diff --git a/README.md b/README.md index ee85df9..90c6435 100644 --- a/README.md +++ b/README.md @@ -122,68 +122,68 @@ Contains CPPs with natural or modified amino acids. - 1.1. POSEIDON - - Contains heterogeneous experimental data regarding CPP (natural and non-natural amino acids) activity measurements (.csv format), which are: - - peptide name, - - target cell line CPP was tested on cell penetration ability, - - delivered molecule/protein, - - paper PubMed ID, - - cellular uptake measurement + measurement units, - - CPP+cargo concentration, - - incubation time, - - incubation temperature, - - determination method, - - uptake type, - - sequence. + 1.1. POSEIDON + + Contains heterogeneous experimental data regarding CPP (natural and non-natural amino acids) activity measurements (.csv format), which are: + - peptide name, + - target cell line CPP was tested on cell penetration ability, + - delivered molecule/protein, + - paper PubMed ID, + - cellular uptake measurement + measurement units, + - CPP+cargo concentration, + - incubation time, + - incubation temperature, + - determination method, + - uptake type, + - sequence. ### 2. Natural CPPs Contains only sequences with natural amino acids. - 2.1. CPPBase - - Contains sequences of CPPs with experimentally proved activity in .fasta format. + 2.1. CPPBase + + Contains sequences of CPPs with experimentally proved activity in .fasta format. - - 2.2. Experimental and Experimental2 - - Contain more sequences of CPPs with experimentally proved activity in .txt format. - - 2.3. Experimental_high_uptake - - Contains CPP sequences with high (but not stated) uptake in .txt format. + 2.2. Experimental and Experimental2 + + Contain more sequences of CPPs with experimentally proved activity in .txt format. - 2.4. Balanced_dataset - - Represents a balanced dataset of CPPs and non-CPPs; often used for model benchmarking. + + 2.3. Experimental_high_uptake + + Contains CPP sequences with high (but not stated) uptake in .txt format. + + 2.4. Balanced_dataset + + Represents a balanced dataset of CPPs and non-CPPs; often used for model benchmarking. ### 3. Non-CPPs Contains negative CPP samples in .txt format. - 3.1. Generated - - Contains randomly generated sequences treated as negative. + 3.1. Generated + + Contains randomly generated sequences treated as negative. - - 3.2. Experimental - - Contains non-CPP sequences shown not to demonstrate activity experimentally. + + 3.2. Experimental + + Contains non-CPP sequences shown not to demonstrate activity experimentally. ### 4. Non-Natural CPPs Contains CPPs consisting of non-natural amino acids. - 4.1. CPPBase_modified - - Contains a list of modified CPPs with experimentally proved activity in .fasta format. + 4.1. CPPBase_modified + + Contains a list of modified CPPs with experimentally proved activity in .fasta format. - - 4.2. CPPBase_modified_symbols - - Contains a list of abbreviations for modified amino acids in .txt format (ABBREVIATION: NAME; ...: ...). + + 4.2. CPPBase_modified_symbols + + Contains a list of abbreviations for modified amino acids in .txt format (ABBREVIATION: NAME; ...: ...). ## Useful tools :bookmark_tabs: