From 42d52a65fa3cb5e4e4c22ae494620bc44b719a41 Mon Sep 17 00:00:00 2001 From: ACID Design Lab <82499756+acid-design-lab@users.noreply.github.com> Date: Fri, 7 Jun 2024 00:27:15 +0300 Subject: [PATCH] Update README.md --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 18ad76e..b719667 100644 --- a/README.md +++ b/README.md @@ -17,6 +17,7 @@ ### Project pipeline Here are the main steps which will allow you to build a precise model for CPP design: + **1. Data curation and cleaning.** All inappropriate or ambiguous data should be removed or corrected. **2. Data unification.** The data presented in Datasets are heterogeneous and should be unified in terms of variables, measurement units etc. **3. System parametriation.** You need to choose the set of parameters to describe CPPs as well as experimental setup. Most of the models use symbolic representations lacking physico-chemical properties crucial for CPP activity prediction.