You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Assess different features that can be generated from RDkit.
For example, convert the SMILES to morgan fingerprint as features, and then use a simple neural network to perform prediction. Assess the performance using testing data. Compare the performance with what is reported in kaggle currently so we know how much to improve.
Paste the code here.
The text was updated successfully, but these errors were encountered:
I have generated morgan fingerprint, protein name (one hot encoding) and binds (labels) for all train and validation data as numpy array format, into trunks. Each trunk has 500,000 rows, the data are located in "/mnt/isilon/wang_lab/shared/Belka/analysis/morgan" and "/mnt/isilon/wang_lab/shared/Belka/analysis/morgan_validation"
Assess different features that can be generated from RDkit.
For example, convert the SMILES to morgan fingerprint as features, and then use a simple neural network to perform prediction. Assess the performance using testing data. Compare the performance with what is reported in kaggle currently so we know how much to improve.
Paste the code here.
The text was updated successfully, but these errors were encountered: