-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to execute #13
Comments
Hi, I looked at your file and it seems the main issue was that you did not change the task of the data set. You need to run:
This will make the data suitable for a classification task. Here is an executable Python script I compiled from your notebook that does the training on your data and contains solutions to some of the errors (again had to change the extension): Thanks for the interest in the framework and let us know how it went! |
thanks Martin for your kind assistance. certainly i will update you. Today I am going to refine the work as per your valuable suggestion and provide you the details. Thanks again for your time. Thanks |
after issuing following codes i am able to generate classification model model = SklearnModel( CrossValAssessor("roc_auc")(model, dataset) TestSetAssessor("roc_auc")(model, dataset) but i found Mean ROC (AUC= 0.68) i think i need to use TopologicalFP
) dataset.getDF().head() BUT I AM ENCOUNTERED BY SOME ERROR stated bellowArgumentError Traceback (most recent call last) File ~/miniconda3/envs/drugex/lib/python3.11/site-packages/qsprpred/data/tables/qspr.py:875, in QSPRDataset.prepareDataset(self, smiles_standardizer, data_filters, split, feature_calculators, feature_filters, feature_standardizer, feature_fill_value, applicability_domain, drop_outliers, recalculate_features, shuffle, random_state) File ~/miniconda3/envs/drugex/lib/python3.11/site-packages/qsprpred/data/tables/qspr.py:802, in QSPRDataset.addFeatures(self, feature_calculators, recalculate) File ~/miniconda3/envs/drugex/lib/python3.11/site-packages/qsprpred/data/tables/qspr.py:525, in QSPRDataset.addDescriptors(self, descriptors, recalculate, featurize, *args, **kwargs) File ~/miniconda3/envs/drugex/lib/python3.11/site-packages/qsprpred/data/tables/mol.py:831, in MoleculeTable.addDescriptors(self, descriptors, recalculate, fail_on_invalid, *args, **kwargs) File ~/miniconda3/envs/drugex/lib/python3.11/site-packages/qsprpred/data/tables/mol.py:633, in MoleculeTable.processMols(self, processor, proc_args, proc_kwargs, add_props, as_rdkit, chunk_size, n_jobs) File ~/miniconda3/envs/drugex/lib/python3.11/site-packages/qsprpred/data/tables/mol.py:544, in MoleculeTable.runMolProcess(cls, props, func, add_rdkit, smiles_col, *args, **kwargs) File ~/miniconda3/envs/drugex/lib/python3.11/site-packages/qsprpred/data/descriptors/fingerprints.py:72, in Fingerprint.call(self, mols, props, *args, **kwargs) File ~/miniconda3/envs/drugex/lib/python3.11/site-packages/qsprpred/data/descriptors/fingerprints.py:197, in TopologicalFP.getDescriptors(self, mols, props, *args, **kwargs) ArgumentError: Python argument types in Please Help. |
Hi, this looks like something might have changed in how the rdkit function is supposed to be called. Can you let me know the version of rdkit that you have installed? This command should do it: |
(drugex) sneha@sneha:~$ pip freeze | grep rdkit |
Hi @snehachem, sorry for the delay. I have been fairly busy lately. I tried to run the workflow with topological fingerprints instead and it worked just fine. Can you confirm that you still get the error even when using this notebook? I did not find any issues with calculating those descriptors and my environment looks like this:
If you do not find a fix in this shared notebook, please, share the notebook you are trying to execute with the input files as well and I will try to debug it. |
One thing I missed before, you are calling the fingerprint class like this: |
thanks, i will update accordingly |
During my modeling,I faced similar issues. I am able to solved them on the basis of said suggestions. I am interested to build regression models. I have few quarries
|
IKK_refine.csv
I am trying my best to execute this tool. i am trying to build my model using my data. Whenever i use tutorial data it works. However getting confused with my customized data curated from chembl data base. I want to build a random Forest classification model. the steps I followed as per the tutorial given bellow
I attached the necessary files in HTML format (HTML can not be attached, so i changed .HTML to .csv, After reversing one can view the file)
What is the straightforward way to build random forest model, which I want to use for DrugEx
QSAR_Pred_2.csv
Attached Raw input data
The text was updated successfully, but these errors were encountered: