Modeling Drug Synergy

This repository holds the code for a machine learning-based model for predicting synergistic drug behaviors on cancer cell lines. The ML is largely performed on top of the scikit-learn library, with additional work done in numpy/scipy/pandas. Each IPython Notebook holds a different version of the model and/or a major step in data manipulation or feature engineering. Research is still in process.

Several key components include:

Cross-validation of several different regressors and classifiers, including random forest, adaboost, gradient boosting, and svms.
Drug-drug mapping through shared targets
Construction of a PPI (protein-protein interaction) network and implementation of a path-searching algorithm to map target interactions to drug interactions
Implementation of a genetic algorithm to perform parameter tuning and feature selection

The model currently achieves ~0.66 average pearson correlation for regression, and a 0.73 average classification accuracy of synergistic vs. non-synergistic compound/compound/cell line combinations (0.76 AUC, 0.79 F1).

When tasked with identifying clinically significant cases of synergy (>30% change integrated over a log2 concentration space), it achieves a classification accuracy of 0.83 and an AUC of 0.79.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Modeling Drug Synergy

Files

README.md

Latest commit

History

README.md

File metadata and controls

Modeling Drug Synergy