DrugEx Version 3.4.0
Change Log
From v3.3.0 to v3.4.0
Fixes
None.
Changes
Major refactoring of drugex.training
-
Moving generators from
drugex.training.models
todrugex.training.generators
, and harmonizing and renaming themRNN
->SequenceRNN
GPT2Model
->SequenceTransformer
GraphModel
->GraphTransformer
-
Moving explorers from
drugex.training.models
todrugex.training.explorers
, harmonizing and renaming themSmilesExplorerNoFrag
->SequenceExplorer
SmilesExplorer
->FragSequenceExplorer
GraphExplorer
->FragGraphExplorer
-
Removal of all obsolete modules related to the two discontinued fragment-based LSTM models from DrugEx v3.
-
The generators'
sample_smiles()
has been replaced by agenerate()
function -
Clafification of the terms qualifying the generated molecules to have the following unique and constant definitions (replacing ambigous
VALID
andDESIRE
terms)Valid
: molecule can be parsed with rdkitAccurate
: molecule contains given input fragmentsDesired
: molecule fulfils all given objectives
-
Revise implementation of Tanimoto distance-based Pareto ranking scheme(
SimilarityRanking
) to correspond to the method described in DrugEx v2. Add option to use minimum Tanimoto distance between molecules in a front instead the mean distance. -
Remove all references to NN-based RAscore (already discontinued)
Refactoring of CLI
- Refactoring
dataset.py
andtrain.py
to object based - Writting a single
.txt.vocab
file per dataset preprocessing instead of separate (duplicate) files for each subset indataset.py
Removed
--save_voc
argument indataset.py
as redundant--pretrained_model
argment intrain.py
(merged with--agent_path
)memory
parameter and all associated code from inSequenceRNN
New Features
- GRU-based RNN added to the CLI
- added another possible implementation of similarity ranking (
MutualSimilaritySortRanking
), this is based on the code in the original repository of DrugEx