Skip to content

Commit

Permalink
Merge pull request #2 from StephenSzwiec/dev
Browse files Browse the repository at this point in the history
Dev
  • Loading branch information
StephenSzwiec authored Oct 18, 2023
2 parents 626937d + 9b39901 commit 21c5cb2
Show file tree
Hide file tree
Showing 158 changed files with 15,796 additions and 2,626 deletions.
71 changes: 66 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,8 +1,69 @@
# pyQSARplus
Library of tools for the analysis of QSAR/QSPR datasets and models.
# qsarify

# What is included?
-----------------
qsarify is a library of tools for the analysis of QSAR/QSPR datasets and models. This library is intended to be used to produce models which relate a set of calculated chemical descriptors to a given numeric endpoint. Many great tools will take the geometry or string data of a given chemical and compute **descriptors**, which are numeric measures of the properties of these, but you can generate some of these with another one of my scripts, [Free Descriptors](https://github.com/StephenSzwiec/free_descriptors).

- Data preprocessing: `data_tools`
# Dependencies

- Python 3
- [numpy](https://numpy.org/)
- [pandas](https://pandas.pydata.org/)
- [scikit-learn](https://scikit-learn.org)
- [matplotlib](https://matplotlib.org)


# Installation

`pip install qsarify`

# What is included right now?

- Data preprocessing tools: `data_tools`
- Dimensionality reduction via clustering: `clustering`
- Feature selection:
- Single threaded: `feature_selection_single`
- Multi-threaded: `feature_selection_multi`
- Model Export and Visualization: `model_export`
- Cross Valiidation: `cross_validation`

# How to use

The best way to learn how to use this library is to look at the example notebook in the `examples` folder. This notebook will walk you through the workflow of using this library to build a QSAR model.

# Future Plans

- Massively parallel feature selection methods:
- CUDA acceleration
- MPI acceleration
- Include Shannon Entropy as a dimensionality reduction metric in clustering
- Embedded kernel methods
- More visualization tools
- More cross validation tools
- Feature selection tools for categorical data

# Contributing


If you would like to contribute to this project, please feel free to fork this repository and submit a pull request. Otherwise, you may also submit an issue. I will try to respond to issues as quickly as possible.

# License


This project is licensed under the GNU GPLv3 license. See the LICENSE file for more details.

# Citation

If you use this library in your work, please cite it as follows:

Szwiec, Stephen. (2023). qsarify: A high performance library for QSAR model development.

BibTex:
```
@misc{szwiec2023qsarify,
author = {Szwiec, Stephen},
title = {qsarify: A high performance library for QSAR model development},
year = {2023},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/stephenszwiec/qsarify}},
}
```
1,780 changes: 0 additions & 1,780 deletions Untitled.ipynb

This file was deleted.

1 change: 0 additions & 1 deletion __init__.py

This file was deleted.

240 changes: 0 additions & 240 deletions cross_validation.py

This file was deleted.

Binary file added dist/qsarify-0.1-py2.py3-none-any.whl
Binary file not shown.
Binary file added dist/qsarify-0.1.tar.gz
Binary file not shown.
Loading

0 comments on commit 21c5cb2

Please sign in to comment.