UrbsLab

All

25 repositories

scikit-FIBERS
Public
scikit-FIBERS (Feature Inclusion Bin Evolver for Risk Stratification) is a scikit-learn compatible machine learning algorithm for modeling or feature learning in survival analyses where feature 'burden' may be predictive of risk strata. Originally designed to identify amino-acid positions where mismatch burden predicts kidney graft failure risk.
Jupyter Notebook
•
MIT License
•2•2•0•0•Updated Nov 4, 2024Nov 4, 2024
scikit-rebate
Public
A scikit-learn-compatible Python implementation of ReBATE, a suite of Relief-based feature selection algorithms for Machine Learning.
Python
•
MIT License
•73•1•0•0•Updated Oct 28, 2024Oct 28, 2024
STREAMLINE
Public
Simple Transparent End-To-End Automated Machine Learning Pipeline for Supervised Learning in Tabular Binary Classification Data
data-science data-visualization feature-selection statistical-analysis supervised-learning imputation binary-classification model-application automl-pipeline machine-learning
Jupyter Notebook
•
GNU General Public License v3.0
•9•68•2•0•Updated Aug 14, 2024Aug 14, 2024
scikit-ExSTraCS
Public
A scikit-learn implementation based on ExSTraCS 2.0
Jupyter Notebook
•
GNU General Public License v3.0
•1•7•0•2•Updated Jun 17, 2024Jun 17, 2024
scikit-eLCS
Public
A scikit-learn-compatible Python implementation of eLCS, a supervised learning variant of Learning Classifier Systems
Jupyter Notebook
•
GNU General Public License v3.0
•8•19•1•2•Updated Jun 17, 2024Jun 17, 2024
scikit-XCS
Public
scikit learn compatible implementation of XCS, the most popular and best studied learning classifier system algorithm to date.
Jupyter Notebook
•
GNU General Public License v3.0
•8•13•0•3•Updated Jun 17, 2024Jun 17, 2024
GAMETES
Public
Source code for the Genetic Architecture Model Emulator for Testing and Evaluating Software (GAMETES) is an algorithm for the generation of complex single nucleotide polymorphism (SNP) models for simulated association studies.
Java
•1•4•0•0•Updated Jun 11, 2024Jun 11, 2024
survival-LCS
Public
Python
•
GNU General Public License v3.0
•0•0•0•0•Updated Apr 12, 2024Apr 12, 2024
I2C2-Documentation
Public
Documentation and informational resources for LPC use
Python
•
MIT License
•8•2•0•0•Updated Nov 27, 2023Nov 27, 2023
LCS-Visualization-Pipeline
Public
LCS Discovery and Visualization Environment (LCS-DIVE)
Python
•
GNU General Public License v3.0
•1•3•0•4•Updated Jul 19, 2023Jul 19, 2023
FIBERS
Public
Feature Inclusion Bin Evolver for Risk Stratification (FIBERS) is an evolutionary algorithm that constructs bins of features, seeking to optimize the bins' stratification of event risk over time.
Python
•
GNU General Public License v3.0
•0•0•0•0•Updated May 5, 2023May 5, 2023
scikit-RARE
Public
scikit-RARE is scikit compatible pypi package for the RARE (Relevant Association Rare-variant-bin Evolver) evolutionary algorithm.
Python
•
MIT License
•0•0•0•0•Updated Mar 3, 2023Mar 3, 2023
RARE
Public
RARE: Relevant Association Rare-variant-bin Evolver (under development); an evolutionary algorithm approach to binning rare variants as a rare variant association analysis tool. Applications, visualizations, and modifications currently in works.
Python
•
GNU General Public License v3.0
•1•0•0•0•Updated Jun 30, 2022Jun 30, 2022
scikit-ExSTraCS-RuleInit
Public
Experimental variation of scikit-ExSTraCS that allows the user to import an initial rule population that will get initially evaluated and assigned fitness values prior to the start of learning iterations. This allows for the import of manually curated expert knowledge derived rules, or rules derived from other sources.
Jupyter Notebook
•
GNU General Public License v3.0
•3•0•0•0•Updated May 11, 2022May 11, 2022
AutoMLPipe-BC
Public
An automated, rigorous, and largely scikit-learn based machine learning analysis pipeline for binary classification. Adopts current best practices to avoid bias, optimize performance, ensure replicatability, capture complex associations (e.g. interactions and heterogeneity), and enhance interpretability. Includes (1) exploratory analysis, (2) da…
Jupyter Notebook
•
GNU General Public License v3.0
•1•7•0•0•Updated May 7, 2022May 7, 2022
auto_term_harm_pipe
Public
A set of Python-based Jupyter notebooks illustrating a documented example of a semi-automated term harmonization pipeline applied to harmonizing medical history terms across 28 clinical trials of pulminary arterial hypertension
Jupyter Notebook
•
GNU General Public License v3.0
•0•0•0•0•Updated Oct 6, 2021Oct 6, 2021
scikit_ML_Pipeline_Binary_Notebook
Public
An (updated and expanded) rigorous, well documented machine learning analysis pipeline for binary classification datasets assembled as a Jupyter Notebook. Includes exploratory analysis, data processing, feature processing, ML modeling (13 algorithms) with hyperparameter sweeps, visualizations, and statistical analysis. A comprehensive starting p…
Jupyter Notebook
•
GNU General Public License v3.0
•6•10•0•0•Updated Jun 16, 2021Jun 16, 2021
PyKE_ExpertSystem_Example_BMIN520
Public
Example PyKE code and Jupyter Notebook for a simple backwards chaining expert system as described in this lecture on YouTube: https://www.youtube.com/watch?v=mzsk5_EmZq8
Jupyter Notebook
•
GNU General Public License v3.0
•8•24•0•0•Updated May 24, 2021May 24, 2021
scikit_ML_Pipeline_Binary_Parallel
Public
An rigorous, machine learning analysis pipeline for binary classification datasets assembled as parallelizable command line modules. Includes exploratory analysis, data processing, feature processing, ML modeling (11 algorithms) with hyperparameter sweeps, visualizations, and statistical analysis. A comprehensive starting point to adapt to your …
Python
•
GNU General Public License v3.0
•0•0•0•0•Updated Apr 23, 2021Apr 23, 2021
gametes_archive_gen
Public
Python scripts to generate an diverse archive of simulated SNP datasets using GAMETES
Python
•
GNU General Public License v3.0
•0•0•0•0•Updated Dec 3, 2020Dec 3, 2020
GP-LCS
Public
Supplemental materials and code for our GP-LCS project, adapting ExSTraCS to evolve GP trees rather than rules for comparison to other stand-alone GP algorithms
Python
•
GNU General Public License v3.0
•0•0•0•0•Updated Sep 15, 2020Sep 15, 2020
ExSTraCS_ML_Pipeline_Binary_Notebook
Public
An rigorous, well documented machine learning analysis pipeline for binary classification datasets assembled as a Jupyter Notebook. Includes exploratory analysis, data processing, feature processing, ML modeling (9 algorithms, including the original ExSTraCS algorithm) with hyperparameter sweeps, visualizations, and statistical analysis. A compr…
Python
•
GNU General Public License v3.0
•3•10•1•0•Updated Sep 1, 2020Sep 1, 2020
Pancreatic_Cancer_ML_Notebook_Analysis
Public
Code and results for an investigation of pancreatic cancer datasets applying our binary classification machine learning analysis pipeline notebook. Includes analysis and comparison of three pancreatic cancer datasets.
Jupyter Notebook
•
GNU General Public License v3.0
•3•3•0•0•Updated Aug 26, 2020Aug 26, 2020
ML_Pipeline_Notebooks
Public
This repository includes educational materials on machine learning and a basic example machine learning analysis pipeline. These materials were originally developed for a workshop series at the University of Pennsylvania.
HTML
•
GNU General Public License v3.0
•7•8•0•0•Updated May 21, 2020May 21, 2020
independent-study-18fall
Public
Assembly of Jupyter notebooks comprising basic machine learning pipeline tasks. This student driven, independent study project will eventually evolve into a user-friendly starting point for ML pipeline example notebooks.
Jupyter Notebook
•
GNU General Public License v3.0
•1•1•0•0•Updated Oct 15, 2018Oct 15, 2018