SageML is an automated machine learning pipeline that democratizes ML by providing a unified, end-to-end solution for data processing, analysis, and model building. It eliminates the complexity of traditional ML workflows, making machine learning accessible to users without advanced technical expertise.
- Multi-format Data Support: Process PDF, DOCX, XLSX, HTML, and Images
- Automated Data Pre-processing: Intelligent handling of missing values, outliers, and feature engineering
- Advanced NLP Capabilities: Built-in text processing with tokenization, lemmatization, and specialized analysis
- Automated Model Selection: Smart algorithm selection and hyperparameter optimization
- User-friendly Interface: Simple PyQt-based GUI for seamless interaction
- Windows 10/11 (64-bit)
- Minimum 8GB RAM
- 2GB free disk space
- Screen resolution: 1280x720 or higher
- Download the latest release from the Releases page
- Extract the ZIP file to your desired location
- Run
SageML.exe
from the extracted folder
- Launch SageML by double-clicking
SageML.exe
- Click "Select Files" to import your dataset
- Select the Data Type of the dataset (Structured/Unstructured)
- Select the Model Type for the dataset (Regression/Classification/Clustering/NLP)
- Click "Start Training" to begin the automated ML pipeline
SAGE saves two files for each analysis:
model.joblib
: Serialized machine learning model that can be loaded for predictionsmodel_specs.json
: Comprehensive model specifications including:- Model type and parameters
- Performance metrics
- Preprocessing steps
- Feature importance
- Creation date and dataset information
- Research Analysis: Process research papers and extract meaningful insights
- Business Intelligence: Analyze business data for pattern recognition and prediction
- Educational: Learn about ML workflows and model performance
- Data Science: Rapid prototyping and baseline model creation
- 95% accuracy in text extraction from PDFs and images
- 40% reduction in data cleaning time
- 50% faster hyperparameter optimization
- 30% overall workflow speedup
While the source code is private, we welcome:
- Feature requests
- Bug reports
- Documentation improvements
- Use case suggestions
Please use the Issues section for any contributions.
This project is licensed under the MIT License - see the LICENSE file for details.
For support or queries:
- Create an issue on GitHub
- Connect with us on LinkedIn