Machine Learning Project @FCUL

In this project, our group had to use Python 3 and Jupyter Notebook, together with Scikit-learn, Orange3, or both, to perform a ML task.

The dataset analysed was the Donors_dataset.csv, downloaded from Kaggle: Donors-Prediction.

In this project, our team was supposed to use only tabular data (not Images or Image Metadata) and see how far we could go in predicting donations and understanding the donors. We had to use both supervised and unsupervised learning to tackle 2 tasks:

Task 1 (Supervised Learning) - Predicting Donation and Donation Type
Task 2 (Unsupervised Learning) - Characterizing Donors

An important preliminary step, consisted on Data Cleaning and Preprocessing. The following had to be considered:

Data can contain errors/typos, whose correction might improve the analysis.
Some features can contain many values, whose grouping in categories (aggregation into bins) might improve the analysis.
Data can contain missing values, that you might decide to fill. You might also decide to eliminate instances/features with high percentages of missing values.
Not all features are necessarily important for the analysis.
Depending on the analysis, some features might have to be excluded.
Class distribution is an important characteristic of the dataset that should be checked. Class imbalance might impair machine learning.

This project includes all necessary files, including the dataset (Donors_dataset.csv), Jupyter Notebook (AA_202021_Final_Project_Group25.ipynb) and several Orange3 files.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Orange3		Orange3
AA_202021_Final_Project_Group25.ipynb		AA_202021_Final_Project_Group25.ipynb
DONATION_TYPE_CN2.ows		DONATION_TYPE_CN2.ows
DONATION_TYPE_CN2_V2.ows		DONATION_TYPE_CN2_V2.ows
DONATION_TYPE_CN2_V3_redux.ows		DONATION_TYPE_CN2_V3_redux.ows
DONATION_TYPE_CN2_V4_redux.ows		DONATION_TYPE_CN2_V4_redux.ows
DONATION_TYPE_Orange3.csv		DONATION_TYPE_Orange3.csv
Donors_dataset.csv		Donors_dataset.csv
README.md		README.md
TARGET_B_CN2.ows		TARGET_B_CN2.ows
TARGET_B_Orange3.csv		TARGET_B_Orange3.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Machine Learning Project @FCUL

About

Releases

Packages

Languages

milcs40/MachineLearningProject_FCUL

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Project @FCUL

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages