Skip to content

mCrozzoli/KMeans_NaiveBayes_from_SCRATCH_plus_data_preprocessing_and_EDA

Repository files navigation

Learning Data Mining

In my first exam for the Data Mining course at ITU in Copenhagen, I tackled the challenge of building KMeans and Naive Bayes algorithms from scratch, as well as pre-processing data and performing exploratory data analysis (EDA).

The primary objective of the assignment was to gain expertise in various data mining techniques. To accomplish this, I focused on:

  1. Cleaning and preparing data for analysis
  2. Understanding the data and formulating relevant questions
  3. Developing a clustering algorithm (K-means), executing it, and interpreting the outcomes
  4. Developing an unsupervised prediction algorithm (Naive Bayes), executing it, and interpreting the results

Overall, this experience helped me gain a deeper understanding of the intricacies of data mining and enabled me to hone my skills in data pre-processing, EDA, and algorithm development.

About

Learning data pre-processing, EDA, KMeans and Naive Bayes (from scratch!)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published