Skip to content

This repository contains a Jupyter Notebook that focuses on Churn Analysis within the E-Commerce sector. The project aims to predict customer churn using various machine learning algorithms and techniques such as data preprocessing, feature selection, and model optimization.

Notifications You must be signed in to change notification settings

matthieukhl/e_commerce_churn_analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

E-Commerce Churn Analysis

Description

This repository contains a Jupyter Notebook that focuses on Churn Analysis within the E-Commerce sector. The project aims to predict customer churn using various machine learning algorithms and techniques such as data preprocessing, feature selection, and model optimization.

Objective

The primary objective of this project is educational. I've undertaken this analysis to practice my data science and machine learning skills. The project serves as a valuable addition to my portfolio, demonstrating my capability to handle end-to-end data science projects.

Technologies Used

  • Python
  • Jupyter Notebook
  • Pandas
  • Scikit-Learn
  • Matplotlib
  • Seaborn
  • XGBoost

Steps Involved

  1. Data Loading and Exploration: Loaded the dataset and performed initial exploration to understand the data.
  2. Data Cleaning: Removed missing or irrelevant information.
  3. Data Preprocessing: Transformed categorical variables into dummy variables.
  4. Feature Selection: Used Recursive Feature Elimination for feature selection.
  5. Model Training: Utilized Logistic Regression for the initial model training.
  6. Model Optimization: Applied GridSearchCV for hyperparameter tuning and used XGBoost for ensemble learning.
  7. Evaluation and Interpretation: Evaluated the model using various metrics and interpreted the results.

How to Use

  1. Clone this repository.
  2. Open the Jupyter Notebook to view the code and explanations.

Feel free to explore the notebook and provide any feedback or contributions. Thank you!

About Dataset

The dataset used for this project is sourced from Kaggle and can be found here. It provides various features that are significant for predicting customer churn in the E-Commerce sector.

About

This repository contains a Jupyter Notebook that focuses on Churn Analysis within the E-Commerce sector. The project aims to predict customer churn using various machine learning algorithms and techniques such as data preprocessing, feature selection, and model optimization.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published