Skip to content

K-mart sales strategy for the year 2020 based on the insights from the sales data in 2019.

Notifications You must be signed in to change notification settings

OmerTDK/E-commerce-Data-Analysis-for-Sales-Strategy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

E-commerce Data Analysis for Sales Strategy

In their annual sales review meeting, Kmart, one of the largest online retailers in the US, must decide on their sales strategy for 2020 based on insights from the 2019 sales data.

This project focuses on generating key insights regarding Kmart's sales for each month of 2019. The data will be used to assist Kmart's sales team in fine-tuning sales strategies as the year progresses.

Business Problem Statements

Observations on the following are made:

  1. What was the best month for sales?
  2. Which city had the highest number of sales?
  3. Best time to display advertising to maximize sales?
  4. Best-selling product & Why?
  5. What products are most often sold together?

Dataset

  • Data belongs to Kmart: A leading online retailer in the US
  • Time Period :  January 2019  -  December 2019
  • Unique Products:  19
  • Total Orders :  178,437
  • Cities :  9
  • KPI’s : Total Sales, Total Products Sold

Data Analysis Using Python

  1. Loaded data for each month and created a data frame using Pandas.
  2. For 2019 sales, an aggregated dataset was created by concatenating multiple datasets together.
  3. Data handling for null values and junk data.
  4. Preprocessed data to make a filtered dataset.
  5. Business problem analysis and their solutions. (visualizations using matplotlib and seaborn library)

What was the best month for sales?

  • Create a new dataset with all records grouped by month
  • Plot the graph using matplotlib

image

image

Which city had the highest number of sales?

  • Extract the “Purchase Address” column containing the city information into a separate dataframe.
  • Group this dataframe by City and each group will have a sum of all the sales in that city.
  • Visualize the graph using matplotlib

imageimage

Best time to display advertising to maximize sales?

  • Extract the Hours from the Order Date.
  • Group the data by Hours and depict the graph using matplotlib.

image

Best-selling product & Why?

  • Determine the sum of the "Quantity Ordered" by grouping by "Product".
  • Visual representation of the Quantity Ordered for each Product.

image

  • Let us also see a graphic representation of the Prices for each product grouped by Product.

image

What products are most often sold together?

  • Group the product by the Order ID to know which products were sold together.
  • Find the duplicate values of the “Order ID” by using the .duplicated() method.
  • Using .transform() method, create a new column called "Grouped" to combine values from multiple rows into one.
  • Drop the duplicates created when products were merged for each order ID.
  • Display the top 5 products most often sold together.

image

Tools Used

Jupyter Notebook Python Pandas Matplotlib Plotly

  • Jupyter Notebook is used as IDE.
  • Among the Python libraries, Pandas is used for handling and preprocessing data.
  • Plotly, Seaborn, and Matplotlib are used for visualizing plots.

For more details, please go through the Jupyter Notebook attached above.

Conclusion

  • The analysis above clearly illustrates that month 12 (December) boasts the maximum sales in 2019 with roughly $9,226,886.
  • According to the graph, San Francisco has the largest number of sales.
  • The optimum time to display advertising to increase the probability of buyers purchasing the product/s is shortly before 12 pm and/or right before 7 pm.
  • The top selling product is 'AAA Batteries (4-pack)'. The top selling products seem to have a correlation with the price of the product. The cheaper the product, higher the quantity ordered and vice versa.

About

K-mart sales strategy for the year 2020 based on the insights from the sales data in 2019.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published