Nowadays almost every streaming / service providing companies has weekly/monthly/yearly subscription plans , So it is very necessary to retain a customer. If there are high chances that a customer is goingto leave the subscription then the company can come with some strategy to avoid it .This can be predicted with the help of churn prediction , and here wehave a churn prediction problem problem of a music streamingcompany called kkbox where the aim is to predict whether the customer will churn the subscription after one months or not , orbasically the customer will renew subscription in 30 days or not .Some of the important features to determine churn rate aretransaction date , membership expiration rate and is_cancel , whereis_cancel is our target variable .Date is a time series data , divided over the period of months ,Train data is the data of the customer whose subscription is expiring inmonth of february,2017 and test data dataset contains the data hosesubscription expires in month of march,2017 so we are to see churnrenewal for train data in month of march,2017 and churn renewal fortest data in month of april ,2017.User logs , information of members and transaction details are given inthe dataWe have to predict whether the customer will churn or not (0 or 1) .Once the company knows the customer is going to churn it can beavoided by some marketing techniques etc.
- https://www.kaggle.com/c/kkbox-churn-prediction-challenge/overview
- https://www.kaggle.com/c/kkbox-churn-prediction-challenge/data
- Member_v3 - Information about the customers (customer_id,gender,Payment method,Payment date)
- Information about behaviour of different user (number of uniquesongs , different percentiles of how many songs played , total songs played)
- Transaction - Information regarding the each transaction and its details(payment_method,plan_price , validity , is_autorenew, membership expirationdate , is_cancel)
- Train - (customer_id,is_churn)
- Sample_Submission_zero - The predicted Values that it will churn or not
Rest 4 files are User_logs,transaction, train, sample_submission but it's the data for the predicted renewal month