0% found this document useful (0 votes)
141 views

Capstone Project Weekly Progress Report

This weekly progress report summarizes the work done by a student group on their capstone project for customer churn prediction. In the reporting week, they performed data cleaning and initial exploratory data analysis on their dataset containing over 7000 customer records. Their analysis found patterns in customer demographics, services used, and contract and payment types. In the next week, they plan to continue exploratory analysis, detect links between features and churn, use visualization to explore these relationships, and perform feature extraction. They encountered difficulties clearly linking monthly charges to churn and analyzing new customers.

Uploaded by

Rohit N
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
141 views

Capstone Project Weekly Progress Report

This weekly progress report summarizes the work done by a student group on their capstone project for customer churn prediction. In the reporting week, they performed data cleaning and initial exploratory data analysis on their dataset containing over 7000 customer records. Their analysis found patterns in customer demographics, services used, and contract and payment types. In the next week, they plan to continue exploratory analysis, detect links between features and churn, use visualization to explore these relationships, and perform feature extraction. They encountered difficulties clearly linking monthly charges to churn and analyzing new customers.

Uploaded by

Rohit N
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

AML-2404 AI/ML Capstone Project

Capstone Project Weekly Progress Report

Project Title Customer Churn Prediction Project

Group Name C

Student Nehalkumar Jesadiya c0793769, Rohit Nanawati c0796684, Shah Razzakh


names/Student IDs Mohammed c0794302, Sharjeel Ahmed c0806695

Reporting Week 29-May-2021 to 04-June-2021

Faculty Supervisor Dr.Parisa Naraei

1. Tasks Outlined in Previous Weekly Progress Report (Provide detailed information on the
tasks to be completed in this week)
The dataset we obtained has 7043 records with 21 features. We have performed Data
Wrangling and initiated the Exploratory Data Analysis to enable:
1. Pro- active retention strategy as opposed to reactive one.
2. Insights on subscriber’ churn behavior
3. Enable the company to move from business rules-based campaigns to analytics led campaigns.
4. Enable reduction in marketing spend.

We plan to proceed further with the EDA to gain more insights from the Data and use
some Data Visualizations to find the dependencies of ‘Churn’ on features.

2. Progress Made in Reporting Week (Provide detailed information on the progress that you
made in the reporting week. Limit your write-up to no more than two page)

This week we performed cleaning for our obtained dataset and initiated our Exploratory Data
Analysis.

We have dropped the ‘CustomerID’ from our dataset as it simply gives unique identity and has
no importance in our analysis. We have indexing in our DataFrame for this purpose.

Our dataset surprisingly has no missing values. Hence, no processing had to be done for it.

When we started to explore the data, we found out:

• ‘SeniorCitizen’ is actually a categorical hence the 25%-50%-75% distribution is not


proper.
• 75% customers have tenure less than 55 months.
• Average Monthly charges are USD 64.76 whereas 25% customers pay more than USD
89.85 per month.

@2021 Lambton College in Toronto Page 1


AML-2404 AI/ML Capstone Project

On analyzing the ‘Churn’, we reached the following conclusion:

• Data is highly imbalanced, ratio = 73:27


• So, we analyze the data with other features while taking the target values separately to
get some insights.

Now, we further proceeded with analyzing our data and results are:

• Gender: The customer distribution is almost equal with ratio 1.01:1.


• Partner: This category is also almost equal with ratio 1.07:1.
• Dependent: The distribution for this category is uneven and measured 2.33:1 [No:Yes].
• PhoneService: Almost all the customers use this service. Only 1 out of 10 customers
does not use this service.
• InternetService: 43% users have a Fibre Optic connection as compared to 34% having
DSL. And 23% not subscribed to the Internet Service.
• OnlineSecurity: Out of the customers subscribed to the internet service, only 37% have
subscribed to this service.
• OnlineBackup: Out of the customers subscribed to the internet service, 44% customers
have subscribed to this service.
• DeviceProtection: Out of the customers subscribed to the internet service, 44%
customers have subscribed to this service.
• TechSupport: Out of the customers subscribed to the internet service, 37% customers
have taken some sort of Technical Support.
• StreamingTV: Out of the customers subscribed to the internet service, 49% customers
have subscribed to this service.
• StreamingMovies: Out of the customers subscribed to the internet service, 50%
customers have subscribed to this service.
• Contract: 55% customer are on a Month-to-Month contract as compared to 21%
subscribed to a Yearly and 24% to Bi-Yearly contract.
• PaperlessBilling: Majority of customer have agreed to paperless billing while 40% still
going the traditional way.
• PaymentMethod: 33% uses Electronic Check, 23% Mailed Check, 22% Auto Bank
Transfer, 22% Auto Credit Card. Comprising of 44% customer opting for Auto Debit
option.
• SeniorCitizen: Only 16% customer falls under this category.
• Tenure: This is quite unevenly distributed as the provider has been in the market for a
considerable amount of time. But the majority of the customers still are either new or
already using it for more than 70 months.

@2021 Lambton College in Toronto Page 2


AML-2404 AI/ML Capstone Project
3. Difficulties Encountered in Reporting Week (Provide detailed information on the difficulties
and issues that you encountered in the reporting week. Limit your write-up to no more than
one page)

The analysis till now has provided us with trivial links of features to the churning. Although it has
helped us understand the customer base, nature of the customers and their choice preferences
as well as behavior to some extent. We could not detect any specific pattern with the monthly
charges. The following are the problem we need to find solution for:

1. Figure out the pattern with the monthly charges.


2. How to analyze the customers that are newly acquired.
3. Find concrete links of these features to our target variable i.e Churning.

4. Tasks to Be Completed in Next Week (Outline the tasks to be completed in the following
week)

The coming week, our analytics team is planning to proceed further with our Data Exploration
and Analysis. We also intend to plot some graphs and figure out the reasons behind Churning
with the help of Data Visualization.

1. Perform the Exploratory Data Analysis.


2. Detect the link between Churning and the features.
3. Use Data Visualization to extract the relation between Churning and each feature.
4. Use the extracted relations to perform Feature Extraction.

@2021 Lambton College in Toronto Page 3

You might also like