Phase 3

The document outlines a model planning and building phase aimed at predicting customer churn in a telecom company. It highlights key factors influencing churn, such as contract type, tenure, payment methods, and additional services, and describes the development of a Decision Tree Model to analyze these factors. The analysis concludes with recommendations to target high-risk customer segments and improve service offerings to enhance customer retention.

Uploaded by

dahmerjack56

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views16 pages

Phase 3

Uploaded by

dahmerjack56

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 16

PHASE 3

MODEL PLANNING AND BUILDING

Prepared by: shouq alrahma 202102114, mariam alhammadi 202118366, maitha alateeqi
202117547

Prepared for: Mohammad Tubaishat

Importing data and downloading necessary files:

Using the `head` of the telecom data, the first few rows of the data reveal
important factors that impact customer churn. Customers who have been
with the company for a shorter period of time (1-2 months) are more
likely to churn, based on their "Yes" churn status, compared to a
customer who has been with the company for 45 months and has not
churned. Furthermore, clients with month-to-month agreements exhibit
higher turnover rates when contrasted with customers who have one-
year contracts. The payment method is also important, as three
customers who used Electronic check as their payment method have left,
indicating a potential connection between this payment method and
customer turnover. Clients who do not have extra features such as
Technical Support, Device Protection, and Streaming TV are more likely
to cancel their services frequently, suggesting that combining services
could improve customer retention. These trends underscore the
significance of tenure, type of contract, payment method, and extra
services in comprehending and dealing with customer turnover.

In this step, we prepared the data for analysis by converting the target
variable Churn into a factor, which is essential for classification tasks.
The Churn column, initially containing "Yes" and "No" values as
strings, was transformed into a factor to enable the model to interpret it
correctly as a categorical variable. This change is crucial for the
classification model to distinguish between customers who have left
("Yes") and those who have stayed ("No"). By ensuring that Churn is set
as a factor, we allow the model to handle it as a binary outcome,
improving the accuracy and interpretability of our predictive analysis.
In this phase, we proceed with Model Planning and Building by first
dividing our data into training and testing sets, which is essential for
evaluating the model’s performance. We set a random seed
(set.seed(123)) to ensure reproducibility, so that each time the code runs,
the split will be the same. Using a 70-30 split ratio, we assign 70% of the
data to the training set (used to build the model) and the remaining 30%
to the testing set (used to evaluate the model's accuracy). This separation
allows us to train the model on one portion of the data and test it on
another, providing a more reliable measure of how the model will
perform on unseen data.
In this step, we constructed a Decision Tree Model for classification to
predict customer churn based on various factors. Using
the rpart function, the model analyzes the relationship between the target
variable (Churn) and 19 independent variables such
as gender, tenure, contract type, payment method, and additional
services like Tech Support and Streaming TV. The output shows the tree
structure starting with 4,930 training observations at the root node,
where 1,308 customers are predicted as "Yes" for churn and the rest as
"No." The model splits based on key factors like contract type, tenure,
and Internet Service, progressively narrowing down groups of customers
to make more accurate predictions. For example, customers with
a month-to-month contract have a higher churn probability, while those
with longer contracts (e.g., one-year or two-year agreements) are less
likely to churn. Similarly, customers using Fiber optic services or
lacking Tech Support are identified as high-risk groups.
In this step, we visualize our decision tree model for predicting customer
churn by plotting it. Each node in the tree represents a decision point
based on different features, such as "Contract," "InternetService," or
"Tenure." At each decision node, customers are split based on their
characteristics to predict their likelihood of churn ("Yes" or "No").
For example, at the top node (root), we see the "Contract" feature, where
customers with month-to-month contracts are more likely to churn than
those with one-year or two-year contracts. Specifically, out of 2,706
customers with month-to-month contracts, 1,167 have a churn status of
"No," while 1,539 have a churn status of "Yes," indicating a higher
likelihood of churn among this group. The number shown in each node
indicates the split based on the feature, with "Yes" or "No" outcomes
representing churn status. The numbers within each node display the
customer count and the distribution of churn outcomes. Blue nodes
generally indicate a prediction of "No" (not churning), while green
nodes indicate "Yes" (churning).

This visual representation of the decision tree helps us understand which

factors contribute most to customer churn and how customer segments
differ based on these attributes.
This boxplot visualizes "Monthly Charges" for customers who stayed
("No") versus those who left ("Yes"). Customers who churned tend to
have higher monthly charges, with their median monthly charge being
slightly above $70, compared to around $60 for those who did not churn.
The spread for churned customers is narrower, indicating more
consistent monthly charges among this group. This aligns with the
decision tree results, where higher monthly charges were a factor
contributing to customer churn, emphasizing the importance of pricing
strategies to retain high-paying customers.
This boxplot shows the "Tenure" (in months) of customers who stayed
("No") compared to those who left ("Yes"). It reveals that customers
who stayed generally have longer tenures, with a median tenure close to
40 months. In contrast, customers who left have shorter tenures, with a
median closer to 10 months. This supports the findings from the decision
tree, where shorter tenures were a key factor for predicting churn,
highlighting that customers with less time with the company are more
likely to leave.
In this step, we evaluate the model's performance by generating a
confusion matrix to understand its classification accuracy. The confusion
matrix compares the model's predictions with the actual test data to
measure how well it distinguishes between customers who churn ("Yes")
and those who do not ("No").
According to the matrix, the model correctly predicted 1,396 instances
of non-churning customers and 262 instances of churning customers.
However, it misclassified 156 customers as not churning when they
actually churned, and 299 customers as churning when they did not.
These results provide insights into the model's accuracy and areas where
it may need improvement to reduce false positives and false negatives,
thereby enhancing its predictive performance for customer churn.
In this step, the overall accuracy of the decision tree model is calculated
to assess its performance. The accuracy is determined by dividing the
number of correctly classified instances (both "Yes" and "No") by the
total number of test cases in the dataset.
The accuracy score obtained is 0.7847, which means the model correctly
predicts customer churn 78.47% of the time.
This t-test evaluates the "Contract Tenure Hypothesis" by comparing the
mean tenure of customers who churned ("Yes") against those who did
not churn ("No"). The results show a significant difference between the
two groups, as indicated by a t-value of -34.824 and a p-value less than
2.2e-16. The mean tenure for churned customers is 17.98 months, while
for non-churned customers, it is 37.57 months, with a 95% confidence
interval of the mean difference ranging from -20.69 to -18.49. These
findings strongly support the hypothesis that customers with shorter
tenure are more likely to churn, reinforcing the importance of early
engagement strategies to retain new customers.
This Chi-Squared test evaluates the "Additional Services Hypothesis" by
analyzing the relationship between customers having technical support
and their likelihood of churning ("Yes" or "No"). The test results show a
significant association between these factors, with an X-squared value of
828.2, 2 degrees of freedom, and a highly significant p-value of less than
2.2e-16. These results indicate that the availability of technical support
greatly influences churn behavior.
The findings reveal that customers who do not have technical support
are far more likely to churn compared to those who have it, emphasizing
the importance of offering or improving technical support services. This
analysis strongly supports the hypothesis and shows that providing
technical support can help keep customers and reduce churn effectively.
This Chi-Squared test evaluates the "Billing and Payment Methods
Hypothesis" by analyzing the relationship between customers' payment
methods and their likelihood of churning ("Yes" or "No"). The test
results reveal a significant association between these factors, with an X-
squared value of 648.14, 3 degrees of freedom, and a highly significant
p-value of less than 2.2e-16. These findings demonstrate that the type of
payment method a customer uses significantly affects their churn
behavior.
The analysis shows that customers using automatic payment methods,
such as Bank Transfer and Credit Card, are far less likely to churn, with
churn rates of 16.7% and 15.2%, respectively. In contrast, customers
using Electronic Check show a much higher churn rate of 45.3%,
making this group the most likely to leave. Customers paying by Mailed
Check also have a lower churn rate of 19.1%.

These findings strongly support the hypothesis and highlight the

importance of promoting automatic payment methods as a strategy to
reduce churn. Additionally, the high churn rate for electronic check
users suggests a need to address issues related to this payment method,
such as user experience or satisfaction, to improve customer retention.
Proposal and Recommendations:

Target Month-to-Month Contract Customers: Since these customers

have shown higher churn rates, offer incentives such as discounts or
loyalty programs to encourage them to switch to longer contracts.

Focus on Early Tenure Customers: Customers in the early stages of their

tenure are often at higher risk of churn. Implementing onboarding
programs, personalized support, or early loyalty incentives could
improve retention among new customers.

Improve Service for Fiber Optic Customers: If the analysis shows fiber
optic users are more likely to churn, investigate service quality or
pricing issues and consider offering tailored support or premium features
to improve satisfaction among these customers.

Enhance Support Services: The availability of tech support seems to

correlate with churn likelihood. Investing in improved customer support,
especially for customers who initially decline tech support, could help
reduce churn.

Classification of Customer Churn Prediction Model For Telecommunication Industry Using Analysis of Variance
No ratings yet
Classification of Customer Churn Prediction Model For Telecommunication Industry Using Analysis of Variance
7 pages
Churn Rate DPV
No ratings yet
Churn Rate DPV
15 pages
Classification Report Telco
No ratings yet
Classification Report Telco
2 pages
Wa0004.
No ratings yet
Wa0004.
70 pages
CustomerChurnPrediction ProjectReport 2555425555
No ratings yet
CustomerChurnPrediction ProjectReport 2555425555
19 pages
Journal Pone 0278095
No ratings yet
Journal Pone 0278095
21 pages
DataScience Project-New
No ratings yet
DataScience Project-New
16 pages
Research Paper - Tushar Agrawal
No ratings yet
Research Paper - Tushar Agrawal
3 pages
Churn Prediction Product Idea
No ratings yet
Churn Prediction Product Idea
7 pages
Wa0001.
No ratings yet
Wa0001.
11 pages
Operator'S Manual: T6.145 T6.155 T6.165 T6.175 T6.180 Autocommand
No ratings yet
Operator'S Manual: T6.145 T6.155 T6.165 T6.175 T6.180 Autocommand
22 pages
DSS 2 Draft
No ratings yet
DSS 2 Draft
33 pages
Data Science Case Report
No ratings yet
Data Science Case Report
20 pages
Customer Churn in Subscription Business Model-Pred
No ratings yet
Customer Churn in Subscription Business Model-Pred
7 pages
Foundation Class X PCMB
No ratings yet
Foundation Class X PCMB
1,571 pages
Analysis of Telecom Churn Using Machine Learning Techniques
No ratings yet
Analysis of Telecom Churn Using Machine Learning Techniques
6 pages
Comparative Analysis of Predictive Models For Customer Churn Prediction in The Telecommunication Industry
No ratings yet
Comparative Analysis of Predictive Models For Customer Churn Prediction in The Telecommunication Industry
6 pages
Output 4
No ratings yet
Output 4
5 pages
Classification Research 1
No ratings yet
Classification Research 1
4 pages
1 s2.0 S2590123024014208 Main
No ratings yet
1 s2.0 S2590123024014208 Main
12 pages
Ref 1
No ratings yet
Ref 1
10 pages
2017 Paper 10
No ratings yet
2017 Paper 10
5 pages
Project Report
No ratings yet
Project Report
12 pages
Customer Churn Prediction Using Machine Learning
No ratings yet
Customer Churn Prediction Using Machine Learning
7 pages
Chap6 Stair Design MDM
No ratings yet
Chap6 Stair Design MDM
33 pages
Telecom Customer Churn Report
No ratings yet
Telecom Customer Churn Report
3 pages
Iranian Churn
No ratings yet
Iranian Churn
16 pages
Group 13 - Analyzing Customer Churn
No ratings yet
Group 13 - Analyzing Customer Churn
6 pages
IJSC Vol 10 Iss 2 Paper 5 2054 2060
No ratings yet
IJSC Vol 10 Iss 2 Paper 5 2054 2060
7 pages
Customer Churn Prediction Capstone Himanshu
No ratings yet
Customer Churn Prediction Capstone Himanshu
5 pages
Research Churn
No ratings yet
Research Churn
4 pages
Customer Churn Prediction Capstone Projectdocx
No ratings yet
Customer Churn Prediction Capstone Projectdocx
11 pages
Customer Churn Telecom
No ratings yet
Customer Churn Telecom
35 pages
ML Customer Churn Case Study
No ratings yet
ML Customer Churn Case Study
4 pages
Research On A Customer Churn Combination Prediction Model Based On Decision Tree and Neural Network
No ratings yet
Research On A Customer Churn Combination Prediction Model Based On Decision Tree and Neural Network
4 pages
Synopsis Major Project
No ratings yet
Synopsis Major Project
8 pages
Token ID Ain20250117003-1
No ratings yet
Token ID Ain20250117003-1
14 pages
Telecom Customer Churn
No ratings yet
Telecom Customer Churn
5 pages
Customer Churn Prediction in Telecom Sector Using Machine Learning Techniques
No ratings yet
Customer Churn Prediction in Telecom Sector Using Machine Learning Techniques
16 pages
Synopsis
No ratings yet
Synopsis
3 pages
1941 - National Building Code of Canada
No ratings yet
1941 - National Building Code of Canada
432 pages
Churn Prediction
100% (3)
Churn Prediction
41 pages
Churn Prediction in Telecom Using Machine Learning in R
No ratings yet
Churn Prediction in Telecom Using Machine Learning in R
9 pages
Comparative Study of Customer Churn Prediction Based On Data Ensemble Approach
No ratings yet
Comparative Study of Customer Churn Prediction Based On Data Ensemble Approach
10 pages
Algorithms 17 00231
No ratings yet
Algorithms 17 00231
21 pages
Customer Churn Prediction Using Machine Learning Algorithms
No ratings yet
Customer Churn Prediction Using Machine Learning Algorithms
6 pages
Architecture and Sociology
No ratings yet
Architecture and Sociology
11 pages
Customer Churn Prediction For Telecom Services: Utku Yabas Hakki Candan Cankaya Turker Ince
No ratings yet
Customer Churn Prediction For Telecom Services: Utku Yabas Hakki Candan Cankaya Turker Ince
2 pages
Ali Tamaddoni Jahromi, Mehrad Moeini, Issar Akbari, Aram Akbarzadeh
No ratings yet
Ali Tamaddoni Jahromi, Mehrad Moeini, Issar Akbari, Aram Akbarzadeh
11 pages
Customer Churn Prediction
No ratings yet
Customer Churn Prediction
5 pages
Paper Published
No ratings yet
Paper Published
5 pages
A Survey On Customer Churn Prediction in
No ratings yet
A Survey On Customer Churn Prediction in
6 pages
Analysis of Customer Churn Prediction in Telecom Industry Using Decision Trees and Logistic Regression
No ratings yet
Analysis of Customer Churn Prediction in Telecom Industry Using Decision Trees and Logistic Regression
4 pages
Anticipating Customer Churn in Telecommunication Using Machine Learning Algorithms For Customer Retention
No ratings yet
Anticipating Customer Churn in Telecommunication Using Machine Learning Algorithms For Customer Retention
7 pages
Customer Churn Prediction Using Machine Learning: D. Deepika, Nihal Chandra
100% (1)
Customer Churn Prediction Using Machine Learning: D. Deepika, Nihal Chandra
14 pages
DWDM Cep
No ratings yet
DWDM Cep
13 pages
A Survey and Implementation of Machine Learning Algorithms For Customer Churn Prediction
No ratings yet
A Survey and Implementation of Machine Learning Algorithms For Customer Churn Prediction
7 pages
A Proposed Churn Prediction Model: Essam Shaaban, Yehia Helmy, Ayman Khedr, Mona Nasr
No ratings yet
A Proposed Churn Prediction Model: Essam Shaaban, Yehia Helmy, Ayman Khedr, Mona Nasr
5 pages
Lubrizol 1038 - Auto Gear Oil - Tds
No ratings yet
Lubrizol 1038 - Auto Gear Oil - Tds
3 pages
Customer Churn Prediction in Telecommunication
No ratings yet
Customer Churn Prediction in Telecommunication
13 pages
Unit 8 - TQM
No ratings yet
Unit 8 - TQM
37 pages
12622-Article Text-22383-1-10-20220510
No ratings yet
12622-Article Text-22383-1-10-20220510
5 pages
Churn PredictionITNACC
No ratings yet
Churn PredictionITNACC
7 pages
Google Ai ML Virtual Internship Report
No ratings yet
Google Ai ML Virtual Internship Report
29 pages
Design Report of A Go Kart Vehicle
No ratings yet
Design Report of A Go Kart Vehicle
8 pages
CBSE Class 6 Social Science Sample Paper SA 2 SET 1
No ratings yet
CBSE Class 6 Social Science Sample Paper SA 2 SET 1
2 pages
Kinetic Theory & Thermal Properties Notes IGCSE AVG
100% (3)
Kinetic Theory & Thermal Properties Notes IGCSE AVG
12 pages
WS - 3 Class X Phy CH - 10 (Light - Refraction) - 1
No ratings yet
WS - 3 Class X Phy CH - 10 (Light - Refraction) - 1
3 pages
Kowsi Final Project
No ratings yet
Kowsi Final Project
50 pages
Customer Churn Analysis and Prediction
No ratings yet
Customer Churn Analysis and Prediction
4 pages
Alemite Oil Mist Application Manual
100% (1)
Alemite Oil Mist Application Manual
34 pages
Gender: Project All Numerates Pre-Test Results
100% (1)
Gender: Project All Numerates Pre-Test Results
6 pages
Conflict Resolution Skills
100% (11)
Conflict Resolution Skills
16 pages
Ucc2817, Ucc2818, Ucc3817 and Ucc3818 Bicmos Power Factor Pregulator
No ratings yet
Ucc2817, Ucc2818, Ucc3817 and Ucc3818 Bicmos Power Factor Pregulator
45 pages
PPE Lab Manual
No ratings yet
PPE Lab Manual
52 pages
DRAGO COSIC-prezentacija HIDROGEN
No ratings yet
DRAGO COSIC-prezentacija HIDROGEN
12 pages
Class Notes For English 2 (PDF 2)
No ratings yet
Class Notes For English 2 (PDF 2)
17 pages
PP Math6 QTR2W7 Day 1
No ratings yet
PP Math6 QTR2W7 Day 1
14 pages
B. Stage 1 and 2
No ratings yet
B. Stage 1 and 2
20 pages
Teacher Notes and Answers 8 Fluid Mechanics
No ratings yet
Teacher Notes and Answers 8 Fluid Mechanics
3 pages
Exercise About News Item
No ratings yet
Exercise About News Item
3 pages
Satish
No ratings yet
Satish
5 pages
Optimal Lot-Size With The Andler Formula: Sensitivity Analysis
No ratings yet
Optimal Lot-Size With The Andler Formula: Sensitivity Analysis
3 pages
Business Case Studies
No ratings yet
Business Case Studies
10 pages
OD328816327605052100
No ratings yet
OD328816327605052100
1 page
The Best of Charlie Munger 1994 2011 PDF
No ratings yet
The Best of Charlie Munger 1994 2011 PDF
1 page
11 2 Multi-Step Subtraction Problems
No ratings yet
11 2 Multi-Step Subtraction Problems
2 pages
Measuring Customer Satisfaction: Exploring Customer Satisfaction’s Relationship with Purchase Behavior
From Everand
Measuring Customer Satisfaction: Exploring Customer Satisfaction’s Relationship with Purchase Behavior
Tim Glowa
4.5/5 (6)
Dictionary of Credit Risk Business Terms - EXTRACT
From Everand
Dictionary of Credit Risk Business Terms - EXTRACT
Steve Preece
No ratings yet
The Science of Sourcing Governance
From Everand
The Science of Sourcing Governance
Ernie Zibert
No ratings yet

Phase 3

Uploaded by

Phase 3

Uploaded by

PHASE 3

MODEL PLANNING AND BUILDING

Prepared for: Mohammad Tubaishat

This visual representation of the decision tree helps us understand which

These findings strongly support the hypothesis and highlight the

Target Month-to-Month Contract Customers: Since these customers

Focus on Early Tenure Customers: Customers in the early stages of their

Enhance Support Services: The availability of tech support seems to

You might also like