0% found this document useful (0 votes)

15 views

Analysis and Comparison of Forecasting Algorithms For Telecom Customer Churn

Uploaded by

Marcio Filho

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

Analysis and Comparison of Forecasting Algorithms For Telecom Customer Churn

Uploaded by

Marcio Filho

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Journal of Physics: Conference

Series

PAPER • OPEN ACCESS You may also like

- Telecom wavelength single photon
Analysis and Comparison of Forecasting sources
Xin Cao, Michael Zopf and Fei Ding
Algorithms for Telecom Customer Churn - Nanowire-based telecom-band light-
emitting diodes with efficient light
extraction
To cite this article: Gui’e Jiao and Hong Xu 2021 J. Phys.: Conf. Ser. 1881 032061 Guoqiang Zhang, Dominika Gnatek,
Masato Takiguchi et al.

- Predicting Churn: How Multilayer

Perceptron Method Can Help with
Customer Retention in Telecom Industry
View the article online for updates and enhancements. NNA Sjarif, NF Azmi, HM Sarkan et al.

This content was downloaded from IP address 200.130.19.145 on 27/07/2024 at 22:47

The 2nd International Conference on Computing and Data Science (CONF-CDS 2021) IOP Publishing
Journal of Physics: Conference Series 1881 (2021) 032061 doi:10.1088/1742-6596/1881/3/032061

Analysis and Comparison of Forecasting Algorithms for

Telecom Customer Churn

Gui’e Jiao1,* and Hong Xu2

1
Shanghai University/Shanghai Jian Qiao University, No.1111 Hucheng Ring Road,
Pudong New District, Shanghai, China
2
Shanghai Ocean University, No. 999 Hucheng Ring Road, Pudong New District,
Shanghai, China,

*Corresponding author e-mail: [email protected]

Abstract. The integrated algorithm is a highly flexible data analysis and prediction
algorithm. In many big data competitions at home and abroad, the winning teams
basically adopt the idea of integrated algorithms such as random forest, GBDT,
XGBoost and other algorithms. This shows that accuracy of ensemble algorithms is
still very advantageous in terms of predictive classification. The main task of this
article is to predict the loss of telecom customers. Under the current situation of
saturation of the telecom market, how to retain the original customers is the main task
of each telecom operator. This article mainly compares the four prediction models on
the telecom data set. Predictive performance, the final performance evaluation index
also shows that the random forest model and XGBoost model of integrated thought
have better predictive models.

Keywords: Data Mining, Customer Churn, Classification Prediction

1. Introduction
For the predictive analysis of the classification status of customers, commonly used machine learning
methods include clustering, association rules [1] and machine learning models [2-7]. The machine
learning models used this time are mainly used. The data for predicting customer churn this time was
borrowed from publicly available data sets on the Internet, and private information such as user names
has been deleted.
In the data preprocessing stage, perform a simple statistical analysis of the data, including viewing
the characteristic attributes of the data set and its data types, showing the loss of each attribute in the
data set according to whether the customer is lost, and dividing it according to the different attribute
values In the two images on the left and right, use the .isna().sum() method to view the missing value
of each attribute in the data set, filter out the attributes with missing values, and then deal with the
missing values according to the specific data situation. The results show that some attributes in the
data have missing values. For the processing of missing values, generally delete and fill. The attribute
with missing values here is the total annual consumption. This attribute does not involve time series

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution
of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
Published under licence by IOP Publishing Ltd 1
The 2nd International Conference on Computing and Data Science (CONF-CDS 2021) IOP Publishing
Journal of Physics: Conference Series 1881 (2021) 032061 doi:10.1088/1742-6596/1881/3/032061

and continuity, so it is not necessary to use the previous value or the next value is filled. In order to
ensure the least impact on the prediction result, the average value is used here.

Fig 1. Attribute loss ratio

The data presentation form in the original data set is generally not very suitable for mining.
Therefore, before the model prediction, the data needs to be normalized. In this data set, many
characteristic attribute values exist in the form of strings. In order to better apply to the prediction
algorithm of the model, the data type of the string type is converted. For the attribute value of two, the
mapping is converted to 0 and 1. For the ternary attribute value, according to it .The actual meaning of
is converted into a two-element attribute value or one-hot encoding.

Fig 2. User usage time

2. Experiment and Results Analysis

This article compares four commonly used predictive analysis models, including logistic regression
model, random forest model, AdaBoost model and XGBoost model. The last three prediction
algorithms are integrated algorithms. In the model test, the distribution ratio of the data set is 7:3.

2.1 Experiments
(1) Logistic regression model is a commonly used machine learning model, usually used for data
mining, analysis and prediction [8]. In the prediction of the loss of telecom users, the more traditional
Sigmoid hypothesis function is adopted, and the regression prediction model is based on the linear
regression of the hypothesis function and independent variables. Pass the processed result data to the
logistic regression model to obtain the best weight coefficient. The loss function of this model draws
on the linear regression model and improves it. The loss function
m
1
is J ( β ) = ∑ {− y ( i ) log e h( x (i ) ) − (1 − y ( i ) ) log e [1 − h( x (i ) )]} .The gradient descent method is used to
m i =1
solve the optimal solution of the model. The model is trained to find the best classification regression
coefficients, and the user life cycle is predicted based on the characteristic attributes of user
information in the data set. Different feature attributes are very important for training the weight
coefficients of these data, which will affect the quality of the final prediction results.

2
The 2nd International Conference on Computing and Data Science (CONF-CDS 2021) IOP Publishing
Journal of Physics: Conference Series 1881 (2021) 032061 doi:10.1088/1742-6596/1881/3/032061

Random forest, XGBoost and AdaBoost models are all ensemble learning algorithms. The main
idea is to integrate weak learners to form the final strong learner. Integrated learning is divided into
two schools, and there is no correlation between each weak learner. Random forest and XGBoost
model are both typical algorithms for this ensemble learning. Another ensemble learning algorithm is
the mutual influence between weak learners. The typical algorithm is AdaBoost algorithm.
(2) Random forest model: The training process of the random forest model is to build multiple
decision trees and merge them to obtain a more accurate and stable model. The final output of the
model is determined by many decision trees. In the classification model, the classification results of
the final model are produced by voting on these decision trees. Every decision tree that makes up a
random forest is very important. The decision tree will select and divide the characteristic attributes of
the data. In random forest, the ranking of feature attributes is determined by information entropy.
Information entropy is an index to measure the stability of attributes. In this telecommunication
customer churn prediction, it is necessary to consider the mutual influence of multiple feature
attributes to determine the final.
When predicting the loss of telecom customers, Random Forest combines 1000 decision trees, the
minimum number of split samples is 3, the minimum leaf node is 3, and the largest feature attribute is
determined internally by the algorithm. One advantage of the random forest model is that it can output
the importance ranking of feature attributes. The ranking of feature attributes in the dataset is shown in
Figure 3.

Fig 3. Random forest algorithm's ranking of feature attributes

The advantage of the random forest algorithm is that training can be highly parallelized, and it has
advantages in training speed for large samples in big data. The algorithm can randomly select decision
tree nodes to divide features, so that samples can still be effectively trained when the feature
dimension is high. The importance of each function to the output can be given; the random sampling
training model has small variance and strong generalization ability. It is relatively simple to implement
and not very sensitive to missing values, but it can easily lead to overfitting.
(3) The principle of the XGBoost algorithm is to continuously increase the tree on the basis of the
decision tree. Like random forests, weak learners are decision trees. The difference is that the
algorithm adds one tree to n-1 trees. When a tree becomes n trees, the accuracy of the algorithm is
continuously improved and the effect is improved [9].
The objective function of XGBoost is Lt = ∑ L( yi , ft −1 ( xi ) + ht ( xi )) + γ J + λ ∑ ωtj2 , which is mainly
m J

i =1 2 j =1

composed of loss function and regular term. In the function, J is the number of leaf nodes, and ωtj is
the optimal value of the j-th leaf node. I is the input training sample set, T is the maximum number of
iterations, L is the loss function, used to indicate the degree to which the model fits the data, and the
regularization coefficient λ , γ is used to control the complexity model. The output result of the
objective function is the strong learner f(x). In the XGBoost model, the model uses Taylor's formula to
process the loss function. If the node loss is calculated under the current situation, the relevant
calculation formula is score=max(score, 1 GL2 1 GR2 1 (GL + GR ) 2
+ − − γ ) , where γ is the
2 HL + λ 2 HR + λ 2 HL + HR + λ

3
The 2nd International Conference on Computing and Data Science (CONF-CDS 2021) IOP Publishing
Journal of Physics: Conference Series 1881 (2021) 032061 doi:10.1088/1742-6596/1881/3/032061

threshold for calculating the split loss. Only when it is greater than γ will it choose to split, which
plays a pruning role.
The XGBoost algorithm is a very flexible algorithm and has achieved good results in many data
mining activities. Like the random forest of integrated learning, the importance of the attributes of the
data set will be sorted and output. The following figure shows the importance ranking of each attribute
in the XGBoost model.

Fig 4. Function decision value

(4) AdaBoost algorithm (Adaptive Boosting, adaptive enhancement algorithm), which belongs to
the Boost type of algorithm. Its weak learners have a strong dependence. The AdaBoost algorithm is
an iterative algorithm idea. The algorithm increases the weight of samples that were classified
incorrectly in the previous round, and reduces the weight of samples that are correctly classified; the
final ensemble method of strong classifiers is the linear weighting of weak learners Sum up. A weak
learner with a low error rate has a larger weight, and a weak learner with a higher error rate has a
smaller weight. In the end, the strong learner is composed of the weak learner, and the weight
distribution of the weak learner will be determined according to the correct ratio of its classified
samples. Here, the correct rate is related to the weight of the data sample. The higher the correct rate,
the higher the weight. In order to obtain better weight coefficients, it is best to correctly classify the
samples with larger weights in the previous round of classification errors to obtain higher weights
during classification training.

2.2 Results and Analysis

In the algorithm debugging stage, grid tuning and cross-validation are mainly used. Grid tuning is
mainly used to select the best combination of parameters and cross-validate the fit of the training
model. When training a model, debugging is an extremely important step. The best parameters will
make the model reach the best state. In the algorithm debugging of this paper, we compare 3-fold
cross-validation and 5-fold cross-validation. In general, three-fold cross-validation has better results,
and the prediction accuracy of the model has also been improved. The following table shows the final
improvement effect of each model, where f1 is the evaluation index.
Table 1. Comparison of the four models
Training set f1 Test set f1 Cross-validation
f1
Logistic regression 58.45% 56.09% 57.37%
model
Random forest model 67.41% 57.17% 57.22
XGBoost model 73.01% 55.78% 56.69%
AdaBoost model 99.39% 52.77% 55.11%
The most commonly used evaluation indicator for machine learning algorithms is the confusion
matrix, which has 4 first-level indicators. TP represents the number of samples whose prediction is
true and the actual is true; FP is the number of samples whose prediction is correct but actually is
wrong; FN is predicted to be false, but the actual number of samples is true; TN is the number of

4
The 2nd International Conference on Computing and Data Science (CONF-CDS 2021) IOP Publishing
Journal of Physics: Conference Series 1881 (2021) 032061 doi:10.1088/1742-6596/1881/3/032061

samples whose prediction is false but the actual is false . The second level indicator f1 of the
confusion matrix used above is the harmonic mean value.
In the analysis of the results in this experiment, the ROC curve and the AUC area are mainly used.
In the coordinate system, the abscissa and ordinate are related to the confusion matrix, the abscissa
is TPR = TP , the ordinate is FPR = FP , and AUC represents the area under the ROC curve. Is
TP + FN FP + TN
mainly used to measure the generalization ability of the model. AUC is a specific value, which is more
suitable for comparative analysis than the curve. The criterion of AUC value for model performance is:
when AUC=1, the performance of the classifier is very good, the perfect prediction effect can be
achieved when using this model. This model is ideal and generally does not exist; if AUC<0.5, then
the prediction effect of the model is very poor. If it is such a model, then it needs to be performed
Reverse prediction; the general performance model AUC value is between 0.5 and 1, the larger the
value, the better the performance [10].
The results of the four algorithms are compared as follows:

Fig 5. Comparison of the results of the four models

Figure 4 can be seen from the above figure that the random forest model and XGBoost model of
integrated learning ideas have better prediction efficiency, and the logistic regression model based on
the linear model is the most unsatisfactory for the classification prediction effect of this data set.

3. Conclusion
A comprehensive comparison of four machine learning prediction algorithms, logistic regression
algorithms and three ensemble learning algorithms found that the random forest algorithm, XGBoost
algorithm and AdaBoost algorithm are more effective, but for many modeling scholars, it is similar to
the black box principle. The internal operation of random forest is opaque. The model used this time is
a relatively basic model, and then simple parameter adjustments have been made. Future research can
improve the loss function and objective function. In addition, the current computer development has
moved towards artificial intelligence, and the development of data mining is gradually moving
towards the field of deep learning. Neural network is a very successful technology for computer deep
learning, so the next step is to consider using deep learning, neural network processing data set
[11-12].

References
[1] Wang, Xinyan and Jiao, Guie. ‘Research on Association Rules of Course Grades Based on
Parallel FP-Growth Algorithm’. 1 Jan. 2020 : 759 – 769.
[2] Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and
Data MiningAugust 2016 Pages 785–794.
[3] S. A. Qureshi, A. S. Rehman, A. M. Qamar, A. Kamal and A. Rehman, "Telecommunication
subscribers' churn prediction model using machine learning," Eighth International
Conference on Digital Information Management (ICDIM 2013), Islamabad, 2013, pp.
131-136, doi: 10.1109/ICDIM.2013.6693977.

5
The 2nd International Conference on Computing and Data Science (CONF-CDS 2021) IOP Publishing
Journal of Physics: Conference Series 1881 (2021) 032061 doi:10.1088/1742-6596/1881/3/032061

[4] TELE-INFO'06: Proceedings of the 5th WSEAS international conference on

Telecommunications and informaticsMay 2006 Pages 281–286
[5] So Young Sohn,Jae Kang Lee. Competing risk model for mobile phone service[J].
Technological Forecasting & Social Change,2008,75(9).
[6] Proceedings of the 2017 12th International Conference on Intelligent Systems and Knowledge
Engineering, ISKE 2017, 2017, 2018-January pp. 1 - 6.
[7] Scott A. Neslin,Sunil Gupta,Wagner Kamakura,Junxiang Lu,Charlotte H. Mason. Defection
Detection: Measuring and Understanding the Predictive Accuracy of Customer Churn
Models[J]. Journal of Marketing Research,2006,43(2).
[8] Gopal R.K., Meher S.K. (2008) Customer Churn Time Prediction in Mobile
Telecommunication Industry Using Ordinal Regression. In: Washio T., Suzuki E., Ting
K.M., Inokuchi A. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2008.
Lecture Notes in Computer Science, vol 5012. Springer, Berlin, Heidelberg.
https://doi.org/10.1007/978-3-540-68125-0_88
[9] Abdelrahim Kasem Ahmad,Assef Jafar,Kadan Aljoumaa. Customer churn prediction in telecom
using machine learning in big data platform[J]. Journal of Big Data,2019,6(1).
[10] Ascarza, Eva, Retention Futility: Targeting High-Risk Customers Might Be Ineffective (January
6, 2018). Columbia Business School Research Paper No. 16-28, Available at SSRN:
https://ssrn.com/abstract=2759170 or http://dx.doi.org/10.2139/ssrn.2759170.
[11] Guolin Ke, Zhenhui Xu, Jia Zhang, Jiang Bian, and Tie-Yan Liu. 2019. DeepGBM: A Deep
Learning Framework Distilled by GBDT for Online Prediction Tasks. In <i>Proceedings of
the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data
Mining</i> (<i>KDD '19</i>). Association for Computing Machinery, New York, NY, USA,
384–394. DOI:https://doi.org/10.1145/3292500.3330858 GBDT
[12] J. Hu et al., "pRNN: A Recurrent Neural Network based Approach for Customer Churn
Prediction in Telecommunication Sector," 2018 IEEE International Conference on Big Data
(Big Data), Seattle, WA, USA, 2018, pp. 4081-4085, doi: 10.1109/BigData.2018.8622094.

Power BI Book
80% (10)
Power BI Book
32 pages
Ioegc 12 171 12248
No ratings yet
Ioegc 12 171 12248
6 pages
Machine Learning Based Telecom-Customer Churn Prediction
No ratings yet
Machine Learning Based Telecom-Customer Churn Prediction
7 pages
An Effective Classifier for Predicting Churn in Telecommunication
No ratings yet
An Effective Classifier for Predicting Churn in Telecommunication
9 pages
ADD6
No ratings yet
ADD6
9 pages
Comparative Analysis of Predictive Models For Customer Churn Prediction in The Telecommunication Industry
No ratings yet
Comparative Analysis of Predictive Models For Customer Churn Prediction in The Telecommunication Industry
6 pages
Customer Churn Prediction Using Machine Learning Algorithms
No ratings yet
Customer Churn Prediction Using Machine Learning Algorithms
5 pages
Customer Churn Analysis and Prediction
No ratings yet
Customer Churn Analysis and Prediction
4 pages
Customer Churn Prediction in The Telecom Sector
No ratings yet
Customer Churn Prediction in The Telecom Sector
6 pages
1-s2.0-S2590123024014208-main
No ratings yet
1-s2.0-S2590123024014208-main
12 pages
FULLTEXT01
No ratings yet
FULLTEXT01
56 pages
AIMATH (1)
No ratings yet
AIMATH (1)
19 pages
Churn Forecasting Using Deep Ljearning Model
No ratings yet
Churn Forecasting Using Deep Ljearning Model
5 pages
A Survey on Customer Churn Prediction In
No ratings yet
A Survey on Customer Churn Prediction In
6 pages
major project
No ratings yet
major project
27 pages
Paper 8675
No ratings yet
Paper 8675
6 pages
Bda Review
No ratings yet
Bda Review
13 pages
Predictive Analytics Applications For Enhanced Customer Retention and Increased Profitability in The Telecommunications Industry
No ratings yet
Predictive Analytics Applications For Enhanced Customer Retention and Increased Profitability in The Telecommunications Industry
13 pages
Customer Churn Prediction For Telecom Services: Utku Yabas Hakki Candan Cankaya Turker Ince
No ratings yet
Customer Churn Prediction For Telecom Services: Utku Yabas Hakki Candan Cankaya Turker Ince
2 pages
Customer Churn Prediction System: A Machine Learning Approach
No ratings yet
Customer Churn Prediction System: A Machine Learning Approach
24 pages
Final Review Batch 07
No ratings yet
Final Review Batch 07
30 pages
Swasti Arya
No ratings yet
Swasti Arya
6 pages
fin_irjmets1682089319
No ratings yet
fin_irjmets1682089319
19 pages
Predictive Analytics Customer Churn (1)
No ratings yet
Predictive Analytics Customer Churn (1)
3 pages
A Comparison of Machine Learning Algorithms for Customer Churn Prediction
No ratings yet
A Comparison of Machine Learning Algorithms for Customer Churn Prediction
6 pages
Churn Prediction
100% (3)
Churn Prediction
41 pages
Hanoi - 2021: (Document Title)
No ratings yet
Hanoi - 2021: (Document Title)
19 pages
Duplichecker Plagiarism Report (73)
No ratings yet
Duplichecker Plagiarism Report (73)
2 pages
Synopsis Major Project
No ratings yet
Synopsis Major Project
8 pages
Online News Articles Popularity Prediction System
No ratings yet
Online News Articles Popularity Prediction System
9 pages
Vol 11 3
No ratings yet
Vol 11 3
5 pages
An Insight Into Machine Learning Techniq
No ratings yet
An Insight Into Machine Learning Techniq
8 pages
1-7-Machine-Learning-Approaches-for-Customer-Churn-Prediction-in-Telecommunications (2)
No ratings yet
1-7-Machine-Learning-Approaches-for-Customer-Churn-Prediction-in-Telecommunications (2)
7 pages
Customer Churn Prediction in Telecommunication Industry Using Deep Learning
No ratings yet
Customer Churn Prediction in Telecommunication Industry Using Deep Learning
15 pages
Mid Semester Project Review UditSoni
No ratings yet
Mid Semester Project Review UditSoni
25 pages
Clustering Comparison of Customer Attrition Dataset using Machine Learning Algorithms
No ratings yet
Clustering Comparison of Customer Attrition Dataset using Machine Learning Algorithms
5 pages
review1-1
No ratings yet
review1-1
16 pages
5_BGregory_WSDM2018_PredictingCustomerChurn
No ratings yet
5_BGregory_WSDM2018_PredictingCustomerChurn
4 pages
Customer_Churn_Prediction_Using_Machine_Learning_Algorithms
No ratings yet
Customer_Churn_Prediction_Using_Machine_Learning_Algorithms
6 pages
Churn PredictionITNACC
No ratings yet
Churn PredictionITNACC
7 pages
Group Method of Data Handling: Fundamentals and Applications for Predictive Modeling and Data Analysis
From Everand
Group Method of Data Handling: Fundamentals and Applications for Predictive Modeling and Data Analysis
Fouad Sabry
No ratings yet
UCLA Electronic Theses and Dissertations: Title
No ratings yet
UCLA Electronic Theses and Dissertations: Title
43 pages
Paper Published
No ratings yet
Paper Published
5 pages
Predicting Churn Customer in Telecom Using Peergrading Regression Learning Technique
No ratings yet
Predicting Churn Customer in Telecom Using Peergrading Regression Learning Technique
13 pages
Customer Churn Prediction in Telcom Industry Using Data Mining Techniques
No ratings yet
Customer Churn Prediction in Telcom Industry Using Data Mining Techniques
14 pages
C21_Report
No ratings yet
C21_Report
64 pages
Comparative Analysis of Supervised Learning Techniques of Machine Learning For Software Defect Prediction
No ratings yet
Comparative Analysis of Supervised Learning Techniques of Machine Learning For Software Defect Prediction
4 pages
Research Churn
No ratings yet
Research Churn
4 pages
Research paper_Tushar Agrawal
No ratings yet
Research paper_Tushar Agrawal
3 pages
Ieee Research Paper
No ratings yet
Ieee Research Paper
2 pages
Customer Churn Prediction
No ratings yet
Customer Churn Prediction
5 pages
Comparative_Study_of_Customer_Churn_Prediction_Based_on_Data_Ensemble_Approach
No ratings yet
Comparative_Study_of_Customer_Churn_Prediction_Based_on_Data_Ensemble_Approach
10 pages
(IJCST-V11I3P15) :auti Divya.S, Deshmukh Rajeshwari.B, Dumbre Komal.G, Dr.A.A.Khatri
No ratings yet
(IJCST-V11I3P15) :auti Divya.S, Deshmukh Rajeshwari.B, Dumbre Komal.G, Dr.A.A.Khatri
5 pages
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
From Everand
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
Fouad Sabry
No ratings yet
FULLTEXT01
No ratings yet
FULLTEXT01
88 pages
CUSTOMER CHURN PREDICTION AND CATEGORIZATION A MACHINE LEARNING APPROACH TO ANALYSE CUSTOMER BEHAVIOR AND DECISION MAKING IN THE TELECOMMUNICATIONS INDUSTRY
No ratings yet
CUSTOMER CHURN PREDICTION AND CATEGORIZATION A MACHINE LEARNING APPROACH TO ANALYSE CUSTOMER BEHAVIOR AND DECISION MAKING IN THE TELECOMMUNICATIONS INDUSTRY
30 pages
Predictive Modelling Project
No ratings yet
Predictive Modelling Project
29 pages
5Vol102No22
No ratings yet
5Vol102No22
16 pages
Décortication article 1
No ratings yet
Décortication article 1
4 pages
Em Semester Project
No ratings yet
Em Semester Project
21 pages
Irjet V10i395
No ratings yet
Irjet V10i395
4 pages
Ant-Balanced Multiple Traveling Salesmen ACO-BmTSP
No ratings yet
Ant-Balanced Multiple Traveling Salesmen ACO-BmTSP
17 pages
GENI Ants for the Traveling Salesman Problem
No ratings yet
GENI Ants for the Traveling Salesman Problem
15 pages
Network-Aware Credit Scoring System For Telecom Subscribers Using Machine Learning and Network Analysis
No ratings yet
Network-Aware Credit Scoring System For Telecom Subscribers Using Machine Learning and Network Analysis
21 pages
Capnography Com IOT - TESE UFPI
No ratings yet
Capnography Com IOT - TESE UFPI
30 pages
Carbon Footprint Vs Energy Optimization in IoT Network Deployments
No ratings yet
Carbon Footprint Vs Energy Optimization in IoT Network Deployments
13 pages
Security in IoT-enabled Smart Agriculture Architecture, Security
No ratings yet
Security in IoT-enabled Smart Agriculture Architecture, Security
24 pages
IoT Big Data and Artificial Intelligence in Agriculture and Food Industry
No ratings yet
IoT Big Data and Artificial Intelligence in Agriculture and Food Industry
20 pages
Forecasting Demand For Food at Apollo Hospitals - Model 4 - Final
No ratings yet
Forecasting Demand For Food at Apollo Hospitals - Model 4 - Final
9 pages
Wisavifaxafowakob
No ratings yet
Wisavifaxafowakob
4 pages
Analysis of Variance: LC - GC Europe Online Supplement
No ratings yet
Analysis of Variance: LC - GC Europe Online Supplement
4 pages
Two Sample T-Test Statology
No ratings yet
Two Sample T-Test Statology
5 pages
Data Analysis: Correct Practice Wrong Practice
0% (1)
Data Analysis: Correct Practice Wrong Practice
43 pages
Quiz 6
100% (1)
Quiz 6
8 pages
Dsbda Unit 1
No ratings yet
Dsbda Unit 1
25 pages
Influence of Storage Conditions On The Quality Properties PDF
No ratings yet
Influence of Storage Conditions On The Quality Properties PDF
8 pages
Instucciones Proyecto 1
No ratings yet
Instucciones Proyecto 1
10 pages
Chapter 5
No ratings yet
Chapter 5
35 pages
Bio23 Linden
No ratings yet
Bio23 Linden
34 pages
Field Report
No ratings yet
Field Report
63 pages
Analisis Konsumsi Energi Listrik Untuk P 03f5c67f
No ratings yet
Analisis Konsumsi Energi Listrik Untuk P 03f5c67f
13 pages
Forecasting Session 2.0 2024
No ratings yet
Forecasting Session 2.0 2024
29 pages
6-DATA-analysis-2
No ratings yet
6-DATA-analysis-2
46 pages
Business Research Methods TSM
100% (1)
Business Research Methods TSM
13 pages
Lecture 10 - ANOVA
No ratings yet
Lecture 10 - ANOVA
27 pages
An Evaluation of The Textbooks Used For
No ratings yet
An Evaluation of The Textbooks Used For
9 pages
SEM - ALIGNMENT - Luong and Flake 2021 PM MI Tutorial Preprint V1
No ratings yet
SEM - ALIGNMENT - Luong and Flake 2021 PM MI Tutorial Preprint V1
65 pages
Categorical Data Analysis With SAS and SPSS Applications
100% (1)
Categorical Data Analysis With SAS and SPSS Applications
576 pages
Navies Bayes
No ratings yet
Navies Bayes
18 pages
6305: Applied Econometrics For Policy Analysis: 1 Oaxaca-Blinder Decomposition
No ratings yet
6305: Applied Econometrics For Policy Analysis: 1 Oaxaca-Blinder Decomposition
3 pages
Data Migration Approach During Upgrade
No ratings yet
Data Migration Approach During Upgrade
24 pages
Advanced Statistics (AS) Project Report
No ratings yet
Advanced Statistics (AS) Project Report
52 pages
Synopsis On mARKETING sTRATEGY OF vOLtas
No ratings yet
Synopsis On mARKETING sTRATEGY OF vOLtas
9 pages
MicroTrap SCOPE Operations Manual Revision 4.1
No ratings yet
MicroTrap SCOPE Operations Manual Revision 4.1
60 pages
Dell Market Research
0% (1)
Dell Market Research
13 pages
Methods of Research A. Research Design
No ratings yet
Methods of Research A. Research Design
9 pages
Case Study #1 - Good Fitness
No ratings yet
Case Study #1 - Good Fitness
2 pages

Analysis and Comparison of Forecasting Algorithms For Telecom Customer Churn

Uploaded by

Analysis and Comparison of Forecasting Algorithms For Telecom Customer Churn

Uploaded by

Journal of Physics: Conference

PAPER • OPEN ACCESS You may also like

- Predicting Churn: How Multilayer

This content was downloaded from IP address 200.130.19.145 on 27/07/2024 at 22:47

Analysis and Comparison of Forecasting Algorithms for

Gui’e Jiao1,* and Hong Xu2

*Corresponding author e-mail: [email protected]

Keywords: Data Mining, Customer Churn, Classification Prediction

Fig 1. Attribute loss ratio

Fig 2. User usage time

2. Experiment and Results Analysis

Fig 3. Random forest algorithm's ranking of feature attributes

Fig 4. Function decision value

2.2 Results and Analysis

Fig 5. Comparison of the results of the four models

[4] TELE-INFO'06: Proceedings of the 5th WSEAS international conference on

You might also like