Predicting Churn
Predicting Churn
https://www.gainsight.com/your-success/what-is-customer-
churn-analysis/
What are its benefits
• Converts structured and unstructured data/information into
meaningful insights
• Utilizes these insights to predict customers who are likely to churn
• Identifies the causes for churn and works to resolve those issues
• Engages with customers to foster relationships
• Implements effective programs for customer retention
https://www.gainsight.com/your-success/what-is-customer-
churn-analysis/
Cross-industry
standard process
for data mining
2. Data Understanding
1. Selecting the Data: Customer Data.xlsx
2. Check the Headline (Name of Attribute). If there are no headers, remove
the annotation, telling RM that the data starts directly in the first row. All
attribute will get generic names such as att1, att2, etc.
3. Checking Data Types. Eg., PostalCode type as integer is ok but do we want
RM to perform mathematical operation on PostalCode? So polynominal is
better.
4. Data Exploration using the Statistics View.
Cross-industry
standard process
for data mining
Data Preparation
• Issues found:
• Missing values: ChurnDate contains a lot of missing values. Age and Gender
too.
• Range: Customer should between 16 to 100.
• Gender: With have four!
• Irrelevant attributes: Which one?
• ID Attributes: Can confuse the algorithm.. Remove them.
• No Label: Make RM ignore rowNumber attribute by assigning either the
predefined id role or a custom role. Use Set Role operator.
3. Modeling using Machine Learning
Machine learning is an application of artificial intelligence (AI) that
provides systems the ability to automatically learn and improve from
experience without being explicitly programmed.
Wikipedia