0% found this document useful (0 votes)

174 views37 pages

Customer Segmentation in Python Chapter4

The document discusses different methods for customer segmentation using k-means clustering in Python. It covers steps like data preprocessing, choosing the number of clusters using the elbow method or silhouette scores, running k-means clustering, and analyzing the results by looking at average values for each cluster. The document also provides examples of profiling the customer segments by creating summaries, snake plots to compare attributes, and calculating relative importance of attributes compared to the overall population.

Uploaded by

Fgpeqw

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

174 views37 pages

Customer Segmentation in Python Chapter4

Uploaded by

Fgpeqw

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

DataCamp Customer Segmentation in Python

CUSTOMER SEGMENTATION IN PYTHON

Practical implementation of
k-means clustering

Karolis Urbonas
Head of Data Science, Amazon
DataCamp Customer Segmentation in Python

Key steps
Data pre-processing
Choosing a number of clusters
Running k-means clustering on pre-processed data
Analyzing average RFM values of each cluster
DataCamp Customer Segmentation in Python

Data pre-processing

We've completed the pre-processing steps and have these two objects:

datamart_rfm

datamart_normalized

Code from previous lesson:

import numpy as np
datamart_log = np.log(datamart_rfm)

from sklearn.preprocessing import StandardScaler

scaler = StandardScaler()
scaler.fit(datamart_log)

datamart_normalized = scaler.transform(datamart_log)
DataCamp Customer Segmentation in Python

Methods to define the number of clusters

Visual methods - elbow criterion
Mathematical methods - silhouette coefficient
Experimentation and interpretation
DataCamp Customer Segmentation in Python

Running k-means

Import KMeans from sklearn library and initialize it as kmeans

from sklearn.cluster import KMeans

kmeans = KMeans(n_clusters=2, random_state=1)

Compute k-means clustering on pre-processed data

kmeans.fit(datamart_normalized)

Extract cluster labels from labels_ attribute

cluster_labels = kmeans.labels_
DataCamp Customer Segmentation in Python

Analyzing average RFM values of each cluster

Create a cluster label column in the original DataFrame:

datamart_rfm_k2 = datamart_rfm.assign(Cluster = cluster_labels)

Calculate average RFM values and size for each cluster:

datamart_rfm_k2.groupby(['Cluster']).agg({
'Recency': 'mean',
'Frequency': 'mean',
'MonetaryValue': ['mean', 'count'],
}).round(0)
DataCamp Customer Segmentation in Python

Analyzing average RFM values of each cluster

The result of a simple 2-cluster solution:

DataCamp Customer Segmentation in Python

CUSTOMER SEGMENTATION IN PYTHON

Let's practice running k-

means clustering!
DataCamp Customer Segmentation in Python

CUSTOMER SEGMENTATION IN PYTHON

Choosing number of
clusters

Karolis Urbonas
Head of Data Science, Amazon
DataCamp Customer Segmentation in Python

Methods
Visual methods - elbow criterion
Mathematical methods - silhouette coefficient
Experimentation and interpretation
DataCamp Customer Segmentation in Python

Elbow criterion method

Plot the number of clusters against within-cluster sum-of-squared-errors (SSE) -
sum of squared distances from every data point to their cluster center
Identify an "elbow" in the plot
Elbow - a point representing an "optimal" number of clusters
DataCamp Customer Segmentation in Python

Elbow criterion method

# Import key libraries
from sklearn.cluster import KMeans
import seaborn as sns
from matplotlib import pyplot as plt

# Fit KMeans and calculate SSE for each k

sse = {}
for k in range(1, 11):
kmeans = KMeans(n_clusters=k, random_state=1)
kmeans.fit(data_normalized)
sse[k] = kmeans.inertia_ # sum of squared distances to closest cluster cente

# Plot SSE for each k

plt.title('The Elbow Method')
plt.xlabel('k'); plt.ylabel('SSE')
sns.pointplot(x=list(sse.keys()), y=list(sse.values()))
plt.show()
DataCamp Customer Segmentation in Python

Elbow criterion method

The elbow criterion chart:

DataCamp Customer Segmentation in Python

Elbow criterion method

The elbow criterion chart:

DataCamp Customer Segmentation in Python

Using elbow criterion method

Best to choose the point on elbow, or the next point
Use as a guide but test multiple solutions
Elbow plot built on datamart_rfm
DataCamp Customer Segmentation in Python

Experimental approach - analyze segments

Build clustering at and around elbow solution
Analyze their properties - average RFM values
Compare against each other and choose one which makes most business sense
DataCamp Customer Segmentation in Python

Experimental approach - analyze segments

Previous 2-cluster solution

3-cluster solution on the same normalized RFM dataset

DataCamp Customer Segmentation in Python

CUSTOMER SEGMENTATION IN PYTHON

Let's practice finding the

optimal number of clusters!
DataCamp Customer Segmentation in Python

CUSTOMER SEGMENTATION IN PYTHON

Profile and interpret

segments

Karolis Urbonas
Head of Data Science, Amazon
DataCamp Customer Segmentation in Python

Approaches to build customer personas

Summary statistics for each cluster e.g. average RFM values
Snake plots (from market research
Relative importance of cluster attributes compared to population
DataCamp Customer Segmentation in Python

Summary statistics of each cluster

Run k-means segmentation for several k values around the recommended value.

Create a cluster label column in the original DataFrame:

datamart_rfm_k2 = datamart_rfm.assign(Cluster = cluster_labels)

Calculate average RFM values and sizes for each cluster:

datamart_rfm_k2.groupby(['Cluster']).agg({
'Recency': 'mean',
'Frequency': 'mean',
'MonetaryValue': ['mean', 'count'],
}).round(0)

Repeat the same for k=3

DataCamp Customer Segmentation in Python

Summary statistics of each cluster

Compare average RFM values of each clustering solution
DataCamp Customer Segmentation in Python

Snake plots to understand and compare segments

Market research technique to compare different segments
Visual representation of each segment's attributes
Need to first normalize data (center & scale)
Plot each cluster's average normalized values of each attribute
DataCamp Customer Segmentation in Python

Prepare data for a snake plot

Transform datamart_normalized as DataFrame and add a Cluster column

datamart_normalized = pd.DataFrame(datamart_normalized,
index=datamart_rfm.index,
columns=datamart_rfm.columns)
datamart_normalized['Cluster'] = datamart_rfm_k3['Cluster']

Melt the data into a long format so RFM values and metric names are stored in 1
column each

datamart_melt = pd.melt(datamart_normalized.reset_index(),
id_vars=['CustomerID', 'Cluster'],
value_vars=['Recency', 'Frequency', 'MonetaryValue'],
var_name='Attribute',
value_name='Value')
DataCamp Customer Segmentation in Python

Visualize a snake plot

plt.title('Snake plot of standardized variables')
sns.lineplot(x="Attribute", y="Value", hue='Cluster', data=datamart_melt)
DataCamp Customer Segmentation in Python

Relative importance of segment attributes

Useful technique to identify relative importance of each segment's attribute
Calculate average values of each cluster
Calculate average values of population
Calculate importance score by dividing them and subtracting 1 (ensures 0 is
returned when cluster average equals population average)
cluster_avg = datamart_rfm_k3.groupby(['Cluster']).mean()

population_avg = datamart_rfm.mean()

relative_imp = cluster_avg / population_avg - 1

DataCamp Customer Segmentation in Python

Analyze and plot relative importance

The further a ratio is from 0, the more important that attribute is for a segment
relative to the total population.
relative_imp.round(2)

Recency Frequency MonetaryValue

Cluster
0 -0.82 1.68 1.83
1 0.84 -0.84 -0.86
2 -0.15 -0.34 -0.42

Plot a heatmap for easier interpretation:

plt.figure(figsize=(8, 2))
plt.title('Relative importance of attributes')
sns.heatmap(data=relative_imp, annot=True, fmt='.2f', cmap='RdYlGn')
plt.show()
DataCamp Customer Segmentation in Python

Relative importance heatmap

Heatmap plot:

vs. printed output:

Recency Frequency MonetaryValue

Cluster
0 -0.82 1.68 1.83
1 0.84 -0.84 -0.86
2 -0.15 -0.34 -0.42
DataCamp Customer Segmentation in Python

CUSTOMER SEGMENTATION IN PYTHON

Your time to experiment

with different customer
profiling techniques!
DataCamp Customer Segmentation in Python

CUSTOMER SEGMENTATION IN PYTHON

Implement end-to-end
segmentation solution

Karolis Urbonas
Head of Data Science, Amazon
DataCamp Customer Segmentation in Python

Key steps of the segmentation project

Gather data - updated data with an additional variable
Pre-process the data
Explore the data and decide on the number of clusters
Run k-means clustering
Analyze and visualize results
DataCamp Customer Segmentation in Python

Updated RFM data

Same RFM values plus additional Tenure variable

Tenure - time since the first transaction

Defines how long the customer has been with the company
DataCamp Customer Segmentation in Python

Goals for this project

Remember key pre-processing rules
Apply data exploration techniques
Practice running several k-means iterations
Analyze results quantitatively and visually
DataCamp Customer Segmentation in Python

CUSTOMER SEGMENTATION IN PYTHON

Let's dig in!

DataCamp Customer Segmentation in Python

CUSTOMER SEGMENTATION IN PYTHON

Final thoughts

Karolis Urbonas
Head of Data Science, Amazon
DataCamp Customer Segmentation in Python

What you have learned

Cohort analysis and visualization
RFM segmentation
Data pre-processing for k-means
Customer segmentation with k-means
Evaluating number of clusters
Reviewing and visualizing segmentation solutions
DataCamp Customer Segmentation in Python

CUSTOMER SEGMENTATION IN PYTHON

Congratulations!

Low Code AIML USL Project CreditCardCustomerSegmentation Vijay Borade Aug23
67% (3)
Low Code AIML USL Project CreditCardCustomerSegmentation Vijay Borade Aug23
66 pages
Segmentation Analysis
No ratings yet
Segmentation Analysis
17 pages
Credit Risk Modeling in Python Chapter3
No ratings yet
Credit Risk Modeling in Python Chapter3
35 pages
Customer Segmentation in Python Chapter2
No ratings yet
Customer Segmentation in Python Chapter2
33 pages
Customer Segmentation Using RFM Analysis: Overview
No ratings yet
Customer Segmentation Using RFM Analysis: Overview
11 pages
Introduction To Data Visualization With Seaborn Chapter3
100% (1)
Introduction To Data Visualization With Seaborn Chapter3
32 pages
Designing Machine Learning Workflows in Python Chapter2
No ratings yet
Designing Machine Learning Workflows in Python Chapter2
39 pages
Customer Segmentation in Python Chapter3
No ratings yet
Customer Segmentation in Python Chapter3
25 pages
Day 4
No ratings yet
Day 4
62 pages
Chapter1 PDF
No ratings yet
Chapter1 PDF
37 pages
Chapter 2
No ratings yet
Chapter 2
33 pages
PDF Custome Segmentation
No ratings yet
PDF Custome Segmentation
18 pages
Customer Segmentation Report
No ratings yet
Customer Segmentation Report
8 pages
RFM How To Automatically Segment Customers Using Purchase Data and A Few Lines of Python
No ratings yet
RFM How To Automatically Segment Customers Using Purchase Data and A Few Lines of Python
8 pages
K-Means Clustering For Customer Segmentation - A Practical Example - Kimberly Coffey, PH.D - PDF
100% (2)
K-Means Clustering For Customer Segmentation - A Practical Example - Kimberly Coffey, PH.D - PDF
41 pages
Customer Segmentation in Python
No ratings yet
Customer Segmentation in Python
71 pages
Lab 11 - HT
No ratings yet
Lab 11 - HT
4 pages
DAB 303 Project 2
No ratings yet
DAB 303 Project 2
12 pages
DWDM PPT
No ratings yet
DWDM PPT
13 pages
Suwarti - Final Project
No ratings yet
Suwarti - Final Project
20 pages
Phase 2
No ratings yet
Phase 2
5 pages
Ads Phase 4
No ratings yet
Ads Phase 4
12 pages
Customer Segmentation IEEE Report
No ratings yet
Customer Segmentation IEEE Report
2 pages
A Beginner's Guide To Customer Segmentation With Python - by Sigli Mumuni - Medium
No ratings yet
A Beginner's Guide To Customer Segmentation With Python - by Sigli Mumuni - Medium
14 pages
Customer Segmentation With K-Means and RMF
No ratings yet
Customer Segmentation With K-Means and RMF
13 pages
Customer Segmentation Using K
No ratings yet
Customer Segmentation Using K
16 pages
ML0101EN Clus K Means Customer Seg Py v1
100% (1)
ML0101EN Clus K Means Customer Seg Py v1
8 pages
Chapter 1,2 Report
No ratings yet
Chapter 1,2 Report
5 pages
RFM Model For Customer Purchase Behaviour Using K-Means Algorithm
No ratings yet
RFM Model For Customer Purchase Behaviour Using K-Means Algorithm
55 pages
Chapter 5 CLUSTERING
No ratings yet
Chapter 5 CLUSTERING
36 pages
DWDM Report
No ratings yet
DWDM Report
6 pages
Machine Learning and Business Analytics Surprize Quiz
No ratings yet
Machine Learning and Business Analytics Surprize Quiz
5 pages
Energy Consumption Prediction System
No ratings yet
Energy Consumption Prediction System
21 pages
Full Customer Segmentation
No ratings yet
Full Customer Segmentation
11 pages
Customer Segmentation
No ratings yet
Customer Segmentation
9 pages
Updated Thesis
No ratings yet
Updated Thesis
29 pages
Lab 8-DA
No ratings yet
Lab 8-DA
1 page
Tasks For Students
No ratings yet
Tasks For Students
4 pages
Customer Segmentation Using Machine Learning
100% (1)
Customer Segmentation Using Machine Learning
28 pages
Customer Segmentation New
No ratings yet
Customer Segmentation New
11 pages
ML Assignment 4
No ratings yet
ML Assignment 4
6 pages
Another Project-Creating Customer Segments
No ratings yet
Another Project-Creating Customer Segments
31 pages
Unsupervised Machine Learning (Customer Segmentation) Online Retail
No ratings yet
Unsupervised Machine Learning (Customer Segmentation) Online Retail
43 pages
Factor Analysis - Segmentation New
No ratings yet
Factor Analysis - Segmentation New
142 pages
Customer Segemntation
No ratings yet
Customer Segemntation
26 pages
Universitas Yapis Papua (2024) - Segmentasi Pelanggan Dan RFM
No ratings yet
Universitas Yapis Papua (2024) - Segmentasi Pelanggan Dan RFM
12 pages
Lecture - 7 - Practical - DBSCAN Clustering in Python
No ratings yet
Lecture - 7 - Practical - DBSCAN Clustering in Python
3 pages
Customer Segmentation E-Commerce
No ratings yet
Customer Segmentation E-Commerce
22 pages
VL2024250504566 Ast03
No ratings yet
VL2024250504566 Ast03
2 pages
Updated Thesis
No ratings yet
Updated Thesis
28 pages
Ads Phase 5
No ratings yet
Ads Phase 5
23 pages
Data Analysis and Data Science Task - 3
No ratings yet
Data Analysis and Data Science Task - 3
3 pages
Tasks For Students-1
No ratings yet
Tasks For Students-1
3 pages
IEEE Conference Template
No ratings yet
IEEE Conference Template
5 pages
Customer Segmentation
No ratings yet
Customer Segmentation
15 pages
BT 4065 Report
No ratings yet
BT 4065 Report
32 pages
Exp 8ml
No ratings yet
Exp 8ml
5 pages
IEEE Conference Template
No ratings yet
IEEE Conference Template
5 pages
Name: Aditya Parade Roll No: 281047 PRN: 22311577 Batch: A-2 Assignment 5
No ratings yet
Name: Aditya Parade Roll No: 281047 PRN: 22311577 Batch: A-2 Assignment 5
3 pages
Workshop Project Report
No ratings yet
Workshop Project Report
10 pages
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
Python Beyond Limits: Python, #3
From Everand
Python Beyond Limits: Python, #3
AnwaarX
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Spoken Language Processing in Python Chapter3
No ratings yet
Spoken Language Processing in Python Chapter3
26 pages
Spoken Language Processing in Python Chapter4
No ratings yet
Spoken Language Processing in Python Chapter4
46 pages
Spoken Language Processing in Python Chapter2
No ratings yet
Spoken Language Processing in Python Chapter2
23 pages
Preparing Your Gures To Share With Others: Ariel Rokem
No ratings yet
Preparing Your Gures To Share With Others: Ariel Rokem
35 pages
Spoken Language Processing in Python Chapter1
No ratings yet
Spoken Language Processing in Python Chapter1
17 pages
Chapter3 PDF
No ratings yet
Chapter3 PDF
36 pages
Introduction To Data Visualization With Matplotlib: Ariel Rokem
No ratings yet
Introduction To Data Visualization With Matplotlib: Ariel Rokem
30 pages
Introduction To Data Visualization With Matplotlib Chapter2
No ratings yet
Introduction To Data Visualization With Matplotlib Chapter2
27 pages
Changing Plot Style and Color: Erin Case
No ratings yet
Changing Plot Style and Color: Erin Case
54 pages
Introduction To Data Visualization With Seaborn Chapter2
No ratings yet
Introduction To Data Visualization With Seaborn Chapter2
38 pages
Designing Machine Learning Workflows in Python Chapter4
No ratings yet
Designing Machine Learning Workflows in Python Chapter4
38 pages
Designing Machine Learning Workflows in Python Chapter1
No ratings yet
Designing Machine Learning Workflows in Python Chapter1
32 pages
Introduction To Data Visualization With Seaborn Chapter1
No ratings yet
Introduction To Data Visualization With Seaborn Chapter1
26 pages
Designing Machine Learning Workflows in Python Chapter3
No ratings yet
Designing Machine Learning Workflows in Python Chapter3
42 pages
Cleaning Data With PySpark Chapter4
No ratings yet
Cleaning Data With PySpark Chapter4
23 pages
Customer Segmentation in Python Chapter4
No ratings yet
Customer Segmentation in Python Chapter4
37 pages
Analyzing IoT Data in Python Chapter4
No ratings yet
Analyzing IoT Data in Python Chapter4
34 pages
Credit Risk Modeling in Python Chapter4
100% (1)
Credit Risk Modeling in Python Chapter4
35 pages
Cleaning Data With PySpark Chapter3
No ratings yet
Cleaning Data With PySpark Chapter3
25 pages
Cleaning Data With PySpark Chapter2
100% (1)
Cleaning Data With PySpark Chapter2
25 pages
Cleaning Data With PySpark Chapter1
0% (1)
Cleaning Data With PySpark Chapter1
20 pages
Building Chatbots in Python Chapter4
No ratings yet
Building Chatbots in Python Chapter4
20 pages
Analyzing IoT Data in Python Chapter3
No ratings yet
Analyzing IoT Data in Python Chapter3
30 pages
Building Chatbots in Python Chapter2 PDF
No ratings yet
Building Chatbots in Python Chapter2 PDF
41 pages
Analyzing IoT Data in Python Chapter1
100% (1)
Analyzing IoT Data in Python Chapter1
27 pages
Analyzing IoT Data in Python Chapter2
No ratings yet
Analyzing IoT Data in Python Chapter2
35 pages
Emerging Methods For Genome-Scale Metabolic Modeling of Microbial Communities
No ratings yet
Emerging Methods For Genome-Scale Metabolic Modeling of Microbial Communities
16 pages
Signed Off Statistics and Probability11 q2 m3 Random Sampling and Sampling Distribution v3
No ratings yet
Signed Off Statistics and Probability11 q2 m3 Random Sampling and Sampling Distribution v3
64 pages
SOP For Handling of Market Complaints in Pharmaceuticals - Pharmaceutical Guidelines
100% (1)
SOP For Handling of Market Complaints in Pharmaceuticals - Pharmaceutical Guidelines
6 pages
Diagnostic Exam - Prac Research 2 q1-m1-m2
No ratings yet
Diagnostic Exam - Prac Research 2 q1-m1-m2
4 pages
Q2 Module 2
No ratings yet
Q2 Module 2
3 pages
Roadside Video Data Analysis Deep Learning 1st Edition Brijesh Verma - Download The Ebook Now and Own The Full Detailed Content
No ratings yet
Roadside Video Data Analysis Deep Learning 1st Edition Brijesh Verma - Download The Ebook Now and Own The Full Detailed Content
58 pages
PR Final
No ratings yet
PR Final
59 pages
Reviewer
No ratings yet
Reviewer
68 pages
Practical Research 1 2111
No ratings yet
Practical Research 1 2111
3 pages
Marketing and Product Promotion Report
0% (1)
Marketing and Product Promotion Report
3 pages
Proposal 1
100% (1)
Proposal 1
19 pages
Dissertation Mci
100% (2)
Dissertation Mci
8 pages
MSA Type II - Gage Repeatability & Reproducibility Linearity and Stability
No ratings yet
MSA Type II - Gage Repeatability & Reproducibility Linearity and Stability
1 page
Enger-Ross - Concepts in Biology 10e HQ
100% (1)
Enger-Ross - Concepts in Biology 10e HQ
516 pages
Sip Report Sagar Malik
No ratings yet
Sip Report Sagar Malik
72 pages
Sample Presentation of RESEARCH TITLE During TITLE DEFENSE
No ratings yet
Sample Presentation of RESEARCH TITLE During TITLE DEFENSE
7 pages
Media Literacy in Support of Critical Thinking
No ratings yet
Media Literacy in Support of Critical Thinking
6 pages
Development - and - Validation - of - SSES ARTYKUŁ 1991 Heatheron & Polivy
No ratings yet
Development - and - Validation - of - SSES ARTYKUŁ 1991 Heatheron & Polivy
16 pages
Intro To Business Psycchology
No ratings yet
Intro To Business Psycchology
16 pages
Relationship Between Hotels Website Quality and Consumers Booking Intentions With Internet Experience As Moderator
No ratings yet
Relationship Between Hotels Website Quality and Consumers Booking Intentions With Internet Experience As Moderator
22 pages
Teacher Self-Efficacy Profiles-Determinants, Outcomes, and Generalizability Across Teaching Level
No ratings yet
Teacher Self-Efficacy Profiles-Determinants, Outcomes, and Generalizability Across Teaching Level
18 pages
Mfa Thesis Format
100% (2)
Mfa Thesis Format
6 pages
Brenda Putri BR Sinuhaji - Review Jurnal
No ratings yet
Brenda Putri BR Sinuhaji - Review Jurnal
12 pages
HM Literature Review
100% (1)
HM Literature Review
4 pages
Collaboration Roundtable Partnership Toolkit
No ratings yet
Collaboration Roundtable Partnership Toolkit
134 pages
Lab 1 Introduction To Data
No ratings yet
Lab 1 Introduction To Data
11 pages
Anaphora Resolution PDF
No ratings yet
Anaphora Resolution PDF
63 pages
Action Plan For Student Activity Coordinator
No ratings yet
Action Plan For Student Activity Coordinator
2 pages
Assignment 2
No ratings yet
Assignment 2
4 pages
The 7 Biggest Problems Facing Science
No ratings yet
The 7 Biggest Problems Facing Science
35 pages

Customer Segmentation in Python Chapter4

Uploaded by

Customer Segmentation in Python Chapter4

Uploaded by

DataCamp Customer Segmentation in Python

CUSTOMER SEGMENTATION IN PYTHON

Code from previous lesson:

from sklearn.preprocessing import StandardScaler

Methods to define the number of clusters

Import KMeans from sklearn library and initialize it as kmeans

from sklearn.cluster import KMeans

Compute k-means clustering on pre-processed data

Extract cluster labels from labels_ attribute

Analyzing average RFM values of each cluster

Create a cluster label column in the original DataFrame:

datamart_rfm_k2 = datamart_rfm.assign(Cluster = cluster_labels)

Calculate average RFM values and size for each cluster:

Analyzing average RFM values of each cluster

The result of a simple 2-cluster solution:

CUSTOMER SEGMENTATION IN PYTHON

Let's practice running k-

CUSTOMER SEGMENTATION IN PYTHON

Elbow criterion method

Elbow criterion method

# Fit KMeans and calculate SSE for each *k*

# Plot SSE for each *k*

Elbow criterion method

The elbow criterion chart:

Elbow criterion method

The elbow criterion chart:

Using elbow criterion method

Experimental approach - analyze segments

Experimental approach - analyze segments

3-cluster solution on the same normalized RFM dataset

CUSTOMER SEGMENTATION IN PYTHON

Let's practice finding the

CUSTOMER SEGMENTATION IN PYTHON

Profile and interpret

Approaches to build customer personas

Summary statistics of each cluster

Create a cluster label column in the original DataFrame:

datamart_rfm_k2 = datamart_rfm.assign(Cluster = cluster_labels)

Calculate average RFM values and sizes for each cluster:

Repeat the same for k=3

Summary statistics of each cluster

Snake plots to understand and compare segments

Prepare data for a snake plot

Transform datamart_normalized as DataFrame and add a Cluster column

Visualize a snake plot

Relative importance of segment attributes

relative_imp = cluster_avg / population_avg - 1

Analyze and plot relative importance

Recency Frequency MonetaryValue

Plot a heatmap for easier interpretation:

Relative importance heatmap

vs. printed output:

Recency Frequency MonetaryValue

CUSTOMER SEGMENTATION IN PYTHON

Your time to experiment

CUSTOMER SEGMENTATION IN PYTHON

Key steps of the segmentation project

Updated RFM data

Tenure - time since the first transaction

Goals for this project

CUSTOMER SEGMENTATION IN PYTHON

Let's dig in!

CUSTOMER SEGMENTATION IN PYTHON

What you have learned

CUSTOMER SEGMENTATION IN PYTHON

You might also like

# Fit KMeans and calculate SSE for each k

# Plot SSE for each k