0% found this document useful (0 votes)

28 views28 pages

Cluster Analysis vs. Market Segmentation2

The document compares cluster analysis and market segmentation, highlighting their definitions, methodologies, and applications. Cluster analysis groups similar objects based on data-driven techniques, while market segmentation categorizes consumers for targeted marketing based on various attributes. Both approaches aim to identify distinct groups to enhance marketing effectiveness, but have different methodologies and limitations.

Uploaded by

anushakaran.29

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views28 pages

Cluster Analysis vs. Market Segmentation2

Uploaded by

anushakaran.29

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 28

Cluster Analysis vs.

Market
Segmentation
Pavel Brusilovsky
Objectives
• Introduce cluster analysis and market segmentation by
discussing:
– Concept of cluster analysis and basic ideas and
algorithms
– Concept of market segmentation and basic ideas
– Comparison of these two approaches

2
Cluster Analysis Algorithms

• There appear to be more

algorithms for clustering
data than data to analyze

Quant People
Folklore

3
What is Cluster Analysis?
• Cluster is a group of similar objects (cases, points, observations,
examples, members, customers, patients, locations, etc)

• Cluster Analysis is a set of data-driven partitioning techniques

designed to group a collection of objects into clusters, such that
– the number of groups (clusters) as well as their forms are
unknown
– the degree of association or similarity
• is strong between members of the same cluster
• is weak between members of different clusters

• The nature of Cluster Analysis is data exploration that conducted in

repetitive fashion. Clusterization is not a single grouping, but the
process of getting well interpretable groups of objects under
consideration.

4
What is not Cluster Analysis?
• Supervised classification, for example, Discriminant Analysis, Naïve
Bayes Classifier, Support Vector Machines, etc.
– Have class label information

• Simple segmentation
– Doctors’ segmentation by specialty, assuming that each doctor’s
specialty is known
– Customer segmentation by sex, education level, geography and
response rate (assuming that these customer attributes are
known)

• Results of a query (groupings are the outcome of an external

specification)

5
Supervised vs. Unsupervised
 Cluster analysis is a product of at least two different quantitative
fields: statistics and machine learning
 Machine learning
 Unsupervised is a learning from raw data (no examples of
correct classification). In other words, class label information is
unavailable.
 No measure of success
 Heuristic arguments for judgments
 Lots of methods developed
 Supervised is a learning from data where the correct
classification of examples is given (class label information is
available)

6
Questions about groups
• Groups are unknown
– Are there groups in the data?
• Traditional Cluster Analysis
• Kohonen Vector Quantization

• Groups are known

– Given the groups, are there differences in the central tendency
of the groups?
• ANOVA (one dependent variable)
• MANOVA (several dependent variables)
– To which groups does this new object belong?
• Discriminant Analysis

7
Market segmentation
– Market segmentation is one of the most fundamental strategic
marketing concepts:
• grouping people (with the willingness, purchasing power, and the
authority to buy) according to their similarity in several
dimensions related to a product under consideration.

– The better the segments chosen for targeting by a particular

organization, the more successful the organization is in the
marketplace. The objectives are accurately predict the needs of
customers and improve the profitability.

8
Variables used in market segmentation
• Demographics
– Age
– Gender
– Education
– Income
– Home ownership, etc.
• Psychographics
– Lifestyle
– Attitude
– Beliefs
– Personality
– Buying motives, etc.
• Brand Loyalty
• Geography
– State
– ZIP
– City size
– Rural vs. Urban, etc.
9
Market Segmentation and Cluster Analysis
• Help marketers discover distinct groups in their customer bases, and
then use this knowledge to develop targeted marketing programs

• The underlying definition of cluster analysis procedures mimic the

goals of market segmentation:
– to identify groups of respondents that minimizes differences
among members of the same group
• highly internally homogeneous groups
– while maximizing differences between different groups
• highly externally heterogeneous groups

• Market Segmentation solution depends on

– variables used to segment the market
– method used to arrive at a certain segmentation
10
Criteria for Successful Market Segmentation
• Identifiability
– Can we see clear differences between segments?
• Substantiality
– Are the segments large enough to warrant separate marketing
targeting?
• Accessibility
– Can we reach our customers?
• Stability
– Do our segments stable over a certain period of time?
• Responsiveness
– Is the response to our marketing effort segment specific?
• Actionability
– Do the segmentation provides direction of marketing efforts?

11
Types of Clustering

• Partitional clustering
– A division of objects into non-overlapping subsets (clusters)
such that each object is in exactly one cluster

• Hierarchical clustering
– A set of nested clusters organized as a hierarchical tree

12
Other Distinctions Between Different
Clustering
• Different treatment of object characteristics vs. even treatment
– Characteristics are subdivided into two groups: dependent
variable and independent variables (Classification and
Regression trees)
– There is no such a subdivision (K-means)
• Model-based vs. Non-model-based
– A model is hypothesized for each of the clusters and the idea is
to find the best fit of that model to each cluster (Latent Class
Clustering)

13
Limitations and Problems of
Traditional Cluster Analysis Methods
• Need to specify K (number of clusters) in advance
• Applicable only for interval variables (only numeric data)
• Has problems when clusters are of differing
– Sizes
– Densities
– Non-globular shapes
• Unable to handle noisy data and outliers

14
Latent Class Cluster Analysis (LCCA)
• LCCA is a model-based approach:
– Statistical model is postulated for the population from which the
data sample is obtained
– LC model do not rely on the traditional modeling assumptions
(linearity, normality, homogeneity)
– It is assumed that a mixture of underlying probability distributions
generates the data
– LC model includes a K-category latent variable, each category
represents a cluster
– Objects are classified into clusters based upon membership
probabilities that are estimated directly from the data

15
Advantages of Latent Class Cluster
Analysis (LCCA)
• Optimal number of clusters is determined as a result of LCCA, using
rigorous statistical tests

• No decisions have to be made about the scaling of the observed

variables

• Variables maybe continuous, nominal, ordinal, count, or any

combination of these

16
Theory and Cluster Analysis
 Is clustering a theory?
 A theory could be true or false
 Unlike a theory, a clustering is neither true nor false, and should be
judged largely on the interpretability and usefulness of results
 No measure of success
 Heuristic arguments for judgments
 Selection of right method is a problem
 However, a clustering may be useful for suggesting a theory, which
could then be tested

17
References
 Leonard Kaufman and Peter Rousseeuw (2005), Finding Groups in
Data: An Introduction to Cluster Analysis, Wiley Series in
Probability and Statistics, 337 p.
 Mark Aldenderfer and Roger Blashfield (1984), Cluster Analysis
(Quantitative Applications in the Social Sciences), SAGE
Publications, Inc., 90 p.
 Brian Everitt, Sabine Landau and Morven Leese (2001) Cluster
Analysis, Oxford University Press, 248 p.
 Marketing Segmentation (
http://www.beckmanmarketing8e.nelson.com/ppt/chapter03.pps. )

18
Application of clustering and customer
segmentation to survey data

19
Case study: background, objectives, and
methodology
• Producer and distributor of health and beauty products launched a new product.
The product can be ordered only on the website.
• In six month an internet survey was conducted. Only three simple questions were
asked:
– How many adults are in your household?
– How many of them adopted the product?
– How many of them did not adopt the product?
• When the total number of adopters and non-adopters is less than the number of
adults in a household, the difference is treated as the number of unknowns. There
are some other situation when the number of unknown makes sense to introduce.

• The client asked us to analyze the survey data (obviously it is not the most
informative survey BI Solutions dealt with).
• The objectives of the study was to extract as much as possible useful information
from the survey data in order to understand the distribution and the usage of the
product among households, associate with each household a corresponding
likelihood of adoption, and develop methodology to employ this info in the
marketing programs.
• Methodology: synergy of cluster analysis of proportional data and intuitive
20
segmentation.
Clustering of households
• We calculated the following three variables:
– P1 is a proportion of customers in a household with unknown product
adaption behavior
– P2 is a proportion of customers in a household who adopted the
product
– P3 is a proportion of customers in a household who did not adopt the
product
• Therefore, each household is characterized by a point in three-
dimensional proportion space. Once again, it was the only available
information (that we got from the client).

• We decided to employ synergy of cluster analysis and customer

segmentation. Six clusters were identified as the result of K-means
clustering.

• Variables importance in K-means clustering:

21
Household representation in three
dimensional proportion space
Each household is
represented by a data
point (a square) in 3-
dimensional proportion
space.

Households form a 2-
dimensional triangle.

Each square might

represent several
customers.

22
Clusters of households: mean value of
proportions and cluster interpretation

Mean value (%)

Number of
Cluster Interpretation customers P1 (Non-Adopters) P2 (Adopters) P3 (Unknown)
1 Non-Adopters 1463 96.4 0.6 3.0
2 Adopters 3915 0.3 97.9 1.8
Mixed households with
adopters and non-
3 adopters 474 46.3 51.8 1.9
Mixed households with
non-adopters and
4 unknowns 537 54.0 4.3 41.7
5 Unknowns 1097 0.8 0.9 98.3
Households with adopters
6 and unknowns 1201 0.3 57.9 41.8

23
Cluster Profiling: good separation and
good interpretability

24
Households as a set of three proportions /
percentages
Three proportions:
P1 (unknown)
P2 (non-adopters)
P3 (adopters)

Different color reflect clusters

Cluster 1 (Non-Adopters)

Cluster 2 (Adopters)

Cluster 3 (Adopters and Non-Adopters)

Cluster 4 (Non-Adopters and Unknown)

Cluster 5 (Uncertain)

Cluster 6 (Adopters and Unknown)

25
Cluster /Sub-cluster profile
Number of Number of
Households in Cluster Likelihood Households in
Cluster Cluster Profile of Adoption Segmentation Rule Sub-Cluster

1 1,463 Non-Adopters Low All households in cluster 1,463

High Likely to adopt > 40% The 1,874
2 3,915 Adopters Medium Rest 2,041

Mixed households
with adopters and Medium Likely/Unlikely >= 40% The 401
3 474 non-adopters Low rest 73

Mixed households
with non-adopters
4 537 and unknowns Low All households in cluster 537

5 1,097 Unknowns Unknown All households in cluster 1,097

Households with High Likely to adopt > 60% 378

adopters and Low Likely/Unlikely >= 40% The 723
6 1,201 unknowns Unknown rest 100

26
Likelihood of the new product adoption

27
Next steps
• Customer profiling
– Data enrichment
• Data enrichment (ZIP level census data)
• Usage other health/beauty products (household level data)
• Estimation of the likelihood of the product adoption by data mining predictive
analysts / scoring households with unknown purchasing behaviour

• Identifying customers with high likelihood of the product adoption for targeting

• Developing program for increasing up-sell and cross-sell

• Developing program for customer retention

• Spatial clustering of potential and real customers

2108 03929
No ratings yet
2108 03929
201 pages
Spatial Data Mining: Theory and Application
No ratings yet
Spatial Data Mining: Theory and Application
337 pages
Data Extracted For XAI in CyberSec Paper
No ratings yet
Data Extracted For XAI in CyberSec Paper
354 pages
Graph Neural Network Introduction
No ratings yet
Graph Neural Network Introduction
88 pages
Sih Report
No ratings yet
Sih Report
33 pages
Machine Learning, ML Ass 6
No ratings yet
Machine Learning, ML Ass 6
11 pages
Industrial Market Segmentation
100% (1)
Industrial Market Segmentation
23 pages
Segmentation & Targeting
No ratings yet
Segmentation & Targeting
71 pages
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
ITM Marketing Analytics Dec 2024
No ratings yet
ITM Marketing Analytics Dec 2024
149 pages
Core Concepts in Statistical Learning
From Everand
Core Concepts in Statistical Learning
Tushar Gulati
No ratings yet
Cluster Analysis - Part A
No ratings yet
Cluster Analysis - Part A
77 pages
Market Research - Vincent
No ratings yet
Market Research - Vincent
109 pages
Analysis On Credit Card Fraud Detection Methods
0% (1)
Analysis On Credit Card Fraud Detection Methods
7 pages
Project Report On Instagram Clone
No ratings yet
Project Report On Instagram Clone
28 pages
AI in Regulatory Reporting - Whitepaper - Digital PDF
No ratings yet
AI in Regulatory Reporting - Whitepaper - Digital PDF
16 pages
UNIT II-Segmentation, Positioning, and Product Optimization
No ratings yet
UNIT II-Segmentation, Positioning, and Product Optimization
48 pages
Microsoft Azure Ai Fundamentals Certification Companion Guide To Prepare For The Ai900 Exam 1st Edition Krunal S Trivedi Download
No ratings yet
Microsoft Azure Ai Fundamentals Certification Companion Guide To Prepare For The Ai900 Exam 1st Edition Krunal S Trivedi Download
82 pages
Cluster Analysis
No ratings yet
Cluster Analysis
49 pages
T Extacy
No ratings yet
T Extacy
184 pages
Unit5 Clustering
No ratings yet
Unit5 Clustering
74 pages
BM2515 Week 2 Segmentation Slides
No ratings yet
BM2515 Week 2 Segmentation Slides
63 pages
ASMD Theory
No ratings yet
ASMD Theory
63 pages
Cluster Analysis
No ratings yet
Cluster Analysis
46 pages
Unit IV 3
No ratings yet
Unit IV 3
59 pages
ML Quiz-1
No ratings yet
ML Quiz-1
4 pages
Customer Segemntation
No ratings yet
Customer Segemntation
26 pages
Chapter 5 CLUSTERING
No ratings yet
Chapter 5 CLUSTERING
36 pages
Hyper Parameter Turning
No ratings yet
Hyper Parameter Turning
4 pages
Cluster Analysis: Learning Objectives
No ratings yet
Cluster Analysis: Learning Objectives
53 pages
MR - Session 16
No ratings yet
MR - Session 16
66 pages
Automatic Player Face Detection and Recognition For Players in Cricket Games - PPT 3
No ratings yet
Automatic Player Face Detection and Recognition For Players in Cricket Games - PPT 3
38 pages
Cluster Analysis: Prepared By: (Group-5) Ashish Goyal Jitendra Jain Nitesh Sadani
100% (1)
Cluster Analysis: Prepared By: (Group-5) Ashish Goyal Jitendra Jain Nitesh Sadani
19 pages
Notes For Chapter 3
No ratings yet
Notes For Chapter 3
15 pages
Comparison of Malware Classification Methods Using Convolutional Neural Network Based On Api Call Stream
No ratings yet
Comparison of Malware Classification Methods Using Convolutional Neural Network Based On Api Call Stream
19 pages
Cluster Analysis - Part B
No ratings yet
Cluster Analysis - Part B
25 pages
Chapter 23 - Cluster Analysis
100% (1)
Chapter 23 - Cluster Analysis
16 pages
Prof. Swapna Dutta Khan: Submitted by
100% (1)
Prof. Swapna Dutta Khan: Submitted by
14 pages
Artificial Intelligence Use in Disasters Management
No ratings yet
Artificial Intelligence Use in Disasters Management
3 pages
10clustering - Han and Kamber
No ratings yet
10clustering - Han and Kamber
93 pages
ML Module 5
No ratings yet
ML Module 5
15 pages
Clustering Methods
No ratings yet
Clustering Methods
14 pages
Concepts and Techniques: - Chapter 10
No ratings yet
Concepts and Techniques: - Chapter 10
97 pages
Deep Learning A Tutorial
No ratings yet
Deep Learning A Tutorial
16 pages
Cluster Analysis and Multidimensional Scaling Ignou PDF
No ratings yet
Cluster Analysis and Multidimensional Scaling Ignou PDF
11 pages
COMPUTATIONAL METHODS AND TECHNIQUES - 1st - 23-24 - ALL - P1PGCC01 - Mtech
No ratings yet
COMPUTATIONAL METHODS AND TECHNIQUES - 1st - 23-24 - ALL - P1PGCC01 - Mtech
2 pages
Q-Learning Algorithm
No ratings yet
Q-Learning Algorithm
13 pages
Data Segmentation
No ratings yet
Data Segmentation
27 pages
BT4211 Data-Driven Marketing: Product: Segmentation, Targeting
No ratings yet
BT4211 Data-Driven Marketing: Product: Segmentation, Targeting
31 pages
Tema 4 PRIM 2023 24
No ratings yet
Tema 4 PRIM 2023 24
13 pages
Week 1
No ratings yet
Week 1
40 pages
Marketing Management
No ratings yet
Marketing Management
23 pages
A Survey On Machine Learning Algorithms Techniques and
No ratings yet
A Survey On Machine Learning Algorithms Techniques and
6 pages
ML Project Report
No ratings yet
ML Project Report
22 pages
Segmentation and Targeting: Basics Market Definition Segmentation Research and Methods Behavior-Based Segmentation
No ratings yet
Segmentation and Targeting: Basics Market Definition Segmentation Research and Methods Behavior-Based Segmentation
77 pages
Applying Artificial Intelligence Techniques To Prevent Cyber Assaults
No ratings yet
Applying Artificial Intelligence Techniques To Prevent Cyber Assaults
8 pages
Clustering K Means Agnes
No ratings yet
Clustering K Means Agnes
36 pages
Segmentation and Profiling Using SPSS For Windows
No ratings yet
Segmentation and Profiling Using SPSS For Windows
44 pages
Customer Segmentation Report
No ratings yet
Customer Segmentation Report
31 pages
Lecture 1 PDF
No ratings yet
Lecture 1 PDF
23 pages
Lecture 1 PDF
No ratings yet
Lecture 1 PDF
23 pages
Segmentation and Profiling Using SPSS For Windows: Kate Grayson
No ratings yet
Segmentation and Profiling Using SPSS For Windows: Kate Grayson
44 pages
Post Graduate Program in Business Analytics: M&CRM: STP, Customer Intelligence
No ratings yet
Post Graduate Program in Business Analytics: M&CRM: STP, Customer Intelligence
31 pages
Market Segmentation - Cluster Analysis
No ratings yet
Market Segmentation - Cluster Analysis
18 pages
Why Artificial Intelligence Needs To Understand Consequences
No ratings yet
Why Artificial Intelligence Needs To Understand Consequences
12 pages
Search Algorithms and Systems: Definitive Reference for Developers and Engineers
From Everand
Search Algorithms and Systems: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Cluster Analysis
No ratings yet
Cluster Analysis
26 pages
New Criteria For Market Segmentation - HBR
No ratings yet
New Criteria For Market Segmentation - HBR
12 pages
Market Segmentation - A Case Study
No ratings yet
Market Segmentation - A Case Study
12 pages
Augmented Intelligence Ebook
No ratings yet
Augmented Intelligence Ebook
20 pages
IJCRT22A6129
No ratings yet
IJCRT22A6129
9 pages
Final Synopsis
No ratings yet
Final Synopsis
9 pages
Career Transition Handbook
No ratings yet
Career Transition Handbook
8 pages
Cluster Analysis For Market Segmentation
No ratings yet
Cluster Analysis For Market Segmentation
24 pages
Leeds Artificial Intelligence Course Guide
No ratings yet
Leeds Artificial Intelligence Course Guide
11 pages
Audience Profiling and Segmentation
No ratings yet
Audience Profiling and Segmentation
4 pages
STP Cluster Discriminant
No ratings yet
STP Cluster Discriminant
14 pages
Mall Customer Segmentation Kalash Daf
No ratings yet
Mall Customer Segmentation Kalash Daf
12 pages
Cluster Analysis
No ratings yet
Cluster Analysis
5 pages
Dolnicar 2003 SHARE
No ratings yet
Dolnicar 2003 SHARE
9 pages
Industrial Statistics: Application of Multivariate Statistical Methods in Marketing Research
No ratings yet
Industrial Statistics: Application of Multivariate Statistical Methods in Marketing Research
15 pages
Kanenus College
No ratings yet
Kanenus College
6 pages
Abstract
No ratings yet
Abstract
2 pages
Cluster Analysis
No ratings yet
Cluster Analysis
2 pages
Mall Customer Segmentation Using Cluster
No ratings yet
Mall Customer Segmentation Using Cluster
6 pages
Multi-Label Classification With Weighted Labels Using Learning Classifier Systems
No ratings yet
Multi-Label Classification With Weighted Labels Using Learning Classifier Systems
7 pages
Discriminative Clustering For Market Segmentation
No ratings yet
Discriminative Clustering For Market Segmentation
9 pages
Customer Segmentation
No ratings yet
Customer Segmentation
7 pages
Segmentation: Foundation of Marketing Strategy: Gelb Consulting Group, Inc
No ratings yet
Segmentation: Foundation of Marketing Strategy: Gelb Consulting Group, Inc
8 pages
Monitor Making Segmentation Work
No ratings yet
Monitor Making Segmentation Work
5 pages

Cluster Analysis vs. Market Segmentation2

Uploaded by

Cluster Analysis vs. Market Segmentation2

Uploaded by

Cluster Analysis vs.

• There appear to be more

• Cluster Analysis is a set of data-driven partitioning techniques

• The nature of Cluster Analysis is data exploration that conducted in

• Results of a query (groupings are the outcome of an external

• Groups are known

– The better the segments chosen for targeting by a particular

• The underlying definition of cluster analysis procedures mimic the

• Market Segmentation solution depends on

• No decisions have to be made about the scaling of the observed

• Variables maybe continuous, nominal, ordinal, count, or any

• We decided to employ synergy of cluster analysis and customer

• Variables importance in K-means clustering:

Each square might

Mean value (%)

Different color reflect clusters

Cluster 3 (Adopters and Non-Adopters)

Cluster 4 (Non-Adopters and Unknown)

Cluster 6 (Adopters and Unknown)

1 1,463 Non-Adopters Low All households in cluster 1,463

5 1,097 Unknowns Unknown All households in cluster 1,097

Households with High Likely to adopt > 60% 378

• Developing program for increasing up-sell and cross-sell

• Developing program for customer retention

• Spatial clustering of potential and real customers

You might also like