0% found this document useful (0 votes)

264 views56 pages

Topic 7 - Discriminant and Cluster Analysis

Uploaded by

Ne Ne

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

264 views56 pages

Topic 7 - Discriminant and Cluster Analysis

Uploaded by

Ne Ne

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

TOPIC 7

Discriminant and
Cluster Analysis
BRM9 - Group 1
Table of contents

01 Technique overview

03 References

02 Technique practicing
01
Technique overview
DISCRIMINANT ANALYSIS

A statistical technique used to classify observations into

different categories based on a set of predictor variables

It predicts a categorical outcome variable.

Identifying the characteristics that distinguish between different groups of customers.

It can be used to develop marketing strategies that are tailored to specific customer
segments.
CLUSTER ANALYSIS

A class of techniques
Objects in each cluster
used to classify
tend to be similar to
objects or cases into
each other and
relatively
dissimilar to objects in
homogeneous groups
the other clusters
called clusters

There is no a priori
information about the
group or cluster
membership for any of
the objects
Which situation, in terms of variables’ characteristics,
are these techniques appropriate to be used?
Useful for analyzing data when the criterion or dependent
variable is categorical and the predictor or independent
variables are interval scaled.

Two-group discriminant analysis Discriminant

Discriminant analysis technique where the analysis
criterion variable has two categories.

Multiple discriminant analysis

Discriminant analysis technique where the criterion
variable involves three or more categories.
Which situation, in terms of variables’ characteristics,
are these techniques appropriate to be used?
Numerical
Cluster analysis algorithms use numerical data
to calculate distances between data points.

Cluster
Continuous
analysis
Cluster analysis algorithms typically work
best with continuous data.

Uncorrelated
Cluster analysis algorithms try to group data points
together based on their similarity.
Which situation, in terms of variables’ characteristics,
are these techniques appropriate to be used?

The number of clusters is unknown a priori

Cluster analysis algorithms can be used to identify
the optimal number of clusters in the data.
Cluster
The clusters are not well-defined analysis
Cluster analysis algorithms can be used to
identify clusters that are not well-defined in
terms of their shape or boundaries.
Differences

Discriminant Analysis
Cluster Analysis
Number of groups: Known
Number of groups: Unknown
Groups: Well-defined
Groups: Not well-defined
Goal: Predict the group
membership of new observations
Goal: Identify groups of similar
data points
With discriminant analysis:

Predict the group membership of

new data points

Identify the variables that are most important

for discriminating between the groups

Assess the accuracy of the model in

predicting the group membership of
new data points
BUSINESS QUESTIONS
With cluster analysis:

Identify groups of similar data points

Understand the patterns in the data

Segment the data into different groups

for targeted marketing or other
purposes
BUSINESS QUESTIONS
02
Technique practicing
Dependent Datasets Independent
variables variables
Which factors significantly explain the differences between online shopping
adopting groups and online shopping refusing groups?

Perceived value (PEV) Computer skill (COM) Shopping experience (SHE)

Consumers who perceive Consumers with higher Consumers with more

more value in online computer skills are more shopping experience
shopping are more likely likely to adopt online are more likely to adopt
to adopt online channels channels for shopping. online channels for
for shopping. shopping.
Which factors significantly explain the differences between online shopping
adopting groups and online shopping refusing groups?

Income (INC) Education (EDU)

Consumers with higher Consumers with higher

income are more likely to education are more likely
adopt online channels for to adopt online channels
shopping. for shopping.
Tests of Equality of Group Means

Not significant in explaining the differences

> 0.05
between consumers who adopt online
channels and consumers who refuse to make
online purchase.
2. How many percentages of the differences between these two groups of
consumers are explained by the predictors?

How much the factors can account for the

reasons why the two groups of consumers are
different from each other.

One group that adopts online shopping and

another group that refuses to adopt it. We want
to understand why they differ.
3. Among the significant factors, which one contributes most and least to
the differences between adopting and refusing group

Which factors have the biggest and

smallest impact on the differences
between the group of consumers who
adopt online shopping and the group that
refuses to adopt it.

For example: the predators that have the

function coefficient of 0.5 are more
meaningful than the one that has the
function coefficient of 0.1.
4. Assume that there are two potential consumers having following characteristics;
identify which group (adopting or refusal) each of them may belong to.

Shopping Perceived Computer

Consumer Income Education Gender Age
experience value skill

A 7 2 5 3 8 Female 25

B 1 6 7 6 3 Male 30

Compare the mean of adopting group and refusal group to A and B discriminant function,
cases with scores near to a centroid are predicted as belonging to that group
1 As many groups as
possible

Analysis of a survey
conducted to understand
2 2 groups

students' choices to enroll

3
at UEH 3 groups

Optimal number of
4 groups
Datasets

Independent
variables

5-point Likert scale

Optimal Number
Two Groups of Groups

As Many Groups
As Possible Three groups
K-Means Hierarchical
Clustering (k=2) Clustering
Two distinct Optimal number
groups will be K-Means of cluster
Cluster Analysis
formed Clustering (k=3)
(K-Means)
Each belong to Three distinct
a specific groups will be
cluster formed
BRM9 - G1

Discriminant
Analysis

TOPIC 7
Managers of an online retailer want to investigate the differences between two
groups of consumers who adopt online channel and who refuse to adopt online
channel for shopping (OSA) based on some criterion: shopping experience
(SHE), income (INC), education (EDU), perceived value (PEV), computer skill
(COM), gender and age.

Dependent
OSA
Variable

Independent
SHE, INC, EDU, PEV, COM, GENDER AND AGE
Variables
Steps for running a two-group discriminant analysis
Select ANALYZE from the SPSS menu bar

Click CLASSIFY and then

DISCRIMINANT.
Click CLASSIFY and then DISCRIMINANT.
Move “OSA” into the GROUPING VARIABLE box.

Click DEFINE RANGE. 1 for

MINIMUM and 2 for MAXIMUM.
Move “Age,” “PEV,” “COM,” “EDU,” “INC” and “Gender” into
the INDEPENDENTS box
Click on “Statistic”
Click on “Classify”
OUTPUT
1.Which factors significantly explain for the differences between online
shopping adopting group and online shopping refusing group?
Sig (SHE); Sig (Gender); Sig (Age) > 0.05: NOT significant in explaining the differences

Sig (INC); Sig (EDU); Sig (PEV); Sig (COM) < 0.05: significantly explains for the differences

=> Income (INC), education (EDU), perceived value (PEV) and computer skill (COM) are
the factors that significantly explain the differences between consumers who adopt
andconsumers who refuse to make online purchases.
2. How many percentages of the differences between these two groups of
consumers are explained by the predictors?

The Canonical Correlation coefficient is 0.57

=> The predictors (Income, Education, Perceived value and Computer
skill) explain 32.49% (= 0.57^2)
3. Among the significant factors, which one contributes most and least to
the differences between adopting and refusing group?

Shopping experience (SHE), gender and age are

NEGATIVE
=> The two groups of consumers cannot be differentiated

Education (0.585) is the MOST meaningful factor

used to explain the differences between the two groups of
consumers followed by computer skill (0.308), perceived
value (0.242) Income (0.074) is the LEAST.
4. Assume that there are two potential consumers having following
characteristics; identify which group (adopting or refusal) each of them may
belong to.
Group A: D= (0.038x2) + (0.324x5) + (0.133x3) + (0.185x8) + (-0.015x7) - 2.621
=> Group A: D= 0.849

Group B: D= (0.038x6) + (0.324x7) + (0.133x6) + (0.185x3) -0.015 - 2.621

=> Group B: D= 1,213
Group A: D= (0.038x2) + (0.324x5) + (0.133x3) + (0.185x8) + (-0.015x7) - 2.621
=> Group A: D= 0.849 (near 0.949 -> REFUSAL GROUP)

Group B: D= (0.038x6) + (0.324x7) + (0.133x6) + (0.185x3) -0.015 - 2.621

=> Group B: D= 1,213 (near 0.949 -> REFUSAL GROUP)

=> Cases with scores near to a centroid are predicted

as belonging to that group
BRM9 - G1

Cluster
Analysis

TOPIC 7
UEH DATA
The survey has been conducted to investigate students’ choice to enrol UEH. The
sample includes 50 participants who are currently UEH students. The
questionnaire is used to measure some key factors (5-point Likert scale)
including:
University reputation, coded as UR
Lecturers – FA
Learning program – PC
Financial support – CF
Facility – FACI
Student career development – CD
Social influence – PI
Classify as many groups as possible

Analyze > Classify > Hierachical Cluster

Classify as many groups as possible

Move the variables and label cases by SID

Choose Plots, then tick the Dendrogram
Choose Method, then choose Ward’s methos and Squared Eulidean distance
Classify as many groups as possible

Using the classification tree to determine

the number of cluster is a selective
process.

50 clusters
Classify as many groups as possible

Move variables and set label cases by SID

Type “50” in Number of Clusters

Create 50 clusters by Analyze > Classify > K-Means Cluster

Classify as many groups as possible
Classify as 2 groups

Create 2 clusters by K-Means Cluster

Classify as 2 groups

Group 1: Moderate Satisfaction (19,000 students)

Group 2: High Satisfaction with Emphasis on
Reputation and Teaching (31,000 students)
Classify as 3 groups

Group 1: Moderate Satisfaction (3,000 students)

Group 2: High Satisfaction Group (27,000 students)
Group 3: Moderate Satisfaction Group with Emphasis on Reputation and Lecturers (20,000
students)
Optimal number of group
Starting from the
right, between 10 and
25 there are two clear
clusters.
The gap is bridged
between 3 clusters
and 4 clusters.
However, when come
to 5 clusters, it’s a
sudden jump (gap)
=> The solution before
the gap indicate the good
solution.
In this case, it’s 4 clusters
Optimal number of group

By K-Mean Cluster, we create 4 groups by:

Group 1: High Satisfaction (20,000 students)
Group 2: High Satisfaction with Varied Preferences (11,000 students)
Group 3: Low Satisfaction Group (3,000 students)
Group 4: Moderate Satisfaction with Emphasis on Reputation (16,000 students)
BRM9 - G1

Thanks
for listening

TOPIC 7
03
References
(1) Malhotra NK. Marketing research : an applied orientation.
Upper Saddle River, Nj ; London: Prentice Hal l; 2010.

Cool Mate Digital Marketing Proposal
No ratings yet
Cool Mate Digital Marketing Proposal
29 pages
Session 5 - Correlation and Regression
100% (1)
Session 5 - Correlation and Regression
32 pages
Data Structure & Algorithms Lab Manual V1.2-1
No ratings yet
Data Structure & Algorithms Lab Manual V1.2-1
97 pages
Chapter 3 With Answers
100% (1)
Chapter 3 With Answers
5 pages
Chapter 14
100% (2)
Chapter 14
16 pages
Individual Assignment - MKT201 - Đoàn Hoàng Anh - hs140521 - MKT1406
No ratings yet
Individual Assignment - MKT201 - Đoàn Hoàng Anh - hs140521 - MKT1406
7 pages
Ananas: Microenvironment Analysis
No ratings yet
Ananas: Microenvironment Analysis
28 pages
EIM Knowledge Test
No ratings yet
EIM Knowledge Test
768 pages
Bách Hóa Xanh Team B
No ratings yet
Bách Hóa Xanh Team B
58 pages
Câu hỏi trắc nghiệm thương hiệu chương 4
No ratings yet
Câu hỏi trắc nghiệm thương hiệu chương 4
11 pages
Culture of Fear
No ratings yet
Culture of Fear
3 pages
(CB-G8) Case Session 5
No ratings yet
(CB-G8) Case Session 5
14 pages
G7 Case
No ratings yet
G7 Case
15 pages
Final Strategy & Innovation PDF
No ratings yet
Final Strategy & Innovation PDF
21 pages
Chuong/ Chapter 3: Consumer Behavior: Multiple Choice
No ratings yet
Chuong/ Chapter 3: Consumer Behavior: Multiple Choice
5 pages
Critical Factors Influencing Consumer Online Purchase Intention For Cosmetics and Personal Care Products in Vietnam
No ratings yet
Critical Factors Influencing Consumer Online Purchase Intention For Cosmetics and Personal Care Products in Vietnam
11 pages
Online Knowledge Test 1 Attempt Review
No ratings yet
Online Knowledge Test 1 Attempt Review
12 pages
giải case study marketingggg
No ratings yet
giải case study marketingggg
7 pages
CBVN GTP'24 - Carlsberg Case Challenge - Week 1
No ratings yet
CBVN GTP'24 - Carlsberg Case Challenge - Week 1
8 pages
End-Module Assignment: Tiki Analysis
No ratings yet
End-Module Assignment: Tiki Analysis
12 pages
Chapter 14-Cullture
No ratings yet
Chapter 14-Cullture
64 pages
Rie Nevan
No ratings yet
Rie Nevan
8 pages
Glo Bus Quiz 1 Answers
No ratings yet
Glo Bus Quiz 1 Answers
20 pages
QUESTIONNAIRE-Trần Hoàng Thành-11194734
No ratings yet
QUESTIONNAIRE-Trần Hoàng Thành-11194734
4 pages
Prior-To-Class Quiz 10 - Statistics For Business-T123PWB-1
No ratings yet
Prior-To-Class Quiz 10 - Statistics For Business-T123PWB-1
6 pages
Asm 2a MP
No ratings yet
Asm 2a MP
17 pages
Huỳnh Nguyễn Nam Phương - Individual Reflective Assignment - Consumer Behavior-T22324PWB-1
No ratings yet
Huỳnh Nguyễn Nam Phương - Individual Reflective Assignment - Consumer Behavior-T22324PWB-1
29 pages
TRAN THI THUY AN - 2045519130 - K58BFA - Strategy & Innovations
No ratings yet
TRAN THI THUY AN - 2045519130 - K58BFA - Strategy & Innovations
17 pages
Consumer Behavior Final Exam Reviw
100% (1)
Consumer Behavior Final Exam Reviw
11 pages
Swot Analysis Strengths Weaknesses
No ratings yet
Swot Analysis Strengths Weaknesses
8 pages
Project Management - Session 15 - Revision For The Final Examination
No ratings yet
Project Management - Session 15 - Revision For The Final Examination
113 pages
QT Marketing Trac Nghiem
No ratings yet
QT Marketing Trac Nghiem
133 pages
Chapter 21: Theory of Consumer Choice Section A
No ratings yet
Chapter 21: Theory of Consumer Choice Section A
8 pages
Class Submission
No ratings yet
Class Submission
3 pages
6 Case Study Có Đáp Án
No ratings yet
6 Case Study Có Đáp Án
12 pages
Strategic Brand Management Keller 10 Rev Measuring Outcomes of BE Market Performance 0010
No ratings yet
Strategic Brand Management Keller 10 Rev Measuring Outcomes of BE Market Performance 0010
32 pages
Consumer Behaviour Mcqs (Set-4) : Answer: A
No ratings yet
Consumer Behaviour Mcqs (Set-4) : Answer: A
6 pages
Online Knowledge Test 1 Attempt Review 1
No ratings yet
Online Knowledge Test 1 Attempt Review 1
11 pages
Chapter 12-Income and Social Class
0% (1)
Chapter 12-Income and Social Class
43 pages
(CB-G8) Case Session 8
No ratings yet
(CB-G8) Case Session 8
15 pages
Group 6 Haidilao
No ratings yet
Group 6 Haidilao
36 pages
Chapter 6 Testbank
No ratings yet
Chapter 6 Testbank
22 pages
Chapter 9 MCQs
100% (1)
Chapter 9 MCQs
3 pages
Nielsen Mock Test: Brand Super Spa
No ratings yet
Nielsen Mock Test: Brand Super Spa
15 pages
Review Questions Case Study
No ratings yet
Review Questions Case Study
45 pages
END-MODULE Strategy
No ratings yet
END-MODULE Strategy
23 pages
Luyen Tap Chuong 2 - and SOLUTION - Daily Transaction
No ratings yet
Luyen Tap Chuong 2 - and SOLUTION - Daily Transaction
8 pages
Sales Management Chapter 1 The Nature and Role of Selling
No ratings yet
Sales Management Chapter 1 The Nature and Role of Selling
12 pages
OSCM Tự luận
No ratings yet
OSCM Tự luận
21 pages
Exercises On Central Bank Intervention
No ratings yet
Exercises On Central Bank Intervention
4 pages
Question Bank - Marketing Metrics
No ratings yet
Question Bank - Marketing Metrics
29 pages
Group-9 INS4003.02 Netfix
No ratings yet
Group-9 INS4003.02 Netfix
23 pages
MARKETING STRATEGY OF PHUC LONG - Nguyen Dinh Cat Tuong 2273201081992
No ratings yet
MARKETING STRATEGY OF PHUC LONG - Nguyen Dinh Cat Tuong 2273201081992
22 pages
Chapter 3 Test Bank Test Bank
No ratings yet
Chapter 3 Test Bank Test Bank
46 pages
Final Report
No ratings yet
Final Report
20 pages
MKT318M - GROUP ASSIGNMENT Bitis
No ratings yet
MKT318M - GROUP ASSIGNMENT Bitis
62 pages
Assigment 1: Business Strategy: Task 1.1 Strategy Process Vinamilk Vision
No ratings yet
Assigment 1: Business Strategy: Task 1.1 Strategy Process Vinamilk Vision
7 pages
Solomon, Michael R - Consumer Behavior - Buying, Having, and Being (Global Edition) - Pearson (2017) - 203
No ratings yet
Solomon, Michael R - Consumer Behavior - Buying, Having, and Being (Global Edition) - Pearson (2017) - 203
1 page
Quizz 1 Mkt201
No ratings yet
Quizz 1 Mkt201
4 pages
QTH cuối kì
No ratings yet
QTH cuối kì
18 pages
Analysis Case Processing Summary
No ratings yet
Analysis Case Processing Summary
8 pages
SKYMARK C-Series Water-Cooled Self-Contained Units Catalog (1112)
No ratings yet
SKYMARK C-Series Water-Cooled Self-Contained Units Catalog (1112)
24 pages
MIT's Undergraduate String Theory Project
100% (13)
MIT's Undergraduate String Theory Project
18 pages
Amazon Braket: Developer Guide
No ratings yet
Amazon Braket: Developer Guide
54 pages
Using Third Overtone Crystals
No ratings yet
Using Third Overtone Crystals
11 pages
BIHANA2015 - Hollis - Performance Tuning in Sap Hana PDF
No ratings yet
BIHANA2015 - Hollis - Performance Tuning in Sap Hana PDF
75 pages
Bomba Kobe T200 - Manual de Partes
100% (1)
Bomba Kobe T200 - Manual de Partes
13 pages
Shell Diala S2 ZU-I Gasoil Tariff: Performance, Features & Benefits Main Applications
No ratings yet
Shell Diala S2 ZU-I Gasoil Tariff: Performance, Features & Benefits Main Applications
2 pages
Ethylene Oxide: Jump To
100% (1)
Ethylene Oxide: Jump To
31 pages
0 - A Manual For The Part-Compositor Framework
No ratings yet
0 - A Manual For The Part-Compositor Framework
10 pages
FT (06) - Answerkey (RM) Phase02
No ratings yet
FT (06) - Answerkey (RM) Phase02
22 pages
Vendor P Q App
No ratings yet
Vendor P Q App
441 pages
Industrial Filters PDF
No ratings yet
Industrial Filters PDF
48 pages
SAP2000 Tutorial Example: Analysis and Design of Continuous RC Beam
No ratings yet
SAP2000 Tutorial Example: Analysis and Design of Continuous RC Beam
21 pages
Condenser Cladding Info
0% (1)
Condenser Cladding Info
37 pages
Progs
No ratings yet
Progs
22 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
4 pages
Chapter 4 Measures of Location
No ratings yet
Chapter 4 Measures of Location
37 pages
PH and PH Meter-1
100% (1)
PH and PH Meter-1
9 pages
Customize Pricing Procedure
No ratings yet
Customize Pricing Procedure
5 pages
Chapter 3
No ratings yet
Chapter 3
77 pages
Heliax AVA5-50 Coaxial Cable: One Company. A World of Solutions
No ratings yet
Heliax AVA5-50 Coaxial Cable: One Company. A World of Solutions
2 pages
Rotational Motion - Torque and Center of Gravity
No ratings yet
Rotational Motion - Torque and Center of Gravity
39 pages
Buckley 2005
No ratings yet
Buckley 2005
11 pages
Formulation In-Vitro Evaluation of Sulfanilamide 15% Vaginal Cream
No ratings yet
Formulation In-Vitro Evaluation of Sulfanilamide 15% Vaginal Cream
3 pages
Accelerated Synthesis of Novel Materials
No ratings yet
Accelerated Synthesis of Novel Materials
12 pages
BODMAS 1new
No ratings yet
BODMAS 1new
2 pages
Nasa 5020a - Its All in The Preload - Predictive Engineering Fea Consulting Engineering Service 20201230
No ratings yet
Nasa 5020a - Its All in The Preload - Predictive Engineering Fea Consulting Engineering Service 20201230
8 pages
Probability of One Event
No ratings yet
Probability of One Event
14 pages
Relativistic Electrodynamics PDF
No ratings yet
Relativistic Electrodynamics PDF
10 pages

Topic 7 - Discriminant and Cluster Analysis

Uploaded by

Topic 7 - Discriminant and Cluster Analysis

Uploaded by

TOPIC 7

A statistical technique used to classify observations into

It predicts a categorical outcome variable.

Identifying the characteristics that distinguish between different groups of customers.

Two-group discriminant analysis Discriminant

Multiple discriminant analysis

The number of clusters is unknown a priori

Predict the group membership of

Identify the variables that are most important

Assess the accuracy of the model in

Identify groups of similar data points

Understand the patterns in the data

Segment the data into different groups

Perceived value (PEV) Computer skill (COM) Shopping experience (SHE)

Consumers who perceive Consumers with higher Consumers with more

Income (INC) Education (EDU)

Consumers with higher Consumers with higher

Not significant in explaining the differences

How much the factors can account for the

One group that adopts online shopping and

Which factors have the biggest and

For example: the predators that have the

Shopping Perceived Computer

students' choices to enroll

5-point Likert scale

Click CLASSIFY and then

Click DEFINE RANGE. 1 for

The Canonical Correlation coefficient is 0.57

Shopping experience (SHE), gender and age are

Education (0.585) is the MOST meaningful factor

Group B: D= (0.038x6) + (0.324x7) + (0.133x6) + (0.185x3) -0.015 - 2.621

Group B: D= (0.038x6) + (0.324x7) + (0.133x6) + (0.185x3) -0.015 - 2.621

=> Cases with scores near to a centroid are predicted

Analyze > Classify > Hierachical Cluster

Move the variables and label cases by SID

Using the classification tree to determine

Move variables and set label cases by SID

Create 50 clusters by Analyze > Classify > K-Means Cluster

Create 2 clusters by K-Means Cluster

Group 1: Moderate Satisfaction (19,000 students)

Group 1: Moderate Satisfaction (3,000 students)

By K-Mean Cluster, we create 4 groups by:

You might also like