7.2. Topic 7. Part 2
7.2. Topic 7. Part 2
Topic 7.
MULTIVARIABLE ANALYSIS. PART 2
Lecturer : Inna Alexeeva A.
[email protected]
7.4. Multidimensional Scale (MDS)
2
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.4. Multidimensional Scale (MDS)
Allocation of labels
Evaluation of the Decision on the
to the dimensions &
reliability and number of
interpretation of the
validity dimensions
configurations
3
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.4. Multidimensional Scale (MDS)
4
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.4. Multidimensional Scale (MDS)
5
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.4. Multidimensional Scale (MDS)
6
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.4. Multidimensional Scale (MDS)
Entry data acquisition
7
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.4. Multidimensional Scale (MDS)
8
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.4. Multidimensional Scale (MDS)
9
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.4. Multidimensional Scale (MDS)
10
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.4. Multidimensional Scale (MDS)
Application in Marketing:
• Analysis of the competition in the market
• Perceptions of the customers
Sport
• Preferences of the customers (high)
Spatial configuration
conjointly of the car Prestigious,
Economic
high cost
brands and preferences
of the consumers
(low)
Application in Marketing:
Preference data consist of the ordered ranges of the brands according to the
preferences of the consumers.
The ideal points of the interviewees are allocated on the same spatial graph
representation. Each ideal point represents the reference locus of each individual.
This way, the interviewee 1 (labeled with I1) prefers sport cars: Porsche, Jaguar and
Audi; while the interviewee 2 (labelled with I2) prefers luxurious cars: Continental,
Mercedes, Lexus and Cadillac. Sport
(high)
Prestigious,
Economic
high cost
(low)
13
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.5. Correspondence Analysis
14
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.5. Correspondence Analysis
15
Marketing research. Topic 7: Multivariable Analysis. Part 1
7.5. Correspondence Analysis
Example
16
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.5. Correspondence Analysis
Example
The Football Clubs: Barcelona, Real Madrid, Atlético de Madrid, Athletic Club
Bilbao and Valencia CF.
Preference: the first one will be which they follow, the second one – the nicest
of the most friendly and the fifth one – the most unpleasant.
The goal: to analyse the relative position of the followers of these teams.
Followers Barça R. Madrid At. Madrid At. Bilbao Valencia Active margin
17
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.5. Correspondence Analysis
Example
Results: The 1st dimensions, the first axis (we can say the main component, in
terms of PCA) explains at 66.3% of the inertia (= variance).
The 2nd dimension explains at 22% → the two dimensions together explain at
88.3%.
19
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.5. Correspondence Analysis
Example
… and the coordinates of the points of the COLUMN:
20
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.5. Correspondence Analysis
Example
Points of the rows and columns Preferences
2 Followers
1 At.Bilbao
At.Bilbao
Barça
Dimension 2
-1 At. Madrid
At. Madrid
-2
-2 -1 0 1 2 Dimension 1
21
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.5. Correspondence Analysis
22
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.5. Correspondence Analysis
ADVANTAGES DISADVANTAGES
In comparison with other techniques of The distances between sets
multidimensional scaling, the Correspondence (between the row and the
Analysis reduces the requirements of data
collection implied on the respondents, since
column) cannot be
only binary or categorical data are obtained. interpreted in a significate
way.
The respondents must only revise what
attributes are applied to each of the brands. The The Correspondence
input data are the number of affirmative Analysis is a technique of
answers for each brand in each attribute. exploratory data analysis
The brands and attributes are exposed later in which is not adequate for
the same multidimensional space. hypothesis test.
23
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.6. Conjoint Analysis
25
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.6. Conjoint Analysis
Application in Marketing
26
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
27
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
A REAL situation of
conglomeration
28
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
APPLICATION
Market segmentation: possibility to group customers according to the benefits they look
for in the purchase of a product.
Identify opportunities for new products: by clustering brands or products, it’s possible to
determine competitive sets within the market.
Reduce data: useful to develop subgroups of data that would be easier to handle than
individual observations.
29
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
30
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
Phases
31
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
1
• Plan the problem
2
• Select the measure of distance
3
• Choose clustering procedure
4
• Decide the number of clusters
5
• Interpret and describe the clusters
6
• Evaluate the validity of clusters
32
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
▪ The variables must be chosen on the basis of the previous research, theory
or consideration of the evaluated hypothesis.
▪ The set of chosen variables must describe the similarity among the objects
in relevant terms for the research problem.
• If the important variables are not included in the analysis, the
obtained results will be poor and biased.
• Inclusion of one or more irrelevant variables will distort clustering
solution.
33
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
ID V1 V2
1º 4 3 𝒅 𝟏𝟐 = (𝟒 − 𝟕) 𝟐 +(𝟑 − 𝟕) 𝟐 = 𝟓
2º 7 7
▪ The formula for this distance between a point X (X1, X2, etc.) and a
point Y (Y1, Y2, etc.) is the following:
35
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
Source: pbarrett.net
36
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
Source: pbarrett.net
37
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
38
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
Matrix of distances
𝒅 (𝒓,𝒔) = (𝒙 𝒓𝒊 − 𝒙 𝒔𝒊 ) 𝟐
𝒊=𝟏
39
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
40
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
3. Choose clustering procedure
1 2 3 4 5
1 0
2 0,49 0
3 0,38 0,53 0
4 0,24 0,32 0,75 0
5 0,59 0,67 1,40 0,11 0
41
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
3. Choose clustering procedure
1 2 3 4-5
1 0 Distances are not
2 0,49 0 modified
3 0,38 0,53 0
4-5 0,24 0,32 0,75 0
42
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
3. Choose clustering procedure
1-4-5 2 3
1-4-5 0
2 0,32 0
3 0,38 0,53 0
1-2-4-5 3
1-2-4-5 0
3 0,38 0
43
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
3. Choose clustering procedure
Non-hierarchical
Hierarchical Others
(k-means)
Variance
Sequential Parallel Optimal
Linkage (Ward Centroids
threshold threshold division
technique)
Complete Average
Single linkage
linkage linkage
44
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
3. Choose clustering procedure
Hierarchical Non-hierarchical
45
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
3. Choose clustering procedure
Max.
distance
Min.
distance
Conglomerate 1 Conglomerate 2 Conglomerate 1 Conglomerate 2
Average linkage
Average
distance
Conglomerate 1 Conglomerate 2
46
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
Centroid technique
A distance between two clusters is
the distance between their centroids
(the means of all variables).
Every time the objects are grouped, a
new centroid is computed.
47
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
3. Choose clustering procedure
Dendrogram using Ward technique
Cases
Cases
Source: https://docs.roguewave.com/
49
Market Research. Topic 7: Multivariable Analysis. Part 2. Rescaled distance of the combined clusters
Sou rce: https://docs.roguewave.com/
7.7. Cluster Analysis
3. Choose clustering procedure
Dendrogram using Ward technique
S o u r c e : DOI:
10.1109/AVSS.2014.6918690
50
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
3. Choose clustering procedure
51
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
4. Decide the number of clusters
52
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
5. Interpret and describe clusters
▪ Cluster 1 can have relatively high values in some variables, for example, V1
(go shopping is fun) and V3 (when I go shopping I have lunch outdoors). But
it can also have a low value, like in the variable V5 (I’m not interested in go
shopping). So, this cluster can be labelled as “friendly and interested
shoppers”. (the case of 20 variables used for practice in class)
Centroids of clusters
Metrics of the variables
Cluster Nº
53
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
5. Interpret and describe clusters
▪ Cluster 2 is just the opposite: with low values V1 and V3, and high value in V5,
so, this cluster can be labelled as “apathetic shoppers”. (Cases 2, 5, 9, 11, 13
and 20.)
▪ Cluster 3 has high values in V2 (shopping unbalances my budget), V4 (I try to
find the best offers when I go shopping) and V6 (one can save much money if
you compare prices) → “thrifty shoppers”. (Cases 4, 10, 14, 16, 18 and 19.)
Centroids of clusters
Metrics of the variables
Cluster Nº
54
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
▪ It’s useful to describe clusters in terms of variables that were NOT used by
grouping. This includes demographic and psychographic variables and
those of the use of the product or use of the communication means, etc.
▪ For example, the clusters may be derived on the basis of the searched
profits. It’s possible to provide a more detailed description in terms of the
demographic and psychographic variables to address marketing efforts
towards each cluster.
▪ Discriminant analysis and analysis of variance of one factor allow to
distinguish clearly among the clusters.
55
Market Research. Topic 7: Multivariable Analysis. Part 2.
7.7. Cluster Analysis
6. Evaluate the validity of clusters
• Carry out cluster analysis with the same data but with different distance metrics and
1 compare the results to determine stability of solutions.
• In non-hierarchic cluster the solution may depend on the sequence of the cases
5 in the set of data.
57