75% found this document useful (4 votes)

2K views19 pages

Assignment 2

This document discusses segmenting consumers of bath soap using cluster analysis in SAS. It explores segmenting consumers based on demographics, purchase behavior, purchase basis, and a combination of purchase behavior and basis. For each segmentation method, it provides visualizations of the cluster sizes, variable importance, cluster means, and segment profiles. The best segmentation is identified as using both purchase behavior and basis variables, as this combination provides the most informative segmentation of consumers while meeting the business objective of having 5 or fewer segments.

Uploaded by

kuldeep_das_2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

75% found this document useful (4 votes)

2K views19 pages

Assignment 2

Uploaded by

kuldeep_das_2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 19

Segmenting consumers on bath soap

BIDM Assignment 2

Section B, Group 7 Kuldeep Das PGP26282 Nitul Das PGP26105 Amit Roykaran PGP26196

Contents
Introduction ............................................................................................................................................ 3 Understanding the business problem & objectives ................................................................................ 3 Business Objectives............................................................................................................................. 3 Data mining objectives........................................................................................................................ 3 Data preparation ( Done in excel file) ..................................................................................................... 4 Clustering Analysis Using SAS ................................................................................................................. 5 Clustering based on Demographics .................................................................................................... 6 Clustering based on purchase behaviour............................................................................................ 9 Clustering based on Purchase Basis .................................................................................................. 10 Clustering based on Purchase behaviour + Purchase basis .............................................................. 15 Question 2 ............................................................................................................................................. 17

ASSIGNMENT2 - BIDM
Introduction
CRISA is an Asian market research agency that specializes in tracking consumer purchase behaviour in consumer goods. CRISA has recorded the data of household consumption pattern. The households were selected using stratified sampling techniques. The data captured by CRISA contains the following information: Demographics of the households (updated annually) Possession of durable goods: This data is used to calculate the affluence index Purchase data of product categories and brands (updated monthly)

In this project, we have used k-means clustering to identify clusters based on parameters such as: Purchase behaviour (volume, frequency, susceptibly to discounts, and brand loyalty) Basis of purchase (price, selling proposition)

And then we have combined the above variables to find segmentation based on both purchase behaviour and Basis of purchase.

Understanding the business problem & objectives

Business Objectives
The data needs to be analyzed by segmenting the variables into various clusters based on criterion other than demographics. The customers display different levels of brand loyalty based on the price, choice criteria, promotions, affluence, social & economic status etc. If we can segment the customers based on certain important variables as given in the data set, we can target them more specifically by providing customized branding and promotional campaigns. Hence, the business objective is to form segments of customers that shows similar purchase behaviour and are affected similarly by any kind of selling proposition or promotional campaigns so that segments can be targeted in particular for branding and promotional activities.

Data mining objectives

To divide the variables into clusters or segments based on: Purchase behaviour (volume, frequency, susceptibly to discounts, and brand loyalty) Basis of purchase (price, selling proposition) Variables that describe both purchase behaviour and basis of purchase

To find the best segmentation of these clusters using demographic variables also in combination with the above variables. There is an upper cap on the number of clusters due to the number of promotional campaigns that can be run which is 5. Hence, an ideal clustering should not exceed more than 5 clusters.

Data preparation ( Done in excel file)

Note:- The transformed columns have been highlighted in red in the excel file. The given data has many missing values or values that do not represent any particular category. Hence, imputation of the values were done a) Imputing missing values - Many sex variables are 0, converted them to female - FEH = 1 assuming major population is vegetarian - MT=5 (Hindi speaking) - Number of people in the household is 5 - EDU = 5 (12th standard as it is the majority value in the data) - CS = 1 (majority of the data points have this value) b) Derivation for Brand Loyalty Index The brand loyalty index is a measure of 3 criteria (ceteris paribus, the volume of transactions) No. Of brands Brand Runs Volume of purchases attributed to each brand

Each of these criteria is normalized (between 0 to 1) so as to remove the bias of higher numeric values for a given criteria. a) No. Of brands As the number of brands increases, the probability of switching between the brands increases, hence the lower the number of brands its better. Hence we assign a lower score to rows which have low number of brands thus indicating a better brand loyalty. b) Brand Runs The lower the number of brand runs, the better it is. A higher number of brand runs increases the probability of having brand runs for multiple brands, therefore indicating a higher switching behaviour. Hence we assign a lower score to rows which have lower number of brand runs. c) Volume of purchases attributed to each brand The higher the purchase for a given brand, the better it is and hence we attribute a lower score to this parameter. The way we have worked out the score for this criteria is that We find the max % volume attributed to any one of the given brands From the given table below we assign the score to this variable (Note that the score increases as the % volume decreases, this is to ensure that we get a lower score for the brand loyalty index in consistent with the other 2 criterias) Score 0.0

% volume of purchase to a given brand 100%

90% 80% 70% 60% 50% 40% 30% 20% 10% 0%

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0

The final score for brand loyalty index is therefore a linear combination of the three criteria mentioned above with different weights assigned to indicate relative importance. Volume of purchase is given low importance. This is illustrated by the example below. A customer might buy a brand less number of times, but the times he buy a brand, purchases in bulk quantities, in this case he is less loyal than a person, who visits to buy a brand more number of times, but buys in less quantities. Brand Loyalty Index = 0.4 * No. Of brands_score + 0.4 * Brand Run_score + 0.2 * Volume of purchase attributed to a given brand_score The lower the brand loyalty index, the better it is.

Clustering Analysis Using SAS

The process flow diagram is shown below

Figure 1. SAS Flow diagram

Clustering based on Demographics

All the demographic variables were used in clustering. In all there are 10 variables. Below diagrams show the cluster and segment plot.

Figure 2: Segment and Cluster Plot, Demographic clustering

Figure 3: Variable Importance, Demographic clustering As we can see, that Affluence_Index is the most important variable among the demographic variables

Figure 4: Mean statistics of the generated clusters, Demographic clustering

Figure 5: Segment Plot of the generated clusters, Demographic clustering

Figure 6: Segment Profile of the generated clusters, Demographic clustering Segment 1 2 3 4 5 Comment on Affluence Index Little less than average Very low Very high Little higher than average Average

Clustering based on purchase behaviour

Figure 7: Segment size, clustering based on purchase behaviour

Figure 8: Variable importance, clustering based on purchase behaviour As we can see, Total Volume, No_of_trans, and Brand Loyal are the important variables.

Figure 9: Segment plot, clustering based on purchase behaviour

Figure 10: Mean statistics of generated clusters, Purchase behaviour

Figure 11: Segment profile, Purchase Behaviour

Clustering based on Purchase Basis

Since there are many variables included in Price Category (Pr_Cat_1 to Pr_Cat_4) and Selling proposition (PropCat_5 to PropCat_15) , when we run the clustering tool, we find a large number of

clusters (> 15). Hence we manually limit the number of clusters to 4, 5&6 and then come to the conclusion that the best cluster is 5. (Below diagrams illustrate that cluster size 5 gives the best distribution)

Figure 12: Cluster proximities, Cluster size 5

Figure 13: Cluster proximities, Cluster size 4

Figure 14: Cluster proximities, Cluster size 6

We proceed with Cluster size 5

Figure 15: Segment size plot, Purchase Basis

Figure 16: Variable importance, Purchase Basis As we can see, that Pr_Cat_2 is the most important variable.

Figure 17: Mean statistics, Purchase Basis

Figure 18: Segment plot, Purchase Basis

Figure 19: Segment profile, Purchase Basis Segment 1 2 3 4 5 Comment on Pr_Cat_2 variable distribution in the cluster Lowest among the all Less than average Significantly higher than average Less than average Higher than average

Clustering based on Purchase behaviour + Purchase basis

Figure 20: Segment size, both purchase basis + purchase behaviour

Figure 21: Cluster proximities, both purchase basis + purchase behaviour

Figure 22: Variable importance, both purchase basis + purchase behaviour

Figure 23: Segment plot, both purchase basis + purchase behaviour

Figure 24: Mean Statistics, both purchase basis + purchase behaviour

Figure 25: Segment profile, both purchase basis + purchase behaviour As it can be seen, that purchase behaviour variables dominate more than the purchase basis variables from the variables importance table.

Question 2
To identify the best segmentation basis out of the 3 profiles (purchase behavior, basis for purchase, both basis for purchase and purchase behavior) we have to see the distance between the clusters. The following plots shows the distance between the clusters in the 3 profiles used:

Basis of Purchase

Demographic

Both basis of purchase and purchase behavior Hence, we can see that a combination of both basis for purchase and purchase behavior gives the highest degree of separation between the clusters and hence is the best segmentation criteria. Based on the segment profile of this segmentation basis, we can say that the segments have the following membership Segment 1 2 3 4 5 Key Characteristics Less than average volume purchase, Least brand loyalty, Less than average price category 1 More than average volume purchase, Less than average brand loyalty, More than average price category 1 Average Volume purchase, Average brand loyalty, Average price category 1 Lowest Volume purchase, Highest brand loyalty, Highest price category 1 Highest Volume purchase, More than average brand loyalty, Lowest price category 1

Sing Sing Sing - FULL Big Band - Andrew Sisters
100% (1)
Sing Sing Sing - FULL Big Band - Andrew Sisters
51 pages
Capstone Chapter 9 Case Problem Grey Code Corporation SBA 1 2
No ratings yet
Capstone Chapter 9 Case Problem Grey Code Corporation SBA 1 2
10 pages
Instructions AKS134 Date 2008 02 EN PDF
No ratings yet
Instructions AKS134 Date 2008 02 EN PDF
101 pages
Business Statistics: Correlation Study Alumni Giving Case
No ratings yet
Business Statistics: Correlation Study Alumni Giving Case
4 pages
LPP Sensitivity Report
No ratings yet
LPP Sensitivity Report
7 pages
Marketing Assignment
No ratings yet
Marketing Assignment
14 pages
QT Presentation
No ratings yet
QT Presentation
16 pages
Group Assignment: Points:10 Uday Singh'S Changed Travel Plan
50% (2)
Group Assignment: Points:10 Uday Singh'S Changed Travel Plan
2 pages
Chapter 5. Solution To End-of-Chapter Comprehensive/Spreadsheet Problem
No ratings yet
Chapter 5. Solution To End-of-Chapter Comprehensive/Spreadsheet Problem
11 pages
02 10-Motherboards PDF
No ratings yet
02 10-Motherboards PDF
19 pages
Case Questions
75% (4)
Case Questions
2 pages
FRA Class Notes
100% (1)
FRA Class Notes
16 pages
Tanaya and Akansha Major Project
No ratings yet
Tanaya and Akansha Major Project
74 pages
Logisticskarnataka Engineering Case PDF Free
No ratings yet
Logisticskarnataka Engineering Case PDF Free
9 pages
What Do You Think Hilton Leadership Should Do After The Blackstone Acquisition? Should They Further Invest in CRM or Simply Maintain The Status Quo?
No ratings yet
What Do You Think Hilton Leadership Should Do After The Blackstone Acquisition? Should They Further Invest in CRM or Simply Maintain The Status Quo?
1 page
Faculty PPT-Customer Life Time Value Analytics PDF
No ratings yet
Faculty PPT-Customer Life Time Value Analytics PDF
40 pages
Exercises On Asset Analysis
100% (1)
Exercises On Asset Analysis
2 pages
2015 10 10
No ratings yet
2015 10 10
19 pages
Assignment 1 - Main
No ratings yet
Assignment 1 - Main
9 pages
Marketing Strategy Instructional Manual Version 2 - 4
No ratings yet
Marketing Strategy Instructional Manual Version 2 - 4
17 pages
Final Exam 2023 Corporate Valuation
No ratings yet
Final Exam 2023 Corporate Valuation
5 pages
VAR Package Pricing at Mission Hospital
No ratings yet
VAR Package Pricing at Mission Hospital
6 pages
Cost Sheet Practice
No ratings yet
Cost Sheet Practice
1 page
Operations Research-Sec D
No ratings yet
Operations Research-Sec D
5 pages
Scalene Works-HR Analytics
0% (1)
Scalene Works-HR Analytics
10 pages
Cadm Pre Mid Term 2016 - Soln
No ratings yet
Cadm Pre Mid Term 2016 - Soln
5 pages
Forecasting Mini Cases
100% (1)
Forecasting Mini Cases
2 pages
CH 032
No ratings yet
CH 032
57 pages
Project Report Adv Stat V1.0
No ratings yet
Project Report Adv Stat V1.0
5 pages
Supporting Sheet - UV21001
No ratings yet
Supporting Sheet - UV21001
15 pages
Model Solution - Assessment 2
100% (2)
Model Solution - Assessment 2
12 pages
EVALUATION OF BANK OF MAHARASTRA-Devanshu
100% (2)
EVALUATION OF BANK OF MAHARASTRA-Devanshu
7 pages
EPGP 07 BPP Hand Book Final
No ratings yet
EPGP 07 BPP Hand Book Final
12 pages
Planning Public Facilities in Airports: Quantitative Methods
No ratings yet
Planning Public Facilities in Airports: Quantitative Methods
11 pages
4587 2261 10 1487 54 Budgeting
No ratings yet
4587 2261 10 1487 54 Budgeting
46 pages
Statistical Methods For Decision Making
100% (1)
Statistical Methods For Decision Making
15 pages
Mayank M - B75 - C0-RNo20 - QM - Assign01
No ratings yet
Mayank M - B75 - C0-RNo20 - QM - Assign01
16 pages
This Study Resource Was: Supply Chain Management
No ratings yet
This Study Resource Was: Supply Chain Management
4 pages
FinValley 5.0 Case Study
No ratings yet
FinValley 5.0 Case Study
3 pages
Round: 0 Dec. 31, 2017 Andrews Baldwin Chester
No ratings yet
Round: 0 Dec. 31, 2017 Andrews Baldwin Chester
19 pages
Chapter 20
No ratings yet
Chapter 20
93 pages
Linear Programming
100% (1)
Linear Programming
6 pages
Final Assignment BUS-202: Independent University Bangladesh (IUB) Assignment On
No ratings yet
Final Assignment BUS-202: Independent University Bangladesh (IUB) Assignment On
1 page
DS II Mid Term 2017 Solution
No ratings yet
DS II Mid Term 2017 Solution
20 pages
Moderating Effect of The Relationship Between Private Label Share and Store Loyalty PDF
No ratings yet
Moderating Effect of The Relationship Between Private Label Share and Store Loyalty PDF
15 pages
Vayutel Case Study
No ratings yet
Vayutel Case Study
10 pages
Fundamental and Means Objective
No ratings yet
Fundamental and Means Objective
3 pages
Karnataka Engineering Company Limited (KECL)
No ratings yet
Karnataka Engineering Company Limited (KECL)
13 pages
R Code Default Data PDF
No ratings yet
R Code Default Data PDF
10 pages
Assignment 2.1
No ratings yet
Assignment 2.1
19 pages
Final Wacc Assignment
No ratings yet
Final Wacc Assignment
10 pages
PBM SPSS
No ratings yet
PBM SPSS
7 pages
Effect of Demonetisation On IS-LM Curve.: Money Market Equilibrium
No ratings yet
Effect of Demonetisation On IS-LM Curve.: Money Market Equilibrium
5 pages
Bookbinders Case 2
0% (3)
Bookbinders Case 2
6 pages
Round: 2 Dec. 31, 2022: Selected Financial Statistics
No ratings yet
Round: 2 Dec. 31, 2022: Selected Financial Statistics
15 pages
Chi Forest-Case Study
No ratings yet
Chi Forest-Case Study
15 pages
Case Study:Acceptable Pins
100% (1)
Case Study:Acceptable Pins
31 pages
SAPM Formulas
No ratings yet
SAPM Formulas
1 page
Synd - 7 Business Eco EMBA 59A Decision Time at The Aromatic Coffee Co
No ratings yet
Synd - 7 Business Eco EMBA 59A Decision Time at The Aromatic Coffee Co
18 pages
Course Material BM QT 2019 PDF
No ratings yet
Course Material BM QT 2019 PDF
44 pages
2016-BIDM Assignment No2. and 3
No ratings yet
2016-BIDM Assignment No2. and 3
2 pages
Projects PDF
No ratings yet
Projects PDF
12 pages
SGOS RelNotes 6.2.2
No ratings yet
SGOS RelNotes 6.2.2
48 pages
OBD II System Monitors
No ratings yet
OBD II System Monitors
22 pages
Lesson 1: Network Architecture Standard: Network Components and Terminology
No ratings yet
Lesson 1: Network Architecture Standard: Network Components and Terminology
12 pages
JSA CTU Sand Clean Out BPP
No ratings yet
JSA CTU Sand Clean Out BPP
8 pages
S26MC - MK - 6 - Project - Guide M20 M25 M32 M43 M281-332C M451-453 M551-552 M601C PDF
100% (2)
S26MC - MK - 6 - Project - Guide M20 M25 M32 M43 M281-332C M451-453 M551-552 M601C PDF
241 pages
Getting Started: 1.2 Expressions and Assignment Statement
No ratings yet
Getting Started: 1.2 Expressions and Assignment Statement
29 pages
GP Raw
No ratings yet
GP Raw
20 pages
Guidelines For Entrapment Hazards: Making Pools and Spas Safer
No ratings yet
Guidelines For Entrapment Hazards: Making Pools and Spas Safer
40 pages
HTML Web
No ratings yet
HTML Web
5 pages
RAM Plan
No ratings yet
RAM Plan
3 pages
2930 IT Specialist Closing Meeting Agenda - QCC
No ratings yet
2930 IT Specialist Closing Meeting Agenda - QCC
2 pages
Dulce Battle PDF Tunnel Alien Abduction
No ratings yet
Dulce Battle PDF Tunnel Alien Abduction
191 pages
Aircraft Movement On Ground
100% (2)
Aircraft Movement On Ground
69 pages
IECEx SIR 06.0054 004
No ratings yet
IECEx SIR 06.0054 004
5 pages
I 9403 DS1124776 1
No ratings yet
I 9403 DS1124776 1
2 pages
Wojo Space Offices New Configuration Ibis Top-Présentation1
No ratings yet
Wojo Space Offices New Configuration Ibis Top-Présentation1
1 page
CSS Image Opacity - Transparency
No ratings yet
CSS Image Opacity - Transparency
3 pages
Lecture Computer Codes
No ratings yet
Lecture Computer Codes
87 pages
Jordan MK708 Bulletin
100% (1)
Jordan MK708 Bulletin
28 pages
Power Quality Instruments (PQI) An Over View by Schneider
No ratings yet
Power Quality Instruments (PQI) An Over View by Schneider
8 pages
The Unhoneymooners _ Christina Lauren _ download on Z-Library
No ratings yet
The Unhoneymooners _ Christina Lauren _ download on Z-Library
4 pages
Design Standards No 14 Chapter 1
100% (1)
Design Standards No 14 Chapter 1
53 pages
Peugeot 607 Owners Manual 2003
67% (3)
Peugeot 607 Owners Manual 2003
98 pages
New Gear Locking Design in Synchromesh Gearbox Which Reduces Gear Shift Effort
No ratings yet
New Gear Locking Design in Synchromesh Gearbox Which Reduces Gear Shift Effort
8 pages
Reflection 3 Needs Assessment
No ratings yet
Reflection 3 Needs Assessment
3 pages
CME Corlist
No ratings yet
CME Corlist
11 pages
DRUM ROOM ENG
No ratings yet
DRUM ROOM ENG
16 pages

Assignment 2

Uploaded by

Assignment 2

Uploaded by

Segmenting consumers on bath soap

Understanding the business problem & objectives

Data mining objectives

Data preparation ( Done in excel file)

% volume of purchase to a given brand 100%

90% 80% 70% 60% 50% 40% 30% 20% 10% 0%

Clustering Analysis Using SAS

Figure 1. SAS Flow diagram

Clustering based on Demographics

Figure 2: Segment and Cluster Plot, Demographic clustering

Figure 4: Mean statistics of the generated clusters, Demographic clustering

Figure 5: Segment Plot of the generated clusters, Demographic clustering

Clustering based on purchase behaviour

Figure 7: Segment size, clustering based on purchase behaviour

Figure 9: Segment plot, clustering based on purchase behaviour

Figure 10: Mean statistics of generated clusters, Purchase behaviour

Figure 11: Segment profile, Purchase Behaviour

Clustering based on Purchase Basis

Figure 12: Cluster proximities, Cluster size 5

Figure 13: Cluster proximities, Cluster size 4

Figure 14: Cluster proximities, Cluster size 6

We proceed with Cluster size 5

Figure 15: Segment size plot, Purchase Basis

Figure 17: Mean statistics, Purchase Basis

Figure 18: Segment plot, Purchase Basis

Clustering based on Purchase behaviour + Purchase basis

Figure 20: Segment size, both purchase basis + purchase behaviour

Figure 21: Cluster proximities, both purchase basis + purchase behaviour

Figure 22: Variable importance, both purchase basis + purchase behaviour

Figure 23: Segment plot, both purchase basis + purchase behaviour

Figure 24: Mean Statistics, both purchase basis + purchase behaviour

You might also like