0% found this document useful (0 votes)

42 views12 pages

03 Segmenting Stores Using Clustering - SAC

Uploaded by

MEKALA SAI VINDHYA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views12 pages

03 Segmenting Stores Using Clustering - SAC

Uploaded by

MEKALA SAI VINDHYA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Segmenting Stores Using

Clustering
Authors:
Nitin Kalé, University of Southern California
Nancy Jones, San Diego State University

Revised:
Liz Simmons, July 2022

OBJECTIVE
The objective of this exercise is to segment retail stores based on various attributes to help with sales
promotions.

ACTIVITIES
• Import and prepare data.
• Apply Smart Grouping cluster analysis.
• Merge data.
• Create data visualizations.
• Analyze and interpret output from models.

SOFTWARE PREREQUISITES
• SAP Analytics Cloud
• Microsoft Excel

DATA SET
Data file titled Stores.csv

1 of 12
Scenario
The Country Manager of a retail chain (which has 150 stores) is finalizing plans for three sales
promotion strategies. Data pertaining to the stores such as store location, sales turnover, store
size, staff, and profit margin are stored in a CSV file. The manager wants to segment the 150
stores into three different groups based on sales turnover, profit margin, store size, and staff
size so specific strategies can be applied to each store segment. You will use clustering of retail
stores data to assist the manager in developing promotion strategies.

Cluster Analysis
Given a dataset, organizing it into meaningful groups is a basic and useful approach to data
mining and data analysis. Clustering classifies samples into groups using a measure of
association so that data points within a group are similar. Data points from different groups are
not similar. Data points are multidimensional, that is they consist of several variables.
Visualization is not practical for humans when datasets consist of more than three dimensions.
The input to a clustering exercise is a dataset and the number of clusters. The result of the
analysis is a set of clusters. K-means clustering is a method of finding clusters and their
centers (R) given a choice in the number of clusters (K). It is often used for market
segmentation. The goal is to make the inter-cluster difference (distance) high and the intra-
cluster difference (distance) low.

1. Visualize the Store Data

1. In SAP Analytics Cloud (SAC), Select Stories → Create New → Canvas.
2. Add data → Data uploaded from file.
3. Select Source File and choose the Stores.csv file provided to you, Open.
a. Use first row as column headers should be selected and the CSV Delimiter
should be set to Auto-detect.
b. Import.
c. You will be directed to the Data view. You should have 150 rows of data: four
Measures (Profit Margin, Sales Turnover, Staff Size and Store Size) and one
Dimension (Store) in the dataset.
4. Select the Story view. You will now create a visualization of the relationships among
the variables of the data set as a first step to helping the Regional Manager understand
the dynamics of each of the stores in her area of responsibility.

2 of 12
a. Insert Chart.
b. Select Bubble Chart from the Correlation charts.
c. Configure the Chart Structure as follows:
(1) + Add Sales Turnover to the X-Axis.
(2) + Add Staff Size to the Y-Axis.
(3) + Add Profit Margin to Size.
(4) + Add Store to the Dimensions.
(5)+ Add a Tooltip Measure as shown in Figure 1. You can find Add Tooltip after
clicking the three dots icon next to Chart Structure.

Figure 1: Adding a Tooltip

(6) Tooltip Measures will now show as a Chart Structure option. + Add Store Size
to Tooltip Measures.
(7) You will now see a Bubble chart of the first three measures by Store.

3 of 12
Figure 2: A Bubble Chart of Store Data

2. Creating the Cluster Analysis

You may hover over any of the bubbles of the Bubble chart to get more information about the
data point. You may also filter to stores of interest. However, I think you will agree this chart is
of limited usefulness. Let’s group the stores that are similar using a k-means. In SAC,
clustering is done using Smart Grouping.
1. Toggle on Smart Grouping (near the bottom of the Builder panel).
2. Change the Number of Groups to 3, (3 is k in the k-means algorithm).
3. Change the Group Label to “Cluster” just to be consistent with your understanding of
cluster analysis.
4. Select Include Tooltip Measures in grouping so all four Measures are considered in
the cluster analysis.

4 of 12
Figure 3: Configure Smart Grouping

5. The clusters in the default monochromatic color scheme tend to blend together, so you
may want to change the Color pallet. You should now see three distinct groups
(clusters) in your chart. You can filter on the clusters by clicking the cluster number you
wish to examine.

Question 1: Add your name to the title of the clustered Bubble chart and
submit a screenshot of the chart.

3. Visualization and Interpretation

The results of the grouping (clustering) can be further analyzed by associating the cluster
numbers with the data in the Stores data “model”. (The original Stores.csv file is stored as a
private or embedded “model” within your SAC story.)
Each cluster may be analyzed individually. That means you can create visualizations for the
data filtered by cluster. However, since the manager wants to compare each of the clusters of
5 of 12
customers, it will be useful to actually create a new data set that includes all three clusters
together. To do this, you will export the data from each of the three clusters to a spreadsheet
and add a cluster identifier.
1. Merging data sets.
a. Filter to Cluster 1 by clicking on the Legend of the Bubble chart you created. Refer
to Figure 4.

Figure 4: Filter the Cluster

(1) Notice that SAC Smart Grouping will continue to break down the filtered data set
to even smaller clusters. You can ignore these new groups.
(2) Select Export from the chart dropdown list.

6 of 12
Figure 5: Export the Clustered Data

(3) Name the .csv file “Cluster_1”. The data from Cluster 1 will be downloaded to
your computer.
b. Repeat these steps for Clusters 2 and 3 and name the files “Cluster_2”
and “Cluster_3” respectively.
NOTE: Be sure to remove the chart filter (click the X to the right of 1 Filter in the
header) and replace it with the next cluster number before downloading the data.
You should have three downloaded .csv files.
c. Now you will prepare the cluster data for integration with the Stores data model in
SAC.
(1) The first step is to clean up the header information so it is only one row. Open the
Cluster_1.csv file.
(i) Move content of cells B1:D1 to cells B2:D2.
(ii) Delete row 1.
(2) Next add a column called “Cluster”.
(i) Add the cluster number to all the rows of data.
(3) Save the .csv file.

7 of 12
(4) You can see the results of your clean up in the following before and after Figures:

Figure 6: The .csv file from SAC

Figure 7: The .csv file after "Wrangling"

d. Repeat these steps for Clusters 2 and 3.

e. To merge the cluster data with the Stores data do the following:
(1) Go to Data view, Grid mode.
(2) Use the dropdown menu next to Stores to select + Add New Data.

Figure 8: Adding Data to the Analysis

(3) Select Data uploaded from a file.

(4) Select Source Cluster_1.csv.
(i) Use first row as header should be selected.
(ii) Import.

8 of 12
(5) On the Save dropdown select Open With Basic Data Preparation. This
will allow you to append the files for clusters 2 and 3.

Figure 9: Open with Basic Data Preparation

(6) Select Reimport Data from the Data ribbon.

Figure 10: Reimport Data

(7) Select Cluster_2.csv.

(8) When you see the following screen, select Append.

9 of 12
Figure 11: Append a File

(9) Finish.
(10) Repeat the append for Cluster_3.csv.
(i) Now look at the data in the Clusters data set and you should find stores in all
three clusters and 150 rows.
f. Save.
2. To visualize the Stores and Clusters data, go to the Story view.
a. Add a new page with either a Canvas or a Responsive page.
b. Add a chart.
c. Add a Calculated Measure for Count of Stores as shown below:

Figure 12: Count of Stores

10 of 12
d. In Builder, select the Link Dimensions icon to the right of Data Source.
e. You now need to choose the matching dimensions from each data set. In your case,
the “Stores” dimension in the Stores data matches the “Stores” dimension in the
Cluster data.
f. Click the More dots to the right of Dimension to select Data Samples > ID to see
samples of the linked values. The Link Dimension settings are shown below. Then
click Set, and then Done.

Figure 13: Link Dimensions

g. Leave the chart as a Column chart. To add variables to the chart, you will now have a
choice of which data set you would like to use. You will see them as a drop down
when you add a Measure or Dimension. SAC calls this a blended data chart.
(1) Add Count of Stores from the Store data set to Measures.
(2) Add Cluster from the Clusters data set to Dimensions.

Question 2: Which Cluster has the highest number of stores?

Support your answer with a screenshot.

11 of 12
3. Create visualizations to answer the following questions:

Question 3: Provide the name of one store in each cluster. Include a

screenshot of the store name from within each cluster.
(Hover over the cluster circles to see the data they represent.)

Question 4: How does Average Profit Margin, Average Sales Turnover, and
Average Staff Size compare amongst the clusters?
Support your answers with a screenshot.
Hint: You will need to create Calculated Measures to determine
Averages.

Challenge Activity 1 (Optional, Not Graded)

Choose one cluster to analyze further using visualizations. Provide a detailed
description/analysis of the stores within the cluster you have chosen. Based on what you see
in this cluster, what kind of marketing strategy to improve sales for the stores in the cluster
do you recommend?

Challenge Activity 2 (Optional, Not Graded)

Design tables and/or visualizations to determine the density of each of the three clusters.
Hint: You may want to create some calculated measures for measures of dispersion or
you could also use the variance tool.

12 of 12

Power BI
100% (1)
Power BI
109 pages
SAC EGI Day2
No ratings yet
SAC EGI Day2
102 pages
Cluster Analysis
No ratings yet
Cluster Analysis
49 pages
Unit 5 DVA
No ratings yet
Unit 5 DVA
54 pages
Cluster Analysis
No ratings yet
Cluster Analysis
46 pages
Cluster Analysis: Learning Objectives
No ratings yet
Cluster Analysis: Learning Objectives
53 pages
Tableau Draft
No ratings yet
Tableau Draft
63 pages
ANSWERS Tableau - Assignment PDF
100% (2)
ANSWERS Tableau - Assignment PDF
13 pages
openSAP Sac3 Week 1 Exercise1
No ratings yet
openSAP Sac3 Week 1 Exercise1
30 pages
SAP Analytics Cloud Help: Warning
No ratings yet
SAP Analytics Cloud Help: Warning
71 pages
Retail Analysis Sample For Power BI - Take A Tour - Power BI - Microsoft Learn
No ratings yet
Retail Analysis Sample For Power BI - Take A Tour - Power BI - Microsoft Learn
14 pages
SAP Crystal Step by Step Guide Create An Interactive Dashboard
No ratings yet
SAP Crystal Step by Step Guide Create An Interactive Dashboard
16 pages
10th Program
No ratings yet
10th Program
13 pages
Manmohan Pandey Lab Mannual
No ratings yet
Manmohan Pandey Lab Mannual
30 pages
Convert HTML To PDF GPT
No ratings yet
Convert HTML To PDF GPT
7 pages
Retail Analytics-MGT3007-Dr. AFMS (53031) M1-M2
No ratings yet
Retail Analytics-MGT3007-Dr. AFMS (53031) M1-M2
20 pages
Tbit Interpretation
No ratings yet
Tbit Interpretation
4 pages
01 ERPSim Analysis - SAC
No ratings yet
01 ERPSim Analysis - SAC
18 pages
Retail Sales Analysis Using Clustering: Dr. M. Rajeshwari, P.R.Bharathi Nandha
No ratings yet
Retail Sales Analysis Using Clustering: Dr. M. Rajeshwari, P.R.Bharathi Nandha
8 pages
Sac Training Guide 2019 Final PDF
100% (2)
Sac Training Guide 2019 Final PDF
49 pages
05 Analysis Using GBI Pt1 - SAC
No ratings yet
05 Analysis Using GBI Pt1 - SAC
18 pages
4a Mi Bi Visualizations Dashboards (Db-Sales Report) en
No ratings yet
4a Mi Bi Visualizations Dashboards (Db-Sales Report) en
26 pages
Working With Smart View
No ratings yet
Working With Smart View
68 pages
Week 3 4.0 Data Modeling in SAC
No ratings yet
Week 3 4.0 Data Modeling in SAC
11 pages
Adventure Works Case Study Tableau
No ratings yet
Adventure Works Case Study Tableau
7 pages
LAB-Explore Fundamentals of Data Visualization With Power BI
No ratings yet
LAB-Explore Fundamentals of Data Visualization With Power BI
14 pages
DataLoadBasics Exercise
No ratings yet
DataLoadBasics Exercise
4 pages
Retail Customer Segmentation Using SAS
No ratings yet
Retail Customer Segmentation Using SAS
19 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
39 pages
SAP Analytics Cloud
100% (1)
SAP Analytics Cloud
22 pages
Online Book Store Project Report
100% (1)
Online Book Store Project Report
50 pages
Module 4
No ratings yet
Module 4
69 pages
Companion To Marketing Data Miner
No ratings yet
Companion To Marketing Data Miner
3 pages
Tableau Assignment
No ratings yet
Tableau Assignment
15 pages
FSM Unit 5 - Analytics and Reports
No ratings yet
FSM Unit 5 - Analytics and Reports
33 pages
06 Analysis Using GBI Pt2 - SAC
No ratings yet
06 Analysis Using GBI Pt2 - SAC
19 pages
Day 1
No ratings yet
Day 1
8 pages
Autosar RTE Layer
No ratings yet
Autosar RTE Layer
1,116 pages
Business Analytics With Excel
No ratings yet
Business Analytics With Excel
11 pages
IIS263 - SAP Analytics Cloud With Focused Insights For SAP Solution Manager
No ratings yet
IIS263 - SAP Analytics Cloud With Focused Insights For SAP Solution Manager
16 pages
Problem Statement Project Cases
No ratings yet
Problem Statement Project Cases
5 pages
ZFL KM ICT702 Assessment 4
No ratings yet
ZFL KM ICT702 Assessment 4
7 pages
Python Machine Learning
No ratings yet
Python Machine Learning
19 pages
My Notes
No ratings yet
My Notes
158 pages
Superstore PBI Report Plan
No ratings yet
Superstore PBI Report Plan
2 pages
Tableau Intro
No ratings yet
Tableau Intro
21 pages
Data Visualization - Day 4 - in Class Exercises - Dashboards and Story Points - Solution
No ratings yet
Data Visualization - Day 4 - in Class Exercises - Dashboards and Story Points - Solution
44 pages
Microcontroller Question Bank
No ratings yet
Microcontroller Question Bank
5 pages
Superstore Sales PDF
No ratings yet
Superstore Sales PDF
10 pages
Activity Creating A Matrix Report PDF
No ratings yet
Activity Creating A Matrix Report PDF
3 pages
Condensate Recovery Meter CRM 485R: Energy Conservation - Environment - Process Efficiency
0% (1)
Condensate Recovery Meter CRM 485R: Energy Conservation - Environment - Process Efficiency
6 pages
Tableau
No ratings yet
Tableau
10 pages
Tableau Companion - Sort Hierarchy & Filter
No ratings yet
Tableau Companion - Sort Hierarchy & Filter
3 pages
Creating Analytics Charts
No ratings yet
Creating Analytics Charts
5 pages
Visual Analytics Using Tableau-Class 3
No ratings yet
Visual Analytics Using Tableau-Class 3
16 pages
QYB - Set Analysis and AGGR Exercises
No ratings yet
QYB - Set Analysis and AGGR Exercises
15 pages
Chapter 2 (Web Design and Programming)
No ratings yet
Chapter 2 (Web Design and Programming)
74 pages
Course Title: Visual Analytics and Applications
No ratings yet
Course Title: Visual Analytics and Applications
36 pages
Business Intelligence Notes
No ratings yet
Business Intelligence Notes
16 pages
Assignment 1 - Basic Graphs and Charts in Tableau
No ratings yet
Assignment 1 - Basic Graphs and Charts in Tableau
6 pages
Patient Monitor: Series
No ratings yet
Patient Monitor: Series
498 pages
AWS Certified DevOps Engineer Professional Questions
No ratings yet
AWS Certified DevOps Engineer Professional Questions
4 pages
Case Study Mysql
0% (1)
Case Study Mysql
3 pages
Height Comparison - Comparing Heights Visually With Chart
No ratings yet
Height Comparison - Comparing Heights Visually With Chart
1 page
COMPUTER Input and Out Put Devices
100% (1)
COMPUTER Input and Out Put Devices
15 pages
Organization Culture
No ratings yet
Organization Culture
105 pages
Ultrasound Image Optimization ("Knobology") - B-Mode
No ratings yet
Ultrasound Image Optimization ("Knobology") - B-Mode
12 pages
Ch.02 Financial Accounting S4HANA 2020 V1.5
No ratings yet
Ch.02 Financial Accounting S4HANA 2020 V1.5
62 pages
Ch.03 Management Accounting S4HANA 2020 V1.5
No ratings yet
Ch.03 Management Accounting S4HANA 2020 V1.5
69 pages
Math1330 Printable Exercises and Solutions
No ratings yet
Math1330 Printable Exercises and Solutions
245 pages
Yixing Sea Fountain Equipment Co.,Ltd: Always Believe Something Beautiful Is Going To Happen
No ratings yet
Yixing Sea Fountain Equipment Co.,Ltd: Always Believe Something Beautiful Is Going To Happen
31 pages
AIX Performance Tuning VUG May2418
No ratings yet
AIX Performance Tuning VUG May2418
50 pages
PCI Express 1x, 4x, 8x, 16x Bus Pinout Diagram @
No ratings yet
PCI Express 1x, 4x, 8x, 16x Bus Pinout Diagram @
1 page
Using CAATs To Support IS Audit
No ratings yet
Using CAATs To Support IS Audit
3 pages
66f2333917152bc83a343f60 94216597565
No ratings yet
66f2333917152bc83a343f60 94216597565
2 pages
Proposal-MyOperator 6 User Based Unlimited Incoming Plan
No ratings yet
Proposal-MyOperator 6 User Based Unlimited Incoming Plan
6 pages
Unit 1 - Cloud Computing
No ratings yet
Unit 1 - Cloud Computing
12 pages
022 - SK Santan - Sebutharga Pemasangan Wireless AP
No ratings yet
022 - SK Santan - Sebutharga Pemasangan Wireless AP
1 page
APA Citations For Electronic Sources
No ratings yet
APA Citations For Electronic Sources
7 pages
02 Benford Analysis - SAC
No ratings yet
02 Benford Analysis - SAC
13 pages
ISDN, B-ISDN, X.25, Frame-Relay, ATM Networks: A Telephony View of Convergence Architectures
No ratings yet
ISDN, B-ISDN, X.25, Frame-Relay, ATM Networks: A Telephony View of Convergence Architectures
158 pages
Introduction Deck Nike Final
No ratings yet
Introduction Deck Nike Final
17 pages
Jobvacancyresult Com
No ratings yet
Jobvacancyresult Com
4 pages
Technical Skills
No ratings yet
Technical Skills
5 pages
Skyblue - Operations: Operating Manual
No ratings yet
Skyblue - Operations: Operating Manual
47 pages
MISY5370 Textbook Assignment 2
No ratings yet
MISY5370 Textbook Assignment 2
2 pages
Sample Program: XGB-INV IG5A (RS-485 Modbus RTU)
No ratings yet
Sample Program: XGB-INV IG5A (RS-485 Modbus RTU)
4 pages
DD vcredistUI0CD6
No ratings yet
DD vcredistUI0CD6
2 pages
mfc480dw Uke QSG
No ratings yet
mfc480dw Uke QSG
2 pages
Assignment 5
No ratings yet
Assignment 5
1 page
Assignment 2 - Updated Version
No ratings yet
Assignment 2 - Updated Version
1 page
The Informed Company: How to Build Modern Agile Data Stacks that Drive Winning Insights
From Everand
The Informed Company: How to Build Modern Agile Data Stacks that Drive Winning Insights
Dave Fowler
No ratings yet
Tableau 8.2 Training Manual: From Clutter to Clarity
From Everand
Tableau 8.2 Training Manual: From Clutter to Clarity
Larry Keller
No ratings yet
Visual Analytics with Tableau
From Everand
Visual Analytics with Tableau
Alexander Loth
No ratings yet
Practice Questions for Tableau Desktop Specialist Certification Case Based
From Everand
Practice Questions for Tableau Desktop Specialist Certification Case Based
Exam OG
5/5 (1)
Learning Open Office: Calc & Base
From Everand
Learning Open Office: Calc & Base
Durgesh
No ratings yet

03 Segmenting Stores Using Clustering - SAC

Uploaded by

03 Segmenting Stores Using Clustering - SAC

Uploaded by

Segmenting Stores Using

1. Visualize the Store Data

Figure 1: Adding a Tooltip

2. Creating the Cluster Analysis

3. Visualization and Interpretation

Figure 4: Filter the Cluster

Figure 6: The .csv file from SAC

Figure 7: The .csv file after "Wrangling"

d. Repeat these steps for Clusters 2 and 3.

Figure 8: Adding Data to the Analysis

(3) Select Data uploaded from a file.

Figure 9: Open with Basic Data Preparation

(6) Select Reimport Data from the Data ribbon.

Figure 10: Reimport Data

(7) Select Cluster_2.csv.

Figure 12: Count of Stores

Figure 13: Link Dimensions

Question 2: Which Cluster has the highest number of stores?

Question 3: Provide the name of one store in each cluster. Include a

Challenge Activity 1 (Optional, Not Graded)

Challenge Activity 2 (Optional, Not Graded)

You might also like