Binder 3

The document discusses using a hierarchical clustering method and k-means clustering to define the optimal number of clusters in a dataset. It describes using the elbow method by plotting within-group sum of squares against the number of clusters and selecting the elbow point. It also discusses validating the cluster analysis by examining the impact of initial seeds, the selected method, and relevant variables. An SPSS example is provided to illustrate defining 4 clusters from a dataset.

Uploaded by

Atiqul Islam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views10 pages

Binder 3

Uploaded by

Atiqul Islam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Suggested approach

1. First perform a hierarchical

method to define the number of
clusters
2. Then use the k-means procedure
to actually form the clusters
Defining the number of
clusters: elbow rule (1)
Agglomeration Schedule
n
Stage Cluster First
Stage Number of clusters Cluster Combined Appears
0 12 StageCluster 1 Cluster 2CoefficientsCluster 1 Cluster 2Next Stage
1 11 1 4 7 .015 0 0 4
2 10 2 6 10 .708 0 0 5
3 9 3 8 9 .974 0 0 4
4 8 4 4 8 1.042 1 3 6
5 7 5 1 6 1.100 0 2 7
6 6 6 4 5 3.680 4 0 7
7 5 7 1 4 3.492 5 6 8
8 4 8 1 11 6.744 7 0 9
9 3 9 1 2 8.276 8 0 10
10 2 10 1 12 8.787 9 0 11
11 1 11 1 3 11.403 10 0 0
Elbow rule (2): the
scree diagram
12

8
Distance

0
11 10 9 8 7 6 5 4 3 2 1
Number of clusters
Validating the
analysis
• Impact of initial seeds / order of
cases
• Impact of the selected method
• Consider the relevance of the
chosen set of variables
SPSS Example
1.5 MATTHEW
JULIA

1.0 LUCY
JENNIFER
.5 NICOLE

0.0

JOHN
-.5 PAMELA
THOMAS ARTHUR

-1.0
Component2

-1.5 FRED

-2.0
-1.5 -1.0 -.5 0.0 .5 1.0 1.5 2.0

Component1
Agglomeration Schedule

Stage Cluster First

Cluster Combined Appears
Stage Cluster 1 Cluster 2 Coefficients Cluster 1 Cluster 2 Next Stage
1 3 6 .026 0 0 8
2 2 5 .078 0 0 7
3 4 9 .224 0 0 5
4 1 7 .409 0 0 6
5 4 10 .849 3 0 8
6 1 8 1.456 4 0 7
7 1 2 4.503 6 2 9
8 3 4 9.878 1 5 9
9 1 3 18.000 7 8 0

Number of clusters: 10 – 6 = 4
1.5 MATTHEW
JULIA

1.0 LUCY
JENNIFER
.5 NICOLE

0.0

JOHN
-.5 PAMELA
THOMAS ARTHUR
Cluster Number of Ca

-1.0 4
Component2

3
-1.5 FRED
2

-2.0 1
-1.5 -1.0 -.5 0.0 .5 1.0 1.5 2.0

Component1
Open the dataset
supermarkets.sav
From your N: directory (if you saved it
there last time
Or download it from:
http://www.rdg.ac.uk/~aes02mm/
supermarket.sav
• Open it in SPSS
The supermarkets.sav
dataset

DM-MICA TELTEK Piyush Singh
100% (3)
DM-MICA TELTEK Piyush Singh
12 pages
MCSL-223 2024-25 em
0% (1)
MCSL-223 2024-25 em
13 pages
Business Report Data Mining
91% (11)
Business Report Data Mining
18 pages
K-Means Clustering Using Python
No ratings yet
K-Means Clustering Using Python
30 pages
K-MEANS CLUSTERING PPT Kpu
No ratings yet
K-MEANS CLUSTERING PPT Kpu
4 pages
Cp4252-Machine Learning Lab Manual 23-24
No ratings yet
Cp4252-Machine Learning Lab Manual 23-24
28 pages
Unit 4
No ratings yet
Unit 4
63 pages
Aula - Análise de Clusters
No ratings yet
Aula - Análise de Clusters
93 pages
Natural Language Processing With Java - Sample Chapter
100% (1)
Natural Language Processing With Java - Sample Chapter
33 pages
Business Research Methods: Cluster Analysis
No ratings yet
Business Research Methods: Cluster Analysis
46 pages
ML Seminar
No ratings yet
ML Seminar
37 pages
Presentation Malo
No ratings yet
Presentation Malo
65 pages
K Means Clustering Algorithm
No ratings yet
K Means Clustering Algorithm
12 pages
SPSS Week7
No ratings yet
SPSS Week7
42 pages
SPSS Week7
No ratings yet
SPSS Week7
42 pages
AI Chapter 3 Part 5
No ratings yet
AI Chapter 3 Part 5
30 pages
DWM Exp8 C49
No ratings yet
DWM Exp8 C49
10 pages
Un Supervised Learning
No ratings yet
Un Supervised Learning
22 pages
19 - Sessionppt - Clusteringalgos
No ratings yet
19 - Sessionppt - Clusteringalgos
36 pages
Unit 4 Aam
No ratings yet
Unit 4 Aam
26 pages
Cluster Analysis Finalllll
No ratings yet
Cluster Analysis Finalllll
24 pages
Module 4 - 5TH Sem
No ratings yet
Module 4 - 5TH Sem
23 pages
UNIT - 3 - Clustering
No ratings yet
UNIT - 3 - Clustering
21 pages
MODULE 4 Clustering
No ratings yet
MODULE 4 Clustering
23 pages
Cluster Analysis
No ratings yet
Cluster Analysis
24 pages
Cluster Analysis
No ratings yet
Cluster Analysis
25 pages
K Means Clustering
No ratings yet
K Means Clustering
27 pages
AI Week 11
No ratings yet
AI Week 11
21 pages
Machine Learning Unit 4
No ratings yet
Machine Learning Unit 4
22 pages
Data Mining Business Report 2
No ratings yet
Data Mining Business Report 2
18 pages
Assignment 4 A
No ratings yet
Assignment 4 A
15 pages
Final Group 1
No ratings yet
Final Group 1
31 pages
Clustering Approach For Analyzing The Student's Efficiency and Performance Based
No ratings yet
Clustering Approach For Analyzing The Student's Efficiency and Performance Based
18 pages
Cluster Analysis
No ratings yet
Cluster Analysis
30 pages
K-Means Clustering
No ratings yet
K-Means Clustering
14 pages
SPSS Tutorial Cluster Analysis
No ratings yet
SPSS Tutorial Cluster Analysis
42 pages
Session-13b BRM PDF
No ratings yet
Session-13b BRM PDF
18 pages
SPSS Tutorial Cluster Analysis PDF
No ratings yet
SPSS Tutorial Cluster Analysis PDF
42 pages
K-Means Clustering Optimization Using The Elbow Method and Early Centroid Determination Based On Mean and Median Formula
No ratings yet
K-Means Clustering Optimization Using The Elbow Method and Early Centroid Determination Based On Mean and Median Formula
9 pages
Methods To Find Optimal K Value
No ratings yet
Methods To Find Optimal K Value
14 pages
K Means Clustering
No ratings yet
K Means Clustering
13 pages
DWDM Unit5
No ratings yet
DWDM Unit5
14 pages
Data Mining-4
No ratings yet
Data Mining-4
9 pages
K-Means Clustering Algorithm
No ratings yet
K-Means Clustering Algorithm
13 pages
KMean Merged
No ratings yet
KMean Merged
13 pages
Elbow Method For Optimal Cluster Number in K-Means
No ratings yet
Elbow Method For Optimal Cluster Number in K-Means
8 pages
K-Means Clustering
No ratings yet
K-Means Clustering
7 pages
K-Mean Clustering
No ratings yet
K-Mean Clustering
8 pages
Neurology Clinics-Current Advances and Future Trends in Vascular Neurology 2024
No ratings yet
Neurology Clinics-Current Advances and Future Trends in Vascular Neurology 2024
141 pages
66 Yash DM PR9
No ratings yet
66 Yash DM PR9
4 pages
K-Means Clustering
No ratings yet
K-Means Clustering
8 pages
FullMarks - Clustering StudentSolution 2
No ratings yet
FullMarks - Clustering StudentSolution 2
13 pages
K-Means Algorithm
No ratings yet
K-Means Algorithm
6 pages
Determining Clusters
No ratings yet
Determining Clusters
4 pages
ML Practical 4
No ratings yet
ML Practical 4
2 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-1W
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-1W
2 pages
Kmeans Clustering
No ratings yet
Kmeans Clustering
3 pages
Cluster Analysis
No ratings yet
Cluster Analysis
5 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-1E
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-1E
2 pages
Early Prediction For Chronic Kidney Disease Detection A Progressive Approach To Health Management
No ratings yet
Early Prediction For Chronic Kidney Disease Detection A Progressive Approach To Health Management
34 pages
Final R20 M.Tech AI - ML Syllabus
No ratings yet
Final R20 M.Tech AI - ML Syllabus
50 pages
BT-2016 SEM-IV Project Report (Review 1)
No ratings yet
BT-2016 SEM-IV Project Report (Review 1)
42 pages
IE506 Bagging Boosting April5 6
No ratings yet
IE506 Bagging Boosting April5 6
14 pages
IPO Performance Prediction During Covid-19 Using Decision Tree Algorithum
No ratings yet
IPO Performance Prediction During Covid-19 Using Decision Tree Algorithum
12 pages
Basic Concepts of Statistics
No ratings yet
Basic Concepts of Statistics
13 pages
Project Report Arjun
No ratings yet
Project Report Arjun
95 pages
NLP Study Material
No ratings yet
NLP Study Material
8 pages
2021-Application of Artificial Intelligence and Machine Learning To Detect DrillingAnomalies Leading To Stuck Pipe Incidents
No ratings yet
2021-Application of Artificial Intelligence and Machine Learning To Detect DrillingAnomalies Leading To Stuck Pipe Incidents
11 pages
Lecture 07 On Decision Trees
No ratings yet
Lecture 07 On Decision Trees
36 pages
B1 Major Project Paper
No ratings yet
B1 Major Project Paper
8 pages
Big Data Assignments Answer
No ratings yet
Big Data Assignments Answer
15 pages
GANS
No ratings yet
GANS
22 pages
Major Premise: All Students Attend School Regularly Minor Premise: John Is A Student Conclusion: John Attends School Regularly
No ratings yet
Major Premise: All Students Attend School Regularly Minor Premise: John Is A Student Conclusion: John Attends School Regularly
41 pages
M.SC Statistics101011 PDF
No ratings yet
M.SC Statistics101011 PDF
35 pages
Practice Questions AI & Robotics
No ratings yet
Practice Questions AI & Robotics
17 pages
9071 PDF
No ratings yet
9071 PDF
16 pages
Assignment 6 AI Travelling Salesman Problem - Jupyter Notebook
No ratings yet
Assignment 6 AI Travelling Salesman Problem - Jupyter Notebook
1 page
Crop Yield Prediction Based On Indian Agriculture Using Machine Learning
No ratings yet
Crop Yield Prediction Based On Indian Agriculture Using Machine Learning
5 pages
Final Project Journal C4.5 Algorithm Decision Tree
No ratings yet
Final Project Journal C4.5 Algorithm Decision Tree
8 pages
Predicting Gold Prices: Megan Potoski
No ratings yet
Predicting Gold Prices: Megan Potoski
5 pages
Patient AttendanceNo-Show Prediction
No ratings yet
Patient AttendanceNo-Show Prediction
10 pages
1 s2.0 S131915781730544X Main
No ratings yet
1 s2.0 S131915781730544X Main
7 pages
AIDI 1002 FinalExam Section 01
No ratings yet
AIDI 1002 FinalExam Section 01
2 pages
Quiz Solution IE683
No ratings yet
Quiz Solution IE683
1 page
Organic Reaction Mechanisms 2014: An annual survey covering the literature dated January to December 2014
From Everand
Organic Reaction Mechanisms 2014: An annual survey covering the literature dated January to December 2014
A. C. Knipe
No ratings yet
D3 4.x数据可视化实战手册（第2版）: Chinese Edition
From Everand
D3 4.x数据可视化实战手册（第2版）: Chinese Edition
Posts & Telecom Press
No ratings yet
Green Catalysis: Homogeneous Catalysis
From Everand
Green Catalysis: Homogeneous Catalysis
Wiley
No ratings yet
INVENRELATION
From Everand
INVENRELATION
Shih Yu Chang
No ratings yet
Green Catalysis: Heterogeneous Catalysis
From Everand
Green Catalysis: Heterogeneous Catalysis
Wiley
No ratings yet
Physical Pharmaceutics-II Lab Manual as per the PCI Syllabus
From Everand
Physical Pharmaceutics-II Lab Manual as per the PCI Syllabus
A. Pavani
No ratings yet

Binder 3

Uploaded by

Binder 3

Uploaded by

Suggested approach

1. First perform a hierarchical

Stage Cluster First

You might also like