0% found this document useful (0 votes)

6 views31 pages

AI20 - Hierarchical-Clustering

Hierarchical clustering generates a hierarchy of partitions from a dataset, allowing users to identify sub-populations. It includes two main methods: agglomerative, which merges clusters, and divisive, which splits them, with various distance measures to determine cluster similarity. While hierarchical clustering is easy to implement and does not require prior knowledge of the number of clusters, it can be sensitive to outliers and is not suitable for large datasets.

Uploaded by

zaydenguide

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views31 pages

AI20 - Hierarchical-Clustering

Uploaded by

zaydenguide

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

+

Hierarchical Clustering
Hierarchical Clustering
• Hierarchical methods generate a hierarchy of partitions, i.e.
• a partition P1 into 1 clusters (the entire collection)
• a partition P2 into 2 clusters
– …
• a partition Pn into n clusters (each object forms its own cluster)

• It is then up to the user to decide which of the partitions

reflects actual sub-populations in the data.

• Representing data objects in the form of a hierarchy is

useful for data summarization and visualization.
Note: A sequence of partitions is called "hierarchical" if each cluster
in a given partition is the union of clusters in the next larger partition.

P4 P3 P2 P1

Top: hierarchical sequence of partitions

Bottom: non hierarchical sequence
HC methods come in two varieties: agglomerative and divisive

Agglomerative methods [AGNES (AGglomerative NESting)]:

• Start with partition Pn, where each object forms its own cluster.
• Merge the two closest clusters, obtaining P n-1.
• Repeat merge until only one cluster is left.

Divisive methods [DIANA (DIvisive ANAlysis)]:

• Start with P1.
• Split the collection into two clusters that are as homogenous (and
as different from each other) as possible.
• Apply splitting procedure recursively to the clusters.
• Agglomerative methods require a rule to decide which clusters
to merge.
• Typically one defines a distance between clusters and then
merges the two clusters that are closest.
• Divisive methods require a rule for splitting a cluster.
Hierarchical Agglomerative Clustering
• Define a distance between clusters
Initially, every datum is a cluster • Initialize: every example is a cluster
• Iterate:
– Compute distances between all
clusters
(store for efficiency)
– Merge two closest clusters
• Save both clustering and sequence
of cluster operations
• “Dendrogram”
Iteration 1
Iteration 2
Iteration 3
• Builds up a sequence of clusters
(“hierarchical”)

• Because two clusters are

merged per iteration, where each
cluster contains at least one
object, an agglomerative method
requires at most n iterations.
• Algorithm complexity O(N2)
Dendrogram
Dendograms
Result of hierarchical clustering can be represented as binary tree:
• Root of tree represents entire collection
• Terminal nodes represent observations
• Each interior node represents a cluster
• Each subtree represents a partition

Note: For HAC methods, the merge order defines a sequence of n

subtrees of the full tree. For HDC methods a sequence of subtrees
can be defined if there is a figure of merit for each split.
Clustering obtained by cutting the dendrogram at a desired level:
each connected component forms a cluster.
Hierarchical agglomerative clustering
Need to define a distance d(P,Q) between groups, given a distance
measure d(x,y) between observations.
Commonly used distance measures:
1. d1(P,Q) = min d(x,y), for x in P, y in Q ( single linkage )
2. d2(P,Q) = ave d(x,y), for x in P, y in Q ( average linkage )
3. d3(P,Q) = max d(x,y), for x in P, y in Q ( complete linkage )

4. d 4 ( P, Q) = x P − xQ ( centroid method )

PQ 2
5. d5 ( P, Q) = 2 x P − xQ ( Ward’s method )
P + Q

d5 is called Ward’s distance.

Motivation for Ward’s distance:
• Let Pk = P1 ,…, Pk be a partition of the observations into k
groups.
• Measure goodness of a partition by the sum of squared
distances of observations from their cluster means:
k 2

RSS (Pk ) = 
i =1 j Pi
x j − x Pi

• Consider all possible (k-1)-partitions obtainable from Pk by a

merge
• Merging two clusters with smallest Ward’s distance optimizes
goodness of new partition.
Cluster Distances

produces minimal spanning tree.

avoids elongated clusters.

Sec. 17.2

Single Link
• Use minimum similarity of pairs:
sim (ci ,c j ) = min sim ( x, y )
xci , yc j

• Can result in “straggly” (long and thin)

clusters due to chaining effect.
• After merging ci and cj, the similarity of the
resulting cluster to another cluster, ck, is:
sim ((ci  c j ), ck ) = min( sim (ci , ck ), sim (c j , ck ))

Ci Cj Ck
Sec. 17.2

Single Link Example

Sec. 17.2

Complete Link Agglomerative Clustering

• Use maximum similarity of pairs:
sim (ci ,c j ) = max sim ( x, y )
xci , yc j

• Makes “tighter,” spherical clusters that are

typically preferable.
• After merging ci and cj, the similarity of the
resulting cluster to another cluster, ck, is:

sim ((ci  c j ), ck ) = max( sim (ci , ck ), sim (c j , ck ))

Sec. 17.2

Complete Link Example

Example
• The minimum and maximum measures represent two
extremes in measuring the distance between clusters.
They tend to be overly sensitive to outliers or noisy data.
• The use of mean or average distance is a compromise
between the minimum and maximum distances and
overcomes the outlier sensitivity problem.
• Whereas the mean distance is the simplest to compute,
the average distance is advantageous in that it can
handle categoric as well as numeric data. The
computation of the mean vector for categoric data can
be difficult or impossible to define.
Solved example
Step 1

18 22 25 27 42 43

18 0 4 7 9 24 25

22 4 0 3 5 20 21

25 7 3 0 2 17 18

27 9 5 2 0 15 16

42 24 20 17 15 0 1

43 25 21 18 16 1 0
Step 2

18 22 25 27 42, 43

18 0 4 7 9 24

22 4 0 3 5 20

25 7 3 0 2 17

27 9 5 2 0 15

42, 43 24 20 17 15 0
Step 3

18 22 25, 27 42, 43

18 0 4 7 24

22 4 0 3 20

25, 27 7 3 0 15

42, 43 24 20 15 0
Step 4

18 22, 25, 27 42, 43

18 0 4 24

22, 25, 27 4 0 15

42, 43 24 15 0
Step 5

18, 22, 25, 27 42, 43

18, 22, 25, 27 0 15

42, 43 15 0
Step 5

18, 22, 25, 27, 42, 43

18, 22, 25, 27, 42, 43 0

Exit criteria
• Can work with a pre-determined value for number of clusters
• Can set a threshold for dissimilarity of clusters
• In HAC, the distance between nearest clusters is greater than a
threshold
• In HDC, the distance between members of clusters is less than
a threshold
• Can be decided from the dendrogram
Advantages
• Easy to implement and understand
• No prior information is required about the number of clusters
• Outliers can be detected with the help of dendrogram
• Deterministic and predictable
Disadvantages
• Not suitable for large dataset
• Difficulty in handling different sized clusters
• Sensitive to outliers and noise in the dataset
Challenge with divisive methods
• How to partition a large cluster into several smaller ones?
• For example, there are 2 n − 1 − 1 possible ways to partition a set of n
objects into two exclusive subsets, where n is the number of objects.
• When n is large, it is computationally prohibitive to examine all
possibilities.
• A divisive method typically uses heuristics in partitioning, which can
lead to inaccurate results. For the sake of efficiency, divisive methods
typically do not backtrack on partitioning decisions that have been made.
Once a cluster is partitioned, any alternative partitioning of this cluster
will not be considered again. Due to the challenges in divisive methods,
there are many more agglomerative methods than divisive methods.

Unit 3 Clustering
No ratings yet
Unit 3 Clustering
101 pages
Week-9-Part-2 Agglomerative Clustering
No ratings yet
Week-9-Part-2 Agglomerative Clustering
40 pages
Lecture - 11 Hierarchical Clustering
No ratings yet
Lecture - 11 Hierarchical Clustering
28 pages
Hierarchical Clustering Unit 4 ML
No ratings yet
Hierarchical Clustering Unit 4 ML
14 pages
Hierarchial Clustering
No ratings yet
Hierarchial Clustering
14 pages
Aula - Análise de Clusters
No ratings yet
Aula - Análise de Clusters
93 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
4 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
10 pages
Hierarchical Clustering Algorithm
No ratings yet
Hierarchical Clustering Algorithm
9 pages
Module 3 - 1
No ratings yet
Module 3 - 1
149 pages
Hierarchical Clustering: Relationship Between Clusters
No ratings yet
Hierarchical Clustering: Relationship Between Clusters
23 pages
7.2. Clustering Methods
No ratings yet
7.2. Clustering Methods
46 pages
Presentation 28128 Content Document 20241126014005PM
No ratings yet
Presentation 28128 Content Document 20241126014005PM
80 pages
Hierarchical
No ratings yet
Hierarchical
31 pages
Hierarchical Clustering: Class Program University Semester Lecturer Sources
100% (1)
Hierarchical Clustering: Class Program University Semester Lecturer Sources
33 pages
ML TCS Lecture Hierarchical 1608
No ratings yet
ML TCS Lecture Hierarchical 1608
41 pages
Unit-4 New
No ratings yet
Unit-4 New
36 pages
Lecture 18
No ratings yet
Lecture 18
27 pages
Hierarchical Clusters
No ratings yet
Hierarchical Clusters
6 pages
Hierarchical Clustering PDF
No ratings yet
Hierarchical Clustering PDF
7 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
10 pages
6 - Chapter 6 - Hierarchical Clustering
No ratings yet
6 - Chapter 6 - Hierarchical Clustering
32 pages
Hierar Scale4
No ratings yet
Hierar Scale4
51 pages
Stat401 ch6
No ratings yet
Stat401 ch6
37 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
41 pages
Introduction To Clustering: Alka Arora Sr. Scientist
No ratings yet
Introduction To Clustering: Alka Arora Sr. Scientist
57 pages
Lec 35
No ratings yet
Lec 35
18 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
26 pages
Unit-6 Clustering Techniques
No ratings yet
Unit-6 Clustering Techniques
110 pages
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
No ratings yet
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
41 pages
Chp10 Cluster Analysis Basic Concepts and Methods
No ratings yet
Chp10 Cluster Analysis Basic Concepts and Methods
24 pages
Hierarchical 4 4 03
No ratings yet
Hierarchical 4 4 03
15 pages
Module-5-Cluster Analysis-Part1
No ratings yet
Module-5-Cluster Analysis-Part1
24 pages
Cluster Analysis Using Statgraphics: Dr. Neil W. Polhemus
No ratings yet
Cluster Analysis Using Statgraphics: Dr. Neil W. Polhemus
32 pages
DA Seminar
No ratings yet
DA Seminar
29 pages
Unit 4 - Data Warehousing and Mining
No ratings yet
Unit 4 - Data Warehousing and Mining
51 pages
Survey of Clustering Algorithms
No ratings yet
Survey of Clustering Algorithms
37 pages
P 3.1.3 Hierarchical
No ratings yet
P 3.1.3 Hierarchical
30 pages
2.11 Hierarchical Clustering - Agglomerative & Divisive Clustering
No ratings yet
2.11 Hierarchical Clustering - Agglomerative & Divisive Clustering
11 pages
13 Birch
No ratings yet
13 Birch
8 pages
Un Supervised Learning
No ratings yet
Un Supervised Learning
22 pages
Clustring
No ratings yet
Clustring
20 pages
10Hierarchical&Probabilistic Clustering & GMM (ML)
No ratings yet
10Hierarchical&Probabilistic Clustering & GMM (ML)
24 pages
Cluster Analysis
No ratings yet
Cluster Analysis
30 pages
Chap15 Cluster Analysis
No ratings yet
Chap15 Cluster Analysis
55 pages
Cluster 1
No ratings yet
Cluster 1
6 pages
Hierarchical Clustering - 11.3.2024 - Full
No ratings yet
Hierarchical Clustering - 11.3.2024 - Full
14 pages
Cluster Analysis Concept & Methods
No ratings yet
Cluster Analysis Concept & Methods
14 pages
Example For Agglomerative Clustering
No ratings yet
Example For Agglomerative Clustering
2 pages
Clustering Hierarchical PDF
No ratings yet
Clustering Hierarchical PDF
31 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
11 pages
Agglomerative Clustering
No ratings yet
Agglomerative Clustering
6 pages
Hierarchical
No ratings yet
Hierarchical
2 pages
Data Mining Unit 5
No ratings yet
Data Mining Unit 5
30 pages
Agnes
No ratings yet
Agnes
25 pages
Hierarchical Clustering: Required Data
No ratings yet
Hierarchical Clustering: Required Data
6 pages
Agglomerative Hierarchical Clustering Algorithm-A Review: K.Sasirekha, P.Baby
No ratings yet
Agglomerative Hierarchical Clustering Algorithm-A Review: K.Sasirekha, P.Baby
3 pages
Hierarchical Clustering PDF
No ratings yet
Hierarchical Clustering PDF
5 pages
Miniproject Thirukumaran
No ratings yet
Miniproject Thirukumaran
38 pages
Gold Price Prediction Publication
No ratings yet
Gold Price Prediction Publication
8 pages
6th Sem
No ratings yet
6th Sem
15 pages
6 1 Mining Complex Data
No ratings yet
6 1 Mining Complex Data
69 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
3 pages
INN Hotels Project
No ratings yet
INN Hotels Project
26 pages
OptiRamp Rod Pump Diagnostics PDF
No ratings yet
OptiRamp Rod Pump Diagnostics PDF
16 pages
Or - Chapter - I
No ratings yet
Or - Chapter - I
18 pages
Chapter 10: Artificial Neural Networks
No ratings yet
Chapter 10: Artificial Neural Networks
17 pages
ML Tennis
No ratings yet
ML Tennis
6 pages
UNIT 3-Clustering Metrics
No ratings yet
UNIT 3-Clustering Metrics
59 pages
Multistatic Radar
No ratings yet
Multistatic Radar
11 pages
Multinomial Logistic Regression Basic Relationships
No ratings yet
Multinomial Logistic Regression Basic Relationships
73 pages
1 s2.0 S2590123024000410 Main
No ratings yet
1 s2.0 S2590123024000410 Main
10 pages
Recommender MidTerm - 2
No ratings yet
Recommender MidTerm - 2
12 pages
A Hierarchical Fused Fuzzy Deep Neural Network For Data Classification
No ratings yet
A Hierarchical Fused Fuzzy Deep Neural Network For Data Classification
8 pages
Player Stats Analysis Using Machine Learning
No ratings yet
Player Stats Analysis Using Machine Learning
4 pages
ML BIT Ans
No ratings yet
ML BIT Ans
5 pages
Intelligent Sales Prediction Using Machine Learning Techniques
No ratings yet
Intelligent Sales Prediction Using Machine Learning Techniques
6 pages
Predication of Parkinson's Disease Using Data Mining Methods: A Comparative Analysis of Tree, Statistical and Support Vector Machine Classifiers
No ratings yet
Predication of Parkinson's Disease Using Data Mining Methods: A Comparative Analysis of Tree, Statistical and Support Vector Machine Classifiers
14 pages
Gta-304 List of Classifications
No ratings yet
Gta-304 List of Classifications
5 pages
Detecting Arabic Fake Reviews in E-Commerce Platforms Using Machine and Deep Learning
No ratings yet
Detecting Arabic Fake Reviews in E-Commerce Platforms Using Machine and Deep Learning
8 pages
Sns College of Technology: Department of Civil Engineering
No ratings yet
Sns College of Technology: Department of Civil Engineering
10 pages
Automated Use Case Diagram Generator Using NLP and
No ratings yet
Automated Use Case Diagram Generator Using NLP and
5 pages
An Intelligent Sleep Apnea Classification System Based On EEG Signals
No ratings yet
An Intelligent Sleep Apnea Classification System Based On EEG Signals
9 pages
Word Sense Disambiguation: A Survey
No ratings yet
Word Sense Disambiguation: A Survey
16 pages
The Use of Machine Learning Techniques To Advance The Detection and Classification of Unknown Malware
No ratings yet
The Use of Machine Learning Techniques To Advance The Detection and Classification of Unknown Malware
6 pages
Teamdl at Semeval-2018 Task 8: Cybersecurity Text Analysis Using Convolutional Neural Network and Conditional Random Fields
No ratings yet
Teamdl at Semeval-2018 Task 8: Cybersecurity Text Analysis Using Convolutional Neural Network and Conditional Random Fields
6 pages
E-Commerce Product Rating Based On Customer Review Mining
No ratings yet
E-Commerce Product Rating Based On Customer Review Mining
4 pages
Data Science Resume
No ratings yet
Data Science Resume
1 page
Calculus: Maths of the Gods
From Everand
Calculus: Maths of the Gods
Bill Todorovich
No ratings yet
Speed Mathamatics
From Everand
Speed Mathamatics
Naila Hina
1/5 (1)

AI20 - Hierarchical-Clustering

Uploaded by

AI20 - Hierarchical-Clustering

Uploaded by

+

• It is then up to the user to decide which of the partitions

• Representing data objects in the form of a hierarchy is

Top: hierarchical sequence of partitions

Agglomerative methods [AGNES (AGglomerative NESting)]:

Divisive methods [DIANA (DIvisive ANAlysis)]:

• Because two clusters are

Note: For HAC methods, the merge order defines a sequence of n

d5 is called Ward’s distance.

• Consider all possible (k-1)-partitions obtainable from Pk by a

produces minimal spanning tree.

avoids elongated clusters.

• Can result in “straggly” (long and thin)

Single Link Example

Complete Link Agglomerative Clustering

• Makes “tighter,” spherical clusters that are

sim ((ci  c j ), ck ) = max( sim (ci , ck ), sim (c j , ck ))

Complete Link Example

18 22, 25, 27 42, 43

18, 22, 25, 27 42, 43

18, 22, 25, 27 0 15

18, 22, 25, 27, 42, 43

18, 22, 25, 27, 42, 43 0

You might also like