0% found this document useful (0 votes)

22 views28 pages

Lecture - 11 Hierarchical Clustering

Uploaded by

Prince Abdullah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views28 pages

Lecture - 11 Hierarchical Clustering

Uploaded by

Prince Abdullah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 28

Hierarchical Clustering

By: Abdul Hameed

1
Hierarchical Clustering

• Hierarchical clustering (also called hierarchical cluster analysis or

HCA) is a method of cluster analysis which seeks to build a
hierarchy of clusters. E.g. All files and folders on our hard disk are
organized in a hierarchy.

2
Hierarchical Clustering - Dendrogram
• It constructs a binary tree of the data that consecutively combines related ensembles of points. The
graphical representation of the resultant hierarchy is a tree-structured graph named dendrogram.

3
Hierarchical Clustering - Strategies

• Agglomerative (bottom-up):
• Beginning with singletons (sets with 1 element)
• Merging them until S is achieved as the root
• In each step, the two closest clusters are aggregates into a new combined cluster
• In this way, number of clusters in the dataset is reduced at each step
• Eventually, all records/elements are combined into a single huge cluster
• It is the most common approach
• Divisive (top-down):
• All records are combined in to a one big cluster
• Then the most dissimilar records being split off recursively partitioning S until
singleton sets are reached.

4
Hierarchical Clustering - Strategies

5
Hierarchical Agglomerative Clustering - Steps

1. Start by assigning each item to its own cluster, so that if you

have N items, you now have N clusters, each containing just one
item. Let the distances (similarities) between the clusters equal
the distances (similarities) between the items they contain.
2. Find the closest (most similar) pair of clusters and merge them
into a single cluster, so that now you have one less cluster.
3. Compute distances (similarities) between the new cluster and
each of the old clusters.
4. Repeat steps 2 and 3 until all items are clustered into a single
cluster of size N.
6
Example - Problem

• Suppose I want to divide my students into different groups.

• I have the marks scored by each student in an assignment and based on
these marks, I want to segment them into groups.
• There’s no fixed target here as to how many groups to have.
• Since I don’t know what type of students should be assigned to which
group, it cannot be solved as a supervised learning problem.
• So, I shall try to apply hierarchical clustering here and segment the students
into different groups.

7
Example - Dataset

• Let’s take a sample of five students

Reg# Marks
1 10
2 7
3 28
4 20
5 35

8
Example – Proximity Matrix

• Proximity matrix stores the distances between each point.

Reg# Marks Distances have been
1 10 calculated using Euclidean
Distance.
2 7
3 28 For example: distance
between 1 and 2
4 20
5 35

9
Example – Step 1

• Assign all the points to an

individual cluster.

10
Example – Step 2

• Find smallest distance in the proximity matrix and merge

the points with the smallest distance:

11
Example – Step 2

• Update the tables Reg# Marks

1 10
2 7
3 28
4 20
5 35

Reg# Marks
(1,2) 10
3 28
4 20
5 35

12
Example – Step 3

• Now repeat step 2 until only a single cluster is left.

10 7 28 20 35

13
Example – How many clusters?

• Now, we can set a threshold distance and draw a

horizontal line (Generally, we try to set the
threshold in such a way that it cuts the tallest
vertical line). Let’s set this threshold as 12 and
draw a horizontal line.
• The number of clusters will be the number of
vertical lines which are being intersected by the
line drawn using the threshold. Since the red
line intersects 2 vertical lines, we will have 2
clusters. One cluster will have a sample (1,2,4)
and the other will have a sample (3,5).

14
Distance Measures

• Single link: smallest distance between an

element in one cluster and an element in the
other, i.e.,
• , )}
• Complete link: largest distance between an
element in one cluster and an element in the
other, i.e.,
• , )}
• Average: avg distance between elements in
one cluster and elements in the other, i.e.,
• , )}

15
Summary

• Hierarchical Clustering • Disadvantages

• Not easy to define levels for clusters
• For a dataset consisting of n points
• Can never undo what was done
• O(n2) space; it requires storing the previously
distance matrix • Sensitive to cluster distance measures
• O(n3) time complexity in most of the and noise/outliers
cases(agglomerative clustering) • Experiments showed that other
clustering techniques
• Advantages • outperform hierarchical clustering
• Dendograms are great for • There are several variants to overcome
visualization its weaknesses
• Provides hierarchical relations • BIRCH: scalable to a large data set
between clusters • ROCK: clustering categorical data
• CHAMELEON: hierarchical clustering
using dynamic modelling
16
Divisive Clustering

17
Divisive Hierarchical Clustering

• Divisive or DIANA(DIvisive ANAlysis Clustering) is a top-down

clustering approach.
• The process starts at the root with all the points as one
cluster.
• It recursively splits the higher level clusters to build the
dendrogram.
• Can be considered as a global approach.
• Divisive clustering is good at identifying large clusters while
agglomerative clustering is good at identifying small clusters.
18
Hierarchical Clustering - Strategies

19
Hierarchical Clustering - Steps

1. Start with all data points in the cluster.

2. After each iteration, remove the
outsiders/heterogeneous objects from
the cluster.
3. Stop when each example is in its own
singleton cluster, else go to step 2.
20
Example - Divisive Hierarchical Clustering

ID X1 X2 X3 ID 1 2 3 4 5
1 1 6 -1 1 0 4 4 5 7
2 3 7 0 2 4 0 4 3 3
3 3 5 -2 3 4 4 0 5 7
4 4 8 -1 4 5 3 5 0 2
5 5 8 0 5 7 3 7 2 0

Proximity Matrix using Manhattan Distance

Top level cluster A{1,2,3,4,5} is all the 5 points

21
Example

ID 1 2 3 4 5 Find most dissimilar point

Take the average distance to the other points. So, just for
1 0 4 4 5 7 example, for point 1 we compute
2 4 0 4 3 3
3 4 4 0 5 7
4 5 3 5 0 2
Similarly, average distances for all the points are:
5 7 3 7 2 0

Proximity Matrix using Manhattan Distance Point 1 2 3 4 5

Distance 5.00 3.50 5.00 3.75 4.75

22
Example

• Since points 1 and 3 are Point 1 2 3 4 5

Distance 5.00 3.50 5.00 3.75 4.75
tied for the most
dissimilar, we pick one of
these arbitrarily.
• I will use point 1.
• Now we have
• A = {2,3,4,5}
• B = {1}

23
Example

Now we want to move any points that are closer to B than

ID 1 2 3 4 5 (the other points in) A into B. So for each point x in A we
1 0 4 4 5 7 compute d(x, A) and d(x,B). For example, for point 2 we
compute:
2 4 0 4 3 3
3 4 4 0 5 7
4 5 3 5 0 2
5 7 3 7 2 0 Similarly, average distances for all the points are:

Proximity Matrix using Manhattan Distance

Point 2 3 4 5
Distance -0.67 1.33 1.67 -3

24
Example

• Only point 3 is bigger Point 2 3 4 5

Distance -0.67 1.33 1.67 -3
than zero so we move it
to cluster B.
• Now we have
• A = {2,4,5}
• B = {1,3}

25
Example

We check if any additional points should be moved. Again,

ID 1 2 3 4 5 we compute d(x, A) - d(x,B) for each point in A. The
1 0 4 4 5 7 differences are:
2 4 0 4 3 3 Point 2 4 5
3 4 4 0 5 7
Distance -1.0 -2.5 -4.5
4 5 3 5 0 2
5 7 3 7 2 0
All are negative (that is the remaining
Proximity Matrix using Manhattan Distance
points in A are closer to A than to B),
so we stop this division and we have
the two clusters {2,4,5} and {1,3}.
26
Example

For the next step, we choose the cluster with the largest diameter, that
ID 1 2 3 4 5 is the cluster with the greatest distance between two points in the
1 0 4 4 5 7 cluster.
2 4 0 4 3 3
3 4 4 0 5 7
4 5 3 5 0 2
So cluster {1,3} has the largest diameter. Trivially, this will be split into
5 7 3 7 2 0 {1} and {3}. So now we have clusters {2,4,5}, {1} and {3}.

No recursively apply the same steps to {2,4,5} to split it further.

Proximity Matrix using Manhattan Distance

27
Example - Dendrogram

ID 1 2 3 4 5
1 0 4 4 5 7
2 4 0 4 3 3
3 4 4 0 5 7
4 5 3 5 0 2
5 7 3 7 2 0

Proximity Matrix using Manhattan Distance

System Dynamics and Respinse Kelly Solutions
100% (6)
System Dynamics and Respinse Kelly Solutions
447 pages
DRR Integration in School Curricula
100% (11)
DRR Integration in School Curricula
229 pages
Hall ND The Rediscovery of Ideology PDF
100% (1)
Hall ND The Rediscovery of Ideology PDF
20 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
10 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
41 pages
Clustering Hierarchical PDF
No ratings yet
Clustering Hierarchical PDF
31 pages
Unit-6 Clustering Techniques
No ratings yet
Unit-6 Clustering Techniques
110 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
32 pages
Lect 11 DM
No ratings yet
Lect 11 DM
41 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
35 pages
Hierarchical Clusters
No ratings yet
Hierarchical Clusters
6 pages
Un Supervised Learning
No ratings yet
Un Supervised Learning
22 pages
Hierarchical Clustering - 11.3.2024 - Full
No ratings yet
Hierarchical Clustering - 11.3.2024 - Full
14 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
10 pages
Grouping
No ratings yet
Grouping
98 pages
10Hierarchical&Probabilistic Clustering & GMM (ML)
No ratings yet
10Hierarchical&Probabilistic Clustering & GMM (ML)
24 pages
Chap15 Cluster Analysis
No ratings yet
Chap15 Cluster Analysis
55 pages
Week 10
No ratings yet
Week 10
84 pages
Agnes
No ratings yet
Agnes
25 pages
P 3.1.3 Hierarchical
No ratings yet
P 3.1.3 Hierarchical
30 pages
Hierarchical Clustering Algorithm
No ratings yet
Hierarchical Clustering Algorithm
9 pages
Clustering: EE-671 Prof L. Behera, IITK
No ratings yet
Clustering: EE-671 Prof L. Behera, IITK
33 pages
Hierarchical Clustering Unit 4 ML
No ratings yet
Hierarchical Clustering Unit 4 ML
14 pages
Agglomerative Clustering
No ratings yet
Agglomerative Clustering
44 pages
Presentation 28128 Content Document 20241126014005PM
No ratings yet
Presentation 28128 Content Document 20241126014005PM
80 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
4 pages
What Is Cluster Analysis?
No ratings yet
What Is Cluster Analysis?
20 pages
Hierar Scale4
No ratings yet
Hierar Scale4
51 pages
Topic 6d - Hierarchical Algorithm
No ratings yet
Topic 6d - Hierarchical Algorithm
38 pages
Hierarchicalclustering
No ratings yet
Hierarchicalclustering
20 pages
Clustering
No ratings yet
Clustering
75 pages
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
No ratings yet
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
41 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
23 pages
Slide TIF311 DM 10 11
No ratings yet
Slide TIF311 DM 10 11
49 pages
6 - Clustering and Applications and Trends in Datamining
No ratings yet
6 - Clustering and Applications and Trends in Datamining
66 pages
Clustering
No ratings yet
Clustering
75 pages
AIMLB PGP 2024 Session 12
No ratings yet
AIMLB PGP 2024 Session 12
46 pages
Clustering and Applications and Trends in Datamining Lecture:-30 To 35
No ratings yet
Clustering and Applications and Trends in Datamining Lecture:-30 To 35
66 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
26 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
34 pages
4.4 Hierarchical Clustering Methods
No ratings yet
4.4 Hierarchical Clustering Methods
39 pages
Unit-4 New
No ratings yet
Unit-4 New
36 pages
Spooo
No ratings yet
Spooo
9 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
11 pages
Module 5
No ratings yet
Module 5
43 pages
Week 07 Lecture Material
No ratings yet
Week 07 Lecture Material
49 pages
Chp10 Cluster Analysis Basic Concepts and Methods
No ratings yet
Chp10 Cluster Analysis Basic Concepts and Methods
24 pages
Clustering
No ratings yet
Clustering
39 pages
Unit 3
No ratings yet
Unit 3
12 pages
Clustring
No ratings yet
Clustring
20 pages
03 Hierarchical Clustering
100% (1)
03 Hierarchical Clustering
15 pages
3CP10 MJJ Hierarchical Clustering
No ratings yet
3CP10 MJJ Hierarchical Clustering
40 pages
ML Module 4 2022 1 PDF
No ratings yet
ML Module 4 2022 1 PDF
31 pages
Data Science Session 8 Clustering V0
No ratings yet
Data Science Session 8 Clustering V0
30 pages
Heirarchical Clustering
No ratings yet
Heirarchical Clustering
22 pages
Clustering
No ratings yet
Clustering
19 pages
Module-5-Cluster Analysis-Part1
No ratings yet
Module-5-Cluster Analysis-Part1
24 pages
Clustering
No ratings yet
Clustering
38 pages
Introduction To Data Analytics MCA-3282 Open Elective - 6 Sem B.Tech Topic - Grouping
No ratings yet
Introduction To Data Analytics MCA-3282 Open Elective - 6 Sem B.Tech Topic - Grouping
44 pages
19 - Sessionppt - Clusteringalgos
No ratings yet
19 - Sessionppt - Clusteringalgos
36 pages
Unit5 CSM ML
No ratings yet
Unit5 CSM ML
32 pages
Mastery Level Frequency of Errors
No ratings yet
Mastery Level Frequency of Errors
5 pages
UK Full Year 2024
No ratings yet
UK Full Year 2024
33 pages
Forces 3 QP
No ratings yet
Forces 3 QP
7 pages
Btech Ce 6 Sem Foundation Design Kce 064 2023
No ratings yet
Btech Ce 6 Sem Foundation Design Kce 064 2023
2 pages
Group Work in The Classroom: Types of Small Groups
No ratings yet
Group Work in The Classroom: Types of Small Groups
22 pages
What Are Natural Resources
No ratings yet
What Are Natural Resources
6 pages
Ubd Template
No ratings yet
Ubd Template
2 pages
Research Forum Script
No ratings yet
Research Forum Script
4 pages
The Occult Knowledge - Strategies of Epi PDF
100% (3)
The Occult Knowledge - Strategies of Epi PDF
61 pages
Css English Precie 2019
No ratings yet
Css English Precie 2019
2 pages
How Does Society Influence Literature? How Does Literature Influence Society?
No ratings yet
How Does Society Influence Literature? How Does Literature Influence Society?
16 pages
Env Assignment Bioplastic
No ratings yet
Env Assignment Bioplastic
3 pages
Advances in Engineering Software: M.J. Esfandiari, G.S. Urgessa, S. Sheikholare Fin, S.H. Dehghan Manshadi
No ratings yet
Advances in Engineering Software: M.J. Esfandiari, G.S. Urgessa, S. Sheikholare Fin, S.H. Dehghan Manshadi
12 pages
Multi-Attribute Evaluation of Flood Management in Japan: A Choice Experiment Approach
No ratings yet
Multi-Attribute Evaluation of Flood Management in Japan: A Choice Experiment Approach
10 pages
Selectividad - English Exam
No ratings yet
Selectividad - English Exam
5 pages
Pillars of Leadership
No ratings yet
Pillars of Leadership
5 pages
Working Drawing: UV Sterilizer
No ratings yet
Working Drawing: UV Sterilizer
19 pages
RectorDecryptor.2.3.14.0 07.05.2011 22.02.03 Log
No ratings yet
RectorDecryptor.2.3.14.0 07.05.2011 22.02.03 Log
2 pages
Social Psychology Practice MCQs 2022
100% (1)
Social Psychology Practice MCQs 2022
4 pages
S.2 Ict Eot I 2025
100% (1)
S.2 Ict Eot I 2025
3 pages
ADL 01 - Principles and Practices of Management Assignment
0% (1)
ADL 01 - Principles and Practices of Management Assignment
9 pages
2022 Demo
No ratings yet
2022 Demo
22 pages
Learning Module 5 Part B
No ratings yet
Learning Module 5 Part B
3 pages
CE448W Conceptual Design Submittal Group3.08
No ratings yet
CE448W Conceptual Design Submittal Group3.08
34 pages
Bộ 5 đề thi giữa HK1 môn Tiếng Anh 12 năm 2022-2023 có đáp án Trường THPT Nguyễn Gia Thiều
No ratings yet
Bộ 5 đề thi giữa HK1 môn Tiếng Anh 12 năm 2022-2023 có đáp án Trường THPT Nguyễn Gia Thiều
30 pages
SWOT Analysis
No ratings yet
SWOT Analysis
22 pages
Sedona Method Release Technique 1992 Sedona Institute 01 of 08 Volume 1 Session 1
100% (2)
Sedona Method Release Technique 1992 Sedona Institute 01 of 08 Volume 1 Session 1
110 pages

Lecture - 11 Hierarchical Clustering

Uploaded by

Lecture - 11 Hierarchical Clustering

Uploaded by

Hierarchical Clustering

By: Abdul Hameed

• Hierarchical clustering (also called hierarchical cluster analysis or

1. Start by assigning each item to its own cluster, so that if you

• Suppose I want to divide my students into different groups.

• Let’s take a sample of five students

• Proximity matrix stores the distances between each point.

• Assign all the points to an

• Find smallest distance in the proximity matrix and merge

• Update the tables Reg# Marks

• Now repeat step 2 until only a single cluster is left.

• Now, we can set a threshold distance and draw a

• Single link: smallest distance between an

• Hierarchical Clustering • Disadvantages

• Divisive or DIANA(DIvisive ANAlysis Clustering) is a top-down

1. Start with all data points in the cluster.

Proximity Matrix using Manhattan Distance

Top level cluster A{1,2,3,4,5} is all the 5 points

ID 1 2 3 4 5 Find most dissimilar point

Proximity Matrix using Manhattan Distance Point 1 2 3 4 5

• Since points 1 and 3 are Point 1 2 3 4 5

Now we want to move any points that are closer to B than

Proximity Matrix using Manhattan Distance

• Only point 3 is bigger Point 2 3 4 5

We check if any additional points should be moved. Again,

No recursively apply the same steps to {2,4,5} to split it further.

Proximity Matrix using Manhattan Distance

You might also like