0% found this document useful (0 votes)

4 views32 pages

Hierarchical Clustering

The document discusses hierarchical clustering, detailing two main types: agglomerative (bottom-up) and divisive (top-down), along with their respective algorithms and methodologies. It explains the process of merging or splitting clusters based on distance metrics and illustrates the concept using examples and distance matrices. Additionally, it introduces the dendrogram as a visual representation of the clustering hierarchy.

Uploaded by

pobocow192

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views32 pages

Hierarchical Clustering

Uploaded by

pobocow192

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 32

Hierarchical Clustering

• Clusters are created in levels actually creating sets of

clusters at each level.
• Types:
– Agglomerative
• Initially each item is in its own cluster
• Iteratively clusters are merged together
• Bottom Up approach is followed
– Divisive
• Initially all items are in one cluster
• Large clusters are successively divided
• Top Down approach is followed
Types of hierarchical clustering
• Divisive (top down) clustering: Starts with all data points in
one cluster, the root, then
–Splits the root into a set of child clusters. Each child cluster is recursively
divided further
–stops when only singleton clusters of individual data points remain, i.e.,
each cluster with only a single point

• Agglomerative (bottom up) clustering :The dendrogram is

built from the bottom level by
–merging the most similar (or nearest) pair of clusters
–stopping when all the data points are merged into a single cluster (i.e.,
the root cluster).
Hierarchical Clustering
• Distance matrix is used as clustering criteria. This method does not
require the number of clusters k as an input, but needs a termination
condition.

Step 0 Step 1 Step 2 Step 3 Step 4 Agglomerative Nesting

(AGNES)
a ab
b abcde
c
cde
d
de
e
Divisive Analysis
Step 4 Step 3 Step 2 Step 1 Step 0 (DIANA)
Hierarchical Algorithms
Algorithms are based on
• Single Link - the distance between two clusters is the minimum distance
between the two groups (the similarity of two clusters is the similarity of
their most similar members)
• Complete Link –the distance between two clusters is the maximum distance
between the two groups(the similarity of two clusters is the similarity of their most
dissimilar members)
• Average Link
Dendrogram
• Dendrogram: a tree data
structure which illustrates
hierarchical clustering
techniques.
• Each level shows clusters for
that level.
– Leaf – individual clusters
– Root – one cluster
• A cluster at level i is the union of
its children clusters at level i+1.
Levels of Clustering
Single Link
• View all items with links (distances) between them.
• Finds maximal connected components in this graph.
• Two clusters are merged if there is at least one edge which
connects them.
• Uses threshold distances at each level.
• Could be agglomerative or divisive.
AGNES (Agglomerative Nesting)
• Introduced in Kaufmann and Rousseeuw (1990)
• Uses the Single-Link method and the dissimilarity
matrix.
– Merge nodes that have the least dissimilarity
– Continue the merging of nodes in a non-descending fashion
– Eventually all nodes belong to the same cluster

10 10 10

9 9 9

8 8 8

7 7 7

6 6 6

5 5 5

4 4 4

3 3 3

2 2 2

1 1 1

0 0 0
0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10
DIANA (Divisive Analysis)
• Introduced in Kaufmann and Rousseeuw (1990)
• Inverse order of AGNES
• Eventually each node forms a cluster on its own
• A simple algorithm based on MST version of single link algorithm:
– All items are initially placed in one cluster
– Clusters are split into two until all items are in their own cluster

10 10
10

9 9
9
8 8
8

7 7
7
6 6
6

5 5
5
4 4
4

3 3
3
2 2
2

1 1
1
0 0
0
0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10
0 1 2 3 4 5 6 7 8 9 10
AGNES - Example 1
• Consider the data points A(1,1), B(1.5, 1.5), C (5,5), D(3,4),
E(4,4) and F(3,3.5).
• Example Reference:
• The distance (Euclidean) matrix is
Dist A B C D E F
A 0.00 0.71 5.66 3.61 4.24 3.20
B 0.71 0.00 4.95 2.92 3.54 2.50
C 5.66 4.95 0.00 2.24 1.41 2.50
D 3.61 2.92 2.24 0.00 1.00 0.50
E 4.24 3.54 1.41 1.00 0.00 1.12
F 3.20 2.50 2.50 0.50 1.12 0.00
Example 1 (cont’d)
• All the 6 objects are in a cluster of its own.
• In the beginning we have 6 clusters. We iterate until we have a cluster
consisting of the whole six original objects.
• In each step of the iteration, we find the closest pair clusters.
• The closest cluster is between cluster F and D with shortest distance of
0.5.
• Thus, we group cluster D and F into cluster (D, F).
• Distance between ungrouped clusters will not change from the original
distance matrix.
• Then we update the distance matrix as given below.
Dist A B C D,F E
A 0.00 0.71 5.66 ? 3.20
B 0.71 0.00 4.95 ? 2.50
C 5.66 4.95 0.00 ? 2.50
D,F ? ? ? 0.00 ?
E 4.24 3.54 1.41 ? 1.12
Example 1 (cont’d)
• Calculate distance between newly grouped
clusters (D, F) and other clusters
– Use the linkage rule (Single link). Using single
linkage, we specify minimum distance between
original objects of the two clusters.
Dist A B C D,F E
A 0.00 0.71 5.66 ? 3.20
B 0.71 0.00 4.95 ? 2.50
C 5.66 4.95 0.00 ? 2.50
D,F ? ? ? 0.00 ?
E 4.24 3.54 1.41 ? 1.12
Example 1 (cont’d) -Distance table updation:
Using the input distance matrix, distance between cluster (D, F) and cluster A is computed
as
= min(, )= min( 3.61, 3.20) =3.20

Distance between cluster (D, F) and cluster B is

= min(, )= min( 2.92, 2.50) =2.50

Similarly, distance between cluster (D, F) and cluster C is

= min(, )= min( 2.24, 2.50) =2.24

Finally, distance between cluster E and cluster (D, F) is calculated as

= min(, )= min(1.00,1.12) =1.00
Then, the updated distance matrix becomes Looking

Dist A B C D,F E
A 0.00 0.71 5.66 3.20 3.20
B 0.71 0.00 4.95 2.50 2.50
C 5.66 4.95 0.00 2.24 2.50
D,F 3.20 2.50 2.24 0.00 1.00
E 4.24 3.54 1.41 1.00 1.12
Example 1 (cont’d)
• Looking at the lower triangular updated distance
matrix (previous slide), the closest distance is
between cluster B and cluster A (0.71). Thus, we
group cluster A and cluster B into a single cluster
name (A, B).
– Now we update the distance matrix. Expecting the first
row and first column, all the other elements of the new
distance matrix are not changed.
Dist A,B C D,F E
Dist A B C D,F E
A,B 0.0 ? ? ?
A 0.00 0.71 5.66 3.20 3.20
C ? 0.00 2.24 2.50
B 0.71 0.00 4.95 2.50 2.50
D,F ? 2.24 0.00 1.00
C 5.66 4.95 0.00 2.24 2.50
E ? 1.41 1.00 1.12
D,F 3.20 2.50 2.24 0.00 1.00
E 4.24 3.54 1.41 1.00 1.12
Example 1 (cont’d)
• Using the input distance matrix (size 6 by 6), distance
between cluster C and cluster (D, F) is computed as

• Distance between cluster (D, F) and cluster (A, B) is the

minimum distance between all objects involved in the two
clusters

• Similarly, distance between cluster E and (A, B) is

Dist A,B C D,F E

• Then the updated distance matrix is A,B 0.0 4.95 2.50 3.54
C 4.95 0.00 2.24 2.50
D,F 2.50 2.24 0.00 1.00
E 3.54 1.41 1.00 1.12
Example 1 (cont’d)
• From the updated distance matrix, the closest
distance happens between clusters cluster E
and (D, F) at distance 1.00.
– Thus, we cluster them together into cluster ((D, F),
E ).
• The updated distance matrix is given below.
Dist A,B C (D,F),E

A,B 0.0 4.95 3.54

C 4.95 0.00 2.50
(D,F),E 2.50 1.41 0.00
Example 1 (cont’d)
• Distance between cluster ((D, F), E) and cluster (A, B) is
calculated as

• Distance between cluster ((D, F), E) and cluster C yields the

minimum distance of 1.41.
– Hence , we merge cluster ((D, F), E) and cluster C into a new cluster name
(((D, F), E), C).

• The updated distance matrix is

Dist A,B C (D,F),E Dist A,B ((D,F),E
),C
A,B 0.0 4.95 3.54 A,B 0.0 2.50

C 4.95 0.00 2.50 ((D,F),E 2.50 0.00

), C
(D,F),E 2.50 1.41 0.00
Example 1 (cont’d)

• The minimum distance of 2.5 is the result of

the following computation Dist A,B ((D,F),E),C

A,B 0.0 2.50

((D,F),E), C 2.50 0.00
Example 1 (cont’d)
Summary
• In the beginning we have 6 clusters: A, B, C, D, E and F
• We merge cluster D and F into cluster (D, F) at distance 0.50
• We merge cluster A and cluster B into (A, B) at distance 0.71
• We merge cluster E and (D, F) into ((D, F), E) at distance 1.00
• We merge cluster ((D, F), E) and C into (((D, F), E), C) at distance 1.41
• We merge cluster (((D, F), E), C) and (A, B) into ((((D, F), E), C), (A, B)) at
distance 2.50
• The last cluster contains all the objects.
• Using this information, we can now draw the final results of a dendogram.
The dendogram is drawn based on the distances to merge the clusters
above.
Example 1 (cont’d)

• The hierarchy is given as (((D, F), E),C), (A,B).

We can also plot the clustering hierarchy into
XY space

x1 x2
A 1 1
B 1.5 1.5
C 5 5
D 3 4
E 4 4
F 3 3.5
Example 2
• Consider the distance matrix:
– Minimum distance between cluster E and F is
1.31; hence clustered together.
Dist. A B C D E F

A 0 7.56 12.15 34.34 45.11 46.42

B 7.56 0 19.71 41.9 52.67 53.98

C 12.15 19.71 0 22.19 32.96 34.27

D 34.34 41.9 22.19 0 10.77 20.08

E 45.11 52.67 32.96 10.77 0 1.31

F 46.42 53.98 34.27 12.08 1.31 0

Example 2 (cont’d)
• ? marks under a column or row means two elements are
clubbed together at this point of calculation but distance
from this new point to all other points are not known.
Dist. A B C D E, F
A 0 7.56 12.15 34.34 ?
B 7.56 0 19.71 41.9 ?
C 12.15 19.71 0 22.19 ?
D 34.34 41.9 22.19 0 ?
E, F ? ? ? ? 0

• Minimum distance is between cluster A and B is 7.56.

• The cluster A and B is grouped into single cluster name (A,
B).
Dist. A, B C D E, F
A, B 0 ? ? ?
C ? 0 22.19 32.96
D ? 22.19 0 10.77
E, F ? 32.96 10.77 0
Example 2 (cont’d)
• From the distance matrix it is found that the closest distance between
clusters happens between cluster D and (E, F) at distance 10.77.
• Distance between cluster (E, F) and A is

• Distance between cluster (E, F) and B is

Dist. A, B C D E, F
A, B 0 12.15 34.34 34.34
C 12.15 0 22.19 32.96
D 34.34 22.19 0 10.77
E, F 45.11 32.96 10.77 0
Example 2 (cont’d)
• It can be seen that the closest distance between clusters happens
between cluster D and (E, F) at distance 10.77. Thus, result in clustering
these together into cluster ((E, F), D).
Dist. A, B C D E, F
A, B 0 12.15 34.34 34.34
C 12.15 0 22.19 32.96
D 34.34 22.19 0 10.77
E, F 45.11 32.96 10.77 0

• The minimum distance appears between cluster (A, B) and C at distance

12.15. Thus, clustered them together ((A, B), C).
Dist. A, B C (E, F), D
A, B 0 12.15 34.34
C 12.15 0 22.19
(E, F), D 34.34 22.19 0
Example 2 (cont’d)
• The minimum distance appears between cluster (A, B) and C
at distance 12.15. Thus, clustered them together ((A, B), C).

Dist. ((A, B), C) ((E, F), D)

((A, B), C) 0 22.19
((E, F), D) 22.19 0

• From the distance matrix, it has been found that ((E, F), D)
and ((A, B), C) are merged into cluster {((E, F), D), ((A, B), C)};
– The cluster contains all the objects, and thus terminates the
Agglomerative Hierarchical Clustering computation.
Example 3
Example of Complete Linkage Clustering
• Clustering starts by computing a distance
between every pair of units to be clustered.
The table below is an example of a
distance matrix.
• The smallest distance is between three and
five and they get merged first into a the
cluster '35'.
• Using complete linkage clustering, the
distance between "35" and every other
item is the maximum of the distance
between this item and 3 and this item and
5.
• For example, d(1,3)= 3 and d(1,5)=11. So,
D(1,"35")=11. This gives us the new
distance matrix.
– If it had been “Average linkage clustering”, the
distance is (3+11)/2 = 7
• The items with the smallest distance get
clustered next. This will be 2 and 4.
Example 3 (cont’d)

• Now, clusters 1 and (2,4) will get • Updated Distance

clustered at a height of 9.
• Finally we have a cluster of all 5 Matrices:
objects. 1 2,4 3,5
• On this plot given below, the y-axis 1 0
shows the distance between the 2,4 9 0
objects at the time they were 3,5 11 10 0
clustered. This is called the cluster
height.
1, 2,4 3,5
1,2,4 0
3,5 11 0
Example 3 (cont’d)
• Below is the single linkage dendogram for the
same distance matrix. It starts with cluster
"35" but the distance between "35" and each
item is now the minimum of d(x,3) and d(x,5).
So c(1,"35")=3.
Divisive Clustering – Example 1
• Consider the graph and its adjacency matrix
A B

A B C D E
A 0 1 2 2 3
E C
B 1 0 2 4 3
C 2 2 0 1 5
D 2 4 1 0 3 D
E 3 3 5 3 0

• Reference :Margaret H. Dunham , “Data Mining :

Introductory and Advanced Concepts”, Pearson, 2012.
Example 1 ( cont’d)
The Minimum Spanning Tree (MST) is

A B
A B C D E
A 0 1 2 2 3
B 1 0 2 4 3 E C
C 2 2 0 1 5
D 2 4 1 0 3
E 3 3 5 3 0 D
Example 1 ( cont’d)
• Cut edges from the MST starting
from largest (weight) to smallest
repeatedly
• Step1: All items are in 1 cluster
{A, B, C, D, E}
• Step 2: Largest edge is between D
and E, cutting this results in 2
clusters
{E}, {A, B, C, D}
• Step 3: Removing the edge between
B and C results in
MST
{E}, {A, B} {C, D}
• Step 4: Removing the edge between
A and B ( and between C and D),
results in
{E}, {A}, {B}, {C} , {D}
References
• https://newonlinecourses.science.psu.edu/sta
t555/node/86
/
• https://
people.revoledu.com/kardi/tutorial/Clustering
/Numerical%20Example.htm
• Margaret H. Dunham , “Data Mining :
Introductory and Advanced Concepts”,
Pearson, 2012.

7 HierarchicalClustering AND DBSCAN
No ratings yet
7 HierarchicalClustering AND DBSCAN
41 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
11 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
10 pages
Unit 3 Clustering
No ratings yet
Unit 3 Clustering
101 pages
Hierarchical Clustering Unit 4 ML
No ratings yet
Hierarchical Clustering Unit 4 ML
14 pages
Grouping
No ratings yet
Grouping
98 pages
Hierarchical Clustering Algorithm
No ratings yet
Hierarchical Clustering Algorithm
9 pages
Hierarchical Clustering: Class Program University Semester Lecturer Sources
100% (1)
Hierarchical Clustering: Class Program University Semester Lecturer Sources
33 pages
Heirarchical Clustering
No ratings yet
Heirarchical Clustering
20 pages
Lecture - 11 Hierarchical Clustering
No ratings yet
Lecture - 11 Hierarchical Clustering
28 pages
V Unit AIML
No ratings yet
V Unit AIML
37 pages
Hierarchical Clustering: Relationship Between Clusters
No ratings yet
Hierarchical Clustering: Relationship Between Clusters
23 pages
Cluster Analysis Hierarchical & - Means
No ratings yet
Cluster Analysis Hierarchical & - Means
41 pages
RK Clustering
No ratings yet
RK Clustering
77 pages
ML Imp Ques 2
No ratings yet
ML Imp Ques 2
37 pages
13 Clustering and Classifier
No ratings yet
13 Clustering and Classifier
123 pages
03 Hierarchical Clustering
100% (1)
03 Hierarchical Clustering
15 pages
19 - Clustering in Operation Research
No ratings yet
19 - Clustering in Operation Research
11 pages
Week 07 Lecture Material
No ratings yet
Week 07 Lecture Material
49 pages
Data Mining Functionalities
No ratings yet
Data Mining Functionalities
13 pages
ML Lec-18
No ratings yet
ML Lec-18
21 pages
3CP10 MJJ Hierarchical Clustering
No ratings yet
3CP10 MJJ Hierarchical Clustering
40 pages
Agglomerative Clustering
No ratings yet
Agglomerative Clustering
44 pages
Unit IV
No ratings yet
Unit IV
51 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
41 pages
22it601 - Data Mining and Warehousing: Lecture Notes Template
No ratings yet
22it601 - Data Mining and Warehousing: Lecture Notes Template
10 pages
Hierarchical
No ratings yet
Hierarchical
31 pages
Lecture-11 Cluster Analysis-1
No ratings yet
Lecture-11 Cluster Analysis-1
28 pages
Unit 3
No ratings yet
Unit 3
12 pages
ML - 8
No ratings yet
ML - 8
70 pages
DEU CSC5045 Intelligent System Applications Using Fuzzy - 4+clustering
No ratings yet
DEU CSC5045 Intelligent System Applications Using Fuzzy - 4+clustering
61 pages
Presentation 28128 Content Document 20241126014005PM
No ratings yet
Presentation 28128 Content Document 20241126014005PM
80 pages
MachineLearning Unit IV
No ratings yet
MachineLearning Unit IV
51 pages
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
No ratings yet
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
41 pages
Clustering
No ratings yet
Clustering
110 pages
Clustering
No ratings yet
Clustering
75 pages
Clustering Basics
No ratings yet
Clustering Basics
39 pages
AIMLB PGP 2024 Session 12
No ratings yet
AIMLB PGP 2024 Session 12
46 pages
P 3.1.3 Hierarchical
No ratings yet
P 3.1.3 Hierarchical
30 pages
Clustering: Sridhar S Department of IST Anna University
No ratings yet
Clustering: Sridhar S Department of IST Anna University
91 pages
ML Unit Iii
No ratings yet
ML Unit Iii
12 pages
Clustering
No ratings yet
Clustering
75 pages
Slide TIF311 DM 10 11
No ratings yet
Slide TIF311 DM 10 11
49 pages
Clustering Part-2
No ratings yet
Clustering Part-2
49 pages
Module-5-Cluster Analysis-Part1
No ratings yet
Module-5-Cluster Analysis-Part1
24 pages
Un Supervised Learning
No ratings yet
Un Supervised Learning
22 pages
Image Analysis Classification and Change Detection in Remote Sensing With Algorithms For Python 4th Edition Morton John Canty Instant Download
100% (1)
Image Analysis Classification and Change Detection in Remote Sensing With Algorithms For Python 4th Edition Morton John Canty Instant Download
90 pages
Unit-6 Clustering Techniques
No ratings yet
Unit-6 Clustering Techniques
110 pages
ML CH 4
No ratings yet
ML CH 4
65 pages
Topic 6d - Hierarchical Algorithm
No ratings yet
Topic 6d - Hierarchical Algorithm
38 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
11 pages
Clustering Dendogram
No ratings yet
Clustering Dendogram
13 pages
SchemeId 2367 CFP QuantumAlgorithms 30062025
No ratings yet
SchemeId 2367 CFP QuantumAlgorithms 30062025
37 pages
ML Module 4 2022 1 PDF
No ratings yet
ML Module 4 2022 1 PDF
31 pages
Hierarchical Clustering - 11.3.2024 - Full
No ratings yet
Hierarchical Clustering - 11.3.2024 - Full
14 pages
Clustering: K-Means, Agglomerative, DBSCAN: Tan, Steinbach, Kumar
No ratings yet
Clustering: K-Means, Agglomerative, DBSCAN: Tan, Steinbach, Kumar
45 pages
Agglomerative Clustering
No ratings yet
Agglomerative Clustering
6 pages
Clustering: EE-671 Prof L. Behera, IITK
No ratings yet
Clustering: EE-671 Prof L. Behera, IITK
33 pages
Clustering Hierarchical PDF
No ratings yet
Clustering Hierarchical PDF
31 pages
Agnes
No ratings yet
Agnes
25 pages
Chapter 5 PowerPoint 2016 Q and A
100% (1)
Chapter 5 PowerPoint 2016 Q and A
2 pages
Cluster
100% (1)
Cluster
72 pages
AL ICT Marking Scheme English Medium
No ratings yet
AL ICT Marking Scheme English Medium
5 pages
Machine Learning (15Cs73) : Text Book Tom M. Mitchell, Machine Learning, India Edition 2013, Mcgraw Hill
No ratings yet
Machine Learning (15Cs73) : Text Book Tom M. Mitchell, Machine Learning, India Edition 2013, Mcgraw Hill
78 pages
PQ Handbook 2011 - EngFV
100% (1)
PQ Handbook 2011 - EngFV
57 pages
S7 Failsafe Safety Config and Prog
No ratings yet
S7 Failsafe Safety Config and Prog
334 pages
How To Document Infrastructure - Part 1 Fibre Networks v1 - 1022016
No ratings yet
How To Document Infrastructure - Part 1 Fibre Networks v1 - 1022016
16 pages
Embedded Project
No ratings yet
Embedded Project
8 pages
Admit Letters For CAIIB - July 2024 Candidate Details
No ratings yet
Admit Letters For CAIIB - July 2024 Candidate Details
3 pages
Datasheet
No ratings yet
Datasheet
9 pages
Reactivision 1.5.1: A Toolkit For Tangible Multi-Touch Surfaces
No ratings yet
Reactivision 1.5.1: A Toolkit For Tangible Multi-Touch Surfaces
7 pages
Infineon SONOS - Technology Whitepaper v06 - 00 EN
No ratings yet
Infineon SONOS - Technology Whitepaper v06 - 00 EN
17 pages
Manual Serviplasma
No ratings yet
Manual Serviplasma
126 pages
Msieditor,+journal+editor,+9 +dwi+ayu+gusriyanti
No ratings yet
Msieditor,+journal+editor,+9 +dwi+ayu+gusriyanti
16 pages
Notes Ariba-Supplier-Guide
No ratings yet
Notes Ariba-Supplier-Guide
94 pages
(FREE PDF Sample) Advanced Antenna Systems For 5G Network Deployments: Bridging The Gap Between Theory and Practice 1st Edition Asplund Ebooks
No ratings yet
(FREE PDF Sample) Advanced Antenna Systems For 5G Network Deployments: Bridging The Gap Between Theory and Practice 1st Edition Asplund Ebooks
55 pages
F70aet NEW
No ratings yet
F70aet NEW
329 pages
Inspired Instruments You Rock Guitar
No ratings yet
Inspired Instruments You Rock Guitar
5 pages
FY20 Getting Started With Ansible
No ratings yet
FY20 Getting Started With Ansible
51 pages
LIANLI LANCOOL II MESH Installation Guide
No ratings yet
LIANLI LANCOOL II MESH Installation Guide
16 pages
Sudoku New: Workouts to sharpen your mind
From Everand
Sudoku New: Workouts to sharpen your mind
Sahil Gupta
No ratings yet
SDS011 Laser PM2.5 Sensor Specification-V1.2
No ratings yet
SDS011 Laser PM2.5 Sensor Specification-V1.2
12 pages
TLE9 CSS Q2 Wk1-2 Computer-Maintenance-and-Security PDF
No ratings yet
TLE9 CSS Q2 Wk1-2 Computer-Maintenance-and-Security PDF
14 pages
Justinrhill 2018@
No ratings yet
Justinrhill 2018@
9 pages
Epservices Flyer All-Bioprocessing-Equipment
No ratings yet
Epservices Flyer All-Bioprocessing-Equipment
8 pages
(Xinfeng Zhou) A Practical Guide To Quantitative Finance Interviews PDF
100% (1)
(Xinfeng Zhou) A Practical Guide To Quantitative Finance Interviews PDF
96 pages
Asa5510 Sec Bun k9 Datasheet
No ratings yet
Asa5510 Sec Bun k9 Datasheet
4 pages
Coursera EVLNZJEV2YNA PDF
No ratings yet
Coursera EVLNZJEV2YNA PDF
1 page
Low Power Upf Notes
No ratings yet
Low Power Upf Notes
5 pages
Titanus Product Line Brochure
No ratings yet
Titanus Product Line Brochure
4 pages

Hierarchical Clustering

Uploaded by

Hierarchical Clustering

Uploaded by

Hierarchical Clustering

• Clusters are created in levels actually creating sets of

• Agglomerative (bottom up) clustering :The dendrogram is

Step 0 Step 1 Step 2 Step 3 Step 4 Agglomerative Nesting

Distance between cluster (D, F) and cluster B is

Similarly, distance between cluster (D, F) and cluster C is

Finally, distance between cluster E and cluster (D, F) is calculated as

• Distance between cluster (D, F) and cluster (A, B) is the

• Similarly, distance between cluster E and (A, B) is

A,B 0.0 4.95 3.54

• Distance between cluster ((D, F), E) and cluster C yields the

• The updated distance matrix is

C 4.95 0.00 2.50 ((D,F),E 2.50 0.00

• The minimum distance of 2.5 is the result of

A,B 0.0 2.50

• The hierarchy is given as (((D, F), E),C), (A,B).

A 0 7.56 12.15 34.34 45.11 46.42

B 7.56 0 19.71 41.9 52.67 53.98

C 12.15 19.71 0 22.19 32.96 34.27

D 34.34 41.9 22.19 0 10.77 20.08

E 45.11 52.67 32.96 10.77 0 1.31

F 46.42 53.98 34.27 12.08 1.31 0

• Minimum distance is between cluster A and B is 7.56.

• Distance between cluster (E, F) and B is

• The minimum distance appears between cluster (A, B) and C at distance

Dist. ((A, B), C) ((E, F), D)

• Now, clusters 1 and (2,4) will get • Updated Distance

• Reference :Margaret H. Dunham , “Data Mining :

You might also like