Hierarchical Clustering Algorithm

The document discusses hierarchical clustering algorithms, which group data into a tree of clusters, starting with each data point as an individual cluster. It outlines two main types: Agglomerative Clustering, a bottom-up approach that merges clusters based on similarity, and Divisive Clustering, a top-down method that separates clusters. The document also highlights the advantages and disadvantages of hierarchical clustering, including its ability to handle non-convex clusters and the high computational costs involved.

Uploaded by

script1712

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Hierarchical Clustering Algorithm

Uploaded by

script1712

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 9

HIERARCHICAL

CLUSTERING
ALGORITHM
ROSHINI SELVAKUMAR
2021503041
INTRODUCTION
A Hierarchical clustering method works via grouping data into a tree of clusters.
Hierarchical clustering begins by treating every data point as a separate cluster.
Then, it repeatedly executes the subsequent steps:
1.Identify the 2 clusters which can be closest together, and

2.Merge the 2 maximum comparable clusters. We need to continue these steps

until all the clusters are merged together.

The result of hierarchical clustering is a tree-like structure, called a dendrogram,

which illustrates the hierarchical relationships among the clusters.
TYPES OF HIERARCHICAL
CLUSTERING
Basically, there are two types of hierarchical Clustering:
 Agglomerative Clustering
 Divisive clustering

Agglomerative Clustering

Initially consider every data point as an individual Cluster and at every step, merge
the nearest pairs of the cluster. (It is a bottom-up method).
At first, every dataset is considered an individual entity or cluster.
At every iteration, the clusters merge with different clusters until one cluster is
formed.
AGGLOMERATIVE CLUSTERING
The algorithm for Agglomerative Hierarchical Clustering is:
Calculate the similarity of one cluster with all the other clusters (calculate proximity matrix)
 Calculate the distance between each pair of data points using a distance function, like
Euclidean distance.
 Fill the matrix with the distances calculated in step 1. The proximity matrix will be a
square matrix with dimensions n x n where n is the number of data points.
Consider every data point as an individual cluster
Merge the clusters which are highly similar or close to each other.
Recalculate the proximity matrix for each cluster
Repeat Steps 3 and 4 until only a single cluster remains.
AGGLOMERATIVE CLUSTERING-
EXAMPLE
AGGLOMERATIVE CLUSTERING-
EXAMPLE
• Step-1: Consider each alphabet as a single cluster and calculate the distance of one cluster from all the
other clusters.

• Step-2: In the second step comparable clusters are merged together to form a single cluster. Let’s say
cluster (B) and cluster (C) are very similar to each other therefore we merge them in the second step
similarly to cluster (D) and (E) and at last, we get the clusters [(A), (BC), (DE), (F)]

• Step-3: We recalculate the proximity according to the algorithm and merge the two nearest
clusters([(DE), (F)]) together to form new clusters as [(A), (BC), (DEF)]

• Step-4: Repeating the same process; The clusters DEF and BC are comparable and merged together to
form a new cluster. We’re now left with clusters [(A), (BCDEF)].

• Step-5: At last, the two remaining clusters are merged together to form a single cluster [(ABCDEF)].
DIVISIVE CLUSTERING
We can say that Divisive Hierarchical clustering is
precisely the opposite of Agglomerative Hierarchical
clustering.
It’s a top- down method.
In Divisive Hierarchical clustering, we take into account
all of the data points as a single cluster and in every iteration,
we separate the data points from the clusters which aren’t
comparable.
In the end, we are left with N clusters.
ADVANTAGES AND DIS-
ADVANTAGES
ADVANTAGES: DIS-ADVANTAGES:
•The ability to handle non-convex •The need for a criterion to stop the
clusters and clusters of different sizes clustering process and determine the
and densities. final number of clusters.

•The ability to handle missing data •The computational cost and memory
and noisy data. requirements of the method can be
high, especially for large datasets.
•The ability to reveal the hierarchical
structure of the data, which can be •The results can be sensitive to the
useful for understanding the initial conditions, linkage criterion,
relationships among the clusters. and distance metric used.

Hierarchical Clustering Unit 4 ML
No ratings yet
Hierarchical Clustering Unit 4 ML
14 pages
Hierarchical-Clustering-in-Machine-Learning
No ratings yet
Hierarchical-Clustering-in-Machine-Learning
10 pages
Agnes
No ratings yet
Agnes
25 pages
Hierar Scale4
No ratings yet
Hierar Scale4
51 pages
Hierachical Clustering
No ratings yet
Hierachical Clustering
31 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
10 pages
Clustering: EE-671 Prof L. Behera, IITK
No ratings yet
Clustering: EE-671 Prof L. Behera, IITK
33 pages
Clustering Hierarchical PDF
No ratings yet
Clustering Hierarchical PDF
31 pages
Hierarchical Clustering - 11.3.2024 - Full
No ratings yet
Hierarchical Clustering - 11.3.2024 - Full
14 pages
Hierarchical Clustering in Data Mining
No ratings yet
Hierarchical Clustering in Data Mining
4 pages
unit5_CSM_ML
No ratings yet
unit5_CSM_ML
32 pages
ML CO4 SESSION 30 Hierarchical Clustering
No ratings yet
ML CO4 SESSION 30 Hierarchical Clustering
20 pages
Clustering
No ratings yet
Clustering
110 pages
P 3.1.3 Hierarchical
No ratings yet
P 3.1.3 Hierarchical
30 pages
Presentation 28128 Content Document 20241126014005PM
No ratings yet
Presentation 28128 Content Document 20241126014005PM
80 pages
Lecture - 11 Hierarchical Clustering
No ratings yet
Lecture - 11 Hierarchical Clustering
28 pages
Lect 11 DM
No ratings yet
Lect 11 DM
41 pages
Hierarchical Clusters
No ratings yet
Hierarchical Clusters
6 pages
10Hierarchical&Probabilistic Clustering & GMM (ML)
No ratings yet
10Hierarchical&Probabilistic Clustering & GMM (ML)
24 pages
Data Mining Functionalities
No ratings yet
Data Mining Functionalities
13 pages
Week-9-Part-2 Agglomerative Clustering
No ratings yet
Week-9-Part-2 Agglomerative Clustering
40 pages
Unit 3 Clustering
No ratings yet
Unit 3 Clustering
101 pages
Chap15 Cluster Analysis
No ratings yet
Chap15 Cluster Analysis
55 pages
ml 8
No ratings yet
ml 8
12 pages
6 - Clustering and Applications and Trends in Datamining
No ratings yet
6 - Clustering and Applications and Trends in Datamining
66 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
41 pages
Clustring
No ratings yet
Clustring
20 pages
8. Clustering
No ratings yet
8. Clustering
38 pages
1629189889 ML TCS Lecture Hierarchical 1608
No ratings yet
1629189889 ML TCS Lecture Hierarchical 1608
41 pages
5812d46b-1c39-4a89-ae4b-eec09f93ba4b
No ratings yet
5812d46b-1c39-4a89-ae4b-eec09f93ba4b
66 pages
Cluster 1
No ratings yet
Cluster 1
6 pages
ifferent methods of clustering
No ratings yet
ifferent methods of clustering
8 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
34 pages
19 - Sessionppt - Clusteringalgos
No ratings yet
19 - Sessionppt - Clusteringalgos
36 pages
Data Mining Unit 5
No ratings yet
Data Mining Unit 5
30 pages
MACHINE LEARNING NOTES ANNA UNIVERSITY
No ratings yet
MACHINE LEARNING NOTES ANNA UNIVERSITY
14 pages
4.4 Hierarchical Clustering Methods
No ratings yet
4.4 Hierarchical Clustering Methods
39 pages
Hierarchical Clustering pdf
No ratings yet
Hierarchical Clustering pdf
7 pages
Cluster
100% (1)
Cluster
72 pages
Module-5-Cluster Analysis-Part1
No ratings yet
Module-5-Cluster Analysis-Part1
24 pages
AIMLB PGP 2024 Session 12
No ratings yet
AIMLB PGP 2024 Session 12
46 pages
Unit 4 Clustering
No ratings yet
Unit 4 Clustering
18 pages
Agglomerative Clustering
No ratings yet
Agglomerative Clustering
6 pages
DA Seminar
No ratings yet
DA Seminar
29 pages
Hierarchical clustering
No ratings yet
Hierarchical clustering
2 pages
22AIP3101A Session 9
No ratings yet
22AIP3101A Session 9
38 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
15 pages
Slide TIF311 DM 10 11
No ratings yet
Slide TIF311 DM 10 11
49 pages
Unit 4 Descriptive Modeling
No ratings yet
Unit 4 Descriptive Modeling
18 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
11 pages
ML UNIT 4
No ratings yet
ML UNIT 4
15 pages
AI20- Hierarchical-clustering
No ratings yet
AI20- Hierarchical-clustering
31 pages
Clustering new
No ratings yet
Clustering new
6 pages
Fundamentals of Data Science Unit 3
No ratings yet
Fundamentals of Data Science Unit 3
15 pages
UnSupervisedLearning
No ratings yet
UnSupervisedLearning
22 pages
Clustering
No ratings yet
Clustering
75 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
35 pages
Lec.4.D. M. spring 2025
No ratings yet
Lec.4.D. M. spring 2025
19 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Unit - 4 - Modified
No ratings yet
Unit - 4 - Modified
152 pages
Classification Clustering
No ratings yet
Classification Clustering
3 pages
Deepthi Webclustering Report
100% (1)
Deepthi Webclustering Report
38 pages
Segmenting Bank Customers Via RFM Model and Unsupervised Machine Learning
No ratings yet
Segmenting Bank Customers Via RFM Model and Unsupervised Machine Learning
6 pages
6th_SEM Data Science Notes
No ratings yet
6th_SEM Data Science Notes
46 pages
SOLVED NUMERICALS EXAMPLES in Machine Learning
No ratings yet
SOLVED NUMERICALS EXAMPLES in Machine Learning
59 pages
r20 Datamining Lab (2-2 Sem Lab)
No ratings yet
r20 Datamining Lab (2-2 Sem Lab)
41 pages
2022 BM MRS Cluster Analysis
No ratings yet
2022 BM MRS Cluster Analysis
23 pages
Computational Journalism 2016 Week 3: Algorithmic Filtering
No ratings yet
Computational Journalism 2016 Week 3: Algorithmic Filtering
61 pages
Data Analytics Unit 4
No ratings yet
Data Analytics Unit 4
22 pages
ML Unit 5
No ratings yet
ML Unit 5
50 pages
Dornburg&Davin 2024
No ratings yet
Dornburg&Davin 2024
25 pages
Defining Homogenous Climate Zones of Bangladesh Using Cluster Analysis
No ratings yet
Defining Homogenous Climate Zones of Bangladesh Using Cluster Analysis
11 pages
2 - Review Article - Introduction To Multivariate Analysis
No ratings yet
2 - Review Article - Introduction To Multivariate Analysis
8 pages
Evans Analytics2e PPT 10 Data Mining
No ratings yet
Evans Analytics2e PPT 10 Data Mining
69 pages
bailey-et-al-2018-social-connectedness-measurement-determinants-and-effects
No ratings yet
bailey-et-al-2018-social-connectedness-measurement-determinants-and-effects
35 pages
Topic 18:: Cluster Analysis and MDS
No ratings yet
Topic 18:: Cluster Analysis and MDS
38 pages
Data Mining Project Report
100% (1)
Data Mining Project Report
98 pages
التحليل المكاني والوظيفي للخدمات التعليمية في مدينة سوران باستخدام نظم المعلومات الجغرافية- عمر حسن حسين رواندزي- ماجستير
88% (8)
التحليل المكاني والوظيفي للخدمات التعليمية في مدينة سوران باستخدام نظم المعلومات الجغرافية- عمر حسن حسين رواندزي- ماجستير
178 pages
Speaker Diarization WJ
No ratings yet
Speaker Diarization WJ
16 pages
Aiml Project Review
No ratings yet
Aiml Project Review
22 pages
3-9-5-Choffray-Lilien Model of Market Seqmentation - New Approach
No ratings yet
3-9-5-Choffray-Lilien Model of Market Seqmentation - New Approach
13 pages
1 s2.0 S0165178123002159 Main
No ratings yet
1 s2.0 S0165178123002159 Main
28 pages
Alnafrah, Zeno - 2019 - Innovation and Development A New Comparative Model For National Innovation Systems Based On Machine Learning Cla-Annotated
No ratings yet
Alnafrah, Zeno - 2019 - Innovation and Development A New Comparative Model For National Innovation Systems Based On Machine Learning Cla-Annotated
23 pages
Understanding Platform Business Models
No ratings yet
Understanding Platform Business Models
25 pages
Introduction To Data Mining Clustering Analysis
No ratings yet
Introduction To Data Mining Clustering Analysis
84 pages
Social Network Analysis Unit-3
No ratings yet
Social Network Analysis Unit-3
28 pages
UNIT I-Machine Learning
No ratings yet
UNIT I-Machine Learning
68 pages
ML-1+Project
No ratings yet
ML-1+Project
30 pages
Data Analytics and Model Evaluation
No ratings yet
Data Analytics and Model Evaluation
55 pages

Hierarchical Clustering Algorithm

Uploaded by

Hierarchical Clustering Algorithm

Uploaded by

HIERARCHICAL

2.Merge the 2 maximum comparable clusters. We need to continue these steps

The result of hierarchical clustering is a tree-like structure, called a dendrogram,

You might also like