Hierarchical

Hierarchical clustering is an unsupervised machine learning algorithm that organizes data into clusters based on similarity, using a top-down or bottom-up approach. It includes two main types: Agglomerative Hierarchical Clustering, which merges clusters from individual data points, and Divisive Hierarchical Clustering, which starts with one cluster and splits it into smaller clusters. The algorithm employs various linkage methods to measure distances between clusters and can be visualized using a dendrogram.

Uploaded by

abhijaychauhan88

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views31 pages

Hierarchical

Uploaded by

abhijaychauhan88

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 31

HIERARCHICAL Unsupervised ML

CLUSTERING
HIERARCHICAL
CLUSTERING ALGORITHM
Also called Hierarchical cluster analysis or HCA is an unsupervised
clustering algorithm which involves creating clusters that have
predominant ordering from top to bottom.
For e.g. All files and folders on our hard disk are organized in a hierarchy.
The algorithm groups similar objects into groups called clusters. The
endpoint is a set of clusters or groups, where each cluster is distinct
from each other cluster, and the objects within each cluster are broadly
similar to each other.
This clustering technique is divided into two types:
Agglomerative Hierarchical Clustering
Divisive Hierarchical Clustering
AGGLOMERATIVE
HIERARCHICAL CLUSTERING
The Agglomerative Hierarchical Clustering is the most common
type of hierarchical clustering used to group objects in clusters
based on their similarity. It’s also known as AGNES (Agglomerative
Nesting). It's a “bottom-up” approach: each observation starts
in its own cluster, and pairs of clusters are merged as one
moves up the hierarchy.
HOW DOES IT WORK?
Make each data point a single-point cluster → forms N clusters
Take the two closest data points and make them one cluster →
forms N-1 clusters
Take the two closest clusters and make them one cluster → Forms
N-2 clusters.
Repeat step-3 until you are left with only one cluster.
LINKAGE METHODS FOR
CLUSTER OBSERVATIONS
There are several ways to measure the distance between clusters in order
to decide the rules for clustering, and they are often called Linkage
Methods. Some of the common linkage methods are:

Complete-linkage: the distance between two clusters is defined as

the longest distance between two points in each cluster.
dmj = max (dkj, dlj)
Single-linkage: the distance between two clusters is defined as
the shortest distance between two points in each cluster. This linkage may
be used to detect high values in your dataset which may be outliers as they
will be merged at the end.
dmj = min (dkj, dlj)
LINKAGE METHODS
Average-linkage: the distance between two clusters is defined as
the average distance between each point in one cluster to every
point in the other cluster.

Centroid-linkage: finds the centroid of cluster 1 and centroid of

cluster 2, and then calculates the distance between the two before
merging.
LINKAGE METHODS
Ward :- With Ward's linkage method, the distance between two
clusters is the sum of squared deviations from points to centroids.
The objective of Ward's linkage is to minimize the within-cluster
sum of squares.
The distance is calculated with the following distance matrix:
WHAT IS A
DENDROGRAM?
A Dendrogram is a type of tree diagram showing hierarchical
relationships between different sets of data.
As already said a Dendrogram contains the memory of hierarchical
clustering algorithm, so just by looking at the Dendrgram you can
tell how the cluster is formed.
DENDROGRAM
EXAMPLE HIERARCHICAL
CLUSTER ANALYSIS:
Example: We asked people about how many hours a week they
spend on social media platforms and at the gym.
HOW IS A HIERARCHICAL CLUSTER
ANALYSIS CALCULATED?
.
With this we can now start to create the clusters. In the first step
we assign a cluster to each point. So we have as many clusters as
we have persons.
The goal now is: to merge more and more clusters little by little,
until finally all points are in one cluster.
For this we need to determine two things:
1. How the distance between two points is measured.
2. How points in a cluster are connected.
DISTANCE BETWEEN TWO
POINTS
Let's start with the question, how do we calculate the distance
between two points? Here are the most known distances:
Euclidean Distance
Manhattan Distance
Maximum Distance
EUCLIDEAN DISTANCE
.
MANHATTAN DISTANCE
.
MAXIMUM DISTANCE
.
LINKING METHOD
Now that we know what ways there are to calculate the distances
between points, we need to determine how to link the points within
a cluster.
SINGLE-LINKAGE
Single-linkage uses the distance between the closest elements in
the cluster. This is the distance between Caro and Joe.
COMPLETE-LINKAGE
Complete linkage uses the distance between the farthest elements
in the cluster. So between Max and Joe.
AVERAGE-LINKAGE
Average-linkage uses the average of all pairwise distances. From
each combination the distance is calculated and from it the
average.
.
DIVISIVE HIERARCHICAL
CLUSTERING
In Divisive or DIANA(DIvisive ANAlysis Clustering) is a top-down
clustering method where we assign all of the observations to a
single cluster and then partition the cluster to two least similar
clusters. Finally, we proceed recursively on each cluster until there
is one cluster for each observation. So this clustering approach is
exactly opposite to Agglomerative clustering.
THE STEPS TO FORM
DIVISIVE CLUSTERING
Step 1: Start with all data points in the cluster.

Step 2: After each iteration, remove the “outsiders” from the least
cohesive cluster.

Step 3: Stop when each example is in its own singleton cluster,

else go to step 2.
DENDROGRAM

SOLVED NUMERICALS EXAMPLES in Machine Learning
No ratings yet
SOLVED NUMERICALS EXAMPLES in Machine Learning
59 pages
Dendrogram - Slides
No ratings yet
Dendrogram - Slides
27 pages
Hierarchical Clustering Unit 4 ML
No ratings yet
Hierarchical Clustering Unit 4 ML
14 pages
Unit 3 Clustering
No ratings yet
Unit 3 Clustering
101 pages
2022-A Comprehensive Survey of Clustering Algorithms State-Of-The-Art Machine Learning Applications Taxonomy Challenges
No ratings yet
2022-A Comprehensive Survey of Clustering Algorithms State-Of-The-Art Machine Learning Applications Taxonomy Challenges
43 pages
Week-9-Part-2 Agglomerative Clustering
No ratings yet
Week-9-Part-2 Agglomerative Clustering
40 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
4 pages
Distance Measures
No ratings yet
Distance Measures
11 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
10 pages
Hierarchial Clustering
No ratings yet
Hierarchial Clustering
14 pages
Cluster Analysis
No ratings yet
Cluster Analysis
9 pages
Lecture - 11 Hierarchical Clustering
No ratings yet
Lecture - 11 Hierarchical Clustering
28 pages
Hierarchical Clustering Algorithm
No ratings yet
Hierarchical Clustering Algorithm
9 pages
Physicochemical Properties of Banana Peel Flour As Influenced by Variety and Stage of Ripeness Multivariate Statistical Analysis
No ratings yet
Physicochemical Properties of Banana Peel Flour As Influenced by Variety and Stage of Ripeness Multivariate Statistical Analysis
14 pages
Hierarchical Clustering: Class Program University Semester Lecturer Sources
100% (1)
Hierarchical Clustering: Class Program University Semester Lecturer Sources
33 pages
ML Customer Segmentation
No ratings yet
ML Customer Segmentation
39 pages
Hierarchical Clustering: Relationship Between Clusters
No ratings yet
Hierarchical Clustering: Relationship Between Clusters
23 pages
Module-5-Cluster Analysis-Part1
No ratings yet
Module-5-Cluster Analysis-Part1
24 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
10 pages
Hierarchical Clusters
No ratings yet
Hierarchical Clusters
6 pages
Review Questions On Clustering DBSCAN and HAC
No ratings yet
Review Questions On Clustering DBSCAN and HAC
2 pages
Hierarchical Clustering PDF
No ratings yet
Hierarchical Clustering PDF
7 pages
Presentation 28128 Content Document 20241126014005PM
No ratings yet
Presentation 28128 Content Document 20241126014005PM
80 pages
Cluster Analysis - Part A
No ratings yet
Cluster Analysis - Part A
77 pages
Clustering
No ratings yet
Clustering
69 pages
Analytics Overview
No ratings yet
Analytics Overview
34 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
32 pages
Unit-6 Clustering Techniques
No ratings yet
Unit-6 Clustering Techniques
110 pages
RK Clustering
No ratings yet
RK Clustering
77 pages
Decision Tree
No ratings yet
Decision Tree
35 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Setting The Unit of Analysis
No ratings yet
Setting The Unit of Analysis
34 pages
Unit-4 New
No ratings yet
Unit-4 New
36 pages
Path-Based Spectral Clustering: Guarantees, Robustness To Outliers, and Fast Algorithms
No ratings yet
Path-Based Spectral Clustering: Guarantees, Robustness To Outliers, and Fast Algorithms
66 pages
Introduction To ML
No ratings yet
Introduction To ML
17 pages
3CP10 MJJ Hierarchical Clustering
No ratings yet
3CP10 MJJ Hierarchical Clustering
40 pages
Probability
No ratings yet
Probability
22 pages
Agglomerative Clustering
No ratings yet
Agglomerative Clustering
6 pages
Comprehensive Survey On Hierarchical Clustering Algorithms and The Recent Developments
No ratings yet
Comprehensive Survey On Hierarchical Clustering Algorithms and The Recent Developments
46 pages
AI20 - Hierarchical-Clustering
No ratings yet
AI20 - Hierarchical-Clustering
31 pages
Cluster Analysis: Prof. Vandith Pamuru
No ratings yet
Cluster Analysis: Prof. Vandith Pamuru
68 pages
Clustering
No ratings yet
Clustering
19 pages
CC282 Unsupervised Learning (Clustering) : Lecture 7 Slides For CC282 Machine Learning, R. Palaniappan, 2008 1
No ratings yet
CC282 Unsupervised Learning (Clustering) : Lecture 7 Slides For CC282 Machine Learning, R. Palaniappan, 2008 1
38 pages
Pattern Recognition 21BR551 MODULE 04 NOTES
No ratings yet
Pattern Recognition 21BR551 MODULE 04 NOTES
16 pages
Clustering Lecture
No ratings yet
Clustering Lecture
49 pages
Statistica
No ratings yet
Statistica
40 pages
ML Unit 5
No ratings yet
ML Unit 5
50 pages
CHAID Decision Tree
No ratings yet
CHAID Decision Tree
14 pages
Data Mining Functionalities
No ratings yet
Data Mining Functionalities
13 pages
K Means
No ratings yet
K Means
25 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
26 pages
Lec 35
No ratings yet
Lec 35
18 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
41 pages
Un Supervised Learning
No ratings yet
Un Supervised Learning
22 pages
Data Mining
No ratings yet
Data Mining
13 pages
Confusion Matrix
No ratings yet
Confusion Matrix
16 pages
Unit 4 Self Made
No ratings yet
Unit 4 Self Made
28 pages
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
No ratings yet
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
41 pages
25 - Picking - Unlocked
No ratings yet
25 - Picking - Unlocked
56 pages
Chapter 7
No ratings yet
Chapter 7
49 pages
ML Lec-17
No ratings yet
ML Lec-17
12 pages
Regression Metrics
No ratings yet
Regression Metrics
11 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
6 pages
MidTermExam Solution
No ratings yet
MidTermExam Solution
8 pages
Example For Agglomerative Clustering
No ratings yet
Example For Agglomerative Clustering
2 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
34 pages
Cluster 3.0 Manual: Michael Eisen Updated by Michiel de Hoon
No ratings yet
Cluster 3.0 Manual: Michael Eisen Updated by Michiel de Hoon
34 pages
France Ghose 2018 Marketing Analytics PDF
No ratings yet
France Ghose 2018 Marketing Analytics PDF
32 pages
Clustring
No ratings yet
Clustring
20 pages
ML Solved Endsem
No ratings yet
ML Solved Endsem
16 pages
9536 DWM Expt 7 Merged
No ratings yet
9536 DWM Expt 7 Merged
14 pages
10Hierarchical&Probabilistic Clustering & GMM (ML)
No ratings yet
10Hierarchical&Probabilistic Clustering & GMM (ML)
24 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
7 pages
Watson Studio
No ratings yet
Watson Studio
8 pages
Machine Learning 3
No ratings yet
Machine Learning 3
65 pages
Cluster Analysis: Clusters Classification Analysis Numerical Taxonomy
No ratings yet
Cluster Analysis: Clusters Classification Analysis Numerical Taxonomy
50 pages
Multidendrograms: Variable-Group Agglomerative Hierarchical Clusterings
No ratings yet
Multidendrograms: Variable-Group Agglomerative Hierarchical Clusterings
22 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
11 pages
Clustering Hierarchical PDF
No ratings yet
Clustering Hierarchical PDF
31 pages
Cluster MCQ
No ratings yet
Cluster MCQ
12 pages
MA Unit 5
No ratings yet
MA Unit 5
7 pages
Hierarchical Clustering - 11.3.2024 - Full
No ratings yet
Hierarchical Clustering - 11.3.2024 - Full
14 pages
Statistics
No ratings yet
Statistics
7 pages
My Lecture On CLUSTER ANALYSIS PDF
No ratings yet
My Lecture On CLUSTER ANALYSIS PDF
55 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
7 pages
Lecture+Notes+ +clustering
No ratings yet
Lecture+Notes+ +clustering
13 pages
Clustering in R Tutorial
No ratings yet
Clustering in R Tutorial
13 pages
DWM Exp8 127 133 137
No ratings yet
DWM Exp8 127 133 137
4 pages
Cluster Analysis
No ratings yet
Cluster Analysis
30 pages
Cluster Analysis or Clustering Is The Art of Separating The Data Points Into Dissimilar Group With A
No ratings yet
Cluster Analysis or Clustering Is The Art of Separating The Data Points Into Dissimilar Group With A
11 pages
9248-Article Text-33828-1-10-20111216 PDF
No ratings yet
9248-Article Text-33828-1-10-20111216 PDF
8 pages
Clustering: EE-671 Prof L. Behera, IITK
No ratings yet
Clustering: EE-671 Prof L. Behera, IITK
33 pages
Paper - Hierarchical Cluster
No ratings yet
Paper - Hierarchical Cluster
13 pages
Assign 7
No ratings yet
Assign 7
5 pages
Hierarchical Clustering: Required Data
No ratings yet
Hierarchical Clustering: Required Data
6 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
8 pages
Cluster Analysis Concept & Methods
No ratings yet
Cluster Analysis Concept & Methods
14 pages
Agnes
No ratings yet
Agnes
25 pages
Hierarchical Clustering PDF
No ratings yet
Hierarchical Clustering PDF
5 pages
Agglomerative Hierarchical Clustering Algorithm-A Review: K.Sasirekha, P.Baby
No ratings yet
Agglomerative Hierarchical Clustering Algorithm-A Review: K.Sasirekha, P.Baby
3 pages
Cluster Analysis
No ratings yet
Cluster Analysis
9 pages
Ain Shams University Faculty of Engineering
No ratings yet
Ain Shams University Faculty of Engineering
2 pages
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
From Everand
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
Fouad Sabry
No ratings yet

Hierarchical

Uploaded by

Hierarchical

Uploaded by

HIERARCHICAL Unsupervised ML

Complete-linkage: the distance between two clusters is defined as

Centroid-linkage: finds the centroid of cluster 1 and centroid of

Step 3: Stop when each example is in its own singleton cluster,

You might also like