0% found this document useful (0 votes)

53 views41 pages

Hierarchical Clustering

This document provides an overview of hierarchical clustering in machine learning. It discusses how hierarchical clustering groups similar objects together in a tree structure called a dendrogram. It describes different methods for calculating the distance between clusters, including single, complete, average, centroid, and Ward's linkage. It also provides an example of performing hierarchical clustering on a dataset using Python libraries like Scipy and Scikit-Learn.

Uploaded by

Sivam Chinna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views41 pages

Hierarchical Clustering

Uploaded by

Sivam Chinna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 41

Hierarchical Clustering

in Machine Learning

Fundamental of Data Science:

UNIT II
Dr. C. Sivaraj
Hierarchical clustering
 Hierarchical clustering is a popular method for
grouping objects used in unsupervised machine
learning.
 It creates groups so that objects within a group are
similar to each other and different from objects in
other groups.
 Clusters are visually represented in a hierarchical tree
called a dendrogram.
 The assumption is that data points that are close to
each other are more similar or related than data points
that are farther apart
2
Let's consider that we have a set of cars and we want to group
similar ones together. SEDANs AND SUVs
last step, we can group
everything into one cluster
and finish when we’re left
with only one cluster

Next, we'll bunch

the sedans and the
SUVs together.

For starters, we have four

cars that we can put into
two clusters of car types:

Dendrogra
m 3
A dendrogram
 A dendrogram, a tree-like figure produced by
hierarchical clustering, depicts the hierarchical
relationships between groups.
 Individual data points are located at the bottom of
the dendrogram,
 while the largest clusters, which include all the data
points, are located at the top.
 In order to generate different numbers of clusters, the
dendrogram can be sliced at various heights.

4
A dendrogram
 X axis of the dendrogram represents the features or
columns of the dataset,
 Y axis of the dendrogram represents the Euclidian
distance between data observations.

5
Hierarchical clustering types
1. Agglomerative: Initially, each object is
considered to be its own cluster.
2. According to a particular procedure, the
clusters are then merged step by step until a
single cluster remains.
3. At the end of the cluster merging process, a
cluster containing all the elements will be
formed.

6
Hierarchical clustering types
1. Divisive: The Divisive method is the opposite
of the Agglomerative method. Initially, all
objects are considered in a single cluster. Then
the division process is performed step by step
until each object forms a different cluster. The
cluster division or splitting procedure is
carried out according to some principles that
maximum distance between neighboring
objects in the cluster.
7
Agglomerative

8
Agglomerative: Steps for Agglomerative
clustering can be summarized as follows:

 Hierarchical clustering employs a measure of

distance/similarity to create new clusters.
 Step 1: Compute the proximity matrix using a particular
distance metric
 Step 2: Each data point is assigned to a cluster
 Step 3: Merge the clusters based on a metric for the similarity
between clusters
 Step 4: Update the distance matrix
 Step 5: Repeat Step 3 and Step 4 until only a single cluster
remains
9
Computing a proximity matrix
 The first step of the algorithm is to create a
distance matrix.
 The values of the matrix are calculated by
applying a distance function between each pair
of objects.

 The Euclidean distance function is

commonly used for this operation.
10
Euclidean Distance
 The Euclidean distance is the most widely used
distance measure when the variables are continuous
(either interval or ratio scale).

 The Euclidean distance between two points

calculates the length of a segment connecting the
two points. It is the most evident way of representing
the distance between two points.

11
Euclidean Distance
 The Pythagorean Theorem can be used to calculate the
distance between two points, as shown in the figure below.
 If the points (x1, y1)) and (x2, y2) in 2-dimensional space,

12
Manhattan Distance
 Euclidean distance may not be suitable while measuring the
distance between different locations
 The Manhattan distance is the simple sum of the horizontal
and vertical components.

13
Computing a proximity matrix

14
Similarity between Clusters
 The main question in hierarchical clustering is
how to calculate the distance between clusters
and update the proximity matrix.
 There are many different approaches used to
answer that question.
 The choice will depend on whether there is
 noisein the data set,
 whether the shape of the clusters is circular or not,
 the density of the data points.

15
A numerical example

16
two clusters in the sample data set, as shown in
Figure.

17
Min (Single) Linkage
 One way to measure the distance between
clusters is to find the minimum distance
between points in those clusters.
 That is, we can find the point in the first
cluster nearest to a point in the other cluster
and calculate the distance between those
points.

18
Min (Single) Linkage

The advantage of the Min

method is that it can
accurately handle non-
elliptical shapes.
The disadvantages are that it
is sensitive to noise and
outliers.
19
Max (Complete) Linkage

maximum distance between points in two

clusters. We can find the points in each
cluster that are furthest away from each
other and calculate the distance between
those points. In Figure 3, the maximum
distance is between and . Distance
between those two points, and hence the
distance between clusters, is found as .

20
Max (Complete) Linkage
 maximum distance between points in two
clusters.
 find the points in each cluster that are furthest
away from each other and calculate the
distance between those points.

21
Max (Complete) Linkage
 Max is less sensitive to noise and outliers in comparison to
MIN method.
 However, MAX can break large clusters and tends to be
biased towards globular clusters.

22
Centroid Linkage
 The Centroid method defines the distance between clusters as being the
distance between their centers/centroids.
 After calculating the centroid for each cluster, the distance between those
centroids is computed using a distance function.

23
Average Linkage
 The Average method defines the distance between clusters as
the average pairwise distance among all pairs of points in the
clusters.
 For simplicity, only some of the lines connecting pairs of
points are shown in Figure

24
Ward Linkage
 The Ward approach analyzes the variance of
the clusters rather than measuring distances
directly, minimizing the variance between
clusters.
 With the Ward method, the distance between
two clusters is related to how much the sum of
squares (SS) value will increase when
combined.

25
Ward Linkage
 In other words, the Ward method attempts to
minimize the sum of the squared distances of
the points from the cluster centers.
 Compared to the distance-based measures, the
Ward method is less susceptible to noise and
outliers.
 Therefore, Ward's method is preferred more
than others in clustering.

26
27
Hierarchical Clustering with Python
 In Python, the Scipy and Scikit-Learn libraries have
defined functions for hierarchical clustering.
 First, we'll import NumPy, matplotlib, and seaborn
(for plot styling):

28
Hierarchical Clustering with Python
 graph this data set as a scatter plot:

29
Hierarchical Clustering with Python
 graph this data
set as a scatter
plot:

30
Hierarchical Clustering using Scipy
 The Scipy library has the linkage function for
hierarchical (agglomerative) clustering.

 The linkage function has several methods available

for calculating the distance between clusters:
 single, average, weighted, centroid, median, and ward.

 To draw the dendrogram, we'll use the dendrogram

function.

31
Hierarchical Clustering using Scipy
 by passing the dendrogram function to matplotlib, we can
view a plot of these linkages:

 by passing the dendrogram function to matplotlib, we can

view a plot of these linkages:

32
RESULT: Dendrogram

33
Hierarchical Clustering using Scipy
 Finally, let's use the fcluster function to find
the clusters for the Ward linkage:

34
Hierarchical Clustering using Scikit-
Learn

 The Scikit-Learn library has its own function

for agglomerative hierarchical clustering:
AgglomerativeClustering.
 Options for calculating the distance between
clusters include ward, complete, average,
and single.

35
Hierarchical Clustering using Scikit-
Learn
 Using sklearn is slightly different than scipy.
 We need to import the AgglomerativeClustering
class, then instantiate it with the number of desired
clusters and the distance (linkage) function to use.

36
Hierarchical Clustering using Scikit-
Learn
 Result:

37
Clustering a real dataset
 dataset from the book Biostatistics with R,
which contains information for nine different
protein sources and their respective
consumption from various countries.

 We'll use this data to group countries

according to their protein consumption.

38
39
Dendrogram

40
Result

Hierarchical Clustering Unit 4 ML
No ratings yet
Hierarchical Clustering Unit 4 ML
14 pages
Week-9-Part-2 Agglomerative Clustering
No ratings yet
Week-9-Part-2 Agglomerative Clustering
40 pages
Unit 3 Clustering
No ratings yet
Unit 3 Clustering
101 pages
Lecture - 11 Hierarchical Clustering
No ratings yet
Lecture - 11 Hierarchical Clustering
28 pages
Hierarchical Clustering Algorithm
No ratings yet
Hierarchical Clustering Algorithm
9 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
4 pages
Hierarchical Clustering: Class Program University Semester Lecturer Sources
100% (1)
Hierarchical Clustering: Class Program University Semester Lecturer Sources
33 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
10 pages
K-Means and Hierarchical Clustering
No ratings yet
K-Means and Hierarchical Clustering
30 pages
Hierarchical Clustering: Relationship Between Clusters
No ratings yet
Hierarchical Clustering: Relationship Between Clusters
23 pages
DWDM Unit-5
No ratings yet
DWDM Unit-5
52 pages
Module-5-Cluster Analysis-Part1
No ratings yet
Module-5-Cluster Analysis-Part1
24 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
23 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
10 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
11 pages
Hierarchical Clusters
No ratings yet
Hierarchical Clusters
6 pages
Hierarchical Clustering PDF
No ratings yet
Hierarchical Clustering PDF
7 pages
Managerial Skill
No ratings yet
Managerial Skill
24 pages
Agglomerative Clustering
No ratings yet
Agglomerative Clustering
6 pages
Unit-6 Clustering Techniques
No ratings yet
Unit-6 Clustering Techniques
110 pages
Clustering Hierarchical PDF
No ratings yet
Clustering Hierarchical PDF
31 pages
Hierar Scale4
No ratings yet
Hierar Scale4
51 pages
Chap15 Cluster Analysis
No ratings yet
Chap15 Cluster Analysis
55 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
35 pages
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
No ratings yet
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
41 pages
Lect 11 DM
No ratings yet
Lect 11 DM
41 pages
Lesson 6 - Unsupervised Learning
No ratings yet
Lesson 6 - Unsupervised Learning
63 pages
Presentation 28128 Content Document 20241126014005PM
No ratings yet
Presentation 28128 Content Document 20241126014005PM
80 pages
P 3.1.3 Hierarchical
No ratings yet
P 3.1.3 Hierarchical
30 pages
Topic 6d - Hierarchical Algorithm
No ratings yet
Topic 6d - Hierarchical Algorithm
38 pages
Clustering: K-Means, Agglomerative, DBSCAN: Tan, Steinbach, Kumar
No ratings yet
Clustering: K-Means, Agglomerative, DBSCAN: Tan, Steinbach, Kumar
45 pages
Clustering: EE-671 Prof L. Behera, IITK
No ratings yet
Clustering: EE-671 Prof L. Behera, IITK
33 pages
19 - Sessionppt - Clusteringalgos
No ratings yet
19 - Sessionppt - Clusteringalgos
36 pages
08 Clustering Hierarchical
No ratings yet
08 Clustering Hierarchical
44 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
34 pages
10Hierarchical&Probabilistic Clustering & GMM (ML)
No ratings yet
10Hierarchical&Probabilistic Clustering & GMM (ML)
24 pages
AI20 - Hierarchical-Clustering
No ratings yet
AI20 - Hierarchical-Clustering
31 pages
Clustring
No ratings yet
Clustring
20 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
26 pages
4.4 Hierarchical Clustering Methods
No ratings yet
4.4 Hierarchical Clustering Methods
39 pages
Unit 3
No ratings yet
Unit 3
30 pages
Survey of Clustering Algorithms: IEEE Transactions On Neural Networks June 2005
No ratings yet
Survey of Clustering Algorithms: IEEE Transactions On Neural Networks June 2005
35 pages
Un Supervised Learning
No ratings yet
Un Supervised Learning
22 pages
Heirarchical Clustering
No ratings yet
Heirarchical Clustering
22 pages
Cluster Analysis 04: Elbow, Slihouette, Hierarchical Clustering, Agglomerative Clustering, Min, Max, Group Average
No ratings yet
Cluster Analysis 04: Elbow, Slihouette, Hierarchical Clustering, Agglomerative Clustering, Min, Max, Group Average
28 pages
Unit-4 New
No ratings yet
Unit-4 New
36 pages
3.2 HierCluster
No ratings yet
3.2 HierCluster
17 pages
K Means
No ratings yet
K Means
36 pages
3CP10 MJJ Hierarchical Clustering
No ratings yet
3CP10 MJJ Hierarchical Clustering
40 pages
CLUSTERING
No ratings yet
CLUSTERING
16 pages
Intelligent Systems Notes: Federico Rossi A.A 2017/2018
No ratings yet
Intelligent Systems Notes: Federico Rossi A.A 2017/2018
34 pages
Hierarchical Clustering - 11.3.2024 - Full
No ratings yet
Hierarchical Clustering - 11.3.2024 - Full
14 pages
ML Lec-17
No ratings yet
ML Lec-17
12 pages
Clustering
No ratings yet
Clustering
19 pages
Hierarchical
No ratings yet
Hierarchical
31 pages
Agnes
No ratings yet
Agnes
25 pages
DMDW Question Bank
No ratings yet
DMDW Question Bank
17 pages
Dornburg&Davin 2024
No ratings yet
Dornburg&Davin 2024
25 pages
What Is Cluster Analysis?
No ratings yet
What Is Cluster Analysis?
20 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
11 pages
Data Mining Functionalities
No ratings yet
Data Mining Functionalities
13 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
32 pages
Linktransformer:: A Unified Package For Record Linkage With Transformer Language Models
No ratings yet
Linktransformer:: A Unified Package For Record Linkage With Transformer Language Models
16 pages
SocialNetworkAnalysis FullNote
No ratings yet
SocialNetworkAnalysis FullNote
10 pages
Minerals 11 01178
No ratings yet
Minerals 11 01178
16 pages
DWM Assignment
No ratings yet
DWM Assignment
15 pages
Sci Py
No ratings yet
Sci Py
10 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
6 pages
Lec.4.D. M. Spring 2025
No ratings yet
Lec.4.D. M. Spring 2025
19 pages
Subcities of Bengaluru
No ratings yet
Subcities of Bengaluru
10 pages
Lecture 4
No ratings yet
Lecture 4
6 pages
Hierarchical Clustering: Required Data
No ratings yet
Hierarchical Clustering: Required Data
6 pages
Importance of Clustering in Data Mining
No ratings yet
Importance of Clustering in Data Mining
5 pages
13 Birch
No ratings yet
13 Birch
8 pages
DWM Important Answer
No ratings yet
DWM Important Answer
8 pages
Pattern Recognition 21BR551 MODULE 04 NOTES
No ratings yet
Pattern Recognition 21BR551 MODULE 04 NOTES
16 pages
Paper 1 73
No ratings yet
Paper 1 73
6 pages
DWM Exp8 127 133 137
No ratings yet
DWM Exp8 127 133 137
4 pages
Exp 8
No ratings yet
Exp 8
5 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
7 pages
Semantic Feature Enabled Agglomerative Clustering For Information Technology Job Profile Analysis
No ratings yet
Semantic Feature Enabled Agglomerative Clustering For Information Technology Job Profile Analysis
5 pages
Unit 5 Mfds
No ratings yet
Unit 5 Mfds
4 pages
11 - Mimbi and Bankole 2015
No ratings yet
11 - Mimbi and Bankole 2015
15 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
7 pages
CSC 501 Mid Term 2-Assignment
No ratings yet
CSC 501 Mid Term 2-Assignment
2 pages
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
From Everand
Competitive Learning: Fundamentals and Applications for Reinforcement Learning through Competition
Fouad Sabry
No ratings yet
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet

Hierarchical Clustering

Uploaded by

Hierarchical Clustering

Uploaded by

Hierarchical Clustering

Fundamental of Data Science:

Next, we'll bunch

For starters, we have four

 Hierarchical clustering employs a measure of

 The Euclidean distance function is

 The Euclidean distance between two points

The advantage of the Min

maximum distance between points in two

 The linkage function has several methods available

 To draw the dendrogram, we'll use the dendrogram

 by passing the dendrogram function to matplotlib, we can

 The Scikit-Learn library has its own function

 We'll use this data to group countries

You might also like