0% found this document useful (0 votes)

30 views26 pages

Hierarchical Clustering

Uploaded by

bodasantosh91

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views26 pages

Hierarchical Clustering

Uploaded by

bodasantosh91

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 26

Hierarchical

Clustering:
PRESENTED BY

RAVI RANJAN

ANISHA BHARTI

RUNU SUKEERTI
Clustering in Machine Learning
Definition: Unsupervised learning technique for grouping data points into clusters based on similarity.

Types of Clustering Methods

1 Partitioning Methods 2 Hierarchical Clustering 3 Density-Based Methods

K-Means algorithm divides data Creates a tree of clusters. DBSCAN algorithm finds
into non-overlapping subsets. It Includes agglomerative clusters based on density. It can
aims to minimize within-cluster (bottom-up) and divisive (top- detect clusters of arbitrary
distances. down) approaches. shape.

4 Model-Based Clustering
Gaussian Mixture Models assume data comes from a mixture of probability distributions.
Hierarchical Clustering
(Definition)
Hierarchical clustering is a method of clustering that builds a hierarchy of clusters. It
creates a tree-like structure (called a dendrogram), where data points are grouped
based on their similarity. It can either start by treating each data point as a separate
cluster and then merging them (agglomerative) or by treating all data points as a
single cluster and splitting them (divisive).

Need for Hierarchical Clustering

No need to specify the number of clusters: Unlike K-means, you don’t have to
define the number of clusters beforehand.

Dendrogram visualization: It provides a visual representation (dendrogram) that

shows the cluster formation process and helps in determining the optimal number of
clusters.

Suitable for small datasets: Works well with smaller datasets where visual
interpretation and understanding of cluster hierarchies is important.

Captures nested clusters: Useful for data with a natural hierarchical structure, like
taxonomies or evolutionary trees.
Types of Hierarchical Clustering
Agglomerative Clustering: Bottom-Up
Approach
How it Works: This method starts with each data point as its own individual cluster. For example, if you have 100 data
points, you will initially have 100 separate clusters.

Merging Process: Then, similar clusters are identified based on their proximity (using distance metrics like Euclidean
distance). After that, these similar clusters are merged to form larger clusters.

Step-by-Step Merging: This process continues until all clusters are merged into a single cluster that represents the
entire dataset.

When to Use: Agglomerative clustering is suitable for small to medium-sized datasets. This approach can be slower
and may not be efficient for large datasets.
Agglomerative Clustering:
Bottom-Up Approach
Individual Clusters
Start with each data point as a separate cluster. For 100 points, begin with
100 clusters.

Similarity Calculation
Compute distances between clusters. Use metrics like Euclidean distance
to measure similarity.

Merging
Combine the most similar clusters. This process continues until one
cluster remains.

Dendrogram Analysis
Examine the dendrogram to determine optimal cluster number. Cut the
tree at appropriate level.
Divisive Clustering: Top-Down Approach
How it Works: In this method, the entire dataset is initially treated as a single cluster. This means all data points are
grouped together at the start.

Splitting Process: The single cluster is then gradually divided into smaller clusters based on their similarity or
distance. This process is recursive, meaning that at each step, a cluster is further split until each data point has its own
cluster.
Step-by-Step Splitting: Clusters are divided based on similarity or dissimilarity, and this continues until meaningful,
smaller clusters are formed.

Dendrogram: A dendrogram is also used in this method, but it starts from the top and moves downward as smaller
clusters are created. In this dendrogram, you can see where to stop splitting to obtain optimal clusters.

When to Use: Divisive clustering is generally used when you have a large dataset or when you want to display a clear
hierarchical structure. This method can be more computationally intensive but can be effective for large datasets.

.
Divisive Clustering: Top-
Down Approach
1 Single Cluster
Start with all data points in one cluster. This represents
the entire dataset.

2 Splitting Process
Divide the cluster based on dissimilarity. Create
smaller, more homogeneous groups.

3 Recursive Division
Continue splitting until each data point is its own
cluster. Or stop at desired level.
Distance Metrics and
Linkage Criteria
Understanding distance metrics and linkage criteria is crucial in
the field of clustering, as they determine how similarity between
data points is measured and how clusters are formed. This
overview explores the key concepts and their applications.
Euclidean Distance
Definition Use Cases Limitations

Euclidean distance is the straight- Euclidean distance is particularly It can be sensitive to differences
line distance between two points effective when the data has a in scale and may not perform well
in a multidimensional space. It is clear cluster structure and the when the data has varying
the most commonly used clusters are compact and well- densities or complex shapes.
distance metric in clustering separated.
algorithms.
Manhattan Distance
Definition Characteristics
Also known as the city block distance, Manhattan distance Manhattan distance is less sensitive to outliers and more
measures the absolute difference between the coordinates of robust to noise than Euclidean distance.
two points.

Applications Comparison
It is often used in tasks where the data has a grid-like Manhattan distance is generally more computationally efficient
structure, such as image processing and text analysis. than Euclidean distance, making it a popular choice for large-
scale clustering problems.
Cosine Similarity

Vector Orientation
Cosine similarity measures the cosine of the angle between two non-zero
vectors, focusing on their orientation rather than magnitude.

Text Analysis
It is commonly used in text mining and information retrieval, where it measures
the similarity between document vectors.

Clustering Preference
Cosine similarity is often preferred when the data has no clear scale, such as in
high-dimensional text data or gene expression data.
Linkage Criteria
1 Single Linkage
Merges clusters based on the minimum distance
between any two points in the clusters.

2 Complete Linkage
Merges clusters based on the maximum distance
between any two points in the clusters.

3 Average Linkage
Merges clusters based on the average distance
between all pairs of points in the clusters.
Single Linkage
1 Minimum Distance 2 Chaining Effect
Single linkage uses the This method can be
minimum distance sensitive to outliers and
between any two points may result in long,
in the clusters to chain-like clusters.
determine when to
merge them.

3 Applications
Single linkage is often used in exploratory data analysis
to identify potential cluster structures.
Complete Linkage
Maximum Distance
Complete linkage uses the maximum distance between
any two points in the clusters to determine when to
merge them.

Compact Clusters
This method tends to produce more compact, spherical
clusters by favoring the merger of clusters with the
smallest maximum distance.

Sensitivity to Outliers
Complete linkage is more sensitive to outliers than
single linkage, as it is influenced by the maximum
distance between points.
Average Linkage
Compromise Average linkage is a
compromise between single
and complete linkage, using
the average distance between
all pairs of points in the
clusters.
Flexibility This method can produce
clusters with varying shapes
and sizes, making it a versatile
choice for many clustering
problems.

Robustness Average linkage is generally

less sensitive to outliers and
noise than other linkage
methods.
Ward's Method:
Principles and
Assumptions
Ward's method is a hierarchical clustering algorithm that uses a
specific distance metric to group data points. The core principle is to
minimize the variance within clusters, which leads to a more
balanced and cohesive clustering.

by Runu Sukeerti
Constructing a Dendrogram
1 Initial Steps
Each data point starts as its own cluster.

2 Merging Clusters
At each step, the two closest clusters are merged
based on the Ward's distance metric.

3 Hierarchical Structure
The merging process continues until all data points
are in one cluster, forming a hierarchical tree
structure.
Interpreting Dendrogram Structures
Branching Patterns Cluster Height Cluster Size

The dendrogram's branching The height of each cluster indicates The size of a cluster indicates the
patterns reveal the relationships the distance between merged number of data points belonging to
between clusters and their clusters, providing insights into their it.
proximity. similarity.
Practical Applications of
Dendrograms
Customer Segmentation
Group customers based on their purchasing behavior or
demographics for targeted marketing.

Gene Expression Analysis

Identify genes with similar expression patterns for
understanding biological processes.

Image Analysis
Cluster pixels or regions of an image to segment objects
or identify patterns.
Limitations and
Considerations
High Computational Cost Ward's method can be
computationally expensive for
large datasets.

Sensitivity to Outliers Outliers can distort the

dendrogram and affect
clustering results.

Difficulty with Complex Data Ward's method might struggle

with data that has non-
spherical clusters or varying
densities.
Selecting
Optimal Clusters
from a
Dendrogram
Dendrograms, tree-like diagrams, visually represent hierarchical
clustering. Selecting the optimal number of clusters is crucial for
insightful analysis.
Determining the Optimal Number of Clusters
Elbow Method Cut-Off Height Domain Knowledge

Look for a distinct "elbow" in the Choose a specific height on the Consider prior knowledge about the
dendrogram where the rate of dendrogram and cut the tree data and the desired number of
change in cluster distance horizontally, resulting in clusters clusters based on the problem's
decreases significantly. where the height represents the context.
similarity threshold.
Advantages of
Hierarchical Clustering
No Predefined Visual Insights
Cluster Number and
Dendrograms provide a
Cluster Size
visual representation of the
Hierarchical clustering does clustering process, aiding in
not require prior knowledge understanding the
of the number of clusters. relationships between data
points.

Hierarchical Structure and Unsupervised

Learning
Hierarchical clustering allows for understanding the relationships
between clusters at different levels of granularity.
Limitations and
Disadvantages of Hierarchical
Clustering
Computational Complexity
Hierarchical clustering can be computationally expensive, especially for large datasets.

Irreversible Merging
Once clusters are merged, they cannot be undone, making it difficult to adjust the
clustering based on new insights.

Sensitivity to Outliers
Outliers can significantly influence the clustering results, leading to inaccurate cluster
formations.
Conclusion and Future
Directions
Hierarchical clustering is a versatile method for exploring data and identifying
underlying relationships.

Further research can focus on improving computational efficiency and developing

methods to handle outliers effectively.

Integrating hierarchical clustering with other techniques, such as k-means clustering,

can lead to hybrid approaches for improved accuracy and interpretability.

Agglomerative Hierarchical Clustering Algorithm-A Review: K.Sasirekha, P.Baby
No ratings yet
Agglomerative Hierarchical Clustering Algorithm-A Review: K.Sasirekha, P.Baby
3 pages
My Lecture On CLUSTER ANALYSIS PDF
No ratings yet
My Lecture On CLUSTER ANALYSIS PDF
55 pages
Hierarchical Clusters
No ratings yet
Hierarchical Clusters
6 pages
Clustering
No ratings yet
Clustering
19 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
10 pages
Module-5-Cluster Analysis-Part1
No ratings yet
Module-5-Cluster Analysis-Part1
24 pages
Hierarchical Clustering - 11.3.2024 - Full
No ratings yet
Hierarchical Clustering - 11.3.2024 - Full
14 pages
Hierarchical
No ratings yet
Hierarchical
31 pages
Unit-6 Clustering Techniques
No ratings yet
Unit-6 Clustering Techniques
110 pages
Clustering Hierarchical PDF
No ratings yet
Clustering Hierarchical PDF
31 pages
Hierarchical Clustering Case Study
No ratings yet
Hierarchical Clustering Case Study
4 pages
Chapter 4 - Clustering
No ratings yet
Chapter 4 - Clustering
21 pages
Un Supervised Learning
No ratings yet
Un Supervised Learning
22 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
10 pages
DA Seminar
No ratings yet
DA Seminar
29 pages
Cluster Analysis
No ratings yet
Cluster Analysis
30 pages
MA Unit 5
No ratings yet
MA Unit 5
7 pages
Clustering
No ratings yet
Clustering
69 pages
Unit-4 New
No ratings yet
Unit-4 New
36 pages
Hierarchial Clustering
No ratings yet
Hierarchial Clustering
14 pages
Heirarchical Clustering
No ratings yet
Heirarchical Clustering
22 pages
Unit 4 Self Made
No ratings yet
Unit 4 Self Made
28 pages
Presentation 28128 Content Document 20241126014005PM
No ratings yet
Presentation 28128 Content Document 20241126014005PM
80 pages
Hierarchical Clustering Unit 4 ML
No ratings yet
Hierarchical Clustering Unit 4 ML
14 pages
Agnes
No ratings yet
Agnes
25 pages
Week-9-Part-2 Agglomerative Clustering
No ratings yet
Week-9-Part-2 Agglomerative Clustering
40 pages
AI20 - Hierarchical-Clustering
No ratings yet
AI20 - Hierarchical-Clustering
31 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
41 pages
Module 5
No ratings yet
Module 5
43 pages
10Hierarchical&Probabilistic Clustering & GMM (ML)
No ratings yet
10Hierarchical&Probabilistic Clustering & GMM (ML)
24 pages
Clustering
No ratings yet
Clustering
20 pages
Clustering: EE-671 Prof L. Behera, IITK
No ratings yet
Clustering: EE-671 Prof L. Behera, IITK
33 pages
Expt 5
No ratings yet
Expt 5
3 pages
Hierarchical Clustering: Required Data
No ratings yet
Hierarchical Clustering: Required Data
6 pages
Example For Agglomerative Clustering
No ratings yet
Example For Agglomerative Clustering
2 pages
Cluster Analysis
No ratings yet
Cluster Analysis
24 pages
Spooo
No ratings yet
Spooo
9 pages
Lect 11 DM
No ratings yet
Lect 11 DM
41 pages
Hierarchical Clustering PDF
No ratings yet
Hierarchical Clustering PDF
7 pages
Hierarchical Clustering: Relationship Between Clusters
No ratings yet
Hierarchical Clustering: Relationship Between Clusters
23 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
6 pages
Lec 35
No ratings yet
Lec 35
18 pages
Unit5 CSM ML
No ratings yet
Unit5 CSM ML
32 pages
Clustring
No ratings yet
Clustring
20 pages
Slide TIF311 DM 10 11
No ratings yet
Slide TIF311 DM 10 11
49 pages
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
No ratings yet
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
41 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
11 pages
ML Lec-17
No ratings yet
ML Lec-17
12 pages
Introduction To Clustering: Alka Arora Sr. Scientist
No ratings yet
Introduction To Clustering: Alka Arora Sr. Scientist
57 pages
Module 3 - 1
No ratings yet
Module 3 - 1
149 pages
Week 10
No ratings yet
Week 10
84 pages
Lecture - 11 Hierarchical Clustering
No ratings yet
Lecture - 11 Hierarchical Clustering
28 pages
Agglomerative Clustering
No ratings yet
Agglomerative Clustering
44 pages
Machine Learning Notes Anna University
100% (1)
Machine Learning Notes Anna University
14 pages
Data Clustering..
No ratings yet
Data Clustering..
10 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
7 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
34 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
4 pages
Clustering: Sridhar S Department of IST Anna University
No ratings yet
Clustering: Sridhar S Department of IST Anna University
91 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Implementationof Bellman Ford Algorithmfor Dynamic NPCPathfindinin Horror Games
No ratings yet
Implementationof Bellman Ford Algorithmfor Dynamic NPCPathfindinin Horror Games
16 pages
Code:: 01. Write A Program in C To Draw A Smiley Face
No ratings yet
Code:: 01. Write A Program in C To Draw A Smiley Face
7 pages
Tree-Structured Vector Quantizers
No ratings yet
Tree-Structured Vector Quantizers
13 pages
Fa20-Bme-031 (Week1 Matlab)
No ratings yet
Fa20-Bme-031 (Week1 Matlab)
3 pages
3D Reconstruction: Jeff Boody
No ratings yet
3D Reconstruction: Jeff Boody
32 pages
Kalman Smoothing
No ratings yet
Kalman Smoothing
15 pages
Decimation in Frequency PDF
No ratings yet
Decimation in Frequency PDF
7 pages
Confidential
No ratings yet
Confidential
14 pages
Excel Apex CT - 05 - (06-02-2024) - SPS Sir FC
No ratings yet
Excel Apex CT - 05 - (06-02-2024) - SPS Sir FC
4 pages
DSA Roadmap 3 Months
No ratings yet
DSA Roadmap 3 Months
3 pages
Tutorial 1
No ratings yet
Tutorial 1
3 pages
Mixed Integer Prog - Excel Solver Practice Example
No ratings yet
Mixed Integer Prog - Excel Solver Practice Example
22 pages
Characteristics and Features of Problems Solved by Greedy Algorithms Structure Greedy Algorithm
No ratings yet
Characteristics and Features of Problems Solved by Greedy Algorithms Structure Greedy Algorithm
2 pages
Tolerance - Python Numerical Methods
No ratings yet
Tolerance - Python Numerical Methods
1 page
Materi 04. Viewing & Clipping: Komputer Grafik
No ratings yet
Materi 04. Viewing & Clipping: Komputer Grafik
23 pages
Lecture Slides AMM Week 3 - Differential Equations
No ratings yet
Lecture Slides AMM Week 3 - Differential Equations
23 pages
Aiml Unit-4
No ratings yet
Aiml Unit-4
82 pages
Lu and Plu Factorization: Terry A. Loring
No ratings yet
Lu and Plu Factorization: Terry A. Loring
7 pages
ML CS3035 Question Bank Part I
No ratings yet
ML CS3035 Question Bank Part I
2 pages
Parsing Assignment
No ratings yet
Parsing Assignment
6 pages
Visa - LeetCode
No ratings yet
Visa - LeetCode
3 pages
MA3251 PART A Imporant Questions - Finalised
No ratings yet
MA3251 PART A Imporant Questions - Finalised
15 pages
Question Bank Unit 4 - 5 - 6
No ratings yet
Question Bank Unit 4 - 5 - 6
7 pages
Important Math Concept For DSA
No ratings yet
Important Math Concept For DSA
2 pages
Object Detection Using YOLO: Challenges, Architectural Successors, Datasets and Applications
No ratings yet
Object Detection Using YOLO: Challenges, Architectural Successors, Datasets and Applications
33 pages
Feed-Forward Neural Networks (Part 2: Learning)
No ratings yet
Feed-Forward Neural Networks (Part 2: Learning)
17 pages
Data Science QB
No ratings yet
Data Science QB
2 pages
D1, L8 Critical Path Analysis Early Event Time Examples
No ratings yet
D1, L8 Critical Path Analysis Early Event Time Examples
45 pages
Poles, Zeros, and Higher Order Filters
No ratings yet
Poles, Zeros, and Higher Order Filters
7 pages
Secent Method MATLAB Code
No ratings yet
Secent Method MATLAB Code
4 pages

Hierarchical Clustering

Uploaded by

Hierarchical Clustering

Uploaded by

Hierarchical

Types of Clustering Methods

1 Partitioning Methods 2 Hierarchical Clustering 3 Density-Based Methods

Need for Hierarchical Clustering

Dendrogram visualization: It provides a visual representation (dendrogram) that

Robustness Average linkage is generally

Gene Expression Analysis

Sensitivity to Outliers Outliers can distort the

Difficulty with Complex Data Ward's method might struggle

Hierarchical Structure and Unsupervised

Further research can focus on improving computational efficiency and developing

Integrating hierarchical clustering with other techniques, such as k-means clustering,

You might also like