0% found this document useful (0 votes)

16 views9 pages

Spooo

Uploaded by

preethipgowda2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views9 pages

Spooo

Uploaded by

preethipgowda2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 9

• Hierarchical clustering is a method used to group similar objects into

clusters, forming a hierarchical structure or tree-like representation called a

dendrogram. It doesn't require the number of clusters to be specified
beforehand, making it useful for exploratory data analysis. Hierarchical
clustering can be broadly classified into two types: agglomerative (bottom-
up) and divisive (top-down).
• The hierarchical clustering technique has two approaches:

• Agglomerative: Agglomerative is a bottom-up approach, in which the

algorithm starts with taking all data points as single clusters and merging
them until one cluster is left.
• Divisive: Divisive algorithm is the reverse of the agglomerative algorithm
as
it is a top-down approach.
• Agglomerative Hierarchical clustering

• The agglomerative hierarchical clustering algorithm is a popular

example of HCA. To group the datasets into clusters, it follows
the bottom-up approach. It means, this algorithm considers each
dataset as a single cluster at the beginning, and then start combining
the closest pair of clusters together. It does this until all the clusters
are merged into a single cluster that contains all the datasets.
• This hierarchy of clusters is represented in the form of the
dendrogram.
• The dendrogram is a tree-like structure that is mainly used to store each step
as a memory that the HC algorithm performs. In the dendrogram plot, the Y-
axis shows the Euclidean distances between the data points, and the x-axis
shows all the data points of the given dataset.
• The working of the dendrogram can be explained using the below diagram:
• Step 1: Compute the proximity matrix using a particular distance
metric
• Step 2: Each data point is assigned to a cluster
• Step 3: Merge the clusters based on a metric for the similarity
between clusters
• Step 4: Update the distance matrix
• Step 5: Repeat Step 3 and Step 4 until only a single cluster remains
APPLICATIONNS:
• Biology and Bioinformatics: Hierarchical clustering is widely used in genomics and bioinformatics to analyze

gene expression data, identify gene regulatory networks, and classify biological samples based on their expression

profiles.

• Marketing and Customer Segmentation: In marketing, hierarchical clustering is applied to segment customers

based on their purchasing behavior, demographics, or preferences. This information can be used for targeted

marketing campaigns and product recommendations.

• Image Analysis and Computer Vision: Hierarchical clustering is used in image processing and computer vision

for tasks such as image segmentation, object recognition, and content-based image retrieval.

• Text Mining and Document Clustering: In text analysis, hierarchical clustering is applied to group similar

documents or words together, enabling tasks such as document clustering, topic modeling, and sentiment analysis.

• Social Network Analysis: Hierarchical clustering can be used to analyze social networks by grouping users or

communities based on their interactions or network properties.

ADVANTAGES
• No Need for Predefined Number of Clusters: Hierarchical clustering does not

require specifying the number of clusters beforehand, making it suitable for

exploratory data analysis.

• Hierarchical Representation: Hierarchical clustering provides a hierarchical

structure (dendrogram) that can be visually inspected to understand the

relationships between clusters at different levels of granularity.

• Interpretability: The hierarchical structure produced by hierarchical clustering can

be interpreted to gain insights into the natural grouping of data points.

• Flexibility: Hierarchical clustering can handle different types of data (e.g.,

numerical, categorical) and distance metrics, allowing for a flexible approach to

clustering.
DISADVANTAGE:
• Computational Complexity: Agglomerative hierarchical clustering can be

computationally expensive, especially for large datasets, as it requires calculating

pairwise distances between all data points or clusters.

• Memory Usage: Hierarchical clustering may require storing the entire distance

matrix in memory, making it memory-intensive for large datasets.

• Difficulty in Handling Noise and Outliers: Hierarchical clustering may struggle

with noisy or outlier data points, as they can affect the merging process and lead to

suboptimal clustering results.

• Subjectivity in Dendrogram Interpretation: Interpreting the dendrogram

produced by hierarchical clustering can be subjective, and determining the

appropriate number of clusters or the level of granularity can be challenging.

Hierarchical Clustering
No ratings yet
Hierarchical Clustering
41 pages
Hierarchical Clustering Algorithm
No ratings yet
Hierarchical Clustering Algorithm
9 pages
Hierarchial Clustering
No ratings yet
Hierarchial Clustering
14 pages
6 - Clustering and Applications and Trends in Datamining
No ratings yet
6 - Clustering and Applications and Trends in Datamining
66 pages
Lecture - 11 Hierarchical Clustering
No ratings yet
Lecture - 11 Hierarchical Clustering
28 pages
Hierarchical Clustering Unit 4 ML
No ratings yet
Hierarchical Clustering Unit 4 ML
14 pages
Clustering and Applications and Trends in Datamining Lecture:-30 To 35
No ratings yet
Clustering and Applications and Trends in Datamining Lecture:-30 To 35
66 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
10 pages
Hierarchical Clustering: Class Program University Semester Lecturer Sources
100% (1)
Hierarchical Clustering: Class Program University Semester Lecturer Sources
33 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
6 pages
Data Analytics and Model Evaluation
No ratings yet
Data Analytics and Model Evaluation
55 pages
4.4 Hierarchical Clustering Methods
No ratings yet
4.4 Hierarchical Clustering Methods
39 pages
ML Module Iv
No ratings yet
ML Module Iv
27 pages
ML CO4 SESSION 30 Hierarchical Clustering
No ratings yet
ML CO4 SESSION 30 Hierarchical Clustering
20 pages
4.5 Heirarchical
No ratings yet
4.5 Heirarchical
25 pages
Hierarchical Clusters
No ratings yet
Hierarchical Clusters
6 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
19 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
34 pages
Lect 11 DM
No ratings yet
Lect 11 DM
41 pages
Unit-4 New
No ratings yet
Unit-4 New
36 pages
Week 10
No ratings yet
Week 10
84 pages
Hierarchical Clustering PDF
No ratings yet
Hierarchical Clustering PDF
7 pages
DA Seminar
No ratings yet
DA Seminar
29 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
10 pages
Hierar Scale4
No ratings yet
Hierar Scale4
51 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
26 pages
ML Unit 5
No ratings yet
ML Unit 5
50 pages
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
No ratings yet
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
41 pages
4.unsupervised Learning Model-Clustering
No ratings yet
4.unsupervised Learning Model-Clustering
45 pages
Heirarchical Clustering
No ratings yet
Heirarchical Clustering
22 pages
Hierarchical
No ratings yet
Hierarchical
31 pages
Herichycal Cluster - March2020
No ratings yet
Herichycal Cluster - March2020
29 pages
Module-5-Cluster Analysis-Part1
No ratings yet
Module-5-Cluster Analysis-Part1
24 pages
Unit-6 Clustering Techniques
No ratings yet
Unit-6 Clustering Techniques
110 pages
Report 2
No ratings yet
Report 2
7 pages
Clustering
No ratings yet
Clustering
19 pages
Unit5 CSM ML
No ratings yet
Unit5 CSM ML
32 pages
10Hierarchical&Probabilistic Clustering & GMM (ML)
No ratings yet
10Hierarchical&Probabilistic Clustering & GMM (ML)
24 pages
Unit 4 Self Made
No ratings yet
Unit 4 Self Made
28 pages
Herichycal March2020
No ratings yet
Herichycal March2020
29 pages
Clustring
No ratings yet
Clustring
20 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
3 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
7 pages
Oralcee Apps Notes
No ratings yet
Oralcee Apps Notes
256 pages
Hierarchical Clustering - 11.3.2024 - Full
No ratings yet
Hierarchical Clustering - 11.3.2024 - Full
14 pages
ML Lec-17
No ratings yet
ML Lec-17
12 pages
Hierarchical Clustering Case Study
No ratings yet
Hierarchical Clustering Case Study
4 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
11 pages
Un Supervised Learning
No ratings yet
Un Supervised Learning
22 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
11 pages
MA Unit 5
No ratings yet
MA Unit 5
7 pages
Hierarchical Clustering in Unsupervised Learning
No ratings yet
Hierarchical Clustering in Unsupervised Learning
9 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
7 pages
Expt 5
No ratings yet
Expt 5
3 pages
Clustering Hierarchical PDF
No ratings yet
Clustering Hierarchical PDF
31 pages
Agnes
No ratings yet
Agnes
25 pages
DWM Exp8 127 133 137
No ratings yet
DWM Exp8 127 133 137
4 pages
Hierarchical Clustering PDF
No ratings yet
Hierarchical Clustering PDF
5 pages
Data Mining Unit 5
No ratings yet
Data Mining Unit 5
30 pages
Hierarchical Clustering in Data Mining
No ratings yet
Hierarchical Clustering in Data Mining
4 pages
MISK
No ratings yet
MISK
134 pages
Saemm Bs Data Science Syllabuses
No ratings yet
Saemm Bs Data Science Syllabuses
122 pages
Db2 SQL Errors
100% (1)
Db2 SQL Errors
574 pages
Cheminformatics
No ratings yet
Cheminformatics
4 pages
C#Unit-5 Notes
No ratings yet
C#Unit-5 Notes
22 pages
Chapter 2 Modeling Data in The Organization
No ratings yet
Chapter 2 Modeling Data in The Organization
48 pages
Chapter 3 IR
No ratings yet
Chapter 3 IR
34 pages
Fs Manual
No ratings yet
Fs Manual
7 pages
Project Scope
No ratings yet
Project Scope
4 pages
Data Mining and Data Warehousing 1st Edition S.K. Mourya All Chapters Instant Download
100% (10)
Data Mining and Data Warehousing 1st Edition S.K. Mourya All Chapters Instant Download
82 pages
Chapter IV Computer Security
No ratings yet
Chapter IV Computer Security
21 pages
Dbms Lab Manual
No ratings yet
Dbms Lab Manual
42 pages
Multi Mirror
No ratings yet
Multi Mirror
39 pages
Chapter 1 SMS
No ratings yet
Chapter 1 SMS
30 pages
Module 2 - Apply Your Knowledge
No ratings yet
Module 2 - Apply Your Knowledge
3 pages
CHETHAN
No ratings yet
CHETHAN
7 pages
Unit - 3 Mining Frequent Patterns
No ratings yet
Unit - 3 Mining Frequent Patterns
10 pages
To DB: Dr. Mohammed Eshtay
No ratings yet
To DB: Dr. Mohammed Eshtay
40 pages
Ba Iat-1
No ratings yet
Ba Iat-1
2 pages
A Digital-Based Integrated Methodology For The Preventive Conservation of Cultural Heritage: The Experience of Heritagecare Project
No ratings yet
A Digital-Based Integrated Methodology For The Preventive Conservation of Cultural Heritage: The Experience of Heritagecare Project
51 pages
Dbms Unit2
No ratings yet
Dbms Unit2
22 pages
Digital Age - Challenges For Libraries
No ratings yet
Digital Age - Challenges For Libraries
6 pages
FDSA - Question Bank
No ratings yet
FDSA - Question Bank
5 pages
Manasa D S
No ratings yet
Manasa D S
21 pages
Monika S V
No ratings yet
Monika S V
21 pages
Chandana
No ratings yet
Chandana
6 pages
Final Model Paper Computer Science HSSC-I
No ratings yet
Final Model Paper Computer Science HSSC-I
12 pages
Delta Lake vs. Parquet. If Delta Lake Tables Also Use Parquet - by Abhinav Prakash - Jan, 2024 - Medium
No ratings yet
Delta Lake vs. Parquet. If Delta Lake Tables Also Use Parquet - by Abhinav Prakash - Jan, 2024 - Medium
13 pages
CSS Interview Questions
No ratings yet
CSS Interview Questions
3 pages
DB Managment Ch7 Problems
No ratings yet
DB Managment Ch7 Problems
6 pages
Xafpay Full-Stack Job Description
No ratings yet
Xafpay Full-Stack Job Description
2 pages
Commission Sheet Template 07
No ratings yet
Commission Sheet Template 07
9 pages
New Resume Saurav
No ratings yet
New Resume Saurav
1 page
Usd Paper
No ratings yet
Usd Paper
2 pages
BAPIs MM
No ratings yet
BAPIs MM
2 pages
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet

Spooo

Uploaded by

Spooo

Uploaded by

• Hierarchical clustering is a method used to group similar objects into

clusters, forming a hierarchical structure or tree-like representation called a

• Agglomerative: Agglomerative is a bottom-up approach, in which the

• The agglomerative hierarchical clustering algorithm is a popular

marketing campaigns and product recommendations.

communities based on their interactions or network properties.

require specifying the number of clusters beforehand, making it suitable for

exploratory data analysis.

• Hierarchical Representation: Hierarchical clustering provides a hierarchical

structure (dendrogram) that can be visually inspected to understand the

relationships between clusters at different levels of granularity.

• Interpretability: The hierarchical structure produced by hierarchical clustering can

be interpreted to gain insights into the natural grouping of data points.

• Flexibility: Hierarchical clustering can handle different types of data (e.g.,

numerical, categorical) and distance metrics, allowing for a flexible approach to

computationally expensive, especially for large datasets, as it requires calculating

pairwise distances between all data points or clusters.

matrix in memory, making it memory-intensive for large datasets.

• Difficulty in Handling Noise and Outliers: Hierarchical clustering may struggle

suboptimal clustering results.

• Subjectivity in Dendrogram Interpretation: Interpreting the dendrogram

produced by hierarchical clustering can be subjective, and determining the

appropriate number of clusters or the level of granularity can be challenging.

You might also like