0% found this document useful (0 votes)

14 views41 pages

Lect 11 DM

Uploaded by

Saba Tariq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views41 pages

Lect 11 DM

Uploaded by

Saba Tariq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 41

DISCLAIMER

In preparation of these slides, materials have been taken from

different online sources in the shape of books, websites, research
papers and presentations etc. However, the author does not have any
intention to take any benefit of these in her/his own name. This
lecture (audio, video, slides etc) is prepared and delivered only for
educational purposes and is not intended to infringe upon the
copyrighted material. Sources have been acknowledged where
applicable. The views expressed are presenter’s alone and do not
necessarily represent actual author(s) or the institution.
Data Mining

Clustering

1
Clustering Approaches
1. Partitioning Methods
2. Hierarchical Methods
3. Density-Based Methods
Hierarchical Clustering
• Two main types of hierarchical clustering
– Agglomerative:
• Start with the points as individual clusters
• At each step, merge the closest pair of clusters until only one cluster (or k
clusters) left
– Divisive:
• Start with one, all-inclusive cluster
• At each step, split a cluster until each cluster contains a point (or there are k
clusters)
• Traditional hierarchical algorithms use a similarity or distance
matrix
– Merge or split one cluster at a time
– Image segmentation mostly uses simultaneous merge/split
Hierarchical clustering
• Agglomerative (Bottom-up)
– Compute all pair-wise pattern-pattern similarity
coefficients
– Place each of n patterns into a class of its own
– Merge the two most similar clusters into one
• Replace the two clusters into the new cluster
• Re-compute inter-cluster similarity scores w.r.t. the new
cluster
– Repeat the above step until there are k clusters
left (k can be 1)
Hierarchical clustering
• Agglomerative (Bottom up)
Hierarchical clustering
• Agglomerative (Bottom up)
• 1st iteration
1
Hierarchical clustering
• Agglomerative (Bottom up)
• 2nd iteration
1 2
Hierarchical clustering
• Agglomerative (Bottom up)
• 3rd iteration
3 2
1
Hierarchical clustering
• Agglomerative (Bottom up)
• 4th iteration
3 2
1

4
Hierarchical clustering
• Agglomerative (Bottom up)
• 5th iteration
3 2
1
5

4
Hierarchical clustering
• Agglomerative (Bottom up)
• Finally k clusters left
6 3 2 9
1
5
8
4
7
Hierarchical clustering
• Divisive (Top-down)
– Start at the top with all patterns in one cluster
– The cluster is split using a flat clustering algorithm
– This procedure is applied recursively until each
pattern is in its own singleton cluster
Hierarchical clustering
• Divisive (Top-down)
Hierarchical Clustering: The Algorithm

• Hierarchical clustering takes as input a set of points

• It creates a tree in which the points are leaves and the internal
nodes reveal the similarity structure of the points.
– The tree is often called a “dendogram.”
• The method is summarized below:

Place all points into their own clusters

While there is more than one cluster,
do
Merge the closest pair of clusters

The behavior of the algorithm depends on how “closest pair

of clusters” is defined
Hierarchical Clustering: Example
This example illustrates single-link clustering in
Euclidean space on 6 points.

F
E

A
B
C D

A B C D E F
Hierarchical Clustering
• Produces a set of nested clusters organized as a
hierarchical tree
• Can be visualized as a dendrogram
– A tree like diagram that records the sequences of merges
or splits

6 5
0.2

4
0.15 3 4
2
5
0.1
2

0.05
1
3 1
0
1 3 2 5 4 6
Strengths of Hierarchical Clustering

• Do not have to assume any particular number of

clusters
– Any desired number of clusters can be obtained by
‘cutting’ the dendogram at the proper level
Hierarchical Clustering: Merging
Clusters
Single Link: Distance between two clusters
is the distance between the closest points.
Also called “neighbor joining.”

Average Link: Distance between

clusters is distance between the cluster
centroids.

Complete Link: Distance between

clusters is distance between farthest pair
of points.
How to Define Inter-Cluster Similarity
p1 p2 p3 p4 p5 ...
p1
Similarity?
p2

p5
• MIN .
• MAX .
• Group Average .
Proximity Matrix
• Distance Between Centroids
• Other methods driven by an
objective function
– Ward’s Method uses squared error
How to Define Inter-Cluster Similarity
p1 p2 p3 p4 p5 ...
p1

  p2

p5
• MIN .
• MAX .
• Group Average .
Proximity Matrix
• Distance Between Centroids
• Other methods driven by an
objective function
An example
Let us consider a gene measured in a set of 5 experiments:
A,B,C,D and E. The values measured in the 5 experiments are:
A=100 B=200 C=500 D=900 E=1100

We will construct the hierarchical clustering of these values

using Euclidean distance, centroid linkage and an
agglomerative approach.

50
An example
SOLUTION:
• The closest two values are 100 and 200
=>the centroid of these two values is 150.
• Now we are clustering the values: 150, 500, 900, 1100
• The closest two values are 900 and 1100
=>the centroid of these two values is 1000.
• The remaining values to be joined are: 150, 500, 1000.
• The closest two values are 150 and 500
=>the centroid of these two values is 325.
• Finally, the two resulting subtrees are joined in the root of the
tree.

51
An example:
Two hierarchical clusters of the expression values of a single gene measured
in 5 experiments.

1100 500 1100 900

500 900
D E C E D
C 200 100
100 200
A B B A

 The dendograms are identical: both diagrams show that:

•A is most similar to B
•C is most similar to the group (A,B)
•D is most similar to E
 In the left dendogram A and E are plotted far from each other
 In the right dendogram A and E are immediate neighbors

THE PROXIMITY IN A HIERARCHICAL CLUSTERING DOES NOT NECESSARILY

CORRESPOND TO SIMILARITY 52
Example: Single Link Method

50
Example: Single Link Method

50
Example: Single Link Method
Example: Single Link Method
Example: Single Link Method
Example: Single Link Method
Example: Single Link Method
Example: Single Link Method
Example

50
Example: Complete Link Method

50
Example

50
Example: Group Average Method

50
Acknowledgements
 Introduction to Machine Learning, Alphaydin
 Pattern Classification” by Duda et al., John Wiley & Sons.
 Read GMM from “Automated Detection of Exudates in Colored Retinal Images for
Diagnosis of Diabetic Retinopathy”, Applied Optics, Vol. 51 No. 20, 4858-4866, 2012.
Material in these slides has been taken from, the following

 Biomisa.org
resources

100

Verified PDF Download Discrete Time Signal Processing 3rd Edition by Alan V Oppenheim Ebook and TestBank Bundle Fast Instant Download
No ratings yet
Verified PDF Download Discrete Time Signal Processing 3rd Edition by Alan V Oppenheim Ebook and TestBank Bundle Fast Instant Download
408 pages
Unit 3 Clustering
No ratings yet
Unit 3 Clustering
101 pages
Computer Vision Lec-1
No ratings yet
Computer Vision Lec-1
110 pages
Techno 101 - Presentation
No ratings yet
Techno 101 - Presentation
58 pages
Lecture - 11 Hierarchical Clustering
No ratings yet
Lecture - 11 Hierarchical Clustering
28 pages
Hierarchical Clustering Algorithm
No ratings yet
Hierarchical Clustering Algorithm
9 pages
Hierarchical Clustering Unit 4 ML
No ratings yet
Hierarchical Clustering Unit 4 ML
14 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
10 pages
Content - DELMIA - Ergonomics at Work Essentials
No ratings yet
Content - DELMIA - Ergonomics at Work Essentials
28 pages
Hierarchial Clustering
No ratings yet
Hierarchial Clustering
14 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
23 pages
Clustering and Applications and Trends in Datamining Lecture:-30 To 35
No ratings yet
Clustering and Applications and Trends in Datamining Lecture:-30 To 35
66 pages
KWV 230 BT
No ratings yet
KWV 230 BT
96 pages
K-Means and Hierarchical Clustering
No ratings yet
K-Means and Hierarchical Clustering
30 pages
Globe Telecom Accounting Case Study
No ratings yet
Globe Telecom Accounting Case Study
20 pages
Module 5
No ratings yet
Module 5
43 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
10 pages
Hierarchical Clusters
No ratings yet
Hierarchical Clusters
6 pages
Presentation 28128 Content Document 20241126014005PM
No ratings yet
Presentation 28128 Content Document 20241126014005PM
80 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
32 pages
Hierarchical Clustering PDF
No ratings yet
Hierarchical Clustering PDF
7 pages
Unit5 CSM ML
No ratings yet
Unit5 CSM ML
32 pages
6 - Clustering and Applications and Trends in Datamining
No ratings yet
6 - Clustering and Applications and Trends in Datamining
66 pages
ML TCS Lecture Hierarchical 1608
No ratings yet
ML TCS Lecture Hierarchical 1608
41 pages
Unit-4 New
No ratings yet
Unit-4 New
36 pages
Feedback Control Systems (FCS) : Lecture-26 Routh-Herwitz Stability Criterion
No ratings yet
Feedback Control Systems (FCS) : Lecture-26 Routh-Herwitz Stability Criterion
19 pages
Meghnaghat Power Plant
No ratings yet
Meghnaghat Power Plant
65 pages
Unit 3 DVA
No ratings yet
Unit 3 DVA
50 pages
4.4 Hierarchical Clustering Methods
No ratings yet
4.4 Hierarchical Clustering Methods
39 pages
Module-5-Cluster Analysis-Part1
No ratings yet
Module-5-Cluster Analysis-Part1
24 pages
Internet of Things5
No ratings yet
Internet of Things5
15 pages
2018 M.SC 2nd Sem
No ratings yet
2018 M.SC 2nd Sem
12 pages
6 - Chapter 6 - Hierarchical Clustering
No ratings yet
6 - Chapter 6 - Hierarchical Clustering
32 pages
Lec.4.D. M. Spring 2025
No ratings yet
Lec.4.D. M. Spring 2025
19 pages
DA Seminar
No ratings yet
DA Seminar
29 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
26 pages
Whitepaper EngineeringDesignSimulationShapeOptimization OnshapeSimScaleESTECO
No ratings yet
Whitepaper EngineeringDesignSimulationShapeOptimization OnshapeSimScaleESTECO
17 pages
Lecture 2 - Problem Solving Process
No ratings yet
Lecture 2 - Problem Solving Process
32 pages
Clustering
No ratings yet
Clustering
19 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
41 pages
Hierarchical Clustering: Class Program University Semester Lecturer Sources
100% (1)
Hierarchical Clustering: Class Program University Semester Lecturer Sources
33 pages
Tut - 03 - 020843
No ratings yet
Tut - 03 - 020843
25 pages
Unit-6 Clustering Techniques
No ratings yet
Unit-6 Clustering Techniques
110 pages
ML CO4 SESSION 30 Hierarchical Clustering
No ratings yet
ML CO4 SESSION 30 Hierarchical Clustering
20 pages
Data Mining Functionalities
No ratings yet
Data Mining Functionalities
13 pages
Un Supervised Learning
No ratings yet
Un Supervised Learning
22 pages
Datasheet 1 RTG 1223160 E 2,400.0
No ratings yet
Datasheet 1 RTG 1223160 E 2,400.0
2 pages
Hierar Scale4
No ratings yet
Hierar Scale4
51 pages
Lecture 01.1 Introduction To Website Development
No ratings yet
Lecture 01.1 Introduction To Website Development
22 pages
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
No ratings yet
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
41 pages
Hierarchical Clustering Case Study
No ratings yet
Hierarchical Clustering Case Study
4 pages
Helping Hand - An Advance Way To Communicate With An Orphanage Organization
No ratings yet
Helping Hand - An Advance Way To Communicate With An Orphanage Organization
3 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
34 pages
(족보닷컴 미리보는 기말고사) 중3 영어 YBM (박준언)
No ratings yet
(족보닷컴 미리보는 기말고사) 중3 영어 YBM (박준언)
10 pages
Cisco Intersight Infrastructure Service Data Sheet
No ratings yet
Cisco Intersight Infrastructure Service Data Sheet
15 pages
19 - Sessionppt - Clusteringalgos
No ratings yet
19 - Sessionppt - Clusteringalgos
36 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
6 pages
P 3.1.3 Hierarchical
No ratings yet
P 3.1.3 Hierarchical
30 pages
Data Mining: Hierarchical Clustering, DBSCAN The EM Algorithm
No ratings yet
Data Mining: Hierarchical Clustering, DBSCAN The EM Algorithm
63 pages
CLUSTERING
No ratings yet
CLUSTERING
16 pages
03 Hierarchical Clustering
100% (1)
03 Hierarchical Clustering
15 pages
Spooo
No ratings yet
Spooo
9 pages
Chap15 Cluster Analysis
No ratings yet
Chap15 Cluster Analysis
55 pages
Cluster Analysis 04: Elbow, Slihouette, Hierarchical Clustering, Agglomerative Clustering, Min, Max, Group Average
No ratings yet
Cluster Analysis 04: Elbow, Slihouette, Hierarchical Clustering, Agglomerative Clustering, Min, Max, Group Average
28 pages
10Hierarchical&Probabilistic Clustering & GMM (ML)
No ratings yet
10Hierarchical&Probabilistic Clustering & GMM (ML)
24 pages
Clustring
No ratings yet
Clustring
20 pages
Magnum Press-On
No ratings yet
Magnum Press-On
5 pages
Clustering
No ratings yet
Clustering
36 pages
Vanessa Carbonell
No ratings yet
Vanessa Carbonell
4 pages
Saba Tariq 061 CN Lab 5
No ratings yet
Saba Tariq 061 CN Lab 5
8 pages
Clustering: Sridhar S Department of IST Anna University
No ratings yet
Clustering: Sridhar S Department of IST Anna University
91 pages
Saba Tariq 061 CN Lab 3
No ratings yet
Saba Tariq 061 CN Lab 3
7 pages
Toshiba 500gb Dt01aca Dt01aca050!3!5 Internal Hard Hdkpc01 282179 User Manual
No ratings yet
Toshiba 500gb Dt01aca Dt01aca050!3!5 Internal Hard Hdkpc01 282179 User Manual
2 pages
RL Quadcopter Movement Control Using Image Processing Techniques
No ratings yet
RL Quadcopter Movement Control Using Image Processing Techniques
4 pages
Hierarchical Clustering - 11.3.2024 - Full
No ratings yet
Hierarchical Clustering - 11.3.2024 - Full
14 pages
Topic 6d - Hierarchical Algorithm
No ratings yet
Topic 6d - Hierarchical Algorithm
38 pages
08 Clustering Hierarchical
No ratings yet
08 Clustering Hierarchical
44 pages
F-S Divertor PDF
No ratings yet
F-S Divertor PDF
174 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
35 pages
Clustering Hierarchical PDF
No ratings yet
Clustering Hierarchical PDF
31 pages
Analysing Descriptive, Prescriptive, Predictive & Diagnostic Framework at Workplace
No ratings yet
Analysing Descriptive, Prescriptive, Predictive & Diagnostic Framework at Workplace
11 pages
Mickael Musindo
No ratings yet
Mickael Musindo
2 pages
Endress-Hauser Proline T-Mass A 150 6AAB EN
No ratings yet
Endress-Hauser Proline T-Mass A 150 6AAB EN
4 pages
17 Microprocessor Systems Lecture No 17 JMP and LOOP Instructions PDF
No ratings yet
17 Microprocessor Systems Lecture No 17 JMP and LOOP Instructions PDF
12 pages
20m Horizontal Lifeline
No ratings yet
20m Horizontal Lifeline
2 pages
SCM Module1 Questions and Answers 1
No ratings yet
SCM Module1 Questions and Answers 1
11 pages
Spys Mykola Resume
No ratings yet
Spys Mykola Resume
1 page
Yoga Pavan Resume
No ratings yet
Yoga Pavan Resume
2 pages
Machine Structure
No ratings yet
Machine Structure
27 pages
Agnes
No ratings yet
Agnes
25 pages
Clustering: EE-671 Prof L. Behera, IITK
No ratings yet
Clustering: EE-671 Prof L. Behera, IITK
33 pages
Efficient Algorithms and Structures with Heaps: Definitive Reference for Developers and Engineers
From Everand
Efficient Algorithms and Structures with Heaps: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Julia for Data Science
From Everand
Julia for Data Science
Anshul Joshi
No ratings yet
The Numpy Pocketbook: Essentials on the Go
From Everand
The Numpy Pocketbook: Essentials on the Go
Silas Meadowlark
No ratings yet

Lect 11 DM

Uploaded by

Lect 11 DM

Uploaded by

DISCLAIMER

In preparation of these slides, materials have been taken from

• Hierarchical clustering takes as input a set of points

Place all points into their own clusters

The behavior of the algorithm depends on how “closest pair

• Do not have to assume any particular number of

Average Link: Distance between

Complete Link: Distance between

We will construct the hierarchical clustering of these values

1100 500 1100 900

 The dendograms are identical: both diagrams show that:

THE PROXIMITY IN A HIERARCHICAL CLUSTERING DOES NOT NECESSARILY

You might also like