AI20 - Hierarchical-Clustering
AI20 - Hierarchical-Clustering
Hierarchical Clustering
Hierarchical Clustering
• Hierarchical methods generate a hierarchy of partitions, i.e.
• a partition P1 into 1 clusters (the entire collection)
• a partition P2 into 2 clusters
– …
• a partition Pn into n clusters (each object forms its own cluster)
P4 P3 P2 P1
4. d 4 ( P, Q) = x P − xQ ( centroid method )
PQ 2
5. d5 ( P, Q) = 2 x P − xQ ( Ward’s method )
P + Q
RSS (Pk ) =
i =1 j Pi
x j − x Pi
Single Link
• Use minimum similarity of pairs:
sim (ci ,c j ) = min sim ( x, y )
xci , yc j
Ci Cj Ck
Sec. 17.2
18 22 25 27 42 43
18 0 4 7 9 24 25
22 4 0 3 5 20 21
25 7 3 0 2 17 18
27 9 5 2 0 15 16
42 24 20 17 15 0 1
43 25 21 18 16 1 0
Step 2
18 22 25 27 42, 43
18 0 4 7 9 24
22 4 0 3 5 20
25 7 3 0 2 17
27 9 5 2 0 15
42, 43 24 20 17 15 0
Step 3
18 22 25, 27 42, 43
18 0 4 7 24
22 4 0 3 20
25, 27 7 3 0 15
42, 43 24 20 15 0
Step 4
18 0 4 24
22, 25, 27 4 0 15
42, 43 24 15 0
Step 5
42, 43 15 0
Step 5