DBSCAN
DBSCAN
Learning
Topic Contents
DBSCAN
Recommended
3
Reading
3
Clustering
Problem description
Given:
• Resistant to Noise
• Can handle clusters of different shapes and sizes
When DBSCAN Does NOT Work Well
(MinPts=4, Eps=9.75).
Original Points
• Varying densities
• High-dimensional data
(MinPts=4, Eps=9.92)
Statistical Framework for Correlation
1 1
0.9 0.9
0.8 0.8
0.7 0.7
0.6 0.6
0.5 0.5
y
0.4 0.4
0.3 0.3
0.2 0.2
0.1 0.1
0 0
0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1
x x
BSS Ci ( m mi ) 2
i
– Where |Ci| is the size of cluster i
Internal Measures: Cohesion and
Separation
Example: SSE
– BSS + WSS = constant
m
1 m1 2 3 4 m2 5
cohesion separation
Internal Measures: Silhouette Coefficient
b
– Typically between 0 and 1. a
– The closer to 1 the better.