Clustering
Clustering
Clustering
Clustering
Clustering
1. Prototype-based clustering
2. Density clustering
3. Hierarchical clustering
4. Model-based clustering
k-Means Clustering
k-Means
k-means clustering is a prototype-based clustering method where the data set
is divided into k clusters.
Objective: find a prototype data point for each cluster; all the data points are
then assigned to the nearest prototype, which then forms a cluster
k Partitions
Voronoi partition. (“Euclidean Voronoi Diagram” by Raincomplex – personal work. Licensed under Creative
Commons Zero, Public Domain Dedication via Wikimedia Commons
k Partitions
Evaluation of Clusters
Groups of core points form distinct clusters. If two core points are within ε of
each other, then both core points are within the same cluster.
The initial centroids are values of random data objects from the data set.
Data objects are selected one by one and assigned to the nearest centroid.
Step 4: Centroid Update
Update the data values of the nearest centroid of the data object, proportional
to the difference between the centroid and the data object.
Step 4: Centroid Update
Step 5: Termination