UNIT 3 (2marks) TA
UNIT 3 (2marks) TA
key difference between clustering and categorization is that clustering is often used to
identify new patterns and insights in a dataset. By contrast, categorization is used to classify
new data points based on pre-existing knowledge about the categories.
Selecting features
Initialization Phase: In this phase, the initial centroids of the clusters are selected randomly or
based on some prior knowledge of the data.
Iterative Refinement Phase: In this phase, each data point is assigned to the nearest centroid,
and the centroids are updated based on the mean of the data points assigned to them.