Homework#6
Homework#6
a.) What are 3 clusters and their centers after one iteration? Show the detailed steps, same
as questions b.
b. What are 3 clusters and their centers after two iterations?
Dataset A (write A1 or A2, same in the following question);
_ In A1 clusters, a few points are located outside the circle but the average position of the data is
not positioned during K-means, then K-means are used to apply to A2.
Dataset B
_ B1 Cluster the far right side of the red cluster is nearer to other clusters. Also the rest clusters
Dataset C
_ C2 is greater than C1, because the C1 has less than C2, which we can say the C1 has clusters
are separate and has less cluster and C2 is a connected cluster and it has a larger cluster than C1.
Dataset D
_The colors at the extreme ends are also nearer to other centroids with respect to their own
Dataset E
_ The line that segregates the light blue from the deep blue in E1 has a downward or negative
slope. However, because the centroid of light blue is a little higher than that of deep blue, the
dividing line must maintain a positive slope. So, the K-means was thus applied to E2.
Dataset F
_ The centroid of blue in dataset F1 is clearly visible and nearer to the red dataset. Which means
clusters are well separated. So, the K-means is applied for F2.
a. What is the distance between the two farthest members? (max or complete link) (round
= 5.620910252
d. What is the center distance between two clusters?
e. Among all four distances above, which one is robust to noise? Answer either “complete”,
_ Among all four distances above, the robust to noise is "average" distance. The average
distance between all pairs considers the distances between all points in the clusters and provides
Border Points: None (as all points within ε neighborhood of core points)