0% found this document useful (0 votes)
97 views4 pages

Case Study K-Means Clustering

K-Means clustering is an unsupervised learning algorithm that groups data points into a specified number of clusters based on their similarities. It was used to categorize 50 countries into 4 clusters based on their murder rate, assault rate, percentage of population living in urban areas, and crimes against women. The algorithm computes centroids for each cluster and iteratively assigns countries to clusters to minimize distances between countries and their assigned centroids until optimal clusters are identified.

Uploaded by

Mrunmai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
97 views4 pages

Case Study K-Means Clustering

K-Means clustering is an unsupervised learning algorithm that groups data points into a specified number of clusters based on their similarities. It was used to categorize 50 countries into 4 clusters based on their murder rate, assault rate, percentage of population living in urban areas, and crimes against women. The algorithm computes centroids for each cluster and iteratively assigns countries to clusters to minimize distances between countries and their assigned centroids until optimal clusters are identified.

Uploaded by

Mrunmai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
You are on page 1/ 4

K-Means Clustering

Clustering: Clustering is the task of dividing the population or data points into a number of groups such that data po
are more similar to other data points in the same group than those in other groups. In simple words, the aim is to seg
similar traits and assign them into clusters.
 
K-Means Clustering: K-Means Clustering is an unsupervised learning algorithm that is used to solve the clustering
science. The K-means clustering algorithm computes centroids and repeats until the optimal centroid is found.
 
Most often assumed to be the case, an important question is whether crime levels are higher in urban versus rural are
statistics, community size does make a difference, as crime rates are higher in urban than in rural areas. Crime rates
population countries are three to four times as high as the rates in countries with low urban population.). These statis
types of crime. Next, we want to put 50 countries in four clusters based on different crimes and urban population.
 
Suppose there are 50 countries that we want to categorize into four clusters based on four attributes: murder rate, ass
population, and crime against women. All the data is given in Table 1 (Data Set Sheet). Classify the data in four clus
Means clustering.
r of groups such that data points in the same groups
mple words, the aim is to segregate groups with

used to solve the clustering problems in data


mal centroid is found.

gher in urban versus rural areas. According to crime


n in rural areas. Crime rates in higher urban
an population.). These statistics hold for nearly all
mes and urban population.

r attributes: murder rate, assault rate, urban


Classify the data in four clusters with the help of K-
Country Murder Assault UrbanPop Crime against women
L1 12 193 51 21
L2 9 222 39 42
L3 6 226 72 30
L4 8 177 50 17
L5 7 235 86 39
L6 5 170 78 37
L7 0 35 75 7
L8 4 198 70 15
L9 13 259 74 30
L10 15 136 52 24
L11 5 32 82 17
L12 0 54 47 12
L13 10 218 79 20
L14 7 26 65 19
L15 2 0.1 53 9
L16 3 80 56 16
L17 7 70 43 15
L18 15 159 61 19
L19 1 0.1 45 7
L20 10 260 63 28
L21 2 138 82 13
L22 11 251 68 35
L23 1 3 62 14
L24 13 209 44 13
L25 8 80 61 28
L26 5 19 45 15
L27 4 10 56 13
L28 11 239 75 43
L29 0 0.1 53 6
L30 5 155 87 19
L31 12 229 64 32
L32 9 161 85 23
L33 13 321 45 13
L34 0.1 27 44 6
L35 8 119 73 18
L36 4 116 59 20
L37 4 103 58 29
L38 4 57 67 13
L39 2 97 83 5
L40 13 210 46 21
L41 3 1 43 12
L42 14 96 58 27
L43 13 137 79 25
L44 0 49 74 21
L45 0.1 44 24 8
L46 7 100 58 16
L47 1 62 73 24
L48 5 65 34 9
L49 0 0.1 60 11
L50 6 143 51 11

You might also like