Experiment 3.1 K-Mean

The document discusses implementing K-Means clustering. It shows how to identify clusters in 1D and 2D data using scikit-learn KMeans. It generates scatter plots to visualize clustering for different numbers of clusters on randomly generated data.

Uploaded by

Arslan Mansoori

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views8 pages

Experiment 3.1 K-Mean

Uploaded by

Arslan Mansoori

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

EXPERIMENT 9

Aim: Implementation of K-Mean Clustering

COURSE OUTCOMES

CO4 Evaluate machine learning model’s performance and apply learning strategy to
improve the performance of supervised and unsupervised learning model.

CO5 Develop a suitable model for supervised and unsupervised learning algorithm and
optimize the model on the expected accuracy.

K Means Clustering
In this model Data is divided into clusters on the basis of nearest mean to each cluster.
1. Identify 2 groups in 1D Array
from sklearn.cluster import KMeans
import numpy as np

data = np.array([1,2,3,4,5,6,7,8,9,10,91,92,93,94,95,96,97,98,99,100])

kmeans = KMeans(n_clusters=2).fit(data.reshape(-1,1))
kmeans.predict(data.reshape(-1,1))

1. Identify 5 groups in 1D Array

from sklearn.cluster import KMeans
import numpy as np
data = np.array([101, 107, 106, 199, 204, 205, 207, 306, 310, 312, 312, 314, 317, 318, 380, 377,
379, 382, 466, 469, 471, 472, 557, 559, 562, 566, 569])

kmeans = KMeans(n_clusters=5).fit(data.reshape(-1,1))
kmeans.predict(data.reshape(-1,1))

2. Identify 2 groups in 2 D Array

from sklearn.cluster import KMeans
import numpy as np
X = np.array([[1, 2], [1, 4], [1, 0], [10, 2], [10, 4], [10, 0]])
kmeans = KMeans(n_clusters=2, random_state=0).fit(X)
kmeans.predict([[0, 0], [12, 3]])
kmeans.predict([[11,11], [8, 9]])
kmeans.predict([[2,20], [4, 4]])
Explanation:
1 2
1 4
1 0
10 2
10 4
10 0
Ans is [1,0]
[0,0] will be predicted in Column No 1
[12,3] will be predicted in Column No 0

Similarly check [11,11] [8,9] it must come in [0,0]

And Check[2,2][4,4] it must come in [1,1]

3. Plotting K means cluster for 2D Group for 2 Clusters

from sklearn.cluster import KMeans
import numpy as np
X = np.array([[1, 2], [1, 4], [1, 0], [10, 2], [10, 4], [10, 0]])
kmeans = KMeans(n_clusters=2, random_state=0).fit(X)
y_predict= kmeans.fit_predict(X)
#kmeans.predict([[0, 0], [12, 3]])

import matplotlib.pyplot as mtp

mtp.scatter(X[y_predict == 0, 0], X[y_predict == 0, 1], s = 100, c = 'blue', label = 'Cluster 1')

#for first cluster
mtp.scatter(X[y_predict == 1, 0], X[y_predict == 1, 1], s = 100, c = 'green', label = 'Cluster 2')
#for second cluster
mtp.xlim(0,10)
mtp.ylim(0,10)
mtp.show()

4. Plot a scatter Chart for 300 random numbers

%matplotlib inline
import matplotlib.pyplot as plt
import seaborn as sns; sns.set() # for plot styling
import numpy as np
from sklearn.datasets import make_blobs
X, y_true = make_blobs(n_samples=300, centers=4,
cluster_std=0.60, random_state=0)
plt.scatter(X[:, 0], X[:, 1], s=50);
# The scatter() function plots one dot for each observation. It needs two arrays of the same
length, one for the values of the x-axis, and one for values on the y-axis.
# Using : means that we take all elements in the correspond array dimension.
# s tells the size of the marker. (This is the size of the marker)

Now seeing this chart we can identify that there are 4 different clusters.
The k-means algorithm does this automatically, and in Scikit-Learn uses the typical estimator
API:

from sklearn.cluster import KMeans

kmeans = KMeans(n_clusters=4)
kmeans.fit(X)
y_kmeans = kmeans.predict(X)
plt.scatter(X[:, 0], X[:, 1], c=y_kmeans, s=50, cmap='viridis')
centers = kmeans.cluster_centers_
plt.scatter(centers[:, 0], centers[:, 1], c='black', s=200, alpha=0.5);

5. Plot a scatter Chart for 300 random numbers (For the same data increase the clusters to 5
say)
from sklearn.cluster import KMeans
kmeans = KMeans(n_clusters=5)
kmeans.fit(X)
y_kmeans = kmeans.predict(X)

plt.scatter(X[:, 0], X[:, 1], c=y_kmeans, s=50, cmap='viridis')

centers = kmeans.cluster_centers_
plt.scatter(centers[:, 0], centers[:, 1], c='black', s=200, alpha=0.5);

Figure 30: 5 Clusters

6. Plot a scatter Chart for 300 random numbers (For the same data increase the clusters to 6
say)

from sklearn.cluster import KMeans

kmeans = KMeans(n_clusters=6)
kmeans.fit(X)
y_kmeans = kmeans.predict(X)

plt.scatter(X[:, 0], X[:, 1], c=y_kmeans, s=50, cmap='viridis')

centers = kmeans.cluster_centers_
plt.scatter(centers[:, 0], centers[:, 1], c='black', s=200, alpha=0.5);

Figure 31: 6 Clusters

Similarly do the same for 7 Clusters and 8 Clusters

Figure 32: 7 Clusters

Figure 33: 12 Clusters

Viva Questions
1. What is the main difference between k-Means and k-Nearest Neighbours?
2. How is Entropy used as a Clustering Validation Measure?
3. How to determine k using the Elbow Method?
4. What is the difference between Classical k-Means and Spherical k-Means?
5. What is the difference between k-Means and k-Medians and when would you use one
over another?

01 K Means - Merged
No ratings yet
01 K Means - Merged
26 pages
ML - K-Means
No ratings yet
ML - K-Means
12 pages
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
No ratings yet
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
7 pages
Wa0033.
No ratings yet
Wa0033.
38 pages
Lab Report6 - B21CI014
No ratings yet
Lab Report6 - B21CI014
8 pages
DWDM Lab All
No ratings yet
DWDM Lab All
20 pages
SE KMeansClustering
No ratings yet
SE KMeansClustering
21 pages
K Means Clustering - Experiment 12
No ratings yet
K Means Clustering - Experiment 12
3 pages
K-Means in Python - Solution
No ratings yet
K-Means in Python - Solution
6 pages
DWM Exp4
No ratings yet
DWM Exp4
9 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
K Means
No ratings yet
K Means
3 pages
Practical 03
No ratings yet
Practical 03
3 pages
ML 2.3 Prashant
No ratings yet
ML 2.3 Prashant
4 pages
ML0101EN Clus K Means Customer Seg Py v1
100% (1)
ML0101EN Clus K Means Customer Seg Py v1
8 pages
K.means Clustering
No ratings yet
K.means Clustering
8 pages
3.1 K - Means
No ratings yet
3.1 K - Means
16 pages
EXP-6 K Mean Clustring
No ratings yet
EXP-6 K Mean Clustring
6 pages
Aam Unit 4 QB With Answer
No ratings yet
Aam Unit 4 QB With Answer
11 pages
2.3 Aiml Rishit
No ratings yet
2.3 Aiml Rishit
7 pages
Unsupervisd Learning Algorithm
No ratings yet
Unsupervisd Learning Algorithm
6 pages
Machine Learning K Means - Unsupervised
No ratings yet
Machine Learning K Means - Unsupervised
5 pages
Yunsu Han KNN K Means
No ratings yet
Yunsu Han KNN K Means
8 pages
Presentation 1
No ratings yet
Presentation 1
47 pages
K Means Clustering
No ratings yet
K Means Clustering
11 pages
Aiml Lab
No ratings yet
Aiml Lab
37 pages
ML Minors Exp7
No ratings yet
ML Minors Exp7
6 pages
Experiment-7: Implementation of K-Means Clustering Algorithm
No ratings yet
Experiment-7: Implementation of K-Means Clustering Algorithm
3 pages
K-Means Algorithm
No ratings yet
K-Means Algorithm
29 pages
MLT 8 KK
No ratings yet
MLT 8 KK
2 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
20 pages
K-Means Clustering Report
No ratings yet
K-Means Clustering Report
2 pages
02.1 K-Means Example
No ratings yet
02.1 K-Means Example
12 pages
Drawback of Standard K-Means Algorithm
No ratings yet
Drawback of Standard K-Means Algorithm
5 pages
Week 8. K-Means
No ratings yet
Week 8. K-Means
7 pages
AI Week 11
No ratings yet
AI Week 11
21 pages
Aml - Lab (1-6)
No ratings yet
Aml - Lab (1-6)
15 pages
ML2 Practical List
No ratings yet
ML2 Practical List
80 pages
Lab-7 Clustering
No ratings yet
Lab-7 Clustering
4 pages
Da Exp 10
No ratings yet
Da Exp 10
6 pages
Kmeans Clustering
No ratings yet
Kmeans Clustering
3 pages
Da Exp 10
No ratings yet
Da Exp 10
6 pages
LAB7 Kmeans
No ratings yet
LAB7 Kmeans
11 pages
Yogesh Siddiq Edited
No ratings yet
Yogesh Siddiq Edited
6 pages
AdityaGaur BDA Exp8
No ratings yet
AdityaGaur BDA Exp8
4 pages
Ex No: Date: K-Means Clustering Using Python: Scatter
No ratings yet
Ex No: Date: K-Means Clustering Using Python: Scatter
10 pages
Rajeek8 12
No ratings yet
Rajeek8 12
21 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
DS - ML - 7 - 60019210046 1
No ratings yet
DS - ML - 7 - 60019210046 1
6 pages
Document
No ratings yet
Document
4 pages
Clustering
No ratings yet
Clustering
1 page
09.unsupervised Learning
No ratings yet
09.unsupervised Learning
50 pages
Aam Codes
No ratings yet
Aam Codes
8 pages
K-Means Cluster
No ratings yet
K-Means Cluster
2 pages
Untitled Document
No ratings yet
Untitled Document
1 page
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
3 pages
Experiment 9
No ratings yet
Experiment 9
10 pages
UNIT - 3 - Clustering
No ratings yet
UNIT - 3 - Clustering
21 pages
JAVIER KMeans Clustering Jupyter Notebook
No ratings yet
JAVIER KMeans Clustering Jupyter Notebook
7 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Unit 3
No ratings yet
Unit 3
28 pages
PyCaret Regression
No ratings yet
PyCaret Regression
13 pages
Eswd Assignment
No ratings yet
Eswd Assignment
10 pages
De Lab Worksheet
No ratings yet
De Lab Worksheet
5 pages
Design Automatic Street Light Using LDR
No ratings yet
Design Automatic Street Light Using LDR
4 pages
Michelle Cook
No ratings yet
Michelle Cook
273 pages
Business Analytics 705 v1 468
100% (1)
Business Analytics 705 v1 468
468 pages
Stda 1
No ratings yet
Stda 1
93 pages
Chapter 1
No ratings yet
Chapter 1
35 pages
Project Description1
No ratings yet
Project Description1
6 pages
Research Gate - Asthama Diagnosis
No ratings yet
Research Gate - Asthama Diagnosis
7 pages
PHD Thesis Data Mining Bioinformatics
100% (2)
PHD Thesis Data Mining Bioinformatics
6 pages
08 ML WEKA Classification
No ratings yet
08 ML WEKA Classification
73 pages
Predicting Social Media Performance Metr
No ratings yet
Predicting Social Media Performance Metr
11 pages
Unit V
No ratings yet
Unit V
23 pages
List of Experiments BI LAb-1
No ratings yet
List of Experiments BI LAb-1
2 pages
Data Mining 1-3
No ratings yet
Data Mining 1-3
29 pages
AIML Lect5 Assignment ID3
No ratings yet
AIML Lect5 Assignment ID3
2 pages
Rcse 001
No ratings yet
Rcse 001
2 pages
Data Mining in Cloud Computing PDF
No ratings yet
Data Mining in Cloud Computing PDF
5 pages
Data Warehousing Mining MCQs
No ratings yet
Data Warehousing Mining MCQs
12 pages
A Study On Artificial Intelligence in HCL Info System
No ratings yet
A Study On Artificial Intelligence in HCL Info System
67 pages
M.SC Part II Syllabus
No ratings yet
M.SC Part II Syllabus
41 pages
Automated Image Captioning With Convnets and Recurrent Nets: Andrej Karpathy, Fei-Fei Li
No ratings yet
Automated Image Captioning With Convnets and Recurrent Nets: Andrej Karpathy, Fei-Fei Li
105 pages
Strategic Alignment Process and Decision Support Systems
100% (3)
Strategic Alignment Process and Decision Support Systems
385 pages
3 1 Results
No ratings yet
3 1 Results
16 pages
CS614 Current FinalTerm Paper 20 August 2016
No ratings yet
CS614 Current FinalTerm Paper 20 August 2016
15 pages
Datamining Mod3
No ratings yet
Datamining Mod3
21 pages
Factors Influencing Book Borrowing in Universities: Nicholas Muriithi
No ratings yet
Factors Influencing Book Borrowing in Universities: Nicholas Muriithi
12 pages
Data Analysis 2020
No ratings yet
Data Analysis 2020
56 pages
T SNE
No ratings yet
T SNE
11 pages
Detection of Breast Cancer Using Data Mining Tool WEKA PDF
No ratings yet
Detection of Breast Cancer Using Data Mining Tool WEKA PDF
5 pages
Railway Signal Intelligent Monitoring System Based On Data Mining
No ratings yet
Railway Signal Intelligent Monitoring System Based On Data Mining
4 pages
Practice
No ratings yet
Practice
5 pages
Keras Cheat Sheet Python For Data Science: Model Architecture Inspect Model
No ratings yet
Keras Cheat Sheet Python For Data Science: Model Architecture Inspect Model
1 page