0% found this document useful (0 votes)

78 views3 pages

PGM 7

This document applies the k-means clustering algorithm and expectation-maximization (EM) algorithm to cluster iris flower data. It compares the results of k-means and EM clustering by evaluating the accuracy and confusion matrices. Both algorithms are able to reasonably cluster the iris data into three groups corresponding to the three iris species, with EM achieving slightly better accuracy than k-means.

Uploaded by

badeni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

78 views3 pages

PGM 7

Uploaded by

badeni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

12/26/21, 4:21 PM PGM7-EM-K-MEANS.

ipynb - Colaboratory

#Apply EM algorithm to cluster a set of data stored in a .CSV file. Use the same data set
#for clustering using k-Means algorithm. Compare the results of these two algorithms and
#comment on the quality of clustering. You can add Java/Python ML library classes/API in
#the program.

import matplotlib.pyplot as plt

from sklearn import datasets
from sklearn.cluster import
KMeans import sklearn.metrics as
sm import pandas as pd
import numpy as np

11 = [0,1,2]
def rename(s):
12 []
for i in s:
if i not in 12:
12.append(i)

for i in range(len(s)):
pos = 12.index(s[i])
s[i] = 11[pos]

return s

# import some data to play with

iris = datasets.load_iris()

print(”\n IRIS FEATURES :\n“,iris.feature_names)

print(”\n IRIS TARGET :\n“,iris.target)
print(”\n IRIS TARGET NAMES:\n“,iris.target_names)

# Store the inputs as a Pandas Dataframe and set the column names
X = pd.DataFrame(iris.data)

#print(X)
X.columns ['Sepal_Length','Sepal_Width','Petal_Length','Petal_Width']

#print(X.columns) #print(”X:”,x)
#print(“Y:“,y)
y = pd.DataFrame(iris.target)
y.columns = ['Targets']

# Set the size of the plot

plt.figure(figsize=(14,7))

# Create a colormap
https://coIab.research.googIe.com/drive/1N7XpAG0S_bJ_Ny8yfYtPincehs2nqBi9#printMode=true 1/5
12/26/21,4:21PM PGM7-EM-KMEANS.ipynb-CoIaboratory
colormap = np . array( [ ' red' , ' lime ' , ' black ' ] )

# Plot Sepal
plt.subplot(1,2,1)
plt.scatter(X.Sepal_Length,X.Sepal_Width, c=colormap[y.Targets], s=40)
plt.title('Sepal')

plt.subplot(1,2,2)
plt.scatter(X.Petal_Length,X.Petal_Width, c=colormap[y.Targets], s=40)
plt.title('Petal')
plt.show()

print(”Actual Target is:\n“, iris.target)

# K Means Cluster
model = KMeans(n_clusters=3)
model.fit(X)

# Set the size of the plot

plt.figure(figsize=(14,7))

# Create a colormap
colormap = np.array(['red', 'lime', 'black'])

# Plot the Original Classifications

plt.subplot(1,2,1)
plt.scatter(X.Petal_Length, X.Petal_Width, c=colormap[y.Targets], s=40)
plt.title('Real Classification')

# Plot the Models Classifications

plt.subplot(1,2,2)
plt.scatter(X.Petal_Length, X.Petal_Width, c=colormap[model.labels_], s=40)
plt.title('K Mean Classification')
plt.show()

km = rename(model.labels_)
print(”\nWhat KMeans thought: \n“, km)
print(”Accuracy of KMeans is “,sm.accuracy_score(y, km))
print(”Confusion Matrix for KMeans is \n”,sm.confusion_matrix(y, km))

#The GaussianMixture scikit-learn class can be used to model this problem

#and estimate the parameters of the distributions using the expectation-maximization algorith

from sklearn import preprocessing

scaler =
preprocessing.StandardScaler()
scaler.fit(X)
xsa = scaler.transform(X)
xs = pd.DataFrame(xsa, columns = X.columns)
print(”\n”,xs.sample(5))

from sklearn.mixture import GaussianMixture

emm = Gaus si anMixture(n comDonents=3 \
12/26/21, 4:21 PM PGM7-EM-K-MEANS.ipynb - Colaboratory

gmm.fit(xs)

y_cluster_gmm = gmm.predict(xs)

plt.subplot(1, 2, 1)
plt.scatter(X.Petal_Length, X.Petal_Width, c=colormap[y_cluster_gmm], s=40)
plt.title('GMM Classification')
plt.show()

em = rename(y_cluster_gmm)
print(”\nWhat EM thought: \n“, em)
print(”Accuracy of EM is “,sm.accuracy_score(y, em))
print(”Confusion Matrix for EM is \n“, sm.confusion_matrix(y, em))

Department Of: Computer Science & Engineering
No ratings yet
Department Of: Computer Science & Engineering
4 pages
Experiment 11ml
No ratings yet
Experiment 11ml
1 page
KMeans
No ratings yet
KMeans
2 pages
2nd Programme AIML 7th Sem
No ratings yet
2nd Programme AIML 7th Sem
2 pages
K-Means Algorithm - Colab
No ratings yet
K-Means Algorithm - Colab
3 pages
EX7
No ratings yet
EX7
3 pages
ML 2.3 Prashant
No ratings yet
ML 2.3 Prashant
4 pages
ML Lab Programs
No ratings yet
ML Lab Programs
23 pages
Implement Clustering Algorithms
No ratings yet
Implement Clustering Algorithms
4 pages
MLP Kmeans
No ratings yet
MLP Kmeans
3 pages
MLT Lab 08
No ratings yet
MLT Lab 08
5 pages
Kmeans
No ratings yet
Kmeans
2 pages
KNN - Jupyter Notebook
No ratings yet
KNN - Jupyter Notebook
8 pages
9 Ds
No ratings yet
9 Ds
5 pages
Ex No 10
No ratings yet
Ex No 10
2 pages
Yogesh Siddiq Edited
No ratings yet
Yogesh Siddiq Edited
6 pages
EXP-6 K Mean Clustring
No ratings yet
EXP-6 K Mean Clustring
6 pages
GMM 1
No ratings yet
GMM 1
3 pages
MLT 8 KK
No ratings yet
MLT 8 KK
2 pages
DWDM Lab All
No ratings yet
DWDM Lab All
20 pages
Q8 em &K
No ratings yet
Q8 em &K
3 pages
KNN Datacamp
No ratings yet
KNN Datacamp
31 pages
K Means Clustering
No ratings yet
K Means Clustering
4 pages
Kmean PGM
No ratings yet
Kmean PGM
3 pages
2.3 Aiml Rishit
No ratings yet
2.3 Aiml Rishit
7 pages
SVM and Kmeans - Iris Dataset - Ipynb - Colab
No ratings yet
SVM and Kmeans - Iris Dataset - Ipynb - Colab
5 pages
K Means
No ratings yet
K Means
3 pages
Implementing Logistic Regression For Iris Using Sklearn and Checking The Accuracy Using Confusion Matrix
No ratings yet
Implementing Logistic Regression For Iris Using Sklearn and Checking The Accuracy Using Confusion Matrix
7 pages
K-Means Cluster
No ratings yet
K-Means Cluster
2 pages
ML0101EN Clus K Means Customer Seg Py v1
100% (1)
ML0101EN Clus K Means Customer Seg Py v1
8 pages
Pa66 ML Exp6
No ratings yet
Pa66 ML Exp6
9 pages
Exp 7 PDF
No ratings yet
Exp 7 PDF
4 pages
Program 7-EM Algorithm-K Means Algorithm
No ratings yet
Program 7-EM Algorithm-K Means Algorithm
3 pages
Clustering - Jupyter Notebook
100% (1)
Clustering - Jupyter Notebook
11 pages
AML Clustering
No ratings yet
AML Clustering
7 pages
ML Lab Programs
No ratings yet
ML Lab Programs
2 pages
Lab Manual ML
No ratings yet
Lab Manual ML
23 pages
Experiment 3.1 K-Mean
No ratings yet
Experiment 3.1 K-Mean
8 pages
TranMinhTu1 bt2 2
No ratings yet
TranMinhTu1 bt2 2
5 pages
Experiment 4 1
No ratings yet
Experiment 4 1
4 pages
Da Exp 10
No ratings yet
Da Exp 10
6 pages
Unsupervisd Learning Algorithm
No ratings yet
Unsupervisd Learning Algorithm
6 pages
ML Short Code - Under Updating
No ratings yet
ML Short Code - Under Updating
4 pages
ML2 Practical List
No ratings yet
ML2 Practical List
80 pages
K-Means in Python - Solution
No ratings yet
K-Means in Python - Solution
6 pages
Kmeans Steps
No ratings yet
Kmeans Steps
3 pages
Da Exp 10
No ratings yet
Da Exp 10
6 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
Assignment 6 ML
No ratings yet
Assignment 6 ML
4 pages
K Means Clustering - Experiment 12
No ratings yet
K Means Clustering - Experiment 12
3 pages
K Means Clustering
No ratings yet
K Means Clustering
11 pages
01 K Means - Merged
No ratings yet
01 K Means - Merged
26 pages
Lab Manual
No ratings yet
Lab Manual
9 pages
M PDF
No ratings yet
M PDF
13 pages
Linear SVM: 'Target'
No ratings yet
Linear SVM: 'Target'
13 pages
DSM 1
No ratings yet
DSM 1
6 pages
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
No ratings yet
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
7 pages
AdityaGaur BDA Exp8
No ratings yet
AdityaGaur BDA Exp8
4 pages
085
No ratings yet
085
4 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
ID3 Program4
No ratings yet
ID3 Program4
3 pages
BACKPROPAGATION (Training - Example, Ƞ, N
No ratings yet
BACKPROPAGATION (Training - Example, Ƞ, N
4 pages
#Creating A Dataset #Creating Target Variable: Import As Import As
No ratings yet
#Creating A Dataset #Creating Target Variable: Import As Import As
3 pages
PGM9
No ratings yet
PGM9
1 page
File System Basics: Hadoop Distributed
No ratings yet
File System Basics: Hadoop Distributed
22 pages
Adobe Scan 10 Jan 2022
No ratings yet
Adobe Scan 10 Jan 2022
25 pages
Adobe Scan 10-Jan-2022
100% (1)
Adobe Scan 10-Jan-2022
21 pages
Logg 20250628
No ratings yet
Logg 20250628
384 pages
ASR6502 Datasheet V0.4
No ratings yet
ASR6502 Datasheet V0.4
10 pages
Differences Between IPC Mechanisms On A Single System Vs
No ratings yet
Differences Between IPC Mechanisms On A Single System Vs
3 pages
What Is IoT
No ratings yet
What Is IoT
5 pages
CSE - 2019-23 BATCH - List of Successful Candidates
No ratings yet
CSE - 2019-23 BATCH - List of Successful Candidates
16 pages
GE8151 Python Programming - Unit I Question Bank With Sample Code
100% (1)
GE8151 Python Programming - Unit I Question Bank With Sample Code
25 pages
Lecture 3.1.4 (Amdahl's Law)
No ratings yet
Lecture 3.1.4 (Amdahl's Law)
4 pages
C++ Infosystems 2
No ratings yet
C++ Infosystems 2
303 pages
Unit 1.2 Array
No ratings yet
Unit 1.2 Array
105 pages
CVG DVG
No ratings yet
CVG DVG
1 page
Cheatsheet - VIM
No ratings yet
Cheatsheet - VIM
2 pages
Pointer and Polymorphism in C++
No ratings yet
Pointer and Polymorphism in C++
23 pages
Computer Science Paper 2 HL Markscheme
No ratings yet
Computer Science Paper 2 HL Markscheme
26 pages
Embracing The Four Python Programming Styles: John Paul Mueller
No ratings yet
Embracing The Four Python Programming Styles: John Paul Mueller
12 pages
Command Line Juniper
No ratings yet
Command Line Juniper
11 pages
Module Code & Module Title CC5004NI Security in Computing
No ratings yet
Module Code & Module Title CC5004NI Security in Computing
5 pages
HCI Final Paper
No ratings yet
HCI Final Paper
7 pages
GD Manual
No ratings yet
GD Manual
35 pages
Total Pages: 2: Answer All Questions, Each Carries 3 Marks
No ratings yet
Total Pages: 2: Answer All Questions, Each Carries 3 Marks
2 pages
Slide Icera
No ratings yet
Slide Icera
17 pages
Architecture Jan Semester Final Exam 2022 - Chongo
No ratings yet
Architecture Jan Semester Final Exam 2022 - Chongo
5 pages
Praveen Kumar, Mike Folk, Momcilo Markus, Jay C. Alameda - Hydroinformatics - Data Integrative Approaches in Computation, Analysis, and Modeling-CRC Press (2005)
100% (1)
Praveen Kumar, Mike Folk, Momcilo Markus, Jay C. Alameda - Hydroinformatics - Data Integrative Approaches in Computation, Analysis, and Modeling-CRC Press (2005)
553 pages
Builder
No ratings yet
Builder
19 pages
CS Chapter 1 Assignment 1
No ratings yet
CS Chapter 1 Assignment 1
8 pages
Teste
No ratings yet
Teste
42 pages
Power Applications GAMS
No ratings yet
Power Applications GAMS
8 pages
Jenkins Installation Steps On AWS EC2
No ratings yet
Jenkins Installation Steps On AWS EC2
4 pages
B.SC - Computer Science SF 16UCS519 Software Testing
No ratings yet
B.SC - Computer Science SF 16UCS519 Software Testing
25 pages
CMR College of Engineering and Technology: Kandlyakoya, Hyderabad
No ratings yet
CMR College of Engineering and Technology: Kandlyakoya, Hyderabad
3 pages
stm32l010rb Datasheet
No ratings yet
stm32l010rb Datasheet
89 pages