0% found this document useful (0 votes)

161 views4 pages

03 - K Means Clustering On Iris Datasets

This document demonstrates k-means clustering on the iris dataset using Python's scikit-learn library. It loads the iris data, selects the feature columns for clustering, runs k-means clustering with different values of k, and calculates the elbow method to select the optimal number of clusters. It then visualizes the clustered data in 2D plots of sepal length/width and petal length/width to show how the algorithm has grouped the samples.

Uploaded by

John Wick

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

161 views4 pages

03 - K Means Clustering On Iris Datasets

Uploaded by

John Wick

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Practical - 3

AIM :- K Means Clustering On Iris Datasets

Import libraries

In [3]:

1 import numpy as np
2 import pandas as pd
3 import matplotlib.pyplot as plt
4 import seaborn as sns
5 from sklearn.model_selection import train_test_split
6 from sklearn.cluster import KMeans

import the dataset

In [4]:

1 data = pd.read_csv('iris.csv')
2 data.head(6)

Out[4]:

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

5 6 5.4 3.9 1.7 0.4 Iris-setosa

X is the selected columns

In [5]:

1 X = data[['SepalLengthCm', 'SepalWidthCm', 'PetalLengthCm',

2 'PetalWidthCm']].values
3 X[0:5]

Out[5]:

array([[5.1, 3.5, 1.4, 0.2],

[4.9, 3. , 1.4, 0.2],

[4.7, 3.2, 1.3, 0.2],

[4.6, 3.1, 1.5, 0.2],

[5. , 3.6, 1.4, 0.2]])


Specifing the correct value of k selecting randomly and applying elbow
method

In [6]:

1 kmeans5 = KMeans(n_clusters=5)
2 y_kmeans5 = kmeans5.fit_predict(X)
3 print(y_kmeans5)
4
5 kmeans5.cluster_centers_

[1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1

1 1 1 1 1 1 1 1 1 1 1 1 1 3 3 3 4 3 3 3 4 3 4 4 3 4 3 4 3 3 4 3 4 3 4 3 3

3 3 3 3 3 4 4 4 4 3 4 3 3 3 4 4 4 3 4 4 4 4 4 3 4 4 0 3 2 0 0 2 4 2 0 2 0

0 0 3 0 0 0 2 2 3 0 3 2 3 0 2 3 3 0 2 2 2 0 3 3 2 0 0 3 0 0 0 3 0 0 0 3 0

0 3]

Out[6]:

array([[6.52916667, 3.05833333, 5.50833333, 2.1625 ],

[5.006 , 3.418 , 1.464 , 0.244 ],

[7.475 , 3.125 , 6.3 , 2.05 ],

[6.20769231, 2.85384615, 4.74615385, 1.56410256],

[5.508 , 2.6 , 3.908 , 1.204 ]])

In [13]:

1 Error = []
2 for i in range(1, 11):
3 kmeans = KMeans(n_clusters=i).fit(X)
4 kmeans.fit(X)
5 Error.append(kmeans.inertia_)
6
7 import matplotlib.pyplot as plt
8
9 plt.grid()
10 plt.plot(range(1, 11), Error, 'r')
11 plt.plot(range(1, 11), Error, 'o')
12 plt.title('Elbow method')
13 plt.xlabel('No of clusters')
14 plt.ylabel('Error')
15 plt.show()
In [8]:

1 kmeans3 = KMeans(n_clusters=3)
2 y_kmeans3 = kmeans3.fit_predict(X)
3 print(y_kmeans3)
4
5 kmeans3.cluster_centers_

[1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1

1 1 1 1 1 1 1 1 1 1 1 1 1 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 2 2 2 2 0 2 2 2 2

2 2 0 0 2 2 2 2 0 2 0 2 0 2 2 0 0 2 2 2 2 2 0 2 2 2 2 0 2 2 2 0 2 2 2 0 2

2 0]

Out[8]:

array([[5.9016129 , 2.7483871 , 4.39354839, 1.43387097],

[5.006 , 3.418 , 1.464 , 0.244 ],

[6.85 , 3.07368421, 5.74210526, 2.07105263]])

Visualizing Clustering

Clustring Sepal Length and Sepal Width

In [9]:

1 plt.scatter(X[:, 0], X[:, 1], c=y_kmeans3, cmap="rainbow")

2 SepalLength = 5.1
3 SepalWidth = 3.5
4 plt.scatter(SepalLength, SepalWidth, cmap='rainbow', marker='*')
5 plt.title('KMeans clustering')
6 plt.xlabel('SepalLength')
7 plt.ylabel('SepalWidth')

Out[9]:

Text(0, 0.5, 'SepalWidth')

Clustering Petal Length and Petal Width

In [10]:

1 plt.scatter(X[:, 2], X[:, 3], c=y_kmeans3, cmap="rainbow")

2 PetalLength = 1.4
3 PetalWidth = 0.2
4 plt.scatter(PetalLength, PetalWidth, cmap='rainbow', marker='*')
5 plt.title('KMeans clustering')
6 plt.xlabel('PetalLength')
7 plt.ylabel('PetalWidth')

Out[10]:

Text(0, 0.5, 'PetalWidth')

DPN TSC Exam Quiz Pass
No ratings yet
DPN TSC Exam Quiz Pass
22 pages
BECE Computer Studies Objective Questions and Answer
0% (1)
BECE Computer Studies Objective Questions and Answer
5 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
5 pages
Step-By-Step-Diabetes-Classification-Knn-Detailed-Copy1 - Jupyter Notebook
No ratings yet
Step-By-Step-Diabetes-Classification-Knn-Detailed-Copy1 - Jupyter Notebook
12 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
Ensemble Methods Bagging Boosting and Stacking
100% (1)
Ensemble Methods Bagging Boosting and Stacking
19 pages
LDAP Directories Explained
No ratings yet
LDAP Directories Explained
291 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
Ibm Enhances Its Advanced Project and Portfolio Management Capabilities Using Ibm Rational Portfolio Manager Compress
0% (1)
Ibm Enhances Its Advanced Project and Portfolio Management Capabilities Using Ibm Rational Portfolio Manager Compress
7 pages
02 ML Supervised Learning
No ratings yet
02 ML Supervised Learning
32 pages
Classification and Regression Trees
100% (1)
Classification and Regression Trees
60 pages
Intro SVM New Example PDF
100% (1)
Intro SVM New Example PDF
56 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Support Vector Machines PDF
100% (1)
Support Vector Machines PDF
37 pages
Unit - 4 Machine Learning
100% (1)
Unit - 4 Machine Learning
84 pages
Chapter 6 ML Classifications
100% (1)
Chapter 6 ML Classifications
51 pages
ML Projects For Final Year
No ratings yet
ML Projects For Final Year
7 pages
Sachin's DBA Blog - Automatic Storage Management (ASM)
No ratings yet
Sachin's DBA Blog - Automatic Storage Management (ASM)
14 pages
Heart: Our "Goal" Predict The Presence of Heart Disease in The Patient
100% (1)
Heart: Our "Goal" Predict The Presence of Heart Disease in The Patient
73 pages
Linode UnderstandingDatabases ExtendedEdition
No ratings yet
Linode UnderstandingDatabases ExtendedEdition
259 pages
Support Vector Machines: Dominik Wisniewski Wojciech Wawrzyniak
No ratings yet
Support Vector Machines: Dominik Wisniewski Wojciech Wawrzyniak
16 pages
C2M2 - Assignment: 1 Risk Models Using Tree-Based Models
100% (1)
C2M2 - Assignment: 1 Risk Models Using Tree-Based Models
38 pages
08250771
No ratings yet
08250771
8 pages
Lecture Week 2 KNN and Model Evaluation PDF
100% (1)
Lecture Week 2 KNN and Model Evaluation PDF
53 pages
Lecture 9 PDF
100% (1)
Lecture 9 PDF
28 pages
K Means Clustering Lecture
No ratings yet
K Means Clustering Lecture
32 pages
REPORT On DECISION TREE
No ratings yet
REPORT On DECISION TREE
40 pages
Lecture 03 Gradient Descent
No ratings yet
Lecture 03 Gradient Descent
26 pages
1.linear Regression PSP
No ratings yet
1.linear Regression PSP
92 pages
Report GIS Application
No ratings yet
Report GIS Application
16 pages
K-Means and PCA
No ratings yet
K-Means and PCA
69 pages
02 - Decision Tree Classification On Iris Dataset
No ratings yet
02 - Decision Tree Classification On Iris Dataset
6 pages
Multicollinearity Exercise
100% (1)
Multicollinearity Exercise
6 pages
Ain Shams University Faculty of Engineering
No ratings yet
Ain Shams University Faculty of Engineering
2 pages
Already Table Exists in Oracle How To Append Table Row Using Data Pump - Google Search
No ratings yet
Already Table Exists in Oracle How To Append Table Row Using Data Pump - Google Search
2 pages
Unit V - Classification and Prediction 2020-21
100% (1)
Unit V - Classification and Prediction 2020-21
68 pages
Outline: Problem Statement Definitions & Examples Strategies
No ratings yet
Outline: Problem Statement Definitions & Examples Strategies
7 pages
Decision Support System: Unit 1
No ratings yet
Decision Support System: Unit 1
34 pages
An Introduction Of: Support Vector Machine
No ratings yet
An Introduction Of: Support Vector Machine
36 pages
Email Analysis Overview
No ratings yet
Email Analysis Overview
47 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
24 pages
Jntuk R20 ML Unit-Ii
No ratings yet
Jntuk R20 ML Unit-Ii
37 pages
Introduction To Tree Methods
No ratings yet
Introduction To Tree Methods
15 pages
K Means
No ratings yet
K Means
22 pages
Concepts and Techniques: Data Mining
No ratings yet
Concepts and Techniques: Data Mining
52 pages
Loading The Dataset: 'Churn - Modelling - CSV'
No ratings yet
Loading The Dataset: 'Churn - Modelling - CSV'
6 pages
Expectation Maximization
No ratings yet
Expectation Maximization
23 pages
L3 - Supervised and Unsupervised Learning
100% (3)
L3 - Supervised and Unsupervised Learning
24 pages
Heart Disease Prediction - Jupyter Notebook
100% (1)
Heart Disease Prediction - Jupyter Notebook
9 pages
ML0101EN Clas K Nearest Neighbors CustCat Py v1
100% (1)
ML0101EN Clas K Nearest Neighbors CustCat Py v1
11 pages
Support Vector Machines
No ratings yet
Support Vector Machines
14 pages
Exercise 4: Simple and Multiple Linear Regression Analysis
No ratings yet
Exercise 4: Simple and Multiple Linear Regression Analysis
15 pages
A Structured Approach To SQL Query Design
No ratings yet
A Structured Approach To SQL Query Design
21 pages
Unit 4
No ratings yet
Unit 4
4 pages
CH 6
No ratings yet
CH 6
72 pages
SVM
No ratings yet
SVM
12 pages
Oil Export Indonesia
100% (1)
Oil Export Indonesia
12 pages
Assignment No - 6-1
100% (1)
Assignment No - 6-1
3 pages
Link To New World
No ratings yet
Link To New World
2 pages
A Comparison of Classification Techniques On Prediction of Student Performance
No ratings yet
A Comparison of Classification Techniques On Prediction of Student Performance
6 pages
ML Unit-2
No ratings yet
ML Unit-2
26 pages
Chapter 1 Gis in The Web Era 1: Preface Ix Acknowledgments Xiii
No ratings yet
Chapter 1 Gis in The Web Era 1: Preface Ix Acknowledgments Xiii
3 pages
IFLA Web Forms: Status Message
No ratings yet
IFLA Web Forms: Status Message
4 pages
Recommendation System in Python
No ratings yet
Recommendation System in Python
13 pages
Matplotlib Fundamentals
No ratings yet
Matplotlib Fundamentals
31 pages
Clustering
No ratings yet
Clustering
8 pages
Unsupervised Learning 2024-PPG
No ratings yet
Unsupervised Learning 2024-PPG
85 pages
ET4248E - Chap9 - K-Means and GMM
No ratings yet
ET4248E - Chap9 - K-Means and GMM
27 pages
Chapter
100% (1)
Chapter
101 pages
Lecture 3 Data Mining
No ratings yet
Lecture 3 Data Mining
30 pages
Vinee
100% (1)
Vinee
28 pages
Operating Systems - File-System Interface
No ratings yet
Operating Systems - File-System Interface
13 pages
Sqlfordevscom Next Level Database Techniques For Developers 37 40
No ratings yet
Sqlfordevscom Next Level Database Techniques For Developers 37 40
4 pages
Data Preprocesing JavaPoint
No ratings yet
Data Preprocesing JavaPoint
19 pages
SHOE: A Platform For Semantic Web Language Usage and Analysis
No ratings yet
SHOE: A Platform For Semantic Web Language Usage and Analysis
5 pages
Template BBR, LC, HM 2023
No ratings yet
Template BBR, LC, HM 2023
2 pages
Biometric-Based Students Attendance System
No ratings yet
Biometric-Based Students Attendance System
21 pages
Department Of: Computer Science & Engineering
No ratings yet
Department Of: Computer Science & Engineering
4 pages
Unit - 1 Part - 2concept of Information Systems and Software
No ratings yet
Unit - 1 Part - 2concept of Information Systems and Software
30 pages
DBMS Complete Notes For Exam
No ratings yet
DBMS Complete Notes For Exam
21 pages
Introduction To Database Management Systems DBMS
100% (2)
Introduction To Database Management Systems DBMS
10 pages
DBMS Question Bank
No ratings yet
DBMS Question Bank
16 pages
SQL Post
No ratings yet
SQL Post
6 pages
ML Lecture 15 Ensemble
No ratings yet
ML Lecture 15 Ensemble
27 pages
Burrows-Wheeler Aligner
No ratings yet
Burrows-Wheeler Aligner
5 pages
085
No ratings yet
085
4 pages
ML Lab Report
No ratings yet
ML Lab Report
6 pages
Big Data Complete Notes
No ratings yet
Big Data Complete Notes
33 pages
DWDM Notes
No ratings yet
DWDM Notes
11 pages
DBMS Unit-4&5 10 Marks QB Answers
No ratings yet
DBMS Unit-4&5 10 Marks QB Answers
24 pages