0% found this document useful (0 votes)

37 views5 pages

SVM and Kmeans - Iris Dataset - Ipynb - Colab

Uploaded by

termp89

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views5 pages

SVM and Kmeans - Iris Dataset - Ipynb - Colab

Uploaded by

termp89

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

11/29/24, 9:30 PM SVM and Kmeans -Iris dataset.

ipynb - Colab

import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.svm import SVC
from sklearn.metrics import accuracy_score, classification_report, confusion_matrix
import matplotlib.pyplot as plt
import seaborn as sns

!kaggle datasets download -d uciml/iris

Dataset URL: https://www.kaggle.com/datasets/uciml/iris

License(s): CC0-1.0
Downloading iris.zip to /content
0% 0.00/3.60k [00:00<?, ?B/s]
100% 3.60k/3.60k [00:00<00:00, 7.28MB/s]

Loading of the dataset and creating dataframe

!unzip iris.zip

Archive: iris.zip
inflating: Iris.csv
inflating: database.sqlite

df = pd.read_csv('Iris.csv')
print(df.head())

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa
1 2 4.9 3.0 1.4 0.2 Iris-setosa
2 3 4.7 3.2 1.3 0.2 Iris-setosa
3 4 4.6 3.1 1.5 0.2 Iris-setosa
4 5 5.0 3.6 1.4 0.2 Iris-setosa

Changing categorical to numbers

df['Species'] = df['Species'].astype('category').cat.codes

Selection of columns and assigning to X and Y

X = df.iloc[:, :-1].values
y = df.iloc[:, -1].values
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
print("Training set shape:", X_train.shape)
print("Test set shape:", X_test.shape)

Training set shape: (120, 5)

Test set shape: (30, 5)

Training of the SVM Model

svm_model = SVC(kernel='linear', C=1.0, random_state=42)

Model fitting and Prediction

svm_model.fit(X_train, y_train)

y_pred = svm_model.predict(X_test)

Evaluation Metrics and Parameters

accuracy = accuracy_score(y_test, y_pred)

print("Accuracy:", accuracy)
print("\nClassification Report:")
print(classification_report(y_test, y_pred))

Accuracy: 1.0

Classification Report:
precision recall f1-score support

0 1.00 1.00 1.00 10

https://colab.research.google.com/drive/1kDkVaGxeyPshe6mgQPxShanNabVTF1v_#scrollTo=oHHbeiXRnXVu&printMode=true 1/5
11/29/24, 9:30 PM SVM and Kmeans -Iris dataset.ipynb - Colab
1 1.00 1.00 1.00 9
2 1.00 1.00 1.00 11

accuracy 1.00 30
macro avg 1.00 1.00 1.00 30
weighted avg 1.00 1.00 1.00 30

Confusion Matrix

conf_matrix = confusion_matrix(y_test, y_pred)

print("\nConfusion Matrix:")
print(conf_matrix)

Confusion Matrix:
[[10 0 0]
[ 0 9 0]
[ 0 0 11]]

HeatMap

sns.heatmap(conf_matrix, annot=True, cmap="YlGnBu", fmt='g')

plt.title("Confusion Matrix")
plt.xlabel("Predicted")
plt.ylabel("Actual")
plt.show()

K MEANS Implementation

import numpy as np

df=pd.read_csv('/content/Iris.csv')
df.head()

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

Next steps: Generate code with df

toggle_off View recommended plots New interactive sheet

df.info()

https://colab.research.google.com/drive/1kDkVaGxeyPshe6mgQPxShanNabVTF1v_#scrollTo=oHHbeiXRnXVu&printMode=true 2/5
11/29/24, 9:30 PM SVM and Kmeans -Iris dataset.ipynb - Colab
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 150 entries, 0 to 149
Data columns (total 6 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 Id 150 non-null int64
1 SepalLengthCm 150 non-null float64
2 SepalWidthCm 150 non-null float64
3 PetalLengthCm 150 non-null float64
4 PetalWidthCm 150 non-null float64
5 Species 150 non-null object
dtypes: float64(4), int64(1), object(1)
memory usage: 7.2+ KB

df.drop(['Id'] ,axis=1, inplace=True)

df.isnull().sum()

SepalLengthCm 0

SepalWidthCm 0

PetalLengthCm 0

PetalWidthCm 0

Species 0

dtype: int64

df.describe()

SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm

count 150.000000 150.000000 150.000000 150.000000

mean 5.843333 3.054000 3.758667 1.198667

std 0.828066 0.433594 1.764420 0.763161

min 4.300000 2.000000 1.000000 0.100000

25% 5.100000 2.800000 1.600000 0.300000

50% 5.800000 3.000000 4.350000 1.300000

75% 6.400000 3.300000 5.100000 1.800000

max 7.900000 4.400000 6.900000 2.500000

df.head()

SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 5.1 3.5 1.4 0.2 Iris-setosa

1 4.9 3.0 1.4 0.2 Iris-setosa

2 4.7 3.2 1.3 0.2 Iris-setosa

3 4.6 3.1 1.5 0.2 Iris-setosa

4 5.0 3.6 1.4 0.2 Iris-setosa

Next steps: Generate code with df

toggle_off View recommended plots New interactive sheet

df_imp = df.iloc[:,0:4]
from sklearn.cluster import KMeans
k_meansclus = range(1,10)
sse = []

for k in k_meansclus :
km = KMeans(n_clusters =k)
km.fit(df_imp)
sse.append(km.inertia_)

plt.title('The Elbow Method')

plt.plot(k_meansclus,sse)
plt.show()

https://colab.research.google.com/drive/1kDkVaGxeyPshe6mgQPxShanNabVTF1v_#scrollTo=oHHbeiXRnXVu&printMode=true 3/5
11/29/24, 9:30 PM SVM and Kmeans -Iris dataset.ipynb - Colab

km1 = KMeans(n_clusters=3,max_iter=300 , random_state=0)

km1.fit(df_imp)
y_means = km1.fit_predict(df_imp)

km1.cluster_centers_

array([[5.88360656, 2.74098361, 4.38852459, 1.43442623],

[5.006 , 3.418 , 1.464 , 0.244 ],
[6.85384615, 3.07692308, 5.71538462, 2.05384615]])

df_imp = np.array(df_imp)

plt.scatter(df_imp[y_means==0,2 ],df_imp[y_means==0,3 ], color='g' , label='Iris-versicolor ')

plt.scatter(df_imp[y_means==1,2 ],df_imp[y_means==1,3 ], color='r' , label='Iris-setosa')
plt.scatter(df_imp[y_means==2,2 ],df_imp[y_means==2,3 ], color='b', label='Iris-virginica')
plt.legend()
plt.show()

plt.scatter(df_imp[y_means==0,0 ],df_imp[y_means==0,1], color='g' , label='Iris-versicolor ')

plt.scatter(df_imp[y_means==1,0 ],df_imp[y_means==1,1 ], color='r' , label='Iris-setosa')
plt.scatter(df_imp[y_means==2,0 ],df_imp[y_means==2,1 ], color='b', label='Iris-virginica')

plt.legend()
plt.show()

https://colab.research.google.com/drive/1kDkVaGxeyPshe6mgQPxShanNabVTF1v_#scrollTo=oHHbeiXRnXVu&printMode=true 4/5
11/29/24, 9:30 PM SVM and Kmeans -Iris dataset.ipynb - Colab

https://colab.research.google.com/drive/1kDkVaGxeyPshe6mgQPxShanNabVTF1v_#scrollTo=oHHbeiXRnXVu&printMode=true 5/5

(Livestock Health Ii (Livestock Parasites)
No ratings yet
(Livestock Health Ii (Livestock Parasites)
5 pages
Elementary Surveying
75% (8)
Elementary Surveying
36 pages
Iris Flower Classification Project
No ratings yet
Iris Flower Classification Project
9 pages
Assignment No - 6-1
100% (1)
Assignment No - 6-1
3 pages
Random Forest 1737667979
No ratings yet
Random Forest 1737667979
11 pages
Dsbda Ouput 1-10
No ratings yet
Dsbda Ouput 1-10
89 pages
Codes and Other Relevant Explanations For Supervised Learning (Part 1) - Session by Sabyasachi Mukhopadhyay - August 3
No ratings yet
Codes and Other Relevant Explanations For Supervised Learning (Part 1) - Session by Sabyasachi Mukhopadhyay - August 3
5 pages
Postmodernism and Biology in John Fowles S The French Lieutenant's Woman
No ratings yet
Postmodernism and Biology in John Fowles S The French Lieutenant's Woman
23 pages
Assignment - 10 - Pandas
No ratings yet
Assignment - 10 - Pandas
53 pages
Towngas ESG Report 2020
No ratings yet
Towngas ESG Report 2020
59 pages
Somali Aggregate
No ratings yet
Somali Aggregate
120 pages
Class 9 Economics Project On Toothpaste
No ratings yet
Class 9 Economics Project On Toothpaste
12 pages
Noun and Question Tag
No ratings yet
Noun and Question Tag
8 pages
SA Health Cleaning Standard 2014 - (v1.1) CDCB Ics 20180301 PDF
No ratings yet
SA Health Cleaning Standard 2014 - (v1.1) CDCB Ics 20180301 PDF
48 pages
Rotational Mechanics
No ratings yet
Rotational Mechanics
17 pages
ML LabReport Final Index Edited
No ratings yet
ML LabReport Final Index Edited
35 pages
19mid0034 (Chandru) - ML Lab Fat - Jupyter Notebook
No ratings yet
19mid0034 (Chandru) - ML Lab Fat - Jupyter Notebook
4 pages
Citizen CLP-8301 Technical Manual
No ratings yet
Citizen CLP-8301 Technical Manual
259 pages
ML Keshav
No ratings yet
ML Keshav
23 pages
Lab Manual ML
No ratings yet
Lab Manual ML
23 pages
Chapter 6 (Convective Heat Transfer Only)
No ratings yet
Chapter 6 (Convective Heat Transfer Only)
28 pages
Iris Flower Classification
No ratings yet
Iris Flower Classification
47 pages
Eurotile Pricelist2015 3
No ratings yet
Eurotile Pricelist2015 3
147 pages
Chemistry Lab Report 3
No ratings yet
Chemistry Lab Report 3
22 pages
ML Lab Programs
No ratings yet
ML Lab Programs
23 pages
Nano Sweep BT
No ratings yet
Nano Sweep BT
38 pages
Lab Manual
No ratings yet
Lab Manual
32 pages
AML Lab3 2021wb15156
No ratings yet
AML Lab3 2021wb15156
13 pages
Mlpy 2
No ratings yet
Mlpy 2
18 pages
Release Notes Isatis - Neo 2020.06: Last Update: June 28, 2020
No ratings yet
Release Notes Isatis - Neo 2020.06: Last Update: June 28, 2020
35 pages
UHPC White Paper
No ratings yet
UHPC White Paper
32 pages
ML L - Ab
No ratings yet
ML L - Ab
13 pages
Practical 5
No ratings yet
Practical 5
11 pages
ABHAYMLFILE
No ratings yet
ABHAYMLFILE
16 pages
Curves Math 473 Introduction To Differential Geometry: Dr. Nasser Bin Turki
No ratings yet
Curves Math 473 Introduction To Differential Geometry: Dr. Nasser Bin Turki
14 pages
ML N PY Programs
No ratings yet
ML N PY Programs
17 pages
03 Q1M1 Chapter 3 Research Methodology
No ratings yet
03 Q1M1 Chapter 3 Research Methodology
47 pages
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
No ratings yet
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
8 pages
EXP 9 DWM - Merged
No ratings yet
EXP 9 DWM - Merged
11 pages
4c Sklearn-Classification-Regression-Bkhw-Spring 2019
No ratings yet
4c Sklearn-Classification-Regression-Bkhw-Spring 2019
20 pages
L6 Tutorial - KNN - Jupyter Notebook
No ratings yet
L6 Tutorial - KNN - Jupyter Notebook
7 pages
Ts X Biology Final Exam Revision 2023-24
No ratings yet
Ts X Biology Final Exam Revision 2023-24
7 pages
PR 6
No ratings yet
PR 6
6 pages
KNN - Jupyter Notebook
No ratings yet
KNN - Jupyter Notebook
8 pages
Iris - Copy1 - Jupyter Notebook
No ratings yet
Iris - Copy1 - Jupyter Notebook
8 pages
Dsbda Assig 6 Data Analytcs 3
No ratings yet
Dsbda Assig 6 Data Analytcs 3
6 pages
Experiment-2-1-Ml Kritika
No ratings yet
Experiment-2-1-Ml Kritika
11 pages
Import As Import As Import As From Import Import As Import
No ratings yet
Import As Import As Import As From Import Import As Import
7 pages
E23CSEU2241 LAB9 Data Mining
No ratings yet
E23CSEU2241 LAB9 Data Mining
5 pages
KPMG 2024 Zimbabwe National Budget Highlights
No ratings yet
KPMG 2024 Zimbabwe National Budget Highlights
11 pages
ML Lab Manual
No ratings yet
ML Lab Manual
6 pages
Trần Mạnh Hùng 20192643.Ipynb - Colab
No ratings yet
Trần Mạnh Hùng 20192643.Ipynb - Colab
6 pages
KNN ALGORITHM - Ipynb - Colab
No ratings yet
KNN ALGORITHM - Ipynb - Colab
4 pages
Lab06 KNN 01
No ratings yet
Lab06 KNN 01
3 pages
L3 - Classification - RandomForest - Jupyter Notebook
No ratings yet
L3 - Classification - RandomForest - Jupyter Notebook
6 pages
Iris - Ipynb - Colaboratory
No ratings yet
Iris - Ipynb - Colaboratory
8 pages
TranMinhTu1 bt2 2
No ratings yet
TranMinhTu1 bt2 2
5 pages
Pra 8
No ratings yet
Pra 8
4 pages
Cota12 6
No ratings yet
Cota12 6
4 pages
Seed Germination Chamber
No ratings yet
Seed Germination Chamber
6 pages
DSBDA6
No ratings yet
DSBDA6
3 pages
Practical No - 1
No ratings yet
Practical No - 1
5 pages
Delhi Public School Vadodara: Academic Session 2024-2025 Practice Paper-14
No ratings yet
Delhi Public School Vadodara: Academic Session 2024-2025 Practice Paper-14
5 pages
ML 1
No ratings yet
ML 1
4 pages
Iris - Regression - Jupyter Notebook
No ratings yet
Iris - Regression - Jupyter Notebook
5 pages
DS 6
No ratings yet
DS 6
2 pages
NaiveBayesClassifier - Jupyter Notebook
No ratings yet
NaiveBayesClassifier - Jupyter Notebook
2 pages
343 Kokutai
No ratings yet
343 Kokutai
2 pages
ML 2.3 Prashant
No ratings yet
ML 2.3 Prashant
4 pages
SVM and KNN
No ratings yet
SVM and KNN
3 pages
Model - Ipynb - Colaboratory
No ratings yet
Model - Ipynb - Colaboratory
3 pages
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
No ratings yet
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
5 pages
02 - Decision Tree Classification On Iris Dataset
No ratings yet
02 - Decision Tree Classification On Iris Dataset
6 pages
30 - 11 - 24 - Ensemble - Based Learning
No ratings yet
30 - 11 - 24 - Ensemble - Based Learning
1 page
Jumper (3.5e Class) - D&D Wiki
No ratings yet
Jumper (3.5e Class) - D&D Wiki
8 pages
BDA pr2
No ratings yet
BDA pr2
2 pages
Iris - Ipynb - Colab
No ratings yet
Iris - Ipynb - Colab
1 page
Machine Learning Algorithm
No ratings yet
Machine Learning Algorithm
18 pages
Xerox Workcentre 5735 / 5740 / 5745 / 5755 Multifunction Printer
No ratings yet
Xerox Workcentre 5735 / 5740 / 5745 / 5755 Multifunction Printer
8 pages
b21 DSBDA Assignment No 10
No ratings yet
b21 DSBDA Assignment No 10
1 page
PGM 7
No ratings yet
PGM 7
3 pages
Comparison of Classifiers
No ratings yet
Comparison of Classifiers
6 pages
Chem Lab Report
No ratings yet
Chem Lab Report
6 pages
SPARX 10 - "Kellogg's Mate" - Why It Failed
No ratings yet
SPARX 10 - "Kellogg's Mate" - Why It Failed
3 pages
Support Vector Machine
No ratings yet
Support Vector Machine
7 pages
Response of Newly Collected Acetobacter Isolates in Sweet Corn (Zea Mays L. Saccharata)
No ratings yet
Response of Newly Collected Acetobacter Isolates in Sweet Corn (Zea Mays L. Saccharata)
5 pages
Ap2 F.E BLUE PRINT (OLD SYL 2007)
No ratings yet
Ap2 F.E BLUE PRINT (OLD SYL 2007)
4 pages
Stem Cell Reflection
No ratings yet
Stem Cell Reflection
2 pages
TensorFlow深度学习项目实战: Chinese Edition
From Everand
TensorFlow深度学习项目实战: Chinese Edition
Posts & Telecom Press
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)

SVM and Kmeans - Iris Dataset - Ipynb - Colab

Uploaded by

SVM and Kmeans - Iris Dataset - Ipynb - Colab

Uploaded by

11/29/24, 9:30 PM SVM and Kmeans -Iris dataset.

!kaggle datasets download -d uciml/iris

Dataset URL: https://www.kaggle.com/datasets/uciml/iris

Loading of the dataset and creating dataframe

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

Changing categorical to numbers

Selection of columns and assigning to X and Y

Training set shape: (120, 5)

Training of the SVM Model

svm_model = SVC(kernel='linear', C=1.0, random_state=42)

Model fitting and Prediction

Evaluation Metrics and Parameters

accuracy = accuracy_score(y_test, y_pred)

0 1.00 1.00 1.00 10

conf_matrix = confusion_matrix(y_test, y_pred)

sns.heatmap(conf_matrix, annot=True, cmap="YlGnBu", fmt='g')

Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

Next steps: Generate code with df

df.drop(['Id'] ,axis=1, inplace=True)

SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm

count 150.000000 150.000000 150.000000 150.000000

mean 5.843333 3.054000 3.758667 1.198667

std 0.828066 0.433594 1.764420 0.763161

min 4.300000 2.000000 1.000000 0.100000

25% 5.100000 2.800000 1.600000 0.300000

50% 5.800000 3.000000 4.350000 1.300000

75% 6.400000 3.300000 5.100000 1.800000

max 7.900000 4.400000 6.900000 2.500000

SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 5.1 3.5 1.4 0.2 Iris-setosa

1 4.9 3.0 1.4 0.2 Iris-setosa

2 4.7 3.2 1.3 0.2 Iris-setosa

3 4.6 3.1 1.5 0.2 Iris-setosa

4 5.0 3.6 1.4 0.2 Iris-setosa

Next steps: Generate code with df

plt.title('The Elbow Method')

km1 = KMeans(n_clusters=3,max_iter=300 , random_state=0)

array([[5.88360656, 2.74098361, 4.38852459, 1.43442623],

plt.scatter(df_imp[y_means==0,2 ],df_imp[y_means==0,3 ], color='g' , label='Iris-versicolor ')

plt.scatter(df_imp[y_means==0,0 ],df_imp[y_means==0,1], color='g' , label='Iris-versicolor ')

You might also like