0% found this document useful (0 votes)

54 views5 pages

Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050

The document discusses implementing two machine learning algorithms on an iris dataset: 1) A k-nearest neighbors algorithm is used to classify the iris data, achieving 96.67% accuracy. 2) A naive Bayes classifier is also implemented on the iris data, with its accuracy to be computed.

Uploaded by

test

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views5 pages

Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050

Uploaded by

test

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

10/8/21, 1:09 PM 20190802050_DS_Lab4

AI/ML LAB-4
Name: Pratik Jadhav

PRN: 20190802050

AIM: To implement two algorithms on a data set and impute the

accuracy score of the predictions

Q1. Write a program to implement k-Nearest Neighbour algorithm to classify the iris data
set. Print both correct and wrong predictions. Java/Python ML library classes can be used
for this problem.

In [1]:
%matplotlib inline

import numpy as np

import pandas as pd

import matplotlib.pyplot as plt

In [2]:
iris_data = pd.read_csv("Iris.csv")

iris_data.head()

Out[2]: Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

In [3]:
len(iris_data)

150
Out[3]:

In [4]:
iris_data.isna().sum()

Id 0

Out[4]:
SepalLengthCm 0

SepalWidthCm 0

PetalLengthCm 0

PetalWidthCm 0

Species 0

dtype: int64

localhost:8888/nbconvert/html/20190802050_DS_Lab4.ipynb?download=false 1/5
10/8/21, 1:09 PM 20190802050_DS_Lab4

In [5]: X = iris_data.drop("Species", axis=1)

y = iris_data["Species"]

len(X), len(y)

(150, 150)
Out[5]:

In [6]:
from sklearn.neighbors import KNeighborsClassifier

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y,

test_size=0.2,

random_state=1)

clf = KNeighborsClassifier(n_neighbors=3)

clf.fit(X_train, y_train)

clf.score(X_test, y_test)

0.9666666666666667
Out[6]:

In [7]:
y_preds = clf.predict(X_test)

y_preds[:10]

array(['Iris-setosa', 'Iris-versicolor', 'Iris-versicolor', 'Iris-setosa',

Out[7]:
'Iris-virginica', 'Iris-versicolor', 'Iris-virginica',

'Iris-setosa', 'Iris-setosa', 'Iris-virginica'], dtype=object)

In [8]:
y_preds_proba = clf.predict_proba(X_test)

y_preds_proba[:10]

array([[1., 0., 0.],

Out[8]:
[0., 1., 0.],

[0., 1., 0.],

[1., 0., 0.],

[0., 0., 1.],

[0., 1., 0.],

[0., 0., 1.],

[1., 0., 0.],

[0., 0., 1.]])

In [9]:
from sklearn.metrics import accuracy_score, confusion_matrix, classification_report

accuracy = accuracy_score(y_preds, y_test)

print(f"The accuracy of the ML model for iris data: {accuracy * 100:.2f}%\n")

print(f"Classfication Report: {classification_report(y_preds, y_test)}\n")

print(f"Confusion Matrix: \n{confusion_matrix(y_preds, y_test)}")

The accuracy of the ML model for iris data: 96.67%

Classfication Report: precision recall f1-score support

Iris-setosa 1.00 1.00 1.00 11

Iris-versicolor 0.92 1.00 0.96 12

Iris-virginica 1.00 0.86 0.92 7

localhost:8888/nbconvert/html/20190802050_DS_Lab4.ipynb?download=false 2/5
10/8/21, 1:09 PM 20190802050_DS_Lab4

accuracy 0.97 30

macro avg 0.97 0.95 0.96 30

weighted avg 0.97 0.97 0.97 30

Confusion Matrix:

[[11 0 0]

[ 0 12 0]

[ 0 1 6]]

In [10]:
from sklearn.model_selection import cross_val_score

cvs = cross_val_score(clf, X, y)

print(cvs)

print(f"Mean of each testing data set: {np.mean(cvs) * 100:.2f}%")

[0.66666667 1. 1. 1. 0.7 ]

Mean of each testing data set: 87.33%

In [11]:
y_testing = pd.Series(y_test).reset_index().drop("index",axis=1)

y_predictions = pd.Series(y_preds)

In [12]:
predictions_df = pd.DataFrame(data={

"Species": y_testing["Species"],

"Predicted Species": y_predictions

})

In [13]:
predicts = []

for index, i in enumerate(y_testing["Species"]):

if i == y_preds[index]:

predicts.append("Correct")

else:

predicts.append("Wrong")

In [14]:
predictions_df["Correct or Wrong"] = pd.Series(predicts)

predictions_df.head()

Out[14]: Species Predicted Species Correct or Wrong

0 Iris-setosa Iris-setosa Correct

1 Iris-versicolor Iris-versicolor Correct

2 Iris-versicolor Iris-versicolor Correct

3 Iris-setosa Iris-setosa Correct

4 Iris-virginica Iris-virginica Correct

In [15]:
print(f"Total Correct or Wrong Predictions:\n\

{predictions_df['Correct or Wrong'].value_counts()}")

Total Correct or Wrong Predictions:

Correct 29

localhost:8888/nbconvert/html/20190802050_DS_Lab4.ipynb?download=false 3/5
10/8/21, 1:09 PM 20190802050_DS_Lab4

Wrong 1

Name: Correct or Wrong, dtype: int64

Q2. Write a program to implement the naïve Bayesian classifier for a sample training data
set stored as a .CSV file. Compute the accuracy of the classifier, considering few test data
sets.

In [16]:
iris_data = pd.read_csv("Iris.csv")

iris_data.head()

Out[16]: Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

In [17]:
X = iris_data.drop("Species", axis=1)

y = iris_data["Species"]

len(X), len(y)

(150, 150)
Out[17]:

In [18]:
from sklearn.naive_bayes import GaussianNB

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y,

test_size=0.3,

random_state=1)

gnb = GaussianNB()

gnb.fit(X_train, y_train)

gnb.score(X_test, y_test)

1.0
Out[18]:

In [19]:
y_preds = gnb.predict(X_test)

y_preds[:10]

array(['Iris-setosa', 'Iris-versicolor', 'Iris-versicolor', 'Iris-setosa',

Out[19]:
'Iris-virginica', 'Iris-versicolor', 'Iris-virginica',

'Iris-setosa', 'Iris-setosa', 'Iris-virginica'], dtype='<U15')

In [20]:
from sklearn.metrics import accuracy_score

accuracy = accuracy_score(y_preds, y_test)

localhost:8888/nbconvert/html/20190802050_DS_Lab4.ipynb?download=false 4/5
10/8/21, 1:09 PM 20190802050_DS_Lab4

print(f"The accuracy of the ML model for iris data: {accuracy * 100:.2f}%")

The accuracy of the ML model for iris data: 100.00%

In [21]:
from sklearn.model_selection import cross_val_score

cvs = cross_val_score(gnb, X, y)

print(cvs)

print(f"Mean of each testing data set: {np.mean(cvs) * 100:.2f}%")

[0.96666667 1. 1. 1. 1. ]

Mean of each testing data set: 99.33%

In [22]:
from sklearn.metrics import accuracy_score, confusion_matrix, classification_report

accuracy = accuracy_score(y_preds, y_test)

print(f"The accuracy of the ML model for iris data: {accuracy * 100:.2f}%\n")

print(f"Classfication Report: {classification_report(y_preds, y_test)}\n")

print(f"Confusion Matrix: \n{confusion_matrix(y_preds, y_test)}")

The accuracy of the ML model for iris data: 100.00%

Classfication Report: precision recall f1-score support

Iris-setosa 1.00 1.00 1.00 14

Iris-versicolor 1.00 1.00 1.00 18

Iris-virginica 1.00 1.00 1.00 13

accuracy 1.00 45

macro avg 1.00 1.00 1.00 45

weighted avg 1.00 1.00 1.00 45

Confusion Matrix:

[[14 0 0]

[ 0 18 0]

[ 0 0 13]]

Conclusion: Hence, we have successfully implemented kNeigbhours and Naive Bayesian

algorithms on iris data set and computed the accuracy and different evaluation model on the
predictions. We got an accuray of 96.67% on testing data and 87.33% on different testing data
sets of the KNeighbours Algorithm. And for Naive Bayesian we got an accuracy of 100% and
99.33% on different testing data sets of iris data.

localhost:8888/nbconvert/html/20190802050_DS_Lab4.ipynb?download=false 5/5

ML Keshav
No ratings yet
ML Keshav
23 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
Lab Manual ML
No ratings yet
Lab Manual ML
23 pages
Sinha and Dhiman Text Book
No ratings yet
Sinha and Dhiman Text Book
394 pages
Codes and Other Relevant Explanations For Supervised Learning (Part 1) - Session by Sabyasachi Mukhopadhyay - August 3
No ratings yet
Codes and Other Relevant Explanations For Supervised Learning (Part 1) - Session by Sabyasachi Mukhopadhyay - August 3
5 pages
Code Examples in Space
No ratings yet
Code Examples in Space
13 pages
Aiml Practical
No ratings yet
Aiml Practical
17 pages
Remaining ML Program
No ratings yet
Remaining ML Program
12 pages
AML Lab3 2021wb15156
No ratings yet
AML Lab3 2021wb15156
13 pages
ML Functions
No ratings yet
ML Functions
12 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
10 pages
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
No ratings yet
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
8 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
Wa0001
No ratings yet
Wa0001
39 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
L6 Tutorial - KNN - Jupyter Notebook
No ratings yet
L6 Tutorial - KNN - Jupyter Notebook
7 pages
Dsbda Assig 6 Data Analytcs 3
No ratings yet
Dsbda Assig 6 Data Analytcs 3
6 pages
Machine Learning Aiml
No ratings yet
Machine Learning Aiml
7 pages
PR 6
No ratings yet
PR 6
6 pages
ML Internal Answers
No ratings yet
ML Internal Answers
9 pages
Iris - Copy1 - Jupyter Notebook
No ratings yet
Iris - Copy1 - Jupyter Notebook
8 pages
ML Lab Manual
No ratings yet
ML Lab Manual
6 pages
Assigmnent 3 (Data Mining)
No ratings yet
Assigmnent 3 (Data Mining)
18 pages
Comparison of Classifiers
No ratings yet
Comparison of Classifiers
6 pages
Babbush CH., Hahn J., Krauser J., Rosenlicht J. - Dental Implants. The Art and Science
100% (8)
Babbush CH., Hahn J., Krauser J., Rosenlicht J. - Dental Implants. The Art and Science
545 pages
ML Expt 4
No ratings yet
ML Expt 4
4 pages
Assignment 5
No ratings yet
Assignment 5
5 pages
ABHAYMLFILE
No ratings yet
ABHAYMLFILE
16 pages
Mnbnmnbnnmbbhhuyrgh
No ratings yet
Mnbnmnbnnmbbhhuyrgh
3 pages
Data Analytics III
No ratings yet
Data Analytics III
5 pages
Week 11 KNN
No ratings yet
Week 11 KNN
5 pages
Lab06 KNN 01
No ratings yet
Lab06 KNN 01
3 pages
DSBDA6
No ratings yet
DSBDA6
3 pages
It - S All About Neighbors - Completed
No ratings yet
It - S All About Neighbors - Completed
14 pages
KNN
No ratings yet
KNN
4 pages
NaiveBayesClassifier - Jupyter Notebook
No ratings yet
NaiveBayesClassifier - Jupyter Notebook
2 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
8 pages
How To Treat Different Types of Church Members
No ratings yet
How To Treat Different Types of Church Members
81 pages
DS 6
No ratings yet
DS 6
2 pages
Lab Program 9
No ratings yet
Lab Program 9
5 pages
AIML Lab 3 4
No ratings yet
AIML Lab 3 4
5 pages
Vighnesh - S Log 13
No ratings yet
Vighnesh - S Log 13
4 pages
MLAss Code
No ratings yet
MLAss Code
1 page
DS6BAYES
No ratings yet
DS6BAYES
2 pages
33NaiveBayesOn Iris
No ratings yet
33NaiveBayesOn Iris
1 page
Iris - Regression - Jupyter Notebook
No ratings yet
Iris - Regression - Jupyter Notebook
5 pages
Lab - 5 (CB - En.u4ece22115)
No ratings yet
Lab - 5 (CB - En.u4ece22115)
5 pages
SVM K NN MLP With Sklearn Jupyter NoteBo
No ratings yet
SVM K NN MLP With Sklearn Jupyter NoteBo
22 pages
Dsbda 10
No ratings yet
Dsbda 10
5 pages
George Orwell's 5 Rules of Effective Writing
100% (8)
George Orwell's 5 Rules of Effective Writing
2 pages
MLT Lab 09
No ratings yet
MLT Lab 09
3 pages
Prac7 23bme053
No ratings yet
Prac7 23bme053
2 pages
KNN Model Find Optimanl K
No ratings yet
KNN Model Find Optimanl K
3 pages
Nomlab 14 Ai
No ratings yet
Nomlab 14 Ai
3 pages
National Income Determination
100% (2)
National Income Determination
27 pages
Description Page Number: QUEST - Power Coaching For IITJEE
No ratings yet
Description Page Number: QUEST - Power Coaching For IITJEE
35 pages
Data Mining and Warehousing Concepts Lab: (ITPC - 228)
No ratings yet
Data Mining and Warehousing Concepts Lab: (ITPC - 228)
6 pages
Paul Ricoeur - S Hermeneutics of Symbols
0% (1)
Paul Ricoeur - S Hermeneutics of Symbols
17 pages
Tobira Kanji
67% (3)
Tobira Kanji
4 pages
Implementing KNN Algorithm: Importing Libraries
No ratings yet
Implementing KNN Algorithm: Importing Libraries
6 pages
Bailment and Pledge Are Two Special Contracts That Are Often Confused
No ratings yet
Bailment and Pledge Are Two Special Contracts That Are Often Confused
9 pages
Lab 6
No ratings yet
Lab 6
4 pages
Job Involvement
100% (1)
Job Involvement
12 pages
Alami Yaum-e-Urdu's Special Edition On Legendary Urdu Journalist Muhammad Muslim.
No ratings yet
Alami Yaum-e-Urdu's Special Edition On Legendary Urdu Journalist Muhammad Muslim.
144 pages
SVM and Kmeans - Iris Dataset - Ipynb - Colab
No ratings yet
SVM and Kmeans - Iris Dataset - Ipynb - Colab
5 pages
Lab Week 7
No ratings yet
Lab Week 7
3 pages
Case Study - Classifier
No ratings yet
Case Study - Classifier
5 pages
Integrative Couples Group Treatment For Emerging Adults With ADHD Symptoms
No ratings yet
Integrative Couples Group Treatment For Emerging Adults With ADHD Symptoms
11 pages
ML Lab
No ratings yet
ML Lab
7 pages
Herbal Drug and Chemistry
No ratings yet
Herbal Drug and Chemistry
67 pages
2023 PMCF
100% (1)
2023 PMCF
8 pages
BDA Worksheet 5 Arman
No ratings yet
BDA Worksheet 5 Arman
5 pages
Super Senses Class 5 Notes CBSE EVS Chapter 1 (Free PDF Download)
No ratings yet
Super Senses Class 5 Notes CBSE EVS Chapter 1 (Free PDF Download)
5 pages
SOCIAL PROCESSE Rural Sociology
100% (1)
SOCIAL PROCESSE Rural Sociology
5 pages
120 Advanced JavaScript Interview Questions
From Everand
120 Advanced JavaScript Interview Questions
Hernando Abella
No ratings yet
Detailed Lesson Plan in Geometry I .Objectives
No ratings yet
Detailed Lesson Plan in Geometry I .Objectives
8 pages
Photography Fashion Genre
No ratings yet
Photography Fashion Genre
2 pages
Raith e LiNE e Beam
No ratings yet
Raith e LiNE e Beam
4 pages
A. Describe FIVE Types of Information That Needs To Be Collected by John in Order For Him To Develop Job Analysis For The New Posts
No ratings yet
A. Describe FIVE Types of Information That Needs To Be Collected by John in Order For Him To Develop Job Analysis For The New Posts
11 pages
3rd Exam English 7
No ratings yet
3rd Exam English 7
3 pages
Personal Pronouns and Verbs Conjugation
No ratings yet
Personal Pronouns and Verbs Conjugation
12 pages
Dating
No ratings yet
Dating
6 pages
Romeike V Holder - Decision of Immigration Judge Lawrence O. Burman
No ratings yet
Romeike V Holder - Decision of Immigration Judge Lawrence O. Burman
19 pages
How To Tell Which Decisions Are Strategic by Ram Shivakumar
No ratings yet
How To Tell Which Decisions Are Strategic by Ram Shivakumar
21 pages
MACE61222 Helicopter Flight PDF
No ratings yet
MACE61222 Helicopter Flight PDF
6 pages
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Modification A. Incomplete Sentences
No ratings yet
Modification A. Incomplete Sentences
5 pages
INTERVIEW
No ratings yet
INTERVIEW
10 pages
Marcia Dossier
No ratings yet
Marcia Dossier
4 pages
Planner 2024-25
No ratings yet
Planner 2024-25
1 page

Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050

Uploaded by

Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050

Uploaded by

10/8/21, 1:09 PM 20190802050_DS_Lab4

AIM: To implement two algorithms on a data set and impute the

import matplotlib.pyplot as plt

Out[2]: Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

In [5]: X = iris_data.drop("Species", axis=1)

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y,

array(['Iris-setosa', 'Iris-versicolor', 'Iris-versicolor', 'Iris-setosa',

'Iris-setosa', 'Iris-setosa', 'Iris-virginica'], dtype=object)

array([[1., 0., 0.],

[0., 1., 0.],

[1., 0., 0.],

[0., 0., 1.],

[0., 1., 0.],

[0., 0., 1.],

[1., 0., 0.],

[1., 0., 0.],

[0., 0., 1.]])

accuracy = accuracy_score(y_preds, y_test)

print(f"The accuracy of the ML model for iris data: {accuracy * 100:.2f}%\n")

print(f"Classfication Report: {classification_report(y_preds, y_test)}\n")

The accuracy of the ML model for iris data: 96.67%

Classfication Report: precision recall f1-score support

Iris-setosa 1.00 1.00 1.00 11

Iris-versicolor 0.92 1.00 0.96 12

Iris-virginica 1.00 0.86 0.92 7

macro avg 0.97 0.95 0.96 30

weighted avg 0.97 0.97 0.97 30

print(f"Mean of each testing data set: {np.mean(cvs) * 100:.2f}%")

Mean of each testing data set: 87.33%

"Predicted Species": y_predictions

for index, i in enumerate(y_testing["Species"]):

Out[14]: Species Predicted Species Correct or Wrong

0 Iris-setosa Iris-setosa Correct

1 Iris-versicolor Iris-versicolor Correct

2 Iris-versicolor Iris-versicolor Correct

3 Iris-setosa Iris-setosa Correct

4 Iris-virginica Iris-virginica Correct

Total Correct or Wrong Predictions:

Name: Correct or Wrong, dtype: int64

Out[16]: Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y,

array(['Iris-setosa', 'Iris-versicolor', 'Iris-versicolor', 'Iris-setosa',

'Iris-setosa', 'Iris-setosa', 'Iris-virginica'], dtype='<U15')

accuracy = accuracy_score(y_preds, y_test)

print(f"The accuracy of the ML model for iris data: {accuracy * 100:.2f}%")

The accuracy of the ML model for iris data: 100.00%

print(f"Mean of each testing data set: {np.mean(cvs) * 100:.2f}%")

Mean of each testing data set: 99.33%

accuracy = accuracy_score(y_preds, y_test)

print(f"The accuracy of the ML model for iris data: {accuracy * 100:.2f}%\n")

print(f"Classfication Report: {classification_report(y_preds, y_test)}\n")

The accuracy of the ML model for iris data: 100.00%

Classfication Report: precision recall f1-score support

Iris-setosa 1.00 1.00 1.00 14

Iris-versicolor 1.00 1.00 1.00 18

Iris-virginica 1.00 1.00 1.00 13

macro avg 1.00 1.00 1.00 45

weighted avg 1.00 1.00 1.00 45

Conclusion: Hence, we have successfully implemented kNeigbhours and Naive Bayesian

You might also like