0% found this document useful (0 votes)

4 views4 pages

KNN

The document outlines a KNN (K-Nearest Neighbors) classification example using the Iris dataset, focusing on sepal length and width for visualization. It includes steps for data loading, preprocessing, feature scaling, training the KNN model, and evaluating its performance using metrics like confusion matrix and classification report. The model achieved perfect accuracy on the test data with a confusion matrix showing no misclassifications.

Uploaded by

abdelazizasma80

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views4 pages

KNN

Uploaded by

abdelazizasma80

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

10/04/2022 07:51 KNN

KNN
In [17]:

import matplotlib.pyplot as plt

import numpy as np
import pandas as pd
import seaborn as sns

from sklearn import datasets

from sklearn.model_selection import train_test_split , KFold
from sklearn.preprocessing import Normalizer
from sklearn.metrics import accuracy_score
from sklearn.neighbors import KNeighborsClassifier
import matplotlib.pyplot as plt # import de Matplotlib
from collections import Counter

We are going to use a very famous dataset called Iris. Attributes: sepal length in cm sepal width in cm petal
length in cm petal width in cm We will just use two features for easier visualization, sepal length and width. Class:
Iris Setosa Iris Versicolour Iris Virginica #Load the Dataset

In [3]:

# import iris dataset

iris = datasets.load_iris()
# np.c_ is the numpy concatenate function
iris_df = pd.DataFrame(data= np.c_[iris['data'], iris['target']],
columns= iris['feature_names'] + ['target'])
iris_df.head()

Out[3]:

sepal length (cm) sepal width (cm) petal length (cm) petal width (cm) target

0 5.1 3.5 1.4 0.2 0.0

1 4.9 3.0 1.4 0.2 0.0

2 4.7 3.2 1.3 0.2 0.0

3 4.6 3.1 1.5 0.2 0.0

4 5.0 3.6 1.4 0.2 0.0

file:///C:/Users/pc/Downloads/KNN.html 1/4
10/04/2022 07:51 KNN

In [4]:

iris_df.describe()

Out[4]:

sepal length (cm) sepal width (cm) petal length (cm) petal width (cm) target

count 150.000000 150.000000 150.000000 150.000000 150.000000

mean 5.843333 3.057333 3.758000 1.199333 1.000000

std 0.828066 0.435866 1.765298 0.762238 0.819232

min 4.300000 2.000000 1.000000 0.100000 0.000000

25% 5.100000 2.800000 1.600000 0.300000 0.000000

50% 5.800000 3.000000 4.350000 1.300000 1.000000

75% 6.400000 3.300000 5.100000 1.800000 2.000000

max 7.900000 4.400000 6.900000 2.500000 2.000000

In [7]:

iris_df = pd.DataFrame(iris.data, columns=iris.feature_names)

x=pd.DataFrame(iris.data)

y=pd.DataFrame(iris.target)

x.columns=['Sepal_Length','Sepal_width','Petal_Length','Petal_width']

Out[7]:

Sepal_Length Sepal_width Petal_Length Petal_width

0 5.1 3.5 1.4 0.2

1 4.9 3.0 1.4 0.2

2 4.7 3.2 1.3 0.2

3 4.6 3.1 1.5 0.2

4 5.0 3.6 1.4 0.2

... ... ... ... ...

145 6.7 3.0 5.2 2.3

146 6.3 2.5 5.0 1.9

147 6.5 3.0 5.2 2.0

148 6.2 3.4 5.4 2.3

149 5.9 3.0 5.1 1.8

150 rows × 4 columns

file:///C:/Users/pc/Downloads/KNN.html 2/4
10/04/2022 07:51 KNN

In [22]:

--------------------------------------------------------------------------
-
NameError Traceback (most recent call las
t)
<ipython-input-22-668994d18e71> in <module>
----> 1 X = dataset.iloc[:, :-1].values
2 y = dataset.iloc[:, 4].values

NameError: name 'dataset' is not defined

In [8]:

y.columns=['Targets']
y

Out[8]:

Targets

0 0

1 0

2 0

3 0

4 0

... ...

145 2

146 2

147 2

148 2

149 2

150 rows × 1 columns

In [20]:

#train test split

from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(x, y, test_size=0.20)

Feature Scaling Before making any actual predictions, it is always a good practice to scale the features so that all
of them can be uniformly evaluated.

In [21]:

from sklearn.preprocessing import StandardScaler

scaler = StandardScaler()
scaler.fit(X_train)

X_train = scaler.transform(X_train)
X_test = scaler.transform(X_test)

file:///C:/Users/pc/Downloads/KNN.html 3/4
10/04/2022 07:51 KNN

Training and Predictions It is extremely straight forward to train the KNN algorithm and make predictions with it,
especially when using Scikit-Learn.

In [23]:

#Create KNN Classifier

#Number of neighbors to use by default for kneighbors queries.
from sklearn.neighbors import KNeighborsClassifier
classifier = KNeighborsClassifier(n_neighbors=5)
classifier.fit(X_train, y_train)

C:\ProgramData\Anaconda3\lib\site-packages\ipykernel_launcher.py:3: DataCo
nversionWarning: A column-vector y was passed when a 1d array was expecte
d. Please change the shape of y to (n_samples, ), for example using ravel
().
This is separate from the ipykernel package so we can avoid doing import
s until

Out[23]:

KNeighborsClassifier(algorithm='auto', leaf_size=30, metric='minkowski',

metric_params=None, n_jobs=None, n_neighbors=5, p=2,
weights='uniform')

The final step is to make predictions on our test data.

In [24]:

y_pred = classifier.predict(X_test)

Evaluating the Algorithm For evaluating an algorithm, confusion matrix, precision, recall and f1 score are the
most commonly used metrics. The confusion_matrix and classification_report methods of the sklearn.metrics can
be used to calculate these metrics.

In [25]:

from sklearn.metrics import classification_report, confusion_matrix

print(confusion_matrix(y_test, y_pred))
print(classification_report(y_test, y_pred))

[[11 0 0]
[ 0 10 0]
[ 0 0 9]]
precision recall f1-score support

0 1.00 1.00 1.00 11

1 1.00 1.00 1.00 10
2 1.00 1.00 1.00 9

accuracy 1.00 30
macro avg 1.00 1.00 1.00 30
weighted avg 1.00 1.00 1.00 30

In [ ]:

file:///C:/Users/pc/Downloads/KNN.html 4/4

Codes and Other Relevant Explanations For Supervised Learning (Part 1) - Session by Sabyasachi Mukhopadhyay - August 3
No ratings yet
Codes and Other Relevant Explanations For Supervised Learning (Part 1) - Session by Sabyasachi Mukhopadhyay - August 3
5 pages
Class X - Artificial Intelligence - Evaluation - Question Bank
83% (6)
Class X - Artificial Intelligence - Evaluation - Question Bank
8 pages
Rahul Raj - Ipynb - Colab
No ratings yet
Rahul Raj - Ipynb - Colab
50 pages
Machine Learning With Python - Machine Learning Algorithms - KNN
No ratings yet
Machine Learning With Python - Machine Learning Algorithms - KNN
15 pages
ML Lab 8
No ratings yet
ML Lab 8
9 pages
Pratham ML
No ratings yet
Pratham ML
14 pages
Experiment 7 Ids
No ratings yet
Experiment 7 Ids
12 pages
L6 Tutorial - KNN - Jupyter Notebook
No ratings yet
L6 Tutorial - KNN - Jupyter Notebook
7 pages
Practical 5
No ratings yet
Practical 5
11 pages
ML Keshav
No ratings yet
ML Keshav
23 pages
ML - Labtask5.ipynb - K - Colab
No ratings yet
ML - Labtask5.ipynb - K - Colab
8 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
10 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
KNN Rainfall
No ratings yet
KNN Rainfall
9 pages
Implementing KNN Algorithm On The Iris Dataset
No ratings yet
Implementing KNN Algorithm On The Iris Dataset
7 pages
Bi 6 New
No ratings yet
Bi 6 New
6 pages
ML Lab Manual
No ratings yet
ML Lab Manual
6 pages
Lab Manual ML
No ratings yet
Lab Manual ML
23 pages
Dsbda Assignment 6
No ratings yet
Dsbda Assignment 6
5 pages
KNN Colab Illustration
No ratings yet
KNN Colab Illustration
5 pages
SPPUML5
No ratings yet
SPPUML5
4 pages
KNN ALGORITHM - Ipynb - Colab
No ratings yet
KNN ALGORITHM - Ipynb - Colab
4 pages
Module 3 Lab 2
No ratings yet
Module 3 Lab 2
6 pages
Mnbnmnbnnmbbhhuyrgh
No ratings yet
Mnbnmnbnnmbbhhuyrgh
3 pages
'Classified Data': Import As Import As Import As Import As
No ratings yet
'Classified Data': Import As Import As Import As Import As
3 pages
It - S All About Neighbors - Completed
No ratings yet
It - S All About Neighbors - Completed
14 pages
AIML - ECE304 - Assign-2 - Kartikeya - Kandpal - Ajitesh - S.ipynb - Colab
No ratings yet
AIML - ECE304 - Assign-2 - Kartikeya - Kandpal - Ajitesh - S.ipynb - Colab
3 pages
ML - Lab-8.ipynb - Colab
No ratings yet
ML - Lab-8.ipynb - Colab
4 pages
KNN Classifier
No ratings yet
KNN Classifier
5 pages
Python Code For KNN Classifier 1. Initial Message
No ratings yet
Python Code For KNN Classifier 1. Initial Message
7 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
7 pages
KNN - Predictive Analysis
No ratings yet
KNN - Predictive Analysis
6 pages
ML Mini Project
No ratings yet
ML Mini Project
9 pages
ML Lab Programs 2
No ratings yet
ML Lab Programs 2
16 pages
ML Project
No ratings yet
ML Project
2 pages
Week 11 KNN
No ratings yet
Week 11 KNN
5 pages
Machine Learning Assignment 3
No ratings yet
Machine Learning Assignment 3
7 pages
ML Mini Project
No ratings yet
ML Mini Project
9 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
Lab Program 9
No ratings yet
Lab Program 9
5 pages
Import As Import As Import As Import As From Import
No ratings yet
Import As Import As Import As Import As From Import
3 pages
Prac7 23bme053
No ratings yet
Prac7 23bme053
2 pages
Lab06 KNN 01
No ratings yet
Lab06 KNN 01
3 pages
Experiment 7
No ratings yet
Experiment 7
3 pages
ML Functions
No ratings yet
ML Functions
12 pages
AIML Lab 3 4
No ratings yet
AIML Lab 3 4
5 pages
Week10 KNN Practical
No ratings yet
Week10 KNN Practical
4 pages
Updated K-Nearest Neighbors in Machine Learning
No ratings yet
Updated K-Nearest Neighbors in Machine Learning
11 pages
Machine Learnin1
100% (1)
Machine Learnin1
41 pages
Dhanashree ML Report
No ratings yet
Dhanashree ML Report
3 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
8 pages
Implementing KNN Algorithm: Importing Libraries
No ratings yet
Implementing KNN Algorithm: Importing Libraries
6 pages
Dsbda 10
No ratings yet
Dsbda 10
5 pages
Aam Codes
No ratings yet
Aam Codes
8 pages
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
No ratings yet
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
5 pages
Exercise and Experiment 3
No ratings yet
Exercise and Experiment 3
14 pages
Practical 6
No ratings yet
Practical 6
8 pages
Lab7.ipynb - Colaboratory
100% (1)
Lab7.ipynb - Colaboratory
5 pages
Python For Data Science Cheat Sheet: Scikit-Learn Create Your Model Evaluate Your Model's Performance
100% (1)
Python For Data Science Cheat Sheet: Scikit-Learn Create Your Model Evaluate Your Model's Performance
1 page
LLM ML Interview Q
No ratings yet
LLM ML Interview Q
43 pages
DS Capstone Presentation
No ratings yet
DS Capstone Presentation
46 pages
Session 1 Evaluation Model
No ratings yet
Session 1 Evaluation Model
58 pages
Monkeypox Diagnosis With Interpretable Deep Learning
No ratings yet
Monkeypox Diagnosis With Interpretable Deep Learning
16 pages
Unit 5 Classification PDF
No ratings yet
Unit 5 Classification PDF
131 pages
Shivam Intership
100% (1)
Shivam Intership
18 pages
Chapter - 1
No ratings yet
Chapter - 1
56 pages
Evaluation Metrics-ML
No ratings yet
Evaluation Metrics-ML
16 pages
Unit-2 AI Python
No ratings yet
Unit-2 AI Python
57 pages
Se 345
No ratings yet
Se 345
9 pages
ML - LAB - FILE Pankaj
No ratings yet
ML - LAB - FILE Pankaj
13 pages
Mechanical and Electrical Faults Detection in Induction Motor Across Multiple Sensors With CNN-LSTM Deep Learning Model
No ratings yet
Mechanical and Electrical Faults Detection in Induction Motor Across Multiple Sensors With CNN-LSTM Deep Learning Model
11 pages
Military Safety, Weapon Detection
No ratings yet
Military Safety, Weapon Detection
9 pages
SRS Sentiment Analysis Project
No ratings yet
SRS Sentiment Analysis Project
4 pages
ML Question Answer
No ratings yet
ML Question Answer
21 pages
Answer Key Sample Paper 3 AI Class 10
No ratings yet
Answer Key Sample Paper 3 AI Class 10
10 pages
Ransomware Detection Using Machine Learning With EBPF
No ratings yet
Ransomware Detection Using Machine Learning With EBPF
20 pages
Asteroid Hazard Prediction
No ratings yet
Asteroid Hazard Prediction
8 pages
Spam Detection
No ratings yet
Spam Detection
10 pages
Demystifying The Confusion Matrix A Guide To Model Evaluation in Python With OpenCV
No ratings yet
Demystifying The Confusion Matrix A Guide To Model Evaluation in Python With OpenCV
8 pages
Isprs Archives XLVIII 3 2024 401 2024
No ratings yet
Isprs Archives XLVIII 3 2024 401 2024
6 pages
Performance Metrics Deep Dive
No ratings yet
Performance Metrics Deep Dive
7 pages
Confusion Matrix & Box Plot
No ratings yet
Confusion Matrix & Box Plot
5 pages
Detecting and Classifying DNS Tunneling Through Novel Machine Learning Approach
No ratings yet
Detecting and Classifying DNS Tunneling Through Novel Machine Learning Approach
7 pages
RANDOM FOREST (Binary Classification)
No ratings yet
RANDOM FOREST (Binary Classification)
5 pages
Comparative Analysis of Weighted Emphirical Optimization Algorithm and Lazy Classification Algorithms
No ratings yet
Comparative Analysis of Weighted Emphirical Optimization Algorithm and Lazy Classification Algorithms
6 pages
Optimalisasi Klasifikasi Kanker Payudara Menggunakan Forward Selection Pada Naive Bayes
No ratings yet
Optimalisasi Klasifikasi Kanker Payudara Menggunakan Forward Selection Pada Naive Bayes
5 pages
Confusion Matrix: Example Table of Confusion References External Links
No ratings yet
Confusion Matrix: Example Table of Confusion References External Links
3 pages
Sentimental Analysis Using NLP
No ratings yet
Sentimental Analysis Using NLP
1 page
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
From Everand
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
Vladimir Kiselev
No ratings yet
TensorFlow深度学习项目实战: Chinese Edition
From Everand
TensorFlow深度学习项目实战: Chinese Edition
Posts & Telecom Press
No ratings yet

KNN

Uploaded by

KNN

Uploaded by

10/04/2022 07:51 KNN

import matplotlib.pyplot as plt

from sklearn import datasets

# import iris dataset

0 5.1 3.5 1.4 0.2 0.0

1 4.9 3.0 1.4 0.2 0.0

2 4.7 3.2 1.3 0.2 0.0

3 4.6 3.1 1.5 0.2 0.0

4 5.0 3.6 1.4 0.2 0.0

count 150.000000 150.000000 150.000000 150.000000 150.000000

mean 5.843333 3.057333 3.758000 1.199333 1.000000

std 0.828066 0.435866 1.765298 0.762238 0.819232

min 4.300000 2.000000 1.000000 0.100000 0.000000

25% 5.100000 2.800000 1.600000 0.300000 0.000000

50% 5.800000 3.000000 4.350000 1.300000 1.000000

75% 6.400000 3.300000 5.100000 1.800000 2.000000

max 7.900000 4.400000 6.900000 2.500000 2.000000

iris_df = pd.DataFrame(iris.data, columns=iris.feature_names)

Sepal_Length Sepal_width Petal_Length Petal_width

0 5.1 3.5 1.4 0.2

1 4.9 3.0 1.4 0.2

2 4.7 3.2 1.3 0.2

3 4.6 3.1 1.5 0.2

4 5.0 3.6 1.4 0.2

... ... ... ... ...

145 6.7 3.0 5.2 2.3

146 6.3 2.5 5.0 1.9

147 6.5 3.0 5.2 2.0

148 6.2 3.4 5.4 2.3

149 5.9 3.0 5.1 1.8

150 rows × 4 columns

NameError: name 'dataset' is not defined

150 rows × 1 columns

#train test split

from sklearn.preprocessing import StandardScaler

#Create KNN Classifier

KNeighborsClassifier(algorithm='auto', leaf_size=30, metric='minkowski',

The final step is to make predictions on our test data.

from sklearn.metrics import classification_report, confusion_matrix

0 1.00 1.00 1.00 11

You might also like