0% found this document useful (0 votes)

39 views6 pages

Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

This document is a Jupyter Notebook detailing the implementation of the K Nearest Neighbors (KNN) algorithm using Python. It covers data import, standardization, train-test splitting, model training, and evaluation, including the use of confusion matrices and classification reports. The notebook also discusses selecting an optimal K value using the elbow method.

Uploaded by

pateljil0247

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views6 pages

Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

Uploaded by

pateljil0247

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

4/5/23, 12:36 PM Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

Lecture 11-K Nearest Neighbour-Part 2

K Nearest Neighbors with Python

Import Libraries
In [10]: 1 import pandas as pd
2 import seaborn as sns
3 import matplotlib.pyplot as plt
4 import numpy as np
5 %matplotlib inline

Get the Data

In [11]: 1 df = pd.read_csv('Downloads/KNN_Project_Data')

In [12]: 1 df.head()

Out[12]:
XVPM GWYH TRAT TLLZ IGGA HYKR EDFS

0 1636.670614 817.988525 2565.995189 358.347163 550.417491 1618.870897 2147.641254 33

1 1013.402760 577.587332 2644.141273 280.428203 1161.873391 2084.107872 853.404981 44

2 1300.035501 820.518697 2025.854469 525.562292 922.206261 2552.355407 818.676686 84

3 1059.347542 1066.866418 612.000041 480.827789 419.467495 685.666983 852.867810 34

4 1018.340526 1313.679056 950.622661 724.742174 843.065903 1370.554164 905.469453 65

Standardize the Variables

In [4]: 1 from sklearn.preprocessing import StandardScaler

In [5]: 1 scaler = StandardScaler()

In [13]: 1 scaler.fit(df.drop('TARGET CLASS',axis=1))

Out[13]: StandardScaler()

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 1/6

4/5/23, 12:36 PM Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

In [16]: 1 scaled_features = scaler.transform(df.drop('TARGET CLASS',axis=1))

In [20]: 1 df_feat = pd.DataFrame(scaled_features,columns=df.columns[:-1])

2 df_feat.head()

Out[20]:
XVPM GWYH TRAT TLLZ IGGA HYKR EDFS GUUB MGJM

0 1.568522 -0.443435 1.619808 -0.958255 -1.128481 0.138336 0.980493 -0.932794 1.008313

1 -0.112376 -1.056574 1.741918 -1.504220 0.640009 1.081552 -1.182663 -0.461864 0.258321

2 0.660647 -0.436981 0.775793 0.213394 -0.053171 2.030872 -1.240707 1.149298 2.184784

3 0.011533 0.191324 -1.433473 -0.100053 -1.507223 -1.753632 -1.183561 -0.888557 0.162310

4 -0.099059 0.820815 -0.904346 1.609015 -0.282065 -0.365099 -1.095644 0.391419 -1.365603

Train Test Split

In [21]: 1 from sklearn.model_selection import train_test_split

In [22]: 1 X_train, X_test, y_train, y_test = train_test_split(scaled_features,df['TA

2 test_size=0.30)

Using KNN
Remember that we are trying to come up with a model to predict whether someone will TARGET
CLASS or not. We'll start with k=1.

In [23]: 1 from sklearn.neighbors import KNeighborsClassifier

In [24]: 1 knn = KNeighborsClassifier(n_neighbors=1)

In [25]: 1 knn.fit(X_train,y_train)

Out[25]: KNeighborsClassifier(n_neighbors=1)

In [26]: 1 pred = knn.predict(X_test)

Predictions and Evaluations

Let's evaluate our KNN model!

In [27]: 1 from sklearn.metrics import classification_report,confusion_matrix

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 2/6

4/5/23, 12:36 PM Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

In [28]: 1 print(confusion_matrix(y_test,pred))

[[109 45]
[ 33 113]]

In [29]: 1 print(classification_report(y_test,pred))

precision recall f1-score support

0 0.77 0.71 0.74 154

1 0.72 0.77 0.74 146

accuracy 0.74 300

macro avg 0.74 0.74 0.74 300
weighted avg 0.74 0.74 0.74 300

Choosing a K Value
Let's go ahead and use the elbow method to pick a good K Value:

In [30]: 1 error_rate = []
2
3 # Will take some time
4 for i in range(1,40):
5
6 knn = KNeighborsClassifier(n_neighbors=i)
7 knn.fit(X_train,y_train)
8 pred_i = knn.predict(X_test)
9 error_rate.append(np.mean(pred_i != y_test))

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 3/6

4/5/23, 12:36 PM Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

In [31]: 1 plt.figure(figsize=(10,6))
2 plt.plot(range(1,40),error_rate,color='blue', linestyle='dashed', marker='
3 markerfacecolor='red', markersize=10)
4 plt.title('Error Rate vs. K Value')
5 plt.xlabel('K')
6 plt.ylabel('Error Rate')

Out[31]: Text(0, 0.5, 'Error Rate')

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 4/6

4/5/23, 12:36 PM Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

In [32]: 1 # FIRST A QUICK COMPARISON TO OUR ORIGINAL K=1

2 knn = KNeighborsClassifier(n_neighbors=1)
3
4 knn.fit(X_train,y_train)
5 pred = knn.predict(X_test)
6
7 print('WITH K=1')
8 print('\n')
9 print(confusion_matrix(y_test,pred))
10 print('\n')
11 print(classification_report(y_test,pred))

WITH K=1

[[109 45]
[ 33 113]]

precision recall f1-score support

0 0.77 0.71 0.74 154

1 0.72 0.77 0.74 146

accuracy 0.74 300

macro avg 0.74 0.74 0.74 300
weighted avg 0.74 0.74 0.74 300

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 5/6

4/5/23, 12:36 PM Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

In [35]: 1 # NOW WITH K=3

2 knn = KNeighborsClassifier(n_neighbors=23)
3
4 knn.fit(X_train,y_train)
5 pred = knn.predict(X_test)
6
7 print('WITH K=30')
8 print('\n')
9 print(confusion_matrix(y_test,pred))
10 print('\n')
11 print(classification_report(y_test,pred))

WITH K=30

[[114 40]
[ 20 126]]

precision recall f1-score support

0 0.85 0.74 0.79 154

1 0.76 0.86 0.81 146

accuracy 0.80 300

macro avg 0.80 0.80 0.80 300
weighted avg 0.81 0.80 0.80 300

In [ ]: 1

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 6/6

Practical No 1: Aim:Breadth First Search & Iterative Depth First Search
No ratings yet
Practical No 1: Aim:Breadth First Search & Iterative Depth First Search
36 pages
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
100% (1)
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
125 pages
Worksheet - 2.3 20BCS7490
No ratings yet
Worksheet - 2.3 20BCS7490
6 pages
Math401E, Ch3, Mortality Table, Emad Salem-Edited
No ratings yet
Math401E, Ch3, Mortality Table, Emad Salem-Edited
19 pages
Complete Download Outlier Ensembles An Introduction 1st Edition Charu C. Aggarwal PDF All Chapters
100% (3)
Complete Download Outlier Ensembles An Introduction 1st Edition Charu C. Aggarwal PDF All Chapters
65 pages
Topic01 Classification Basics Jiawei Han Extra
No ratings yet
Topic01 Classification Basics Jiawei Han Extra
198 pages
K-Nearest Neighbor On Python Ken Ocuma
100% (2)
K-Nearest Neighbor On Python Ken Ocuma
9 pages
K-Nearest Neighbors: KNN Algorithm Pseudocode
No ratings yet
K-Nearest Neighbors: KNN Algorithm Pseudocode
2 pages
K-Nearest Neighbor (KNN) 6
No ratings yet
K-Nearest Neighbor (KNN) 6
46 pages
Part A 3. KNN Classification
No ratings yet
Part A 3. KNN Classification
35 pages
Tushar ML
No ratings yet
Tushar ML
52 pages
Notes 02
No ratings yet
Notes 02
79 pages
Rahul Raj - Ipynb - Colab
No ratings yet
Rahul Raj - Ipynb - Colab
50 pages
Decision Trees and Random Forest
No ratings yet
Decision Trees and Random Forest
79 pages
Classification
No ratings yet
Classification
58 pages
cs4302 Lecture2
No ratings yet
cs4302 Lecture2
40 pages
Chapter 13 Clustering Algorithms
No ratings yet
Chapter 13 Clustering Algorithms
62 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
33 pages
UGC List of Approved Journals
No ratings yet
UGC List of Approved Journals
9 pages
Evaluating Diagnostic Tests: Payam Kabiri, Md. Phd. Clinical Epidemiologist Tehran University of Medical Sciences
No ratings yet
Evaluating Diagnostic Tests: Payam Kabiri, Md. Phd. Clinical Epidemiologist Tehran University of Medical Sciences
96 pages
ARI5102 Presentation
No ratings yet
ARI5102 Presentation
25 pages
Pre and Post Assessment Scoresheet
No ratings yet
Pre and Post Assessment Scoresheet
44 pages
Lab7.ipynb - Colaboratory
100% (1)
Lab7.ipynb - Colaboratory
5 pages
ML Lab Manual
No ratings yet
ML Lab Manual
24 pages
ML Unit 3 r20 Jntuk
No ratings yet
ML Unit 3 r20 Jntuk
22 pages
Ayush ML
No ratings yet
Ayush ML
29 pages
New Data Science Module Nearest Neighbors
No ratings yet
New Data Science Module Nearest Neighbors
22 pages
Lab 8
No ratings yet
Lab 8
7 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
ML Unit-3 Notes
No ratings yet
ML Unit-3 Notes
19 pages
Fraud Detection Using Machine Learning and Deep Learning
No ratings yet
Fraud Detection Using Machine Learning and Deep Learning
6 pages
Assignment #1: Decision and Support Systems ID: 999991355
No ratings yet
Assignment #1: Decision and Support Systems ID: 999991355
11 pages
Updated K-Nearest Neighbors in Machine Learning
No ratings yet
Updated K-Nearest Neighbors in Machine Learning
11 pages
Chapter 7 - K-Nearest-Neighbor: Data Mining For Business Analytics in Python
No ratings yet
Chapter 7 - K-Nearest-Neighbor: Data Mining For Business Analytics in Python
21 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
2 pages
K Nearest Neighbour - Jupyter Notebook
No ratings yet
K Nearest Neighbour - Jupyter Notebook
24 pages
Student Clustering Based On Academic Using K-Means Algorithma
No ratings yet
Student Clustering Based On Academic Using K-Means Algorithma
34 pages
Machine Learning With Python - Machine Learning Algorithms - KNN
No ratings yet
Machine Learning With Python - Machine Learning Algorithms - KNN
15 pages
B-56 Sanket Jambhulkar MLA-7
No ratings yet
B-56 Sanket Jambhulkar MLA-7
9 pages
Sagrada Individualized Phil Iri English Results Post Test Sy 2022 2023
No ratings yet
Sagrada Individualized Phil Iri English Results Post Test Sy 2022 2023
7 pages
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
No ratings yet
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
13 pages
ML - Labtask5.ipynb - K - Colab
No ratings yet
ML - Labtask5.ipynb - K - Colab
8 pages
Hsslive Xii Maths Science Second Term Key Anon Dec 2023
No ratings yet
Hsslive Xii Maths Science Second Term Key Anon Dec 2023
6 pages
KNN Cookbook
No ratings yet
KNN Cookbook
8 pages
KNN - Predictive Analysis
No ratings yet
KNN - Predictive Analysis
6 pages
Assignment 3 B
No ratings yet
Assignment 3 B
7 pages
K-Nearest Neighbor: General Gist
No ratings yet
K-Nearest Neighbor: General Gist
14 pages
Implementing KNN Algorithm On The Iris Dataset
No ratings yet
Implementing KNN Algorithm On The Iris Dataset
7 pages
It - S All About Neighbors - Completed
No ratings yet
It - S All About Neighbors - Completed
14 pages
KNN Classifier
No ratings yet
KNN Classifier
5 pages
Lab 3
No ratings yet
Lab 3
6 pages
Here's An Visualization of The K-Nearest Neighbors Algorithm
No ratings yet
Here's An Visualization of The K-Nearest Neighbors Algorithm
5 pages
Week 11 KNN
No ratings yet
Week 11 KNN
5 pages
Worksheet - 2.3 20BCS7611
No ratings yet
Worksheet - 2.3 20BCS7611
6 pages
Practical 7
No ratings yet
Practical 7
6 pages
Dhanashree ML Report
No ratings yet
Dhanashree ML Report
3 pages
Risss ML Record 6
No ratings yet
Risss ML Record 6
6 pages
KNN
No ratings yet
KNN
4 pages
Untitled
No ratings yet
Untitled
8 pages
DWM Practical
No ratings yet
DWM Practical
5 pages
KNN Colab Illustration
No ratings yet
KNN Colab Illustration
5 pages
ML Lab Week 7
No ratings yet
ML Lab Week 7
4 pages
8.01 Machine Learning Basics
No ratings yet
8.01 Machine Learning Basics
6 pages
KNN Algorithm Edt
No ratings yet
KNN Algorithm Edt
5 pages
K-NN Algorithm: Need To Create Two Files File 1: KNN - Py Second File: Expt3.py
No ratings yet
K-NN Algorithm: Need To Create Two Files File 1: KNN - Py Second File: Expt3.py
4 pages
Tmi 4013 Revision V 5
No ratings yet
Tmi 4013 Revision V 5
6 pages
Experiment 4: Aim/Overview of The Practical: Task To Be Done
No ratings yet
Experiment 4: Aim/Overview of The Practical: Task To Be Done
7 pages
What Are The Evaluation Metrics in Machine Learning
No ratings yet
What Are The Evaluation Metrics in Machine Learning
3 pages
ML - Lab-8.ipynb - Colab
No ratings yet
ML - Lab-8.ipynb - Colab
4 pages
KNN Lab
No ratings yet
KNN Lab
4 pages
Lab06 KNN 01
No ratings yet
Lab06 KNN 01
3 pages
ML Lab2 PGM
No ratings yet
ML Lab2 PGM
3 pages
Nirma University: ? !,'' XTLT"
No ratings yet
Nirma University: ? !,'' XTLT"
3 pages
MLT Lab 09
No ratings yet
MLT Lab 09
3 pages
Assignment No 2 AI
No ratings yet
Assignment No 2 AI
4 pages
Implementing KNN Algorithm: Importing Libraries
No ratings yet
Implementing KNN Algorithm: Importing Libraries
6 pages
Mail Spam
No ratings yet
Mail Spam
4 pages
Lab Session 9
No ratings yet
Lab Session 9
2 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
5 pages
PML Lab Exp 11
No ratings yet
PML Lab Exp 11
3 pages
Week10 KNN Practical
No ratings yet
Week10 KNN Practical
4 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Q. Implement K-Nearest Neighbours Algorithm On Iris Dataset For Different Values of K. You Can Implement For K 4,5,6,7,8
No ratings yet
Q. Implement K-Nearest Neighbours Algorithm On Iris Dataset For Different Values of K. You Can Implement For K 4,5,6,7,8
2 pages
KnnClassifier - Jupyter Notebook
No ratings yet
KnnClassifier - Jupyter Notebook
2 pages
Perceptron Explanation 10th Standard
No ratings yet
Perceptron Explanation 10th Standard
2 pages
Process of ML Code/Algorithm: KNN Type I - Input Test Sample Method
No ratings yet
Process of ML Code/Algorithm: KNN Type I - Input Test Sample Method
3 pages
Box Plot Outliers
No ratings yet
Box Plot Outliers
2 pages
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
TensorFlow深度学习项目实战: Chinese Edition
From Everand
TensorFlow深度学习项目实战: Chinese Edition
Posts & Telecom Press
No ratings yet

Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

Uploaded by

Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

Uploaded by

4/5/23, 12:36 PM Lecture-11-K Nearest Neighbors-Part2 - Jupyter Notebook

Lecture 11-K Nearest Neighbour-Part 2

K Nearest Neighbors with Python

Get the Data

0 1636.670614 817.988525 2565.995189 358.347163 550.417491 1618.870897 2147.641254 33

1 1013.402760 577.587332 2644.141273 280.428203 1161.873391 2084.107872 853.404981 44

2 1300.035501 820.518697 2025.854469 525.562292 922.206261 2552.355407 818.676686 84

3 1059.347542 1066.866418 612.000041 480.827789 419.467495 685.666983 852.867810 34

4 1018.340526 1313.679056 950.622661 724.742174 843.065903 1370.554164 905.469453 65

Standardize the Variables

In [5]: 1 scaler = StandardScaler()

In [13]: 1 scaler.fit(df.drop('TARGET CLASS',axis=1))

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 1/6

In [16]: 1 scaled_features = scaler.transform(df.drop('TARGET CLASS',axis=1))

In [20]: 1 df_feat = pd.DataFrame(scaled_features,columns=df.columns[:-1])

0 1.568522 -0.443435 1.619808 -0.958255 -1.128481 0.138336 0.980493 -0.932794 1.008313

1 -0.112376 -1.056574 1.741918 -1.504220 0.640009 1.081552 -1.182663 -0.461864 0.258321

2 0.660647 -0.436981 0.775793 0.213394 -0.053171 2.030872 -1.240707 1.149298 2.184784

3 0.011533 0.191324 -1.433473 -0.100053 -1.507223 -1.753632 -1.183561 -0.888557 0.162310

4 -0.099059 0.820815 -0.904346 1.609015 -0.282065 -0.365099 -1.095644 0.391419 -1.365603

Train Test Split

In [22]: 1 X_train, X_test, y_train, y_test = train_test_split(scaled_features,df['TA

In [23]: 1 from sklearn.neighbors import KNeighborsClassifier

In [24]: 1 knn = KNeighborsClassifier(n_neighbors=1)

In [26]: 1 pred = knn.predict(X_test)

Predictions and Evaluations

In [27]: 1 from sklearn.metrics import classification_report,confusion_matrix

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 2/6

precision recall f1-score support

0 0.77 0.71 0.74 154

accuracy 0.74 300

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 3/6

Out[31]: Text(0, 0.5, 'Error Rate')

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 4/6

In [32]: 1 # FIRST A QUICK COMPARISON TO OUR ORIGINAL K=1

precision recall f1-score support

0 0.77 0.71 0.74 154

accuracy 0.74 300

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 5/6

In [35]: 1 # NOW WITH K=3

precision recall f1-score support

0 0.85 0.74 0.79 154

accuracy 0.80 300

localhost:8890/notebooks/Lecture-11-K Nearest Neighbors-Part2.ipynb 6/6

You might also like