0% found this document useful (0 votes)

12 views4 pages

K Nearest Neighbours

This document discusses using K-Nearest Neighbors (KNN) for classification of apple quality. It includes importing datasets, data analysis, splitting data, and building and evaluating a KNN model. Key steps are preprocessing the 'apple_quality' dataset, analyzing correlations, fitting a KNN classifier with hyperparameter tuning, and reporting train and test classification metrics.

Uploaded by

bunsglazing135

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views4 pages

K Nearest Neighbours

Uploaded by

bunsglazing135

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

21bce5695-knn-lab7

March 13, 2024

21BCE5695 M. Ashwin

1 K Nearest Neighbours
1.1 Importing required libraries
[ ]: from sklearn import datasets
from sklearn.model_selection import train_test_split, GridSearchCV
from sklearn.neighbors import KNeighborsClassifier,KNeighborsRegressor
from sklearn.dummy import DummyClassifier, DummyRegressor
from sklearn.metrics import classification_report, mean_squared_error
from sklearn.preprocessing import StandardScaler, LabelEncoder
from sklearn.decomposition import PCA

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from tqdm import tqdm

1.2 Importing Dataset

[ ]: df = pd.read_csv('apple_quality.csv')

[ ]: print(df.head(2))

A_id Size Weight Sweetness Crunchiness Juiciness Ripeness \

0 0 -3.970049 -2.512336 5.346330 -1.012009 1.844900 0.32984
1 1 -1.195217 -2.839257 3.664059 1.588232 0.853286 0.86753

Acidity Quality
0 -0.491590 good
1 -0.722809 good
Dropping the ID column since it is not relevant to the machine learning model
[ ]: df.drop(['A_id'], axis=1, inplace=True)

Splitting into input and output data

1
[ ]: x = df.drop(['Quality'], axis=1)
y = df['Quality']

1.3 Data Analysis

[ ]: plt.figure(figsize=(25,10))
for (i,v) in enumerate(x.columns):
plt.subplot(3,df.shape[1],i+1);
plt.hist(df.iloc[:,i],bins="sqrt")
plt.title(df.columns[i],fontsize=9);

Encoding the categorical output values into binary values

[ ]: label = []
for i in tqdm(df['Quality']):
if i=='bad':
label.append(0)
else:
label.append(1)

df['Quality'] = label

100%|��| 4000/4000 [00:00<00:00, 945994.70it/s]

[ ]: dfinfo = pd.DataFrame(df.dtypes,columns=["dtypes"])
for (m,n) in zip([df.count(),df.isna().sum()],["count","isna"]):
dfinfo = dfinfo.merge(pd.
↪DataFrame(m,columns=[n]),right_index=True,left_index=True,how="inner");

dfinfo.T.append(df.describe())

<ipython-input-65-4673ff7821a0>:4: FutureWarning: The frame.append method is

deprecated and will be removed from pandas in a future version. Use
pandas.concat instead.
dfinfo.T.append(df.describe())

[ ]: Size Weight Sweetness Crunchiness Juiciness Ripeness \

dtypes float64 float64 float64 float64 float64 float64
count 4000 4000 4000 4000 4000 4000
isna 0 0 0 0 0 0

2
count 4000.0 4000.0 4000.0 4000.0 4000.0 4000.0
mean -0.503015 -0.989547 -0.470479 0.985478 0.512118 0.498277
std 1.928059 1.602507 1.943441 1.402757 1.930286 1.874427
min -7.151703 -7.149848 -6.894485 -6.055058 -5.961897 -5.864599
25% -1.816765 -2.01177 -1.738425 0.062764 -0.801286 -0.771677
50% -0.513703 -0.984736 -0.504758 0.998249 0.534219 0.503445
75% 0.805526 0.030976 0.801922 1.894234 1.835976 1.766212
max 6.406367 5.790714 6.374916 7.619852 7.364403 7.237837

Acidity Quality
dtypes float64 int64
count 4000 4000
isna 0 0
count 4000.0 4000.0
mean 0.076877 0.501
std 2.11027 0.500062
min -7.010538 0.0
25% -1.377424 0.0
50% 0.022609 1.0
75% 1.510493 1.0
max 7.404736 1.0

Correlation matrix
[ ]: df.corr().round(2).style.background_gradient(cmap="viridis")

[ ]: <pandas.io.formats.style.Styler at 0x78992d29c3d0>

[ ]: print(df.head(3))

Size Weight Sweetness Crunchiness Juiciness Ripeness Acidity \

0 -3.970049 -2.512336 5.346330 -1.012009 1.844900 0.329840 -0.491590
1 -1.195217 -2.839257 3.664059 1.588232 0.853286 0.867530 -0.722809
2 -0.292024 -1.351282 -1.738429 -0.342616 2.838636 -0.038033 2.621636

Quality
0 1
1 1
2 0

1.4 Model building and testing

Splitting data into training and testing
[ ]: x_train,x_test,y_train,y_test = train_test_split(x,y,test_size = 0.
↪3,stratify=y,random_state=30);

3
[ ]: model = KNeighborsClassifier(algorithm="auto");
parameters = {"n_neighbors":[1,3,5],
"weights":["uniform","distance"]}
model_optim = GridSearchCV(model, parameters, cv=5,scoring="accuracy");

Training the model

[ ]: model_optim.fit(x_train,y_train)

[ ]: GridSearchCV(cv=5, estimator=KNeighborsClassifier(),
param_grid={'n_neighbors': [1, 3, 5],
'weights': ['uniform', 'distance']},
scoring='accuracy')

[ ]: model_optim.best_estimator_

[ ]: KNeighborsClassifier(weights='distance')

Model metrics
[ ]: for (i,x,y) in zip(["Train","Test"],[x_train,x_test],[y_train,y_test]):
print("Classification kNN",i," report:
↪\n",classification_report(y,model_optim.predict(x)))

Classification kNN Train report:

precision recall f1-score support

bad 1.00 1.00 1.00 1397

good 1.00 1.00 1.00 1403

accuracy 1.00 2800

macro avg 1.00 1.00 1.00 2800
weighted avg 1.00 1.00 1.00 2800

Classification kNN Test report:

precision recall f1-score support

bad 0.91 0.90 0.91 599

good 0.90 0.91 0.91 601

accuracy 0.91 1200

macro avg 0.91 0.91 0.91 1200
weighted avg 0.91 0.91 0.91 1200

Lloyds Bank Statement
100% (2)
Lloyds Bank Statement
4 pages
Guesstimates
No ratings yet
Guesstimates
35 pages
Practical Research 2: Quarter 1 - Module 1 Nature of Inquiry and Research
100% (5)
Practical Research 2: Quarter 1 - Module 1 Nature of Inquiry and Research
13 pages
ML Notes
100% (2)
ML Notes
125 pages
October 11, 2020: 0.1 Applied Machine Learning, Module 1: A Simple Classification Task
No ratings yet
October 11, 2020: 0.1 Applied Machine Learning, Module 1: A Simple Classification Task
4 pages
Machine Learning KNN - Supervised
No ratings yet
Machine Learning KNN - Supervised
9 pages
KNN RahayuFitria 19510004
No ratings yet
KNN RahayuFitria 19510004
5 pages
K Nearest Neighbour - Jupyter Notebook
No ratings yet
K Nearest Neighbour - Jupyter Notebook
24 pages
KNN Activity
No ratings yet
KNN Activity
4 pages
Lab7.ipynb - Colaboratory
100% (1)
Lab7.ipynb - Colaboratory
5 pages
Here's An Visualization of The K-Nearest Neighbors Algorithm
No ratings yet
Here's An Visualization of The K-Nearest Neighbors Algorithm
5 pages
Worksheet Classification2
No ratings yet
Worksheet Classification2
14 pages
Forex Algorithm
No ratings yet
Forex Algorithm
5 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
Ciber Crime
No ratings yet
Ciber Crime
12 pages
Branching and Looping
No ratings yet
Branching and Looping
19 pages
KNN Colab Illustration
No ratings yet
KNN Colab Illustration
5 pages
PinAAcle 900 AA Product Description List - 2011-06-06
100% (1)
PinAAcle 900 AA Product Description List - 2011-06-06
22 pages
Week10 KNN Practical
No ratings yet
Week10 KNN Practical
4 pages
Experiment No-4 Code
No ratings yet
Experiment No-4 Code
16 pages
Fasteners PDF
No ratings yet
Fasteners PDF
242 pages
Module 5 Construction
No ratings yet
Module 5 Construction
1 page
2.3 Aiml Rishit
No ratings yet
2.3 Aiml Rishit
7 pages
Worksheet - 2.3 20BCS7611
No ratings yet
Worksheet - 2.3 20BCS7611
6 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
5 pages
Experiment-5 AdityaKumar 11
No ratings yet
Experiment-5 AdityaKumar 11
4 pages
Us 119 750 760 Backup Restore
No ratings yet
Us 119 750 760 Backup Restore
4 pages
Data Science Libraries
No ratings yet
Data Science Libraries
4 pages
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
100% (1)
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
125 pages
Bsbcus401 Student Assessment Guide
No ratings yet
Bsbcus401 Student Assessment Guide
20 pages
Manipal University Jaipur
No ratings yet
Manipal University Jaipur
3 pages
KNN Reccomendation
No ratings yet
KNN Reccomendation
7 pages
Risss ML Record 6
No ratings yet
Risss ML Record 6
6 pages
KNN Class 2
No ratings yet
KNN Class 2
40 pages
Reading Data: #Importing Required Libraries
No ratings yet
Reading Data: #Importing Required Libraries
16 pages
KNN Class 1
No ratings yet
KNN Class 1
32 pages
9,12,19,68 - ML Assignment-2
No ratings yet
9,12,19,68 - ML Assignment-2
5 pages
Wa0003
No ratings yet
Wa0003
16 pages
K - NN Classification
No ratings yet
K - NN Classification
4 pages
Lab Session 9
No ratings yet
Lab Session 9
2 pages
Salesforce Research Sixth Edition State of Marketing
No ratings yet
Salesforce Research Sixth Edition State of Marketing
88 pages
Handout
No ratings yet
Handout
2 pages
Diploma Project in Tea Leaf Cutting
No ratings yet
Diploma Project in Tea Leaf Cutting
15 pages
Pratham ML
No ratings yet
Pratham ML
14 pages
Cns Unit2
No ratings yet
Cns Unit2
147 pages
Mini Project Sushant 612210154
No ratings yet
Mini Project Sushant 612210154
3 pages
Xiaomi Brings Redmi Note 9S To The Philippines
No ratings yet
Xiaomi Brings Redmi Note 9S To The Philippines
4 pages
Mini Project With Output
No ratings yet
Mini Project With Output
8 pages
2 - Parallel Computer Architecture - 1
No ratings yet
2 - Parallel Computer Architecture - 1
26 pages
A545 730 16 50 02051 - Rev 2
No ratings yet
A545 730 16 50 02051 - Rev 2
2 pages
Decision Tree KNN
No ratings yet
Decision Tree KNN
9 pages
K-Means Clustering From Scratch
No ratings yet
K-Means Clustering From Scratch
3 pages
Cuestionarios Escuela
No ratings yet
Cuestionarios Escuela
60 pages
B-56 Sanket Jambhulkar MLA-7
No ratings yet
B-56 Sanket Jambhulkar MLA-7
9 pages
2860 - Procurementplan250620 OIL & GAS
No ratings yet
2860 - Procurementplan250620 OIL & GAS
29 pages
MACPAN Pastry Solution OK
No ratings yet
MACPAN Pastry Solution OK
6 pages
K-Nearest Neighbors Clearly Explained
No ratings yet
K-Nearest Neighbors Clearly Explained
11 pages
EEE 204 - Lecture2
No ratings yet
EEE 204 - Lecture2
16 pages
Ads Exp5 Code
No ratings yet
Ads Exp5 Code
2 pages
ML Assignment 8
No ratings yet
ML Assignment 8
2 pages
ML Lab Manual
No ratings yet
ML Lab Manual
24 pages
AUS
No ratings yet
AUS
11 pages
Product Omniswitch Matrix Comparison en
No ratings yet
Product Omniswitch Matrix Comparison en
3 pages
KNN Cookbook
No ratings yet
KNN Cookbook
8 pages
ML Lecture For School Students
No ratings yet
ML Lecture For School Students
8 pages
1 Supervise Learning (KNN) (Solution) : 1.1 Distance Measuring in Machine Learning
No ratings yet
1 Supervise Learning (KNN) (Solution) : 1.1 Distance Measuring in Machine Learning
14 pages
New Data Science Module Nearest Neighbors
No ratings yet
New Data Science Module Nearest Neighbors
22 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
33 pages
01 MK1033C 1
No ratings yet
01 MK1033C 1
104 pages
MLLab Manual
No ratings yet
MLLab Manual
24 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
Mlalllabprgs
No ratings yet
Mlalllabprgs
17 pages
Arc Volt Buildup Deion Plates
No ratings yet
Arc Volt Buildup Deion Plates
1 page
KNN Classifier
No ratings yet
KNN Classifier
5 pages
PGM 5
No ratings yet
PGM 5
3 pages
C504-E036b Mobiledart mx8v Glass-Free
No ratings yet
C504-E036b Mobiledart mx8v Glass-Free
7 pages
Dhanashree ML Report
No ratings yet
Dhanashree ML Report
3 pages
Devesh
No ratings yet
Devesh
11 pages
Rahul Raj - Ipynb - Colab
No ratings yet
Rahul Raj - Ipynb - Colab
50 pages
KNN - Jupyter Notebook
No ratings yet
KNN - Jupyter Notebook
8 pages
DSASSign 4
No ratings yet
DSASSign 4
11 pages
(Ebook) Apache ZooKeeper Essentials - Saurav Haloi by 2015pdf Download
100% (4)
(Ebook) Apache ZooKeeper Essentials - Saurav Haloi by 2015pdf Download
55 pages
Module 3 Lab 2
No ratings yet
Module 3 Lab 2
6 pages
Free Fire
No ratings yet
Free Fire
2 pages
KNN Example
No ratings yet
KNN Example
4 pages
Hyperparameter Tuning For K
No ratings yet
Hyperparameter Tuning For K
2 pages
Machine Learning Programs
No ratings yet
Machine Learning Programs
10 pages
KNN - Predictive Analysis
No ratings yet
KNN - Predictive Analysis
6 pages
IN_TIME_TEC - Preparation Guide
No ratings yet
IN_TIME_TEC - Preparation Guide
4 pages
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet

K Nearest Neighbours

Uploaded by

K Nearest Neighbours

Uploaded by

21bce5695-knn-lab7

March 13, 2024

1.2 Importing Dataset

A_id Size Weight Sweetness Crunchiness Juiciness Ripeness \

Splitting into input and output data

1.3 Data Analysis

Encoding the categorical output values into binary values

100%|����������| 4000/4000 [00:00<00:00, 945994.70it/s]

<ipython-input-65-4673ff7821a0>:4: FutureWarning: The frame.append method is

[ ]: Size Weight Sweetness Crunchiness Juiciness Ripeness \

Size Weight Sweetness Crunchiness Juiciness Ripeness Acidity \

1.4 Model building and testing

Training the model

Classification kNN Train report:

bad 1.00 1.00 1.00 1397

accuracy 1.00 2800

Classification kNN Test report:

bad 0.91 0.90 0.91 599

accuracy 0.91 1200

You might also like

100%|��| 4000/4000 [00:00<00:00, 945994.70it/s]