0% found this document useful (0 votes)

14 views5 pages

KNN Classifier

The document provides an in-depth guide to the K Nearest Neighbors (KNN) algorithm, covering its definition, working mechanism, advantages and disadvantages, and practical implementation in Python. It explains how KNN classifies data points based on the majority vote of their nearest neighbors and includes a step-by-step example using a toy dataset of fruits. Additionally, it discusses when to use KNN, its performance in high-dimensional data, and offers tips for effective usage.

Uploaded by

Arkojyoti Dey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views5 pages

KNN Classifier

Uploaded by

Arkojyoti Dey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Absolutely!

Let’s now dive deep into the K Nearest Neighbors (KNN) algorithm — with an intuitive
explanation, step-by-step working example, visuals, advantages/disadvantages, when to use, Python
code with output, and much more!

👣 K Nearest Neighbors (KNN) — In-Depth,

Beginner-Friendly Guide

📘 What is KNN?
K-Nearest Neighbors (KNN) is a supervised machine learning algorithm used for classification and
regression. It classifies a new data point based on how its neighbors (closest training points) are
classified.
It’s based on the principle:

“Birds of a feather flock together.”

🧠 How Does KNN Work?

1. Choose a value for K (number of neighbors).
2. Calculate distance (e.g., Euclidean) between the test point and all training data.
3. Sort the distances and find the K nearest neighbors.
4. Majority voting (for classification) or average value (for regression).
5. Assign the class or value accordingly.

✅ Why Use KNN?

Easy to understand and implement
Non-parametric (makes no assumptions about data)
Works well for low-dimensional data

🧮 Step-by-Step: KNN Example

Dataset
Let’s take a toy example of fruits:
Weight Size Fruit

150 7.0 Apple

170 7.5 Apple

140 6.5 Apple

130 6.0 Orange

120 5.5 Orange

110 5.0 Orange

Predict the fruit for a test input: Weight = 135 , Size = 6.4

Step 1: Choose K = 3

Step 2: Compute distance from each point to test input

Use Euclidean distance:

d= (x1 − x2 )2 + (y1 − y2 )2

Compute distance to all 6 training points.

Step 3: Select 3 closest neighbors

Suppose nearest ones are:
140, 6.5 → Apple
130, 6.0 → Orange
150, 7.0 → Apple

Step 4: Voting
2 Apple 🆚 1 Orange → Predict: Apple

🔧 Python Example using sklearn

python Copy Edit

from sklearn.datasets import load_iris from sklearn.model_selection import

train_test_split from sklearn.neighbors import KNeighborsClassifier from sklearn.metrics
import classification_report, confusion_matrix # Load dataset iris = load_iris() X, y =
iris.data, iris.target # Split dataset X_train, X_test, y_train, y_test =
train_test_split(X, y, test_size=0.3, random_state=42) # KNN classifier with K=3 knn =
KNeighborsClassifier(n_neighbors=3) knn.fit(X_train, y_train) # Predict y_pred =
knn.predict(X_test) # Evaluation print("Classification Report:\n",
classification_report(y_test, y_pred)) print("Confusion Matrix:\n",
confusion_matrix(y_test, y_pred))
🖨️ Output:
lua Copy Edit

Classification Report:
precision recall f1-score support

0 1.00 1.00 1.00 16

1 1.00 0.89 0.94 9
2 0.91 1.00 0.95 11

accuracy 0.97 36
macro avg 0.97 0.96 0.96 36
weighted avg 0.97 0.97 0.97 36

Confusion Matrix:
[[16 0 0]
[ 0 8 1]
[ 0 0 11]]

📊 Visualizing KNN
python Copy Edit

import seaborn as sns import matplotlib.pyplot as plt import pandas as pd # Create a

dataframe for visualization df = pd.DataFrame(iris.data, columns=iris.feature_names)
df['target'] = iris.target # Plot 2 features sns.scatterplot(data=df, x='sepal length
(cm)', y='sepal width (cm)', hue='target', palette='deep') plt.title('Iris Dataset -
Sepal Length vs Width') plt.show()

✅ Advantages of KNN
Pros Description

🧠 Simple Very easy to implement

🪞 No Training No model built ahead of time

🧭 Non-Parametric No assumptions about data

🔍 Adaptable Works for classification and regression

⚠️ Disadvantages
Cons Description

🧮 Slow on Large Datasets Every prediction computes distance to all training points

❄️ Sensitive to Noise Outliers can distort predictions

📊 Requires Feature Scaling Distance metrics require normalization (e.g., MinMax)

💡 Curse of Dimensionality Doesn’t work well in high-dimensional spaces

🧠 When to Use / Not Use

Use KNN When... Avoid KNN When...

You have small to medium dataset Your data has many irrelevant features

Decision boundary is nonlinear You care about runtime efficiency

Data is clean and not high-dimensional Dataset is large or sparse

💡 How It Handles High-Dimensional Data

Poorly. As dimensions increase:
Distances between points become less meaningful
All points start looking equally distant
Model performance degrades

📌 Use dimensionality reduction (e.g., PCA, LDA) before KNN!

🧮 Complexity
Aspect Complexity

Training O(1) (no training!)

Aspect Complexity

Prediction O(n ⋅ d) — n: train samples, d: features

Space O(n ⋅ d)

⚙️ Tips for Using KNN

Always scale your features using MinMaxScaler or StandardScaler
Use GridSearchCV to choose best value of K
If class distribution is imbalanced, use distance-weighted voting

🧪 Try Distance Weighting in sklearn

python Copy Edit

KNeighborsClassifier(n_neighbors=5, weights='distance')

Would you like the visual explanation of KNN (with 2D plots) or how to tune K using cross-
validation next?
Or want to continue to Naive Bayes or SVM in the same style?

ML Unit 5..
No ratings yet
ML Unit 5..
40 pages
ML 7th Sem Aiml Ite Notes Complete Long (1) - 63-155
No ratings yet
ML 7th Sem Aiml Ite Notes Complete Long (1) - 63-155
93 pages
Rahul Raj - Ipynb - Colab
No ratings yet
Rahul Raj - Ipynb - Colab
50 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
17 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
18 pages
K-Nearest Neighbor (KNN) 6
No ratings yet
K-Nearest Neighbor (KNN) 6
46 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
22 pages
Practical 10 K-Nearest Neighbors Algorithm
No ratings yet
Practical 10 K-Nearest Neighbors Algorithm
16 pages
KNN Model Implementation
No ratings yet
KNN Model Implementation
12 pages
Machine Learning With Python - Machine Learning Algorithms - KNN
No ratings yet
Machine Learning With Python - Machine Learning Algorithms - KNN
15 pages
K-Nearest Neighbors Clearly Explained
No ratings yet
K-Nearest Neighbors Clearly Explained
11 pages
KNN - Predictive Analysis
No ratings yet
KNN - Predictive Analysis
6 pages
K Nearest Neighbour's (KNN) (1) Using R
No ratings yet
K Nearest Neighbour's (KNN) (1) Using R
9 pages
CSL0777 L22
No ratings yet
CSL0777 L22
35 pages
Top Ranked of Gtu List
No ratings yet
Top Ranked of Gtu List
6 pages
ML Lec07 KNN
100% (2)
ML Lec07 KNN
37 pages
ML-Unit 5
No ratings yet
ML-Unit 5
40 pages
UNIT 3 - Final
No ratings yet
UNIT 3 - Final
37 pages
ML 4
No ratings yet
ML 4
33 pages
Updated K-Nearest Neighbors in Machine Learning
No ratings yet
Updated K-Nearest Neighbors in Machine Learning
11 pages
Huna
50% (2)
Huna
38 pages
KNN Colab Illustration
No ratings yet
KNN Colab Illustration
5 pages
Presentation UNIT-2 (Old)
No ratings yet
Presentation UNIT-2 (Old)
58 pages
K Nearest Neighbor: Presented by
No ratings yet
K Nearest Neighbor: Presented by
29 pages
K - Nearest Neighbours (K-NN) Algorithm
No ratings yet
K - Nearest Neighbours (K-NN) Algorithm
10 pages
cYCLE 9
No ratings yet
cYCLE 9
5 pages
Python Code For KNN Classifier 1. Initial Message
No ratings yet
Python Code For KNN Classifier 1. Initial Message
7 pages
K-NN Algorithm in Machine Learning
No ratings yet
K-NN Algorithm in Machine Learning
11 pages
K Nearest Neighbors KNN A Fundamental Machine Learning Algorithm
No ratings yet
K Nearest Neighbors KNN A Fundamental Machine Learning Algorithm
11 pages
Module 3 Lab 2
No ratings yet
Module 3 Lab 2
6 pages
KNN Lab
No ratings yet
KNN Lab
4 pages
K-Nearest Neighbor On Python Ken Ocuma
100% (2)
K-Nearest Neighbor On Python Ken Ocuma
9 pages
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
100% (1)
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
125 pages
Lecture7 KNN
No ratings yet
Lecture7 KNN
40 pages
PML Lab Exp 11
No ratings yet
PML Lab Exp 11
3 pages
PDF
No ratings yet
PDF
27 pages
Unit 5 Learning With Algorithm
No ratings yet
Unit 5 Learning With Algorithm
7 pages
Amrendra
No ratings yet
Amrendra
9 pages
A Complete Guide To KNN
No ratings yet
A Complete Guide To KNN
16 pages
Selection Methods in Plant Breeding
No ratings yet
Selection Methods in Plant Breeding
29 pages
ML Notes
100% (2)
ML Notes
125 pages
KNN Activity
No ratings yet
KNN Activity
4 pages
B-56 Sanket Jambhulkar MLA-7
No ratings yet
B-56 Sanket Jambhulkar MLA-7
9 pages
Dhanashree ML Report
No ratings yet
Dhanashree ML Report
3 pages
Clustering - KNN
No ratings yet
Clustering - KNN
10 pages
K-Nearest Neighbours Algorithm: KNN-Visualization
No ratings yet
K-Nearest Neighbours Algorithm: KNN-Visualization
2 pages
Kle 2008
96% (26)
Kle 2008
43 pages
LAB 3 Triangle of Forces
67% (3)
LAB 3 Triangle of Forces
3 pages
Machine Learning KNN - Supervised
No ratings yet
Machine Learning KNN - Supervised
9 pages
Machine Learning Unit-3.1
No ratings yet
Machine Learning Unit-3.1
20 pages
ML Lab2 PGM
No ratings yet
ML Lab2 PGM
3 pages
Why Do We Need A K-NN Algorithm?
No ratings yet
Why Do We Need A K-NN Algorithm?
11 pages
'Machine Learning (Nagarjun)
No ratings yet
'Machine Learning (Nagarjun)
10 pages
Part A 3. KNN Classification
No ratings yet
Part A 3. KNN Classification
35 pages
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
No ratings yet
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
13 pages
KNN Dan KMeans
No ratings yet
KNN Dan KMeans
37 pages
Untitled 9
No ratings yet
Untitled 9
17 pages
K-Nearest Neighbor Classification-Algorithm and Characteristics
No ratings yet
K-Nearest Neighbor Classification-Algorithm and Characteristics
6 pages
Sayan Das - Machine Learning
No ratings yet
Sayan Das - Machine Learning
4 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
Marico - Over The Wall
No ratings yet
Marico - Over The Wall
32 pages
K-Nearest Neighbors Algorithm
No ratings yet
K-Nearest Neighbors Algorithm
7 pages
05.contraction of Skeletal Muscle
No ratings yet
05.contraction of Skeletal Muscle
87 pages
Job Hunting Guide For: International Students
No ratings yet
Job Hunting Guide For: International Students
34 pages
Experiment 2.2 KNN Classifier
No ratings yet
Experiment 2.2 KNN Classifier
7 pages
Machine Learning Lab Manual 7
100% (1)
Machine Learning Lab Manual 7
8 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
5 pages
Lesson Plan in Teaching Essay
100% (1)
Lesson Plan in Teaching Essay
12 pages
Logitech Wireless Keyboard K350 Manual
No ratings yet
Logitech Wireless Keyboard K350 Manual
40 pages
Experiment No 7 ML
No ratings yet
Experiment No 7 ML
4 pages
Jyotish - KP - 2016 - Bibhash Choudhary - How To Judge Incest - With KP System
100% (1)
Jyotish - KP - 2016 - Bibhash Choudhary - How To Judge Incest - With KP System
19 pages
Effect of Drilling Fluids On ROP
100% (1)
Effect of Drilling Fluids On ROP
6 pages
(21st Century Skills Library - Cool Military Careers) Josh Gregory-Avionics Technician-Cherry Lake Publishing (2012)
No ratings yet
(21st Century Skills Library - Cool Military Careers) Josh Gregory-Avionics Technician-Cherry Lake Publishing (2012)
36 pages
The Overview Effect
No ratings yet
The Overview Effect
7 pages
Class-02 (Annual Exam Syllabus)
No ratings yet
Class-02 (Annual Exam Syllabus)
8 pages
GACIS, Ramon Jr. G.: PRO Version
No ratings yet
GACIS, Ramon Jr. G.: PRO Version
8 pages
Caffeine Experiment
No ratings yet
Caffeine Experiment
6 pages
HR Practices in Insurance Companies: A Case Study of Bangladesh
No ratings yet
HR Practices in Insurance Companies: A Case Study of Bangladesh
14 pages
C20
No ratings yet
C20
60 pages
Automated Software Testing
No ratings yet
Automated Software Testing
10 pages
NAL Recruitment 2017 - Apply Here
No ratings yet
NAL Recruitment 2017 - Apply Here
11 pages
NRMCA Plant Certification: Items The Company Should Have Ready Prior To Requesting An Inspection
No ratings yet
NRMCA Plant Certification: Items The Company Should Have Ready Prior To Requesting An Inspection
2 pages
해커스텝스 손승미선생님 2025년1월
No ratings yet
해커스텝스 손승미선생님 2025년1월
13 pages
Portfolio Second Language Acquisition
No ratings yet
Portfolio Second Language Acquisition
35 pages
01 MS For Piling Testing (FINAL)
No ratings yet
01 MS For Piling Testing (FINAL)
12 pages
Oral Communication in Context
No ratings yet
Oral Communication in Context
4 pages
КТЖ 10 Action 68 Сағат Жаңа
No ratings yet
КТЖ 10 Action 68 Сағат Жаңа
9 pages
(Company Name) : Who We Are
No ratings yet
(Company Name) : Who We Are
2 pages
Linear Quadratic Exponential Tables
No ratings yet
Linear Quadratic Exponential Tables
3 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

KNN Classifier

Uploaded by

KNN Classifier

Uploaded by

Absolutely!

👣 K Nearest Neighbors (KNN) — In-Depth,

“Birds of a feather flock together.”

🧠 How Does KNN Work?

✅ Why Use KNN?

🧮 Step-by-Step: KNN Example

150 7.0 Apple

170 7.5 Apple

140 6.5 Apple

130 6.0 Orange

120 5.5 Orange

110 5.0 Orange

Step 2: Compute distance from each point to test input

Compute distance to all 6 training points.

Step 3: Select 3 closest neighbors

🔧 Python Example using sklearn

python Copy Edit

from sklearn.datasets import load_iris from sklearn.model_selection import

0 1.00 1.00 1.00 16

import seaborn as sns import matplotlib.pyplot as plt import pandas as pd # Create a

🧠 Simple Very easy to implement

🪞 No Training No model built ahead of time

🧭 Non-Parametric No assumptions about data

🔍 Adaptable Works for classification and regression

❄️ Sensitive to Noise Outliers can distort predictions

📊 Requires Feature Scaling Distance metrics require normalization (e.g., MinMax)

💡 Curse of Dimensionality Doesn’t work well in high-dimensional spaces

🧠 When to Use / Not Use

Decision boundary is nonlinear You care about runtime efficiency

Data is clean and not high-dimensional Dataset is large or sparse

💡 How It Handles High-Dimensional Data

📌 Use dimensionality reduction (e.g., PCA, LDA) before KNN!

Training O(1) (no training!)

Prediction O(n ⋅ d) — n: train samples, d: features

⚙️ Tips for Using KNN

🧪 Try Distance Weighting in sklearn

python Copy Edit

You might also like