100% found this document useful (1 vote)

136 views8 pages

Introduction To KNN

The KNN algorithm classifies new data points based on their similarity to existing data points in the training set. It finds the K nearest neighbors of the new point and assigns the most common class among those neighbors as the predicted class. The value of K is a hyperparameter that determines how many neighbors to consider. With a higher K more neighbors are included, which can impact the predicted class. The algorithm calculates the distance between points, typically using Euclidean distance, and selects the closest K points based on distance. It then predicts the class of the new point by taking a majority vote of the classes of its K nearest neighbors.

Uploaded by

noname

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

136 views8 pages

Introduction To KNN

Uploaded by

noname

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Introduction to KNN

KNN which stand for K Nearest Neighbor is a Supervised Machine Learning algorithm that
classifies a new data point into the target class, depending on the features of its neighboring
data points. To make you understand how KNN algorithm works, let’s consider the following
scenario

Figure 1 Two classes with one unknown objects

In the above image, we have two classes of data, namely class A (squares) and Class B
(triangles).The problem statement is to assign the new input data point to one of the two
classes by using the KNN algorithm. The first step in the KNN algorithm is to define the
value of K. But what does the K in the KNN algorithm stand for?. K stands for the number of
Nearest Neighbors and hence the name K Nearest Neighbors (KNN).

Figure 2 K=3 (Three Nearest Neighbors)

In the above figure we defined the value of ‘K’ as 3. This means that the algorithm will
consider the three neighbors that are the closest to the new data point in order to decide the
class of this new data point. The closeness between the data points is calculated by using
measures such as Euclidean and Manhattan distance. At ‘K’ = 3, the neighbors include two
squares and 1 triangle. So, if I were to classify the new data point based on ‘K’ = 3, then it
would be assigned to Class A (squares).

Figure 3 K=7 (Five Nearest Neighbors)

But what if the ‘K’ value is set to 7 ? Here, I’m basically telling my algorithm to look for
the seven nearest neighbors and classify the new data point into the class it is most
similar to. At ‘K’ = 7, the neighbors include three squares and four triangles. So, if we
were to classify the new data point based on ‘K’ = 7, then it would be assigned to Class B
(triangles) since the majority of its neighbors were of class B.

Figure 4 Unknown object belongs ( Majority class ) to class B

Features of KNN Algorithm
The KNN algorithm has the following features:
1. KNN is a Supervised Learning algorithm that uses labeled input data set to predict the
output of the data points.
2. It is one of the most simple Machine learning algorithms and it can be easily
implemented for a varied set of problems.
3. It is mainly based on feature similarity. KNN checks how similar a data point is to its
neighbor and classifies the data point into the class it is most similar to.
4. Unlike most algorithms, KNN is a non-parametric model which means that it does not
make any assumptions about the data set. This makes the algorithm more effective
since it can handle realistic data.
5. It memorizes the training data set instead of learning a discriminative function from
the training data.
6. KNN can be used for solving both classification and regression problems

Used for
Classification
Memorizes the and regression
training data Non-
set parametric

KNN Algorithm

Supervised
Based on
Learning
feature
algorithm
similarity
Simple Machine
learning
algorithm

Figure 5 Features of KNN algorithm

Outline of Proposed approach

Start

Compute distance between given unknown

object and all other objects

Initialize the value of K

Select K Nearest Neighbor

Apply majority on K Nearest Neighbor

Decide the class for unknown object

End

Figure 6 Outline of KNN Algorithm

KNN Algorithm Pseudo code
1. Calculate D(x, xi) i =1, 2, ….., n; where D denotes the Euclidean distance between the
points.
2. Arrange the calculated n Euclidean distances in non-decreasing order.
3. Let k be a +ve integer, take the first k distances from this sorted list.
4. Find those k-points corresponding to these k-distances.
5. Let ki denotes the number of points belonging to the ith class among k points i.e. k ≥ 0
6. If ki >kj ∀ i ≠ j then put x in class i.
Illustrate With Example
Table 1 simple training data set with 12 records
S.No. Customer Age Loan Class
Name (Default)
1 John 25 40000 N
2 Smith 35 60000 N
3 Pat 40 62000 Y
4 Alex 45 80000 N
5 Jade 20 20000 N
6 Jim 48 220000 Y
7 Jack 33 150000 Y
8 Kate 35 120000 N
9 Mark 52 18000 N
10 Anil 23 95000 Y
11 George 60 100000 N We need to
12 Andrew 48 1420000 ? predict Andrew
default status

Euclidean distance: The Euclidean distance between any two instances is the
length of the line segment connecting them. In this study, the dataset is composed
of 22 attributes is represented in 22-dimensional space. If x = (x 1 , x2 , ..., x12 ) and y
= (y1 , y2 , ..., y12 ) are two points, then the distance from x to y is given by :

Euclidean distance = √∑( 𝑥𝑖 − 𝑦𝑖 )2

𝑖=1

First step is to calculate Euclidean distance

Dist John (X 1 ,Y 1 ) and Andrew(X2 ,Y2 )
=√(𝑋1 − 𝑋2 )2 + (𝑌1 − 𝑌2 )2

= √(48 − 25)2 + (142000 − 40000)2

Dist.(John, Andrew)=102000
Similarly we need to calculate distance for all all other objects
Table 2 Euclidean distance of objects from unknown objects
S.No Customer Name Age Loan Class (Default) Distance
1 John 25 40,000 N 1,02,000
2 Smith 35 60,000 N 82,000
8 Pat 40 62,000 Y 80,000
3 Alex 45 80,000 N 62.000
4 Jade 20 20,000 N 1,22,000
10 Jim 48 2,20,000 Y 78,000
11 Jack 33 1,50,000 Y 8,000
5 Kate 35 1,20,000 N 22,000
6 Mark 52 18,000 N 1,24,000
7 Anil 23 95,000 Y 47,000
9 George 60 1,00,000 N 42,000
12 Andrew 48 14,20,000 ? -

Table 3 Five Nearest Neighbor with Minimum distance

S.No Customer Age Loan Class Distance Minimum
Name (Default) Distance
1 John 25 40,000 N 1,02,000
2 Smith 35 60,000 N 82,000
8 Pat 40 62,000 Y 80,000
3 Alex 45 80,000 N 62.000 5
4 Jade 20 20,000 N 1,22,000
10 Jim 48 2,20,000 Y 78,000
11 Jack 33 1,50,000 Y 8,000 1
5 Kate 35 1,20,000 N 22,000 2
6 Mark 52 18,000 N 1,24,000
7 Anil 23 95,000 Y 47,000 4
9 George 60 1,00,000 N 42,000 3
12 Andrew 48 14,20,000 ? -
Let K=2
With K=2, there are one Default=Y (Jack, 33,1 50000, Y) Yes and one Default=N (Kate,
35,120000, N) No. So we cannot able to decide class for Andrew

Let K=3
With K=3, there are one Default=Y (Jack, 33,150000, Y) and two Default=N (Kate,
35,120000, N)and (George , 60, 100000, N) .Here out of three closest object only one Y and
two N so we can say default class for Andrew is N.

Let K=4
With K=4, there are two Default=Y (Jack, 33,150000, Y),(Anil, 23, 95000, Y) and three
Default=N (Kate, 35,120000, N)(George, 60, 100000, N). Here out of four closest object
twois Y and two is N.So again we cannot able to decide class for Andrew.

Let K=5
With K=5, there are two Default=Y (Jack, 33,150000, Y),(Anil, 23, 95000, Y) and two
Default=N (Kate, 35,120000, N) (George, 60, 100000, N) (Alex,45,80000,N). Here out of
five closest objects two is Y and three is N. So we can say default class for Andrew is N
Pros of KNN
1. Simple to implement
2. Flexible to feature/distance choices
3. Naturally handles multi-class cases
4. Can do well in practice with enough representative data
Cons of KNN
1. Need to determine the value of parameter K (number of nearest neighbors)
2. Computation cost is quite high because we need to compute the distance of each query
instance to all training samples.
3. Storage of data
4. Must know we have a meaningful distance function.

6 - KNN Classifier
No ratings yet
6 - KNN Classifier
10 pages
K Nearest Neighbor - Step by Step Tutorial
No ratings yet
K Nearest Neighbor - Step by Step Tutorial
16 pages
Chem 111 Course Outline
No ratings yet
Chem 111 Course Outline
2 pages
12 ML KNN
No ratings yet
12 ML KNN
28 pages
Algorithms - K Nearest Neighbors
No ratings yet
Algorithms - K Nearest Neighbors
23 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
13 pages
Supervised Example KNN
No ratings yet
Supervised Example KNN
22 pages
ML 2
No ratings yet
ML 2
6 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
22 pages
Miss Erum Mahood Topic: KNN Algorthim: Presentator BY: Zobia Malaika Maryam Minahil
No ratings yet
Miss Erum Mahood Topic: KNN Algorthim: Presentator BY: Zobia Malaika Maryam Minahil
10 pages
ML Lecture 13 KNN
No ratings yet
ML Lecture 13 KNN
14 pages
Unit 3 KNN
No ratings yet
Unit 3 KNN
16 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
KNN - Feb 19
No ratings yet
KNN - Feb 19
42 pages
KNN With Example
No ratings yet
KNN With Example
21 pages
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
No ratings yet
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
7 pages
KNN Presentation
No ratings yet
KNN Presentation
16 pages
Why Do We Need A K-NN Algorithm?
No ratings yet
Why Do We Need A K-NN Algorithm?
11 pages
COS4852 2023 Unit 2 - KNN
No ratings yet
COS4852 2023 Unit 2 - KNN
10 pages
14 K - Nearest Neighbours
No ratings yet
14 K - Nearest Neighbours
8 pages
Lecture Note #3 - PEC-CS701E
No ratings yet
Lecture Note #3 - PEC-CS701E
27 pages
K Nearest Neighbors - Classification: Algorithm
No ratings yet
K Nearest Neighbors - Classification: Algorithm
4 pages
Unit 3 KNN
No ratings yet
Unit 3 KNN
6 pages
K Nearestneighborknnalgorithm 241117075907 d767c46d
No ratings yet
K Nearestneighborknnalgorithm 241117075907 d767c46d
13 pages
Road Traffic Algorithm
No ratings yet
Road Traffic Algorithm
5 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
22 pages
KNN Dan KMeans
No ratings yet
KNN Dan KMeans
37 pages
KNN Algorithm
No ratings yet
KNN Algorithm
15 pages
K-Means and KNN
No ratings yet
K-Means and KNN
11 pages
Instance-Based Learning: K-Nearest Neighbour Learning
No ratings yet
Instance-Based Learning: K-Nearest Neighbour Learning
21 pages
K Nearest Neighbors: Probably A Duck."
No ratings yet
K Nearest Neighbors: Probably A Duck."
14 pages
21 KNN
No ratings yet
21 KNN
28 pages
Distance-Based Methods - KNN
No ratings yet
Distance-Based Methods - KNN
8 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
Machine Learning-Lecture 03
No ratings yet
Machine Learning-Lecture 03
19 pages
1694600817-Unit2.3 KNN CU 2.0
No ratings yet
1694600817-Unit2.3 KNN CU 2.0
25 pages
KNN Algorithm
No ratings yet
KNN Algorithm
2 pages
Lecture 14 and 15
No ratings yet
Lecture 14 and 15
42 pages
KNN
No ratings yet
KNN
53 pages
Instance Based Learning
No ratings yet
Instance Based Learning
16 pages
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
No ratings yet
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
18 pages
K-Nearest Neighbor Algorithm
No ratings yet
K-Nearest Neighbor Algorithm
6 pages
Lecture-11-KNearest Clustering-Part-1
No ratings yet
Lecture-11-KNearest Clustering-Part-1
18 pages
Intro To KNN
No ratings yet
Intro To KNN
8 pages
Aiml M3 C2
No ratings yet
Aiml M3 C2
56 pages
Supervised Learning and K Nearest Neighbors: Business Intelligence For Managers
No ratings yet
Supervised Learning and K Nearest Neighbors: Business Intelligence For Managers
15 pages
K Nearest Neighbor (KNN)
No ratings yet
K Nearest Neighbor (KNN)
9 pages
Lecture 38 KNN
No ratings yet
Lecture 38 KNN
4 pages
K-Nearest Neighbor Classification-Algorithm and Characteristics
No ratings yet
K-Nearest Neighbor Classification-Algorithm and Characteristics
6 pages
Decision Tree KNN
No ratings yet
Decision Tree KNN
9 pages
K-Nearest Neighbors: KNN Algorithm Pseudocode
No ratings yet
K-Nearest Neighbors: KNN Algorithm Pseudocode
2 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
2 pages
Instance Based Learning
No ratings yet
Instance Based Learning
7 pages
4.kNN Concepts
No ratings yet
4.kNN Concepts
12 pages
KNN Algorithm
No ratings yet
KNN Algorithm
16 pages
What Is KNN
No ratings yet
What Is KNN
9 pages
K Nearest Neighbor KNN
No ratings yet
K Nearest Neighbor KNN
18 pages
Lecture 3 - KNN Algorithm
No ratings yet
Lecture 3 - KNN Algorithm
28 pages
K Nearest Neighbors
100% (1)
K Nearest Neighbors
9 pages
Calculus: Maths of the Gods
From Everand
Calculus: Maths of the Gods
Bill Todorovich
No ratings yet
08 Fair Machine Learning
No ratings yet
08 Fair Machine Learning
53 pages
Example 1: Chapter 6. Eigenvalues and Eigenvectors
No ratings yet
Example 1: Chapter 6. Eigenvalues and Eigenvectors
1 page
Machines That Can Learn: Decision Support Systems in The 21 Century, 2 by George M. Marakas
No ratings yet
Machines That Can Learn: Decision Support Systems in The 21 Century, 2 by George M. Marakas
22 pages
PID Pole Placement Controller
No ratings yet
PID Pole Placement Controller
16 pages
978 0 7503 3395 5.preview
No ratings yet
978 0 7503 3395 5.preview
26 pages
Regression
No ratings yet
Regression
14 pages
(Applying Mathematics) David Mumford, Agnès Desolneux - Pattern Theory - The Stochastic Analysis of Real-World Signals-A K Peters - CRC Press (2010)
No ratings yet
(Applying Mathematics) David Mumford, Agnès Desolneux - Pattern Theory - The Stochastic Analysis of Real-World Signals-A K Peters - CRC Press (2010)
413 pages
Alignment Methods: Introduction To Global and Local Sequence Alignment Methods
No ratings yet
Alignment Methods: Introduction To Global and Local Sequence Alignment Methods
57 pages
MIT6 0001F16 Pset4
No ratings yet
MIT6 0001F16 Pset4
10 pages
Image Processing
No ratings yet
Image Processing
8 pages
Graph Theory
No ratings yet
Graph Theory
13 pages
Restricted Boltzmann Machines
No ratings yet
Restricted Boltzmann Machines
8 pages
Lab 5A: Design A Digital Fir Low Pass Filter With The Following Specifications
No ratings yet
Lab 5A: Design A Digital Fir Low Pass Filter With The Following Specifications
6 pages
The Cholesky Decomposition
No ratings yet
The Cholesky Decomposition
4 pages
Energy-Constrained Delivery of Goods With Drones Under Varying Wind Conditions 1
No ratings yet
Energy-Constrained Delivery of Goods With Drones Under Varying Wind Conditions 1
13 pages
Homework Computational Mechanics 2023-1
No ratings yet
Homework Computational Mechanics 2023-1
7 pages
Ajms 482 23
No ratings yet
Ajms 482 23
7 pages
Practice Questions On Height balanced/AVL Tree
No ratings yet
Practice Questions On Height balanced/AVL Tree
5 pages
Bs Synopsis
No ratings yet
Bs Synopsis
7 pages
Mip001 Problema 1
No ratings yet
Mip001 Problema 1
3 pages
MCS 224 2
No ratings yet
MCS 224 2
5 pages
SCIE 211 Lab 2 Worksheet
No ratings yet
SCIE 211 Lab 2 Worksheet
4 pages
Ds Notes V Unit C++
No ratings yet
Ds Notes V Unit C++
20 pages
2 - 04. Energy Method (5. Minimum Total Potential E - 02)
No ratings yet
2 - 04. Energy Method (5. Minimum Total Potential E - 02)
18 pages
23 Domain Adaptation Challenges Methods Datasets and Applications
No ratings yet
23 Domain Adaptation Challenges Methods Datasets and Applications
48 pages
Ai Unit3
No ratings yet
Ai Unit3
38 pages
Mid Important Questions-Pps
No ratings yet
Mid Important Questions-Pps
3 pages
Experiment 5
No ratings yet
Experiment 5
5 pages
Bert
No ratings yet
Bert
2 pages

Introduction To KNN

Uploaded by

Introduction To KNN

Uploaded by

Introduction to KNN

Figure 1 Two classes with one unknown objects

Figure 2 K=3 (Three Nearest Neighbors)

Figure 3 K=7 (Five Nearest Neighbors)

Figure 4 Unknown object belongs ( Majority class ) to class B

Figure 5 Features of KNN algorithm

Compute distance between given unknown

Initialize the value of K

Select K Nearest Neighbor

Apply majority on K Nearest Neighbor

Decide the class for unknown object

Figure 6 Outline of KNN Algorithm

Euclidean distance = √∑( 𝑥𝑖 − 𝑦𝑖 )2

First step is to calculate Euclidean distance

= √(48 − 25)2 + (142000 − 40000)2

Table 3 Five Nearest Neighbor with Minimum distance

You might also like