05 K-Nearest Neighbors

K-Nearest Neighbors is a simple machine learning algorithm that classifies new data points based on the majority class of its k nearest neighbors. It searches the training dataset for the k most similar instances to make predictions, using distance measures like Euclidean distance. For classification, it returns the most common class of neighbors, and for regression it averages their values. KNN is non-parametric, instance-based, and performs lazy learning by building no model until prediction time.

Uploaded by

Lalaloopsie The Great

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views15 pages

05 K-Nearest Neighbors

Uploaded by

Lalaloopsie The Great

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

K-Nearest Neighbors

https://machinelearningmastery.com/tutorial-to-implement-k-nearest-
neighbors-in-python-from-scratch/
K-Nearest Neighbors
• The model for kNN is the entire training dataset.
• When a prediction is required for an unseen data instance, the kNN
algorithm will search through the training dataset for the k-most
similar instances.
• The prediction attribute of the most similar instances is summarized
and returned as the prediction for the unseen instance.
K-Nearest Neighbors
• The similarity measure is dependent on the type of data.
• For real-valued data, the Euclidean distance can be used.
• Other types of data such as categorical or binary data, Hamming
distance can be used.
K-Nearest Neighbors
• In the case of regression problems, the average of the predicted
attribute may be returned.
• In the case of classification, the most prevalent class may be
returned.
K-Nearest Neighbors
• The kNN algorithm is belongs to the family of

• instance-based,
• competitive learning and
• lazy learning algorithms.
competitive learning algorithm
• It is a competitive learning algorithm, because it internally uses
competition between model elements (data instances) in order to
make a predictive decision.
• The objective similarity measure between data instances causes each
data instance to compete to “win” or be most similar to a given
unseen data instance and contribute to a prediction.
Instance-based algorithms
• Instance-based algorithms are those algorithms that model the
problem using data instances (or rows) in order to make predictive
decisions.
• The kNN algorithm is an extreme form of instance-based methods
because all training observations are retained as part of the model.
Lazy learning
• Lazy learning refers to the fact that the algorithm does not build a
model until the time that a prediction is required.
• It is lazy because it only does work at the last second.
• This has the benefit of only including data relevant to the unseen
data, called a localized model.
• A disadvantage is that it can be computationally expensive to repeat
the same or similar searches over larger training datasets.
K-nearest neighbors
• Finally, kNN is powerful because it does not assume anything about
the data, other than a distance measure can be calculated
consistently between any two instances.
• As such, it is called non-parametric or non-linear as it does not
assume a functional form.
Extensions (Asynchronous) By Pairs Sept 23
• Tune KNN. Try larger and larger k values to see if you can improve
the performance of the algorithm on the Iris dataset.
• Regression. Adapt the example and apply it to a regression
predictive modeling problem (e.g. predict a numerical value)
• More Distance Measures. Implement other distance measures that
you can use to find similar historical data, such as Hamming
distance, Manhattan distance and Minkowski distance.
• Data Preparation. Distance measures are strongly affected by the
scale of the input data. Experiment with standardization and other
data preparation methods in order to improve results.
• More Problems. As always, experiment with the technique on more
and different classification and regression problems.
By Pairs Sept 23 Expected Output
• Tune KNN. Try larger and larger k values to see if you can
improve the performance of the algorithm on the Iris dataset.
• Use at least 5 different values for k.
• Using a table, present the accuracy and briefly explain.

• Regression. Adapt the example and apply it to a regression

predictive modeling problem (e.g. predict a numerical value)
• Explain what the predictive modeling problem is about.
• Explain the result or what changes was made (if any)
By Pairs Sept 23 Expected Output
• More Distance Measures. Implement other distance measures
that you can use to find similar historical data, such as
Hamming distance, Manhattan distance and Minkowski
distance.
• Identify what distance measure was used, explain the result.
By Pairs Sept 23 Expected Output
• Data Preparation. Distance measures are strongly affected by
the scale of the input data. Experiment with standardization and
other data preparation methods in order to improve results.
• Define a problem
• Identify what data preparation method can be used
• Explain how the data can be evaluated.

• Reference on data preparation

• https://machinelearningmastery.com/data-preparation-techniques-
for-machine-learning/
For Example:
• Problem: Email Spam or not Spam
• Data Preparation:
• Gather data, Clean data, Identify Features of Spam emails…
• Evaluation:
• Measure Accuracy of Classification
September 23 Session
• Review Linear and Logistic Regression (Asynchronous)
• Summative Exam #1 Activate at 2pm.
• Download and Install Respondus Lockdown Browser.

K-Nearest Neighbor (KNN) Algorithm For Machine Learning
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
17 pages
Part A 3. KNN Classification
No ratings yet
Part A 3. KNN Classification
35 pages
Lazy Learners Unit 2
No ratings yet
Lazy Learners Unit 2
26 pages
Machine Learning Unit 3
No ratings yet
Machine Learning Unit 3
40 pages
GUI Programming With Python QT EDITION
80% (5)
GUI Programming With Python QT EDITION
641 pages
CSL0777 L22
No ratings yet
CSL0777 L22
35 pages
Improving Time-Complexity of K Nearest Neighbors Classifier: A Systematic Review
No ratings yet
Improving Time-Complexity of K Nearest Neighbors Classifier: A Systematic Review
6 pages
1 - KNN
No ratings yet
1 - KNN
19 pages
12 - 23ECE216 - Nearest Neighbors
No ratings yet
12 - 23ECE216 - Nearest Neighbors
29 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
K-Nearest Neighbors
No ratings yet
K-Nearest Neighbors
35 pages
KNN 2
No ratings yet
KNN 2
53 pages
ML CH 3
No ratings yet
ML CH 3
88 pages
Unit 4 Classification
No ratings yet
Unit 4 Classification
73 pages
ML Lec-10
No ratings yet
ML Lec-10
19 pages
K Nearest Neighbor: Presented by
No ratings yet
K Nearest Neighbor: Presented by
29 pages
Lecture 3 - KNN Algorithm
No ratings yet
Lecture 3 - KNN Algorithm
28 pages
Oracle Approvals Management
0% (1)
Oracle Approvals Management
22 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
No ratings yet
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
13 pages
ML KN
No ratings yet
ML KN
12 pages
ML-Unit 5
No ratings yet
ML-Unit 5
40 pages
19-K-Nearest Neighbor Learning.-22-08-2024
No ratings yet
19-K-Nearest Neighbor Learning.-22-08-2024
25 pages
'Machine Learning (Nagarjun)
No ratings yet
'Machine Learning (Nagarjun)
10 pages
KNN HMM
No ratings yet
KNN HMM
51 pages
K-Nearest Neighbor Classification-Algorithm and Characteristics
No ratings yet
K-Nearest Neighbor Classification-Algorithm and Characteristics
6 pages
B-56 Sanket Jambhulkar MLA-7
No ratings yet
B-56 Sanket Jambhulkar MLA-7
9 pages
Computer Systems Bryant Homework Solutions
100% (1)
Computer Systems Bryant Homework Solutions
6 pages
20180723161729D4730 - Pert18 - K-Nearest Neighbor
No ratings yet
20180723161729D4730 - Pert18 - K-Nearest Neighbor
22 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
ML 3
No ratings yet
ML 3
6 pages
01 Basics 02knn 01
No ratings yet
01 Basics 02knn 01
7 pages
ML Notes
100% (2)
ML Notes
125 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
22 pages
K-Nearest Neighbor (KNN) 6
No ratings yet
K-Nearest Neighbor (KNN) 6
46 pages
Supervised Example KNN
No ratings yet
Supervised Example KNN
22 pages
Experiment 4: Aim/Overview of The Practical: Task To Be Done
No ratings yet
Experiment 4: Aim/Overview of The Practical: Task To Be Done
7 pages
Instance Based Learning
No ratings yet
Instance Based Learning
16 pages
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
100% (1)
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
125 pages
AI Practical No 2 New
No ratings yet
AI Practical No 2 New
21 pages
KNN Dan KMeans
No ratings yet
KNN Dan KMeans
37 pages
02-knn Notes
No ratings yet
02-knn Notes
23 pages
06 KNN
No ratings yet
06 KNN
41 pages
4K-Nearest Neighbor
No ratings yet
4K-Nearest Neighbor
38 pages
K Nearest Neighbour Classifier
No ratings yet
K Nearest Neighbour Classifier
24 pages
Lecture Note #3 - PEC-CS701E
No ratings yet
Lecture Note #3 - PEC-CS701E
27 pages
KNN
No ratings yet
KNN
53 pages
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
23 pages
What Is KNN
No ratings yet
What Is KNN
9 pages
ML Lecture 13 KNN
No ratings yet
ML Lecture 13 KNN
14 pages
OOSE Chapter One
No ratings yet
OOSE Chapter One
58 pages
KNN PDF
No ratings yet
KNN PDF
30 pages
Experiment No 7 ML
No ratings yet
Experiment No 7 ML
4 pages
Research Paper
No ratings yet
Research Paper
6 pages
The K
No ratings yet
The K
2 pages
Experiment 2.2 KNN Classifier
No ratings yet
Experiment 2.2 KNN Classifier
7 pages
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
No ratings yet
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
18 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
3.1 K Nearest Neighbour Classifier
No ratings yet
3.1 K Nearest Neighbour Classifier
24 pages
Lecture 38 KNN
No ratings yet
Lecture 38 KNN
4 pages
Explain The Basic Elements of A C# Program. Illustrate Every Aspect Completely Through A Simple C# Program Structure
No ratings yet
Explain The Basic Elements of A C# Program. Illustrate Every Aspect Completely Through A Simple C# Program Structure
6 pages
Lecture 07 Slides
No ratings yet
Lecture 07 Slides
45 pages
IMS DB-DC Return Codes
No ratings yet
IMS DB-DC Return Codes
31 pages
Final Set - Final Check SCJP Question
No ratings yet
Final Set - Final Check SCJP Question
41 pages
Tree Adt
No ratings yet
Tree Adt
7 pages
Unit - 5 - Chpater 4 - Querying
No ratings yet
Unit - 5 - Chpater 4 - Querying
24 pages
VAT Report Changes
No ratings yet
VAT Report Changes
4 pages
L-12-State Variables and Scripting
No ratings yet
L-12-State Variables and Scripting
32 pages
Why Do We Need A K-NN Algorithm?
No ratings yet
Why Do We Need A K-NN Algorithm?
11 pages
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
No ratings yet
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
7 pages
Rxjs Tutorial
100% (1)
Rxjs Tutorial
106 pages
Lecture 3
No ratings yet
Lecture 3
17 pages
PHP Unit
No ratings yet
PHP Unit
36 pages
Zero Generation Assignment
No ratings yet
Zero Generation Assignment
14 pages
Lecture 07 W23
No ratings yet
Lecture 07 W23
27 pages
Lecture 4 - SQL Part II
No ratings yet
Lecture 4 - SQL Part II
73 pages
Study of Spell Checking Techniques and A
No ratings yet
Study of Spell Checking Techniques and A
4 pages
Deadman Timer Config
No ratings yet
Deadman Timer Config
3 pages
Cs8083 Unit V Notes
No ratings yet
Cs8083 Unit V Notes
24 pages
HDL Survival Guide
No ratings yet
HDL Survival Guide
4 pages
Ni Quiz Computer System
No ratings yet
Ni Quiz Computer System
4 pages
Ejercicio Cobol 5
No ratings yet
Ejercicio Cobol 5
2 pages
PGEE Coding Questions Lists
No ratings yet
PGEE Coding Questions Lists
4 pages
BCS 102 - Lab Final Assignment (Guidelines and Rubric)
No ratings yet
BCS 102 - Lab Final Assignment (Guidelines and Rubric)
65 pages
4 ICT S112 Introduction To C 1
No ratings yet
4 ICT S112 Introduction To C 1
18 pages
Client Side and Server Side
No ratings yet
Client Side and Server Side
3 pages
Lists Make #2 Lists, Loops, and Traversals ('22-'23)
No ratings yet
Lists Make #2 Lists, Loops, and Traversals ('22-'23)
1 page
SPOS pr1 Pass-1
No ratings yet
SPOS pr1 Pass-1
9 pages
ABAP RAP Unmanaged Transactional Apps - Part2
No ratings yet
ABAP RAP Unmanaged Transactional Apps - Part2
15 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

05 K-Nearest Neighbors

Uploaded by

05 K-Nearest Neighbors

Uploaded by

K-Nearest Neighbors

• Regression. Adapt the example and apply it to a regression

• Reference on data preparation

You might also like