0% found this document useful (0 votes)

74 views9 pages

K-Nearest Neighbor: Carla P. Gomes CS4700

The document discusses K-Nearest Neighbor (KNN), a simple machine learning classification algorithm. KNN classifies new data points based on the labels of the k closest training examples in feature space. It can be sensitive to noise and high-dimensional data. Key aspects include choosing a distance metric, determining k, and addressing the "curse of dimensionality". KNN is simple to implement but requires searching all training data for predictions.

Uploaded by

Ali Shan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views9 pages

K-Nearest Neighbor: Carla P. Gomes CS4700

Uploaded by

Ali Shan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 9

K- Nearest Neighbor

Carla P. Gomes
CS4700
1-Nearest Neighbor

One of the simplest of all machine learning classifiers

Simple idea: label a new point the same as the closest known point

Label it red.

Carla P. Gomes
CS4700
Distance Metrics
Different metrics can change the decision surface

Dist(a,b) =(a1 – b1)2 + (a2 – b2)2 Dist(a,b) =(a1 – b1)2 + (3a2 – 3b2)2

Standard Euclidean distance metric:

– Two-dimensional: Dist(a,b) = sqrt((a1 – b1)2 + (a2 – b2)2)
Adapted from “Instance-Based Learning”
– Multivariate: Dist(a,b) = sqrt(∑ (ai – bi)2) lecture slides by Andrew Moore, CMU.

Carla P. Gomes
CS4700
1-NN’s Aspects as an
Instance-Based Learner:

A distance metric
– Euclidean
– When different units are used for each dimension
 normalize each dimension by standard deviation
– For discrete data, can use hamming distance
 D(x1,x2) =number of features on which x1 and x2 differ
– Others (e.g., normal, cosine)

How many nearby neighbors to look at?

– One
How to fit with the local points?
– Just predict the same output as the nearest neighbor.

Adapted from “Instance-Based Learning”

lecture slides by Andrew
Carla P.Moore,
Gomes CMU.
CS4700
k – Nearest Neighbor

Generalizes 1-NN to smooth away noise in the labels

A new point is now assigned the most frequent label of its k nearest
neighbors

Label it red, when k = 3

Label it blue, when k = 7

Carla P. Gomes
CS4700
KNN Example
Food Chat Fast Price Bar BigTip
(3) (2) (2) (3) (2)
1 great yes yes normal no yes
2 great no yes normal no yes
3 mediocre yes no high no no
4 great yes yes normal yes yes

Similarity metric: Number of matching attributes (k=2)

New examples:
– Example 1 (great, no, no, normal, no) Yes
 most similar: number 2 (1 mismatch, 4 match)  yes

Second most similar example: number 1 (2 mismatch, 3 match)  yes

– Example 2 (mediocre, yes, no, normal, no) Yes/No

 Most similar: number 3 (1 mismatch, 4 match)  no

Second most similar example: number 1 (2 mismatch, 3 match)  yes

Selecting the Number of Neighbors

Increase k:
– Makes KNN less sensitive to noise

Decrease k:
– Allows capturing finer structure of space

Pick k not too large, but not too small (depends on data)

Carla P. Gomes
CS4700
Curse-of-Dimensionality

Prediction accuracy can quickly degrade when number of attributes

grows.
– Irrelevant attributes easily “swamp” information from relevant
attributes
– When many irrelevant attributes, similarity/distance measure
becomes less reliable

Remedy
– Try to remove irrelevant attributes in pre-processing step
– Weight attributes differently
– Increase k (but not too much)

Carla P. Gomes
CS4700
Advantages and Disadvantages of KNN

Need distance/similarity measure and attributes that “match” target

function.

For large training sets,

 Must make a pass through the entire dataset for each classification.
This can be prohibitive for large data sets.

Prediction accuracy can quickly degrade when number of attributes

grows.

Simple to implement algorithm;

Requires little tuning;
Often performs quite weel!
(Try it first on a new learning problem). Carla P. Gomes
CS4700

Chatbot Final Year Project PDF Free
100% (1)
Chatbot Final Year Project PDF Free
24 pages
CS 4700: Foundations of Artificial Intelligence
No ratings yet
CS 4700: Foundations of Artificial Intelligence
9 pages
1-Nearest Neighbor: One of The Simplest of All Machine Learning Classifiers A New Point The
No ratings yet
1-Nearest Neighbor: One of The Simplest of All Machine Learning Classifiers A New Point The
8 pages
K NNF Alskdjf Alsdkjf
No ratings yet
K NNF Alskdjf Alsdkjf
9 pages
Week 6 - K-Nearest Neighbor-Dikonversi
No ratings yet
Week 6 - K-Nearest Neighbor-Dikonversi
9 pages
Lecture 07 KNN 14112022 034756pm
100% (1)
Lecture 07 KNN 14112022 034756pm
24 pages
Week 5 - Instance-Based Learning & PCA
No ratings yet
Week 5 - Instance-Based Learning & PCA
69 pages
m3 Final-1
No ratings yet
m3 Final-1
171 pages
Why Do We Need A K-NN Algorithm?
No ratings yet
Why Do We Need A K-NN Algorithm?
11 pages
K Nearest Neighbor: Presented by
No ratings yet
K Nearest Neighbor: Presented by
29 pages
Machine Learning For Humans, Part 2.3 - Supervised Learning III - by Vishal Maini - Machine Learning For Humans - Medium
No ratings yet
Machine Learning For Humans, Part 2.3 - Supervised Learning III - by Vishal Maini - Machine Learning For Humans - Medium
25 pages
Similarity Based Learning (Part 2)
No ratings yet
Similarity Based Learning (Part 2)
15 pages
K-Nearest Neighbors: Nipun Batra July 5, 2020
No ratings yet
K-Nearest Neighbors: Nipun Batra July 5, 2020
66 pages
Lecture Slides-Week15,16
No ratings yet
Lecture Slides-Week15,16
50 pages
K-Nearest Neighbour Classifiers
No ratings yet
K-Nearest Neighbour Classifiers
18 pages
K-Nearest Neighbors: Marcel Van Velzen Junior Marte Garcia
No ratings yet
K-Nearest Neighbors: Marcel Van Velzen Junior Marte Garcia
8 pages
DSB - Unit3
No ratings yet
DSB - Unit3
87 pages
Chapter 2
No ratings yet
Chapter 2
70 pages
Lecture 4
No ratings yet
Lecture 4
31 pages
AIML
No ratings yet
AIML
13 pages
K - Nearest Neighbours
No ratings yet
K - Nearest Neighbours
6 pages
K Nearest Neighbor Classification
No ratings yet
K Nearest Neighbor Classification
30 pages
Lecture 2 - Nearest-Neighbors Methods
No ratings yet
Lecture 2 - Nearest-Neighbors Methods
57 pages
Lazy LearningClassification Using Nearest Neighbors
No ratings yet
Lazy LearningClassification Using Nearest Neighbors
36 pages
Machine Learning Note 4
No ratings yet
Machine Learning Note 4
2 pages
K-Nearest Neighbors
No ratings yet
K-Nearest Neighbors
35 pages
Instance Based Learning
No ratings yet
Instance Based Learning
20 pages
KNN Using Python
No ratings yet
KNN Using Python
23 pages
Similarity Analysis
No ratings yet
Similarity Analysis
85 pages
BDA
No ratings yet
BDA
31 pages
Introduction To Machine Learning: K-Nearest Neighbor Algorithm
No ratings yet
Introduction To Machine Learning: K-Nearest Neighbor Algorithm
25 pages
K Nearest Neighbor KNN
No ratings yet
K Nearest Neighbor KNN
18 pages
Instance-Based Learning: K-Nearest Neighbour Learning
No ratings yet
Instance-Based Learning: K-Nearest Neighbour Learning
21 pages
Univt - IV
No ratings yet
Univt - IV
72 pages
06 KNN
No ratings yet
06 KNN
41 pages
BookSlides 5A Similarity-based-Learning
No ratings yet
BookSlides 5A Similarity-based-Learning
40 pages
Decision Tree KNN
No ratings yet
Decision Tree KNN
9 pages
Unit II 2 Mark Answers ML
No ratings yet
Unit II 2 Mark Answers ML
3 pages
Lec 02 - KNN
No ratings yet
Lec 02 - KNN
36 pages
ML Lec07 KNN
100% (2)
ML Lec07 KNN
37 pages
Lecture-8 Classification Using K-NN
No ratings yet
Lecture-8 Classification Using K-NN
40 pages
K Nearest Neighbors (KNN) : "Birds of A Feather Flock Together"
No ratings yet
K Nearest Neighbors (KNN) : "Birds of A Feather Flock Together"
16 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
Session 9 KNN - 2024
No ratings yet
Session 9 KNN - 2024
23 pages
Unit 5 ML
No ratings yet
Unit 5 ML
13 pages
KNN - Feb 19
No ratings yet
KNN - Feb 19
42 pages
08 - KNN
No ratings yet
08 - KNN
39 pages
KNN
No ratings yet
KNN
53 pages
DS - Module 3
No ratings yet
DS - Module 3
65 pages
05 KNN
No ratings yet
05 KNN
49 pages
Week 07
No ratings yet
Week 07
24 pages
445 Lecture 5
No ratings yet
445 Lecture 5
28 pages
Predict Based Simmiliarity and Validation
No ratings yet
Predict Based Simmiliarity and Validation
19 pages
Unit 4 - KVR
No ratings yet
Unit 4 - KVR
111 pages
04 KNN M
No ratings yet
04 KNN M
26 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
4.4-InstanceBasedLearning Part 1
No ratings yet
4.4-InstanceBasedLearning Part 1
16 pages
Session 4_Chapter 07 KNN
No ratings yet
Session 4_Chapter 07 KNN
15 pages
1 KNN-Algo
No ratings yet
1 KNN-Algo
27 pages
Painless Pre-Algebra
From Everand
Painless Pre-Algebra
Barron's Educational Series
3/5 (2)
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Notes 2. Linear - Regression - With - Multiple - Variables
No ratings yet
Notes 2. Linear - Regression - With - Multiple - Variables
10 pages
University Automated Chatbot Using Text Classification With Machine Learning and NLP
No ratings yet
University Automated Chatbot Using Text Classification With Machine Learning and NLP
5 pages
University Automated Chatbot Using Text Classification With Machine Learning and NLP
No ratings yet
University Automated Chatbot Using Text Classification With Machine Learning and NLP
5 pages
Group 4
No ratings yet
Group 4
10 pages
Duplicate Question Detection: Using Random Forest Algorithm
No ratings yet
Duplicate Question Detection: Using Random Forest Algorithm
24 pages
Data Compilation For Border Security A Powerful Tool
No ratings yet
Data Compilation For Border Security A Powerful Tool
9 pages
Recursive Neural Conditional Random Fields For Aspect-Based Sentiment Analysis
No ratings yet
Recursive Neural Conditional Random Fields For Aspect-Based Sentiment Analysis
11 pages
21CS743 Model Question Paper Solution
No ratings yet
21CS743 Model Question Paper Solution
32 pages
A Usg It Handbook Companion Guide
No ratings yet
A Usg It Handbook Companion Guide
8 pages
Gaussian Mixture Model - GeeksforGeeks
No ratings yet
Gaussian Mixture Model - GeeksforGeeks
6 pages
Initial MCQS Computer Domain Sanfoundry
No ratings yet
Initial MCQS Computer Domain Sanfoundry
51 pages
Lessons From Large-Scale Machine Learning Deployments On Spark
No ratings yet
Lessons From Large-Scale Machine Learning Deployments On Spark
105 pages
MIS Assignment
No ratings yet
MIS Assignment
26 pages
Generative AI in Business Consulting Analyzing Its Impact On Client Engagement and Service Delivery Models
No ratings yet
Generative AI in Business Consulting Analyzing Its Impact On Client Engagement and Service Delivery Models
8 pages
Sree Kalyani Dhanwada
No ratings yet
Sree Kalyani Dhanwada
1 page
Ananth Kumar Resume
No ratings yet
Ananth Kumar Resume
5 pages
Naive Bayes
No ratings yet
Naive Bayes
26 pages
DeepFood Food Image Analysis and Dietary Assessment Via Deep Model
No ratings yet
DeepFood Food Image Analysis and Dietary Assessment Via Deep Model
13 pages
Lecture 1 - Introduction
No ratings yet
Lecture 1 - Introduction
14 pages
Explainable Prediction of Surface Roughness in Multi-Jet Polishing Based On
No ratings yet
Explainable Prediction of Surface Roughness in Multi-Jet Polishing Based On
12 pages
ML Record
No ratings yet
ML Record
24 pages
0TH PPT 1
No ratings yet
0TH PPT 1
10 pages
Feature-Based Semi-Supervised Learning To Detect Malware From Android
No ratings yet
Feature-Based Semi-Supervised Learning To Detect Malware From Android
26 pages
Roi Detection For Visual Lip Reading
No ratings yet
Roi Detection For Visual Lip Reading
22 pages
LM39 - Naïve Bayes Models
No ratings yet
LM39 - Naïve Bayes Models
14 pages
Ashwin Internship Report
No ratings yet
Ashwin Internship Report
30 pages
Ai Notes Unit 1
No ratings yet
Ai Notes Unit 1
33 pages
Generative AI Machine Learning
No ratings yet
Generative AI Machine Learning
13 pages
Lasso Vs Ridge Vs Elastic 1
No ratings yet
Lasso Vs Ridge Vs Elastic 1
5 pages
FF 180 Project Registration and Progress Chinmay
No ratings yet
FF 180 Project Registration and Progress Chinmay
7 pages
(FREE PDF Sample) Encyclopedia of Machine Learning and Data Mining.9781489976857 Wei Zhi Ebooks
100% (5)
(FREE PDF Sample) Encyclopedia of Machine Learning and Data Mining.9781489976857 Wei Zhi Ebooks
34 pages
Data Science & Machine Learning 2024
No ratings yet
Data Science & Machine Learning 2024
2 pages
JCTC: A Large Job Posting Corpus For Text Classification: Haoyu Xu
No ratings yet
JCTC: A Large Job Posting Corpus For Text Classification: Haoyu Xu
15 pages

K-Nearest Neighbor: Carla P. Gomes CS4700

Uploaded by

K-Nearest Neighbor: Carla P. Gomes CS4700

Uploaded by

K- Nearest Neighbor

One of the simplest of all machine learning classifiers

Standard Euclidean distance metric:

How many nearby neighbors to look at?

Adapted from “Instance-Based Learning”

Generalizes 1-NN to smooth away noise in the labels

Label it red, when k = 3

Label it blue, when k = 7

Similarity metric: Number of matching attributes (k=2)

Second most similar example: number 1 (2 mismatch, 3 match)  yes

– Example 2 (mediocre, yes, no, normal, no) Yes/No

Second most similar example: number 1 (2 mismatch, 3 match)  yes

Prediction accuracy can quickly degrade when number of attributes

Need distance/similarity measure and attributes that “match” target

For large training sets,

Prediction accuracy can quickly degrade when number of attributes

Simple to implement algorithm;

You might also like