Nearest-Neighbor Classifier: MTL 782 Iit Delhi

machine learning KNN

Uploaded by

webdev397

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views16 pages

Nearest-Neighbor Classifier: MTL 782 Iit Delhi

machine learning KNN

Uploaded by

webdev397

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Nearest-Neighbor

Classifier
KNN
MTL 782
IIT DELHI
Instance-Based Classifiers
Set of Stored Cases • Store the training records

……... • Use training records to

Atr1 AtrN Class
predict the class label of
A unseen cases
B
B
Unseen Case
C
Atr1 ……... AtrN
A
C
B
Instance Based Classifiers
• Examples:
– Rote-learner
• Memorizes entire training data and performs classification only if attributes
of record match one of the training examples exactly

– Nearest neighbor
• Uses k “closest” points (nearest neighbors) for performing classification
Nearest Neighbor Classifiers
• Basic idea:
– If it walks like a duck, quacks like a duck, then it’s probably a duck

Compute
Distance Test
Record

Training Choose k of the

Records “nearest” records
Nearest-Neighbor Classifiers
Unknown record l Requires three things
– The set of stored records
– Distance Metric to compute
distance between records
– The value of k, the number of
nearest neighbors to retrieve

l To classify an unknown record:

– Compute distance to other
training records
– Identify k nearest neighbors
– Use class labels of nearest
neighbors to determine the
class label of unknown record
(e.g., by taking majority vote)
Definition of Nearest Neighbor

X X X

(a) 1-nearest neighbor (b) 2-nearest neighbor (c) 3-nearest neighbor

K-nearest neighbors of a record x are data points

that have the k smallest distance to x
1 nearest-neighbor
Voronoi Diagram
Nearest Neighbor Classification
• Compute distance between two points:
– Euclidean distance
d ( p, q )   ( pi
i
q )
i
2

– Manhatten distance
𝑑 𝑝, 𝑞 = 𝑝𝑖 − 𝑞𝑖
𝑖
– q norm distance
𝑑 𝑝, 𝑞 = ( 𝑝𝑖 − 𝑞𝑖 𝑞 ) 1/𝑞
𝑖
• Determine the class from nearest neighbor list
– take the majority vote of class labels among the k-nearest neighbors
y’ = argmax 𝒙𝑖 ,𝑦𝑖 ϵ 𝐷𝑧 𝐼( 𝑣 = 𝑦𝑖 )
𝑣
where Dz is the set of k closest training examples to z.
– Weigh the vote according to distance
y’ = argmax 𝒙𝑖 ,𝑦𝑖 ϵ 𝐷𝑧 𝑤𝑖 × 𝐼( 𝑣 = 𝑦𝑖 )
𝑣
• weight factor, w = 1/d2
The KNN classification algorithm
Let k be the number of nearest neighbors and D be the set of
training examples.
1. for each test example z = (x’,y’) do
2. Compute d(x’,x), the distance between z and every
example, (x,y) ϵ D
3. Select Dz ⊆ D, the set of k closest training examples to z.
4. y’ = argmax 𝒙𝑖 ,𝑦𝑖 ϵ 𝐷𝑧 𝐼( 𝑣 = 𝑦𝑖 )
𝑣
5. end for
KNN Classification
$2,50,000

$2,00,000

$1,50,000

Loan$ Non-Default
$1,00,000 Default

$50,000

$0
0 10 20 30 40 50 60 70

Age
Nearest Neighbor Classification…
• Choosing the value of k:
– If k is too small, sensitive to noise points
– If k is too large, neighborhood may include points from other classes

X
Nearest Neighbor Classification…
• Scaling issues
– Attributes may have to be scaled to prevent distance measures from
being dominated by one of the attributes
– Example:
• height of a person may vary from 1.5m to 1.8m
• weight of a person may vary from 60 KG to 100KG
• income of a person may vary from Rs10K to Rs 2 Lakh
Nearest Neighbor Classification…
• Problem with Euclidean measure:
– High dimensional data
• curse of dimensionality: all vectors are almost equidistant to the query vector
– Can produce undesirable results
111111111110 100000000000
vs
011111111111 000000000001
d = 1.4142 d = 1.4142

 Solution: Normalize the vectors to unit length

Nearest neighbor Classification…
• k-NN classifiers are lazy learners
– It does not build models explicitly
– Unlike eager learners such as decision tree induction and rule-based
systems
– Classifying unknown records are relatively expensive
Thank You

CSE 423:cloud Computing and Virtualisation MCQ
50% (2)
CSE 423:cloud Computing and Virtualisation MCQ
47 pages
Compiled ESL Activities - Activity Directions - Updated Sept 13th, 2014
50% (2)
Compiled ESL Activities - Activity Directions - Updated Sept 13th, 2014
177 pages
12 ML KNN
No ratings yet
12 ML KNN
28 pages
Syllabus 2021 Foundation Engineering
No ratings yet
Syllabus 2021 Foundation Engineering
4 pages
K-Nearest Neighbourhood
100% (1)
K-Nearest Neighbourhood
7 pages
06c Nearest Neighbor
No ratings yet
06c Nearest Neighbor
17 pages
Introduction To KNN
100% (1)
Introduction To KNN
8 pages
KNN Presentation
No ratings yet
KNN Presentation
16 pages
Lecture 07 KNN 14112022 034756pm
100% (1)
Lecture 07 KNN 14112022 034756pm
24 pages
ML Lecture 13 KNN
No ratings yet
ML Lecture 13 KNN
14 pages
Computerised Assessment of Handwriting
No ratings yet
Computerised Assessment of Handwriting
15 pages
K-Nearest Neighbors
No ratings yet
K-Nearest Neighbors
35 pages
K Nearest Neighbour
100% (1)
K Nearest Neighbour
35 pages
K Nearest Neighbor KNN
No ratings yet
K Nearest Neighbor KNN
18 pages
Lecture 4 KNN
No ratings yet
Lecture 4 KNN
17 pages
Medical Astrology - Medicine by The Stars
No ratings yet
Medical Astrology - Medicine by The Stars
4 pages
1304593-TogetherwithElements Archetypes v1
No ratings yet
1304593-TogetherwithElements Archetypes v1
33 pages
20 KNN Presentation
No ratings yet
20 KNN Presentation
16 pages
K Nearest Neighbor Classification
No ratings yet
K Nearest Neighbor Classification
16 pages
Road Traffic Algorithm
No ratings yet
Road Traffic Algorithm
5 pages
Lecture8 KNN1
No ratings yet
Lecture8 KNN1
16 pages
Instance-Based Learning: Slides Provided by Introduction To Data Mining, 2 Edition
No ratings yet
Instance-Based Learning: Slides Provided by Introduction To Data Mining, 2 Edition
13 pages
Experiment No 7 ML
No ratings yet
Experiment No 7 ML
4 pages
T6 - KNN - Features, Distances &amp Amp Non-Parametric Models
No ratings yet
T6 - KNN - Features, Distances &amp Amp Non-Parametric Models
23 pages
Ch2 - Lec2 - K Nearest Neighbour (KNN)
No ratings yet
Ch2 - Lec2 - K Nearest Neighbour (KNN)
18 pages
5c. Nearest Neighbour Classifier
No ratings yet
5c. Nearest Neighbour Classifier
2 pages
Miss Erum Mahood Topic: KNN Algorthim: Presentator BY: Zobia Malaika Maryam Minahil
No ratings yet
Miss Erum Mahood Topic: KNN Algorthim: Presentator BY: Zobia Malaika Maryam Minahil
10 pages
Textbook ML - Removed
No ratings yet
Textbook ML - Removed
10 pages
04 KNN
No ratings yet
04 KNN
60 pages
KNN Algorithm
No ratings yet
KNN Algorithm
2 pages
ML DSBA Lab4
No ratings yet
ML DSBA Lab4
5 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
13 pages
Materi 7.2. K-NN
No ratings yet
Materi 7.2. K-NN
6 pages
04 KNN M
No ratings yet
04 KNN M
26 pages
Session 9 KNN - 2024
No ratings yet
Session 9 KNN - 2024
23 pages
Chap7 KNN
No ratings yet
Chap7 KNN
15 pages
Aiml M3 C2
No ratings yet
Aiml M3 C2
56 pages
KNN Algorithm
No ratings yet
KNN Algorithm
16 pages
ML KN
No ratings yet
ML KN
12 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
CSE445 NSU Week - 5
No ratings yet
CSE445 NSU Week - 5
26 pages
COS4852 2023 Unit 2 - KNN
No ratings yet
COS4852 2023 Unit 2 - KNN
10 pages
Decision Tree KNN
No ratings yet
Decision Tree KNN
9 pages
Lecture-11-KNearest Clustering-Part-1
No ratings yet
Lecture-11-KNearest Clustering-Part-1
18 pages
20180723161729D4730 - Pert18 - K-Nearest Neighbor
No ratings yet
20180723161729D4730 - Pert18 - K-Nearest Neighbor
22 pages
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
No ratings yet
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
18 pages
PowerPoint Presentation - KNN Presentation
No ratings yet
PowerPoint Presentation - KNN Presentation
16 pages
K Nearest Neighbors (KNN) : "Birds of A Feather Flock Together"
No ratings yet
K Nearest Neighbors (KNN) : "Birds of A Feather Flock Together"
16 pages
K-Nearest Neighbour Classifiers
No ratings yet
K-Nearest Neighbour Classifiers
18 pages
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
No ratings yet
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
47 pages
Classification (K-Nearest Neighbor)
No ratings yet
Classification (K-Nearest Neighbor)
22 pages
Week 07
No ratings yet
Week 07
24 pages
WAH5 - Functional Language Worksheets
No ratings yet
WAH5 - Functional Language Worksheets
6 pages
Algorithms - K Nearest Neighbors
No ratings yet
Algorithms - K Nearest Neighbors
23 pages
K Nearest Neighbor Classification
No ratings yet
K Nearest Neighbor Classification
30 pages
KNN Dan KMeans
No ratings yet
KNN Dan KMeans
37 pages
K-Nearest Neighbor Classifier: This Slide Is Modified From Dr. Tan's Slides. Thanks To Dr. Tan
No ratings yet
K-Nearest Neighbor Classifier: This Slide Is Modified From Dr. Tan's Slides. Thanks To Dr. Tan
11 pages
Lecture 14 and 15
No ratings yet
Lecture 14 and 15
42 pages
K-Nearest Neighbor Learning
No ratings yet
K-Nearest Neighbor Learning
19 pages
Machine Learning Lecture 02
No ratings yet
Machine Learning Lecture 02
25 pages
Lec 02 - KNN
No ratings yet
Lec 02 - KNN
36 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
22 pages
Nearest Neighbour Algorithm
No ratings yet
Nearest Neighbour Algorithm
20 pages
Instance Based Learning
No ratings yet
Instance Based Learning
20 pages
People v. Pagal
No ratings yet
People v. Pagal
3 pages
Recovery Is Everywhere Handout
No ratings yet
Recovery Is Everywhere Handout
3 pages
Century Iib: Autopilot Flight System
No ratings yet
Century Iib: Autopilot Flight System
24 pages
Briere ITCT-A Final PDF
No ratings yet
Briere ITCT-A Final PDF
119 pages
Level 7 Diploma in Data Science (Fast Track) - Delivered Online by LSBR, UK
No ratings yet
Level 7 Diploma in Data Science (Fast Track) - Delivered Online by LSBR, UK
19 pages
Important Components of E-Commerce
100% (1)
Important Components of E-Commerce
23 pages
KNN Algorithm
No ratings yet
KNN Algorithm
3 pages
Sony Ericsson Product
No ratings yet
Sony Ericsson Product
34 pages
HPE Smart Choice Gen 11 - Supplemental QuickSpecs-a50009219enw
No ratings yet
HPE Smart Choice Gen 11 - Supplemental QuickSpecs-a50009219enw
51 pages
Search Engine Optimization
No ratings yet
Search Engine Optimization
12 pages
Boiling: 1. Neutralization of Magma Gas in Host Rock at Deep Location
No ratings yet
Boiling: 1. Neutralization of Magma Gas in Host Rock at Deep Location
84 pages
Sony KDL - 52s5100 Chasis Exr2
No ratings yet
Sony KDL - 52s5100 Chasis Exr2
104 pages
SL 1297 - Rudder Tube Assembly Inspection 2021-10-22
No ratings yet
SL 1297 - Rudder Tube Assembly Inspection 2021-10-22
4 pages
Project Charter Template
No ratings yet
Project Charter Template
9 pages
Traits of 21st Century Teacher
No ratings yet
Traits of 21st Century Teacher
14 pages
(Ebook) Mastering Twitter Ads by Antonio Calero (PDF)
No ratings yet
(Ebook) Mastering Twitter Ads by Antonio Calero (PDF)
106 pages
1.0 Introduction To Biochemistry and Cellular Organization
No ratings yet
1.0 Introduction To Biochemistry and Cellular Organization
6 pages
8 Powerful Icon Libraries
No ratings yet
8 Powerful Icon Libraries
10 pages
Phil Summa
No ratings yet
Phil Summa
3 pages
Bayesian Networks
No ratings yet
Bayesian Networks
45 pages
Introduction To Support Vector Machines
No ratings yet
Introduction To Support Vector Machines
46 pages
KNN Ques
No ratings yet
KNN Ques
13 pages
Parkinson Disease & ALS Cheat Sheet
No ratings yet
Parkinson Disease & ALS Cheat Sheet
4 pages
Operating System
No ratings yet
Operating System
117 pages
2nde Unit 6 Speaking
No ratings yet
2nde Unit 6 Speaking
3 pages
What We Do - MeisterKraft
No ratings yet
What We Do - MeisterKraft
1 page
Secret of Anti-Aging Anti-Aging Food Con
No ratings yet
Secret of Anti-Aging Anti-Aging Food Con
5 pages
Case Bennie and The Jets (CHAPTER 3) : Muadz Kamaruddin 191264
No ratings yet
Case Bennie and The Jets (CHAPTER 3) : Muadz Kamaruddin 191264
2 pages
Day 4 Plastic Pollution Ielts Nguyenhuyen
No ratings yet
Day 4 Plastic Pollution Ielts Nguyenhuyen
1 page
24F - 48F DJ ADSS Specs 600 MTR
No ratings yet
24F - 48F DJ ADSS Specs 600 MTR
2 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet