0% found this document useful (0 votes)

15 views15 pages

Lecture 12

Uploaded by

mbilal23640

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views15 pages

Lecture 12

Uploaded by

mbilal23640

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Data Science

CSE-4075
(K-Nearest Neighbor)
If you want to annoy your neighbors, tell the truth
about them.
Pietro Aretino
Different Learning Methods
• Eager Learning
– Explicit description of target function on the
whole training set
• Instance-based Learning
– Learning=storing all training instances
– Classification=assigning target function to a new
instance
– Referred to as “Lazy” learning
Eager Learning

Any random movement

=>It’s a mouse

I saw a mouse!
Instance-based Learning

Its very similar to a

Desktop!!
Instance based learning
• Approximating real valued or discrete-valued
target functions
• Learning in this algorithm consists of storing
the presented training data
• When a new query instance is encountered, a
set of similar related instances is retrieved
from memory and used to classify the new
query instance
• Disadvantage of instance-based methods is
that the costs of classifying new instances can
be high
• Nearly all computation takes place at
classification time rather than learning time
K-Nearest Neighbor algorithm
• Most basic instance-based method

• Data are represented in a vector space

• Supervised learning
WHY NEAREST NEIGHBOR?
• Used to classify objects based on closest training
examples in the feature space
– Feature space: raw data transformed into sample
vectors of fixed length using feature extraction
(Training Data)
• Top 10 Data Mining Algorithm
– ICDM paper – December 2007
• Among the simplest of all Data Mining Algorithms
– Classification Method
• Implementation of lazy learner ? 9

– All computation deferred until

K NEAREST NEIGHBOR
• Requires 3 things:
– Feature Space(Training Data)
– Distance metric
• to compute distance between
records
– The value of k
• the number of nearest
neighbors to retrieve from which
? to get majority class
• To classify an unknown record:
– Compute distance to other training
records
– Identify k nearest neighbors
– Use class labels of nearest
neighbors to determine the
class label of unknown record
10
6
Feature space
•      


x (1)
, f ( x (1)
) , x (2)
f ( x (2)
) ,..., x (n )
, f ( x (n )
) 

x1

x
  
2
  d

x   ..   d
xy   i i
(x  y ) 2

 ..  i1




x
 d


K NEAREST NEIGHBOR

 k = 1:
 Belongs to square class

 k = 3:
?  Belongs to triangle class

 k = 7:
 Belongs to square class

• Choosing the value of k:

– If k is too small, sensitive to noise points
– If k is too large, neighborhood may include points from other
classes 12
8
– Choose an odd value for k, to eliminate ties
ICDM: Top Ten Data Mining Algorithms k nearest neighbor classificationDecember 2006
How to determine
the good value for k?
• Determined experimentally
• Start with k=1 and use a test set to validate the
error rate of the classifier
• Repeat with k=k+2
• Choose the value of k for which the error rate is
minimum

• Note: k should be odd number to avoid ties

Training Data
(I) (II) (III) (IV)
 (7, 7), False ,  (7, 4), False ,  (3, 4), True ,  (1, 4), True 
Testing Instance

¿
Parameters:
Distance Metric= Euclidean Distance
Nearest Neighbors =K= 3

Distance Calculation Neighbor Neighbor Decision

Closeness Class
N=3 False For K=3
N=4 False True=2>False=1
N=1 True
Hence,
N=2 True
When to Consider Nearest Neighbors
• Instances map to points in Rd
• Less than 20 features (attributes) per instance, typically
normalized
• Lots of training data
Advantages:
• Training is very fast
• Learn complex target functions
• Do not loose information
Disadvantages:
• Slow at query time
– Presorting and indexing training samples into search trees reduces time
• Easily fooled by irrelevant features (attributes)

12 ML KNN
No ratings yet
12 ML KNN
28 pages
List of C Basic Programs
100% (2)
List of C Basic Programs
4 pages
Distance-Based Methods - KNN
No ratings yet
Distance-Based Methods - KNN
8 pages
Love Babbar's Array Section
No ratings yet
Love Babbar's Array Section
10 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
21 pages
Introduction To KNN
100% (1)
Introduction To KNN
8 pages
K-NN (Nearest Neighbor)
100% (1)
K-NN (Nearest Neighbor)
17 pages
K-Nearest Neighbors: KNN Algorithm Pseudocode
No ratings yet
K-Nearest Neighbors: KNN Algorithm Pseudocode
2 pages
ML Lecture 13 KNN
No ratings yet
ML Lecture 13 KNN
14 pages
Notes 02
No ratings yet
Notes 02
79 pages
K-Nearest Neighbour (KNN)
No ratings yet
K-Nearest Neighbour (KNN)
14 pages
Lec 7
No ratings yet
Lec 7
40 pages
KNN Presentation
No ratings yet
KNN Presentation
16 pages
Aiml M3 C2
No ratings yet
Aiml M3 C2
56 pages
Machine Learning Unit 3
No ratings yet
Machine Learning Unit 3
40 pages
Nearest Neighbor Classification: Presented by Sam Brown Sbbrown@uvm - Edu DATA MINING - Xindong Wu
No ratings yet
Nearest Neighbor Classification: Presented by Sam Brown Sbbrown@uvm - Edu DATA MINING - Xindong Wu
39 pages
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
No ratings yet
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
47 pages
CH 2
No ratings yet
CH 2
30 pages
Nearest Neighbour Based Classifiers - Variants
No ratings yet
Nearest Neighbour Based Classifiers - Variants
22 pages
Machine Learning Lecture 02
No ratings yet
Machine Learning Lecture 02
25 pages
K Nearest Neighbor Classification
No ratings yet
K Nearest Neighbor Classification
30 pages
Lecture Note #3 - PEC-CS701E
No ratings yet
Lecture Note #3 - PEC-CS701E
27 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
Co-2 ML 2019
No ratings yet
Co-2 ML 2019
71 pages
Session 9 KNN - 2024
No ratings yet
Session 9 KNN - 2024
23 pages
Lazy Learners Unit 2
No ratings yet
Lazy Learners Unit 2
26 pages
KNN Dan KMeans
No ratings yet
KNN Dan KMeans
37 pages
Lecture 3
No ratings yet
Lecture 3
17 pages
Week 07
No ratings yet
Week 07
24 pages
ML KN
No ratings yet
ML KN
12 pages
PowerPoint Presentation - KNN Presentation
No ratings yet
PowerPoint Presentation - KNN Presentation
16 pages
Chapter 2
No ratings yet
Chapter 2
26 pages
Chap4 KNN
No ratings yet
Chap4 KNN
6 pages
Nearest-Neighbor Classifier: MTL 782 Iit Delhi
No ratings yet
Nearest-Neighbor Classifier: MTL 782 Iit Delhi
16 pages
Chap4 KNN
No ratings yet
Chap4 KNN
11 pages
7.classification After
No ratings yet
7.classification After
51 pages
Pandas Practice Questions
No ratings yet
Pandas Practice Questions
2 pages
20 KNN Presentation
No ratings yet
20 KNN Presentation
16 pages
02-knn Notes
No ratings yet
02-knn Notes
23 pages
CMTH642 - Module 10.2 - Classification
No ratings yet
CMTH642 - Module 10.2 - Classification
10 pages
Lecture8 KNN1
No ratings yet
Lecture8 KNN1
16 pages
STAT 451: Introduction To Machine Learning Lecture Notes
No ratings yet
STAT 451: Introduction To Machine Learning Lecture Notes
22 pages
Chap4 KNN New
No ratings yet
Chap4 KNN New
7 pages
01 Basics 02knn 01
No ratings yet
01 Basics 02knn 01
7 pages
Instance-Based Learning: Slides Provided by Introduction To Data Mining, 2 Edition
No ratings yet
Instance-Based Learning: Slides Provided by Introduction To Data Mining, 2 Edition
13 pages
Lecture Notes For Chapter 4 Instance-Based Learning Introduction To Data Mining, 2 Edition
No ratings yet
Lecture Notes For Chapter 4 Instance-Based Learning Introduction To Data Mining, 2 Edition
17 pages
Chapter-2-Data Structures and Algorithms Analysis
100% (2)
Chapter-2-Data Structures and Algorithms Analysis
44 pages
U3 KNN
No ratings yet
U3 KNN
6 pages
6 - KNN Classifier
No ratings yet
6 - KNN Classifier
10 pages
20180723161729D4730 - Pert18 - K-Nearest Neighbor
No ratings yet
20180723161729D4730 - Pert18 - K-Nearest Neighbor
22 pages
K Nearest Neighbor (KNN)
No ratings yet
K Nearest Neighbor (KNN)
9 pages
Instance Based Learning
No ratings yet
Instance Based Learning
16 pages
ML Lec7
No ratings yet
ML Lec7
5 pages
Supervised Learning and K Nearest Neighbors: Business Intelligence For Managers
No ratings yet
Supervised Learning and K Nearest Neighbors: Business Intelligence For Managers
15 pages
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
23 pages
COS4852 2023 Unit 2 - KNN
No ratings yet
COS4852 2023 Unit 2 - KNN
10 pages
K-Means Consistency in Clustering
No ratings yet
K-Means Consistency in Clustering
10 pages
Instance Based Learning
No ratings yet
Instance Based Learning
20 pages
K-Nearest Neighbor Classifier: This Slide Is Modified From Dr. Tan's Slides. Thanks To Dr. Tan
No ratings yet
K-Nearest Neighbor Classifier: This Slide Is Modified From Dr. Tan's Slides. Thanks To Dr. Tan
11 pages
Wikipedia K Nearest Neighbor Algorithm
No ratings yet
Wikipedia K Nearest Neighbor Algorithm
4 pages
5c. Nearest Neighbour Classifier
No ratings yet
5c. Nearest Neighbour Classifier
2 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
2 pages
KNN Algorithm
No ratings yet
KNN Algorithm
3 pages
CHAPTER 2 FUNDAMENTALS OF C LANGUAGE - 2.1 Edited
No ratings yet
CHAPTER 2 FUNDAMENTALS OF C LANGUAGE - 2.1 Edited
29 pages
II BCA JAVA-NEP-updated
No ratings yet
II BCA JAVA-NEP-updated
5 pages
Class 11 Comp. Project
No ratings yet
Class 11 Comp. Project
17 pages
Introduction To Python (Lab)
No ratings yet
Introduction To Python (Lab)
28 pages
Driver Drowsiness Detection
No ratings yet
Driver Drowsiness Detection
81 pages
JCL-PPT-5 - Procedures
No ratings yet
JCL-PPT-5 - Procedures
17 pages
(Cbse - Board) Inc SR Cbse Computer - Science - 18 05 2024
No ratings yet
(Cbse - Board) Inc SR Cbse Computer - Science - 18 05 2024
5 pages
Pig Latin Reference Manual 2
No ratings yet
Pig Latin Reference Manual 2
149 pages
Misra C2012 Guidelines For The Use of The C Language in Critical Systems Motor Industry Software Reliability Association Download
No ratings yet
Misra C2012 Guidelines For The Use of The C Language in Critical Systems Motor Industry Software Reliability Association Download
88 pages
Management Studies (CP-202) : Unit I
No ratings yet
Management Studies (CP-202) : Unit I
2 pages
EWIT - 3rd Sem Certificate - BRANCH WISE-with Number
No ratings yet
EWIT - 3rd Sem Certificate - BRANCH WISE-with Number
247 pages
Encapsulation
100% (1)
Encapsulation
15 pages
L7 Cross Compiler
No ratings yet
L7 Cross Compiler
9 pages
Practice Q 02
No ratings yet
Practice Q 02
2 pages
SDET Interview Questions and Answers
No ratings yet
SDET Interview Questions and Answers
7 pages
Difference Between These Two Files
No ratings yet
Difference Between These Two Files
2 pages
Log
No ratings yet
Log
16 pages
Bootstrap Corewar
100% (1)
Bootstrap Corewar
4 pages
Untitled Document
No ratings yet
Untitled Document
15 pages
Lecture Human Computer Interaction Note
No ratings yet
Lecture Human Computer Interaction Note
37 pages
Insert, Update, Delete, Search Image in SQL - C#, JAVA, PHP, Programming, Source Code
No ratings yet
Insert, Update, Delete, Search Image in SQL - C#, JAVA, PHP, Programming, Source Code
37 pages
Exp 4
No ratings yet
Exp 4
2 pages
Flowcharting
No ratings yet
Flowcharting
19 pages
Lecture On AI - Uninformed Search
No ratings yet
Lecture On AI - Uninformed Search
17 pages
Zero Generation Assignment
No ratings yet
Zero Generation Assignment
14 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

Lecture 12

Uploaded by

Lecture 12

Uploaded by

Data Science

Any random movement

Its very similar to a

• Data are represented in a vector space

– All computation deferred until

• Choosing the value of k:

• Note: k should be odd number to avoid ties

Distance Calculation Neighbor Neighbor Decision

You might also like