K-Nearest Neighbor Learning

Here are the steps to find the class label using KNN (K=3) for the values (Age=26, Income=18): 1. Calculate distances between query point (26,18) and training data points Distance to (25,15) is sqrt((26-25)^2 + (18-15)^2) = 3 Distance to (20,10) is sqrt((26-20)^2 + (18-10)^2) = 6 Distance to (30,20) is sqrt((26-30)^2 + (18-20)^2) = 6 Distance to (35,25) is sqrt((26-35)^2 + (18-25)^2)

Uploaded by

hhh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

253 views19 pages

K-Nearest Neighbor Learning

Uploaded by

hhh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 19

K-Nearest Neighbor

Learning
 Classification is the process
of classifying the data with the
help of class labels. On the other
hand,Clustering is similar
to classification but there are no
predefined class
labels. Classification is geared with
supervised learning. As
against, clustering is also known as
unsupervised learning
Nearest Neighbor Classifier
 Nearest-neighbor classifiers are
based on learning by analogy, that
is, by comparing a given test tuple
with training tuples that are similar
to it.
Nearest Neighbor Classifier
 The training tuples are described by
n attributes. Each tuple represents a
point in an n-dimensional space. In
this way, all of the training tuples
are stored in an n-dimensional
pattern space.
Nearest Neighbor Classifier
 When given an unknown tuple, a k-
nearest-neighbor classifier searches
the pattern space for the k training
tuples that are closest to the
unknown tuple. These k training
tuples are the k “nearest neighbors”
of the unknown tuple.
Closeness
 defined in terms of a distance
metric, such as Euclidean distance.
 The Euclidean distance between two
points or tuples, say, X1 = (x11,
x12, … , x1n) and X2 = (x21, x22,
... , x2n) is
Class Label
 For k-nearest-neighbor
classification, the unknown tuple is
assigned the most common class
among its k nearest neighbors.
 When k = 1, the unknown tuple is
assigned the class of the training
tuple that is closest to it in pattern
space.
1-Nearest Neighbor
3-Nearest Neighbor
K-Nearest Neighbor
 An arbitrary instance is represented by
(a1(x), a2(x), a3(x),.., an(x))
 ai(x) denotes features
 Euclidean distance between two instances
d(xi, xj)=sqrt (sum for r=1 to n (ar(xi) -
ar(xj))2)
 Continuous valued target function
 mean value of the k nearest training
examples
K-Nearest Neighbor
 Here is step by step on how to compute K-
nearest neighbor algorithm:
1. Determine parameter K= number of nearest
neighbors
2. Calculate the distance between the query-
instance and all the training samples
3. Sort the distance and determine nearest
neighbors based on the K-th minimum distance
4. Gather category Y of nearest neighbors
5. Use majority of the category of nearest
neighbors as prediction value of the query
instance
Example
 We have data of two attributes to classify
whether a special tissue is ‘good’ or ‘bad’.

X1 = Acid X2 = Strength Y=
Durability (kg/ Square Classification
(Seconds) meter)
7 7 Bad
7 4 Bad
3 4 Good
1 4 Good
Example
 Now the factory produces a new tissue that pass
laboratory test with X1 = 3 and X2 = 7. Without
another expensive survey, can we guess what
the classification of this new tissue is?
1. Determine parameter K= number of nearest
neighbors. Suppose use K = 3
Example
 Now the factory produces a new tissue that pass
laboratory test with X1 = 3 and X2 = 7. Without
another expensive survey, can we guess what
the classification of this new tissue is?
1. Determine parameter K= number of nearest
neighbors. Suppose use K = 3
Example
2. Calculate the distance between the query-
instance and all the training samples

X1 = Acid X2 = Strength Square distance to

Durability (kg/ Square query instance (3, 7)
(Seconds) meter)
7 7 (7-3)2 + (7-7)2 = 16
7 4 (7-3)2 + (4-7)2 = 25
3 4 (3-3)2 + (4-7)2 = 9
1 4 (1-3)2 + (4-7)2 = 13
Example
3. Sort the distance and determine nearest
neighbors based on the K-th minimum distance

X1 = Acid X2 = Square distance to Rank Is it

Durability Strength (kg/ query instance (3, 7) minimum included
(Seconds) Square distance in 3-
meter) nearest
neighbors
?
7 7 (7-3)2 + (7-7)2 = 16 3 Yes
7 4 (7-3)2 + (4-7)2 = 25 4 No
3 4 (3-3)2 + (4-7)2 = 9 1 Yes
1 4 (1-3)2 + (4-7)2 = 13 2 Yes
Example
4. Gather category Y of nearest neighbors

X1 = X2 = Rank Is it Y=
Acid Strength minimu include Categor
Durabili (kg/ m d in 3- y of
ty Square distance nearest nearest
(Secon meter) neighbo neighbo
ds) rs? rs
7 7 3 Yes Bad
7 4 4 No -
3 4 1 Yes Good
1 4 2 Yes Good
Example
5. Use majority of the category of nearest
neighbors as prediction value of the query
instance
 We have 2 good and 1 bad. So test tuple is
included in good class.
Exercise
 Find class label of the values (Age = 26, Income
= 18)

Age Income Class

($/hr)
25 15 Yes
20 10 No
30 20 Yes
35 25 No

DS Capstone Presentation
No ratings yet
DS Capstone Presentation
46 pages
Graded Quiz - Test Your Project Understanding - Coursera
100% (1)
Graded Quiz - Test Your Project Understanding - Coursera
1 page
6 - KNN Classifier
No ratings yet
6 - KNN Classifier
10 pages
Computer Aided Audit Technique
100% (1)
Computer Aided Audit Technique
24 pages
Create New Coal Id
No ratings yet
Create New Coal Id
27 pages
Science Investigatory Project: Quarter 1 - Module 2
100% (8)
Science Investigatory Project: Quarter 1 - Module 2
17 pages
Complex Data Types: Practice Exercises
No ratings yet
Complex Data Types: Practice Exercises
4 pages
Reserch Proposal
No ratings yet
Reserch Proposal
23 pages
Thesis Format and Guidlines For Students
No ratings yet
Thesis Format and Guidlines For Students
19 pages
ASTM E178 - 2008 - Standard Practice For Dealing With Outlying Observations PDF
100% (3)
ASTM E178 - 2008 - Standard Practice For Dealing With Outlying Observations PDF
18 pages
Preparation For An Official CTI Thermal Performance, Plume Abatement, or Drift Emission Test
No ratings yet
Preparation For An Official CTI Thermal Performance, Plume Abatement, or Drift Emission Test
16 pages
Research Capabilities of Senior High School Students: January 2018
No ratings yet
Research Capabilities of Senior High School Students: January 2018
9 pages
Verification of Effectiveness - MANCP Network - October 2016, Version 1
No ratings yet
Verification of Effectiveness - MANCP Network - October 2016, Version 1
30 pages
15MA301 U4v1
No ratings yet
15MA301 U4v1
28 pages
Regression Analysis
No ratings yet
Regression Analysis
22 pages
Exercises 695 Clas
No ratings yet
Exercises 695 Clas
3 pages
Day 2 - Session 2: - KNN - Decision Tree - Random Forest - Naïve Bayes Classification
No ratings yet
Day 2 - Session 2: - KNN - Decision Tree - Random Forest - Naïve Bayes Classification
50 pages
Establishing Quality Standards (2021 Update) - 042334
No ratings yet
Establishing Quality Standards (2021 Update) - 042334
20 pages
Bank Data Analysis Report
No ratings yet
Bank Data Analysis Report
14 pages
Bio Osmo - Announcement Dated 12 April 2018
No ratings yet
Bio Osmo - Announcement Dated 12 April 2018
6 pages
Correlation
No ratings yet
Correlation
12 pages
Chapter Three: Research Methodology: 3.2.1 Positivism
No ratings yet
Chapter Three: Research Methodology: 3.2.1 Positivism
8 pages
Penerbit, 004
No ratings yet
Penerbit, 004
10 pages
How To Do Science Coursework
100% (2)
How To Do Science Coursework
9 pages
K - Nearest Neighbor Algorithm
100% (1)
K - Nearest Neighbor Algorithm
18 pages
Brown Durbin CUSUM
No ratings yet
Brown Durbin CUSUM
15 pages
SelfStudy1 Forecasting Questions
No ratings yet
SelfStudy1 Forecasting Questions
3 pages
Unit - 4 Machine Learning
100% (1)
Unit - 4 Machine Learning
84 pages
Power Plant Performance Monitoring Using Statistic
No ratings yet
Power Plant Performance Monitoring Using Statistic
15 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
Genre Analysis
No ratings yet
Genre Analysis
10 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
K-Means and PCA
No ratings yet
K-Means and PCA
69 pages
358 33 Powerpoint Slides DSC Chapter 15
No ratings yet
358 33 Powerpoint Slides DSC Chapter 15
55 pages
More On The Phases of Data Analysis and This Program
No ratings yet
More On The Phases of Data Analysis and This Program
2 pages
ML Lab Programs (1-12)
No ratings yet
ML Lab Programs (1-12)
35 pages
Data Mining
No ratings yet
Data Mining
15 pages
K Means Clustering Lecture
No ratings yet
K Means Clustering Lecture
32 pages
Lab Manual B.Sc. (CA) : Department of Computer Science Ccb-2P2: Laboratory Course - Ii
No ratings yet
Lab Manual B.Sc. (CA) : Department of Computer Science Ccb-2P2: Laboratory Course - Ii
31 pages
Session 18 Time Series Forecasting
No ratings yet
Session 18 Time Series Forecasting
30 pages
1.write A Program in Prolog To Show The Sum of N Natural Numbers. Code
No ratings yet
1.write A Program in Prolog To Show The Sum of N Natural Numbers. Code
2 pages
Code Optimization
0% (1)
Code Optimization
90 pages
U L D R: Nsupervised Earning and Imensionality Eduction
No ratings yet
U L D R: Nsupervised Earning and Imensionality Eduction
58 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
23 pages
Sat - 13.Pdf - Child Mortality Prediction Using Machine Learning
No ratings yet
Sat - 13.Pdf - Child Mortality Prediction Using Machine Learning
11 pages
Lecture 21 - Sugeno Fuzzy Models
No ratings yet
Lecture 21 - Sugeno Fuzzy Models
5 pages
MCQ
No ratings yet
MCQ
2 pages
Demographics Segmentation Using Machine Learning
No ratings yet
Demographics Segmentation Using Machine Learning
8 pages
Linear Regression: in Machine Learning
No ratings yet
Linear Regression: in Machine Learning
6 pages
Title Exploring Chebyshev's Theorem
No ratings yet
Title Exploring Chebyshev's Theorem
2 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Model Building Through
No ratings yet
Model Building Through
21 pages
7 - Classification
No ratings yet
7 - Classification
71 pages
ER Practical 7r
No ratings yet
ER Practical 7r
5 pages
Data Warehousing and Data Mining (10cs755)
No ratings yet
Data Warehousing and Data Mining (10cs755)
142 pages
Nearest Neighbour Algorithm
No ratings yet
Nearest Neighbour Algorithm
20 pages
Unit 4
No ratings yet
Unit 4
4 pages
Normalization in DBMS11
No ratings yet
Normalization in DBMS11
17 pages
Unit III Data Analysis and Reporting
No ratings yet
Unit III Data Analysis and Reporting
15 pages
Jntuk R20 ML Unit-Ii
No ratings yet
Jntuk R20 ML Unit-Ii
37 pages
OOSE Lab Report
No ratings yet
OOSE Lab Report
30 pages
12 ML KNN
No ratings yet
12 ML KNN
28 pages
Cluster Analysis Chapter 8 Solution
No ratings yet
Cluster Analysis Chapter 8 Solution
8 pages
Data Mining-Rule Based Classification
No ratings yet
Data Mining-Rule Based Classification
4 pages
AMNA SHAHID - Docx MCQS
No ratings yet
AMNA SHAHID - Docx MCQS
8 pages
6 Different Ways To Compensate For Missing Values in A Dataset
No ratings yet
6 Different Ways To Compensate For Missing Values in A Dataset
6 pages
K Means Questions
No ratings yet
K Means Questions
2 pages
Answer 1722791857 NLP and Classification Practical MCQ 4991
No ratings yet
Answer 1722791857 NLP and Classification Practical MCQ 4991
26 pages
UNIT V DWM Notes
No ratings yet
UNIT V DWM Notes
18 pages
Nptel - Data Mining - Week 2
No ratings yet
Nptel - Data Mining - Week 2
4 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
2 pages
ML Unit-2
No ratings yet
ML Unit-2
26 pages
Data Mining and Model Selection
No ratings yet
Data Mining and Model Selection
27 pages
Density & Grid Based Clustering
100% (1)
Density & Grid Based Clustering
21 pages
Linear Regression Analysis. Statistics 2 Notes
No ratings yet
Linear Regression Analysis. Statistics 2 Notes
20 pages
Distance-Based Methods - KNN
No ratings yet
Distance-Based Methods - KNN
8 pages
Clustering Algorithms CheatSheet 1710438661
No ratings yet
Clustering Algorithms CheatSheet 1710438661
6 pages
K-Nearest Neighbors: KNN Algorithm Pseudocode
No ratings yet
K-Nearest Neighbors: KNN Algorithm Pseudocode
2 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
10 pages
Cs 143 Sample Mid
No ratings yet
Cs 143 Sample Mid
4 pages
AkashKashyap Resumepdf
No ratings yet
AkashKashyap Resumepdf
1 page
Machine Learning Notes
No ratings yet
Machine Learning Notes
23 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
Answers To Problems For Data Mining and Predictive Analytics (2nd Edition) by Larose
No ratings yet
Answers To Problems For Data Mining and Predictive Analytics (2nd Edition) by Larose
12 pages
Soft Computing
No ratings yet
Soft Computing
13 pages
DBMS Module5 Questions With Answers
No ratings yet
DBMS Module5 Questions With Answers
27 pages
Lesson Plan: Data Warehousing and Data Mining
No ratings yet
Lesson Plan: Data Warehousing and Data Mining
1 page
Abhishek Patil Resume
No ratings yet
Abhishek Patil Resume
1 page
Hierarchical Regression
No ratings yet
Hierarchical Regression
3 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
22 pages

K-Nearest Neighbor Learning

Uploaded by

K-Nearest Neighbor Learning

Uploaded by

K-Nearest Neighbor

X1 = Acid X2 = Strength Square distance to

X1 = Acid X2 = Square distance to Rank Is it

Age Income Class

You might also like