0% found this document useful (0 votes)

17 views18 pages

Lecture-11-KNearest Clustering-Part-1

The document provides an overview of the K Nearest Neighbor (KNN) classification algorithm, highlighting its simplicity, non-parametric nature, and lazy learning approach. It explains the classification process, including training, testing, and usage phases, as well as the importance of distance calculations and normalization of features. Additionally, it discusses practical applications of KNN, such as using Voronoi diagrams for optimal location selection.

Uploaded by

pateljil0247

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views18 pages

Lecture-11-KNearest Clustering-Part-1

Uploaded by

pateljil0247

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Introduction to Data

Analytics
ITE 5201
Lecture11-K Nearest Neighbor
Classification
Instructor: Parisa Pouladzadeh
Email: [email protected]
www.udemy. /course/python-for-data-science-and-machine-learning.com
Nearest Neighbor Classifiers
➢KNN algorithm is one of the simplest classification
algorithm
➢non-parametric
➢it does not make any assumptions on the underlying data
distribution

➢lazy learning algorithm.

➢there is no explicit training phase or it is very minimal.
➢also means that the training phase is pretty fast .
➢Lack of generalization means that KNN keeps all the training data.

➢Its purpose is to use a database in which the data points

are separated into several classes to predict the
classification of a new sample point.
Copyright © 2018 Pearson Education, Inc. All Rights Reserved.
Nearest Neighbor Classifiers

Basic idea:
◦ If it walks like a duck, quacks like a duck, then it’s probably a duck

Compute
Distance Test Record

Training
Records 3

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

Nearest Neighbor Classifiers

KNN Algorithm is based on feature similarity

How closely out-of-sample features resemble our training set
determines how we classify a given data point

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

Basic Idea

➢k-NN classification rule is to assign to a test sample the

majority category label of its k nearest training samples
➢In practice, k is usually chosen to be odd, so as to avoid ties
➢The k = 1 rule is generally called the nearest-neighbor
classification rule

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

Definition of Nearest Neighbor

X X X

(a) 1-nearest neighbor (b) 2-nearest neighbor (c) 3-nearest neighbor

K-nearest neighbors of a record x are data points that

have the k smallest distance to x 6

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

Classification steps

➢Training phase: a model is constructed from the training

instances.
➢ classification algorithm finds relationships between predictors
and targets
➢ relationships are summarised in a model
➢Testing phase: test the model on a test sample whose class
labels are known but not used for training the model
➢Usage phase: use the model for classification on new data
whose class labels are unknown

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

K-Nearest Neighbor
Features
◦ All instances correspond to points in an n-dimensional Euclidean
space
◦ Classification is delayed till a new instance arrives
◦ Classification done by comparing feature vectors of the different
points
◦ Target function may be discrete or real-valued

K-Nearest Neighbor

➢An arbitrary instance is represented by

(a1(x), a2(x), a3(x),.., an(x))
➢ai(x) denotes features
➢Euclidean distance between two instances
d(xi, xj)=sqrt (sum for r=1 to n (ar(xi) - ar(xj))2)
➢Continuous valued target function
➢ mean value of the k nearest training examples

Euclidean Distance
•K-nearest neighbours uses the local neighborhood to obtain a
prediction
•The K memorized examples more similar to the one that is being
classified are retrieved
•A distance function is needed to compare the examples
similarity
•This means that if we change the distance function, we change
how examples are classified

Normalization
If the ranges of the features differ, feaures with bigger values
will dominate decision
In general feature values are normalized prior to distance
calculation

Voronoi diagram
•We frequently need to find the nearest hospital, surgery or
supermarket.
•A map divided into cells, each cell covering the region closest to a
particular centre, can assist us in our quest.

Voronoi diagram
Another practical problem is to choose a location for a new service,
such as a school, which is as far as possible from existing schools while
still serving the maximum number of families.
A Voronoi diagram can be used to find the largest empty circle amid a
collection of points, giving the ideal location for the new school. Of
course, numerous parameters other than distance must be considered,
but access time is often the critical factor.

Numerical Example
Steps:
1. Determine parameter K = number of nearest neighbors
2. Calculate the distance between the query-instance and all the training
samples
3. Sort the distance and determine nearest neighbors based on the K-th
minimum distance
4. Gather the category of the nearest neighbors
5. Use simple majority of the category of nearest neighbors as the
prediction value of the query instance

Example

Voronoi diagram

Residential Plumbing Inspection Checklist Template
No ratings yet
Residential Plumbing Inspection Checklist Template
6 pages
12 ML KNN
No ratings yet
12 ML KNN
28 pages
PowerPoint Presentation - KNN Presentation
No ratings yet
PowerPoint Presentation - KNN Presentation
16 pages
KNN Presentation
No ratings yet
KNN Presentation
16 pages
Miss Erum Mahood Topic: KNN Algorthim: Presentator BY: Zobia Malaika Maryam Minahil
No ratings yet
Miss Erum Mahood Topic: KNN Algorthim: Presentator BY: Zobia Malaika Maryam Minahil
10 pages
Nearest-Neighbor Classifier: MTL 782 Iit Delhi
No ratings yet
Nearest-Neighbor Classifier: MTL 782 Iit Delhi
16 pages
Lecture 07 KNN 14112022 034756pm
100% (1)
Lecture 07 KNN 14112022 034756pm
24 pages
K Nearest Neighbors (KNN) : "Birds of A Feather Flock Together"
No ratings yet
K Nearest Neighbors (KNN) : "Birds of A Feather Flock Together"
16 pages
Lecture8 KNN1
No ratings yet
Lecture8 KNN1
16 pages
20 KNN Presentation
No ratings yet
20 KNN Presentation
16 pages
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
No ratings yet
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
18 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
K Nearest Neighbor KNN
No ratings yet
K Nearest Neighbor KNN
18 pages
Introduction To KNN
100% (1)
Introduction To KNN
8 pages
Instance Based Learning
No ratings yet
Instance Based Learning
7 pages
KNN Algorithm
No ratings yet
KNN Algorithm
16 pages
K Nearest Neighbor Classification
No ratings yet
K Nearest Neighbor Classification
30 pages
KNN Dan KMeans
No ratings yet
KNN Dan KMeans
37 pages
Algorithms - K Nearest Neighbors
No ratings yet
Algorithms - K Nearest Neighbors
23 pages
20180723161729D4730 - Pert18 - K-Nearest Neighbor
No ratings yet
20180723161729D4730 - Pert18 - K-Nearest Neighbor
22 pages
Introduction To Machine Learning: K-Nearest Neighbor Algorithm
No ratings yet
Introduction To Machine Learning: K-Nearest Neighbor Algorithm
25 pages
KNN
No ratings yet
KNN
53 pages
1694600817-Unit2.3 KNN CU 2.0
No ratings yet
1694600817-Unit2.3 KNN CU 2.0
25 pages
Unit 4 Classification
No ratings yet
Unit 4 Classification
73 pages
9.introduction To Artificial Intelligence
No ratings yet
9.introduction To Artificial Intelligence
14 pages
KNN - Feb 19
No ratings yet
KNN - Feb 19
42 pages
KNN Algorithm
No ratings yet
KNN Algorithm
3 pages
Supervised Example KNN
No ratings yet
Supervised Example KNN
22 pages
Ch2 - Lec2 - K Nearest Neighbour (KNN)
No ratings yet
Ch2 - Lec2 - K Nearest Neighbour (KNN)
18 pages
K-Nearest Neighbor: Scholarpedia January 2009
No ratings yet
K-Nearest Neighbor: Scholarpedia January 2009
14 pages
01 Basics 02knn 01
No ratings yet
01 Basics 02knn 01
7 pages
K-Nearest Neighbor Learning
No ratings yet
K-Nearest Neighbor Learning
19 pages
Notes 02
No ratings yet
Notes 02
79 pages
Decision Tree KNN
No ratings yet
Decision Tree KNN
9 pages
K-Nearest Neighbor (KNN) Algorithm: Last Updated: 14 May, 2025
No ratings yet
K-Nearest Neighbor (KNN) Algorithm: Last Updated: 14 May, 2025
14 pages
ML 2
No ratings yet
ML 2
6 pages
K Nearest Neighbor Classification
No ratings yet
K Nearest Neighbor Classification
16 pages
Instance-Based Learning: Slides Provided by Introduction To Data Mining, 2 Edition
No ratings yet
Instance-Based Learning: Slides Provided by Introduction To Data Mining, 2 Edition
13 pages
K-Nearest Neighbors
No ratings yet
K-Nearest Neighbors
35 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
13 pages
K-Nearest Neighbor Classification-Algorithm and Characteristics
No ratings yet
K-Nearest Neighbor Classification-Algorithm and Characteristics
6 pages
Materi 7.2. K-NN
No ratings yet
Materi 7.2. K-NN
6 pages
05 KNN
No ratings yet
05 KNN
49 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
CPE412 Pattern Recognition (Week 6)
No ratings yet
CPE412 Pattern Recognition (Week 6)
27 pages
CSE445 NSU Week - 5
No ratings yet
CSE445 NSU Week - 5
26 pages
Session 9 KNN - 2024
No ratings yet
Session 9 KNN - 2024
23 pages
KNN Algorithm
No ratings yet
KNN Algorithm
11 pages
Week 5 - Instance-Based Learning & PCA
No ratings yet
Week 5 - Instance-Based Learning & PCA
69 pages
Classification (K-Nearest Neighbor)
No ratings yet
Classification (K-Nearest Neighbor)
22 pages
K-Nearest Neighbor Algorithm: by Vipul Pathak (00216404824) Siddharth Tyagi (02016404824)
No ratings yet
K-Nearest Neighbor Algorithm: by Vipul Pathak (00216404824) Siddharth Tyagi (02016404824)
19 pages
ML KN
No ratings yet
ML KN
12 pages
Seminar Report File On KNN Models: University Institute of Engineering and Technology, Kurukshetra University
No ratings yet
Seminar Report File On KNN Models: University Institute of Engineering and Technology, Kurukshetra University
24 pages
5c. Nearest Neighbour Classifier
No ratings yet
5c. Nearest Neighbour Classifier
2 pages
Week 07
No ratings yet
Week 07
24 pages
Unit II 2 Mark Answers ML
No ratings yet
Unit II 2 Mark Answers ML
3 pages
Adobe Scan 16 May 2023
No ratings yet
Adobe Scan 16 May 2023
9 pages
04 KNN M
No ratings yet
04 KNN M
26 pages
K Nearest Neighbor Classification
0% (1)
K Nearest Neighbor Classification
32 pages
Supervised Learning and K Nearest Neighbors: Business Intelligence For Managers
No ratings yet
Supervised Learning and K Nearest Neighbors: Business Intelligence For Managers
15 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Paper 2
No ratings yet
Paper 2
6 pages
The C2M2: Helping Utilities With Cybersecurity Preparedness
No ratings yet
The C2M2: Helping Utilities With Cybersecurity Preparedness
29 pages
The Man With A Movie Camera
No ratings yet
The Man With A Movie Camera
7 pages
Kerala - ET AICTE Approved Colleges
No ratings yet
Kerala - ET AICTE Approved Colleges
101 pages
Microsoft Windows (Versión 10.0.261
No ratings yet
Microsoft Windows (Versión 10.0.261
5 pages
LC 33
No ratings yet
LC 33
2 pages
Operations Management Term Paper Topics
100% (1)
Operations Management Term Paper Topics
4 pages
Simple Methods To Fix Err - Name - Not - Resolved
No ratings yet
Simple Methods To Fix Err - Name - Not - Resolved
2 pages
XCCCCCCCCCCC
No ratings yet
XCCCCCCCCCCC
4 pages
4-Hour Lockout Avoidance For LM2500 and LM6000 Gas Turbines: Conversion, Modification and Upgrade Offering
No ratings yet
4-Hour Lockout Avoidance For LM2500 and LM6000 Gas Turbines: Conversion, Modification and Upgrade Offering
1 page
H3C S5560X-EI Series Converged Gigabit Switches: Product Overview
No ratings yet
H3C S5560X-EI Series Converged Gigabit Switches: Product Overview
16 pages
B.Tech. CP Structure-18-19
No ratings yet
B.Tech. CP Structure-18-19
5 pages
Checklist Sachin Hissaria 1697820912
No ratings yet
Checklist Sachin Hissaria 1697820912
14 pages
319itsc - Internet of Things: Face Recognition System
No ratings yet
319itsc - Internet of Things: Face Recognition System
18 pages
How To Setup Cashiering Management System in Code: and Mysql Database Is A Great Advantage When Used Properly Within The
No ratings yet
How To Setup Cashiering Management System in Code: and Mysql Database Is A Great Advantage When Used Properly Within The
3 pages
Galaxy Fund For Funds
No ratings yet
Galaxy Fund For Funds
4 pages
Hikvision Cybersecurity Milestones
No ratings yet
Hikvision Cybersecurity Milestones
2 pages
Vessel: M/V: KENZ Date: .22/02/2022 SO No. KENZ 1/22
No ratings yet
Vessel: M/V: KENZ Date: .22/02/2022 SO No. KENZ 1/22
1 page
Wilo Pump Submittals
No ratings yet
Wilo Pump Submittals
13 pages
AI For EV
No ratings yet
AI For EV
22 pages
" A Puzzle A Day To Learn, Code, and Play " Visit: Description Example
No ratings yet
" A Puzzle A Day To Learn, Code, and Play " Visit: Description Example
1 page
Execution of A Complete Instruction (Computer Organisation), Computer Science and Engineering, Vtu
No ratings yet
Execution of A Complete Instruction (Computer Organisation), Computer Science and Engineering, Vtu
9 pages
Stair Structure Detail
100% (1)
Stair Structure Detail
2 pages
Quiz Jom Rancang
No ratings yet
Quiz Jom Rancang
4 pages
Database Programming With SQL 16-1: Working With Sequences Practice Activities
No ratings yet
Database Programming With SQL 16-1: Working With Sequences Practice Activities
3 pages
Midibox 2
No ratings yet
Midibox 2
8 pages
Academic Calendar For MBA Modular System
No ratings yet
Academic Calendar For MBA Modular System
1 page
Software Quality Metrics Overview
No ratings yet
Software Quality Metrics Overview
63 pages
3HAC065036 OM OmniCore-en
No ratings yet
3HAC065036 OM OmniCore-en
284 pages

Lecture-11-KNearest Clustering-Part-1

Uploaded by

Lecture-11-KNearest Clustering-Part-1

Uploaded by

Introduction to Data

➢lazy learning algorithm.

➢Its purpose is to use a database in which the data points

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

KNN Algorithm is based on feature similarity

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

➢k-NN classification rule is to assign to a test sample the

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

(a) 1-nearest neighbor (b) 2-nearest neighbor (c) 3-nearest neighbor

K-nearest neighbors of a record x are data points that

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

➢Training phase: a model is constructed from the training

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

➢An arbitrary instance is represented by

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

Copyright © 2018 Pearson Education, Inc. All Rights Reserved.

You might also like