0% found this document useful (0 votes)

82 views47 pages

Week 7 Part 1KNN K Nearest Neighbor Classification

K-Nearest Neighbors (KNN) is a simple machine learning algorithm used for classification and regression. It makes predictions based on the labels of the k nearest training examples in feature space. For classification, it assigns a new example the majority class of its k nearest neighbors. For regression, it predicts the average target value of the k nearest neighbors. KNN is non-parametric, easy to implement, and interpretable, but computationally expensive for large datasets. Choosing an optimal value for k is important to avoid overfitting or underfitting.

Uploaded by

Michael Zewdie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

82 views47 pages

Week 7 Part 1KNN K Nearest Neighbor Classification

Uploaded by

Michael Zewdie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

Week-7-Part-1

K-NN
K-Nearest Neighbors (KNN) is a simple and versatile
machine learning algorithm used for both
classification and regression tasks.
It's based on the principle of similarity, where the
predicted label or value of a new data point is
determined by the labels or values of its k nearest
neighbors in the training dataset.
How KNN works?

Training: The KNN algorithm first requires a labelled training

dataset. Each data point in the dataset has features (attributes) and a
corresponding label (class or target value).

Prediction: When a new, unlabelled data point arrives, KNN searches

for its k nearest neighbors in the training dataset based on a chosen
distance metric (e.g., Euclidean distance).

Classification: For classification tasks, the algorithm assigns the new

data point the majority class label of its k nearest neighbors.

Regression: For regression tasks, the algorithm predicts the value of

the new data point as the average of the values of its k nearest
neighbors.
Key characteristics of KNN

Non-parametric: KNN does not make any assumptions about

the underlying data distribution, making it suitable for diverse
datasets.
Easy to implement: The algorithm's concept and
implementation are relatively straightforward, making it a
popular choice for beginners.

Interpretable: KNN predictions are easily interpretable by

examining the nearest neighbors used for the prediction.

Lazy learning: KNN doesn't explicitly build a model during the

training phase. Instead, it stores the entire training data and
performs computations only when making predictions.
KNN: Strengths & Weaknesses
Strengths:

Simple and intuitive concept

Versatile for both classification and regression
Works well with small datasets
Handles complex relationships between features
Robust to outliers

Weaknesses:

Computationally expensive for large datasets

Sensitive to irrelevant features
Can suffer from the "curse of dimensionality"
Requires careful selection of the k value
KNN-Applications
❖Image recognition
❖Spam filtering
❖Customer segmentation
❖Recommendation systems
❖Anomaly detection
❖Time series forecasting
Why k-NN?
What is k-NN algorithm?
What is k-NN algorithm contd..
What is k-NN algorithm contd..
What is k-NN algorithm contd..
K-NN
Eager Vs Lazy Learners

In machine learning, a lazy learner is an algorithm

that delays the learning process until a query is
made to the system. This contrasts with eager
learners, which build a model upfront during
training.
How do we choose the factor ‘k’?
How do we choose better ‘k’?

Choosing the optimal K-value in the K-Nearest Neighbors

(KNN) algorithm is crucial for its performance. A good K-
value leads to accurate predictions, while a bad choice can
result in underfitting or overfitting.
To choose the value of k

Small K: A very small K can lead to overfitting, where the KNN algorithm becomes too
sensitive to noise and local variations in the training data, resulting in poor generalization to
unseen data.
Large K: A very large K can lead to underfitting, where the KNN algorithm becomes insensitive
to local patterns and fails to learn the underlying relationships between features and target
variables.
Odd K: Choosing an odd K value helps to break ties when voting for the class of a new data
point.
When do we use k-NN
How does k-NN algorithm works?
How does k-NN algorithm work?
Euclidean distance to find NN
Euclidian Distance formula
Distance calculation using Ecludian
Euclidian distance
K=3
Majority neighbors
Usecase2: apply nearest neighbor
algorithm from node-B
K-NN solution
Recap of k-NN
Use case3
K-NN predict diabetes
Diabetes dataset
Diabetes predict
K-NN classification recap
k-NN
Usecase4
Similarity
K-NN Usecase solution contd..
Rank these attributes
k=1
K=2
K=3
End of Week6 Part-1
• End of Week6 (Part1)
KNN-K Nearest Neighbor
Classification
Solved numerical on KNN classification
3 Nearest values N1,N2,N3
KNN solved example
End of Solved Usecases of
week7-part-1
• End of Solved Usecases

QUM2 Task 1 Linear Regression Analysis
No ratings yet
QUM2 Task 1 Linear Regression Analysis
5 pages
6 - KNN Classifier
No ratings yet
6 - KNN Classifier
10 pages
Analisis Beban Kerja Perawat
No ratings yet
Analisis Beban Kerja Perawat
13 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
KNN
No ratings yet
KNN
53 pages
Why Do We Need A K-NN Algorithm?
No ratings yet
Why Do We Need A K-NN Algorithm?
11 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
13 pages
Unit 3 KNN
No ratings yet
Unit 3 KNN
16 pages
14 K - Nearest Neighbours
No ratings yet
14 K - Nearest Neighbours
8 pages
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
No ratings yet
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
7 pages
21 KNN
No ratings yet
21 KNN
28 pages
K-Nearest NEIGHBOUR
No ratings yet
K-Nearest NEIGHBOUR
16 pages
K - Nearest Neighbors
No ratings yet
K - Nearest Neighbors
33 pages
KNN - Feb 19
No ratings yet
KNN - Feb 19
42 pages
KNN Dan KMeans
No ratings yet
KNN Dan KMeans
37 pages
Lecture Slides#7
No ratings yet
Lecture Slides#7
21 pages
1 - KNN
No ratings yet
1 - KNN
19 pages
Kenny-230720-8 Unique Machine Learning Interview Questions About K Nearest Neighbors
No ratings yet
Kenny-230720-8 Unique Machine Learning Interview Questions About K Nearest Neighbors
3 pages
KNN With Example
No ratings yet
KNN With Example
21 pages
19-K-Nearest Neighbor Learning.-22-08-2024
No ratings yet
19-K-Nearest Neighbor Learning.-22-08-2024
25 pages
3.1 K Nearest Neighbour Classifier
No ratings yet
3.1 K Nearest Neighbour Classifier
24 pages
ML Assignment No. 3: 3.1 Title
No ratings yet
ML Assignment No. 3: 3.1 Title
6 pages
Clustering - KNN
No ratings yet
Clustering - KNN
10 pages
K-Nearest Neighbor Classification-Algorithm and Characteristics
No ratings yet
K-Nearest Neighbor Classification-Algorithm and Characteristics
6 pages
What Is KNN
No ratings yet
What Is KNN
9 pages
K-Nearest Neighbor (KNN)
No ratings yet
K-Nearest Neighbor (KNN)
27 pages
K Nearest Neighbour Classifier
No ratings yet
K Nearest Neighbour Classifier
24 pages
Experiment No 7 ML
No ratings yet
Experiment No 7 ML
4 pages
K-Nearest Neighbor (KNN)
No ratings yet
K-Nearest Neighbor (KNN)
12 pages
'Machine Learning (Nagarjun)
No ratings yet
'Machine Learning (Nagarjun)
10 pages
Supervised Learning KNN
No ratings yet
Supervised Learning KNN
23 pages
KNN PDF
No ratings yet
KNN PDF
30 pages
Unit V Non Parametric Machine Learning
No ratings yet
Unit V Non Parametric Machine Learning
47 pages
Machine Learning Unit-3.1
No ratings yet
Machine Learning Unit-3.1
20 pages
Algorithms: K Nearest Neighbors
No ratings yet
Algorithms: K Nearest Neighbors
16 pages
Improving Time-Complexity of K Nearest Neighbors Classifier: A Systematic Review
No ratings yet
Improving Time-Complexity of K Nearest Neighbors Classifier: A Systematic Review
6 pages
K-NN Algorithm and Clustering Analysis
No ratings yet
K-NN Algorithm and Clustering Analysis
93 pages
Unit 4.8 KNN
No ratings yet
Unit 4.8 KNN
10 pages
K Nearest Neighbor
No ratings yet
K Nearest Neighbor
33 pages
Week 3. K-Nearest Neighbours (KNN) : Dr. Shuo Wang
No ratings yet
Week 3. K-Nearest Neighbours (KNN) : Dr. Shuo Wang
18 pages
ML 2
No ratings yet
ML 2
6 pages
Lecture 14 and 15
No ratings yet
Lecture 14 and 15
42 pages
Amrendra
No ratings yet
Amrendra
9 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
22 pages
ML Assignment No. 3: 3.1 Title
No ratings yet
ML Assignment No. 3: 3.1 Title
6 pages
CSL0777 L22
No ratings yet
CSL0777 L22
35 pages
Presentation of KNN-1
No ratings yet
Presentation of KNN-1
18 pages
K Nearest Neighbors
100% (1)
K Nearest Neighbors
9 pages
Unit 3 KNN
No ratings yet
Unit 3 KNN
6 pages
Lecture Note #3 - PEC-CS701E
No ratings yet
Lecture Note #3 - PEC-CS701E
27 pages
Aiml M3 C2
No ratings yet
Aiml M3 C2
56 pages
Bài nhóm tìm hiểu về KNN
No ratings yet
Bài nhóm tìm hiểu về KNN
5 pages
KNN Algorithm
No ratings yet
KNN Algorithm
15 pages
KNN Presentation
No ratings yet
KNN Presentation
19 pages
K-Nearest Neighbors
No ratings yet
K-Nearest Neighbors
9 pages
K-NN (Nearest Neighbor)
100% (1)
K-NN (Nearest Neighbor)
17 pages
K-Nearest Neighbors (K-NN) Algorithm
No ratings yet
K-Nearest Neighbors (K-NN) Algorithm
10 pages
Shubh
No ratings yet
Shubh
10 pages
Lecture 38 KNN
No ratings yet
Lecture 38 KNN
4 pages
KNN Algorithm
No ratings yet
KNN Algorithm
9 pages
K Nearest Neighbour Classifier
No ratings yet
K Nearest Neighbour Classifier
20 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Week-2-Data Warehouse and Olap
No ratings yet
Week-2-Data Warehouse and Olap
57 pages
Week 9 Part 1 Clustering
No ratings yet
Week 9 Part 1 Clustering
44 pages
4205-Comuter Systems Security Course Outline
No ratings yet
4205-Comuter Systems Security Course Outline
5 pages
CSEg - 4207 - Course Outline
No ratings yet
CSEg - 4207 - Course Outline
4 pages
Assignmentt
No ratings yet
Assignmentt
22 pages
Ncert Solutions For Class 11 Maths May22 Chapter 15 Statistics Exercise 15 1
No ratings yet
Ncert Solutions For Class 11 Maths May22 Chapter 15 Statistics Exercise 15 1
15 pages
STAT 3022 Data Analysis Class Slides 1
No ratings yet
STAT 3022 Data Analysis Class Slides 1
16 pages
QMM Exam Assist
67% (3)
QMM Exam Assist
21 pages
Portfolio Risk and Return
No ratings yet
Portfolio Risk and Return
4 pages
Index
No ratings yet
Index
4 pages
REGRESSION
No ratings yet
REGRESSION
7 pages
Chi Square
No ratings yet
Chi Square
34 pages
Binomial Distribution Problems
No ratings yet
Binomial Distribution Problems
3 pages
Sampling: Design and Procedures
No ratings yet
Sampling: Design and Procedures
4 pages
BE Aids - BI Syllabus
No ratings yet
BE Aids - BI Syllabus
3 pages
Raw Score: Class Tally Frequency Class Boundaries X FX FX CF RF RCF
No ratings yet
Raw Score: Class Tally Frequency Class Boundaries X FX FX CF RF RCF
15 pages
Introduction To Stata - Lecture 4: Instrumental Variables: Hayley Fisher 3 March 2010
No ratings yet
Introduction To Stata - Lecture 4: Instrumental Variables: Hayley Fisher 3 March 2010
11 pages
Autocorrelation Notes PDF
No ratings yet
Autocorrelation Notes PDF
6 pages
Hypothesis Testing (Lecture) PDF
50% (2)
Hypothesis Testing (Lecture) PDF
50 pages
STA101 - Statistics - Assignment 1
No ratings yet
STA101 - Statistics - Assignment 1
36 pages
Final DSR Lab Record
No ratings yet
Final DSR Lab Record
16 pages
Traffic Engineering Lab Exercise Report 1&2 Department of Civil Engineering Name:Mekuanint Getnet Entry No: 2018cep2086
No ratings yet
Traffic Engineering Lab Exercise Report 1&2 Department of Civil Engineering Name:Mekuanint Getnet Entry No: 2018cep2086
18 pages
Factorial Designs
100% (1)
Factorial Designs
23 pages
Adjusted FBA Assignment Questions. Year3
No ratings yet
Adjusted FBA Assignment Questions. Year3
9 pages
JNTUH Usedpapers March 2022: (Common To CSE, IT, CSE (SE), CSE (IOT), CSEN)
No ratings yet
JNTUH Usedpapers March 2022: (Common To CSE, IT, CSE (SE), CSE (IOT), CSEN)
2 pages
Geokniga Id Be Ok Mik Uc Critique Mineral Resource Estimation Techniques
100% (1)
Geokniga Id Be Ok Mik Uc Critique Mineral Resource Estimation Techniques
276 pages
Lecture 6 - Path Analysis 2020
No ratings yet
Lecture 6 - Path Analysis 2020
6 pages
Two-And Three - Parameter Weibull Goodness-of-Fit Tests: United States Department of Agriculture
No ratings yet
Two-And Three - Parameter Weibull Goodness-of-Fit Tests: United States Department of Agriculture
34 pages
الكذب لدى الطفل في المرحلة الابتدائية - أسبابه وأساليب الحد منه من وجهة نظر الأولياء والمعلمين
No ratings yet
الكذب لدى الطفل في المرحلة الابتدائية - أسبابه وأساليب الحد منه من وجهة نظر الأولياء والمعلمين
34 pages
Lepage Test
No ratings yet
Lepage Test
3 pages
Data Collection Statistics
No ratings yet
Data Collection Statistics
18 pages
Oromia State University College of Finance and Management Studies Department of Management Business Statistics Mid Exam. For Weekend Students
No ratings yet
Oromia State University College of Finance and Management Studies Department of Management Business Statistics Mid Exam. For Weekend Students
2 pages
Statistics Midterm Review
No ratings yet
Statistics Midterm Review
21 pages

Week 7 Part 1KNN K Nearest Neighbor Classification

Uploaded by

Week 7 Part 1KNN K Nearest Neighbor Classification

Uploaded by

Week-7-Part-1

Training: The KNN algorithm first requires a labelled training

Prediction: When a new, unlabelled data point arrives, KNN searches

Classification: For classification tasks, the algorithm assigns the new

Regression: For regression tasks, the algorithm predicts the value of

Non-parametric: KNN does not make any assumptions about

Interpretable: KNN predictions are easily interpretable by

Lazy learning: KNN doesn't explicitly build a model during the

Simple and intuitive concept

Computationally expensive for large datasets

In machine learning, a lazy learner is an algorithm

Choosing the optimal K-value in the K-Nearest Neighbors

You might also like