0% found this document useful (0 votes)

71 views13 pages

W3M3-KNN Regression

The document discusses k-Nearest Neighbor regression, a non-parametric machine learning algorithm. It describes how k-NN regression works, including finding the k nearest neighbors of a new data point and predicting the target value as the average of the neighbors' target values. The document also covers the advantages of k-NN regression, such as simplicity and interpretability, as well as its limitations, like sensitivity to outliers and computational complexity.

Uploaded by

2480054

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

71 views13 pages

W3M3-KNN Regression

Uploaded by

2480054

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

k-Nearest Neighbor (k-NN)

Regression
Anubha Gupta, PhD.
Professor
SBILab, Dept. of ECE,
IIIT-Delhi, India
Contact: [email protected]; Lab: http://sbilab.iiitd.edu.in
Machine Learning in Hindi

Motivation
Parametric ML Models Non-Parametric ML Models
Assume a specific functional form for the relationship Do not assume a specific functional form
between the features and target variable for the relationship between the features
and target variable
Estimate a fixed set of parameters that describe this Estimate the relationship between the
functional form, typically using maximum likelihood features and target variable directly from
estimation (MLE) or other optimization techniques the training data
Once the parameters are estimated, the model can The training data may be required each
make predictions on new data without having to time a new prediction is to be made
reprocess the training data
Simple Linear Regression, Logistic Regression, etc. Decision trees, k-Nearest Neighbors, etc.
Machine Learning in Hindi

Motivation
Linear Regression is a parametric ML model, while K-NN is a non-parametric model.

Non-parametric ML models
• are more flexible than parametric
• can handle complex relationships between the features and target variables,
making them particularly useful for high-dimensional and non-linear datasets.
• k-nearest neighbors (k-NN) is a popular non-parametric algorithm that can be
used for both regression and classification
Machine Learning in Hindi

K-Nearest Neighbor (k-NN)

Regression
Machine Learning in Hindi

Learning Objectives
• k-Nearest Neighbor (k-NN) Regression
o Description
o Advantages
o Limitations
Machine Learning in Hindi

Description
A supervised on-parametric regression algorithm used for predicting continuous values

o Assumption: Similar samples have similar target values

o Lazy learning algorithm: it defers the learning process until a new data sample
needs to be predicted because it does not learn a model explicitly. Instead, it stores
the training samples and use them to make predictions on a new data sample.

o The value of k depicts the number of nearest neighbors that are considered while
making a prediction.

o Application: Widely used in various prediction tasks, such

as predicting stock prices, housing prices, and customer churn
Machine Learning in Hindi

Steps:
1. Calculate the distance between a new data
sample to be predicted and all the samples
of the training dataset

2. Select the k number of samples from the

training dataset that are nearest to the new
data sample based on the calculated
distance

3. Calculate the predicted value of the new

data sample as the average of the target
values of the k-nearest samples of the
training set
Machine Learning in Hindi

Advantages
1. Simplicity: Does not require any assumptions about the underlying
distribution of the data or any complex mathematical calculations

2. Versatility: Can work with both continuous and categorical target variables

3. Interpretability: It can be said as an interpretable algorithm that can provide

insight into the relationship between the features and target variable. It is
easy to understand how the algorithm arrived at the prediction

4. Flexibility: Can be used with various distance metrics and weighting

schemes
Machine Learning in Hindi

Limitations
1. Selection of k: The value of k is an important hyperparameter that needs to be
chosen carefully. A small value of k can lead to overfitting, while a large value of k
can lead to underfitting.

2. Sensitivity to outliers and noise: it relies on the distance between data samples to
make predictions. Outliers and noise can distort the distance calculation and affect
the accuracy of the predictions

3. Computationally expensive & Slow: It can be computationally expensive for

large datasets because it requires distance calculations from all the data samples of
the training dataset. This can make the algorithm slow and memory-intensive,
especially for datasets with many features
Machine Learning in Hindi

Limitations
4. Curse of dimensionality: The distance between data samples becomes less
meaningful as the number of features increases. This can lead to overfitting
and poor performance.
5. Class Imbalance: KNN regression can be biased towards the majority class in
imbalanced datasets because it tends to predict the class having the majority
of samples in the training dataset. This can lead to poor performance for the
minority class.
Machine Learning in Hindi

Let us Try!
Question: Given the following training data, use KNN regression to predict the target value for a new
data sample with x=7 and k=3.
Training Data
x y
(feature) (output/target)
2 4

5 8

8 12

11 16
You may pause the video and try.
Machine Learning in Hindi

Let us Try!
Answer: Let us calculate the distance of a test data (𝑥𝑡 = 7) from each sample of the
training dataset
x y Distance of the test sample
(feature) (output/target) from all training samples
2 4 5
5 8 2
3-Nearest Neighbors
8 12 1
11 16 4

Average of the target values of the k nearest neighbors

8+12+16
Prediction = = 12
3
Machine Learning in Hindi

Summary
In this module, we learned:
• Parametric vs. non-parametric models
• Importance considerations while building an ML model
• k-Nearest Neighbor (k-NN) Regression
o Definition
o Advantages
o Limitations
• Example

ML Unit 5..
No ratings yet
ML Unit 5..
40 pages
m3 Final-1
No ratings yet
m3 Final-1
171 pages
Supervised Machine Learning-Adi
No ratings yet
Supervised Machine Learning-Adi
51 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
18 pages
Machine Learning Unit 3
No ratings yet
Machine Learning Unit 3
40 pages
ML04 KNN-SVM 2024-2025
No ratings yet
ML04 KNN-SVM 2024-2025
57 pages
Jntuk r20 ML Unit-II
No ratings yet
Jntuk r20 ML Unit-II
33 pages
Unit V Non Parametric Machine Learning
No ratings yet
Unit V Non Parametric Machine Learning
47 pages
ML CH 3
No ratings yet
ML CH 3
88 pages
Jntuk R20 ML Unit-Ii
No ratings yet
Jntuk R20 ML Unit-Ii
37 pages
Lecture 07 Slides
No ratings yet
Lecture 07 Slides
45 pages
K-Nearest Neighbors
No ratings yet
K-Nearest Neighbors
35 pages
K Nearest Neighbor: Presented by
No ratings yet
K Nearest Neighbor: Presented by
29 pages
ML Unit - 2
No ratings yet
ML Unit - 2
85 pages
Lec 23 - 24 KNN
No ratings yet
Lec 23 - 24 KNN
25 pages
CH 2
No ratings yet
CH 2
30 pages
14-15 ASAP Advanced Statistics Clasification Techniques KNN
No ratings yet
14-15 ASAP Advanced Statistics Clasification Techniques KNN
49 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
22 pages
ML Lecture 13 KNN
No ratings yet
ML Lecture 13 KNN
14 pages
Distance-Based Methods - KNN
No ratings yet
Distance-Based Methods - KNN
8 pages
MachineLearning Unit-III
No ratings yet
MachineLearning Unit-III
26 pages
Lecture Slides#7
No ratings yet
Lecture Slides#7
21 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
Machine Learning Examples With R
No ratings yet
Machine Learning Examples With R
30 pages
KNN 2
No ratings yet
KNN 2
53 pages
Week 7 Part 1KNN K Nearest Neighbor Classification
No ratings yet
Week 7 Part 1KNN K Nearest Neighbor Classification
47 pages
K Nearest Neighbour Classifier
No ratings yet
K Nearest Neighbour Classifier
24 pages
Lecture 14 and 15
No ratings yet
Lecture 14 and 15
42 pages
KNN - Algorithm - SVM - Algorithm
No ratings yet
KNN - Algorithm - SVM - Algorithm
27 pages
AIML
No ratings yet
AIML
13 pages
19-K-Nearest Neighbor Learning.-22-08-2024
No ratings yet
19-K-Nearest Neighbor Learning.-22-08-2024
25 pages
KNN
No ratings yet
KNN
53 pages
Lecture Note #3 - PEC-CS701E
No ratings yet
Lecture Note #3 - PEC-CS701E
27 pages
K Nearest Neighbors KNN A Fundamental Machine Learning Algorithm
No ratings yet
K Nearest Neighbors KNN A Fundamental Machine Learning Algorithm
11 pages
Supervised Learning KNN
No ratings yet
Supervised Learning KNN
23 pages
Lecture 3
No ratings yet
Lecture 3
17 pages
20 Essential Questions SAP HANA
No ratings yet
20 Essential Questions SAP HANA
22 pages
Why Do We Need A K-NN Algorithm?
No ratings yet
Why Do We Need A K-NN Algorithm?
11 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
Instance-Based Learning: K-Nearest Neighbour Learning
No ratings yet
Instance-Based Learning: K-Nearest Neighbour Learning
21 pages
1 - KNN
No ratings yet
1 - KNN
19 pages
AI Lec4
No ratings yet
AI Lec4
17 pages
Machine Learning Unit-3.1
No ratings yet
Machine Learning Unit-3.1
20 pages
05 K-Nearest Neighbors
No ratings yet
05 K-Nearest Neighbors
15 pages
Shubh
No ratings yet
Shubh
10 pages
3.1 K Nearest Neighbour Classifier
No ratings yet
3.1 K Nearest Neighbour Classifier
24 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
13 pages
ML 2
No ratings yet
ML 2
6 pages
'Machine Learning (Nagarjun)
No ratings yet
'Machine Learning (Nagarjun)
10 pages
Clustering - KNN
No ratings yet
Clustering - KNN
10 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
No ratings yet
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
13 pages
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
No ratings yet
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
18 pages
Unit 3 KNN
No ratings yet
Unit 3 KNN
16 pages
20180723161729D4730 - Pert18 - K-Nearest Neighbor
No ratings yet
20180723161729D4730 - Pert18 - K-Nearest Neighbor
22 pages
Instance Based Learning
No ratings yet
Instance Based Learning
16 pages
What Is KNN
No ratings yet
What Is KNN
9 pages
Lecture#2. K Nearest Neighbors
No ratings yet
Lecture#2. K Nearest Neighbors
10 pages
14 K - Nearest Neighbours
No ratings yet
14 K - Nearest Neighbours
8 pages
Experiment No 7 ML
No ratings yet
Experiment No 7 ML
4 pages
GST Weekly Update - 28-2022-23
No ratings yet
GST Weekly Update - 28-2022-23
4 pages
W2M3-Linear Regression
No ratings yet
W2M3-Linear Regression
32 pages
Mohini Toor & Travles
No ratings yet
Mohini Toor & Travles
1 page
Win10 Offline Update
No ratings yet
Win10 Offline Update
12 pages
W4M3-Logistic Regression Examples
No ratings yet
W4M3-Logistic Regression Examples
13 pages
W3M2-Kernel Regression
No ratings yet
W3M2-Kernel Regression
10 pages
Maximum Value For PG - MAXFS, PG - SHM, ROLL - MAXFS, ROLL - SHM
No ratings yet
Maximum Value For PG - MAXFS, PG - SHM, ROLL - MAXFS, ROLL - SHM
2 pages
W4M1-Logistic Regression Classificaiton Evaluation Metrics
No ratings yet
W4M1-Logistic Regression Classificaiton Evaluation Metrics
8 pages
Flower Turbines US E Brochure UPDATED 2
No ratings yet
Flower Turbines US E Brochure UPDATED 2
4 pages
Shortdump Creating Inbound Delivery in The Fiori App F1705 With Create Batch v2
No ratings yet
Shortdump Creating Inbound Delivery in The Fiori App F1705 With Create Batch v2
2 pages
SUM Upgrade Error DB Version Out of Range Too Low v2
No ratings yet
SUM Upgrade Error DB Version Out of Range Too Low v2
2 pages

W3M3-KNN Regression

Uploaded by

W3M3-KNN Regression

Uploaded by

k-Nearest Neighbor (k-NN)

K-Nearest Neighbor (k-NN)

o Assumption: Similar samples have similar target values

o Application: Widely used in various prediction tasks, such

2. Select the k number of samples from the

3. Calculate the predicted value of the new

3. Interpretability: It can be said as an interpretable algorithm that can provide

4. Flexibility: Can be used with various distance metrics and weighting

3. Computationally expensive & Slow: It can be computationally expensive for

Average of the target values of the k nearest neighbors

You might also like