Assignment 2 Solution

The document discusses the K-nearest neighbors (KNN) classifier. It explains that KNN is a lazy classifier, meaning it does not learn a discriminative function from the training data like other models. Instead, it simply stores all the training examples and classifies new examples based on similarity to past cases. The document notes that for KNN, adding new training data only requires storing it without retraining the whole model. It also states that KNN has the highest computational cost of O(n) for classifying a new test example, where n is the number of training examples, as it requires calculating distances to all past cases. Finally, it provides pseudocode for implementing a basic KNN classifier, including calculating distances, selecting

Uploaded by

Razin

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views

Assignment 2 Solution

Uploaded by

Razin

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Scanned by CamScanner

Scanned by CamScanner
Scanned by CamScanner
Scanned by CamScanner
Scanned by CamScanner
Scanned by CamScanner
Scanned by CamScanner
Scanned by CamScanner
Scanned by CamScanner
Scanned by CamScanner
3. K-Nearest Neighbor Classifier
3.1 Lazy Classifier
a. When a new training example becomes available, among SVM, Naive Bayes and KNN,
which classifier(s) have to be re-trained from scratch?
SVM has to be re-trained from scratch. For KNN, just add the new data to the training set
and then it will be available for prediction. Nothing else has to be done. For Naïve Bayes
also, just the count of data points will vary and accordingly probability values can be
adjusted and hence it can be updated easily. But for SVM, the new data might change the
support vectors entirely and hence has to re-trained from scratch.
b. When a new test example becomes available, among SVM, Naive Bayes and KNN, which
classifier needs the most computation to infer the class label for this example, and what
is the time complexity for this inference, assuming that we have n training examples, and
the number of features is significantly smaller than n?
KNN
In KNN, firstly we need to calculate distance from the new test sample to all of the n
training samples. Since number of features are negligible, it will take O(1) time to
calculate distance from the test sample to 1 training sample. Therefore O(n) time is
required to calculate distance from test sample to n training samples.
Now to select k closest points from the sample, need O(n Log k), assuming max-heap
is used to select k closest points.
Selecting label based on majority vote will take O(k).
Hence complexity will be O(n)+O(n Log k)+O(k). If k is negligible compared to n, then
O(n).
3.2 Implementation of KNN Classifier
a. Pseudocode
1. Download 'mnist_train.csv' and 'mnist_test.csv' files from the site mentioned.
2. Load the first 6000 samples from training set to X_train (Samples) and y_train(Labels).
3. Load the last 1000 samples from test set to X_test (Samples) and y_test(Labels).
4. Calculate Euclidean Distance from each of the test sample to all the training samples
and store it in a matrix of 1000*6000 dimension.
5. Predict label for the test set using the distance matrix using KNN Classifier algorithm
with different values of k and calculate error.
6. Plot the graph of error vs the value of k.
Function calculate_distance_matrix
For i=0 to NumberOfTestSamples-1
difference = X_testi – X_train
squared = difference^2
summed = ∑j (squaredj)
squareRooted = √ summed
distance_matrix[i] = squareRooted
return distance_matrix

Function predict
For i=0 to NumberOfTestSamples-1
distance_from_i = distance_matrix[i]
sort(distance_from_i)
select k closest points from distance_from_i
obtain classes of those k points from y_train
y_pred[i] = majority label among k values
accuracy = (# y_pred == y_test) / (# y_pred)
error = 1-accuracy
return error
b. Curve of Error vs Value of K

Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
87% (46)
12 Week Program: Summer Body Starts Now
70 pages
Read People Like A Book by Patrick King-Edited
57% (82)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Cheat Code To The Universe
94% (79)
Cheat Code To The Universe
34 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
The Secret Language of Attraction
86% (108)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (542)
How To Develop and Write A Grant Proposal
17 pages
Penis Enlargement Secret
60% (124)
Penis Enlargement Secret
12 pages
Workbook For The Body Keeps The Score
89% (53)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (30)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
77% (13)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
Phone Codes
79% (28)
Phone Codes
5 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
How 2 Setup Trust
97% (307)
How 2 Setup Trust
3 pages
The 36 Questions That Lead To Love - The New York Times
91% (35)
The 36 Questions That Lead To Love - The New York Times
3 pages
100 Questions To Ask Your Partner
78% (36)
100 Questions To Ask Your Partner
2 pages
Satanic Calendar
25% (56)
Satanic Calendar
4 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
100% (8)
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
27 pages
1001 Songs
70% (73)
1001 Songs
1,798 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
Assignment 2 Solution
No ratings yet
Assignment 2 Solution
2 pages
3.1 K Nearest Neighbour Classifier (1)
No ratings yet
3.1 K Nearest Neighbour Classifier (1)
24 pages
KNN PDF
No ratings yet
KNN PDF
30 pages
19-K-Nearest Neighbor Learning.-22-08-2024
No ratings yet
19-K-Nearest Neighbor Learning.-22-08-2024
25 pages
Experiment 2.2 KNN Classifier
No ratings yet
Experiment 2.2 KNN Classifier
7 pages
ML-Unit 5
No ratings yet
ML-Unit 5
40 pages
KNN
No ratings yet
KNN
29 pages
ML Lec07 KNN
100% (2)
ML Lec07 KNN
37 pages
K-Nearest Neighbour Classifier: Prerequisite
No ratings yet
K-Nearest Neighbour Classifier: Prerequisite
6 pages
Classification KNN
No ratings yet
Classification KNN
11 pages
Part A 3. KNN Classification
No ratings yet
Part A 3. KNN Classification
35 pages
4+KNN+Classifier
No ratings yet
4+KNN+Classifier
6 pages
5. K-Nearest Neighbors
No ratings yet
5. K-Nearest Neighbors
35 pages
ML Notes
100% (2)
ML Notes
125 pages
Lecture 3
No ratings yet
Lecture 3
17 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
Lab 10- Manual and assignment on KNN
No ratings yet
Lab 10- Manual and assignment on KNN
3 pages
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
100% (1)
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
125 pages
Machine Learning Unit-3.1
No ratings yet
Machine Learning Unit-3.1
20 pages
ML Lec-10
No ratings yet
ML Lec-10
19 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
17 pages
K-Nearest Neighbor Classification-Algorithm and Characteristics
No ratings yet
K-Nearest Neighbor Classification-Algorithm and Characteristics
6 pages
Experiment No 7 ML
No ratings yet
Experiment No 7 ML
4 pages
cYCLE 9
No ratings yet
cYCLE 9
5 pages
06-knn
No ratings yet
06-knn
41 pages
ML Supervised Learning Unit 3
No ratings yet
ML Supervised Learning Unit 3
51 pages
Ml 7th Sem Aiml Ite Notes Complete Long[1]-63-155
No ratings yet
Ml 7th Sem Aiml Ite Notes Complete Long[1]-63-155
93 pages
WEEK 07
No ratings yet
WEEK 07
24 pages
ML-KN
No ratings yet
ML-KN
12 pages
Instance Based Learning
No ratings yet
Instance Based Learning
16 pages
Activity 01: Python Set/s of Source Code Use in The Activity (Paste Below)
No ratings yet
Activity 01: Python Set/s of Source Code Use in The Activity (Paste Below)
2 pages
KNN CIML
No ratings yet
KNN CIML
12 pages
U3 KNN
No ratings yet
U3 KNN
6 pages
KNN v2
No ratings yet
KNN v2
31 pages
PML Lab Exp 11
No ratings yet
PML Lab Exp 11
3 pages
Unit 5 Learning with Algorithm
No ratings yet
Unit 5 Learning with Algorithm
7 pages
Lab 8
No ratings yet
Lab 8
7 pages
Assignment No 2 AI
No ratings yet
Assignment No 2 AI
4 pages
KNN Algorithm: Gnitc Mrs - Sumitra Mallick CSE Dept
No ratings yet
KNN Algorithm: Gnitc Mrs - Sumitra Mallick CSE Dept
12 pages
KNN - Algorithm - SVM - Algorithm
No ratings yet
KNN - Algorithm - SVM - Algorithm
27 pages
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
No ratings yet
K-Nearest Neighbor (KNN) : Non-Parametric Algorithm
7 pages
K- Nearest Neighbors.pptx
No ratings yet
K- Nearest Neighbors.pptx
33 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
18 pages
KNN Class 2
No ratings yet
KNN Class 2
40 pages
ML Assignment No. 3: 3.1 Title
No ratings yet
ML Assignment No. 3: 3.1 Title
6 pages
ML 4 (1)
No ratings yet
ML 4 (1)
33 pages
Lecture Note #3_PEC-CS701E
No ratings yet
Lecture Note #3_PEC-CS701E
27 pages
ML Assignment No. 3: 3.1 Title
No ratings yet
ML Assignment No. 3: 3.1 Title
6 pages
Week 3. K-Nearest Neighbours (KNN) : Dr. Shuo Wang
No ratings yet
Week 3. K-Nearest Neighbours (KNN) : Dr. Shuo Wang
18 pages
KNN Algorithm
No ratings yet
KNN Algorithm
9 pages
Improving Time-Complexity of K Nearest Neighbors Classifier: A Systematic Review
No ratings yet
Improving Time-Complexity of K Nearest Neighbors Classifier: A Systematic Review
6 pages
UNIT 3 - Final
No ratings yet
UNIT 3 - Final
37 pages
Week 7 Nearest Neighbours
No ratings yet
Week 7 Nearest Neighbours
21 pages
05 K-Nearest Neighbors
No ratings yet
05 K-Nearest Neighbors
15 pages
ML Lab2 pgm
No ratings yet
ML Lab2 pgm
3 pages
L05-Predictive Analytics I
No ratings yet
L05-Predictive Analytics I
49 pages
KNN Dan KMeans
No ratings yet
KNN Dan KMeans
37 pages
Bài-nhóm-tìm-hiểu-về-KNN
No ratings yet
Bài-nhóm-tìm-hiểu-về-KNN
5 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Pandas PD: Import As
No ratings yet
Pandas PD: Import As
19 pages
Modern Resume With QR Code
No ratings yet
Modern Resume With QR Code
2 pages
Introduction To Turbo Prolog - Townsend, Carl, 1938 - 1987 - Berkeley - Sybex - 9780895883599 - Anna's Archive
No ratings yet
Introduction To Turbo Prolog - Townsend, Carl, 1938 - 1987 - Berkeley - Sybex - 9780895883599 - Anna's Archive
340 pages
Restaurant Manager Resume
No ratings yet
Restaurant Manager Resume
3 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Cover Letter For Entry-Level Resume
No ratings yet
Cover Letter For Entry-Level Resume
2 pages
Human Resources Resume
No ratings yet
Human Resources Resume
2 pages
Assignment 3 Specification
No ratings yet
Assignment 3 Specification
3 pages
Unit 3 Reference Material PDF
No ratings yet
Unit 3 Reference Material PDF
93 pages
Assignment 2 Specification
No ratings yet
Assignment 2 Specification
3 pages
Unit 2 Reference Material PDF
No ratings yet
Unit 2 Reference Material PDF
124 pages
Sem 4 OOP (JAVA - TECHNICAL - PUBLICATION PDF
No ratings yet
Sem 4 OOP (JAVA - TECHNICAL - PUBLICATION PDF
251 pages
DCN Unit 3 PDF
No ratings yet
DCN Unit 3 PDF
42 pages
DCN Unit 1 PDF
No ratings yet
DCN Unit 1 PDF
78 pages

Assignment 2 Solution

Uploaded by

Assignment 2 Solution

Uploaded by

Scanned by CamScanner

You might also like