Final Assignment

This document contains a multi-part assignment involving KNN classification, decision tree classification, and clustering on a small dataset with variables X1, X2, and a binary outcome Y. For KNN, the student is asked to make predictions for different values of k and distance metrics. For decision trees, the student computes a split's Gini index and argues for different split points. Finally, the student is tasked with clustering the records based only on X1 and X2.

Uploaded by

hyperloke

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views7 pages

Final Assignment

Uploaded by

hyperloke

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

K=Final Assignment (60 points)

Problem 1 (15 points)

Here is a short dataset consisting of a binary outcome variable Y and two independent variables
X1 and X2. The independent variables are normalized, so no need to normalize further.

X1 X2 Y
3 5 1
1 4 0
3 2 0
2 2 1
4 1 1

a) Suppose you are asked to predict the outcome for (X1, X2) = (4, 4). Use KNN with k = 3 to
predict this outcome. You can use Euclidean distance as the distance measure.
b) Predict the outcome with k = 5.

c) Use k = 3 with Manhattan distance and reevaluate the prediction. Should the prediction
with k = 5 change? Why or why not? Is there another name you can use to call the
prediction with k = 5.
Problem 2 (30 points)

Consider the same dataset as in Problem 1.

X1 X2 Y
3 5 1
1 4 0
3 2 0
2 2 1
4 1 1

a) We would now like to train a classification tree to the above dataset. Consider the split X1 =
2.5. Compute the weighted Gini Index of this split.
b) Suppose we change the split to X1 = 3.5. Provide an argument for why this split is better or
worse.
c) Based on your judgment, introduce a split on X2 in addition to the previous split on X1
(either 2.5 or 3.5). Show that this split improves fit and draw the corresponding decision tree.
Problem 3 (15 points)

For the same dataset, ignore the Y variable and simply consider the X variables:

Record X1 X2
#
R1 3 5
R2 1 4
R3 3 2
R4 2 2
R5 4 1
We are now interested in a clustering exercise.

a) Fill up a distance matrix that stipulates the distance from each record to every other record.
Use either Euclidean or Manhattan distance as the measure depending on your convenience.

R1 R2 R3 R4 R5
R1
R2
R3
R4
R5
b) Construct 2 clusters based on the distance matrix above. Can you improve these clusters?
How would you measure this improvement?

Module in PE3 (Swimming) - Physical Education Department-Unlocked
86% (14)
Module in PE3 (Swimming) - Physical Education Department-Unlocked
98 pages
Practice Exam
No ratings yet
Practice Exam
6 pages
Practice Midterm
No ratings yet
Practice Midterm
4 pages
(Supplements To Vetus Testamentum 152) Craig A. Evans, Joel N. Lohr, David L. Petersen (Eds.) - The Book of Genesis - Composition, Reception, and Interpretation-Brill (2012) PDF
100% (8)
(Supplements To Vetus Testamentum 152) Craig A. Evans, Joel N. Lohr, David L. Petersen (Eds.) - The Book of Genesis - Composition, Reception, and Interpretation-Brill (2012) PDF
789 pages
CS-3035 (ML) - CS Mid March 2023
No ratings yet
CS-3035 (ML) - CS Mid March 2023
3 pages
Machine Learning Unit 4 MCQ
No ratings yet
Machine Learning Unit 4 MCQ
28 pages
Introduction To KNN and R
No ratings yet
Introduction To KNN and R
12 pages
Wa0006.
No ratings yet
Wa0006.
4 pages
30 Interview Questions To Test Your Skills On KNN Algorithm
No ratings yet
30 Interview Questions To Test Your Skills On KNN Algorithm
12 pages
KNN - Algorithm - SVM - Algorithm
No ratings yet
KNN - Algorithm - SVM - Algorithm
27 pages
KNN Interview Question Rev 2.0
No ratings yet
KNN Interview Question Rev 2.0
17 pages
Week 3
No ratings yet
Week 3
11 pages
Reachable Distance Function For KNN Classification
No ratings yet
Reachable Distance Function For KNN Classification
152 pages
Classification: K N X X X y I y
No ratings yet
Classification: K N X X X y I y
6 pages
Machine Learning Foundations and Applications Assignment 1 Due Date: 10 October, 2021
No ratings yet
Machine Learning Foundations and Applications Assignment 1 Due Date: 10 October, 2021
3 pages
HW02 - KNN DT
No ratings yet
HW02 - KNN DT
3 pages
Machine Learning 20CSE09
No ratings yet
Machine Learning 20CSE09
3 pages
Road Traffic Algorithm
No ratings yet
Road Traffic Algorithm
5 pages
10-601 Machine Learning: Homework 7: Instructions
No ratings yet
10-601 Machine Learning: Homework 7: Instructions
5 pages
Pattern Recognition 21BR551 MODULE 03 NOTES
No ratings yet
Pattern Recognition 21BR551 MODULE 03 NOTES
16 pages
Unit 2 ML
No ratings yet
Unit 2 ML
89 pages
Aiml M5 PPT 23-24
No ratings yet
Aiml M5 PPT 23-24
94 pages
AMNA SHAHID - Docx MCQS
No ratings yet
AMNA SHAHID - Docx MCQS
8 pages
4K-Nearest Neighbor
No ratings yet
4K-Nearest Neighbor
38 pages
SDS Solution1
No ratings yet
SDS Solution1
26 pages
10 EST Solution
No ratings yet
10 EST Solution
16 pages
Worksheet Classification1
No ratings yet
Worksheet Classification1
15 pages
HW02 Sol - KNN DT
No ratings yet
HW02 Sol - KNN DT
8 pages
An Empirical Study of Distance Metrics For K-Nearest Neighbor Algorithm
No ratings yet
An Empirical Study of Distance Metrics For K-Nearest Neighbor Algorithm
6 pages
Lecture#2. K Nearest Neighbors
No ratings yet
Lecture#2. K Nearest Neighbors
10 pages
Statistical Learning
No ratings yet
Statistical Learning
92 pages
KNN Practice Set
No ratings yet
KNN Practice Set
5 pages
Dist BTW Datasets
No ratings yet
Dist BTW Datasets
15 pages
Jntuk R20 ML Unit-Ii
No ratings yet
Jntuk R20 ML Unit-Ii
37 pages
Instance-Based Learning: K-Nearest Neighbour Learning
No ratings yet
Instance-Based Learning: K-Nearest Neighbour Learning
21 pages
Assignment 2
No ratings yet
Assignment 2
6 pages
Chapter 1 Introduction To Data Mining
No ratings yet
Chapter 1 Introduction To Data Mining
10 pages
4 KNN Classifier
No ratings yet
4 KNN Classifier
6 pages
Wa0008.
No ratings yet
Wa0008.
63 pages
4 KNN Classifier
No ratings yet
4 KNN Classifier
6 pages
Practice Midterm 2010
No ratings yet
Practice Midterm 2010
4 pages
COSC 6335 Data Mining (Dr. Eick) Solution Sketches Midterm Exam October 25, 2012
No ratings yet
COSC 6335 Data Mining (Dr. Eick) Solution Sketches Midterm Exam October 25, 2012
11 pages
05 KNN
No ratings yet
05 KNN
49 pages
Uct633 Est 23
No ratings yet
Uct633 Est 23
3 pages
ML Unit 2 r20 Jntuk
No ratings yet
ML Unit 2 r20 Jntuk
34 pages
Exam SRM Sample Questions
No ratings yet
Exam SRM Sample Questions
71 pages
Classification and K Nearest Neighbour Algorithm
No ratings yet
Classification and K Nearest Neighbour Algorithm
53 pages
Machine Learning Module-03
No ratings yet
Machine Learning Module-03
24 pages
Introduction To Classification - KNN
No ratings yet
Introduction To Classification - KNN
29 pages
Cluster Analysis Introduction (Unit-6)
No ratings yet
Cluster Analysis Introduction (Unit-6)
20 pages
ML Unit-2
No ratings yet
ML Unit-2
33 pages
Instance Based Learning
No ratings yet
Instance Based Learning
20 pages
MachineLearning-Spring24 - KNN Implementation For Classification
No ratings yet
MachineLearning-Spring24 - KNN Implementation For Classification
3 pages
Project1 2022 Fall
No ratings yet
Project1 2022 Fall
4 pages
Classification
No ratings yet
Classification
11 pages
Week 3. K-Nearest Neighbours (KNN) : Dr. Shuo Wang
No ratings yet
Week 3. K-Nearest Neighbours (KNN) : Dr. Shuo Wang
18 pages
Compre FoDS
No ratings yet
Compre FoDS
3 pages
Lecture #2: Prediction, K-Nearest Neighbors: CS 109A, STAT 121A, AC 209A: Data Science
No ratings yet
Lecture #2: Prediction, K-Nearest Neighbors: CS 109A, STAT 121A, AC 209A: Data Science
28 pages
KNN Algorithm
No ratings yet
KNN Algorithm
10 pages
HW1
0% (1)
HW1
2 pages
K-Nearest Neighbour Classifier: Prerequisite
No ratings yet
K-Nearest Neighbour Classifier: Prerequisite
6 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Internship Report On Branding & Promotional Strategies of Ceylon Biscuits Bangladesh (PVT.) Limited - IUBAT - Sahadat Hossain
85% (13)
Internship Report On Branding & Promotional Strategies of Ceylon Biscuits Bangladesh (PVT.) Limited - IUBAT - Sahadat Hossain
92 pages
Danny Montoya Every Day David Levithan Rough Draft
No ratings yet
Danny Montoya Every Day David Levithan Rough Draft
5 pages
Training of The American Actor 1St Edition Edition Arthur Bartow Download
No ratings yet
Training of The American Actor 1St Edition Edition Arthur Bartow Download
48 pages
CE21B030 CE3410 EXP4 Aggregates
No ratings yet
CE21B030 CE3410 EXP4 Aggregates
10 pages
Maths Lesson Plan
100% (1)
Maths Lesson Plan
3 pages
Rickshaw Bank: Management Case
No ratings yet
Rickshaw Bank: Management Case
15 pages
Fill in The Blanks With Suitable Prepositions
No ratings yet
Fill in The Blanks With Suitable Prepositions
18 pages
Duolingo English Test Sample Questions and Answers
No ratings yet
Duolingo English Test Sample Questions and Answers
14 pages
Nsi MC 1616 Manual en
No ratings yet
Nsi MC 1616 Manual en
20 pages
Marketing Across Cultures cw2
No ratings yet
Marketing Across Cultures cw2
19 pages
Cisco SD-WAN: High Availability and Redundancy
No ratings yet
Cisco SD-WAN: High Availability and Redundancy
4 pages
The Origin of Life On Earth
No ratings yet
The Origin of Life On Earth
72 pages
Hubungan Pemerintah Dengan Masyarakat
No ratings yet
Hubungan Pemerintah Dengan Masyarakat
9 pages
Design For Assembly Thesis
100% (3)
Design For Assembly Thesis
6 pages
PP2699 Detektor Asap Soteria UL - Edisi 1
No ratings yet
PP2699 Detektor Asap Soteria UL - Edisi 1
3 pages
Micro Economics - Concepts & Examples
No ratings yet
Micro Economics - Concepts & Examples
12 pages
Icici BNK Imp New ALL OVER File
No ratings yet
Icici BNK Imp New ALL OVER File
18 pages
Volume One 1
No ratings yet
Volume One 1
290 pages
4D3N Miri Brunei
No ratings yet
4D3N Miri Brunei
5 pages
3 Art Therapy Techniques To Deal With Anxiety PDF
No ratings yet
3 Art Therapy Techniques To Deal With Anxiety PDF
3 pages
Mainstreaming The Victims of Crimes and The Witnesses To It - AISHANI PATTANAIK
No ratings yet
Mainstreaming The Victims of Crimes and The Witnesses To It - AISHANI PATTANAIK
20 pages
800+ Marked Question PDF
No ratings yet
800+ Marked Question PDF
446 pages
Star Wars: Clone Guide
No ratings yet
Star Wars: Clone Guide
11 pages
Shannon Butler Resume
No ratings yet
Shannon Butler Resume
3 pages
Moto Boxer 150
No ratings yet
Moto Boxer 150
32 pages
Perito ES
No ratings yet
Perito ES
4 pages
Wrote: (Hadn't Arrived)
No ratings yet
Wrote: (Hadn't Arrived)
7 pages
3 Depreciation
No ratings yet
3 Depreciation
15 pages

Final Assignment

Uploaded by

Final Assignment

Uploaded by

K=Final Assignment (60 points)

Problem 1 (15 points)

Consider the same dataset as in Problem 1.

You might also like