0% found this document useful (0 votes)

5 views6 pages

Dda3020 2024F HW1

The DDA3020 Homework 1 is due on October 14, 2024, and constitutes 20% of the final grade. It includes written problems on linear regression and support vector machines, as well as programming tasks involving linear regression and SVM using the iris dataset. Submissions must be made electronically via Blackboard, and late submissions will incur score penalties.

Uploaded by

1620040208

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views6 pages

Dda3020 2024F HW1

Uploaded by

1620040208

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

DDA3020 Homework 1

Due date: Oct 14, 2024

Instructions

• The deadline is 23:59, Oct 14, 2024.

• The weight of this assignment in the final grade is 20%.

• Electronic submission: Turn in solutions electronically via Blackboard. Be sure to submit

your homework as a single file. Please name your solution file as DDA3020HW 1 studentID name

• Note that late submissions will result in discounted scores: 0-24 hours → 80%, 24-120 hours
→ 50%, 120 or more hours → 0%.

• Answer the questions in English. Otherwise, you’ll lose half of the points.

• Collaboration policy: You need to solve all questions independently and collaboration between
students is NOT allowed.

1 Written Problems (50 points)

1.1. (Learning of Linear Regression, 25 points) Suppose we have training data:

{(x1 , y1 ), (x2 , y2 ), . . . , (xN , yN )},

where xi ∈ Rd and yi ∈ Rk , i = 1, 2, . . . , N .

i) (9 pts) Find the closed-form solution of the following problem.

N
X
min ∥yi − W xi − b∥22 ,
W ,b
i=1

ii) (8 pts) Show how to use gradient descent to solve the problem. (Please state at least one
possible Stopping Criterion)

iii) (8 pts) We further suppose that x1 , x2 , . . . , xN are drawn from N (µ, σ 2 ). Show that the
1 PN
maximum likelihood estimation (MLE) of σ 2 is σ̂M 2
LE = N
2
n=1 (xn − µM LE ) .

1
DDA3020 Machine Learning Autumn 2024, CUHKSZ

1.2. (Support Vector Machine, 25 points) Given two positive samples x1 = (3, 3)T , x2 =
(4, 3)T , and one negative sample x3 = (1, 1)T , find the maximum-margin separating hyperplane and
support vectors.
Solution steps:

i) Formulating the Optimization Problem (5 pts)

ii) Constructing the Lagrangian (5 pts)

iii) Using KKT Conditions (5 pts)

iv) Solving the Equations (5 pts)

v) Determining the Hyperplane Equation and Support Vectors (5 pts)

2 Programming (50 points)

2.1. (Linear regression, 25 points) We have a labeled dataset D = {(x1 , y1 ), (x2 , y2 ),

· · · , (xn , yn )}, with xi ∈ Rd being the d-dimensional feature vector of the i-th sample, and yi ∈ R
being real valued target (label).

A linear regression model is give by

fw0 ,...,wd (x) = w0 + w1 x1 + w2 x2 + · · · + wd xd , (1)

where w0 is often called bias and w1 , w2 , . . . , wd are often called coefficients.

Now, we want to utilize the dataset D to build a linear model based on linear regression.
We provide a training set Dtrain that includes 2024 labeled samples with 11 features (See lin-
ear regression train.txt) to fit model, and a test set Dtest that includes 10 unlabeled samples with
11 features (see linear regression test.txt) to estimate model.

1. Using the LinearRegression class from Sklearn package to get the bias w0 and the coefficients
w1 , w2 , . . . , w11 , then computing the ŷ = f (x) of test set Dtest by the model trained well. (Put
the estimation of w0 , w1 , . . . , w11 and these ŷ in your answers.)

2. Implementing the linear regression by yourself to obtain the bias w0 and the coefficients
w1 , w2 , . . . , w11 , then computing the ŷ = f (x) of test set Dtest . (Put the estimation of
w0 , w1 , . . . , w11 and these ŷ in your answers. It is allowed to compute the inverse of a matrix
using the existing python package.)

(Hint: Note that for linear regression train.txt, there are 2024 rows with 12 columns where the
first 11 columns are features x and the last column is target y and linear regression test.txt
only contains 10 rows with 11 columns (features). Both of two tasks require the submission of

2
DDA3020 Machine Learning Autumn 2024, CUHKSZ

code and results. Put all the code in a “HW1 yourID Q1.ipynb” Jupyter notebook. file.(”.py”
file is also acceptable))

2.2. (SVM, 25 points)

Task Description You are asked to write a program that constructs support vector machine
models with different kernel functions and slack variables.

Datasets You are provided with the iris dataset. The data set contains 3 classes of 50 instances
each, where each class refers to a type of iris plant. There are four features: 1. sepal length in cm;
2. sepal width in cm; 3. petal length in cm; 4. petal width in cm. You need to use these features
to classify each iris plant as one of the three possible types.

What you should do You should use the SVM function from python sklearn package, which
provides various forms of SVM functions. For multiclass SVM you should use the one vs rest
strategy. You are recommended to use sklearn.svm.svc() function. You can use numpy for vector
manipulation. For technical report, you should report the results required as mentioned below (e.g.
training error, testing error, and so on).

1. (2 points) Split training set and test set. Split the data into a training set and a test set.
The training set should contain 70% of the samples, while the test set should include 30%.
The number of samples from each category in both the training and test sets should reflect
this 70-30 split; for each category, the first 70% of the samples will form the training set, and
the remaining 30% will form the test set. Ensure that the split maintains the original order
of the data. You should report instance ids in the split training set and test set. The output
format is as follows:

Q2.2.1 Split training set and test set:

Training set: xx

Test set: xx

You should fill up xx in the template. You should write ids for each set in the same line with
comma separated, e.g. Training set:[1, 4, 19].

2. (10 points) Calculation using Standard SVM Model (Linear Kernel). Employ the
standard SVM model with a linear kernel. Train your SVM on the split training dataset and
validate it on the testing dataset. Calculate the classification error for both the training and
testing datasets, output the weight vector w, the bias b, and the indices of support vectors

3
DDA3020 Machine Learning Autumn 2024, CUHKSZ

(start with 0). Note that the scikit-learn package does not offer a function with hard margin,
so we will simulate this using C = 1e5. You should first print out the total training error
wrong prediction
and testing error, where the error is number of data . Then, print out the results for each class
separately (note that you should calculate errors for each class separately in this part). You
should also mention in your report which classes are linear separable with SVM without slack.
The output format is as follows:

Q2.2.2 Calculation using Standard SVM Model:

total training error: xx, total testing error: xx,

class setosa:
training error: xx, testing error: xx,
w: xx, b: xx,
support vector indices: xx,

class versicolor:
training error: xx, testing error: xx,
w: xx, b: xx,
support vector indices: xx,

class virginica:
training error: xx, testing error: xx,
w: xx, b: xx,
support vector indices: xx,

Linear separable classes: xx

If we view the one vs all strategy as combining the multiple different SVM, each one being
a separating hyperplane for one class and the rest of the points, then the w, b and support
vector indices for that class is the corresponding parameters for the SVM
 separating this class
1
 
and the rest of the points. If a variable is of vector form, say a = 
2, then you should write

3
each entry in the same line with comma separated e.g. [1,2,3].

3. (6 points) Calculation using SVM with Slack Variables (Linear Kernel). For each
C = 0.25 × t, where t = 1, 2, . . . , 4, train your SVM on the training dataset, and subsequently
validate it on the testing dataset. Calculate the classification error for both the training and
testing datasets, the weight vector w, the bias b, and the indices of support vectors, and the
slack variable ζ of support vectors (you may compute it as max(0, 1 − y · f (X)). The output
format is as follows:

Q2.2.3 Calculation using SVM with Slack Variables (C = 0.25 × t, where t = 1, . . . , 4):
-------------------------------------------
C=0.25,

4
DDA3020 Machine Learning Autumn 2024, CUHKSZ

total training error: xx, total testing error: xx,

class setosa:
training error: xx, testing error: xx,
w: xx, b: xx,
support vector indices: xx,
slack variable: xx,

class versicolor:
training error: xx, testing error: xx,
w: xx, b: xx,
support vector indices: xx,
slack variable: xx,

class virginica:

training error: xx, testing error: xx,

w: xx, b: xx,

support vector indices: xx,

slack variable: xx,

-------------------------------------------

C=0.5,

<... results for (C=0.5) ...>

-------------------------------------------

C=0.75,

<... results for (C=0.75) ...>

-------------------------------------------

C=1,

<... results for (C=1) ...>

4. (7 points) Calculation using SVM with Kernel Functions. Conduct experiments with
different kernel functions for SVM without slack variable. Calculate the classification error
for both the training and testing datasets, and the indices of support vectors for each kernel
type:

(a) 2nd-order Polynomial Kernel

(b) 3nd-order Polynomial Kernel
(c) Radial Basis Function Kernel with σ = 1
(d) Sigmoidal Kernel with σ = 1

The output format is as follows:

Q2.2.4 Calculation using SVM with Kernel Functions:

-------------------------------------------

5
DDA3020 Machine Learning Autumn 2024, CUHKSZ

(a) 2nd-order Polynomial Kernel,

total training error: xx, total testing error: xx,

class setosa:
training error: xx, testing error: xx,
w: xx, b: xx,
support vector indices: xx,

class versicolor:
training error: xx, testing error: xx,
w: xx, b: xx,
support vector indices: xx,

class virginica:

training error: xx, testing error: xx,

w: xx, b: xx,

support vector indices: xx,

-------------------------------------------

(b) 3nd-order Polynomial Kernel,

<... results for (b) ...>

-------------------------------------------

(c) Radial Basis Function Kernel with σ = 1,

<... results for (c) ...>

-------------------------------------------

(d) Sigmoidal Kernel with σ = 1,

<... results for (d) ...>

Submission Submit your executable code in a “HW1 yourID Q2.ipynb” Jupyter notebook(”.py”
file is also acceptable). Indicate the corresponding question number in the comment for each cell,
and ensure that your code can logically produce the required results for each question in the required
format. Please note that you need to write clear comments and use appropriate function/variable
names. Excessively unreadable code may result in point deductions.

2nd Exam Question Paper 2
No ratings yet
2nd Exam Question Paper 2
16 pages
Luciano M Barone, Enzo Marinari, Giovanni Organtini, Federico Ricci Tersenghi-Scientific Programming - C-Language, Algorithms and Models in Science-World Scientific Publishing Company (2013)
No ratings yet
Luciano M Barone, Enzo Marinari, Giovanni Organtini, Federico Ricci Tersenghi-Scientific Programming - C-Language, Algorithms and Models in Science-World Scientific Publishing Company (2013)
718 pages
Prolog - Unification - Backtracking - Recursion - Lists - Cut
No ratings yet
Prolog - Unification - Backtracking - Recursion - Lists - Cut
78 pages
Fourier 4
No ratings yet
Fourier 4
73 pages
Static Indeterminacy PDF
No ratings yet
Static Indeterminacy PDF
5 pages
Tính Toán Phân Tán
No ratings yet
Tính Toán Phân Tán
79 pages
First Order Open Loop System: Che 529 Process Dynamics and Control
No ratings yet
First Order Open Loop System: Che 529 Process Dynamics and Control
5 pages
DATA STRUCTURE Update 2
No ratings yet
DATA STRUCTURE Update 2
118 pages
IR - Lecture 2
No ratings yet
IR - Lecture 2
35 pages
Power BI Interview Questions
No ratings yet
Power BI Interview Questions
5 pages
Grammar and Language: Grammar: It Is System That Specifies
No ratings yet
Grammar and Language: Grammar: It Is System That Specifies
40 pages
Deadlock Detection and Its Algorithm
No ratings yet
Deadlock Detection and Its Algorithm
9 pages
Dissertacao Mest XuYang
No ratings yet
Dissertacao Mest XuYang
67 pages
Machine Learning: Engr. Ejaz Ahmad
No ratings yet
Machine Learning: Engr. Ejaz Ahmad
54 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
12 pages
IEEE2023 Data Secure De-Duplication and Recovery Based On Public Key Encryption With Keyword Search
No ratings yet
IEEE2023 Data Secure De-Duplication and Recovery Based On Public Key Encryption With Keyword Search
11 pages
Artificial Neural Network - Genetic Algorithm - Tutorialspoint
No ratings yet
Artificial Neural Network - Genetic Algorithm - Tutorialspoint
2 pages
Posts Theorem PDF
No ratings yet
Posts Theorem PDF
10 pages
OS PPT 3-4
No ratings yet
OS PPT 3-4
18 pages
Algorithm-Lab Updated
No ratings yet
Algorithm-Lab Updated
125 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
11 pages
L 11 Circle Drawing Algorithims 2
No ratings yet
L 11 Circle Drawing Algorithims 2
6 pages
Assignment 2 Specification
No ratings yet
Assignment 2 Specification
3 pages
Numerical Methods Test
No ratings yet
Numerical Methods Test
1 page
Machine Learning LAB: Practical-1
100% (2)
Machine Learning LAB: Practical-1
24 pages
Assignment 4
No ratings yet
Assignment 4
3 pages
Find The Optimal Solution To The Linear Programming Model With He Integer Restrictions Relaxed
No ratings yet
Find The Optimal Solution To The Linear Programming Model With He Integer Restrictions Relaxed
10 pages
AI Lec 1 Introduction, Foundation, History and State of The Art
No ratings yet
AI Lec 1 Introduction, Foundation, History and State of The Art
7 pages
Assignment II Machine Learning
No ratings yet
Assignment II Machine Learning
8 pages
Fundamentals of Machine Learning Support Vector Machines, Practical Session
No ratings yet
Fundamentals of Machine Learning Support Vector Machines, Practical Session
4 pages
Assignment 3
No ratings yet
Assignment 3
2 pages
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
No ratings yet
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
10 pages
Revision F3
No ratings yet
Revision F3
2 pages
Lower-Upper Symmetric-Gauss-Seidel Method For The Euler and Navier-Stokes Equations
No ratings yet
Lower-Upper Symmetric-Gauss-Seidel Method For The Euler and Navier-Stokes Equations
2 pages
HW 3
No ratings yet
HW 3
5 pages
Linear Algebra
No ratings yet
Linear Algebra
20 pages
B24 ML Exp-3
No ratings yet
B24 ML Exp-3
10 pages
Ma 3 H0
No ratings yet
Ma 3 H0
2 pages
Syllabus Template MSc-IMCA
No ratings yet
Syllabus Template MSc-IMCA
4 pages
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
No ratings yet
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
12 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
10 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
12 pages
Machine Learnin
100% (2)
Machine Learnin
23 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
TD2345
No ratings yet
TD2345
3 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
1 Analytical Part (3 Percent Grade) : + + + 1 N I: y +1 I 1 N I: y 1 I
No ratings yet
1 Analytical Part (3 Percent Grade) : + + + 1 N I: y +1 I 1 N I: y 1 I
5 pages
Machine Learning With SQL
100% (1)
Machine Learning With SQL
12 pages
Ass 1
No ratings yet
Ass 1
3 pages
ML PG Assignment 3
No ratings yet
ML PG Assignment 3
3 pages
Epfl Machine Learning Final Exam 2021 Solutions
No ratings yet
Epfl Machine Learning Final Exam 2021 Solutions
21 pages
178 hw3
No ratings yet
178 hw3
3 pages
C2 W3 Assignment
No ratings yet
C2 W3 Assignment
437 pages
07au Midterm
No ratings yet
07au Midterm
17 pages
Lab 07
No ratings yet
Lab 07
2 pages
CS4100 CS5100 CW1 20241001
No ratings yet
CS4100 CS5100 CW1 20241001
10 pages
SVM Implementation
No ratings yet
SVM Implementation
8 pages
CMU 2018s NinaBALCAN HW3
No ratings yet
CMU 2018s NinaBALCAN HW3
7 pages
OEE Templet
No ratings yet
OEE Templet
2 pages
Computing Key Stage 4 Lesson COMy11u1L4
No ratings yet
Computing Key Stage 4 Lesson COMy11u1L4
9 pages
Machine Learning Hands-On
100% (1)
Machine Learning Hands-On
18 pages
Midterm Practice Questions
No ratings yet
Midterm Practice Questions
14 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
IML-IITKGP - Assignment 5 Solution
No ratings yet
IML-IITKGP - Assignment 5 Solution
7 pages
Exercise - 3: DS203-2024-S1 Roll Number: 23B2215
No ratings yet
Exercise - 3: DS203-2024-S1 Roll Number: 23B2215
25 pages
Module 1
No ratings yet
Module 1
50 pages
C2W3 Lab 01 Model Evaluation and Selection
No ratings yet
C2W3 Lab 01 Model Evaluation and Selection
21 pages
1st Exam Question Paper 2
No ratings yet
1st Exam Question Paper 2
16 pages
CS6301 Homework2 KR
No ratings yet
CS6301 Homework2 KR
13 pages
Wa0030.
No ratings yet
Wa0030.
36 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
ML 8,9,10
No ratings yet
ML 8,9,10
3 pages
Stream Cipher
No ratings yet
Stream Cipher
21 pages
C2W3 Lab 01 Model Evaluation and Selection
No ratings yet
C2W3 Lab 01 Model Evaluation and Selection
21 pages
hw3 Red
No ratings yet
hw3 Red
4 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Lokesh T00691325
No ratings yet
Lokesh T00691325
5 pages
Midterm2008f Sol
No ratings yet
Midterm2008f Sol
12 pages
ML Cyber Lab
No ratings yet
ML Cyber Lab
16 pages
AML Assignment 1
No ratings yet
AML Assignment 1
3 pages
Message
No ratings yet
Message
2 pages
Assgmt 1
No ratings yet
Assgmt 1
7 pages
Btech1007022 Lab5.1
No ratings yet
Btech1007022 Lab5.1
9 pages
HW 3
No ratings yet
HW 3
7 pages
Shobit Sharma (2124399) ML Lab File PDF
No ratings yet
Shobit Sharma (2124399) ML Lab File PDF
19 pages
S&UL Subjective Question Bank
No ratings yet
S&UL Subjective Question Bank
7 pages
Sheet1 1
No ratings yet
Sheet1 1
2 pages
CSE455/CSE552 Machine Learning (Spring 2024) Homework #1: Hand-In Policy Collaboration Policy Grading
No ratings yet
CSE455/CSE552 Machine Learning (Spring 2024) Homework #1: Hand-In Policy Collaboration Policy Grading
2 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)

Dda3020 2024F HW1

Uploaded by

Dda3020 2024F HW1

Uploaded by

DDA3020 Homework 1

Due date: Oct 14, 2024

• The deadline is 23:59, Oct 14, 2024.

• The weight of this assignment in the final grade is 20%.

• Electronic submission: Turn in solutions electronically via Blackboard. Be sure to submit

1 Written Problems (50 points)

1.1. (Learning of Linear Regression, 25 points) Suppose we have training data:

{(x1 , y1 ), (x2 , y2 ), . . . , (xN , yN )},

i) (9 pts) Find the closed-form solution of the following problem.

i) Formulating the Optimization Problem (5 pts)

ii) Constructing the Lagrangian (5 pts)

iii) Using KKT Conditions (5 pts)

iv) Solving the Equations (5 pts)

v) Determining the Hyperplane Equation and Support Vectors (5 pts)

2 Programming (50 points)

2.1. (Linear regression, 25 points) We have a labeled dataset D = {(x1 , y1 ), (x2 , y2 ),

A linear regression model is give by

fw0 ,...,wd (x) = w0 + w1 x1 + w2 x2 + · · · + wd xd , (1)

where w0 is often called bias and w1 , w2 , . . . , wd are often called coefficients.

2.2. (SVM, 25 points)

Q2.2.1 Split training set and test set:

Q2.2.2 Calculation using Standard SVM Model:

Linear separable classes: xx

total training error: xx, total testing error: xx,

training error: xx, testing error: xx,

support vector indices: xx,

slack variable: xx,

<... results for (C=0.5) ...>

<... results for (C=0.75) ...>

<... results for (C=1) ...>

(a) 2nd-order Polynomial Kernel

The output format is as follows:

Q2.2.4 Calculation using SVM with Kernel Functions:

(a) 2nd-order Polynomial Kernel,

training error: xx, testing error: xx,

support vector indices: xx,

(b) 3nd-order Polynomial Kernel,

<... results for (b) ...>

(c) Radial Basis Function Kernel with σ = 1,

<... results for (c) ...>

(d) Sigmoidal Kernel with σ = 1,

<... results for (d) ...>

You might also like