0% found this document useful (0 votes)

9 views3 pages

Homework2 - Tran Anh Vu

Homework 2

Uploaded by

azanetranclc17

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views3 pages

Homework2 - Tran Anh Vu

Homework 2

Uploaded by

azanetranclc17

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

COMP3020 Machine Learning Fall 2024

Homework 2
Tran Anh Vu - V202100569
October 21, 2024

1 Perceptron
1.1 Exercise 1a
- Based on the diagrams of data distribution, we can state that this dataset is linearly separable.
Therefore, we can train a Perceptron to classify it perfectly.
- Initially, w0 = [0, −1] and b0 = 12 We iteratively go through each sample: - For iteration 1:
• x1 = [0, 0] =⇒ a = w0T x1 + b0 = 12 . So a.y < 0 which implies the incorrect prediction. Then we
do the update w1 = w0 + y.x1 = [0, −1] and b1 = b0 + y = −0.5
• x2 = [0, 1] =⇒ a = w1T x2 + b1 = −3
2 . So a.y < 0 which implies the incorrect prediction. Then
we do the update w2 = w1 + y.x2 = [0, 0] and b2 = b1 + y2 = 0.5
• x3 = [1, 0] =⇒ a = w2T x3 + b2 = 12 . So a.y > 0 which implies the correct prediction.
• x3 = [1, 1] =⇒ a = w2T x4 + b2 = 12 . So a.y > 0 which implies the correct prediction.
- For iteration 2:
• x1 = [0, 0] =⇒ a = w2T x1 + b2 = 12 . So a.y < 0 which implies the incorrect prediction. Then we
do the update w3 = w2 + y.x1 = [0, 0] and b3 = b2 + y = −0.5
• x2 = [0, 1] =⇒ a = w3T x2 + b3 = −1
2 . So a.y < 0 which implies the incorrect prediction. Then
we do the update w4 = w3 + y.x2 = [0, 1] and b4 = b3 + y = 0.5
• x3 = [1, 0] =⇒ a = w4T x3 + b4 = 12 . So a.y > 0 which implies the correct prediction.
• x3 = [1, 1] =⇒ a = w4T x4 + b4 = 32 . So a.y > 0 which implies the correct prediction.
- For iteration 3:
• x1 = [0, 0] =⇒ a = w4T x1 + b4 = 12 . So a.y < 0 which implies the incorrect prediction. Then we
do the update w5 = w4 + y.x1 = [0, 1] and b5 = b4 + y = −0.5
• x2 = [0, 1] =⇒ a = w5T x2 + b5 = 21 . So a.y > 0 which implies the correct prediction.

1
• x3 = [1, 0] =⇒ a = w5T x3 + b5 = −1
2 . So a.y > 0 which implies the incorrect prediction. Then
we do the update w6 = w5 + y.x3 = [1, 1] and b6 = b5 + y = 0.5
• x3 = [1, 1] =⇒ a = w6T x4 + b6 = 52 . So a.y > 0 which implies the correct prediction.
- For iteration 4:
• x1 = [0, 0] =⇒ a = w2T x1 + b2 = 12 . So a.y < 0 which implies the incorrect prediction. Then we
do the update w7 = w6 + y.x1 = [1, 1] and b7 = b6 + y = −0.5
• x2 = [0, 1] =⇒ a = w7T x2 + b7 = 21 . So a.y > 0 which implies the correct prediction.
• x3 = [1, 0] =⇒ a = w7T x3 + b7 = 12 . So a.y > 0 which implies the correct prediction.
• x3 = [1, 1] =⇒ a = w7T x4 + b7 = 32 . So a.y > 0 which implies the correct prediction.
−1
Therefore, the perfect classifier for the dataset is the Perceptron with w∗ = [1, 1] and b∗ = 2

1.2 Exercise 1b
Assume that we can find a Perceptron that perfectly classifies the dataset and its parameters are
w∗ = [w1 , w2 ] and b∗ = b Therefore, those parameter has to satisfy the following system of equations:

b < 0(1)


w + b ≥ 0(2)
1
w2 + b ≥ 0(3)


w1 + w2 + b < 0(4)


From that system, we plus (1) and (4), we get w1 + w2 + 2b < 0 while plus (2) and (3), we get
w1 + w2 + 2b ≥ 0 ( contradiction). Therefore, we can not find a Perceptron that perfectly fits the
dataset.

2 Linear Regression
2.1 Exercise 2a
The loss function is given as:
L(w) = ∥Xw − y∥2 = (Xw − y)T (Xw − y) = wT X T Xw − 2y T Xw + y T y

Now, take the gradient of this with respect to w:

∂L(w)
= 2X T (Xw − y)
∂w
This is the derivative of the loss function with respect to w.

2.2 Exercise 2b
From Exercise 2a, we have the derivative as:
∂L(w)
= 2X T (Xw − y)
∂w
Setting this equal to zero:

X T (Xw∗ − y) = 0

=⇒ X T Xw∗ = X T y

Since X is full column rank, X T X is invertible. Thus, we can solve for w∗ :

w∗ = (X T X)−1 X T y

2
2.3 Exercise 2c
We have:
Lridge = ∥Xw − y∥2 + λ∥w∥2 = (Xw − y)T (Xw − y) + λwT w

Taking the derivative with respect to w:

∂Lridge
= 2X T (Xw − y) + 2λw
∂w

Setting the derivative equal to zero, we have:

X T (Xw∗ − y) + λw∗ = 0

=⇒ X T Xw∗ + λw∗ = X T y
=⇒ (X T X + λI)w∗ = X T y
In case λ > 0, we have (X T X + λI) > 0 because X T X > 0 has been proved above. Therefore,
(X T X + λI) is invertable. Hence, the solution is:

w∗ = (X T X + λI)−1 X T y

2.4 Exercise 2d
Let X and y ∗ be features and label sets of the new dataset such that we can obtain L2 regularization
∗

by training with ordinary least square regression with this new dataset. In other words:

∥X ∗ w − y ∗ ∥2 = ∥Xw − y∥2 + λ∥w∥2 (1)

v1
Besides that, we have the norm of a stacked vector is the sum of the norms of v1 and v2 :
v2
2
v1
= ∥v1 ∥2 + ∥v2 ∥2
v2
Therefore, we can ’compress’ the expression ∥Xw − y∥2 + λ∥w∥2 into :
2 2
Xw − y X y
L( w) = = w− (2)
λIw λI 0

where I is the identity matrix.

∗ X ∗ y
Compare (2) to (1), we can set X = and y = so that (1) will be satisfied. To do this, we
λI 0
can add m artificial samples to the dataset, where these samples are constructed such that the input
features are scaled by λ, and the corresponding outputs are all zero. The artificial samples would look
like this:
Xartificial = λI, yartificial = 0

Hence, we can augment the original dataset X, y to the augmented dataset X ∗ , y ∗ to achieve the same
effect as L2 regularize while using the ordinary least square regression.

3 Coding Questions
I described my code in comment lines

Kurmanji Complete
100% (2)
Kurmanji Complete
217 pages
Rb183210 Mpa Craft Guidebook FA
100% (1)
Rb183210 Mpa Craft Guidebook FA
23 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
46 pages
Atlas of Dermatopathology Practical Differential Diagnosis by Clinicopathologic Pattern, 1st Edition Scribd Download
100% (12)
Atlas of Dermatopathology Practical Differential Diagnosis by Clinicopathologic Pattern, 1st Edition Scribd Download
16 pages
Quality Improvement Training Guide 2022
No ratings yet
Quality Improvement Training Guide 2022
99 pages
Badac Plan 2022-2024
100% (6)
Badac Plan 2022-2024
4 pages
Cambridge Advanced Practice Tests 2015
0% (1)
Cambridge Advanced Practice Tests 2015
17 pages
Sample Final Exam Solutions
No ratings yet
Sample Final Exam Solutions
30 pages
Michel Peletz - Kinship Studies in Late Twentieth-Century Anthropology
No ratings yet
Michel Peletz - Kinship Studies in Late Twentieth-Century Anthropology
31 pages
Lab2 Linear Regression
100% (1)
Lab2 Linear Regression
18 pages
Business Ethics - Chapter 5
No ratings yet
Business Ethics - Chapter 5
25 pages
Sampling Procedure APEDA 1721269949
No ratings yet
Sampling Procedure APEDA 1721269949
5 pages
Mahaveer Price List
No ratings yet
Mahaveer Price List
6 pages
Logica Portfolio-1
No ratings yet
Logica Portfolio-1
10 pages
Introduction To Machine Learning Lecture 2: Linear Regression
No ratings yet
Introduction To Machine Learning Lecture 2: Linear Regression
38 pages
MIDA1 AUT - Solutions
No ratings yet
MIDA1 AUT - Solutions
4 pages
TEST INITIAL LA LIMBA ENGLEZA Cls 11 E Lic.4
No ratings yet
TEST INITIAL LA LIMBA ENGLEZA Cls 11 E Lic.4
1 page
Solucionario Koretssky
No ratings yet
Solucionario Koretssky
130 pages
Linear Classifier: by Dr. Sanjeev Kumar Associate Professor Department of Mathematics IIT Roorkee, Roorkee-247 667, India
No ratings yet
Linear Classifier: by Dr. Sanjeev Kumar Associate Professor Department of Mathematics IIT Roorkee, Roorkee-247 667, India
86 pages
Today: - Calculus
No ratings yet
Today: - Calculus
61 pages
07au Midterm
No ratings yet
07au Midterm
17 pages
Midterm Review Spring18 Sols
No ratings yet
Midterm Review Spring18 Sols
22 pages
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
No ratings yet
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
43 pages
Firewall 11.1 Essentials: Configuration and Management (EDU-210)
No ratings yet
Firewall 11.1 Essentials: Configuration and Management (EDU-210)
1 page
NN Theory
No ratings yet
NN Theory
138 pages
Unit Conversions: The S.I. and English Systems
No ratings yet
Unit Conversions: The S.I. and English Systems
4 pages
Midterm With Solutions
No ratings yet
Midterm With Solutions
26 pages
06 Optimization Basics PDF
No ratings yet
06 Optimization Basics PDF
82 pages
CS480 6 Linear Models
No ratings yet
CS480 6 Linear Models
68 pages
Machine Learning: Support Vector Machines Kernel Methods
No ratings yet
Machine Learning: Support Vector Machines Kernel Methods
87 pages
Mock Exams 2024
No ratings yet
Mock Exams 2024
81 pages
Lecture 6 - Ridge Regression, Polynomial Regression (DONE!!) PDF
No ratings yet
Lecture 6 - Ridge Regression, Polynomial Regression (DONE!!) PDF
26 pages
Lecture 3 - Linear Regression
No ratings yet
Lecture 3 - Linear Regression
31 pages
05 Optimization Basics
No ratings yet
05 Optimization Basics
94 pages
YLSTD30-40K01小功率直流充电桩用户手册User Manua V1 - (EN&CN) ) 已校对
No ratings yet
YLSTD30-40K01小功率直流充电桩用户手册User Manua V1 - (EN&CN) ) 已校对
17 pages
Regression Using LS Handout
No ratings yet
Regression Using LS Handout
21 pages
Linear Regression
No ratings yet
Linear Regression
37 pages
Lec4 Oct12 2022 PracticalNotes LinearRegression
No ratings yet
Lec4 Oct12 2022 PracticalNotes LinearRegression
34 pages
Motor, Filter, Kühlsystem Und Auspuff
No ratings yet
Motor, Filter, Kühlsystem Und Auspuff
18 pages
COMP2050-Lecture 22 - Machine Learning
No ratings yet
COMP2050-Lecture 22 - Machine Learning
47 pages
Palindromes: Digitalcommons@University of Nebraska - Lincoln
No ratings yet
Palindromes: Digitalcommons@University of Nebraska - Lincoln
19 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
4684 Down
No ratings yet
4684 Down
22 pages
Week 4 Linear Regression
No ratings yet
Week 4 Linear Regression
38 pages
Kakade S. Tewari A. - Topics in Artificial Intelligence (Learning Theory)
No ratings yet
Kakade S. Tewari A. - Topics in Artificial Intelligence (Learning Theory)
68 pages
CTSD-Lab Mannual Final - 241204 - 102238
No ratings yet
CTSD-Lab Mannual Final - 241204 - 102238
54 pages
Penalizing Gradient Norm For Efficiently Improving Generalization in Deep Learning
No ratings yet
Penalizing Gradient Norm For Efficiently Improving Generalization in Deep Learning
11 pages
Number System Representation - Study Notes
No ratings yet
Number System Representation - Study Notes
12 pages
Current Research Topics in Optical Sensors and Laser Diagnostics
No ratings yet
Current Research Topics in Optical Sensors and Laser Diagnostics
17 pages
Sims 2 Thoughts
No ratings yet
Sims 2 Thoughts
13 pages
W02 MLOptDL
No ratings yet
W02 MLOptDL
23 pages
COL774 Practice Problems
No ratings yet
COL774 Practice Problems
22 pages
Lect5 Reg
No ratings yet
Lect5 Reg
16 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
Midterm F02soln
No ratings yet
Midterm F02soln
14 pages
Waste Valorization For Bioenergy and Bioproducts Hwai Chyuan Ong - The Latest Ebook Edition With All Chapters Is Now Available
100% (3)
Waste Valorization For Bioenergy and Bioproducts Hwai Chyuan Ong - The Latest Ebook Edition With All Chapters Is Now Available
50 pages
Lecture Notes 3 Perceptron
No ratings yet
Lecture Notes 3 Perceptron
7 pages
3.7 3.7 Firms' Costs, Revenue and Objectives
No ratings yet
3.7 3.7 Firms' Costs, Revenue and Objectives
34 pages
hw5 1
No ratings yet
hw5 1
6 pages
Machine Learning (CSEN3203) 1-14
No ratings yet
Machine Learning (CSEN3203) 1-14
15 pages
hw3 Soln
No ratings yet
hw3 Soln
7 pages
cs188 Fa23 Note21
No ratings yet
cs188 Fa23 Note21
8 pages
Representer Function
No ratings yet
Representer Function
12 pages
ASU Assignment2 Sol
No ratings yet
ASU Assignment2 Sol
8 pages
EndSem 202223 Solution
No ratings yet
EndSem 202223 Solution
4 pages
IML Summary
No ratings yet
IML Summary
12 pages
t4 Sol
No ratings yet
t4 Sol
8 pages
HW 3
No ratings yet
HW 3
7 pages
NZ Pa 36 New Zealand Numeracy Stages 1 To 8 Weekly Planning Template English Ver 2
No ratings yet
NZ Pa 36 New Zealand Numeracy Stages 1 To 8 Weekly Planning Template English Ver 2
12 pages
CMU 2018s NinaBALCAN HW3
No ratings yet
CMU 2018s NinaBALCAN HW3
7 pages
6.86x Machine Learning With Python: Linear Classifiers
No ratings yet
6.86x Machine Learning With Python: Linear Classifiers
7 pages
Perceptron
No ratings yet
Perceptron
6 pages
Class Test 1
No ratings yet
Class Test 1
5 pages
1155 CS F425 20230524120823 Mid Semester Question Paper DL
No ratings yet
1155 CS F425 20230524120823 Mid Semester Question Paper DL
5 pages
annotated-CRL 202 20 - 20report 203
No ratings yet
annotated-CRL 202 20 - 20report 203
7 pages
Lec 12
No ratings yet
Lec 12
9 pages
Perceptron Learning Algorithm Lecture Supplement
No ratings yet
Perceptron Learning Algorithm Lecture Supplement
6 pages
LinearRegression-2023-24-exercs-sols 2
No ratings yet
LinearRegression-2023-24-exercs-sols 2
8 pages
hw1 Sols PDF
No ratings yet
hw1 Sols PDF
5 pages
Homework2 v1.0
No ratings yet
Homework2 v1.0
5 pages
HW 1
No ratings yet
HW 1
3 pages
HW 2
No ratings yet
HW 2
5 pages
SVM Problems1
No ratings yet
SVM Problems1
5 pages
Introduction To Machine Learning - Unit 7 - Week 4
No ratings yet
Introduction To Machine Learning - Unit 7 - Week 4
4 pages
ES Key
No ratings yet
ES Key
4 pages
Water Supply Base Map of Bellary City: Allipura Impounding Reservoir - 12633 ML
No ratings yet
Water Supply Base Map of Bellary City: Allipura Impounding Reservoir - 12633 ML
1 page
Fun Least Squares
No ratings yet
Fun Least Squares
3 pages
PL01ELBL53 Corporate Finance-I
No ratings yet
PL01ELBL53 Corporate Finance-I
3 pages
The Star Weaver
No ratings yet
The Star Weaver
2 pages
COMP3040 Proposal 4
No ratings yet
COMP3040 Proposal 4
3 pages
2IIG0 Cheat Sheet 1
No ratings yet
2IIG0 Cheat Sheet 1
2 pages
MODULE 4 MAT Antepartum Flexible Learning
No ratings yet
MODULE 4 MAT Antepartum Flexible Learning
2 pages
Cheat Sheet For Exam
No ratings yet
Cheat Sheet For Exam
2 pages
The Inventory Order List Can Also Be Found On Our Website Under Downloads at The Specific Article
No ratings yet
The Inventory Order List Can Also Be Found On Our Website Under Downloads at The Specific Article
2 pages
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet

Homework2 - Tran Anh Vu

Uploaded by

Homework2 - Tran Anh Vu

Uploaded by

COMP3020 Machine Learning Fall 2024

Now, take the gradient of this with respect to w:

Since X is full column rank, X T X is invertible. Thus, we can solve for w∗ :

Taking the derivative with respect to w:

Setting the derivative equal to zero, we have:

∥X ∗ w − y ∗ ∥2 = ∥Xw − y∥2 + λ∥w∥2 (1)

where I is the identity matrix.    

You might also like

where I is the identity matrix.