CS771: Practice Set 2: Problem 1

Uploaded by

darshan sethia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views2 pages

CS771: Practice Set 2: Problem 1

Uploaded by

darshan sethia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

CS771: Practice Set 2

Problem 1
(A More Fancy Version of?) Consider a classification model where we are given training data {xn , yn }N n=1
from K classes. Each input xn ∈ RD and each class c is defined by two parameters, wc ∈ RD and a D × D
positive definite (PD) matrix Mc , c = 1, 2, . . . , K. Assume Nc denotes the number of training examples from
class c. Suppose we estimate wc and Mc by solving the following optimization problem

X 1
(ŵc , M̂c ) = arg min (xn − wc )> Mc (xn − wc ) − log |Mc |
wc ,Mc
xn :yn =c
Nc

(note that, in the above objective, the log |Mc | term ensures positive definiteness of Mc because the determinant
of a PD matrix is always non-negative)
For the given objective/loss function, find the optimal values of wc and Mc using first-order optimality (you
may use standard results of derivatives of functions w.r.t. vectors and matrices from the Matrix Cookbook1 ).
Also, what will this model reduce to as a special case when Mc is an identity matrix?

Problem 2
(Corrective Updates) Consider the weight vector update equation of the Perceptron algorithm for binary clas-
sification: w(t+1) = w(t) + yn xn . Assume yn ∈ {−1, +1}.
Prove that these updates are “corrective” in nature, i.e., if the current weight vector w(t) mispredicts (xn , yn )
>
(i.e., if yn w(t) xn < 0) then after this update, the new weight vector w(t+1) will mispredict this example by a
> >
”lesser extent” (i.e.., yn w(t+1) xn will be less negative than yn w(t) xn < 0 after this update).

Problem 3
(Arbitrary Choice?) Formally, show that changing the condition yn (w> xn + b) ≥ 1 in SVM to a different
condition yn (w> xn + b) ≥ m does not change the effective separating hyperplane that is learned by the SVM.
Assume the hard-margin SVM for simplicity.

Problem 4
(Recover the Bias) Assuming hard-margin SVM, show that, given the solution for the dual variables αn ’s, the
bias term b ∈ R can be computed as b = ys − ts where s can denote the index of any of the support vectors,
and ts is a term that requires computing a summation defined over all the support vectors. (Hint: Use KKT
conditions)

Problem 5
(Look Ma, No Subgradients!) Show that we can rewrite regression with absolute loss function |yn − w> xn |
as a reweighted least squares objective where the squared loss term for each example (xn , yn ) is multiplied by
an importance weight sn > 0. Write down the expression for sn , and briefly explain why this expression for sn
1
https://www.math.uwaterloo.ca/ hwolkowi/matrixcookbook.pdf

1
makes intuitive sense. Given N examples {(xn , yn }N
n=1 , briefly outline the steps of an optimization algorithm
that estimates the unknowns (w and the importance weights {sn }Nn=1 ) for this reweighted least squares problem.

Problem 6
(Linear Regression viewed as Nearest Neighbors) Show that, for the unregularized linear regression model,
where the solution ŵ = (X> X)−1 X> y, the prediction at a test input x∗ can be written as a weighted sum of
all the training responses, i.e.,
N
X
f (x∗ ) = wn yn
n=1

Give the expression for the weights wn ’s in this case and briefly discuss (<50 words) in what way these weights
are different from the weights in a weighted version of K nearest neighbors where each wn typically is the
inverse distance of x∗ from the training input xn . Note: You do not need to give a very detailed expression for
wn (if it makes algebra messy) but you must give a precise meaning as to what wn depends on and how it is
different from the weights in the weighted K nearest neighbors.

Problem 7
(Feature
PN Masking > as Regularization) Consider linear regression model by minimizing the squared loss func-
tion n=1 (yn − w xn )2 . Suppose we decide to mask out or “drop” each feature xnd of each input xn ∈ RD ,
independently, with probability 1 − p (equivalently, retaining the feature with probability p). Masking or drop-
ping out basically means that we will set the feature xnd to 0 with probability 1 − p. Essentially, it would be
equivalent to replacing each input xn by x̃n = xn ◦ mn , where ◦ denotes elementwise product and mn denotes
the D × 1 binary mask vector with mnd ∼ Bernoulli(p) (mnd = 1 means the feature xnd was retained; mnd = 0
means the feature xnd was masked/zeroed).
Let us now define a new loss function using these masked inputs as follows: N > 2
P
n=1 (yn − w x̃n ) . Show that
minimizing the expected value of this new loss function (where the expectation is used since the mask vectors
mn are random) is equivalent to minimizing a regularized loss function. Clearly write down the expression of
this regularized loss function. Note that showing this would require some standard results related to expectation
of random variables, such as linearity of expectation, and expectation and variance of a Bernoulli random
variable. Note that, so far in the course, we haven’t talked much about probability ideas but, with this much
information, you should be able to attempt this problem.

1Z0-1041-24 Exam Questions
100% (1)
1Z0-1041-24 Exam Questions
25 pages
Practice Midterm
No ratings yet
Practice Midterm
4 pages
HW 1
No ratings yet
HW 1
3 pages
Midterm Practice Questions
No ratings yet
Midterm Practice Questions
14 pages
HW3 2
No ratings yet
HW3 2
4 pages
Practice Midterm 2010
No ratings yet
Practice Midterm 2010
4 pages
ML Question CMU
No ratings yet
ML Question CMU
12 pages
HW 3
No ratings yet
HW 3
7 pages
ASU Assignment2 Sol
No ratings yet
ASU Assignment2 Sol
8 pages
Machine 2020 Jul-Dec
No ratings yet
Machine 2020 Jul-Dec
45 pages
hw3 Red
No ratings yet
hw3 Red
4 pages
cs675 SS2022 Midterm Solution PDF
No ratings yet
cs675 SS2022 Midterm Solution PDF
10 pages
Wa0030.
No ratings yet
Wa0030.
36 pages
COL774 Practice Problems
No ratings yet
COL774 Practice Problems
22 pages
SVM Problems1
No ratings yet
SVM Problems1
5 pages
Midterm 2010 F
No ratings yet
Midterm 2010 F
15 pages
Fun Least Squares
No ratings yet
Fun Least Squares
3 pages
Lecture10 Mid
No ratings yet
Lecture10 Mid
43 pages
Epfl Machine Learning Final Exam 2021 Solutions
No ratings yet
Epfl Machine Learning Final Exam 2021 Solutions
21 pages
Midterm F02soln
No ratings yet
Midterm F02soln
14 pages
Final Exam Epfl 2020 Machine Leaning
No ratings yet
Final Exam Epfl 2020 Machine Leaning
16 pages
ML 2024a QP Solution Full
No ratings yet
ML 2024a QP Solution Full
13 pages
Stochastic Gradient Descent 1
No ratings yet
Stochastic Gradient Descent 1
42 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
Dis11 Sol
No ratings yet
Dis11 Sol
5 pages
Kernel PCA
No ratings yet
Kernel PCA
13 pages
Midterm 2010 Solutions
No ratings yet
Midterm 2010 Solutions
8 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
ML ES 23-24-II Key
No ratings yet
ML ES 23-24-II Key
4 pages
Lecture 1 - Overview of Supervised Learning
No ratings yet
Lecture 1 - Overview of Supervised Learning
133 pages
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
No ratings yet
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
78 pages
CS550 Lec2
No ratings yet
CS550 Lec2
24 pages
Problem 1 Report Trần Minh Long 2052154 Final
No ratings yet
Problem 1 Report Trần Minh Long 2052154 Final
31 pages
Introduction To Machine Learning Lecture 2: Linear Regression
No ratings yet
Introduction To Machine Learning Lecture 2: Linear Regression
38 pages
2019-20-I ES Key
No ratings yet
2019-20-I ES Key
4 pages
Wa0006.
No ratings yet
Wa0006.
4 pages
Lec 12
No ratings yet
Lec 12
9 pages
ML Assignments 2025
No ratings yet
ML Assignments 2025
91 pages
Lecture 6 - Ridge Regression, Polynomial Regression (DONE!!) PDF
No ratings yet
Lecture 6 - Ridge Regression, Polynomial Regression (DONE!!) PDF
26 pages
CMU 2018s NinaBALCAN HW3
No ratings yet
CMU 2018s NinaBALCAN HW3
7 pages
Wk05 Machine Learning
No ratings yet
Wk05 Machine Learning
6 pages
SD-M1 TSI Chapitre 4
No ratings yet
SD-M1 TSI Chapitre 4
42 pages
HW 4
No ratings yet
HW 4
7 pages
EE364a Homework 6 Solutions: I 1,..., K I I I
No ratings yet
EE364a Homework 6 Solutions: I 1,..., K I I I
20 pages
ML Practice 1
No ratings yet
ML Practice 1
106 pages
Endsem ML Makeup AK - 1
No ratings yet
Endsem ML Makeup AK - 1
7 pages
Practice 1130
No ratings yet
Practice 1130
20 pages
CS725 2020 Quiz1
No ratings yet
CS725 2020 Quiz1
3 pages
Regression Using LS Handout
No ratings yet
Regression Using LS Handout
21 pages
ML 20230316 1
No ratings yet
ML 20230316 1
9 pages
hw5 1
No ratings yet
hw5 1
6 pages
Midterm With Solutions
No ratings yet
Midterm With Solutions
26 pages
2019-20-I MS Key
No ratings yet
2019-20-I MS Key
6 pages
Class05 LogisticsSVM
No ratings yet
Class05 LogisticsSVM
33 pages
Lecture-05 - Least Squares and Optimization
No ratings yet
Lecture-05 - Least Squares and Optimization
34 pages
CS 419M Midsem 2021 22
No ratings yet
CS 419M Midsem 2021 22
6 pages
Ps 1
No ratings yet
Ps 1
5 pages
SMAI-M20-L09: Aspects of Supervised Learning: C. V. Jawahar
No ratings yet
SMAI-M20-L09: Aspects of Supervised Learning: C. V. Jawahar
16 pages
HW 1
No ratings yet
HW 1
4 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Directory of Sugar Refineries CY 2018 2019
No ratings yet
Directory of Sugar Refineries CY 2018 2019
2 pages
Microsoft Cognitive Toolkit
No ratings yet
Microsoft Cognitive Toolkit
2 pages
Pre Booking Process
No ratings yet
Pre Booking Process
10 pages
Teach Your Raspberry Pi - "Yeah, World"
No ratings yet
Teach Your Raspberry Pi - "Yeah, World"
10 pages
ICT Backup Procedure Policy
No ratings yet
ICT Backup Procedure Policy
8 pages
Nginx
No ratings yet
Nginx
41 pages
Intermediate Python Nanodegree Program Syllabus
No ratings yet
Intermediate Python Nanodegree Program Syllabus
11 pages
Smartellite™ Fixed Ku Band
No ratings yet
Smartellite™ Fixed Ku Band
2 pages
1620A "Dewk" Thermo-Hygrometer: Technical Data
No ratings yet
1620A "Dewk" Thermo-Hygrometer: Technical Data
4 pages
HPE - A00094858en - Us - Aruba CX Mobile App User Guide
No ratings yet
HPE - A00094858en - Us - Aruba CX Mobile App User Guide
42 pages
W2-EX RA0 6 Solutions
No ratings yet
W2-EX RA0 6 Solutions
24 pages
Syllabus Adhoc
No ratings yet
Syllabus Adhoc
2 pages
Junior Level - Mini-CRM For Freelancers
No ratings yet
Junior Level - Mini-CRM For Freelancers
5 pages
Pic Favorite
No ratings yet
Pic Favorite
86 pages
Dynamic Rule-Based Tags
No ratings yet
Dynamic Rule-Based Tags
16 pages
Changes
No ratings yet
Changes
206 pages
LINUX
100% (1)
LINUX
3 pages
Measuring Stradivari Violin "Cremonese" (1715) by 3D Modeling
No ratings yet
Measuring Stradivari Violin "Cremonese" (1715) by 3D Modeling
5 pages
Real Numbers
No ratings yet
Real Numbers
2 pages
Keshav Sharma Aset It
No ratings yet
Keshav Sharma Aset It
12 pages
Fraunhofer CML TOS-Study Excerpt PDF
No ratings yet
Fraunhofer CML TOS-Study Excerpt PDF
13 pages
Nabl Test Report Cross Verification Methodology and Steps For Report Authenticity
No ratings yet
Nabl Test Report Cross Verification Methodology and Steps For Report Authenticity
4 pages
Isom 3400 - Python For Business Analytics 1. Intro To Python
No ratings yet
Isom 3400 - Python For Business Analytics 1. Intro To Python
46 pages
Final Examination in Empowerment Technologies
No ratings yet
Final Examination in Empowerment Technologies
3 pages
Internet of Things LTP
No ratings yet
Internet of Things LTP
3 pages
Selenium
No ratings yet
Selenium
33 pages
ABS Blink Codes
No ratings yet
ABS Blink Codes
1 page
EPGP in Data Science (Curriculum)
No ratings yet
EPGP in Data Science (Curriculum)
30 pages
Dot Net
No ratings yet
Dot Net
9 pages

CS771: Practice Set 2: Problem 1

Uploaded by

CS771: Practice Set 2: Problem 1

Uploaded by

CS771: Practice Set 2

You might also like