Assignment #3 - Handout

Uploaded by

omar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views3 pages

Assignment #3 - Handout

Uploaded by

omar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Elements of Machine Learning, WS 2024/2025

Prof. Dr. Isabel Valera and Dr. Kavya Gupta

Assignment Sheet #3: Generalization, Regularization and Beyond Linearity

Deadline: Wednesday, December 11th, 2024 23:59 hrs

This problem set is worth a total of 50 points, consisting of 3 theory questions and 1 programming
question. Please carefully follow the instructions below to ensure a valid submission:

• You are encouraged to work in groups of two students. Register your team (of 1 or 2 members) on
the CMS at least ONE week before the submission deadline. You have to register your team for
each assignment.
• All solutions, including coding answers, must be uploaded individually to the CMS under the
corresponding assignment and problem number. On CMS you will find FOUR problems under each
assignment. Make sure you upload correctly each of your solution against
Assignment#X − P roblem Y (where X- Assignment number and Y is the problem number) on
CMS. In total you have to upload THREE PDFs (theoretical problems) and ONE ZIP file
(programming problem).
• For each theoretical question, we encourage using LaTeX or Word to write your solutions for
clarity and readability. Scanned handwritten solutions will be accepted as long as they are clean
and easily legible. Final submission format must always be in a single PDF file per theoretical
problem. Ensure your name, team member’s name (if applicable), and matriculation numbers are
clearly listed at the top of each PDF.
• For programming question, you need to upload a ZIP file to CMS under
Assignment#X − P roblem 4. Each ZIP file must contain a PDF or HTML exported from Jupyter
Notebook and the .ipynb file with solutions. Make sure all cells in your Jupyter notebook contain
your final answers. For creating PDF/HTML, use the export of the Jupyter notebook. Before
exporting, ensure that all cells have been computed. To do this:

– Go to the “Cell” menu at the top of the Jupyter interface.

– Select “Run All” to execute every cell in your notebook.
– Once all cells are executed, export the notebook: Click on “File” in the top menu.
– Choose “Export As” and select either PDF or HTML.

The submission should include your name, team member’s name, and matriculation numbers at the
top of both PDF/HTML and .ipynb file document.

• Finally, ensure academic integrity is maintained. Cite any external resources you use for your
assignment.
• If you have any questions follow the instructions here.

1 of 3
Elements of Machine Learning, WS 2024/2025
Prof. Dr. Isabel Valera and Dr. Kavya Gupta
Assignment Sheet #3: Generalization, Regularization and Beyond Linearity

Problem 1 (Generalization). (10 Points)

1. Assume you are only given training points for a binary classification problem and a small validation
set. Does it make sense to compute the validation error for all classification methods (Logistic
Regression, LDA, QDA) and report minimal validation error over all methods as an estimate of the
test error ? Justify your answer. (3 Points)
2. Is it possible that model selection using cross-validation overfits? If yes, describe with an example,
if no, explain the reason why overfitting is impossible. (4 Points)
3. Why does K-fold CV result in a higher bias than LOOCV? (3 Points)

Problem 2 (Regularization). (15 Points)

1. Lasso and Ridge regressions are used to predict a target Y from X as shown in Equation 2.1 and
2.2 respectively. In order to understand which of the two models is better suited for a task, the
mathematical equations for these are written as follows:
 2
n
X p
X p
X
yi − β0 − βj xij  + λ |βj | (2.1)
i=1 j=1 j=1

 2
n
X p
X p
X
yi − β0 − βj xij  + λ βj2 (2.2)
i=1 j=1 j=1

(a) Discuss how the model coefficients (βj ) change as λ → 0 and as λ → ∞ in both Equation 2.1
and 2.2. (4 Points)
(b) If we have significantly more independent features than observations and want to perform
feature selection, which type of regularization method should we use? (Hint: L1 or L2 ?) What
value of λ should be considered i.e. small or large? (3 Points)
Pp
2. Suppose that yi = β0 + j=1 xij βj + ϵi , where ϵ1 , . . . , ϵn are independent and identically
distributed from a N (0, σ 2 ) distribution.
(a) Write out the likelihood for the data. (2 Points)
(b) Assume the prior for β : β1 , . . . , βp are independent and identically distributed according to a
double-exponential distribution with mean 0 and common scale parameter b, written as:

1 |β|
p(β) = exp − .
2b b
Write out the posterior for β in this setting. (2 Points)
(c) Show that the lasso estimate is the mode for β under this posterior distribution. (4 Points)

Problem 3 (Beyond linearity: Polynomial and Splines). (15 Points)

1. Cubic regression spline with one knot at ξ can be obtained using a basis of the form
x, x2 , x3 , (x − ξ)3+ , where (x − ξ)3+ = (x − ξ)3 if x > ξ and equals 0 otherwise. We can show that a
function of the form

f (x) = β0 + β1 x + β2 x2 + β3 x3 + β4 (x − ξ)3+
is indeed a cubic regression spline, regardless of the values of β0 , β1 , β2 , β3 , β4 .
2 of 3
Elements of Machine Learning, WS 2024/2025
Prof. Dr. Isabel Valera and Dr. Kavya Gupta
Assignment Sheet #3: Generalization, Regularization and Beyond Linearity

(a) Find a cubic polynomial (2 Points)

f1 (x) = a1 + b1 x + c1 x2 + d1 x3

such that f (x) = f1 (x) for all x ≤ ξ. Express a1 , b1 , c1 , d1 in terms of β0 , β1 , β2 , β3 , β4 .

(b) Find a cubic polynomial (2 Points)

f2 (x) = a2 + b2 x + c2 x2 + d2 x3

such that f (x) = f2 (x) for all x > ξ. Express a2 , b2 , c2 , d2 in terms of β0 , β1 , β2 , β3 , β4 . We

have now established that f (x) is a piecewise polynomial.
(c) Show that f1 (ξ) = f2 (ξ). That is, f (x) is continuous at ξ. (2 Points)
(d) Show that f1′ (ξ) = f2′ (ξ). That is, f ′ (x) is continuous at ξ. (2 Points)
(e) Show that f1′′ (ξ) = f2′′ (ξ). That is, f ′′ (x) is continuous at ξ. (2 Points)
Therefore, f (x) is indeed a cubic spline.
Hint: Parts (d) and (e) of this problem require knowledge of single-variable calculus. As a
reminder, given a cubic polynomial

f1 (x) = a1 + b1 x + c1 x2 + d1 x3 ,

the first derivative takes the form

f1′ (x) = b1 + 2c1 x + 3d1 x2 .

2. Consider two curves, ĝ1 and ĝ2 , defined by (5 Points)

n Z !
X
2 (3) 2
ĝ1 = arg min (yi − g(xi )) + λ [g (x)] dx ,
g
i=1

n Z !
X
2 (4) 2
ĝ2 = arg min (yi − g(xi )) + λ [g (x)] dx ,
g
i=1

where g (m) represents the m-th derivative of g.

(a) As λ → ∞, will ĝ1 or ĝ2 have the smaller training RSS?

(b) As λ → ∞, will ĝ1 or ĝ2 have the smaller test RSS?
(c) For λ = 0, will ĝ1 or ĝ2 have the smaller training and test RSS?

Problem 4 (Coding Generalization, Regularization and Beyond Linearity). (10 Points)

In this assignment, you will work on selecting the best model using K-fold cross-validation. You will also
explore methods for selecting hyperparameters to enhance the generalizability of your trained models.

Please refer to the file assignment 3 handout.ipynb and only complete the sections marked in red and
missing codes denoted with #TODO. Once you have filled in the required parts, revisit submission
instructions to check how to submit it.

3 of 3

2nd Exam Question Paper 2
No ratings yet
2nd Exam Question Paper 2
16 pages
Cs229 Midterm Aut2015
No ratings yet
Cs229 Midterm Aut2015
21 pages
5 Test of Population Variance Workbook
No ratings yet
5 Test of Population Variance Workbook
5 pages
Practice Midterm
No ratings yet
Practice Midterm
4 pages
Meta-Analysis Fixed Effect Vs Random Effects
No ratings yet
Meta-Analysis Fixed Effect Vs Random Effects
162 pages
Parametric Test
No ratings yet
Parametric Test
28 pages
Bahir Dar University: Ethiopian Institute of Textile and Fashion Technology
No ratings yet
Bahir Dar University: Ethiopian Institute of Textile and Fashion Technology
14 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
44 pages
HW 1 in 2015
No ratings yet
HW 1 in 2015
3 pages
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 1
No ratings yet
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 1
11 pages
hw3 Red
No ratings yet
hw3 Red
4 pages
ELEN4903 hw1 Spring2018
No ratings yet
ELEN4903 hw1 Spring2018
2 pages
HW 1
No ratings yet
HW 1
3 pages
Ps 1
No ratings yet
Ps 1
5 pages
Machine Learning Homework
No ratings yet
Machine Learning Homework
8 pages
Midterm Practice Questions
No ratings yet
Midterm Practice Questions
14 pages
hw1 2025
No ratings yet
hw1 2025
2 pages
Stanford University CS 229, Autumn 2015 Midterm Examination
No ratings yet
Stanford University CS 229, Autumn 2015 Midterm Examination
25 pages
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
No ratings yet
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
12 pages
Department of Electrical Engineering School of Science and Engineering
No ratings yet
Department of Electrical Engineering School of Science and Engineering
10 pages
Midterm Exam - Summer 21
No ratings yet
Midterm Exam - Summer 21
6 pages
HW 2
No ratings yet
HW 2
10 pages
hw5 1
No ratings yet
hw5 1
6 pages
HW 3
No ratings yet
HW 3
7 pages
Worksheet For Quiz
No ratings yet
Worksheet For Quiz
5 pages
Mock End Term Solution
No ratings yet
Mock End Term Solution
12 pages
Taller 3 (A. NG.) - Introducción Al Aprendizaje Supervisado
No ratings yet
Taller 3 (A. NG.) - Introducción Al Aprendizaje Supervisado
8 pages
Midterm Sol
No ratings yet
Midterm Sol
23 pages
Assgmt 1
No ratings yet
Assgmt 1
7 pages
Exam 2011
No ratings yet
Exam 2011
22 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
12 pages
Stats 205 Hw2
No ratings yet
Stats 205 Hw2
3 pages
Assignment+1 +Regression+This+assignment+is+to+
No ratings yet
Assignment+1 +Regression+This+assignment+is+to+
6 pages
Practice Midterm 2010
No ratings yet
Practice Midterm 2010
4 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
Solutions Manual Scientific Computing
0% (1)
Solutions Manual Scientific Computing
192 pages
S Ccs Answers
No ratings yet
S Ccs Answers
192 pages
Machine Learning Homework1 Solutions
No ratings yet
Machine Learning Homework1 Solutions
16 pages
1st Exam Question Paper 2
No ratings yet
1st Exam Question Paper 2
16 pages
Lokesh T00691325
No ratings yet
Lokesh T00691325
5 pages
Chapter 4 Assignment
No ratings yet
Chapter 4 Assignment
5 pages
ПМиИИ Демо ENG
No ratings yet
ПМиИИ Демо ENG
11 pages
Homework 1
No ratings yet
Homework 1
8 pages
Dda3020 2024F HW1
No ratings yet
Dda3020 2024F HW1
6 pages
hw2 311
No ratings yet
hw2 311
4 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Cs 419 Endsemsols
No ratings yet
Cs 419 Endsemsols
6 pages
Questions and Solutions On Linear Regression
No ratings yet
Questions and Solutions On Linear Regression
5 pages
AML Assignment 1
No ratings yet
AML Assignment 1
3 pages
Assignment 1 New Version
No ratings yet
Assignment 1 New Version
4 pages
CS725 2020 Quiz1
No ratings yet
CS725 2020 Quiz1
3 pages
Epfl Machine Learning Final Exam 2021 Solutions
No ratings yet
Epfl Machine Learning Final Exam 2021 Solutions
21 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
Machine 2020 Jul-Dec
No ratings yet
Machine 2020 Jul-Dec
45 pages
A4 Sol
No ratings yet
A4 Sol
27 pages
Machine Learning Assignments
No ratings yet
Machine Learning Assignments
3 pages
Midterm 2010 Solutions
No ratings yet
Midterm 2010 Solutions
8 pages
ML Assignments 2025
No ratings yet
ML Assignments 2025
91 pages
Homework 1
No ratings yet
Homework 1
9 pages
Homework Set 3
No ratings yet
Homework Set 3
7 pages
CS 229, Public Course Problem Set #3: Learning Theory and Unsuper-Vised Learning
No ratings yet
CS 229, Public Course Problem Set #3: Learning Theory and Unsuper-Vised Learning
4 pages
CS 229, Public Course Problem Set #1 Solutions: Supervised Learning
No ratings yet
CS 229, Public Course Problem Set #1 Solutions: Supervised Learning
10 pages
This Study Resource Was: CS 7641 CSE/ISYE 6740 Homework 3
No ratings yet
This Study Resource Was: CS 7641 CSE/ISYE 6740 Homework 3
4 pages
Lecture Notes (English)
No ratings yet
Lecture Notes (English)
78 pages
Homework Sheet 08
No ratings yet
Homework Sheet 08
1 page
Homework Sheet 10
No ratings yet
Homework Sheet 10
1 page
Convex Geometry
No ratings yet
Convex Geometry
304 pages
Variable 1 Variable 2: T-Test: Two-Sample Assuming Equal Variances
No ratings yet
Variable 1 Variable 2: T-Test: Two-Sample Assuming Equal Variances
4 pages
Machine Learning Record VR19
No ratings yet
Machine Learning Record VR19
46 pages
Midterm So Ls
No ratings yet
Midterm So Ls
9 pages
A Brief Introduction To Linear Models in R
No ratings yet
A Brief Introduction To Linear Models in R
21 pages
Correlation Pearsons R
No ratings yet
Correlation Pearsons R
25 pages
MI Unit 2
No ratings yet
MI Unit 2
85 pages
MMW Reviewer PDF
100% (1)
MMW Reviewer PDF
16 pages
Solutions RVCE AIML Test 1
No ratings yet
Solutions RVCE AIML Test 1
5 pages
CH 6 Practice
No ratings yet
CH 6 Practice
5 pages
Tests of Hypothesis: Lesson 3: Test On Population Mean (Part 1)
No ratings yet
Tests of Hypothesis: Lesson 3: Test On Population Mean (Part 1)
10 pages
APPLIED STATISTICS AND PROBABILITY - Assignment2
No ratings yet
APPLIED STATISTICS AND PROBABILITY - Assignment2
1 page
BBA Students' Attitude Toward BBA Degree: Model Development
No ratings yet
BBA Students' Attitude Toward BBA Degree: Model Development
18 pages
Convergent and Discriminant Validity
No ratings yet
Convergent and Discriminant Validity
13 pages
Unit 1-QTM-Introduction To Statistics-MBA 1
No ratings yet
Unit 1-QTM-Introduction To Statistics-MBA 1
48 pages
Staffff
No ratings yet
Staffff
16 pages
PDF Lind 18e Chap006 - Compress
No ratings yet
PDF Lind 18e Chap006 - Compress
32 pages
007-Discrete Dynamics in Nature and Society - 2022 - Alkhammash - Optimized Multivariate Adaptive Regression Splines For
No ratings yet
007-Discrete Dynamics in Nature and Society - 2022 - Alkhammash - Optimized Multivariate Adaptive Regression Splines For
9 pages
Chapter 3
No ratings yet
Chapter 3
33 pages
MANOVA
No ratings yet
MANOVA
1 page
Anova
No ratings yet
Anova
17 pages
Linear Regression Analysis To Determine Antoine Equation Constants
No ratings yet
Linear Regression Analysis To Determine Antoine Equation Constants
5 pages
Data Handling, Statistic and Errors
67% (3)
Data Handling, Statistic and Errors
38 pages
Control Charts For PDF
No ratings yet
Control Charts For PDF
19 pages
Business Statistics Syl Lab Us
No ratings yet
Business Statistics Syl Lab Us
2 pages
Chapter 4 - Probability - The Study of Randomness
No ratings yet
Chapter 4 - Probability - The Study of Randomness
2 pages