0% found this document useful (0 votes)

15 views4 pages

P05 LinearRegression SolutionNotes

Uploaded by

Emília Morgado Santos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views4 pages

P05 LinearRegression SolutionNotes

Uploaded by

Emília Morgado Santos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Aprendizagem 2023

Lab 5: Linear and kernel regression

Practical exercises

1. Consider the following training data: y1 y2 output

𝐱1 1 1 1.4
a) Find the closed form solution for a linear regression 𝐱2 2 1 0.5
. minimizing the sum of squared errors 𝐱3 1 3 2
𝐱4 3 3 2.5
0.275
𝐰 = (𝑋 𝑇 𝑋)−1 𝑋 𝑇 𝑍 = ( 0.02 )
0.645

b) Predict the target value for 𝐱 𝑛𝑒𝑤 = [2 3]𝑇

𝑜𝑢𝑡𝑝𝑢𝑡(𝐱) = 𝐰 · 𝐱 = 2.25

c) Sketch the predicted three-dimensional hyperplane

0.275 + 0.02𝑥1 + 0.645𝑥2

d) Compute the MSE and MAE produced by the linear regression

𝒛̂ = (0.94 0.96 2.23 2.27)

𝑀𝑆𝐸(𝒛, 𝒛̂) = 0.13225, 𝑀𝐴𝐸(𝒛, 𝒛̂) = 0.345

e) Are there biases on the residuals against y1? And y2?

𝑟𝑒𝑠𝑖𝑑𝑢𝑒𝑠 = 𝒛 − 𝒛̂ = (0.46 −0.46 −0.23 0.23)

0.6 0.6
0.4 0.4
0.2 0.2
0 0
-0.2 0 1 2 3 4 -0.2 0 1 2 3 4
-0.4 -0.4
-0.6 -0.6

There is not evidence in favor of biases, as residues appear to be randomly distributed against y1 and y2.
f) Compute the closed form solution considering Ridge regularization term with 𝜆 = 0.2.
0.24
𝐰 = (𝑋 𝑇 𝑋 + 𝜆 𝐼)−1 𝑋 𝑇 𝑍 = (0.05)
0.63

g) Compare the hyperplanes obtained using ordinary least squares and Ridge regression.

The norm of the Ridge vector describing the hyperplane has lower norm as expected.

h) Why is Lasso regression suggested for data spaces of higher dimensionality?

It provides an elegant way of regularizing predictors (sparse vector 𝐰). Zero entries correspond to
variables that do not affect the regression. In this way, it can be seen as an alternative to feature
selection, supporting the learning convergence and generalization ability of the regression model.

2. Consider the following training data output

y1 y2
where output is an ordinal variable 𝐱1 1 1 1
𝐱2 2 1 1
a) Find a linear regression using the closed form solution 1 3 0
𝐱3
𝐱4 3 3 0

1.5
𝐰 = (𝑋 𝑇 𝑋)−1 𝑋 𝑇 𝑍 = ( 0 )
−0.5

b) Assuming the output threshold θ=0.5, use the regression to classify 𝐱 new = [2 2.5]𝑇

The input is classified as 0 since 𝑜𝑢𝑡𝑝𝑢𝑡(𝐱) = 𝐰 · 𝐱 = 0.25

3. Considering the following data to learn a model y1 y2 output

𝑧 = 𝑤1 𝑥1 + 𝑤2 𝑥2 + 𝜀, where 𝜀 ∼ 𝑁(0, 0.1) 𝐱1 3 -1 2
𝐱2 4 2 1
Compare: 𝐱3 2 2 1

a) 𝒘 = [𝑤1 𝑤2 ]𝑇 using the maximum likelihood approach

Maximum likelihood can be given by (proof on the slides): 𝐰 = (𝑋 𝑇 𝑋)−1 𝑋 𝑇 𝑍

Solve exercise similarly as previous ones.

0.2 0
b) 𝒘 using the Bayesian approach, assuming 𝑝(𝒘) = 𝑁 (𝒘 | 𝒖 = [0 0], 𝝈 = [ ])
0 0.2
Maximum posterior is given by (proof on the slides): 𝐰 = (𝑋 𝑇 𝑋 + 𝜆 𝐼)−1 𝑋 𝑇 𝑍
𝜎𝒑𝒐𝒔𝒕𝒆𝒓𝒊𝒐𝒓 𝟐 0.12
λ= 𝟐 = . Solve exercise similarly as 1. 𝑓).
𝜎𝒑𝒓𝒊𝒐𝒓 0.22
4. Identify a transformation to aid the linearly y1 y2 output
𝐱1 -0.95 0.62 0
modelling of the following data points. 𝐱2 0.63 0.31 0
𝐱3 -0.12 -0.21 1
Sketch the predicted surface. 𝐱4 -0.24 -0.5 0
𝐱5 0.07 -0.42 1
𝐱6 0.03 0.91 0
𝐱7 0.05 0.09 1
𝐱8 -0.83 0.22 0

Plotting the data points we see that the labels seem to change with the distance from the origin.
A way to capture is this is to perform a quadratic feature transform

𝜑(𝑥1 , 𝑥2 ) = (𝑥1 2 , 𝑥2 2 )

1 −0.952 0.622 0
1 0.632 0.312 0
1 −0.122 −0.212 1
Φ= 1 −0.242 −0.52 , 𝒛 = 0
1 0.072 −0.422 1
1 0.032 0.912 0
1 0.052 0.092 1
(1 −0.832 0.222 ) (0 )

0.817
𝐰 = (𝑋 𝑇 𝑋)−1 𝑋 𝑇 𝒛 = (−0.865)
−0.95
0.817 − 0.865𝑥1 − 0.95𝑥2

5. Consider logarithmic and quadratic transformations: input output

𝜑1 (𝑥) = 𝑙𝑜𝑔(𝑥), 𝜑2 (𝑥) = 𝑥 2 𝐱1 3 1.5
𝐱2 4 9.3
a) Plot both of the closed form regressions. 𝐱3 6 23.4
𝐱4 10 45.8
𝐱5 12 60.1
1 1.0986 1.5
1 1.3863 9.3
Φ1 = 1 1.7918 , 𝑍 = 23.4
1 2.3026 45.8
(1 2.4849) (60.1)
−47.02
𝐰𝟏 = (𝑋 𝑇 𝑋)−1 𝑋 𝑇 𝑍 = ( )
41.395
−47.02 + 41.395 log(𝑥)

1 9 1.5
1 16 9.3
Φ2 = 1 36 , 𝑍 = 23.4
1 100 45.8
(1 144) (60.1)
2.7895
𝐰𝟐 = (𝑋 𝑇 𝑋)−1 𝑋 𝑇 𝑍 = ( )
0.4136
2.7895 + 0.4136 × 𝑥 𝟐

b) Which one minimizes the sum of squared errors on the original training data

𝑀𝑆𝐸𝑙𝑜𝑔 = 9.7618, 𝑀𝑆𝐸𝑞𝑢𝑎𝑑𝑟𝑎𝑡𝑖𝑐 = 13.1273

6. Select the criteria promoting a smoother regression model:

• Applying Lasso and Ridge regularization to linear regression models True
• Increasing the depth of a decision tree regressor False
• Increasing the k of a kNN regressor True
• Parameterizing a kNN regressor with uniform weights instead of distance-based weights False

Programming quests
7. Consider the housing dataset available at https://web.ist.utl.pt/~rmch/dscience/data/housing.arff and
the Regression notebook available at the course’s webpage:

a) Compare the determination coefficient of the non-regularized, Lasso and Ridge linear regression

b) Compare the MAE and RMSE of linear, kNN and decision tree regressors on housing

Lec10
No ratings yet
Lec10
61 pages
ML_Lec 4-introduction to regression
No ratings yet
ML_Lec 4-introduction to regression
65 pages
OERprobability 2020
No ratings yet
OERprobability 2020
247 pages
Linear Regression
No ratings yet
Linear Regression
30 pages
Cs 419 Midsem
No ratings yet
Cs 419 Midsem
6 pages
Chapter 3. Linear Regression
No ratings yet
Chapter 3. Linear Regression
41 pages
cheatsheet 2
No ratings yet
cheatsheet 2
5 pages
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
No ratings yet
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
42 pages
Bias
No ratings yet
Bias
62 pages
Lecture 3 Multi-Regresion 2022.
No ratings yet
Lecture 3 Multi-Regresion 2022.
16 pages
PTSP PPT
No ratings yet
PTSP PPT
74 pages
ML Assignments 2025
No ratings yet
ML Assignments 2025
91 pages
Assignment 1-12 ML
No ratings yet
Assignment 1-12 ML
54 pages
COL774 Practice Problems
No ratings yet
COL774 Practice Problems
22 pages
Lab2 Linear Regression
100% (1)
Lab2 Linear Regression
18 pages
CBU5201_RegressionExercises
No ratings yet
CBU5201_RegressionExercises
4 pages
PRML Test 2
No ratings yet
PRML Test 2
3 pages
Group30 Linear Regression
No ratings yet
Group30 Linear Regression
20 pages
Machine 2020 Jul-Dec
No ratings yet
Machine 2020 Jul-Dec
45 pages
Midterm F02soln
No ratings yet
Midterm F02soln
14 pages
Download complete Statistics Unlocking the Power of Data 1st Edition Lock Solutions Manual (DOCX) and get instant access
100% (6)
Download complete Statistics Unlocking the Power of Data 1st Edition Lock Solutions Manual (DOCX) and get instant access
51 pages
Project2 2022 Fall
No ratings yet
Project2 2022 Fall
7 pages
Week 4 Linear Regression
No ratings yet
Week 4 Linear Regression
38 pages
Business Statistics, 4th Global Edition Norean R. Sharpepdf download
100% (2)
Business Statistics, 4th Global Edition Norean R. Sharpepdf download
50 pages
3.1 Linear and Logistic Regression
No ratings yet
3.1 Linear and Logistic Regression
36 pages
Found Ed 203 Module 3 Testing Hypothesis
No ratings yet
Found Ed 203 Module 3 Testing Hypothesis
29 pages
Lect5 Reg
No ratings yet
Lect5 Reg
16 pages
Lecture 13 - Least Squares
No ratings yet
Lecture 13 - Least Squares
28 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
HW 4
No ratings yet
HW 4
7 pages
Lecture Notes 5 Linear Regression
No ratings yet
Lecture Notes 5 Linear Regression
11 pages
CMU 2018s NinaBALCAN HW3
No ratings yet
CMU 2018s NinaBALCAN HW3
7 pages
LAB5_Regularization
No ratings yet
LAB5_Regularization
6 pages
P04 EvaluationKNN SolutionNotes
No ratings yet
P04 EvaluationKNN SolutionNotes
3 pages
Today: - Calculus
No ratings yet
Today: - Calculus
61 pages
Design of Experiments by R. C. Baker: How To Gain 20 Years of Experience in One Short Week!
No ratings yet
Design of Experiments by R. C. Baker: How To Gain 20 Years of Experience in One Short Week!
26 pages
Random Number Generation
No ratings yet
Random Number Generation
43 pages
Core 11 Statistics-Probability q4 CLAS4 Computing-Test-Statistic-Value v1-JOSEPH-AURELLO
No ratings yet
Core 11 Statistics-Probability q4 CLAS4 Computing-Test-Statistic-Value v1-JOSEPH-AURELLO
19 pages
Lec9 - Linear Models
No ratings yet
Lec9 - Linear Models
44 pages
CHAPTER 4-5 Cashless Policy Data Analysis
No ratings yet
CHAPTER 4-5 Cashless Policy Data Analysis
22 pages
Lab Manual 05
No ratings yet
Lab Manual 05
13 pages
GradientDescent-Regression_slides
No ratings yet
GradientDescent-Regression_slides
26 pages
Regression
No ratings yet
Regression
16 pages
ANUM 2012 Curve-Fitting
No ratings yet
ANUM 2012 Curve-Fitting
44 pages
Curve Fitting: There Are Two General Approaches For Curve Fitting
No ratings yet
Curve Fitting: There Are Two General Approaches For Curve Fitting
63 pages
Worksheet For Quiz
No ratings yet
Worksheet For Quiz
5 pages
Linear Regression
No ratings yet
Linear Regression
31 pages
ML Question CMU
No ratings yet
ML Question CMU
12 pages
STATISTICS and PROBABILITY PPT2
No ratings yet
STATISTICS and PROBABILITY PPT2
11 pages
Linear Regression
No ratings yet
Linear Regression
14 pages
C4644C49-AFA7-4582-87CB-8974D8D000DD
No ratings yet
C4644C49-AFA7-4582-87CB-8974D8D000DD
8 pages
Linear and Logistic Regression: Marta Arias Marias@lsi - Upc.edu
No ratings yet
Linear and Logistic Regression: Marta Arias Marias@lsi - Upc.edu
25 pages
Chapter 7 Confidence Interval and Sample Mean A
No ratings yet
Chapter 7 Confidence Interval and Sample Mean A
37 pages
MA5156-Applied Mathematics For Engineers
75% (4)
MA5156-Applied Mathematics For Engineers
16 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
11 pages
Lecture 3 - Linear Regression
No ratings yet
Lecture 3 - Linear Regression
31 pages
Biostat Exam Take Home
No ratings yet
Biostat Exam Take Home
10 pages
Curve Fitting
100% (1)
Curve Fitting
43 pages
05 Regression Least Squares
No ratings yet
05 Regression Least Squares
5 pages
Wa0030.
No ratings yet
Wa0030.
36 pages
TEST 2, CMTH 380, Summer 2022
No ratings yet
TEST 2, CMTH 380, Summer 2022
14 pages
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
No ratings yet
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
43 pages
CH 1
No ratings yet
CH 1
7 pages
Introduction To Machine Learning Lecture 2: Linear Regression
No ratings yet
Introduction To Machine Learning Lecture 2: Linear Regression
38 pages
CS 229, Public Course Problem Set #1 Solutions: Supervised Learning
No ratings yet
CS 229, Public Course Problem Set #1 Solutions: Supervised Learning
10 pages
Clase 11 Calculo Numerico I
No ratings yet
Clase 11 Calculo Numerico I
37 pages
Sample Size Estimation
No ratings yet
Sample Size Estimation
14 pages
Poisson Model For Linking Children Ever Born With Some Key Predictor Variables in Nepalese Women
No ratings yet
Poisson Model For Linking Children Ever Born With Some Key Predictor Variables in Nepalese Women
7 pages
HW 1
No ratings yet
HW 1
3 pages
RR Anova 38
No ratings yet
RR Anova 38
17 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
5 pages
AL3451 - QUESTION BANK
100% (1)
AL3451 - QUESTION BANK
12 pages
Accuracy Is The Closeness of A Measured Value To The True - For Example, The Measured Density of Water Has Become More Accurate With Improved Experimental Design, Technique, and Equipment
No ratings yet
Accuracy Is The Closeness of A Measured Value To The True - For Example, The Measured Density of Water Has Become More Accurate With Improved Experimental Design, Technique, and Equipment
15 pages
L. D. College of Engineering: Lab Manual For
No ratings yet
L. D. College of Engineering: Lab Manual For
70 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
Assignment
No ratings yet
Assignment
5 pages
Statics For Management One Exit +2015
No ratings yet
Statics For Management One Exit +2015
3 pages
MMW PFTaskPerformance
No ratings yet
MMW PFTaskPerformance
7 pages
Module 11 Unit 2 Simple Linear Regression
No ratings yet
Module 11 Unit 2 Simple Linear Regression
12 pages
(Assume α= 5% if not mentioned in the question) : Cfa, Frm, Ca, Cs, Fm, Caia, Cipm, Ccra, Ciib, Aim, Cira
No ratings yet
(Assume α= 5% if not mentioned in the question) : Cfa, Frm, Ca, Cs, Fm, Caia, Cipm, Ccra, Ciib, Aim, Cira
3 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
12 pages
W24 - ECOR2050 - Assignment 3
No ratings yet
W24 - ECOR2050 - Assignment 3
1 page
Lec20 RidgeRegression
No ratings yet
Lec20 RidgeRegression
21 pages
ESMR For Different Type of Class Protection Level: (Please Refer To Answer Sheet of Excel For Detailed Calculation)
No ratings yet
ESMR For Different Type of Class Protection Level: (Please Refer To Answer Sheet of Excel For Detailed Calculation)
1 page
Test MI2036 2021.2 CK
No ratings yet
Test MI2036 2021.2 CK
1 page
Wk05 machine learning
No ratings yet
Wk05 machine learning
6 pages
Curve Fitting
No ratings yet
Curve Fitting
48 pages
F Distribution Chart PDF
No ratings yet
F Distribution Chart PDF
2 pages
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet

P05 LinearRegression SolutionNotes

Uploaded by

P05 LinearRegression SolutionNotes

Uploaded by

Aprendizagem 2023

Lab 5: Linear and kernel regression

1. Consider the following training data: y1 y2 output

b) Predict the target value for 𝐱 𝑛𝑒𝑤 = [2 3]𝑇

c) Sketch the predicted three-dimensional hyperplane

0.275 + 0.02𝑥1 + 0.645𝑥2

d) Compute the MSE and MAE produced by the linear regression

𝒛̂ = (0.94 0.96 2.23 2.27)

e) Are there biases on the residuals against y1? And y2?

𝑟𝑒𝑠𝑖𝑑𝑢𝑒𝑠 = 𝒛 − 𝒛̂ = (0.46 −0.46 −0.23 0.23)

h) Why is Lasso regression suggested for data spaces of higher dimensionality?

2. Consider the following training data output

The input is classified as 0 since 𝑜𝑢𝑡𝑝𝑢𝑡(𝐱) = 𝐰 · 𝐱 = 0.25

3. Considering the following data to learn a model y1 y2 output

a) 𝒘 = [𝑤1 𝑤2 ]𝑇 using the maximum likelihood approach

Maximum likelihood can be given by (proof on the slides): 𝐰 = (𝑋 𝑇 𝑋)−1 𝑋 𝑇 𝑍

5. Consider logarithmic and quadratic transformations: input output

𝑀𝑆𝐸𝑙𝑜𝑔 = 9.7618, 𝑀𝑆𝐸𝑞𝑢𝑎𝑑𝑟𝑎𝑡𝑖𝑐 = 13.1273

6. Select the criteria promoting a smoother regression model:

You might also like