0% found this document useful (0 votes)

5 views4 pages

Assignment 1

CMPT 726 Assignment 1 is due on October 18, 2024, and must be completed individually, adhering to strict academic integrity policies. The assignment includes various tasks related to linear algebra, singular value decomposition, Taylor expansion, convexity, and linear regression for house price prediction using polynomial features and Ridge regression. Students are required to submit a PDF report along with a Python script for specific tasks, ensuring clarity and legibility in their solutions.

Uploaded by

Qian Wang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views4 pages

Assignment 1

Uploaded by

Qian Wang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

CMPT 726: Assignment 1 (Fall 2024) Instructor: Steven Bergner

Assignment 1

Due October 18, 2024 at 11:59pm

This assignment is to be done individually.

Important Note: The university policy on academic dishonesty (cheating) will be taken very seriously in this course.
You may not provide or use any solution, in whole or in part, to or by another student.
You are encouraged to discuss the concepts involved in the questions with other students. If you are in doubt as to what
constitutes acceptable discussion, please ask! Further, please take advantage of office hours offered by the instructor
and the TA if you are having difficulties with this assignment.
DO NOT:

• Give/receive code or proofs to/from other students

• Use Google to find solutions for assignment

DO:

• Meet with other students to discuss assignment (it is best not to take any notes during such meetings, and to re-work
assignment on your own)
• Use online resources (e.g. Wikipedia) to understand the concepts needed to solve the assignment.

Submitting Your Assignment

The assignment must be submitted online on Coursys. You must submit a report in PDF format. You may typeset
your assignment in LaTeX or Word, or submit neatly handwritten and scanned solutions. We will not be able to give
credit to solutions that are not legible.
For the last question you are required to submit: A Python script (linreg submission.py) containing your
complete code for each of the linear regression tasks. You do not need to include the actual Python code in your
report, but please provide a clear discussion of your results.

1
CMPT 726: Assignment 1 (Fall 2024) Instructor: Steven Bergner

1. Linear Algebra
a) Let U be a subspace of R5 defined by:
  

 x1 

x
 
2

  

5
 
U = x3  ∈ R : x1 = 3x2 and x4 = 2x5
 


 x4  


 
x5
 

Find an orthonormal basis {u1 , u2 , u3 }.

b) Prove that the set {u1 , u2 , u2 + u3 } is linearly independent.
2. SVD and Eigendecomposition Rotation Matrices and Practical Applications:
 
1 2
Consider the matrix A = 2 3.
3 1
a) Define the matrix B = AA⊤ and compute the matrix B.
b) Singular Values from Eigenvalues:
The Eigendecomposition (EVD) of matrix B is given as:
B = U ΛU ⊤
where U is the matrix of eigenvectors, and Λ is the diagonal matrix of eigenvalues.
• Explain how the eigenvalues of B relate to the singular values of A.
• Compute the singular value matrix Σ for A, given that the eigenvalues of B = AA⊤ are 25, 3, and 0.
c) Consider an SVD of a matrix D as follows:
"√ # " 1 √ #⊤
3
− 1
5 0 − − 23
D = U ΣV ⊤ = 2 √2 √2
1 3 0 2 3
− 21
2 2 2

A matrix Rθ ∈ R2×2 is a 2D rotation matrix if it has the following form:

cos θ − sin θ
Rθ =
sin θ cos θ
where θ ∈ R. Geometrically speaking, Rθ v rotates v counterclockwise by angle θ, for any v ∈ R2 , as
shown in Figure 1.

π
Figure 1: In this case, x = (1, 0) and y = (0, 1) are both rotated by θ = 4.

2
CMPT 726: Assignment 1 (Fall 2024) Instructor: Steven Bergner

Show that U and V ⊤ are both rotation matrices and find their corresponding rotational angles θU and θV ⊤ .

3. Taylor Expansion Given #»

⊤
x = x1 , x2 , x3 , consider a nonlinear function f : R3 −→ R as follows:

f ( #»
x ) = 5x21 + 3x22 + 2x23 + 4x1 x2 − 2x1 x3 + 6x2 x3

a) Compute the Gradient and Hessian matrix of f .

b) Find the second order Taylor Expansion at the point #»
x 0 = [0, 0, 0]⊤ .
c) State whether f is convex, concave, or neither convex nor concave. Prove your claim.
4. Convexity For any #»
x , #»
y ∈ Rn and any t ∈ [0, 1], a function f is said to be convex if it satisfies any of these
conditions:

• f (t #»
x + (1 − t) #»
y ) ≤ tf ( #»
x ) + (1 − t)f ( #»
y)
• If f is differentiable: f ( y ) ≥ f ( x ) + (∇f ( #»
#» #» x )) ( #»
y − #»
⊤
x)
#»
• If f is twice differentiable: Hf ( x ) ⪰ 0
a) Given x ∈ R and only using the definition of convex functions given above, prove that the rectified linear
unit function, ReLU(x) := max(x, 0), is convex.

5. Linear Regresssion – House Price Prediction with Polynomial Features and Ridge Regression
You are working on predicting house prices in a real estate market using a dataset that consists of 500 examples,
each with multiple features: number of rooms, house age, area size, etc. The target variable is the house price. You
decide to apply linear regression and Ridge regression (a regularized form of linear regression) to build predictive
models. Your goal is to assess the generalization of these models using cross-validation and to experiment with
different polynomial degrees and regularization strengths to improve the model’s performance.
Data Loading: Use the following code to load the Boston Housing dataset from GitHub and initialize the
DataFrame. For the model, only use the variables rm (number of rooms) and lstat (lower status population,
percentage), and the target variable medv (median house value in $1,000s).
import pandas as pd

# Load Boston Housing Data from GitHub

url = "https://raw.githubusercontent.com/selva86/datasets/master/BostonHousing.csv"
df = pd.read_csv(url)

# Features: ’rm’ (number of rooms) and ’lstat’ (lower status population, percentage)
X = df[[’rm’, ’lstat’]]
y = df[’medv’] # Target: ’medv’ (median house value in $1,000s)

Implementation Hints: For managing your dataset, use pandas dataframes and for the models and training
tools, utilize scikit-learn. Plotting should be done with the built-in functions or using matplotlib.
You can refer to the official scikit-learn documentation for functions like train-test split, cross-validation, linear
regression, and Ridge regression.
For coding environments:

• You can work in a Jupyter notebook in Google Colab, and export it as a .py script for final submission.
• If you are already using VS Code, consider using #%% cell separators in your Python script, allowing you
to run parts of your script like Jupyter notebook cells.
Ensure your script includes the code for each task, such as MSE computation or the best parameter choices, and
attach the result outputs to your assignment report in PDF format.

3
CMPT 726: Assignment 1 (Fall 2024) Instructor: Steven Bergner

a. Train-Test Split and Cross-Validation for Linear Regression with Polynomial Features

Split the dataset into a training set (80%) and a test set (20%). Use 5-fold cross-validation on the training set
to evaluate the performance of linear regression models with polynomial features. For each polynomial degree
(from 1 to 5), compute the average mean squared error (MSE) over the five folds and report your results.
Hint: Use PolynomialFeatures from scikit-learn to create polynomial features of different degrees. When
using the cross val score function, set the scoring parameter to neg mean squared error.

b. Cross-Validation for Ridge Regression with Polynomial Features

Ridge regression introduces a regularization term controlled by a hyperparameter α. Perform 5-fold cross-
validation on the training set with Ridge regression, using polynomial features (degrees 1 to 5) and different
values of α (e.g., 0.1, 1, 10, 100). Use grid search to find the best α and the optimal degree, and report both the
values that minimize the cross-validation error along with the corresponding average MSE.
Hint: Use GridSearchCV from scikit-learn to automate the search for the best regularization parameter α.

c. Generalization and Test Set Performance

Now that the best polynomial degree and regularization strength have been identified from part (b), train both the
linear regression model and the Ridge regression model on the training set, and evaluate their MSE on the test
set. Compare the performance of the two models, and discuss which model generalizes better to unseen data and
why.
Hint: Use mean squared error from scikit-learn to evaluate the models on the test set.

d. Theoretical Considerations

Explain why the regularization in Ridge regression helps prevent overfitting, especially when using polynomial
features. How does the choice of the regularization parameter α and the polynomial degree influence the model?
What might happen if α is too small or too large, or if the degree is too high?

CompactNumerical Methods For ComputersLinearAlgebra - Muya
100% (1)
CompactNumerical Methods For ComputersLinearAlgebra - Muya
288 pages
ML Coursera Python Assignments
100% (1)
ML Coursera Python Assignments
20 pages
Computational Aids in Aeroservoelastic Analysis Using MATLAB
No ratings yet
Computational Aids in Aeroservoelastic Analysis Using MATLAB
175 pages
Khaleghi-MIMO Systems Theory and Applications
No ratings yet
Khaleghi-MIMO Systems Theory and Applications
500 pages
Recommender Systems-Unit I
No ratings yet
Recommender Systems-Unit I
12 pages
ICT Assignment 2
No ratings yet
ICT Assignment 2
7 pages
Dahlquist - Bjoerck - Numerical Methods in Scientific Computing. Volume 2
No ratings yet
Dahlquist - Bjoerck - Numerical Methods in Scientific Computing. Volume 2
667 pages
ML Assignment 1ipynb
No ratings yet
ML Assignment 1ipynb
10 pages
ML Lab Experiment Shivansh
No ratings yet
ML Lab Experiment Shivansh
29 pages
Maths For Intelligent Systems
No ratings yet
Maths For Intelligent Systems
76 pages
Lab Manual
No ratings yet
Lab Manual
6 pages
Kritika Sejwal - 24MCI10023 - ML Lab - Worksheet 1
No ratings yet
Kritika Sejwal - 24MCI10023 - ML Lab - Worksheet 1
6 pages
Unit - Iv
No ratings yet
Unit - Iv
11 pages
Continuous Time Stochastic Modelling
No ratings yet
Continuous Time Stochastic Modelling
36 pages
ML Manual
No ratings yet
ML Manual
30 pages
Sketching As A Tool For Numerical Linear Algebra
No ratings yet
Sketching As A Tool For Numerical Linear Algebra
139 pages
SNT 7
No ratings yet
SNT 7
13 pages
Unit-6 Clustering Techniques
No ratings yet
Unit-6 Clustering Techniques
110 pages
Remote Vital Signs Monitoring Using A Mm-Wave FMCW Radar
No ratings yet
Remote Vital Signs Monitoring Using A Mm-Wave FMCW Radar
118 pages
BDS-Homework-1-Submission - Ipynb - Colab
No ratings yet
BDS-Homework-1-Submission - Ipynb - Colab
11 pages
Argha's ML LAB - 240927 - 121838
No ratings yet
Argha's ML LAB - 240927 - 121838
13 pages
Message
No ratings yet
Message
5 pages
Foundations of Data Science Avrim Blum - Download The Full Ebook Version Right Now
100% (1)
Foundations of Data Science Avrim Blum - Download The Full Ebook Version Right Now
69 pages
AI Lec 3
No ratings yet
AI Lec 3
36 pages
Cosc 370 hw1
No ratings yet
Cosc 370 hw1
2 pages
Optimization Algorithms For Data Analysis Wright
No ratings yet
Optimization Algorithms For Data Analysis Wright
49 pages
Computational Method Lab
No ratings yet
Computational Method Lab
38 pages
Lab9 Solution 24052024 115622am
No ratings yet
Lab9 Solution 24052024 115622am
10 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
Lecture 4
No ratings yet
Lecture 4
3 pages
Assignment 1
100% (1)
Assignment 1
3 pages
HW 1 in 2015
No ratings yet
HW 1 in 2015
3 pages
Remotesensing 12 00424 v2
No ratings yet
Remotesensing 12 00424 v2
29 pages
IoT Task4 21BEC0384
No ratings yet
IoT Task4 21BEC0384
9 pages
Principal Component Analysis For Noise Reduction and Fraudulent Activity Detection in Scientific Data
No ratings yet
Principal Component Analysis For Noise Reduction and Fraudulent Activity Detection in Scientific Data
10 pages
PRJ Housuing Price
No ratings yet
PRJ Housuing Price
14 pages
SML - Week 3
No ratings yet
SML - Week 3
5 pages
Int AI TW-PW 03
No ratings yet
Int AI TW-PW 03
4 pages
Machine Learning-SEAIML-241P (PR) Bharat
No ratings yet
Machine Learning-SEAIML-241P (PR) Bharat
42 pages
Python File
No ratings yet
Python File
5 pages
Journal Linear Algebra
No ratings yet
Journal Linear Algebra
18 pages
CS168: The Modern Algorithmic Toolbox Lecture #1: Introduction and Consistent Hashing
No ratings yet
CS168: The Modern Algorithmic Toolbox Lecture #1: Introduction and Consistent Hashing
11 pages
DSBDAL - Assignment No 4
No ratings yet
DSBDAL - Assignment No 4
15 pages
Unit 5
No ratings yet
Unit 5
171 pages
Detection of Anomalous Crowd Behavior Using Spatio Tempora Multiresolution Model and Kronecker Sum Decompositions
No ratings yet
Detection of Anomalous Crowd Behavior Using Spatio Tempora Multiresolution Model and Kronecker Sum Decompositions
10 pages
LAB5 Regularization
No ratings yet
LAB5 Regularization
6 pages
An Automated Online Proctoring System Using Attentive-Net To Assess Student Mischievous Behavior
No ratings yet
An Automated Online Proctoring System Using Attentive-Net To Assess Student Mischievous Behavior
30 pages
Boston Housing Kaggle Challenge With Linear Regression
No ratings yet
Boston Housing Kaggle Challenge With Linear Regression
3 pages
DL Assignment 1ms24rai03
No ratings yet
DL Assignment 1ms24rai03
10 pages
International Journal of Electronics and Communications (AEÜ)
No ratings yet
International Journal of Electronics and Communications (AEÜ)
7 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
Assignment+1 +Regression+This+assignment+is+to+
No ratings yet
Assignment+1 +Regression+This+assignment+is+to+
6 pages
Sample Exam For ML YSZ Sample For Machine Lerning - CMNKNVMNCS."NMD, MN, MVN, MDNV, MNDV MC, MDN, MDCNVM, NDV, M Ccwdmnbnbew, Mwbe
No ratings yet
Sample Exam For ML YSZ Sample For Machine Lerning - CMNKNVMNCS."NMD, MN, MVN, MDNV, MNDV MC, MDN, MDCNVM, NDV, M Ccwdmnbnbew, Mwbe
4 pages
Semantic Density Analysis: Comparing Word Meaning Across Time and Phonetic Space
No ratings yet
Semantic Density Analysis: Comparing Word Meaning Across Time and Phonetic Space
8 pages
T2 Summary VHA
No ratings yet
T2 Summary VHA
14 pages
Missing Data Imputation Using Singular Value Decomposition
No ratings yet
Missing Data Imputation Using Singular Value Decomposition
6 pages
Application of Transfer Path Analysis On Vibrational Study of Motor Cycle
No ratings yet
Application of Transfer Path Analysis On Vibrational Study of Motor Cycle
8 pages
P04 EvaluationKNN SolutionNotes
No ratings yet
P04 EvaluationKNN SolutionNotes
3 pages
ML Record
No ratings yet
ML Record
19 pages
Deep MIMO Detection
No ratings yet
Deep MIMO Detection
5 pages
Introduction To Linear Algebra V: 1 Eigenvalue and Eigenvector
No ratings yet
Introduction To Linear Algebra V: 1 Eigenvalue and Eigenvector
4 pages
Train
No ratings yet
Train
17 pages
Optimization Models: Exercises 2
No ratings yet
Optimization Models: Exercises 2
2 pages
Linear Algebra Roadmap
No ratings yet
Linear Algebra Roadmap
8 pages
Wa0002.
No ratings yet
Wa0002.
5 pages
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
No ratings yet
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
14 pages
Homework 2: Lasso Regression: 1.1 Data Set and Programming Problem Overview
No ratings yet
Homework 2: Lasso Regression: 1.1 Data Set and Programming Problem Overview
11 pages
Chapter 3. Linear Regression
No ratings yet
Chapter 3. Linear Regression
41 pages
ML Manoj
No ratings yet
ML Manoj
51 pages
Dda3020 2024F HW1
No ratings yet
Dda3020 2024F HW1
6 pages
ML Unit
No ratings yet
ML Unit
23 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
Lab Manual 05
No ratings yet
Lab Manual 05
13 pages
ml2020 Pythonlab02
No ratings yet
ml2020 Pythonlab02
3 pages
Zam 05 A
No ratings yet
Zam 05 A
2 pages
Personalized Medicine Recommendation System
No ratings yet
Personalized Medicine Recommendation System
17 pages
C1 W1 Lab03 Model Representation Soln-Copy1
No ratings yet
C1 W1 Lab03 Model Representation Soln-Copy1
7 pages
ML-Lab07-Building and Evaluating Multivariate Regression Models in Python
No ratings yet
ML-Lab07-Building and Evaluating Multivariate Regression Models in Python
5 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Ofdm Channel Estimation by Singular Value Decomposition: Abstract - A New Approach To Low-Complexity Chan
No ratings yet
Ofdm Channel Estimation by Singular Value Decomposition: Abstract - A New Approach To Low-Complexity Chan
5 pages
ML Cyber Lab
No ratings yet
ML Cyber Lab
16 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 1
No ratings yet
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 1
11 pages
Message
No ratings yet
Message
2 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
No ratings yet
CIS 419/519 Introduction To Machine Learning Assignment 2: Instructions
12 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Lab 3 - Linear Regression
No ratings yet
Lab 3 - Linear Regression
15 pages
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
From Everand
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
CSPacademic
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Assignment 1

Uploaded by

Assignment 1

Uploaded by

CMPT 726: Assignment 1 (Fall 2024) Instructor: Steven Bergner

Due October 18, 2024 at 11:59pm

This assignment is to be done individually.

• Give/receive code or proofs to/from other students

Submitting Your Assignment

Find an orthonormal basis {u1 , u2 , u3 }.

A matrix Rθ ∈ R2×2 is a 2D rotation matrix if it has the following form:

3. Taylor Expansion Given #»

a) Compute the Gradient and Hessian matrix of f .

# Load Boston Housing Data from GitHub

b. Cross-Validation for Ridge Regression with Polynomial Features

c. Generalization and Test Set Performance

You might also like