0% found this document useful (0 votes)

7 views5 pages

Exercise Sheet 8

This document is an exercise sheet for a course on Linear Algebra and Optimization for Machine Learning at Delft University of Technology. It includes various exercises related to single-layer perceptrons, linear functions, numerical approximations, and the implementation of neural networks. The exercises cover theoretical proofs, programming tasks, and applications of mathematical concepts in machine learning.

Uploaded by

lershbersh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views5 pages

Exercise Sheet 8

Uploaded by

lershbersh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

DELFT UNIVERSITY OF TECHNOLOGY

Faculty of Electrical Engineering, Mathematics and Computer Science

EXERCISE SHEET 8 – WI4635 LINEAR ALGEBRA AND OPTIMIZATION

FOR MACHINE LEARNING

Exercise 1
Show that the single-layer perceptron model

p (x) = a⊤ σ (bx + c) (1)

with σ(x) = max{0, x} and a, b, c ∈ Rn for some n ∈ N can represent any piecewise linear
function. You may proceed as follows:

(a) Let 0 = x0 ≤ . . . ≤ xN = 1 be a set of grid points, and let C(I) and P1 (I) be the
continuous and linear functions on the interval I, respectively. Show that any function

f ∈ C([0, 1]) ∧ f ∈ P1 ([xi , xi+1 ]) ∀i ∈ {0, . . . , N }

can be written as a linear combination of hat functions (Figure 1) defined as

0, x ≤ xi−1 ,


 x−xi−1

, x ∈ [xi−1 , xi ],
xi −xi−1
pi (x) = x−xi+1
 xi −xi+1 , x ∈ [xi , xi+1 ],


0, x ≥ xi+1 .

1 pi

0
xi−1 xi xi+1

Figure 1: Hat function pi .

(b) Construct the hat function pi using functions of the form eq. (1).

Exercise 2
Show that a single-layer perceptron is an over-parametrized model. That is, consider models
of the form
p (x) = a⊤ σ (bx + c) (2)

1
with x ∈ R, σ(y) = max{0, y}, and a, b, c ∈ Rn ; we concatenate the parameter vectors as
follows:  
a
 b  ∈ R3n .
c
In the lecture, we have already noted that a permutation of the parameter vectors a, b, c
leads to the same model. In this exercise, construct a subspace (dimension > 0) of the space
of concatenated parameter vectors, R3n , that yields the model

p(x) = σ(x)

using the representation eq. (2); to simplify the discussion, you may choose any specific n.

Exercise 3
Consider the following Theorem from the book Linear Algebra and Learning from Data from
G. Strang1 :

Theorem 1 For v ∈ Rm , suppose the graph of F (v) has folds along N hyperplanes H1 , . . . , HN .
Those come from N linear equations a⊤ i v + bi = 0, in other words from ReLU at N neurons.
Then the number of linear pieces of F and regions bounded by the N hyperplanes is r(N, m):
m
X N N N
r(N, m) = = + ... + .
i=0
i 0 m

The binomial coefficients are

N N!
=
i i!(N − i)!
N N

with 0! = 1 and 0
= 1 and i
= 0 for i > N .

(a) Prove the recursion formula

r(N, m) = r(N − 1, m) + r(N − 1, m − 1).

(b) Using the theorem, compute the numbers for one, two, and three dimensional input
vectors.

Exercise 4

(a) Using Python, verify numerically that

N Z 1
1 X
f (xi ) → f (x) dx
N i=0 0

for N → ∞, where xi is sampled form a uniform distribution U (0, 1). Choose a fourth
order polynomial and compare the numerical approximation with the exact value of the
integral.
1
Strang, Gilbert. Linear Algebra and Learning From Data. Wellesley-Cambridge Press, 2019.

2
(b) Use the same approach to compute an approximation of π by approximating
Z
χC (x) dx,
[0,1]2

where
1, for ∥x∥2 ≤ 1,
χC (x) =
0, otherwise.

Exercise 5
Prove a sub-optimal approximation result, which uses a stronger assumption of f that just
continuity:

(a) Show that, for the linear interpolant p on Ii = [xi , xi+1 ] with

p(xi ) = f (xi ) and p(xi+1 ) = f (xi+1 ),

there exists some ξ ∈ I, such that

h2i ′′
max |f (x) − p(x)| ≤ |f (ξ)| ,
x 8
with hi = xi+1 − xi .

(b) Let f ∈ C 2 ([0, 1]) arbitrary and σ(x) = max {0, x} and. Prove that, for every ε > 0,
there exists some n ∈ N and a function

P (x) = a⊤ σ (bx + c) ,

with a, b, c ∈ Rn , such that

max |f (x) − p(x)| < ε.

Hint:

(a) Construct the function

F (x) = f (x) − p(x) + K(x − xi )(x − xi+1 )

with K ∈ R. Impose F (x̄) = 0 for some x̄ ∈ [xi , xi+1 ]. Then, use Rolle’s theorem.

Exercise 6
Implement a multi-layer perceptron model with two hidden layers, sigmoid activation func-
tion, and the following architecture using Python:

• input dimension: 2

• dimension of each hidden layer: 4

• output dimension: 1

3
You may use any deep learning library to implement the model or implement it yourself.

In order to initialize the weights, compare the following two strategies discussed in the
lecture:

• Uniform distribution: U − ni , ni ,
√1 √1

q q
• Glorot and Bengio: U − ni +ni+1 , ni +ni+1
6 6

Initialize the bias vectors with zero. Without training the network, perform the following
tasks:

(a) Visualize the output of the neural networks after initialization for five different random
seeds.

(b) Visualize the value of the loss function depending on the diagonal entries of the weight
matrix in the first hidden layer. Therefore, use the loss functions

2
LMSE N N σW,b (xi ) , yi 1
N N σW,b (xi ) − yi 2 ,

Mean squared error (MSE) = N
σ 1
N N σW,b (x) − y 1 .

Mean absolute error (MAE) LMAE N N W,b (x) , yi = N

In order to compute the input and output data, take 100 points randomly sampled from
{x ∈ R2 | x21 + x22 ≤ 1} as input and the squared norm as corresponding output:

y = x21 + x22 .

Exercise 7
For the two functions

(a)
f (x, y, θ, r) = x + iy + r(cos(θ) + i sin(θ))

(b) A multi-layer perceptron model with two hidden layers

• input dimension: 4
• dimension of each hidden layer: 2
• output dimension: 1

visualize the computational graph. Then, perform forward and backward propagation for
the input  
1
1
π .
 
2
2

4
Exercise 8

(a) Show that the discrete convolution operation is linear.

(b) Let  
d11
   .. 
d11 d12 d13 d14  . 
d21 d22 d23 d24 
 
D= and d = d14 
  
d31 d32 d33 d34  d41 
d41 d42 d43 d44
 
. . .
d44
be the matrix and vector representations of the same data. Derive the sparse matrix A
corresponding to the discrete convolution with the kernel
 
0 1 0
K = 1 −4 1 ,
0 1 0
such that
Ad and D∗K
are equivalent. How sparse is the matrix? How does the number of nonzeros related to
the size of the kernel? How would the sparsity change if D ∈ R100×4 instead?

Exercise 9

(a) Find a solution of the boundary value problem: find u such that

∂ 1 ∂
· u = 1 in [0, 1],
∂x x + 1 ∂x
u(0) = 0,
7
u(1) = ,
3

(b) Give an example for the functions a and b in the ansatz

u(x) = a(x) + b(x)N N σW,b (x)

and the loss function that allow for solving the BVP using the method of Lagaris et al.
(c) Replace the ansatz by a data loss and formalize the corresponding PINN loss function.

Exercise 10
Test the example in examples/pinn forward/Poisson Dirichlet 1d.py from the DeepXDE
library2 for solving a one dimensional Poisson problem using PINNs.

2
https://github.com/lululxvi/deepxde

CS236 Homework 1
100% (1)
CS236 Homework 1
4 pages
HW 2
No ratings yet
HW 2
4 pages
Lab Experiment 1: Buckling of Columns
100% (3)
Lab Experiment 1: Buckling of Columns
14 pages
Solution Dseclzg524!01!102020 Ec2r
100% (1)
Solution Dseclzg524!01!102020 Ec2r
6 pages
MIDA1 AUT - Solutions
No ratings yet
MIDA1 AUT - Solutions
4 pages
27-11 Mechanical Vibration (ME)
No ratings yet
27-11 Mechanical Vibration (ME)
3 pages
Seed Math GR 12 Assignment Memo 07 May 2024
No ratings yet
Seed Math GR 12 Assignment Memo 07 May 2024
12 pages
Machine Learning Assignments
No ratings yet
Machine Learning Assignments
3 pages
Foundations of Data Science: Exercise 1
No ratings yet
Foundations of Data Science: Exercise 1
5 pages
3rd Mid-Science 3
100% (1)
3rd Mid-Science 3
3 pages
Solution Dseclzg524 05-07-2020 Ec3r
No ratings yet
Solution Dseclzg524 05-07-2020 Ec3r
7 pages
Falcimaigne, Jean - Decarre, Sandrine-Multiphase Production - Pipeline Transport, Pumping and Metering-Editions Technip (2008)
100% (2)
Falcimaigne, Jean - Decarre, Sandrine-Multiphase Production - Pipeline Transport, Pumping and Metering-Editions Technip (2008)
201 pages
2021 EE769 Tutorial Sheet 1
No ratings yet
2021 EE769 Tutorial Sheet 1
4 pages
Fun With Magnets
No ratings yet
Fun With Magnets
2 pages
hw1 f21112 Problems11
No ratings yet
hw1 f21112 Problems11
2 pages
Problem 1
No ratings yet
Problem 1
2 pages
Neural Network Assignment
No ratings yet
Neural Network Assignment
6 pages
840 Service Manual Rev E - 11x17 Foldouts
100% (1)
840 Service Manual Rev E - 11x17 Foldouts
30 pages
Disc11-Examprep-Sols (9 Files Merged)
No ratings yet
Disc11-Examprep-Sols (9 Files Merged)
12 pages
КНиНД демо ENG
No ratings yet
КНиНД демо ENG
16 pages
UDL Errata
No ratings yet
UDL Errata
13 pages
2nd Exam Question Paper 2
No ratings yet
2nd Exam Question Paper 2
16 pages
HW04
No ratings yet
HW04
9 pages
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 3
No ratings yet
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 3
8 pages
DNN Cluster S2 22 MidSem Makeup
No ratings yet
DNN Cluster S2 22 MidSem Makeup
7 pages
W1 Practical Solution
No ratings yet
W1 Practical Solution
8 pages
hw2 Red
No ratings yet
hw2 Red
4 pages
DL Exam 2023-2
No ratings yet
DL Exam 2023-2
5 pages
Wa0193.
No ratings yet
Wa0193.
4 pages
Signal Processing
No ratings yet
Signal Processing
85 pages
hw8 (5555)
No ratings yet
hw8 (5555)
3 pages
UDL Errata
No ratings yet
UDL Errata
8 pages
HW 3
No ratings yet
HW 3
7 pages
Kinetic Theory of Gases Notes
No ratings yet
Kinetic Theory of Gases Notes
5 pages
Formula Sheet Posted
No ratings yet
Formula Sheet Posted
5 pages
Indian Institute of Science: Roblem
No ratings yet
Indian Institute of Science: Roblem
2 pages
ПМиИИ Демо ENG
No ratings yet
ПМиИИ Демо ENG
11 pages
TT1 QBAns1
No ratings yet
TT1 QBAns1
15 pages
Recognition Patterns: Jean Carlo Grandas Franco March 2020
No ratings yet
Recognition Patterns: Jean Carlo Grandas Franco March 2020
9 pages
CST294 July 2021
No ratings yet
CST294 July 2021
4 pages
1st Exam Question Paper 2
No ratings yet
1st Exam Question Paper 2
16 pages
Week 1 Sol Merged
No ratings yet
Week 1 Sol Merged
39 pages
đề học máy 1
No ratings yet
đề học máy 1
3 pages
Exercise Sheet 1
No ratings yet
Exercise Sheet 1
5 pages
Challenging Questions
No ratings yet
Challenging Questions
2 pages
Homework 1
No ratings yet
Homework 1
1 page
HW 1 Eeowh 3
No ratings yet
HW 1 Eeowh 3
6 pages
ML Ctanujit
No ratings yet
ML Ctanujit
56 pages
Instructor Solution Manual To Neural Networks and Deep Learning A Textbook Solutions 3319944622 9783319944623 - Compress
No ratings yet
Instructor Solution Manual To Neural Networks and Deep Learning A Textbook Solutions 3319944622 9783319944623 - Compress
40 pages
Basic Concepts For Understanding ML & DL
No ratings yet
Basic Concepts For Understanding ML & DL
8 pages
Ad
No ratings yet
Ad
5 pages
Exercise 01 Math Refresher
No ratings yet
Exercise 01 Math Refresher
4 pages
Machine Learning Probability Homework
No ratings yet
Machine Learning Probability Homework
3 pages
HW 2
No ratings yet
HW 2
5 pages
Sol 4
No ratings yet
Sol 4
7 pages
Exercise 01
No ratings yet
Exercise 01
3 pages
Instructor's Solution Manual For Neural Networks
No ratings yet
Instructor's Solution Manual For Neural Networks
40 pages
Qnpaper
No ratings yet
Qnpaper
3 pages
HW 23 P 4 Rie
No ratings yet
HW 23 P 4 Rie
5 pages
1160 CS F425 20241218114944 Comprehensive Exam Question Paper
No ratings yet
1160 CS F425 20241218114944 Comprehensive Exam Question Paper
5 pages
Roll No. B.E/ B.Tech (Fulltime) Degreeend Semesterexaminations, April/May2013
No ratings yet
Roll No. B.E/ B.Tech (Fulltime) Degreeend Semesterexaminations, April/May2013
3 pages
DL Assignment Solutions
No ratings yet
DL Assignment Solutions
64 pages
Fiilncees: Fundamentals of Engineering (Fe) Civil CBT Exam Specifications Effective Beginning With The July Examinations
No ratings yet
Fiilncees: Fundamentals of Engineering (Fe) Civil CBT Exam Specifications Effective Beginning With The July Examinations
3 pages
Learning From Data HW 6
No ratings yet
Learning From Data HW 6
5 pages
Python Ws20 21
No ratings yet
Python Ws20 21
15 pages
Ex04 Regression MLP
No ratings yet
Ex04 Regression MLP
2 pages
Foaming in The Cooling Tower - Flat
No ratings yet
Foaming in The Cooling Tower - Flat
1 page
1 - Correas de Transmisión Industrial Transmission Belts
No ratings yet
1 - Correas de Transmisión Industrial Transmission Belts
80 pages
Matter Crossword PDF
No ratings yet
Matter Crossword PDF
3 pages
12-Chapter Test (Heat - Thermodynamics)
No ratings yet
12-Chapter Test (Heat - Thermodynamics)
2 pages
JHJHH
No ratings yet
JHJHH
6 pages
Linearity in Regression, Domodar N Gujrati - Basic Econometrics
No ratings yet
Linearity in Regression, Domodar N Gujrati - Basic Econometrics
2 pages
Automatic Speed Control and Turning ON/OFF For Smart Fan by Temperature and Ultrasonic Sensor
No ratings yet
Automatic Speed Control and Turning ON/OFF For Smart Fan by Temperature and Ultrasonic Sensor
7 pages
Optimization of Sacrificial Anodes For One Offshore Jacket: February 2016
100% (1)
Optimization of Sacrificial Anodes For One Offshore Jacket: February 2016
7 pages
There Are Three Types of Rock
No ratings yet
There Are Three Types of Rock
1 page
Mendeleev Periodic Table
No ratings yet
Mendeleev Periodic Table
2 pages
Ceramic Coatings
No ratings yet
Ceramic Coatings
4 pages
Applied Reservoir Engineering and Management (PEC15101)
No ratings yet
Applied Reservoir Engineering and Management (PEC15101)
6 pages
Broch MMH 2010
No ratings yet
Broch MMH 2010
2 pages
Wormholes: Theory of Everything String Theory
No ratings yet
Wormholes: Theory of Everything String Theory
4 pages
Short Answer Type Questions I
No ratings yet
Short Answer Type Questions I
12 pages
Advanced Numerical Analysis Prof. Daniel Kressner
No ratings yet
Advanced Numerical Analysis Prof. Daniel Kressner
25 pages
(do CT Hội đồng chấm thi ghi)
No ratings yet
(do CT Hội đồng chấm thi ghi)
24 pages
Name: Jui Manohar Yezarkar
No ratings yet
Name: Jui Manohar Yezarkar
8 pages
MITRES 6 007S11 hw07
No ratings yet
MITRES 6 007S11 hw07
7 pages
Tunneling Through A Controllable Vacuum Gap: Related Articles
No ratings yet
Tunneling Through A Controllable Vacuum Gap: Related Articles
4 pages
1-PDCI Damping Control Analysis For The Western North American Power System-2013
No ratings yet
1-PDCI Damping Control Analysis For The Western North American Power System-2013
5 pages
Reading Selection 5
No ratings yet
Reading Selection 5
1 page
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Exercise Sheet 8

Uploaded by

Exercise Sheet 8

Uploaded by

DELFT UNIVERSITY OF TECHNOLOGY

Faculty of Electrical Engineering, Mathematics and Computer Science

EXERCISE SHEET 8 – WI4635 LINEAR ALGEBRA AND OPTIMIZATION

p (x) = a⊤ σ (bx + c) (1)

f ∈ C([0, 1]) ∧ f ∈ P1 ([xi , xi+1 ]) ∀i ∈ {0, . . . , N }

can be written as a linear combination of hat functions (Figure 1) defined as

Figure 1: Hat function pi .

The binomial coefficients are  

(a) Prove the recursion formula

r(N, m) = r(N − 1, m) + r(N − 1, m − 1).

(a) Using Python, verify numerically that

p(xi ) = f (xi ) and p(xi+1 ) = f (xi+1 ),

there exists some ξ ∈ I, such that

with a, b, c ∈ Rn , such that

(a) Construct the function

F (x) = f (x) − p(x) + K(x − xi )(x − xi+1 )

• dimension of each hidden layer: 4

(b) A multi-layer perceptron model with two hidden layers

(a) Show that the discrete convolution operation is linear.

(b) Give an example for the functions a and b in the ansatz

u(x) = a(x) + b(x)N N σW,b (x)

You might also like

The binomial coefficients are