0% found this document useful (0 votes)

398 views4 pages

CS 229, Autumn 2016 Problem Set #0 Solutions: Linear Algebra and Multivariable Calculus

The document provides solutions to problems regarding linear algebra and multivariable calculus concepts for a CS229 problem set. It includes: 1) Definitions of gradients, Hessians, and symmetric matrices. Questions involve calculating gradients and Hessians of functions, as well as properties of positive definite matrices. 2) The first problem calculates gradients and Hessians of functions involving vectors, matrices, and composition of functions. 3) The second problem involves showing a matrix constructed from an vector is positive semidefinite and finding its null space and rank.

Uploaded by

Sasanka Sekhar Sahu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

398 views4 pages

CS 229, Autumn 2016 Problem Set #0 Solutions: Linear Algebra and Multivariable Calculus

Uploaded by

Sasanka Sekhar Sahu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

CS229 Problem Set #0 1

CS 229, Autumn 2016

Problem Set #0 Solutions: Linear Algebra and
Multivariable Calculus

Notes: (1) These questions require thought, but do not require long answers. Please be as
concise as possible. (2) If you have a question about this homework, we encourage you to post
your question on our Piazza forum, at https://piazza.com/stanford/autumn2016/cs229. (3)
If you missed the first lecture or are unfamiliar with the collaboration or honor code policy, please
read the policy on Handout #1 (available from the course website) before starting work. (4)
This specific homework is not graded, but we encourage you to solve each of the problems to
brush up on your linear algebra. Some of them may even be useful for subsequent problem sets.
It also serves as your introduction to using Gradescope for submissions.
If you are scanning your document by cellphone, please check the Piazza forum for recommended
cellphone scanning apps and best practices.

1. [0 points] Gradients and Hessians

Recall that a matrix A ∈ Rn×n is symmetric if AT = A, that is, Aij = Aji for all i, j. Also
recall the gradient ∇f (x) of a function f : Rn → R, which is the n-vector of partial derivatives
 ∂   
∂x1 f (x) x1
..  . 
∇f (x) =   where x =  ..  .
 
.
∂
∂xn f (x)
xn

The hessian ∇2 f (x) of a function f : Rn → R is the n × n symmetric matrix of twice partial

derivatives,
∂2 ∂2 2
 
∂x21
f (x) ∂x1 ∂x2 f (x) · · · ∂x1∂∂xn f (x)
 ∂2 ∂2 2
· · · ∂x2∂∂xn f (x)


∂x ∂x f (x) ∂x 2 f (x)
∇2 f (x) = 
2 1 2
.
 
.. .. .. ..

 . . . . 

2 2 2
∂ ∂ ∂
∂xn ∂x1 f (x) ∂xn ∂x2 f (x) · · · ∂x 2 f (x)
n

(a) Let f (x) = 21 xT Ax + bT x, where A is a symmetric matrix and b ∈ Rn is a vector. What

is ∇f (x)?
Answer: In short, we know that ∇( 12 xT Ax) = Ax for a symmetric matrix A, while
T
∇(b x) = b. Then ∇f (x) = Ax + b when A is symmetric. In more detail, we have
n n
1 T 1 XX
x Ax = Aij xi xj ,
2 2 i=1 j=1
CS229 Problem Set #0 2

so for each k = 1, . . . , n, we have

n n n n
∂ 1 XX (i) ∂ 1 X ∂ 1 X ∂ 1
Aij xi xj = Aik xi xk + Akj xk xj + Akk x2k
∂xk 2 i=1 j=1 ∂xk 2 ∂xk 2 ∂xk 2
i=1,i6=k j=1,j6=k
n n
(ii) 1 X 1 X
= Aik xi + Akj xj + Akk xk
2 2
i=1,i6=k j=1,j6=k
n
X
= Aki xi
i=1

where step (i) follows because ∂x∂ k Aij xi xj = 0 if i 6= k and j 6= k, step (ii) by the definition
of a partial derivative, and the final equality because Aij = Aji for all pairs i, j. Thus
∇( 21 xT Ax) = Ax. To see that ∇bT x = b, note that
n
∂ T ∂ X ∂
b x= bi xi = bk xk = bk .
∂xk ∂xk i=1 ∂xk

(b) Let f (x) = g(h(x)), where g : R → R is differentiable and h : Rn → R is differentiable.

What is ∇f (x)?
Answer: In short, if g 0 is the derivative of g, then the chain rule gives

∇f (x) = g 0 (h(x))∇h(x).

Expanding this by components, we have for each i = 1, . . . , n that

∂ ∂ ∂
f (x) = g(h(x)) = g 0 (h(x)) h(x)
∂xi ∂xi ∂xi
by the chain rule. Stacking each of these in a column vector, we obtain
 0 ∂ 
g (h(x)) ∂x 1
h(x)
∇f (x) =  .. 0
 = g (h(x))∇h(x).
 
.
g 0 (h(x)) ∂x∂n h(x)

(c) Let f (x) = 21 xT Ax + bT x, where A is symmetric and b ∈ Rn is a vector. What is ∇2 f (x)?

Answer: We have ∇2 f (x) = A. To see this more formally, note that ∇2 (bT x) = 0,
because the second derivatives of bi xi are all zero. Let A = [a(1) · · · a(n) ], where ai ∈ Rn is
an n-vector (because A is symmetric, we also have A = [a(1) a(2) · · · a(n) ]T ). Then we use
part (1a) to obtain
n
∂ 1 T T X
( x Ax) = a(k) x = Aik xi ,
∂xk 2 i=1

and thus
∂2 1 T ∂ (k) T
( x Ax) = a x = Aik .
∂xk xi 2 ∂xi
(d) Let f (x) = g(aT x), where g : R → R is continuously differentiable and a ∈ Rn is a vector.
What are ∇f (x) and ∇2 f (x)? (Hint: your expression for ∇2 f (x) may have as few as 11
symbols, including 0 and parentheses.)
CS229 Problem Set #0 3

Answer: We use the chain rule (part (1b)) to see that ∇f (x) = g 0 (aT x)a, because
T
∇(a x) = a. Taking second derivatives, we have
∂ ∂ ∂ 0 T
= g (a x)aj = g 00 (aT x)ai aj .
∂xi ∂xj ∂xi
Expanding this in matrix form, we have
 2 
a1 a1 a2 ··· a1 an
 a2 a1 a22 ··· a2 an 
∇2 f (x) = g 00 (aT x)  . 00 T T
..  = g (a x)aa .
 
.. ..
 .. . . . 
an a1 an a2 ··· a2n

2. [0 points] Positive definite matrices

A matrix A ∈ Rn×n is positive semi-definite (PSD), denoted A 0, if A = AT and xT Ax ≥ 0
for all x ∈ Rn . A matrix A is positive definite, denoted A 0, if A = AT and xT Ax > 0 for
all x 6= 0, that is, all non-zero vectors x. The simplest example of a positive definite matrix is
the identity I (the diagonal matrix with 1s on the diagonal and 0s elsewhere), which satisfies
2 Pn
xT Ix = kxk2 = i=1 x2i .
(a) Let z ∈ Rn be an n-vector. Show that A = zz T is positive semidefinite.
Answer: Take any x ∈ Rn . Then xT Ax = xT zz T x = (xT z)2 ≥ 0.
(b) Let z ∈ Rn be a non-zero n-vector. Let A = zz T . What is the null-space of A? What is
the rank of A?
Answer: If n = 1, the null space of A is empty. The rank of A is always 1, as the
null-space of A is the set of vectors orthogonal to z. That is, if z T x = 0, then x ∈ Null(A),
because Ax = zz T x = 0. Thus, the null-space of A has dimension n − 1 and the rank of A
is 1.
(c) Let A ∈ Rn×n be positive semidefinite and B ∈ Rm×n be arbitrary, where m, n ∈ N. Is
BAB T PSD? If so, prove it. If not, give a counterexample with explicit A, B.
Answer: Yes, BAB T is positive semidefinite. For any x ∈ Rm , we may define v = B T x ∈
Rn . Then
xT BAB T x = (B T x)T A(B T x) = v T Av ≥ 0,
where the inequality follows because v T Av ≥ 0 for any vector v.
3. [0 points] Eigenvectors, eigenvalues, and the spectral theorem
The eigenvalues of an n × n matrix A ∈ Rn×n are the roots of the characteristic polynomial
pA (λ) = det(λI − A), which may (in general) be complex. They are also defined as the the
values λ ∈ C for which there exists a vector x ∈ Cn such that Ax = λx. We call such a pair
(x, λ) an eigenvector, eigenvalue pair. In this question, we use the notation diag(λ1 , . . . , λn )
to denote the diagonal matrix with diagonal entries λ1 , . . . , λn , that is,
 
λ1 0 0 ··· 0
 0 λ2 0 · · · 0 
 
diag(λ1 , . . . , λn ) =  0
 0 λ3 · · · 0  .
 .. .. .. .. .. 
. . . . . 
0 0 0 · · · λn
CS229 Problem Set #0 4

(a) Suppose that the matrix A ∈ Rn×n is diagonalizable, that is, A = T ΛT −1 for an invertible
matrix T ∈ Rn×n , where Λ = diag(λ1 , . . . , λn ) is diagonal. Use the notation t(i) for the
columns of T , so that T = [t(1) · · · t(n) ], where t(i) ∈ Rn . Show that At(i) = λi t(i) , so
that the eigenvalues/eigenvector pairs of A are (t(i) , λi ).
Answer: The matrix T is invertible, so if we let t(i) be the ith column of T , we have
h i h i
In×n = T −1 T = T −1 t(1) t(2) · · · t(n) = T −1 t(1) T −1 t(2) · · · T −1 t(n)

so that T
T −1 t(i) = ∈ {0, 1}n ,

0| ·{z
· · 0} 1 |0 ·{z
· · 0}
i−1 times n−i times

the ith standard basis vector, which we denote by e(i) (that is, the vector of all-zeros except
for a 1 in its ith position. Thus

ΛT −1 t(i) = Λe(i) = λi e(i) , and T ΛT −1 t(i) = λi T e(i) = λi t(i) .

A matrix U ∈ Rn×n is orthogonal if U T U = I. The spectral theorem, perhaps one of the most
important theorems in linear algebra, states that if A ∈ Rn×n is symetric, that is, A = AT ,
then A is diagonalizable by a real orthogonal matrix. That is, there are a diagonal matrix
Λ ∈ Rn×n and orthogonal matrix U ∈ Rn×n such that U T AU = Λ, or, equivalently,

A = U ΛU T .

Let λi = λi (A) denote the ith eigenvalue of A.

(b) Let A be symmetric. Show that if U = [u(1) · · · u(n) ] is orthogonal, where u(i) ∈
Rn and A = U ΛU T , then u(i) is an eigenvector of A and Au(i) = λi u(i) , where Λ =
diag(λ1 , . . . , λn ).
Answer: Once we see that U −1 = U T because U T U = I, this is simply a repeated
application of part (3a).
(c) Show that if A is PSD, then λi (A) ≥ 0 for each i.
Answer: Let x ∈ Rn be any vector. We know that A = AT , so that A = U ΛU T for an
orthogonal matrix U ∈ Rn×n by the spectral theorem. Take the ith eigenvector u(i) . Then
we have
U T u(i) = e(i) ,
the ith standard basis vector. Using this, we have
T T
0 ≤ u(i) Au(i) = (U T u(i) )T ΛU T u(i) = e(i) Λe(i) = λi (A).

Project Game Theory
No ratings yet
Project Game Theory
46 pages
Exercises For The Midterm - Summer 2023
No ratings yet
Exercises For The Midterm - Summer 2023
6 pages
Financial Time Series Analysis and Prediction With Feature Engineering and Support Vector Machines - Newton - Linchen
100% (1)
Financial Time Series Analysis and Prediction With Feature Engineering and Support Vector Machines - Newton - Linchen
5 pages
Selected Papers On Analysis of Algorithms (Donald E Knuth)
100% (2)
Selected Papers On Analysis of Algorithms (Donald E Knuth)
635 pages
Kaiser 1974
100% (1)
Kaiser 1974
7 pages
SIT718 Assessment-Task 4-T3 2019-Amended PDF
No ratings yet
SIT718 Assessment-Task 4-T3 2019-Amended PDF
7 pages
CS 229, Summer 2019 Problem Set #3 Solutions
No ratings yet
CS 229, Summer 2019 Problem Set #3 Solutions
19 pages
CS 229, Summer 2019 Problem Set #1 Solutions
No ratings yet
CS 229, Summer 2019 Problem Set #1 Solutions
22 pages
Linear Algerbra PDF
No ratings yet
Linear Algerbra PDF
1 page
Ps and Solution CS229
No ratings yet
Ps and Solution CS229
55 pages
cs229 Notes Ensemble
No ratings yet
cs229 Notes Ensemble
7 pages
Game Theory Yuval Peres
100% (1)
Game Theory Yuval Peres
180 pages
BX53, BX43 and BX46 Brochure
No ratings yet
BX53, BX43 and BX46 Brochure
24 pages
Robert Aumann
No ratings yet
Robert Aumann
27 pages
StockMarket Forecasting Using Hidden Markov Model A New Approach
No ratings yet
StockMarket Forecasting Using Hidden Markov Model A New Approach
5 pages
Cooperative Game Theory-Brandenburger
No ratings yet
Cooperative Game Theory-Brandenburger
9 pages
Tactical Investment Algorithms: Marcos López de Prado
No ratings yet
Tactical Investment Algorithms: Marcos López de Prado
5 pages
Ps 1
No ratings yet
Ps 1
16 pages
Andrew NG Main - Notes PDF
No ratings yet
Andrew NG Main - Notes PDF
226 pages
Nassim Taleb Tweet Arcihve
No ratings yet
Nassim Taleb Tweet Arcihve
87 pages
A Mind-Reading Machine
No ratings yet
A Mind-Reading Machine
3 pages
Artificial Intelligence For Game Design
No ratings yet
Artificial Intelligence For Game Design
2 pages
CS229 Andrew NG Lecture Notes
No ratings yet
CS229 Andrew NG Lecture Notes
216 pages
Intro ML For Quants
No ratings yet
Intro ML For Quants
51 pages
BSP
No ratings yet
BSP
30 pages
Data Pre-Processing - by Quant Arb - The Quant Stack
No ratings yet
Data Pre-Processing - by Quant Arb - The Quant Stack
9 pages
QF-TraderNet Intraday Trading Via Deep Reinforceme
No ratings yet
QF-TraderNet Intraday Trading Via Deep Reinforceme
12 pages
Mathematical Statistics With Resampling and R
No ratings yet
Mathematical Statistics With Resampling and R
359 pages
Teorija Na Igra - Beleski
No ratings yet
Teorija Na Igra - Beleski
81 pages
Game Theory
No ratings yet
Game Theory
22 pages
Statistical Rethinking Sample
100% (2)
Statistical Rethinking Sample
80 pages
Cash or Crash-Algorithm
No ratings yet
Cash or Crash-Algorithm
2 pages
SVD PDF
No ratings yet
SVD PDF
10 pages
04 Monte Carlo
No ratings yet
04 Monte Carlo
4 pages
The Analysis and Compensation of Deadtime Effects in Three Phase
No ratings yet
The Analysis and Compensation of Deadtime Effects in Three Phase
6 pages
Quant Interview Prep
No ratings yet
Quant Interview Prep
14 pages
Probability It's Only A Game!
No ratings yet
Probability It's Only A Game!
55 pages
Understanding The Kelly Capital Growth Investment Strategy
No ratings yet
Understanding The Kelly Capital Growth Investment Strategy
7 pages
Wigner D-Matrix
100% (1)
Wigner D-Matrix
8 pages
The Little Book of Deep Learning
100% (1)
The Little Book of Deep Learning
140 pages
Developing Machine Learning Applications With TensorFlow
No ratings yet
Developing Machine Learning Applications With TensorFlow
22 pages
The Geometry of Uncertainty
No ratings yet
The Geometry of Uncertainty
640 pages
Deep Learning Tutorial Release 0.1
No ratings yet
Deep Learning Tutorial Release 0.1
173 pages
Lecture - 12 Von Neumann & Morgenstern Expected Utility
No ratings yet
Lecture - 12 Von Neumann & Morgenstern Expected Utility
20 pages
Dsa Notes Unit 5
No ratings yet
Dsa Notes Unit 5
21 pages
Proposals For Strengthening The Markets-Springer International Pub
No ratings yet
Proposals For Strengthening The Markets-Springer International Pub
173 pages
Agent-Based Keynesian Macroeconomics
No ratings yet
Agent-Based Keynesian Macroeconomics
330 pages
Intro To Machine Learning With PyTorch
No ratings yet
Intro To Machine Learning With PyTorch
48 pages
An Introduction To Mathematics Behind Neural Networks
No ratings yet
An Introduction To Mathematics Behind Neural Networks
5 pages
Leukemia Cancer Cells Segmentation and Classification Using Machine Learning
No ratings yet
Leukemia Cancer Cells Segmentation and Classification Using Machine Learning
18 pages
A Comprehensive Study and Performance Analysis of Deep Neural Network-Based Approaches in Wind Time-Series Forecasting
No ratings yet
A Comprehensive Study and Performance Analysis of Deep Neural Network-Based Approaches in Wind Time-Series Forecasting
18 pages
Making Fat Tails Fatter
100% (1)
Making Fat Tails Fatter
7 pages
Understanding Probability 3rd Edition. Henk Tijms. Cambridge University Press 2012.
0% (2)
Understanding Probability 3rd Edition. Henk Tijms. Cambridge University Press 2012.
6 pages
Glpsol Tutorial
No ratings yet
Glpsol Tutorial
7 pages
Chapter 8 Code Optimization and Code Generation
No ratings yet
Chapter 8 Code Optimization and Code Generation
58 pages
Deepseek-Vl: Towards Real-World Vision-Language Understanding
No ratings yet
Deepseek-Vl: Towards Real-World Vision-Language Understanding
33 pages
Aether Analytics Strategy Deck
No ratings yet
Aether Analytics Strategy Deck
21 pages
What Do We Know About The Profitability of Technical Analysis
100% (1)
What Do We Know About The Profitability of Technical Analysis
70 pages
Paper 43
No ratings yet
Paper 43
9 pages
Mastering AI-Powered Trading Bots for Options
From Everand
Mastering AI-Powered Trading Bots for Options
Jeffery William Long
No ratings yet
TradeStation EasyLanguage for Algorithmic Trading: Discover real-world institutional applications of Equities, Futures, and Forex markets
From Everand
TradeStation EasyLanguage for Algorithmic Trading: Discover real-world institutional applications of Equities, Futures, and Forex markets
Domenico D'Errico
No ratings yet
CS 229, Fall 2018 Problem Set #0: Linear Algebra and Multivariable Calculus
No ratings yet
CS 229, Fall 2018 Problem Set #0: Linear Algebra and Multivariable Calculus
2 pages
Ps0 Template
No ratings yet
Ps0 Template
5 pages
CS209 Practice Problems 1 ML
No ratings yet
CS209 Practice Problems 1 ML
4 pages
10-601 Machine Learning: Homework 7: Instructions
No ratings yet
10-601 Machine Learning: Homework 7: Instructions
5 pages
CS 229, Summer 2019 Problem Set #2 Solutions
No ratings yet
CS 229, Summer 2019 Problem Set #2 Solutions
18 pages
Computer Graphics Solution Manual Hearn and Baker
No ratings yet
Computer Graphics Solution Manual Hearn and Baker
5 pages
Unit 12
No ratings yet
Unit 12
33 pages
CS F320 - Foundations of Data Science - (Snehanshu Saha) - 2024 - 2
No ratings yet
CS F320 - Foundations of Data Science - (Snehanshu Saha) - 2024 - 2
2 pages
MAtrix Unit 1
No ratings yet
MAtrix Unit 1
88 pages
Introduction To Simulation - Lecture 6: Krylov-Subspace Matrix Solution Methods
No ratings yet
Introduction To Simulation - Lecture 6: Krylov-Subspace Matrix Solution Methods
36 pages
Encyclopedia of Mathematics and Its Applications: Edited by G.-C. ROTA
No ratings yet
Encyclopedia of Mathematics and Its Applications: Edited by G.-C. ROTA
320 pages
B.Tech. Common Ist Year AICTE Model Curriculum 2020-21 - 28th Sept 20 PDF
No ratings yet
B.Tech. Common Ist Year AICTE Model Curriculum 2020-21 - 28th Sept 20 PDF
45 pages
Basics of Vibration Dynamics
No ratings yet
Basics of Vibration Dynamics
41 pages
LECTURE NOTE Matrix
No ratings yet
LECTURE NOTE Matrix
8 pages
Stochastic Processes With Applications
No ratings yet
Stochastic Processes With Applications
14 pages
Handbook of Thin Plate Buckling and Postbuckling
100% (3)
Handbook of Thin Plate Buckling and Postbuckling
776 pages
Cse 2018-2022 PDF
No ratings yet
Cse 2018-2022 PDF
460 pages
State Space Representation, Observability, Controllability
No ratings yet
State Space Representation, Observability, Controllability
4 pages
02 Linear Algebra
No ratings yet
02 Linear Algebra
67 pages
Telecom-Syllabus 2024
No ratings yet
Telecom-Syllabus 2024
34 pages
ME. Biomedical Engineering REG 2021
No ratings yet
ME. Biomedical Engineering REG 2021
61 pages
Math 208 Course Pack
No ratings yet
Math 208 Course Pack
159 pages
MAT306101 2009 0 e
No ratings yet
MAT306101 2009 0 e
25 pages
Finite Strain and Strain Ellipsoide
No ratings yet
Finite Strain and Strain Ellipsoide
27 pages
Notes On Spatial Econometrics: Mauricio Sarrias Universidad de Talca October 6, 2020
No ratings yet
Notes On Spatial Econometrics: Mauricio Sarrias Universidad de Talca October 6, 2020
161 pages
Upwind Secondorder Difference Schemes and Applications in Aerody 1976
No ratings yet
Upwind Secondorder Difference Schemes and Applications in Aerody 1976
9 pages
03-Damping 2023
No ratings yet
03-Damping 2023
30 pages
Orthogonal Rational Functions On A Semi-Infinite Interval
No ratings yet
Orthogonal Rational Functions On A Semi-Infinite Interval
26 pages
Electronic Delivery Cover Sheet: This Notice Is Posted in Compliance With Title 37 C. F. R., Chapter II, Part 201.14
No ratings yet
Electronic Delivery Cover Sheet: This Notice Is Posted in Compliance With Title 37 C. F. R., Chapter II, Part 201.14
21 pages
Ma3251-Statistics and Numerical Methods-256433279-Ma3251 SNM Que Bank
No ratings yet
Ma3251-Statistics and Numerical Methods-256433279-Ma3251 SNM Que Bank
13 pages
Implementation of DFT On Quantum Computing
No ratings yet
Implementation of DFT On Quantum Computing
29 pages
Tracking The Progress of The Lanczos Algorithm For Large Symmetric Eigenproblems
No ratings yet
Tracking The Progress of The Lanczos Algorithm For Large Symmetric Eigenproblems
21 pages
onlinetestMS23ex Site
No ratings yet
onlinetestMS23ex Site
8 pages
Elc (Syallbus)
No ratings yet
Elc (Syallbus)
253 pages

CS 229, Autumn 2016 Problem Set #0 Solutions: Linear Algebra and Multivariable Calculus

Uploaded by

CS 229, Autumn 2016 Problem Set #0 Solutions: Linear Algebra and Multivariable Calculus

Uploaded by

CS229 Problem Set #0 1

CS 229, Autumn 2016

1. [0 points] Gradients and Hessians

The hessian ∇2 f (x) of a function f : Rn → R is the n × n symmetric matrix of twice partial

(a) Let f (x) = 21 xT Ax + bT x, where A is a symmetric matrix and b ∈ Rn is a vector. What

so for each k = 1, . . . , n, we have

(b) Let f (x) = g(h(x)), where g : R → R is differentiable and h : Rn → R is differentiable.

Expanding this by components, we have for each i = 1, . . . , n that

(c) Let f (x) = 21 xT Ax + bT x, where A is symmetric and b ∈ Rn is a vector. What is ∇2 f (x)?

2. [0 points] Positive definite matrices

ΛT −1 t(i) = Λe(i) = λi e(i) , and T ΛT −1 t(i) = λi T e(i) = λi t(i) .

Let λi = λi (A) denote the ith eigenvalue of A.

You might also like