0% found this document useful (0 votes)

42 views5 pages

1 Applications of SVD: Least Squares Approximation: Lecture 8: October 21, 2021

Lecture 8 Mathematical Toolkit, Autumn 2021 Toyota Technological Institute at Chicago Madhur Tulsiani

Uploaded by

Pushkaraj Panse

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views5 pages

1 Applications of SVD: Least Squares Approximation: Lecture 8: October 21, 2021

Lecture 8 Mathematical Toolkit, Autumn 2021 Toyota Technological Institute at Chicago Madhur Tulsiani

Uploaded by

Pushkaraj Panse

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Mathematical Toolkit Autumn 2021

Lecture 8: October 21, 2021

Lecturer: Madhur Tulsiani

1 Applications of SVD: least squares approximation

We discuss another application of singular value decomposition (SVD) of matrices. Let

a1 , . . . , an ∈ Rd be points which we want to fit to a low-dimensional subspace. The goal
is to find a subspace S of Rd of dimension at most k to minimize ∑in=1 (dist( ai , S))2 , where
dist( ai , S) denotes the distance of ai from the closest point in S. We first prove the follow-
ing.

Claim 1.1 Let u1 , . . . , uk be an orthonormal basis for S. Then

(dist( ai , S))2 = ∥ ai ∥22 − ∑ ai , u j .

2
j =1

Proof: Complete u1 , . . . , uk to an orthonormal basis uk+1 , . . . , ud for all of Rd . For any

point v ∈ Rd , where exist c1 , . . . , cd ∈ R such that v = ∑dj=1 c j · u j . To find the distance
dist(v, S) = minu∈S ∥v − u∥, we need to find the point u ∈ S, which is closest to v.
Let u = ∑kj=1 b j · uk be an arbitrary point in S (any u ∈ S can be written in this form, since
u1 , . . . , uk form a basis for S). We have that
2
k d k d
∥ v − u ∥2 = ∑ (c j − b j ) · u j + ∑ c j · u j = ∑ ( c j − b j )2 + ∑ c2j ,

j =1 j = k +1
j =1 j = k +1

which is minimized when b j = c j for all j ∈ [k ]. Thus, the cloest point u ∈ S to v =

∑dj=1 c j · u j is given by u = ∑kj=1 c j · u j , with v − u = ∑dj=k+1 c j · u j . Moreover, since u1 , . . . , ud

form an orthonormal basis, we have c j = u j , v for all j ∈ [d], which gives

d d k k

∑ ∑ j ∑ j ∑ uj, v .
2
∥ v − u ∥2 = c2j = c 2
− c 2
= ∥ v ∥ 2
−
j = k +1 j =1 j =1 j =1

Using the above for each ai (as the point v) completes the proof.

1
Thus, the goal is to find a set of k orthonormal vectors u1 , . . . , uk to maximize the quantity
2
∑in=1 ∑kj=1 ai , u j . Let A ∈ Rn×d be a matrix with the ith row equal to aiT . Then, we need

to find orthonormal vectors u1 , . . . , uk to maximize ∥ Au1 ∥22 + · · · + ∥ Auk ∥22 . We will prove
the following.

Proposition 1.2 Let v1 , . . . , vr be the right singular vectors of A corresponding to singular values
σ1 ≥ · · · ≥ σr > 0. Then, for all k ≤ r and all orthonormal sets of vectors u1 , . . . , uk

∥ Au1 ∥22 + · · · + ∥ Auk ∥22 ≤ ∥ Av1 ∥22 + · · · + ∥ Avk ∥22

Thus, the optimal solution is to take S = Span (v1 , . . . , vk ). We prove the above by induc-
tion on k. For k = 1, we note that
D E
∥ Au1 ∥22 = AT Au1 , u1 ≤ max R AT A (v) = σ12 = ∥ Av1 ∥22 .
v∈Rd \{0}

To prove the induction step for a given k ≤ r, define

n o
Vk⊥−1 = v ∈ Rd | ⟨v, vi ⟩ = 0 ∀i ∈ [k − 1] .

First prove the following claim.

Claim 1.3 Given an orthonormal set u1 , . . . , uk , there exist orthonormal vectors u1′ , . . . , u′k such
that

- u′k ∈ Vk⊥−1 .

- Span (u1 , . . . , uk ) = Span u1′ , . . . , u′k .

2 2
- ∥ Au1 ∥22 + · · · + ∥ Auk ∥22 = ∥ Au1′ ∥2 + · · · + Au′k 2 .

Proof: We only provide a sketch of the proof here. Let S = Span ({u1 , . . . , uk }). Note that
dim Vk⊥−1 = d − k + 1 (why?) and dim(S) = k. Thus,

dim Vk⊥−1 ∩ S ≥ k + (d − k + 1) − d = 1 .

Hence, there exists u′k ∈ Vk⊥−1 ∩ S with u′k = 1. Completing this to an orthonormal basis

of S gives orthonormal u1′ , . . . , u′k with the first and second properties. We claim that this
already implies the third property (why?).

2
Thus, we can assume without loss of generality that the given vectors u1 , . . . , uk are such
that uk ∈ Vk⊥−1 . Hence,
∥ Auk ∥22 ≤ max ∥ Av∥22 = σk2 = ∥ Avk ∥22 .
v ∈V ⊥
k −1
∥v∥=1

Also, by the inductive hypothesis, we have that

∥ Au1 ∥22 + · · · + ∥ Auk−1 ∥22 ≤ ∥ Av1 ∥22 + · · · + ∥ Avk−1 ∥22 ,
which completes the proof. The above proof can also be used to prove that SVD gives the
best rank k approximation to the matrix A in Frobenius norm. We will see this in the next
homework.

2 Bounding the eigenvalues: Gershgorin Disc Theorem

We will now see a simple but extremely useful bound on the eigenvalues of a matrix, given
by the Gershgorin disc theorem. Many useful variants of this bound can also be derived
from the observation that for any invertible matrix S, the matrices S−1 MS and M have the
same eigenvalues (prove it!).
Theorem 2.1 (Gershgorin Disc Theorem) Let M ∈ Cn×n . Let Ri = ∑ j̸=i Mij . Define the

set
Disc( Mii , Ri ) := {z | z ∈ C, | x − Mii | ≤ Ri } .
If λ is an eigenvalue of M, then
n
[
λ ∈ Disc( Mii , Ri ) .
i =1

Proof: Let x ∈ Cn be an eigenvector corresponding to the eigenvalue λ. Let i0 =

argmaxi∈[n] {| xi |}. Since x is an eigenvector, we have
n
Mx = λx ⇒ ∀i ∈ [ n ] ∑ Mij z j = λxi .
j =1

In particular, we have that for i = i0 ,

n n xj xj
∑ Mi j x j
0
= λxi0 ⇒ ∑ Mi j x i 0
= λ ⇒ ∑ Mi j x i 0
= λ − Mi 0 i 0 .
j =1 j =1 0 j ̸ = i0 0

Thus, we have
x

∑ Mi j · j ∑ Mi j

| λ − Mi 0 i 0 | ≤ 0 x
≤
0
= R i0 .
j ̸ = i0 i0 j ̸ = i0

3
2.1 An application to compressed sensing

The Gershgorin disc theorem is quite useful in compressed sensing, to ensure what is
known as the “Restricted Isometry Property” for the measurement matrices.

Definition 2.2 A matrix A ∈ Rk×n is said to have the restricted isometry property with parame-
ters (s, δs ) if
(1 − δs ) · ∥ x ∥2 ≤ ∥ Ax ∥2 ≤ (1 + δs ) · ∥ x ∥2
for all x ∈ Rn which satisfy |{i | xi ̸= 0}| ≤ s.

Thus, we want the transformation A to be approximately norm preserving for all sparse
vectors x. This can of course be ensured for all x by taking A = id, but we require k ≪ n
for the applications in compressed sensing. In general, the restricted isometry property
is NP-hard to verify and can thus also be hard to reason about for a given matrix. The
Gershgorin Disc Theorem lets us derive a much easier condition which is sufficient to
ensure the restricted isometry property.

Definition 2.3 Let A ∈ Rk×n be such that A(i) = 1 for each column A(i) of A. Define the

coherence of A as D E
µ( A) = max A(i) , A( j) .

i̸= j

We will prove the following

Proposition 2.4 Let A ∈ Rk×n be such that A(i) = 1 for each column A(i) of A. Then, for any

s, the matrix A has the restricted isometry property with parameters (s, (s − 1)µ( A)).

Note that the bound becomes meaningless if s ≥ 1 + µ(1A) . However, the above proposition
shows that it may be sufficient to bound µ( A) (which is also easier to check in practice).

Proof: Consider any x such that |{i | xi ̸= 0}| ≤ s. Let S denote the support of x i.e.,
S = {i | xi ̸= 0}. Let AS denote the k × |S| submatrix where we only keep the columns
corresponding to indices in S. Let xS denote x restricted to the non-zero entries. Then
D E
∥ Ax ∥2 = ∥ AS xS ∥2 = AST AS xS , xS .
D E
Thus, it suffices to bound the eigenvalues of the matrix AST AS . Note that ( AS )ij = A (i ) , A( j) .
Thus the diagonal entries are 1 and the off-diagonal entries are bounded by µ( A) in abso-
lute value. By the Gershgorin Disc Theorem, for any eigenvalue λ of A, we have

| λ − 1| ≤ ( s − 1) · µ ( A ) .

4
Thus, we have

(1 − (s − 1) · µ( A)) · ∥ x ∥2 ≤ ∥ Ax ∥2 ≤ (1 + (s − 1) · µ( A)) · ∥ x ∥2 ,

as desired.

The theorem is also very useful for bounding how much the eigenvalues of matrix change
due to a perturbation. We will see an example of this in the homework.

Comp 361
No ratings yet
Comp 361
492 pages
Gershgorin GSC PDF
No ratings yet
Gershgorin GSC PDF
118 pages
2 - Numerical Methods For Solving Linear Systems of Equations
No ratings yet
2 - Numerical Methods For Solving Linear Systems of Equations
35 pages
Math 5390 Chapter 3
No ratings yet
Math 5390 Chapter 3
32 pages
The Spectral Decomposition: 1 N N J J J J J J 1 1 T 1 N N T N
No ratings yet
The Spectral Decomposition: 1 N N J J J J J J 1 1 T 1 N N T N
7 pages
Trefethen Bau
100% (2)
Trefethen Bau
29 pages
Basic Solns
No ratings yet
Basic Solns
32 pages
SVD 4
No ratings yet
SVD 4
29 pages
Linear 3
No ratings yet
Linear 3
33 pages
HW4 Solution
No ratings yet
HW4 Solution
10 pages
ComputationalMathematics - Chapter 2 PDF
No ratings yet
ComputationalMathematics - Chapter 2 PDF
29 pages
Symmetric Matrix In: Manchester Ml3 Opl, Engeanc
No ratings yet
Symmetric Matrix In: Manchester Ml3 Opl, Engeanc
16 pages
MATH 235 Assignment 6
100% (1)
MATH 235 Assignment 6
4 pages
Symmetry
No ratings yet
Symmetry
45 pages
3.3 3.4 Some Solutions
No ratings yet
3.3 3.4 Some Solutions
12 pages
Ia#-Tz I ( - ! (Llaii . Itral ) ) 1/2,: For Complex Matrices
No ratings yet
Ia#-Tz I ( - ! (Llaii . Itral ) ) 1/2,: For Complex Matrices
7 pages
Midtermsols Sp2010
No ratings yet
Midtermsols Sp2010
6 pages
Spring 2019
No ratings yet
Spring 2019
6 pages
Matrix Analyisis
No ratings yet
Matrix Analyisis
23 pages
Col726 2302 Ass2 Solutions
No ratings yet
Col726 2302 Ass2 Solutions
6 pages
Slides
No ratings yet
Slides
428 pages
Self Adjoint Transformations in Inner-Product Spaces
No ratings yet
Self Adjoint Transformations in Inner-Product Spaces
6 pages
Col726 2302 Ass3 Solutions
No ratings yet
Col726 2302 Ass3 Solutions
5 pages
(2022) 30407 - Exam - Solution
No ratings yet
(2022) 30407 - Exam - Solution
4 pages
Matrices and Linear Algebra: (S, Mat, Det) (S, Mat, Eig)
No ratings yet
Matrices and Linear Algebra: (S, Mat, Det) (S, Mat, Eig)
15 pages
Math Data
No ratings yet
Math Data
117 pages
Lecture Notes On SVD For Math 54
No ratings yet
Lecture Notes On SVD For Math 54
5 pages
Lecture 3 - Eigen - Note
No ratings yet
Lecture 3 - Eigen - Note
9 pages
N×P 2 2 2 HS T
No ratings yet
N×P 2 2 2 HS T
5 pages
MAS10009 Supplementary Materials Orthogonal Matrices
No ratings yet
MAS10009 Supplementary Materials Orthogonal Matrices
8 pages
Tutorial On Compressed Sensing Exercises: 1. Exercise
No ratings yet
Tutorial On Compressed Sensing Exercises: 1. Exercise
12 pages
Numerical Linear Algebra: Course Material Networkmaths Graduate Programme Maynooth 2010
No ratings yet
Numerical Linear Algebra: Course Material Networkmaths Graduate Programme Maynooth 2010
66 pages
SVD Notes
No ratings yet
SVD Notes
7 pages
Singular Value Decomposition SVD
No ratings yet
Singular Value Decomposition SVD
36 pages
Prapanna Ray SAP PS
No ratings yet
Prapanna Ray SAP PS
4 pages
Svdnotes
No ratings yet
Svdnotes
10 pages
Mathematical Tools Problem
No ratings yet
Mathematical Tools Problem
6 pages
Grassmannian As A Metric Space
No ratings yet
Grassmannian As A Metric Space
7 pages
(Solution) Linear Algebra 2nd (Kwak, Hong) Birkhauser
31% (16)
(Solution) Linear Algebra 2nd (Kwak, Hong) Birkhauser
22 pages
Linear Algebra & Analysis Review As Covered in Class UW EE/AA/ME 578 Convex Optimization
No ratings yet
Linear Algebra & Analysis Review As Covered in Class UW EE/AA/ME 578 Convex Optimization
16 pages
Solutions For Applied Numerical Linear Algebra PDF
No ratings yet
Solutions For Applied Numerical Linear Algebra PDF
75 pages
Caam 453 Numerical Analysis I: 6 October 2009 M. Embree, Rice University
No ratings yet
Caam 453 Numerical Analysis I: 6 October 2009 M. Embree, Rice University
4 pages
Matrix Algebra Solution
No ratings yet
Matrix Algebra Solution
23 pages
Homework 4 MATH2050
No ratings yet
Homework 4 MATH2050
7 pages
The Singular Value Decomposition: Prof. Walter Gander ETH Zurich Decenber 12, 2008
No ratings yet
The Singular Value Decomposition: Prof. Walter Gander ETH Zurich Decenber 12, 2008
18 pages
Bitstream 821479
No ratings yet
Bitstream 821479
6 pages
NLA10
No ratings yet
NLA10
66 pages
Ecd 01
No ratings yet
Ecd 01
16 pages
Stat 501 Homework 1 Solutions Spring 2005
No ratings yet
Stat 501 Homework 1 Solutions Spring 2005
9 pages
EE263s Homework 4
No ratings yet
EE263s Homework 4
11 pages
MIT System Theory Solutions
No ratings yet
MIT System Theory Solutions
75 pages
Design of Marine Propulsion Shafting System For 53000 DWT Bulk Carrier
67% (3)
Design of Marine Propulsion Shafting System For 53000 DWT Bulk Carrier
10 pages
Nonlinear Optimization (18799 B, PP) : Ist-Cmu PHD Course, Spring 2011
No ratings yet
Nonlinear Optimization (18799 B, PP) : Ist-Cmu PHD Course, Spring 2011
11 pages
Color and Shade Matching in Operative Dentistry
100% (1)
Color and Shade Matching in Operative Dentistry
19 pages
Department of Mathematics Indian Institute of Technology, Bombay
No ratings yet
Department of Mathematics Indian Institute of Technology, Bombay
8 pages
Solution 1
No ratings yet
Solution 1
9 pages
Innerproduct 2
No ratings yet
Innerproduct 2
6 pages
Saipem at A Glance PDF
No ratings yet
Saipem at A Glance PDF
9 pages
Midsem Regular MFDS 22-12-2019 Answer Key PDF
No ratings yet
Midsem Regular MFDS 22-12-2019 Answer Key PDF
5 pages
Solutions: Problem Set 1: January 17, 2013
No ratings yet
Solutions: Problem Set 1: January 17, 2013
9 pages
Schematic Wiring GTBZ18A
No ratings yet
Schematic Wiring GTBZ18A
2 pages
Soal Mock Exam HCIA-Access
No ratings yet
Soal Mock Exam HCIA-Access
7 pages
Procurement Key Performance Indicators and Metrics
100% (1)
Procurement Key Performance Indicators and Metrics
10 pages
BCM84891L Broadcom
No ratings yet
BCM84891L Broadcom
181 pages
Linear Algebra Study Guide
No ratings yet
Linear Algebra Study Guide
25 pages
CC Link IE
No ratings yet
CC Link IE
84 pages
CS Assessment
No ratings yet
CS Assessment
15 pages
New Model Service Ratio - 15022025
No ratings yet
New Model Service Ratio - 15022025
36 pages
(GF54.15-P-1256-06TB) Fuse Assignment in (N10-2) RearSAM
No ratings yet
(GF54.15-P-1256-06TB) Fuse Assignment in (N10-2) RearSAM
3 pages
纸张研究
100% (2)
纸张研究
12 pages
Sahil Sehgal's E-Book
No ratings yet
Sahil Sehgal's E-Book
10 pages
IEEE Pervasive Computing
No ratings yet
IEEE Pervasive Computing
80 pages
Brosur HE-43 T
No ratings yet
Brosur HE-43 T
2 pages
History of Mobile Generations
No ratings yet
History of Mobile Generations
7 pages
Package Aware R
No ratings yet
Package Aware R
98 pages
Ratesem 2000 Handouts Int2
No ratings yet
Ratesem 2000 Handouts Int2
31 pages
SSRN Id4032020
No ratings yet
SSRN Id4032020
27 pages
Viplav Awasthi-DataScientist
No ratings yet
Viplav Awasthi-DataScientist
6 pages
Meeting PPT - 19.07.2022
No ratings yet
Meeting PPT - 19.07.2022
15 pages
Installation Guide
No ratings yet
Installation Guide
19 pages
Wa0017.
No ratings yet
Wa0017.
4 pages
Learner's Needs, Progress and Achievement Cardex
No ratings yet
Learner's Needs, Progress and Achievement Cardex
1 page
4 Enhancing-Decision-Making-Through-Sensitivity-Analysis
No ratings yet
4 Enhancing-Decision-Making-Through-Sensitivity-Analysis
14 pages
世界500强面试笔试题
No ratings yet
世界500强面试笔试题
15 pages
Email Spoofing Detection Using Volatile Memory
No ratings yet
Email Spoofing Detection Using Volatile Memory
7 pages
Cepsa Atf Avant Diii
No ratings yet
Cepsa Atf Avant Diii
1 page
Milan Er: Medium Range Weapon System For Close Combat Operations
No ratings yet
Milan Er: Medium Range Weapon System For Close Combat Operations
2 pages
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
From Everand
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet

1 Applications of SVD: Least Squares Approximation: Lecture 8: October 21, 2021

Uploaded by

1 Applications of SVD: Least Squares Approximation: Lecture 8: October 21, 2021

Uploaded by

Mathematical Toolkit Autumn 2021

Lecture 8: October 21, 2021

1 Applications of SVD: least squares approximation

We discuss another application of singular value decomposition (SVD) of matrices. Let

Claim 1.1 Let u1 , . . . , uk be an orthonormal basis for S. Then

(dist( ai , S))2 = ∥ ai ∥22 − ∑ ai , u j .

Proof: Complete u1 , . . . , uk to an orthonormal basis uk+1 , . . . , ud for all of Rd . For any

which is minimized when b j = c j for all j ∈ [k ]. Thus, the cloest point u ∈ S to v =

∥ Au1 ∥22 + · · · + ∥ Auk ∥22 ≤ ∥ Av1 ∥22 + · · · + ∥ Avk ∥22

To prove the induction step for a given k ≤ r, define

First prove the following claim.

- Span (u1 , . . . , uk ) = Span u1′ , . . . , u′k .

Also, by the inductive hypothesis, we have that

2 Bounding the eigenvalues: Gershgorin Disc Theorem

Proof: Let x ∈ Cn be an eigenvector corresponding to the eigenvalue λ. Let i0 =

In particular, we have that for i = i0 ,

We will prove the following

You might also like