0% found this document useful (0 votes)

7 views21 pages

l1 Ext Slides

The document discusses ℓ1-norm methods for convex-cardinality problems, focusing on total variation reconstruction, which minimizes the difference between a piecewise constant estimate and a corrupted signal while controlling the number of jumps. It also introduces an iterated weighted ℓ1 heuristic to improve solutions for sparse linear inequalities and time series models. Additionally, it covers extensions to matrix rank problems and factor modeling, utilizing nuclear norms and trace heuristics for approximation.

Uploaded by

Ed Z

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views21 pages

l1 Ext Slides

Uploaded by

Ed Z

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

ℓ1-norm Methods for

Convex-Cardinality Problems
Part II

• total variation
• iterated weighted ℓ1 heuristic
• matrix rank constraints

EE364b, Stanford University

Total variation reconstruction
• fit xcor with piecewise constant x̂, no more than k jumps

• convex-cardinality problem: minimize kx̂ − xcork2 subject to

card(Dx) ≤ k (D is first order difference matrix)

• heuristic: minimize kx̂ − xcork2 + γkDxk1; vary γ to adjust number of

jumps

• kDxk1 is total variation of signal x̂

• method is called total variation reconstruction

• unlike ℓ2 based reconstruction, TVR filters high frequency noise out

while preserving sharp jumps

EE364b, Stanford University 1

Example (§6.3.3 in BV book)
signal x ∈ R2000 and corrupted signal xcor ∈ R2000
2

0
x

−1

−2
0 500 1000 1500 2000

1
xcor

−1

−2
0 500 1000 1500 2000

EE364b, Stanford University 2

Total variation reconstruction
for three values of γ
2

0
x̂

−2
0 500 1000 1500 2000
2

0
x̂

−2
0 500 1000 1500 2000
2

0
x̂

−2
0 500 1000 1500 2000

EE364b, Stanford University 3

ℓ2 reconstruction
for three values of γ
2

0
x̂

−2
0 500 1000 1500 2000
2

0
x̂

−2
0 500 1000 1500 2000
2

0
x̂

−2
0 500 1000 1500 2000

EE364b, Stanford University 4

Example: 2D total variation reconstruction
• x ∈ Rn are values of pixels on N × N grid (N = 31, so n = 961)

• assumption: x has relatively few big changes in value (i.e., boundaries)

• we have m = 120 linear measurements, y = F x (Fij ∼ N (0, 1))

• as convex-cardinality problem:

minimize card(xi,j − xi+1,j ) + card(xi,j − xi,j+1)

subject to y = F x

• ℓ1 heuristic (objective is a 2D version of total variation)

P P
minimize |xi,j − xi+1,j | + |xi,j − xi,j+1|
subject to y = F x

EE364b, Stanford University 5

TV reconstruction
original TV reconstruction

1.5 1.5

1 1

0.5 0.5

0 0

−0.5 −0.5
0 0
5 5
10 30 10 30
15 25 15 25
20 20 20 20
15 15
25 25
10 10
30 30
5 5
35 35

. . . not bad for 8× more variables than measurements!

EE364b, Stanford University 6

ℓ2 reconstruction
original ℓ2 reconstruction

1.5 1.5

1 1

0.5 0.5

0 0

−0.5 −0.5
0 0
5 5
10 30 10 30
15 25 15 25
20 20 20 20
15 15
25 25
10 10
30 30
5 5
35 35

. . . this is what you’d expect with 8× more variables than measurements

EE364b, Stanford University 7

Iterated weighted ℓ1 heuristic

• to minimize card(x) over x ∈ C

w := 1
repeat
minimize k diag(w)xk1 over x ∈ C
wi := 1/(ǫ + |xi|)

• first iteration is basic ℓ1 heuristic

• increases relative weight on small xi
• typically converges in 5 or fewer steps
• often gives a modest improvement (i.e., reduction in card(x)) over
basic ℓ1 heuristic

EE364b, Stanford University 8

Interpretation
• wlog we can take x 0 (by writing x = x+ − x−, x+, x− 0, and
replacing card(x) with card(x+) + card(x−))

• we’ll use approximation card(z) ≈ log(1 + z/ǫ), where ǫ > 0, z ∈ R+

• using this approximation, we get (nonconvex) problem

Pn
minimize i=1 log(1 + xi /ǫ)
subject to x ∈ C, x 0

• we’ll find a local solution by linearizing objective at current point,

n n n (k)
X X (k)
X xi − x i
log(1 + xi/ǫ) ≈ log(1 + xi /ǫ) + (k)
i=1 i=1 i=1 ǫ+ xi

EE364b, Stanford University 9

and solving resulting convex problem
Pn
minimize i=1 wi xi
subject to x ∈ C, x 0

with wi = 1/(ǫ + xi), to get next iterate

• repeat until convergence to get a local solution

EE364b, Stanford University 10

Sparse solution of linear inequalities
• minimize card(x) over polyhedron {x | Ax b}, A ∈ R100×50
• ℓ1 heuristic finds x ∈ R50 with card(x) = 44
• iterated weighted ℓ1 heuristic finds x with card(x) = 36
(global solution, via branch & bound, is card(x) = 32)
50

40
card(x)

10
iterated ℓ1
ℓ1
0
1 2 3 4 5 6

iteration

EE364b, Stanford University 11

Detecting changes in time series model
• AR(2) scalar time-series model

y(t + 2) = a(t)y(t + 1) + b(t)y(t) + v(t), v(t) IID N (0, 0.52)

• assumption: a(t) and b(t) are piecewise constant, change infrequently

• given y(t), t = 1, . . . , T , estimate a(t), b(t), t = 1, . . . , T − 2
• heuristic: minimize over variables a(t), b(t), t = 1, . . . , T − 1
PT −2
(y(t + 2) − a(t)y(t + 1) − b(t)y(t))2
t=1
PT −2
+γ t=1 (|a(t + 1) − a(t)| + |b(t + 1) − b(t)|)

• vary γ to trade off fit versus number of changes in a, b

EE364b, Stanford University 12

Time series and true coefficients

3
1

2 0.8
0.6

1 0.4
0.2
y(t)

0 0
−0.2 b(t)
−1 −0.4
−0.6
a(t)
−2 −0.8
−1
−3
0 50 100 150 200 250 300 50 100 150 200 250 300

t t

EE364b, Stanford University 13

TV heuristic and iterated TV heuristic

left: TV with γ = 10; right: iterated TV, 5 iterations, ǫ = 0.005

1 1
0.8 0.8
0.6 0.6
0.4 0.4
0.2 0.2
0 0
−0.2 −0.2
−0.4 −0.4
−0.6 −0.6
−0.8 −0.8
−1 −1

50 100 150 200 250 300 50 100 150 200 250 300

t t

EE364b, Stanford University 14

Extension to matrices

• Rank is natural analog of card for matrices

• convex-rank problem: convex, except for Rank in objective or

constraints

• rank problem reduces to card problem when matrices are diagonal:

Rank(diag(x)) = card(x)
P
• analog of ℓ1 heuristic: use nuclear norm, kXk∗ = i σi (X)
(sum of singular values; dual of spectral norm)

• for X 0, reduces to Tr X (for x 0, kxk1 reduces to 1T x)

EE364b, Stanford University 15

Factor modeling
• given matrix Σ ∈ Sn+, find approximation of form Σ̂ = F F T + D, where
F ∈ Rn×r , D is diagonal nonnegative
• gives underlying factor model (with r factors)

x = F z + v, v ∼ N (0, D), z ∼ N (0, I)

• model with fewest factors:

minimize Rank X
subject to X 0, D 0 diagonal
X +D ∈C

with variables D, X ∈ Sn
C is convex set of acceptable approximations to Σ

EE364b, Stanford University 16

• e.g., via KL divergence

C = {Σ̂ | − log det(Σ−1/2Σ̂Σ−1/2) + Tr(Σ−1/2Σ̂Σ−1/2) − n ≤ ǫ}

• trace heuristic:

minimize Tr X
subject to X 0, D 0 diagonal
X +D ∈C

with variables d ∈ Rn, X ∈ Sn

EE364b, Stanford University 17

Example
• x = F z + v, z ∼ N (0, I), v ∼ N (0, D), D diagonal; F ∈ R20×3

• Σ is empirical covariance matrix from N = 3000 samples

• set of acceptable approximations

C = {Σ̂ | kΣ−1/2(Σ̂ − Σ)Σ−1/2k ≤ β}

• trace heuristic
minimize Tr X
subject to X 0, d 0
kΣ−1/2(X + diag(d) − Σ)Σ−1/2k ≤ β

EE364b, Stanford University 18

Trace approximation results

2
16 10

0
12 10
Rank(X)

λi(X)
10

−2
8 10

−4
4 10

2 −2 −1 0 −2 −1 0
10 10 10 10 10 10

β β

EE364b, Stanford University 19

• for β = 0.1357 (knee of the tradeoff curve) we find
T
range(X), range(F F ) = 6.8◦

– 6
– kd − diag(D)k/k diag(D)k = 0.07

• i.e., we have recovered the factor model from the empirical covariance

EE364b, Stanford University 20

Maths Project Integral
No ratings yet
Maths Project Integral
21 pages
Solution Manual For Discrete Time Signal Processing 3 E 3rd Edition Alan V Oppenheim Ronald W Schafer
0% (1)
Solution Manual For Discrete Time Signal Processing 3 E 3rd Edition Alan V Oppenheim Ronald W Schafer
4 pages
Counting in Two Ways
0% (1)
Counting in Two Ways
4 pages
Bishop Solutions PDF
No ratings yet
Bishop Solutions PDF
87 pages
Algebraic Identities - Definition, Proof and Examples - CK-12 Foundation
No ratings yet
Algebraic Identities - Definition, Proof and Examples - CK-12 Foundation
14 pages
GSS280 20232 Test
No ratings yet
GSS280 20232 Test
6 pages
Y13 Logarithm Worksheet
No ratings yet
Y13 Logarithm Worksheet
3 pages
Lectures HD
No ratings yet
Lectures HD
301 pages
Lecture Notes Differential Equation - First Order ODE
No ratings yet
Lecture Notes Differential Equation - First Order ODE
49 pages
Creel M Econometrics
No ratings yet
Creel M Econometrics
479 pages
189 Cheat Sheet Nominicards PDF
No ratings yet
189 Cheat Sheet Nominicards PDF
2 pages
Fundamentals of Linear Algebra For Signal Processing 2022 09 22
No ratings yet
Fundamentals of Linear Algebra For Signal Processing 2022 09 22
321 pages
Commodity
No ratings yet
Commodity
91 pages
Six Lectures On NN - Montanari
No ratings yet
Six Lectures On NN - Montanari
77 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
663 Detecting 2021
No ratings yet
663 Detecting 2021
78 pages
Sketching As A Tool For Numerical Linear Algebra
No ratings yet
Sketching As A Tool For Numerical Linear Algebra
139 pages
Wainwrightslides 2
No ratings yet
Wainwrightslides 2
77 pages
Skript Opt Mach
No ratings yet
Skript Opt Mach
49 pages
Analytic Geometry in Three Dimensions: Section 11.1 The Three-Dimensional Coordinate System
100% (1)
Analytic Geometry in Three Dimensions: Section 11.1 The Three-Dimensional Coordinate System
2 pages
Commodity February 11 2005
No ratings yet
Commodity February 11 2005
57 pages
Tối Ưu Hóa Cho Khoa Học Dữ Liệu
No ratings yet
Tối Ưu Hóa Cho Khoa Học Dữ Liệu
64 pages
l1 Slides
No ratings yet
l1 Slides
31 pages
Lehman GPR
No ratings yet
Lehman GPR
68 pages
04 LinearRegression PDF
No ratings yet
04 LinearRegression PDF
61 pages
L06 Vectors
No ratings yet
L06 Vectors
26 pages
Lasso Slides Tibsharani
No ratings yet
Lasso Slides Tibsharani
44 pages
Optimization in Electrical Engineering
No ratings yet
Optimization in Electrical Engineering
7 pages
Homework 5
No ratings yet
Homework 5
1 page
36-325/725 Homework 1 Due Thursday Aug 29 (1) Chapter 2 Problem 1
No ratings yet
36-325/725 Homework 1 Due Thursday Aug 29 (1) Chapter 2 Problem 1
1 page
Lecture16 Crossvalidation
No ratings yet
Lecture16 Crossvalidation
32 pages
L18 Backprop
No ratings yet
L18 Backprop
18 pages
Penalty Decomposition Methods For L - Norm Minimization
No ratings yet
Penalty Decomposition Methods For L - Norm Minimization
26 pages
SSP 4 2 - Modelling 2
No ratings yet
SSP 4 2 - Modelling 2
34 pages
663 Topics
No ratings yet
663 Topics
12 pages
(MLP) Lecture Notes
No ratings yet
(MLP) Lecture Notes
22 pages
IML Summary
No ratings yet
IML Summary
12 pages
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
No ratings yet
1.1 ID5059 1.2 Tom Kelsey - Jan 2021: February 15, 2021
43 pages
ML Module 2,3,4
No ratings yet
ML Module 2,3,4
13 pages
Linear Regression
No ratings yet
Linear Regression
19 pages
CS550 Lec2
No ratings yet
CS550 Lec2
24 pages
Understanding Diffusion Models: A Unified Perspective
No ratings yet
Understanding Diffusion Models: A Unified Perspective
23 pages
Commodity Misperceptions January 30 2017
No ratings yet
Commodity Misperceptions January 30 2017
9 pages
MA 324, Lecture 1: Yohann Tendero Yohann - Tendero@
No ratings yet
MA 324, Lecture 1: Yohann Tendero Yohann - Tendero@
19 pages
Connor Sensible Return Forecasting 1997
No ratings yet
Connor Sensible Return Forecasting 1997
8 pages
Slides Lec 10
No ratings yet
Slides Lec 10
14 pages
L08 MaximumLikelihoodEstimation
No ratings yet
L08 MaximumLikelihoodEstimation
5 pages
Raghu Meka Notes
No ratings yet
Raghu Meka Notes
7 pages
Convex Cardinality Optimization
No ratings yet
Convex Cardinality Optimization
26 pages
Lec4 Oct12 2022 PracticalNotes LinearRegression
No ratings yet
Lec4 Oct12 2022 PracticalNotes LinearRegression
34 pages
S M S T C Lecture Notes Lecture2
No ratings yet
S M S T C Lecture Notes Lecture2
9 pages
Homework 9: Nhi Ly 2025-04-10
No ratings yet
Homework 9: Nhi Ly 2025-04-10
6 pages
Lecture 17 Least Squares, State Estimation
No ratings yet
Lecture 17 Least Squares, State Estimation
29 pages
The Distance Spectrum of Corona and Cluster of Two Graphs
No ratings yet
The Distance Spectrum of Corona and Cluster of Two Graphs
7 pages
EE364a Homework 6 Solutions: I 1,..., K I I I
No ratings yet
EE364a Homework 6 Solutions: I 1,..., K I I I
20 pages
Updating Weight
No ratings yet
Updating Weight
9 pages
Eight: Discrete Numeric Functions and Generating Functions
No ratings yet
Eight: Discrete Numeric Functions and Generating Functions
12 pages
Tikhonov Regularization
No ratings yet
Tikhonov Regularization
8 pages
Fall2020 CS395T Mock Midterm Solutions
No ratings yet
Fall2020 CS395T Mock Midterm Solutions
4 pages
4.0 Finite Difference Methods and Interpolation
No ratings yet
4.0 Finite Difference Methods and Interpolation
37 pages
Chance Constr
No ratings yet
Chance Constr
22 pages
C2 Integration - Questions
No ratings yet
C2 Integration - Questions
9 pages
Introduction To Machine Learning Lecture 2: Linear Regression
No ratings yet
Introduction To Machine Learning Lecture 2: Linear Regression
38 pages
Lec 07-08 - Final
No ratings yet
Lec 07-08 - Final
32 pages
Cs419 Closed Form Derv
No ratings yet
Cs419 Closed Form Derv
5 pages
cs229.... Machine Language. Andrew NG
No ratings yet
cs229.... Machine Language. Andrew NG
17 pages
Cheat Sheet 1
No ratings yet
Cheat Sheet 1
2 pages
Lecture 2
No ratings yet
Lecture 2
8 pages
09 Forward Differential Dynamic Programming
No ratings yet
09 Forward Differential Dynamic Programming
18 pages
Lecture 9: October 2: 9.1.1 Stochastic Block Model
No ratings yet
Lecture 9: October 2: 9.1.1 Stochastic Block Model
6 pages
Convex Optimization Prerequisite - Topics
No ratings yet
Convex Optimization Prerequisite - Topics
6 pages
Norm Methods For Convex-Cardinality Problems
No ratings yet
Norm Methods For Convex-Cardinality Problems
31 pages
AP Calc 3.2 HW
No ratings yet
AP Calc 3.2 HW
3 pages
STATISTICS - AND - PROBABILITY (Measures of Central Tendencey-Grouped)
No ratings yet
STATISTICS - AND - PROBABILITY (Measures of Central Tendencey-Grouped)
26 pages
Convex Problems
No ratings yet
Convex Problems
48 pages
Functions,: Relations
No ratings yet
Functions,: Relations
9 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
Limits Easy
No ratings yet
Limits Easy
4 pages
Notes Linearregression
No ratings yet
Notes Linearregression
4 pages
Sparse Inverse Covariance Estimation With The Graphical Lasso
No ratings yet
Sparse Inverse Covariance Estimation With The Graphical Lasso
14 pages
I. Introduction To Convex Optimization: Georgia Tech ECE 8823a Notes by J. Romberg. Last Updated 13:32, January 11, 2017
No ratings yet
I. Introduction To Convex Optimization: Georgia Tech ECE 8823a Notes by J. Romberg. Last Updated 13:32, January 11, 2017
20 pages
Total Least Squares
No ratings yet
Total Least Squares
11 pages
Weather Wax Hastie Solutions Manual
No ratings yet
Weather Wax Hastie Solutions Manual
18 pages
Lecture 15
No ratings yet
Lecture 15
7 pages
Signals and Systems: CE/EE301
No ratings yet
Signals and Systems: CE/EE301
9 pages
Tutorial 3 (Unit III)
No ratings yet
Tutorial 3 (Unit III)
6 pages
Total Least Squares
No ratings yet
Total Least Squares
11 pages
Qfti Homework 1 Solution: 1.1 Problem
No ratings yet
Qfti Homework 1 Solution: 1.1 Problem
9 pages
Bucklewaves: Denzil G. Vaughn, John W. Hutchinson
No ratings yet
Bucklewaves: Denzil G. Vaughn, John W. Hutchinson
12 pages
Math 9 TQ Q1 S1
No ratings yet
Math 9 TQ Q1 S1
3 pages
Power Series Solutions of Linear Differential Equations
No ratings yet
Power Series Solutions of Linear Differential Equations
34 pages
Model Selection and Multiple Hypothesis Testing
No ratings yet
Model Selection and Multiple Hypothesis Testing
6 pages
Advanced Algorithms Course. Lecture Notes. Part 10: Hashing
No ratings yet
Advanced Algorithms Course. Lecture Notes. Part 10: Hashing
4 pages
5-3 Newton - S Divided Difference Method
No ratings yet
5-3 Newton - S Divided Difference Method
11 pages
Practice 1130
No ratings yet
Practice 1130
20 pages
Structural Macroeconometrics: David N. Dejong Chetan Dave
No ratings yet
Structural Macroeconometrics: David N. Dejong Chetan Dave
31 pages
Algebra 1 Chapter 3 Study Guide
No ratings yet
Algebra 1 Chapter 3 Study Guide
2 pages
Probability and Mathematical Statistics I: Lectures Instructor Office Extension Email Web-Site Text
No ratings yet
Probability and Mathematical Statistics I: Lectures Instructor Office Extension Email Web-Site Text
3 pages
The Levenberg-Marquardt Algorithm-Implementation and Theory
No ratings yet
The Levenberg-Marquardt Algorithm-Implementation and Theory
12 pages
Computation Techniq Syllabi
No ratings yet
Computation Techniq Syllabi
3 pages
Clamped, Square Isotropic Plate With A Uniform Pressure Load
No ratings yet
Clamped, Square Isotropic Plate With A Uniform Pressure Load
3 pages
189 Cheat Sheet Minicards
No ratings yet
189 Cheat Sheet Minicards
2 pages
Solutions to Problems in Fluids and Turbomachinery
From Everand
Solutions to Problems in Fluids and Turbomachinery
Rahul Basu
No ratings yet
Capsule Calculus
From Everand
Capsule Calculus
Ira Ritow
No ratings yet
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet

l1 Ext Slides

Uploaded by

l1 Ext Slides

Uploaded by

ℓ1-norm Methods for

EE364b, Stanford University

• convex-cardinality problem: minimize kx̂ − xcork2 subject to

• heuristic: minimize kx̂ − xcork2 + γkDxk1; vary γ to adjust number of

• kDxk1 is total variation of signal x̂

• method is called total variation reconstruction

• unlike ℓ2 based reconstruction, TVR filters high frequency noise out

EE364b, Stanford University 1

EE364b, Stanford University 2

EE364b, Stanford University 3

EE364b, Stanford University 4

• assumption: x has relatively few big changes in value (i.e., boundaries)

• we have m = 120 linear measurements, y = F x (Fij ∼ N (0, 1))

minimize card(xi,j − xi+1,j ) + card(xi,j − xi,j+1)

• ℓ1 heuristic (objective is a 2D version of total variation)

EE364b, Stanford University 5

. . . not bad for 8× more variables than measurements!

EE364b, Stanford University 6

. . . this is what you’d expect with 8× more variables than measurements

EE364b, Stanford University 7

• to minimize card(x) over x ∈ C

• first iteration is basic ℓ1 heuristic

EE364b, Stanford University 8

• we’ll use approximation card(z) ≈ log(1 + z/ǫ), where ǫ > 0, z ∈ R+

• using this approximation, we get (nonconvex) problem

• we’ll find a local solution by linearizing objective at current point,

EE364b, Stanford University 9

with wi = 1/(ǫ + xi), to get next iterate

• repeat until convergence to get a local solution

EE364b, Stanford University 10

EE364b, Stanford University 11

y(t + 2) = a(t)y(t + 1) + b(t)y(t) + v(t), v(t) IID N (0, 0.52)

• assumption: a(t) and b(t) are piecewise constant, change infrequently

• vary γ to trade off fit versus number of changes in a, b

EE364b, Stanford University 12

EE364b, Stanford University 13

left: TV with γ = 10; right: iterated TV, 5 iterations, ǫ = 0.005

EE364b, Stanford University 14

• Rank is natural analog of card for matrices

• convex-rank problem: convex, except for Rank in objective or

• rank problem reduces to card problem when matrices are diagonal:

• for X  0, reduces to Tr X (for x  0, kxk1 reduces to 1T x)

EE364b, Stanford University 15

x = F z + v, v ∼ N (0, D), z ∼ N (0, I)

• model with fewest factors:

EE364b, Stanford University 16

C = {Σ̂ | − log det(Σ−1/2Σ̂Σ−1/2) + Tr(Σ−1/2Σ̂Σ−1/2) − n ≤ ǫ}

with variables d ∈ Rn, X ∈ Sn

EE364b, Stanford University 17

• Σ is empirical covariance matrix from N = 3000 samples

• set of acceptable approximations

C = {Σ̂ | kΣ−1/2(Σ̂ − Σ)Σ−1/2k ≤ β}

EE364b, Stanford University 18

EE364b, Stanford University 19

EE364b, Stanford University 20

You might also like

• for X 0, reduces to Tr X (for x 0, kxk1 reduces to 1T x)