0% found this document useful (0 votes)

101 views27 pages

Canonical Problem Forms: Ryan Tibshirani Convex Optimization 10-725

This document introduces several canonical problem forms in convex optimization, including linear programs (LPs), quadratic programs (QPs), and semidefinite programs (SDPs). It provides examples of how problems from areas like transportation, portfolio optimization, and machine learning can be modeled using these forms. Key points covered include the standard forms for LPs, QPs, and SDPs, as well as properties of positive semidefinite matrices that are important for SDPs.

Uploaded by

Saheli Chakraborty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

101 views27 pages

Canonical Problem Forms: Ryan Tibshirani Convex Optimization 10-725

Uploaded by

Saheli Chakraborty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Canonical Problem Forms

Ryan Tibshirani
Convex Optimization 10-725
Last time: optimization basics

• Optimization terminology (e.g., criterion, constraints, feasible

points, solutions)
• Properties and first-order optimality

• Equivalent transformations (e.g., partial optimization, change

of variables, eliminating equality constraints)

2
Outline

Today:
• Linear programs
• Quadratic programs
• Semidefinite programs
• Cone programs

3
4
Linear program
A linear program or LP is an optimization problem of the form

min cT x
x
subject to Dx ≤ d
Ax = b

Observe that this is always a convex optimization problem

• First introduced by Kantorovich in the late 1930s and Dantzig

in the 1940s
• Dantzig’s simplex algorithm gives a direct (noniterative) solver
for LPs (later in the course we’ll see interior point methods)
• Fundamental problem in convex optimization. Many diverse
applications, rich history

5
Example: diet problem

Find cheapest combination of foods that satisfies some nutritional

requirements (useful for graduate students!)

min cT x
x
subject to Dx ≥ d
x≥0

Interpretation:
• cj : per-unit cost of food j
• di : minimum required intake of nutrient i
• Dij : content of nutrient i per unit of food j
• xj : units of food j in the diet

6
Example: transportation problem
Ship commodities from given sources to destinations at min cost
m X
X n
min cij xij
x
i=1 j=1
Xn
subject to xij ≤ si , i = 1, . . . , m
j=1
m
X
xij ≥ dj , j = 1, . . . , n, x ≥ 0
i=1

Interpretation:
• si : supply at source i
• dj : demand at destination j
• cij : per-unit shipping cost from i to j
• xij : units shipped from i to j
7
Example: basis pursuit
Given y ∈ Rn and X ∈ Rn×p , where p > n. Suppose that we seek
the sparsest solution to underdetermined linear system Xβ = y

Nonconvex formulation:

min kβk0
β

subject to Xβ = y
Pp
where recall kβk0 = j=1 1{βj 6= 0}, the `0 “norm”

The `1 approximation, often called basis pursuit:

min kβk1
β

subject to Xβ = y

8
Basis pursuit is a linear program. Reformulation:

min kβk1 min 1T z

β ⇐⇒ β,z

subject to Xβ = y subject to z ≥ β
z ≥ −β
Xβ = y

(Check that this makes sense to you)

9
Example: Dantzig selector

Modification of previous problem, where we allow for Xβ ≈ y (we

don’t require exact equality), the Dantzig selector:1

min kβk1
β

subject to kX T (y − Xβ)k∞ ≤ λ

Here λ ≥ 0 is a tuning parameter

Again, this can be reformulated as a linear program (check this!)

1
Candes and Tao (2007), “The Dantzig selector: statistical estimation when
p is much larger than n”
10
Standard form

A linear program is said to be in standard form when it is written as

min cT x
x
subject to Ax = b
x≥0

Any linear program can be rewritten in standard form (check this!)

11
Convex quadratic program

A convex quadratic program or QP is an optimization problem of

the form
1
min cT x + xT Qx
x 2
subject to Dx ≤ d
Ax = b

where Q 0, i.e., positive semidefinite

Note that this problem is not convex when Q 6 0

From now on, when we say quadratic program or QP, we implicitly

assume that Q 0 (so the problem is convex)

12
Example: portfolio optimization

Construct a financial portfolio, trading off performance and risk:

γ T
max µT x − x Qx
x 2
subject to 1T x = 1
x≥0

Interpretation:
• µ : expected assets’ returns
• Q : covariance matrix of assets’ returns
• γ : risk aversion
• x : portfolio holdings (percentages)

13
Example: support vector machines

Given y ∈ {−1, 1}n , X ∈ Rn×p having rows x1 , . . . , xn , recall the

support vector machine or SVM problem:
n
1 X
min kβk22 + C ξi
β,β0 ,ξ 2
i=1
subject to ξi ≥ 0, i = 1, . . . , n
yi (xTi β + β0 ) ≥ 1 − ξi , i = 1, . . . , n

This is a quadratic program

14
Example: lasso

Given y ∈ Rn , X ∈ Rn×p , recall the lasso problem:

min ky − Xβk22
β

subject to kβk1 ≤ s

Here s ≥ 0 is a tuning parameter. Indeed, this can be reformulated

as a quadratic program (check this!)

Alternative parametrization (called Lagrange, or penalized form):

1
min ky − Xβk22 + λkβk1
β 2

Now λ ≥ 0 is a tuning parameter. And again, this can be rewritten

as a quadratic program (check this!)

15
Standard form

A quadratic program is in standard form if it is written as

1
min cT x + xT Qx
x 2
subject to Ax = b
x≥0

Any quadratic program can be rewritten in standard form

16
Motivation for semidefinite programs
Consider linear programming again:

min cT x
x
subject to Dx ≤ d
Ax = b

Can generalize by changing ≤ to different (partial) order. Recall:

• Sn is space of n × n symmetric matrices
• Sn
+ is the space of positive semidefinite matrices, i.e.,

Sn+ = {X ∈ Sn : uT Xu ≥ 0 for all u ∈ Rn }

• Sn
++ is the space of positive definite matrices, i.e.,

Sn++ = X ∈ Sn : uT Xu > 0 for all u ∈ Rn \ {0}

17
Facts about Sn , Sn+ , Sn++
• Basic linear algebra facts, here λ(X) = (λ1 (X), . . . , λn (X)):

X ∈ Sn =⇒ λ(X) ∈ Rn
X ∈ Sn+ ⇐⇒ λ(X) ∈ Rn+
X ∈ Sn++ ⇐⇒ λ(X) ∈ Rn++

• We can define an inner product over Sn : given X, Y ∈ Sn ,

X • Y = tr(XY )

• We can define a partial ordering over Sn : given X, Y ∈ Sn ,

X Y ⇐⇒ X − Y ∈ Sn+

Note: for x, y ∈ Rn , diag(x) diag(y) ⇐⇒ x ≥ y (recall,

the latter is interpreted elementwise)

18
Semidefinite program

A semidefinite program or SDP is an optimization problem of the

form

min cT x
x
subject to x1 F1 + · · · + xn Fn F0
Ax = b

Here Fj ∈ Sd , for j = 0, 1, . . . , n, and A ∈ Rm×n , c ∈ Rn , b ∈ Rm .

Observe that this is always a convex optimization problem

Also, any linear program is a semidefinite program (check this!)

19
Standard form

A semidefinite program is in standard form if it is written as

min C •X
X
subject to Ai • X = bi , i = 1, . . . , m
X0

Any semidefinite program can be written in standard form (for a

challenge, check this!)

20
Example: theta function
Let G = (N, E) be an undirected graph, N = {1, . . . , n}, and
• ω(G) : clique number of G
• χ(G) : chromatic number of G

The Lovasz theta function:2

ϑ(G) = max 11T • X

X
subject to I • X = 1
Xij = 0, (i, j) ∈
/E
X0

The Lovasz sandwich theorem: ω(G) ≤ ϑ(Ḡ) ≤ χ(G), where Ḡ is

the complement graph of G

2
Lovasz (1979), “On the Shannon capacity of a graph”
21
Example: trace norm minimization
Let A : Rm×n → Rp be a linear map,
 
A1 • X
A(X) =  . . . 
Ap • X
for A1 , . . . , Ap ∈ Rm×n (and where Ai • X = tr(ATi X)). Finding
lowest-rank solution to an underdetermined system, nonconvex:
min rank(X)
X
subject to A(X) = b
Trace norm approximation:
min kXktr
X
subject to A(X) = b
This is indeed an SDP (but harder to show, requires duality ...)
22
Conic program

A conic program is an optimization problem of the form:

min cT x
x
subject to Ax = b
D(x) + d ∈ K

Here:
• c, x ∈ Rn , and A ∈ Rm×n , b ∈ Rm
• D : Rn → Y is a linear map, d ∈ Y , for Euclidean space Y
• K ⊆ Y is a closed convex cone

Both LPs and SDPs are special cases of conic programming. For
LPs, K = Rn+ ; for SDPs, K = Sn+

23
Second-order cone program
A second-order cone program or SOCP is an optimization problem
of the form:

min cT x
x
subject to kDi x + di k2 ≤ eTi x + fi , i = 1, . . . , p
Ax = b

This is indeed a cone program. Why? Recall the second-order cone

Q = {(x, t) : kxk2 ≤ t}
So we have

kDi x + di k2 ≤ eTi x + fi ⇐⇒ (Di x + di , eTi x + fi ) ∈ Qi

for second-order cone Qi of appropriate dimensions. Now take

K = Q1 × · · · × Qp

24
Observe that every LP is an SOCP. Further, every SOCP is an SDP

Why? Turns out that

tI x
kxk2 ≤ t ⇐⇒ 0
xT t

Hence we can write any SOCP constraint as an SDP constraint

The above is a special case of the Schur complement theorem:

A B
0 ⇐⇒ A − BC −1 B T 0
BT C

for A, C symmetric and C 0

25
Hey, what about QPs?
Finally, our old friend QPs “sneak” into the hierarchy. Turns out
QPs are SOCPs, which we can see by rewriting a QP as

min cT x + t
x,t
1 T
subject to Dx ≤ d, x Qx ≤ t
2
Ax = b

Now write 12 xT Qx ≤ t ⇐⇒ k( √12 Q1/2 x, 21 (1 − t))k2 ≤ 12 (1 + t)

Take a breath (phew!). Thus we have established the hierachy

LPs ⊆ QPs ⊆ SOCPs ⊆ SDPs ⊆ Conic programs

completing the picture we saw at the start

26
References and further reading

• D. Bertsimas and J. Tsitsiklis (1997), “Introduction to linear

optimization,” Chapters 1, 2
• S. Boyd and L. Vandenberghe (2004), “Convex optimization,”
Chapter 4
• A. Nemirovski and A. Ben-Tal (2001), “Lectures on modern
convex optimization,” Chapters 1–4

Cono Semi Defini Do
100% (2)
Cono Semi Defini Do
155 pages
Demystifying The Characterization of SDP Matrices in Mathematical Programming
No ratings yet
Demystifying The Characterization of SDP Matrices in Mathematical Programming
104 pages
Optimización Lineal
No ratings yet
Optimización Lineal
304 pages
Linear Programming
No ratings yet
Linear Programming
16 pages
Semidefinite Programming
No ratings yet
Semidefinite Programming
59 pages
Chapter 8 - Linear Programming
No ratings yet
Chapter 8 - Linear Programming
163 pages
Cone Programming
No ratings yet
Cone Programming
27 pages
Lect6 Removed
No ratings yet
Lect6 Removed
31 pages
CO Lecture Notes
No ratings yet
CO Lecture Notes
84 pages
Optimization
No ratings yet
Optimization
49 pages
Semip
No ratings yet
Semip
119 pages
Linear Programming
No ratings yet
Linear Programming
54 pages
MCQ Convex Lecture 7
No ratings yet
MCQ Convex Lecture 7
26 pages
LP Notes New
No ratings yet
LP Notes New
33 pages
10 Convex Optimisation
No ratings yet
10 Convex Optimisation
31 pages
MOSEKModeling Cookbook
No ratings yet
MOSEKModeling Cookbook
127 pages
2LP Lecture Notes
No ratings yet
2LP Lecture Notes
130 pages
Lec 19
No ratings yet
Lec 19
8 pages
Linear Matrix Inequalities in Control
No ratings yet
Linear Matrix Inequalities in Control
60 pages
Linear Programming and Simplex Algorithm: 1.1 The Problem
No ratings yet
Linear Programming and Simplex Algorithm: 1.1 The Problem
12 pages
Chap04 ConvexOptimizationBasics
No ratings yet
Chap04 ConvexOptimizationBasics
29 pages
PSA Appendix B
No ratings yet
PSA Appendix B
49 pages
Linear Programming: - Socrates
No ratings yet
Linear Programming: - Socrates
21 pages
Theory Note 1
No ratings yet
Theory Note 1
5 pages
LP Notes 310
No ratings yet
LP Notes 310
47 pages
Advances in Convex Optimization: Interior-Point Methods, Cone Programming, and Applications
No ratings yet
Advances in Convex Optimization: Interior-Point Methods, Cone Programming, and Applications
57 pages
Chapter 1 Modeling With Linear Programming
No ratings yet
Chapter 1 Modeling With Linear Programming
35 pages
Lecture Notes J2LALP
No ratings yet
Lecture Notes J2LALP
201 pages
ISM206 Optimization Scribing Notes: David L. Bernick
100% (1)
ISM206 Optimization Scribing Notes: David L. Bernick
9 pages
Introduction To Semidefinite Programming (SDP) : Robert M. Freund
No ratings yet
Introduction To Semidefinite Programming (SDP) : Robert M. Freund
51 pages
Part IB - Optimisation: Definitions
No ratings yet
Part IB - Optimisation: Definitions
7 pages
Mit6 251JF09 SDP
No ratings yet
Mit6 251JF09 SDP
51 pages
LP PDF
No ratings yet
LP PDF
16 pages
Lec 1
No ratings yet
Lec 1
60 pages
Mathematical Programming: Is Called The Feasible Set and
No ratings yet
Mathematical Programming: Is Called The Feasible Set and
12 pages
MOSEKModelingCookbook Letter
No ratings yet
MOSEKModelingCookbook Letter
131 pages
Optimization (SF1811 SF1831 SF1841)
100% (1)
Optimization (SF1811 SF1831 SF1841)
198 pages
(A First Course in Linear Optimization) Jon Lee (B-Ok - CC) PDF
100% (1)
(A First Course in Linear Optimization) Jon Lee (B-Ok - CC) PDF
188 pages
01 Intro Notes Cvxopt f22
No ratings yet
01 Intro Notes Cvxopt f22
25 pages
Linear Programming: 1.1 Formulations
No ratings yet
Linear Programming: 1.1 Formulations
33 pages
Wisdom of Crowds Intro
No ratings yet
Wisdom of Crowds Intro
53 pages
Linear or Integer Programming
No ratings yet
Linear or Integer Programming
12 pages
A First Course in Linear Optimi - Wei Zhi
No ratings yet
A First Course in Linear Optimi - Wei Zhi
306 pages
Classification of Optimization Methods
No ratings yet
Classification of Optimization Methods
68 pages
Phuong Phap Tinh
No ratings yet
Phuong Phap Tinh
49 pages
EC760 Lecture6
No ratings yet
EC760 Lecture6
7 pages
Notes On Linear Programming: CE 377K Stephen D. Boyles Spring 2015
No ratings yet
Notes On Linear Programming: CE 377K Stephen D. Boyles Spring 2015
17 pages
OQM Lecture Note - Part 1 Introduction To Mathematical Optimisation
No ratings yet
OQM Lecture Note - Part 1 Introduction To Mathematical Optimisation
10 pages
Handout 1 Introduction
No ratings yet
Handout 1 Introduction
7 pages
Convexity II: Optimization Basics: Ryan Tibshirani Convex Optimization 10-725
No ratings yet
Convexity II: Optimization Basics: Ryan Tibshirani Convex Optimization 10-725
28 pages
Linear Programming MSM 2da (G05a, M09a) : Matthias Gerdts
No ratings yet
Linear Programming MSM 2da (G05a, M09a) : Matthias Gerdts
85 pages
Optimizatio With Matlab
No ratings yet
Optimizatio With Matlab
49 pages
Facts & Formulae Chemistry
100% (2)
Facts & Formulae Chemistry
53 pages
Notes On Linear Programming
No ratings yet
Notes On Linear Programming
13 pages
ISM206 Optimization Scribing Notes: David L. Bernick
No ratings yet
ISM206 Optimization Scribing Notes: David L. Bernick
9 pages
OOPM UNIT 1 (Cse)
No ratings yet
OOPM UNIT 1 (Cse)
64 pages
TLE7 - 8-ICT-PROGRAMMING FOR ROBOTICS Q1 M1 W1 - noAK
No ratings yet
TLE7 - 8-ICT-PROGRAMMING FOR ROBOTICS Q1 M1 W1 - noAK
16 pages
A First Course in Linear Optimization
No ratings yet
A First Course in Linear Optimization
196 pages
Periodic Health Examination Form 2 2020
No ratings yet
Periodic Health Examination Form 2 2020
2 pages
Linear Programming (LP) (Also Called Linear Optimization) Is A Method To Achieve The Best
No ratings yet
Linear Programming (LP) (Also Called Linear Optimization) Is A Method To Achieve The Best
2 pages
Definition of A Linear Program
No ratings yet
Definition of A Linear Program
6 pages
Operations Research
No ratings yet
Operations Research
19 pages
Math C4 Practice
No ratings yet
Math C4 Practice
53 pages
Corolla Diesel PDF
No ratings yet
Corolla Diesel PDF
2 pages
Final Marketing Plan Whole
No ratings yet
Final Marketing Plan Whole
19 pages
Tribhuvan University Institute of Engineering Pulchwok Central Campus Pulchwok, Lalitpur
No ratings yet
Tribhuvan University Institute of Engineering Pulchwok Central Campus Pulchwok, Lalitpur
13 pages
Logiq 500 GE
No ratings yet
Logiq 500 GE
407 pages
Nomination Facility Provided by Banks MCQs With Case Study
No ratings yet
Nomination Facility Provided by Banks MCQs With Case Study
8 pages
2 11 4minds b1 Units Key U1-7
No ratings yet
2 11 4minds b1 Units Key U1-7
5 pages
A'Seeb Wastewater Project Seeb, Muscat, Sultanate of Oman
No ratings yet
A'Seeb Wastewater Project Seeb, Muscat, Sultanate of Oman
3 pages
Hindustan Aeronautics Limited: Asia'S Premier Aerospace Complex
No ratings yet
Hindustan Aeronautics Limited: Asia'S Premier Aerospace Complex
20 pages
The Efficacy of Specialized Language Models in Advancing Educational Outcomes
No ratings yet
The Efficacy of Specialized Language Models in Advancing Educational Outcomes
8 pages
Status and Prospects For Helicopter Apus in Russia Gavrilov V.V., Ponomarev B.A
No ratings yet
Status and Prospects For Helicopter Apus in Russia Gavrilov V.V., Ponomarev B.A
16 pages
Lucky House Others
No ratings yet
Lucky House Others
16 pages
Midterm Exam: TEST I MULTIPLE CHOICE. Select The Best Answer by Writing The Letter of Your Choice.
100% (1)
Midterm Exam: TEST I MULTIPLE CHOICE. Select The Best Answer by Writing The Letter of Your Choice.
3 pages
Studentsco: Computer Science
No ratings yet
Studentsco: Computer Science
6 pages
Algorithm - Pseudocode of 2D CNN
No ratings yet
Algorithm - Pseudocode of 2D CNN
7 pages
STD.7 Comparing Quantities and Algebraic Expressions Practice Worksheet
No ratings yet
STD.7 Comparing Quantities and Algebraic Expressions Practice Worksheet
5 pages
Designation
No ratings yet
Designation
12 pages
PES 4 AFS Map Files by Ajay
No ratings yet
PES 4 AFS Map Files by Ajay
2 pages
FSSAI - Internship Portal
No ratings yet
FSSAI - Internship Portal
3 pages
Daa 2
No ratings yet
Daa 2
4 pages
Tour de Samos 2025 Results Overall
No ratings yet
Tour de Samos 2025 Results Overall
1 page
Using The Universal PE Unpacker
No ratings yet
Using The Universal PE Unpacker
11 pages
Aqautec Ocean Parts Manual
No ratings yet
Aqautec Ocean Parts Manual
4 pages
Machine Life Cycle Analysis
No ratings yet
Machine Life Cycle Analysis
1 page
Comparative Analysis of Short Film
No ratings yet
Comparative Analysis of Short Film
4 pages
B 2 B Sales Manager Checklist
No ratings yet
B 2 B Sales Manager Checklist
1 page
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet

Canonical Problem Forms: Ryan Tibshirani Convex Optimization 10-725

Uploaded by

Canonical Problem Forms: Ryan Tibshirani Convex Optimization 10-725

Uploaded by

Canonical Problem Forms

• Optimization terminology (e.g., criterion, constraints, feasible

• Equivalent transformations (e.g., partial optimization, change

Observe that this is always a convex optimization problem

• First introduced by Kantorovich in the late 1930s and Dantzig

Find cheapest combination of foods that satisfies some nutritional

The `1 approximation, often called basis pursuit:

min kβk1 min 1T z

(Check that this makes sense to you)

Modification of previous problem, where we allow for Xβ ≈ y (we

Here λ ≥ 0 is a tuning parameter

Again, this can be reformulated as a linear program (check this!)

A linear program is said to be in standard form when it is written as

Any linear program can be rewritten in standard form (check this!)

A convex quadratic program or QP is an optimization problem of

where Q  0, i.e., positive semidefinite

Note that this problem is not convex when Q 6 0

From now on, when we say quadratic program or QP, we implicitly

Construct a financial portfolio, trading off performance and risk:

Given y ∈ {−1, 1}n , X ∈ Rn×p having rows x1 , . . . , xn , recall the

This is a quadratic program

Given y ∈ Rn , X ∈ Rn×p , recall the lasso problem:

Here s ≥ 0 is a tuning parameter. Indeed, this can be reformulated

Alternative parametrization (called Lagrange, or penalized form):

Now λ ≥ 0 is a tuning parameter. And again, this can be rewritten

A quadratic program is in standard form if it is written as

Any quadratic program can be rewritten in standard form

Can generalize by changing ≤ to different (partial) order. Recall:

Sn+ = {X ∈ Sn : uT Xu ≥ 0 for all u ∈ Rn }

Sn++ = X ∈ Sn : uT Xu > 0 for all u ∈ Rn \ {0}

• We can define an inner product over Sn : given X, Y ∈ Sn ,

• We can define a partial ordering over Sn : given X, Y ∈ Sn ,

Note: for x, y ∈ Rn , diag(x)  diag(y) ⇐⇒ x ≥ y (recall,

A semidefinite program or SDP is an optimization problem of the

Here Fj ∈ Sd , for j = 0, 1, . . . , n, and A ∈ Rm×n , c ∈ Rn , b ∈ Rm .

Also, any linear program is a semidefinite program (check this!)

A semidefinite program is in standard form if it is written as

Any semidefinite program can be written in standard form (for a

The Lovasz theta function:2

ϑ(G) = max 11T • X

The Lovasz sandwich theorem: ω(G) ≤ ϑ(Ḡ) ≤ χ(G), where Ḡ is

A conic program is an optimization problem of the form:

This is indeed a cone program. Why? Recall the second-order cone

kDi x + di k2 ≤ eTi x + fi ⇐⇒ (Di x + di , eTi x + fi ) ∈ Qi

for second-order cone Qi of appropriate dimensions. Now take

Why? Turns out that

Hence we can write any SOCP constraint as an SDP constraint

The above is a special case of the Schur complement theorem:

for A, C symmetric and C  0

Now write 12 xT Qx ≤ t ⇐⇒ k( √12 Q1/2 x, 21 (1 − t))k2 ≤ 12 (1 + t)

Take a breath (phew!). Thus we have established the hierachy

LPs ⊆ QPs ⊆ SOCPs ⊆ SDPs ⊆ Conic programs

completing the picture we saw at the start

• D. Bertsimas and J. Tsitsiklis (1997), “Introduction to linear

You might also like

where Q 0, i.e., positive semidefinite

Note that this problem is not convex when Q 6 0

Note: for x, y ∈ Rn , diag(x) diag(y) ⇐⇒ x ≥ y (recall,

for A, C symmetric and C 0