Lecture 02 - Convexity
Lecture 02 - Convexity
2. Convex sets
• generalized inequalities
2–1
Affine set
x = θx1 + (1 − θ)x2 (θ ∈ R)
θ = 1.2 x1
θ=1
θ = 0.6
x2
θ=0
θ = −0.2
affine set: contains the line through any two distinct points in the set
x = θx1 + (1 − θ)x2
with 0 ≤ θ ≤ 1
convex set: contains line segment between any two points in the set
x = θ1 x1 + θ2 x2 + · · · + θk xk
with θ1 + · · · + θk = 1, θi ≥ 0
x = θ1 x1 + θ2 x2
with θ1 ≥ 0, θ2 ≥ 0
x1
x2
0
convex cone: set that contains all conic combinations of points in the set
x0
x
aT x = b
x0 aT x ≥ b
aT x ≤ b
xc
Ax ! b, Cx = d
a1 a2
P
a5
a3
a4
! "
x y 0.5
z
example: ∈ S2+
y z
0
1
1
0
0.5
y −1 0 x
1. apply definition
example:
S = {x ∈ Rm | |p(t)| ≤ 1 for |t| ≤ π/3}
for m = 2:
2
1
1
p(t)
0
x2 0 S
−1
−1
−2
0 π/3 2π/3 π −2 −1
t x01 1 2
examples
• scaling, translation, projection
• solution set of linear matrix inequality {x | x1A1 + · · · + xmAm % B}
(with Ai, B ∈ Sp)
• hyperbolic cone {x | xT P x ≤ (cT x)2, cT x ≥ 0} (with P ∈ Sn+)
images and inverse images of convex sets under perspective are convex
Ax + b
f (x) = T , dom f = {x | cT x + d > 0}
c x+d
1
f (x) = x
x1 + x2 + 1
1 1
x2
x2
0 C 0
f (C)
−1 −1
−1 0 1 −1 0 1
x1 x1
examples
• nonnegative orthant K = Rn+ = {x ∈ Rn | xi ≥ 0, i = 1, . . . , n}
• positive semidefinite cone K = Sn+
• nonnegative polynomials on [0, 1]:
x !K y ⇐⇒ y − x ∈ K, x ≺K y ⇐⇒ y − x ∈ int K
examples
• componentwise inequality (K = Rn+)
x !Rn+ y ⇐⇒ xi ≤ yi , i = 1, . . . , n
x !K y, u !K v =⇒ x + u !K y + v
3. Convex functions
• quasiconvex functions
3–1
Definition
f : Rn → R is convex if dom f is a convex set and
(y, f (y))
(x, f (x))
• f is concave if −f is convex
• f is strictly convex if dom f is convex and
convex:
• affine: ax + b on R, for any a, b ∈ R
• exponential: eax, for any a ∈ R
• powers: xα on R++, for α ≥ 1 or α ≤ 0
• powers of absolute value: |x|p on R, for p ≥ 1
• negative entropy: x log x on R++
concave:
• affine: ax + b on R, for any a, b ∈ R
• powers: xα on R++, for 0 ≤ α ≤ 1
• logarithm: log x on R++
extended-value extension f˜ of f is
• dom f is convex
• for x, y ∈ dom f ,
f (y)
f (x) + ∇f (x)T (y − x)
(x, f (x))
f is twice differentiable if dom f is open and the Hessian ∇2f (x) ∈ Sn,
2 ∂ 2f (x)
∇ f (x)ij = , i, j = 1, . . . , n,
∂xi∂xj
convex if P # 0
least-squares objective: f (x) = $Ax − b$22
f (x, y)
! "! "T 1
2 y y
∇2f (x, y) = 3 #0
y −x −x 0
2 2
1 0
convex for y > 0 y 0 −2 x
1 1
∇2f (x) = diag(z) − zz T
(zk = exp xk )
1T z (1T z)2
to show ∇2f (x) # 0, we must verify that v T ∇2f (x)v ≥ 0 for all v:
! 2
! !
T 2 ( k zk vk )( k zk ) − ( k v k zk ) 2
v ∇ f (x)v = ! ≥0
( k zk ) 2
! 2
! 2
!
since ( k v k zk ) ≤( k zk vk )( k zk ) (from Cauchy-Schwarz inequality)
"n 1/n n
geometric mean: f (x) = ( k=1 x k ) on R ++ is concave
α-sublevel set of f : Rn → R:
Cα = {x ∈ dom f | f (x) ≤ α}
epigraph of f : Rn → R:
epi f
f (E z) ≤ E f (z)
prob(z = x) = θ, prob(z = y) = 1 − θ
examples
m
!
f (x) = − log(bi − aTi x), dom f = {x | aTi x < bi, i = 1, . . . , m}
i=1
examples
is convex
examples
• support function of a set C: SC (x) = supy∈C y T x is convex
• distance to farthest point in a set C:
λmax(X) = sup y T Xy
"y"2 =1
composition of g : Rn → R and h : R → R:
f (x) = h(g(x))
examples
• exp g(x) is convex if g is convex
• 1/g(x) is convex if g is concave and positive
composition of g : Rn → Rk and h : Rk → R:
examples
!m
• i=1 log gi (x) is concave if gi are concave and positive
!m
• log i=1 exp gi(x) is convex if gi are convex
is convex
examples
• f (x, y) = xT Ax + 2xT By + y T Cy with
! "
A B
! 0, C"0
BT C
g is convex if f is convex
examples
• f (x) = xT x is convex; hence g(x, t) = xT x/t is convex for t > 0
• negative logarithm f (x) = − log x is convex; hence relative entropy
g(x, t) = t log t − t log x is convex on R2++
• if f is convex, then
T
! T
"
g(x) = (c x + d)f (Ax + b)/(c x + d)
f (x)
xy
(0, −f ∗(y))
Sα = {x ∈ dom f | f (x) ≤ α}
a b c
• f is quasiconcave if −f is quasiconvex
• f is quasilinear if it is quasiconvex and quasiconcave
is quasiconvex