Chapter - 2 - Convex Function
Chapter - 2 - Convex Function
Convex Functions
TANN Chantara
October 9, 2022
1 Definitions
2 Checking Convexity
4 Schur Lemma
5 Generalized Inequalities
6 Summary
dom(f) = {x ∈ Rm : f(x) ≤ ∞}
Convex Functions
Definition: A function f : Rn → (−∞, ∞] is called convex if its epigraph
is a convex set.
Sublevel Sets
Definition: The α-sublevel set of a function f : Rn → (−∞, ∞] is defined
as Cα = {x : f(x) ≤ α}
m X
n
f(X) = tr(A⊤ X) =
X
Aij Xij , (A ∈ Rm×n )
i=1 j=1
is convex in t.
Proof:
⇒: For any x, y ∈ Rn and θ ∈ (0, 1), consider t = θa + (1 − θ)b for
arbitrary a, b ∈ R
g(t) = g(θa + (1 − θ)b) = f(x + (θa + (1 − θ)b)y)
= f(θ(x + ay) + (1 − θ)(x + by))
≤ θf(x + ay) + (1 − θ)f(x + by) = θg(a) + (1 − θ)g(b)
Proof: Cont’d
⇐: For any x, y ∈ Rn , θ ∈ (0, 1) and t1 , t2 ∈ R
f(x + (θt1 + (1 − θ)t2 )y) = g(θt1 + (1 − θ)t2 )
≤ θg(t1 ) + (1 − θ)g(t2 ) = θf(x + t1 y) + (1 − θ)f(x + t2 y)
For t1 = 0, t2 = 1, x = x′ , y = y′ − x′ we get
f(θx′ + (1 − θ)y′ ) ≤ θf(x′ ) + (1 − θ)f(y′ )
Univariate Functions
Proof:
⇒: If x, y ∈ R, 0 < t ≤ 1, then
f(x + t(y − x)) ≤ (1 − t)f(x) + tf(y) (convexity)
f(y) − f(x) ≥ [f(x + t(y − x))]/t (divide by t)
′
f(y) − f(x) ≥ f (x)(y − x) (limit t → ∞)
⇐: For any x, y ∈ R, 0 < t ≤ 1, Let z = tx + (1 − t)y
t(f(x) − f(z)) ≥ tf ′ (z)(x − z) (by assumption)
(1 − t)(f(y) − f(z)) ≥ (1 − t)t′ (z)(y − z) (by assumption)
tf(x) + (1 − t)f(y) ≥ f(z) (sum of above)
Proof:
⇒: g(t) = f(tx + 1(1 − t)y) is convex in t for any x, y ∈ Rn
g′ (t) = ∇f(tx + (1 − t)y)⊤ (y − x) (definition of g)
g(1) ≥ g(0) + g′ (0) (convexity of g)
f(x) ≥ f(y) + ∇f(y)⊤ (y − x) (substition)
n
⇐: x, y ∈ R , t, t̃ ∈ R, z = ty + (1 − t)x, z̃ = t̃y + (1 − t̃)x
f(z) ≥ f(z̃) + ∇f(z̃)⊤ (z − z̃) (by assumption)
⊤
g(t) ≥ g(t̃) + ∇g(t̃) (t − t̃) (definition of g, z, z̃)
∇2 f(x) ⪰ 0 ∀x ∈dom(f)
Univariate Functions∗
2nd-Order Conditions∗
Examples
Negative Log-Determinant
Proposition: The log-determinant function f(X) = − log det(X) is convex
on the set of positive definite matrices Sn++ .
Proof: Homework.
Affine Transformations
is also convex
TANN Chantara (ITC) Convex Functions October 9, 2022 19 / 31
Convexity Preserving Transformations
Examples
Examples Cont’d
4 Maximum eigenvalue f(X) = λmax (X) for X ∈ Sn
Write X = RDR⊤ , with R orthogonal and D = diag(λ1 , ..., λn )
Composition
Thus f is convex.
Example: f(x) = exp(g(x)) is convex if g is convex.
Generalizations
Minimization
is convex
Proof: Assume that the inner problem is solvable, i.e., for every
x ∈dom(h). Choose x1 , x2 ∈dom(h) and let y1 , y2 ∈ C be the
corresponding minimizers, i.e., h(xi ) = f(xi , yi ) for i = 1, 2, for any
θ ∈ [0, 1]
Minimization Cont’d
g(θx1 + (1 − θ)x2 , y) ≤ 0}
≤ f(θx1 + (1 − θ)x2 , θy1 + (1 − θ)y2 )
≤ θf(x1 , y1 ) + (1 − θ)f(x2 , y2 )
= θh(x1 ) + (1 − θ)h(x2 )
Thus, h is convex. If the problem is not solvable, once can use a similar
argument using ε-optimal solution for ε → 0
Schur Lemma
!
A B
Lemma (Schur) Consider X ∈ Sn partitioned as X = ,
B⊤ C
where C ≻ 0. Then
X ⪰ 0 ⇐⇒ A − BC−1 B⊤ ⪰ 0
Distance function
Perspective function
is convex in (x, t)
Proof: Choose (x1 , t1 ), (x2 , t2 ) ∈ dom(g) and θ ∈ [0, 1], then
θx1 + (1 − θ)x2
g(θ(x1 , t1 ) + (1 − θ)(x2 , t2 )) = (θt1 + (1 − θ)t2 )f
θt1 + (1 − θ)t2
θt1 x1 /t1 + (1 − θ)t2 x2 /t2
= (θt1 + (1 − θ)t2 )f
θt1 + (1 − θ)t2
≤ θt1 f(x1 /t1 ) + (1 − θ)t2 f(x2 /t2 )
= θg(x1 , t1 ) + (1 − θ)g(x2 , t2 )
Relative Entropy
is convex.
Proof: The negative logarithm f(x) = − log(x) is convex on R++ . We
therefore conclude that its perspective function
Summary