CS 726: Nonlinear Optimization 1 Lecture 04: Convexity and Continuity
CS 726: Nonlinear Optimization 1 Lecture 04: Convexity and Continuity
Michael C. Ferris
February 1 2021
By substituting this bound into the previous integral, we obtain the following
result.
kF (y ) F (x)k ⇢ ky xk
Example
p
F (x) = x 8x 2 [0, 1]
Definition (Continuity)
A function F : Rn ! Rm is said to be continuous if 8✏, 8x, 9 , 8y :
Example
F (x) = x 1 8x 2 (0, 1)
Fact
Lipschitz continuity =) Uniform continuity =) Continuity
D E
2 2 2
r f (z) = sup r f (z)y = sup x, r f (z)y
ky k=1 kxk=1,ky k=1
However,
D E 1 D E 1 D E 1 D E
2 2 2 2
x, r f (z)y = x y , r f (z)(y x) + x, r f (z)x + y , r f (z)y
2 2 2
1 2 2 2
{ kx y k + ⇢ kxk + ky k }
2
1 2 2 2 2
{max{ , 0}(kxk + 2 kxk ky k + ky k ) + ⇢(kxk + ky k )}
2
Hence, r2 f (z) 2 max{ , 0} + ⇢, as required. The Lipschitz continuity now follows easily since
Z 1
2
krf (y ) rf (x)k = r f (x + t(y x))(y x)dt
0
ky xk
Lemma
If F : Rn ! Rm is such that DF is Lipschitz continuous w.r.t. convex set
⌦, then,
L
kF (y ) F (x) DF (x)(y x)k2 ky xk22
2
8x, y 2 ⌦ where L is a Lipschitz constant for DF on ⌦.
Proof.
Problem in homework and also [Wright and Recht(2020), Lemma 2.2].
Example
f (x) = x 2 is a nice example of this proposition.
Example
Note that while f (x) = x12 has a psd (and in fact pd) second derivative,
domf is not convex (domf = R \ {0}) and thus f is not convex.
1
A visualization of the function f (x) = x2 , where the gap at x = 0 is clear.
Lemma
Suppose f is twice continuously di↵erentiable on Rn and convex. Then
(a) f is strongly convex with modulus of convexity m if and only if
r2 f (x) ⌫ mI for all x.
(b) rf is Lipschitz continuous with Lipschitz constant L if and only if
r2 f (x) LI for all x.
Proof.
Statement (a) is proven as [Wright and Recht(2020), Lemma 2.5 (a)].
The statement (b) follows in a similar manner (to generate a lower bound
on eigenvalues) and the implication (d) implies (a) in Lemma 7 (see
above).
Theorem
f (x) = 12 x T Qx + P T x. f is strictly convex () f is strongly convex
() f is coercive.
Proof.
Exercise(put the pieces together from the last two lectures)
Proof.
See [Wright and Recht(2020), Theorem 2.6]. The di↵erentiability
assumption is actually not needed, see for example Wikipedia entry on
Convex Analysis/Strong convexity.