Convex Optimization Cheatsheet
Convex Optimization Cheatsheet
MARK SCHMIDT
2 f (x) 0.
For a differentiable convex f , the following conditions are equivalent to the condition that the gradient f 0 is
L-Lipschitz continuous:
2 f (x) LI.
The following conditions are equivalent to the condition that a differentiable f is -strongly convex:
x 7 f (x) kxk2 is convex
2
f (y) f (x) + hf 0 (x), y xi + kx yk2
2
hf 0 (x) f 0 (y), x yi kx yk2
(1 )
f (x + (1 )y)) f (x) + (1 )f (y) kx yk2 .
2
1
The following are not equivalent to -strong convexity but are implied by it:
kf 0 (x) f 0 (y)k kx yk
1
f (y) f (x) + hf 0 (x), y xi + kf 0 (x) f 0 (y)k2
2
1 0
hf 0 (x) f 0 (y), x yi
kf (x) f 0 (y)k2 .
For a twice-differentiable f strong-convexity is equivalent to:
2 f (x) I.
If f is -strongly convex and f 0 is L-Lipschitz continuous then we have
L 1
hf 0 (x) f 0 (y), x yi kx yk2 + kf 0 (x) f 0 (y)k2 .
+L +L