Optimization Lectures Formal Note
Optimization Lectures Formal Note
4. Over what domain is the function f (x) = x2 (x2 − 1) convex? Is it strictly convex over the
region(s) specified? Justify your answer.
5. Show that a function f : Rn → R is affine if and only if f is both convex and concave. (A
function f is affine if it is of the form f (x) = α + c> x, where α is a scalar c is an n-vector.)
n
2 , . . . , fk : R → R be convex functions. Consider the function f defined by
6. Let f1 , fP
k
f (x) = j=1 αj fj (x), where αj > 0 for j = 1, 2, . . . , k. Show that f is convex. State and
prove a similar result for concave functions.
1
f (y) = inf{ky − xk : x ∈ S}.
Note that f (y) gives the distance from y to the set S and is called the distance function. Prove
that f is convex.
10. Let S = {(x1 , x2 ) : x21 + x22 ≤ 4}. Let f be the distance function defined in Problem 9. Find
the function f explicitly.
12. Let F be a cumulative Rdistribution function for a random variable b, that is, F (y) = Prob(b ≤
z
y). Show that φ(z) = −∞ F (y)dy is a convex function. Is φ convex for any nondecreasing
function F ?
(α> x)2
f (x) = β>x
,
where S is a convex subset of Rn , α and β are vectors in Rn , and where β t x > 0 for all x ∈ S.
Derive an explicit expression for the Hessian of f , and hence verify that f is convex over S.
2 2
14. Let f (x1 , x2 ) = e2x1 −x2 − 3x1 + 5x2 . Give the linear and quadratic approximations of f at
(1, 1). Are these approximations convex, concave, or neither? Why?
Write a necessary condition for optimality and verify that it is satisfied by the point (2, 4). Is
this the optimal point? Why?
17. (a) Give an example of strictly convex function that is not bounded below and not strongly
convex.
(b) Give an example of convex function that is neither strictly convex nor affine.
(c) Give an example of strictly convex function but its strict convexity cannot be checked
by second order derivative test.
2
18. Let S be a nonempty open convex set in R, and f : S → R be infinitely many times
differentiable on S. Prove that f is strictly convex iff for each x̄ ∈ S, there exists an even n
such that f (n) (x̄) > 0, while f (j) (x̄) = 0 for any 1 < j < n.
x21
19. Show that the function f : R × R+ → R given by f (x1 , x2 ) = x2
is convex.
(
0, if x ∈ C
20. Prove that a nonempty set C ⊆ Rn is convex iff its indicator function fC (x) =
+∞, if x ∈
/C
is convex.
Note: Indicator function allows to pass constraint to objective:
21. Given an arbitrary set S ⊆ Rn , not necessarily convex, the support function associated with
S is σS : Rn → R ∪ {+∞} given by σS (z) = sup x> z. Prove that σS is a convex function
x∈S
on Rn .
Note:
22. Let S be a nonempty convex set in Rn . Prove that the distance function f (x) = inf{ky − xk :
x ∈ S} is a convex function.
24. Let S be a nonempty convex subset of Rn , and f : S → R. Prove that the following three are
equivalent:
3
Solutions
Solution to Problem 5.
for α ∈ (0, 1)
for α = 0
for α = 1
g(αx) = αg(x).
4
⇒ g(x) + g(−x) = 0
⇒ g(−x) = −g(x).
Step 3: Claim g(αx) = αg(x) ∀ α < 0.
Therefore, g is a linear
Pn function.
n
Take x ∈ R . x = i=1 xi ei
n
X
x= xi ei , ei → i-th unit vector
i=1
n
X
⇒ g(x) = xi g(ei ) = a1 x1 + a2 x2 + · · · + an xn , ai = g(ei )
i=1
⇒ g(x) = at x, where a = [a1 a2 · · · an ]t
Therefore,
Solution to Problem 6.
Let x1 , x2 ∈ Rn and λ ∈ (0, 1). Hence
k
X
h(λx1 + (1 − λ)x2 ) = αj fj (λx1 + (1 − λ)x2 )
j=1
k
X
≤ αj λfj (x1 ) + (1 − λ)fj (x2 )
j=1
k
! k
!
X X
= λ αj fj (x1 ) + (1 − λ) αj fj (x2 )
j=1 j=1
= λh(x1 ) + (1 − λ)h(x2 ).
Solution to Problem 7.
Let x1 , x2 ∈ Rn and λ ∈ (0, 1). Thus,
5
Solution to Problem 8.
Let x1 , x2 ∈ Rn and λ ∈ (0, 1). Thus,
(g ◦ h)(λx1 + (1 − λ)x2 ) = g h(λx1 + (1 − λ)x2 )
≤ g λh(x1 ) + (1 − λ)h(x2 )
= λg h(x1 ) + (1 − λ)g h(x2 )
= λ(g ◦ h)(x1 ) + (1 − λ)(g ◦ h)(x2 ).
6
Solution to Problem 13.
(αt x)2
f (x) =
β tx
αt x 2 αt x
f x1 −β (
1 βtx ) + 2( βtx
)α 1
αt x 2 αt x
fx −β ( ) + 2( )α
2 2 βtx βtx 2
∇f (x) = .. = ..
. .
f xn αt x 2 αt x
−βn ( β t x ) + 2( β t x )αn
(∇fx1 )t
(∇fx )t
2
H(f ) = ..
.
(∇fxn )t
t t t t t
−2β1 αβ t xx α1 β(βx−β
t x)2
1α x
+ 2α1 α1 β(βx−β 1α x
t x)2
t t t t t
−2β1 αβ t xx α2 β(βx−β 2α x
+ 2α1 α2 β(βx−β 2α x
t x)2 t x)2
∇f (x1 ) = ..
7
.
t t t t t
−2β1 αβ t xx αn β(βx−β
t x)2
nα x
+ 2α1 αn β(βx−β nα x
t x)2
−β1 (αt x)(α1 β t x − β1 αt x) + α1 (α1 (β t x)2 − β1 (αt x)(β t x))
t t t t 2 t t
2 −β1 (α x)(α2 β x − β2 α x) + α1 (α2 (β x) − β2 (α x)(β x))
= t 3 ..
(β x)
.
t t t t 2 t t
−β1 (α x)(αn β x − βn α x) + α1 (αn (β x) − βn (α x)(β x))
(β1 αt x − α1 β t x)2
t t t t
2 (β1 α x − α1 β x)(β2 α x − α2 β x)
= t 3 ..
(β x)
.
t t t t
(β1 α x − α1 β x)(βn α x − αn β x)
(β1 αt x − α1 β t x)2 (β1 αt x − α1 β t x)(β2 αt x − α2 β t x) ··· (β1 αt x − α1 β t x)(βn αt x − αn β t x)
t t t t
2 (β2 α x − α2 β x)(β1 α x − α1 β x)
(β2 αt x − α2 β t x)2 ··· (β2 αt x − α2 β t x)(βn αt x − αn β t x)
∴ H(f ) = t 3
.. .. ..
(β x)
. . ··· .
(βn α x − αn β x)(β1 α x − α1 β x) (βn α x − αn β x)(β2 αt x − α2 β t x)
t t t t t t
··· (βn αt x − αn β t x)2
2
=⇒ at H(f )a = {(β1 αt x − α1 β t x)2 a21 + (β2 αt x − α2 β t x)2 a22 + · · · + (βn αt x − αn β t x)2 a2n
(β t x)3
+ 2(β1 αt x − α1 β t x)(β2 αt x − α2 β t x)a1 a2
+ 2(β1 αt x − α1 β t x)(β3 αt x − α3 β t x)a1 a3
+ ···
+ 2(βn−1 αt x − αn−1 β t x)(βn αt x − αn β t x)an−1 an }
2
= (β t x)3
((β1 αt x − α1 β t x)a1 + (β2 αt x − α2 β t x)a2 + · · · + (βn αt x − αn β t x)an )2
≥ 0, ∀ a ∈ Rn