Lecture 3 Si416 2025
Lecture 3 Si416 2025
Harsha Hutridurga
IIT Bombay
x∗ is a ∇f (x∗ ) = 0
local minimizer =⇒ and
of f ∇2 f (x∗ )p, p ≥ 0 for all p ∈ Rn
∇f (x∗ ) = 0 x∗ is a
and =⇒ strict local minimizer
∇2 f (x∗ )p, p > 0 ∀p ∈ Rn \ {0} of f
x∗ is a x∗ is a
local minimizer ⇐⇒ global minimizer
of f of f
x∗ satisfies x∗ satisfies
f (x∗ ) ≤ f (x) for all x ∈ Rn ⇐⇒ ∇f (x∗ ) = 0
f is for every x, y ∈ Rn ,
convex ⇐⇒ f (y) ≥ f (x) + h∇f (x), (y − x)i
Theorem
A twice differentiable function f : Rn → R is convex if and only if the
Hessian matrix ∇2 f (x) is positive semidefinite for all x ∈ Rn .
Theorem
A differentiable function f : Rn → R is strictly convex if and only if
♣ The proof of the above result is exactly similar to the convex case
wherein we replace inequalities by strict inequalities
Theorem
A strictly convex function f : Rn → R has at most one global minimizer
Theorem
Let f : Rn → R be a twice continuously differentiable function such that
its Hessian matrix ∇2 f (x) is positive definite for all x ∈ Rn .
Then, the function f is strictly convex.
♣ Here
n
X
kxk2 := x2i
i=1
Lemma
If f : Rn → R is a twice continuously differentiable strongly convex
function, then there exists a λ > 0 such that
Lemma
If f : Rn → R is a twice continuously differentiable strongly convex
function, then there exists a λ > 0 such that for all x, y ∈ Rn ,
i.e.
− kuk kvk ≤ hu, vi ≤ kuk kvk
♣ Consider a function f : Rn → R.
f is f is f is
strongly convex =⇒ strictly convex =⇒ convex
f is for every x, y ∈ Rn ,
convex ⇐⇒ f (y) ≥ f (x) + ∇f (x) · (y − x)
f is for every x ∈ Rn ,
convex ⇐⇒ ∇2 f (x)p, p ≥ 0 for all p ∈ Rn
f is for every x, y ∈ Rn , x 6= y,
strictly convex ⇐⇒ f (y) > f (x) + ∇f (x) · (y − x)
f is for every x ∈ Rn ,
strictly convex ⇐= ∇2 f (x)p, p > 0 for all p ∈ Rn \ {0}
f is for every x ∈ Rn ,
strongly convex =⇒ ∇2 f (x)p, p ≥ 2λkpk2 for all p ∈ Rn
f is for every x, y ∈ Rn ,
strongly convex =⇒ f (y) ≥ f (x) + h∇f (x), y − xi + λ ky − xk2
♣ Consider a function f : Rn → R.
f is f is f is
strongly convex =⇒ strictly convex =⇒ convex
Harsha Hutridurga (IIT Bombay) SI 416 20 / 23
strong convexity – further properties
♣ Note that
kαx + (1 − α)yk2 = α2 kxk2 + (1 − α)2 kyk2 + 2α(1 − α) hx, yi
♣ Hence we get
kαx + (1 − α)yk2 − αkxk2 − (1 − α)kyk2
= α2 kxk2 + (1 − α)2 kyk2 + 2α(1 − α) hx, yi − αkxk2 − (1 − α)kyk2
= −α(1 − α) kxk2 − (1 − α)α kyk2 + 2α(1 − α) hx, yi
= −α(1 − α) kx − yk2
Lemma
If f : Rn → R is a strongly convex function, then there exists a λ > 0
such that for all x, y ∈ Rn and α ∈ [0, 1],