Lecture14 KKT
Lecture14 KKT
MFDS Team
Introduction
maxλ∈Rm − λT b
subject to c + AT λ = 0
λ≥0
Linear programming
minx ∈Rd
2
x Qx + c T x subject to
1 T
Ax ≤ b
▶ Here A ∈ Rm×d , b ∈ Rm , c ∈ Rd
Quadratic programming
minimize f (x ) subject to
gi (x ) ≤ 0 ∀i ∈ [m]
hj (x ) = 0 ∀j ∈ [p]
i=1 j=1
max F (λ, ν)
subject to λ ≥ 0
Slater’s condition
(u , v , t) ∈ A =⇒ λ̃T u + ν̃ T v + µt ≥ α
(0, 0, t) ∈ B =⇒ λ̃T u + ν̃ T v + µt ≤ α
We can see from the above that λ̃ ≥ 0 and µ > 0. This is because
if (u , v , t) ∈ A then (k u , v , kt), k > 1 ∈ A, and a negative λ̃, µ will
make the left hand-side of the inequality arbitrarily small, so that it
cannot be lower-bounded by α.
The bottom condition means that µt ≤ α for t < p ∗ which means
that µp ∗ ≤ α.
Proof of Slater’s condition
For any x ∈D
i=m
λ̃i gi (x ) + ν T (Ax − b) + µf (x ) ≥ α ≥ µp ∗
X
i=1
There are now two cases: µ > 0 and µ = 0. When µ > 0, we can
divide the left and right-hand sides to get
f (x ∗ ) = F (λ∗ , ν∗)
= infx (f (x ) + λ∗i gi (x ) + νi∗ hi (x ))
X X
i∈[m] i∈[p]
≤ f (x ∗ ) + λ∗i gi (x ∗ ) + νi∗ hi (x ∗ )
X X
i∈[m] i∈[p]
≤ f (x ∗)
Proof of complementary slackness
The previous claim we proved establishes that for the primal and
dual feasible solutions, the fourth KKT condition must hold, i.e
λ∗i gi (x ∗ ) = 0 ∀i ∈ [m]. The previous claim also establishes that
x ∗ ∈ argminx L(x , λ∗ , ν ∗ ), which means that the gradient of L must
vanish at x ∗ . Thus the last KKT condition must hold true.
Sufficiency - KKT conditions for strong duality
Now we will show that if we assume the KKT conditions and the
problem is convex, we have strong duality. The first two conditions
gi (x ∗ ) ≤ 0 ∀i ∈ [m] and hi (x ∗ ) = 0 ∀i ∈ [p]
The condition λ∗i ≥ 0 ∀i ∈ [m] together with the information that
f and the constraints gi are convex and the constraints hi are
affine enable us to establish
Pi=m that
∗ ∗
i=1 νi hi (x ) is a convex
L(x , λ , ν ) = f (x ) + i=1 λ∗i gi (x ) + i=m ∗
P
function.
By the last condition we see that the gradient of this convex
function vanishes at x ∗ which means x ∗ is a local and global
minimum.
Sufficiency - KKT conditions for strong duality
Thus we have
F (λ∗ , ν ∗ ) = L(x ∗ , λ∗ , ν ∗ )
i=m i=p
= f (x ) + x νi∗ hi (x ∗ )
X X
∗
λ∗i gi ( ∗ ) +
i=1 i=1
= f (x ∗ )