Convex Sets and Jensen's Inequality
Convex Sets and Jensen's Inequality
ANDREW D SMITH
School of Mathematics and Statistics
University College Dublin
3
This says that if two points, a and b lie in the set, then so does the
straight line segment connecting a to b.
Which of these sets are convex?
4
y y
x x
y
y
x x
Proof We prove this in two stages. Firstly, we show that this def-
inition implies that the epigraph is convex (the ‘if’ part), and then
that a convex epigram implies this inequality (the ‘only if’ part).
If. Suppose that the inequality holds. We need to show that the
epigraph is convex.
Suppose then that the vectors (x, a) and (y, b) are in the epigraph,
which is equivalent to:
a ≥ f (x)
b ≥ f (y)
≥ f (λx + (1 − λ)y)
4 100
3.5
80
3
2.5 60
2
1.5 40
1
20
0.5
0 0
-2 -1.5 -1 -0.5 0 0.5 1 1.5 2 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2
4 2.5
3.5
3 2
2.5 1.5
2
1.5 1
1 0.5
0.5
0 0
0 0.5 1 1.5 2 2.5 3 3.5 4 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2
8
Jensen’s Inequality:
Let f (x) be a convex function and let w1, w2, . . . wn be weights with
• wj ≥ 0
• nj=1 wj = 1
P
• wj ≥ 0
• kj=1 wj = 1
P
w1 w2 wk−1
f x1 + x2 + . . . + xk−1
1 − wk 1 − wk 1 − wk
w1f (x1) + w2f (x2) + . . . wk−1f (xk−1)
≤
1 − wk
9
f (xk ) ≤ f (xk )
f (w1x1 + w2x2 + . . . wk xk ) ≤
w1 w2 wk−1
(1 − wk )f x1 + x2 + . . . + xk−1 + wk f (xk )
1 − wk 1 − wk 1 − wk
Combining these last two inequalities, we finally have proved the
inductive hypothesis when n = k:
f (w1x1 + w2x2 + . . . wk xk )
Example Problem
Show that:
√ √ √ n√ 2
12 + 1 + 22 + 1 + . . . + n2 + 1 ≥ n + 2n + 5
2
Solution: Apply Jensen’s inequality to the convex function f (x) =
√
1 + x2 at the points xn = n with weight 1/n. Then
√ √ √
1 + 1 + 2 + 1 + . . . + n2 + 1
2 2
n
s 2
1 + 2 + ... + n
≥ 1+
n
r
(n + 1)2
= 1+
4
1p
= (n + 1)2 + 4
2
Multiplying by n, we obtain the result we set out to prove.
12
Therefore: n n
X √ X
xi ≥ xi = 1
i=1 i=1
Equality holds when one of the xi = 1 and all the others are zero.
14
In this expression, the left hand side is the geometric mean and the
right hand side is the arithmetic mean.
Proof If any of the aj are zero then the result holds trivially setting
the left hand side to zero.
So let us suppose all the aj are strictly positive. Then we can write
aj = 10xj for some (positive or negative) xj .
Then applying Jensen’s inequality to the convex function 10x, with
weights equal to 1/n, we have:
n
(x1 +x2 +...xn )/n 1 X xj
10 ≤ 10
n j=1
Applications of AM ≥ GM.
Problem AMGM #1
If {b1, b2, . . . bn} is a permutation of the sequence {a1, a2, . . . an}
of positive real numbers, then show that:
a1 a2 an
+ + ... + ≥n
b1 b2 bn
Example Problem Let x1, x2, . . . xn and y1, y2, . . . yn be real se-
quences, satisfying:
n
X
|xi|p = 1
i=1
Xn
|yi|q = 1
i=1
wi = yiq
xi
zi = q−1
yi
Jensen’s inequality then implies:
" n #p p p
p
q xi
X X X
xi yi ≤ yi p(q−1) = xpi = 1
i=1 i=1 yi i=1
B 0 = {y ∈ Rn : x.y ≤ 1, ∀x ∈ B}
Hölder’s inequality identifies the dual ball in our example. These are
shown in R2 for p = 5 and q = 1.25.
1.5 1.5
1 1
0.5 0.5
0 0
-1.5 -1 -0.5 0 0.5 1 1.5 -1.5 -1 -0.5 0 0.5 1 1.5
-0.5 -0.5
-1 -1
-1.5 -1.5
20
0! = 1
1! = 1
2! = 2
n! = n × (n − 1)!
100
10
1
-1 0 1 2 3 4 5
0.1
21
• x! = x × (x − 1)!
• x! = 10f (x) where f (x) is a convex function.
f (n + x) ≤ (1 − x)f (n) + xf (n + 1)
f (n) ≤ xf (n − 1 + x) + (1 − x)f (n + x)
(n + x)! ≤ (n + 1)x × n!
n! ≤ (n + x)−x × (n + x)!
Putting these together, we have upper and lower bounds for (n+x)!:
This gives upper and lower bounds for x!. The chart below shows
the upper and lower bounds for 0 ≤ x ≤ 1 and n = 1 (red), n = 5
(orange), n = 10 (green) and the limit of large n (black):
0.95
0.9
0.85
0.8
0 0.2 0.4 0.6 0.8 1
There is much more that can be said about this function, including
√
1 π
that 2! = 2 , but this will have to wait for another time!