Jensen
Jensen
Note that the definition of convexity is simply the statement that Jensen’s
inequality holds for two point distributions. We prove Jensen’s inequality for
finite M by induction on the number of elements of M . Suppose M contains
k elements and assume that Jensen’s inequality holds for distributions on
k − 1 points. We now have the following where the fourth line follows from
the induction hypothesis.
Em∼P [f(X(m))]
= p1 f (x1 ) + p2 f (x2 ) + p3 f (x3 ) + · · · + pk f (xk )
! ! !
p1 p2
= (p1 + p2 ) f (x1 ) + f (x2 ) + p3 f (x3 ) + · · · + pk f (xk )
p1 + p2 p1 + p2
! ! !
p1 p2
≤ (p1 + p2 )f x1 + x2 + p3 f (x3 ) + · · · + pk f (xk )
p1 + p2 p1 + p2
! !
p1 x1 p2 x2
≤ f (p1 + p2 ) + + p3 x3 + · · · pk xk
p1 + p2 p1 + p2
= f (p1 x1 + p2 x2 + p3 x3 + · · · pk xk )
= f (Em∼M [X] (m))
1 Problems
1. Which of the following functions are convex. (Hint: compute the second
derivative.)
• x3 (on all reals)