Max-Min Problems in and The Hessian Matrix: Taylor's Theorem in R
Max-Min Problems in and The Hessian Matrix: Taylor's Theorem in R
I Taylor’s Theorem in Rn
Let f 2 C 2 (Rn ), where C 2 (Rn ) is the set of real-valued functions de…ned on Rn
having continuous second partial derivatives. The method for solving for local
extreme points of f relies upon Taylor’s Theorem with second degree remainder
terms, which we state here without proof. (In the following theorem, an open
hypersphere centered at x0 is a set of the form fx 2 Rn j kx x0 k < rg for some
positive real number r.)
THEOREM 1
(Taylor’s Theorem in Rn ) Let A be an open hypersphere centered at x0 2 Rn ,
let u be a unit vector in Rn , and let t 2 R such that x0 + tu 2 A. Suppose
f : A ! R has continuous second partial derivatives throughout A; that is,
f 2 C 2 (A). Then there is a c with 0 c t such that
Xn n
@f 1 X @2f
f (x0 + tu) = f (x0 ) + (tui ) + (t2 u2i )
i=1
@xi x0 2 i=1 @x2i x0 +cu
n
X n
X 2
@ f
+ (t2 ui uj ):
i=1 j=i+1
@xi @xj x0 +cu
in this expression is called the Hessian matrix for f . Thus, in the R2 case, with
v = tu, the formula in Taylor’s Theorem can be written as
1 T
f (x0 + v) = f (x0 ) + rf v + v H v,
x0 2 x0 +kv
I Critical Points
If A is a subset of Rn , then we say that f : A ! R has a local maximum at a point
x0 2 A if and only if there is an open neighborhood U of x0 such that f (x0 ) f (x)
for all x 2 U. A local minimum for a function f is de…ned analogously.
THEOREM 2
Let A be an open hypersphere centered at x0 2 Rn , and let f : A ! R have
continuous …rst partial derivatives on A. If f has a local maximum or a local
minimum at x0 , then rf (x0 ) = 0.
Proof If x0 is a local maximum, then f (x0 + hei ) f (x0 ) 0 for small h. Then,
f (x0 +hei ) f (x0 )
limh!0+ h 0. Similarly, limh!0 f (x0 +hehi ) f (x0 ) 0. Hence, for
@f
the limit to exist, we must have @xi = 0. Since this is true for each i, rf = 0.
x0 x0
A similar proof works for local minimums. Q ED
1 T
f (x0 + v) = f (x0 ) + v H v.
2 x0 +kv
THEOREM 3
Given the conditions of Taylor’s Theorem for a set A and a function f : A !
R, f has a local minimum at a critical point x0 if vT H v > 0 for all
x0
nonzero vectors v. Similarly, f has a local maximum at a critical point x0 if
vT H v < 0 for all nonzero vectors v.
x0
vT Hv > 0 for all nonzero v if and only if wT Dw > 0 for all nonzero w. Now, D
is diagonal, and so wT Dw = d11 w12 + d22 w22 + + dnn wn2 . But the dii ’s are the
T
eigenvalues of H. Thus, it follows that w Dw > 0 for all nonzero w if and only if
all of these eigenvalues are positive. (Set w = ei for each i to prove the “only if”
part of this statement.) Similarly, wT Dw < 0 for all nonzero w if and only if all
of these eigenvalues are negative. Hence,
THEOREM 4
A symmetric matrix A de…nes a positive de…nite quadratic form vT Av if and
only if all of the eigenvalues of A are positive. A symmetric matrix A de…nes a
negative de…nite quadratic form vT Av if and only if all of the eigenvalues of A
are negative.
But pH (x) = x2 28x + 160, which has roots x = 8 and x = 20. Thus, H has all
eigenvalues positive, and hence, vT Hv is positive de…nite. Theorem 4 then tells us
that x0 = [ 1; 2] is a local minimum for f .
Example 3 Suppose f (x; y) = 2x2 2x2 y 2 + 2y 2 + 24y x4 y 4 . First, we look for critical
points by solving the system
8
> @f 2
< @x = 4x 4xy 4x3 = 4x(1 (y 2 + x2 )) = 0
:
>
: @f = 4x2 y + 4y + 24 4y 3 = 4y(x2 + y 2 ) + 4y + 24 = 0
@y
Now @f 2 2 @f
@x = 0 yields x = 0 or y +x = 1. If x = 0, then @y = 0 gives 4y+24 4y = 0.
3
The unique real solution to this equation is y = 2. Thus, [0; 2] is a critical point.
If x 6= 0, then y 2 + x2 = 1. From @f@y = 0, we have 0 = 4y(1) + 4y + 24 = 24,
a contradiction, so there is no critical point when x 6= 0.
Next, we compute the Hessian matrix at the critical point [0; 2].
2 @2f @2f 3
@x2 @x@y
H = 4 5
@2f @2f
@y@x @y 2 [0;2]
2 2
4 4y 12x 8xy 12 0
= = :
8xy 4x2 + 4 12y 2 [0;2]
0 44
Since the (1; 1) entry is negative and jHj > 0, H de…nes a negative de…nite quadratic
form and so f has a local maximum at [0; 2].
I An Example in R3
Using row reduction to solve this linear system yields the unique critical point
[ 9; 12; 16]. The Hessian matrix at [ 9; 12; 16] is
2 2 3
@ g @2g @2g
@x2 @x@y @x@z 2 3
6 7 10 4 2
6 @2g @2g @2g 7
H= 66 @y@x @y2 @y@z 7
7 = 4 4 10 6 5 .
4 5 2 6 6
2 2 2
@ g @ g @ g
@z@x @z@y @z 2 [ 9;12;16]
I New Vocabulary
C 2 (Rn ) (functions from Rn to R having continuous second partial derivatives)
critical point (of a function)
gradient (of a function on Rn )
Hessian matrix
local maximum (of a function on Rn )
local minimum (of a function on Rn )
negative de…nite quadratic form
I Highlights
h i
@f @f @f
The gradient of a function f : Rn ! R is de…ned by rf = @x1 ; @x2 ; : : : ; @xn :
i=1 @xi (tui )+ 21 i=1 @@xf2 (t2 u2i )+ i=1 j=i+1 @x@i @x
f
j
(t2 ui uj ):
x0 i x0 +cu x0 +cu
1 T
In particular, we have f (x0 +v) = f (x0 )+ rf v + 2v H v,
x0 x0 +kv
for some k with 0 k 1.
Let A be an open hypersphere centered at x0 2 Rn : If f : A ! R has contin-
uous second partial derivatives throughout A, then f : A ! R, f has a local
minimum at a critical point x0 if vT H v > 0 for all nonzero vectors v.
x0
I EXERCISES
1. In each part, solve for all critical points for the given function.
Then, for
each critical point, use the Hessian matrix to determine whether the critical
point is a local maximum, a local minimum, or neither.
F a) f (x; y) = x3 + x2 + 2xy 3x + y 2
b) f (x; y) = 6x2 + 4xy + 3y 2 + 8x 9y
F c) f (x; y) = 2x2 + 2xy + 2x + y 2 2y + 5
3 2
d) f (x; y) = x + 3x y x + 3xy + 2xy 3x + y 3
2 2
y2 3y (Hint: To
solve for critical points, …rst set @f
@x
@f
@y = 0.)
a) Show that f (x; y) = (x 2)4 + (y 3)2 has a local minimum at [2; 3],
but its Hessian matrix at [2; 3] has 0 as an eigenvalue.
b) Show that f (x; y) = (x 2)4 + (y 3)2 has a critical point at [2; 3], its
Hessian matrix at [2; 3] has all nonnegative eigenvalues, but [2; 3] is not
a local extreme point for f .
c) Show that f (x; y) = (x+1)4 (y+2)4 has a local maximum at [ 1; 2],
but its Hessian matrix at [ 1; 2] is O and thus has all of its eigenvalues
equal to zero.
d) Show that f (x; y; z) = (x 1)2 (y 2)2 +(z 3)4 does not have any local
extreme points. Then verify that its Hessian matrix has eigenvalues of
opposite sign at the function’s only critical point.
a b
a) Prove that a symmetric 2 2 matrix A = de…nes a positive
b c
de…nite quadratic form if and only if a > 0 and jAj > 0. (Hint: Compute
pA (x) and show that both roots are positive if and only if a > 0 and
jAj > 0.)
b) Prove that a symmetric 2 2 matrix A de…nes a negative de…nite
quadratic form if and only if a11 < 0 and jAj > 0.
F 2. True or False: