Chap2 Lec1 Coercive Functions and Global Minimizers
Chap2 Lec1 Coercive Functions and Global Minimizers
MAT 419/519
Summer Session 2011-12
Lecture 4 Notes
That is, for any constant M > 0 there exists a constant RM > 0 such that kf (xk > M whenever
kxk > RM .
1
Example The function f (x, y) = x2 + y 2 is coercive, as
2
Example A linear function is never coercive. For instance, a linear function on R2 has the form
f (x, y) = ax + by + c,
for constants a, b and c, and is equal to c along the line defined by the equation ax + by = 0. Since
kxk → ∞ along this line, but f (x, y) = c along this line, f (x, y) is not coercive. 2
Example The function f (x, y) = x2 − 2xy + y 2 is not coercive, as f (x, y) = (x − y)2 , which means
that f (x, y) = 0 on the line y = x, along which kxk becomes infinite. 2
As these examples show, in order for a function to be coercive, it must approach +∞ along any
path within Rn on which kxk becomes infinite.
The following theorem indicates the usefulness of knowing whether a function is coercive.
Theorem Let f (x) be a continuous function defined on all of Rn . If f (x) is coercive, then f (x)
has a global minimizer. Furthermore, if the first partial derivatives of f (x) exist on all of Rn , then
any global minimizers of f (x) can be found among the critical points of f (x).
This theorem can be proved by using the fact that f (x) is coercive to find a compact subset of Rn
on which f (x) must have a global minimizer, by the preceding theorem.
To find the global minimizer of a coercive function f (x), it is sufficient to find the critical points
of f (x), and then evaluate f (x) at each of these points. The critical points for which f (x) assumes
the smallest values are then the global minimizers.
Example Let f (x, y) = x4 − 4xy + y 4 . Then we have
and therefore critical points satisfy y = x3 and x = y 3 . That is, x = x9 , which means x = 0, x = 1
or x = −1. This yields the critical points (0, 0), (1, 1) and (−1, −1). However, as the Hessian is
12x2
−4
Hf (x, y) = ,
−4 12y 2
2
As k(x, y)k → ∞, 4xy/(x4 + y 4 ) → 0, while x4 + y 4 → +∞. It follows that
Therefore, f (x, y) is coercive, and the three critical points are candidates for global minimizers.
Evaluating f (x, y) at (0, 0), (1, 1) and (−1, 1), we obtain
det(A − λI) = 0.
3
It follows that
QA (x) = x · Ax
= xT Ax
= xT P DP T x
= (P T x)T D(P T x)
= yT Dy
Xn
= λi yi2 ,
j=1
where y = P T x and
λ1 0 ··· 0
. .
λ2 . . ..
0
D=
.. .. ..
.
. . . 0
0 · · · 0 λn
This simple form of QA (x) yields the following conditions for classifying A.
Theorem Let A be a symmetric matrix. Then
5. A is indefinite if and only if at least one of its eigenvalues is positive and at least one of its
eigenvalues is negative.
4
This matrix has the characteristic polynomial
2 − λ −4 0
det(Hf (x, y, z) − λI) = det −4 2 − λ 0
0 0 2−λ
= (2 − λ)[(2 − λ)2 − 16]
= (2 − λ)(λ2 − 4λ − 12)
= (2 − λ)(λ + 2)(λ − 6).
Therefore, the eigenvalues are 2, −2 and 6, which means that the Hessian is indefinite. We conclude
that (0, 0, 0) is a saddle point, and there are no global maximizers or minimizers. 2
Exercises
1. Chapter 1, Exercise 8
2. Chapter 1, Exercise 12
3. Chapter 1, Exercise 13
4. Chapter 1, Exercise 14