Wendell H. Fleming - Functions of Several Variables-ADDISON-WESLEY (1965)
Wendell H. Fleming - Functions of Several Variables-ADDISON-WESLEY (1965)
39001006923919
seis
Ah
ie
Functions of
Several Variables
This book is in the
WENDELL H. FLEMING
Brown University
To Brown UNIVERSITY
1764-1964
Digitized by the Internet Archive
in 2022 with funding from
Kahle/Austin Foundation
httos://archive.org/details/functionsofseverO0O00wend
Preface
- on
(reed) Care (G0 Brin it O im ou Ih Se :
a! ees (27 ever hi Mite Mi iy weoth sare Ra . pee
pein elise ow ayaa tarts SH
il, Weasel) hues emcees pOrepirn marr =j
ie? eleicee | incient’ lata @. O48 sete Pet
MAN ei) kee JOG t ; iohant tient yee .
a — eee Ww a ¥ : ’ a” ‘ sana a
, ~ - : j : 4 9 a oe ee | <a
e
a ' aif ;
eS od| ;
| 7
— —
,
ve i
Contents
i= 1 Euclidean E£”
te 2 Sets, functions .
ibs 3 Linear functions
1 —4 Convex sets .
ths 5 Convex and concave faetions
ce1 6 Noneuclidean norms
3-1 Derivatives .
3-2 Curves in E”
3-3 Line integrals
*3-4 Gradient method
4-1 Transformations
4-2 Linear and affine Prennformations
4-3 Differentiable transformations
4-4 Composition. ,
4-5 The inverse function Ecorert
4-6 The implicit function theorem
4-7 Manifolds
4-8 The multiplier rule
INTEGRATION
5-1 Intervals
5-2 Measure .
5-3 Integrals over En
Contents
INTEGRATION ON MANIFOLDS
APPENDIX
1-1 EUCLIDEAN E”
While calculus has been motivated in large part by problems from geom-
etry and physics, its foundations rest upon the idea of number. Therefore a
thorough treatment of calculus should begin with a study of the real numbers.
The real number system satisfies a list of axioms about arithmetic and order,
which express properties of numbers with which everyone is familiar from
elementary mathematics. To be more precise, the real number system is what
is called in algebra an ordered field. To this list of axioms must be added one
further axiom which expresses the completeness of the real number system.
The completeness axiom can be introduced in several different forms. Of these
we shall take the property that any nonempty set of real numbers which is
bounded above has a least upper bound. This axiom is more subtle than the
1
2 Euclidean Spaces, Convexity 1-1
others and is the foundation stone for some of the most important theorems in
calculus. The axioms for the real number system are listed in Section 1 of the
Appendix.
Scalars and vectors. By scalar we shall mean a real number. In ele-
mentary mathematics a vector is described as a quantity which has both diree-
tion and length. Vectors are illustrated by drawing arrows issuing from a given
point 0. The point at the head of the arrow specifies the vector. Therefore we
may (and shall) say that this point 7s the vector. Thus in two dimensions a
vector is just a point (x, y) of the plane E?. Vectors in E® are added by the
parallelogram law, which amounts to adding corresponding components. Thus
With these definitions £” satisfies the axioms for a vector space (Appendix A-2).
The term “vector” will be reserved for elements of E” rather than those of any
space satisfying these axioms. F
_ The superscripts should not be confused with powers of x. For instance,
(x*)? means the square of the 7th entry 2x‘ of the n-tuple (x!,..., 2”).
If n = 1 we identify the i-tuple x = (x) with the scalar x. In this ease
addition and scalar multiplication reduce to ordinary addition and multipli-
cation of real numbers. If nm = 2 or 3 we usually write (zx, y) or (z, y, 2) as is
commonly done in elementary analytic geometry, rather than (x!,2?) or
(x', x*, 2°). Practically all of the theorems will be stated and proved for arbi-
trary dimension n. However, the special cases n = 2, 3 will frequently appear
in the examples and homework problems.
1-1 Euclidean E” 3
vec \ Pees ya ey
fat
The vector space E” with this inner product is called euclidean n-space. Other
inner products in #” will be considered later in Section 1-6.
The euclidean norm (or length) of a vector x is
ba) See
It is positive except when x = 0, and satisfies the following two important
inequalities: For every x, y € EH”,
*The problem number, for example, Problem 2, refers to the end of the section in
which it is cited, unless stated otherwise.
4 Euclidean Spaces, Convexity 1-1
Ix-y| = |x| |y| ¢f and only if x is a scalar multiple of y. If x-y = |x| ly|, then
x is a nonnegative scalar multiple of y (and conversely).
Proof of (1-2). We write, as before, Se
(aghe
Y)ato Vie Xue iah2X1SR GY ys
From Cauchy’s inequality,
yl alae) 2
This is equivalent to the triangle inequality. J
ly—2|
x
lx—y| y
0
FicureE 1-1 Figure 1-2
[Xatd ZS XG RY aly eZ
which justifies the name “triangle inequality.” See Fig. 1-1.
If x and y are nonzero vectors, the angle 6 between x and y is defined
by the formula
60s Oyu O<@0 5
[x| ly|
1-1 Euclidean E” 5
ni og fe:
ogee L eeL DGS AW 6 ale
LA ge
The symbol 6;; was first introduced by the mathematician Kronecker, and
consequently is called “Kronecker’s delta.” The unit coordinate vectors
ey = (ls O aaa. Oe
1p
Th osalb
form the standard orthonormal basis for H”. We have, for each x € EH”,
n
Taking the inner product of each side with v; and using the formula v;-v; =
6;;, we obtain
x:V; = Ge
The coefficients c’ in (1-4) are just the components of x with respect to the
orthonormal basis vectors.
6 Euclidean Spaces, Convexity 1-2
PROBLEMS
1. Let n = 4, x = e1 — + 2e4 =
e2 2), y= 3e1 —
(1, —1,0, e2 + e3 + 4 =
(3,—1,1,1). Find x+y, x—y, [x+ yl, Ix — yl, |x|, lyl, x-y. Verify (1-1)
and (1-oe in this example.
2. Prove that the standard euclidean inner prone in E” has the following five
properties:
(a) x-y = y°x. (b) &+y)-2=x-z+y-z.
(OVI CX) Vu _c(X“
y)- (d) O-x = 0. (@) seooe SO th be 2 ©
3. Using Problem 2, show that
(w+
cx)- (y+ dz) =w:y+cx:y+dw-z-+ cdx:z.
4. Show that 2|x|?+ 2ly|? = |x+ y|?-+ |x — y|?. What does this say about
parallelograms? See Fig. 1-3.
Figure 1-3
5. Show that |x -+ y| |x — y| < |x|? + |y|? with equality if and only if x-y = 0.
What does this say about parallelograms?
6. Prove (1-3), using (1-2) and induction on m.
7. Let n = 4, and v1 = 2(3e1 + 4e3), vo = $(4e2 — 3e4), v3 = (/2/10)(—4e;, +
3e2 + 3e3 + 4e4). Show that vi, ve, v3 are mutually orthogonal unit vectors.
Find a unit vector v4 such that vi, v2, v3, v4 form an orthonormal basis for E?.
8. Show that the distance between any two elements of an orthonormal basis for
Bris s/ 2.
9. (Gram-Schmidt process.) Let {x1,...,Xn} be a basis for EH”. Let v1 = |x1|~1x1,
y2 = X2 — (2-vi)v1, V2 = ly2|~"y2, ys = x3 — (K3+ v1)v1 — (x3-V2)v2, V3 =
lys|—tys,..-,Vn = l|yn|—lyn. Show that {vi,...,v,} is an orthonormal basis
for E”,
Note: In this book, “Show that...” and “Prove that...” both mean “give a
valid mathematical proof.”
€, Z, BPR ar
are
stand, respectively, for 2s an element of, is not an element of, union, intersection,
difference, and inclusion. Sets will ordinarily be denoted by capital italicized
]-2 Sets, Functions 7h
letters. A set will be described either by listing its elements or by some property
characterizing them. Thus {2, 5,7} is the set whose elements are the three
numbers 2, 5, and 7. If S is a set and 7 a property pertaining to elements of S,
then {p © S:7} denotes the set of all p © S with property 7. For example,
{(x, y) € BE? : x? + y® = 1} is the circle with center (0,0) and radius 1. The
set {(x,y) € E?: 27 + y? = —1} is the empty set. The set {(z, y) € E?:
x? + y” > 0} is all of E?.
When the set S in question is clear from the context, we write simply
oem
Cartesian product sets. If S and 7 are sets, then the cartesian product set
S x T is formed by taking all ordered pairs (p,q) where p € S and qe T.
For example, if S = {1,2,...,n} and T = {1,2,...,m}, then the elements
of S X T are pairs (7,7) of positive integers with 1 <1 <n,t< 7 < m. In
the same way, the plane E£? is the cartesian product H' x E’.
If S,,...,S, are sets, then the n-fold cartesian product Sy X +++ X Sy 1s
formed by taking all (ordered) n-tuples (p1,..., Dn), where p; € S; for each
4 =1,...,n. In particularE” = E1 x --- X E’.
Functions. A function f assigns to each element p of some set S an element
f(p) of another set 7. The element f(p) is called the value of f at p.
This is not a satisfactory definition of “function” because of the ambiguity
of the word “assigns.” A more careful definition is the following. Any subset
f of the cartesian product S x T is called a relation between S and T. A rela-
tion f is called a function if for every p © S there is exactly one g € T' such
that (p,q) € f. This element q is denoted by f(p).
The set S is the domain of f. We shall sometimes say that f is a function
from S into T. If for every g € T there is some p € S such that q = f(p),
Euclidean Spaces, Convexity 1-2
8
TA — Ap i(p)) pe At.
For instance, if a real-valued function f is integrated over an interval J c EZ’,
then it is only f|J which is important. The values of f outside J do not affect
the integral.
1-3 Linear Functions a
for every m,X1,.--,Xm © E”, and scalars c',...,c”. In words this states
that “L of a linear combination of x,,...,Xm is the corresponding linear
combination of L(x,),..., L(xn).”
If a,,..., 0, are real numbers, then the function L defined by
a; = L(e,), t= 1,..
Definition. Leta ~ 0. A closed half-space is a set of the form {x:a-x > c},
and an open half-space is a set of the form {x:a-x > c}.
SA = {x:a-x>0}
— Figures 1-4
P= {x:ax=c}
See Fig. 1-4. A set H of the form {x:a-x < c},a ¥ 0, is a closed half-
space, since H is also {x:(—a)-x > —c}. The same remark applies to
open half-spaces. A hyperplane P = {x:a-x = c}divides E” into two half-
spaces. More precisely, H” — P is the union of the open half-spaces
The dual space of E”. Let us now give a more thorough description of the
space of covectors, dual to H”. The reader may postpone this discussion and
study it together with later chapters.
According to Section A—2 every vector space U has a dual 0*, whose ele-
ments are the real-valued linear functions with domain 0. If VU has finite
dimension n, then U* also has dimension n. Moreover, given any basis for U,
there is a dual basis for 0*.
Now let 0 = £”.
The functions X!,..., X” are called the standard cartesian coordinate functions.
In order to emphasize the duality between vectors and covectors, it is
desirable to change the notation for covector. From now on we shall ordinarily
denote covectors by a, b,... rather than by, say, Z. As in Proposition 1, we
write a-x for L(x), and call a-x the scalar product. The basis dual to
fe,,...,@n} will be denoted by {e',...,e”} rather than {X’,..., X”}.
The notation is chosen so that for every formula about vectors there will
be a corresponding formula about covectors obtained by interchanging sub-
scripts and superscripts. For instance, the components of a vector x satisfy
xz’ = X'(x) = e’-x. The corresponding formula for the components of a
covector a is a; = a-e;. In (#")* a euclidean inner product and norm are
defined in the same way as in E”.
These facts are summarized in the table below.
Vectors Covectors
@ .
In the table, 5; = 6,; is Kronecker’s delta (p. 5). The scalar product
of a vector and a covector involves only the vector space structure of ZH” and
its dual. It does not depend on the fact that we chose the euclidean inner
product rather than some other inner product. If H” is given a noneuclidean
inner product, then the appropriate formula for changing covectors into vec-
tors is (1-14b) in Section 1-6.
_ Qne important fact about vectors and covectors is that their components
x’ and a; change oppositely with respect to linear transformations. This will
be seen in Section 4-2.
1-4 Convex Sets 13
PROBLEMS
1. Let n = 3. Find the plane which contains the three points e1, eg, and e3 — 3e).
Sketch its intersection with the first octant in E°.
2. (a) Find the hyperplane in #* containing the four points 0, e, + e2, eg — eg +
2e3, 3e4 — eo.
(b) Find the value of ¢ for which t(e; — e2) + (1 — Z)eg is in this hyperplane.
oo. Prove that any hyperplane is a closed set.
4. Prove that any linear function is continuous. [Hint: |ja-x —a-y| = |a-(x—y)|.
Use Cauchy’s inequality.]
5. Let {x1,...,Xn} be a basis for H”. Define L by the formula
eX ite, me ei CeXn— CP
for every c!,...,c”. Show that:
(a) ZL is a linear function.
(b) The set P = {x: L(x) = 0} is a hyperplane containing 0, xi,..., Xn—1.
(ec) P is the only hyperplane containing these n points.
6. Let xo, X1,...,Xnz—1 be such that x1 — xo0,...,Xn—1 — Xo are linearly inde-
pendent. Prove that there is exactly one hyperplane containing xo, Xi1,...,Xn—1.
where [a, b] denotes the set of real numbers ¢ such thata < t < 6 (Section A-1).
For example, if t = 4, then x is the midpoint of the line segment joining
x, and x.(Fig. 1-5). The points corresponding to t = 3, 2 trisect the line
segment.
14 Euclidean Spaces, Convexity 1-4
Definition. Let K C E”. Then K is a convex set if the line segment joining
any two points of K is contained in K (Fig. 1-6).
a a
x, Xx XK
E” itself is a convex set. The empty set and sets with just one point trivially
satisfy the definition; hence they are convex. The reader should be able to
think of several kinds of geometric objects such as lines, planes, spherical balls,
regular solids, and so on, which appear to be convex sets. However, geometric
intuition is not always a reliable guide, especially in four or more dimensions.
In any case, intuition is no substitute for a proof that the set in question is
actually convex.
The convex subsets of H” have many remarkable geometric properties.
There is an extensive mathematical literature devoted to them [7],* [10], [13].
However, in the present section we shall go no further than to obtain a few
basic facts about convex sets which are useful in calculus. The main result
(Theorem 1) will be the characterization of closed convex sets as intersections
of closed half-spaces.
The definition of convex set makes sense in any vector space. During
recent years the study of convexity in infinite-dimensional vector spaces has
furnished powerful new tools in such diverse branches of mathematical analysis
as complex function theory, differential equations, and calculus of variations.
Let us consider some familiar subsets of HE” and prove that they are convex.
To show that a set K is convex directly from the definition, we must verify
that for every x,, Xo © K and ¢ € (0, 1], the point x = tx, + (1 — £)x2 also
belongs to K. In the definition, we assumed that x; ¥ xX. But if xy = Xo, it
is trivial that x € K, since x = x, = Xp.
Example 1. Any closed half-space is a convex set. Let H = {x:a-x> ct},
a ~ 0. Let x1, x2 © H and x = tx; + (1 — #)xe, where t € [0,1]. Thena-x; > c
and a-:x2 > c. Since ¢ > 0, ta-x1 > tc; and since 1 —t > 0, (1 — t)a-xo >
(1 — #)c. Consequently,
Given y € fr B, let
Xeic Xa oe be
Py = 4x2y>x = 1}, Figure 1-7
so that Py is the hyperplane bounding Hy. By Cauchy’s inequality and the fact
ony Saale
VOR WINS Shah
Equality holds if and only if x is a positive scalar multiple of y. Hence BC Hy
and B 1 Py consists only of y. The supporting hyperplane to B at y is Py (Fig. 1-8).
Again let K be any nonempty, closed convex set which is a proper subset
of KE” (K # EH”). Let 3Cx denote the collection of all closed half-spaces H such
that K CH and the hyperplane P bounding H is supporting for K. For
instance, the collection 3Cg in the above example consists of the various half-
spaces H, for all possible choices of y € fr B.
The notation
(a
HERK
stands for the intersection of all half-spaces H € 3Cx (see Section A-3).
Theorem 1. K = O) H.
HERR
eS |S
YU ,
Convex Sets 17
]/ ii
Z,
Figure 1-8 Figure 1-9
Example 6. Let f be a real-valued function with domain H!, which has everywhere a
derivative f’(x). Assume that f’ is an increasing function. Let
A = {@,y):y = f@)}.
For each s consider the closed half-space H, above the tangent line to f at (s, f(s)),
H, = {(2,y):y — f(s) > f’(s)\(@ — s)}. (See Fig. 1-10.) Consider (a, y) € A and
suppose first that x > s. By the mean value theorem, f(x) — f(s) = f’()(@ — 8),
where s < t < x. Since f’ is increasing, f’(t) > f’(s). Hence
x, y)
// ye
ss (s, f(s))
Figure 1-10
A sufficient condition that f’ be an increasing function is that f(x) > @ for every
z. More generally, f’ is increasing if f’’(z) > 0 for every x and each point where
f(x) = 0 is isolated.
ie hs is >0 forj=1,...,m
j=1 j=l
If ¢+* = 1, then ? = 0 forj < mand x = Xm+1 isin K. If t”*! < 1, let
Shea eae eee MOL pie Jt 1,
m .
y= De SX
j=l
Then y is a convex combination of x1,...,Xm. By the induction hypothesis,
y € K. But
2.5 ly = (1 co t)Xm+1)
(u, v) *1
of the vertices Xo, X1,... , X, (Problem 6). The numbers 2°, ¢',... , ¢” are called
the barycentric coordinates of x. The (r — 1)-dimensional face opposite the
vertex x; is the set of points of the r-simplex with ¢' = 0.
For example, the vertices xo, X1, X2 of a triangle have barycentric coordi-
nates (1, 0,0), (0, 1,0), (0, 0, 1), respectively. The midpoint of the face oppo-
site Xo has barycentric coordinates (0, $, 3). The interior points of the triangle
have barycentric coordinates (¢°, ¢', t”), all of which are strictly positive. In
each case 1? + ¢} + #2? = 1.
The simplex with vertices 0,e;,...,¢€, is called the standard n-simplez.
It will be denoted by 2, and will be of use later in Section 5-7 in connection with
integration. The barycentric coordinates (¢°, t’,...,¢”) of a point x € = are
wivemby G22 tor? = bo...te) = 1i— (ess 4%),
*Further results about convex combinations. In the definition of convex
combination, no upper bound was put on the positive integer m. However,
for most purposes one need consider only m < n+ 1. More precisely:
Proposition. Jf S C E” and x is a convex combination of points of S, then
X 1s a convex combination of n + 1 or fewer points of S.
PROBLEMS
“I . Prove that a supporting hyperplane for a closed convex set A can contain no
interior point of K.
. Let y be any boundary point of a closed convex set AK. Show that A has a sup-
porting hyperplane P which contains y. [Hint: Let {y} be a sequence of points
exterior to AK such that y» tends to yas m — ©. Let x» be a point of K nearest
tO Ym and Um = (Ym — Xm)/|Ym — Xml. Then |u,| = 1 and xm tends to y as
m — ©. By the proof of Theorem 1 there is a supporting hyperplane of the form
{XiUm*(X — Xm) = 0}. Let u be an accumulation point of the bounded set
{Uj, Us, ...p and P = {rru+ (x — y) = 0}.]
. The barycenter of an r-simplex is the point at which the barycentric coordinates
are equal, 4 = g! = --. = 7f,
(a) Show that the barycenter of a triangle is at the intersection of the medians.
(b) State and prove a corresponding result for r > 3.
10. Let x be a convex combination of xi,..., Xm and let x; be a convex combination
Of yj1,---+,¥Yimj J = 1,..., m. Show thatx is a convex combination ofz1, ... , Zp,
which are the distinct elements of the set {yj,:k = 1,...,mj,j = 1,..., mb}.
1-5 Convex and Concave Functions 23
11. Let S be any subset of E". The set S of all convex combinations of points of S
is the convex hull of S.
(a) Using Problem 10, show that S§ is convex.
(b) Using Proposition 3, show that if K is convex and S C K, then SC K. Thus
the convex hull is the smallest convex set containing S.
12. Given xo and 6 > 0, let C = {x: |x‘ — 2o| < 6,7 = 1,...,n}, an n-cube with
center xo and side length 26. The vertices of C are those x with |a’ — zo| = 6
fora = 1,...,n”. Show that C is the convex hull of its set of vertices. [Hint:
Use induction on n.]
13. Let K be a closed subset of H” such that both K and its complement #” — K
are nonempty convex sets. Prove that K is a half-space.
14. Let K be any convex set. Prove that its interior and its closure are also convex
sets.
15. Let A and B be convex subsets of #”. The join of A and B is the set of all x such
that x lies on a line segment with one endpoint in A and the other in B. Show
that the join of A and B is a convex set.
ae Xe Ke (xX) = C)
The same proof shows that {x : f(x) < c} is also convex. The converse to
Proposition 6 is false; for example, let f be any increasing function with domain
E'. Then K, is either all of E', a semi-infinite interval, or the empty set. In
each case K, is convex. However, f need not be a convex function; for instance,
if f(z) = x3, then f is not convex on E!.
Example 1. Let A be any nonempty closed subset of H”. For every x, let f(x) be
the distance from x to A, namely,
Proof.+ Let Xo be any point of K, and d the distance from x9 to the boundary
of K (d = +a if K = EH"). Let C be an n-cube with center xo and side length
26, where n"!?5 < d. Let V denote the set of vertices of C (see Problem 12,
Section 1-4). V is a finite set. Let
Let x be any point such that 0 < |x — xo| < 6, and define xp + u, x9 — u
on the line through xp and x as in Fig. 1-15. Let us write x as a convex com-
bination of x9 + u and Xo, and Xp as a convex combination of x and Xo — u.
Tet 6~"Ix — xp|,then
x = UX+ u) + (1 — £)Xo,
1 t
Roker = hepep Xo Us
Since f is convex,
PROBLEMS
1. In each case find those intervals of ZH! on which f is convex and those on which
it is concave. Illustrate with a sketch.
(20 fe) i= 127: (b) f(z) = exp (—2z).
(c) f(z) = 2/(1 — |), || ¥ 1. (d) f(z) = log (w?+ 1).
2. (a) Show that no polynomial of odd degree (> 8) is a convex function on H!.
(b) Which fourth degree polynomials are convex functions on 11?
(c) Why must a polynomial (of degree >2) which is a convex function on EH!
be a strictly convex function?
3. Let f(x) = x(x) and g(x) = $(1/x), where ¢ has a second derivative $’’(x) for
every x > 0. Show thatf is convex on (0, ©) if and only if g is convex on (0, ©).
AviLety > 0.7 > 0,0 =< < 1. Show that i+ (1 — dy = z'y'—. (Hint: Log
is an increasing, concave function.]
5. Prove by induction on m that if f is concave on K, then
(X ‘) > Ss t’f(x;)
j=1 all
28 Euclidean Spaces, Convexity 1-6
for every X1,...,Xm€K and scalars ¢t!,...,¢" such that each ¢#/ > 0 and
ti ----+t™ = 1. [For convex functions the sense of the inequality is reversed.]
6. (a) Generalizing Problem 4, show that if %1,...,%m are positive numbers,
OS forg = ale. meando) = Paten
mr 1 ¢™
LIPTas
aieemek
Sulton 1
triangle inequality. Just as for the euclidean norm one can easily prove using
(2), (3), and induction on m that
xk
— "xo 33).
As in the euclidean case, they are convex sets.
Example 1. Let
Ix = D> i
The n-balls with respect to this norm are convex polytopes. For example, if n = 2,
the closed unit 2-ball {x: ||x|| < 1} is the square with vertices e1, e2, —e1, —ee.
Compare with Problem 1(a), Section 1-4.
Example 2. Inner products, quadratic norms. A real-valued function B on the
cartesian product HL” X E” is bilinear if for each x, y € H” the functions B(x, ) and
B( ,y) are linear. An inner product in £” is a bilinear function such that: (a) B(x, y) =
B(y, x) for every x, y, and (b) B(x,x) > 0 for every x # 0. With any inner product
B is associated a quadratic norm given by
In fact, if n aS
x = De Ties, Yis Ss y'e;,
i=1 eh!
then n n
and we set ci; = B(e;,e;). Condition (a) states that the matrix (c;;) is symmetric,
Cij= cj; for 1, 7 = 1,...,n. For the standard euclidean inner product, ci; = 6;;
and the matrix is the identity.
The n-balls with respect to any quadratic norm are n-dimensional ellipsoids.
This will be proved later (Section 4-8) by finding a new orthonormal basis for H”
for which the matrix associated with B is diagonal.
If y is a vector, then there is a covector a such that
On the other hand, given a covector a there is a vector y such that (1-13) holds. Its
components are
n
Ua ih cee) (1-14b)
j=1
where (c’) is the inverse of the matrix (c;;). For the standard euclidean inner product,
y’ = a;, and formula (1-13) becomes (1-7).
Proposition. Corresponding to any norm || || on E” there exist positive num-
bers m and M such that for every x € E”,
Ix < Mix.
From this |x — y|| < M|x — y| for every x and y, which implies that
|| || is a continuous function. Therefore it has a minimum value m on the com-
pact set {x : |x| = 1},
m = min {|[x||> |x) = 1},
By Axiom (1) m > 0. If x = 0, all terms in (1-15) are 0. Given any x # 0,
let ¢ = |x|~*. Then |cx| = c|x| = 1, and hence ||cx|| > m. By Axiom (2)
1-6 Noneuclidean Norms 31
|x|] = mlx].a
Irom the proposition,
Proof. By (1) and (4) there exist r; > 0 and rg > O such that y € K if
ly] <r, andy ¢ K if |y| > ra. Hence given x + 0, tx EK if |t) < r1/|x|,
and tx ¢ K if |t| > re/|x|. (See Fig. 1-16.) Let
\
Ss; = {t 2b S KE
=1 = 245
1 1
(lel + Iv. Frere 1-16;
A little manipulation shows that su = (1 — u)t. Consequently,
Axioms (1), (2), and (3). Ifa # 0, thena-x ¥ 0 for some x. We can assume
that a-x > 0, since if not, x can be replaced by —x. If y = (1/||x||)x, then
a-y> 0 and |ly|| = 1. Thus {a-x:||x|| = 1} contains some positive num-
ber, and its maximum |lal| is positive. The reader should verify that ||ca|] =
|c| |lal|. Given a, b € (E”)*, we have whenever ||x|| = 1,
Hence the number |la|| + ||b|| is an upper bound for {(a + b)-x:||x|| = 1}.
The least upper bound of this set is ||a + b||. Thus
From 1-19, a- x < ||x|| for every covector a such that |/a|| = 1. Hence
To prove the opposite inequality, consider first any y with |/y|| = 1. Then y
is a boundary point of the closed convex set K defined in (1-16). By a corollary
to Theorem 1 (Problem 8, Section 1-4), K has a supporting hyperplane at y.
Thus there exists a covector b such that
Thus |ly|| < max {a-y:|lal|/ = 1}. We have already proved the opposite
inequality. Hence (1-20) is true for elements of norm 1. Since ||ey|! = |c| ||y|)
and
max {a- (cy) : |lal] = 1} = |c| max {a-y:]a|| = Lf,
(1-20) then follows for elements of arbitrary norm.
34 Euclidean Spaces, Convexity 1-6
PROBLEMS
Pere ix| =sraax tot 8, ey:
(a) Show that this is a norm. x
(b) Describe the neighborhoods with respect to it.
(c) Show that the triangle with vertices 0, e1, e2 is equilateral with respect to the
distance which it defines.
2. The ellipse K = {(z, y): 2? -+ ry + 4y? < 1} has Properties (1)—-(4). (See p. 31.)
(a) For what (quadratic) norm is it the closed unit 2-ball?
(b) Find |le1 — eg||.
Se letro = Jand let |x| = (> 721 (a1?) "2. [For p = 1 this is Example labove,
and for p = 2 this is the euclidean norm.] Show that this is a norm, in the follow-
ing steps:
(a) Let f(x) = 0%; |z'|?. Show thatf is convex on EH”. (Hint: In Section 1-5
we showed this for n = 1. Hence if 0 < ¢ < 1, |t2*-+ (1 — Oy'|? < tx]? +
(ily?
(b) Let K = {x: f(x) < 1}. Show that K satisfies Properties (1)—(4) on p. 31.
(c) Show that ||x|| is given by (1-17). [Note: For this norm the inequality
Ix+ yl] < ||x|] + lly|| is called Minkowski’s inequality. There is a related
inequality for integrals which we shall prove in Section 5-12.]
4. Show that if p = 1 in Problem 3, the dual norm is given by
Differentiation of
Real-Valued Functions
2 f'(o) = h-0
lim
feo +8) = feo)”
h
provided the limit exists. The corresponding expression for functions of several
variables does not make sense, since h is then a vector and division by h is
undefined. Therefore we must find an acceptable substitute for it. Let us first
consider the derivative of f in various directions.
Let us call any unit vector v (that is, vector with |v| = 1) a direction
in #”. The directions are just the points of the (n — 1)-dimensional sphere
which bounds the unit n-ball. If n = 1, the only directions are e; and —e,,
which we have identified with the scalars 1 and —1. If n = 2, every direction
can be written (cos 6, sin 6) where 0 < 6 < 27. The angle @ determines the
direction. For any n > 2 the components of a direction v satisfy v’ = cos 6;,
1 = 1,...,n, where 9; is the angle between v and e;.
Given X and a direction v, the line through x9 + v and Xo is called the
line through Xo with direction v. According to the definition on p. 13, this line is
Quy
f(z, y) = puny fey) (0, 0):
Since the domain D is to be HE? we must give f some value at (0,0). More or less
arbitrarily, we let f(0,0) = 1. Let us find the directional derivatives of f at (0, 0).
Given a direction (cos 9, sin 6),
or d(t) = sin 26, foreveryt ~ 0. But ¢(0) = 1. If sin 26 = 1, then ¢ is the constant
function with value 1 everywhere, and ¢’(0) = 0. Thus if 6 = 2/4 or 5/4, the
directional derivative exists and is 0. For all other values of 6 the function ¢ is dis-
continuous at 0, and consequently ¢’(0) does not exist. Thus f has a directional
derivative at (0,0) only in the directions (2/2, V/2/2) and (—+/2/2, — V/2/2).
2-1 Directional and Partial Derivatives 37
In the next section we shall see that if f is differentiable at xo, then the de-
rivative in every direction exists and is easily calculated. Thus the unpleasant
phenomenon illustrated by the example cannot occur for differentiable functions.
The partial derivatives of f are defined as the derivatives in the directions
€1,..., €n, if these directional derivatives exist. There are several equivalent
notations in use for partial derivatives. Of these we shall adopt just two The
ath partial derivative of f at x,7 = 1,...,n, is denoted by
Of
Thus eS? sea Ox; St
Ce wee tae ad, ) 2°) = fl ) Aa)
) ;
Fi(X)
5 =
ba
| d ’
j ’
(2-3)
provided the limit exists. Stated in less precise terms, f;(x) is the derivative
taken with respect to the 7th variable while holding all other variables fixed.
fi (x, Y, z) = 2x,
The symbol f; will denote the real-valued function whose value at x is f;(x).
Its domain is the set of points where f has an 7th partial derivative.
For purposes of brevity, we shall occasionally abuse the notation by writing
f; for the value f;(x) at some particular x. In each such instance this abuse will
be indicated either explicitly or by the context.
Example 3. Let f(x) = Wlg(x)] for every x € D. Suppose that the 7th partial deriva-
tive of g at xo and the derivative of at g(xo) exist. By the composite function theorem
for functions of one variable
Filo) = ¥’[g(xo)]gi(Xo). (2-4)
This theorem will be proved in Section 4-4 as a special case of the composite function
theorem for transformations.
PROBLEMS
Unless otherwise stated, the domain D of f is £” for the particular n indicated
in the problem.
1. In each case find the partial derivatives of f.
(a) f(a, y) = xlog (zy), D = {(a,y): ay > 0}.
(Dray 2) a= (a2 2y2ct-2) 8.
(c) f(x) = x-x.
2. Let f(z, y) = (« — 1)? — y?. Find the derivative of f at eg in any direction v,
using the definition of directional derivative.
3. Let f(z) = 21/3. Show thatf has no derivative at 0.
38 Differentiation of Real-Valued Functions 2-2
4. Let f(x, y) = (xy)'/3. (a) Using the definition of directional derivative, show that
fi(0, 0) = fe(0,0) = 0, and that +e, +e2 are the only directions in which the
(b) Show thatf is continuous at (0, 0).
derivative at (0, 0) exists.
5. Let f(z, y,z) = |e + y+ 2]. Find those direction’. in which the derivative of f
at e; — eg exists. [Hint: The absolute value function, g(t) = |t| for every t € £!,
has no derivative at 0.]
6. Show that the derivative of f at xo in the direction —v is the negative of the deriva-
tive at Xo in the direction v.
epiibtti
= leo th) 0. (2-5)
h0 {h
Let us show that if f is differentiable at xo, then f has a derivative at xo
in every direction v. Taking h = tv, (2-5) implies that
and its value at a vector h is denoted by df(xo) -h rather than (Tae Lav
e;, then the number a; = L/(e,) is the ith partial derivative fi(Xo). Hence
the components of the covector df(xo) are the partial derivatives:
af(xo) = Dofalxo)e’,
1=1
(2-6)
(Xo, 29 +df(Xo)-h)
(Xo, 20)
|
X=Xjth
-e———————
——— Zo =J(Xo)
Figure 2-2
Example 1. Let f(z, y) = (zy)!/%. Find the tangent plane at (1,1,1). By ele-
mentary calculus
2=1+e-1)+4y—D.
The partial derivatives f1(0, 0) and fo(0, 0) are both 0, according to Problem 4,
Section 2-1. However, there is no tangent plane at (0,0, 0). If there were a tangent
plane at (0, 0, 0), then f would have to be differentiable at (0, 0). Since there is not a
derivative in every direction at (0, 0), f is not differentiable there.
Example 2. Let
Qry”
f(a, y) = if (x,y) ~ (0,0), and f(0,0) = 0.
oes
There are two cases to consider. If cos@ ¥ 0, then the derivative at (0,0) in the
direction (cos 6, sin @) is
If cos@ = 0, then f(écos 6, t sin 0) = 0 for every ¢ and the directional derivative
at (0, 0) is 0. However, f(y”, y) = 1 for every y ~ 0. Since f(0,0) = 0, f is not
continuous at (0,0). By Proposition 7, f is not differentiable at (0, 0).
Proposition 8. Let ¢(t) = f(Xo + th). Then for every t such that f is dif-
ferentiable at Xo + th,
T>0 oF
Mean Value Theorem. Let f be differentiable at every point of the line segment
joining Xo and Xy +h. Then there exists a number s € (0, 1) such that
dj(x) = )) fixe’
tent
whose components are the partial derivatives f;(x), it is sometimes more suit-
able to think instead of the vector with these same components. This is called
the gradient vector at x and is denoted by grad f(x). Thus
*Note: This definition of the gradient vector is correct only if we use the
euclidean inner product in HE”. If H#” is given some other inner product B,
then one should use formula (1-14b) for changing covectors into vectors.
The gradient vector in that case is
n
PROBLEMS
In Problems 1, 2, 3, and 8, assume that f is differentiable. In each case this
follows from Theorem 2 in the next section.
1. Let f(x, y) = 3x7y + 2zy?. Find the tangent plane at (1, —2, 2).
2. Using the formula df(xo) : v for directional derivative, find the derivative of f at
xo in the direction v.
df(x) +x = pf(x)
for every x ~ 0, and conversely. [Hint: Let g(t) = f(tx) and use Proposition 8
with xo = 0. For the converse, show that for fixed x, (t)t~? is a constant.]
8. Let Q(x) = Dotj=1 Cijv‘z’, where Ci; = Cj; and Q(x) > 0 for every x ~ 0. Let
f(x) = [Q(x)]*/*. Calculate df(x) and verify Euler’s formula for this function.
whenever \h| < 6,. Since f, is continuous, there exists 62,0 < 52 < 69, such
that |fn(y) — fn(Xo)| < €/2 whenever ly — xo| < 6. Let 6 = min {6}, 35}
and let |h| < 6. By the mean value theorem,
Using the above inequalities, the triangle inequality, and the fact that |h”| <
|h|, |b] < |h|, we get
Ifo +h) — (Ko) — af(xo) hl < 5 |A"| +5 [Al < lhl
whenever |h| < 6. This proves that f is differentiable at Xo. I
Example 1. If f and g are functions of class C” with the same domain D, then f + g
is of class C™. Using the product rule from elementary calculus, the partial deriva-
tives of the product are
+ foi:
(f9)i = fig ese its
Since sums and products of continuous functions are again continuous, (fg); is con-
tinuous for each i = 1,...,n. Hence fg is of class C™.
Example 2. The composite of two functions of class C“) is also of class C™. For
suppose that f = Y°g, where y and g are of class C. By formula (2-4), p. 37,
fi = (W’ °g)g:. Since y/ and g are continuous, their composite ¥ © g is continuous
46 Differentiation of Real-Valued Functions 2-3
(Proposition A-7). Since g; is continuous, the product (W’ © g)g; is continuous. Thus
f is of class C“.
IEAeS) tore
af (x)
axidx"
For example, if F(, y) aa rie then filz, y) ia Dae fir, y) = 2y°,
fia, y) = Gay’.
If all of the partial derivatives f;;(x), 1, 7 = 1,...,”, exist at every
x € D and each f;; is a continuous function, then f is called a function of class
Cc By the corollary to Theorem 2, if f is of class C then fi,...,f, are
continuous. Hence any function of class C is also of class C™.
The partial derivatives of f of order gq = 3,4,... are defined similarly,
wherever they exist. The notation for partial derivative at x, first in the direc-
tion e;,, second in the direction e;,, and so on, is
WE Aes) il << 11 << n, = il. ooo y Ue
It can happen that a function f has the second partial derivatives f;; and
fii, t # J, but that f;; # fj; See Problem 6. However, this undesirable phe-
nomenon cannot occur if f is of class C™. This is even true under the shghtly
weaker hypotheses of the following theorem in which no assumption is made
about the other second-order partial derivatives of f.
2-3 Functions of Class C'”’ 47
Proof. Suppose first that 7 = 2. We need to show that fig = fo1. Let
(Xo, Yo) be any point of D, and 59 > 0 such that the 69-neighborhood of
(Xo, Yo) is contained in D. For 0 < u < 69/+/2 let
i
A(u) = ee [F(zo + u, Yo + u) — f(%o, Yo + u) — f(to + u, Yo) + f(Zo, Yo)]-
h(y) = filé, y)
for every y such that (& y) € D. The domain of h is open and contains
[yo, Yo t+ ul. Moreover, h’(y) = fio(£, y), h is of class C\ since fi. is con-
tinuous, and
it
A(u) = U= [h(yo + u) — hlyo)].
for some
Since f;2 and fg, are continuous, given € > 0 there exists 6 € (0, 69) such
thateitOn<yu <0/~/2,
[tr2(Ee mp) == 7
too, Yo) ene, \far(E*, gia fo1(ton Yo) ||<ee:
for every (x, y) In some open set containing (x, x). Applying the theorem to
¢, we find that
Ga, De fe
1
Oneeet
n
t Be), 2-4)
5 °
wate ig-1=1
ee ee
Rae) =D) Farnig (Ko + sh)hit; «= Be) (2-10)
Example 5. Let f(x, y) = x7y and (xo, yo) = (1, —1). Then
and all other partial derivatives are 0. Here we have written f; for short in place of
fi(z, y), and so on. Ra(z, y) = 0, and Taylor’s formula becomes
where the partial derivatives on the right-hand side are evaluated at (1, —1). Thus
exist. Let
Lie ieee,
A= Mad) © BIE iy OS SNe.
Et oe Sa sy
For other extension theorems of Whitney, see Trans. Amer. Math. Soc.
36 (1934), and Bull. Amer. Math. Soc. 50 (1944).
*Functions of class C’”); real analytic functions. Let us say that f is of
class C? if f is of class C for every q. If f is of class” and lim,_,» Rq(x) = 0,
then in place of Taylor’s formula with remainder we may put the corresponding
infinite series. This infinite series is called the Taylor series for f(x) at Xo.
If K is a convex subset of D and xy € K, then the following is a sufficient
condition that f(x) be the sum of its Taylor series for every x € K. Suppose
that there is a positive number 1 whose gth power bounds every gth-order
partial derivative of f, namely,
Vapor x) a * (2-11)
TOteCVeLVEXiee age al Ore GNC) SS 41,. 125 tg < 1. Smee K is convex,
for each x € K the estimate (2-11) also holds at xo + sh. Since |h’| < |hl,
from (2-10) we have
[R@| < EM
SALete ealOnS
where C= nM |h|. Since C2/q! — Oasq — a,
lim R,(x) = 0
q-~
0 ie ge SO
Let us show that this function is of class C‘” and that f‘’(0) = 0 for every
g = 1,2,... Forx # 0 the derivatives f(x) can be computed by elementary
calculus, and each f‘” is continuous on H' — {0}. It is at the point 0 where
f must be examined. Now
lim u* exp (—u) = 0 foreach k = 0,1,2,..., (2-12)
uUu— +o
a fact which we shall prove immediately below. If x < 0, then f(x) = f(x) =
f’(z) = ++-=0. Using (2-12) with k = 0, exp (—1/z”) > 0 as x — OF.
Since f(0) = 0, f iscontinuous. If x > 0
or 2 i eo
ee) i ey | 1
ee x3 exp ( 5) v4 exp ( 4)
52 Differentiation of Real-Valued Functions 2-3
PROBLEMS
1. Expand f(a, y, z) = xyz by Taylor’s formula about xo = (1, —1,0), with g = 4.
2. Let f(z, y) = W(ax + by), where a and 6 are scalars and y is of class C in some
open set containing 0. Show that Taylor’s formula about (0, 0) becomes
q—1 _(m)
f@,y) = > ps(ee) (ax)’(by)"2+ Ra(2, y),
m=0 ;
where (”;) is the binomial coefficient (which equals the number of 7-element subsets
of a set with m elements).
3. Let f be continuous on an open set D and of class C on D — {xo}. Suppose
moreover that J; = limy_,x, fi(x) exists for each 7 = 1,...,n”. Prove that J; =
fi(xo), and consequently that f is of class C™ on D. State and prove a correspond-
ing result in case g > 1.
4. Prove the statement made in Example 3.
5. Let f@) = z* sin(1/z) if « ~ 0; and f(0) = 0. Show that:
(a) Ifk 0, then f is discontinuous at 0.
(b) If k = 1, then f is of class C but not differentiable at 0.
(c) If k = 2, then f is differentiable but not of class C™.
(d) What can you say fork > 3?
6. Let f(x, y) = zy? — y?)/(a? + y?), if @, y) 4 (0,0), and f(0, 0) = 0.
(a) If (z,y) A (0,0), find fie(x, y) and foi(z,y) by elementary calculus, and
verify that they are equal.
(b) Using Problem 3 show that f1(0,0) = f2(0,0) = 0 and f is of class C®.
(c) Using the definition of partial derivative, show that f12(0,0) and f21(0, 0)
exist but are not equal. Why does this not contradict Theorem 3?
2-4 Convex and Concave Functions (continued) 53
7. Given n and q, how many solutions of the equation 7; +--+ 7, = q are there
with 21,...,% nonnegative integers? With 71,...,7, positive integers? What
func-
does this say about the number of different gth-order partial derivatives of a
tion of class C(O?
Proof. Let f be convex on K, and let xo, x be any two points of K. Let
h = x — xp) andt e€ (0,1). By definition of convex function,
By (2-13a) we have
f(a) > fo) + df(ao) +h,
f(x2) > flo) + df (xo) C a4 h):
Multiplying by ¢/(1 — 4) in the first inequality and adding, we get
the) + san) > (FAG +1) fe0), or Yen) + (1 = flex) > Hex)
But this is just the inequality (1-8a) in the definition of convex function. We
assumed that ¢ € (0,1), but if t = 0 or 1, (1-8a) trivially holds. Therefore
f is convex on K. §
By sharpening the inequality in (2-13a) we get a necessary and sufficient
condition for strict convexity.
The proof of the converse is the same as for Proposition 9a, all inequalities
now being strict.
Let us next prove a theorem which provides a convenient test for con-
cavity or convexity of a function of class C’. Let f be of class C”’ on an open
set D. Let Q be the function with domain D x EH” defined by the formula
fx) = Do caz'a’,
i,j=1
2-4 Convex and Concave Functions (continued) 57
for each x € E”, where the n X n matrix (c;;) is symmetric. Then Hie) omen are|
Q(x, h) = 2f(h). Hence f is convex on HE” in case f(x) > 0 for every x, and concave
in case f(x) < 0 for every x. If f has values of both signs, then f is neither convex
nor concave.
Example 2. Let f(x) = exp [g(x)], where g is of class C®) and convex on K. Then
The last term on the right-hand side is nonnegative since g is convex. Moreover,
n n 2
and exp [g(x)] > 0. Hence Q(x, h) > 0 for every x € K and every h. Therefore
f is convex on K. This example is a special case of Problem 4, since the exponential
function is increasing and convex on E!.,
where we have written (h, k) for (h', h”) and f,; for f;;(z, y).
If the discriminant —(f11f22 — f?2) is negative, the equation Q(z, y, h, k) = 0
has no roots (h, k) except the trivial one (0,0). The sign of f1; and fo: deter-
mines whether Q(z, y, , ) > OorQ(z,y, , ) < 0.
If fisfes — fro < 0, then {(h, k) : Q(z, y, h, k) = 0} consists of two lines
intersecting at (0,0). They divide the (h, k)-plane into four parts, on two of
which Q(x, y, h, k) > 0 and on the other two of which Q(z, y,h,k) <0. In
this case, Q(z, y, , ) is indefinite. Thus
Example 3. Let f(x,y) = de? + y?) + ay. Then fir = 2, foo = y, fii fo2—fi2 =
ay — 1. Hence f is strictly convex on the part of the first quadrant above the hyper-
bola zy = 1, and strictly concave on the part of the third quadrant below this
hyperbola.
These are called the principal minor determinants of the matrix (f;;(x)). The
mth principal minor d,,(x) is the determinant of the matrix obtained by delet-
ing the last n-m rows and columns. The determinant d,(x) is called the Hessian
Off at x,
Let us state without proof the following criterion:
For a proof of the first of these two statements, see reference [3], especially
pp. 140, 147. The second follows from the first by considering —Q. Here
iff is an abbreviation for “if and only if.”
Criterion II is fairly convenient for small values of n, but becomes unwieldy
for larger ones. This is because of the very large number of operations required
to calculate the determinant of an m X m matrix even for moderately small m.
III. In linear algebra it is shown that any quadratic form can be written
as a linear combination of squares by suitably choosing a new orthonormal
basis for H”. This fact is also proved in Section 4-8 below. Therefore
h = >> n'(x)vi(x).
oa
2-4 Convex and Concave Functions (continued) 59
The numbers )1(x),..., n(x) are just the characteristic values of the matrix
CATES)
If \,(x) > 0 for each i = 1,...,n, then from (2-17) Q(x, h) > 0 unless
n'(x) = 0 for each 7 (that is, unless h = 0). In this case Q(x, ) is positive
definite. Conversely, if h = v,(x), then Q(x, h) = \,(x). Therefore, if Q(x, ) > 0,
then in particular Q(x, v,(x)) > 0, and A,(x) > 0. This proves the first of
the following statements:
OC es 0 Mate A(x)
> 0) fort = arn:
ieee OI Ne Xe ONS Ord == a1een,
The second is proved in the same way. Replacing on both sides “>0” by
“>0” we get a criterion for nonnegative semidefiniteness, and replacing “<0”
by “<0,” one for nonpositive semidefiniteness.
If n is fairly large it is better, instead of criterion II, to try some numerical
method for putting Q(x, ) in the form (2-17).
PROBLEMS
1. Use Theorem 4 to determine whether f is convex on K, concave on K, or neither.
Unless otherwise indicated, K = EH? or E?.
(Ah fGyy, 2 = a? iy? — 42?
(b) f(z, Yy; z) Se Te ane ee
(oy f{@,y) = (ea y+ 1)’,K = {@,y):2+ y+ 1 > 0}.
GG giz) = exp (0 ayy? 27).
(e) f@, y) = exp (zy).
In which cases is the convexity or concavity strict?
2. Let f(z, y) = o(x? + y?), where ¢ is of class C), increasing and concave. Show
thatf is convex on the circular disk x? + y? < a? if and only if ¢’(u) + 2ud’’(u) > 0
whenever 0 < u < a?.
3. Using Problem 2, find the largest a such that f is convex on x? + y? < a’.
(a) f(x,y) = log (1+ 2? + y?). (b) f(z, y) = sin @? + y?).
4, Let K be an open, convex set, and g a function which is convex and of class C)
on K. Let I C E! be an interval such that g(x) € J for every x € K. Let gd bea
function which is of class C‘?’, nondecreasing, and convex on J. Let f be the com-
posite of @ and g, f(x) = $[g(x)] for every x € K.
(a) Using Theorem 4, prove that f is convex on K.
(b) Prove the same result without the assumption that ¢ and g are of class C®,
by using directly the definitions of convex function and nondecreasing function.
5. Using Problem 4, show that each of the following functions is convex on H”:
(a) f(x) = |x|’,p = 1. (De Ryeae let XX) aie.
(eo) fxe=) (ie lx|7)?*, 9 > 1. [Hint: First consider p = 1.]
6. Let f be of class C“, decreasing, and convex on a semi-infinite interval (c, ©).
Prove that if f(z) > 0 for every x > c, then limz,+4.f'(z) = 0. [Hint: Let l =
sup {f’(c):2 > cl}. Either 1 = 0 or / < 0. Using the fundamental theorem of
calculus, show that if 1 < 0, then limz,4. f(x) = —%.]
60 Differentiation of Real-Valued Functions 2-5
7. Let K be a convex set with nonempty interior int K, and x* some point of int K.
Prove each of the following:
(a) For every x € K and s € (0, 1], sx* + (1 — s)x € int K.
(b) If f is continuous on K and convex on int KSthen f is convex on K. [Hzint:
Use (a).]
(c) Suppose that fr K contains no line segment, and that f is continuous on K
and strictly convex on int K. Then f is strictly convex on K.
f(Xo) =m 4f(x)ex Ee A}
is the minimum value of f on A. (Of course, there need not be any such
point Xo. However, if A is a compact set, then by the corollary to
Theorem A-6, any continuous function has an absolute minimum at some
point of A.) If f(xo) < f(x) for every x € A except xo, then f has a strict
absolute minimum at Xo.
Proof. Given a direction v, let ¢(t) = f(xo + tv) for every ¢ in some open
subset of H' containing 0. Then ¢ has a relative extremum at 0, and con-
sequently by elementary calculus ¢’(0) = 0. But ¢/(0) = df(xo)-v is the
derivative at Xo in the direction v. Hence df(xo) -v = 0 for every v, which
implies that df(xo) = 0. J
]
v(x) = grad f(x).
|grad f(x)|
This direction is called the direction of the gradient at x, and is the one which
maximizes the directional derivative. The maximum value of the directional
derivative is
(a) (b)
Figurr 2-5
62 Differentiation of Real-Valued Functions 2-5
concave function, then it appears that the summit will be reached by this
technique. However, if the mountain has a more complicated shape, the
climber may reach a false summit or a saddle as in Fig. 2-5(b). Once he reaches
any critical point, the gradient method tells him to stay there.
The gradient method will be defined more precisely later (Section 3-4).
For functions which are convex or concave, Proposition 10 has a converse.
Proof. Since df(xo) = 0, f(x) > f(Xo) for every x € A by Proposition 9a. §f
For functions which are neither convex nor concave, the theory of relative
extrema is more complicated. We shall consider only functions of class C on
A. The main result is:
Proof. Let f have a relative minimum at x9. Then there exists a neighbor-
hood U of xo such that
where h = x — Xo, x © U. Suppose that Q(xo, ho) < 0 for some ho. The
proof of Theorem 4 shows that f(x) < f(xo) for some x € U of the form
Xo + cho. This is a contradiction. Therefore Q(xo, ) > 0, which proves (a).
To prove (a’), suppose that Q(xo, ) > 0. Using Problem 8 and the fact
that the functions f;; are continuous at xo, there exists a neighborhood U of
Xo such that U C A and Q(y, ) > 0 for every y € U. Taking y = xo + sh,
we find from (2-18) that f(x) > f(xo) for every x € U, x ¥ Xo. This proves
that f has a strict relative minimum at x9. Statements (b), (b’) follow respec-
tively from (a), (a’) by considering —f.
Wi =) =
c=—7pes
C= at
27
Figure 2-6
Example. Let f(x,y) = 2y2 — x(x — 1)? for every (2, y) € H? and A = E?. This
function has two critical points, (4, 0) and (1, 0). We find that
The point (4, 0) of relative minimum is an isolated point of the level set con-
taining it. For —34 < c < 0 the level set has two parts. The one which encloses
(4, 0) resembles a small ellipse if ¢ is near —z. This can be attributed to the fact
that near (4, 0), f(z, y) is approximated by the first two nonzero terms in its Taylor
expansion about (4, 0), namely,
Examples. The extreme points of a simplex are the vertices. If K is a closed n-ball
then every point of fr K is extreme. A half-space has no extreme points.
a eet C=C
But f(x) = C, and since each t? > 0 we must have f(x;) = Cforj = 1,...,m.
Thus X1,...,Xm © K,. Conversely, if f(x;) = C for each7 and x is a convex
COMbinationOl x7, <5. , Xm, then f(x) = C. |
PROBLEMS
Their derivatives are given by the formulas sinh’ = cosh, cosh’ = sinh.]
. Let f(z, y,z) = x? + y? — 27. Show that f has one critical point, which does
not give a relative extremum. Describe the level sets.
. Let f(z, y, 2) = 2? + 3y? + 227 — 2Qry+ 2zrz. Show that 0 is the minimum
value of f.
. Given x1,...,Xm, find the point x where }>7); |x — x;|? has an absolute mini-
mum, and find the minimum value.
. (a) In Problem 1(a) find the (absolute) maximum and minimum values of f on
the circular disk xz? + y? < 1.
(b) Do the same for 1(c).
. (a) Show
ear that under the hypotheses of Theorem 5, {x € A: df(x) = 0} is a
convex set.
(b) Illustrate this result in case A = E? and f(z, y) = (x — y)?.
. Let g(h) = do?j=i cish'h?. Assume that g > 0, that is, that g(h) > 0 for
every h ~ 0.
(a) Show that there exists a number m > 0 such that g(h) > ml/h? for every h.
[Hint: The polynomial g is continuous, and has a positive minimum value
m on the unit (n — 1)-sphere.]
(b) Suppose that |Ci; — ci;| < en? for each 7, 7 = 1,...,n. Let G(h) =
Dij=1 Cijh‘h?. Show that G(h) > (m — ©)J|h|? for every h. Hence G > 0
ife < m.
. Let f(x) = ¥(a-x), where y is of class C@. Show that every critical point of
f is degenerate.
10. Let xo be a nondegenerate critical point of a function f of class C@’. Show that
Xo is isolated, that is, that xo has a neighborhood U containing no other critical
points of f. [Hint: Let x be another critical point in U. Apply the mean value
theorem to each of the functions f1,...,f, to find that
where each y; € U. Show that if U is small enough, det (f:;(y,)) + 0 and con-
sequently the system of equations (*) has only the solution x — xo = 0, a
contradiction. ]
fi Let A be closed and convex, and P a supporting hyperplane for K. Show that
any extreme point of K Q P is an extreme point of K.
2-6 Differential 1-Forms 67
*12. Let K be a closed convex polytope (not necessarily compact) and f be a linear
function such that f(x) is bounded above on K. Show that f has an absolute
maximum on K.
Let us first give a rough description of this notion and afterward be more
precise. A differential form w of degree 1 is supposed to be an “expression linear
in the differentials dr!,... , dx””:
w = w, dr’ +---+ w, dz”, (2-19)
vector space, the sum w + ¢ of two functions w and ¢ with the same domain D
and values in (£”)* is defined (p. 8). Similarly, the product fw is defined
if f is a real-valued function and w a 1-form, with the same domain Dain
particular, w; dz is the 1-form whose value at eath xis w;(x)e’. From (2-21),
w, dx! +---+ w, dx” is the 1-form whose value at each x is w(x). Therefore
formula (2-19) is correct.
ie
= T
OW; a7 a a7 00;
dx? dvidx’ —-axa? ax’
wo = Mdz+N dy.
We have shown that every exact 1-form is closed. The converse is false,
as Example 2 below shows. It is comparatively easy to check whether condi-
tions (2-22) are satisfied or not. Therefore it is very desirable to find some
additional condition which will guarantee that the converse holds. Such a con-
dition is that the domain D be simply connected. We shall prove in Chapter 7
that if D is simply connected, then every closed 1-form with domain D
is exact. We shall define the term “simply connected” in Section 7-7. For
the present, let us merely say that any convex set, and in particular E”, is
simply connected. For n = 2, an open, connected set D is simply connected
if and only if, roughly speaking, D has no holes.
We have been careful to distinguish notationally between functions and
their values. One can scarcely attain a sound knowledge of calculus until this
70 Differentiation of Real-Valued Functions 2-6
Example 1. Let w = 2xry dv + (x? + 2y) dy, D = E*. This is of course an abbrevia-
tion for wo = Mdz+ N dy, where M(x, y) = 2cy, N(x, y) = x? -+ 2y, for every
(x, y) € E?. In this example,
aM ON
Oy (x,y) = 2x = ae (ara)
for every (x, y). Hence w is a closed 1-form. Since H? is connected and simply con-
nected, w = df where f is determined up to the addition of a constant function. The
function f can be found by partial integration with respect to the first variable, as
follows:
Of 2 2 2 D
pt eh ea eee Yin Desi CYR eae
O )-
Of course these equations hold for every (x, y) € H?. Then ¢’(y) = 2y, and ¢(y) =
y + c for every y, where the “constant of integration” ¢ is a number which may be
chosen arbitrarily. Hence for every (a, y) € H?,
ACO) aI me
Example 2. Let D = H? — {(0,0)}. By removing (0, 0) we have made a hole, and
D is not simply connected. Let w = M dx + N dy, where for every (x, y) € D
Y
M(a, y) = ea
H
N(x, y) = 22 y2
Exampie 3. In some cases it can be seen by inspection that w is exact. For instance,
ifw = 2r!dz!4---+ 22" dx" and D = E”, then
PROBLEMS
1. Let n = 1. Give a precise interpretation of the formula df/dx = f’ from ele-
mentary calculus. [Hint: The quotient of two real-valued functions is defined
wherever the denominator does not have the value 0.]
2. Letn = 3 and w = Mdz+ N dy+Odz. What do conditions (2-22) become in
this case?
3. In each case determine whether or not w is exact. If exact, find all functions f such
LO A) Saye
(ayi @ = zy dr -+ (@2/2) dy, D = E?. (b) w = 2dxr-+ xz dy + zy dz, D = E?.
(cnc =" y det D = 8,
(d) @= (1/x2 + 1/y*)y dz — x dy), D = {(ryy) se ~ 0 and yy, = 0}.
4. Let w = dy + p(x)y dx and D be the vertical strip {(z, y):a < a < b}. Let p be
continuous on (a, b) and P be an antiderivative of p, that is, a function such that
P'(x) = p(x) for every x € (a,b). Let f(z) = exp [P(x)]. Show that fw is exact.
5. Show that
n 2 n
is an exact 1l-form. [Hint: Let h(u) = ug(u) for every u€ FE}. The function h
has an antiderivative.]
CHAPTER 3
Vector-Valued Functions
of One Variable
3-1 DERIVATIVES
Let g be a function from a set J C EL! into EH”. Let ¢ be an interior point
of J. Then the derivative of g at t is the vector
where ¢ = to + wu and |u| is small enough that t€ J. The ratio of the distances
x — y| and |x — xo| may be written, upon multiplying numerator and de-
nominator by 1/|u|, in the form
= ibaa ey 1
Betagieg dele henEU) 8) azaigey =e)
Hence
oe ail Le
eee oan laa]
This justifies calling vg a tangent vector at Xo, and the line through x9 and
Xo + Vo a tangent line at Xo (Fig. 3-1). Note that we have used the assumption
that g’(to) ¥ 0.
The number ¢ is often called a parameter. It need not have any geometric
or physical significance. However, if n = 3 and ¢t happens to denote time in
a physical problem, then g’(t) is the velocity vector.
A vector-valued function g has components g',...,g”, which are the
real-valued functions such that
g(t) = Ss g'(te:
4=1
for every te J. If g’(é) exists, then the 7th component u g(t + u) — g'(t)]
of the expression on the right side of (3-1) tends to g'’(t) as u — 0, by Propo-
sition A—4b, and
AO SS EOE (3-3)
i=l
Conversely, if g(t) exists for each 7 = 1,...,n, then g’(¢) exists and is given
by (3-3).
Example. Let n = 2, g(t) = t?e1 + (log t)ez. Find the tangent line at e;. In this
example g!(t) = #2, g(t) = logt, and to = 1, xo = g(l) = e1. Then g’(t) =
2te; + t—!e2, and vo = g’(1) = 2e1-+e2. The tangent line goes through e; and
3e,; + eg. Its equation is 2y = x — 1.
74 Vector-Valued Functions of One Variable 3-2
PROBLEMS
1. Find the tangent line at 2—!/2e; — 2'/?ee to the ellipse represented by g(t) =
(cos t)e; + (2 sin é)e2, J = [0, 27]. Illustrate with a sketch.
2. Find the tangent line at e: + e2 + e3’to the curve represented by
g(t) = tey + t'/eq+ t3e3, 4 < t < 2.
3. A particle moves along the parabola y” = 42 with constant speed 2 and so that
dy/dt = g?’(t) > 0. Find the velocity vector g’(t) at e1 — 2e2. [Note: The speed
is |g’ ()|.]
4. Give a proof of formulas (3-2):
(a) Using the corresponding formulas for derivatives of real-valued functions
and (3-8).
(hb) Directly from the definition (3-1).
5. Let g(t) = (3¢/(1 + #3)Jer + [8¢7/(1 + é)Jeo, t # —1.
(a) Sketch the curve traversed by g(¢) on the interval (—%, —1). On the interval
af) 2):
(b) cee ee {g(t):t ¥ —1} = {(2, y):23 + y® = 32y}. This set is called the
folium of Descartes.
3-2 CURVES IN E”
Let g be a function from an interval J C EH! into E”. Then g(t) traverses
a curve in #” as the “parameter” ¢ traverses J. It is better not to call g itself
a curve. Instead one should regard any vector-valued function f obtained from
g by a suitable change of parameter as representing the same curve as g. We
shall define a curve as an equivalence class of equivalent parametric represen-
tations. To simplify matters we shall at first consider only curves with con-
tinuously changing tangents.
Let us now be more precise. Let us for simplicity assume that J = [a, b],
a closed bounded interval, and that the components g',...,g” are of class
CY on [a,b]. By g(a) and g”(b) we mean respectively right-hand and left-
hand derivatives. They are equal to the derivatives at a and b of any class C‘”
extension of g' to an open set containing [a, b]. See p. 50.
Definition. If g’(t) # 0 for every ¢ € [a, b], then g is a parametric repre-
sentation of class C‘” on [a, b].
To motivate the definition of equivalence which we are going to make,
let us first consider an example.
Example 1. Let g(t) = te: + t7e2,1 <¢< 2. Then g!(t) = #, g(t) = #7, g’(t) =
e; + 2te2 ~ 0. Hence g is a parametric representation of class C“ on the interval
[1, 2]. In fact, it represents the arc of the parabola y = x? between (1, 1) and (2, 4),
traversed from left to right (Fig. 3-2). If we let f(r) = (exp r)e1 + (exp 27)ee,
0 < 7 < log 2, then f also represents this same parabolic arc. In effect, f is obtained
from g by the parameter change ¢ = expr. It is reasonable to regard f and g as
equivalent, and we shall do so.
3-2 Curves in E” 75
Example 3. Let g(t) = (cos mt)e1 + (sin mt)e2, 0 < t < 27, where m is an integer
not 0. The trace is the unit circle x? + y? = 1. The closed curve Y which g repre-
sents goes around the circle |m| times, counterclockwise if m > 0 and clockwise if
m <0. Iim = +1, then Y is a simple closed curve.
L= fo le'ol ae (3-6)
If f is equivalent to g, then
g(b)
7
of ap
g(b)
g(a) g(a) =g(b) g(a)
Simple are Simple closed curve Not simple
FIGurRE 3-4
by (3-5) and the theorem about change of variables in integrals (Section A-9).
Thus / does not depend on the particular representation chosen for Y.
Formula (3-6) is suggested by considering inscribed polygons. Let
a= to <i < +++ < tm_1 < th = D, and let p = max {t; — to, f2—t,...,
tm — tm—1}. The polygon which joins successively g(t;_1) with g(t;) has ele-
mentary length
m
yy le — eur). (+)
j=1
The length of the part of Y represented on {a, ¢] is S(t). Clearly S(a) = 0 and
S(b) = l. By the fundamental theorem of calculus,
for every t € [a,b]. In particular, if ¢ signifies time then S’(¢) is the length of
the velocity vector, that is, the speed of motion.
Since S’(t) > 0 the equation s = S(t) can be solved for ¢t. More precisely,
the function S has an inverse ¢ of class C‘” on [OJ]. Let G = g°¢. ThenG
is the standard representation of Y. From (3-5)
Example 3 (continued). Let m > 0. Then |g’(é)| = m, S(t) = mt. Solving the equation
s = S(t) for t, we obtain the standard representation
such that the restriction of g to each of the closed subintervals [t;_1, t;], 7 =
1,...,p, is a parametric representation of class C"”. In particular, g has at
each ¢; interior to [a, b] right- and left-hand derivatives which need not be
equal. Parameter changes which are piecewise of class C‘” are admitted. A
piecewise smooth curve is an equivalence class of parametric representations
which are piecewise of class C“”.
Example 4. Let m = 2 and g(t) = te; + |t — llex,0 <t< 2. This represents
the polygon from eg to e; to 2e; + eg with a corner at e;. Let d(7) = 72+ 1 and
f(r) = g(7? ++ 1) = (7? + 1)e1+ |r3le2, —1 < +r < 1. Then fl(7) = 73+ 1 and
f?(7) = |r|. The components f!, f? are of class C™, which might lead one to think
that there is no corner. However, ¢ does not define an admissible parameter change,
since ¢’(0) = 0 contrary to (3-4). Since f/(0) = 0, f is not a parametric representa-
tion of class C., This example emphasizes the importance of the restriction ¢’(r) > 0
in (3-4).
3=3 Line Integrals 79
PROBLEMS
1. Which of the following represent simple arcs? Simple closed curves? Illustrate
with a sketch.
(a) g(t) = (acost)e; + (bsint)e2,a > 0,b > 0, J = (0, 2z}.
(b) Same as (a) except J = [—7, 2z].
(c) g(t) = (—cosh t)e;+ (sinh t)e2, J = [—1,1] (see p. 66 for the definition
of cosh and sinh).
2. (a) Let Y be represented by f(x) = rei + f(x)e2, a < x < b, where f is of class
C™ on [a, b]. Show that
i= fSaye ae
(b) Find 7 in case f(x) = |z|?/2,a = —d.
3. Find the standard representation of the helical curve represented on [0, 27] by
g(t) = (cos te; + (sin t)eg + teg. Sketch the trace.
4. Sketch the trace of the curve Y represented on [0, 27] by g(t) = (cos t)e1 + (sin 2t)ee.
Find the tangent vectors to Y at the double point (0, 0).
5. Let g!(é) = cos (1/t) exp (—1/2), g2(t) = sin (1/é) exp (—1/2) if 0 < t < 1, and
g'(0) = 92(0) = 0.
(a) Show that g! and g? are of class C on [0,1]. [Hint: u*® exp (—u) > 0 as
> =o |
(b) Does g = g'e; + g7e2 represent a curve of class C“? Illustrate with a sketch.
6. Let us write f ~ g to mean f is equivalent to g. Prove that:
(a) g ~ g (reflexivity).
(b) If f ~ g, then g ~ f (symmetry).
(c) If gi ~ go and ge ~ gz, then gi ~ gz (transitivity).
7. Let Y be a curve of class C@. Prove that the multiplicity of any point x is finite.
8. Let Yo and Y; be curves represented on [a, 6] by go and gi, respectively. For every
u € [0,1] let Y, be the curve represented by gu(t) = ugi(t) + (1 — u)go(t),
a<t< 6b. Let l(u) be the length of Y,. Prove that / is a convex function on
[0, 1]. When is the convexity strict?
. s . . . . . . ) ~
The line integral exists, since if g is piecewise of class C‘" the integrand
in (3-8a) is bounded and has a finite number of discontinuities. If f is equivalent
to g, then using (3-5)
The formula for change of variables in integrals still holds if we agree as usual
in calculus that ff = — f?. The composite f = g-@ represents a curve,
which is denoted by —7 and is called the curve obtained by reversing the sense of
direction of Y. From the change of variables formula
le = fe (3-10)
Let %,...,Y%p, be piecewise smooth curves such that the final endpoint
of 7; is the initial endpoint of ¥;,, forj = 1,...,p — 1. Let 7 be obtained
by “joining together” the curves ¥;,...,7%,. More precisely, let us divide
(0, 1] into p subintervals [(7 — 1)/p,j/p] of the same length. Each curve 7;
has a representation on an interval [a;,b,;]. By a linear change of parameter
we may assume that a; = (7 — 1)/p, bj = j/p. Let g; be such a represen-
tation of 7; for each 7 = 1,...,p. Then g;(j/p) = gj41(j/p). Let g be the
function such that g(t) = g,(t) fort © [(7 — 1)/p,j/p). Then g is a para-
metric representation which is piecewise of class C'”’, and Y is the curve which
g represents. Let us call7 the swm of these curves and writeY = ¥; +---+ 7p.
Since an ordinary integral over [a, b] is the sum of the integrals over the sub-
intervals [(7 — 1)/p,j/p], we have
[ o= a ec od (3-11)
bstaseat ler i Vp
Let us now consider the case when w is an exact differential form, w = df,
where f is of class C‘” on an open set D containing the trace of 7. By the def-
inition (3-84), y
i)
i df = / dflg(t) - g(t) dt. ab
Let us use the formula ch
o= [ la Mhteal[e.:
“Ti—T2 “tr
every x € D let
{ay= [ w,
where Y is any curve lying in D with initial endpoint x9 and final endpoint x.
Since we are assuming (3) in Theorem 7, it does not matter which curve with
these properties is chosen. Let us show that df = w.
Given x € D, let U be a neighborhood of x contained in D and 6 the
radius of U. Let 0 < u < 6 and for each i= 1,...,n let ¥; be the line
segment from x to x + ue;. (See Fig. 3-7.) Then
f(x
+ wei) — f(x) = _ - oe [e
Let gi(t) = x + te;, Y(t) = w(x + te;). Then g; represents 7; on [0, uJ, and
hence 1 : pfs
7 Lt + wes) —f@)1 = 7u [- = mt W(t) dt.
is a displacement from g(f;—), which is small if » is small. The work done going
along Y from g(t;—1) to g(t;) should be approximately w/[g(¢;_1)]-h,;. This sug-
gests the following.
: p be i
Definition. The work w done in moving a particle along Y is
w= |o.
Uf
PROBLEMS (6, ¢)
1. Evaluate 4f, x dy — y dz in case: Y
(a) Y bounds the triangle shown in Fig. 3-8.
(b) Y is represented by g(t) = (acost)e; +
(bsin thes, O0O<t< 2x, where a,b > 0. 0, 0) (a, 0)
Your answer should be the area of the set
enclosed by Y. This is a very special case of Figure 3-8
Green’s theorem (Section 7-6).
2. Let gift) = te1+ (2t — les, 1 < t < 2, and got) = (t+ 1l)er+ (2? +#+4 leo,
0<t<1. Evaluate fy, and J;, for each of the 1-forms (a), (¢), and (d) in
Problem 3, Section 2-6, where Y; and Yg are the curves represented by gy and go,
respectively.
3-3 Line Integrals 85
3. Let n = 1,w = fda, and Y be the interval [a, b] directed from a to b. Show that
[e = [ fe ae.
[Note: These two first-order partial differential equations are called the Cauchy-
Riemann equations. They are fundamental to the theory of complex analytic
functions.]
Show that for any closed curve Y lying in D,
7. The moment of inertia of a curve Y about a point xo is fy |x — xo|? ds. Find the
moment of inertia about 0 of the line segment in H? joining e; and eg + 2e3.
8. The centroid x of a curve Y is the point such that
B= (f x'ds)/t 6 = My oo oy We
af
Show that:
(a) This integral does not depend on the particular representation g of Y.
(b) If W(x, h) = w(x)-h, then fy W = fw; and if W(x,h) = f(x)|hl, fy V =
Sy f ds. *s
(c) Let W(x,h) = |/hl|, where || || is any norm on #” (Section 1-6). Then f, WV
is called the length of Y with respect to this norm. Show that if Y is the line
segment joining x; and xo, then the length is ||x; — xo].
fo) = t+
lim ¢().
Suppose that grad f(xo) # 0. Since grad f is continuous there exists a neigh-
borhood U of x9 and m > 0 such that |grad f(x)| > m for every x € U. There
exists ¢; such that g(t) © U for every t > t,. By the fundamental theorem of
calculus, if t; < tg then
Proof. By Problem 8, Section 2-5, there exists m > O such that Q(xo, h) <
—m\h|? for every h. Since f is of class C”’ there is a neighborhood U of xo
such that |fi;(xo + sh) — fi;(Xo)| < m/2n? for i, 7 = 1,...,n, whenever
Xy) + h € Uands € (0, 1). Since f;(xo) = 0 there is by the mean value theorem
s; € (0, 1) such that
Now let ¥(é) = |g(t) — xo|”, the square of the distance from xo. Taking
x = g(t), we have provided g(t) € U
Figure 3-9 indicates the behavior of the level sets and gradient trajectories
near a nondegenerate maximum. The situation is similar near a nondegenerate
mininum. The gradient trajectories in that case are followed as t ~ —o.
Near a saddle point (n = 2) the behavior of the trajectories is indicated
in Fig. 3-10.
Example 1. Let f(z, y) = x* — y?. The equations of the gradient trajectories are
=dx = 2 dy
ae et)
Gin tard #
whose solutions are g!(t) = x1 exp (2¢), g?(t) = y1 exp (—2t). The trajectories lie
on the hyperbolas zy = k orthogonal to the level sets x? — y? = c, and on the co-
ordinate axes. Only trajectories starting from points (0, yi) lead to the saddle
point (0, 0).
PROBLEMS
1. Sketch the level sets and gradient trajectories.
(2) (Gy) = 12? — 2y?,
(b) f(z, y) = ry + y.
2. Let f be a strictly concave function on H” which has an absolute maximum at
xo. By Theorem 5, f has no other critical points. Consider any gradient trajectory g.
(a) Let Y(t) = |g(t) — xo|?. Show thatw is nonincreasing.
(b) Show that y(t) — 0, and hence g(t) — xo, ast ~ +.
3. Let G be the representation with arc length as parameter of a gradient trajectory g.
Show that G’(s) is the direction of the gradient at G(s). See pp. 61, 78.
CHAPTER 4
Vector-Valued Functions
of Several Variables
4-1 TRANSFORMATIONS
Let n and r be positive integers. Let g be a function with domain A C EH”
and values in #”. Such a vector-valued function g is called a transformation
from A into EZ”. Let points of A be denoted by t = (t',...,¢). The image
of a set B CA is the set {g(t):t © B}. It is denoted by g(B). The inverse
image of a set A C E” is the set {t C A: g(t) € A}. It is denoted by g~'(A).
These ideas are indicated schematically in Fig. 4-1.
We recall that if g is continuous then the image of any compact set is
compact and the image of any connected set is connected. Moreover, if g is
continuous then the inverse image of any open set is open relative to A. See
Section A-6. The inverse image of a compact set under a continuous transfor-
mation need not be compact, nor the inverse image of a connected set connected.
The image of an open set need not be open.
89
90 Vector-Valued Functions of Several Variables 4-1
Figure 4-1
Example 3. Let n = r = 2. Points of the domain A are denoted by (s, t) and those
of the image g(A) by (z, y). It is helpful to think of two copies of the plane H?. The
first contains A and will be called the st-plane. The second contains g(A) and will be
called the zy-plane.
In our example we let A be the whole st-plane, and
for every (s, t) © E?. If (z, y) € g(A), then x = s?+ #t?, y = 2st andz+y> 0,
x — y= 0. Therefore g(A) is contained in the quadrant Q shown in Fig. 4-2. In
fact, g(A) = Q. This is seen as follows: Let C be a circle with center (0, 0) and radius
a > 0. Points of C are given by s = acos¢,t = asing, where 0 < @ < 27. The
image g(C) is the trace of the curve represented on [0, 27] by
a (s 3)
y, “Yl
Z
YY.
Ui
YY;
A= EF? y = 2st
Figure 4—2
4-1 Transformations 91
This trace is the vertical line segment shown in Fig. 4-2. By letting a take all possible
nonnegative values, one gets a collection of line segments covering Q. If a = 0,
the line segment degenerates to the point (0, 0). This shows that Q = g(A).
SIO Sy a CLS ee
for every t € A and x € g(A). The transformation g ~~1 is called the inverse
of g. The notation g~‘(A) for inverse image of a set is consistent with this
one. If g is univalent and A C g(A), then g~'(A) is the image of A under g7’.
NESE t
_vaety—-Ve—y
a 2 a 2
Such points (s, é) belong to Q (regarded as a subset of the st-plane). Let g be the
restriction of g to Q. Then @ is univalent and g(Q) = g(4) = Q. The value of the
inverse £~! at any (z, y) € Q is
(eet, See)
gz (z,y) = 2 2
PROBLEMS
1. Letn = 1,A = KE’, and g(t) = t-t— 2Qit.
(a) Find the image g(H”’).
(b) For each c find the inverse image of the semi-infinite interval [c, ©).
2. In Example 8 find:
(a) The image of any vertical line s = c.
(b) The inverse image of any line y = mz through the origin.
(c) The image of the circular disk bounded by C in Fig. 4-2.
3. Let g(s,t) = |s — tler + |s+ tle2, A = H?. Find g(H?) and answer questions
(a), (b), and (c) in Problem 2 for this transformation.
4. Let g(s, t) = (t cos 2rs)e, + (tsin 2rs)eg + (1 — thes3,A = E”.
(a) Show that g(H?) is a cone with vertex e3.
(b) What is the image of the square {(s,t):0 << s<1,0<t< 1}?
(c) Find g~'({e3}) and g~*({e1}).
5. Let g(s, t) = 1/(s?-+ st+ t?)e1 + 1/(s?+ st+ t?)?e0, A = {(s, t):0 < s? +0? < 1}.
(a) Show that g(A) is part of the parabola y = 2”, and find it.
(b) Find g~"({(c, c?)}).
De ME e (4-1)
The matrix of L. Let us denote the standard basis vectors for H” by
€;,...,¢€, and those for H” by e;,...,@€n. With a linear transformation L
and these bases is associated a matrix (c}) with n rows and r columns, in the
following way. For each j = 1,...,7r let v; = L(e;), and let c be the ith
component of the vector v;:
n
v= Ds Cie, eh ae (4-2)
i=1
Then (c}) is the matrix of L, and v,,...,v, are the column vectors of this
matrix. Note that the superscript 7 indicates the row, and subscript 7 the
column, of the matrix.
Actually, for any pair of bases for HE” and E” there is a matrix associated
with L. It is shown in linear algebra that by suitable choice of bases the asso-
ciated matrix can be made to have some special form [for instance, the Jordan
canonical form if r = n ({12], p. 207)]. What we have called “the” matrix of
L is the matrix corresponding to the standard bases for #” and EH”.
Since L is linear,
j=l
w'| ci ¢3 c
w’| c? OP
w" cn Cr
By (4-3a) the column vectors v,,...,V, span L(#’). The rank p equals
the largest number of linearly independent column vectors of the matrix. Since
row rank equals column rank ({12], p. 105), p is also the largest number of
linearly independent row covectors of the matrix.
Composition. Let L be linear from EH” into #”, and M linear from E”
into H?. The composite M - L is linear. Its matrix is the product of the matrices
of M and L (Problem 4).
Lp LasikLe — LL
By (4-1) L is singular if and only if y > 0. But vy> 0 means that L(t) = 0
for some t ~ 0. Therefore, from (4-4a) the system of homogeneous linear
equations
OS a hee (4-5)
gat
has a nontrivial solution if and only if det L = 0.
4-2 Linear and Affine Transformations 95
In later chapters we shall see that the absolute value of the determinant
is the ratio of n-dimensional volumes, and the sign of the determinant deter-
mines an orientation.
( )
3 i!
The row covectors are w! = 2e! + e?, w2 = 3e! — e2. The column vectors are
Vi = 2e; + 3e2, v2 = e1 — eg. Since detL = —5 ¥ 0, L is nonsingular. (See
Fig. 4-3.)
N
y
cL \
&
\
p(x, y)
s fr
Vi
(x, y) =L(s, ¢) 4
x=2s+t vA
y=3s—t We
x Ii
Ne Figure 4-3
MS
Let M(az, y) = (2x — 5y)E; — rEg, where Ej, Ee denote the standard basis
vectors for the plane H? in which M has its values. The matrix of M is
( 2 Bi
—] 0
(ein).
As expected, det M>L = 25 = det M det L.
Example 2. Let n = r = 3, and let L be the linear transformation which takes the
standard basis vectors €1, €2, €3 respectively into
aL (thes (a)jeot
U7 (ee ws (a, 5 6. ie
EE WR oi? A on oe (4-4b)
=
This follows from (4-3b) since the jth component of w’* is ch.
WR ONG = Gp OS Or;
and hence
gis) ty = e(S)o ett) v5 0,
for each j= 1,...n. The vector g(s + t) — g(s) — g(t) has component 0
with respect to each basis vector v;. Hence
Proof. Let g be an isometry of EH”. Let f(t) = g(t) — Xo for every t € E”,
where xp = g(0). Then
From this equation, L is orthogonal if and only if s-t = (L‘~ L)(s) -t for
every s, t © Hk”. But this is equivalent to the statement that s = (L‘ > L)(s)
for every s, in other words, that I = L'. L. Hence L is orthogonal if and only if
Bs
If L is orthogonal, then
1 = det I det L‘ det L.
But det L' = detL ({12], p. 146). Hence 1 = (det L)?, and detL = +1.
If L is orthogonal and det L = 1, then L is called a rotation of E” about 0.
Example 4. Let S be the orthogonal transformation which takes each t = (é!,..., t”)
into S(t) = (t!,...,¢"—!, —t”). S is a reflection of E” about the hyperplane t” = 0.
Its matrix is
1 0
0 =I
Two such reflections take each t into itself; that is, So S = I. HenceS = S~! = S‘.
If M is any orthogonal transformation with det M = —1, then L = S-M is a
rotation of HE” about 0 and
cos @ —sin )
sin 0 cos 8
PROBLEMS
1. Letr = 3,n = 2, andL be the linear transformation such that L(e1) = e1 — 2e2,
L(€2) e1, L(e3) = 5e; + ee. Find the matrix of L, the rank, and the kernel.
2. Show that the kernel of a linear transformation is a vector subspace of its domain.
3. Let r = n, and let Li(t) = c’t‘ for every t © E”, where c!,...,c” are scalars.
(a) What is the matrix?
(b) Find L—! if it exists.
n, bee =P = A,
@ il
(a) Describe geometrically the linear transformation L with matrix ¢ )c
0
(b) Find S- L and L~S§, where S is the same as in Example 4. Show that both
are rotations of H? about (0, 0).
6. (a) Show that the vectorsvi = (i/V5)(e1 + 2e3), v2 = (1/v/
10)(Zen,
/5e2 + e3), v3 = (1/V10)(2e1 + V5e2 — es) form an orthonormal basis
for 1°.
(b) Let L be the orthogonal transformation whose matrix has vj, v2, v3 as column
vectors. Find L! and verify that L’o L = I. Is L a rotation?
7. Let Land M be rotations of H” about 0. Show that L~! and M © L are also rotations.
8. (a) Show that the composite of two affine transformations is also affine.
(b) Which affine transformations are univalent?
lim [kl7
xo [g(to +k) — g(to) — L(k)] = 0. (4-10a)
If we set t = to + k, then L(t — to) is the desired local approximation
to g(t) — g(to). If nm = 1, the definition agrees with the one in Section 2-2,
p. 388. Moreover, for n > 1 the expression in (4-10a) tends to 0 if and only if
each of its components tends to 0 as k — 0 (see Proposition A-4b). Thus
(4-10a) is equivalent to
: 1 t a
ive ra lg (tore KE agi(tor v
atl (k) a 20, (4-10b)
HOR.= Ih os a Hes
4-3 Differentiable Transformations 101
Proof. Since (4-10b) states that each component g’' is differentiable at to,
the first assertion follows at once. If g is differentiable at to, then L'(k) =
dg'(to) -k for every k € E”. Hence the row covectors are Cay. LER,
and the elements of the matrix are the partial derivatives gj(to). §
Definitions. 1. The column vectors of the matrix are -called the partial
derivatives of the transformation g at to, and are denoted by g;(to) or
(dg/dt’) (to). Thus
n
d i
Zi(to) = 1 (to) = S gi(to)e:, (4-11la)
=
for each 7 = 1,...,7. The jth partial derivative g;(to) can be regarded
as the derivative of g with respect to the jth variable while all of the other
variables are held fixed, in the sense described in Section 3-1.
The formula, dual to (4-lla), for the row covectors is
NGS
iv ae (t).
if)
ee Antie)
We shall see that the Jacobian often plays the same role in the calculus
of functions of several variables as the derivative does in the case r = n = 1.
In particular, this is so in the theorems about inverses (Section 4-5) and trans-
forming multiple integrals (Section 5-8). If the Jacobian is 0 at a point to,
102 Vector-Valued Functions of Several Variables 4-3
then Dg(to) is singular. This suggests some kind of irregularity in the behavior
of g near to. In order to exclude such irregularities we shall repeatedly have to
make the assumption that the Jacobian is not 0.
Example 1. If n = 1, then Dg(t) has the single row covector dg(t). We may identify
Dg with dg and Dg(t) with dg(t). If dg(t) # 0, the rank of Dg(t) is 1; otherwise it is 0.
It is 0 only at (0,0). If (s, t) ¥ (0, 0), the rank is 2. At (0, 0) the rank is 0.
Proof. Let L = Dg(to) and set g(t) = g(t) — L(t). Since DL(to) = L,
Dg(to) = 0 (the zero linear transformation). By (4—10a), in which g is replaced
by g, there is a neighborhood Qp of to such that
Proposition 13b. Let g be of class C‘? and to € A. Then given € > 0 there
exists a neighborhood Q of to such that Q C A and
Proof. Let g be as before. The row covectors dg‘(to) are all 0. Since the
partial derivatives of g are continuous, given € > 0 there is a neighborhood 2
104 Vector-Valued Functions of Several Variables 4-3
of to such that |dj‘(u)| < e/n for every u € Qand7 = 1,...,n. By Corollary 1,
p. 42, for every s, t € Q,
Proof. We have
PROBLEMS
3. Show that:
(a) ||L|| > 0 unless L has rank 0.
(b) |[cL|| = |e] ||LI).
(c) IL -+ L'| < [LI] + ILI.
(d) ||MeL|| < ||[M|| ||L}.
4. Another norm for linear transformations, which we denote by ||| |||, is defined
as follows:
LW [Wola = [wet
where w!,..., w” are the row covectors. Show that properties (a)—(d) of Prob-
lem 3 hold for this norm. Show that ||L]| < |||L]|].
5. Let r = n and g be a differentiable transformation. Then g is called conformal if
there exists a real-valued function yu such that u(t) > 0 and u(t) Dg(t) is a rotation
of H” for every tE A.
(a) Using Proposition 11 show that g is conformal if and only if, for every t € A,
Jg(t) > 0 and the partial derivatives of g satisfy:
4-4 COMPOSITION
We shall now derive a rule for differentiating the composite of two dif-
ferentiable transformations. As corollaries of the basic formula (4-19) we then
obtain a formula for Jacobians and the chain rule for partial derivatives.
Let g be a transformation from an open set A C #” into an open set
D Cc E”, and let f be a transformation from D into E?.
Proof. Let L = Dg(to), M = Df(xo), and f =f —M. Using the fact that
M is linear, we have
dF
[oh at
of ae
ee
ag’ = sie Te
af ag”
eee Ghtoeseles
at? =—sax at? ax” ot?
The chain rule is just a particular case of (4-4b) which describes how the
components of a covector change under the dual L* of a linear transformation.
In the present instance L = Dg(to), and dF (to) = L*[df(xo)].
If r = 1, then we may identify DF(to) with its column vector F’(to),
and Dg(to) with g’(to). From (4-19) we then get:
Fit) = oe Bait),
for every t€ Aandj = 1,...,r. Since all of the functions f;, g, g} are con-
tinuous, each partial derivative F; is continuous. Hence F is of class C"”.
If q > 2, then repeated application of the chain rule shows that F is of class
C® and gives formulas for calculating its partial derivatives of orders 1,
Deters Us
In case p > 1 and f, g are of class C™, the preceding discussion shows
that the components F!,...,F? of F are of class C, since F’ = f! og for
each | = 1,...,p. Therefore F is of class C. We have proved:
F(t) = Do file@lg"@,
=)
which can also be written
F’(t) = dflg(t)]- 2’.
If in addition n = 1, it becomes F’(t) = f’[g(t)]g’(t), which is the composite function
rule of elementary calculus.
Example 2. Let F(z) = f[z, g(x)], where fand g are of class C). In this case g'(x) =
z,g?(z) = g(x), and the formula in Example 1 becomes
Another application of the chain rule together with the formula for the derivative
of a product gives
The expression on the left-hand side is called the Laplacian of f. The partial dif-
ferential equation f11 + fez = 0 is called Laplace’s equation. Its solutions are called
harmonic functions. The formula above expresses the Laplacian in polar coordinates.
In this example
Fy, = cos O(f11 cos 6 + fig sin 8) + sin 0(fo1 cos 6 + fee sin 8),
F22 = —r sin 6[f11(—r sin 8) + fi2(r cos 6)] + r cos 6[f2i(—r sin 6) +- f22(r cos 8)]
—fir cos 6 — for sin 0.
Combining terms and using the fact that fo1 = fie, we get
1 1
Pi + 75 Fae = fir + fo2 — — (fr cos
6+ fesin 8),
I 1
eta
et eee eGh a
oApy: — olic
r r
PROBLEMS
Assume that all functions which occur in these problems are of class C).
1. Let F(x, y) = f(x, xy). Find the mixed partial derivative Fj2.
2. Let F(x, y) = fix, y, g(a, y)]. Express the partial derivatives of F of orders
1 and 2 in terms of those of f and g.
3. Letn = r = p = 2. Find the Jacobian 0(F!, F7)/0(s, t) at the indicated point
by means of Corollary 4.
(a) f(@,y) = zyEi + x7yEo, g(s, t) = (s+ ter + (s? — #?)ee, (so, fo) = (2, 1).
(b) f(z, y) = o@ + y)E1 + $(@ — y)Ez, g(s, t) = (exp t)e1 + exp (—s)ee,
(so, to) = (log 2)e1.
4. (a) Show that the chain rule is still true if p > 1, namely,
A ees
(b) Use it to find (OF /ds) (so, to) and (OF /dt) (so, to) in Problem 3(a).
5. Let G be the standard representation of a curve Y of class C?.
(a) Show that G’(s)-G’’(s) = 0. [Hint: Use the fact that |G’(s)|? = 1.]
(b) Let g be any parametric representation of class C®) of Y and define S(t) as
in Section 3-2. Show that S’’(t) = g’(t)- g’’(t)/S’(t).
(c) If G(s) # 0, then G’’(s) is called the principal normal vector and |G’’(s)|
the absolute curvature at G(s). Show that
|G"1S)]|
_ fe’OPle"OP — (2-2)
le"
6. Let f(z, y) = o(a — cy) + Ww + cy), where cis a scalar. Show that fez = cfi1.
7. Let n = 4 and p = [(x!)?2+ (a7)?+ (23)7]1/2. Let f(x) = [¢(o — cx*) +
¥(p + cx*)]/p. Show that faa = c?(fi1 + fo2 + f33). [Note: The partial differ-
ential equation fran = c7 (fir +--+ + fn—1,n—1) is called the wave equation in n
variables. Problem 6 gives D’Alembert’s solution for » = 2. Solutions of the
type in Problem 7 are called spherical waves.]
110 Vector-Valued Functions of Several Variables 4-5
8. Suppose that f satisfies the partial differential equation fe = fi1 + bf, where b
isa scalar. Let F(x, y) = exp (—by)f(z, y). Show that F2 = Fi1.
9. Let F = foL, where L is a linear transformation with matrix (c}). Show that
10. Using Problem 9 show that ifr = n and Lis orthogonal, then Fi; + +++ + Fan =
fir ++++-+fnn. In other words, the Laplacian is invariant under orthogonal
transformations of HE”. [Hint: L' = L—!]
11. Let n = r. A linear transformation L is a Lorentz transformation of #” if L~! =
S-L'~S, where S is as in Example 4, Section 4-2.
(a) Show that if M and L are Lorentz, then ML and L~! are also Lorentz.
[Hint: S2 = I]
(b) Show that L is Lorentz if and only if S = L’e SL.
(c) Show that L is Lorentz if and only if
n—1
12. Show that if L is Lorentz, then Fi, + --->—+ Pa—inn—1 — Fan = fir 27°
fn—1,n—-1 — fnn. In other words, the wave operator is invariant under Lorentz
transformations (c = 1).
condition than differentiability. Second, and more important, is the fact that
the Jacobian’s being of one sign implies only that g locally has an inverse. Any
point to € A has a neighborhood © such that the restriction g|Q of g to @ has
an inverse f. An example given below shows that g itself need not have an
inverse.
It is plausible that a local inverse exists. For t near to, g(t) is approxi-
mated by G(t) = L(t — to) + xo, where L = Dg(to), xo = g(to). Since
det L = Jg(to) ¥ 0, the affine transformation G has an inverse. This sug-
gests, but of course does not prove, the following.
(f has domain U)
Xo = g(t)
Figure 4—4
Step 2. Let U = g(Q). To show that U is open we shall show that any
x, € U has a neighborhood U, such that U; C U. From Step 1, x1 = g(t:)
for exactly one t; € 2. Let Q; be a neighborhood of t; whose closure cl {2; is
contained in Q, and let IT, denote the boundary of Q,. Since the restriction of
g to @ is univalent and t; €T;,x1 ¢ g(T'1). (See Fig. 4-5.) Since g is of class
C™, and therefore is continuous, the image g(I',) of the compact set I; 1s
compact. Let a, be one-half the distance from x; to g(['1), and let Uy be the
neighborhood of x, of radius ¢}.
112 Vector-Valued Functions of Several Variables
Figure 4-5
Hence the minimum value is less than 07, and must be attained at some interior
point tg € Q;. By Proposition 10, dy(t,) = 0. Since
0 = a c’ dg'(ts).
oH
4-5 The Inverse Function Theorem 113
tion 4-4, gi » f is of classC‘”. Hence fj is of class C‘” and f of class Cc. Repeat-
ing this argument, we find that if g is of class C@ then f is also of class C.
This completes the proof of the inverse function theorem. J
Corollary. Let g satisfy the hypotheses of the inverse function theorem. Then
the image of any open subset of A 1s open.
Proof. Let B C A be open, and let x9 € g(B). Then xo = g(to) for some
t) € B. (There may be several possible choices for to.) By the inverse function
theorem, applied to g|B, there exists a neighborhood Q of to such that g(Q)
is an open subset of g(B). Therefore some neighborhood of Xo is contained in
g(B). This shows that g(B) is an open set. §f
The Jacobian is sinh? s cos? t-+ cosh? s sin? t, which simplifies because cosh? s =
1 + sinh? s and cos? t + sin? t = 1 to
If we take for A the right half-plane s > 0, then sinhs > 0 and Jg(s, t) > 0. The
hypotheses of the inverse function theorem are satisfied, hence local inverses exist.
Since cos and sin are periodic, g(s,t-+ 27) = g(s,t). The transformation g is not
univalent, and consequently has no inverse. By the corollary, g(A) is an open set
which, as we shall soon see, is ZH? with a line segment removed.
Let A = {(s,t):s > 0,0 < t < 2m}, and let g be the restriction of g to A.
Let us show that g has an inverse. It is not easy to solve the equations
x = g'(s,t) = cosh
scost, y = g*(s,t) = sinhssint
explicitly for s and ¢. However, let us consider what happens on vertical straight
lines s = c. For each c > 0, g(c, t) represents on (0, 27] an ellipse with major semi-
The Inverse Function Theorem 115
Figure 4-6
axis of length coshe > 1 and minor semiaxis of length sinh c. Each of these ellipses
has +e, as foci, and g(c,0) = g(c, 27) = (coshc)ey. If s, ¥ se, then the points
&(si, t1) and &(sg, tz) lie on different ellipses. Moreover, &(s, ti) = &(s, t2) implies
i) = pe Hence &(s1, t1) = (se, te) implies that (si, t1) = (se, tg), and % is univalent.
The image of A is HE? with the semi-infinite line on the z-axis from —e; to © deleted.
The part of the boundary of A on the s-axis is transformed onto the part of the line
from e; to ©, and the vertical part of the boundary onto the part from —e, to e}.
Hence g(cl A) = E?. By periodicity each value which g takes on A is also taken
somewhere on A or its lower boundary. Hence g(A) is H? with the line segment join-
ing —e; and e; removed. (See Fig. 4-6.)
Regular transformations
PROBLEMS
1. Determine whether g satisfies the hypotheses of the inverse function theorem.
Find g(A). If g is univalent, find g~ explicitly.
«
(a) g(t) = t+ xo (a translation), A = E£”.
(b) g(¢,t) = (s+ er + (s — tes,A = BP.
(ec) g(s, t) (s? — s — 2)e, + 3tez, A = E?.
(d) g(s,t) = (s? — t)e1 + stes, A = EB? — {(0, 0)}.
(e) g(s, #) = (log st)e1 + 1/(s?+ t?)e2,A = {(s8,t):0 <t < 8}.
Dalet giye— ¢-2i*, A = (0,0), Find 7.
3. Let r = n = 38, and g(s,t, u) = (uwcos ste: + (usin st)eo + (s+ uje3. Then
g(e1 + €3) = e1 + 2e3. Let f be a local inverse of g such that f(e1 + 2e3) =
€, + 3. Find Df(e; + 2e3) using (4-23).
4. In the example on p. 114, what are the images of horizontal straight lines? Show
that g is a conformal transformation (Problem 5, Section 4-3), and hence that the
images of vertical and horizontal straight lines intersect at right angles. Illustrate
with a sketch.
5. Let g(s, t) = (exp scost)e; + (exp ssint)eg and A = E?.
(a) Show that g satisfies the conditions of the inverse function theorem, but is
not univalent.
(b) Let A = {(s, t):0 < t < 27}. Show that the restriction of g to A is univalent,
and find its inverse.
(c) Find g(£?).
(d) Show that g is conformal.
6. Let g1(s, t) = s+ f(t), 97(s, t) = t+ f(s), where
f is of class C and |f’(s)| < ¢ < 1
for every s € E!.
(a) Show that g(H?) = E?. [Hint: Given (z, y) define ¥(s, t) as in Step 2 of the
proof of the inverse function theorem. Prove that y has a minimum at some
point (s*, é*) and that g(s*, *) = (, y).]
(b) Show that g is univalent.
7. Let A be an open convex set and g a differentiable transformation such that
Dija1 9)(t)h'h’? > 0 for every t€ A andh ¥ 0. Show that g is univalent. [Hint:
Suppose that g(t1) = g(t2). Let h = te — t1, f(t) = [g(t) — g(ti)]-h, and
apply the mean value theorem to f.| This result is due to H. Nikaido.
The implicit function theorem guarantees the local validity of the principle
near any point Xo such that ®(xo) = 0 and D@(x,) has maximum rank m.
Before stating and proving this theorem, let us indicate what it asserts
when n = 3, m= 1. Let xo = (20, Yo, 20) be a point such that (x9) = 0
and d®(xo) 4 0. Since the components of d(x») are the partial derivatives
1 (Xo), B2(Xo), &3(Xo), at least one of the components is not 0. For instance,
suppose that ®3(x9) # 0. Then in some neighborhood U of xo, @3(x) ¥ 0
and the equation 6(x) = 0 “determines z as a function of x and y.” More
precisely, there is a function ¢ of class C such that for (z, y,z) € U,
P(x, y, z) = Oif and only if z = ¢(2, y). The domain R of ¢ is an open subset
of the xy—plane.
AC Aen se)
= 0) at Xo. (4-25)
Oa wae)
for the vectors obtained by taking only the first 7 components of x and Xp.
0 ee xlens () ee ee
ob} egal. Cag) Gh Ge
a ee ie ee oe
By properties of determinants, the Jacobian Jf(x) equals the determinant
of the m X m block in the lower right-hand corner. Since the latter is just the
Jacobian in (4-25), Jf(x) # 0. By the inverse function theorem, there is a
neighborhood U of xo such that f(U) is an open set and the restriction f|U has
an inverse g of class C”’. Note here that the roles of the symbols f and g in
Section 4-5 have been reversed.
Git r+] ener ot1)
{x : (x) =0}
Figure 4-7
For x € U, (x) = 0 if and only if € R and f(x) = (%, 0). Since f|U and
g are inverses, f(x) = (%, 0) if and only if x = g(%, 0). §
do + $362 = 0, 2 = — a
In the formulas (**) the partial derivatives of & are evaluated at (2, y, (x, y)).
To calculate the second-order partial derivatives $11, $12, ¢22, the chain
rule is applied again. For instance, taking the partial derivative with respect
to the second variable in the first of equations (**), we get
Substituting the expressions for ¢1, ¢2 obtained above and solving for ¢j9,
we get
a (63)"@19 — Bob313 — Bi b3Hzo + D1 P2%33
oie == (63)3 ;
b, + Sod’ + 3’ = 0,
Sa a Vod’ ie Wy’ a 0,
gi 30, — P1V3
@oVs — b3Vo
120 Vector-Valued Functions of Several Variables 4-6
The second derivatives ¢’’, ¥’’ can be found by another application of the chain
rule.
For convenience we assumed in (4-25) that the last m columns of the
m X n matrix of partial derivatives (€!(xo)) weréeslinearly independent. More
generally, one need merely suppose that some set of m columns is linearly inde-
pendent, in other words, that the linear transformation D@®(x9) has maximum
rank m. Let us suppose that columns J), j2, . - - , jm form a linearly independent
set, where we may suppose that j1 < je <--+- < jm. Let 2,...,1, be those
integers between 1 and n not included among j1,...,Jm, with 13 < +++ < 4,.
For brevity let us write \ for the r-tuple of integers (7,,...,7,) and x for the
ruupler(ae t,o").
The implicit function theorem now states, roughly speaking, that locally
the equation ®(x) = 0 determines x”1,..., 2’ as functions of x*. More pre-
cisely, U, R, and ¢',..., 6” exist as before such that
O(a eee
~ 0 ateveryxec U,
d(x7!,..., x)
and
AB", &
ee) ~ (0 at xo.
OG, av)
Example 3. Let
@(z, y,z) = 2? + y2 + 22 — Qrz — 4
Then
Pi (x,y, 2) = 2a — 2Qz,
Po(z, y, 2) = 2y,
@3(x, y, z) = 22 — Qa.
PROBLEMS
¢'(z) =
__ bile, $(2))
Holz, $(x)]
In this proof of the theorem, the rectangle J replaces the circular neighborhood
U, but this is unimportant. Can you extend this proof to the case m = 1,n > 2?
122 Vector-Valued Functions of Several Variables 4-7
4-7 MANIFOLDS
The word manifold is used in mathematics to describe a topological space
which locally is “like” euclidean E’, for some r called the dimension of the
manifold. For instance, a circle is locally like E*. Such geometric figures in
E® as ellipsoids, cylinders, and tori are locally like HZ”. A cone is not locally
like H? near its vertex.
We shall approach the idea of manifold from a rather concrete viewpoint.
For us, a manifold M is a subset of some euclidean #” which can locally be
described by an equation ®(x) = 0, where D@(x) must have maximum rank.
Another definition of manifold can be given abstractly in terms of coordinate
systems. It has the advantage that one need not presuppose that M is a subset
of some euclidean space. This will be discussed in Chapter 7.
MO ee xi Se UE (x)t = 02
f(MaU,)
Figure 4-8
4-7 Manifolds 123
ME ye y= 1 (a, y) y=}:
To show that © is a 1-manifold we must show that given (xo, yo) € M, there exist
U and ® such that d®(z,y) #0 in U and MN U = {(2,y) ECU: B(az, y) = O}.
If (vo, yo) € H, then we let B(x, y) = x? — y? — 1 as before, and let U be any
neighborhood of (xo, yo) which does not meet the asymptote y = x. However, if
(zo, yo) is on the asymptote, we take ®(z, y) = y — x and U any neighborhood
of (xo, yo) which does not meet H. In this example our choice of ® depends on (zo, yo).
Example 3. Let M = {(z, y):2? = y?}. This set consists of the two lines y = -£2,
and is not a l-manifold. Roughly speaking, / is not like H! near the crossing point
(0, 0). More precisely, if M were a 1-manifold, then by the implicit function theorem
the following would be true: Each (zo, yo) € M has a neighborhood U, such that
either MMU, = {(x, o()):2€ Ri} or MN Ui = {(¥y), y) : y © Re}, where
Ri, Rez are open. In the present example, (0, 0) has no such neighborhood.
Let ®(x) = F(x) — c. Then d®(x) = dF (x). If B. is not empty and contains no
critical points of F', then B, is the (n — 1)-manifold determined by ®. If B, contains
critical points, then the (n — 1)-manifold determined by ® is
unless M, happens to be empty. We saw in Section 2-5 that B. need not resemble
E”—! near a critical point contained in Be.
Example 6. Let F(z, y) = exp (ry). The partial derivatives are F'1(z, y) = y exp (zy)
and Fo(z, y) = xexp (zy). The only critical point is (0,0), and F(0,0) = 1. If
c < 0, then B, is empty. If c > 0,c ¥ 1, then B, is a 1-manifold. In fact, B- is the
hyperbola zy = loge. The level set By is the union of the xz- and y-axes, and is not
a l-manifold. M,; = By, — {(0, 0)} is a 1-manifold.
is called an r-dimensional plane. Let L be the linear transformation with row covec-
{Orssa set, ava pincers has rank m, L(#")3=h™. Hence 'P =i{x: L(iz)e=nc) ms
not empty. Moreover, DL(x) = L has rank m for every x. Thus, any r-dimensional
plane P is an r-manifold.
If we set y(t) = y; for 0 < |é < 6, and ¥(0) = Xo, then (*) states that
y’/(0) = h. (See Fig. 4-9.)
Let 7x0) denote the set of all tangent vectors at xo. It is called the tan-
gent space to M at Xo. If r is the dimension of M, then it is plausible that the
tangent space is a vector space of dimension r. Let us show that this is true.
Let U and @® be the same as in the definition of manifold.
Theorem 10. The tangent space T(xo) is the kernel of the linear transforma-
tion D®(xXo).
Proof of Theorem 10. Let L = D@(x9). We must show that his a tangent
vector if and only if L(h) = 0.
Let h € T(x). Let yw be as in the definition of tangent vector. Then
@[y(t)] = 0 for every t € (—6, 6). Calculating the derivative of ®> y by
Corollary 2, Section 4-4, we have
0 = LIy’(0)] = L(h).
Conversely, let L(h) = 0. For simplicity let us assume that (4-25) holds,
that is, that the last m columns of the matrix of L are linearly independent.
Betti — (XxX)... XX", ©). &”), and’ U, be a neighborhood of x9 such
that the restriction of f to U, is regular, as in the proof of the implicit function
theorem. There exists 5 > 0 such that %) + th € R, for every t € (—6, 5).
Let g = (f|U,)~! and Z
W(t) = g(Ko + th, 0).
Then y(t) € M and wis of class C‘? (Fig. 4-10). We must show that y/(0) = h.
Let A = Df(xo). By formula (4-23), A7! is the differential of g at the
point (9, 0) = f(x). Therefore
Since h is in the kernel of D(x), A’t"(h) = 0. Thus A(h) = (h, 0), and
tA gh, 0) — v0).
Normal vectors to a manifold. A vector n is called normal to M at Xo if
n-h = 0 for every h € 7T(X0). The normal vectors form a vector space of
126 Vector-Valued Functions of Several Variables 4-7
for every h € T(xo) and! =1,...,m. Hence grad ®'(xo)j2 4 grad © (Xp)
are normal vectors to M at Xo. Since D®(xo) has rank m, these vectors are linearly
independent, and they form a basis for the space of normal vectors.
In particular, if M is an (n — 1)-manifold, then m = 1; grad (xo) is
a normal vector, and all others are scalar multiples of it.
Xg+n
Xp th
Xo
Figure 4-11
Tangent r-plane
(See Fig. 4-11.) The terms tangent line, tangent plane, and tangent hyperplane
are used when r = 1, 2, and n — 1, respectively. The tangent 7-plane is
Let us find the spaces of tangent and normal vectors to M at (2, V3, 1).
the partial derivatives being evaluated at (xo, yo, zo). The tangent space T(x, yo, 20)
consists of all scalar multiples of d.
Mn Cf — {x € U, : (x) = 0}.
PROBLEMS
In Problems 1-6 and 11, one can show that the set in question is a manifold by
verifying (4-26) for suitable ®.
1. Let F(x, y) = exp (x? + 2y?-+ 2). Find the level sets and determine which are
1-manifolds.
2. (a) Show that if c ~ 0, then the hyperboloid x? + y? — 42? = cis a 2-manifold.
Is the cone x? + y? = 42? a 2-manifold?
(b) Find the tangent plane at 2e1 — e2-+ eg to the hyperboloid 2? + y? — 42° = 1.
3. Let M = {(a,y,2):2y = 0, 2?-+y?+ 2? = 1,2 ¥ +1}. Show that M is a
1-manifold. Sketch M.
128 Vector-Valued Functions of Several Variables 4-7
4, Letfbe of class on an open set A C E?. Let M = {(2, y, f(a, y)): (t, y) € A}.
(a) Show that M is a 2-manifold.
(b) Show that (fi(2, y), fe(x, y), — 1) is a normal vector to M at (2, y, f(a, y)).
(c) Show that the equation for a tangent plane agrees with the one in Section 2-2.
. Let A C E! be open, and f, g be real-valued functions of class C on A.
(a) Show that M = {(z, f(x), g(z)):a © A} is a 1-manifold.
(b) Show that (1, f’(x), g’(x)) is a tangent vector to M at (2, f(x), g(2)).
. Let M = {(z,y): 2% = y*,x > 0,y > 0, (z,y) ¥ (¢, e)}, where e is the base
for natural logarithms. Show that M is a 1-manifold. Make a sketch.
let iM = {@, y,2):2y = az = 0}. Is M a 1-manifolds
. In Example 10 show that d is a tangent vector, using Cramer’s rule.
. Let Mf and N be r-manifolds such that (clM) MN and Mf clN are empty,
MCE",N CE”. Prove that M U N is an r-manifold.
10. Let J/ be an r-manifold and A an open set such that 1M A is not empty. Prove
that JJM A is an r-manifold.
Wile Let M = {x:)-7j-1 cijv'x’ = 1}, where the matrix (c;;) has rank n and is
symmetric.
(a) Show that M is an (n — 1)-manifold.
(b) Show that the equation of the tangent hyperplane at xo € J is
n
Gg
> Gage any, = Me
1,j=1
where (c;) is the matrix of L. Let O(n) be the set of orthogonal transformations
On ae:
4-8 The Multiplier Rule 129
(a) Show that O(n) is a manifold of dimension $4n(n — 1) and class CO. [Hint:
Proposition 11.]
(b) Let I be the identity transformation of E”. Show that L is a tangent vector
to O(n) at Lif and only if L‘ = —L. (Such a linear transformation is called
skew symmetric.)
(c) Let SO(n) be the set of all rotations of E” about 0. Show that SO(n) is a
relatively open subset of O(n), and hence SO(n) is also a manifold of dimen-
sion 3n(n — 1). Show that SO(n) is the largest connected subset of O(n)
which contains I. [Hint: Using induction on n, show that any L € SO(n)
can be joined with I by a path in SO(n).]
and D@®(x) has maximum rank m for every x € U. We may assume that
UG Wo:
Roughly speaking, the multiplier rule states that by introducing suitable
multipliers o1,...,@m the constrained extremum problem can be treated as
one of ordinary (unconstrained) extremum. More precisely:
Example 1. Let f(z, y,z) = x — y+ 2z. Let us find the maximum and minimum
values of f on the ellipsoid
M = {(a,y,2):22
+y2+ 2? = 2.
Let ®(2, y,z) = 2 — (2? + y? + 227) and F = f+ 0%. The multiplier o is yet to
be determined. From the multiplier rule we get three equations
Py eS 1 — 2cx0
= 0,
Fo = —1
— 2cyo = 0;
F37=) +2) — 4oz9,
— 0:
no = 5 yo = — 52> St ee eS ep)
Therefore xo = +(/2/2)(e: — e2-+ es), depending on which of the two possible values
for ¢ is used. Since f is continuous and M is a compact set, f has a maximum and a
minimum value on Jf. One of the two critical points obtained by the multiplier rule
must give the maximum and the other the minimum. Since f[(+/2/2)(e1 — e2+ e3)] =
2/2 and f[—(V/2/2)(e1 — e2 + e3)] = —2/2, these numbers are the maximum
and minimum values, respectively.
Example 2. Let f(x) = > 07-1 b:(2‘)?, where b; # 0 for eachz = 1,...,n. Let M
be the hyperplane {x:a-x = 1}, and F(x) = f(x) + o(1 — a-x). If xq isa critical
point of F, then
0 = dF (xo) = df(xo) — oa.
Thus df(xo) = a, or
TCG) iss = tr Se IR te
From this and the equation a: xo = 1,
n 2
a 0a; 1 1 SS a;
provided the sum is not zero. To determine whether xo gives an extremum, we use
the formula
f(xo +h) = f(xo) + df(xo)-h-+ f(b),
4-8 The Muitiplier Rule 131
Ay = f(v1) = Li) - V1 = OV
- V1,
and vs € My» be a point such that f(v2) = 2. We have added another con-
132 Vector-Valued Functions of Several Variables 4-8
Ne AAR Tee NG
and {v,,..., Vn} is an orthonormal basis for #”. Let us show by induction
on k that ; is a characteristic value and v; a characteristic vector. This is
true if k = 1. Let k > 2. Applying the multiplier rule with
k—1
Oe ZL (Vp) V; Gs iy elena
a, = L(vx) + vi = f(a),
If k = n, the multiplier rule does not apply. However, we used the mul-
tiplier rule only to show that L(v;) is a linear combination of vy,..., vz. If
k = n, this is clear from the fact that {v,,...,v,} is a basis for EH”.
Theorem 11. The characteristic values of the symmetric matrix (c') are
A1,-++)An- For eachi = 1,...,n, Vv; is a characteristic vector correspond-
ing to d,. If &',..., & denote the components of x with respect to the ortho-
normal basis {v1,..., Vn}, then
Proof. The first two statements have already been proved. To prove the
third, we have
n
aS a DeLey;; faa
cl
im eeeLy.) SS Niki,
al 1=1
eX (Xe be
jn SO ine aa
C=)
PROBLEMS
1. Set M = {(z,y,2):4+y+2 = 1} and f(z,y,z) = 322+ 3y?+ 22. Show
that there is a constrained absolute minimum, and find the minimum value of
fon M.
2. Use the multiplier rule to find the distance to the parabola y? = x from the point
cey,c ~ 0. [Hint: Let f(x,y) = (« —c)?+ y?, which is the square of the
distance. |
3. Find the distance from the point ey — 2e2 — e3 to the line {(a, y,z):% = y = 2}.
4. (a) Use the multiplier rule to show that Jal] = max {a-x:|x| = 1}
(b) Deduce the same result from Cauchy’s inequality.
134 Vector-Valued Functions of Several Variables 4-8
[Note: For these norms the inequality |a- x] < |lal] ||x|| [formula (1-18), p. 32] is
called Hélder’s inequality. A related inequality for integrals is given in Section 5-12].
. Let f@, y, z) = 2xz + y?.
(a) Find the characteristic values \1, Ag, A3.
(b) Sketch the surface with equation 2xz-++ y? = 1. With equation 2xz-+ y? = 0.
10. Let ||L|| be defined as in Section 4-3. Show that ||L||? is the largest characteristic
value of L'e L. [Hint: Use (4-8) with y = L(t)].
Hil. (Second derivative test for constrained relative maxima.) Let f and ® be of
class C@), and let Q(x, h) = Sof j;=; Fi;(x)h*h? where F is as in the multiplier rule.
Show that:
(a) If f|M has a relative maximum at xo, then Q(xo, h) > 0 for every h € 7'(xo).
(b) If Q(xo,h) > O for every h € T(xo), h ¥ 0, then f|M has a strict relative
maximum at xo. [Hints: See the proof of Theorem 6. Set hy = t7!(y; — xo),
and show that lim;_,9 Q(xo, hz) = Q(xo, h).]
CHAPTER io
Integration
If A = £”, we write simply ffdV,. The symbols dV, and dV,(x) are used
after an integral sign merely for convenience and for traditional reasons. They
will have no significance by themselves. (On the other hand, df has a meaning
already explained in Section 2-6. In particular, dx’ stands for the differential
of the ith standard cartesian coordinate function X*.)
If n = 1, then we write, as is customary, f4 f(z) dx instead of
Sa f(x) dV;(z), and if A = [a, b], we write f? f(x) dz.
5-1 INTERVALS
What is the n-dimensional measure of a subset A of #”? To start with,
let us consider the simplest possible case—where A is an n-dimensional interval.
A 2-dimensional interval is a rectangle with sides parallel to the coordinate
axes (Fig. 5-1). Its area is the product of the lengths of its sides. Since H? is
the cartesian product H! x E', a 2-dimensional interval is just the cartesian
product of 1-dimensional intervals. Similarly, a set J C H” is called an n-dimen-
sional interval if I is the cartesian product of 1-dimensional intervals:
LK Se le
The n-dimensional measure V(I) of I is the product of the lengths of the inter-
eulStel
iy elem pad as
We shall next define the measure of a set which is a finite union of
n-dimensional intervals. For this purpose the idea of grid of hyperplanes is
introduced.
Grids. For each 7 = 1,...,n let us take a finite set of real numbers; let
the elements of these sets be denoted by x;, where
fh Kee
and m;+1 is the number of elements of the ith set. Let P} be the hyperplane
with equation x’ = 2}, and let II be the union of all of these hyperplanes.
Such a set II is called a grid of hyperplanes. A grid divides H” into a finite
number of n-dimensional intervals, called intervals of 1, and a finite number
5-1 Intervals 137
x2
Xie \
0 | |
| |
| |
| |
of unbounded sets. The latter could be called semi-infinite intervals of II, but
we shall have no occasion to do so. The intervals of I have the form J =
Ji X+++X Jn where J; = [2}, 2}41] and the integers j,,...,j, may be
chosen arbitrarily subject to 1 < 7; < mi4;. There are m,--+m, intervals
of the grid, and for convenience we have taken them to be closed. (See Fig. 5-2.)
Let us call a set Y a figure if Y is the union of certain intervals I,,..., I»
of some grid II. The measure of Y is
PROBLEMS
TeLet =) 1sand Y= (0,1) U [2,3], Z = 11; 3] U [4, 5).. Verify formula. (5-2) in
this example.
2a leiw =12 and Y= (0,2) < (0,1) U [1,3] x [Lb 2], Z = [—1, 2] x [—1, 3]. Find
a grid II such that both Y and Z are unions of intervals of II. Find the areas of
Y,Z, Y UZ, and Y f Z, and verify that V(Y) + V(Z) — V(Y UZ).= V(Y NZ).
138 Integration 5-2
3. Let Ix = [0,1] X [0,1] X [0,1] and Io = [%, 2] X [0, 2] X [—1, 2], Find the
volume of J; U Ig and of 11 N Ie.
4. Let m be a positive integer, f(z) = expz, and Y = I; U--:U Im, where for
i = Ipootg .
Ix = [(k — 1)/m, k/m] X [0, f(k/m)].
Find the area Vo(Y). Show that it is approximately e — 1 if mis large.
5. (a) Let I and Je be n-dimensional intervals. Show that I, Q J2 is also an interval
provided that it has nonempty interior.
(b) When is I; U J2 an interval?
5-2 MEASURE
We shall now define the measure of a bounded set A. This is done in two
stages. I irst, the measure of an open set @ is defined by approximating G
from within by figures, and that of a compact set K by approximating K from
without by figures. In the second stage, A is approximated from within by
compact sets and from without by open sets. This two-stage approximation
process is an important feature of the Lebesgue theory of measure.
There is an older theory of measure due to Jordan. In this theory A is
approximated simultaneously from within and without by figures. The Jordan
theory is unsatisfactory for several reasons. Among them is the fact that the
class of sets to which it applies is too small. For instance, there are compact
sets to which the Jordan theory does not assign any measure.
Let G be an open set. If Y is a figure contained in G, then the measure
of G must be more than V(Y). It is defined to be the least upper bound of the
set of all such numbers V(Y). (See Fig. 5-3.)
Figure 5-3
Example 1. Let G = intZ where Z is a figure (recall that any figure is a closed set).
Let us show that V(int Z) = V(Z). For any figure Y C int Z, V(Y) < V(Z); and
given e« > 0 one can find such a figure Y with V(Z) < V(Y) +. Thus V(Z) =
Supa VY) vee mt Z\.
We shall now establish a formula for the sum of the measures of two open
sets. For this purpose a topological lemma is needed.
VW ae OA) Vi Vz).
Since Y CcG, V(Y) < V(G); similarly, V(Z) < V(H). Hence
Vikan dV Oke mz
4
Example 2. Any figure Y is a compact set. The new and old definitions of V(Y)
agree. The proof is like the one in Example 1.
Proof. Let f(x) = dist (x, L). Since K ML is empty and L is closed,
f(x) > O for every x E K. Since K is compact and f is continuous, f has a
positive minimum value d on K.
Let W be any figure such that K UZ C int W. Then W is the union of
intervals J;,..., 7, with diameters less than d. Let Y be the union of those
intervals 7; such that (int J;) MN K is not empty, and Z the union of those
such that (int J;) QL is not empty. Then YUZCW, YNZ is empty,
and KC int Y, L CintZ. Hence
Now let A be any bounded set. Its outer measure, denoted by V(A), is
defined by approximating from without by open sets:
VADESV A):
It is easy to show that if B C A, then
If H is a bounded open set, then V(H) < V(GQ) for any open set @ con-
taining H, and there is equality when G= H. Hence V(H) = V(H). Given
€ > 0, there is a figure Y C H with V(H) — e < V(Y). Since Y is a com-
pact set, V(Y) < V(H), and hence V(H) — e < V(H). Since this is true
for every € > 0, V(H) < V(H). But V(H) < V(#), and therefore
We showed above that bounded open sets and compact sets are measur-
able and that the new definition of their measures agrees with the previous one.
Many sets which are neither open nor compact are also measurable. In fact,
the only examples of nonmeasurable sets are obtained in a quite nonconstruc-
tive way using the “axiom of choice” of set theory (see [15], p. 157).
We shall now show that finite unions, intersections, and differences of
bounded measurable sets are also measurable. [or this purpose let us first
prove the following.
If An B is empty, then
Since the last inequality is true for every € > 0, we must have (5-5).
Similar reasoning, using Lemma 3, gives (5-6). I
142 Integration 5-2
Proof. By Lemma 5,
LCA—BCH.
Ee GR Ge KG a)
By Lemma 2a,
and the three sets on the right-hand side are measurable and no two of them
intersect. By Corollary 1, A U B is measurable. §
Lemma 2b. Let Gi, Go, ... be open sets each of which has finite measure.
LNG = Gy U G2 U---, then
On the other hand, given e > 0 let G; be an open set such that A, C G; and
Vi GeV pa eo, 2, end lebe G et GUL oie letter:
A C @and therefore V(A) < V(G). By Lemma 2
and 5>2—* = 1. Since this is true for any € > 0, A is measurable and (5-9)
holds. §
Proof. Let
VGA\e ye (Bar
k=1
Since By C Az, V(Be) < V(Az), giving (5-10). If the sequence Aj, Ao,...
is disjoint, B, = A; and (5-10) becomes formula (5-9) which has already been
proved.
Lita GA Geen Bag) ee ee eri
Unbounded sets. Let U, = {x: |x| < r}. A set A is called measurable
if A m U, is measurable for every r > 0. The measure of A is
Example 3. Let Amn = im, 0). “Then V(A;,) = --o for each m = 1,2,..., but
V(A1N A2n::: ) = 0 since the intersection is empty.
KG oy VC).
k=1
The third possibility is that G contains some half-line. In that case V(@) = +.
The sets of measure 0 play a special role. They turn out to be negligible
in integration theory, and for that reason we shall call them null sets.
Corollary. Jf A;, Ao,... are null sets, then Ay U Ag U-::: 28 a null set.
If B C A and A is a null set, then B is a null set.
Proof. By (5-10)
Ona AU Aa Uc) 0 0,
k=1
which proves the first assertion. If A is a bounded null set and B C A, then
Example 6. A set A is countable if either A is a finite set or its elements can be arranged
in an infinite sequence, A = {x1,X2,...} where x, ~ x: fork #1. Any one point
set is a null set. Hence, taking A; = {x;}, we find that any countable set 1s a null set.
Example 7. Let A be the set of rational numbers in the interval (0,1). Then A is
countable. For instance, one can write
Hence A is a null set. Since Vi(A) + Vi[(0, 1) — A] = Vil, 1)] = 1, the set of
irrational numbers in (0, 1) has measure 1 and therefore must be an uncountable set.
PROBLEMS
In 1, 2, and 3 assume that the sets are bounded.
1. Let A and B be measurable. Show that:
(a) V(A — B) = V(A) — VAN B).
(Dye VARS By Vi A) Bye= Vi) = V(b):
2. Show that if A, B, and C are measurable, then
VAU B) = V4 — B) = VA).
0 Ag Ay Ag 1
I
——————
oe
As As ‘As As
Figure 5-4
(a) Show that Vi(@) = 1. Hence the compact set K = [0,1] — G@ is a null
set. K is called the Cantor discontinuum.
(b) Show that no connected subset of K contains more than one point.
(c) Show that « € K if and only ifx = )°72, a:3—‘ where a; = Oor2,i = 1,2,...
(d) Let f(z) = 214,;2-*—! for xe K. Show that f(K) = [0,1]. Hence K
is uncountable.
(e) For z in the kth interval of A; let f(x) have the constant value (2k — 1)2~/,
k = 1,2,...,27-!,7 = 1,2,... Show that f is continuous and nondecreas-
ing on [0, 1]. [Note: f is called the Cantor function.]
11. Show that any straight line in EH? has area 0.
12. Show that if A is an unbounded measurable set, then
V(A) = sup {V(K): KC A}.
(if V(A) = + ©, this means that for every C > 0 there is a compact set K C A
with Vici) w= C.)
The object of this section is to define the integral ff dV of f over HE” when
f is bounded and has compact support. This is done first for functions taking
only a finite number of values. In that case the integral is just a certain finite
sum. A function ¢ is called a step function if there exists a disjoint collection
{Ay,..., Am} of bounded measurable sets such that ¢(x) is constant on each
A; and ¢(x) = 0 for x € Ay U---UAm. If o(x) = cx for every x € Aj,
then the integral of ¢ over EH” is
JeeS AD (5-13)
k=
Example 2. Let (zx) = k/m for x € [(k — 1)/m,k/m), k = 1,2,...,m, and
¢o(x) = O fora ¢ [0,1). Then
1 2 mm 1
m2 | m2 ieee m Im
[oa-
We write dx instead of dV; in case n = 1.
148 Integration 5-3
SS Hien) = 0S eh aesy bo
t=1 t=1 k=1
Since d; = cz, whenever B; N Ax is not empty, the right-hand sides are equal.
This shows that {¢ dV does not depend on which collection is used.
Similar reasoning shows that the sum ¢ + wy of two step functions is also
a step function. Moreover,
Proof. Given € > 0 there exists a step function ¢, > f such that
If f and g are integrable, then the right-hand sides of (5-18a) and (5-18b)
are equal. The left-hand side of (5-18b) is not greater than the left-hand
side of (5-18a). Hence both upper and lower integrals of f + g equal ff dV +
fg dV. This proves that f + g is integrable and (5-17a). The rest of the proof
is left to the reader (Problem 4). §
PROBLEMS
3. Let a unit square be divided into a small square in the center and 2m annular
figures of equal width surrounding it, as shown in Fig. 5-5. Let ¢(z, y) = 0 for
(x, y) in the small square or outside the large square. Let o(a, y) = (—1)*k in
the kth annular figure, k = 1,...,2m. Show that
e| Figure 5-5
4. (a) Show that if f is integrable, then f(cf) dV = cffdV. [Hint: Show that this
is true if c > 0, and that Sg Wy = S(—9) dV for every g. If c < 0, set
g = cfandg = —<cf.]
(b) Show that ffdV < fgdVitf < g.
and the right-hand side is CV(A). Similarly, —f(x) < |f(x)| and
Let us next show that under quite mild assumptions about f, the integral
exists. It is for this purpose that the idea of measurable function is introduced.
Let f have domain E”.
Definition. If {x: f(x) > c} is a measurable set for every scalar c, then
f is a measurable function.
It will be shown in Section 5-10 that such operations as taking the sum of
two measurable functions or the limit of a sequence of measurable functions
lead again to measurable functions. Just as for nonmeasurable sets, the only
examples of nonmeasurable functions are obtained in a nonconstructive way
using the axiom of choice.
Ay = fan
{x: (k — 1)/m < f(x) < k/m}.
fe a= fv dV = V(EN)/m,
V(E,)/m < V(D)/m < e.
Proof. Let us first assume that f > 0. If c > 0, then the set
PROBLEMS
1. Let fs) = MU —2* if OS 2 < 2, f(z) = 0 otherwise. Using the notation in
the proof of the Jemma, describe the sets A1,..., An. Sketch the step functions
¢ and yp in the case m = 4,
2. In each case show that f is integrable over A.
(a) fiz) = z* expz, A = (0, a}.
(b) f(z) = sin (1/4) ifz # 0,f0) = 5, A = [—1, 1].
() fz,y = —/)/@ —»,A = ey, a] <1,\yl < 1,2"¥ y}.
(d) fiz) = 0 if & ia irrational, f(z) = 1/q if c = p/q where p and q are integers
with no cornmon factor, A = (0, 1).
(e) fiz) = 1 if & ie irrational, f(z) = 0 if z is rational, A = [a, BJ.
3. For each part of Problem 2 describe the sets {x © A: f(x) > c}.
4. Show that if f, g, and h are integrable over A and |f(x) — g(x)|< h(x) for every
x€ A, then |f,fdV — fagdV| < fahdv.
5, Let f be of dass C™ on (0, a) and b = max {|f’"(z)|:0 < ¢ < aj. Let g(z)=
SO) +f/O)z. Using Proble mn 4, show that \fo f dz — af(0) — a*f’(0)/2| < a*b/6.
Use this result to estimate f}/* exp(—z?/2) dz.
6. (Mean value theorem for integrals.) Let A be compact and connected. Let f be
continuous on A and g be integrable over A with g(x) > O foreveryx € A. Prove
that there exists x* © A such that
[9 qv = fx") gav.
[,$@
A
ane) = fae’)
R
ave). (5-28a)
Actually, Theorem 15 is true if A is merely measurable and f is integrable
over A. In that case there may be a subset of AR of s-dimensional measure 0
for which g(x’) is undefined. We shall not prove this general form of Theorem 15.
However, in the next section the theorem will be extended to the case when
A is a g-compact set.
In the proof the following fact about monotone sequences of functions
will be used. For each vy = 1, 2,... let F, be a bounded measurable function
with compact support such that
For each x’, the functions F,(x’, ) form a monotone sequence tending to
TixCes ae Let
G(x’) = ijF(x’, )dVa-, Gx)a i)F(x’, ) dVn—s-
Applying the monotone sequences theorem to the sequence [F,(x’, )],
for every x’. Moreover, G; > Gz > --- Applying the monotone sequences
theorem to this sequence,
lim [G,dV, = ilGdV..
y—o
if
F,dV, = iG, dVs.
Therefore
{|FdV, = iGdV.. (5-23b)
Corollary.
Vn(A) = ipVn—slA(x’)] dV,(x’). (5-24)
Proof. Take f(x) = 1 in (5-23a). J
In particular, let s = 1. Writing xz’ = x' = u, (5-24) becomes
” n—1 no n
(c — u) By (c — u) c
USN (n — 1)! a n(n — D!o nl
To evaluate this integral we have of course used the fundamental theorem of calculus.
A proof of this theorem is given in Section A-9.
a! Figure 5-9
where the iterated integrals are taken over the appropriate subsets of H'. Many
authors write the iterated integral as fff dy dx, but this notation would lead to
confusion when we come to the exterior differential calculus in later chapters.
The iterated integral is usually easier to evaluate when taken in one of the
two possible orders than in the other order.
160 Integration 5-5
Then f(a, y) = «2 -++ y and A is as shown (Fig. 5-10). Evaluating the inner integral
first, we get
1 1 4
[L,@a4)
1 2
2 y
a2
dex <4 fears —1
(.!| ae = ¢é.
Writing ff dV2 as an iterated integral in the opposite order, we get
we x ne [ov
[afc : + y) dx = ~
[ (E+a)/ a =8/ie y?? dy
== 28.
and A(z, y) is the interval (0, 7 ee y]. (See Fig. 5-11.) Writing the integral
over F& as an iterated integral, we get
2 Ae V4—22—y
[favs =f de [ dy [ f dz.
A 0 0 0
There are 5 other possible ways of writing {4 f dV3 as an iterated triple integral.
For instance, crs
4—z°—z
[favs
A
= fff
s Vo
f dy\ dV (a, 2)
2 V4—22 Lg ye
s / dz / dex| f dy,
0 0 0
In the same way, for any n the integral can be written in n! possible ways
as an n-fold iterated integral.
pe
3 0)
R
=; _ fav'p(x)
dVn(x)
Pomona oat
If p is constant, then ¥ = X.
162 Integration 5-5
Example 5. Find the centroid of the hemispherical n-ball H = {x:|x| < 1, Peele
(See Fig. 5-12.) Now Ve) = a,/2, where a, is the measure of the unit n-ball
(Problem 7). By symmetry,m' = 0 fori > 1, and
m} ie dV (x)
ll avai u(1
2\ (n—1)/2 _ On—1
wu) UT
Thus
x Jes 207-1 e
as an integral over a set A C E?, and then as an iterated integral in the opposite
order. Evaluate it.
3. Express as an iterated triple integral:
A S
1 1
(b) Evaluate iL dxf exp (a + y) dy.
T 1/2
(c) Evaluate il dy [ xy cos (c + y) dz.
0 0
7. Let a, be the measure of the unit n-ball {x:|x| < 1}. Prove by induction that
apr” is the measure of an n-ball of radius r, and a, = 2an—if§ (1 — u2)-D/2 dy,
Show that a4 = 17/2. [In Section 5-9 we give a general formula for an.]
5-6 The Unbounded Case 163
Figure 5-13
Proof. Note that |f + g9| < |f| + |g]. By Corollary 1 and (b) of the
lemma, |f| ++ |g| is integrable over A. Apply the comparison test.
The integral can be written as a limit. Let K,, Ko, ... be any nondecreas-
ing sequence of compact sets such that Ad= K, U Ky U--- Then
iL nee ak ¥. pce
Taking the limit as vy> o,
= dV.
[turf tv Last
Note: Not every measurable set is o-compact. For instance, it can be
shown that the set of irrational numbers in [0, 1] is not a-compact. However,
166 Integration 5-6
it turns out that any measurable set A has a o-compact subset B such that
A — Bisa null set.
Let us now consider some important particular cases.
Case 1. A is closed. For every r > 0 let A(r) = {x EA: |x| < 7}.
Each of these sets is compact. Let r ,7rg,... be any nondecreasing sequence
tending to +o, and K, = A(r,). Let f be integrable over A, and let
V(r) = ee fav.
By (5-29), ¥(r,) tends to the integral of f over A as vy> o. Since this is true
for every such sequence rj, 72,..., we have
~ 1
x ?de=
| p= I
Example 2. Let f(z) = exp (—bx),b > 0. Then f is integrable over [a, ©) for any a.
Note: If A C [a,b] and [a,b] — A is a null set, then we still use the
notation f? for f.4.
Case 3. Let f be continuous on A = K — {xo}, where K is closed but not
compact. Let A; = {x eK : |x — xo| > 1}, Ag = {xEK:0 < |x — xo| < 1}.
Then f4,fdV and f4,fdV can be treated respectively as in Cases 1 and 2.
Since A; M Ag is a null set, if both of these integrals exist their sum is fy f dV.
If either the integral of f over A, or the integral over Ay does not exist, then
f is not integrable over A. To show this, suppose for instance that f is not
integrable over A,. By Corollary 1, f4, |f| dV diverges to +o. Since A, C A,
so does f4 |f| dV. Hence f is not integrable over A.
Example 3 (continued). Let A = (0, ©) and f(x) = 2-?. Taking A, = [1, ~),
Ag = (0, 1], the integral over Aj exists only if p > 1, and over Ag only if p < 1.
Hence fo z~” dz diverges to + for every p.
iiix — xol~” aV
A
exists if p < n and diverges to-+™ if p > n. This is proved in Section 5-9 by intro-
ducing generalized spherical coordinates. Let f(x) = $(x)|x — xo|~”, where ¢ is
continuous and |¢(x)| < C for every x A. Let g(x) = C\x — xo|"”. By the
comparison test, f is integrable over A if p < n. If ¢(x) = ¢ > 0 for every xe A,
let h(x) = clx — xo|~”. If p > n, then f > h. The integral of h over A diverges
to +, and hence so does the integral of f.
In the same way
/ Ix — xol|? dV
E”"—K
Example 5. Let
The function I is called the gamma function. Let f(z) = 2“~1 exp (—2). Since
f(z) < 2-1 and p = 1— u <1, f is integrable over (0,1] by comparison with
a-?. For any 0, 2° exp (—xz) > 0 as x +--+ and therefore is bounded on [1, ~).
Letting b = u-+ 1 we see that there is a number C such that f(x) < C/x? for every
x €[l, ©). Thus f is integrable over [1, ©) and over (0, 1], therefore over (0, ©).
The gamma function generalizes the factorial. First of all,
The integration by parts over (0, ©) is justified by taking it first over intervals
[6, 1] and [1, 7] and letting 6 > 0+, r +--+. In particular, if m is an integer, then
IMG. == 1) = witGo). Siac 1)) = il,
[Fen = [Gav
for each y = 1,2,... The sequences F,, F2,... and Gi, Go,... are non-
decreasing. Using the monotone sequences theorem (p. 190) three times, just
as in the previous section,
[Fan = [@av,,
which is the formula (5-23b) for iterated integrals. If either integral in (5-23b)
diverges to +o, then so does the other. Therefore, if f > 0, one way to show
that f is integrable over A is to show that the iterated integral exists.
Example 6. Let f(z, y) = |z|/(1-+ «2+ y?)?, A = EE. By symmetry
AVE © Ox z geese
fs 2 a wv Genes 222 dy TSA
5-6 The Unbounded Case 169
Now let f be any continuous function integrable over A. The iterated in-
tegrals theorem applies to ft and f—, and hence by subtraction to f. Caution: If
f has both positive and negative values on A, then one cannot conclude that f
is integrable from the fact that the iterated integral exists.
1 7 1
i dy [ y "cose dx = f Ody = 0.
0 0 0
The iterated integral in the opposite order does not exist, hence f is not integrable
over A.
Note: In the discussion of iterated integrals above, the integral of f(x’, )
over A(x’) is in the sense of the present section. However, the function g in
(5-23a) need not be continuous on R. In fact, g(x’) may be +o for certain
values of x’. If g is discontinuous and unbounded on R, then its integral has
to be taken in the sense described in Section 5-10.
PROBLEMS
1. Determine whether the integral exists or is divergent to +.
2
us /2
dx
(d) / FS oer Tees
ees Eel)
Fae
(e) / mine daz.
Daechow thatet(2,4,2)00 <2 — (2-- y7)/ry, 0 < « < 1,0 < y X< 1} has in-
finite volume.
3. Show that the volume of {(z, y, z):0 < z < |zy| exp (—2? — y?)} is 1.
4. Show that ff(2, y) dVo(a, y) exists if f is bounded, continuous, and |f(z, y)| <
Care ie) aarp 2:
5. If lim, f", f(z) dx exists but f is not integrable on E’, then this limit is called
the Cauchy principal value. Find the Cauchy principal value:
(a) fa) = 2/7 2). (c) Any odd function f, f(—z) = —f(z)
(b) f(z) = x +1/(1+ 2”). for every «.
170 Integration 5-7
9. (a) Assume that for every real number c, both of the sets {x € A:f(x) < c}
and {x € A:f(x) > c} are open relative to A. Show that f is continuous
on A. [Hint: Show that {x € A:c < f(x) < d} is open relative to A.
(b) Using (a) and the Corollary to Proposition A-6 (Section A-6), prove: if f is
continuous on A, then f+ is continuous on A.
10. Let A be a bounded, o-compact set, and let f be bounded and continuous on A.
Show that fu fdV, as defined by (5-28), is the same as in Section 5-4. [Hint:
Suppose first that f > 0. For any compact set K C A, we have, taking the integral
over A in the previous sense, 0 < fafdV — fxfdV < C[V(A — K)] provided
f(z) < C for every xE A. But V(A — K) = V(A) — V(K) is arbitrarily
small. Apply this to ft, f-, and subtract.]
11. Let Ki, Ko,... be a nondecreasing sequence of compact sets, and A =
Ki U K2U-:: Assume that if K is any compact subset of A, then K C K, for
some v (sufficiently large). Prove (5-29). [Hint: Suppose first that f > 0, the
integral being then given by (5—-26).]
12. (Difficult.) Let f be continuous on an open set D. Assume that the integrals of
ft and f~ over D both diverge to-+%. Show that given any number / there is a
sequence of compact sets Ki; C Ko C:--: such that D = K, U Ke U--:> and
linn dea =.
g(t) = L(t) + xo
for every t € EH”.
Proof. First of all, (5-35) is true for some particular kinds of affine trans-
formations. Certain elementary details of the proofs are left to the reader
(Problem 4).
(1) g is a translation, L = I. Then V[g(A)] = V(A) for any figure A,
and hence for any compact set.
From (1) we may assume from now on that x» = 0, g = L.
(2) For some k and /, the transformation L merely interchanges the com-
ponents ¢* and ¢’ of t. Then V[L(A)] = V(A) for every figure, and hence for
every compact set.
(3) The matrix of L is diagonal. In this case L*(t) = c’t', where ci,..., c? nm
and (det M)(det N) = detM-N. Hence the theorem also holds for their
composite M-N.
If N has row covectors w',...,w” and M is of type (4), then the kth
row covector of M-N is w” + cw’ and the others are unchanged. The kth
172 Integration ms)
column vector of N-M is v;,-+ cv; and the others are unchanged, where
Vj,..-, V, are the column vectors of N. A transformation M of type (2) when
applied on the left interchanges the kth and Ith row covectors of N, and when
applied on the right interchanges the kth and /th column vectors. Moreover,
the inverse of a transformation of type (2) or (4) is of the same type.
Now let L be any linear transformation. Then
L = M,-:-->
Mi,
where M,,...,M, are of types (2), (3), and (4). Since Theorem 16 holds for
each M,,, it is true for L.
Pa pe aa a Ae ae NE a, go ia
—t
Xo FV]
Xp t+Vo
Figure 5-15
PROBLEMS
Ike Find the volume of the tetrahedron with vertices e1, —e2, e3, e1 + 2e2 + e3.
2. Find the area of the parallelogram with vertices e — e2, 2e1-+ e2, —2e1, —e1 + 2eo.
3. Let S be an n-simplex, with vertices xo, X1,...,Xn. Show that
he
x0 el
ok) el foine
1
aR
1
V(S) = — | det
n! B
° ZO 1 Xn
il il 1
[Hint: Subtract the first column from each of the other columns. This does not
change the determinant.]
. Complete the details of steps (1), (2), and (3) of the proof of Theorem 16.
. Let (cj) be a positive definite symmetric n X n matrix, and let \1,..., An be its
characteristic values. Show that
where a, is the measure of the unit n-ball. [Hint: See Theorem 11.]
. Show that if |v;| < C for each i = 1,...,n and P is a parallelepiped spanned by
Vi,--+,Vn, then V(P) < C”. [Hint: Use induction, a suitable rotation of E”,
and the method of slices.]
Figure 5-16
Xo = g(to), L = Dg(to).
The affine approximation G to g at to is given by
[sav a Dy (tk)
V Lk) ~ i oaV.
Hifav = I $ dV.
A B
The following theorem states that the formula is correct, and not merely for
compact sets.
5-8 Transformation of Integrals 175
[f@ave@)
A
=f,g (A) fetlivediavio, (5-38)
provided either integral exists.
The right-hand sides are equal. If either integral diverges to +, then so does
the other. Thus (5-38) holds if f > 0. In the general case, write f = ft — f7,
¢=¢' —¢. Since |Jg| > 0, we have
L(s — t). Let C = ||L7'|| (p. 103). Thens — t = L~’(@ — y), and
s—t] < Ck — yl. (*)
Let o = 7/(2V/nC). Since g is differentiable, to has a neighborhood
Q Cc A such that
lg(t) — G(t)| < o|t — to (**)
for every t € Q.
Let x € g(I) and s = G’/(x). Then x = G(s) = g(t) for some t € J.
By (*) and (**)
ls — t| < Co|t — tol.
c= got = [ov
[ew = | ow,
0
sav= > | fav.
gC) Gay g(J5)
Choose some such j and let J’ = J,;. By dividing J! into 2” congruent n-cubes
and repeating the argument, we obtain J*. Continuing, we obtain a sequence
of n-cubes I' D> I? D- ++ such that
of those meeting B, and so on. Then Z; > Z2 D--- and their intersection
is B. There exists vo such that Z, C A for all vy > v9.
Applying Lemma 3 to each of these congruent disjoint n-cubes comprising
Z,, and adding, we get
This fact will be proved in the section on convergence theorems [Corollary 5(b),
Section 5-10]. Thus
| fv < i ¢ dV.
A B
But g~’ is also a regular transformation. Interchanging the roles of A
and B,
[ev
B
sfA eee ela.
Bonen | |/e(t)|=" ix g(t), and
ole @)ilJen
(x)) — fG):
Therefore
if
B
gqV < iA fav,
which proves (5-38) for the case of compact sets. This completes the proof
of Theorem 17. J
Corollary 1.
VAy= fe, Vel avin. (5-39)
Corollary 2. /f B is a null set, then g(B) is a null set.
Figure 5-18
K is a null set. If A is any measurable set, then V(A) = sup {V(K) :K C A}.
If A C M, then A is a null set.
Example 1. Let 4 = {@, y):2 > 0,4 > 0,0 < ay < 3,4 < y <'2z}, f@, y) = y?,
g(s,t) = Vs/te: + Vste2fors > 0,t > 0. We show that g is univalent by solving
the equations
B= Wl, y = Vst
explicitly, and find that s = zy, t = y/z. Since Jg(s,t) = 1/2t ¥ 0, g is regular.
Moreover, the part in A of the hyperbola zy = c, 0 < ¢ < 38, corresponds to the
segment s = c, 1 < t < 2in B. Hence B is as shown in Fig. 5-18, and
ss Ss
Jy aVo(z, y) = 3] at sds = i.
A eo
Example 2. Let P be an n-parallelepiped. Then P = g(Jo) where, as on p. 172, g is
affine and Jo is the unit n-cube. Then
PROBLEMS
1. Letn = 1, g(t) = t? — 2+ 3, B = (0,1). What does (5-38) become?
2. Let g(s, t) = (s?-+t7)e1+ (s? — t?)eo,s>0,t >0,and A= {(2, y):2<a+y <4,
z —y > 0,y > 0}. Show that g is regular and evaluate [4 «—! dV 2(z, y).
3. Find the second moment about (0,0) of the parallelogram with vertices (0, 0),
e; + e2, —2e; + 3e2, —e; + 4e2. [Hint: Let g be the linear transformation L
with column vectors e; + e2, —2e; + 3e2.]
4. Let B be a compact set, X its centroid, and g an affine transformation. Show that
g(x) is the centroid of g(B).
5. Let A be symmetric about 0, that is, x € A implies —x € A. Letf be integrable
over A and f(—x) = —/f(x) for every x € A. Show that fafdV = 0.
6. Let g be of class OC) on A and K be a compact subset of A. Show that there is a
number C such that |g(s) — g(t)| < C|s — t| for every s, t © K. [Hint: Proposi-
tion 13b, Section 4-3.]
7. Let g be regular and K, C as in Problem 6. Show that if BC int K, then
V[g(B)] < C"V(B). (Hint: Fort € B the partial derivatives of g satisfy |gi(t)| < C.
Use Problem 6, p. 173 to see that |Jg(t)| < C”.]
5-9
180 Integration
x! = rcos 6!
x” = rsin 6! cos 6?
Example 1. Suppose that f(x) = (|x|), where ¢ is continuous on (ri, r2). Then
where a@,, is the measure of the unit n-ball. Hence Bn = nan, which turns out to be
the (n — 1)-dimensional measure of the unit (n — 1)-sphere.
*Gamma and beta functions. The gamma function was defined on p. 167.
If we let x = g(s) = s?/2, then
LEO) = | | So
exp (3772) ds{” t?°—! exp (—t?/2) dt.
0 0
Writing the iterated integral as an integral over the first quadrant Q and intro-
ducing polar coordinates,
T'(u)T(v)
Buu, v) = T(u +0) : (5-42)
proved earlier, '(m + 4) can be found explicitly for any positive integer m. For
instance,
He = iraah ae Ey (5-45)
5-9 Coordinate Systems in E” 183
A variety of integrals can be reduced to either (5-41) or (5-45) and hence can be
evaluated in terms of the gamma function. See Problem 10.
The measure a, of the unit n-ball. According to Problem 7, p. 162,
Sf)
An—1 0
is ee a
rar n (242)
1
An = An—] nN
r G a 1)
PROBLEMS
1. Let A = {(2,y):22 + y? < a?,2 > 0}. Evaluate fi, zy? dVo(a, y) by intro-
ducing polar coordinates.
2. Find the area of {(z, y):4 < y < 24,1 < 4+ 4y < 4} by introducing f'(z, y) =
-y/2, f2(z, y) = z+ 4y as coordinates of (x, y).
3. Let A = {(2,y): O< 2? +y? < 2,2?-—y?<1,2>0,y> 0}. Find fa rdVo(z, y)
by introducing the coordinates f'(z, y) = 2? + y?, f?(z,y) = a? — y?.
4. Write the iterated integral fj drfj7” dy ff?”
f(a, y, 2) dz as an iterated integral in cylindrical
coordinates.
5. Find V3({(z, y, 2): 2? + y? + 2? < a7, 27+
y? > b?}) where a > b.
6. (Solids of revolution.) Let S be a compact
subset of the right half-plane and
A= {(z, Y, z) : (r, z) ‘S Si
7. Find V3({(q, y, 2): exp (—22) > y?+ 27,2 > 0}). .
8. Let (f',...,f”) be a coordinate system for Dy C ee and (¢!,..., ”) a coordi-
nate eaten for Dg C E?. Show that (f!,...,f",¢',..., 6”), regarded as func-
tions on D; X De, form a coordinate system for\D; X De
9. (Bipolar coordinates in £4). In this system the coordinates of x are 7 cos 8, 7 sin 8,
p cos a, p sin a, where (r, 6) are polar coordinates in the z!x?-plane and (p, a) polar
coordinates in the a3z4-plane. Find, using bipolar coordinates,
ik (x!)? dV a(x)
KXK
Show that
numbers. However, for present purposes we agree that —x1 <a < +x
for every number a, and that
Lemma 1. Assume that Gn, > 0, Gmy < Amv+41, Amy S Ami, for every
ie = Un OP od aN eae Mat
Proof. Let
De wT aa (Cone
yo mo
These limits exist since dm, < Amo < +--+ and a), < do, <-+-- Moreover,
Dr oe ree andic ye Coe Let
For every m and v, dm, < c. Hence b» < c for every m, and it follows that
b < cc. Similarly, c < b. ff
This is (5-11).
If AG, A>)» are disjomt, let By, — Ay U-+-U A, Then bie Boies
A = B, U By U---, and by what we have just proved
This is (5-9). Since differences of measurable sets are measurable [Problem 1(a)],
the same reasoning as on p. 144 establishes (5-10). §f
is a measurable set for every real number c. Let us list and prove some proper-
ties of measurable functions.
(1) If f is measurable, then for every c the sets {x : f(x) < ch, {x: f(x) > ¢}ry
and {x : f(x) < c} are measurable sets.
then A is measurable.
Let yi, yo, ... be any sequence in #'. Let
Then z} < zo <--: The limit of the monotone sequence 2), 22,... is called
the lower limit of the sequence y1, y2,..., and is denoted by lim inf y,. Simi-
larly, if asl
p= SUDA Ys Uyeete ere} i Spel oe
then w; > wo >--:: The limit of the monotone sequence wy, we,... 1s the
upper limit lim sup y,. Since z, < w, for each vy= 1,2,..., we must have
yoo
Equality holds if and only if the sequence yj, ye,... has a limit.
(4) If fi, fo,... are measurable and
then f is measurable. i
2f00) me ObMCACn Vs lap): eum lew, (Xe mine, (X) 0)10q(X) eee ens,
(8b) each h, is measurable; hy < ha < ---, and by the definition of “lim inf,”
f(x) = lim,_.. h,(x) for every x. Therefore f is measurable. J
then f is measurable.
This is a particular case of (4). J
shins (6-9) Ss A
where 7 is an integer, and 1 — v? < j < v?. This construction is like one
on p. 153. Then
f(x) = lim f,(x)
vo
(a) Let f be bounded, measurable, and f > 0. For each r > 0, f is meas-
urable on the r-neighborhood U, of 0. By Theorem 14, f is integrable
over U,. Let $(r) equal the integral of fover U,. Since f > 0, ¢ is non-
decreasing. Hence ¢(7r) tends to a limit, finite or +x, as7r—>-+to. Let
If the limit is finite, then f is integrable (over HE”). Otherwise the left-hand
side of (5-49) equals +o. If0 < f < g, then ffdV < fgdV.
(b) Let f be measurable and f > 0. For any ¢t > 0 consider the function
:f such that
:f(x) = min {f(x),¢} for every x.
In case f is bounded and has compact support, then the new definition
agrees with the one in Section 5-3. For if the integral is taken in the
sense of Section 5-3, then formula (5-51) holds. (See Proposition 17 and
Theorem 14.) Since f* and f~ have the same integral in the new sense
as in Section 5-3, so does f.
Let us now give some general theorems about the validity of interchanging
the symbols “lim,_,..” and “f”.
lim |, dV ==" 0.
von
Proof. There exists C such that ¢,(x) < C for every x and vy = 1, hence
also for yi== 2,3,... Given e > 0 lev c= €/2V(K)> Let
Let vo be such that V(A,) < €/2C for every v > vo. Then
Proof. We have already shown that the limit f is measurable. First let us
assume that f is bounded and has compact support. Let ¢, = f — f,. Since
0 <f, < f, each function f, is also bounded and has compact support. Moreover
and apply Lemma 1. Finally, if f is unbounded observe that for each m = 1,2,...,
The double limiting process (5-49) and (5-50) can be replaced by a single
one. Let f be measurable and f > 0. Let
Then 0 < f, < fe < --- and f,(x) tends to f(x) for every x. Hence by the
theorem,
frav = lim [f, dV.
vo
0< (f+pns
fi +g".
Since ft and g* are integrable, so is (f + g)*. Similarly (f + g)~ is integrable.
Then
frt+gt=(f+t@t+4,
fog. (fg) Saree,
where ¢ > 0. Since the corollary is true for nonnegative functions,
[@ dV = effav.
Corollary 3. Let f,, fo,... be measurable,f, be integrable,f, > fe >--- = 0,
and let f(x) = lim,_,. f,(x) for every x. Then (5-52) holds.
For sequences which are not necessarily monotone, there is a result called:
Since (**) holds for each v, the limit is no more than the right-hand side of (**). §f
we have
lim inf Jg, dV = lim inf [f,dV — i¢ dV.
voo poo
Applying Fatou’s lemma to the sequence 91, ga, - and adding f¢dV to each
side, we again get (5—54a).
Similarly, if for each v, f, < ¢ where ¢ is integrable, then
Note: In the monotone sequences theorem and its corollary the hypothesis
f, = 0 can also be replaced by f, > $, where ¢ is integrable.
(b) There is an integrable function g such that |f,| < g forv =1,2,...
Then
[trav = lim [f, dV.
Proof. Since —g < f, < g and both —g and g are integrable, we can
apply (5-54a) and (5-54b). But
lim sup [f, dV < /fdV < lim inf [f, av.
VO yo
But lim inf < lm sup, and hence ff, dV tends to ff dV. §
The proofs of Fatou’s lemma and the dominated convergence theorem for
integrals over A are similar.
Corollary 4. Let A be a bounded measurable set of finite measure and f,, fo, .. -
measurable on A. Assume that:
(a) lim,_,. f,(x) = f(x) almost everywhere in A.
(b) There is a number C such that |f,(x)| < C for every x € A and vy =
Pe hen
ilfdV = lim [ fav.
A v—oo JA
Proof. Let g(x) = C for every x. Since V(A) is finite, g is integrable over
A. The corollary is then a special case of the dominated convergence theorem
for integrals over A.
Proof. Let s denote the right-hand side of (5-56). Since fx fdV < fafdV
whenever K C A, we must have s < fafdV. Let Ky, Ko,... be a non-
decreasing sequence of compact sets such that A = K,; UK2U-:-- Then
Sx,fdV < s for each y = 1,2,... Setting A, = K, in (5-55), we find that
Saf dV <s. Hence fafdV = s.f
The right-hand side of (5-56) was taken in Section 5-6 as the definition.
Corollary 6 shows that the definition there agrees with the one in the present
section in case f > 0. Since the procedure for defining the integral when f also
has negative values was the same in both sections [see (5-28) and (5-51)],
the two definitions agree in general.
PROBLEMS
1. Show that:
(a) If A and B are measurable, then A — B is measurable.
[Aint: (A 3B) Ty U, = AAU, = BU]
(b) If BC A and A has finite measure, then V(A — B) = V(A) — V(B).
(c) If Ai, Ag,... are measurable, then Ay M Az M--- is measurable.
ieee (Fd ed |) Ca
m=1 m=1
Kee eltak a
j,k=1
2. Let f and g be measurable real-valued functions. Show that their product is meas-
urable. [Hznt: Show that the square of a measurable function is measurable.
Phen: 28" = 9)? = f2=— 97
5-11 Differentiation Under the Integral Sign 197
3. Show that:
(a) If A is measurable, N is a null set, and d —-NCBCAUVN, then B is
eee and V(A) = V(B). [Hint: First consider the case of bounded
sets.
(b) If f is measurable and f(x) = g(x) almost everywhere, then g is measurable.
(c) Moreover, if f is integrable, then {[f dV = fg dV. [Hint: The result is known
from Section 5-4 to be true if f and g are bounded and have compact supports.]
4, Let f be measurable and f > 0. Show that if f[fdV = 0, then f(x) = 0 almost
everywhere.
5. Let f,(x) = sin vrx. Show that:
(a) lim inf f,(~) = —1 and lim sup f,(z) = 1 whenever z is irrational. [Hint: If
yo
x is irrational, then every arc of the circle s* + ¢? = 1 contains (cos yz, sin
vx) for infinitely many ».]
1 1
(b) (lim inf f,) dx = —1,liminf |] f,dzr = 0.
0 yo yo 0)
6. Let f,(z) = v if « € (0, v—') and f,(z) = 0 otherwise, y = 1,2,... Show that
lim,4. f(z) = 0 for every x, but ff,dzx = 1. Why does this not contradict
Lebesgue’s dominated convergence theorem?
eleta, a=") @2- y*)eye= ie 2... Show that0 < f,@) < 1, lim,..f,(2) = 0
for every x, and {f,dx = a. Why does this not contradict the dominated con-
vergence theorem?
8. (a) Let fi, fe,... be integrable over a measurable set A, and assume that
i e-1 Sa |fel dV is finite. Let G(x) = >?-1 |fx(x)|. Since the terms of this series
are nonnegative, it either converges or diverges to +-% for every x € A. Show
that G is integrable over A and hence G(x) is finite for almost every x € A.
[Hint: Apply the monotone sequences theorem to the sequence Gj, G2,...,
where G, = |(fi)al +> °° + [fal J
(b) By (a) the series )-#1 f;(x) converges absolutely for almost every x € A.
Let F(x) be the sum of the series. Show that fa
PFdV = %1 Sa fi dV.
[Hint: Apply the dominated convergence theorem to the sequence F, F2,...,
where F, = (fi)a t+: + (fp).
9. Let fi, fe,... be integrable over A, where A has finite measure. Assume that
\fx(x)| < Cy for every x A and k = 1,2,..., and that the series ))¥1 Cx
converges. Show that the series > f¢-1 fx(x) converges absolutely for almost every
x € A; and if F(x) is the sum of the series, then fa FdVn = Doki1 Tate EVR
[Hint: Use Problem 8.]
Similarly (**) is true when —6 < u < 0. Lemma 1 implies that y is con-
tinuous; hence by the fundamental theorem of calculus, ¥(to) = $/(to). I
If, instead of an open set, B is a closed interval [a, b], then the proof of
Lemma 2 shows that (5-59) is still true provided ¢/(t) means the one-sided
derivative at the endpoints.
Differentiation Under the Integral Sign 199
tics tle ee
of
es(s,1)| < h;(x) for everyxEeA, te B,
where g and hy,..., hi are integrable over A. Then the function ¢ in (5-58)
is of class C' on B and its partial derivatives are gwen by (5-57).
Pa eee
Example. Let ¢(f) = {7° 2—' exp (—2t) dz, t> 0. Find ¢’(t). Using (5-59)
If B is an interval (a, ©), a > 0, the hypotheses of Lemma 2 are satisfied with g(x) =
h(a) = exp (—ax). The formula for ¢’(¢) is correct for all ¢ in any such interval,
and hence for every t > 0.
PROBLEMS
1. Find ¢’(t) if @(@ is:
2. Let o(t)= fo log (2 — xt?) dx. Show that $’(0)= 0 and that ¢ is concave
on the interval (—1, 1).
3. For « € E' let o(x)= fo exp (—#”) cos zt dt. Show that ¢’(z) = — ad(x) and
find (a).
200 Integration S73
. Let o() = Jo f(a, t) dx, for t > 0, where f(x, #) = exp (—ai)z—'sinz. Show
that:
Qe Or 1/0).
(b) 6) ~ 0 as t>-+o. [Hint: Apply the dominated convergence theorem
with f,(x) = f(z, t,) where 1 < t1 < t2 < ---andt, ~ +0 asvy > ~ 1]
sin x
(c) tim aaah AF peetet @ f dx = +,
roe /0 x 2 0
Hints: From (a) and (b), #(t) = 7/2 — tan—' ¢. Integrate by parts to show
that
f sen ae ee ee ot | 2 anr
By the dominated convergence theorem,
im, fi) Gh =| a
(07 1.0 Ome
*5-12 L?-SPACES
Let A be a measurable set of positive measure, and let p be a number
such that p > 1. The collection of all measurable real-valued functions f
with domain A such that |f|? is integrable over A will be denoted by L?(A).
For instance, L*(A) is just the collection of all real-valued functions integrable
over A.
5-12 L?—Spaces 201
By inequality (1-9),
Since the series on the right converges, so does the one on the left. Therefore
F is integrable over A and B = {x € A: F(x) = +} is a null set [Prob-
lem 8(a), p. 197]. Now each term of a nonnegative series is no more than the
sum of the series. Applying this observation to («), we find that forx € A — B
Note: The idea of convergent sequence makes sense in any metric space S.
If every Cauchy sequence in S converges, then S is called a complete metric
space. A normed vector space U which is complete is called a Banach space.
The theorem above states that L?(A) is a Banach space.
If p > 1, then the number p’ such that
: '
~+o=1
P Pp
is called conjugate to p.
5-12 L?—Spaces 203
1 t 1
eas ((G1) ) aya.
Setting ¢ = u?v-”’, where v > 0, we find since 1 — p’ = —p'/p that
uP p'
w < 2s -+- aot (5-62)
|A eo)
%
aw@larwe)
-
<bP1 44Ppi. =1.
But the left-hand side is ||fg||1/||fl|pllg||>-. This proves Holder’s inequality. |
f Wille + oP
ol
av < Cf isPav)?
Pp 1/p
Cf lp t+ol? PP avy
(p—1)p’ 1/p’
204 Integration 5-12
PROBLEMS
1. Let f(x) = |x|~*. Show that if A is the unit n-ball, then f € L?(A) for p < n/a
but not for p > n/a. [Hint: See Section 5-6, Example (4).]
2. Let f(z) = 2 (log z)-2, A = (0,34). Show that
f € L?(A) only for p = 1.
3. Let 1 < q < p. Using Hélder’s inequality show that if A has finite measure, then
f © L*(A) implies that f € L(A). Give an example to show that it is necessary
to assume A has finite measure. [Hint: Apply Hélder’s inequality to the functions
|f|2 and 1 with p replaced by p/q.]
4. Prove properties (a) and (b) (see p. 201) of the p-norm.
CHAPTER 6
Conversely, given n(n — 1)/2 numbers w,;;, 7 < j, formula (6-3) defines an
alternating bilinear function.
Similarly, for any r > 2 Jet M be a real-valued function with domain the
r-fold cartesian product H” x --- X EH”. The elements of H” x --- x EH” are
r-tuples of vectors, denoted by (h;,...,h,).
Interchanges. Let S be some set. For our purposes we shall take either
ee eOtoe lene), eli (py, , py) and (p,,..., p.) are r-tuples
of elements of S, let us say that the second r-tuple is obtained from the first
by interchanging p, and p; if p, = p:, pi = ps, and p;j = pi forl $ sg, t.
Examples. The triple of vectors (h3, he, hi) is obtained from (hj, he, hs) by inter-
changing hg and hy. The 4-tuple of integers (1, 5, 3, 7) is obtained from the 4-tuple
(1, 7, 3, 5) by interchanging 5 and 7.
We know that the sum of two linear functions is a linear function. From
this fact and the definition of multilinear function, the sum M + N of two
r-linear functions M and N isr-linear. If M and N are alternating, then MZ ++ N
is alternating. Similarly, if c is a scalar then cM is r-linear when MM is r-linear
and alternating when M is alternating.
Let (H})* denote the set of all alternating, r-linear functions with domain
iE” x --- x H”. By the remarks just made, (#7)* satisfies the axioms for a
vector space. Let us now prove two propositions which enable us to find the
dimension of (/7')* and a basis for it.
An r-tuple (h,,...,h,) is called linearly dependent if there exist scalars
c},..., cc", not all 0, such that c’h, + ----+ c’h, = 0.
Proposition 18. Let M be r-linear and alternating. Jf (hy,...,h,) is a
linearly dependent r-tuple, then M(h,,...,h,) = 0.
Proof. First of all, the conclusion is true if some vector in the r-tuple is
repeated. For instance, suppose that h; = hg. Since M is alternating,
In the Ith term on the right-hand side, the vector h; is repeated, and hence
each term is 0. Thus M(h,,...,h,) = 0.1
For any r > 2 there is the trivial alternating r-linear function 0, which
has the value 0 for every r-tuple (hy,...,h,). If r > n, then (hi,...,h,)
must be linearly dependent and from the proposition we get the following.
= (Gin ee OP
where 1 < i <n for each Kk=1,...,r. There are n” such r-tuples of
integers. If 7; <--- < 7,, then 2 is called an increasing r-tuple. There are
(") increasing r-tuples, where (7) = n!/r!(n — r)!is a binomial coefficient. We
write >>), for a sum over all r-tuples and >',; for a sum over all increasing
r-tuples.
The following generalization of the Kronecker symbol 6; will be used.
Let X = (%j,...-,%), # = (1,---, 7,7) be r-tuples of, mtegers. Then dit is an
element of an r X r matrix; and is 1 if 2, = j;, 0 otherwise. Let
5, = det (65*).
The important properties of 5. are:
(1) If no integer is repeated in the r-tuple \ and w = X, then 5. == Ik
In this case 4, = 7, if and only if k = J. Hence 5. = det (sf) = 1.
(2) If no integer is repeated in the r-tuple u and d is obtained from pw by p
interchanges, then 5n =a)
Hach interchange of elements of « interchanges two column vectors of the
matrix (55*) and changes the sign of the determinant. Therefore (2) follows
from (1).
(3) In all other cases, 5. = (0)
If some integer is repeated in yw, then two column vectors of the matrix are
the same and the determinant is 0. If the integers 7,,..., 7, are distinct and
some 2; does not appear among them, then the kth row covector of the matrix
is 0 and the determinant is 0.
Now let M be an alternating r-linear function. For brevity let us set
oy, = Qi, --.dp
6-1 Alternating Multilinear Functions 209
Sometimes we will still write w,,...;, rather than w, particularly when r < 3
or r =n. If X is obtained from yu by one interchange, then w, = —w,. In
particular, w, = 0 if any integer is repeated. If \ is obtained from p by p
interchanges, then w, = (—1)?w, = dw).
If » has no repetitions, then exactly one increasing \ is obtained from pu
by interchanges. Hence for every p,
Oy = Dy ordy, (6-4)
[A]
Let us now consider some particular elements of the space (Z7)*. For each
r-tuple \ = (21, ...,%,) let e* be the function such that
since by formula (6-3) both sides have the same value for each pair of vectors
(h, k). This is a particular case of the following.
210 Exterior Algebra and Differential Calculus 6-2
But M and M are r-linear and have the same value w, at (e;,,..., €;,) for
each yp. By (6-2b) M = M.4
PROBLEMS
i, eng = H ihel
651, 851, 6214, 255, 425.
2. Letn = 4,r = 3, wi23 = 2, w134 = —1, and w, = 0 for every other increasing
triple \. Find M(e4, e1 — e3, e2 + e3).
3. Show that:
Ke 1 m
(@) SEX, = REE
6-2 MULTICOVECTORS
Let us now introduce a different name and a different notation for alter-
nating, multilinear functions.
We observed in the last section that the set (#”)* of all r-covectors satisfies
the axioms for a vector space. When r > n, its only element is 0 by Proposi-
tion 18. When 1 < r < n, Proposition 19 states that if w is any r-covector, then
wo
= Di ordy
[A]
and set |w|? = w-w. The standard basis elements are orthonormal with
respect to this inner product.
Another important product is the exterior product, denoted by the sym-
bol A. The exterior product of an r-covector and an s-covector is an
(r + s)-covector, defined as follows: If
CR eae
e Ae =e” (6-9)
212 Exterior Algebra and Differential Calculus 6-2
1) @+HAn=@An+E A yy).
(2) (cw) A § = cw A §).
(3) § A w= (—1)"w A §, of w has degree r and ¢ has degree s.
(4) GE Aw) An=o6A WA»).
Proof. The proof of (1) and (2) is almost immediate from the definition
and is left to the reader (Problem 8). To prove (3),
vr = (Ghia ew re eee
eh = et Ae Ae”. (6-11)
Since both sides of (6-9) change sign under interchanges in \ or v, formula
(6-9) is also true for nonincreasing r-tuples. Thus (6-11) is valid whether \
is Increasing or not.
2153 1235
e” A (3e! -— 2e”) Veer eee (3e7! — 2e”") Ne m= 3e = 3e
*Remarks. The exterior product has been defined in terms of the standard
bases. It is not clear that it is “coordinate free,” in other words, that the same
exterior product would be obtained starting from different bases. However,
let us add one additional property to the list (1)—(4):
(5) If w =a! A--- A a’, then w(hy,...,h,) = det (a*-h;) for every
T-i ple (Dye ap):
This property of the exterior product will be proved in Section 6-3 [see
(6-12), (6-14)]. Formula (6-11) is a special case of (5). This is seen by taking
a” — e* and recalling (6-5a). Moreover, (6-9) is a consequence of (6-11) and
the associative law (4). Once the product is known for basis elements, Proper-
ties (1) and (2) determine it in general. Thus A is the only product with
Properties (1)—(5). In fact, (3) can be omitted from the list since it follows
from the other four. Since none of these five properties refers to bases, the
exterior product is coordinate free.
The sum of an 7-covector and an s-covector has been defined only when
yr = s. However, one may form the direct sum
PROBLEMS
1. Write down the standard basis for (##)* for each r = 1, 2, 3, 4. Find all products
e* A e’ where = (2) and vy = (j,k, l) is an increasing triple.
Y, Ihe me = B, Sumyollnys
(a) (2e! — e?) A (8e?-+ e). (ay) CHIN Ce
(c) (et — e? + 3e?) A e?!. (d) (e73 + e3!) A (Se! — e?).
3. Let n = 5. Simplify:
(a) e253 Ix (el4 + er, (b) (e? + e°) A el A (e® sod e*).
aNbtbAc+cA
a= (a—b)A (b—c).
6. Show that if w=a IX b, then WijWer—E WinW1; WiIWj, = 0 for Dy De k, j=
1,...,n. [Hint: Using Problem 4,
since the first row is a linear combination of the second and third rows.]
7. Show that if }’;,) cye* = 0, thenc, = 0 for every increasing \. [Hint: See (6-6).]
8. Prove (1) and (2) of Proposition 20.
9. Prove the associative law (4) of Proposition 20, using (1), (2), and (6-10).
10. Show thatw A ¢ A 4» = —n A § A @ if w has degree 7, 7 has degree t, and
both r, t are odd.
6-3 MULTIVECTORS
If U is any vector space, then alternating r-linear functions on U X +--+ X U
can be defined just as in Section 6-1 where we took U = E”. Let us now take
=) (2)* the dualispace to £”.
Definition. A multivector of degree r is an alternating r-linear function with
domain the r-fold cartesian product (H”)* K +++ & (B")*.
6-3 Multivectors 215
For brevity, multivectors of degree r are called r-vectors. They will usually
be denoted by the Greek letters a or 8. When r = 1, the 1-linear functions
on (H”)* are identified with the elements of H” in the way explained in Section
A-2. Then a 1-vector is just a vector, and will be denoted as usual by x or h.
For every statement about multicovectors in Sections 6-1 and 6-2, there
is a dual statement about multivectors obtained by everywhere exchanging
the words “vector” and “covector.” For instance, if @ is an r-vector and
= (11, 6.0. 559 ty)’, let
a ==! oC e'r).
TaBLE 6-1
r-vectors r-covectors
n joe *
Elements of E, (E;)
@(Cj,,..-,€j,) = wey = a,
for every u. Hence & = w. §
Proposition 22. Leta = hy A --- A h,andw=a! A -:: A a’. Then
for every r-tuple \ = (11,..., 7),
The formulas (6—-13a), (6-13b), and (6-14) may not provide the easiest
way to compute the components and the scalar product in numerical examples.
For instance, see Examples (1) and (2) below. However, they are important
for various other reasons.
The only difference between the notions of basis and frame is that the
latter takes into account the order in which the basis vectors hy,...,h, are
written.
Theorem 19. (a) An r-tuple (hy,...,h,) 7s linearly dependent if and only
af hy IK 922 IX h, = 0.
w-a
= det (8%) = 1.
Hence a ¥ 0.
Proof of (b). Each hf is a linear combi- hj <7
nation oleh 7 R,,
= OS my) Cee i
m=1
where a’ = hi A --- A hy. The matrix on the right is the product of the
matrices (a* -h,,) and (c7"). Hence if c = det (.
Example 1. Show that 2e; + 3e2 — e3, e1 + 2e2, e; — 2e3 are linearly dependent.
Their exterior product is
Let us say that the frame (hy,...,h,) has orientation ao if c > 0, and
orientation —ay ifc < 0.
If two vectors in the frame (h,,..., h,) are interchanged, then the exterior
product h, A --- A h, changes sign. Thus the orientation of a frame changes
under interchanges.
In Example (2), (11)~/?(e12 — €13 — 3€93) is an orientation. The frame
(e, + 3e3, €2 — e3) has this orientation.
Let r= n. Then P = EH” and +e)..., are the two orientations. Let us
call e;..., the standard, or positive, orientation of H” and —e,..., the negative
orientation of EH”. When r < n we do not attempt to call one orientation of P
positive and the other negative. If (h,,...,h,) is a frame for H”, then by
(6-13a) :
The frame has positive orientation if det (kh?) > 0 and negative orientation
if det (hi) < 0.
Measure for r-parallelepipeds. It was shown in Section 5-7 that if K is
an n-parallelepiped spanned by hy,...,h, with xo as vertex, then
Ke xerxya—
ox, a be < i << L ke boatl
k=1
is the r-parallelepiped spanned by hy, ... , h; with xo as vertex.
Measure for r-simplices. Let S be an r-simplex with vertices xo, ... , X;.
Let hy, = xz, — Xo, k = 1,...,7r. Reasoning as in Section 5-7, we have
a 7
S={x:x=
xo + >) thy,t > 0 fork = Lae tees
k=1 k=1
6-3 Multivectors 221
il
VS) = Al Ihy A-+: A hy. (6-17)
Both (6-16) and (6-17) are very special cases of a general formula (7-5)
in Chapter 7 for r-dimensional measure.
the components are a1? = —3, a13 = 3, a3 = 1. Since |a|? = D1) (e)2, the
area is /19/2. The area can also be calculated from formula (6-18) below.
Taking square roots, we get a formula for V,(K). If h,,...,h, are mutually
orthogonal, then h,-h,; = 0 for k ¥ 1 and det (hy-h;) = |h,|?--- |h,|?.. In
this case V,(K) = |h,| --- |h,| as required.
PROBLEMS
1. Simplify (n = 6):
(a) es A es A e24.
(b) eg A e3 A e6e.
(c) e1 A (e14 + e684).
(d) (e; + 3e4 — e6) A (2e23 + e36) A e45.
(e) (e12 + e13) A (e34 + e25) A (e56 + e468).
2. Evaluate the indicated scalar products (n = 4), using (6-14).
(a) (et + e?) - (e1 + eg).
(b) e!? + e34.
(c) e!94- (e431 + 3e124).
(d) (e1 — e*) A (e?-+ e*) - (ex + 2e4) A (e2 — 2e4).
3. Using Theorem 19 show that (2e; + e3, e2 + e4, e1 + e4, e3 + e4) is a frame
for E*. What is its orientation?
222 Exterior Algebra and Differential Calculus 6-4
>, (-1)'8; = 0
i=0
—_
k
FigurEe 6-2
Since L, is to be linear, its value at any B is determined once the values at the
basis elements e, are known. For any B = Yo, BYe,
A OPS NS AG (6-21)
[H]
If any integer is repeated in the (7 + s)-tuple (u, v), then this is 0. Otherwise,
the right-hand side is (—1)”L,4s(e,) where 7 is the increasing (r + s)-tuple
obtained from (u,v) by p interchanges. Since e,, = (—1)?e, and L,4, is
linear, (—1)?L,4.(€,) = Lr+e(€,,). Thus
Therefore (6—22a) is correct for basis of elements of H;” and HY’. Since each of
these transformations is linear, (6-22a) then holds in general.
By induction there is a generalization of (6-22a) for products of any num-
ber of multivectors. In particular, in this way we get the required formula
(6-19) for products of vectors.
Let 6 € E” and aw = L,(8). Let us find a formula for the components a*
ineterme olathe componentsion 8. SIL \ =="(21,250a5 27) =) Gi,=- », Jr), and
(c}) is the matrix of L, let
= det (c}').
224 Exterior Algebra and Differential Calculus 6-4
By (6-138a), @ is the Ath component of vj, A --* A vj, By (6-20) and (6-21)
{HJ .
The left-hand side equals én, and the right-hand side is the wth component of
L*(e*). Since both sides of (6-20b) have the same components oF they are equal.
T’rom (6-20b) the formulas dual to (6-19)—(6—23a) follow in the same way
as before.
PROBLEMS
1. Let m = 2,n = 3, L(s, 2t) = (s — 2t)ey — seo -+ (28+ 3t)e3. Find:
(a) L*(a). (b) cf). (c) Le(@).
(d) L3(). Opi Gay
2. Prove the dual of (6—-22a):
= et ene: (6-23b)
IM
4. Let L be an orthogonal transformation of Z”. Show that |L,(@)| = lal:
(a) If w is decomposable. [Hint: (6-18).]
(b) For any r-vector @.
6-5 Differential Forms 225
The values of w are r-covectors. The same Greek letters w and ¢ used in
the last section to denote r-covectors are now used to denote differential forms.
The context will indicate clearly which is intended.
For brevity we say “r-form” instead of “differential form of degree r.”
It is convenient to call any real-valued function f a 0-form. If r > n, then the
only r-form is the one which has the value 0 for every x € D. We also use 0
to denote this r-form.
Let w be an r-form and ¢ be an s-form, with the same domain D. The
exterior product w /\ ¢ is the (r + s)-form defined by
(w A §)(x) = w(x) A F)
for every x € D. Similarly, fw is the r-form such that
(Strictly speaking, the 2-form dx’ A dz’ has domain EH”, and we mean here
its restriction to D.)
226 Exterior Algebra and Differential Calculus 6-5
Similarly, for any r-tuple \ = (i1,...,7%,) the r-form dx! A +++ A dc'*
has constant value e*. Hence if w is an 7-form, then
where the value of w, at x is w,(x). Using Problem 4, Section 6-1, one can
also write
‘= 4 X W) die Ns Ande
dw = dM A dx+dNA dy a ee dy.
Example 2. If r = n, then
1D he IN SOS IN ine
Let us first show that
dfEs A BY) =dfA EB A EB. (*)
If any integer is repeated in the (7 ++ s)-tuple \, v then both sides are 0. Other-
wise, E* A E” = (—1)?E’ where 7 is increasing. By definition, d(fE’) =
df A E’. Multiplying both sides by (—1)” we get (*). Now
Ot = DS CA
(AI)
dE a1) Baan de
The scalar-valued function ¢, commutes with any differential form. Hence
while
Similarly,
(-1)" >> (@E”) A dt A EB’) = (-1)o A a,
[Ally]
which proves (2).
If f is of class C, then from (6-27)
The form E* has constant coefficients and hence dE* = 0. Using the product
rule (2), d(df A E*) = 0. Taking f = , and using (1),
Let d’ also have these four properties. Then d’/E* = d'(d’x"! A E*), where
= (to,...,%,). But dx’ = d’x' by (4) since dz’ stands for the differential of
the coordinate function X*. Using (2), (3), and induction on r, d/E* = 0.
Using (2) and (4), d’(fE*) = df A E*. Using (1),
pido, wer
[A]
Thus d’w = dw for every w of class C. In particular, this proves that the
exterior differential d is “coordinate free.”
Transformation law for differential forms. Let g be a transformation of
class CY from an open set A C E™ into EB". Let D be an open set containing
6-5 Differential Forms 229
g(A)
g D
—_—_—_P
w # —__. @
Figure 6-3
the image g(A). If w is any r-form with domain D, then there is a corresponding
r-form denoted by w’ with domain A. Formally, w' is obtained by merely
substituting g(t) for x and dg’ for dx’. The precise definition of w’ is as follows.
Then (4) follows from this and (2). To prove (5) we have from (1)-(4)
GS ee)
The matrix of Dg(t) is (gi(t)), and the row covectors are dg'(t),...,dg"(t).
By (6-13b) the uth component of dg! A +--+ A dg’” is On: Therefore
Ge fh Gey
fowl= a(g', 9°)
FIC‘) ds /\ dt.
=feog ie
ne Sanh)5 dt) A+++ A at’
er:
Example 6. Let m = r = 1. Then w(t) = w[g(t)] - g’(t), and the definition (3-8a)
of the line integral can be rewritten
Does
acral
j= wo.
6-5 Differential Forms 231
PROBLEMS
Assume that all forms which appear are of class C).
. Find the exterior differential of:
(a) «2y dy — xy? dz.
(b) cos (ry?) dx A dz.
(c) f(a, z) dz.
(d) ady A dz+ydz fA dx-+2zdz A dy.
. Let P,Q, R have domain D C E%. Show that
. (a) Find an (n — 1)-form ¢ such that df = dz! A -++ A dx”. [Hint: Prob-
lem 1(d).]
(b) Find an (r — 1)-form ¢ such that d?* = E?.
(c) Show that if the coefficients w, in (6-25) are constant functions, then w is
exact.
. (a) Show that if w and ¢ are closed differential forms, then w A ¢ is closed.
(b) Show that if w is closed and ¢ is exact, thenw / ¢ is exact.
. Find the exterior differential of:
(a) "dw \ > —w i dg.
(b) dwA SEAn+w di A n+owA ¢ A dy, if w and § are of even degree,
. A function f is an integrating factor for a 1-form w if f(x) + 0 for every x € D
and fw is closed. Show that if w has an integrating factor thenw A dw = 0.
_.Letn = m=2,r=1, 0 = Mdzr+Ndy. Find explicitly dw and w? and
verify that (dw)? = dw’.
. Letn = m = 8, g(s, t, uw) = (scost)er + (ssin t)e2 + ue3. Find:
(a) (fdx A dy A dz)*. (b) (a dy A dz)*.
. Show that if w is a 2-form, then
0; Oox: , Iw
dw = >> (aay tea Beit) a’ A dx’ \ da’.
i<j<k
10. Let w!,...,w? be 1-forms such that w* = )°7_ it dg ate lee pe Assume
that the functions f} are of class C™, the g’ are of class C, and that the 1-covectors
ENG@ eset » 0? (x) are linearly independent for every x € D. Find 1-forms 0;
such that dw' = >?_,0; A wi. [Hint: The p X p matrix (fj(x)) must be
nonsingular.]
232 Exterior Algebra and Differential Calculus 6-6
Let us show that *a@ has Properties (1), (2), and (3) above. Given a frame
(hi,...,h, 1), these three properties determine a vector, which we denote
temporarily by h. Let (hj, ...,h),_,) be an orthogonal frame for P, hi,- hy = 0
if k ~ 1. Then @ is a scalar multiple 6 of hi A --- A hi_, and replacing
6-6 The Adjoint and Codifferential 233
h aC — (Hahje.s),
and hence h-h = c = [h| |h|. Equality holds in Cauchy’s inequality (Sec-
tion 1-1) and therefore h is a positive scalar multiple of h. By (3), |h| = {hl,
and hence h = h as required.
We can now show that every (n — 1)-vector @ is decomposable. Let
a ~ 0, and let h = *a. The vector h is normal to an (n — 1)-dimensional
subspace P. Let @ be an (n — 1)-vector of P whose orientation and norm
are chosen such that @ and h are related by (1)—(3). Then @ is decomposable
and h = *a@. Thus *& = *a, which by (6-32a) implies that @ = a@.
If w is an (n — 1)-covector, then *w is the 1-covector whose components
are given by the dual to (6-32a):
div ¢ =
da} Ox”
The remainder of this section will not be used in Chapter 7. Let us define
*a for any r-vector a when 0 <r <n. If r = 0 orn we set
OEE ae TAY) =O
€ = BN".
It is +1, depending on whether an odd or an even number of interchanges
puts )’,\ in increasing order. If @ is any r-vector, then its adjoint is the
(n — r)-vector *a whose components satisfy
kkgy = (Ae es
GA ey, 96 Co,
which is 0 if vy ¥ 2’ and is ee1..., if v = . If B is any (n — r)-vector, then
Example 2. *(f dx A --- A dx'r) = eyf dx’1 A +++ A dxin-r,whered’ = (j1,..., jn—r)-
For7 = 0 or n,
Notes. Many authors define the adjoint so that in (2) above (hy,...,h,,
h,ii,.--,h,) is a positively oriented frame for H”. When r(n — r) is odd,
according to that definition *a has opposite sign to the one here.
The definition of the adjoint involves the euclidean norm. Hence both
the adjoint and the codifferential depend on the euclidean structure inherited
by E” and (#”)* from the euclidean inner product in #”; while the notions
of A and d actually depend only on the vector space structure and not the inner
product.
In riemannian geometry one is provided at each point x with an inner
product B,, not necessarily the euclidean inner product. The definition of
adjoint must be modified accordingly. The codifferential is again defined by
(6-37a). However, the formula (6-33) for the divergence and its generalization
(Problem 5) must be modified. See [9] and Chap. V of [17].
236 Exterior Algebra and Differential Calculus 6-7
PROBLEMS
1. Let n = 2. Show that
(a) th = h?e, — hes.
(b) *(M dx + N dy) = N dx — M ay. \
(c) *d(N dx — M dy) = —(OM/dx + ON/dy).
2. (a) Let n = 3, and w = Pdy A dz+Qdz A dx+ Rdx A dy be a 2-form.
Show that
ty = Pde +Qdy+ R dz, dw = (0P/dx + 0Q/dy + OR/dz) dx A dy 4A dz.
i—1 n
w= Do wvde' A--- A de Ade” Aw Kae,
seat
and let ¢ = *w. Show that dw = div¢dz! A -:- A da”.
3. Show that: (a) div (df) is the Laplacian of f. (b) div(fw) = fdivw + df-o,
where (¢- w)(x) = ¢(x) - w(x) for 1-forms §, w.
See Fig. 6-4. The cross product distributes with vector addition and scalar
multiplication, and hy X hy = —h,; X hg. However, it is not associative.
6-7 Special Results for n = 3 237
The triple scalar product of three vectors is denoted by [hy, hs, hg]. It is
given by
[hi, ho, hg] = *(hy A he A hg). (6-39)
Its absolute value equals |hy A hy A hg], which is the volume of the paral-
lelepiped spanned by hy, he, hg with vertex 0. The sign of the triple scalar
product is positive if (hy, hy, hs) is a positively oriented frame for E* and
negative if this frame is negatively oriented.
When n = 3, r(m — r) is always even and (—1)"™-” = 1. Then
*(h; X he) = hy A he. Using Problem 4(a), Section 6-6,
Example. Show that div (curlw) = 0 for every 1-form w of class C). Using the fact
that d* = *d (formula 6-37b),
div (curlw) = d(*dw) = *d(dw) = *0 = 0.
PROBLEMS
Assume that all forms are of class C®.
1. Show that:
(a) h Xk = —k xX h. (b) h X (k1 + ko) = hX ki +h X ko.
2. Let w = Mdx+ Ndy+ Odz. Show that curl w = (00/dy — ON/0z) dx +
(0M /dz — 00/dx) dy + (ON /dx — OM /dy) dz.
3. Find e; X e; for all pairs 7,7 = 1, 2, 3.
4. With the aid of (6-38a) and (6-40), show that:
(a) div (¢ X w) = Oif ¢ and w are closed. 4 (b) curl (fw) = feurlw + df X w.
(c) curl (f df) = 0. (d) curl (§ X w) = dF A @).
(e) curl (curlw) = d(divw) — Lapl w, where Lapl (Mdz-+ N dy+ Odz) =
(Lapl M) dx + (Lapl N) dy + (Lap! O) dz and Laplf is the Laplacian of the
function f.
(f) ¢-curlw — w-curlg = div (w X ¢). (Hint: By the dual to (6-36), § - *dw =
#(€ A dw).J
238 Exterior Algebra and Differential Calculus 6-7
5. Show that:
(a) hy X (he X hg) = (hi -h3)he — (hy-he)hg. ([Hint: Since both sides are
trilinear in (hy, hg, hg) it suffices to prove this when hy, he, hg are standard
basis vectors. Use Problem 3.] .
(b) The cross product is not associative.
(c) (hi X he) X (h3 X ha) = [hi, he, haJh3 — [h1, ho, haha.
6. Let w = (#; dz! + Ee dx?+ E3 dx?) A dx*+ By dx? A dx? + Be dr? A dx! +
B3 dz! A dx?, where the functions B;, E; are of class C‘ on an open subset of
E*, Show that dw = 0 if and only if curl E + 0B/dz* = 0, divB = 0. Here
curl and div are taken in the variables (x!, x”, 23). [Note: The equation dw = 0
represents one-half of Maxwell’s equations for an electromagnetic field in free
space. The functions £;, H2, H3 represent the electrical components of the field
and B,, Be, B3 the components of a magnetic induction vector. There is a similar
equation which represents the other half of Maxwell’s equations. See [9], p. 45.]
CHAPTER ;
Integration on Manifolds
Then g is of class C‘ from R onto S and is univalent. The vectors 0g/dx and dg/dy
give a basis for the tangent space 7's[g(z, y)]. By (7-2), Jge(x,y) = |dg/dx A dg/dy|.
See Fig. 7-1.
Calculating these partial derivatives, we z
get
g(x,y)
Og dg Og 0p
Aen bale ny re wey
F = (X|S, Y|S).
Since X and Y are of class C“, F is of class C“ on S. Moreover, F is univalent; in
fact, F = g—!. Since Fog is the identity transformation, J(F°g) = 1. By (7-8),
1
JF (x) = >0, ifx = g(z,y).
Je(x, y)
Thus F is also a regular transformation.
g(x) = ot eae a eA 8
ately
22OC ae
where .
Xe (cr), 2).
Then g is of class C™ and univalent. The explicit formula for Jg(x*) is complicated.
However, we can show that Jg(x*) > 0 as follows. Since
n—r l
This r-vector is not 0 since its Ath component (the coefficient of e,) is 1. Hence
Jg(x,) > 0, which shows that g is regular.
244 Integration on Manifolds : U\
Proof. We know already that the theorem is true in the following two
particular cases:
(a) If @ is a flat regular transformation from an open set A C E’ into EH’,
then by the inverse function theorem $(A) is open and @~' is regular. (b) If g
is as in Example 3 and S = g(R) = M nO U,, where U, is a neighborhood of
some Xo € M, then g(R) is open relative to M and F = g™! is regular.
In the general case, let g be regular from N into M. Let to) € N and
Xo = g(to). By the implicit function theorem, there exist an increasing r-tuple
\ and a relative neighborhood S of xo such that F = X |S is regular. The
set R = F(S) is open and F~’ is regular from R onto S. (In Example 3, F~!
was denoted by g.) Similarly, there exist a relative neighborhood S’ of to and
F’ regular from S’ onto an open set R’ C E” such that (F’)~! is also regular.
Since g is continuous, we may arrange that g(S’) C S. Consider the transfor-
mation @ = Fo g- (F’)| from R’ into R. By the corollary to Proposition 28,
¢ is regular. Since ¢ is flat, @(R’) is open and $7! is continuous. Let S; be a
relative neighborhood of xo such that F(S;) C ¢(R’). If x € S81, then x = g(t)
for t = ((F’)~' > @ '-F)(x). Therefore S; C g(N). Moreover (see Fig. 7-2),
2 S
|
| Xo
|
z I \ M
! |
¢
Se
eensee
R
EigurRE 7—2
T1 Regular Transformations 245
Corollary 1. Jf g satisfies (1), (2), and (3) and g' is continuous, then
g(N) is an r-manifold and g 1s regular.
Proof. Let B CN be relatively open. Then g(B) = (g~')~'(B) is open
relative to g(N) by Proposition A-6. Therefore g is an open transformation. J
PROBLEMS
1. For each of the following transformations from A C E? into E?, find Jg(s, ¢) and
g(A). Show that g is univalent.
(a) g(s,t) = (s+ tei + (s — 3t)e2 + (—2s+ 2¢+ 2)e3,A = {(s,):0<s+t
Leib SA? SS OPS
(b) g(p, 0) = (p cos a)ex + (p sin @ cos A)eg + (p sin @ sin Aes, O< a < 1/2 (a
fixed), A = (0, 0) x (0) 2).
(c) g(s,t) = ste; + seg-+ te3,A = E?.
2. Let G(s, ¢, w) = ase; ++ bteg + cues3, where a, b, and c are positive. Let N be
the sphere s?-+ ¢?-+ u? = 1 and M the ellipsoid 2?/a? + y?/b? + 22/c? = 1.
(a) Find Ty(t), Tu[G(t)], t = (s,t,u). Verify that the image of 7'y(t) under
DG(t) is Tu[G(t)].
(b) Let g = G|N. Calculate Jjg(t) from (7-1) and show that g is regular from
N onto AM.
3. Let g(t) = (cos t)e1 + (sin 2f)e2, A = (0, 37/2). Sketch g(A). Show that g is
univalent, and that Jg(t) > 0, but that g(A) is not a 1-manifold. Why does this
not contradict Theorem 21?
4. Letn = 4,7 = 2, = (1, 2). Show that in Example 3, gg(z1, 2?) = [1 + |dg¢!|?+
|dp?|? + (Hibs — $i$3)7]”?.
5. Let A C H” be open and bounded. Let g be continuous and univalent on cl A.
Show that if g/A is of class C™ and Jgg(t) > 0 for every t € A, then g(A) is an
r-manifold and g|A is regular. [Hint: Problem 8(d), Section A-8, and Corollary 1.]
6. Let g be of class C from an r-manifold N into EZ”, and let Jg(to) > 0. Show that
to has a neighborhood Q such that g(N Q) is an r-manifold and g|(V NQ) is
regular. [Hints: First consider the case N C E*. By generalizing Proposition 14,
Section 4-3, find a neighborhood Qo such that g is univalent on NA clQo. Use
Problem 5.]
UD Coordinate Systems on Manifolds 247
By Theorem 20, the set A = F(S) is an open subset of #” and the trans-
formation g = F~! is regular from A onto S. It will be g rather than F which
is ordinarily used for calculations in the sections to follow.
Example 1. Let us return to the three examples in Section 7-1. In the first of them,
the function F = g~! is a coordinate for the open simple are S = g(A). The co-
ordinate of a point x = g(¢) is ¢ in this system.
In the second of those examples, F is a coordinate system for S. The coordinates
of a point (2, y, (2, y)) € S in this system are x, y. In the third example, F = X|S.
The coordinates of a point x in the coordinate system F are v4,...,a'. Such a
coordinate system will be called cartesian.
FIGuRE 7-3
248 Integration on Manifolds U7)
Example 3. Let (R, 0!,..., @”—!) be the spherical coordinate system for the open
set D in Example 5, p. 181. The complement of D is a null set. Setting R(x) =
we get, according to Example 2, a spherical coordinate system for the intersection
of D with the unit (n — 1)-sphere S"—!. It turns out that S*~! — Dis null in dimen-
sion n — 1, in the sense to be explained in the néxt section. Consequently, this
coordinate system can be used to evaluate integrals over S”—!.
=F.(g|Ao)
Figure 7-4
7-2 Coordinate Systems on Manifolds 249
PROBLEMS
1. Let M = {(a,y, 2): 22+ 24y+ 2? = 3,2 >0,y > |z|}. Let F = (x M, Y|M)
and F = (X|M,Z|M). Describe A, A, g, g, and @ (see Fig. 7-4).
2. Let Mo = {qy? + 27, y, z):y > 0} and let Fa, y, 2) = (y+ z, exp z) for
(x, y, 2) € M. Show that F is a coordinate system for MM and find F(M). [Hint:
First take y and z as coordinates on M and then find a suitable coordinate change
@ giving the system F.]
3. Stereographic projection. Let M be the sphere «7+ y?+ (¢ —1)? = 1. For
each x = (2, y, 2) € M except the “north pole” 2e3, let (s, t, 0) be the point where
the line through 2e3 and x meets the plane z = 0. Let F(x) = (s, é).
(a) Show that F is a coordinate system for MZ — {2e3}.
(b) Let hy, he be tangent vectors to M at x, and let k; = DF(x)(h)), / = 1, 2.
Show that the angle between k; and kg equals the angle between hy and hg.
250 Integration on Manifolds T=}
Xo = (to)
h, = agy(to)
Figure 7-5
In the exact formula Z is replaced by the set B= g—1(A), and the sum by
an integral.
The integral over B is taken in the sense of Section 5-6. By (7-2), Jg(t) =
lgi(t) A --: A g,(t)|. Since g is of class C"”, gg is continuous. Hence (f » g) gg
is continuous. If f > 0, then the integral over B either exists or diverges to
+c. When the latter occurs we agree that the integral of f over A also diverges
to +o.
We must show that the integral does not depend on the particular choice
of coordinate system. Let 8 be another coordinate patch such that A C Ss,
and let F be a coordinate system for S. Let us adopt the notation of Fig. 7-4.
Then g = £>° 9, and by (7-3)
iets en
Let B = F(A). Then B = ¢#(B) and by the transformation formula for
integrals (Theorem 17)
Example 2. Let J/ be a 1-manifold and B be a closed interval [a, 6]. Using the ter-
minology of Section 3-2, A is the trace of the simple arc Y represented on [a, 6] by g.
Formula (7-4) becomes
b
[fe
A
avi = [fle
a
le’) a.
The right-hand side is J, f ds, as defined on p. 85.
*Note. The line integral f,fds was defined in Section 3-2 without requiring that
Y be simple. If Y¥ is not simple, then its trace A = g({a, 6]) need not be contained
in a l-manifold. There is a more general notion of r-dimensional measure and integral
for sets which are not necessarily subsets of an r-manifold. The general formula which
becomes for simple ares the one in Example 2 is
b
[ f@N@) avi) = [lel |e") at
where V(x) is the multiplicity of the point x. For any r > 1 there is a similar formula
The image of a small square [¢,¢-+ a] X [6,@-+ a] in the (4, @) plane is a small
sector of the hemisphere which is approximately a rectangle of side lengths a and
asin ¢. Since |dg/dd A dg/d6|a? is approximately the area of the sector, this suggests
that Jg(¢, 0) = |dg/d¢ A dg/06| = sing. The reader should check this formula
(Problem 3). If we take B = (0, 7/2) X (0, 27) then H — g(B) is an arc of a great
circle corresponding to 6 = 0. This arc is 2-dimensionally null in the sense defined
below, and hence
[100
H
aVa(x) =fg(B) f0x) aac = f0hia i0~" fle(; 8] sin
o48.
Let us turn to the general case when A is not necessarily contained in some
coordinate patch. To simplify matters let us at first assume that J is a compact
manifold and f is continuous everywhere on M. The traditional way to proceed
As to dissect / into a finite number of nonoverlapping pieces S;,..., Sm each
of which has a coordinate system, with fr S; M fr S; contained in a finite union
of (r — 1)-manifolds and M = el 8; U--- UelS,. Then
Partition of unity. Let us recall from Section 5-3 that the support of a
function wy is the smallest closed set outside of which y(x) = 0. Let us first
find for every Xo and r > 0a function y of class C‘“’ on E” such that ¥(x) > 0
on the open r-ball with center xp and the support of y is the closed r-ball with
center Xo. In fact, let
—I]
h(a) = exp Cy — <a << Il.
x)= 0, |x|
>1.
From the example at the end of Section 2-3 and the composite function
theorem, h is of class C on E!. Let
yo) = n(Z—
A).
254 Integration on Manifolds V=8
Wi(X)
=; =
BiGhe seen ’
Somlees
ke ;
ee EM.
| Pa. = fdV;.
A ANK
In particular, if {¢1,..., @m} 1s a partition of unity, then for any f the support
of fo, is compact and lies in some coordinate patch. Hence the integral of
fx is defined.
/A jie = ae
> i]
A
fox dV, (7-7)
f\xx) = 8 Oo) ee
k=1
3 Measure and Integration on Manifolds 255
Since the support of fx; is contained in some coordinate patch, its integral
over A can be written according to (7-4) as an integral over a set BC EK’.
By Theorem 13, the integral over B of a finite sum is the sum of the integrals.
Hence
SS | raav, = 5 ss | rx dV,
— ti — i
PROBLEMS
V2(A)
= ago)? ,I aaa)
fi a(s, t) a(s, t)
| Eaten
a(s, f) dV2(s, t).
3. In Example 3 calculate dg/d¢ A 0g/00 and verify that its norm is sin @.
Moments, centroids. These are defined in the same way as forr = n. For example,
if A has positive r-dimensional measure then the components of its centroid are
te el i & 5
y= Vid) le GVEA
X) et — oseen
7--4 Orientations; Integrals of r-Forms 257
Fiaure 7-6
x x+v(x)
FIGurE 7-7
Example 1. The Mébius strip. This is a 2-manifold M C KE? which is not orientable.
It may be visualized by twisting a strip of paper and pasting together the ends
(Fig. 7-7). The edge of the strip must be omitted in order that M be locally like E?.
The fact that a unit normal cannot be chosen continuously may be expressed more
picturesquely by saying that the Mébius strip is a surface with “only one side.”
Example 2. The Mobius strip is not a compact 2-manifold. An example of a compact,
nonorientable 2-manifold is the Klein bottle, or twisted torus. It is obtained by also
joining together the lateral edges of the rectangle used to make the Mobius strip, as
indicated in Fig. 7-8. The Klein bottle cannot be realized as a submanifold of E3,
since it can be proved that any compact (n — 1)-manifold M C E” is the boundary
of an open set and hence is orientable. However, the Klein bottle can be realized as a
submanifold of H*.
q
M/F q
7-4 Orientations; Integrals of r-Forms 259
[02
A°
= [,A 2@ - 0) aV.(x), (7-8)
provided w - o is integrable over A.
In particular, if M is compact, then w-o is continuous, bounded, and
hence integrable over any g-compact subset of M. The integral has the follow-
ing elementary properties:
© fo — fu
A° Ae
In (1), one can take more generally a finite number of r-forms w', ... , w”, or
more generally an infinite sequence w!, w”,... provided ~~, Sa |w*(x)| dV (x)
converges. Similarly, the generalization of (5) is still true if A = A; UA2U-->-,
where A;, Ag,... are disjoint o-compact sets and > ¢—1 Sak |w(x)| dV,(x)
converges.
The case r= n. Let At denote A with the positive orientation e4...n
of BH”. Let w = fdx’ A --: A de” be ann-form. Then
[fee A dy = [fo
[fax A dy= [fw A dz = —f fae.
Example 4. Let H be the hemisphere in Example 3, p. 253, oriented so that 0!2(x) > 0
for every x © H. The vector n(x) = *o(x) is normal to H, and its third compo-
nent n3(x) equals 0!?(x). We have oriented H so that the normal “points upward.”
If (f, 8) are spherical coordinates of x, and g is as on p. 253, then
Cay Ye
x) =
a6, 8) =
Se
sngcos¢ > 0.
—— # fos
es Loe wy
provided either integral exists.
Example 6. Let w = fdzu A --+ A dx‘, and let AC EH". From Section 6-5
ER ae, eae
ag", Sra? b} g'”) 1 r
i1 Sod ‘r= o Oe
ee de
es
fsa AOA INES Wie ei fy”
provided o is the orientation induced by g.
Continuing Example 4, we have for instance
: a / 4 a(g', meee
9°) do
| sae dy yt e 0(¢, 8) oh
PROBLEMS
1. Let A C E” have the positive orientation and let g be a regular flat transformation
from A into EH”.
(a) Show that g induces the positive orientation en g(A) if and only if Jg(t) > 0
for every tE A.
(b) Show that (7-10) becomes
1 n 1 n
[aha mix cad IK, ake = Le ° gJgdt A ie) se A dt ;
Example 1. Let D = {x:|x| < 1 or 1 < |x| < 2}. Then fr D is the union of two
concentric (n — 1)-spheres of radii 1 and 2. However, D is on both sides of the
inner (n — 1)-sphere.
Actually this example is rather artificial. If D is the interior of its closure, then
using the implicit function theorem it can be shown that D is on one side of fr D.
From the definition, all exterior normals at x are positive scalar multiples
of any particular one. We shall be principally concerned with the unit exterior
normal, which will be denoted by v(x) (|»(x)| = 1).
Let D be on one side of its boundary in U.
The vector n(x) = grad #(x) is normal to fr D
QuxeeuirD) OU.
Let y(t) = (x + tn(x)). Then y(0) = 0
and
Let us defer the proof until later in the section. The last assumption
means that w is the restriction to cl D of a form of class C”’ on some open
set Do containing cl D. The somewhat restrictive assumption (2) about fr D
is made to simplify the proof. The theorem is still true if fr D is not a mani-
fold but instead consists of a finite number of pieces of class C‘” intersecting in
sets of dimension n — 2. For example, if D is the interior of an n-cube then
the pieces are the faces, which are cubes of dimension n — 1 and intersect in
(n — 2)-dimensional cubes. This more general form of the divergence theorem
will be precisely stated at the end of the section. For certain special kinds of
sets D there is an easy proof of the theorem (Problems 5, 6).
The case n = 2. Suppose that fr D = C, U--++ UC» where each C;, is the
trace of a simple closed curve Y¥;, and C,,...,Cm are disjoint. The orientation
is chosen by selecting the unit tangent vector v(z, y) so that (»(z, y), v(z, y)) is
a positively oriented orthonormal frame for H?. Intuitively speaking this
means that as the boundary is traversed, D is always on the left. Then
[i Pie: Br yap
If we write w = M dx + N dy, then (7-1la) becomes
Se M dx - y= aN
Ov aM G
Pa ik (x " dx A dy. (7-12)
4 a
Figure 7-10
Let ¢ be a force field acting in some open set Do C E?, For each x € Do,
¢(x) is the force covector acting at x. For notational simplicity, let us set
M = fr D throughout the discussion to follow. The number Ji £(x) - v(x) dV 2(x)
is called the outward flux across the boundary M. The divergence theorem
expresses the outward flux as a volume integral over D. If D has small diameter
and contains x9, then the outward flux is approximately V3(D) div ¢(Xo).
To make this statement more precise let us state the following.
Teco = ihn
diam D0
rite iE¢(x) - (x) dVn_a (2). (7-13)
As another physical interpretation, consider a fluid flowing in an open set
Do C E?. Let t denote time and x = (x,y,z). Let p(x, t) be the density and
v(x, t) the velocity at x and time ¢. Let ¢ = pv. Suppose that D is regular and
cl D C Do. The left-hand side of (7-11b) represents the rate at which mass
is flowing out of D. Therefore, if m(t) is the mass of the fluid in D at time t,
then from the divergence theorem
dm :
~ ar {idiv (pv) dV 3.
On the other hand,
mit) = i.p(x, t) dV3(x).
Differentiating under the integral sign (Section 5-11),
im _ f200p ay,
dm _
US The Divergence Theorem 267
For each to the functions —div (pv) and dp/dt have the same integral over
every regular D with cl DC Do. By the lemma, for every xo € Do these
functions have the same value at (Xo, fo). In other words,
Op
Arias —div (pv).
If the density p is constant, then the fluid is incompressible. Thus for incom-
pressible fluids div v = 0 at every time ¢.
If div ¢(x) = 0 for every x in its domain Do, then ¢ is called divergence
free (or solenoidal). The divergence theorem has the following corollary.
[ev
M
Vn = D
(dd:
af+ ¢ Laplf]dVn. (7-14)
In the same way
if fey Vn
M
= if [df- do +f Lapl 4] dVn.
D
2
iL,
Sf Vari e= iC|df| an. (7-16)
When n = 3 the right-hand side often has (except for a suitable multiplicative
constant) the physical interpretation of energy.
If f is harmonic and f(x) = 0 for every x € M, then from (7-16) the integral
of the nonnegative continuous function |df|? is 0. Hence df(x) = 0 for every x € cl D.
Given xo € D let x; be a point of M nearest xo. The line joining xo and x, lies in
cl D, and from the mean value theoremf is constant on it. Since f(x1) = 0, we must
have f(xo) = 0. Thus f(x) = 0 for every x € el D.
Suppose that f and g are both of class C’) on cl D and harmonic, and that f(x) =
g(x) for every x € M. Then $(x) = f(x) — g(x) = 0 for x € M and ¢ is harmonic.
Hence $(x) = 0, and f(x) = g(x), for every x E€ cl D. This shows that there is at
most one harmonic function of class C® on cl D with given values on the boundary
M. It is more difficult to show that there is in fact a harmonic function f with given
boundary values. This is called Dtrichlet’s problem. If the boundary data f|M are
merely continuous, then f is continuous on cl D and of class C®) and harmonic on D.
See [14], Chap. XI. If the boundary data are smooth enough, then f is of class C
and harmonic on cl D. For instance this is true if J is of class C@) and f is of class C®
on M.
Let us now turn to the proof of the divergence theorem. The proof will
proceed by first proving the theorem when D is either H” or a half-space and
w has compact support. The general case will then be reduced to these two
by introducing local coordinates on fr D and a partition of unity. As before,
we may let ¢ = *w and may prove either of the two equivalent formulas
(7-1la) and (7-11b). As in Chapter 5, ff dV, denotes the integral of f over
all of E”.
far,
OC;
a |{f ORG
Bac a ave i! ),
where xe = (z*,...,2°",a't!,...,2”"). Since ¢; has compact support,
the inner integral is 0 by the fundamental theorem of calculus. Therefore
fd¢;/dx' dV, = 0. Summing from 1 to n we get the lemma. jf
Lemma 2. Lei H be the half-space {x : 2" < 0}, and let ¢ be as in Lemma 1.
Then
iLdiv ¢ dV, = itGaz’, 0) dVn ai(2').
Proof. If 7% <n, then fy 0¢;/dx' dV, = 0 as in the proof of Lemma 1.
Tor 1 = n we have
Abn ays
0
ip
iLOxn dV rat Ae arn dx |avs1(x’.
By the fundamental theorem of calculus the inner integral is ¢,(x’, 0), since
¢, has compact support. §f
Proof. Let g = f~'. The first assertion (a) follows from the fact that a
regular transformation f is a homeomorphism. Let to © N, and xo = g(t).
Let U and © be as in the definition, p. 263. We may assume that U C Dy.
Let W(t) = @[g(t)] for tef(U). Since Dg(t) has maximum rank n and
d&(x) # 0, the chain rule implies that d¥(t) # 0. The open set f{(U) contains
a neighborhood Q of to, and
(irAy a i= ate O-(t) == 0},
Kp = Cease) 2,
Therefore A is on one side of its boundary in 2. This proves (b).
Since Jg(t) = 1/Jf(x) > 0, g preserves the positive orientation of EH”.
We must show that the orientation induced on S from the positive orientation
1s positive.
Let ko = grad W(ty) and np = grad (xo). They are exterior normals
to A and to D respectively. Let hy = L(ko), where L = Dg(to). From the
chain rule, ky = L‘(no) where L’ is the transpose of L. By formula (4-8)
ho + no = |L'(no)|* > 0.
Let (k,,...,Kn—1) be a positively oriented frame for the tangent space to
Tea tet ymaucelct hp tL (kewl ve le hens (hj, 22, by_y) is 4
270 Integration on Manifolds US)
frame for the tangent space 7T(Xo) to fr D at xo. Since (Ko, ki,..., Kn—1)
is a positively oriented frame and g preserves the orientation of E”,
(ho, hy,...,h,) is a positively oriented frame.
Now ho = cio +h, where c = (ho -No)/(¥o +o) and h € T(xo). From
this,
No A hy aN yer = cho /\ hy, n/N job ae
Since hy no > 0, ¢ > 0. Therefore the frame (no, hi, ... , h,—1) has positive
orientation, which implies that (h;,...,h,—1) orients S positively at Xo.I
If Jf(x) < 0 for every x € Dj, then f~' induces the negative orientation
(corresponding to the interior normal) on S.
Proof of divergence theorem. Let us show that each x9 € cl D has a neigh-
borhood Uo such that (7-lla) holds provided w has compact support con-
tained in cl Up. If xo € D, let Ug be a small enough neighborhood that
cel Up C D. Then fap+ w = 0, and by Lemma 1, fap+ dw = 0.
Let xo € fr D, and let H be as in Lemma 2. Let us find a neighborhood
D, of x9 and a regular transformation f with domain D, such that
But by Lemma 2, the right-hand sides of («) and (#*) are equal.
Since cl D is a compact set, a finite number of such neighborhoods
Uy,...,Um cover cl D. Let Wx(x) and ¢%(x) be defined as in the proof of
Proposition 29, for x € cl D. Since ¢,w has compact support contained in
el Ul,
ie oro = ibe(G0) pn hom eens (*)
7-5 The Divergence Theorem 271
e $x)io
ie & és)ica [r Q
which is precisely (7—-lla). §
PROBLEMS
Unless otherwise indicated, assume that D is a regular domain.
1. Let n = 2. Show that:
4, Letn = 4 and D = {x: (x1)?+ (a2)?+ (@3)? < (2*)?,0 < 2* < 1}.
Evaluate :
(a) fl .@? +24) de! A de’ A dx’. 0b) foal?detA da? A aa.
aD aD
5. Suppose that D = {(z,y):f@)<y <g@); y
a<z <b} = {(z,y):¢y) <2 < vy),¢ <
y < d\. Show directly from the fundamental
theorem of calculus and properties of line inte-
grals that
6. Prove the divergence theorem directly from the fundamental theorem of calculus
when D is:
(amt henmninn-cubernx 3 0F<c oe < al —sl er
(b) The standard n-simplex.
7. For each ¢ in some interval (—a, a) let T; be a regular flat transformation with
domain Do. Assume that To(x) = x for every x € Do and that T is of class C®?
as a function of (x,t) on Do X (—a,a). Let v(t) = V,[T:(D)] where cl DC Do.
Prove that v/(0) = fp div Wo dVn, where W; = OT;/dt. [Hint: Show that the
integrand is (0/0t)JT;(x) evaluated at t = 0.]
In Problems 8 and 9 let 7 = fr D.
8. Show that:
9. Let D be connected, f harmonic, and f,(x) = 0 for every x € Mf. Show that
f(x) is constant on D.
10. Let D = {x:a < |x| < 6}, where 0 <a < Bb.
(a) Show that if f(x) = p(|x|), then f,(x) = y/(|x|) when |x| = 6 and f,(x) =
—y’ (|x|) when |x| = a.
(b)) Let Yr) = —[(@ —2)82_i)u! r=2, where'n.> 2/and/6,-7 1s the (Gee 1)—
measure of the unit (n — 1)-sphere. Let
f be asin (a). Show that
f isharmonic.
(c) Let ¢ be harmonic on the n-ball B = {x:|x| < bd}. Show that $(0) =
(Bn—1b"—!)—! fir 3@dVn_1. [Hint: Apply the second Green’s formula with
D and f as above and let a > 0+]
7-6 Stokes’ Formula 273
Stokes’ formula
/ mye aly (Vala)
aA° Ag
The case r= 1, n= 3. Then »w= Pdr+Qdy+ Rdz is a 1-form
and dw is a 2-form. The 1-form *dw is called curl w, and the vector n(x) = *o(x)
is a unit normal to A. Since dw(x) - 0(x) = curl w(x) -n(x), formula (7-17)
becomes
skBoe iL,curl w(x) - n(x) dV o(x). (7-18)
ro)
The name Stokes’ formula was traditionally applied to (7-18), and not its
generalization (7-17).
: z
g
ae ae
8 UY]
wv
Figure 7-13
274 Integration on Manifolds 7-6
Example. Let 08+ consist of a single simple closed curve Y in H?. Then dA* consists
of a simple closed curve Y in EH.
The normal n(x) varies continuously on A. At a boundary point x of A, n(x)
can be visualized in the following way. Let x = g(s, t), where (s,t) E frB. Let
v be the exterior normal and v the positively oriented unit tangent vector to Y at
(s, ). The vector h = Dg(s, t)(v) is a tangent vector to Y at x. If ho = Dg(s, t)(v),
then (ho, h) is a frame for the tangent space to M at x and has the required orientation
o(x). Hence (n(x), ho, h) is a positively oriented frame for H?. (See Fig. 7-13.)
i
M?
dar.
Now let M be any orientable r-manifold of class C°’. Let us call a rela-
tively open set A C M a regular domain on M if:
(1) cl A is a compact subset of M;
(2) the boundary K of A relative to M is an (r — 1)-manifold of class C;
(3) A is on one side of K.
This theorem can be proved using the divergence theorem and a partition
of unity in much the same way as for Proposition 33. We shall not give the
details.
We have assumed that M is of classC'?, but Theorem 23 is still true for
manifolds of class C”’. Moreover, the relative boundary K may be piecewise
of class C‘” in the sense explained at the end of Section 7-5. For instance, if
M is an r-plane and A an r-simplex contained in M, then the boundary of A
relative to M is piecewise of class C””.
PROBLEMS
1. Let w = yedx + ady-+dz. Let Y be the unit circle in the zy-plane, oriented in
the counterclockwise direction. Calculate f, and [40 dw and verify that they
are equal, where the orientation o is chosen so that 0A®° = Y and:
(a) A is the disk 2? + y? < 1 in the zy-plane.
(eee etre, Lee.) ee ye LP
2. Let w = zexp (—y) dx + zdy+ ydz. Evaluate fac dw when A is:
(a) The ellipsoid x?/a? + y?/b? + z27/c? = 1 oriented by the exterior normal.
(b) The square with vertices 0, e: + e2, V2 e3,e1 + e2 + 1/2 e3, oriented so
that 073(x) > 0.
(ec) The paraboloid y = x? + z? oriented so that 03!(x) > 0.
3. Show that the orientation o’ for K does not depend on the particular choice of
coordinate systems for M used in its definition.
276 Integration on Manifolds Sf,
If f(t) = xo for every ¢ € [a, b], then one should think intuitively that
y;, shrinks to the point x9 as s > 0°. When A is an open subset of E? this is
possible roughly speaking provided 7, does not loop around any holes which
may be present in A. In Fig. 7-14, A has two holes and the curves 7, in the
figure are not null homotopic in A.
%y
= oH’ oH’
= Yon (% ash, au):
i)
n b t
0H
feta d fom are
Ag ti a Figure 7-15
H and 0H‘/dt being evaluated at (1, t). Since H(1, t) = g(#), the right-hand
side is just (g,w). Similarly, since H(O, t) = f(¢)
fe =aa— (f
..w)s
Example 2. Let n = 2 and let D be the plane with (0,0) removed. Let w =
(x dy — y dx)/(x? + y?). Formally, w = d®, where O(z, y) is the angle from the
positive x-axis to (a, y),0 < O(a, y) < 27. However, @ is defined only in the plane
with a slit removed even though w is defined and of class C“) in D. For each integer
m £0 let gm(t) = (cos mt)ey + (sin mt)ez, O< t < 297. Then (gn, w) = 2mr,
which shows that g,, and g; are not strictly homotopic in D when m # I. The trans-
formation g,, represents the unit circle traversed |m| times, counterclockwise if m > 0
and clockwise if m < 0.
“1 1
| T= SS (| 6, as)Ca YN (7-19)
0 Ue ew
Finally, any r-form y on [0,1] X B can be written 4 = ds A 6+ y', where
n' involves only the differentials dt’, ..., dt”:
Te = Gy CES Oe Ne
[J
ener = of
ate, 2 Weck
en MLL
[a
0
tafin=aQ
0
— 1). (7-20)
Now let w be an r-form of class C‘” on A. Let H be a
homotopy between transformations f and g, and let of, wh,
wiz denote the r-forms induced respectively by f, g, and H. =
Let y = wy. Thenyn!(1) = wh and 71(0) = w}. Therefore
1 1
{i fee + df ees = wy — woe (7-21) A *2
0 0
Poincaré’s lemma. Let D be a star-shaped open set and let 1 <r <n.
Then every closed r-form with domain D is exact.
Let us state without proof the following version of the theorem. Let
Z’(D) denote the set of all closed r-forms of class C on D. If w and ¢ are
closed, then w -++ ¢ is closed and cw is closed for any scalar c. Thus Z"(D) is
a vector space over #'. Similarly, let &"(D) denote the vector space consisting
of all exact r-forms of the type w = df where ¢ is of class C‘“” on D. Then
&’(D) C Z"(D). According to DeRham’s theorem, the quotient vector space
5"(D) = Z"(D)/8"(D) is isomorphic to the r-dimensional cohomology group
of D with real coefficients. (The homology and cohomology groups of a space
are defined in algebraic topology. They contain a great deal of topological
information about the space.) In particular, every closed r-form is exact if
and only if 3C"(D). = 0.
PROBLEMS
1. Let D be the solid torus obtained by rotating the circular disk (y — a)?-+ 22 < b?,
0 < 6b < a, about the z-axis. Let Y be the circular path traversed by the center
of the disk. Show that f, (2 dy — y dz)/(x?-+ y?) # 0. Hence by Corollary 1,
Y is not null homotopic in D.
2. Let S be the sphere 2? + y? + z? = a?, oriented by the unit exterior normal.
Let
w = p (xdy A dze+ydz A de +2zdzA dy), p? = 2? + y?+2?,
(a) Show that w(zo, yo) = 1 if (xo, yo) € D. [Hint: Apply Green’s theorem to
Let us begin with a list of axioms which describes the real number system E!.
Axiom I. (a) Any two real numbers have a sum x + y and a product xy,
which are also real numbers. Moreover,
Commutative law ce yy a oe, Tea.
Associative law ae) S> By Ss REM) =e & L(y) == (xy)Z,
Distributive law x(y + 2) = cy + xz
for every x,y, and z.
(b) There are two (distinct) real numbers 0 and 1 which are identity
elements respectively under addition and multiplication:
= I) = ae, CL = 92
for every «x.
(c) Every real number x has an inverse —x with respect to addition, and
_ if « ¥ O, an inverse x‘ with respect to multiplication:
oo (=x) = 0, opens’
Axiom II. There is a relation < between real numbers such that:
(a) For every pair of numbers x and y, exactly one of the following alter-
natives holds: z < y,7 = y, y < @.
(b) w < wanda < yimply w < y (transitive law).
(c) « < y implies x + 2 < y + 2 for every z.
(d) « < y implies zz < yz whenever 0 < z.
From Axioms I and II follow all of the ordinary laws of arithmetic. In
algebra any set with two operations (usually called “addition” and “multi-
plication”) having the properties listed in Axiom I is called a field. A field is
called ordered if there is in it a relation < satisfying Axiom II.
283
284 Appendix A-1
The real numbers form an ordered field. However, this is by no means the
only ordered field. For example, the rational numbers also form an ordered
field. We recall that x rational means that x = p/q where p and q are integers
and gq ~ 0. Yet another axiom is needed td characterize the real number
system. This axiom can be introduced in several ways. Perhaps the simplest
of these is in terms of least upper bounds.
Let S be a nonempty set of real numbers. If there is a number c such that
az < c¢ for every x € S, then c is called an upper bound for S. If c¢ is an upper
bound for S and b > c, then 6 is also an upper bound for S.
Axiom Illa. Any set S of real numbers which has an upper bound has a
least upper bound.
The least upper bound for S will be denoted by sup S. If S has no upper
bound, then we set sup S = +o.
A number d is a lower bound for S if d < x for every x ES. If S hasa
lower bound, then (Problem 2) S has a greatest lower bound. It is denoted
by inf S. If S has no lower bound, then we set inf S = —o.
Example 1. Let S = {1, 2,3,...}, the set of positive integers. Then sup S = +
anGeiniy Sele
[a,©)
= {e:2> a}, (4,0)
= {aia > a}
are called respectively closed and open, and have a as greatest lower bound. The
corresponding intervals (—, b], (—°, b) have 6 as least upper bound.
Let S be a set which has an upper bound. Example 2 shows that the
number sup S need not belong to S. If sup S does happen to be an element of S,
then it is the largest element of S and we write “max S” instead of “sup S.”
Similarly, if S is bounded below and inf S is an element of S, then we write for it
nun 5.”
Example 3. Let S = {x:2? < 2 and z is a rational number}. Then 2 = sup S
and —V/2 = inf S. Since V2 is not a rational number, this example shows that the
least upper bound axiom would no longer hold if we replaced the real number system
by the rational number system.
Example 4. Let S = {sinz:a € [—z,7a]}. Then —1 = minS,1 = maxS.
For every € > 0, x > 0 there is a positive integer m such that x < me.
This is called the archimedean property of the real number system. To prove it,
A-2 Axioms for a Vector Space 285
suppose to the contrary that for some pair €, x of positive numbers, me < x
forevery m = 1, 2,... Thenzisan upper bound for the set S= {e, 2e, 3e,.. .}.
Let c= supS. Then (m-+ l)e < ¢ and therefore me < c — ¢, for each
m = 1,2,... Hence c — é¢ is an upper bound for S smaller than sup S, a
contradiction. This proves the archimedean property.
We shall not prove that there actually is a system satisfying Axioms I,
II, and IIa. There are two well-known methods of constructing the real
number system, starting from the rational numbers. One of them is the method
of Dedekind cuts and the other is Cantor’s method of Cauchy sequences.
Axioms I, II, and IIIa characterize the real numbers; in other words,
any two systems satisfying these three axioms are essentially the same. To
put this more precisely in algebraic language, any two ordered fields satisfying
Axiom IIIa are isomorphic.
For proofs of these facts, refer to [2], Chap. III.
PROBLEMS
1. Find the least upper bound and greatest lower bound of each of the following sets:
(a) {x:22 — 382+ 2 < O}.
(b) {2:a3.-+ 2? — 27 < 2}.
(c) {sinz-+ cos 2:2 € [0, z]}.
(d) {wexpax:a2 < 0}. [Note: exp denotes the exponential function, expx = e?,
where e¢ is the base for natural logarithms.]
(e) ke z, 3, Ts O50 ap
It is easy to show that EH” satisfies these five axioms. However, a multi-
tude of other important vector spaces besides #” occur in mathematics.
A subset B of a vector space VU is called a linearly dependent set if there
exist distinct elements u1,...%m © B and real numbers Cie Ca notes aU)
such that
Cu, ee Ws
for every u € 0. The function L + M has properties (a) and (b), and thus
is linear. If c is a real number, then cL is the linear function given by (cL)(u) =
cL(u) for every u € V.
Let £(0, ‘W) denote the set of all linear functions with domain U and
values in W, together with these operations of sum of functions and multi-
plication of functions by scalars. Then £(, ‘W) satisfies Axioms (1)-(5) for
a vector space. The zero element of £(U, ‘W) is the function whose value at
every u € VU is the zero element of W.
The dual space of U. Let us now suppose that WwW = H! and set
U* = £(0, H'). The vector space U* is called the dual space of U. Let us
show that if U has positive, finite dimension n, then 0* also has dimension n.
Let B = {uy,..., Un} be a basis for U. Let L',..., E” be the real-valued
functions such that for each 7 = 1,...,n and u = clu; +-+-+ cup,
These functions L' are linear, and therefore belong to 0*. They are specified
by their values at the basis elements:
hy SOG? 358745
be.
PROBLEMS
1. Let U be the set of all polynomials p(x) = aox™ + aiz™—! + +++ + Gm—1t + Om
of degree <™m, with the usual notions of addition and scalar multiplication. Here
x* denotes the kth power of x. Show that U is a vector space of dimension m + 1,
and find a basis for U.
2. Show that if O and W are vector spaces of finite positive dimensions n and r, then
£(U, W) has dimension nr.
288 Appendix A-3
be OSS hp eb Mg
ly
— xo] < r+ lx — xo] = 6.
Figure A-1
Hence y € U. This shows that V C U. Similarly, every point x such that |x — xo| > 6
is exterior to U. If |x — xo| = 6, x is a frontier point.
Example 3. Let A = E”. Then int EH” = cl HZ” = HE” and fr E” is the empty set.
It is always true that int A C A. If these two sets are the same, then A is
called an open set.
A-3 Basic Topological Notions in E” 289
Proof. Let x € A UB. By the definition of the union of two sets, either
x € Aorx € B. Ifx € A, then there is a neighborhood U ofx such that U C A.
Since AC AUB,UCAUB. Similarly, if x € B there is a neighborhood of
x contained in A U B. This proves that A U B is open.
If x € A 1B, then x has neighborhoods U,, U2 such that U; C A and
U,CcB. Let U3 = Uy U2. Then U3 is a neighborhood of xand U3; Cc AN B.
Therefore A M B is open. §
These sets are, respectively, the wnzon and intersection of the indexed collection.
If J is a finite set, then the indexed collection is called finite. If g = {1,2,...},
then the indexed collection is an infinite sequence of sets and is written
Anas ee orl Ay) =—="1,12;).-. In that) case the union is “written
A; U Ag U-:: or U-_, Am, with similar notations for the intersection.
Proposition A-Ib. The union of any indexed collection of open sets is open.
The proof is the same as for the first part of Proposition Ala.
(2 a U 4s,
pes
(U dali Cre
Lesa pes
() A= {pES8:peEA forevery
A € Uf}.
AEY
If each set of the collection is indexed by itself (taking J = YA, A4 = A), then
this definition of union and intersection agrees with the one for indexed collec-
tions. Propositions A-lb and A-2 remain true for unindexed collections.
PROBLEMS
1. Find int A, fr A, cl A if A is:
(amex OF<a|xi—-xo| <0 0820:
(b) {x:|x — xo] = 6}, 6 > 0.
(CRG) 0s ye<i a ole yl)
(Dimi rcOs.Oayocit 0) 0 pute <m eO e<aO) <0 Dan
(e) {(a, y):2 or y is irrational}.
(f) Any finite set.
(g) we 3, 3) Om te n=l.
2. In Problem 1 which sets are open? Which are closed?
3. Let A be any set. Show that int A is open, and that both fr A and cl A are closed.
4. Show that:
() aie 41 se (ALS), (b) el A =) cl (lA):
(ce) fr A = el A fel (CA9). (d) int-A = (el (A%)¢.
5. Show by giving examples that the following are in general false:
(a) int (cl A) = int A. (is) sue Gie AD) = iiie Al.
6. Let A be open and B closed. Show that A — B is open, and that B — A is closed.
A-4 Sequences in E” 291
A-4 SEQUENCES IN E”
Definition. Suppose that for every € > 0 there exists a positive integer N
such that |x, — xo| < ¢€ for every m > N. Then Xp is the limit of
the sequence [x].
_ The proof of this is left to the reader (Problem 6). Only superficial changes
are needed in the proof given for real-valued sequences in any careful elementary
caleulus text. Moreover, it is similar to the proof of Proposition A—3b in the
next section.
: : é ; 0 t
Proposition A-4a. Xo = lim xX, if and only af op thie oes) Sere ala)
mo mon
eles 7
i= 1,...,n. Then given € > 0 there exists for each 7 an N; such that
|x‘, — | < e/nforeverym > N;. Let N = max {Nj,..., Nn}. Ifm 2 N,
then
n
[Xm
— Xo] < aon aewerol <n = €.
i=1
Let us next prove three theorems which depend on the least upper bound
axiom. It will be seen later that when n = 1 each of these theorems describes
a property of the real number system which, taken together with the archi-
medean property, is equivalent to the least upper bound axiom.
A sequence [x] of real numbers is called monotone if either x; < v2 <
tga < ++: or tT] > Xy > t3 >-:- In the first instance the sequence is non-
decreasing, in the second nonincreasing. A sequence [xX,,] is bounded if there is
a number C' such that |x,,.| < C for every m = 1,2,...
Theorem A-1. very bounded monotone sequence of real numbers has a limit.
Proof. Let [xm] be nondecreasing and bounded. Let ro = sup {x1, 2...}.
Given e > 0 there exists an N such that x9 — ry < €. Otherwise 7m < Xo — €
for every m = 1,2,..., and x) — e€ would be a smaller upper bound than the
least upper bound zg. Since the sequence is nondecreasing, ty < %m < %o
for every m > N. Hence |tm — X%o| = Xo — tm < € for every m > N.
This shows that 7, — to as m— o.
If the sequence is nonincreasing and bounded, let x9 = inf {x1, xo,...}.
In the same way tm — 2% asm — o.§
Example 1. Let 0 < a < 1. The sequence a, a”, a®,... of its powers is decreasing,
and is bounded below by 0. The limit of the sequence is b = inf fa, a”, a?,...}.
eimce 6 = a"™'!Va-'b < a™ for m= 1,2,;... Therefore a,b —< O) Howeyer
a—'b > bsince0 < a < 1; and hence a~!b = b. This implies that 6 = 0.
Definition. If for every € > 0 there exists a positive integer N such that
|x: — Xm| < € for every 1, m > N, then [x] is a Cauchy sequence.
Proof. Let [Xm] be convergent, and xo be its limit. Then given e > 0
there exists N such that |x, — X9| < €/2 for every m > N. Now
His =—=\64
SS WT ee eel plas ae1
Theorem A-3. (Cantor). Let [Am] be a sequence of closed sets such that
Aya Ajo): ond. 0 == lin, diam Ayre) hen Ane \eso (lice nCOlle
tains a single point.
Proof. For each m = 1, 2,... let Xm be a‘point of A». Let us show that
[x] is a Cauchy sequence. Given e > 0, there exists N such that diam Ay < e.
If 1, m > N, then x;,xm € Aw since Ay C Any, Am C Ay. Therefore
By Theorem A-2 the sequence [x,,] has a limit xo. For each / = 1,2,...,
Xm © A; for every m > Il since Am C A;. Since A; is closed, x9 € Az (Prob-
lem 5). Since this is true for each 1, xX») € Ay N Ag N:::
Ifx € Ay mM Ag n--:, thenx € A, and
*Note. Let n = 1. Suppose that we took instead of the least upper bound
property of H’ (Axiom IIIa, Section A-1) the following.
Ty
WO
ee ae
ee
a ¢
l ue |
I,
Figure A-2
then similar reasoning shows that these sequences must converge to a common
limit x9 and x9 = sup S. Again, this is equivalent to Axiom IIIa.
PROBLEMS
In Problems 1 and 2 you may use the results of Problems 9 and 10.
1. Find the limit if it exists.
(aan ll= 2" — 25%)/ 8" 35"):
(b) 8g 3 I sin (mm/2).
@) Bp = SM Gas.
(da) tm = (m+ 1)/(m — 1))™. [Hint: (1+ 1/m)™ >e as m > © |]
(Cleo nem n(n Salm? — 1) Jr.
2. Find the limit if it exists, using Proposition A—4a.
(8). Gray Ym) = (A m)/ — 2m), 1/0 + m)).
(DMG Yr) Sole on List am)e
(c) (Xm, Ym) = (1 — 2-”, (m? + 3”)/m!).
3. Show that a sequence [x,] has at most one limit xo. [Hint: If yo were another
limit, let € = |xo — yo|/2.]
4. Show that [x] is a Cauchy sequence if and only if [z;,] is a Cauchy sequence
iO? CHONG, = Ih saanite
5. Show that if x, © A for every m > 1 and xo = limm_,~ Xm, then xo € cl A.
6. Prove Proposition A-3a.
7. (Comparison tests.) Show that:
(a) If 0 < am < ym for every m > 1 and yn > 0 as m— ©, then tm — 0
asm —> ©.
296 Appendix A-5
(b) If [zm], [ym] are nondecreasing sequences such that %m < Ym for each
m = 1,2,... Ym — and
yasm — ©,then [zm] has a limit « < y.
(c) If 0< 2m < ym for every m = 1,2,... and ¢ = yi + y2+--°--, then the
series x1 + x2 +--+ converges with sum s < f.
8. An infinite series x1 + x2 + °+-+ converges absolutely if the series of nonnegative
numbers |x;| + |x2|-+--- converges. Prove that any absolutely convergent
infinite series is convergent. [Hint: Show that the sequence [s,,] of partial sums
is Cauchy.]
9. Show that if a > 0, then
@) ime ew (b) lim a”/m! = 0.
mo moO
[Hints: For part (a) reduce to the case 0 < a < 1. By Example 1, if 6 < 1
then a < 6” for only finitely many m. For part (b), compare with the sequence
[c/m] for suitable c and suitable / in Problem 7(a).]
1O, Ian ay SS Nie a, Of) — Wht es Diy, BANC! SONS ae WE, == Orie = OW) Ih, A, ao o
Show that zo/yo = limm_.0%m/Ym- (Hint: By (c) of Proposition A—3a it suffices
to show that yo! = limm—o 2m
Figure A-3
If we let € and 6 denote respectively the radii of V and U, then the defi-
nition may be rephrased as follows: f(x) — yo as x — Xo if for every € > 0
there exists 6 > 0 such that |f(x) — yo| < € whenever 0 < |x — xo| < 6.
The number 6 depends of course on € and may also depend on xo. Given €
and Xo, there is a largest possible 6. However, there is ordinarily no reason
to try to calculate it.
Let us first show that limits behave properly with respect to sums and
products. Let f and g have the same domain and values in the same euclidean E”.
lf(x) - g(®) — yo°Zol < |f(%)| |g(x) — Zol + [zo] If) — yol- (*)
\
Given € > 0, let V,, V2 be the neighborhoods of radius €/2C' of yo, Zo respec-
tively,.and let Vi = Von Vi. U-esx 2C,; then Vj = Vy. By hypothesis
there are punctured neighborhoods U,, Us of x9 such that f(U;) C V2,
TUG Vee let U = Ur Us. Forvevery xe Upii(x) eG.) go and) hence
[f(x)| < C. From (*),
€ €
iQ ee tye" Za < Coat Cag = €
Proposition A-4b. yo = lim f(x) 7f and only if yj = lim f*(x) for each
ee eae habe ate”
Proof. Let V be any neighborhood of yo. There exists 6 > 0 such that
f(x) € V whenever 0 < |x — xo| < 6. If0 < |t| < 6/|v|, then
Example 1. Let f(x,y) = 2?/(#? + y?), (z, y) # (0,0), and let xo = (0,0). Taking
v =e; = (1,0), f(,0) = 1 for every t¥ 0. Hence f(t, 0) > 1 ast > 0. Similarly,
taking v = eg = (0,1), f(0,t) = 0 for every t ¥ 0 and f(0, t) ~ 0 ast > 0. Since
these limits are different, f has no limit at (0, 0).
Example 2. Let f(x,y) = (y? — «)?/(y4+ 27), (2,y) ¥ (0,0), and again let
xo = (0,0). Consider any v = (h,k) # (0,0). Then
fie
GC =i
It will be shown later (Proposition A—7) that the composite of two contin-
uous transformations is continuous. In Section 4—4 it is shown that any differ-
entiable transformation is continuous.
Limits at oo. Let us call a set of the form {x: |x| > 6} a punctured neigh-
borhood of «. The definition of “limit at «” then reads: yo = limjz|_,. f(x)
if for every neighborhood V of yo there exists a punctured neighborhood U
of « such that f{(U) C V.
When f is real valued we say that lim,_,x, f(x) = + if for every C > 0
there is a punctured neighborhood U of xo such that f(x) > C whenever
x € U. The definition of “lim,_x, f(x) = —” is similar.
300 Appendix A-6
PROBLEMS
th Find the limit at xo if it exists.
(a) f@pye— ty (r> 47) xor—" er eo:
(b) I(x; y) = cy) ce ae y”), XO (0, 0). ie
(ec) f(z) = (1 — cosz)/x?, xo = 0. [Hint: limz_o (sin z)/4 = 1.]
(d) f(z) = |x — 2le; + |x + 2leo, xo = 3.
(e) f(x,y) = yer + (ry)?/[(y)? + (& — y)7le2, xo = (0, 0).
At which points is each of these functions continuous?
. Prove (2) of Proposition A-3b.
. Show that if yo = lims_,x, f(x), then |yo| = limz_x, |f(x)|. By an example, show
that the converse is false.
. Let yo = lims4x, f(x), 20 = lims—x, g(x). Show that if zo ¥ 0 then
$f) _ yo.
x>x9 g(X) 20
. Show that limy_,x, f(x) = + if and only if lims_x, [f@)]—* = 0 and f(x) >)0
for every x in some punctured neighborhood of xo.
. Find the limit if it exists.
A 4
gS. oi zy”
a lim : (b) Li oes
(a) (e,4)=1(0,0) 22-1 Y* (x,y) (0,0) & ae y
Figure A-4
1S)
In any topological space S the basic notions of interior, frontier, and closure
are defined just as in Section A-3 for the topological space #”. For instance, p
is interior to a set A C S if some neighborhood of p is contained in A. An open
subset of the topological space S is a subset A each point of which is interior to A.
If S — A is open, then A isa closed subset of S. Axiom (4) guarantees that any
neighborhood is an open set. Propositions A-la, A-1b, and A-2 are still true,
and the proofs are almost the same as before.
It often happens that two different collections of neighborhoods Up, UZ
lead to the same collection of open subsets of S. In E” we need not have started
with spherical neighborhoods. For instance, the neighborhoods obtained from
any noneuclidean norm on #” lead to the same open sets as in Section A-3.
See Section 1-6.
The open sets, and not the particular kinds of neighborhoods from which
they were obtained, determine all of the topological properties of S. Thus we
say that the collections Up, UW, define the same topology on S if they have the
same collection of open sets.
To give some idea of the breadth of the notion of topological space, let us
give a few more examples.
Example 3. Let S be any set, and let every p € S have exactly one neighborhood,
namely S itself. The only open sets are S and the empty set.
302 Appendix A-6
Example 4. Let S be any set, and let the sole neighborhood of p be the set {p} with
the one element p. Then every subset of S is open.
Ficure A-5
In this book the following cases will be of interest: (a) S is an open subset
of HE” and T = HE”. In that case the definition of continuity agrees with the
one in Section A-5. (b) SC H” and T CE”. The sets S and T are given
the relative topology, defined below. (c) S is an open subset of #”, and T is
some other finite-dimensional vector space. Specifically, for 7’ we shall take
either the dual space (#”)*, the space Hy of multivectors of degree r, or its
dual (H*)*. Each of these spaces has a euclidean norm. Just as for #”, neigh-
borhoods of a point p in each of them are of the form {q:|q — p| < 6,6 > O}.
Case (c) could be reduced to (a), since there is a norm preserving isomorphism
between each of these vector spaces and euclidean EH” of the appropriate
dimension .
g(W)
W (feg)(W)
(U 7
g ie f aS I
aa
te are aa
fog
Figure A-6
Example 6. Let D C E” be open. Given xo and v, let g(t) = xo + tv for every scalar t.
Then g is continuous from H! into H”. By Proposition A-6, A = {t:x0 + tw & D}
is open. Let f be continuous on D and let o(t) = f(xo + tv) fort EA. Then ¢ =
f° (g|A), and by Proposition A-7 ¢ is continuous on A. This result is similar to
Proposition A-5.
Given x} € FE’, let g be the function from £"~* into H” such that g(x”) = (xo, x”
for every x’ € E"-*, Such a function g is called an injection. Since |g(x’”) — g(y”’)| =
jx” — y”|, gis continuous. Let D C E” be open, and let D(xo) = {x””: (Ko, x”) € D}.
Since D(xo) = g7!(D), by Proposition A-6 D(x6) is an open subset of H”~*. Let fbe
continuous on D. The function f(x, ) whose value at each x’ © D(x9) is f(x, x”)
is the composite of f and g| D(xg). By Proposition A-7, f(xo, ) is continuous. Similarly,
given x¥, the set {x’: (x’, x) € D} is open and the function f( , x0) is continuous.
Taking complements we find that the relatively closed sets are those of the
form S Q E where E 1s a closed subset of So.
If A CS and A is an open subset of So, then A is relatively open (take
D = A in Proposition A-8). On the other hand, there are generally many
sets which are relatively open but not open.
Example 8. Let So = H', S = [a,b]. Ifa < x < b, then the interval [a, x) is open
relative to S but is not an open subset of #'. A function f is said to have right-hand
limit yo at a if for every € > 0 there exists 6 > 0 such that |f(z) — yo| < € when-
ever a <x <a-+ 6. The idea of left-hand limit at the other endpoint 6 is defined
similarly. A function f is continuous on [a, b] if and only if f is continuous on the
open interval (a, 6) and
PROBLEMS
1. Show that:
(a) In Example 3 any real-valued function continuous on S is constant.
(b) In Example 4 every function with domain S is continuous.
2. Let S be an open subset of So. Show that the relatively open sets are just those
open subsets of So contained in S.
3. Consider the following nonstandard topology on the plane H?: In this topology
the “d-neighborhood” of (zo, yo) is the set {(z, y):%9 —6 <x <20+6,y = yot.
(a) Verify that Axioms (1)—(4) are satisfied.
(b) Show that if D C E? is open in the usual sense, then D is open in this topology,
but not conversely.
(c) Let f(x, y) = g(x)h(y), where g and h have domain EH! and g is continuous
in the usual topology of #!. Show that f is continuous in this topology.
4, Let So be a topological space, and let f be continuous on So. Let S C So have the
relative topology. Show that f|S is continuous on S.
5. A metric space is a nonempty set S together with a real-valued function d with
domain the cartesian product S < S, such that:
(i) d(p,q) => 0 for every p,q € S, d(p, q) = Oif and only if p = gq;
(ii) d(p,q) = dq, p) for every p,q € 8;
(iii) d(p, g) < d(p,r) + d(r, q) for every p, g,r € S.
Let the collection U, consist of all sets {¢: d(p, q) < 6}, where 6 > 0.
(a) Verify Axioms (1)—(4) for a topological space.
(b) Show that {q:d(p, q) < 6} is open and that {q¢:d(p, q) < 4} is closed.
6. A normed vector space is a vector space U together with a real-valued function || ||
with domain U such that: (a) ||u|| > 0 for every u€ VU, u ¥ 0, (b) |leul] = |e} |lull
for every c and u € UV, and (ce) |ju-+ o|| < |lul| + |lvl| for every uo EV. Let
d(u,v) = ||u — o||. Verify the axioms (i), (ii), and (iii) for a metric space in
Problem 5. [Note: If 0 = E”, then it was shown in Section 1-6 that every norm
on E” Jeads to the same topology. In Example 5 above & is a normed vector space.
The norm is ||f|| = sup{|f(x)|: 2 € [0, 1]}. The L?-spaces (Section 5-12) furnish
other examples of normed vector spaces.]
306 Appendix A-7
Example 1. Let S = (0, 1]U[2, 3]. Let A = [0, 1], B = [2,3]. Then A = SNK (—1, 8),
which shows that A is open relative to S. Similarly, B is open relative to S. Since
S = AU Band AN Bis empty, S is disconnected.
© Zz y
[x, y]
Figure A-7
On the other hand, any open connected subset of EH” is pathwise connected.
In fact, any two points of D can be connected by a polygonal path (Problem 9).
PROBLEMS
1. Show from the definition that the following are disconnected subsets of the plane E?:
(a) The hyperbola x2 — y? = 1.
(b) Any finite subset of #? with at least two elements.
(C) {@ yin? <y?}.
308 Appendix A-8
of points, then its points must accumulate somewhere. The truth of this is
expressed by the following.
Proof. Let S be bounded and closed, and let A be any infinite subset of S.
Since S is bounded, A is bounded. By the Bolzano-Weierstrass theorem A has
an accumulation point x9. If xo ¢ S, then xo is exterior to S since S is closed.
Thus x9 has a neighborhood U which does not intersect S. Since AM U is
not empty, this is impossible. Therefore xo € S, which shows that S is compact.
To prove the converse, suppose that S is unbounded. Then for each
ie ale eeetheres exists x, 5 -suchs that. |x,,| =m, ~“The;set) A —
{x1, X2,...} is infinite and has no accumulation point. Hence S is not compact.
If S is not closed, then there exists a point Xo € frS — S. Form =1,2,...
there exists x, € S such that |x, — Xo| < 1/m. The set A = {x1, Xo,...}
is infinite and has the single accumulation point Xo. But x9 € S. Hence S is
not compact. jj
The converse to the Heine-Borel theorem is true. To prove it, suppose that
S is not compact. Then either S is unbounded or S is not closed. If S is un-
bounded, let A,, be the neighborhood of 0 of radius m = 1,2,... Then
{A,, Ao,...} 1S an open covering of S which has no finite subcovering. If S
is not closed, let x9 € frS — S and A,, = {x:|x — xo| > 1/m}. Then
{A,, Ao,...} is an open covering of S with no finite subcovering. This proves
the converse.
The definition of compactness in terms of accumulation points was the
first historically. The characterization in terms of open coverings has less
intuitive appeal but is more useful for proving theorems. Moreover, it is the
appropriate notion of compactness in general topological spaces.
From the Heine-Borel theorem and its converse, if So = HB” this defini-
tion is equivalent to the previous one.
A-8 Compact Spaces 311
Example 2. Let 4 C EH” be compact. Using the notation of Example 7, p. 303, the
set A(x) is closed since its complement is open. Since A is bounded, A (x4) is bounded.
Therefore A(x() is compact. Let p(x) = x’. The transformation p is called a pro-
jection of E” onto H*. Since p is continuous, the set p(A) = {x’: (x’,x”) € A for
some x’’} is compact.
Proof. Any compact subset of HE! has a least and a greatest element
(Problem 2). §
Mean value theorem. Let f be real valued and continuous on a closed interval
[a, b], and let the derivative f’(x) exist for every x € (a,b). Then there exists
c € (a, b) such that
| f(b) — f(a) =f (ec) — a).
Proof. Let m = [f(b) — f(a)]/(b — a) and let F(x) = f(b) — f(x)
m(b — x). Then F is continuous on [a, b] and F’(x) = —f’(x) + m for x € (a, b).
Since [a, b] is compact, F has a maximum and a minimum value on [a, b]. If
the maximum value is positive, then since F(a) = F(b) = 0, the maximum
must occur at some 2x, € (a,b). By elementary calculus F’(x;) = 0 and we
may take c= x,. Similarly, if the minimum value is negative we may take
¢ = 22, where F(x2) is the minimum value. If neither of these possibilities
occurs, then F(x) = 0 on {a, 6] and c is arbitrary. J
derivative f\?(x) exist for every x & (a,b). Then there exists c € (a,b) such
that
fe) — f@ = fla — a) \
ee ra lee 2) orn eo)
f"(@) et CE ae a— PEL IR.
where
@r.
Rae Ness
Proof. Let
where the number K is so chosen that G(a) = 0. Then G(b) = 0 and, using
the product rule,
Repeating the reasoning in the proof of the mean value theorem, there exists
c € (a, b) such that G’(c) = 0. Then f@(c) = K.]
PROBLEMS
1. A point x is an isolated point of A if there is a neighborhood U of x such that
AM U = {x}. Show that every point of el A is either an isolated point of A or
an accumulation point of A.
to Let S be a compact subset of H'. Show that inf S€ S and sup S € S.
A-9 Review of Riemann Integration 313
—0 =
314 Appendix A-9
where
by
OT eee Eb Hit hey SSF,
and
w= max {ty nae to, ig = ty, 50 a 9 Ute cer ineare
More generally, the Riemann integral exists for any bounded function
with a finite number of discontinuities. It agrees with the integral in Lebesgue’s
sense, which is defined in Chapter 5 for a much wider class of functions.
Since f is continuous, given € > 0 there exists 6 > O such that f(t) —é <
f(s) < f(t) + € whenever |s — t| < 6. Thenifh < 6,
In the theorem, F’(a) means the right-hand derivative and F’(b) means
the left-hand derivative.
The fundamental theorem says that F is an antiderivative of f. If @ is
any antiderivative of f, then G’(t) — F’(t) = 0 for every t¢ € [a, b], and by
the mean value theorem G(t) — F(t) is constant on [a, b]. Thus G(t) — F(t) =
G(a) — F(a) = G(a), and upon setting t = b we obtain
b
ely = tela) = | f(s) ds.
A-10 Monotone Functions 315
PROBLEMS
1. Let us say that f(z) — +9 asa — +0 if for every C > 0 there exists 6 such that
f(x) > C for every x > 6. Show that if f is nondecreasing and unbounded on
[a, ©), then f(x) > +0 asx > -+o.
2. (a) Give a precise definition, similar to that in Problem 1, for “zm, —>-—+2 as
m > co ae
(b) Let f have domain [a, ©). Show that f(z) — yo as x >-+0 if and only if
f(%m) — yo for every nondecreasing sequence [rm] such that am => a for
m = 1,2,...andt%m—7>-+%o asm— o.
Historical Notes
Section 1-1
1. 4e; — 2e2+ €34 3e4, —2e1 — e3-+ 4, V30, V6, V6, V12, 6
7. V4 = +(+/2/10)(4e1 + 3e2 — 3e3 + 4e4)
Section 1-3
1. {x:a2!1+
22+ 473 = 1}
Pe Ge Xoo eon oro te = Ob) t= 4s
Section 1-4
BO PP ad
9. (b) The barycenter is at the intersection of the line segments which join the
vertices with the barycenters of the opposite (r — 1)—dimensional faces.
Section 1-5
1. (b) n-cubes
2. (2) |\(@a,y)|| = [e*-+ zy+ 4y7]"? (b) 2
Section 2-1
Section 2-2
Section 2-3
Section 2-4
Section 2-5
pap Pa ayes
i= k=1
8.8)
65 @) 2,0 (b) sn 3, —sing
Section 2-6
ES i Ne be OP ite
3. (a) f(z, y) = 422y +c (b) Not exact (c) Not exact
(d) f(x,y) = x/y — y/x + (2, y), where ¢ is constant on each of the four quad-
rants into which the coordinate axes divide EH?.
Section 3-1
ly = 2@ — v2)
2 l= eG) se le = Be)
3. /2(—e1 + eo)
Answers to Problems 325
Section 3-2
Section 3-3
Section 4-1
i (a) Lt; © )
Section 4-2
e1 — lleg + 2e3.
3. (a) The diagonal elements are c!,..., c”; all other elements are 0.
(b) (L—1)*(x) = (c*)—!2z' provided c* ¥ 0 for every 7
5. (a) Reflection in the line s = ¢
(b) They are rotations through angle 37/2, 1/2 respectively.
6. L is not a rotation.
8. (b) If g(t) = L(t) + xo, then L must be nonsingular.
326 Answers to Problems
Section 4-3
Section 4-4
1. Fig = 2f12 + zyfe2 + fe, the partial derivatives of f being evaluated at (x, ry)
2. Fi = fit f3gi, Fe = fe+ fsge,
Fix = fir + 2f1391 + f33(g1)? + f3g11,
Fi2 = fie + fisge + f329g1 + f33gig2 + fsgi2,
Foo = fo2 + 2fe3g2 + f33(g2)? + fsg22
3. (a) 162 (b) —¢’(3)¢'(3)
Ay A See
Section 4-5
Section 4-7
1. {(z, y): F(x, y) = c} is an ellipse if loge > 2, is the one point set {(0, 0)} if
log ¢ = 2, and is empty if log c < 2. Any ellipse is a 1-manifold.
2. (a) The cone is not a 2-manifold.
(b) 2@ — 2) =" + 1) + 4¢@—1) =
7. No
Section 4-8
ih,
2. |clife < 4, Vc— fife> 3
3. V14/3
9
Section 5-1
Section 5-2
5. 239/240.
Section 5-3
Section 5-5
4, 8
51
- 75
6. (b) (e — 1)? (Ce 2a oT
Section 5-7
fem? 27
Section 5-8
3 1 : ae :
il. i io) Ghd — i;fig()\(2 — 2t) dt, provided either integral exists.
2 0
2, 2 log 2
3. 48
Section 5-9
ll, 2a Ms Tis te By a
m [2 co in 1 (rcos6@,
r sin @) ¥
4, I do i 8 o rar flr cos 0, r sin 8, 2] dz
0
5. 2
Ge = ihe ak Oper i
10. (a) an
AV) @)/T Ce (b) P(H)1(4)/T@)
a+1 fae as
(c) Soleszs) (a) ey T(e
+ 1)
fe) he 1) /Pa@aek=- 1)
Gear. a‘ >}Ure i
Section 5-11
=) 1+ exp (rt)
1. (a) 2tan (1/2) (b) Le ae
Section 6-1
es hy SA UN
2. —1
Section 6-2
1. el A e234 = e3 A e124 = 1234 92 A e134 = ef A e123 — 91234
Section 6-3
Li Ginee) ar - Gra
3. Negative 4. Yes 5. No 6. 3/2 We NiH
Answers to Problems 329
Section 6-4
Section 6-5
1 - SpA. 5 é 5
(b) 7 a (—1)**17"*B"* where \y = Cs Sa ee oa)
3. €1
X €2 = €3, e€1
X €3 = —e2, C2
X €3 = e1
Section 7-1
1. (a) 4+/3; the triangle with vertices 2e3, e: + e2, e1 — 3e2 + 4e3
(b) p sina; a solid cone from which points (2, 0,z),0 < x < z, are deleted
(c) [1 + s? + #?]!/?; the hyperboloid x = yz
2. (a) Ty(t) = {k:k!s+ k7t-+ k3u = O},
Tu(x) = fh:h!x/a? + h2y/b? + h3z/c? = 0}
(b) Jg(t) = [a2b2u2 + a2c7¢2 + b2¢252]1/2
iB (eat |
- rane 2 ;
pe=21s=
— = (sin¢ cos d)e12 + (sin ¢sin A)e31 + (sin ¢ cos A)e23
dp
330 Answers to Problems
a) o(x) = [(— sin s)e; + (cos s)e2] A [(— sin t)e3 + (cos t)ea]
Section 7-5
oars1
3. w is not of class C@ on el D.
4. (a) 5 (b) -3
Section 7-6
Section A-1
Section A-2
Section A-3
Section A-4
ABCDE698765
\
a\\
eter)
steer ie
i4