Muldowney PDF
Muldowney PDF
i i
2010/1/10
page 1
i i
i i
i i
muldown
i i
2010/1/10
page 2
i i
i i
i i
muldown
i i
2010/1/10
page i
i i
Contents
Preface iii
3 Riemann Integration 57
3.1 Content and the Riemann Integral . . . . . . . . . . . . . . . . . 57
i i
i i
muldown
i i
2010/1/10
page ii
i i
ii Contents
3.1.1 Partition of I . . . . . . . . . . . . . . . . . . . . . . 59
3.1.2 Riemann Sums . . . . . . . . . . . . . . . . . . . . . 59
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
3.2 Cauchy Criteria and Properties of Integrals . . . . . . . . . . . . 61
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
3.3 Evaluation of Integrals . . . . . . . . . . . . . . . . . . . . . . . 71
3.3.1 Real valued functions of a real variable . . . . . . . 71
3.3.2 Real valued functions on R2 . . . . . . . . . . . . . . 73
3.3.3 Real valued functions on Rn . . . . . . . . . . . . . 76
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
i i
i i
muldown
i i
2010/1/10
page iii
i i
Preface
These notes are not intended as a textbook. It is hoped however that they
will minimize the amount of notetaking activity which occupies so much of a stu-
dents class time in most courses in mathmatics. Since the material is presented in
the sterile definition, theorem, proof form without much background colour or
discussion most students will find it profitable to use the notes in conjunction with
a textbook recommended by the instructor.
Probably the most important aspect of the notes is the set of exercises. You
should develop the practice of attempting several of these problems every week.
Many of the problems are quite difficult so please consult your instructor if you
are not blessed with success initially. Do not acquire the habit of abandoning a
problem if it does not yield to your first attempt; a defeatist attitude is your greatest
adversary. Solution of a problem, even with some assistance from the teacher when
necessary, is a fine boost to your morale. You will find that a strong effort expended
on the earlier part of the courses will be rewarded by growing self-confidence and
easier success later.
iii
i i
i i
muldown
i i
2010/1/10
page iv
i i
iv Preface
NOTATION: Except when specified otherwise, upper case (capital) letters will
denote sets and lower case (small) letters will denote elements of sets.
i i
i i
muldown
i i
2010/1/10
page 1
i i
Chapter 1
We begin by defining our basic tool, the real numbers R The real numbers can
be constructed from more primitive notions such as the natural numbers N :=
{1, 2, 3, . . .}, or even from the fundamental axioms of set theory. Here we shall be
content with a precise description of R.
In the next several subsections we will explain the three underlined words.
1.1.1 Fields
A field is a set F together with two binary operations + and (addition, multipli-
cation) which satisfy the following axioms: For all a, b, c, . . . in F
F1 a + b F and a b F (closure)
F2 a + b = b + a and a b = b a (commutativity)
F3 a + (b + c) = (a + b) + c and a (b c) = (a b) c (associativity)
F4 (a + b) c = (a c) + (b c) (distributivity)
F5 There exists unique elements 0 and 1 in F , 0 6= 1, such that
a+0=a and a 1 = a a F.
i i
i i
muldown
i i
2010/1/10
page 2
i i
2 Chapter 1. The Real Number system & Finite Dimensional Cartesian Space
a
4. It is customary to write a b for a + (b) and b for ab1 .
Example 1.2 i The simplest (and least interesting) field is the set {, e} with
+ e e
the operations e
e e e e
m
ii The set Q of rational numbers, i.e., numbers of the form n (n 6= 0) with the
usual addition and multiplication is a field.
iii The sets R and C of real and complex numbers respectively with the usual
addition and multiplication are fields.
iv The set Q(t) of rational functions with rational coefficients (i.e. functions of
the form p(t)
q(t) where p(t) and q(t) are polynomials with rational coefficients)
is a field.
v The set N = {1, 2, 3, 4, . . .} of natural numbers and the set Z of integers are
NOT fields.
O1 a, b P = a + b P and ab P .
O2 0
/P
i i
i i
muldown
i i
2010/1/10
page 3
i i
Remark: Every ordered field contains Q as a subfield (we do not prove this). Thus,
Q may be characterized as an ordered field containing no ordered proper subfield,
i.e., Q is the smallest ordered field. (Two ordered fields are considered the same if
they are isomorphic and the isomorphism preserves the order.) A discussion of this
point may be found in
C. Goffman, Real Functions, Proposition 1 and 2 in Chapter 3.
E. Hewitt and K. Stromberg, Real and Abstract Analysis, Theorem 5.9.
We define a relation > on an ordered field as follows: If a, b F , write a > b
(equivalently, b < a) if a b P . Then the axioms O1, O2, and O3 have the
following consequences:
Exercises
1.1. Establish the following properties of a field:
(a) a 0 = 0
(b) a (1) = (a)
(c) (a)(b) = ab
(d) (ab1 )(cd1 ) = ac(bd)1
(f ) If ab = 0, then a = 0 or b = 0.
1.2. Observe that for P being the set of positive elements
(a) 1 P
(b) a 6= 0 = a2 P
(c) If n N, then n P .
i i
i i
muldown
i i
2010/1/10
page 4
i i
4 Chapter 1. The Real Number system & Finite Dimensional Cartesian Space
1.1.4 Properties of R
The first property is for an ordered field F to be Archimedean:
i i
i i
muldown
i i
2010/1/10
page 5
i i
1
Corollary 1.7. If a > 0, there exists n N such that 0 < n < a.
1
Proof. There exists n > a1 > 0. (Why?). Thus, a > n > 0.
Corollary 1.9. If a, b R, a < b, then there is a rational r such that a < r < b.
Proof. There exists an n N such that n(b a) > 1 (why?). Let m be the least
integer such that m > na. Hence, m 1 na and so
m
na < m na + 1 < na + n(b a) = nb = a < < b.
n
Exercises
1.3. Guess the supremum and infimum of the following sets (when they exist):
(0, 1) = {x : 0 < x < 1} [0, 1] = {x : 0 x 1}
{ n1 : n = 1, 2, 3, 4, . . .} N = {1, 2, 3, 4, . . .}
1.4. If a > 0, there exists n N such that 0 < 21n < a. Hint: Show 2n > n,
n N.
1.5. Q is not Archimedean.
1.6. Let F be an Archimedean ordered field containing an irrational element .
Show that if a, b F , a < b, then there is an irrational element such that
a < < b.
1.7. Show that R contains an irrational element. Hint: Show first that no rational
p satisfies p2 = 2. Then show that p = sup{x > 0 : x2 < 2} must satisfy
p2 = 2.
Proof. First note that an < bm for all n, m. Thus, each bm is an upper bound for
the set {an : n N}. Therefore, a := sup{an : n N} bm for all m. It follows
that an a bn for all n. Thus, a In for all n.
i i
i i
muldown
i i
2010/1/10
page 6
i i
6 Chapter 1. The Real Number system & Finite Dimensional Cartesian Space
(i) |x| = 0 x = 0.
(ii) | x| = |x|.
which is the desired right hand inequality. This implies the left hand inequality
since
Exercises
1.9. Let F be an ordered field, with the property that if {In } is a nested sequence
of closed intervals in F , then n=1 In 6= . Show that F is complete. (Remark:
This exercise and Theorem 1.10 shows that the supremum (completeness)
property and the nested interval property are equivalent.)
1.10. Let In = (0, n1 ). Show that n=1 In = .
1.11. Let Kn = [n, ) := {x : x n}. Show that n=1 Kn = .
i i
i i
muldown
i i
2010/1/10
page 7
i i
1.12. If a set S of real numbers contains one of its upper bounds a, then a = sup S.
Such a supremeum is called a maximum.
1.13. Show that S R cannot have two suprema.
1.14. Show that an ordered field F is complete if and only if every non-empty
subset of F which has a lower bound has an infimum.
1.15. Show that Q is not complete.
1.16. If s R is bounded and S0 S, show
Rn = R R . . . R = {(x1 , . . . , xn ) : xi R, i = 1, . . . , n}.
| {z }
ntimes
x = (x1 , . . . , xn ).
(i) x + y = y + x
(ii) (x + y) + z = x + (y + z)
(iii) x + O = O + x = x
i i
i i
muldown
i i
2010/1/10
page 8
i i
8 Chapter 1. The Real Number system & Finite Dimensional Cartesian Space
p+q kp
q
p
0
0 p
Figure 1.1. The sum of vectors p, q and the scalar kp, k > 1.
(iv) x + (1)x = O
(v) 1x = x, and 0x = O
(vii) (x + y) = y + x
Definition 1.15. The inner product or dot product of two vectors x and y is the
quantity
n
X
x y := xi yi = x1 y1 + . . . + xn yn .
i=1
Proposition 1.16.
(ii) x y = y x.
(iii) x (y + z) = x y + x z.
(iv) (x y = (x y) = x (y).
There is a famous inequality that links inner products and the norms of vectors
i i
i i
muldown
i i
2010/1/10
page 9
i i
Proof. We have
|x + y|2 = (x + y) (x + y) = x x + 2x y + y y
= |x|2 + 2x y + |y|2
|x|2 + 2|x| |y| + |y|2 , (CBS inequality)
2
= |x| + |y|
= |x + y| |x| + |y|.
The left-hand inequality follows from this just as in the scalar triangle inequality.
i i
i i
muldown
i i
2010/1/10
page 10
i i
10 Chapter 1. The Real Number system & Finite Dimensional Cartesian Space
I = I1 . . . In ,
I = {(x1 , x2 , . . . , xn ) : ai xi bi }.
k=1 Ik 6= .
i i
i i
muldown
i i
2010/1/10
page 11
i i
1.2.1 Functions
(x, y1 ), (x, y2 ) f = y1 = y2 .
11
Note that if f : 7 B, then {(y, x) : (x, y) f } is also a one-to-one function,
11
which is denoted by f 1 : B 7 A, and is called the inverse function of f .
If f : A 7 B and g : B 7 C are two functions, then the composition of g
with f is the function
Exercises
1.19. Prove Corollary 1.18.
1.20. Show |x + y|2 + |x y|2 = 2(|x|2 + |y|2 ) for all x, y Rn . (parallelogram
identity)
1.21. If x = (x1 , . . . , xn ), show
|xi | |x n sup{|x1 |, . . . , |xn |}, for i = 1, . . . , n.
i i
i i
muldown
i i
2010/1/10
page 12
i i
12 Chapter 1. The Real Number system & Finite Dimensional Cartesian Space
1.22. Show that |x + y|2 = |x|2 + |y|2 x y = 0. In this case, x and y are said
to be orthogonal. This is sometimes denoted by x y.
1.23. Is it true that
1.24. Two sets A and B have the same cardinality if there is a one-to-one function
: A 7 B such that (A) = B and 1 (B) = A. Show that the following
sets have the same cardinality:
(a) N = {1, 2, 3, . . .} and 2N = {2, 4, 6, 8, . . .}.
(b) [0, 1] and [0, 2].
(c) (0, 1) and (0, ) = {x : x > 0}.
(d) [0, 1] and [0, 1).
1.25. A set A is said to be finite if it has the same cardinality as some initial segment
{1, . . . , n} of the natural numbers, and is said to be infinite otherwise. (Thus,
finite means that the elements can be labeled a1 , . . . , an .) Show that a finite
set of real numbers contains its inf and its sup. Hint: Induction.
1.26. A set A is countable if it has the same cardinality as N, the set of natural
numbers, or if it is finite. Otherwise, the set is said to be uncountable. Show
that a countable set need not contain its sup or its inf. (Countability means
all elements of the set can be labeled by the natural numbers {a1 , a2 , a3 , . . .}.)
1.27. Show that the union of a countable collection of countable sets is countable.
Hint:
S1 : a1,1 a1,2 a1,3 . . .
S2 : a2,1 a2,2 a2,3 ...
S3 : a3,1 a3,2 a3,3 , ...
S4 : a4,1 a4,2 a4,3 , ...
.. .. .. .. .. .. .. ..
. : . . . . . . .
The elements may be counted by the scheme indicated . Deduce that Q is
countable.
1.28. [0, 1] is uncountable. Complete this sketch of proof: Suppose [0, 1] is count-
able and that
[0, 1] = {a1 , a2 , . . .}.
At least one of the intervals [0, 13 ], [ 13 , 23 ], [ 23 , 1] does not contain a1 ; call this
interval I1 . Subdivide I1 into three closed intervals, then a2 in not in one of
those three; call it I2 . Continuing in this manner, we obtain a nested sequence
of closed intervals, In , with the property that an 6 In . Therefore, none of
the an are in k=1 Ik . But, by the nested interval theorem, the intersection
is non-empty so there must be an x [0, 1] with x 6= an for any n. This
contradicts our assumption.
i i
i i
muldown
i i
2010/1/10
page 13
i i
1.3. Topology 13
1.2.2 Convexity
Exercises
1.29. Prove that {x : |x| = 1} is not convex.
1.30. Prove that {(x, y) R2 : y > 0} is convex.
1.31. Let C be any collection of convex sets. Show that AC A is convex. Is AC A
necessarily convex?
1.32. The convex hull H(A) of a set A is the intersection of all convex sets con-
taining A as a subset. Prove that H(A) is convex. What is H(A) if A is a
set consisting of two points only?
1.33. A subset C of Rn is a cone if {tx : x C} C for all t 0.
(i) Prove that a cone C is a convex set if and only if
{x + y : x C, y C} C.
(ii) Draw pictures of convex and non-convex cones in R2 .
1.3 Topology
Definition 1.29. If > 0 and x0 Rn , then the open ball of center x0 and radius
is the set
B(x0 , ) := {x : |x x0 | < }.
A neighborhood of x0 is any set U which contains an open ball with center x0 as a
subset.
A set A is open in Rn if it is a neighborhood of each of its points.
i i
i i
muldown
i i
2010/1/10
page 14
i i
14 Chapter 1. The Real Number system & Finite Dimensional Cartesian Space
1. Rn is open.
2. is open.
3. (0, 1) is open in R.
6. B(x0 , ) is open in Rn
Proof. (1),(2) and (5) are left to the reader. For (3) note that if x0 (0, 1), then
B(x0 , ) = (x0 , x0 + ) (0, 1) when = min(x0 , 1 x0 ).
For (4), note that B(0, ) = (, ) is not in [0, 1) for any > 0.
Finally, to see (6), let x1 B(x0 , ). We will show that B(x1 , 1 ) B(x0 , )
when 1 = |x1 x0 | > 0, so that B(x0 , ) is a neighborhood of each of its points,
and hence is open. Let x B(x1 , 1 ), then
|x x0 | = |x x1 + x1 x0 |
|x x1 | + |x1 x0 | (triangle inequality)
< 1 + |x1 x0 | (p B(x1 , 1 ))
= |x1 x0 | + |x1 x0 | definition of 1
= .
Proof. For (a), note that both and Rn are neighborhoods of their points (for ,
it is true because there are no points).
For (b), let x0 A B. Then
i i
i i
muldown
i i
2010/1/10
page 15
i i
1.3. Topology 15
For (c) let C be a collection of open sets, and select any x AC A. Then
x A for some A C
= B(x, ) A for some > 0 (A is open)
= B(x, ) AC A
= AC A is open
Ac := Rn \A := {x Rn : x 6 A}
is an open set.
(b) V \C is open.
C\V := {x : x C and x 6 V } = C V c .
i i
i i
muldown
i i
2010/1/10
page 16
i i
16 Chapter 1. The Real Number system & Finite Dimensional Cartesian Space
Notice that there are sets which are neither open nor closed (for example,
[0, 1) in R). The sets Rn and are both open and closed. We will see that they are
the only sets in Rn with this property.
i i
i i
muldown
i i
2010/1/10
page 17
i i
1.3. Topology 17
We must show that x0 is a cluster point of K. If this were not true, then there
would be a radius > 0 such that the ball B(x0 , ) does not intersect K:
B(x0 , ) K = .
K UG U.
A set K is Rn is compact if every open cover of K has a finite subcover; that is, for
any open cover G, there is a finite collection of sets U1 , . . . , Um from G such that
K m
j=1 Uj .
i i
i i
muldown
i i
2010/1/10
page 18
i i
18 Chapter 1. The Real Number system & Finite Dimensional Cartesian Space
3. (0, 1) is not compact. (Find the infinite open cover that does not have a finite
subcover.)
1 1 1
B(x0 , ) := {x : |x x0 | < } {x : |x x0 | } = Gck0 K c .
k0 k0 k0
Hence, K c is open and K is closed.
To show that K is bounded, consider the open cover consisting of all balls
B(O, k), k N. This collection covers all of Rn , and so is an open cover of K.
Therefore, there is a finite subcover {B(O, ki : i = 1, . . . , m} of K. Hence, K
B(O, k0 ) where k0 := max{k1 , . . . , km }. Thus, K is bounded.
= Let K be closed and bounded. The proof will be by contradiction. If K
is not compact, then there exists an open cover G = {G } of K such that K is not
contained in any union of finitely many G s. Since K is bounded, it is contained in
some closed interval I1 in Rn . Bisect the sides of I1 to obtain 2n subintervals (as in
the proof of the Bolzano-Weierstrass Theorem). At least one of these subintervals
intersects K in such a way that this intersection cannot be covered by finitely many
G s. Select such a subinterval as I2 . Proceeding inductively in this way, we obtain
a nested sequence of closed intervals Ik , k = 1, 2, . . ., such that
i i
i i
muldown
i i
2010/1/10
page 19
i i
1.3. Topology 19
Ik B(x0 , ) G0 ,
(i) A D 6= , B D 6= .
(ii) (A D) (B D) =
(iii) (A D) (B D) = D
3. [0, 1] is connected.
Proof. We will prove that [0, 1] is connected. Suppose it were disconnected and
(A, B) provides the disconnection:
i i
i i
muldown
i i
2010/1/10
page 20
i i
20 Chapter 1. The Real Number system & Finite Dimensional Cartesian Space
B
x
A
(i) A 6= , B 6= .
(ii) A B = .
(iii) A B = Rn .
Since A, B are open in Rn , both A1 and B1 are open in R (this is not obvious so
think how to show it). Now the statements (i), (ii) and (iii) above imply
i i
i i
muldown
i i
2010/1/10
page 21
i i
1.3. Topology 21
Corollary 1.46. The only sets in Rn which are both open and closed are Rn and
.
Proof. If the nonempty set A 6= Rn is both open and closed, then so is Ac . Then,
(A, Ac ) would provide a disconnection of Rn .
Exercises
1.33. Property (b) of Proposition 1.31 implies that the intersection of any finite
collection of open sets is open. Show that it is not true that this holds for an
infinite collection of open sets.
1.34. Prove items (a), (b), and (c) of Proposition 1.33.
1.35. Prove that the two definitions of a bounded set are equivalent.
1.36. Prove that the intersection of any finite collection of open sets is open. Hint:
Use Property (b) of open sets and induction.
1.37. Prove that {x : |x| 1} is closed in Rn .
1.38. Prove that a subset U of Rn is open if and only if it is the union of a collection
of open balls.
1.39. If A is a subset of Rn , then A, the closure of A, is the intersection of all closed
sets which contain A as a subset. Show that
(a) A is closed.
(b) A A.
(c) A = A.
i i
i i
muldown
i i
2010/1/10
page 22
i i
22 Chapter 1. The Real Number system & Finite Dimensional Cartesian Space
(d) A B = A B.
(e) = .
(f ) Observe that A is the smallest closed set containing A.
(g) Prove that B(O, 1) = {x : |x| 1}.
(h) If A and B are subsets of R, then is A B = A B?
1.40. If A is a subset of Rn , then A , the interior of A, is the union of all open sets
contained in A. Show that
(a) A is open.
(b) A A.
(c) A = A .
(d) (A B) = A B .
(e) Rn = Rn .
(f ) Observe that A is the largest open set contained in A.
(g) Prove that B(O, 1) = B(O, 1).
(h) Is there a subset A R such that A = and A = R?
1.41. Let A Rn . The derived set A of A is the set of all cluster points of A
(a) Prove that A is closed.
(b) Prove that A = A A .
(c) Theorem 1.39 says that A is closed if and only if A A. A set A for
which A = A is called perfect. Give examples of perfect and non-perfect
sets.
1.42. If A Rn , then A, the boundary of A, is the set of all points x such that
each neighborhood of x contains a point of A and a point of Ac .
(a) Show A is closed if and only if A A.
(b) Show that (A) = A; hence A is closed.
(c) Show that A = A\A .
1.43. For each of the following sets give its closure, interior, derived set, and bound-
ary.
(a) {x Rn : 0 < |x| < 1}
(b) { n1 R : n N}
1
(c) {( n1 , m ) R2 : n, m N}
(d) {x Rn : |x| < 1}.
1.44. Without using the Heine-Borel Theorem show that {(x, y) : x2 + y 2 < 1} is
not compact on R2 .
1.45. Let A and B be open in R. Prove that A B is open in R2 .
1.46. Show that a finite subset of Rn is closed.
i i
i i
muldown
i i
2010/1/10
page 23
i i
1.3. Topology 23
1.47. Show that a countable subset of R is not open. Show that it may or may not
be closed.
1.48. Let S be an uncountable subset of R. Show that S has a cluster point.
Hint: Show that at least one of the intervals [n, n + 1], n Z, must contain
uncountably many points of S.
1.49. Show that a closed interval is closed.
1.50. Let Q2 denote the set of points in R2 with rational coordinates. What is the
interior of Q2 ? The boundary of Q2 ? Show that Q2 is not connected.
1.51. Show that Qc is not connected. Show that (Q2 )c is connected, in fact, polyg-
onally connected.
1.52. Show that an open connected set in Rn is also polygonally connected.
i i
i i
muldown
i i
2010/1/10
page 24
i i
24 Chapter 1. The Real Number system & Finite Dimensional Cartesian Space
i i
i i
muldown
i i
2010/1/10
page 25
i i
Chapter 2
2.1 Sequences
Definition 2.1. A sequence in Rn is a function from p : N 7 Rn . Sequences are
usually denoted by {pk } where pk := p(k).
1. pk := k1 , {pk } is a sequence in R.
|pk p| < .
25
i i
i i
muldown
i i
2010/1/10
page 26
i i
If k = max{N1 , N2 }, then
1. xk = 1, k = 1, 2, 3, . . ., then limk xk = 1.
1
2. limk k = 0 (from the Archimedean Property, Theorem 1.6).
(b) {pk } is ultimately constant and equal to p. (That is, there is an N such that
k N = pk = p}.)
Definition 2.8. If {kj } is a sequence of natural numbers such that k1 < k2 < k3 <
. . ., then {pkj } is called a subsequence of {pk }.
Proof. There are two cases to consider. Either {pk : k N} is a finite set or it
is an infinite set. If it is a finite set, then there is at least one value p such that
pk = p for infinitely many k these terms in the sequence form a subsequence
of {pk }. In the other case, {pk : k N} is a bounded infinite set and so by the
Bolzano-Weierstrass Theorem 1.38, it has a cluster point p. For the ball B(p, 1),
there is a k1 such that pk1 B(p, 1). Suppose kj is such that pkj B(p, 1j ), then
i i
i i
muldown
i i
2010/1/10
page 27
i i
2.1. Sequences 27
1
there must exist kj+1 > kj such that pkj+1 B(p, j+1 ). So, by induction, there
exists a subsequence {pkj } of {pk } such that |pkj p| < 1j , j = 1, 2, 3, . . .. The
Archimedian Property implies limj pkj = p.
Proof. = Suppose limk pk = p, that is, for each > 0 there exists N such
that j N = |pj p| < . If {pkj } is a subsequence of {pk }, then
j N = kj j N = |pkj p| < .
If {pk } were convergent, these two limits would be the same number by the previous
theorem.
i i
i i
muldown
i i
2010/1/10
page 28
i i
then
(i) limk (pk + qk ) = p + q,
(ii) limk (pk qk ) = p q.
Further, if {xk } is a sequence in R such that limk xk = x, then
(iii) limk xk pk = xp, and
1
(iv) limk xk pk = x1 p, if xk , x 6= 0.
Proof. Exercise 2
The following result shows that it is sufficient to consider only sequences in R
when considering convergence or divergence.
i i
i i
muldown
i i
2010/1/10
page 29
i i
2.1. Sequences 29
Exercises
2.1. Show that the two definitions of the limit of a sequence are equivalent.
2.2. Prove Theorem 2.13
2.3. Finish the proof of = in Theorem 2.14.
2.4. Let xk zk yk . Prove that if {xk } and {yk } are convergent with limit c,
then {zk } is convergent with limit c.
2.5. Discuss the convergence or divergence of the sequences whose kth terms are
given by:
k
k
(a) k+1 (b) (1)
k+1
k
(c) 3k2k
2 +1
2k2 +3
(d) 3k2 +1 (e) ( k1 , k) (f) ((1)k , k1 )
i i
i i
muldown
i i
2010/1/10
page 30
i i
1 1
= < x k < 1, if k N
1 1 +
= x k 1 < < , if k N.
1+
Do the case |x| > 1.
1 1
2.15. Show limk k k = 1. HINT: Let xk = k k 1 > 0. Show k = (1 + xk )k >
k(k+1) 2
2 xk , and hence, limk xk = 0.
1
2.16. (Root Test) Let {xk } be such that lim |xk | k = r. Show
k
(a) If 0 r < 1, then limk xk = 0.
(b) If r > 1, then {xk } is divergent.
(c) If r = 1, then {xk } may be either convergent or divergent.
1
2.17. If a and b are nonnegative real numbers, show that lim (ak +bk ) k = max{a, b}.
k
k N = x < xN xk x,
that is, k N = |xk x| < . Thus, the increasing sequence converges to its
supremum.
i i
i i
muldown
i i
2010/1/10
page 31
i i
2.1. Sequences 31
i i
i i
muldown
i i
2010/1/10
page 32
i i
1 A 1 A
xk+1 = (xk + ) = L = (L + ) = L2 = A = L = A.
2 xk 2 L
The results on monotone sequences are interesting in that, unlike the proceed-
ing examples, it is not necessary first to guess the limit of a sequence in order to
show that it converges. Fortunately, we are able to do this in general.
Definition 2.19. A sequence {pk } in Rn is a Cauchy sequence if, for each > 0,
there exists a natural number N = N () such that if k, m N , then
|pk pm | < .
Proof. = Suppose limk pk = p. Thus, for each > 0, there exists N such
that k N = |pk p| < /2. Hence, if k, m N , we have
|pk pm | = |pk p + p pm | |pk p| + |p pm | < + = .
2 2
Thus, {pk } is a Cauchy sequence.
= We first show that {pk } is bounded and so by the Bolzano-Weierstrass
Theorem (Theorem 1.38) or Theorem 2.9 it has a convergent subsequence. Indeed,
choose N so that k, m N = |pk pm | < 1. Then,
i i
i i
muldown
i i
2010/1/10
page 33
i i
2.1. Sequences 33
1 1 1 1 1 1
|x2k xk | = + + ...+ + ...+ = .
k+1 k+2 2k |2k {z 2k} 2
k times
1
Therefore, |x2k xk | 2 for all k and {xk } cannot be a Cauchy sequence.
Exercises
2.17. Show that the following sequences are divergent by proving directly they are
not Cauchy sequences.
(a) {k} (b) {(1)k (1 k1 )}
2.18. Show directly that the following are Cauchy sequences and hence are conver-
gent.
(a) { k+1k } (b) {1 + 1!1 1
+ 2! + . . . + k!1
}.
2.19. Determine whether each of the following sequences is convergent or divergent.
In the case
n of convergence,
o find then limit. o
4k2 +k+4 k3 +k2 +1
(a) (1+k)(2+ k 3 (b) (k+1) 4
n 2 2
o
(k +1)
(c) (k+1)(k+2)(k+3) (d) {k 2 + k}
3 sin k +3k
(e) {1 k } (f) 3
k
n
sin k
+2k2 k2 +1 k
o
(g) 3
k (h) k3 +1 cos2 4
k k
(i) 3 [3] ,
where [x] denotes the greatest integer not exceeding x.
2.20. For what values of x are the following convergent, divergent? Wherever you
can, give
n the limit. o
xk
(a) (k+2)(k+1) (b) (k + 1)(k + 2)xk
n o n k k1 o
xk +k kx +x +1
(c) xk+1 +(k+1) (d) kxk1 +1
n ko k
x
(e) k! (f) k!x
2.21. If a1 = 2, ak+1 = 6 + ak , k 1, show that {ak } is increasing and
limk ak = 3.
i i
i i
muldown
i i
2010/1/10
page 34
i i
Hint: If r > 0 and > 0, then there exist positive numbers A, B and a natural
number N such that
k 1 k
lim 1 = e, where e = lim (1 + ) .
k (k!) k k k
1
2.28. Let sk = (1 + k1 )k , tk = 1 + 1! 1
+ 2! 1
+ . . . + k! . We have seen that both
sequences are convergent. Show that they have the same limit. Hint: First
use the Binomial Theorem to show
1 1 1 1 1 k1
sk = 1 + + (1 ) + . . . + (1 ) (1 ) tk .
1! 2! k k! k k
Next, with m fixed, and k m, show that
1 1 1 1 1 m1
sk 1 + + (1 ) + . . . + (1 ) (1 )
1! 2! k m! k k
and deduce that limk sk tm for each m.
2.29. Show that every sequence of real numbers has a monotone subsequence. (Be
careful.)
2.30. Let f be an ordered field. Show that the following statements are equivalent.
(a) F is complete (sets bounded above have a supremum in F ).
(b) F has the nested interval property.
i i
i i
muldown
i i
2010/1/10
page 35
i i
2.2. Continuity 35
2.2 Continuity
Let f : Rn 7 Rm , and let D Rn be the domain of f .
Example 2.24 Let f (x) = x2 , 1 x 1, then D = [1, 1], f (D) = [0, 1], and
|f (x) f (x0 )| = |x2 x20 | = |(x x0 )(x + x0 )| |x x0 |(|x| + |x0 |)
= |f (x) f (x0 )| 2|x x0 | for x, x0 [1, 1]
If > 0, take = /2, then |x x0 | < and x, x0 [1, 1] implies
|f (x) f (x0 )| < ,
so f is continuous on [1, 1].
i i
i i
muldown
i i
2010/1/10
page 36
i i
U f
0110..p
00
11
V
.
00p0
11 f(p0 )
.
f(p)
f(D)
D
n m
R R
Proof. = f continuous at p0 implies that for every > 0, there is a > 0 such
that for p D and |p p0 | < we have |f (p) f (p0 )| < . Now, {pk } D and
limk pk = p0 implies the existence of N such that k N = |pk p0 | < .
Hence, for this N we have k N = |f (pk ) f (p0 )| < ; that is limk f (pk ) =
f (p0 ).
= Suppose limk f (pk ) = f (p0 ) for every sequence {pk } in D with
limk pk = p0 . Assume f is not continuous at p0 . Then, negating the definition
of continuity, there exists an 0 > 0 such that each neighborhood U of p0 contains a
point pU for which |f (pU ) f (p0 )| 0 . Consider U = B(p0 , k1 ) and set pU = pk ,
for each k = 1, 2, . . .. Then limk pk = p0 , and |f (pk ) f (p0 )| 0 , so f (pk )
does not converge to f (p0 ), contrary to our hypothesis. Hence, the assumption that
f is not continuous at p0 is false.
Corollary 2.26.
Proof. This follows immediately from the corresponding theorem for sequences
Theorem 2.13.
i i
i i
muldown
i i
2010/1/10
page 37
i i
2.2. Continuity 37
by Theorem 2.14.
i i
i i
muldown
i i
2010/1/10
page 38
i i
||q| |q0 || |q q0 |.
Example 2.31 Some functions and inverse images of part of their range:
1 1 1 1 1 1 1
f ([0, ]) = [0, ], f ([ , 2]) = [ , 1], f 1 ([ , 3]) = [1. ] [ , 1].
2 4 2 4 4 2 2
i i
i i
muldown
i i
2010/1/10
page 39
i i
f c= f(S)
S
0
R2
R
Figure 2.2. Figure for Example 2.31 (3).
S2 S1 f 1=f(S)
1
0
S 1 = f(S )
S 2 2
1
R2
R
Figure 2.3. Figure for Example 2.31 (4).
2
(3) f (x, y) = x4 + y 2 , (x, y) R2 . Then f (R2 ) = {x R : x 0}. For any fixed
c > 0, the set S := {(x, y) : f (x, y) = c} is an ellipse in R2 , and f 1 ({c}) = S.
(4) f (x, y) = xy for (x, y) R2 . Then f (R2 ) = R. The inverse images of 1 and 1
are the hyperbolas S1 := {(x, y) : xy = 1} = f 1 (1) and S2 := {(x, y) : xy =
1} = f 1 (1), while
f 1 (V ) = U D.
i i
i i
muldown
i i
2010/1/10
page 40
i i
which implies
B(p0 , (p0 )) D f 1 (V ), p0 f 1 (V ). (2.1)
Hence, if we set
U := p0 f 1 (V ) B(p0 , (p0 )),
then U is open and (2.1) implies
U D f 1 (V ). (2.2)
f 1 (V ) U D. (2.3)
U D = f 1 (V ).
Proof. Suppose f (D) is not connected. Then there exist open sets V1 and V2 in
Rm such that
(i) V1 f (D) 6= and V2 f (D) 6= ;
(ii) (V1 f (D)) (V2 f (D)) = ; and
(iii) (V1 f (D)) (V2 f (D)) = f (D).
By the Global Continuity Theorem (Theorem 2.32), there exist open sets U1
and U2 in Rn such that
U1 D = f 1 (V1 ), U2 D = f 1 (V2 ).
Then
(i) U1 D 6= , U2 D 6= , from (i),
(ii) (U1 D) (U2 D) = , from (ii),
i i
i i
muldown
i i
2010/1/10
page 41
i i
Example 2.35 At any time there are two antipodal points on the equator at the
same temperature. Indeed, let T (x) be temperature. Then T (x + 2) = T (x).
We suppose T is continuous and consider f (x) = T (x + ) T (x). Then f (0) =
T () T (0), while f () = T (2) T () = T (0) T () = f (0). If f (0) = 0,
then T () = T (0); otherwise, f (0) and f () are of opposite sign and k = 0 is a
value between them. In the latter case, by Corollary 2.34, there is a point x0 , with
f (x0 ) = 0 = T (x0 + ) T (x0 ); i.e. T (x0 + ) = T (x0 ).
Proof. Let G be an open covering of f (D). For each V G there is an open set
U Rn such that
U D = f 1 (V ) by Theorem 2.32. (2.4)
Since {V : V G} covers f (D), the collection {U : U D = f 1 (V ), V G} covers
D. Since D is a compact set, there must be a finite subcover {U1 , . . . , Uk }. From
(2.4), the V s corresponding to the Ui satisfy f (Ui ) Vi so {V1 , . . . , Vk } covers
f (D). Thus, the cover G contains a finite subcover of f (D). Since we began with
an arbitrary cover of f (D), so f (D) must be compact.
Proof. f (D) is compact and therefore closed and bounded by the Heine-Borel
Theorem.
i i
i i
muldown
i i
2010/1/10
page 42
i i
M f (D), that is, that M = f (p0 ) for some p0 D. Suppose M 6 f (D), which
implies M > f (p) for all p D. Consider the function
1
g(p) = > 0, for p D.
M f (p)
Then g is continuous on D. (Why?) Since D is compact, g is bounded by Corollary
2.37. Hence, there exists a finite number A such that
0 < g(p) A, p D
1
= 0 < A, p D
M f (p)
1
= M f (p), p D
A
1
= f (p) M < M, p D.
A
The last inequality contradicts the fact that M is the least upper bound of f (D).
Hence, we must have M f (D).
Exercises
2.32. Prove that the two definitions of continuity are equivalent.
2.33. Let f (x) = x, x 0. Show that f is continuous on [0, ).
2.34. Show that every real polynomial of odd degree has at least one real root.
2.35. Show that x4 + 7x3 9 has at least two real roots.
2.36. Suppose f and g are continuous real valued functions on [0, 1] such that
f (0) < g(0) and f (1) > g(1). Prove that there exists x (0, 1) such that
f (x) = g(x). Hint: Draw a picture.
2.37. Give an alternative proof of Theorem 2.36 by showing directly that f (D) is
closed and bounded if f is continuous on D and D is closed and bounded.
Hint: If f (D) is not bounded, then there exists pk D such that |f (pk )| > k,
k = 1, 2, . . .. If f (D) is not closed, there is at least one cluster point q of
f (D) with q 6 f (D). The latter means there are points pk D so that
limk f (pk ) = limk qk = q 6 f (D). Show that each case leads to a
contradiction of the hypothesis.
2.38. Show that f (x) = x1 is not uniformly continuous on (0, 1].
i i
i i
muldown
i i
2010/1/10
page 43
i i
Here the depends only on (and possibly D), but not on the choice of points.
i i
i i
muldown
i i
2010/1/10
page 44
i i
Exercises
2.38. If f (x) = x1 , x 6= 0, then f is continuous on its domain.
2.39. Show that a polynomial
f (x) = an xn + an1 xn1 + . . . + a1 x + a0 , (ai constants)
is continuous on R. Hint: Show that f (x) = constant and g(x) = x are
continuous on R, and deduce the general result from Corollary 2.26.
i i
i i
muldown
i i
2010/1/10
page 45
i i
is continuous on R.
2.44. Is it possible for f and g to be discontinuous and yet g f be continuous?
2.45. Let f be continuous on D.
(a) Is it true that D open implies f (D) open? (This one is easy.)
1
(b) Is it true that D closed implies f (D) closed? Hint: Consider f (x) = 1+x2
on R.
1
2.46. If f (x) = 1+x 2 , then f is uniformly continuous on R.
i i
i i
muldown
i i
2010/1/10
page 46
i i
2.5 Limits
The notion of a limit which was introduced for sequence can, as you recall from first
year Calculus, be extended to any function.
Write
lim f (p) = L.
pp0
If > 0 is given, then 0 < |(x, y)| < 2 implies |f (x, y)| < .
(2) The function defined on R2 by
0, if y 6= x2 ;
f (x, y) =
1, if y = x2 ;
does not have a limit at O = (0, 0) since each neighborhood of (0, 0) contains
points (x, y) 6= (0, 0) at which f (x, y) = 0 and at which f (x, y) = 1.
i i
i i
muldown
i i
2010/1/10
page 47
i i
2.5. Limits 47
Just as in the case of sequences there is a Cauchy criterion for the existence
of limpp0 f (p).
Theorem 2.44 (Cauchy Criterion). The limit limpp0 f (p) exists if and only
if for each > 0, there is a > 0 such that
p, q D B(p0 , )\{p0 } = |f (p) f (q)| < .
(by the Cauchy Criterion and the choice of N1 ). Thus, limpp0 f (p) = L.
Theorem 2.45. The limpp0 f (p) exists and equals L if and only if limk f (pk )
exists and equals L for each sequence {pk } in D\{p0 } such that limk pk = p0 .
i i
i i
muldown
i i
2010/1/10
page 48
i i
i i
i i
muldown
i i
2010/1/10
page 49
i i
is 0. Notice that in this case, we have the limit along any line through the
origin is 0 at O, i.e.
lim f (tx0 , ty0 ) = 0 if (x0 , y0 ) 6= O.
t0
The moral of this is that when the limit properties of functions of more than
one variable are being considered, it is not enough to look at their behaviour on
straight lines.
Definition 2.52. The function f has an interior relative maximum (respectively, minimum)
at c if there is a neighborhood U of c such that
f (x) f (c) xU (respectively, f (x) f (c) x U ).
i i
i i
muldown
i i
2010/1/10
page 50
i i
f (x) f (c)
c < x < c + = 0
xc
f (x) f (c)
= lim 0.
xc xc
Similarly,
f (x) f (c)
c < x < c = 0
xc
f (x) f (c)
= lim 0.
xc xc
The two inequalities together imply f (c) = 0.
Proof. First, if f (x) = f (a) = f (b) for all x (a, b), then f (x) = 0 for all
x (a, b).
If there is an x1 (a, b) with f (x1 ) > f (a) = f (b), then f being continuous
on the compact set [a, b] achieves its maximum value m = sup{f (x) : x [a, b]} at
some point c (a, b). Thus, f has an interior relative maximum at c and f (c) = 0.
If there is an x1 (a, b) with f (x1 ) < f (a) = f (b), then f has an interior
relative minimum at some c (a, b) and f (c) = 0.
i i
i i
muldown
i i
2010/1/10
page 51
i i
Proof. Consider the function (x) = [f (x) f (a)](b a) [f (b) f (a)](x a).
The function is continuous on [a, b], exists on (a, b) and (a) = (b) = 0. By
Rolles Theorem, there is a c (a, b) such that
then
f (x) f (x)
lim = L = lim = L.
xb g (x) xb g(x)
Note: limxb f (x) = means that, for each real number N , there is a > 0
such that x (b , b) = f (x) > N .
Proof. Case (i): Suppose limxb f (x) = limxb g(x) = 0 and limxb fg (x)
(x)
=
L. If > 0 is given, there is a > 0 such that
f (x)
x (b , b) = L < . (2.8)
g (x)
Therefore,
f (x) f (c)
L = L < , for x (b , b).
g(x) g (c)
i i
i i
muldown
i i
2010/1/10
page 52
i i
f (x)
Thus, limxb g(x) = L.
Case (ii): Suppose limxb f (x) = limxb g(x) = and limxb fg (x)(x)
= L.
As before, (2.8) holds if x (b, b) for some > 0. However, we cannot define f (b)
and g(b) in this case so that f is continuous at b. Let x0 = b and x (b , b).
Then
f (x) f (x0 ) f (c)
= , for some c (b , b).
g(x) g(x0 ) g (c)
Thus, by (2.8), for x0 = b and x (b , b),
f (x) f (x0 ) f (x)
L = h(x) L < (2.9)
g(x) g(x0 ) g(x)
f (x0 )
1 f (x)
where h(x) = g(x0 )
. (2.10)
1 g(x)
Therefore,
f (x)
lim = L.
xb g(x)
Remark: Define limx = L if, for each > 0, there exists N such that x N
implies |f (x) L| < . LHospitals Rule is also valid if limxb is replaced by
limx . Minor changes are required for the proof however.
Exercises
x2 a2
2.54. If f (x) = for x 6= a, show that lim f (x) = 2a.
xa xa
2.55. Prove Theorem 2.45.
i i
i i
muldown
i i
2010/1/10
page 53
i i
i i
i i
muldown
i i
2010/1/10
page 54
i i
2.66. (Darboux Property of the Derivative) Let f (x) exist for each x [a, b] and
f (a) = , f (b) = . Suppose < < . Show that there is a c
(a, b) such that f (c) = . Hint: Show that g(x) = f (x) x must achieve
its minimum in (a, b). This exercise shows that derivatives, like continuous
functions, have the Intermediate Value Property. However, a derivative need
not be continuous on its domain. See Exercise 70 (i) and (ii).
2.67. Let
0, if 1 x 0;
f (x) =
1, if 0 < x 1.
Is there a function g such that g (x) = f (x) on 1 x 1.
2.68. The function defined by
x2 , x Q;
g(x) =
0, x 6 Q;
( ) ( )n1 (n1)
f () = f () + f () + . . . + f () + Rn (f ; , ),
1! (n 1)!
where
( )m ( )nm (n)
Rn (f ; , ) = f ()
m(n 1)!
for some point (, ). This is called Schlomilchs form of the re-
mainder. Special cases are given by taking
( )n (n)
m = n; Rn (f ; , ) = f () (Lagrange form)
n!
( )( )n1 (n)
m = 1; Rn (f ; , ) = f () (Cauchy form)
(n 1)!
The Lagrange form is the easiest to remember and is adequate for most
purposes. Hint: Let the constant C be defined by
( ) ( )n1 (n1)
f ()f () f (). . . f ()()m C = 0,
1! (n 1)!
( x) ( x)n1 (n1)
(x) = f ()f (x) f (x). . . f (x)(x)m C.
1! (n 1)!
i i
i i
muldown
i i
2010/1/10
page 55
i i
(b) (i) If f (x) = ex , show that lim Rn (f ; 0, x) = 0 for all x. That is, show
n
that x x2 xn1
lim 1 + + + ...+ = ex , x.
n 1! 2! (n 1)!
(b) (ii) If f (x) = sin(x), then limn Rn (f ; 0, x) = 0 for all x.
1
(b) (iii) If f (x) = 1x , x 6= 1, then limn Rn (f ; 0, x) = 0 for 1 < x < 1.
(c) (i) How large must I take n to approximate e to four decimal places by
the expression
1 1 1
1 + + + ...+ .
1! 2! (n 1)!
(c) (ii) Approximate 3 e to five decimal places.
(c) (iii) Prove that the closest integer to n!
e is divisible by (n 1). Hint:
Use Taylors formula for e1 .
(c) (iv) Compute 97 to four decimal places.
(d) Suppose f (x) exists and is non-negative (non-positive) in a neighbor-
hood of c and that f (c) = 0. Show that f has a relative minimum
(maximum) at c.
(e) Suppose f (x) exists in a neighborhood of c and is continuous at c. Show
that f has a relative minimum (maximum) at c if
2.70. Let f0 (x) = sin( x1 ), f1 (x) = x sin( x1 ), f2 (x) = x2 sin( x1 ), f3 (x) = x3 sin( x1 ),
for x 6= 0 and fi (0) = 0 for i = 0, 1, 2, 3.
(i) fi are differentiable at any point x 6= 0 (Chain Rule).
(ii) f0 is discontinuous at x = 0.
(iii) f1 is continuous at x = 0, but is not differentiable at x = 0.
(iv) f2 is differentiable at x = 0 but f2 is discontinuous at x = 0.
(v) f3 is differentiable at x = 0 and f3 is continuous at x = 0.
2.71. Let pn be defined by
dn 2
pn (x) = (x 1)n , n = 1, 2, . . . .
dxn
(i) Show that pn is a polynomial of degree n.
(ii) The equation pn (x) = 0 has exactly n roots in (1, 1).
2.72. Doesnt time fly when youre having fun?
i i
i i
muldown
i i
2010/1/10
page 56
i i
i i
i i
muldown
i i
2010/1/10
page 57
i i
Chapter 3
Riemann Integration
The definition of the Riemann integral is essentially the same in higher dimensions
as in one dimension. You may have learned Riemann integration of functions of one
variable by the lower and upper Riemann sums approach; the treatment adopted
here is slightly different but equivalent to that approach (see Exercise 17).
If p, q I, then |p q| (I).
57
i i
i i
muldown
i i
2010/1/10
page 58
i i
(0,0) (1,0)
1/k
() (b)
(5) If f is a continuous real valued function on [0, 1], then {(x, f (x)) : 0 x 1},
the graph of f , has content zero in R2 .
Proof. Let > 0. Since f is uniformly continuous on [0, 1] (Theorem 2.41),
there exists > 0 such that
i i
i i
muldown
i i
2010/1/10
page 59
i i
Figure 3.2. A covering of a graph of a continuous function on [0, 1] by
rectangles.
(6) The union of a finite collection of sets with zero content has zero content.
(7) A circle has zero content in R2 . Indeed, the circle of radius r can be written
as
the union of the two half circles which are the graphs of f (x) = r2 x2 ,
and thus has zero content by (5) and (6).
3.1.1 Partition of I
Let I be the rectangle
i i
i i
muldown
i i
2010/1/10
page 60
i i
I
Ii
|S(P, f ) | < ,
|1 2 | |S(P1 P2 , f ) 1 )| + |S(P1 P2 , f ) 2 | /2 + /2 = .
Exercises
3.1. If f is integrable on I, then f is bounded on I.
i i
i i
muldown
i i
2010/1/10
page 61
i i
1
|S0 (Q , f ) S0 (Qm , f )| < , , m k.
k
Thus, {S0 (Qk , f )} is a Cauchy sequence in Rm so
i i
i i
muldown
i i
2010/1/10
page 62
i i
The next corollary gives an equivalent but more readily applicable version of
the Cauchy Criterion.
R
Corollary 3.7 (Cauchy Criterion). I f exists if and only if for each > 0,
there exists a partition P such that if S1 (P , f ) and S2 (P , f ) are any two Riemann
sums corresponding to P , then
|S1 (P , f ) S2 (P , f )| < .
Proof. It is evident from Theorem 3.5 that the condition is necessary. To see that
it is sufficient, let P satisfy the requirement of Corollary 3.7, and let P and Q be
refinements of P , generating subintervals {Ak } and {Bj } respectively. Then
P P
|S(P, f ) S(Q, f )| = k f (pk )(Ak ) j f (qj )(Bj )
P hP P i
= i Ak Ii f (pk )(Ak ) Bj Ii f (qj )(Bj )
(3.5)
where Ii are the subintervals generated by P . Now there exists points xi , xi in Ii
such that
X X
f (pk )(Ak ) f (qj )(Bj ) f (xi ) f (xi ) (Ii ). (3.6)
Ak Ii Bj Ii
f (xi ) = max{f (pk ), f (qj )}, and f (xi ) = min{f (pk ), f (qj )},
k,j k,j
i i
i i
muldown
i i
2010/1/10
page 63
i i
where the max and min are taken over the k and j for which Ak Ii and Bj Ii ,
two finite sets. Then
X X X X
f (x ) (Ai ) f (xi ) (Bj ) f (pk )(Ak ) f (qj )(Bj )
Ak Ii Bj Ii Ak Ii Bj Ii
X X
f (xi ) (Ai ) f (x ) (Bj )
Ak Ii Bj Ii
so (3.6) follows.
From (3.5) and (3.6), we find
X
|S(P, f ) S(Q, f )| [f (xi ) f (xi )] (Ii ) = |S1 (P , f ) S2 (P , f )| < .
i
R
Theorem 3.8. If f is a continuous real-valued function on I, then I
f exists.
(Ii ) < .
Hence, if p, q Ii , then
and consequently, when S1 (P , f ) and S2 (P , f ) are any two Riemann sums corre-
sponding to P , we have
X
|S1 (P , f ) S2 (P , f )| = f (pi ) f (qi ) (Ii )
i
X X
|f (pi ) f (qi )|)(Ii ) < (Ii ) = (I) = .
i
(I) i
(I)
R
Thus, I f exists by the Cauchy Criterion (Corollary 3.7).
i i
i i
muldown
i i
2010/1/10
page 64
i i
(i) f is bounded on I,
(ii) the set of points of discontinuity of f has content zero.
R
Then I f exists.
Proof. Suppose
|f (p)| N, p I (3.7)
and K is the set of points of discontinuity of f in I. Let > 0. Since K has content
zero, there exists a partition Po of I such that if Ii are the subintervals generated
by Po , then X
(Ii ) < . (3.8)
4N
i,KIi 6=
Let
K
L
1
L
2
L1 = i,KIi 6= Ii , L2 = i,KIi = Ii .
Then f is continuous on L2 , which is a compact subset, and so f is uniformly
continuous on L2 . There exists > 0 such that p, q L2 and
|p q| < = |f (p) f (q)| < .
2(I)
i i
i i
muldown
i i
2010/1/10
page 65
i i
X k 2 X k 2
i1 j1 1 i j 1
2
S(Pk , f ) 2
i,j=1
k k k i,j=1
k k k
i i
i i
muldown
i i
2010/1/10
page 66
i i
Proof. The function f with its domain extended to the interval I D as in the
previous definition, is continuous at eachR point of I except possibly
R at points in
D. But (D) = 0, so R by Theorem 3.9, I f exists. Therefore, D f exists and, by
definition, is equal to I f .
R
Definition 3.13. A bounded set D Rn has content if D 1 exists and
Z
(D) := 1.
D
Rand let I be any closed interval containing D as a subset. Then D has content if
exists and then
I D Z
(D) := D .
I
Proof. Exercise 4.
R R
(c) If D f exists, then D |f | exists, and
Z Z
f |f |.
D D
i i
i i
muldown
i i
2010/1/10
page 67
i i
i i
i i
muldown
i i
2010/1/10
page 68
i i
By Corollary 2.38, D compact and f continuous on D implies that there exist points
p1 , p2 D with f (p1 ) = m and f (p2 ) = M . Therefore, by Corollary 2.34 (the
Intermediate Value Theorem), there exists p0 D such that
R Z
D
f
f (p0 ) = = f (p0 )(D) = f.
(D) D
Discussion: A subset of Rn has Jordan measure zero if, for each > 0, there is a
finite collection of intervals {Ik } such that
X
K k Ik and (Ik ) .
k
K has Lebesgue measure zero if, for each > 0, there is a countable collection
of intervals {Ik } satisfying the displayed relation. This simple extension of the
concept of measure zero has profound consequences. We know that the set of
rationals in [0, 1] does not have Jordan content. However, if this set is enumerated
as {rk : k = 1, 2, 3, . . .}, then rk Ik with (Ik ) = 2k yields
X
X 1
{rk : k = 1, 2, 3, . . .} k Ik and (Ik ) = = .
2k
k k=1
i i
i i
muldown
i i
2010/1/10
page 69
i i
Thus the set of rationals in [0, 1] has Lebesgue measure zero. In fact, the same
argument shows that Q, or any countable set of real numbers has Lebesgue measure
zero. Countable sets are not the only sets of Lebesgue measure zero however; in
fact, they can be quite complicated. The remarkable Cantor set which you will
study in Real Analysis has Lebesgue measure zero but nonetheless has the same
cardinality as [0, 1], i.e. it has zero length but has the same number of points
as [0, 1]. A simple discussion of the Cantor set can be found in Bartles book.
An interesting theorem of Lebesgue
R states that, for a bounded real-valued
function f , the Riemann integral I f exists if and only if the set of points in I at
which f is discontinuous has Lebesgue measure zero. For example, for the functions
in Exercises 15 and 16, the one in the first exercise is discontinuous at each point in
[0, 1], while the function in the second exercise is only discontinuous at the rational
points in [0, 1]
Exercises
3.3. Suppose f and g are bounded on I and f (p) = g(p) for all p I\K, where
K has content zero. Prove that
Z Z Z
f exists = g exists and equals f.
I I I
i i
i i
muldown
i i
2010/1/10
page 70
i i
R1
then 0 f = 0.
3.17. If P is a partition of the interval I and f : I 7 R, f bounded, define
S(P, f ) and S(P, f ), the lower and upper Riemann sums corresponding to
the partition P to be inf{S(P, f )} and sup{S(P, f )} repectively, the inf and
sup being taken over all Riemann sums corresponding to the partition P and
the function f . Let
Z Z
f = sup{S(P, f )} and f = inf P {S(P, f )},
I P I
the sup and inf here being taken over all partitions P of I.
(i) Show that Q P implies
R R
(ii) Show that f If.
I
R R
(iii) (Definition). f is integrable on I if f= I
f , and then we define
I
Z Z Z
f= f= f.
I I I
(iv) Show that f is integrable in this sense if and only if there is exactly one
number such that
S(P, f ) S(P, f )
R
for every partition P of I, in which case I f = .
R
(v) (Cauchy Criterion). Show that I f exists in this sense if and only if, for
each > 0, there exists a partition P of I such that
i i
i i
muldown
i i
2010/1/10
page 71
i i
F (x) = f (x), a x b.
Rx
Proof. Consider the function F (x) = a f , a x b which exists by Exercise 14,
we have from Theorem 3.15 (d) and (f) that
Z x+h
F (x + h) F (x) = f = f (ch )h,
x
F (x + h) F (x)
lim = lim f (ch ) = f (x).
h0 h h0
Rx
Proof. By the proceding Proposition, there is a constant C so that a f = F (x)
Ra
C. When x = a, this yields a f = 0 = F (a) C, while using this when x = b we
Rb
find a f = F (b) C = F (b) F (a).
i i
i i
muldown
i i
2010/1/10
page 72
i i
Then
Z (b) Z b
f= (f ) , or
(a) a
Z (b) Z b
f (x) dx = f ((u)) (u) du.
(a) a
d
F ((u)) = f ((u)) (u),
du
so that F is an antiderivative of (f ) . Therefore,
Z (b) Z b
f = F ((b)) F ((a)) = (f ) .
(a) a
You will have noticed that we used the Chain Rule here even though it was
not yet proven in these Notes. A proof will be given in the next chapter in a more
general context.
R /2
Example 3.20 To evaluate 0 sin2 (u) cos(u) du, we take f (x) = x2 , (u) =
sin(u), and (u) = cos(u). Then
Z /2 Z (/2) Z 1
2 2 1
sin (u) cos(u) du = x dx = x2 dx = .
0 (0) 0 3
Example 3.22
Z 1 1 Z 1 1
xex dx = xex ex dx = e (ex ) = e (e 1) = 1.
0 0 0 0
i i
i i
muldown
i i
2010/1/10
page 73
i i
R
Proof. The definition of f states that if > 0 is given, there is a partition P of
I
I such that Z
P P = S(P, f ) f < ,
I
P = {x0 , x1 , . . . , xm } {y0 , y1 , . . . , yk } P
m k Z
X X
= f (xi , yj )(xi xi1 )(yj yj1 ) f < ,
i=1 j=1 I
Condition (ii) implies that for any fixed set of numbers xi , i = 1, . . . , m, the parti-
tion {yj } of [c, d] may be chosen to be so fine that
X Z d
k
f (xi , yj )(yj yj1 ) f (xi , y) dy <
, i = 1, . . . , m,
b a
j=1 c
Rd
since each of the integrals c f (xi , y) dy exists. Therefore, for the xi we have
X
k
f (xi , yj )(yj yj1 ) F (xi ) < , i = 1, . . . , m. (3.11)
j=1 ba
i i
i i
muldown
i i
2010/1/10
page 74
i i
Rb R
Thus, from the definition of the integral, a F exists and equals I f.
Corollary 3.24 (Interchanging the order of integration). Let I = [a, b][c, d].
If
R
(i) I f exists
Rd
(ii) c
f (x, y) dy exists for each x [a, b],
Rb
(iii) a f (x, y) dx exists for each y [c, d],
(i) f is bounded on I,
Then "Z #
Z Z b d
f= f (x, y) dy dx.
I a c
R
Proof. Conditions (i) and (ii) imply that the integral I f exists by Theorem 3.9.
Condition (iii) says that the intersection of K with each vertical line (considered as
Rd
a set in R) has content zero so that c f (x, y) dy exists for each x [a, b] again by
Theorem 3.9. Therefore, all the conditions of Fubinis Theorem are satisfied.
i i
i i
muldown
i i
2010/1/10
page 75
i i
Proof. Use Corollary 3.25. The graphs of and have zero content in R2 . Each
vertical line intersects each graph once, and these two points have 1 dimensional
content zero. See Figure 3.5
D
a b
Figure 3.5. Area between two curves.
i i
i i
muldown
i i
2010/1/10
page 76
i i
y=x
2
y=x
The examples just given have iterated integrals which are all easily evalu-
ated. However, one sometimes encounters iterated integrals where the antideriva-
tives cannot be found in terms of elementary functions. A simplification is some-
times achieved by using Fubinis Theorem to reverse the order of the integration.
R 1 hR 1 i R
Example 3.28 (1) Let 0 y ey/x dx dy = D f . By Fubinis Theorem, this
integral may be rewritten as
Z 1 Z x Z 1 y=x Z 1
y/x y/x
e dy dx = xe dx = x(e 1) dx = (e 1)/2.
0 0 0 y=0 0
where
1, if 0 y 1
(y) =
y, if 1 y 2.
Hence, the last integral may be rewritten as
Z 1 Z 2 Z 2 Z 2
f (x, y) dx dy + f (x, y) dx dy.
0 1 1 y
i i
i i
muldown
i i
2010/1/10
page 77
i i
y=x
y=x
x=1 D x=2
x=1
D
y=0 y=0
Notes:
(1) The proof of Theorem 3.29 is exactly the same as that given before when n = 2.
(2) The symbols dp, dq above are simply used as devices to indicate the spaces
on which we are integrating.
(3) For a more general formulation of Fubinis Theorem where the condition (ii) is
dropped see Calculus on Manifolds by M. Spivak (p. 58). However, the above
statement of the theorem is sufficient for our needs.
(4) We will prove a change of variables formula for integrals in higher dimensions
in a later chapter.
Exercises
3.18. Let f be a real-valued function on [a, b] such that f (x) exists for each x [a, b]
Rb
and a f exists. Prove that
Z b
f (b) f (a) = f .
a
i i
i i
muldown
i i
2010/1/10
page 78
i i
D := {(x, y) : a x b, 0 y f (x)}.
R Rb
That is, show D 1 = a f .
3.20. Let D := {(x, y) : 1 x 3, x2 y x2 + 1}. Show that
Z "Z 2 #
3 x +1
(D) = dy dx = 2.
1 x2
y
y=1
y=x 2
y=(x3)
4 x
0
Figure 3.8. Figure for Exercise 22.
R
3.23. Find D f where f (x, y) = sin( xa + yb ) and D = {(x, y) : 0 x a/2, 0
y b/2}.
3.24. Let D be the region bounded
R by the curves x2 y 2 = 1, x2 + y 2 = 4, which
contains (0, 0). Find D f where f (x, y) = x2 .
R
3.25. Let f (x, y) = x. Prove that D f = 77/4, where D is the region in R2
illustrated in Figure 3.9.
3.26. Let f (x, y) = g(x) with (x, y) [a, b] [c, d] = I.
Rb R R
(i) Prove that if a exists, then I f exists. Deduce from this that D f exists
where D is any subset of I which has content.
R1
(ii) Suppose g is defined on [0, 1] and 0 g exists. Prove that
Z 1 Z 1 Z 1
g(t) dt dx = tg(t) dt.
0 x 0
i i
i i
muldown
i i
2010/1/10
page 79
i i
y y=4
1 y=(2x+4)/3
0
2
y=x 4x + 5
x=0 000
111
000
111
00
11
00
11
y=x/2
x=4
x
0
R R
(i) If I
f exists, then I
f 2 exists. [Hint:
Can you replace continuous by a less restrictive condition which still im-
plies the result? Hint: [What was Darbouxs first name?]
3.29. (Cavalieris Principle) Let A and B be subsets of R2 with content. If x R,
define
Ax := {y : (x, y) A}, Bx := {y : (x, y) B}.
(sections of A and B). Suppose that, for each x, Ax and Bx have content
in R and 1 (Ax ) = 1 (Bx ). Prove that 2 (A) = 2 (B). [Hint: Spell Fubini.]
R1
3.30. Let f be a real-valued function on [0, 1] such that 0 f exists. Define ak by
k
1X j
ak := f ( ), k = 1, 2, 3, . . . .
k j=1 k
i i
i i
muldown
i i
2010/1/10
page 80
i i
f (x, y) dx dy = f (x, y) dy dx+ f (x, y) dy dx.
0 y 0 0 1 0
R
3.35. Show that D 1 = 61 where D := {(x, y, z) : x 0, y 0, z 0, 0
x + y + z 1}.
3.36. Let D be a subset of R2 with content and f be a positive continuous function
on D. Use Fubinis Theorem to show that if
then Z Z
3 (K) = 1= f.
K D
Deduce that m2 (D) 3 (K) M 2 (D) where m, M are lower and upper
bounds for f on D.
3.37. Evaluate Z 2Z 2 Z Z
2 sin(y)
ex dx dy and dy dx.
0 y 0 x y
3.38. Show "Z 2 #
Z 1 1y
y
sin dx dy = 1.
0 1y 2 1 x2
Explain carefully why each integral you consider exists.
3.39. See also exercises pp. 122-124 in Buck.
i i
i i
muldown
i i
2010/1/10
page 81
i i
Chapter 4
Differentiation 0f
Functions of Several
Variables
4.1 Preliminaries
4.1.1 Linear Functions
Definition 4.1. A function L : Rn 7 Rm is linear, if for all p, q Rn and R,
there holds
(i) L(p + q) = L(p) + L(q), (L is additive)
(ii) L(p) = L(p), (L is homogeneous).
If v0 Rm and L is linear, then the function M :
Rn 7 Rm defined by M (p) = v0 + L(p). is an affine function.
Linear algebra plays a prominent background role in this chapter as the next
theorem suggests.
81
i i
i i
muldown
i i
2010/1/10
page 82
i i
If ej is the vector with 1 is the jth coordinate and has zeros elsewhere, j = 1, . . . , n,
then
p = (x1 , . . . , xn ) = x1 e1 + . . . + xn en =
L(p) = L(x1 e1 + . . . + xn en ) = x1 L(e1 ) + . . . + xn L(en ).
z
3 3
L(R ) v +L(R )
0
u vo v o+u
(0,0,0) y
(0,1,0)
x
Figure 4.1. An affine plane in R3 .
Example 4.3
1 1 0
rank 1 1 0 = 2.
0 1 1
i i
i i
muldown
i i
2010/1/10
page 83
i i
4.1. Preliminaries 83
Thus, for the linear map L : R3 7 R3 corresponding to this matrix, the range
L(R3 ) has dimension 2. In fact, it consists of all vectors of the form (s, s, t), that is
the plane x = y. An example of an affine space in R3 is (0, 1, 0) + L(R3 ), which is
the parallel plane through the point (0, 1, 0), i.e., the plane y = x + 1.
|L(p)| M |p|, p Rn .
Proof. Using the matrix representation of a linear map, we have by the Cauchy-
Schwartz inequality
v v
n uX u n
X u n 2 uX
|yi | = |Li (p)| = | Ci,j xj | t Ci,j t x2j
j=1 j=1 j=1
v v v
um uXm X
n uX
uX u u n 2
= |L(p)| = t yi2 t 2 t
Ci,j xj
i=1 i=1 j=1 j=1
v
uX
um X
n
or |L(p)| M |p| where M := t 2 .
Ci,j
i=1 j=1
The next theorem says that for one-to-one linear functions, the inequality in
Theorem 4.4 can be reversed!
i i
i i
muldown
i i
2010/1/10
page 84
i i
Exercises
4.1. Let L : R2 7 R3 be linear with L(e1 ) = (2, 1, 0), L(e2 ) = (1, 0, 1), where
e1 = (1, 0), e2 = (0, 1). Find L(2, 0), L(1, 1), L(1, 3). Draw pictures.
4.2. Show that L(R2 ) 6= R3 for the function in the last exercise.
4.3. Show that if L : R2 7 R3 is linear, then L(R2 ) 6= R3 .
4.4. Let L : R3 7 R2 be linear. Show that there are non-zero vectors p R3 such
that L(p) = O.
a b
4.5. If L : R2 7 R2 has a matrix , show that
c d
(i) L(R2 ) is a point if and only if a = b = c = d = 0.
(ii) L(R2 ) is a line if and only if = ad bc = 0 and a2 + b2 + d2 + c2 > 0.
(iii) L(R2 ) = R2 if and only if 6= 0.
4.6. Show that in case (iii) of the last exercise, that L is one-to-one (and only in
that case), and that the inverse function L1 is linear with matrix
d b
.
c a
4.7. Show that the sum and composition of two linear functions are linear. What
are the matrix representations of these?
4.8. If L : Rn 7 Rm is linear and one-to-one, then L1 is linear on its domain.
4.9. Let f : Rn 7 Rm be such that
(i) f (p + q) = f (p) + f (q), for all p, q Rn (additive)
(ii) f is continuous at O.
Prove that f is a linear function.
4.10. Let f : R 7 Rm be such that
i i
i i
muldown
i i
2010/1/10
page 85
i i
4.1. Preliminaries 85
Notice that f must be linear since f (x) = f (1)x. Show that homogeneity does
not imply linearity for functions of more than one variable. HINT: Consider
the function ( 3 3
x +y
2 2, if (x, y) 6= O
f (x, y) = x +y
0, otherwise.
c+tu
p+t(qp)
u
qp q
c p
0 0
Notice that this representation of a line is not unique; you may replace u by
any multiple u, 6= 0, and still get the same line.
If f : R 7 Rn , then we say that the set {f (t) : t R} is a curve in Rn
(a continuous curve if f is continuous). The line through f (0 ) and f ( ) when
f ( ) 6= f (0 ), is {f (0 ) + t(f (0 ) f ( )) : t R}. or equivalently,
f ( ) f (0 )
{f (0 ) + t : t R}.
0
f ( ) f (0 )
Thus, it is the line through f (0 ) in the direction . We therefore define
0
f( )
f( )
0 0
n
R
i i
i i
muldown
i i
2010/1/10
page 86
i i
f ( ) f (0 )
f (0 ) = lim exists.
0 0
Evidently,
f1 ( ) f1 (0 ) fn ( ) fn (0 )
f (0 ) = lim , . . . , lim = (f1 (0 ), . . . , fn (0 )).
0 0 0 0
f (c + tu) f (c)
lim , (t R)
t0 t
exists, then this limit is called the directional derivative of f at c in the
direction of u, and is denoted fu (c).
(ii) If ei is the vector in Rn with 1 in the i-th coordinate and zeros elsewhere, then
fei (c) is usually denoted by
f
(c)
xi
and is called the partial derivative of f at c with respect to the variable xi ,
i = 1, . . . , n.
When we compute
f (c + tu) f (c)
lim
t0 t
we are in fact restricting our attention to the behaviour of the function f at c with
respect to a straight line through c in the direction of u. This straight line in Rn
(at least the portion of it that lies in D) is mapped by f into a curve in f (D) Rm .
i i
i i
muldown
i i
2010/1/10
page 87
i i
f(c+tu)
f f(D)
D f(c)+tf(u)
c+tu
f(c)
c
n m
R R
The vector fu (c) is the direction of the tangent to this curve in Rm . (See Figure
4.4.)
It is also instructive to consider the graph G(D) Rn+m of f , where G :
R 7 Rn+m is defined by G(p) = (p, f (p)), p D. Computing
n
(c,f(c))=G(c)
(c+tu,f(c)+tf (c))=G(c)+t G (c)
G(D) u u
(c+tu,f(c+tu))=G(c+tu)
c
n D c+tu
R
i i
i i
muldown
i i
2010/1/10
page 88
i i
y z
2 G
f(R ) z=x 2+ y 2
f
(1,0) x y
0 (0,0) (0,0,0)
3
R
R
2
R x
(2) For the function f (x1 , x2 ) = (x1 , x2 , x21 + x22 ) and points c = (c1 , c2 ). u =
(u1 , u2 ), we have
f (c + tu) f (c)
t
1
= (c1 + tu1 , c2 + tu2 , (c1 + tu1 )2 + (c2 + tu2 )2 ) (c1 , c2 , c21 + c22 )
t
1
= (tu1 , tu2 , 2tu1 c1 + 2tu2 c2 + t2 u21 + t2 u22 )
t
= (u1 , u2 , 2u1 c1 + 2u2 c2 + tu21 + tu22 ).
Therefore,
1
fu (c) = lim (f (c + tu) f (c)) = (u1 , u2 , 2u1 c1 + 2u2 c2 ).
t0 t
Notice that
u1 1 0
u
u2 = 0 1 1
u2
2u1 c1 + 2u2 c2 2c1 2c2
i i
i i
muldown
i i
2010/1/10
page 89
i i
(5) This example shows that the directional derivative does not have to be linear
in u. Define f : R2 7 R by
( 2
x y
2, when x3 =6 y2;
f (x, y) = x y
3
0, when x3 = y 2 .
The preceding examples show that when f is a nice function at c, then fu (c)
is linear in u, but in general it does not need to be linear even if it exists for all u.
Observe however, that all the fu (c) in the above examples are homogeneous in u,
that is, fu (c) = fu (c) for all R. When they fail to be linear, it is the additive
property that fails. The homogeneity in u is always true when fu (c) exists.
Exercises
4.11. Prove that if fu (c) exists, then fu (c) = fu (c).
4.12. Prove that the expressions given in the directional derivatives of (4) and (5)
in Example 4.10 are correct.
4.13. Check that the following partial derivatives are correct:
(a) The function f (x, y) = x2 + y 3 has partial derivatives
f f
(x, y) = 2x, (x, y) = 3y 2 ,
x y
i i
i i
muldown
i i
2010/1/10
page 90
i i
Motivation: At first glance it seems that the directional derivative determines the
properties of functions in the same way that the derivative in one variable does.
Unfortunately, that is not quite the case. For example, in Example 4.10 (4), we
have shown that a function may be discontinuous at a point where all directional
derivatives exist. Therefore, a more rigorous approach must be used to extend the
role of differentiation. By reforulating the definition, we can realize the expected
properties, and hence the usefulness, of the derivative. To this end, we reformulate
the definition of the derivative of a function in one variable to show case the linearity
as found in Example 4.10 (3).
i i
i i
muldown
i i
2010/1/10
page 91
i i
(x,f(x))
f(x)
(c,f(c)) (x,f(c)+L(xc))
f(c)
c x
Proof. This follows because the limit exists if and only if the limit of each compo-
nent exists (this uses the equivalence of d(p, c) and d (p, c)):
|f (p) f (c) L(p c)|
lim =0
pc |p c|
|fi (p) fi (c) Li (p c)|
lim = 0, i = 1, . . . , m
pc |p c|
The matrix for the differential must have the form provided by the next the-
orem.
i i
i i
muldown
i i
2010/1/10
page 92
i i
This
h matrix
i is the Jacobian Matrix of f at c, and can be denoted also as
f
x (c) , or simply f (c).
Here we used the homogeneous property of linearity, Df (c)(tu) = tDf (c)(u) for
the second step.
(ii) recall from Theorem 4.2 that the matrix of L is [Ci,j ] = [Li (ej )]. Here
Notes: In the case when n = m, the determinant det f (c) is called the Jacobian
of f at c and is often denoted by
(f1 , . . . , fn )
Jf (c) or (c).
(x1 , . . . , xn )
n
Proof. fu (c) is unique (from the uniqueness of limits) for each u R h . Alter-
i
fi fi
natively, the partials x j
(c) are unique and hence the matrix f (c) = xj (c) is
unique.
i i
i i
muldown
i i
2010/1/10
page 93
i i
Proof. Let L = Df (c). From the definition of Df (c), given > 0, there is a
() > 0 such that
i.e. |f (p) f (c) L(p c)| < |p c|. In particular, with = 1, we have
p = p0 , p1 = (1 , x2 , . . . , xn ), p2 = (1 , 2 , x3 , . . . , xn ), . . . , c = pn .
p = ( ,2)
2 1
i i
i i
muldown
i i
2010/1/10
page 94
i i
Now on each line segment between pk1 and pk , k = 1, . . . , n, we are really consid-
ering a real-valued differentiable function of a real variable so we may use the Mean
Value Theorem.
f
f (pk1 ) f (pk ) = (p )(xk k ) (4.4)
xk k
where pk is on the line segment from pk1 to pk , k = 1, . . . , n. Clearly, pk {p :
|p c| < } by (4.2), since this set is convex. From (4.3) and (4.2), we find
n
X f
f (p) f (c) = (p )(xk k ).
xk k
k=1
(Cauchy-Schwartz)
n|p c|
|f (p)f (c)L(pc)|
= |pc| < n , when |p c| <
Please note that the last Theorem gives only a sufficient condition for the
function to be differentiable at a point.
i i
i i
muldown
i i
2010/1/10
page 95
i i
In other words, Df (0, 0)(u1 , u2 ) = (u1 , u2 , 0) for each (u1 , u2 ) R2 . Check for
yourself that
Df (0, 1)(u1 , u2 ) = (u1 , u2 , 2u2 ),
and
Df (1, 1)(u1 , u2 ) = (u1 , u2 , 2u1 + 2u2 ).
i i
i i
muldown
i i
2010/1/10
page 96
i i
x2 y3
f
2
f(R )
x1
y2
2
f(0,0)+L(R )
2 3
R R
y1
t z
f
f(R)
y
0 f(0)+L(R)
x 3
R R
z
f
y
3
R
x R
Exercises
4.16. In Example 4.22, sketch the range of f (R2 ) (it is a curve in R2 ). Find the
tangent at one or two points of the range.
4.17. Let f : R2 7 R2 be defined as f (x, y) = (x2 + y, y 2 ). Check that
2x0 1
f (x0 , y0 ) = .
0 2y0
i i
i i
muldown
i i
2010/1/10
page 97
i i
Deduce that the largest value of fu (c) under the restriction |u| = 1 is |f (c)|
and this value is attained when u = f (c)/|f (c)|. This means that the
direction of maximum rate of increase of f at c is the direction of the gradient
vector. [Hint: What was Cauchy-Schwartz first name?]
i i
i i
muldown
i i
2010/1/10
page 98
i i
4.24. Sketch the surface {(x, y, xy) : (x, y) R2 } in R3 (i.e. z = xy) and show that
the tangent to this surface at the point (1, 1, 1) is (1, 1, 1) + {(u, v, u + v) :
(u, v) R2 } (i.e. z + 1 = x + y).
4.25. If L : Rn 7 Rm is linear, then the differential L exists and equals L at each
point in Rn .
i i
i i
muldown
i i
2010/1/10
page 99
i i
The last inequality was the result of several applications of the Cauchy-Schwartz
inequality. As p c in the last expression, the first term goes to zero since
is continuous and D(c) exists, the second term goes to zero since D(c) exists,
while the third term goes to zero by the continuity of at c and the fact that
|L (p c)| M |p c| for some constant M since L is linear.
(i) is differentiable at c Rn
that is,
f1 f1 1 1 1 1
x1 ... xn y1 ... ym x1 ... xn
. .. .. . .. .. .. .. ..
.. . . = .. . . . . . .
f f m m
x1 ... xn y1 ... ym x1 ... xn
X i m
fi k
(c) = (b) (c), i = 1, . . . , , j = 1, . . . , m.
xj yk xj
k=1
i i
i i
muldown
i i
2010/1/10
page 100
i i
Less precisely,
f u v f u v
= + , = + .
x u x v x y u y v y
As a particular example, if h(r, ) = g(u, v) where u = r cos() and v = r sin(),
then
h g g h g g
= cos() + sin(), = (r sin()) + r cos().
r u v u v
i i
i i
muldown
i i
2010/1/10
page 101
i i
The Chain Rule in higher dimensions is more interesting than in one dimen-
sion; for example, it includes the rules for differentiating sums and products as
special cases. Thus, Theorem 4.23 is a corollary to Theorem 4.24. This can be seen
by considering the functions
F : Rn 7 R2m , F (p) = ((p), (p))
G : R2m 7 Rm , G(q1 , q2 ) = q1 + q2 , q1 , q2 Rm
H : R2m 7 R, H(q1 , q2 ) = q1 q2 .
i i
i i
muldown
i i
2010/1/10
page 102
i i
Then G F = + and H F = .
The Mean Value Theorem does not hold for functions f : Rn 7 Rm if m > 1.
Can you tell why? If you cannot, try to carry out a proof with m > 1. See Exercise
30 for what can be said.
Exercises
4.26. Let f : R 7 R be differentiable. If F : R2 7 R is defined by
(a) F (x, y) = f (xy), then x F F
x = y y ,
i i
i i
muldown
i i
2010/1/10
page 103
i i
4.27. Define f : R2 7 R by
(x2 + y 2 ) sin 2 1 2 , if (x, y) 6= (0, 0)
x +y
f (x, y) =
0, if (x, y) = (0, 0).
x5 + y 5 + z 5
x3 + y 3 + 3x2 y, , x2 y 7 + 2z 4 x5 ,
(x + y + z)5
are homogeneous of degree 3, 0, 9 respectively. Prove that if f is differentiable
and homogeneous of degree m, then
f f
x f = x1 + . . . + xn = mf.
x1 xn
[Hint: Differentiate the formula defining the concept of homogeneity with
respect to t and set t = 1. Was Euler an Edmonton hockey player???]
i i
i i
muldown
i i
2010/1/10
page 104
i i
[Hint: Let
Z d
f
(x) = (x, y) dy
c x
then
R x is continuous by part (i). Use the Fubini Theorem to show that
a
= F (x) F (a). Hence, F is an antiderivative of so F exists and
equals .]
(iii) Suppose (x) and (x) have continuous derivatives on [a, b], and f and
f
x are continuous on I. Show
Z (x)
d
f (x, t) dt = f (x, (x)) (x) f (x, (x)) (x)
dx (x)
Z (x)
f
+ (x, t) dt.
(x) x
i i
i i
muldown
i i
2010/1/10
page 105
i i
Rb dx
(v) Evaluate 0 x2 +a2
, a > 0, and from your result deduce that
Z b
dx 1 b b
2 2 2
= 3 arctan + 2 2 , a > 0.
0 (x + a ) 2a a 2a (b + a2 )
and, in particular,
2f f
2 = .
xi xi xi
These are the partial derivatives of second order. Partial derivatives of third and
higher order are similarly defined.
f f
= 3x2 + 2y, = 6y + 2x
x y
2f 2f 2f 2f
2
= 6x, =2= , = 6.
x xy yx y 2
It is usually the case, (but not always, see Exercise 13, page 349 in Buck),
that the successive partial derivatives may be taken in any order we please, e.g., in
the above example we have seen
2f 2f
= .
xy yx
The following two theorems give sufficient conditions for this. There is no loss
of generality in the fact that these theorems are proved in R2 only; we are only
concerned with the behaviour of f with respect to two of the variables in any case.
i i
i i
muldown
i i
2010/1/10
page 106
i i
2f 2f
,
xy yx
exist and are continuous on an open set U R2 , then they are equal at each point
of U .
2f 2f
(x0 , y0 ) 6= (x0 , y0 )
yx xy
for some (x0 , y0 ) U . We may even say that one is larger than the other, say,
2f 2f
(x, y) (x, y) > 0,
xy yx
in a small interval I containing (x0 , y0 ) by continuity of the mixed partials. But
then Z 2
f 2f
J1 J2 = > 0,
I xy yx
by Theorem 3.15(a); a contradiction. Hence, we must have equality throughout U .
The next Theorem is more general than the one just proved, but is also more
difficult to prove.
i i
i i
muldown
i i
2010/1/10
page 107
i i
2 f
(ii) xy is continuous at (x0 , y0 ),
2 f 2f
then yx (x0 , y0 ) exists and equals xy (x0 , y0 ).
as follows
f f
(x0 , y0 + k) (x0 , y0 ) = [f (x, y0 + k) f (x, y0 )]x=x0
x x x
[f (x0 + h, y0 + k) f (x0 + h, y0 )] [f (x0 , y0 + k) f (x0 , y0 )]
= + (h),
h
with (h) 0 as h 0 by (a)
g(y0 + k) g(y0 ) dg k
= + (h) = (y0 + h,k k) + (h)
h dy h
h i
f f
y (x0 + h, y0 + h,k k) y (x0 , y0 + h,k k)
= k + (h), where |h,k | < 1 by (b)
h
i i
i i
muldown
i i
2010/1/10
page 108
i i
2f
=k (x0 + h,k h, y0 + h,k k) + (h), where |h,k | < 1 by (c).
xy
If h = h(k) is chosen to be small enough that |h| < |k| and |(h)| < |k|2 , then, after
dividing by k, we obtain (4.6) with x = x (k) = x0 +h,k h, y = y (k) = y0 +h,k k,
and (h)/k as the term that goes to zero with k. Since
using (4.7) provided |k| < /2, for then |(h)/k| < |k| < /2.
Notation: Let D Rn . By C(D) we will denote the set of all continuous functions
on D. By C k (D), we mean all functions defined on D having all k-th order partial
derivatives continuous on D. The range of the functions will be clear from the
context in which the notation is used.
We first give Taylors Theorem in an expanded form emphasizing two-variables.
The Theorem can be stated cleanly in a form similar to the univariate form even in
higher dimensions which we do in a second pass.
i i
i i
muldown
i i
2010/1/10
page 109
i i
i i
i i
muldown
i i
2010/1/10
page 110
i i
Below we phrase Taylors Theorem using the gradient notation, which is sim-
pler in form but richer in meaning. When properly viewed it allows the concept to
expand to several variables without unnecessary clutter. (Unfortunately, our minds
sometimes demands a struggle with the clutter before we can fully conceptualize.
Dont be afraid to get dirty with this.) Before stating Taylors Theorem, we do
some house-keeping on notation and look at iterations of the particular operator
u . Look at how the gradient operator is used to define, for a fixed direction u,
the directional derivative as an operator on differentiable functions:
n
X f
Given u in Rn : u = u : f 7 uj .
j=1
xj
Pn f
If the resulting function j=1 uj x j
: Rn R is differentiable, the operator may
be applied once again. Quite literally, this gives
n
X n n n n
X f X X 2f X 2f
u uj = u uj = u uj . (4.10)
x j=1 xj j=1
x xj x xj
=1 =1 ,j=1
Since we can only apply this operator again if all the partial derivatives of each
kf
(4.11)
xj1 xjk
i i
i i
muldown
i i
2010/1/10
page 111
i i
Proof. We will reduce the theorem to the univariate Taylor theorem. In fact, one
k
can see directly the analogy when one views (b a) f as the k-th directional
derivative in the direction b a.
Let (t) := a + t(b a), and define
F (t) := f ((t)), 0 t 1.
Then by the Chain Rule for several variables, we have
b 1 a1
f f ..
F (t) = Df ((t))(b a) = ((t)), . . . , ((t)) .
x1 xn
b n an
f f
= (b1 a1 ) ((t)) + . . . + (bn an ) ((t)) = (b a) f ((t)).
x1 xn
Applying this formula inductively to the result, we find
k1 k
F (k) (t) = (b a) (b a) f ((t)) = (b a) f ((t)). (4.12)
where
F (N ) (t0 )
RN = , for some t0 , 0 < t0 < 1.
N!
Substituting (4.12) into this formula and noting (0) = a and (1) = b, we obtain
the Theorem with c = a + t0 (b a).
i i
i i
muldown
i i
2010/1/10
page 112
i i
Points that satisfy (4.13) are called stationary or critical points. These are
the potential points for relative max and mins. At a stationary point a, Taylors
theorem of order 2 reads
2
f (b) = f (a) + (b a) f (c)/2.
Thus, we find
2
(b a) f (c) > 0 = f (b) > f (a), and
2
(b a) f (c) < 0 = f (b) < f (a).
2
Hence the behavior of the operator (b a) f is important for determining
whether the critical point is a max or min.
We will view this a little differently, using matrix notation. Let u Rn , and
consider (u )2 f written in some imaginative ways:
(u )2 f = (u ) (u )f = uT T f u
x1 u1
. f f ..
= [u1 , . . . , un ] .. [ x 1
, . . . , xn ] .
un
x
n2 f 2
x1 x1 . . . x1 x f
n
2
f 2f u
x2 x1 . . . x2 xn .1
= [u1 , . . . , un ] .
.. .. .. . (4.14)
. . . un
2 f 2
xn x1 . . . xn x
f
n
Pn 2
f
u
=1 x1 x
..
= [u1 , . . . , un ]
.
Pn 2 f
=1 xn x u
Pn Pn 2
= j=1 =1 uj u xj x
f
.
i i
i i
muldown
i i
2010/1/10
page 113
i i
(u )2 f (c) = uT Ac u.
then f (a) is neither a max nor a min, and is called a saddle point because
f (b) > f (a) while f (b ) < f (a).
5. Since the second order derivatives are continuous, the function in (4.14) is
continuous in u = b c and therefore, (1)-(4) will be true for Ac replaced by
Aa if b is sufficiently close to a.
i i
i i
muldown
i i
2010/1/10
page 114
i i
Some of the importance of positive definite matrices can be seen from the
following remark.
Remark: A symmetric real matrix is positive definite if and only if its eigenvalues
are positive. (Result from linear algebra which we will take as known.)
f
(a) = 0, j = 1, . . . , n,
xj
and
Then f has
Before going further, we return to R2 and look at these ideas more concretely.
A 2 2 symmetric matrix being positive (negative) semidefinite means
a b x
Q(x, y) = [ x y ] = ax2 + 2bxy + cy 2 0 ( 0), (x, y) R2 .
b c y
i i
i i
muldown
i i
2010/1/10
page 115
i i
It is said to be positive (negative) definite if Q(x, y) > 0 (< 0) for all (x, y) 6= O.
For two by two matrices this expression reminds one of the quadratic formula, and
we have the following easily checked criterion.
For example, if ac b2 < 0 and a > 0, then Q(x, 0) > 0 for any x, but the choice
x0 = by/a gives Q(x0 , y) < 0 for any y > 0.
Before restating Theorem 4.38 for two variable, we introduce another com-
monly used notation for partial derivatives
f f f
fx := x fxy := yx fxx := xx
f f f
fy := y fyx := xy fyy := yy
fx (x0 , y0 ) = fy (x0 , y0 ) = 0,
and let
f (x, y) fxy (x, y)
A(x, y) = xx .
fyx (x, y) fyy (xy)
Then at the point (x0 , y0 ), the function f has
i i
i i
muldown
i i
2010/1/10
page 116
i i
11
00
00(c,f(c))
11 f(c)
G 01c f
f(U)
G(U)
U
n+1 n
R R
G(p)=(p,f(p)) R
G 01c f
11
00 f(U)
00 G(U)
11 U
(c,f(c)) n f(c)
n+1 R
R
G(p)=(p,f(p)) R
(c,f(c)) G
1
0 f
1 f(c)
0
G(U) 0c
1
U
n+1 n
R R
R
i i
i i
muldown
i i
2010/1/10
page 117
i i
Example 4.42 For the function f (x, y) = x2 + 4xy + y 2 , from the equations fx =
2x + 4y = 0 and fy = 4x + 2y = 0, we find that (x, y) = (0, 0) is the only stationary
point, but in this case
fxx (x, y) fxy (x, y) 2 4
= , satisfies 2 2 42 = 12 < 0
fyx (x, y) fyy (x, y) 4 2
so the matrix is indefinite. Hence, f has a saddle point at (0, 0). Alternately one
can see this from the observation that f (x, y) > 0 on the coordinate axes except at
the origin, but is negative on the line y = x when x > 0.
Example 4.43 For the function f (x, y) = 3x2 y 2 + x3 , from the equations
we see that the stationary points are (0, 0) and (2, 0). We observe
6 0
for (0, 0) is indefinite = saddle point
0 2
6 0
for (2, 0) is negative definite = relative maximum.
0 2
fx = 4x3 = 0, and fy = 4y 3 = 0,
Thus, we have no information from the theorem. However, 12x2 0, 12y 2 0 and
144x2 y 2 0 so that the matrix is positive semidefinite for all (x, y) and positive
definite if x 6= 0 and y 6= 0. Thus, the remainder in Taylors theorem R2 (x, y) 0
for all (x, y) and f has a minimum at (0, 0). Of course, one can easily observe that
f (x, y) > 0 when (x, y) 6= (0, 0) without resorting to second derivatives.
i i
i i
muldown
i i
2010/1/10
page 118
i i
fx = 4x3 = 0, and fy = 4y 3 = 0,
The last two examples illustrates the fact that anything can happen in the case
that fxx fyy (fxy )2 = 0 at a stationary point. Our next example looks at a function
on R3 . However, we need some more applicable criteria for the definitiveness of a
matrix.
The following criteria may be found, say, in the book by Gantmacher, Matrix
Theory.
(a) A is positive semidefinite if and only if the determinants of all the k k sub-
matrices of A symmetric about the main diagonal are non-negative ( 0).
(b) A is negative semidefinite if and only if the determinants of all the k k sub-
matrices of A symmetric about the main diagonal have sign (1)k or are zero.
(c) A is positive definite if and only if the determinants
a1,1 . . . a1,k
.. > 0, k = 1, 2, . . . , n.
det ... ... .
ak,1 . . . ak,k
(e) A is indefinite if and only if it satisfies none of (a), (b), (c), or (d).
The determinants in parts (c) and (d) are sometimes called the principal mi-
nors.
i i
i i
muldown
i i
2010/1/10
page 119
i i
Example 4.47 For the function f (x, y, z) = x2 + y 2 + z 2 + 2xyz, solving the equa-
tions
(0, 0, 0), (1, 1, 1), , (1, 1, 1), (1, 1, 1), (1, 1, 1).
At (0, 0, 0), A(0, 0, 0) = 2I33 which is positive definite; hence f has a relative
minimum at (0, 0, 0). At the other stationary points, |x| = |y| = |z| = 1 and
xyz = 1, so the principal minors of A satisfy
2 2z
det[2] > 0 det = 4 4z 2 = 0
2z 2
2 2z 2y
det 2z 2 2x = 8(1 x2 y 2 z 2 + 2xyz) < 0.
2y 2x 2
Thus, the matrix is indefinite at all the other critical points so that these are all
saddle points.
In the following exercises, assume all the differentiability you need unless oth-
erwise specified.
Exercises
4.32. Find Vx , Vy , Vz , Vu , Vxy , Vxyz , Vxyzu for the following functions V
x2 y 2 u x y z u xy
(i) , (ii) + + + + .
a2 z 2 y z u x zu
Solution for (i):
2xy 2 u 2x2 yu 2x2 y 2 uz x2 y 2
Vx = a2 z 2 Vy = a2 z 2 Vz = (a2 z 2 )2 Vu = a2 z 2
4xyu 8xyzu 8xyz
Vxy = a2 z 2 Vxyz = (a2 z 2 )2 Vxyzu = (a2 z 2 )2
i i
i i
muldown
i i
2010/1/10
page 120
i i
(u, v, w)
= 0.
(x, y, z)
Vx = P, Vy = Q, , Vz = R, for some 6= 0,
show that
P (Qz Ry ) + Q(Rx Pz ) + R(Py Qx ) = 0.
Q
4.38. Let P, Q : R2 7 R, with P, Q, P
y , x continuous.
i i
i i
muldown
i i
2010/1/10
page 121
i i
xu xv
(a) [ Vu Vv ] = [ Vx Vy ]
yu yv
Vuu Vuv xu yu Vxx Vxy xu xv
(b) =
Vvu Vvv xv yv Vyx Vyy yu yv
x xuv y yuv
+ Vx uu + Vy uu .
xvu xvv yvu yvv
(ii) If U = U (x, y), V = V (x, y), x = x(u, v), y = y(u, v), show that
(U, V ) 1 (U, V )
= .
(x, y) r (r, )
2 V + V + V = r2 Vrr + rVr + V .
Vuu + Vvv
= x2u + x2v = yu2 + yv2 .
Vxx + Vyy
i i
i i
muldown
i i
2010/1/10
page 122
i i
ztt c2 zxx = 0.
Show that
Z x+ct
1 1
z(x, t) = {f (x + ct) + f (x ct)} + g
2 2c xct
[Hint: If you are observant you dont have to do all that differentiation.]
4.51. Discuss the nature of the stationary points of the functions
(a) 34x2 24xy + 41y 2 . [Solution: (0, 0) is a minimum.]
(b) 3x2 + 4xy 4y 2 . [Solution: (0, 0) is a saddle.]
(c) x2 y 4x2 y 2 . [Solution: (0, 0) is a max; (2 2, 4) are saddles.]
(d) x2 y + 2x2 2xy + 3y 2 4x + 7y. [Solution: (1, 1) is a min; (1 6, 2)
are saddles.]
(e) x2 yz 2xyz + x2 z + x2 + 2 2
y + z + yz 2xz 2x + 2y + z. [Solution:
(1, 1, 0) is a min; (1 2, 2, 1) and (1 2, 0, 1) are saddles.]
i i
i i
muldown
i i
2010/1/10
page 123
i i
which are easily solved for A and B. The line y = Ax + B is the line which
best fits the given set of points in the sense of least squares.
4.55. Let {k } be a sequence of real-valued continuous functions on [a, b] such that
Z b
1, if k = m;
k m =
a 0, if k 6= m,
Let f be a real-valued continuous function on [a, b]. Prove that the choice of
constants 1 , . . . , which minimizes the quantity
Z b
!2
X
f k k ,
a k=1
for a given is
Z b
k = f k , k = 1, . . . , .
a
This problem arises in the theory of Fourier Series.
4.56. The capacity of a condenser formed by two concentric spherical conductors
of radii a and b, 0 < a < b, is
ab
C(a, b) = .
ba
i i
i i
muldown
i i
2010/1/10
page 124
i i
Suppose that in measuring the radii of a and b of the spheres the measure-
ments are subject to errors of amount a and b respectively. Given that
products of the errors are negligible relative to the other quantities involved
derive the following approximation on the error in C:
C a b b a
.
C a ba b ba
4.57. The breaking weight W of a cantilever beam is given by the formula W =
kbd2 where b is breadth, is length, d is depth, and k is a constant depending
on the material of the beam. If the breadth is increased by 2% and the depth
by 5%, show that the length should be increased by about 12% if the breaking
weight is to remain unchanged.
4.58. In a triangle ABC, the area is calculated from the elements a, B, C, the
measurements being subject to errors a, B, C. Show that the error S in
the area is approximately given by
S a cB bC
2 + + .
S a a sin(B) a sin C
i i
i i
muldown
i i
2010/1/10
page 125
i i
Example 4.51 This example shows that in the preceding theorem, f need not be
globally one-to-one. Take f (x, y) = (ex cos(y), ex sin(y)). Then
x
e cos(y) ex sin(y)
Jf (x, y) = x = e2x 6= 0, for all (x, y) R2 .
e sin(y) ex cos(y)
i i
i i
muldown
i i
2010/1/10
page 126
i i
y
v=u tan(k)
2 f
y=k u
x=c x
u 2 + v 2= e 2
Proof. In order to prove f (D) is open, we must show that for any q0 f (D),
there is a 0 = (q0 ) > 0 such that B(q0 , 0 ) f (D). Let q0 = f (p0 ) for some
p0 D. Since D is open, we may choose 0 > 0 so that both
i i
i i
muldown
i i
2010/1/10
page 127
i i
C
f
00
11
p0
1
0
q0 f(C)
Figure 4.14. The point q0 is away from the image of the boundary.
But the last line can be viewed as a homogeneous system of linear equations with
coefficient matrix having determinant Jf (p ) 6= 0. Consequently, we must have
n o
fi (p ) = qi , i = 1, . . . , n = f (p ) = q = q f (D).
Remark: The condition that Jf (p) 6= 0 cannot be dropped even at a single point
in D. Indeed, take D = R and f (x) = x2 . Then f (R) = [0, ) is not open even
though f (x) 6= 0 except at the single point x = 0.
Proof. Given c D, we must prove that f (c) belongs to the interior of f (D). By
relabeling the x s if necessary, we may assume that
(f1 , . . . , fm )
(c) 6= 0.
(x1 , . . . , xm )
Since f C 1 (D), there is an open neighborhood U of c such that
(f1 , . . . , fm )
(p) 6= 0, p U.
(x1 , . . . , xm )
i i
i i
muldown
i i
2010/1/10
page 128
i i
by
f(x1 , . . . , xm ) = f (x1 , . . . , xm , cm+1 , . . . , cm ).
Now the image of the sequence, {f 1 (qk )} is a sequence in the compact set S, hence
there is a subsequence {f 1 (qkj )} such that
i i
i i
muldown
i i
2010/1/10
page 129
i i
p U = |f (p) f (p0 )| m |p p0 | .
But then
1
Since Jf (p0 ) 6= 0, the inverse matrix Df (p0 ) exists and from (4.22),
1 1
|p p0 | Df (p0 ) g(p) = Df (p0 ) f (p) f (p0 ) Df (p0 )(p p0 )
1
= Df (p0 ) f (p) f (p0 ) (p p0 ).
(4.24)
i i
i i
muldown
i i
2010/1/10
page 130
i i
Example 4.58 We have seen that f (x, y) = (ex cos(y), ex sin(y)) is one-to-one on
the strip 0 y < 2. Recall
x
e cos(y) ex sin(y)
det f (x, y) = det x = e2x 6= 0,
e sin(y) ex cos(y)
x
1 e cos(y) ex sin(y)
= [f (x, y)] = e2x .
ex sin(y) ex cos(y)
To find f 1 , solve (u, v) = (ex cos(y), ex sin(y)) for x and y:
v
u2 + v 2 = e2x , tan(y) =
u
1 v
= x= log(u2 + v 2 ), y = arctan
2
u
1 v
= f 1 (u, v) = log(u2 + v 2 ), arctan .
2 u
and
u v
1 u2 +v 2 u2 +v 2 1 u v
f (u, v) = = 2
v u u + v2 v u
u2 +v 2 u2 +v 2
ex cos(y) ex sin(y)
= e2x .
ex sin(y) ex cos(y)
i i
i i
muldown
i i
2010/1/10
page 131
i i
1
Thus, we see that Df 1 (u, v) = Df (x, y) .
Exercises
4.60. In Exercise 29 a sufficient condition was given for f : Rn 7 Rn to be globally
one-to-one. Verify that this condition is not satisfied by Example 4.51. (If
you did not do Exercise 29, a proof is given in Lemma 4.49.)
4.61. Show that if f : R 7 R and f (x) 6= 0 for each x R, then f is one-to-one
globally on R.
4.62. Let f : R 7 R. Show that if f is continuous and one-to-one on a connected
subset S of R, then f 1 is continuous on f (S).
4.63. If y = y(x), that is yi = yi (x1 , . . . , xn ), for i = 1, . . . , n, be C 1 . Show that
4.64. Let f (x, y) = (x2 , y/x) when x > 0. Find f (x, y). Show that f is one-to-one
1
on its domain by finding f 1 . Check that f = f 1 .
4.65. Let !
x y
f (x, y) = p ,p .
x2 + y 2 x2 + y 2
Show that Jf (x, y) = 0 for all (x, y) R2 \O and f is not locally one-to-one
anywhere on its domain. Show that the range of f is the circle u2 + v 2 = 1
and thus contains no open subset.
fi (x1 , . . . , xm , y1 , . . . , yn ) = 0, i = 1, 2, . . . , n (4.26)
yi = i (x1 , . . . , xm ), i = 1, 2, . . . , n? (4.27)
Of course, the way the question was originally stated, part of the problem is to
find which are the independent variables and which are the dependent variables. The
equations will not be written for you with the independent variable so conveniently
i i
i i
muldown
i i
2010/1/10
page 132
i i
labelled. However, since we can always rearrange variables (once we identify them),
for the sake of the statement of the Theorem, we assume the given order.
If the equations (4.26) are linear in the variables yi , then we know we have a
solution if the coefficient matrix is nonsingular. But observe, the coefficient matrix
of linear functions is nothing more than the matrix of partial derivatives for those
functions with respect to the linear variables. This suggests that the answer lies in
whether or not the Jacobian of functions with respect to the dependent variables
in nonzero.
and
(f1 , . . . , fn )
i yi (y1 , . . . , yi1 , xj , yi+1 , . . . , yn ) i = 1, . . . , n,
= = , (4.31)
xj xj (f1 , . . . , fn ) j = 1, . . . , m.
(y1 , . . . , yn )
i i
i i
muldown
i i
2010/1/10
page 133
i i
i i
i i
muldown
i i
2010/1/10
page 134
i i
i i
i i
muldown
i i
2010/1/10
page 135
i i
You will see from the simple exercises for this section that when the basic
condition on the Jacobian in (4.28) is not satisfied there may be no solution, or no
C 1 solution, or indeed infinitely many solutions.
Proof of Theorem 4.59. Consider the function F : Rn+m 7 Rn+m defined by
(p 0,0)
and thus,
O = f (p, (p, O))
for all p in a neighborhood U of p0 . Therefore,
i i
i i
muldown
i i
2010/1/10
page 136
i i
which gives (4.30). Viewing that as a linear system of equations for unknowns
k /xj , k = 1, . . . , n, Cramers Rule gives (4.31)
Remark: The function is the unique solution to f (p, (p)) = O with (p0 ) = q0
since if to the contrary we had two solutions
f (p, 1 (p)) = f (p, 2 (p)) = O and 1 (p) 6= 2 (p) for some p U , then
F (p, 1 (p)) = F (p, 2 (p)) = (p, O),
contradicting the fact that F is one-to-one on W . This means that the graph of ,
the set {p, (p)) : p U } is the whole set F 1 (O) W .
i i
i i
muldown
i i
2010/1/10
page 137
i i
Exercises
4.66. Prove Corollary 4.64. That is, work through the proof of Theorem 4.59 in
this special case.
4.67. The equation y 2 x2 = 0 has two C 1 solutions, y = x in a neighborhood of
x = 0; this shows uniqueness may not hold. What condition of the Implicit
Function Theorem does not hold? Check that there are four solutions of class
C and infinitely many real-valued solutions.
4.68. Show that the equations
x2 yu = 0, u = u0 , v = v0 , xy + uv = 0,
i i
i i
muldown
i i
2010/1/10
page 138
i i
(1 , . . . , n ) (u1 , . . . , un ) (1 , . . . , n )
= (1)n .
(u1 , . . . , un ) (x1 , . . . , xn ) (x1 , . . . , xn )
(u, v) uv(u + v) x
= .
(x, y) (u v)[(u + v) + y(1 x)]
4.72. If u1 = x1 + x2 + x3 + x4 , u1 u2 = x2 + x3 + x4 , u1 u2 u3 = x3 + x4 , and
u1 u2 u3 u4 = x4 , show that
(x1 , x2 , x3 , x4 )
= u31 u22 u3 .
(u1 , u2 , u3 , u4 )
V 2x(x2 y 2 )
3 = + 8y(x2 + y 2 ).
y (x2 + y 2 )2
F (F (x, y), y) = 0
i i
i i
muldown
i i
2010/1/10
page 139
i i
4.5.1 Dimension
We have a notion of dimension for linear objects, specifically for vectors spaces,
namely, the number of vectors in a basis. If L : Rk 7 Rn is linear, L(x) = Ax, then
the dimension of L(Rk ) Rn is the rank of A. In particular, if rank A = k, then
L(Rk ) Rn is k-dimensional, the same dimension as Rk .
i i
i i
muldown
i i
2010/1/10
page 140
i i
1
0 0
1
1
0 V 0
1 V2 0
1
0
1
0 1
1 0
1 0
1
0
1
0
1 0
1 0
1
0
1 0
1 0
1
1111111
0000000
0
1 0
1
111111
000000
0
1 0
1
11111111
00000000
0
1 0
1 0
1
0
0
1 0
1 1
0
1
0
1
0
1 0
1 0 S
1
0
1 0
1
0
1 0
1
0S
1 0
1
1 S
2
Figure 4.16. The unit circle in Example 4.69 as the union of two 1-
segmants S = f (, ) f (0, 2) .
The set S := {(x, y) : x2 + y 2 = 1} is a 1-dimensional manifold in R2 . To see this
consider the map f (t) = (cos(t), sin(t)), t R. Then
sin(t)
rank f (t) = rank = 1.
cos(t)
Thus, f is locally one-to-one and is in fact one-to-one on any open interval of length
2. Then
S = f (, ) f (0, 2) .
11
00
00(x,y,z)
11
y
x
One can check that f C 1 (2, 2), that f is one-to-one, and that rank f = 1
through out the interval. In particular, take special care to check that there is no
trouble at (0, 0).
Example 4.72 The n-dimensional manifolds in Rn are precisely the open sets;
that is, S is an n-dimensional manifold in Rn if and only if S is open. This follows
from the definition and Theorem 4.53.
i i
i i
muldown
i i
2010/1/10
page 141
i i
of the segment (the variable of f being called the parameter). A local parameter-
ization of a manifold is essentially a local coordinate system for the manifold; for
example, in Example 4.70,
z
/2 = 0
f
= /2
= 0
y
0
= /2
x
i i
i i
muldown
i i
2010/1/10
page 142
i i
fi (x1 , . . . , xn ) = 0, i = 1, . . . , k
h i
fi
represents an (n k)-dimensional manifold in Rn if the matrix xj has rank k.
fi (x1 , . . . , xk , xk+1 , . . . , xn ) = 0, i = 1, . . . , k,
Remark: The dimension of the range of f (that is, the number of equations fi = 0)
is immaterial here. If rank f (p) = k on some open set D f 1 (O), then f 1 (O)
is an (n k)-dimensional manifold no matter how many equations there are. You
are no doubt familiar with the corresponding statement for linear functions: If
L : Rn 7 Rm is linear, L(x) = Ax, for a matrix A with rank A = k, then L1 (O)
is a vector space of dimension n k. The number n k is called the nullity of the
matrix A.
i i
i i
muldown
i i
2010/1/10
page 143
i i
Proof. This time take, F (x, y, z) = f (x, y) z. Then the graph is F 1 (0),
F (x, y, z) = [ fx fy 1 ] , and rank F = 1.
Example 4.77 Let f (x, y, z) = x2 + y 2 z 2 . Then f (x, y, z) = [ 2x 2y 2z ].
Thus, f (x, y, z) = 0 is a 2-dimensional manifold (smooth surface) in R3 if the origin
is omitted.
f=0
0 y
000
11 f=0
i i
i i
muldown
i i
2010/1/10
page 144
i i
Question: Let
a(x) b(x) c(x)
A(x) = .
d(x) e(x) f (x)
Does rank A(x0 ) = 2 imply that rank A(x) = 2 near x0 if the entries in A are
continuous? Does rank A(x0 ) = 1 imply rank A(x) = 1 near x0 ?
This theorem is analogous to the statement that the dimension of the range
of any linear function L(x) = Ax is the rank of A. The proof of this theorem is
notationally quite complicated so we will consider a few examples and special cases
first. Actually, all the essential ideas of the proof are contained in the special cases
so you may skip the proof if you wish.
Then
1 1 0
rank f (x, y, z) = rank 0 1 1 = 2.
2(x y) 2(z x) 2(y z)
Here f (R3 ) is the 2-dimensional segment {(u, v, u2 v 2 ) : (u, v) R2 } (a hyperboloid
w = u2 v 2 ) in R3 . We used (u, v) = (x y, y z) to parameterize the surface.
i i
i i
muldown
i i
2010/1/10
page 145
i i
(f1 , f2 )
(x0 , y0 , z0 ) 6= 0 = u = f1 (x, y, z) and v = f2 (x, y, z),
(x, y)
hence
w = f3 (1 (u, v, z), 2 (u, v, z), z).
We observe that this last function is actually independent of z:
w f3 1 f3 2 f3
= + +
z x z y z z
i i
i i
muldown
i i
2010/1/10
page 146
i i
Therefore,
w = (u, v) and f (U0 ) = {(u, v, (u, v)) : (u, v) (f1 , f2 )(U0 )},
for some open neighborhood U0 of (x0 , y0 , z0 ).
i i
i i
muldown
i i
2010/1/10
page 147
i i
Substituting these into the Chain Rule applied to equations from (4.37), using the
effect of interchanging rows in determinants, and using the expansion of a determi-
nant about row i, we obtain
k (f1 ,...,fk )
ui X f
=
i (x
1
,...,x ,x
1 k+1 +1 ,x ,...,x k )
+ fi
xk+1 x (f1 ,...,fk ) xk+1
=1 (x1 ,...,xk )
k+1 (f1 ,...,fk )
X f i (x ,...,x ,x ,...,x ,x )
= (1)k+1 1 1 +1 k k+1
x (f1 ,...,fk )
=1 (x1 ,...,xk )
[ I(nk)(nk) ]
i i
i i
muldown
i i
2010/1/10
page 148
i i
Xk
f gj
(p0 ) + j (p0 ) = 0, i = 1, . . . , n.
xi j=1
xi
Example 4.86 Find the distance d from the point (0, 0) to the line x + y = 1. The
problem is to minimize x2 + y 2 subject to the constrain x + y 1 = 0. We note that
a minimum exists since x2 + y 2 goes to infinity as |(x, y)| and that x2 + y 2 is
non-negative. At the minimum value, the Lagrange Multiplier Rule says that there
is a such that
2x + = 0, 2y + = 0, with x + y 1 = 0.
But,
fxx fxy 4 0
= is indefinite,
fyx fyy 0 6
so f has a saddle point at ( 21 , 0). Therefore, the maximum and minimum occur in
the set
{(x, y) : x2 + y 2 1 = 0}.
The Lagrange Multiplier Rule then gives the existence of a such that at the
extrema
4x 2 + 2x = 0
6y + 2y = 0
x2 + y 2 1 = 0.
i i
i i
muldown
i i
2010/1/10
page 149
i i
i i
i i
muldown
i i
2010/1/10
page 150
i i
k
R
F(U)
U F
p
0
(f(p ),0)
0 R
S: g(p)=0
n
R
gj (p0 ) = 0, j = 1, . . . , k,
serve to determine the (n + k) numbers which are the coordinates of p0 and the
1 , . . . , k .
Proof of 2 of Theorem 4.85. This proof works in general but we prove just the
case of one constraint. Suppose f, g : Rn 7 R are C 1 functions and f (p), p =
(x1 , . . . , xn ), has an extremum at c = (1 , . . . , n ), with respect to the constraint
g(p) = 0. (4.39)
If rank g (c) = 1, then we may assume that (g/x1 )(c) 6= 0. Then, equation (4.39)
may be solved for x1 in a neighborhood of (2 , . . . , n ) in the form
Exercises
4.77. Let f (x, y) = (x + y, 2x + ay) = (u, v).
i i
i i
muldown
i i
2010/1/10
page 151
i i
K := {(x, y) : 0 x 1, 0 y 1}
yi = i (x1 , . . . , xn ), i = 1, . . . , k
i i
i i
muldown
i i
2010/1/10
page 152
i i
=0
=0
2 m2 n2
+ + = 0.
1 aV 1 bV 1 cV
4.90. Show that the following inequalities are true.
1
(a) (x1 x2 xn )2 nn if x21 + x22 + . . . + x2n = 1.
(b) (Arithmetic-geometric mean inequality)
1
(a1 a2 an )1/n (a1 + a2 + . . . + an ), ai 0, i = 1, . . . , n.
n
show that when the distance between point (, ) and (, ) lying on the
respective curves has an extremum, then
fx (, ) gx (, )
= = .
fy (, ) gy (, )
Use this to find the shortest distance between the ellipse x2 +2xy+5y 2 16y =
0 and the line x + y 8 = 0.
i i
i i
muldown
i i
2010/1/10
page 153
i i
4.92. Let f be a real valued function of class C 1 on R3 . Prove that there are at least
two points on the sphere x2 + y 2 + z 2 = R2 , R > 0, at which the equations
f f
y x =0
x y
f f
z y =0
y z
f f
x z =0
z x
are satisfied.
1 1
4.93. (The Holder and Minkowski inequalities.) Let p > 1, q > 1 and p + q = 1.
q
xp y
(a) Show that the minimum of f (x, y) = p + q subject to the constraints
x > 0, y > 0, and xy = 1 is 1.
ap bq
(b) Show that a 0, b 0 implies ab p + q .
(c) Show that ak 0, bk 0, k = 1, . . . , n implies
n n
!1/p n
!1/q
X X X
ak b k apk bqk (Holders Inequality).
k=1 k=1 k=1
Pn 1/p Pn 1/q
[Hint: Let A = ( k=1 apk ) and B = ( k=1 bqk ) and consider a =
ak /A, b = bk /B.]
(d) If ak , bk , k = 1, . . . , n are real numbers and p 1, then
n
!1/p n
!1/p n
!1/p
X X X
p p p
|ak + bk | |ak | + |bk | .
k=1 k=1 k=1
i i
i i
muldown
i i
2010/1/10
page 154
i i
i i
i i
muldown
i i
2010/1/10
page 155
i i
Chapter 5
Further Topics in
Integration
We shall not prove this statement. You should prove it yourself as an exercise; it
is obviously true for n = 1, use induction on n. But first prove it for n = 2 to see
how the general proof should go.
155
i i
i i
muldown
i i
2010/1/10
page 156
i i
Q
Recall the concepts of an interval in Rn , I = nj=1 [aj , bj ]; the Jordan content
Q n
of an interval in Rn , (I) = j=1 (bj aj ); and the diameter of an interval I in Rn ,
P 1/2
n 2
(I) = j=1 (bj aj ) . A special case of an interval is an n-cube
Our immediate goal is to look at sets with content and to investigate what
happens to the content after mapping by nice functions. We begin with investi-
gating the content of intervals under linear functions, and get progressively more
sophisticated. This is background for change of variables in integration.
((I)) = |J |(I).
Proof. The matrices Aj are the Jacobian matrices for the linear functions Lj ,
j = 1, 2, 3, respectively. It is easy to determine what these matrices do to intervals
I. Only the linear function L1 changes the content by changing the length of one
side.
Indeed, both L1 (I) and L3 (I) are again intervals. For L1 (I), the k-th interval
becomes either [aak , abk ] (a > 0), or [abk , aak ] (a < 0), and the content is (L1 (I)) =
|a|(I). For L3 (I), the intervals of the k-th and j-th coordinates are swapped, and
thus, the overall content, namely the product of the lengths of the intervals, remains
unchanged.
Since L2 (I) is no longer an interval, we need to use the theory developed in
Chapter 3. In passing from I to L2 (I) only the description of the k-th coordinate
changes
n
L2 (I) = p = (x1 , . . . , xn ) : am xm bm , m = 1, . . . , k 1, k + 1, . . . , n,
o
ak + xj xk bk + xj .
i i
i i
muldown
i i
2010/1/10
page 157
i i
Thus, the Lemma holds for the elementary linear functions. Since any linear
function is a composition of finitely many elementary linear functions, the differen-
tial is the product of the differentials, and hence the Jacobian is the product of the
Jacobians, the general case follows.
The next three lemmas and Theorem 5.7 that follows are technical in nature.
In view of the last lemma, you will find it easy to accept Theorem 5.7, so on your
first reading, you may proceed directly to the important change of variables theorem,
Theorem 5.8; however, read the statement of Theorem 5.7 first.
|D(p)(u)| M |u|, p m
j=1 Ij , u Rn .
Consequently,
n
X n
(D) nj=1 Kj and ((D) (Kj ) 2 nM .
j=1
i i
i i
muldown
i i
2010/1/10
page 158
i i
If D is a compact subset of G and D has content, then (D) is a compact set with
content.
Proof. We know that C(D) and D compact implies (D) is compact (Theorem
2.36). In particular, D D implies that the boundary of (D) is contained in
(D). By Theorem 4.50 and Theorem 4.53, is locally one-to-one on G and maps
open sets onto open sets. Therefore,
both (G) and (G)\(D) are open sets.
Now, if (p) = q is on the boundary of (D), then each neighborhood V of q
contains a point q1 (G)\(D), which implies that each neighborhood U of p
contains a point in G\D. Therefore, p is in the boundary of D and we have shown
((D)) ((D)). (5.1)
The set D has content if and only if the content of its boundary is zero (The-
orem 3.14). Therefore
(D) = 0 = (((D))) = 0, Lemma 5.3
= (((D))) = 0, by (5.1)
= (D) has content. (Theorem 3.14)
Proof. From the proof of Lemma 5.4, (5.1), we know that ((K)) ((K)). If
p = (x1 , . . . , xn ) (K), then r |p| n r, and consequently,
|(p) p| < |p| n r
= r(1 n) |xj | n r |j (p)| |xj | + n r r(1 + n),
|{z}
some j
where the left most inequality is true for at least
one j. Therefore, the boundary
of (K) lies outside an
n-cube of side 2(1 n)r with center O and inside an
n-cube of side 2(1 + n)r with center O. Since (K) = (2r)n , this implies
n ((K)) n
1 n < < 1+ n .
(K)
i i
i i
muldown
i i
2010/1/10
page 159
i i
(K) K
Figure 5.1. The cube K is mapped toa region (K) whose boundary is
captured between two cubes of side lengths 1 n and 1 + n.
Lemma 5.6. Suppose that L : Rn 7 Rn is a linear map of full rank, and that
: Gopen Rn 7 Rn is in C 1 (G) and J (p) 6= 0 for all p G. Then for any
interval I G,
(L((I))) = |JL |((I)).
Proof. By Lemma 5.4, (I) has content. Therefore, for any interval K with (I)
K, Z
((I)) = (I) .
K
By the Cauchy Criterion for integrals, Corollary 3.7, and by the proper choice
of Riemann sums, given any > 0, there are finitely many intervals with non-
overlapping interior, I1 , . . . , Im , Im+1 , . . . , IN such that
m N
j=1 Ij (I) j=1 Ij
and
m
X N
X
((I)) (Ij ) ((I)) (Ij ) ((I)) + .
j=1 j=1
Then
L m N
j=1 Ij L ((I)) L j=1 Ij
i i
i i
muldown
i i
2010/1/10
page 160
i i
1
Proof. Since J (p) 6= 0, (D(p)) =: Lp exists and is linear. Moreover, det[Lp ] =
1/J (p). Since D is compact and C 1 (D), the partial derivatives of are
uniformly continuous on D and we can deduce the following:
(a) There is an M > 0 such that
|Lp (u)| M |u|, p D and u Rn . (Why?)
i i
i i
muldown
i i
2010/1/10
page 161
i i
Proof. The set (D) has content by Lemma 5.4. Without loss of generality, we
may assume both f 0 and J 0 (WHY?). By Theorem 3.12 both of the
intergrals in the theorem exist. Therefore, it only remains to show that they are
equal.
For any , 1 > > 0, we may choose a partition on D consisting of n-cubes
Kj , j = 1, . . . , m, which are sufficiently small so that
R Pm
(a) D (f )J j=1 (f )(qj )J (pj )(Kj ) < , for any qj Kj and pj
being the center of Kj ;
((Kj ))
(b) J (pj )(1 )n J (pj )(1 + )n ,
(Kj )
(item (a) by the definition of the integral and the continuity of J (p), and item (b)
by Theorem 5.7).
D (D)
i i
i i
muldown
i i
2010/1/10
page 162
i i
m
X
= f (pj )((Kj )), pj (Kj ) (Theorem 3.15(e))
j=1
Xm
= f ((qj ))((Kj )), where qj Kj .
j=1
Remarks: Strictly speaking, the use of the Mean Value Theorem for Integrals,
Theorem 3.15(e), is not applicable to those Kj which intersect the boundary of D.
But this presents no problem (why ?).
You will notice that Theorem 5.8 is much more restrictive than the corre-
sponding result in R1 , Corollary 3.19, which states that
Z (b) Z b
f= (f )
(a) a
without the restrictions that 6= 0 and being one-to-one. These restrictions are
necessary in Rn because of considerations derived from the notion of orientation
which will be discussed later.
i i
i i
muldown
i i
2010/1/10
page 163
i i
y v
R
r
x
u
r R
Figure 5.3. (a) The regions D and (D) for Example 5.9
9
8
8
6
7
4
6
2
5
0
4
2
3
4
2
6
1
8
0
5 0 5 0 2 4 6
Figure 5.4. The region D in the (, r)-plane and its image, the inside of
a cardioid in the (x, y)-plane for Example 5.10.
(x, y) cos() r sin()
(x, y) = (r, ) = (r cos(), r sin()), J = = = r.
(r, ) sin() r cos()
i i
i i
muldown
i i
2010/1/10
page 164
i i
Therefore,
Z Z Z 2 Z a+b cos()
(x, y)
dx dy = dr d = r dr d
(D) D (r, ) 0 0
Z Z
1 2 1 2 2
= (a + b cos())2 d = (a + 2ab cos() + b2 cos2 ()) d
2 0 2 0
Z
1 2 1 1
= a2 + b2 (1 + cos(2)) d = a2 + b2 .
2 0 2 2
Here again J = 0 on the -axis (r R= 0) andR is not one-to-one on the lines
= 0, = 2, and r = 0. However, (D) and D both exist and Theorem 5.8 is
applicable to regions which difffer from (D) and D respectively by regions which
have arbitrarily small content.
Example 5.11 The function (x, y) = (u, v) = (u2/3 v 1/3 , u1/3 v 2/3 ) maps the tri-
angle D := {(u, v) : 0 u, 0 v, u + v 1} onto the area bounded by the loop
of the curve x3 + y 3 = xy. Observe that J = 31 and that maps the u and v axes
1 1.5
1
0.5
0.5
0
0
0.5 0.5
0.5 0 0.5 1 0.5 0 0.5 1 1.5
Figure 5.5. The triangle D in the (u, v) plane is transformed to the inside
of the loop of the curve x3 + y 3 = xy in Example 5.11
For example, if f (x, y) = xy, then (f )(u, v) = (u2/3 v 1/3 )(u1/3 v 2/3 ) = uv; hence
Z Z Z Z
1 1 1v 1 1 1
f= uv du dv = (1 v)2 dv = .
(D) 3 0 0 6 0 72
Why is this valid? is not C 1 at (0, 0) and is not one-to-one on the u and v axes.
The first three examples illustrated how the change of variable formula may
be used to simplify the region of integration. It may also be used to simplify the
integrand, which was its basic role in one-dimension.
i i
i i
muldown
i i
2010/1/10
page 165
i i
y v
x+y=1 D
u=v u=v
D*
x
0 u
0
Figure 5.6. The tip down triangle, D, in the (u, v)-plane is transformed
to the right triangle D in Example 5.12.
D = 1 D , D = (D),
u+v vu
(x, y) = , = (u, v),
2 2
1 1
(x, y) 2 1
J = = 1 12 = .
(u, v) 2 2 2
Alternatively,
(u, v) 1 1 1
J1 = = = 2 = J = ,
(x, y) 1 1 2
so in fact it was not necessary to find . Then
Z Z u 1 Z 1 Z v u
xy 1
exp dx dy = exp du, dv = exp du, dv
D x+y D v 2 2 0 v v
Z Z 1
1 1 u/v u=v 1 1
= ve dv = v e dv
2 0 u=v 2 0 e
1 1 1
= e = sinh(1).
4 e 2
See also the examples in Buck pp 306-311 and the exercises on p311-313. The
particular exercises 10-12 indicate a proof of Theorem 5.8 based on the Implicit
Function Theorem when n = 2.
i i
i i
muldown
i i
2010/1/10
page 166
i i
Exercises
x2 y2
5.1. For f1 (x, y) = a2 + b2 and f2 (x, y) = x2 + y 2 , find
Z Z
x2 y2
f1 and f2 where D = (x, y) : 2 + 2 1 .
D D a b
1
f (x, y) =
(1 + x2 + y 2 )2
i i
i i
muldown
i i
2010/1/10
page 167
i i
(x2 + y 2 )2 (x2 y 2 ) = 0;
and (ii) D2 is the triangle with vertices (0, 0), (2, 0), (1, 3). [Solution: 4 12
and 23 arctan(1/2).]
5.7. Show that
Z
a 2 b 2 c2 x2 y2 z2
|xyz| dx dy dz = where K := (x, y, x) : 2 + 2 + 2 1 .
K 6 a b c
(ii) Show R
H = 14 ab(Aa2 + Bb2 + 4(x0 , y0 )) where
n 2 2
o
H := (x, y, ) : (xx
a2
0)
+ (yy
b2
0)
1 .
(iii) Show
R 57
S
(2x2 + y 2 + 3x 2y + 4) dx dy = 2 where
2 2
S is inside x + 4y 2x + 8y + 1 = 0.
x2 y2 z
2
+ 2 =2 ,
a b c
show that the volume of the solid bounded by these two surfaces is
ab 2 2
(a + b2 m2 + 2pcn)2 .
4c3 n4
5.9. Find
Z
log(x2 + y 2 ) dx dy where D := {(x, y) : b2 (x2 + y 2 ) a2 }
D
i i
i i
muldown
i i
2010/1/10
page 168
i i
1 um+n
2 ) v12 mm+n , if m 6= n and
|m((v1 )(v1 ) (v2 )(v2 )) log(u1 /u2 )|, if m = n.
What conditions must and satisfy if this statement is correct? Prove the
statement under your conditions.
R
5.14. Find D xyz dx dy dz where D is the region in the first octant bounded by
the cylinder x2 + y 2 = 16 and the plane z = 3, that is,
D = (x, y, z) : 0 x, 0 y, 0 z 3, x2 + y 2 16 .
5.15. Let G be the region in the first quadrant which R is bounded by the curves
xy = 1, xy = 3, x2 y 2 = 1, x2 y 2 = 4. Find G F where F (x, y) = xy.
5.16. Show Z
1
= pq(ap2 + bq 2 ) + 2pq(x0 , y0 )
D 3
where D is the region bounded by the four straight lines
x x0 y y0
= 1 (p.q > 0)
p q
i i
i i
muldown
i i
2010/1/10
page 169
i i
Z 1 Z 1 Z 1 Z 1
X n+7 dX Y 5 (1 Y ) dY Z 3 (1 Z) dZ U (1 U ) dU =
0 0 0 0
1
= .
(n + 8)7!
R1 R1R1
5.18. Which is larger 0
xx dx or 0 0
(xy)xy dx dy?
0 a 1
Figure 5.7. The function on the right has graph that are equilateral tri-
angles with base on [1/(k + 1), 1/k], k = 1, . . ., alternating up and down. Since
the height is one, the length of the graph of each triangle is greater than 2. Hence,
the total length cannot be finite. The graph on the left is piecewise smooth and
intuitively does have lenght.
segments piecewise.
i i
i i
muldown
i i
2010/1/10
page 170
i i
(iii) Two smooth curves and on U and U respectively are called parametrically
equivalent if there is a function f : U 7 U , f C 1 (U ), such that f (t) > 0
and (t) = (f (t)) for all t U .
(iv) (U ) is the trace of the curve in Rn .
(a) is continuous on U
(b) Each compact subinterval of U is the union of a finite number of intervals
I such that : I 7 Rn is a smooth curve and | | is Riemann integrable
on I.
We shall refer informally to the curve and its trace as the curve. We will
see that this is unambiguous in the integration theory for parametrically equivalent
curves. It is perhaps even more precise to consider curves as equivalence classes
of functions : R 7 Rn , the equivalence relation being parametric equivalence.
However, you might consider this idea of a curve as too eccentric on your first
encounter.
i i
i i
muldown
i i
2010/1/10
page 171
i i
p + (a,b,c)
o
po
b
a
Figure 5.8. The line through p0 = (x0 , y0 , z0 ) in the direction (a, b, c).
0 = (a, b, c) (, , ) = a + b + c.
:(a,b,c)
(x,y,z)
p
o
i i
i i
muldown
i i
2010/1/10
page 172
i i
(t,t 2 )
x
Figure 5.10. A parametrization og the parabola in Example 5.15.
(cos(t),sin(t),t)
We know that if C 1 , then (t0 ) + D(t0 )(t t0 ) is the best affine approx-
imation to (t) near t0 . This motivates the following definition.
i i
i i
muldown
i i
2010/1/10
page 173
i i
Remark: Two parametrically equivalent curves have the same direction. That is,
if (t) = (f (t)) with f > 0, then
(t) (f (t))f (t) (f (t))
=
= .
| (t)| | (f (t))f (t)| | (f (t))|
i i
i i
muldown
i i
2010/1/10
page 174
i i
(ii) The curve is piecewise smooth on U if for any [a, b] there are nonoverlapping
intervals of [a, b], [ai , bi ], such that is smooth on (ai , bi ). In this case.
X
(([a, b])) = (([ai , bi ])).
i
NOTE: (([0, 4])) = 2(([0, 2])) even though these 2 curves have the same
trace.
Motivation and Remarks: By way of motivation for the definition of the length
of a curve, we offer the following:
(a) The length of the line segment p(t) = p0 + q0 t for t [t1 , t2 ] is
Z t2
|p(t2 ) p( t1 )| = |q0 (t2 t1 )| = |p (t)| dt.
t1
(b) Now consider the segment [a, b] and a partition (tk )m k=0 of [a, b]. Suppose
k [tk1 , tk ], k = 1, . . . , m. One expects the sum of the lengths of the
line segment p(t) = (k ) + (k )(t k ), t [tk1 , tk ], k = 1, . . . , m, to
approximate the length of [a, b] as closely as one pleases provided only that
the partition is fine enough. This sum is
m
X
| (k )|(tk tk1 ),
k=1
Rb
which is a Riemann sum for a
| (t)| dt.
i i
i i
muldown
i i
2010/1/10
page 175
i i
It is not difficult to see by the Mean Value Theorem, that this is equivalent to the
Figure 5.13. The two means to approximate length, length of tangent lines
(top), and lengths of secant lines (bottom).
Rb
definition ([a, b]) = a | (t)| dt if is C 1 . However, this alternative definition is
more general in that it pertains to any curve for which the supremum exists as a
real number; in particular, need not be C 1 .
(ii) If C 1 (U ) and rank (u, v) = 2 for all (u, v) U , then is a smooth surface
on U in Rn :
x1 x1
u v X n 2
.. and is smooth (xi , xj )
= ... . > 0.
x x i,j=1
(u, v)
n n
u v
(iii) Two smooth surfaces and on U and U respectively are called parametrically
equivalent if there is a function f : U 7 U with f C 1 (U ), Jf (p) > 0 for
all p U , f is one-to-one from U onto U and (p) = (f (p)) for all
p U .
(iv) (U ) is called the trace of the surface in Rn . Again, we will not worry
excessively about distinguishing between a surface and its trace.
i i
i i
muldown
i i
2010/1/10
page 176
i i
(a) is continuous on U
(b) each compact Jordan measurable (has content) subset of U is the union of
finitely many Jordan measurable subsets D such that D = D , : D 7
Rn is a smooth surface, and the quantity
X n 2 1/2
(xi , xj )
i,j=1
(u, v)
is Riemann integrable on D.
describes a smooth plane since rank = 2 which may also be described as the
solution set of
x
x + y 2z = 0, or [ 1 1 2 ] y = 0.
z
From the latter equation, we see that the direction numbers of the normal vector
are (1, 2, 2). Notice also that these direction numbers are in fact given by
(y, z) (z, x) (x, y)
, , .
(u, v) (u, v) (u, v)
Example 5.24 In Example 5.22 we had a particular plane through the origin in
R3 . In this example, we take a longer look at the general form for a plane. Let
i i
i i
muldown
i i
2010/1/10
page 177
i i
(x0 , y0 , z0 ), (a1 , a2 , a3 ) and (b1 , b2 , b3 ) be given triples of real numbers, and consider
the parametrization
(
x = x0 + a1 u + b1 v
: y = y0 + a2 u + b2 v, (u, v) R2 , or
z = z + a u + b3 v
0 3
x x0 a1 b 1
y = y 0 + a2 b 2 u .
v
z z0 a3 b 3
i i
i i
muldown
i i
2010/1/10
page 178
i i
The best affine approximation to any surface (u, v) for (u, v) near (u0 , v0 ) is
(u0 , v0 ) + D(u0 , v0 )(u u0 , v v0 ); hence the following definition.
(ii) the normal line to the surface at p0 in R3 has the direction numbers
(u, v) (z, x) (x, y)
n= , , .
(u, v) p0 (u, v) p0 (u, v) p0
Question: In R4 , what corresponds to the normal line discussed above for the
surface in R3 ? If you cannot figure it out, ask.
Exercises
5.19. Show that two parametrically equivalent surfaces in R3 have the same unit
normal n1 .
5.20. Let : R2 7 Rn , (xi = xi (u, v), i = 1, . . . , n), be a smooth surface. Show
that the vectors
x1 xn x1 xn
u= ,..., , v= ,..., ,
u u p0 v v p0
are tangent vectors to certain smooth curves in . Check that the condition
rank (p0 ) = 2 simply requires that u and v be linearly independent. Check
that in the case n = 3, n = u v.
i i
i i
muldown
i i
2010/1/10
page 179
i i
Definition 5.26.
(i) If D is a compact Jordan measurable (has content) subset of U R2 and is
a smooth surface on U in R3 ,
x = x(u, v)
: y = y(u, v), (u, v) U
z = z(u, v)
then
s 2 2 2
Z Z
(y, z) (z, x) (x, y)
A((D)) := |n|du dv = + + du dv.
D D (u, v) (u, v) (u, v)
A((D)) is called the area of (D). Symbolically,
dA = |n|du dv.
Remark: Two parametrically equivalent surfaces have the same area (more on this
later).
Example 5.27 Here we compute the surface area for a surface given as the graph
of a function of two variable, namely, z = f (x, y), (x, y) D:
(x = u
: y=v that is z = f (x, y), f C 1 (D).
z = f (u, v)
Then
1 0
= 0 1 , and rank = 2,
fu fv
so is smooth. The area formula reads
Z p Z q
A((D)) = fu2 + fv2 + 1 du dv = fx2 + fy2 + 1 dx dy.
D D
i i
i i
muldown
i i
2010/1/10
page 180
i i
Example 5.28 The surface area of a sphere of radius a is easily computed when
the sphere is described as the surface
x = a sin() cos()
: y = a sin() sin() on D : {(, ) : 0 2, 0 .}.
z = a cos()
Then x2 + y 2 + z 2 = a2 can easily be checked, and
a sin() sin() a cos() cos()
= a sin() cos() a cos() sin() .
0 a sin()
It follows that
|n|2 = a4 sin4 () cos2 () + a4 sin4 () sin2 ()
2
+a4 sin()(cos() sin2 () + cos() sin() cos2 () , and
Z
A((D)) = |n|
D
Z Z 2 q
= a4 sin2 () cos2 () + a4 sin4 () d d
0 0
Z Z 2
= a2 sin() d d
0 0
= 2a2 cos() = 4a2 .
0
x
Figure 5.14. The sphere in Example 5.20.
i i
i i
muldown
i i
2010/1/10
page 181
i i
and
As motivation for the definition of A() we offer the following two ideas.
(a) The area of a plane segment (see the figure)
Ayz = A cos(1 ) = A, Azx = a cos(2 ) = Am, Axy = a cos(3 ) = An
= A2 = A2 (2 + m2 + n2 ) = A2yz + A2zx + A2xy .
(5.2)
Thus, A2 is the sum of the squares of the areas of the projectons onto the
coordinate planes (yeah Pythagoras!).
Consider the affine function : R2 7 R3 and the projections yz , zx , xy
onto the coordinate planes in R3 :
x x0 a1 b1
u
: y = y 0 + a2 b2
v
z z0 a3 b3
i i
i i
muldown
i i
2010/1/10
page 182
i i
z
3
n =(l,m,n)
A
y
A
x xy
Figure 5.16. The plane segment area and its projection onto the xy-plane.
y y a b u
yz : = 0 + 2 2
z z0 a3 b 3 v
z z a b u
zx : = 0 + 3 3
x x0 a1 b 1 v
x x0 a b u
xy : = + 1 1
y y0 a2 b 2 v
Now that the notion of content or measure has been introduced for curves and
surfaces it is a simple matter to extend the idea of integration to such objects. For
example in R3 : If : R 7 R3 is a smooth curve and I is a closed interval in R, then
i i
i i
muldown
i i
2010/1/10
page 183
i i
.
.
.
i i
i i