Math 521 - Spring 2010
Math 521 - Spring 2010
521
May 12, 2010
Contents
1 Pep Talk 2
2 Notation 5
3 Logic 6
4 Sets 8
7 Mathematical Induction 15
8 Infinite Sets 16
10 Distance 22
11 Limits 24
12 Convergence of Sequences 26
13 Continuity 32
1
15 Connected Sets 37
16 Compact Sets 39
17 Derivatives 43
18 The Integral 44
19 Taylors Formula 49
20 Series 50
21 Uniform Convergence 56
22 Power Series - I 59
23 Analytic Functions 61
24 Power Series - II 64
26 Fourier Series - I 68
27 Fourier Series - II 68
B Additional Problems 71
1 Pep Talk
What does not kill me, makes me stronger.
Friedrich Nietzsche (1844 - 1900)
in The Twilight of the Idols (1899)
2
for many students it is their first college math course, and although they did
well in high school, they are unprepared for what is expected in college. The
latter is hard for many reasons. For many students it is the first course where
they are expected to understand and write proofs. (See 3.4.) There are a lot
of hard words like if, for all, there exists, (see Chapter 3) and the order
in which they appear is crucial (see 13.3). Inequalities (see 9.2) play a crucial
role. The language of set theory (see Chapter 4) is used heavily and some
students have a hard time visualizing what set is. On top of all this there
is some strange lingo like connected, compact, open cover, etc. The first
time you think you understand and discover that I disagree you may well
decide that it is easier to get hit by a truck.
The subject is hard. Calculus was invented by Leibniz and Newton in
the seventeenth century, but the ideas presented in these notes were not fully
developed until the early twentieth century. Some of the best mathematical
minds in human history contributed to the subject. But you have one ad-
vantage over these luminaries: a teacher who understands the subject and
wants to help you understand.
Given the task before you, why should you make the effort? The above
quote of Nietzsche may inspire you. The ability to reason carefully will
serve you well in any future endeavor. You may be driven (like I am) to
understand the patterns in mathematical reasoning. These patterns help us
see the similarities between apparently different problems and enable us to
use our understanding of one problem to solve the others. Finally, you may
actually need to use the material someday. Many mathematical problems
have no explicit solution: we must prove that there is a solution and use
that proof both to guide us to an approximate solution and to estimate the
accuracy of that solution. If we get to the end of these notes in this course
you will get a glimpse of how this works when we solve the heat equation in
problem 25.2.
Here are some tips:
1. Read. It is a bad idea to try to learn just from lectures. A really good
lecturer can make the material look easy, but just listening is rarely
enough to lead to true understanding.
3
reading to recall definitions and previous theorems. If you are reading
these notes on a computer and have a sufficiently recent version of
Adobe Acrobat Reader or some other pdf viewer, you will notice that
this document contains embedded hypertext links. Some viewers even
allow text searches and links to URLs. (If you are online try clicking
on the link to the official description below.) This makes the task of
jumping around the text much easier.
4. Ask questions. If you can formulate a question when you are confused,
you may discover that you become unconfused and dont need to ask
the question. But dont hesitate to ask me questions in class or office
hours.
5. Be concise when you write. Excess verbiage is hard to follow and may
conceal confusion.
6. Dont fall behind. If you postpone understanding till the night before
the exam, you may find that you are still lost. For each lecture there
will be a short prequiz on Moodle (see below) designed to get you to
read the material before I lecture on it. In addition there will be a
short post quiz almost every day (I hope).
4
(Rudin) Walter Rudin, Principles of Mathematical Analysis, McGraw Hill.
There are three sections of 521 this semester and each has chosen one of the
first three. The instructor I am replacing chose Buck. I used this book the last
time I taught 521. The terminology, content, even the title is are somewhat
dated. (The math department will soon change the name of 521 from Ad-
vanced Calculus to Real Analysis I.) Lang is good but somewhat ponderous.
It contains more material than can be comfortably treated in a two semester
course. My favorite is from the first three is Morgan. It is student friendly
and contains almost exactly the material I hope to cover. (See the official de-
scription of 521 at http://www.math.wisc.edu/521-advanced-calculus.)
It is also the least expensive. The Rudin text is the best for someone who
already thinks like a mathematician, but is difficult for the apprentice.
I have arranged these notes so you can use them with any of these texts
there are remarks and footnotes which explain slight differences in termi-
nology but the notes are designed with the Morgan text in mind. Possibly
you wont even need to buy a text: all three texts are on reserve in the Math
Library. The prequizzes will only require you to read the course notes.
2 Notation
The following standard notations are used:1
5
3 Logic
3.1. In these notes we shall often use the abbreviations
= for implies,
for if and only if,
for there exists,
for for all.
These abbreviations help clarify the logical structure of the definitions and
theorems. They usually arent used in textbooks but are commonly used in
lectures.
P Q P and Q P or Q P = Q P Q not P
T T T T T T F
T F F T F F
F T F T T F T
F F F F T T
It is particularly important to notice that the fifth column asserts that false
implies anything is true. Note also that the use of the word or is inclusive
as in either x > 0 or else x < 5 (true) not exclusive as in soup or salad?
(not both).
P = Q.
P implies Q.
If P, then Q.
Q, if P.
P only if Q.
6
Do not confuse the statement if Q, then P with its converse if P, then Q:
it can happen that one is true and the other is not as in x = 2 = x2 = 4.
The statement If not Q, the not P is called the contrapositive of the
statement if P, then Q. A statement and its contrapositive are equivalent.
This provides the justification for proof by contradiction.
3.4. One of the principal aims of this course is to teach you how to read and
write proofs. A proof is an argument intended to convince the reader that a
general principle is true in all situations. The amount of detail that an author
supplies in a proof should depend on the audience. Too little detail leaves
the reader in doubt; too much detail may leave the reader unable to see the
forest for the trees. As a general principle, the author of a proof should be
able to supply the reader with additional detail on demand. When a student
writes a proof for a teacher, the aim is usually not to convince the teacher of
the truth of some general principle (the teacher already knows that), but to
convince the teacher that the student understands the proof and can write
it clearly.
The theorems below show the proper format for writing a proof. In
each of them you are supposed to imagine that the theorem to be proved
has the indicated form. Notice how the key words choose, assume, let, and
therefore are used in the proof. In these sample formats, the phrase Blah
Blah Blah indicates a sequence of steps, each one justified by earlier steps.
Theorem If P, then Q.
Proof. Assume P. Blah Blah Blah. Therefore Q.
7
4 Sets
4.1. A set A divides the mathematical universe into two parts: those objects
x that belong to A and those that dont. The notation x A means x belongs
to A. The notation x / A means that x does not belong to A. The objects
that belong to A are sometimes called the elements of A but we will often call
them points or numbers. Other words roughly synonymous with the word set
are class, collection, and aggregate. These longer words are generally used to
avoid using the word set twice in one sentence. The situation typically arises
when an author wants to talk about sets whose elements are themselves sets.
One might write the collection of all finite sets of integers, rather than the
set of all finite sets of integers.
4.2. For two sets A and B, the notation A B means that A is a subset
of B, i.e. for all x we have x A = x B. By definition, two sets are
equal if each is a subset of the other:
A = B A B and B A.
The notation {x : P (x)} denotes the set of all x for which the property P (x)
is true. The notation {x A : P (x)} denotes the set of all x A for which
the property P (x) is true. Finite sets may be defined by enumerating their
elements as in
x {a1 , a2 , . . . , an } x = a1 or x = a2 or or x = an
are called respectively the union and intersection of A and B. The empty
set is denoted : For all x it is true that
x
/ .
Two sets are disjoint iff they have no elements in common, i.e iff A B = .
The set
X \ A := {x X : x
/ A}
8
is called the complement of A in X.2 The set
A B := {(x, y) : x A and y B}
of all ordered pairs (x, y) with x A and y B is called the Cartesian
product of A and B. The term direct product is a synonym. We also use
the notation
An := A
| A {z A}
n
In particular,
Rn := {(x1 , x2 , . . . , xn ) : xi R}
denotes the vector space of all ntuples of real numbers, so R1 = R, R2 =
R R, R3 = R R R, etc.3
Remark 4.4. Do not confuse an ntuple (finite sequence of length n) with
a finite set. For the former order is important: {3, 7} = {7, 3} but (3, 7) 6=
(7, 3); for the latter repetitions dont matter: {2, 2, 3} = {2, 3} but (2, 2, 3) 6=
(2, 3).
4.5. An indexed family of sets is a function which assigns a set Ai to each
element i of a set I. The set I is called the index set of the family and
the family is usually denoted (Ai )iI . The union and intersection of the
indexed family are defined by
[
x Ai i I such that x Ai .
iI
\
x Ai i I we have x Ai .
iI
9
Remark 4.6. Set theory is simply a way of formalizing logic. Simple set
theoretic identities may be proved by truth tables. For example, consider
the following distributive law
(A B) C = (A C) (B C).
It is usually not necessary to show so much detail in your written work, but
it will be hard for you to decide just how much detail is appropriate. A
good rule of thumb is that you should be prepared to supply more detail if
challenged.
4.7. From logic we know that
so for any set X and any indexed family (Ai )iI we have
[ \ \ [
X \ Ai = (X \ Ai ), X \ Ai = (X \ Ai ).
iI iI iI iI
10
In particular, for I = {1, 2} we have
X (A1 A2 ) = (X A1 ) (X A2 ),
X (A1 A2 ) = (X A1 ) (X A2 ).
of all pairs (x, y) such that y = f (x) is called the graph of the function f .
Two functions are equal iff they have the same graph.
5.2. Let X and Y be sets. We say that f is a map from X to Y and write
f : X Y when f is a function which assigns a point y = f (x) Y to each
point x X. Two maps f : X Y and f 0 : X 0 Y 0 are said to be equal
when X = X 0 , Y = Y 0 , and f (x) = f 0 (x) for all x X. Thus if f and f 0
equal maps, then graph(f ) = graph(f 0 ) but not conversely (because Y = Y 0
is part of the definition of equality for maps). However most authors would
say that two functions are equal iff they have the same graph.
Remark 5.3. Some authors use the notation x 7 f (x) to define a map.
This allows them to avoid introducing a name for the map. Thus instead of
writing
Consider the map f : R R defined by f (x) = x5 + x.
they may write
Consider the map R R : x 7 x5 + x.
5.4. When A X, B Y , and f : X Y , the sets
11
Remark 5.5. The sets X and Y are sometimes called the source and target
of a map f : X Y . The image f (X) of the source is what is called the
range of the function f in calculus. Thus the domain of a map is the same
as its source while the range is a subset of its target.
12
(4) If f : X Y and B1 , B2 Y , then f 1 (B1 ) \ f 1 (B2 ) f 1 (B1 \ B2 )?
for x X. For any set X the identity map of X is the map idX : X X
defined by
idX (x) = x
for x X. Clearly
idY f = f and f idX = f
for f : X Y .
13
Definition 6.3. A map f : X Y is said to be injective iff
x1 , x2 X [f (x1 ) = f (x2 ) = x1 = x2 ]
y Y x X y = f (x).
14
defined by fi (x) = x2 . Then f1 is not injective and not bijective, f2 is injective
but not surjective, f3 is surjective but not injective, and f4 is bijective. Any
map g2 : R [0, ) such that g2 (y) = y for y 0 is a left inverse to f2 ,
and any map g3 : [0, ) R such that g3 (y) = y (the can depend on
y) is a right inverse to f3 The inverse map to f4 is f41 (y) = y.
Problem 6.8. Which (if any) of the false formulas in problem 5.7 become
true if we assume that the map f is injective? surjective? (Proof or counter
example.)
7 Mathematical Induction
7.1. The principle of mathematical induction is the following axiom.
Let S N be a set of nonnegative integers. Assume that 0 S
and that n S = n + 1 S. Then S = N.
It is usually taught in College Algebra as a means of proving identities like
n n
X n(n + 1) X n(n + 1)(2n + 1)
k= and k2 = .
k=0
2 k=0
6
For example to prove the first identity let S denote the set of all n N for
which the statement is true. Then 0 S and if n S then
n+1 n
!
X X n(n + 1) (n + 1)(n + 2)
k= k + (n + 1) = + (n + 1) =
k=0 k=0
2 2
F0 = 0, F1 = 1, Fn = Fn1 + Fn2 .
Thus F2 = 1, F3 = 3, F4 = 5, F5 = 8, F9 = 55 etc.
Problem 7.3. Prove the sum of squares formula in 7.1
15
8 Infinite Sets
Definition 8.1. A set S is said to be
Problem 8.3. Suppose that S and T are finite sets of cardinality s and t
respectively? What is the cardinality of S T ? Of the set T S of maps from
S to T ? Of the set 2S of subsets of S? Of S T (assuming that S T = )?
16
Proof. It is enough to prove that N N is countable. Consider the following
enumeration of N N:
f (0) = (0, 0) f (2) = (0, 1) f (5) = (0, 2) f (9) = (0, 3) f (14) = (0, 4)
f (1) = (1, 0) f (4) = (1, 1) f (8) = (1, 2) f (13) = (1, 3)
f (3) = (2, 0) f (7) = (2, 1) f (12) = (2, 2)
f (6) = (3, 0) f (11) = (3, 1)
f (10) = (4, 0)
.. .. .. .. .. ..
. . . . . .
Problem 8.6. For f as in Proposition 8.5, find f (1307) and f 1 (58, 19).
Hint: The formula for the sum of the first n natural numbers from 7.1 will
be helpful.
17
9 Axioms for the Real Numbers
We state here the axioms for the real number system R. We shall accept
these axioms without proof but it can proved (from more general axioms)
that there is an essentially unique structure satisfying them.
9.1. Algebraic Axioms. The set R of real numbers is equipped with two
operations
R R R : (a, b) 7 a + b, R R R : (a, b) 7 a b
The standard notations from high school algebra are used: in particular,
ab := a b, a b := a + (b), and a/b = a b1 .
9.2. Order Axioms. The set R has an order relation denoted a < b satis-
fying the following laws for all a, b, c R:
The other order notations are defined as usual, i.e. a < b b > a and
a b b a either a < b or a = b.
18
Remark 9.3. All the rules of algebra used in College Algebra (Math 112)
follow from the Algebraic Axioms 9.1 and Order Axioms 9.2. For example,
(a + b)2 = a2 + 2ab + b2 , a2 0, etc.
Problem 9.4. Does it follow from the axioms in 9.1 and 9.2 that a1 > 0 if
a > 0? Explain.
Definition 9.5. A set S of real numbers is said to be bounded above iff
there is a number b R such that x b for all x S; the number b is then
called an upper bound for S. A number b R is called a least upper
bound for S iff it is an upper bound for S and b b0 for every other upper
bound b0 for S. Similarly the set S is said to be bounded below iff there
is an number a R such that a x for all x S; the element b is then
called a lower bound for S. An element a R is called a greatest lower
bound iff it is an lower bound for S and a0 a for every other lower bound
a0 for S. The words infimum and greatest lower bound are synonymous as
are the words supremum and least upper bound. The least upperbound of
the set S will be denoted sup(S) and the greatest lower bound of the set S
will be denoted inf(S). We write sup(S) = when S is not bounded above
and inf(S) = when S is not bounded below.
9.6. Completeness Axiom. Every set S of real numbers which is bounded
above has a least upper bound, i.e.
if x b for all x S, then x sup(S) b for all x S.
Because multiplication by 1 reverses the order it is the same to say that
every set which is bounded below has a greatest lower bound. Thus
if a x for all x S, then a inf(S) x for all x S.
Remark 9.7. For Morgan the completeness axiom is a theorem (see page 44
of his book) but most authors of undergraduate texts dont prove it. Of
course it cant be proved until we give a precise construction (definition) of
the real numbers. Morgan takes the view that a real number is a number
with a decimal expansion, but avoids certain subtle issues connected with
this view. Actually making his definition precise would involve defining how
to do the arithmetic operations with decimal expansions and would be a bit
tedious. To see why, imagine that a + b = c and that
X
X
X
a= an 10n , b= bn 10n , c= cn 10n ,
n=1 n=1 n=1
19
where the coefficients, an , bn , cn are integers between 0 and 9 and try to
express cn in terms the an s and the bn s. In an appendix to his book, Buck
sketches a construction the real numbers by something called Dedekind
cuts but leaves out many details. Rudin explains cuts in an appendix to
his chapter 1. (The book Foundations of Analysis by Edmund Landau gives
all the details.) The completeness axiom is an easy consequence of this
construction. One can also define the real numbers using equivalence classes
of Cauchy sequences (see 12.15) and again the completeness axiom is a
consequence; if we get to the topic of completion of metric spaces, Ill explain
it then.
The crucial point is the uniqueness up to isomorphism which we will
prove in Appendix A. It means that if we start from the axioms, it doesnt
matter what definition of the real numbers we use. We will not use the
completeness axiom until the proof of Theorem 12.7 and the exposition is
arranged in such a way that it will be clear that the reasoning is not circular.
The following exercises will help you understand some of the issues.
Problem 9.8. Prove the following Archimedean Property of the real
numbers: There is neither an infinite real number nor an infinitesimal real
number. More precisely,
(1) There is no real number which is larger than every integer.
(2) For every positive real number > 0 there is a positive integer n such
that 1/n < .
Hint: If > n for every integer n what about 1? The proof will use the
completeness axiom.
Problem 9.9. Let R denote the set of real valued rational functions, i.e.
f R iff f (x) = p(x)/q(x) where p(x) and q(x) are polynomials with real
coefficients (and q(x) is not the zero polynomial). For f, g R define an
order relation by the condition that f > g iff there exists an M such that
f (x) > g(x) for all x > M . Then the set R satisfies the algebraic axioms
and order axioms given above. View R (and hence Z) as a subset of R
by identifying the real number c with the constant function whose value is
always c. Exhibit (in the lingo of Problem 9.8) an infinite element f R
and an infinitesimal element g R. Hint: What is limx p(x)
q(x)
?
Problem 9.10. Prove that there is a rational number in every nonempty
open interval (a, b) R.
20
Solution: By the Archimedean property from Problem 9.8 there are integers
m and n with m < a < b < n. Again by the Archimedean property there
is a positive integer k with 0 < 1/k < b a. Define qj = m + j/k for
j = 0, 1, . . . , k(n m). Then qj = m when j = 0, qj = n when j = k(n m),
and qj qj1 = 1/k < b a. If j is the largest integer such that qj1 a
then a < qj < b as qj qj1 = 1/k < b a.
Problem 9.11. Let Q( 2) denote the set of all numbers of form x = a+b 2
where a and b are rational.Show that Q( 2) is closed under the
algebraic
operations, i.e if x, y Q( 2), then x
y Q( 2), xy Q( 2), and (if
y 6= 0) x/y Q( 2). Show further that 3 / Q( 2). Does Q( 2) satisfy
the completeness axiom?
Problem 9.12. Assume that
X
x= xk 10k
k=1
21
by the first part and the triangle inequality. Since 2108 < 107 we have that
n = 8 works here as well. Note: The point of this problem is to convince you
that it is awkward to work with decimal expansion if you want to be super
careful. Also the wording of the problem (How large must n be to ensure
that) suggests that I want the smallest possible n that works. Usually I
dont care about that. I could have replaced the phrase How large must n
be to ensure that by Find N so that for n N we have. In that case the
answer n = 13472 would be correct (but overkill).
10 Distance
10.1. The distance d(p, q) between two points p = (x1 , x2 , . . . , xn ) and q =
(y1 , y2 , . . . , yn ) in Rn is defined by5
p
d(p, q) = (x1 y1 )2 + (x2 y2 )2 + + (xn yn )2 .
The distance d(v, 0) from a vector v Rn to the origin is called the norm
of v denoted by |v| so
d(p, q) := |p q|.
The norm satisfies the following laws for v, w Rn :
(symmetry) | v| = |v|,
The zero norm law holds because a sum of squares vanishes only if each
summand vanishes and the triangle inequality is proved in problem 10.3
below.6 The laws for the norm imply that the distance function satisfies the
following:
22
(symmetry) d(p, q) = d(q, p),
hu, vi = u1 v1 + u2 v2 + + un vn
Problem 10.3. In this problem you will prove the triangle inequality with-
out using trigonometry in two steps.
23
Remark 10.5. When n = 1 the open ball is an open interval:
B(a, ) = {x R : |x a| < }
= {x R : a < x < a + }
= (a , a + )
for a R
11 Limits
The intuitive idea of the notation
lim F (p) = L
pp0
is that F (p) is very close to L when p is very close to p0 . Some authors write
F (p) L as p p0 ; others write F (p) L when p p0 . In this chapter
we give a more precise definition. The following lingo is helpful.
Proof. If is easy: an infinite set is nonempty and at most one of the points
in an infinite set is p. For only if assume p is an accumulation point of
the set S and choose a neighborhood U of p. By definition there is a point
p0 U S \ {p}. Let 0 = |p p0 |. For n > 0 define n > 0 and pn S
inductively (see 7.2) by n = min(|pn p|, 1/n) and pn+1 B(p, n ) S. The
7
Buck uses the term cluster point and some authors use the term limit point.
24
map n 7 pn is injective as |pn p| < |pm p| for n > m. Choose > 0
so that B(p, ) U (by the definition of neighborhood). Then n < for
1/n < so pn B(p, ) S U S. Hence U contains the infinite set
{pn : n > 1/}.
11.3. Let p0 be a accumulation point of a set S and F be a function defined
on S (but possibly not at p0 ). The notation
lim F (p) = L
pp0
consist of the set of real numbers together with two additional points which
we think of as located at infinity. The set R is sometimes called the set of
extended real numbers. Extend the usual order relation on R to R in the
obvious way. For a R, a set U R is called neighborhood of a iff
25
or else a = and U contains an interval (M, ] for some M > 0,
lim F (x) = L
xa
12 Convergence of Sequences
12.1. A sequence is a function defined on a subset of the integers. (Usually
this subset is the set Z+ := {n Z : n > 0} of positive integers or the set
N := {n Z : n 0} of nonnegative inregers. ) It is customary to denote
the value of a sequence at an integer n with a subscript rather than with
parentheses and to denote a sequence with a notation like (pn )n or (pn )nZ+ .
26
Remark 12.3. Using the lingo introduced in 11.4 this may be stated as
limn pn = p iff for every > 0 there exists N = N () > 0 such that
n N = |pn p| < .
Theorem 12.4. Assume that (an )n and (bn )n are convergent sequences of
real numbers:
lim an = a, lim bn = b.
n n
Then
lim an + bn = a + b, lim an bn = ab.
n n
27
Let N = max(N0 , N1 , N2 ). Then for n > N we have
(IV) That limn an /bn = a/b follows immediately from (II) and (III) by
substituting 1/bn for bn and 1/b for b
28
Theorem 12.7. A bounded weakly monotonic sequence is convergent. In
fact
lim an = sup an
n n
lim an = inf an
n n
Proof. (Compare Buck page 47 or Lang page 35 or Morgan page 38.) In this
proof we use the completeness axiom 9.6 for the first time.
Assume that the sequence (an )n is nondecreasing and let a = sup{an :
n N}. Then an a for all n as a is an upperbound for the set {an : n N}.
Choose > 0. Then a < a so a is not an upperbound for the set
{an : n N}. Hence there is an N with a < aN . For n > N we have
aN an as the sequence (an )n is nondecreasing so
a < aN an a < a +
12.8. We introduce some handy notation. For any sequence (ak )k of real
numbers we have {ak : k n} {ak : k m} for m < n. If the se-
quence (ak )k is bounded above, then the sequence sn := sup{ak : k n} is
nonincreasing. The limit of the latter sequence is denoted
29
Remark 12.10. If the sequence (pn )n converges to p, then every subsequence
(pnk )k also converges to p. This follows immediately from the definition of
convergence: nk k so if N = N () satisfies |pn p| < for n > N () then
in particular we have |pnk p| < for k > N ().
Since An An+1 we have that bn bn+1 so the sequence (bn )n converges to its
supremum b := sup{bn } := lim inf n an . We will show that a subsequence of
the sequence (an )n also converges to b. For n N we have that bn < bn + n1
and bn is the greatest lower bound for An so bn + n1 is not a lower bound
for An so there is a cn An with bn cn < bn + n1 . As bn converges to b
by Theorem 12.7 we have that fore every > 0 there is an N = N () such
that |bn b| < /2 for n > N (). Hence for n > max(2/, N () we have that
|cn b| |cn bn | + |b + n b| < n1 + 2 < which shows that (cn )n converges
to bn .
Now cn An so cn = aj for some j = j(n) n + 1, but we arent quite
done because the definition of subsequence requires that the subscripts j(n)
increase and there is no reason for that to be true. However we can extract a
further subsequence by induction. Namely if n1 < n2 < < nk have been
defined, define nk+1 by nk+1 = j(nk ). Then nk+1 = j(nk ) nk + 1 > nk as
required. (The further subsequence still converges by Remark 12.10.
Now we prove the theorem for a sequence of points in Rm by induction on
m. Assume the theorem holds for Rm and choose a bounded sequence (pn )n
of points in Rm+1 . Then pn = (qn , an ) where qn Rm and an R. That
the sequence (pn )n is bounded means that there is an M such that |pn | M
for all n, As |pn |2 = |qn |2 + a2n it follows that |qn | M and |an | M for
all n, i.e. the sequence (qn )n and (an )n are also bounded. By the inductive
hypothesis the sequence (qn )n has a subsequence (qnk )k converging to q. By
replacing the sequence (pn )n by the sequence (pnk )k we may assume that the
30
sequence (qn )n converges. (Remark 12.10.) Now by the case m = 1 (already
proved) the sequence (an )n contains a convergent subsequence (ank )k . Hence
as required.
Corollary 12.12. For a subset S Rm the following conditions are equiv-
alent.
(1) For every sequence (pn )n of points of S there is a subsequence (pnk )k
which converges to p S.
(2) The set S closed and bounded.
Proof. Assume (1). The certainly every convergent sequence pn S has limit
in S so S is closed by Theorem 14.11. Also S is bounded as otherwise for
every n there would be a point pn S with |pn | > n and this sequence cannot
have a convergent subsequence. Conversely assume (2) and let (pn )n be a
sequence of points in S. As S is bounded there is a convergent subsequence
(by Theorem 12.11) and as S is closed the limit of this subsequence is a point
of S (by Theorem 14.11).
Corollary 12.13. Every bounded infinite subset of Rm has an accumulation
point.
Proof. By Theorem 12.11 and Proposition 11.2.
Remark 12.14. The image8 of the sequence (pn )nN (when viewed as a map
n 7 pn ) is the set
S = {pn : n N}.
The set S can be finite. For example for the sequence pn = (1)n , the
set S is the two element set S = {1, 1}. If the image of a sequence is
finite then there must be at least one constant subsequence and a constant
subsequence is trivially convergent. By definition only an infinite set can
have an accumulation point.
8
Buck calls the set S the trace of the sequence, but that terminology is uncommon.
31
Definition 12.15. A sequence {pn } is called Cauchy iff
lim |pn pm | = 0
m,n
i.e. iff for every > 0 there exists N > 0 such that |pn pm | < for
n, m N .
Theorem 12.16 (Cauchy Convergence Criterion). A sequence in Rn
converges if and only if it is a Cauchy sequence.
Proof. (Buck Theorem 23 and its corollary on pages 62-63.)
13 Continuity
Throughout this chapter f : X Y where X Rn and Y Rm .
Definition 13.1. The map f is said to be continuous at a point p X iff
for every > 0 there exists > 0 such that f (BX (p, )) B(f (p), ).
Theorem 13.2. The map f is continuous at p X if and only if for every
sequence {pn } of points in X we have
lim pn = p. (2)
n
By (2) there is an N such that |pn p| < for n > N . Hence by (3)
|f (pn ) f (q)| < for n > N . This proves
as required.
32
We prove if . Assume that f is not continuous at p X. Then there is
an > 0 such that for every > 0 there is a q X such that
|q p| < but |f (q) f (p)| .
In particular, for each n Z+ there is a qn such that
1
|qn p| < but |f (qn ) f (p)| .
n
But then (2) holds but (4) fails. This proves that (1) is false as required.
Definition 13.3. The map f is said to be continuous iff it is continuous
at every point of X, i.e. iff
p X > 0 > 0 such that f (B(p, )) B(f (p), ).
The map f is said to be uniformly continuous iff
> 0 > 0 such that p X we have f (B(p, )) B(f (p), ).
(For continuity = (p, ); for uniform continuity = ().)
Proposition 13.4. If f : X Y and g : Y Z are both continuous, then
so is the composition g f : X Z.
Proof. Choose p0 X and > 0. As g is continuous there exists > 0
such that |g(q) g(f (p0 ))| < whenever q Y and |q f (p0 )| < . As f is
continuous, there exists > 0 such that |f (p) f (p0 )| < whenever p X
and |p p0 | < . For p X we have q = f (p) Y so
|p p0 | < = |f (p) = f (p0 )| < = |(g f )(p) (g f )(p0 )| <
as required.
13.5. The map f is said to be Lipschitz iff there is a constant M such that
|f (p) f (q)| M |p q|
for all p, q X. A Lipschitz function is uniformly continuous. (Proof:
= /M .)
Problem 13.6. Let f (x) = xp . Show that f is Lipschitz on every closed
interval [a, b] (0, ). For which values of p is f uniformly continuous
on (0, )? Hint: Use the Mean Value Theorem from calculus. (See Theo-
rem 17.4 below.) Theorem 16.4 below may also help.
33
14 Open Sets and Closed Sets
In all the following definitions the term set means subset of Rm .
Definition 14.1. A set U is open iff for every p U there exists a > 0
such that B(p, ) U .
Problem 14.2. Prove that the ball B(p, ) is open. Hint: You must choose
an arbitrary q B(p, ) and then find an > 0 so that B(q, ) B(p, ).
Use the triangle inequality.
Theorem 14.3. The collection of all open sets in Rm satisfies the following
conditions:
(i) The set Rm and the empty set are both open.
(ii) The intersection of a finite collection of open sets is open.
(ii) The union of an arbitrary collection of open sets is open.
Proof. The set Rm is open because B(p, 1) Rm for p Rm . The empty set
is open because for every p satisfies the required condition or any other
condition since false implies anything is true. See 3.3). To prove (ii)
assume U is open. Then for every point p U there is a = p such that
B(p, p ) U .9 It follows that
[
U= B(p, p ), ()
pU
as required.
9
This is actually an example of an application of the Axiom of Choice. See Remark 6.5.
34
Definition 14.4. A set W X is called relatively open in X iff for every
p W there exists a > 0 such that BX (p, ) W . (See 10.4.)
Proof. Equation () in the last proof and the distributive law from 4.7.
Proof. Exercise.
Corollary 14.10. The collection of all closed sets in Rn satisfies the follow-
ing conditions:
35
Proof. (See Buck Theorem 5 page 40.) To prove only if assume that S is
closed, that limn pn = p, and that each pn S. If p / S then p Rm \ S.
As this set is open there is a > 0 such that B(p, ) Rm \ S. As the
sequence converges to p there is an N such that pn B(p, ) for n > N
contradicting the hypothesis that pn S. To prove if assume that S is not
closed. Then Rm \ S is not open so there is a point p Rm \ S such that
B(p, ) 6 Rm \ S for every > 0. In particular for = 1/n there is a point
/ Rm \ S, i.e. pn S. Thus
pn B(p, 1/n) (i.e. |pn p| < 1/n) such that pn
limn pn = p and p / S as desired.
14.12. Let S Rn . For any point p Rn exactly one of the following
alternatives holds:
(i) B(p, ) S for some > 0.
The notations
S := int(S), S := bdry(S)
are commonly used.
Example 14.13. For the half open interval S = [a, b) R we have
cl(S) := S := S bdry(S).
Proposition 14.15. The interior int(S) of S is the largest open set contained
in S and closure S of S is the smallest closed set containing S.
36
Proof. Exercise. Hint: You must show (1a) int(S) is open, (1b) S is closed,
(2a) int(S) S, (2b) S S, (3a) if U is open and U S then U int(S),
and (3b) if T is closed and S T then S T .
Problem 14.16. Prove that if U Rm is an open set and p U , then there
is a point q with rational coordinates and a positive rational number such
that p B(q, ) U .
Solution. As U is open there is an r > 0 such that B(p, r) U . Let pi
denote the ith coordinate of p so that p = (p1 , p2 , . . . , pm ). By Problem 9.10
there are rational numbers q1 , q2 , . . . , qm such that
r r
pi < qi < pi + .
2m 2m
Let q = (q1 , q1 , . . . , qm ). By the Triangle Inequality |p q| < r/2. By
Problem 9.10 there is rational number with |p q| < < r/2. Then
p B(q, ). Choose x B(q, ). Then |p x| < |p q| + |q x| < r so
x B(p, r) U . This shows that B(q, ) U as required.
15 Connected Sets
Definition 15.1. A set S is disconnected iff there are disjoint open sets
U and V such that S U V and both S U and S V are nonempty. A
set is connected iff it is not disconnected.
Theorem 15.2. A subset S R of the real line is connected if and only if
S is an interval, i.e. [a, b] S whenever a, b S.
Proof. We prove only if. Assume S is not an interval, i.e. that there exist
a, b S with [a, b] 6 S. Then there is a c [a, b] with c / S. Let U =
(, c) and V = (c, ). The point c lies in the open interval (a, b) as
a, b S so a U and b V . Hence both S U and S V are nonempty
and clearly S U V (as c / S). Hence the open sets U and V separate S
so S is disconnected as required.
We prove if. Assume that S is disconnected, i.e. that there exist open
sets U, V R with S U V , S U 6= , S V 6= , and U V = . We
must show that S is not an interval. Choose a S U and b S V . Then
a 6= b as U V = . Assume without loss of generality that a < b. (The case
b < a is the same.)
37
The set [a, b] U is nonempty (it contains a) and bounded above (b is
an upper bound). Let c = sup([a, b] U ). Since a U there is an > 0
with (a , a + ) U . Making smaller we also have a + < b. Therefore
[a, a + ) [a, b] U ) so a + = sup [a, a + ) sup [a, b] U = c. Since
b V there is an(other) > 0 with (b , b + ) V . Making smaller we
also have a < b . Therefore (b , b] [a, b] V ) so [b , b] V =
so b is an upperbound for [a, b] U , so c b . We have proved that
a < c < b. If c U there is an > 0 with a < c < c < c + < b and
(c , c + ) U contradicting the fact that c is an upper bound of [a, b] U .
If c V there is an > 0 with a < c < c < c + < b and (c , c + ) V
so c is an upperbound for [a, b] U contradicting the fact that c is the
least upper bound of [a, b] U . Hence c / U V so (as S U V ) c / S.
Thus a < c < b, a S, b S, c / S, so S is not an interval.
Problem 15.7. Prove Theorem 15.6. (This theorem is proved in Buck The-
orem 18 page 96 and Theorem 25 page 114, but Buck assumes that the
intervals are closed and bounded. This assumption can be removed.)
38
Theorem 15.8. Let I R be an interval and f : I R be f is continuous.
Then the set
graph(f ) := {(x, y) I R : y = f (x)}
is connected.
Proof. Define F : I R by F (x) = (x, f (x)) so that F (I) = graph(f ).
Clearly f is continuous if and only if F is continuous. We will assume
that I is an open interval; the case where I contains one of its endpoints
is similar. Assume that F (I) is not connected. Then there are open sets
U, V R2 with F (I) U V , U V = , F (I) U 6= , F (I) V 6=
. Then F 1 (U ), F 1 (V ) R2 are open, I F 1 (U ) F 1 (V ), and
F 1 (U ) F 1 (V ) = F 1 (U V ) = . This contradicts the fact that I
is an interval and therefore connected.
Example 15.9. The converse is false. Consider the function f : R R
defined by
sin(1/x) if x > 0,
f (x) =
0 if x 0.
This function is not continuous as follows. Let xn = (2n + /2)1 . Then
f (xn ) = 1, limn xn = 0, but limn f (xn ) = 1 6= 0 = f (0). However, the
graph of f is connected. To see this suppose U and V are open subsets of
R2 and graph(f ) U V with U V = . Suppose that (0, 0) U . Then
(x, f (x)) U for x 0 as f is continuous on (, 0] and (x, f (x)) U
in U for x > 0 as f is continuous on (0, ). But then graph(f ) U so
graph(f ) V = .
16 Compact Sets
Definition 16.1. AnS open cover of a set S is a collection (U )A of open
sets such that S A U . The subset S is compact iff every open cover
(U )A of S has finite subcover, i.e. there are indices 1 , 2 , . . . , n A
such that S U1 U2 Un .
Theorem 16.2 (Heine Borel). The following are equivalent conditions on
a set S Rm :
(1) For every sequence (pn )n of points of S there is a subsequence (pnk )k
which converges to p S.
39
(2) The set S closed and bounded.
(3) The set S is compact.
Proof. (This is Theorem 9.2 on page 41 of Morgan. See also Buck Theorem 25
page 65.) The equivalence of (1) and (2) is Corollary 12.12.
We prove (3) = (2). Assume that (2) is false. Then either S is not
closed or S is not bounded. In the former case by Theorem 14.11 there is a
convergent sequence of points pn S S whose limit p := limnmpn is not in
S. This implies S Rm \ {p} = k=1 Uk where Uk := {q R : |q p| >
1/k}. The sets Uk are open but we cannot have S U U2 UN as as
U1 U2 UN and pn / UN for n > N so we have an open cover of S
with no finite subcover so S is not compact. In the latterScase the open balls
Vn := B(0, n) := {q Rm : |q| < n} cover S as Rm = n Vn but no finite
collection subcollection covers so again S is not compact.
We prove (1) = (3). Choose an open cover (U )A of S. We first
construct a countable subcover. Consider the set
I := {(q, ) Qm Q : > 0 and B(q, ) U for some A}
of open balls with rational center and rational radius which are contained in
some element of the open cover. For every point p S there is an A
with p U (because the sets U cover S), and so by Problem 14.16 there
is a point q Qm so that p B(q, ) U . In other words there is a point
(q, ) I with p B(q, ). We have proved that
[
S B(q, ).
(q,)I
40
Theorem 16.4. Assume that X is compact and f is continuous. Then f is
uniformly continuous.
Define
:= 12 min{(p1 ), (p2 ), . . . , (pn )}.
Choose p, q X. Assume |q p| < . By (5) we have that p Upk for some
k. Hence
|p pk | < (pk )/2. (6)
But (pk )/2 by its definition so
41
Proof. See Buck Theorem 10 on page 90 or
Theorem 16.8. If f : X R is continuous and X is compact, then f
assumes its maximum on X, i.e. there exists p X such that f (q) f (p)
for all q X. Similarly for the minimum.
Proof. See Buck Theorem 11 page 91 or
Theorem 16.9. Let f : X Y be bijective and continuous, and assume
X (and hence by Theorem 16.6 also Y ) is compact. Then f 1 : Y X is
continuous.
Proof. Choose a convergent sequence (qn )n in E and a let
q := lim qn
n
the theorem will then follow by Theorem 13.2. By Bolzano Weierstrass and
Heine Borel there is a convergent subsequence (f 1 (qnk ))k . Let
p := lim f 1 (qnk )
k
denote the limit. Then p0 / U (else we would have f 1 (qmj ) U for suffi-
ciently large j) so p0 6= p. But as before
f (p ) = f lim f (qmj ) = lim f (f 1 (qmj )) = lim qmj = q.
0 1
j j j
But now f (p) = q = f (p0 ) which contradicts the fact that f is injective.
42
Theorem 16.10. Let S Rn be and f : S Rm be uniformly continu-
ous.Then the function f can be continuously extended to the closure S of S.
i.e. there is a continuous function F : S Rm such that F (p) = f (p) for
p S.
17 Derivatives
Definition 17.1. The function f : I R is said to be differentiable at
the point x0 I iff the limit
f (x) f (x0 )
f 0 (x0 ) := lim
xx0 x x0
exists; we say that f is differentiable on a set iff it is differentiable at each
point x0 in the set. The function f 0 is called the derivative of f .
f (b) f (a)
f 0 (c) = .
ba
Corollary 17.5. Assume that f is differentiable on I. Then the derivative
f 0 vanishes identically on I if and only if f is constant on I.
43
18 The Integral
18.1. A partition of the closed interval [a, b] is an increasing finite sequence
P = (xk )0kn with x0 = a and xn = b. For any bounded function f defined
on [a, b] and any partition P = (xk )0kn of [a, b] define the upper sum by
S(f, P ) and the lower sum S(f, P ) by
n
X n
X
S(f, P ) := y k (xk xk1 ), S(f, P ) := y k (xk xk1 ),
k=1 k=1
y (b a) S(f, P ) S(f, P ) y (b a)
where y is the infimum of f (x) for x [a, b] and y is the supremum. The
middle inequality follows the inequality y k y k which in turn is an immediate
consequence of the fact that the infimum of a bounded nonempty set is less
than or equal to the supremum of that set. For the inequality on the left
note that
y (xk xk1 ) y k (xk xk1 )
44
since the infimum on the left is over a larger set. From the collapsing sum
Since xkj1 < xkj1 +1 < xkj1 +2 < < xkj the partition P determines a
partition Pj of the interval [xkj1 , xkj ]. Applying Step 1 to this partition gives
To see this choose > 0. By Theorem 16.4 there is a > 0 such that
|f (x) f (x0 )| < /(b a) whenever |x x0 | < . If the partition P has mesh
less than it follows that y k y k + /(b a) for k = 1, 2, . . . , n. The desired
inequality follows by multiplying by xk xk1 and summing on k. (Use the
collapsing sum from Step 1.)
Step 4. Given any two partitions of the interval [a, b] there is a third partition
which refines both of them. To construct this third partition we simply take
the union of the underlying sets of the two partitions and list the elements
of this union in increasing order.
Step 6. S(f, P 0 ) S(f, P 00 ) for any two partitions P 0 and P 00 of the interval
[a, b]. This is because by Step 4 the partitions P 0 and P 00 have a common
refinement P so Step 2 we have
Step 7. supP S(f, P ) inf P S(f, P ). To see this take the supremum over
P 0 is Step 4 to get supP S(f, P ) S(f, P 00 ) for every P 00 . Then take the
infimum over P 00 .
45
Step 8. inf P S(f, P ) supP S(f, P 0 ). To see this choose > 0 and let > 0
be given as in Step 3. There certainly are partitions with mesh less than ,
for example the partition Pn = (xk )k with xk = a + k(b a)/n where n is so
large that (b a)/n < . From Step 3 we conclude that
S(f, P ) S S(f, P )
for any Riemann sum for the partition P . It follows from the proof of Theo-
rem 18.2 that the definite integral is the limit of the Riemann sums S as the
mesh of the partition P tends to zero, i.e. for every > 0 there exists a > 0
R b
such that S a f < whenever the mesh of P is less than . In Math 221
the definite integral is usually defined as the limit of Riemann sums in this
sense.
Theorem 18.4. The definite integral of a continuous function satisfies the
following properties.
Z b
(1) (Normalization). 1 = b a.
a
Z b Z b Z b
(2) (Linearity). (f + g) = f+ g
a a a
Z b Z b
(3) (Linearity). cf = c f for c R.
a a
46
Z b Z c Z c
(4) (Additivity). f+ f= f.
a b a
Z b Z b
(5) (Order). If f (x) g(x) for all x [a, b], then f g.
a a
Z b Z b
(6) (Triangle Inequality). f |f |.
a a
n
X n
X n
X
(2) (f (ck ) + g(ck ))k = f (ck )k + g(ck )k
k=1 k=1 k=1
n
X n
X
(3) cf (ck )k = c f (ck )k
k=1 k=1
n
X m
X m
X
(4) f (ck )k + f (ck )k = f (ck )k
k=1 k=n+1 k=1
n
X n
X
(5) f (ck )k g(ck )k if f (ck ) g(ck )
k=1 k=1
n n
X X
(6) f (c ) |f (ck )|k
k k
k=1 k=1
In these formulas (xk )nk=0 is a partition of [a, b] = [x0 , xn ] (in (4) it is extended
to a partition (xk )m
k=0 of [a, c]), ck is in the kth interval (i.e. xk1 ck xk ),
and we have used the abbreviation
k := xk xk1 .
10
Rb Ra
It is customary to define a f = b f if b < a. With this definition the additivity
formula holds without the restriction that a b c.
47
These formulas and Remark 18.3 imply the theorem by taking the limit as
the mesh of the partition goes to zero.
Theorem 18.5 (Fundamental Theorem of Calculus). Assume that I is
an open interval, that f : I R is continuous, that a I, and that F (x) is
defined by Z x
F (x) := f.
a
Then
(I) F is differentiable on I and its derivative is f and hence
Z b
(II) f = F (b) F (a) for a, b I with b a.
a
Proof. We must show that for every > 0 there is a > 0 such that
F (x + h) F (x)
< f (x) <
h
whenever 0 < |h| < . We will only use the properties in Theorem 18.4 to
prove this. Assume h > 0 (the case h < 0 is similar). Then
Z x+h Z x Z x+h
F (x + h) F (x) = f f= f
a a x
by (1), (3), and (5). Dividing by h and subtracting f (x) and gives
Z x+h
1
< f f (x) < .
h x
whenever 0 < h < , i.e.
F (x + h) F (x)
< f (x) <
h
as required.
48
Rb
Remark 18.6. Henceforth we use the more traditional notation a f (x) dx
Rb
(rather than the notation a f ) for the integral. The reader is reminded that
the variable x in this expression is a dummy variable, i.e.
Z b Z b
f (x) dx = f (t) dt.
a a
19 Taylors Formula
Theorem 19.1 (Taylors Formula Lagrange Form). Let I be an in-
terval, a I, and f : I Rm be of class C n+1 . Then
49
Corollary 19.2. If |f (n+1) (t)| M for t I then
M |x a|n+1
|Rn (a, x)|
(n + 1)!
for x I.
Proof. Assume that x > a. (The case x < a is similar.)
Z x (n+1)
f (t) n
|Rn (a, x)| =
(x t) dt
a n!
Z x (n+1)
f (t) n
n! (x t) dt
a
Z x
(x t)n (x a)n+1
M =M .
a n! (n + 1)!
Remark 19.3. In Math 222 it is shown that that there is a number c between
a and x such that
f (n+1) (c)
Rn (a, x) = (x a)n+1 .
(n + 1)!
This form of the remainder has the advantage that it is easy to remember: the
remainder is the next term in the series with f (n+1) (a) replaced by f (n+1) (c).
However, this version of the theorem only holds when f is real valued, i.e.
when m = 1. For m > 1 there will be a different value of c for each component
of f .
20 Series
20.1. A sequence determines a series and a series determines a sequence.
More precisely, a sequence (ak )k determines a series whose partial sums
are n
X
Sn = ak := a1 + a2 + + an ,
k=1
and the terms of the series may be recovered from the sequence of partial
sums via the formula
an = Sn Sn1 , a1 = S1 .
50
Convergence of the series is synonymous with convergence of the sequence of
partial sums:
X n
X
ak := lim ak .
n
k=1 k=1
Since the limit of a difference is the difference of the limits and the limit of
a constant is the constant we have the useful formula
X n
X
X
ak ak = ak .
k=1 k=1 k=n+1
Here the notation on the right has the obvious definition, namely
X m
X
ak := lim ak .
m
k=n+1 k=n+1
means
Pn that for every M > 0 there exists an integer N > 0 such that
k=1 k > M for n > N . If ak 0 for all k then the sequence ofP
a partial
sums is monotonic increasing so by Theorem 12.7 either Pthe limit nk=1 ak
exists (i.e. the sequence of partial sums is bounded) or
k=1 = (i.e. the
sequence of partial sums is unbounded).
Example 20.3. The nth partial sum
n
X
xk = 1 + x + x2 + + xn
k=0
51
Hence if |x| < 1 we have the formula
X 1
xk =
k=0
1x
Proof. (See Buck Theorem 2 page 230.) Let Sn = nk=0 ak be the nth partial
P
sum so an = Sn Sn1 . If the series converges then limn Sn = limn Sn1
so limn an = limn Sn limn Sn1 = 0.
Theorem
P 20.6 (Cauchy Convergence Criterion for Series). A series
k ak converges if and only if
n
X
lim ak = 0,
m,n
k=m+1
i.e. for every > 0 there exists N = N () such that nk=m+1 ak < for
P
n > m > N.
52
P
Definition 20.7. The series k ak is said to converge absolutely iff
X
|ak | < .
k=1
A series which converges but does not converge absolutely is said to converge
conditionally.
Theorem 20.8. If a series converges absolutely, then it converges.
Proof. This is an immediate consequence of the inequality
X n Xn
ak |ak |
k=m+1 k=m+1
53
and the sequence (a2m+1 )m is bounded (as it converges to zero). Hence the
sequence (S2m+1 )m converges by Theorem 12.7. But the sequence (S2m )m
converges to the same limit as S2m+1 = S2m + a2m+1 and we have assumed
that limn an = 0. Hence the sequence (Sn )n converges as claimed.
Example 20.11. (Alternating harmonic series) We will see later (see
Remark 24.3) that
X (1)k
ln 2 = .
k=1
k
The convergence is conditional but (by Example 20.5) not absolute.
Theorem 20.12P (Comparison Test). If 0 akP bk for sufficiently large
k and the series k bk converges, then the series k ak does.
Proof. (Buck Theorem 5 page 231.) The partial sums satisfy the inequality
n
X n
X
0 ak bk .
k=m+1 k=m+1
P Pn
Because the k bP
k is assumed to converge we have lim m,n k=m+1 bk = 0.
n P
Hence limm,n k=m+1 ak = 0 so the series k ak converges by Theo-
rem 12.16
R f : [1, )
Theorem 20.13 (Integral Test). Assume that ak = f (k) where
[0, ) is monotonic decreasing. Then
P the improper integral 1 f (x) dx <
converges if and only if the sum k=1 ak < does.
This inequality shows that the infinite sum is finite if and only if the integral
is finite.
Theorem 20.14 (Root Test). Let
R = lim sup |ak |1/k
k
P
Then the series k ak converges absolutely if R < 1 and diverges (does not
converge) if R > 1.
54
Proof. (Buck Theorem 9 page 232.) Let Sn = |ak |1/k : k = n, n + 1, . . . ,
sn = sup Sn so Sn+1 Sn and hence sn+1 sn . (We are using the convention
that sup S if S is not bounded above.) Either sn = for all n (this falls
under the case R > 1) or else (sn )n is a monotonic sequence decreasing
(i.e non-increasing) converging to R. If R < 1 then R < r < 1 where
r = (1 + R)/2 so there is an N such that sn < r for n > N and hence
|ak |1/k sn < r for k n > N . From this we deduce that |ak | rk and
hence
X X
|ak | rk
k=N +1 k=N +1
so the series convergence by the Comparison Test (Theorem 20.12, see also)
and the fact that the geometric series converges for r < 1. Conversely if
R > 1 then sn R > 1 for all n so for every n there exists a k n with
k
|ak | = |ak |1/k 1. This means that it is not the case that limn an = 0
so the series diverges by the nth term test (Theorem 20.4).
Remark 20.15. Theorem 20.14 gives no information when R =P1. If ak =
1/k, then R = 1 but k ak = . If ak = 1/k 2 , then R = 1 and k ak < .
P
Remark 20.16. The proofs the convergence tests tell us how to estimate
the error, i.e. the difference between a partial
P sum and the infinite sum. For
2
example, by the Integral Test, the series k 1/k converges and
Z
n
X 1 X 1 X 1 dx 1
= = .
k 2 2 2
k k=n+1 k x 2 n
k=1 k=1 n
Similarly if R < 1 in the Root Test and R < r < 1 there is an N such that
|ak | < rk for k > N and hence
n
X X X X
k rn+1 rN
a a = |a | r =
k k k
1r 1r
k=1 k=1 k=n+1 k=n+1
for n > N .
Problem 20.17. Show that for p > 1 the series p
P
k=1 k converges and
that the estimate
n
X X (n + 1)1p
p p
k k
p1
k=1 k=1
holds for the difference (error) between the nth partial sum and the limit.
55
P
Definition
P 20.18. A series k=1 bk is 11 said to be a rearrangement of the
series k=1 ak iff there is a permutation : Z+ Z+ such that bk = a(k) .
Proof. Part (1) is Theorem 13 page 239 of Buck. Part (2) is proved on
pages 238-9 of Buck in the special case where an = (1)n /n and L = 10; the
general argument is much the same but uses Theorem 20.4.
21 Uniform Convergence
Definition 21.1. A sequence (fn )n of functions with common domain X
said to converge pointwise to the function f : X R iff
i.e. iff
> 0 N p U n n > N = |fn (p) f (p)| < .
For a sequence (uk : X Rm )k of functions the series k uk ofPfunctions
P
is said to converge pointwise or uniformly iff the sequence fn = nk=0 uk of
partial sum does.
11
A permutation is bijective map from a set itself. This terminology is most often
used for finite sets, but here it is used for a infinite set.
56
Example 21.2. Define fn : [0, 1] R by fn (x) = xn . Then the sequence
(fn )n converges pointwise but not uniformly to the function
0 for 0 x < 1
f (x) =
1 for x = 1.
|f (x) f (x0 )| |f (x) fn (x)| + |fn (x) fn (x0 )| + |fn (x0 ) f (x0 )| <
whenever |x x0 | < .
Theorem 21.4 (Weierstrass Comparison Test). Assume that the func-
m
P
tions
P u k : X R satisfy |u k (p)| Mk where k Mk < . Then the series
k uk converges uniformly.
as follows. Since the series on the right converges we have that for every
> 0 there is an N = N () such that the right hand side is < if n > N
and hence the left hand side is < for all p.
Remark 21.5. The proof doesnt require that the inequality |uk (p)| Mk
hold for all k but only for all sufficiently
P large k. This inequality isPoften
expressed by saying that the series k uk is dominated by the series k Mk
Theorem 21.6. Assume that the sequence (fn : [a, b] R)n converges uni-
formly to a function f and that each fn is continuous. Then the limit of the
integrals is the integral of the limit, i.e.
Z b Z b
lim f (x) dx = f (x) dx.
n a a
57
Proof. Choose > 0. Then there is an N such that |f (x)fn (x)| < /(ba)
for all x [a, b] and all n > N . For n > N we have, by the various properties
listed in Theorem 18.4, that
Z b Z b Z b
f (x) dx f n (x) dx = (f (x) fn (x)) dx
a a a
Z b
|f (x) fn (x)| dx
a
(b a)
=
ba
as required.
Corollary 21.7. Let (fn )n be a sequence of of functions defined on an open
interval I. Assume that
(i) Each fn is differentiable;
for x I.
Proof. Define g(x) := limn fn0 (x). By Theorem 21.3 g is continuous. By
the Fundamental Theorem of Calculus part (II) we have
Z x
fn (x) fn (a) = fn0 (t) dt
a
58
22 Power Series - I
22.1. A series of form k
P
k=0 ck (x a) is called a power series centered at
a. The radius of convergence of the power series is the number R defined
by
1
:= lim sup |ck |1/k .
R k
(If the lim sup is infinite, then R := 0 and if the lim sup is zero, then R := .)
Problem 22.2. (A formula for the radius of convergence). Assume that the
coefficients ck are nonzero. Show that
|ck+1 |
lim sup |ck |1/k = lim
k k |ck |
59
Example 22.4. If a = 0 and ck = 1, then R = 1 and
1 X
= xk
1 x k=0
where the convergence is uniform on each interval [r, r] with 0 r < 1.
The convergence is not uniform on the interval (r, 1) or on the interval
(1, r) and the series does not converge at x = 1.
Example 22.5. If a = 0 and ck = 1/k!, then R = and
x
X xk
e =
k=0
k!
where the convergence is uniform on each interval [r, r] with r < .
Remark 22.6. A power series never converges uniformly on an unbounded
interval unless it is a polynomial. To see this we argue by contradiction. Let
n
X
Pn (x) = ck (x a)k
k=0
denote the nth partial sum of the power series k ck (xa)k and assume that
P
(Pn )n converges uniformly to a function f on some interval I. Let = 1.
Then there exists N Psuch that |Pn (x)f (x)| < 1 whenever n > N and x I.
If the power series k ck (x a)k has infinitely many nonzero terms, there
exist m > n > N with cm 6= 0 and cn 6= 0 and hence Pn Pm is a nonconstant
polynomial. But then |Pn (x) Pm (x)| |Pn (x) f (x)| + |f (x) Pm (x)| 2
for all x I. Hence the interval I must be unbounded as a nonconstant
polynomial becomes infinite as x .
Theorem 22.7. Let k
P
k=0 ck (x a) be a power series and R be its radius
of convergence. Denote the sum by
X
f (x) := ck (x a)k
k=0
60
Corollary 22.8 (Taylor Series). Continue the hypotheses of Theorem 22.7.
Then f is infinitely differentiable12 on the interval (a R, a + R), the nth
derivative f (n) of f is
X X k!ck
f (n) (x) = k(k 1) (k n + 1)ck (x a)kn = (x a)kn ,
k=n k=n
(k n)!
Corollary 22.9. The nth derivative of the sum of the geometric series is
X k! n!
xkn =
k=n
(k n)! (1 x)n
23 Analytic Functions
Definition 23.1. A function f is called analytic iff for every a in its domain
there is a power series with
X
f (x) = ck (x a)k
k=0
for all x in some interval about a. From Corollary 22.8 it follows that an
analytic function is infinitely differentiable and that it equals its Taylor series
at each point a in its domain, i.e. ck = f (k) (a)/k! and hence
X f (k) (a)
f (x) = (x a)k .
k=0
k!
61
is infinitely differentiable but not analytic. (All its derivatives vanish at zero
so it cannot equal its Taylor series.) Hint: Show inductively that the kth
derivative of f has form
P (x) 1/x
f (k) (x) = e
Q(x)
for x > 0 where P (x) and Q(x) are polynomials. Then use the definition of
the derivative to show that f (k+1) (0) = 0. Then
Problem 23.3. Give two proofs that the function f (x) = x1 defined for
x > 0 is analytic. First: Write f (x) = Pn (x, a) + Rn (x, a) where
n
X f (k) (a)(x a)k
Pn (x, a) :=
k=0
k!
and show that limn Rn (x, a) = 0 for x sufficiently near a. Second: Use
the identity
1 1
=
x a(1 + y)
where y = (x a)/a and the formula for the sum of a geometric series.
62
Proof. This is normally proved in a course on complex variables (Math 623
at UW). Here is a proof that doesnt use complex numbers. Choose b I
and write Taylors Formula (Theorem 19.1) centered at b
f (x) = Pn (b, x) + Rn (b, x)
n Z x (n+1)
X f (k) (b) k f (t)
Pn (b, x) := (x b) , Rn (b, x) := (x t)n dt.
k=0
k! b n!
We must show that limn Rn (b, x) = 0 for x in an open interval about b.
Let R be the radius of convergence of () so that
f (k) (a)
R1 = lim sup |ck |1/k , ck := .
k k!
By Theorem 22.3 the series () diverges for |x a| > R so we may as well
assume that I = (a R, a + R). Since b I we have |b a| < R. Let
r = (R + |b a|)/2 and = (R + r)/2 so that |b a| < r < < R. As in the
proof of that theorem we have the inequality
k
k
|ck | |x a|
r
for sufficiently large k and x (a r, a + r). Since this last inequality holds
for sufficiently large k there is an M such that
k
k r
|ck | |x a| M
for all k and all x (a r, a + r). Take the absolute value of the series for
the nth derivative:
X k!c
k
|f (n) (x)| = (x a)kn
k=n
(k n)!
X k!ck
kn
(k n)! (x a)
k=n
kn
X k! r
M
k=n
(k n)!
n
r
= M n! 1
63
by Corollary 22.9. Now choose > 0 so small that (b, b+) (ar, a+r)
and 1
:= 1 < 1.
r
Then for x (b , b + ) and t between b and x we have |x t| < so
Z x (n+1)
f (t) n
|Rn (b, x)| = (x t) dt
b n!
n1 n
r
|b x| M 1
n!
n1 n+1
r
M (n + 1)! 1
n!
n+1
= M (n + 1) .
24 Power Series - II
Theorem 24.1. Assume that the power series
X
f (x) = ck (x a)k
k=0
Define Bn :=
P
k=n ck . Then ck = Bk Bk+1 . Since the series f (1) converges,
it follows that limn Bn = 0. Choose > 0. There exists N such that
64
|Bn | < /2 for n > N . For x [0, 1) we have
X
X
k
ck x = (Bk Bk+1 )xk
k=n k=n
X
n
= Bn x + Bk+1 (xk+1 xk )
k=n+1
X
n
= Bn x + (x 1) Bk+1 xk
k=n
so for n > N
X
X
k n
ck x |Bn | x + (1 x) |Bk+1 | xk
k=n k=n
xn (1 x)x n X
+ xj
2 2(1 x) j=0
=
When x = 1 this inequality holds because |Bn | < /2 < .
Corollary 24.2. Suppose the power series
X
f (x) = ck (x a)k
k=0
converges pointwise on the interval [a, a + R]. Then the function f is contin-
uous on the interval [a, a + R]. Similarly for the closed interval [a R, a].
Proof. By Theorem 24.1 the series converges uniformly so the function f is
continuous by Theorem 21.3.
Remark 24.3. The formula
1 X
= tk
1 t k=0
65
which holds for 1 < x < 1. Let f (x) denote the right hand side. When
x = 1 the right hand side is the Alternating Harmonics Series from Exam-
ple 20.11 and it converges The series converges for x = 1 by Theorem 20.10.
Hence f is continuous on [1, 0) by Theorem 24.1. But ln(1 x) is also con-
tinuous on [1, 0). Since f (x) = ln(1 x) on (1, 1) this must remain true
on [1, 1) so taking x = 1 gives
X (1)k
ln(2) = .
k=1
k
in contrast to
lim lim xn = 1 6= 0 = lim lim xn .
n x1 x1 n
|bn | Cnp
for n > N .
converges uniformly.
66
(3) Show that, if p > 2, then f is differentiable and that
X
0
f (x) := nbn cos nx.
n=1
Hint: See Problem 20.17 above. You may use any of the theorems stated
above but state which theorems you are using and verify that the hypotheses
of the theorems are satisfied.
Problem 25.2. Continue the notation of Problem 25.1. Show that the series
2
X
u(t, x) := en t bn sin nx (25.2-1)
n=1
converges uniformly on [0, ) [0, ] if p > 1 and that the limit satisfies the
partial differential equation13
u 2u
= (25.2-2)
t x2
on the open set (0, ) (0, ). Show also that u is continuous on the closed
set [0, ) [0, ], that it satisfies the initial condition
(You may use any of the theorems stated above but state which theorems
you are using and verify that the hypotheses of the theorems are satisfied.
2
Hint: en t is very small if t > 0 and n large.) It looks like this exercise
proves that the solution of the partial differential equation (25.2-2) subject
to the intial condition (25.2-3) and the boundary condition (25.2-4) is given
by (25.2-1) where the coefficients are defined by (25.1-2). Is there anything
missing for a rigorous proof?
13
This PDE is called the Heat Equation.
67
26 Fourier Series - I
27 Fourier Series - II
Theorem 27.1. Assume that the function f : R C is 2 periodic and
Lipshitz. Then the Fourier series for f converges uniformly to f , i.e.
n Z
X 1
f (x) = lim ikx
ck e , ck := f (x)eikx dx
n
k=n
2
R R R : (x, y) 7 x + y, R R R : (x, y) 7 x y,
R R : a 7 a, R \ {0} R \ {0} : a 7 a1 ,
Addition Multiplication
commutative a+b=b+a ab=ba
associative (a + b) + c = a + (b + c) (a b) c = a (b c)
identity a+0=a a1=a
inverse a + (a) = 0 a a1 = 1
(a + b) c = (a c) + (b c).
68
The operations of subtraction and division are then defined by
a 1
a b := a + (b) := a b1 = a .
b b
The standard abbreviations ab := a b, a b := a + (b), and a/b = a b1
are used. The above axioms are those which appear in 9.1.
A.2. The rational numbers Q, the real numbers R, and the complex numbers
C are the most important examples of fields, but there are many others, e.g.
the field R of rational functions from problem 9.9 and the field Q( 2) from
problem 9.11. In Math 441 (or 541) you will even meet finite fields. (The
simplest example of a finite field is the set {0, 1, 2, . . . , p 1} where p is a
prime, and addition and multiplication are done modulo p.)
A.3. The axioms in the column headed Addition above are the axioms for
an abelian group in additive notation, and the axioms in the column headed
Multiplication above are the axioms for an abelian group in multiplicative. If
you replace 0 by 1, a + b by a b, and a by a1 in the former column you get
the axioms in the latter column. Because both addition and multiplication
satisfy the axioms for an abelian group there are further analogies.
Addition Multiplication
(0) a + b = 0 = b = a (0) a b = 1 = b = a1
Line (0) explains the phrase necessarily unique used in 9.1. The last line
explains why we invert and multiply to divide fractions.
69
A.4. Using the distributive law one can also prove the following familiar
identities.
It follows from these laws that if a product is zero, then one of its factors
must be zero. (Proof: if ab = 0 and a 6= 0 then b = 1 b(a1 a)b = a1 (ab) =
a1 0 = 0.)
Definition A.5. A homomorphism from a field R to a field R0 is a map
: R R0 such that
(a + b) = (a) + (b), (a b) = (a) (b)
for a, b R. (In each equation the operation on the left is the one for R
and the operation on the right is the one for R0 .) It follows easily that
(0) = 0, (1) = 1, (a) = (a), and (a1 ) = (a)1 . An isomorphism is
a bijective homomorphism.
Remark A.6. The inverse of an isomorphism is an isomorphism. For ex-
ample, if a0 , b0 , c0 := a0 + b0 R and a = 1 (a0 ), b = 1 (b0 ), c = 1 (c0 ),
then (a + b) = a0 + b0 (as is a homomorphism) and (c) = c0 = a0 + b0 , so
(c) = (a + b) so c = a + b (as is injective).
Proposition A.7. A homomorphism of fields is injective.
Proof. Assume that (a) = (b); we must show that a = b. If a 6= b, then
c := a b 6= 0 so 1 = (1) = (c1 c) = (c)1 (c) = (c)1 (a b) =
(c)1 (0) = (c)1 0 = 0 contradicting 1 6= 0.
Definition A.8. An ordered field is a field equipped with an order relation
satisfying the conditions in 9.2. An order preserving homomorphism from
an ordered field R to an ordered field R0 is a homomorphism : R R0 such
that
a < b = (a) < (b).
70
Example A.9. The inclusions Q Q( 2), Q R, $ R are all order
preserving homomorphisms. The map
Q( 2) 7 Q( 2) : a + b 2 7 a b 2
is a field homomorphism but is not order preserving.
Lemma A.10. For any ordered field R, there is a unique order preserving
homomorphism : Q R from the set Q of rational numbers into R.
Proof. The homomorphism must send 0 Q to 0 R, 1 Q to 1 R,
n Z+ to
(n) := (1) + (1) + + (1) R
| {z }
n
B Additional Problems
Problem B.1. Fix a positive number a R. The purpose of this problem
is to define the exponential ax for x R. Define a0 := 1 and for n a positive
integer define
an := |a a{z a}, an := 1/an .
n
Then
71
(1) Prove that for every nonzero integer n there is a unique solution b > 0
to the equation bn = a. Define a1/n to be this unique solution, i.e.
a1/n = b bn = a.
(2) For a rational number q define aq by aq = (am )1/n where q = m/n. Prove
that this definition is independent of the choice of the integers m and
n such that q = m/n.
R (0, ) : x 7 ax
In your proof make clear which theorems from these notes you are appealing
to. Also make your proof self contained so that a person who doesnt have
access to the statement of the problem can follow it. (You neednt provide
proofs for the theorems you use, but do provide references to them.) In your
proof of (3) you may use the inequality
|ap aq | M |p q|
You need not prove this inequality but use calculus to show where it comes
from. Hint: What is the definition of ln x, ex , and ax used in calculus?
Consider the Integral Test 20.13. The natural logarithm function ln x is
usually defined as an integral. How do you bound an integral by a sum? How
does the Mean Value Theorem from calculus (see 17.4)) give inequalities like
this?
72
Index
n space, 9 contrapositive, 7
nth Term Test, 52 converge, 26
ntuple, 9 converge absolutely, 53
converge conditionally, 53
accumulation point, 24, 26 converge pointwise, 56
Algebraic Axioms, 18 converge uniformly, 56
Alternating harmonic series, 54 convergent, 26
alternating series, 53 converges, 26
analytic, 61 converse, 7
Archimedean Property, 20 countable, 16
Axiom of Choice, 14
De Morgans Laws, 10
bijective, 14 decreasing, 28
Bolzano-Weierstrass, 30 Dedekind cuts, 20
boundary, 36 definite integral, 44
bounded, 24 denumerable, 16
bounded above, 19 derivative, 25, 43
bounded below, 19 differentiable, 43
cardinality, 16 direct product, 9
Cartesian product, 9 disconnected, 37
Cauchy, 32 disjoint, 8
Cauchy Convergence Criterion, 32 distance, 22
Cauchy Convergence Criterion for Se- diverge, 26
ries, 52 domain, 11
closed, 35, 36 dominated, 57
closure, 36 dot product, 23
cluster point, 24 dummy variable, 49
compact, 39 empty set, 8
Comparison Test, 54 equal, 8, 11
complement, 9 equality of sets, 11
complete, 71 extended real numbers, 25
Completeness Axiom, 19 exterior, 36
composition, 13
connected, 37 Fibonacci numbers, 15
continuous, 32, 33 field, 68
73
finite, 16 lower sum, 44
function, 11
Fundamental Theorem of Calculus, 48 map, 11
mathematical induction, 15
geometric series, 51 Mean Value Theorem, 43
graph, 11 mesh, 44
greatest lower bound, 19 monotonic, 28
74
relatively open, 35
Riemann sum, 46
right inverse, 13
Root Test, 54
Schwarz inequality, 23
sequence, 26
series, 50
source, 12
strictly, 28
subsequence, 29
subset, 8
supremum, 19
surjective, 14
target, 12
Taylor polynomial, 49
Taylor remainder, 49
Taylor Series, 61
Taylors Formula Lagrange Form,
49
terms, 50
trace, 31
truth tables, 10
uncountable, 16
uniformly continuous, 33
union, 8, 9
upper bound, 19
upper sum, 44
weakly monotonic, 28
Weierstrass Comparison Test, 57
75