Mathematical Analysis
Mathematical Analysis
1 Metric Spaces 1
1.1 Metric Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Open Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.3 Closed Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.4 Compact Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.5 Connected Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.6 Continuous Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
1.7 Normed Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
1.8 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
4 The Derivative 67
4.1 The Derivative of a Real Function . . . . . . . . . . . . . . . . . . . . 67
4.2 Mean Value Theorems . . . . . . . . . . . . . . . . . . . . . . . . . . 69
4.3 L’Hôpital’s Rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
4.4 Fréchet Derivative . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
i
ii CONTENTS
C Linear Algebra 89
Solutions 91
Bibliography 105
Preface
Gabriel Ribeiro
iii
CHAPTER
Metric Spaces
1
F RÉCHET introduced in his 1906’s work Sur Quelques Points Du Calcul
M AURICE
Fonctionnel the concept of a metric space, which is basically a set endowed with
a notion of distance between elements. As we’ll see, that notion is enough to study many
concepts in analysis such as sequences and continuity. Afterwards we’ll understand
normed spaces which, as the natural setting to series and derivatives, are metric spaces
whose set of points is a vector space.
• (Positivity) d(x, y) ≥ 0.
We call the pair (M, d) a metric space.
Notice that the first two axioms of a metric imply the other two. In fact, taking x = z in
the triangle inequality implies symmetry and taking y = z implies positivity. You should
convince yourself why this is true if it is not clear.
Whenever the context makes it clear, we’ll talk about the metric space M without
making reference to the metric. One should notice that every subset of a metric space is
a metric space in its own right, with the same metric restricted to the subset.
2 Chapter 1. Metric Spaces
Example 1.1 The prototypical example of metric spaces, from our standpoint, is the
euclidean space Rn . The metric in Rn is defined by
s
n
d(x, y) = ∑ (xi − yi )2 .
i=1
space is the discrete metric space. The discrete metric in any non-empty set is defined by
(
1 if x 6= y
d(x, y) = .
0 if x = y
Since we defined an open ball, we ought to check that this is always an open set.
Proof. Let a ∈ B(x, r). Then d(x, a) < r and consequently s = r − d(x, a) is a
positive number. We affirm that B(a, s) ⊂ B(x, r).
r
b s
x a
In fact, if b ∈ B(a, s), then d(a, b) < s and so d(x, b) ≤ d(x, a) + d(a, b) < d(x, a) +
s = r. This implies b ∈ B(x, r).
1.2 Open Sets 3
Theorem 1.2.2 Consider a family of open subsets of a metric space. Then the
following properties hold.
[
• Given a (not necessarily countable) family of indices Λ, Eλ is open.
λ ∈Λ
m
\
• For every positive integer m, Ei is open.
i=1
then x is an interior point of Eλ and, consequently, of E. This proves the first part of the
theorem.
Next, put F = m i=1 Ei . For any x ∈ F, there exist open balls B(x, ri ), such that B(x, ri ) ⊂
T
It should be noted that the intersection of a infinite number of open sets need not be
open. Can you think of an example?
Of all the open sets, the open ball is probably the most important since, in some
sense, it provides a "basis" for all the open sets as is shown in the next corollary.
Corollary 1.2.3 A subset E ⊂ M is open if, and only if, it is a union of open balls.
Proof. The fact that if E is a union of open balls, then E is open follows readily
from the preceding theorem. We shall then prove the converse.
If E is open, then for all x ∈ E we have that {x} ⊂ B(x, rx ) ⊂ E for some rx > 0. Taking
unions, we see that [ [
E= {x} ⊂ B(x, rx ) ⊂ E.
x∈E x∈E
It follows that [
E= B(x, rx ).
x∈E
We already know how open sets behave under unions and intersections. The last set
operation to be studied is the Cartesian product.
Corollary 1.2.5 Let Ei be open in (Mi , di ) for every i in {1, 2, . . . , n}. Then E1 ×
. . . × En is open as a subset of M1 × . . . × Mn endowed with the maximum metric.
Proof. Since we have a finite number of sets in the Cartesian product, we can
choose a smallest radius like we did before. The proof then follows similarly.
At first it seems like we proved a very particular case since the preceding corollary
is only valid for one metric in the product space. However, we will see soon that almost
every metric generates the same open sets, so that corollary is a pretty general one.
Definition 1.2.2 — Equivalence of Metrics. Let M be a set and d1 , d2 be metrics on
M. We say that d1 and d2 are equivalent (or d1 ∼ d2 ) if there exist positive constants
c,C ∈ R such that for all x, y ∈ M:
You should check that this is, in fact, a equivalence relation. The motivation for
defining this equivalence is the following theorem.
Theorem 1.2.6 Let (M, d1 ) and (M, d2 ) be metric spaces such that d1 ∼ d2 . A set
E ⊂ M is open in (M, d1 ) if, and only if it is open in (M, d2 ).
Proof. If E is open in (M, d1 ), then for all x ∈ E there is some r1 > 0 such that
{y ∈ M | d1 (x, y) < r1 } ⊂ E. Since d1 (x, y) ≤ C d2 (x, y), taking r2 = r1 /C then we have
that d2 (x, y) < r2 implies d1 (x, y) < r1 and hence E is open in (M, d2 ). The converse is
analogous.
To make Corollary 1.2.5 general we just need to prove that various metrics are
equivalent to the maximum metric. This will be done on the section of normed spaces.
Notice that in order for E to be closed it is not necessary that every point of E is a
limit point. A singleton is closed but it’s sole point is not a limit point.
1.3 Closed Sets 5
Theorem 1.3.1 A point x ∈ E is a limit point if and only if every open ball centered
at x contains infinitely many points from E.
Proof. Suppose x is a limit point of E and B(x, r) is a ball that contains finitely
many points of E. Since B(x, r) ∩ E is finite, we can list its elements as {p1 , p2 , . . . , pn }.
Let s be the minimum of all d(x, pi ) for i ∈ {1, 2, . . . , n}.
Then B(x, s) does not contain any points of E besides x, so that x is not a limit point of
E. Absurd!
The converse is trivial.
Based on the way we named some properties of subspaces, one can think that, in
some way, closed sets are the "opposite" of open sets. This is not true, since there exist
sets that are both open and closed. The empty set is a simple example. If that wasn’t
enough, there are sets that are neither closed nor open. The set (0, 1) considered as a
subset of the real plane is neither open nor closed. However, there is a relation between
these kinds of sets.
Theorem 1.3.2 A set E is open if, and only if its complement is closed.
Theorem 1.3.3 Consider a family of closed subsets of a metric space. Then the
following properties hold.
\
• Given a arbitrary family of indices Λ, Eλ is closed.
λ ∈Λ
m
[
• For every positive integer m, Ei is closed.
i=1
Proof. Taking complements of the equations in Theorem 1.2.2 and using De Mor-
gan’s relations the result is obvious.
Theorem 1.3.4 For any set E, its closure E is closed. Moreover, E is closed if, and
only if E = E.
c
Proof. Note that if x ∈ E , then x is neither a point of E not a limit point of E.
c
Hence there exists an open ball centered at x which does not intersect E so that E is
6 Chapter 1. Metric Spaces
Theorem 1.3.5 Let E be a nonempty set of real numbers which is bounded above.
Let y = sup E. Then y ∈ E. Hence y ∈ E if E is closed.
Corollary 1.3.6 Let E be a nonempty set of real numbers which is bounded below.
Let y = inf E. Then y ∈ E. Hence y ∈ E if E is closed.
E ⊂ Gλ1 ∪ . . . ∪ Gλn .
This property of a set is specially important because it allows the passage of some
local properties to global properties.
As was thought before, the property that every infinite subset has a limit point is
common to every set with this special "something". However, it is not the heart of the
matter.
1.4 Compact Sets 7
We’ll see now that, in fact compact sets are quite similar to closed bounded sets.
It should be noted that, in general, the converse is not true! The set of all integers is
closed and bounded when endowed with the discrete metric, however {{x} ⊂ Z | x ∈ Z}
is an open cover which has no finite subcover.
Theorem 1.4.4 Let Ω be a collection of closed subsets of a compact set E such that
the intersection of every finite subcollection of Ω is non-empty. Then the intersection
of all the elements of Ω is not empty.
Proof. Before we start, one should notice that the contrapositive of compactness is
"Given any collection Φ of open sets, if no finite subcollection of Φ covers E, then Φ
8 Chapter 1. Metric Spaces
It is a fact that the preceding corollary is valid for closed subsets of the real line.
Once we prove that the closed interval [a, b] is compact this becomes trivial. Since we
cannot do this as of yet, we’ll have to prove this result separately.
Proof. Suppose In = [an , bn ] and let E be the set of all an . Then E is not empty
and is bounded above by b1 . Let s be the supremum of E. Since s is the supremum of
E, it is clear that an ≤ s for all n. Since every bn is an upper bound of E, it follows that
s ∈ [an , bn ] = In for every n ∈ N. The result follows.
Although the following result is very important, we’ll call it a lemma since it will be
used to prove a much more general result.
Proof. Suppose there is an open cover {Gλ } of [a, b] which contains no finite
subcover. Let c = (a + b)/2 and consider the intervals [a, c] and [c, b]. It follows that at
least one of these intervals cannot be covered by a finite subcover of {Gλ }, otherwise
[a, b] itself would be covered. We then call this interval I1 , divide it in half and call
the piece that cannot be covered I2 . Continuing this process we get a sequence {In } of
subsets of [a, b] such that for all n ∈ N: In+1 ⊂ In , In is not covered by any finite subcover
of {Gλ } and diam In = 2−n (b − a).
I1 I3 I2
a x c b
1.4 Compact Sets 9
By the nested interval theorem, there is a number x ∈ [a, b] such that x ∈ In for all n.
Since {Gλ } covers [a, b], there is a set Gλ such that x ∈ Gλ . Since Gλ is open, there is a
neighborhood B(x, r) of x that is entirely contained in Gλ .
In Gλ
x
B(x, r)
If we take n large enough such that 2−n (b − a) < r (which is possible since R is
archimedean) then we have that In ⊂ B(x, r) ⊂ Gλ . This is absurd since no finite subset
of {Gλ } covers In . This establishes the proof.
In fact, a result much stronger than this is true: the arbitrary product of compact sets
is compact. That is called Tychonoff’s theorem and it is way out of our scope.
As a quick corollary we get our most powerful result yet: the Heine-Borel Theorem.
As we’ll see, unlike compactness, connectedness is a very intuitive notion for metric
spaces.
Definition 1.5.1 — Connected Set. Let E be a metric space. A separation of E is
a pair U, V of disjoint nonempty open subsets of E whose union is E. The set E is
said to be connected if there does not exist a separation of it.
In every metric space M, we have that M and the empty set are both open and
closed (independent of the metric chosen). Connected metric spaces have the remarkable
property that the only sets that are both open and closed are M and ∅.
Theorem 1.5.1 Let M be a connected metric space. Then the only sets that are both
open and closed are M and ∅.
Proof. Suppose A is a proper non-empty subset of M that is both open and closed.
Since A is closed, M \ A is open and since A is a proper subset of M, M \ A 6= ∅. Clearly
(M \ A) ∪ A = M and (M \ A) ∩ A = ∅, so M is disconnected. We conclude that the only
closed and open sets on a connected metric space are the empty set and itself.
We now prove that every interval of the real line and the real line itself are connected.
I used indices on the metrics to indicate the sets where they are defined.
Theorem 1.6.3 The image of a compact set under a continuous function is compact.
E ⊂ f −1 (Gλ1 ) ∪ . . . ∪ f −1 (Gλn ).
It’s now clear that the set {Gλ1 , . . . , Gλn } is a finite subcover of f (E).
for all x ∈ A.
Proof. Since A is compact and f is continuous, f (A) is compact, hence closed and
bounded. Theorem 2.3.9 then implies the result.
Last, but not least we have theorems that relates connected sets with continuous
functions.
Theorem 1.6.5 The image of a connected set under a continuous function is con-
nected.
Proof. Let E be a connected set in the domain of f . Suppose f (E) is not connected.
Then there are sets U,V that make a separation of f (E). Since f is continuous, f −1 (U)
and f −1 (V ) are open, hence so are E ∩ f −1 (U) and E ∩ f −1 (V ). A possible element
of f −1 (U) ∩ f −1 (V ) would its image would be a element of both U and V , which
is impossible. The sets f −1 (U) and f −1 (V ) are non-empty too since U and V are
non-empty. Lastly, its clear that E ⊂ f −1 (U) ∪ f −1 (V ) since if x ∈ E, then f (x) ∈ U
or f (x) ∈ V and that is the exact definition of x ∈ f −1 (U) ∪ f −1 (V ). This implies
E = (E ∩ f −1 (U)) ∪ (E ∩ f −1 (V )), hence E ∩ f −1 (U) and E ∩ f −1 (V ) constitute a
separation of E, which is absurd!
i) f is continuous;
ii) f is continuous at 0 ∈ E;
iii) The restriction f |N of f to N is bounded. That is, supx∈N k f (x)k < ∞.
Theorem 1.7.2 Let f : X → Y , where X, Y are normed vector spaces and X is finite-
dimensional, be a linear function. Then f is continuous.
Actually, we’ll prove a particular case of this result for now. We’ll assume that the
norm defined on X is the 1-norm k·k1 . (The metric without indices will be the one
defined on Y .)
Proof. Let e1 , . . . , en be a basis of X. Then, for x = x1 e1 + . . . + xn en ∈ X and
y = y1 e1 + . . . + yn en ∈ X we have:
n n
k f (x) − f (y)k = ∑ (xi − yi ) f (ei ) ≤ ∑ |xi − yi | k f (ei )k .
i=1 i=1
14 Chapter 1. Metric Spaces
Let ε > 0 be given and let M = max1≤i≤n k f (ei )k. If we define δ = ε/M, then for all x
and y with kx − yk1 < δ :
n n
k f (x) − f (y)k ≤ ∑ |xi − yi | k f (ei )k ≤ M ∑ |xi − yi | = M kx − yk1 < ε.
i=1 i=1
Hence, f is continuous.
Definition 1.7.2 Let X be a vector space and k · ka , k · kb be any norms. We say that
k · ka and k · kb are equivalent the metrics induced by them are equivalent. That is, if
there exist positive constants c, C such that for all x ∈ X:
We now prove that, in a finite dimensional vector space, the topology generated by
every norm is unique. For that we’ll need the following lemma.
Lemma 1.7.3 Let f : X → R, f (x) = kxk is continuous under the metric induced by
k·k1 on X if X is finite-dimensional.
|kxk − kyk| ≤ kx − yk
n
≤ ∑ |xi − yi | kei k
i=1
n
≤ M ∑ |xi − yi |
i=1
≤ M kx − yk1
< ε,
if kx − yk1 < δ .
Proof. Since norm equivalence is transitive, it is sufficient to show that every norm
is equivalent to some fixed norm. Let k · k1 = ∑ni=1 |xi | be that fixed norm.
If x = 0 the result is trivial, so let’s assume x 6= 0 and divide the inequality by kxk1 .
We only need to prove that
c ≤ kuk ≤ C
is true for all u ∈ X such that kuk1 = 1.
The unit sphere
S = {x ∈ X | kxk1 = 1}
1.8 Exercises 15
is closed and bounded, so it is compact. we have then that f (x) = kxk must attain it’s
bounds on S.
Let
c = min kuk and C = max kuk .
u∈S u∈S
Actually, it isn’t obvious that S is compact since Heine-Borel was only proved on
Rn . However, note that the function f : Rn → X, f (x1 , . . . , xn ) = x1 e1 + . . . + xn en is a
continuous (since it is linear) bijection. The set S0 = {x ∈ Rn | kxk1 = 1} is compact
(Heine-Borel) and S is the image of S0 by f . Since the continuous image of a compact set
is compact, the result follows.
Exercise 1.1 Use the equivalence of norms to prove Theorem 2.7.2 for any norm
defined on X.
1.8 Exercises
p If d is a metric, prove that so are ρ1 (x, y) = d(x, y)/(1 + d(x, y)) and
Exercise 1.2
ρ2 (x, y) = d(x, y).
d(x, B)
f (x) =
d(x, A) + d(x, B)
is continuous, has value 1 for all x ∈ A, has value 0 for all x ∈ B and satisfies
0 ≤ f (x) ≤ 1 for all x ∈ M.
As metric spaces behave much better than topological spaces, it is useful to know
when a topological space is also a metric space. A famous result, called Urysohn
Metrization Theorem states that every normal, second-countable topological space is
in fact a metric space. Urysohn’s lemma is used in the proof of this result.
Exercise 1.5 — Metric Spaces are Normal. Let A and B be non-empty disjoint
closed subsets of a metric space. Prove that there exist open sets E, F such that
A ⊂ E, B ⊂ F and E ∩ F = ∅.
Exercise 1.6 Check the proof of Theorem 2.3.9 again and then read the following
statement.
"Let E be a nonempty subset of R. If y = sup E is finite and δ > 0, then there exists
some x in E such that y − δ < x < y."
I affirm that it is wrong. Can you tell why?
Exercise 1.9 — Lebesgue Number. Let M be a compact metric space and let {Gλ }
be an open cover of M. Prove that there is a number δ > 0, called Lebesgue number
of M relative to {Gλ }, such that every subset of M with diameter less than δ is
contained in some element of {Gλ }.
Exercise 1.11 Prove that the finite union of compact sets and the arbitrary intersec-
tion of compact sets is compact.
The real projective space RPn is the set of all equivalence classes of this equivalence
relation.
Geometrically, two points in Rn+1 \ {0} are equivalent if they lie on the same line
through the origin, so RPn can be interpreted as the set of all lines through the origin
in Rn+1 . As each line crosses the sphere Sn = {x ∈ Rn+1 | kxk = 1} exactly twice,
this suggests that we define the following equivalence relation on Sn :
x ∼ y ⇐⇒ x = ±y.
Exercise 1.13 Let A and B be subsets of the real line and define the set
A + B = {a + b ∈ R | a ∈ A, b ∈ B}.
Exercise 1.15 — R \ Q is Uncountable. Prove that the set of all irrational numbers
R \ Q is uncountable.
It is curious that "almost" every complex number is transcendental and yet the only
transcendental numbers that most people recognize are π and e. Even these numbers
are hard to show that they are not algebraic!
Exercise 1.17 — Cantor Set. Let C0 be the closed interval [0, 1].
C0 :
0 1
C1 will be the set that results when the open middle third is removed. That is,
C1 = [0, 1/3] ∪ [2/3, 1].
C1 :
0 1/3 2/3 1
Now, construct C2 in a similar way, removing the open middle third of each compo-
nent of C1 . Then we have that C2 = [0, 1/9] ∪ [2/9, 1/3] ∪ [2/3, 7/9] ∪ [8/9, 1].
C2 :
0 1/9 2/9 1/3 2/3 7/9 8/9 1
Since C is an intersection of closed sets, C is closed and hence compact. Prove that
C does not contain intervals and does not contain isolated points. Lastly, show that
the Cantor set is uncountable.
Exercise 1.18 Let X ⊂ Rn . Show that there exists a countable set Y ⊂ Rn such that
Y = X.
Exercise 1.19 — Banach Fixed Point Theorem. Let X be a compact metric space
and φ : X → X be a continuous function such that d(φ (x), φ (y)) < d(x, y) for all
x 6= y. Then φ has a unique fixed point. That is, a point x0 such that φ (x0 ) = x0 .
p(t) = (1 − t)a + tb
Show that F is closed and non-empty. The infimum of this set is called the operator
norm k f kop of f . Prove that this is, in fact, a norm.
kx − yk ≥ r
for all y ∈ E.
CHAPTER
Sequences and Series
2
S WE ’ LL SEE , most of the important concepts in analysis are well described in
A terms of sequences. As before, whenever it is possible and enlightening, we’ll
study sequences in it’s more general setting possible. This will usually be metric or
normed spaces.
In the first chapter, we saw (quite informally) that a sequence is a function x : N → X,
where X is an arbitrary set. We denote the element x(n) as xn and write as (xn ) the
sequence itself.
If (xn ) is convergent, we way that (xn ) converges to x or that x is the limit of (xn )
and write this as x = lim xn or xn → x.
n→∞
The preceding definition can be restated in two very useful ways:
A priori, nothing prevents a sequence to converging to more than one point. The
following theorem shows that this is not the case.
22 Chapter 2. Sequences and Series
Proof. Suppose xn → a and xn → b. Then, for every ε > 0, there are integers n0
and m0 such that, if n > n0 we have that d(a, xn ) < ε/2 and if n > m0 we have that
d(b, xn ) < ε/2. Then, if n > max{n0 , m0 }, the triangular inequality implies
d(a, b) ≤ d(a, xn ) + d(b, xn ) < ε.
It follows that a = b.
Theorem 2.1.2 A point x ∈ M is a limit point of M if, and only if there is a sequence
(xn ) of points in M \ {x} such that xn → x.
We say that the set of all points xn is called the range of (xn ) and it is denoted {xn }.
When {xn } is bounded we’ll typically abuse language and say that the sequence itself
is bounded. You should notice the similarity between the range of a sequence and the
range of a function.
Theorem 2.1.3 If (xn ) converges, then {xn } is bounded. Moreover, the converse does
not hold.
Hence xn → x and yn → y.
Conversely, assume xn → x and yn → y. For every ε > 0 there are integers n1 and
n2 such that n > n1 implies d1 (xn , x) < ε and n > n2 implies d2 (yn , y) < ε. Taking
n0 = max{n1 , n2 } we have that n > n0 implies d(zn , z) < ε and hence zn → z.
Proof. Since all norms are equivalent on finite dimensional normed spaces, the
particular result of Theorem 3.1.4 becomes a general result in finite dimensional normed
spaces.
The two preceding results clearly generalize to products of any finite number of
spaces.
Besides understanding the topological aspects of sequences, we can also study some
algebraic properties.
Theorem 2.1.6 Let (xn ) and (yn ) be sequences defined on a normed vector space
X and let (pn ) be a sequence in X’s field. If xn → x, yn → y and pn → p, then the
following holds.
1 1
lim (xn + yn ) = x + y, lim (pn xn ) = px and lim = .
n→∞ n→∞ n→∞ pn p
Proof. For every ε > 0 there are integers n1 and n2 such that
kpn xn − pxk = kpn (xn − x) + (pn − p)xk ≤ |pn | kxn − xk + |pn − p| kxk .
Since xn → x and pn → p, for every ε > 0 there are integers n1 , n2 such that
n > n1 =⇒ kxn − xk < ε/2(max |pn |) and n > n2 =⇒ |pn − p| < ε/2 kxk .
Lastly, choosing n0 such that |pn − p| < |p|/2 if n > n0 , we see that∗
1
|pn | > |p| for all n > n0 .
2
Given ε > 0, there is an integer n1 such that n > n1 implies
1
|pn − p| < |p|2 ε.
2
Hence, for n > max{n0 , n1 },
1 1 pn − p 2
− = < 2 |pn − p| < ε.
pn p pn p |p|
Theorem 2.1.7 Let (pn ) be a sequence of real numbers such that pn ≥ 0 for all n > n0 ,
where n0 is a fixed integer. If (pn ) converges to p, then p ≥ 0.
Corollary 2.1.8 Let (pn ) and (qn ) be sequences of real numbers such that pn ≥ qn
for all n > n0 , where n0 is a fixed integer. If (pn ) converges to p and (qn ) converges
to q, then p ≥ q.
∗ Combining |pn − p| ≥ |p| − |pn | and 12 |p| > |pn − p| we get the desired result.
2.2 Subsequences and Sequential Compactness 25
Proof. I’ll suppose pn+1 ≥ pn since all the other cases follow readily or are
analogous to it. If (pn ) is bounded, then {pn } has a least-upper-bound. Let p = sup{pn }.
For all ε > 0 there is an integer n0 such that
p − ε < pn0 ≤ p.
Otherwise the leftmost inequality would imply that p − ε is an upper bound of {pn }
smaller than p. The rightmost inequality follows from the fact that p is an upper bound
of {pn }. Since (pn ) is monotonic, for all n > n0 the inequality pn ≥ pn0 holds. Hence,
for all n > n0
p − ε < pn ≤ p < p + ε.
It follows that (pn ) converges (to p). The converse was proved in Theorem 3.1.3.
Theorem 2.2.1 A sequence (xn ) converges to x if, and only if every subsequence
(xnk ) of (xn ) converges to x.
Proof. If there was a subsequence that didn’t converge to x, then there will be
a neighborhood of x that has infinite terms of the subsequence outside of it. So this
neighborhood has infinite terms of the sequence outside of it, and then the sequence does
not converge to x.
Since every sequence is a subsequence of itself, the other direction is trivial.
Some books say that a topological space is sequentially compact if every sequence
has a convergent subsequence. The following theorem shows that every compact metric
space is sequentially compact.
Theorem 2.2.2 Let (xn ) be a sequence in a compact metric space M. Then some
subsequence (xnk ) converges.
Proof. If {xn } is finite, there is a point x ∈ {xn } and a strictly increasing sequence
of positive integers (nk ) such that
Proof. Since {xn } is a closed and bounded subset of Rn , {xn } is compact. The
previous theorem implies the result.
Lemma 2.2.4 Let M be a sequentially compact metric space and let {G p } be an open
cover of M. Then, there exists a real number δ > 0 such that every neighborhood
with radius less than δ is contained in some element of {G p }.
Proof. Let Nn (x) be a neighborhood of x with radius 1/n. If the theorem is not true,
for every n ∈ N some neighborhood Nn (xn ) is not contained in any element of the cover.
Since M is sequentially compact, we can find a subsequence (xnk ) which converges to
x ∈ M.
Now, x is contained in some G p0 . Since each G p is open, there is an positive integer
m such that Nm (x) ⊂ G p0 . Since xnk → x, for all k > k0 d(x, xnk ) < 1/2m. Since nk ≥ k,
taking k > max{k0 , 2m} We have Nnk (xnk ) ⊂ Nm (x) ⊂ G p0 .
G p0
Nm (x)
m−1
xnk
x
Lemma 2.2.5 Let M be a sequentially compact metric space. For all ε > 0 there is a
finite open cover of M which consists only of neighborhoods with radii ε.
them the result is proved. Otherwise, note that d(xn , xm ) ≥ ε for every n 6= m. Hence
(xn ) has no convergent subsequence which is absurd!
Why does d(xn , xm ) ≥ ε imply (xn ) has no convergent subsequence? Try to figure this
out by yourself. In every case, this should be clear to you by the end of the next section.
Let (pn ) be a sequence of real numbers. If for all real numbers M there is an integer
n0 such that pn > M whenever n > n0 we say that pn → +∞. Analogously if pn < M
whenever n > n0 we say that pn → −∞.
We’re actually abusing a little of the notation here since the symbol → was used
before to denote converging sequences and the kinds of sequences we just defined clearly
diverge. This is in no way a change to the concept of convergence.
Definition 2.2.2 — Limit Superior/Inferior. Let (pn ) be a sequence of real numbers
and E ⊂ R be the set of all subsequential limits of (pn ). We define the limit superior
and the limit inferior of (pn ) to be:
Of course there are sequences (pn ) for which lim supn→∞ pn 6= lim infn→∞ pn . A sim-
ple example is the sequence pn = (−1)n . In this case we have that lim supn→∞ (−1)n = 1
and lim infn→∞ (−1)n = −1.
Another way to state Theorem 3.2.1 for sequences of real numbers is as follows:
(pn ) converges if, and only if
Exercise 2.1 Prove that the set E that was just defined is closed in R. That is, there
are subsequences (pnk ) and (pmk ) such that
We now prove an analogous result to Corollary 3.1.8 to the limit superior and inferior.
Theorem 2.2.7 Let (pn ) and (qn ) be sequences of real numbers such that pn ≥ qn for
all n > n0 where n0 is a fixed positive integer. Then
Proof. Assume the theorem is false. As you proved in Exercise 2.1 (if you didn’t it
may be helpful to check the solution in the end of the book), there is a subsequence (qnk )
such that
lim qnk = lim sup qn .
k→∞ n→∞
Hence,
lim sup pn < lim qnk .
n→∞ k→∞
Since lim supn→∞ pn is the biggest subsequential limit of (pn ), we have that:
Taking ε = 12 (limk→∞ qnk − limk→∞ pnk ) we see that there is an positive integer k0 for
which
pnk < qnk
whenever k > k0 . This contradiction estabilishes the theorem. The proof for the limit
inferior is analogous.
Theorem 2.2.8 Let (pn ) and (qn ) be sequences of real numbers. If 0 ≤ pn ≤ qn for
all n > n0 , where n0 is an positive integer, and qn → 0 then pn → 0.
Lemma 2.2.9 Let (xn ) be a sequence of real numbers. If a is a real number such that
a > lim supn→∞ xn , then there is an integer n0 such that xn < a whenever n > n0 .
Proof. If we had that xn ≥ a for infinitely many values of n, then the subsequence
(xnk ) of all xn such that xn ≥ a converges to a value x ≥ a > lim supn→∞ xn . This
contradicts the definition of the limit superior.
n k nk k
(1 + p)n > p ≥ kp .
k k
And then,
nq kk q−k
0< < n .
(1 + p)n pk
Since q − k < 0, the result follows from Example 3.1.
lim xn = 0
n→∞
Note that, although the limit point is explicitly involved in the definition of a conver-
gent sequence, it is not in the definition of a Cauchy sequence. As of now, convergent
sequences and Cauchy sequences are two completely different things, but it would be
wonderful if they shared properties. Fortunately, they do.
Proof. Let (xn ) be a sequence convergent to x. Then, for all ε > 0 there is an
integer n0 such that d(xn , x) < ε/2 whenever n > n0 . Hence,
whenever n, m > n0 .
As one can easily verify, the sequence (1/n) defined on (0, 1] is Cauchy but diverges.
However, there exist some important cases where the converse of the preceding theorem
holds.
Theorem 2.3.2 Let M be a compact metric space. Then every Cauchy sequence in
M converges.
Proof. Let (xn ) be a Cauchy sequence in M. For any ε > 0 let n0 be an positive
integer such that
ε
d(xn , xm ) < for all n, m > n0 .
2
Theorem 3.2.2 implies that there is a subsequence (xnk ) that converges to some x ∈ M.
That is, for every ε > 0 there is an positive integer k0 such that
ε
d(xnk , x) < whenever k > k0 .
2
Since nk ≥ k, d(xnk , x) < ε/2 whenever nk > k0 . Let j be an integer such that j > n0 ,
j > k0 and j = nk for some k. We have then that
is bounded, since its diameter is less than 1. It follows that the set
is bounded.
Proof. Let (xn ) be a Cauchy sequence in Rn . It follows from the previous lemma
that {xn } is a bounded closed subset of Rn . Heine-Borel implies compacticity and hence
this theorem follows from the previous one.
We noted before that the sequence (1/n) defined on (0, 1) is Cauchy but diverges.
Our intuition may tell that the problem here is that (0, 1) lacks the point to which (1/n)
"should" converge. The next theorem shows roughly that this is always the case.
Proof. Let x = limk→∞ xnk . We’ll prove that xn → x. In fact, let ε > 0 be an
arbitrary real number. Since (xnk ) converges to x, there exists a positive integer k0
such that d(xnk , x) < ε/2 whenever k > k0 . Similarly, since (xn ) is a Cauchy sequence,
there exists a positive integer m0 such that d(xn , xm ) < ε/2 whenever n, m > m0 . Let
n0 = {k0 , m0 }. Taking k big enough such that nk > n0 we have that
ε ε
d(xn , x) ≤ d(xn , xnk ) + d(xnk , x) < + = ε,
2 2
whenever n > n0 . The result follows.
Thus, Theorem 3.3.2 and Theorem 3.3.3 say that all compact metric spaces and all
Euclidean spaces are complete. It also implies that every closed subset E of a complete
metric space M is complete.† A example of a metric space which is not complete is the
space of all rational numbers, with d(x, y) = |x − y| as a metric.
Proof. For all n ∈ N pick a point xn ∈ Fn . The sequence (xn ) so defined has the
property that xn , xm ∈ Fn0 whenever n, m > n0 . Since diam Fn → 0, for all ε > 0 there is
a positive integer n0 such that diam Fn0 < ε and hence d(xn , xn ) < ε whenever n, m > n0 .
That is, (xn ) is a Cauchy sequence. Since M is complete, let x be the point to which (xn )
converges.
For any positive integer k we have that xn ∈ Fk whenever n ≥ k. That implies x ∈ Fk
for all k ∈ N. We conclude that F is not empty.
If F had more than one point, then it would follow that diam F > 0. For each n ∈ N,
F ⊂ Fn means that diam F ≤ diam Fn and hence diam Fn does not converge to 0.
Note the resemblance of Theorem 3.4.1 and Corollary 2.4.5. The former is about
complete spaces and the latter is about compact spaces. One may ask if the condition
diam Fn → 0 is really necessary. I affirm it is. Let Fn = [n, +∞) be closed subsets of
the real line (which is a complete metric space as we just proved). You may note that
T∞
n=1 Fn = ∅.
Theorem 2.4.2 — Baire’s Theorem. Let M be a complete metric space and {En } be
a collection of sets where each En is open and dense in M. Then the set
∞
\
E= En
n=1
is a dense subset of M.
Proof. Let N be any neighborhood. We’ll prove that E ∩ N is not empty. Let
N1 = N. Since E1 is open and dense, E1 ∩ N1 is open and not empty. Hence it contains a
neighborhood N2 which we can suppose so little that N2 ⊂ E1 ∩ N1 and diam N2 < 1/2.
Similarly, since E2 is open and dense there is a neighborhood N3 such that N3 ⊂ E2 ∩ N2
and diam N3 < 1/3. We then obtain a sequence of neighborhoods Nk such that Nk+1 ⊂ Nk ,
T∞
Nk+1 ⊂ Ek ∩ Nk and diam Nk → 0. Theorem 2.4.1 then implies k=1 Nk consists of a
† Every Cauchy sequence in E is a Cauchy sequence in M, hence it converges to some x ∈ M, and actually
x ∈ E since E is closed.
2.4 Complete Metric Spaces 33
We could have proved the following corollary before but, since we didn’t needed it
before and this proof is so simple, this is a better place for it.
Proof. If R was countable we could list its elements as {r1 , r2 , r3 , . . . }. Since each
set with only one element is closed, for each rn the set En = R \ {rn } is open and dense
in R. However note that
∞
\ ∞
\ ∞
[
En = R \ {rn } = R \ {rn } = R \ R = ∅.
n=1 n=1 n=1
In the first chapter we defined the real field as a ordered field with the least-upper-
bound property. However, what if such a specific field does not exists? It would be a
shame as I’ve spent months writing more than fifty pages of numerous results about the
real field that are now vacuously‡ true since there is no real field. Fortunately I now
possess the machinery to present an example of a ordered field with the least-upper-bound
property!
Definition 2.4.2 — Equivalent Sequences. Two sequences (xn ) and (yn ) defined
on a same metric space M are said to be equivalent if
lim d(xn , yn ) = 0.
n→∞
Theorem 2.4.4 Let Q be the set of all Cauchy sequences of rational numbers. The
set Q/ ∼ of all equivalence classes of such sequences is an ordered field with the
least-upper-bound property.
Denoting by x and y the arbitrary elements [(xn )] and [(yn )] of Q/ ∼ we’ll define the
sum of two elements of Q/ ∼ as x + y = x + y = [(xn + yn )] and the product as x · y =
xy = [(xn yn )]. The elements 0 = [(0)] and 1 = [(1)] are the additive and multiplicative
identities and the element −x = [(−xn )] is the additive inverse of x = [(xn )]. Last, but
not least, we say that x > y if x − y = [(xn − yn )] for some Cauchy sequences (xn ) and
(yn ) such that xn > yn for all n > n0 , where n0 is an positive integer.
‡ A logic statement is vacuously true if it asserts that all members of the empty set have a certain property.
It’s like a child saying to his or her parent: "I ate every vegetable in the plate!" when there was not a single
vegetable there to begin with.
34 Chapter 2. Sequences and Series
We can also embed Q into Q/ ∼ using the function x 7→ [(x)]. This function is
injective since [(x)] = [(y)] imply limn→∞ |x − y| = |x − y| = 0 and hence x = y.
Since the proof of this theorem is not really useful for the rest of the book but only
acts as a motivation for you to believe that this book is not entirely useless, you can work
it by yourself. (I’m not even sorry.)
and say that s is the sum of the series. It will be common to abuse language and
denote by ∑ xn the series itself. It is said that sn is a partial sum of the series.
It should be noticed that, while we use the term "sum" and the symbol ∑, a series is
no ordinary sum at all. A priori it is not even clear that series share properties with sums.
Usually it is too hard to calculate the limit of a series. Until we develop a sufficiently
robust machinery to deal with this task, we shall focus on convergence and divergence.
Example 2.5 — Geometric Series. Let x be any complex number such that |x| < 1.
Using induction it is simple to verify that
n
1 − xn+1
∑ xk = 1−x
k=0
and hence
∞
1
∑ xn = 1 − x .
n=0
The next few theorems are simple consequences of important facts about sequences.
Proof. This is merely a restatement of Theorem 2.3.1 applied to the sequence (sn )
of partial sums. The converse is basically the definition of a complete space.
The converse of this theorem does not hold even in complete spaces.
Example 2.6 — Harmonic Series. Take xn = 1/n. The corresponding series is said
to be the harmonic series. Clearly xn → 0. However, consider the sequence (yn ) such
that 2n
1
yn = ∑ .
k=n+1 k
Since yn+1 − yn = 1/(2k + 1) − 1/(2k + 2) > 0, the sequence (yn ) is strictly increasing
and hence yn > y1 = 1/2. It follows that the Cauchy criterion does not hold for xn = 1/n
and the harmonic series diverges.
Theorem 2.5.4 If ∑ xn and ∑ yn are converging series of real numbers such that
xn ≥ yn for all n ∈ N then
∞ ∞
∑ xn ≥ ∑ yn .
n=1 n=1
m m ∞
∑ xn ≤ ∑ kxn k ≤ ∑ kxn k .
n=1 n=1 n=1
for any sequence (xk ), any n ∈ N and any permutation σ . After all, we just rearranged
the elements of the sum. One would expect that series share this property. As we’ll see
soon, Bernhard Riemann showed that we couldn’t be more wrong!
Example 2.7 — Alternating Harmonic Series. For now, you’ll trust me that the
alternating harmonic series
∞
(−1)n+1
∑ n
n=1
converges to a value s 6= 0. Consider the following permutation in the set of all positive
integers:
4n/3
if n is divisible by 3
σ (n) = (2n + 1)/3 if n − 1 is divisible by 3 .
(4n − 2)/3 if n − 2 is divisible by 3
∞ ∞
1 ∞ (−1)n+1
1 1 1 s
∑ xσ (n) = ∑ − − = ∑ n = 2.
n=1 n=1 2n − 1 2(2n − 1) 4n 2 n=1
Hence, a rearrangement of the terms can modify the value to which a series converges.
Fortunately, rearranging series that converge absolutely does not change its value.
Proof. Let (s0n ) be the sequence of the partial sums of ∑ xσ (n) . Since ∑ xn is
absolutely convergent, given ε > 0 there is an integer n0 such that
m
∑ kxk k < ε
k=n0
If n > p, we have that {1, 2, . . . , n0 − 1} is a subset of {σ (1), σ (2), . . . , σ (n)}. Hence all
the xi for i = 1, 2, . . . , n0 − 1 are cancelled in sn − s0n . So,
m
sn − s0n ≤ ∑ kxk k < ε.
k=n0
The following theorem is Riemann’s way of saying: "Thou shall be careful when
dealing with such outlandish objects as series!"
−∞ ≤ α ≤ β ≤ +∞.
Then there exists a rearrangement ∑ xσ (n) with partial sums s0n such that
Proof. For all n ∈ N, define pn = (|xn | + xn )/2 and qn = (|xn | − xn )/2. If both ∑ pn
and ∑ qn were convergent, then so would be ∑ |xn | = ∑(pn + qn ). Since
n n n n
∑ xk = ∑ (pk − qk ) = ∑ pk − ∑ qk ,
k=1 k=1 k=1 k=1
P1 + · · · + Pm1 > β1 ,
P1 + · · · + Pm1 − Q1 − · · · − Qk1 < α1 .
The preceding proof and slightly generalization of the original result by Bernhard
Riemann appears to be due to W. Rudin[13].
Proof. Since xn ≥ 0 for all n ∈ N, (sn ) is increasing. Hence the Monotone Conver-
gence Theorem applies.
Theorem 2.7.2 — Comparison Test. Let ∑ xn and ∑ yn be series of real numbers such
that ∑ yn converges and |xn | ≤ yn for all n > n0 . Then ∑ xn is absolutely convergent,
and hence convergent (since R is Banach).
Proof. Since ∑ yn converges, for all ε > 0 there exists n1 ≥ n0 such that
m
∑ yk < ε,
k=n
2.7 Series of Non-Negative Real Numbers 39
Corollary 2.7.3 Let ∑ xn and ∑ yn be series of non-negative real numbers such that
xn ≥ yn for all n > n0 . If ∑ yn diverges, then so do ∑ xn .
Proof. If xn /yn → c, let ε = c/2. The limit then implies that for all n > n0 :
c 3c
yn < xn < yn .
2 2
Although the next example is wonderful, it assumes the knowledge of the fundamen-
tal theorem of arithmetic. So, don’t be afraid to skip it. The following proof is due to J.
A. Clarkson[3].
Example 2.8 — The Series of Primes Reciprocals. Let (pn ) be the sequence of all
the prime numbers. I affirm that the series of prime reciprocals,
∞
1
∑ pn
n=1
∞ k ∞
1 1 1 1
∑ −∑ = ∑ < .
n=1 pn n=1 pn n=k+1 pn 2
Let Q = p1 · · · pk , and consider the numbers 1 + nQ for n ∈ {1, . . . , r}. Since these num-
bers are not divisible by any of the p1 , . . . , pk , the prime factors of all these numbers are
members of a finite set {pk+1 , pk+2 , . . . , pm(r) }. If τ(n) is the number of (not necessarily
40 Chapter 2. Sequences and Series
The inequality
!t
r m(r)
1 1
∑ < ∑
n=1 1 + nQ n=k+1 pn
τ(n)=t
is justified by the fact that every term in the leftmost sum appears at least once in the
sum on the right. We conclude that ∑ 1/(1 + nQ) converges, since its sequence of partial
sums is bounded. However the limit comparison test says otherwise (when compared to
the harmonic series).
Leonhard Euler was the first one to prove this result and to notice that it implies
Euclid’s theorem on the infinitude of prime numbers.
converges.
Proof. Let
n
sn = ∑ xn , tk = ∑ 2i x2i .
i=1 i=0
For n < 2k ,
k
sn ≤ x1 + (x2 + x3 ) + · · · + (x2 + · · · + x2k+1 −1 )
≤ x1 + 2x2 + · · · + 2k x2k
= tk .
2.7 Series of Non-Negative Real Numbers 41
Example 2.9 The series ∑ 1/n p converges if, and only p > 1. In fact, if p ≤ 0 then
1/n p does not tend to 0 as n → ∞ and hence ∑ 1/n p diverges. For p > 0, the preceding
theorem says that ∑ 1/n p converges or diverges together with
∞ ∞
1
∑ 2k · 2kp = ∑ 2(1−p)k .
k=0 k=0
The following theorem shows that an even stronger result than Corollary 2.5.2 holds
for series of decreasing non-negative real numbers.
lim nxn = 0.
n→∞
Proof. Let sn be the partial sums of the series and define In = [s2n , s2n−1 ].
Note that s2n+2 − s2n = −x2n+2 + x2n+1 ≥ 0. Similarly, s2n+1 − s2n−1 ≤ 0 and hence
In+1 ⊂ In for all n ∈ N. Since xn → 0 it follows that diam In → 0 and then
∞
\
In = {s}.
n=1
Pick n0 such that diam In < ε for n > n0 . Then |sn − s| < ε and the result follows.
Exercise 2.3 Give another proof of the Leibniz Test using only the Cauchy criterion.
Note that while ∑ xn is a series in a normed vector space, ∑ kxn k is a series of real
numbers and hence the comparison test applies.
kxn+1 k
lim sup < 1,
n→∞ kxn k
p
Proof. Let k be a real number such that lim supn→∞ n kxn k < k < 1. Lemma 2.2.9
then implies that there is an integer n0 such that kxn+1 k < k kxn k whenever n > n0 .
Hence,
xn0 +p < k xn0 +p−1 < k2 xn0 +p−2 < · · · < k p−1 xn0 +1
and then
xn0 +1 n
kxn k < k
kn0 +1
whenever n > n0 . The result now follows from the comparison test.
Whenever the ratio test implies convergence, the root test does too but the converse
does not hold. However it is usually easier to apply the ratio test. It should be noted
that we only have two criteria for determining divergence. Namely Corollary 2.5.2 and
Theorem 2.7.6.
Definition 2.8.1 — Power Series. Given a sequence (an ) of complex numbers, the
series
∞
∑ an zn ,
n=0
defined for complex z, is said to be a power series. The numbers an are the coefficients
of the series.
The convergence of a power series depends on the value of z, so we’ll study which
values of z makes a given power series convergent or not. In general, there is a number
R ∈ R, called the radius of convergence such that ∑ an zn converges for |z| < R. This is
the soul of the following theorem.
1 p
= lim sup n |an |.
R n→∞
p
n
p
n |z|
lim sup |an zn | = |z| lim sup |an | = < 1.
n→∞ n→∞ R
In this proof we used the fact that C is complete, a result that we did not explicitly proved.
However, the only difference between C and R2 is the definition of an multiplication of
elements. As normed vector spaces they are exactly the same thing.
the ratio test implies convergence for every z ∈ C. (Except for z = 0, but in this case the
result is trivial.)
Hence,
1
0 < n!(e − sn ) < ,
n
for all n ∈ N. The number e is clearly a positive real number. If e was rational, then we
would have that e = m/n for some m, n ∈ N. Since ne is an integer, so is n!(e − sn ). That
is, we found an integer between 0 and 1. Absurd!
1 n n
1
xn = 1 + and yn = ∑ k!
n k=0
1 n n n
1 n−1 n
n 1 j 1
xn = 1 + =∑ k
= ∑ ∏ 1 − ≤ ∑ = yn .
n k=0 k n k=0 k! j=1 n k=0 k!
n m
1 n−1 1 m−1
j j
xn = ∑ ∏ 1 − ≥ ∑ ∏ 1 − .
k=0 k! j=1 n k=0 k! j=1 n
In this proof we assumed that the properties of Theorem 3.1.6 were also valid for the
limit inferior. However this is not true as suggested by Exercises 3.13 and 3.14. Show
that the proof of Theorem 3.8.5 is still valid provided that we switch the starred equality
sign by an ≥ sign.
We now define the trigonometric functions without any aid of geometrical interpreta-
tions.
Definition 2.8.3 — Trigonometric Functions. If z ∈ C, we define the trigonometric
functions sine and cosine as
∞ ∞
(−1)n 2n+1 (−1)n 2n
sin(z) = ∑ z , cos(z) = ∑ z .
n=0 (2n + 1)! n=0 (2n)!
sin(z)
tan(z) = .
cos(z)
In the present state we aren’t even able to prove that sin2 (z) + cos2 (z) = 1. However,
Euler’s Identity follows readily from the definition.
Theorem 2.8.6 — Euler’s Identity. For all t ∈ R, the following equation holds:
and call ∑∞
n=0 cn the Cauchy product of the two series.
This definition can be motivated by observing that when we multiply two polynomials
of the same degree and collect the terms which contain the same power of z we get the
sequence (cn ):
! !
n n 2n
∑ ak zk ∑ bk zk = ∑ ck zk ,
k=0 k=0 k=0
Proof.
2.10 Exercises
Exercise 2.4 Let (xn ) be an sequence of real numbers such that
lim (2xn+1 − xn ) = x.
n→∞
Prove that xn → x.
Exercise 2.5 Let (xn ) and (yn ) be sequences of real numbers. Assume that yn → 0
and that there is a real number k ∈ (0, 1) such that xn+1 ≤ kxn + yn for every n ∈ N.
Prove that xn → 0.
2.10 Exercises 47
√ xn + yn
xn+1 = xn yn , yn+1 = ,
2
for all n ∈ N. Prove that the two sequences are convergent and have the same limit.
Exercise 2.8 — Cesàro-Stolz Theorem. Let (xn ) and (yn ) be two sequences of real
numbers with (yn ) strictly positive, increasing and unbounded. If
xn+1 − xn
lim = L,
n→∞ yn+1 − xn
Exercise 2.9 — Cesàro Means. Let (xn ) be an convergent sequence of real numbers.
Prove that the sequence
x1 + x2 + · · · + xn
yn =
n
also converges to the same limit.
Exercise 2.10 Let (xn ) and (yn ) be Cauchy sequences in a metric space X. Show
that the sequence (d(xn , yn )) converges.
Exercise 2.11 Let (xn ) be a sequence of real numbers. Decide whether each of the
following propositions are true or false. In each case, prove it or show a counter-
example.
a) If every subsequence of (xn ) that is not itself converges, then (xn ) converges
as well.
b) If (xn ) contains a divergent subsequence, then (xn ) diverges.
48 Chapter 2. Sequences and Series
c) If (xn ) is bounded and diverges, then there exist two subsequences of (xn ) that
converge to different limits.
d) If (xn ) is monotone and contains a convergent subsequence, then (xn ) con-
verges.
where the rightmost equality is a common abuse of notation. Moreover, show that
this is also valid for the limit inferior.
Exercise 2.13 For any two real sequences (xn ) and (yn ), prove that
provided the sum on the right is not of the form ∞ − ∞. Moreover show that this is
also valid for the limit superior, provided that we switch the side of the inequality.
Exercise 2.14 For any two real sequences (xn ) and (yn ) such that xn and yn non-
negative, prove that
lim inf(xn yn ) ≥ lim inf xn lim inf yn ,
n→∞ n→∞ n→∞
provided the sum on the right is not of the form 0 · ∞. Moreover show that this is
also valid for the limit superior, provided that we switch the side of the inequality.
In particular this result implies that if limn→∞ xn+1 /xn = L, with 0 ≤ L ≤ ∞, then
√
limn→∞ n xn = L.
converge?
Exercise 2.17 Let S = {x1 , x2 , . . . } be the set of all integers that do not contain the
digit 9 in their decimal representation. Prove that
∞
1
∑ xn
n=1
converges.
Exercise 2.18 Find the radius of convergence of each of the following power series:
∞ ∞
2n n
a) ∑ n3 zn , c) ∑ 2z ,
n=0 n=1 n
∞ ∞
2n n3
b) ∑ n! zn , d) ∑ 3n zn .
n=0 n=0
Exercise 2.19 Let (xn ) be a sequence of non-negative real numbers such that ∑ xn
diverges. Consider the following series:
∞ ∞ ∞ ∞
xn xn xn xn
∑ 1 + xn , ∑ 1 + nxn , ∑ 1 + n2 xn , ∑ 1 + xn2 .
n=1 n=1 n=1 n=1
Exercise 2.20 Suppose that the coefficients of the power series ∑ an zn are integers,
infinitely many of which are distinct from zero. Prove that the radius of convergence
is at most 1.
Prove that the set of all x(a) is exactly the Cantor set described in Exercise 2.19.
Since there is an uncountable number of such sequences (Theorem 1.5.6), this gives
another proof of the fact that the Cantor set is uncountable.
50 Chapter 2. Sequences and Series
Exercise 2.22 Let (nk ) be a strictly increasing sequence of positive integers such
that
nk
lim = +∞.
k→∞ n1 n2 n3 · · · nk−1
Prove that the series ∑ 1/nk is convergent and its limit is an irrational number.
Exercise 2.23 — Summation by Parts. Let (xn ) and (yn ) be sequences in a normed
vector space X. Let sn = x1 + x2 + · · · + xn and set s0 = 0. Use the observation that
xn = sn − sn−1 to verify the formula
m m
∑ xi yi = sm ym+1 − sn−1 yn + ∑ si (yi − yi+1 ).
i=n i=n
Exercise 2.24 — Dirichlet’s Test. Let (xn ) and (yn ) be sequences of real numbers
such that ∑ xn has bounded partial sums, yn → 0 and (yn ) satisfies
y1 ≥ y2 ≥ y3 ≥ · · · ≥ 0.
converges. Use this result to give one more proof of the Leibniz test.
Exercise 2.25 — Abel’s Test. In the view of the preceding exercise, show that if
∑ xn converges, then we can drop the restriction yn → 0 while keeping the result.
Exercise 2.26 Let (xnm ) be a doubly indexed sequence of points in a normed space
X. Suppose that:
∞
• For each m ∈ N, the series ∑ xnm is convergent. Denote by ym its limit.
n=1
∞ ∞
• For each n ∈ N, the series ∑ ∑ xi j is convergent. Denote by tn its limit.
j=1 i=n
∞
Show that, for each n ∈ N, the series ∑ xnm is convergent. Denote by zn its limit.
m=1
∞ ∞
Moreover, prove that ∑ ym = ∑ zn if, and only if tn → 0.
m=1 n=1
2.10 Exercises 51
Exercise 2.27 Let (xnm ) be a doubly indexed sequence of real numbers such that
xnm ≥ 0 for all n, m ∈ N. Prove that
∞ ∞ ∞ ∞
∑∑ xnm = ∑ ∑ xnm ,
n=1 m=1 m=1 n=1
whenever one of the double sums converges. Moreover, if one diverges then so does
the other.
CHAPTER
Functional Limits
and Continuity
3
Since most of this book is about the things we can do to functions, its about time we
generalize the notion of a limit to a function. This will be the basis of our future studies.
As usual, we’ll stick to metric spaces whenever possible to clarify the concepts and
proofs.
lim f (x) = L
x→p
In this case we say that L is the limit of f (x) as x approaches p or, more concisely,
f (x) → L as x → p.
In this definition and in the entirety of this book I’ll be sloppy and use the same letter
d to denote two distinct metric functions defined on M and on N whenever the context
makes it clear. We should note that p need not be a point of M but only a point of M 0 .
Similarly, L need not be a point of f (M).
If f is a function from E ⊂ M to N, then the x which satisfies 0 < d(x, p) < δ has to be
an element of E. Otherwise f (x) would not be defined. The point p has also to be a limit
point of E.
54 Chapter 3. Functional Limits and Continuity
lim f (x) = L,
x→p
As suggested by the dangerous bend sign in the previous page, if f is a function from
E ⊂ M to N, then we have that V ∩ E has to be non-empty and f (x) ∈ U for all
x ∈ (V ∩ E) \ {p}.
It is remarkable how similar are the definitions of the limit of a sequence and that of
a function. The definition of (xn ) → p conveys the idea that xn can be made arbitrarily
close to p if n is big enough. Analogously, f (x) → L conveys the idea that f (x) can
be made arbitrarily close to L if x is close enough to p. In fact, the following theorem
shows that the two concepts are closely related.
This sequence clearly converges to p but the sequence ( f (xn )) does not converge to L,
contradicting our original assumption.
This theorem makes an simple way to show that a limit does not exist. If we
find sequences (xn ) and (yn ) in E \ {p} such that lim xn = lim yn = p and lim f (xn ) 6=
lim f (yn ) then we can conclude that lim f (x) does not exist.
Now, almost every important result about limits of functions follows from the
preceding theorem and the fact that a point p ∈ M is a limit point if, and only if there is
a sequence (pn ) of points in M \ {p} which converges to p (Theorem 3.1.2).
Corollary 3.1.2 — A function can converge to at most one point. Let M and N
be metric spaces and p be a limit point of M. If f (x) → L1 as x → p and f (x) → L2
as x → p, then L1 = L2 .
Corollary 3.1.3 Suppose X1 , X2 are normed vector spaces of finite dimension and M
is a metric space. If f = ( f1 , f2 ) is a function from M to X1 × X2 , then
lim f (x)g(x) = L1 L2 .
x→p
Moreover, if L1 6= 0, then
1 1
lim = .
x→p f (x) L1
lim f (x) = 0.
x→p
Example 3.1 Consider the function f : E → R such that f (x) = x/|x|. If E = (0, 1),
then
lim f (x) = 1.
x→0
This shows that it is important to consider the domain of a function when evaluating its
limit.
We’ll show that f (x) has no limit when x → 0. If f (x) converged to L, then we would
have that f (xn ) → L for every sequence of non-zero real numbers. However, if xn = 1/n:
lim f (xn ) = 1.
x→0
√
Hence L = 1. But taking xn = 2/n we see that
lim f (xn ) = 0.
x→0
We now define limits in the extended real line using the topological definition given
before.
Definition 3.2.2 — Limit of a Function. Let E be a subset of the real line and
f : E → R be a function. We say that
lim f (x) = L,
x→p
We can now prove that the results in the beginning of this section are valid. For all
ε > 0, we can take c = 1/ε so that 1/x ∈ (−ε, ε) whenever x ∈ (c, +∞). This implies
1
lim = 0.
x→∞ x
The other limit is analogous. We now define the limit superior and inferior for
functions on the real line.
Definition 3.2.3 — Limit Superior/Inferior of Functions. Let E be a subset of R, p
be a limit point of E and f : E → R be a function. Consider the set
Since Theorem 4.1.1 holds, the limit superior and inferior of functions satisfies most
of the properties of the limit superior and inferior of sequences. We’ll leave the reader
with the task of checking these properties.
whenever x ∈ V \ {p}.
iii) If X = R or X = C, f1 (x) = o(g1 (x)) and f2 (x) = o(g2 (x)), then f1 (x) f2 (x) =
o(g1 (x)g2 (x)).
iv) If f (x) = o(g1 (x)) and g1 (x) = o(g2 (x)), then f (x) = o(g2 (x)).
√
p such that | f1 (x)| < ε|g1 (x)| and a neighborhood
iii) There is a neighborhood V of√
U of p such that | f2 (x)| < ε|g2 (x)| whenever x ∈ V \ {p} and x ∈ U \ {p}
respectively. Hence,
| f1 (x) f2 (x)| = | f1 (x)|| f2 (x)| < ε|g1 (x)||g2 (x)| = ε|g1 (x)g2 (x)|
f (x)
lim = 1.
x→p g(x)
As before, we’ll usually omit the subscript when the context makes it clear.
Example 3.3 — Fundamental Trigonometric Limit. I affirm that sin(x) ∼ x (as x → 0)
holds. In fact, since
∞ ∞
sin(x) − x (−1)n 2n (−1)n 2n−1
=∑ x =x∑ x ,
x n=1 (2n + 1)! n=1 (2n + 1)!
we have that
∞
| sin(x) − x| (−1)n
≤ |x| ∑ |x|2n−1 .
|x| n=1 (2n + 1)!
As x → 0, |x| → 0 and ∑(−1)n |x|2n−1 /(2n + 1)! → 0.‡ Hence, for all ε > 0,
3.4 Continuity
As we saw in chapter 2, if M and N are metric spaces, a function f : M → N is said to
be continuous at p ∈ M if for all ε > 0 there is a real number δ > 0 such that
d( f (x), f (p)) < ε whenever d(x, p) < δ .
Obviously this definition is very similar to Definition 4.1.1. However it has a few
notable differences. Firstly, for a function to be continuous at p, p has to be an element
of M, while in the limit it can just be a limit point. If p is an isolated point of M, f is
surely continuous at p since there is a neighborhood N of p such that N = {p} and hence
d( f (x), f (p)) = 0 for all x ∈ N.
If p is a limit point of M, then f is continuous at p if, and only if
lim f (x) = f (p).
x→p
Proof. The only change from this proof to that of Theorem 4.1.1 is the lack of
requirement that xn 6= p for all n ∈ N.
Proof. If p is an isolated point of M, then the result is trivial. Otherwise the result
follows from Corollary 4.1.4.
Example 3.4 We’ll now consider rational functions of the form f : C → C such that
an xn + an−1 xn−1 + . . . + a1 x + a0
f (x) =
bm xm + bm−1 xm−1 + . . . + b1 x + b0
for some suitable complex constants an , . . . , a0 , bm , · · · , b0 and positive integers n, m.
Of course the identity function f (x) = x is continuous everywhere (just take δ = ε).
Theorem 4.4.2 implies that every polynomial function of the form
f (x) = an xn + an−1 xn−1 + . . . + a1 x + a0
3.4 Continuity 61
is continuous everywhere.
If bm pm + bm−1 pm−1 + . . . + b1 p + b0 6= 0 then
an xn + an−1 xn−1 + . . . + a1 x + a0
f (x) =
bm xm + bm−1 xm−1 + . . . + b1 x + b0
is also continuous at p.
{at ∈ R | t ≤ b, t ∈ Q}.
for any ε > 0. Hence f is continuous. This also follows readily from the fact that every
point of N is isolated.
62 Chapter 3. Functional Limits and Continuity
Note that in Example 4.6, we choose a δ that works for whatever p we picked before.
That was not possible in Example 4.7. This property possessed by the identity function
is called uniform continuity.
Definition 3.5.1 — Uniform Continuity. Let M, N be metric spaces. A function
f : M → N is said to be uniformly continuous if for every ε > 0, there is a δ > 0 such
that for all x, y ∈ M:
Proof. If f is not uniformly continuous, there exists a ε > 0 such that for any δ > 0
there exists two points x, y ∈ M such that d(x, y) < δ but d( f (x), f (y)) ≥ ε. Taking δ =
1/n we get two sequences (xn ), (yn ) such that d(xn , yn ) < 1/n but d( f (xn ), f (yn )) ≥ ε
for each n ∈ N.
Since M is compact, (xn ) has a convergent subsequence (xnk ). Similarly, (ynk ) has a
convergent subsequence (ynk j ). As (xnk ) converges, so does (xnk j ). Using the fact that
d(xnk j , ynk j ) < 1/nk j we see that (xnk j ) and (ynk j ) converge to the same value, namely
a ∈ M. But f is continuous, so it follows that
So,
d( f (xnk j ), f (ynk j )) ≤ d( f (xnk j ), f (a)) + d( f (ynk j ), f (a)) < ε.
Example 3.8 In exercise 4.15 you’ll show that if f is uniformly continuous and
lim(xn − yn ) = 0 then
lim f (xn ) − f (yn ) = 0.
n→∞
1
lim f (xn ) − f (yn ) = lim n2 + 2 + − n2 = 2.
n→∞ n→∞ n2
Hence f is not uniformly continuous in any domain that contains {xn } and {yn }. Since
f is continuous, this implies R is not compact.§
whenever |x − y| < ε/(max{|2a|, |2b|}). Notice that δ = ε/(max{|2a|, |2b|}) works for
whichever points x, y in E we happen to choose. Hence f is uniformly continuous in E.
§ As if we didn’t already knew that...
64 Chapter 3. Functional Limits and Continuity
3.6 Exercises
f (2t)
lim = 1.
t→∞ f (t)
Prove that
f (mt)
lim =1
t→∞ f (t)
for any m > 0.
lim [ f (x + h) − f (x − h)] = 0
h→0
Exercise 3.6 Prove that there is not a function g : R → R that is continuous at all
x ∈ Q and discontinuous at all x ∈ R \ Q.
Hint: for each n ∈ N, consider the set
An = {x ∈ R|∃δ > 0 such that |x−a| < δ and |x−b| < δ =⇒ |g(a)−g(b)| < 1/n}.
Prove
T∞
that An is open for each n and that g is continuous at x if, and only if x ∈
n=1 n . Suppose that Q can be written as a countable intersection of open sets and
A
show that this contradicts Baire’s theorem.
3.6 Exercises 65
Exercise 3.7 — Norms are Continuous. If X is a normed space, show that the
function f : X → R such that f (x) = kxk is continuous.
Prove that there exists t ∈ [0, 1] with f (t)2 + 3 f (t) = g(t)2 + 3g(t).
Exercise 3.10 Prove that a continuous function from R to R which maps open sets
to open sets must be monotonic.
Exercise 3.11 Let f be a continuous function from R to R such that | f (x) − f (y)| ≥
|x − y| for all x and y. Prove that the range of f is all of R.
Exercise 3.14 Prove that a function f : (a, b) → R is uniformly continuous if, and
only if there exists a continuous function f˜ : [a, b] → R such that f˜(x) = f (x) for all
x ∈ (a, b).
Exercise 3.15 Recall the definition of equivalent sequences given in Chapter 3 (Def-
inition 3.4.2). Let M, N be metric spaces and (xn ), (yn ) be two arbitrary equivalent
sequences in M. Prove that a function f : M → N is uniformly continuous if and only
if ( f (xn )) is equivalent to ( f (yn )).
66 Chapter 3. Functional Limits and Continuity
lim f (x)
x→∞
Exercise 3.21 Show that the requirement in the definition of uniform continuity can
be rephrased as follows, in terms of diameters of sets: To every ε > 0 there exists a
δ > 0 such that diam f (E) < ε for all E ⊂ X with diam E < δ .
CHAPTER
The Derivative
4
The motivation for the derivative comes from geometrical investigations. That perspec-
tive, together with countless applications, are the bulk of any calculus book. In analysis
our outlook will be focused on the properties and generalizations of such object.
Unlike the previous chapters, we present firstly the usual derivative of real functions
and then consider generalizations. This is due the fact that there is not a single possible
generalization and it is not obviously clear how one should generalize the derivative to
broader spaces than the real line.
f (x) − f (p)
lim
x→p x− p
exists. In this case its value is said to be the derivative of f at p and it is denoted
f 0 (p) or dd xf (p). We then associate with f a function f 0 , defined in the points where
f is differentiable, such that f 0 (x) is the derivative of f at x. This function is, with a
slightly abuse of language, also called the derivative of f .
Leibniz’s notation dd xf makes the derivative appear to be the quotient of two quantities.
Although this interpretation is possible (at least in the case of a real function), it is not
simple at all. The reader is safer thinking about dd xf as a single expression.
It may be possible that the derivative of f is also differentiable. In this case its
2
derivative is the second derivative of f and it is denoted as f 00 , f (2) or even dd x2f . In
dn f
general, we denote the n-th derivative of f as f (n) or d xn . If f has derivatives of all
orders, then it is said to be smooth.
68 Chapter 4. The Derivative
The converse does not hold, as it can be seen in the following example.
Example 4.1 Consider f : R → R such that f (x) = |x|. If p > 0, then
f (x) − f (p)
f 0 (p) = lim = 1.
x→p x− p
Similarly, if p < 0 we have that f 0 (p) = −1. However, f is not differentiable at 0 as
|x|
lim
x→0 x
does not exist. Hence f is continuous but not differentiable at 0.
As we’ll see in the seventh chapter, there are continuous functions on the real line
that are nowhere differentiable.∗
Theorem 4.1.2 Suppose f and g are differentiable at p. Then so are f + g, f g and
f /g (provided g(p) 6= 0 in the last case). Moreover,
and
f 0 (p)g(p) − f (p)g0 (p)
( f /g)0 (p) = ,
g(p)2
if g(p) 6= 0.
Proof. The proof of this theorem is based on two algebraic identities. Letting
x → p in
f (x)g(x) − f (p)g(p) g(x) − g(p) f (x) − f (p)
= f (x) · + g(p) ·
x− p x− p x− p
results in ( f g)0 (p) = f 0 (p)g(p) + f (p)g0 (p). Similarly,
f (x)/g(x) − f (p)/g(p) 1 f (x) − f (p) g(x) − g(p)
= g(p) · − f (p) ·
x− p g(x)g(p) x− p x− p
results in the corresponding identity for the derivative of the quotient. The derivative of
the sum is trivial.
Arguably, the following theorem is the most important result about derivatives of
real functions. It is usually denoted by the name of "chain rule" in calculus.
h(y) = y − f (p) .
0
g ( f (p)) if y = f (p)
Now, if f (x) 6= f (p) we have that
g( f (x)) − g( f (p)) f (x) − f (p)
= h( f (x)) · .
x− p x− p
If f (x) = f (p) while x 6= p, then both sides of the previous equation equal zero and
hence this equation holds for all x ∈ R \ {p}.
Our function h is continuous at f (p) since g is differentiable at f (p). Hence, letting
x → p we get the desired result.
∗ Consider the set R of all the sets X that are elements of itself. That is, R = {X | X ∈
/ X}. What would
happen if R ∈ R? And if R ∈/ R?
72 Chapter A. Logic and Set Theory
[x] = {p ∈ A | p ∼ x}.
Obviously, equivalence classes are non-empty, since equivalence relations are reflex-
ive.
Theorem A.1.1 Two equivalence classes [x] and [y] are either disjoint or equal.
Proof. Let [x] be the equivalence class determined by x, and let [y] be the equiva-
lence class determined by y. Suppose that [x] ∩ [y] is not empty. So, let z ∈ [x] ∩ [y].
A
[x] [y]
w z y
x
A.2 Countability
Definition A.2.1 For any set A, we say:
This suggests that we might characterize infinite sets as the ones which are equivalent
to some proper subset of itself. It is true and you should try to prove it for countable sets.
The proof that every infinite (not necessarily countable) set is equivalent to some
proper subset of itself is not as easy and will be omitted.
This theorem shows that, roughly speaking, countable sets represent the “smallest”
infinity. That is, no uncountable set can be a subset of a countable set.
74 Chapter A. Logic and Set Theory
The following lemma will be very useful to prove some important theorems.
∞
[
S= En .
n=1
Then S is countable.
a Formally, a sequence is a function from N to any set. It is usual to denote the element f (i) as x and to
i
denote the whole function as (xn ).
Theorem A.2.3 Let A and B be countable sets. Then the cartesian product A × B is
countable.
The result follows with the same argument we used in the end of Lemma 1.5.2.
Corollary A.2.4 Let A be a countable set, and let An be the set of all n-tuples of
elements of A. Then, An is countable.
Cartesian product isn’t associative. An element of (A × B) ×C is of the form ((a, b), c),
while an element of A × B × C is of the form (a, b, c). However these two sets are
clearly equivalent as there is an obvious bijection between them. In the preceding proof,
A × Ak−1 6= Ak but A × Ak−1 ∼ Ak .
Proof. There is a bijection from the set of all fractions b/a (while −1 1
−2 and 2 are
certainly the same number, we will consider them to be different fractions) to the set Z2
of ordered pairs (b, a). Of course there is an injection from Q to the set of all fractions,
so Q is at most countable. But since N ⊂ Q and N is infinite, Q is countable.
Theorem A.2.6 Let A be the set of all sequences whose elements are the digits 0 and
1. Then, A is uncountable.
Define a new sequence b such that b(n) = 1 − en (n). Notice that b is an element of
A and that b is not a element of E, since it is different from each one of its elements in at
least one place.
So, it’s been shown that every countable subset of A is a proper subset of A. It follows
that A is uncountable (for otherwise A would be a proper subset of itself).
The idea of the above proof was first used by Georg Cantor, and is called Cantor’s
diagonal argument.
a0 zn + a1 zn−1 + . . . + an−1 z + an = 0.
Proof. Let
The set Am is finite, since each equation has only a finite set of solutions and there are
only finitely many equations satisfying n + |a0 | + . . .S+ |an | = m for each m.
∞
The set of all algebraic numbers, i.e. the union m=2 Am , is then at most countable.
Since every rational number is algebraic and there are infinite rationals, the set of all
algebraic numbers is exactly countable.
Notice how fortunate the preceding result is: at the present stage we can’t even prove
that the sum of two algebraic numbers is algebraic, let alone that the set of all algebraic
numbers is a field (it is). Yet we can prove that it is countable. To most people, the only
two non-algebraic (i.e. transcendental) numbers they’ll find in their lives are π and e
even though almost all complex numbers are transcendental.
A.3 Exercises
Exercise A.1 Let A be a finite set. Prove that cardinality is a well defined function.
That is, if |A| = n and |A| = m, then n = m.
Exercise A.2 — Inclusion-Exclusion Principle. If A and B are finite sets, prove that
|A ∪ B| = |A| + |B| − |A ∩ B|.
Prove a similar formula for 3 sets.
Exercise A.3 — Dirichlet’s Box Principle. Let A and B be finite sets such that
|A| > |B|. Prove that there is no injection from A to B.
It is usual to write this principle in the following way:
A.3 Exercises 77
"If you have m objects to put in n < m boxes, then at least one box will have more
than one object."
Hint: It may be useful to prove the result of Exercise 9 for finite sets.
• A1 ∪ A2 = A • B1 ∪ B2 = B • f (A1 ) = B1
• A1 ∩ A2 = ∅ • B1 ∩ B2 = ∅ • g(B2 ) = B2 .
Exercise A.5 Prove that no order can be defined in the complex field that turns it
into an ordered field.
Hint: −1 is a square.
Exercise A.6 Prove that every ordered field contains a copy of the rational field.
(Actually it may contain a set that is isomorphic to the rational field. That is, a set
that is equivalent to Q and preserves all the ordered field operations.)
Exercise A.8 In the proof of Lemma 1.5.2 we ordered the elements enk in a sequence
e11 , e21 , e12 , e31 , e22 , e13 , . . . . That is, we implied that there is a bijection f : N → N2
such that f (1) = (1, 1), f (2) = (2, 1), f (3) = (1, 2), f (4) = (3, 1) and so on.
Find this function explicitly and prove that it is a bijection.
Exercise A.10 Let α be a irrational number. We denote by {x} the fractional part
of x. That is, the only real number that satisfies 0 ≤ {x} < 1 and x = m + {x} for
some integer m.
Prove that the set A = { {nα} | n ∈ N} is dense in [0, 1). That is, for all non-empty
subsets (a, b) ⊂ [0, 1) there is a real number x ∈ A such that a < x < b.
You may want to partition the interval [0, 1) in the following way
1 1 2 n−1
[0, 1) = 0, ∪ , ∪...∪ ,1
n n n n
p 1
α− < .
q qn0
Exercise A.14 Let X be an infinite set. Prove that X contains a countable subset.
This exercise is meant to be easy. However, your solution is surely wrong if we do
not assume the Axiom of Choice. Now would be a good time to delight yourself with
A.3 Exercises 79
a bit of math.
Take a look at the precise statements of the Axiom of Choice and the Well Ordering
Theorem. It should be almost clear that the Axiom of Choice is "obviously" right
and the Well Ordering Theorem is "obviously" wrong. Yet they are equivalent...
Exercise A.15 Let B be the set of all bijections from N to itself. Prove that B is
uncountable.
Hint: use a variation of Cantor’s diagonal argument here.
Exercise A.16 — Cantor’s Theorem. Let X be a set and P(X) be the set of its
subsets. We will prove that there is no bijection from X to P(X).
Suppose there is a bijection f : X → P(X) and consider the set A = {x ∈ X | x ∈/
f (x)}. Since A is a subset of X, there is an element a ∈ X such that A = f (a). What
can we conclude from here?
Exercise A.17 At first it seems like we could use Cantor’s diagonal argument to
prove that (0, 1) is uncountable in the following way.
Suppose (0, 1) is countable, then list the decimal expansions of all its elements. We
create a new number that is in (0, 1) but was not counted by switching the i-th digit
of the i-th number in our sequence from k to k + 1 if k 6= 9 and to 0 if k = 9.
This is not a valid proof. Why?
x2 = 2.
Since x is rational, we can write x as p/q, where p and q are integers not both even. We
get then that
p2 = 2q2 ,
which implies that p2 and hence p is even. Since p is even, we can write p as 2k, for
some integer k, and conclude that q is even too. This contradicts our assumption that p
and q are not both even.
Suppose now that B has a smallest element b and consider a = 2/b. Of course
a 6= b, as that would imply a rational solution to x2 = 2. Since a2 = 4/b2 < 4/2 = 2, we
conclude that a ∈ A and hence a < b. However (a − b)2 > 0 implies ( a+b 2
2 ) > 2, which
contradicts the minimality of b as (a + b)/2 ∈ B and (a + b)/2 < b.
Suppose A had a largest element a and consider the numbers b = 2/a and c =
(a + b)/2. Similarly to what we just did, it follows that a < c < b. Writing b and c in
terms of a, we get that ( a+c 2 2 2 2
2 ) − 2 = 9(a − 2)(a − 2/9)/16a < 0 for 2/9 < a < 2.
2
As the set of all real numbers is so important in analysis, now would be a good time
to construct such set. However, I find it’s hard for most students to appreciate such
construction and to notice why it is important at this point. My approach will be to show
some standard set-theoretic definitions and we’ll define the set of all real numbers as
the set that satisfies some desirable properties. As soon as we have the machinery of
Cauchy sequences I shall present one of the possible constructions. I also recommend
the reading of Rudin’s[13] first chapter for an algebraic approach to the real number set.
It is not hard to memorize the axioms for the frequently used algebraic structures
because they are essential to some important sets. For example, both R and Q are fields.
However, R and Q are not the only fields that exists. We need further characterization to
know what really is R.
Definition B.2.2 — Ordered Sets. An ordered set is a set S, together with a relation
> such that:
• (Trichotomy) For any x, y ∈ S, exactly one of x > y, x = y, or y > x holds.
The set of all rational numbers becomes an ordered set when we define x > y if
x − y = p/q where p, q ∈ N. Similarly, Z and N are ordered sets with the usual order
relation.
It is customary to define x ≥ y to mean x = y or x > y, without specifying which one
applies. The notation x ≤ y is defined in the obvious way.
It is not obvious at all that the order we defined earlier for the cardinality of sets is in
fact a order relation. Trichotomy holds if, and only if |A| ≥ |B| and |B| ≥ |A| implies
|A| = |B|, which is a result called Cantor-Schröder-Bernstein Theorem.
Notice that a supremum or infimum of E need not be in E. For example, the set
E = {x ∈ Q | x > 0} has infimum 0 and 0 is not an element of E. The set Z has no
infimum and no supremum since it is not bounded above or below.
Definition B.2.4 An ordered set S is said to possess the least-upper-bound property
if every nonempty subset E ⊂ S that is bounded above has a least upper bound in S.
That is, for all non-empty set E ⊂ S that is bounded above, sup E exists in S.
Exercise B.1 Prove the following statements using only the field axioms.
All letters are meant to denote elements from a field.
a) If x + y = x + z then y = z.
84 Chapter B. The Construction Of The Real Field
b) If x 6= 0 and xy = xz then y = z.
c) 0x = 0.
d) If x 6= 0 and y 6= 0 then xy 6= 0.
The following properties hold for ordered fields. Notice that, while the complex
number set is a field, it is not an ordered field.
Exercise B.2 Prove the following statements using only the ordered field axioms.a
(That means you can only use definitions 1.3.1, 1.3.2 and 1.3.5.)
All letters are meant to denote elements from an ordered field.
a) If x is positive then −x is negative.
b) If x is positive and y > z then xy > xz.
Well, we can always rename the elements of R to create another set that satisfies
definition 1.4.1. Technically we say that the real number set is the only ordered field
with the least-upper-bound property up to a isomorphism. And that is a theorem that I
will not prove.
One of the most important consequences of the least-upper-bound property in the
real line is the Archimedean property. It roughly says that the set {x ∈ R | x > 0} has no
smallest element.
Theorem B.3.1 — Archimedean Property. For all ε > 0 in R there exists a natural
n such that 1/n < ε.
Proof. Suppose the theorem is false. Then there exists x0 > 0 in R such that no
natural n makes 1/n < x0 true. That is, for all n ∈ N we have that nx0 ≤ 1. Let
S = {y ∈ R | y = nx0 , n ∈ N}.
shall call it k.
Of course we have that k − x0 < k since x0 is positive. But then k − x0 is not an upper
bound of S, so mx0 > k − x0 for some m ∈ N. That means k < (m + 1)x0 , which is absurd
since k is an upper bound of S.
For now, we’ll say that a set E ⊂ R is dense in R if, and only if for all a, b in R such
that b > a there exists c in E such that a < c < b. (We will generalize this definition to
metric spaces in the following chapter.)
Theorem B.3.2 The set of all rational numbers is dense in the set of all real numbers.
k
M = {m ∈ Q | m = , k ∈ N and m ≤ a}.
n0
Corollary B.3.3 The set of all irrational numbers is dense in the set of all real
numbers.
Proof. Substitute M in the proof of Theorem 1.4.2 by
√ k
M 0 = {m ∈ R \ Q | m =
2 , k ∈ N and m ≤ a}
n0
√
and c = max M + 1/n0 by c0 = max M 0 + 2/n0 .
−∞ < x < +∞
for all x ∈ R.
We also define the following operations, which are valid for all x ∈ R:
86 Chapter B. The Construction Of The Real Field
• x + ∞ = +∞ if x 6= −∞ • x/(+∞) = 0 if x 6= ±∞
• x − ∞ = −∞ if x 6= +∞ • x/(−∞) = 0 if x 6= ±∞
• x · (+∞) = +∞ if x > 0 • +∞/x = +∞ if x > 0 and x 6= ±∞
• x · (+∞) = −∞ if x < 0 • +∞/x = −∞ if x < 0 and x 6= ±∞
• x · (−∞) = −∞ if x > 0 • −∞/x = −∞ if x > 0 and x 6= ±∞
You should prove that this is, in fact, a field. For every complex number z = (a, b),
we define a to be the real part Re(z) and b to be the imaginary part Im(z) of z.
Exercise B.3 Prove that the complex field, with the operations defined above and
with (0, 0) and (1, 0) as the sum and product identities respectively, satisfies all the
−b
field axioms. You should define −(a, b) as (−a, −b) and (a, b)−1 as a2 +b a
2 , a2 +b2 .
Since (a, 0) + (b, 0) = (a + b, 0) and (a, 0) · (b, 0) = (ab, 0) we will write (a, 0) as
simply a from now on. Provided that we define i as (0, 1) we can write every complex
number as a + bi since a + bi = (a, b).
We shall then define the conjugate of the number a + bi as a − bi and√denote it by
a + bi. We define the absolute value of a complex number a + bi to be a2 + b2 and
denote it by |a + bi|.∗ While square roots are still undefined, this will soon be fixed.
∗ We had a few notations popping in different places here. They usually aren’t a coincidence. If A is a set,
|A| measures roughly how "big" a set is and if z is a complex number, |z| measures roughly how "big" it is.
We’ll see in the next chapter that E denotes the closure of a set E. If (a, b) is a subset of R, its closure is [a, b].
It is based on this fact that we denote the extended real line as R. However, I don’t know a good reason for the
conjugate of a complex number to be defined with the same symbol as the closure of a set. Neither do I know
B.3 The Real and Complex Fields 87
Exercise B.4 Prove the following properties about complex conjugation and absolute
value. All letters are meant to be complex numbers.
a) z + w = z + w e) |zw| = |z||w|
b) zw = z · w f) z + z = 2 Re(z)
x = (x1 , x2 , . . . , xn ),
Exercise B.5 Prove the following properties about norms and inner products (the
product of vectors we just defined). All letters are meant to be elements of a vector
space (Rn or Cn ).
a) x · y = y · x. d) (x + y) · z = x · z + y · z.
One of the most important properties of this vector space (or any vector space in
general) is the Cauchy-Schwarz inequality.
why we denote ordered pairs the same way as we denote open intervals of the real line.
88 Chapter B. The Construction Of The Real Field
|x · y| ≤ kxk kyk .
Proof. If x · y = 0, then the theorem holds trivially. Otherwise, let λ be the complex
number x · y/ kyk2 . Since the square of every real number is positive (or zero),
0 ≤ kx − λ yk2
= kxk2 − λ (x · y) − λ (y · x) + |λ |2 kyk2
|x · y|
= kxk2 − .
kyk2
kx + yk ≤ kxk + kyk .
Proof.
x + y = x + z =⇒ (−x) + (x + y) = (−x) + (x + z)
=⇒ ((−x) + x) + y = ((−x) + x) + z (By associativity)
=⇒ 0 + y = 0 + z
=⇒ y = z.
c) Distributivity implies
0x = (a + (−a)) · x = ax + (−ax) = 0.
d) Suppose xy = 0 then, since y 6= 0, there exists y−1 such that yy−1 = 1. From (c),
xy(y−1 ) = 0(y−1 ) = 0
=⇒ x(yy−1 ) = 0 (By associativity)
=⇒ x · 1 = x = 0.
Which is a contradiction.
Remark: You might ask why we can add or multiply a number in both sides of an
equation. In fact, this is basically the formal definition of equality. Given two objects x
and y, we say that x = y if and only if, P(x) = P(y) for any predicate P.
91
92 Chapter J. Solutions
Solution 2
If x do not belong to B,
Solution 8 Firstly, I’ll prove the result of Exercise 1.9 for finite sets. That is, if there is
an injection from A to B and an injection from B to A then there is a bijection from A to
B.
Let f : A → B and g : B → A be the aforementioned functions. Since f is an injection,
|A| ≤ |B|. Since g is an injection, |A| ≥ |B|. Hence |A| = |B| and there is an bijection
from A to B. (Note that this proof does not involves the fact that | · | is an order relation.
It just involves the fact that |A| is an integer if A is a finite set.)
To prove our exercise we just need to know that |A| > |B| means that there is an
injection from B to A but there is not a bijection. If there was an injection from A to B,
the result we just proved would imply in such bijection, which is absurd!
Solution 9 First lets see that, if the partition described exists, then there is a bijection
from A to B. In fact, since f (A1 ) = B1 and g(B2 ) = A2 , the restrictions of f and g to A1
and B2 respectively are bijections.
A B
f
A1 B1
g
A2 B2
is a bijection. (With a little of notational abuse, since the functions f and g above are the
restrictions of f and g.) We now show that there exist such partition.
Let ρ : P(A) → P(A) be such that ρ(X) = A \ g(B \ f (X)). Note that, if ρ has
a fixed point, that is, a set F ⊂ A such that ρ(F) = F, then we could take A1 = F,
A2 = A \ F, B1 = f (F) and B2 = B \ f (F). It is the existence of such fixed point that I
will prove.
Consider the set E = {X ⊂ A | X ⊂ ρ(X)}. Since ∅ ∈ E, E is not empty. Let
[
F= X.
X∈E
Note that
[
ρ(F) = A \ g B \ f X
[
= A \ g B \ f (X)
\
= A\g (B \ f (X)) (De Morgan)
\
= A\ g(B \ f (X)) (g is injective)
[
= A \ g(B \ f (X)) (De Morgan)
[
= ρ(X).
If such a number does not exist, we say that the characteristic of the field is 0.
I affirm that every ordered field has characteristic 0.
In fact, if the field had characteristic n, then we would have that
which is absurd! This automatically implies that every field contains a subfield isomor-
phic to N.
Since fields have additive inverses, if x is in the field, so is −x. This means that every
field contains a subfield isomorphic to Z. The existence of inverses and the closure by
multiplication then implies the result.
96 Chapter J. Solutions
has exactly p distinct elements and hence it is equal to Z/pZ. Since [1] ∈ Z/pZ, there
is an integer b ∈ {0, 1, . . . , p − 1} such that [b] · [a] = [1].
Solution 13 Let n be a positive integer. The fundamental theorem of arithmetic implies
that every positive integer is the product of an power of 2 and an odd integer. That is,
f (n) = (un , vn )
is clearly onto. If we had that f (n) = f (m), then it would follow un = um and vn = vm ,
hence n = m.
Solution 14 — From [14]. Let Ax be the set {p/q ∈ F | p/q < x}.
It is easy to check that
for all integers m and n. Note that the definition of f for rational numbers makes sense
because if m/n = k/l, then ml = nk, so m · l = k · n. Hence, m/n = k/l.
It is also easy to check that
for all rational numbers r1 and r2 , and that f (r1 ) ≺ f (r2 ) if r1 < r2 .
The set Ax is certainly not empty, and it is also bounded above, for if r0 is a rational
number with r0 > x, then f (r0 ) > f (r) for all f (r) ∈ Ax . Since F has the least-upper-
bound property, the set Ax has a least upper bound so that sup Ax is well defined. We now
shall show that the definition of f for irrational x is actually a general definition. In other
words, if x is a rational number, we want to show that sup Ax = f (x), where f (x) here
denotes m/n, for x = m/n. This is not automatic, but depends on the least-upper-bound
property of F; a slight digression is thus required.
Since F has the least-upper-bound property, F is archimedean. The consequences
of this fact for R have exact analogues in F: in particular, if a and b are elements of F
with a ≺ b, then there is a rational number r such that a ≺ f (r) ≺ b. Having made this
observation, we return to the proof that the two definitions of f (x) agree for rational x.
If y is a rational number with y < x, then we have already seen that f (y) ≺ f (x). Thus
every element of Ax is ≺ f (x). Consequently,
sup Ax f (x).
On the other hand, suppose that we had sup Ax ≺ f (x). Then there would be a rational
number r such that
sup Ax ≺ f (r) ≺ f (x).
But the condition f (r) ≺ f (x) means that r < x, which means that f (r) is in the set
Ax ; this clearly contradicts the condition sup Ax ≺ f (r). This shows that the original
assumption is false, so sup Ax = f (x). It follows that f (x) = sup Ax holds for all x ∈ R.
We’ll now prove that f is an isomorphism of fields.
To rule out the possibility of equality, notice that there are rational numbers r and
s with x < r < s < y. We know that f (r) ≺ f (s). It follows that
2. f is injective.
If x 6= y, then either x < y or y < x; in the first case f (x) ≺ f (y), and in the second
case f (y) ≺ f (x); in either case f (x) 6= f (y).
3. f is onto.
Let a be an element of F, and let B be the set of all rational numbers r with
f (r) ≺ a. The set B is not empty, and it is also bounded above, because there is
a rational number s with a ≺ f (s), so that f (r) ≺ f (s) for r in B, which implies
98 Chapter J. Solutions
that r < s. Let x be the least upper bound of B; we claim that f (x) = a. In order to
prove this it suffices to eliminate the alternatives f (x) ≺ a and a ≺ f (x).
In the first case there would be a rational number r with f (x) ≺ f (r) ≺ a. But this
means that x < r and that r is in B, which contradicts the fact that x = sup B. In
the second case there would be a rational number r with a ≺ f (r) ≺ f (x). This
implies that r < x. Since x = sup B, this means that r < s for some s in B. Hence
f (r) ≺ f (s) ≺ a, again a contradiction. Thus f (x) = a.
4. f (x + y) = f (x) ⊕ f (y).
Suppose that f (x + y) 6= f (x) ⊕ f (y). Then either f (x + y) ≺ f (x) ⊕ f (y) or
f (x) ⊕ f (y) ≺ f (x + y). In the first case there would be a rational number r such
that f (x + y) ≺ f (r) ≺ f (x) ⊕ f (y). But this would mean that x + y < r. Therefore
r could be written as the sum of two rational numbers r = r1 + r2 , where x < r1
and y < r2 . Then, using the facts checked about f for rational numbers, it would
follow that f (r) = f (r1 + r2 ) = f (r1 ) ⊕ f (r2 ) f (x) + f (y), a contradiction. The
other case is handled similarly.
5. f (xy) = f (x) ⊗ f (y).
The same reasoning of the item 4 proves this for positive real numbers. The
general case is then a simple consequence.
{{m(k − j)α} | m ∈ N}
in each of the n elements of the partition. The result follows since for every subset
(a, b) ⊂ [0, 1) there are integers m, n such that
m m+1
, ⊂ (a, b).
n n
Solution 16 The result is equivalent to
1
|qα − p| < .
n0
99
1
{qα} < .
n0
x−
y y
x+
x
• After defining the values of g(2), g(4), · · · , g(2k − 2), define g(2k) to be the small-
est element of P which is not an element of {g(2), g(4), · · · , g(2k − 2), fk (2k)}.
List now all the elements of A = N \ P = {a1 , a2 , · · · } and define g(2k − 1) as ak for all
k ∈ N.
We have then that g is an element of B which is not an element of E, since g(2k) 6= fk (2k)
for all k ∈ N. We conclude that every countable subset of B is proper. It follows that B is
uncountable. (Otherwise B would be a proper subset of B.)
100 Chapter J. Solutions
2- Elements of Topology
Solution 1
Solution 2
Solution 3
Solution 4
Solution 5
Solution 6
Solution 7
Solution 8
Solution 9
Solution 10
Solution 11
Solution 12
Solution 13
Solution 14
Solution 15
Solution 16
Solution 17
Solution 18
101
Solution 19
Solution 20
Solution 21 Let f (x) = d(x, φ (x)). Note that, by the triangular inequality
which is absurd since α is the least value f can take. We conclude that α = 0 and
φ (x0 ) = x0 .
Suppose now that there is some other fixed point x00 . Then d(φ (x0 ), φ (x00 )) =
d(x0 , x00 ), which implies x0 = x00 .
Solution 22 — From [13].
Solution 23
Solution 24 Let x1 ∈ X \ E. Since E is a closed set, the distance from x1 to the elements
of E possesses a positive minimum d. Since d/r > d, there exists p ∈ E for which
kx1 − pk ≤ d/r. Define now x = (x1 − p)/ kx1 − pk. Then,
kx1 − (p + y kx1 − pk)k r
kx − yk = ≥ d = r.
kx1 − pk d
Since p + y kx1 − pk is an element of E, the result follows.
Similarly,
Hence |sn − sm | < xm . Since xm → 0, the Cauchy criterion implies that (sn ) converges.
The proof for m odd is analogous.
Solution 4 — From [4].
Solution 6
Solution 7
Solution 8
Solution 9
Solution 12
Solution 13
Solution 14
Solution 15
Solution 23
Solution 24
Solution 25
Solution 27
103
105