Functional Analysis. Jie Xiao
Functional Analysis. Jie Xiao
1 Metric Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1 Metrizable Topology and Connectedness . . . . . . . . . . . . . . . . . . . . 1
1.2 Completeness . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.3 Compactness, Density and Separability . . . . . . . . . . . . . . . . . . . . . 12
Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
2 Continuous Maps . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
2.1 Criteria for Continuity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
2.2 Continuous Maps on Compact or Connected Spaces . . . . . . . . . . 27
2.3 Sequences of Mappings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
2.4 Contractions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
2.5 Equivalence of Metric Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
Remark 1.2.
(i) Thanks to these three properties of which (ii) and (iii) mean respectively
the symmetry and the triangle inequality of the metric space, the function d
is usually called the distance function.
2 1 Metric Spaces
(ii) If d(·, ·) has all the properties (i)-(ii)-(iii) except that d(x, y) = 0 does
not necessarily enforce x = y, then we call d a pseudo-distance and (X, d) a
pseudo-metric space.
Example 1.3.
(i) For n ∈ N let Rn be the set of all finite number sequences x = (xk )nk=1
where the real number xk is called the kth coordinate of x. Define
n
X 1/2
dn (x, y) = |xk − yk |2 , x = (xk )nk=1 , y = (yk )nk=1 ∈ Rn .
k=1
Then
n
X
Q(t) = (|xk |t + |yk |)2 ≥ 0.
k=1
This infers that Q(t) cannot have two distinct zeros. Therefore, by the
quadratic formula, its discriminant B 2 − 4AC ≤ 0. In other words,
n
X n
2 X n
X
|xk ||yk | − |xk |2 |yk |2 ≤ 0.
k=1 k=1 k=1
(dn (x, y) + dn (y, z))2 = dn (x, y)2 + 2dn (x, y)dn (y, z) + dn (y, z)2
Xn
≥ ((xk − yk ) + (yk − zk ))2
k=1
= dn (x, z)2 .
Taking the square root of the first and last members, we further obtain
is finite. If
kf1 − f2 kpp,mg ,E
, p ∈ (0, 1)
dp,mg ,E (f1 , f2 ) =
kf1 − f2 kp,mg ,E , p ∈ [1, ∞],
Theorem 1.4.
(i) Let X be any nonempty set and define the discrete distance d(x, y), x, y ∈
X, by
1 , x 6= y
d(x, y) =
0 , x = y.
Then (X, d) is a metric space.
(ii) If (X, d) is a metric space and Y is a subset of X, then (Y, d) is a metric
space for which the domain of d is now restricted to Y × Y .
Proof. (i) It is clear that this d satisfies Definition 1.1 (i) and (ii). To verify
Definition 1.1 (iii), consider any points x, y, z ∈ X. If d(x, z) = 0, then Defini-
tion 1.1 (iii) is trivial. If d(x, z) > 0, then x 6= z and d(x, z) = 1, and hence y
cannot equal both x and z, so at least one of the distances d(x, y) and d(y, z)
must equal 1. Therefore, Definition 1.1 (iii) follows.
(ii) This truth is an immediate consequence of the definition of metric
space and the concept of subset.
The topology for a space of points provides a theory of cluster points and
convergence for sets of points in the space. In a metric space, this convergence
theory is based on the concept of distance between points. For more abstract
spaces it is based on a system of open sets, and there may be no distance
4 1 Metric Spaces
concept associated with the notion of limit. We here take the former approach
and use the metric distance function to define certain open sets, which are
generalizations of the open intervals in R.
In the rest of this section, we always assume that X is a metric space with
distance function d.
Example 1.10.
(i) Any open ball Br (x) in a given metric
√ √ space X is a bounded open set.
(ii) In R2 , let A = B2 (0) and x = ( 2, 2); then x ∈ A0 . Since
√ √
( 2 − /2, 2 − /2) ∈ B2 (0) ∩ B (x)
Then (0, 0) is a cluster point of F , and since (0, 0) is not in F we infer that F
is not closed. Also, the point (1, 0) is not an interior point of F , so F is not
open.
Remark 1.12. Perhaps the greatest value of Theorem 1.11 is that it gives us
an alternative method for proving that a set is open or closed. It is sometimes
easier to work with the complement of a set than with the set itself. For
instance, we know immediately that X \ {x} is open because {x} is closed.
Theorem 1.13.
(i) The intersection of a finite number of open sets is itself an open set .
(ii) The union of any collection of open sets is an open set.
Proof. (i) Suppose each of A1 , ..., An is open, and let x ∈ A = ∩nk=1 Ak . Then
x is in each of A1 , ..., An , so for each 1 ≤ k ≤ n there exists a positive
radius rk such that Brk (x) ⊆ Ak . Define r = min{r1 , ..., rn }. Then for each k,
Br (x) ⊆ Brk (x) ⊆ Ak ⊆ A. Hence x is in A◦ , and so A is open.
(ii) Suppose that for each j in an index set I, Aj is an open set. Let
x ∈ ∪j∈I Aj . Then x belongs to some Aj , say, Aj0 . Since it is open, there
exists an open ball Br (x) such that Br (x) ⊆ Aj0 ⊆ ∪j∈I Aj . Hence ∪j∈I Aj is
open.
Corollary 1.14.
(i) The union of a finite number of closed sets is a closed set.
(ii) The intersection of any collection of closed sets is itself a closed set .
6 1 Metric Spaces
Proof. It suffices to prove (i). Suppose that F1 , ..., Fn are closed sets. It is easy
to verify that X \ ∪nk=1 Fk = ∩nk=1 (X \ Fk ). By Theorem 1.13, each X \ F is
open. Thus, according to Theorem 1.13 (i) their intersection is open. Hence
∪nk=1 Fk is closed since its complement is open.
Remark 1.15.
(i) The union of an infinite collection of closed sets need not be closed. For
instance, let Fk be the singleton (1/k, 0) in R2 . It is clear that (0, 0) is a
cluster point of ∪∞k=1 Fk , but (0, 0) is not in this union. So the union is not
closed. Of course, it is not hard to give an example of an infinite collection of
open sets whose intersection is not open. For example, Ak = B1/k (0). Clearly,
∩∞k=1 Ak = {0} which is not open.
(ii) We should take particular notice of the fact that the collections of sets in
Theorem 1.13 and Corollary 1.14 may be arbitrarily large. In particular, the
collections may contain so many sets that is impossible to describe them as
an infinite sequence of sets. This is why we use the device of an index set I to
describe the collection. Had we written {An }∞ n=1 for the collection of sets, this
would have indicated a sequence of sets and therefore limited our discussion
to countable collections.
It is worth pointing out that the above S1 and S2 satisfy a property that is
stronger than being disjoint. Not only must S1 and S2 fail to contain any point
of the other set, they must also fail to contain any cluster point of the other
set. Note that this extra stipulation in Definition 1.16 is significant because
any set containing two or more points can be written as the union of two
nonempty disjoint subsets – for x ∈ S, let S1 = {x} and S2 = S \ {x}. But as
we see in the next theorem, when we are dealing with open sets, disjointness
is sufficient to imply that their union is disconnected.
Proof. Suppose that A and B exist as in the statement of Theorem 1.17 and
let S1 = A ∩ S and S2 = B ∩ S. Clearly, S1 and S2 are nonempty, and
S1 ∪ S2 = S. Consider an arbitrary point x ∈ S1 ; then x is in the open set A,
so for some r > 0, Br (x) ⊆ A. Thus Br (x) ∩ B = ∅ because A ∩ B = ∅, so
Br (x) ∩ S2 = ∅. Thus x ∈ / S 2 ; hence S1 ∩ S 2 = ∅. Similarly, S 1 ∩ S2 = ∅, and
we conclude that S is disconnected.
Now assume that S is disconnected, say, S = S1 ∪ S2 and S1 ∩ S 2 =
S 1 ∩ S2 = ∅. Then no point of S1 is in S20 . For x ∈ S1 , choose Br (x) such
that Br (x) ∩ S2 = ∅. Thus A = ∪x∈S1 Br/2 (x) is an open set that contains no
point of S2 . Similarly, we can take a ball Bs (y) for each y ∈ S2 that satisfies
Bs (y) ∩ S1 = ∅ and define B = ∪x∈S2 Bs/2 (x), which is open. It remains only
to show that A∩B = ∅. Suppose not, and let p ∈ A∩B. Then for some x ∈ S1
and y ∈ S2 , p ∈ Br/2 (x) ∩ Bs/2 (y). But this implies that
1.2 Completeness
To discuss the completeness of a metric space, we start with the definition of
the convergence of a point sequence in the metric space.
A point sequence is a function from N into a metric space (X, d); this is,
it is a sequence whose terms are points in X. Since our principal examples of
metric spaces are the Euclidean spaces and we use subscripts to designate the
coordinates of points in Rn , we use superscripts to index the terms of a point
sequence – (x(k) )∞
k=1 represents a point sequence in a metric space.
Definition 1.19. Given a metric space (X, d). The point sequence (x(k) )∞ k=1
in X is said to converge to the point x ∈ X, denoted limk→∞ x(k) = x,
provided for every > 0 there is an N ∈ N such that d(x(k) , x) < whenever
k > N.
8 1 Metric Spaces
Proof. Suppose x ∈ A0 and r1 = 1. Then there exists a point x(1) in Br1 (x) ∩
(A \ {x}). Let r2 = min{1/2, d(x(1) , x)}, and choose a point x(2) in (A \
{x}) ∩ Br2 (x). After x(1) , ..., x(k) have been defined, let rk+1 = min{1/(k +
1), d(x(k) , x)} and choose x(k+1) as a point of A\{x} in the open ball Brk+1 (x).
Since rk ≤ 1/k, it is clear that limk→∞ x(k) = x; and since rk ≤ d(x(k−1) , x),
it follows that no two x(k) ’s are the same.
To prove the converse, we simply observe that if A contains such a point
sequence that converges to x, then it is obvious from the definition that every
open ball about x contains infinitely many points of A.
Thus r is at least one unit more than the distance between x(N +1) and x(k)
for every k = 1, 2, ...; so Br (x(N +1) ) contains every point x(k) of the sequence.
(iii) Suppose now x ∈ X is the limit of (x(k) )∞ k=1 under d. Then (x
(k) ∞
)k=1
is a sequence in both X and Y , and it converges in X; so by (i) it is a Cauchy
sequence in X. The distance function d on Y ×Y is the same one as on X ×X,
so (x(k) )∞k=1 is also a Cauchy sequence in Y . But in Y there is no point to
which (x(k) )∞k=1 can converge, because x has been removed and the limit of a
sequence is unique.
Accordingly, we see from Theorem 1.24 that Example 1.23 cannot be gen-
eralized to all metric spaces. This idea is the general principle behind the
following example.
Example 1.25.
(i) Let X = B1 (0) be the open unit ball in R2 with the usual Euclidean dis-
tance. Consider a sequence in B1 (0) that converges to a point on its boundary,
say, x(k) = (1 − 1/k, 0), so that it has cluster point (1, 0). Then this sequence
is a Cauchy sequence in X, but it has no limit in X, so it does not converge
in X.
(ii) Again for n ∈ N let Qn be the subset of Rn consisting of√all points
for which all coordinates are rational numbers. The point x = ( 2, 0, ..., 0)
is in Rn \ Qn√ , but if (rk )∞
k=1 is a sequence of rational numbers such that
limk→∞ rk = 2, and x(k) = (rk , 0, ..., 0), then limk→∞ x(k) = x. Therefore
(x(k) )∞
k=1 is a Cauchy sequence but does not converge in Q .
n
Proof. Since statements (ii)-(v) refer to bounded sets, it will be more conve-
nient to use another form of the boundedness; that is, a set S is bounded in
X if and only if it is contained in an n-cube Q with side-length 2r > 0, i.e.,
S ⊆ Q = {x ∈ Rn : |xj | ≤ r for j = 1, ..., n}.
(i)⇒(ii) Let F be a closed and bounded set and suppose that we are given
that {Aµ }µ∈I is an open cover of F as in (ii). Since F is bounded, there is
an n-cube Q1 containing F . Assume that F cannot be covered by any finite
number of the Aµ ’s. Subdivide Q1 into 2n n-cubes by halving each of the
coordinate intervals of Q1 ; that is, the jth coordinate of a point x in one of
the n-cubes is restricted to either [0, r] or [−r, 0] instead of [−r, r]. If it were
possible to cover those parts of F that lie in each one of the 2n n-cubes with
finitely many Aµ ’s, then the union of all of these Aµ ’s would still be only a
finite number and would cover F . Therefore one of these 2n n-cubes must
contain a subset of F that cannot be covered by any finite number of the
Aµ ’s. Call this n-cube Q2 . Now subdivide Q2 into 2n n-cubes and repeat the
process. The result is a sequence of n-cubes such that Qk+1 ⊆ Qk , and the
jth coordinate of each point in Qk is restricted on an interval whose length is
r22−k . Choose a point sequence (x(k) )∞ k=1 such that x
(k)
is in F ∩ Qk . Then
(k) ∞
(xj )k=1 – the sequence of jth coordinates forms a Cauchy number sequence
in R thanks to
1.2 Completeness 11
(k) (m)
|xj − xj | ≤ r(22−k + · · · + 22−m ) < r23−m → 0 as k > m → ∞.
Consequently, (x(k) )∞ n
k=1 is a Cauchy sequence in R . Assuming (i), we con-
clude that there is a point x ∈ R such that limk→∞ x(k) = x. Since each x(k)
n
Remark 1.29.
(i) In examining the proof of Theorem 1.28, we find that the Euclidean dis-
tance formula was used only in proving that (i) implies (ii). Therefore we
conclude that the other four implications hold in any metric space.
(ii) For x ∈ Rn let X = Rn \ {x} and the distance between points in X
be the same Euclidean distance as in Rn . Now if (x(k) )∞ k=1 is a sequence that
converges to x in Rn and x(k) 6= x, then it is still a Cauchy sequence in X. But
because there is no point in X to which it converges, (x(k) ) is not convergent
in X. Thus X is not complete. Since completeness is implied by the other four
statements, it follows that all five are false.
(iii) Of course, the fact that all five statements in Theorem 1.28 are true
in X = Rn is quite dependent upon the distance formula. The use of the
distance formula in proving that (i) implies (ii) is very subtle. It is inherent
in the notions of boundedness and n-cubes that were used.
Since each point in X can be written as {x} = B1/2 (x), we see that every
singleton set is an open set in X. Also X = B2 (x) for any x ∈ X, so X is
bounded. Therefore X is a bounded and closed set, and ∪x∈X {x} is an open
cover of X that obviously cannot be reduced to a finite number of open sets.
We now assert that X is complete, because it is not hard to show that if
(x(k) )∞
k=1 is a Cauchy sequence, then there is an N such that x
(k)
= x(N )
(k) (N )
whenever k ≥ N . Therefore limk→∞ x = x , so X is complete even
though the Heine-Borel property does not hold.
(iii) Every open cover of K has a finite subcover; that is, if K ⊆ ∪α∈I Gα
where each Gα is an open subset of Rn , then there is a finite subset J of the
index set I such that K ⊆ ∪α∈J Gα .
In fact, suppose (i) holds. Then every convergent sequence in K converges
to a point in K. Using the fact that a subset E of Rn is closed if and only if
E equals its closure, we get that K is closed. If K is not bounded, then for
each k ∈ N, there is an xk ∈ K such that dn (xk , 0) > k. Consequently, no
subsequence of (xk )∞ k=1 can converge and so K is not compact, a contradiction.
Thus, (ii) holds.
If (ii) holds, then an application of Theorem 1.28 (i) yields (iii).
Finally, suppose (iii) holds. To verify (i), it suffices to prove that K is
bounded and closed since Theorem 1.28 (iv) implies that every sequence in
such K has a subsequence which converges to a point in K. To see that K is
bounded, set Gx = B1 (x). Then {Gx : x ∈ K} is an open cover of K and so
has a finite subcover, say, Gx1 , ..., Gxk . Let M = max{dn (xj , 0) : j = 1, ..., k}.
If x ∈ K, then x ∈ Gxj for some j and d(x, 0) ≤ dn (xj , 0) + 1 ≤ M + 1. Thus,
K is bounded. To show that K is closed, fix a point x ∈ Rn \K. For any y ∈ K,
there are disjoint open balls Uy and Vy centered at y and x, respectively. Note
that these two balls will generally depend on x. The family {Uy : y ∈ K}
is an open over of K and so has a finite subcover, say, Uy1 , ..., Uyk . Then
K ⊆ ∪kj=1 Uyj , which is disjoint from the open ball B = ∩kj=1 Vyj centered at
x. This ball B must be contained entirely in Rn \ K: Otherwise, B ∩ K 6= ∅
and hence B ∩ ∪kj=1 Uyj ⊇ B ∩K 6= ∅, a contradiction. It follows that Rn \K
is open and so K is closed.
Example 1.32. The family F = {(1/k, 2) : k ∈ N} is an open cover of (0, 1];
the family G = {(2−k , 2) : k ∈ N} is a subfamily of F. Each point of (0, 1]
belongs to (2−k , 2) for k sufficiently large, and so G is a subcover for (0, 1]. If
H is a finite subfamily of F, then there is a largest value of k, say k1 such that
(2−k1 , 2) ∈ H. Since 2−k1 −1 ∈ (0, 1] and does not belong to any element of H,
H is not a subcover for (0, 1]. A close look at this example reveals that the
problem lies in the fact that (0, 1] is not closed. Other examples, which the
reader should construct, show that similar problems can arise with unbounded
sets.
Remark 1.33. There is an important result generalizing the nested set prop-
erty – if F = {Kj }j∈I is a class of compact subsets of Rn , n ∈ N, and the
intersection of any finite subclass of F is nonempty then ∩j∈I Kj 6= ∅. To see
this, let K ∈ F be such that ∩(∩j∈I Kj ) = ∅. Then Ki ⊆ ∪i6=j∈I (Rn \ Kj ).
Since Ki is compact, there are finitely many indices j1 , ..., jk such that
logical spaces much more general than Rn , and thus serves as the basis for a
definition of the compactness in these more general spaces.
Theorem 1.35. Let (X, d) be a metric space and S ⊆ X. Then the following
statements are equivalent:
(i) S is compact;
(ii) Any infinite subset of S has at least one cluster point in S;
(iii) Any sequence of points in S has a subsequence which is convergent to a
point in S.
Proof. (i)⇒(ii) Suppose (i) is true. If (ii) is false, then there is an infinite
subset S of X that has no cluster point in S. Now for each x ∈ S, we can find
an open ball having x as center and containing only a finite number of points
of S. Note that S is compact and covered by all such open balls centered at
x. So it is subset of the union of a finite number of such open balls of which
each contains only a finite number of points of S. This implies that S is finite,
a contradiction.
(ii)⇒(iii) This implication is trivial.
(iii)⇒(i) Suppose (iii) is valid, but (i) is not true. Given > 0 and x1 ∈ S.
Choose x2 ∈ S such that d(x2 , x1 ) ≥ . For k = 1, 2 choose x3 ∈ S such
that d(x3 , xk ) ≥ . Continuing the previous selection, we can find xn+1 ∈ S
such that d(xn+1 , xk ) ≥ for k = 1, 2, ..., n and n ∈ N. This process can be
done because S is covered by all open balls centered at points in S of radius
but this cover has no a finite subcover thanks to the assumption that S is
not compact. According to (iii), (xn )∞ ∞
n=1 must possess a subsequence (xnj )j=1
which is convergent to a point in S, so d(xnj , xnk ) < whenever nj and nk
are big enough. This contradicts d(xnj , xnk ) ≥ .
Corollary 1.36. Let (X, d) be a complete metric space. Then a closed subset
S of X is compact if and only if S is totally bounded; that is, for any > 0 there
exist finitely many open balls {B (xj )}nj=1 in X such that S ⊆ ∪nj=1 B (xj ).
by any finite sub-collection of {Oα }α∈I . Since S is closed, from the total
boundedness of S it follows that for = 1 there is finitely many open balls
{B1 (xj )}nj=1
1
such that S = ∪nj=1 1
S ∩ B 1 (xj ). According to the assumption,
one of closed sets in {S ∩ B 1 (xj )}nj=1
1
, say, S1 = S ∩ B 1 (x1 ), cannot be covered
by finitely many elements of {Oα }α∈I . Do the same with = 1/2 and S1 in
place of = 1 and S respectively, and continue this process. The result is a
sequence of closed sets (Sk )∞k=1 such that:
(a) S ⊃ S1 ⊃ S2 ⊃ · · ·;
(b) supx,y∈Sk d(x, y) ≤ k −1 ;
(c) each Sk cannot be covered by finitely many elements of {Oα }α∈I .
Choose xk ∈ Sk . By (a), (b) and the fact that X is complete and each Sk
is closed, (xk )∞k=1 is a Cauchy sequence which is convergent to an element
x ∈ ∩∞ k=1 kS . This implies x ∈ Sk ⊂ Oα for some α ∈ I and for sufficiently
large k ∈ N, contradicting (c). Therefore, S must be covered by finitely many
elements of {Oα }α∈I , i.e., S is compact.
One example of a compact metric space will give us many more, by means
of the following result.
Proof. From Theorem 1.35 (iii) it follows that any Cauchy sequence has a
convergent subsequence and therefore is itself convergent. Thus, (X, d) is com-
plete.
(i) Let S be closed. Then S c is open. If S ⊆ ∪α∈I Uα where Uα ⊆ X is
open for each α ∈ I. Then X ⊆ (∪α∈I Uα ) ∪ S c . Since X is compact, we can
find a finite subset J ⊆ I such that X ⊆ (∪α∈J Uα ) ∪ S c . Hence S ⊆ ∪α∈J Uα .
This shows that S is compact.
(ii) If ∩j∈N Sj = ∅, then X = ∅c = ∪j∈N Sjc . Since X is compact, there is a
finite number of the open subsets {Sicj }Jj=1 such that X = ∪Jj=1 Sicj . Note that
since S1c ⊆ S2c ⊆ S3c ⊆ ..., we must have X = Skc for some k, which produces
the contradiction Sk = X c = ∅.
Remark 1.38. Here, we point out that Theorem 1.37 (ii) does not hold if the
word “compact” is replaced by “complete” in the hypothesis – for example,
X = R and Sk = {x ∈ R : x ≥ k}, k ∈ N.
Next we deal with the density and the separability of metric spaces.
Example 1.41.
(i) For n ∈ N, Qn is a countable dense subset of Rn . This fact can be deduced
from the countability of Q. To show that Qn is dense in Rn , let B (x) be any
open ball of radius > 0 and center x = (x1 , .., xn ) ∈ Rn and use the density
of Q in R to choose rational numbers r1 , ..., rn such that
√
|xj − rj | < / n for j = 1, ..., n.
Then
n
X 1/2
dn (x, r) = |xj − rj |2 < and so r = (r1 , ..., rn ) ∈ Qn ∩ B (x).
j=1
∞
(ii)
P∞ Let2 `2 and `∞ be the spaces of real-valued sequences (xk )k=1 with
j=1 xj < ∞ and supk∈N |xk | < ∞, respectively. For two sequences x =
(xj )∞ ∞
j=1 and y = (yj )j=1 , define
P∞ 1
2 2
dp (x, y) = j=1 |x j − yj | , p=2
1≤j<∞ |xj − yj | , p = ∞.
sup
Then (`2 , d2 ) and (`∞ , d∞ ) are metric spaces, but the former is separable
and the latter is not separable. It is enough to check the statement after the
but. Let S be the set of all sequences of rational numbers that are eventually
0. If > 0 is given, then for any x = (xj )∞ j=1 ∈ `2 there is some N ∈ N
P∞ 2 2
such that j=N |x j | < (/2) ; at the same time, for each j ∈ {1, ..., N }
√
there is an rj ∈ Q such that |rj − xj | < /(2 N ). Accordingly, the sequence
r = (r1 , ...., rN −1 , 0, 0, 0, 0....) ∈ S satisfies
1.3 Compactness, Density and Separability 17
−1
NX 12 ∞
X 21
d2 (x, r) ≤ |xj − rj |2 + x2j < .
j=1 j=N
This implies that S is dense in `2 . As for the latter, given n ∈ N let x(n) =
(n) (n) (n)
(x1 , x2 , · · · , xj , · · ·) be an element in `∞ . Define x = (x1 , · · · , xj , · · ·),
where (
(j) (j)
xj + 1 , |xj | ≤ 1
xj = (j)
0 , |xj | > 1.
Thus x ∈ `∞ . Moreover,
where a = t0 < t1 < · · · < tn = b is a partition of [a, b], and the sk , tk are
rational (with the possible exceptions of t0 = a and tn = b). The set S is
enumerable. Suppose g ∈ C[a, b]. For any > 0, there is a δ > 0 such that
|t − s| < δ ⇒ |g(t) − g(s)| < .
4
Let a = t0 < t1 < · · · < tn = b be a partition of [a, b] with t1 , · · · , tn−1 rational
and such that max0≤k≤n−1 (tk+1 − tk ) < δ. Let s0 , · · · , sn be rational numbers
such that max0≤k≤n |g(tk ) − sk | < 4 . Then for tk ≤ t ≤ tk+1 we have
tk+1 − t t − tk
f (t) − g(t) = (sk − g(t)) + (sk+1 − g(t)),
tk+1 − tk tk+1 − tk
and hence
|f (t) − g(t)|
≤ |sk − g(tk )| + |g(tk ) − g(t)| + |sk+1 − g(tk+1 )| + |g(tk+1 ) − g(t)| < .
∞ Nn
set S = ∪n=1 ∪j=1 {xj } is dense in X. As a matter of fact, if x ∈ X then
x ∈ B1/n (xk ) for some k ∈ {1, ..., Nn }. Now for any > 0 there exists an
n ∈ N such that n−1 < and hence xk ∈ B1/n (x) ⊆ B (x), i.e., B (x)∩S 6= ∅.
Therefore, X is separable.
Theorem 1.43. For n ∈ N let X = Rn be equipped with the distance dn (·, ·).
(i) There is a countable collection B of open balls in X such that any open
subset of X can be expressed as the union of some subcollection of B.
(ii) If S is any subset of X and {Aµ }µ∈I is a collection of open sets whose
union contains S, then there is a countable subcollection {Aµk }∞ k=1 whose
union contains S.
Proof. (i) As a matter of fact, if B stands for the collection of all open balls
Br (q) with r > 0 being a rational number and q ∈ Qn , then for a given open
set A we can consider the subcollection BA consisting of those open balls such
that Br (q) ⊆ A. Their union is obviously contained in A, and the collection
is countable because both Qn and Q are countable. Therefore it remains only
to show that A is contained in the union of this collection. If x is a point
in A, then there is an open ball about x with rational radius r such that
Br (x) ⊆ A. Since Qn is dense, there is a point q of Qn in Br/2 (x). Then x
is in Br/2 (q), and Br/2 (q) is in BA because it is contained in Br (x), which
lies in A. This last assertion is verified by observing that if y ∈ Br/2 (q), then
dn (x, y) ≤ dn (x, q) + dn (q, y) < r. We have shown that an arbitrary point
x ∈ A is contained in some open ball in BA , so A is contained in the union of
all balls in BA .
(ii) Suppose S ⊆ ∪µ∈I Aµ . By (i), each of the open sets Aµ can be written
as the union of all the balls Brµk centered at the points of Qn that have rational
radii and are contained in Aµ , say, Aµ = ∪∞ µ
k=1 Brk . If x ∈ S then there is some
rk such that x ∈ Brk ⊆ Aµ . For each of the open balls {Brµk }, choose just
µ
one Aµ that contains Brµk and call it Aµrk . The subcollection of all such Aµrk
is countable because there is only one Aµrk for each Brµk . Also, the union of
this subcollection contains the union of the Brµk ’s, which in turn contains S.
Hence there is a countable subcollection of {Aµ }µ∈I whose union contains S.
Remark 1.44. It is worth remarking that the hypothesis of Theorem 1.43 (ii)
makes no restrictive assumption about D. That is the strength and generality
1.3 Compactness, Density and Separability 19
which implies mid (K) = 0. If (K)◦ = K ◦ were not empty, then K would
contain a finite open interval (a, b), then b − a ≤ mid (K) = 0, and hence
a = b, a contradiction. Thus, K ◦ = ∅.
(ii) Q is of first category in R – this follows from the definition.
(iii) R is of second category in itself – this assertion is not obvious, and in
fact it can only be deduced as a consequence of the so-called Baire category
theorem (named after René-Louis Baire) as follows.
Theorem 1.47. Given a metric (X, d). If X is complete, then it is of second
category, and hence the complement of a first category set in X is of second
category and dense in X
20 1 Metric Spaces
Problems
1.1. For each of the following functions on R2 , determine which of the prop-
erties (i), (ii), and (iii) in Definition 1.1 are satisfied:
(i) d(x, y) = |x1 − y1 | + |x2 − y2 |;
(ii) d(x, y) = max{|x1 − y1 |, |x2 − y2 |};
(iii)
((x1 − y1 )2 + (x2 − y2 )2 )1/2 , x2 ≥ y2
d(x, y) =
|x1 − y1 | + |x2 − y2 | , x2 ≤ y2 .
1.2. Prove thatPif x(1) , ..., x(m) are points in a metric space (X, d), then
m
d(x(1) , x(m) ) ≤ j=2 d(x(j) , x(j−1) ).
1.3. Given n ∈ N. P
n
(i) Let d∗n (x, y) = k=1 |xk − yk | for x, y ∈ Rn . Prove that d∗n satisfies (i),
(ii), and (iii) of Definition 1.1. In addition, show that dn (x, y) ≤ d∗n (x, y) for
x, y ∈ Rn .
(ii) Let d∗,n (x, y) = max1≤k≤n |xk −yk | for x, y ∈ Rn . Prove that d∗,n satisfies
(i), (ii), and (iii) of Definition 1.1. Discuss the relationship between dn and
d∗,n .
1.4. Prove that A◦ and A are the largest open subset of A and the smallest
closed set containing A, respectively.
1.5. In R2 , show that the closure of B1 (0) is B 1 (0).
1.6. Suppose (X, d) is a metric space. Prove that if S is a finite subset of X,
then S is disconnected.
1.7. Prove that, if f, g > 0 on R so that their graphs
1.9. Let X be the interval (0, 1) in R with the usual distance |x − y|. Find a
Cauchy sequence in X that does not converge in X.
1.11. Prove that if X is an infinite set and its distance function d is given by
1 , x= 6 y
d(x, y) =
0 , x = y,
1.13. Let X be a metric space with the distance function d. Show that if
(x(k) )∞
k=1 is a Cauchy sequence in X and has a subsequence (x
(kj ) ∞
)j=1 that
converges to a point x ∈ X, then d(xk , x) → 0.
1.18. Let (X, d) be a metric space and limn→∞ d(xn , x0 ) = 0. Prove that the
set {xj }∞
j=0 is compact.
2
Continuous Maps
Since every metric space is automatically a topological space with the set
of all open sets as the metrizable topology, we have a notion of continuous
mappings between metric spaces. Note that a function between topological
spaces is said to be continuous if the inverse image of every open set is open
– this is an attempt to capture the intuition that there are no breaks or
separations in the function. Without referring to the topology, this notion
can also be directly defined using limits of sequences. In this chapter, we
consider properties of continuous mappings, contractions with straightforward
applications to integral and differential equations, and equivalence of metric
spaces.
Definition 2.1. Let (X, dX ) and (Y, dY ) be two metric spaces, let f : X → Y
be a mapping, and let x0 ∈ X. Then f is said to be continuous at x0 if, given
any > 0, there exists a δ > 0 such that if x ∈ X and dX (x, x0 ) < δ, then
dY f (x), f (x0 ) < . Moreover, if f is continuous at all points in X, then f
is said to be continuous on X.
to the notation δ rather than δ() for notational simplicity, always bearing in
mind that each must have its own δ.
Example 2.2.
(i) The mapping f : R → R given by f (x) = x2 is continuous.
(ii) Let X be any metric space with the distance d, and x0 a fixed point in
X. Then the mapping f : X → R given by f (x) = d(x, x0 ) for all x ∈ X
is continuous. This follows from a use of the triangle inequality of d. The
special case X = R and x0 = 0 shows that the absolute value function |x| is
continuous.
(iii) If f : X → Y is continuous and S is a subspace of X, then the restriction
of f to S is continuous on S. This is clear from Definition 2.1. One of the
curious consequences of Definition 2.1 occurs at a point in x0 ∈ S that is not
a cluster point of S. Since x0 is an isolated point of S, there is an open ball
Bδ (x0 ) that contains no point of S other than x0 . Accordingly, the mapping
is always continuous at x0 .
Then there exists an m×n real-valued matrix A = [akj ] such that y = T (x) =
Ax, i.e.,
x1 y1
· ·
T · = · ,
· ·
xn ym
where
n
X
yj = ajk xk , akj ∈ R, k = 1, ..., n and j = 1, ..., m.
k=1
Thus T is continuous.
To prove this result, we first consider the case m = 1, and see that T :
Rn → R is a function. Let e(k) , k = 1, ..., n be the standard basis vectors
n
of RP : Its kth coordinate is 1 and others are 0. And set Ak = T (e(k) ). Since
n
x = k=1 xk e(k) , by the definition of T we have
n
X n
X
T (x) = xk T (e(k) ) = x k Ak .
k=1 k=1
This set of m linear functions f1 , ..., fm determines the m rows of the matrix
[ajk ]. Thus T (x) = Ax.
In order to prove the continuity of T , let
X n
m X 1/2 l
X 1/2
MT = a2jk ; ||x||l = x2k ; dl (x, y) = ||x − y||l
j=1 k=1 k=1
for
x1 y1
· ·
l
· and y = · in R .
x=
· ·
xl yl
Then !1/2
n
X 2 n
X 2
||T (x)||m = a1k xk + ... + amk xk .
k=1 k=1
and hence ||T (x)||m ≤ MT ||x||n . Consequently, for any fixed point x0 ∈ Rn
we have
26 2 Continuous Maps
dm T (x), T (x0 ) = ||T (x) − T (x0 )||m ≤ MT dn (x, x0 ).
If > 0 is given, then we can define δ = 1+MT , and this implies that
dm T (x), T (x0 ) < whenever dn (x, x0 ) < δ.
Remark 2.4. Note that the choice of δ above does not depend on x0 where the
continuity is established. Thus we conclude that linear mappings, like linear
functions on R, are uniformly continuous on Rn .
Theorem 2.5. Let (X, dX ) and (Y, dY ) be two metric spaces and let f : X →
Y be a mapping. Then f is continuous if and only if for every open subset V
of Y the inverse image f −1 (V ) = {x ∈ X : f (x) ∈ V } is an open subset of
X.
the open ball in X of center x0 and some radius δ > 0. Thus if x ∈ X and
dX (x, x0 ) < δ, then dY f (x), f (x0 ) < . This means that f is continuous at
x0 , and this completes the proof.
Theorem 2.7. Let (X, dX ) and (Y, dY ) be two metric spaces and let f : X →
Y be a mapping. Then f is continuous if and only if for every sequence
∞ (xk )∞
k=1
of points in X that converges to x0 ∈ X, the sequence f (xk ) k=1 converges
to f (x0 ) in Y .
Proof. Suppose first that f is continuous at x0 ∈ X and that (xk )∞ k=1 con-
∞
verges to x0 in X. We have to show that f (xk ) k=1 converges to f (x0 ) in Y .
Given > 0, the continuity of f at x0 implies that there is a δ > 0 such that
dY f (x), f (x0 ) < whenever x ∈ X and dX (x, x0 ) < δ. Because (xk )∞ k=1
converges to x0 , there exists a positive integer N such that dX (xk , x0 ) < δ
for all k > N . Hence if k > N then dY f (xk ), f (x0 ) < , which shows that
∞
f (xk ) k=1 converges to f (x0 ).
We now suppose the statement after the if and only if is valid. If f is
not continuous at x0 , then there is some 0 > 0 such that for no number
δ > 0 is it true that whenever x ∈ X and dX (x, x0 ) < δ then necessarily
dY f (x), f (x0 ) < . Hence for any k ∈ N we can find a point xk ∈ X such
that dX (xk , x0 ) < 1/k and dY f (xk ), f (x0 ) ≥ 0 . Since dX (xk , x0 ) < 1/k,
∞
the sequence (xk )∞ k=1 converges to x0 . However f (xk ) k=1 does not converge
to f (x0 ) since dY f (xk ), f (x0 ) ≥ 0 for all k. This completes the proof.
(k) ∞
Then as (x(k) )∞
k=1 approaches 0 along the axes, we see that f (x ) k=1 con-
verges to 0. But if x(k) = (1/k, 1/k), then f (x(k) ) = 1/2. Hence f is not
continuous at 0.
Theorem 2.9. Let (X, dX ) and (Y, dY ) be two metric spaces and let f : X →
Y be a continuous mapping. If X is compact, so is its image f (X).
Corollary 2.10. Let (X, dX ) and (Y, dY ) be two metric spaces and let f :
X → Y be a continuous mapping. If X is compact, then f is bounded.
Corollary 2.11. Let (X, dX ) be a nonempty compact metric space and let
f : X → R be continuous. Then
(i) sup{f (x) : x ∈ X} = M and inf{f (x) : x ∈ X} = m are finite.
(ii) There are points x, y ∈ X such that f (x) = M and f (y) = m.
Proof. (i) By Theorem 2.9 it follows that f (X) is a compact subset of R, hence
closed and bounded. Since f (X) is not empty, f (X) has both a greatest and
a least element.
To prove (ii), by the definition of M we derive that for each k ∈ N there
is xk ∈ X such that M − 1/k < f (xk ) ≤ M. Since X is compact, (xk )∞ k=1 has
a convergent subsequence (xkj )∞ j=1 , which converges to a point x ∈ X. Since
f is continuous, f (x) = limj→∞ f (xkj ) = M. The argument for the existence
of the point y such that f (y) = m is similar.
Definition 2.12. Let (X, dX ) and (Y, dY ) be two metric spaces and let f :
X → Y be a mapping. Then f is said to be uniformly continuous if, given
exists a δ > 0 such that if x, y ∈ X and dX (x, y) < δ then
any > 0, there
dY f (x), f (y) < .
Theorem 2.13. Let (X, dX ) and (Y, dY ) be two metric spaces and let f : X →
Y be a continuous mapping.
∞ If X is compact, then f is uniformly continuous;
equivalently, f (xj ) j=1 is a Cauchy sequence in Y whenever (xj )∞ j=1 is a
Cauchy sequence in X.
2.2 Continuous Maps on Compact or Connected Spaces 29
In fact, since [a, b] is connected, so is f ([a, b]). Note that any open subset
of R which contains a and b and does not contain some point between a and
b is not connected: for suppose that a < c < b and S is an open subset of R
with a, b ∈ S, c ∈/ S, then
Example 2.18.
(i) Let fk : [0, 1] → R be given by fk (x) = x − x/k. Then limk→∞ fk (x) = x.
(ii) Let fk : [0, 1] → R be given by fk (x) = xk . Then limk→∞ fk (x) = f (x)
where f (x) = 0 or 1 if x ∈ [0, 1) or x = 1. Note that fk is continuous, but f
is not.
Definition 2.19. Let (X, dX ) and (Y, dY ) be two metric spaces, for k ∈ N
let fk : X → Y be a mapping, and let f : X → Y be another mapping. Then
we say that (fk )∞
k=1 converges uniformly to f on X if, given any > 0, there
is an N ∈ N such that dY fk (x), f (x) < whenever k > N for all x ∈ X. If
the restrictions of (fk )∞
k=1 to a certain subset S of X converge uniformly to
some mapping on S, we say that (fk )∞ k=1 converges uniformly on S.
Theorem 2.20. Let (X, dX ) and (Y, dY ) be two metric spaces, let Y be com-
plete, and for k ∈ N let fk : X → Y be a mapping. Then (fk )∞ k=1 converges
uniformly to f on X when and only when for any > 0 there is an N ∈ N
such that dY fk (x), fl (x) < whenever k, l > N for all x ∈ X.
2.3 Sequences of Mappings 31
Theorem 2.21. Let (X, dX ) and (Y, dY ) be two metric spaces and let (fk )∞ k=1
be a uniformly convergent sequence of continuous mappings from X into Y .
Then f = limk→∞ fk is continuous.
Proof. Fix x0 ∈ X. Let > 0. Take an N ∈ N such that dY fk (x), f (x) < /3
for all x ∈ X, which is possible by the uniform convergence. Since each fk is
continuous at x0, there is a δ > 0 such that if x ∈ X and dX (x, x0 ) < δ then
dY fk (x), fk (x0 ) < /3. Hence if x ∈ X and dX (x, x0 ) < δ we have
dY f (x), f (x0 ) ≤ dY f (x), fk (x) + dY fk (x), fk (x0 ) + dY fk (x0 ), f (x0 )
< .
Lemma 2.22. Let (X, dX ) and (Y, dY ) be two metric spaces and let f1
and f2 be continuous
mappings from X into Y . Then the function x →
dY f1 (x), f2 (x) is continuous on X.
Proof. This follows from the triangle inequality based estimate below:
32 2 Continuous Maps
|dY f1 (x), f2 (x) − dY f1 (x0 ), f2 (x0 ) |
≤ |dY f1 (x), f2 (x) − dY f1 (x), f2 (x0 ) |
+|dY f1 (x), f2 (x0 ) − dY f1 (x0 ), f2 (x0 ) |
≤ dY f2 (x), f2 (x0 ) + dY f1 (x), f1 (x0 ) .
Now we consider the set C(X, Y ) of all continuous mappings from X into
Y . If X is compact, then it is clear that for f1 , f2 ∈ C(X, Y ) the following
notion
dC(X,Y ) (f1 , f2 ) = max{dY f1 (x), f2 (x) : x ∈ X}
makes sense since any continuous real-valued function on a compact metric
space attains a maximum. Moreover, it is easy to prove that C(X, Y ) is a
metric space under dC(X,Y ) (·, ·). However, it is abstract in the sense that its
points are mappings on another metric space. A sequence of points in C(X, Y )
is a sequence of continuous mappings (fk )∞ k=1 from X into Y . This sequence
will converge to a point f ∈ C(X, Y ) if and only if limk→∞ dC(X,Y ) (fk , f ) = 0;
in other words, if and only if for each > 0 there is an N ∈ N such that for
any integer k > N one has dC(X,Y ) (fk , f ) < ; that is, dY fk (x), f (x) <
for all x ∈ X. Equivalently, (fk )∞
k=1 converges uniformly to f .
An application of the last two theorems gives the following result cov-
ering the well-known Arzela-Ascoli theorem which comes about as this is a
generalization going back to Cesare Arzelá’s theorem – if fn is a uniformly
bounded sequence of Riemann integrable functions that converge pointwise to
a Riemann integrable function f on the finite interval [a, b], then
Z b Z b
f (x)dx = lim fn (x)dx.
a n→∞ a
Theorem 2.23. Let (X, dX ) and (Y, dY ) be two metric spaces and let X be
compact and Y complete. Then
(i) C(X, Y ) is a complete metric space with respect to the distance dC(X,Y ) (·, ·).
(ii) A subset E of C(X, Y ) is compact if and only if E is bounded, closed, and
equi-continuous in the sense of that for any > 0 there is a δ > 0 such that
dX (x1 , x2 ) < δ ⇒ dY (x1 ) − f (x2 ) < for all f ∈ E.
Proof. (i) Suppose (fk )∞k=1 ⊂ C(X, Y ) is a Cauchy sequence. Then for any
> 0 there is an N ∈ N such that if k, l > N then dC(X,Y ) (fk , fl ) < ; that
is, dY fk (x), fl (x) < for all x ∈ X. Since Y is complete, Theorem 2.20 is
applied to derive that (fk )∞ k=1 converges uniformly on X to some mapping
f : X → Y . However, Theorem 2.21 tells us that f is continuous. Thus
f ∈ C(X, Y ) and limk→∞ fk = f in the sense of points of the metric space
C(X, Y ), dC(X,Y ) . Thus C(X, Y ) is complete under dC(X,Y ) (·, ·).
(ii) Let E be a subset of C(X, Y ). If E is compact, then E is automati-
cally bounded and closed. Moreover, E is totally bounded by Theorem 2.23
2.3 Sequences of Mappings 33
and Corollary 1.36, and hence for > 0 there are finite many open balls
C(X,Y ) Sn C(X,Y )
{B/3 (fk )}nk=1 of C(X, Y ) such that E ⊆ j=1 B/3 (fk ). Since each
fk : X → Y is uniformly continuous, there exists a δk > 0 such that
dX (x1 , x2 ) < δk ⇒ dY fk (x1 ), fk (x2 ) < /3.
Let δ = min{δk : k = 1, 2, ..., n}. Then for a given f ∈ E, choose an fk such
that dC(X,Y ) (f, fk ) < /3. Hence, dX (x1 , x2 ) < δ yields
dY f (x1 ), f (x2 )
≤ dY f (x1 ), fk (x1 ) + dY fk (x1 ), fk (x2 ) + dY fk (x2 ), f (x2 ) < .
Conversely, assume the statement after the if and only if. Since E is closed
and C(X, Y ) is complete, according to Corollary 1.36 and Theorem 1.35 it
remains to check that each sequence (fk )∞k=1 in E has a Cauchy subsequence.
∞
Since X is separable, there is a sequence
∞ (xk )k=1 which is dense in X. Note
that E is bounded and then fk (x1 ) k=1 is bounded in the complete space Y .
(1) ∞
So this sequence has a convergent subsequence, denoted fk (x1 ) k=1 . Simi-
(2) (1) ∞ (2) ∞
larly, suppose (fk )∞k=1 is a subsequence of (fk )k=1 such that fk (x2 ) k=1
(j)
converges. In this way, we can inductively obtain subsequence (fk )∞ k=1 of
∞ (j) (k) ∞
(fk )k=1 such that limk→∞ fk (xl ) exists for l = 1, 2, ..., j. Now (fk )k=1 en-
(k) ∞ (k)
sures that fk (xj ) k=1 is convergent for each j. Since fk ∈ C(X, Y ), for
> 0 there is a δ > 0 such that
(k) (k)
dX (x, y) < δ ⇒ dY fk (x), fk (y) < /3.
Also, note that (xk )∞ k=1 is dense in the compact space X. So, it follows that
X = ∪N X X
l=1 Bδ (xl ) for some natural number N , where Bδ (xl ) is the open ball
(k)
in X centered at xl with radius δ. Furthermore, the definition of fk yields a
natural number M such that
(m)
(xj ), fn(n) (xj ) < /3, j = 1, 2, ..., N.
m, n > M ⇒ dY fm
Finally, upon choosing xj such that x ∈ BδX (xj ) for any x ∈ X, we get that
if m, n > M then
(m)
(x), fn(n) (x)
d Y fm
(m) (m) (m)
(xj ), fn(n) (xj ) + dY fm
(m)
(xj ), fn(n) (x)
≤ d Y fm (x), fm (xj ) + dY fm
< ,
(k)
and consequently, (fk )∞
k=1 is convergent to an element of E and hence E is
compact.
Remark 2.24. Theorem 2.23 extends (from a closed interval to a compact met-
ric space) Guido Ascoli’s theorem – every bounded equi-continuous sequence
of real-valued functions on the unit interval [0, 1] has a uniformly convergent
subsequence.
34 2 Continuous Maps
2.4 Contractions
In general, a fixed point of a mapping f from a set X to itself is a point
x0 ∈ X such that f (x0 ) = x0 . For example, 0 is a fixed point of the mapping
f (x) = x2 + x from R to R. But, not all mappings have fixed points – for
instance, the mapping f (x) = x+1 has no fixed point on R. So, what mapping
has a fixed point is an interesting question. In what follows, we introduce an
important tool – the so-called Banach fixed point theorem (named after Stefan
Banach) – which ensures the existence and uniqueness of fixed points of certain
self mappings of metric spaces, but also provides a constructive method to find
those fixed points.
Definition 2.25. Let (X, dX ) be a metric space. A mapping f from X to
itself is a contraction if there is an α ∈ (0, 1) such that
dX f (x1 ), f (x2 ) ≤ αdX (x1 , x2 ) for all x1 , x2 ∈ X.
A contraction mapping contracts or shrinks the distance between points
by the factor α. Clearly, any contraction map is uniformly continuous on X.
This property actually suggests the following Banach’s theorem on contraction
mappings and fixed points.
Theorem 2.26. Let (X, dX ) be a complete metric space. If f : X → X is a
contraction mapping, then f has a unique fixed point.
Proof. Let x0 ∈ X. Define xk+1 = f (xk ) for k ∈ N ∪ {0}. We claim that
(xk )∞
k=1 is a Cauchy sequence in X. In fact
dX (x2 , x1 ) = dX f (x1 ), f (x0 ) ≤ αdX (x1 , x0 ),
and so
dX (x3 , x2 ) = dX f (x2 ), f (x1 ) ≤ αdX (x2 , x1 ) ≤ α2 dX (x1 , x0 ).
Remark 2.27. The above proof is constructive in the sense that the fixed point
is the limit of the iterates given by xk+1 = f (xk ), where the initial point x0
is an arbitrary point in X. The previous estimate gives the rapidity of the
convergence xk → x0 :
αj
dX (x, xj ) ≤ dX f (x0 ), x0 .
1−α
Below is an interesting example which shows an application of Theorem
2.26 in integral equations.
dC[a,b] (f1 , f2 ) = max |f1 (x) − f2 (x)| for all f1 , f2 ∈ C[a, b].
x∈[a,b]
Solving the desired equation is equivalent to showing that T has a fixed point.
Now
dC[a,b] T (f1 ), T (f2 ) ≤ |λ| max |k(x, y)|(b − a)dC[a,b] (f1 , f2 ).
x,y∈[a,b]
Thus, if |λ| maxx,y∈[a,b] |k(x, y)|(b − a) < 1, then T is a contraction and, there-
fore, has a unique fixed point by Theorem 2.26.
As a consequence of Theorem 2.26, we can obtain a more general fixed
point theorem.
Example 2.31. Consider the following Volterra integral equation (VIE), named
after Vito Volterra:
Z x
f (x) = λ k(x, y)f (y)dy + φ(x),
a
for which the notations are the same as in Example 2.28. If T : C[a, b] →
C[a, b] is defined by
Z x
T (f )(x) = λ k(x, y)f (y)dy + φ(x),
a
To see this assertion, it is enough to show that T in the above has a unique
fixed point in C[a, b]. A simple computation implies that if f1 , f2 ∈ C[a, b] and
M = maxx,y∈[a,b] |k(x, y)| then
2 (x − a)2
|T(2) (f1 )(x) − T(2) (f2 )(x)| ≤ |λ|M dC[a,b] (f1 , f2 ) ,
2
and
n (x − a)n
|T(n) (f1 )(x) − T(n) (f2 )(x)| ≤ |λ|M dC[a,b] (f1 , f2 ) .
n!
Hence
n
|λ|(b − a)M
dC[a,b] T(n) (f1 ), T(n) (f2 ) ≤ dC[a,b] (f1 , f2 ).
n!
2.4 Contractions 37
n
Because limn→∞ |λ|(b − a)M /n! = 0, T(n) is a contraction mapping for
large n and, therefore, has a unique fixed point for any value of the parameter
λ by Corollary 2.29.
As a consequence of Remark 2.27, we consider a family of contractions
that vary continuously with respect to a parameter and prove that the corre-
sponding fixed points also vary continuously.
Corollary 2.32. Let (X, dX ) be a complete metric space and let (Y, dY ) be a
metric space. Suppose f : Y × X → X is such that f (·, x) is continuous on Y
for any point x ∈ X and there exists α ∈ (0, 1) such that
dX f (y, x1 ), f (y, x2 ) ≤ αdX (x1 , x2 ) for all y ∈ Y and x1 , x2 ∈ X.
For y ∈ Y let xy ∈ X be the unique fixed point of the contraction f (y, ·). Then
the mapping y → xy is continuous from Y to X.
Then, there exists a δ ∈ (a, b] such that the initial value problem (IVP):
du(t)
u(a) = y0 ∈ Rn
= f t, u(t) ,
dt
has a unique solution in the interval [a, δ].
38 2 Continuous Maps
Proof. Clearly, solving this IVP is equivalent to solving the integral equation
Z t
u(t) = y0 + f s, u(s) ds.
a
So, it is our aim to show the existence and uniqueness of a solution to this
integral equation.
Assuming that B r (y0 ) ⊆ D is the closed ball centered at y0 with radius
r > 0, we have
M= max kf (t, y)kn < ∞.
(t,y)∈[a,b]×B r (y0 )
Now choose δ ∈ (a, b] such that (δ − a)κ < 1 and (δ − a)M ≤ r. Set S be
the class of continuous mappings from [a, δ] to B r (y0 ). It is clear that S is
complete under the distance defined by
dS (φ1 , φ2 ) = max dn φ1 (t), φ2 (t) for all φ1 , φ2 ∈ S.
t∈[a,δ]
Define a mapping F : S → S by
Z t
F (φ)(t) = y0 + f s, φ(s) ds.
a
This implies F (φ) ∈ S. Now, our problem amounts to showing that F has a
unique fixed point in S. Thus, it suffices to verify that F is a contraction. To
do so, we note that t ∈ [a, δ] and φ1 , φ2 ∈ S imply
Z t
dn F (φ1 )(t), F (φ2 )(t) ≤
f s, φ1 (s) − f s, φ2 (s)
ds
n
a
≤ (δ − a)κdS (φ1 , φ2 ),
yielding
dS F (φ1 ), F (φ2 ) ≤ (δ − a)κdS (φ1 , φ2 ).
Accordingly, the assertion follows.
Remark 2.34.
(i) Note that any existence theorem for the nonlinear IVP in the above must
be local in nature. For example,
y 0 (t) = y 2 (t), y(1) = −1
has the solution y(t) = −1/t, which is not defined at t = 0 even though
f (t, y) = y 2 is satisfied with the required condition on D ⊆ R which is
assumed to be open and bounded.
2.5 Equivalence of Metric Spaces 39
(ii) If the Lipschitz condition is dropped, then it is still possible for the IVP
to have one and even many more solutions – for instance, the IVP:
Definition 2.35. Let (X, dX ) and (Y, dY ) be two metric spaces. Then they
are called:
(i) Topologically isomorphic or homeomorphic provided there exists a home-
omorphism between them – a mapping f : X → Y which is bijective and
continuous, and has continuous inverse;
(ii) Uniformly isomorphic provided there exists a uniform isomorphism be-
tween them – a mapping f : X → Y which is bijective and uniformly contin-
uous, and has uniformly continuous inverse;
(iii) Isometrically isomorphic provided
there exists a bijective mapping f such
that f : X → Y and dY f (x1 ), f (x2 ) = dX (x1 , x2 ) for all x1 , x2 ∈ X;
(iv) Similar provided there exists a positive constant κ > 0 and a bijective
mapping f such that f : X → Y and dY f (x1 ), f (x2 ) = κdX (x1 , x2 ) for all
x1 , x2 ∈ X.
(v) Equivalent provided as sets X = Y and there exist two positive constants
κ1 and κ2 independent of all x1 , x2 ∈ X such that
Example 2.36.
(i) Let f : [− π2 , π2 ] → [−1, 1] be defined by f (x) = sin x. Then it is a uniform
isomorphism, and hence any two finite closed subintervals of R are uniformly
isomorphic.
40 2 Continuous Maps
and let
d∞ (x, y) = kx − yk∞ = max{|xk − yk | : k = 1, · · · , n}.
Then
dn (x, y) ≤ d∞ (x, y) ≤ ndn (x, y).
This means that (R , dn ) and (Rn , d∞ ) are equivalent. Of course, dn and d∞
n
define the same notions of continuity and convergence and do not need to
be distinguished for most purposes. In other words, the identity mapping is
uniformly isomorphic from (Rn , dn ) to (Rn , d∞ ).
(iii) In (Rn , dn ), an isometrically isomorphic mapping is a rotation (the move-
ment of a body in such a way that the distance between a certain fixed point
and any given point of that body remains constant), a reflection (to invert a
geometric figure, respect to a line or plane, but not a point), or a translation
(to move every point by a fixed distance in the same direction).
(iv) (Rn , dn ) is similar to itself. This is because the mapping f (x) =
n
κx + x0 , where κ > 0 is constant and x0n ∈ R , is bijective and satisfies
dn f (x), f (y) = κdn (x, y) for any x, y ∈ R .
Both a uniform isomorphism and an isometric isomorphism are certainly
homeomorphisms which are not only open mappings (from open sets to open
sets) but also closed mappings (from closed sets to closed sets). Intuitively,
a homeomorphism not only maps points in the first object that are close
together to points in the second object that are close together, but also sends
points in the first object that are not close together to points in the second
object that are not close together. Topology is the study of those properties
of objects that do not change when homeomorphisms are applied. Below is a
classical and important result.
Theorem 2.37. For n + 1 ∈ N let Rn+1 be equipped with the Euclidean
distance v
un+1
uX
dn+1 (x, y) = kx − ykn+1 = t (xk − yk )2
k=1
Note that dn+1 (h(y), 0) = 1 and xn+1 6= 1. So, h(Rn ) ⊆ Sn \ {(0, 0, · · · , 1)}.
It is easy to see that both f and h are continuous and satisfy f ◦ h(y) = y
and h ◦ f (x) = x. Thus, h is the inverse of f , and f is a homeomorphism of
Sn \ {(0, · · · , 1)} onto Rn .
Regarding the compactness of Sn , we define a function F : Rn+1 → R
via F (x) = kxkn+1 . The triangle inequality for dn+1 yields that F is con-
tinuous. With this and the closedness of the single point {1}, we see that
Sn = F −1 ({1}) is closed. Since dn+1 (0, x) = 1 for x ∈ Sn , Sn is bounded.
Being closed, Sn is therefore compact.
Problems
2.2. Given f (x) = dX (x, 0) for every point x in the subset X \ {0} of a metric
space X with distance dX , define f (0) so that f is continuous at 0.
Pn
2.3. For n ∈ N let f : Rn → R be given by f (x) = ( i=1 x2i )1/2 for x =
(x1 , ..., xn ) ∈ Rn , and let S = (−3, −2). Find f −1 (S).
2.4. Define D = {x = (x1 , x2 , x3 ) ∈ R3 : 0 < x3 < x21 + x22 } and let f be the
mapping from R3 into R given by
1 , x∈D
f (x) =
0 , x∈ / D.
Show that f (x(k) ) tends to f (0) as x(k) approaches 0 along any linear path,
but f is discontinuous at 0.
42 2 Continuous Maps
2.5. Let T be the linear transformation from R2 into itself such that
T (1, 1) = (3, −1) and T (1, −1) = (1, 7).
is onto and thus satisfies the intermediate value property. Show, however, that
f is continuous only at x = 1/2.
2.8. Define
Z x
T : C[0, 1] → C[0, 1] by T (f )(x) = f (t)dt.
0
| ◦ T{z
prove that T has a unique fixed point, where T(k) = T · · · ◦ T}.
k
prove that there exists precisely one point x ∈ K such that x = T (x).
2.12. Let h ∈ C[0, 1]. Show that there is an f ∈ C[0, 1] such that
Z x
f (x) − f (x − t) exp(−t2 )dt = h(x).
0
Problems 43
f : D ⊆ X → X = f (D)
Prove that there exists exactly one point x ∈ D such that f (x) = x.
2.14. Let (X, dX ) and (Y, dY ) be two metric spaces and f : X → Y bijective.
Prove that the following three conditions are equivalent:
(i) f is a homeomorphism;
(ii) f is continuous, and maps closed subsets of X to closed subsets of Y ;
(iii) f is continuous and maps open subsets of X to open subsets of Y .
2.16. Let (X, dX ) be a metric space. Prove that there is a complete metric
space (Y, dY ) and a mapping f : X → Y such that f is an isometry, i.e.,
dY f (x1 ), f (x2 ) = dX (x1 , x2 ) for all x1 , x2 ∈ X,
2.17. Let (X, dX ) be a metric space and let K be a nonempty closed subset
of X. If f : X → R is defined by f (x) = inf y∈K dX (x, y), prove that f is
continuous, and that f (x) = 0 if and only if x ∈ K.
3
Normed Linear Spaces
Definition 3.1. A linear (or vector) space over F is a set X equipped with
two functions
+: X ×X →X and ·: F×X →X
sup |xj + yj | ≤ sup |xj | + sup |yj |; sup |αxj | = |α| sup |xj | < ∞.
j∈N j∈N j∈N j∈N j∈N
α, β ∈ F and x, y ∈ Y imply αx + βy ∈ Y.
Example 3.4.
(i) For n − 2 ∈ N, the vectors in Rn of the form (x1 , x2 , x3 , 0, ..., 0) form a
linear subspace of Rn .
(ii) For m, n ∈ N and m ≤ n, the set of polynomials of degree ≤ m forms a
linear subspace of the set of polynomials of degree ≤ n.
(iii) If Y = {(x1 , x2 , 1, ..., 1) ∈ Rn } for n − 2 ∈ N, then Y is an affine subset
of Rn .
(iv) In Example 3.2 (iii), let Y be the set of matrices with certain blocks of
1’s. Then Y is an affine subset of Mm,n (C).
(v) In R3 all lines and planes through the origin are linear subspaces, whereas
all lines and planes not passing through the origin are affine subsets.
Definition 3.5. Let n ∈ N. If the linear space X is equal to the space spanned
by a linearly independent set of n vectors in X, then X is said to have the
dimension n. If there is no such set of vectors, then X is infinite-dimensional.
Furthermore, a linearly independent set of vectors that spans X is called a
basis for X.
Example 3.6.
(i) For n ∈ N, the space Rn has dimension n; the standard basis is given by
the vectors e1 = (1, 0, · · · , 0), e2 = (0, 1, 0, · · · , 0), · · · , en = (0, 0, · · · , 0, 1).
(ii) For n ∈ N, the set {1, t, t2 , · · · , tn } is a basis for the linear space of real-
valued polynomials of degree ≤ n which has dimension n + 1.
(iii) All linear spaces given in Example 3.2 (iv)-(vii) are infinite-dimensional.
Next, we equip a linear space with a norm. The norm on a linear space
is a way of measuring the length of a vector and hence the distance between
two vectors.
dX (x1 , x2 ) = kx1 − x2 k, x1 , x2 ∈ X
(ii) For n ∈ N, there are many other norms on Rn , called the (n, p)-norms.
More precisely, if p ∈ [1, ∞), then
n
X p1
kxkn,p = |xj |p
j=1
If p = ∞, then
kxkn,∞ = sup |xj |
1≤j≤n
Note that `np and `nq have exactly the same elements which are just ones of
Rn .
3.1 Linear Spaces, Norms and Quotient Spaces 49
(iii) Recall that `∞ is the linear space of bounded infinite sequences of real
numbers. For x = (xj )∞j=1 ∈ `∞ , define
P∞ 1
p p
kxkp = j=1 |x j | , 1≤p<∞
sup
1≤j<∞ |x j | , p = ∞.
Let p, q ∈ [1, ∞] and p < q. Then there is a strict inclusion: `p ⊂ `q and hence
`p is a linear subspace of `q . Moreover, (`p , k · kp ) is separable for p ∈ [1, ∞)
– for a proof see also Example 1.41 (ii).
(iv) For a finite closed interval [a, b] in R, C[a, b] becomes a normed space
with the p-norm below
Rb p1
p
kf kp = a
|f (t)| dt , 1≤p<∞
a≤t≤b |f (t)| , p = ∞.
sup
Note that the triangle inequality for p ∈ [1, ∞) follows from the Minkowski
inequality kf1 + f2 kp ≤ kf1 kp + kf2 kp – see also Remark ??.
x + Y = {x + y : y ∈ Y } where x ∈ X.
Remark 3.11. The above-defined two operations make sense precisely because
Y is itself a linear space, so for instance,
50 3 Normed Linear Spaces
Y + Y = {y1 + y2 : y1 , y2 ∈ Y } = Y
and
λY = {λy : y ∈ Y } = Y for λ 6= 0.
Moreover, x1 + Y and x2 + Y are equal if and only if as sets x1 + Y = x2 + Y ;
this is true if and only if x1 − x2 ∈ Y , denoted x1 ∼ x2 , which reads: x1 is
equivalent to x2 with respect to Y .
Example 3.12.
(i) Let X = R3 , and let Y be the linear subspace spanned by (1, 1, 0). Then
X/Y is a two-dimensional real vector space since (1, 0, 1) + Y and (0, 0, 1) + Y
generate X/Y .
(ii) The linear space Y of finitely supported sequences (of which all but a
finite number of entries vanish) in `1 is a linear subspace. The quotient space
`1 /Y is quite hard to visualize – its elements are equivalence classes under
the relation (xj )∞ ∞ ∞ ∞
j=1 ∼ (yj )j=1 whenever (xj )j=1 and (yj )j=1 differ in finitely
many coordinates.
(iii) For n ∈ N let Y be the set of `1 -sequences of (0, · · · , 0, xn+1 , · · ·) (first
n coordinates are zero). Then Y is a linear subspace of `1 . Here the quotient
space `1 /Y is isomorphic to Rn .
(iv) Recall that if p, q ∈ [1, ∞] and p < q then `p ⊂ `q – this implies that
`q /`p exists as a linear quotient space.
(v) Let X = C[0, 1] and Y = {f ∈ X : f (0) = 0}. Then X/Y is isomorphic
to R.
(vi) Y = C[0, 1] is a linear subspace of X = R[0, 1]. The quotient X/Y is
again a linear space.
Evidently, these examples indicate that not all linear subspaces are equally
good: Example 3.12 (i), (iii) and (v) are quite reasonable, whereas Example
3.12 (ii), (iv) and (vi) are examples of linear spaces unlike any we have seen.
The reason is the following: The space X/Y is guaranteed to be a normed
space with a norm related to the original norm k · k on X only when the
subspace Y is itself closed. Notice that Example 3.12 (i), (iii) and (v) are
precisely the ones in which the linear subspace is closed whenever the space
X is treated as a metric space with the distance
Proof. On the one hand, suppose that Y is closed and note that
On the other hand, suppose Y is not closed. Let x be a point that is not
in Y but is in the closure of Y with respect to the distance kx1 − x2 kX . If
(yj )∞
j=1 is a sequence of points of Y that converges to x, then
Example 3.14.
(i) If X = R2 and Y = {(0, y) : y ∈ R}, then X/Y consists of lines in X of
the form (s, t) + Y . Furthermore, if√X is equipped with the norm k · k2,2 , then
k(s, t) + Y kX/Y = inf (u,v)∈(s,t)+Y u2 + v 2 = |s|.
(ii) The quotient space may be a little odd. For instance, let `∞,c denote the
space of all sequences (xj )∞j=1 with the property that limj→∞ xj exists. This
is a closed linear subspace of `∞ . It is a good practice to give a representation
of the quotient `∞ /`∞,c .
(iii) Let g : R → R be increasing, E ⊆ R an mg -measurable set, p ∈ [1, ∞].
Then each set LRSgp (E, F) defined in Example 1.3 (iii), is a linear space. If
then LRS pg (E, F) = LRSgp (E, F)/Ng (E) forms a linear quotient space whose
elements can be written as f + Ng (E) where f ∈ LRSgp (E, F). In particular,
if g : R → R is the identity, then LRS pid (E, F) is employed to represent
LRS pg (E, F). More importantly, when 1 ≤ p ≤ ∞ the space LRS pg (E, F) is a
normed linear space under the following norm:
52 3 Normed Linear Spaces
Definition 3.15.
(i) Two linear spaces X1 and X2 over F are called algebraically isomorphic
provided there is a bijection T : X1 → X2 that is linear:
(ii) A pair (X, k · kX ), (Y, k · kY ) of normed linear spaces over F are called
topologically isomorphic provided there is a linear bijection T : X → Y with
the property that there exist two positive constants c1 and c2 independent of
x ∈ X with
c1 kxkX ≤ kT (x)kY ≤ c2 kxkX .
If c1 = c2 = 1 then T is called an isometry and the normed spaces X and Y
are called isometric.
(iii) k · k(1) and k · k(2) defined on a given linear normed space X are equivalent
provided there are two constants c1 , c2 > 0 such that
Example 3.16.
(i) Let Y be the set of all real polynomials of the form f (t) = y1 + y2 t + y23 t2
with the norm kf kY = max{|y1 |, |y2 |, |y3 |},pand X the set of all vectors x =
(x1 , x2 , x3 ) ∈ R3 with the norm kxkX = x21 + x22 + x23 . If T : X → Y is
given by T (x) = x1 + x2 t + x23 t2 , then T is linearly bijective but also enjoys
p
kT (x)kY = max{|x1 |, |x2 |, |x3 |} ≤ |x1 |2 + |x2 |2 + |x3 |2 = kxkX
3.2 Finite Dimensional Spaces 53
and
1
√ kxkX ≤ kT (x)kY ≤ kxkX .
3
So, the two spaces are topologically isomorphic.
(ii) Given p ∈ [1, ∞] and n ∈ N, then the previously-defined norm k · kn,p and
the following norm k · kn,∗ defined via
are equivalent.
It follows that
c1 C1
k · k(1) ≤ k · k(2) ≤ k · k(1) ,
C2 c2
implying the desired equivalence.
Now let k · k denote
Pn either k · k(1) or k · k(2) . We show that k · k is equivalent
to k · k(3) . If x = j=1 αj ej then
n
X
kxk ≤ |αj |kej k ≤ max kej k kxk(3) .
1≤j≤n
j=1
Consequently,
n
X n
X
|αj | = 1 while αj ej = 0.
j=1 j=1
This is impossible, since {ej }nj=1 are linearly independent. This completes the
argument.
(ii) It suffices to show that X is topologically isomorphic to Fn with the
following Euclidean norm:
n
X 12
kαkn,2 = |αk |2 for α = (α1 , ..., αn ) ∈ Fn .
k=1
|||x|||X = kT (x)kn,2 .
Proof. (i) Let {ek }nk=1 be a basis for X. Then each x ∈ X can be written as
Pn Pn 1
2 2
x = j=1 αj ej , αj ∈ F. Set kxkn,2 = j=1 |αj | . This is a norm on X,
and by Theorem 3.17 it is equivalent to the given norm on X. Accordingly, if
Pn (k) ∞ (k) (k) ∞
x(k) = j=1 αj ej k=1 is a Cauchy sequence in X, then (α1 , ..., αn ) k=1
is a Cauchy sequence in Fn which is complete, and hence there exists a point
(k) (k)
(α1 , ..., αn ) ∈ Fn such that (α1 P, ..., αn ) → (α1 , ..., αn ) as k → ∞. Therefore
(k) ∞ n
(x )k=1 is convergent to x = j=1 αj ej in X, whence X is complete.
3.2 Finite Dimensional Spaces 55
The foregoing discussion reveals that the unit sphere of a given normed
linear space is compact if and only if the space is finite dimensional, but also
leads naturally to the following concept.
Definition 3.20. If (X, k · kX ) and (Y, k · kY ) are normed linear spaces over
F, then any one of:
1
(i) k(x, y)k = (kxkpX + kykpY ) p , p ∈ [1, ∞);
(ii) k(x, y)k = max{kxkX , kykY }.
may be defined as a norm on the Cartesian product X × Y .
This certainly does not exhaust all the possible combinations of the norms
kxkX and kykY , and yet these are the most commonly used ones. An extension
to the Cartesian product of n normed linear spaces is defined in the following
manner.
Remark 3.21. For n − 1 ∈ N let {Xk }nk=1 be a collection of linear spaces over
F with norms {k · kXk }nk=1 . If their Cartesian product space is defined as
n
Y
Xk = X1 × · · · × Xn = (x1 , ..., xn ) : x1 ∈ X1 , ..., xn ∈ Xn ,
k=1
for
n
Y
α ∈ F; (x1 , ..., xn ), (y1 , ..., yn ) ∈ Xk .
k=1
Qn
Then it is not hard to show that the function k=1 Xk → R defined by
n
X
k(x1 , · · · , xn )kQn Xk = kxk kXk
k=1
k=1
is a norm on X.
Definition 3.23. Let (X, k · kX ) and (Y, k · kY ) be normed linear spaces over
F. A linear operator T : X → Y is said to be bounded provided there exists a
constant C ≥ 0 such that kT (x)kY ≤ CkxkX for all x ∈ X. Define
kT (x)kY
kT k = kT kX→Y = sup .
x∈X,x6=0 kxkX
Example 3.24.
(i) Equip `2 with the 2-norm, and let T : `2 → `2 be given by T (x) =
(0, x1 , x2 , · · ·) when x = (x1 , x2 , · · ·). Then it is easy to check that T is a
bounded linear operator with kT k = 1.
(ii) Choose for both X and Y the real space C[0, 1] with the sup-norm. Define
T : X → Y by T (f )(x) = ex f (x), x ∈ [0, 1]. Then T is bounded with kT k = e.
The following result tells us that as for a linear operator, the boundedness
amounts to the continuity.
Theorem 3.25. Let (X, k · kX ) and (Y, k · kY ) be normed linear spaces over
F, and let T : X → Y be a linear operator. Then the following statements are
equivalent:
(i) T is continuous in X;
(ii) T is continuous at 0;
(iii) T is bounded;
(iv) T maps bounded subsets of X to bounded subsets of Y .
in Y , let
B1X (0) = {x ∈ X : kxkX < 1}
denote the open unit ball in X. By (iv) it follows that T (B1X ) is bounded in
Y . Thus, there is a λ > 0 such that
Proof. Note that any two norms on a finite dimensional linear space X over
F are equivalent. So, we construct a new norm k · k via k · kX and k · kY :
Definition 3.28. Let (X, k · kX ) and (Y, k · kY ) be two normed linear spaces
over F. Denote by B(X, Y ) the set of all bounded linear operators from X to
Y . In particular, B(X) = B(X, X).
kT (x)kY
kT k = sup for T ∈ B(X, Y ),
x∈X,kxkX 6=0 kxkX
is a norm on B(X, Y ).
(iii) If T ∈ B(X, Y ) and S ∈ B(Y, Z), then the composition ST = S ◦ T
belongs to B(X, Z) with kST k ≤ kSkkT k.
(iv) If T, S ∈ B(X) are invertible; that is, there are two elements T −1 , S −1 ∈
B(X) such that
T T −1 = T −1 T = I and SS −1 = S −1 S = I,
where I stands for the identity element of B(X), then ST ∈ B(X) is invertible
with (ST )−1 = T −1 S −1 .
Proof. (i) This follows from checking those conditions for a linear space with
B(X, Y ).
(ii) We have to verify three conditions required for a norm. First, it is clear
that kT k ≥ 0. If kT k = 0 then kT (x)kY = 0 for all x ∈ X, and hence T (x) = 0
for all x ∈ X. This gives T = 0. Conversely, T = 0 implies kT k = 0.
Next,
kαT (x)kY
kαT k = sup = |α|kT k.
x∈X,kxkX 6=0 kxkX
Finally,
60 3 Normed Linear Spaces
kT (x) + S(x)kY
kT + Sk = sup
x∈X,kxkX 6=0 kxkX
kT (x)kY kS(x)kY
≤ sup + sup
x∈X,kxkX 6=0 kxk X x∈X,kxkX 6=0 kxkX
= kT k + kSk.
k(ST )xkZ = kS(T (x))kZ ≤ kSkkT (x)kY ≤ kSkkT kkxkX , for all x ∈ X.
(ST )T −1 S −1 = I = T −1 S −1 (ST ).
Example 3.30. Define T : `2 → `2 by T (x1 , x2 , · · ·) = (0, x1 , x2 , · · ·). This
map is bounded but not invertible, since it is clearly not onto. If S : `2 → `2 is
given by S (x1 , x2 , · · ·) = (x2 , x3 , · · ·), then it is bounded but not invertible,
because it is clearly not one to one. Note that ST = I 6= T S.
Example 3.32. Let p ∈ (0, 1) and I = [0, 1]. If L is a continuous linear function
on LRS pid (I, R) then L must be the zero functional.
To see this conclusion, suppose, to the contrary, L 6= 0. Then, without loss
of generality we may assume that there exists an f ∈ LRS pid (I, R) such that
Hence, f 1[0,x] is continuous in the metric of LRS pid (I, R), and consequently,
φ(x) = L(f 1[0,x] ) is a continuous function on I since L is continuous. Note
that φ(0) = 0 and φ(1) = 1 and f = f 1[0,x] + f 1[x,1] . So there is an x0 ∈ (0, 1)
such that
Since
kf 1[0,x0 ] kpp,mid ,I + kf 1[x0 ,1] kpp,mid ,I = kf kpp,mid ,I ,
one of two terms of the left-hand side of the last equation is not greater than
2−1 kf kpp,mid ,I , say,
and then
But, the last inequality cannot hold due to 0 < p < 1, 2j(p−1) → 0, and L
being continuous.
Actually, the above question will be answered in great generality using the
Hahn-Banach extension theorem which is stated below and named for Hans
Hahn and Stefan Banach.
gα (x) = f (x) for all x∈Y and gα (x) ≤ p(x) for all x ∈ Yα .
Clearly, every totally ordered subset {(Yλ , gλ )} of K, for which at least one of
Y1 = {y + λy1 : y ∈ Y0 , λ ∈ R}.
and hence
−p(−y1 − x) − g0 (x) ≤ p(y + y1 ) − g0 (y).
It follows that
a = sup − p(−y1 − x) − g0 (x) ≤ inf p(y + y1 ) − g0 (y) = b.
x∈Y0 y∈Y0
Accordingly, we get
This is to say,
then
|F (x)| = αF (x) = F (αx) = U (αx) ≤ p(αx) ≤ p(x).
(ii) If x0 ∈ X satisfies
inf ky − x0 kX = d > 0,
y∈Y
|F (x)|
kxkX = sup = sup |F (x)|.
06=F ∈X ∗ kF k F ∈X ∗ , kF k=1
(viii) If X ∗ is separable, so is X.
(ix) If Y is closed and Y ⊥ = {f ∈ X ∗ : f (y) = 0 for all y ∈ Y }, then
Y ∗ = X ∗ /Y ⊥ and (X/Y )∗ = Y ⊥ in the sense that there exist isometries
between the corresponding normed spaces.
(x) If X is reflexive and Y is closed, then Y is reflexive.
Proof. (i) Given f ∈ B(Y, F), let p(x) = kf kkxkX for all x ∈ X. Then
Hence from Theorem 3.33 it turns out that there exists an extension F ∈
B(X, F) with F = f on Y and |F | ≤ p on X. Clearly, kF k = kf k follows from
kF k ≤ kf k and
f ∈ Y1∗ by f (y + λx0 ) = λ.
kf k = kxk−1
X and f (x) = 1.
thus
sup |F (x)| = kxkX .
F ∈X ∗ , kF k=1
such that kf k = 1 and f (x) = 0 for all x ∈ Y . Now if ∈ (0, 1/2), then there
exists a functional fj0 ∈ {fj }∞j=1 such that xj0 ∈ Y and
Problems
3.1. Let p ∈ (1, ∞) and q = p/(p − 1). Prove Hölder’s and Minkowski’s in-
equalities of the discrete forms:
(i) For vectors (xj )nj=1 and (yj )nj=1 in Rn , one has
n
X n
X n
p1 X q1
|xj yj | ≤ |xj |p |yj |q
j=1 j=1 j=1
and
n
X p1 n
X p1 n
X p1
|xj + yj |p ≤ |xj |p + |yj |p .
j=1 j=1 j=1
∞
X ∞
X ∞
p1 X q1
|xj yj | ≤ |xj |p |yj |q
j=1 j=1 j=1
and
∞
X p1 ∞
X p1 ∞
X p1
|xj + yj |p ≤ |xj |p + |yj |p .
j=1 j=1 j=1
(i) On C[0, 1] the sup-norm is not equivalent to any p-norm where 1 ≤ p < ∞;
(ii) If C[0, 1] is equipped with the sup-norm, then the unit sphere of C[0, 1] is
not compact.
3.4. Prove that if X is a linear space over F then any norm k · k : X → R
is continuous on X, but also vector addition and scalar multiplication are
continuous whenever X × Y is equipped with the norm k · kX + k · kY .
3.5. Recall that `∞ is the space of all bounded infinite sequences x = (xj )∞
j=1
of real numbers with the norm kxk∞ = supj∈N |xj |. Prove that if `0 is the
class of all infinite sequence of real numbers which have only finitely many
non-zero terms, then `0 is not closed in `∞ .
3.6. Prove
R p→∞ kf kp,mid ,[0,1] = kf k∞,mid ,[0,1] whenever kf k∞,mid ,[0,1] < ∞;
(i) lim
(ii) [0,1] |f1 f2 |dmid ≤ kf1 k1,mid ,[0,1] kf2 k∞,mid ,[0,1] with equality if and only if
|f2 (x)| = kf2 k∞,mid ,[0,1] for mid -a.e. x ∈ [0, 1].
3.7. Prove that if C[0, 1] is equipped with the sup-norm and T : C[0, 1] → R
is given by T (f ) = f (0), then T is bounded with kT k = 1.
3.8. P∞
(i) Suppose that the real infinite matrix [akj ] satisfies supk∈N j=1 |akj | < ∞.
Define
∞
X ∞
X
T : x = (x1 , x2 , · · ·) → y = T (x) = a1j xj , a2j xj , · · · .
j=1 j=1
P∞
Prove that T : `∞ → `∞ is bounded and kT k = supk∈N j=1 |akj |.
(ii) Let T : `2 → `2 be defined by
3.11. For a normed linear space X over F, prove the following results:
(i) If T ∈ B(X) and T −1 exists and belongs to B(X), then (T −1 )n = (T n )−1
holds for any n ∈ N.
(ii) If T, S ∈ B(X) and T S has an inverse in B(X), then T and S must have
inverses in B(X).
3.12. Let K be a convex set in a normed linear space (X, k · kX ) over R; that
is,
x1 , x2 ∈ K and λ ∈ (0, 1) ⇒ λx1 + (1 − λ)x2 ∈ K.
If 0 is an interior point of K, then the Minkowski functional p of K is defined
by
p(x) = inf{λ > 0 : x ∈ λK} where λK = {λx : x ∈ K}.
Prove the following results:
(i) The Minkowski functional of the unit ball in X is the norm k · kX .
(ii) The Minkowski functional p is positive homogeneous; that is,
(iii) For each x0 6∈ K there exists an L ∈ B(X, R) such that L(x) ≤ L(x0 ) for
x ∈ K.
M ∩ N = {0} and M + N = X.
3.4 Linear Functionals via Hahn-Banach Extension 69
3.17. Verify the Krein extension theorem: Let X be a normed linear space
over R and Y ⊆ X such that if y1 , y2 ∈ Y and c ≥ 0 then y1 + y2 ∈ Y and
cy1 ∈ Y . Define a partial order on X by declaring that x1 x2 if and only
if x2 − x1 ∈ Y , and call a linear functional f on X Y -positive if f (x) ≥ 0 for
x ∈ Y . Let M be a subspace of X such that for each x ∈ X there exists y ∈ M
with x y. Prove that if f is a (Y ∩ M)-positive linear functional on M,
then there is a Y -positive linear functional F on X such that the restriction
of F on M is equal to f .
4
Banach Spaces via Operators and Functionals
Banach spaces, named after Stefan Banach who investigated them, are one
of the central objects of study in linear functional analysis. Banach spaces
are typically infinite-dimensional spaces containing functions. In this chapter
we are concerned with the definition of a Banach space followed by some
examples, some of the basic algebraic and topological properties of Banach
spaces, continuous linear functionals, as well as bounded and compact linear
operators in Banach spaces.
Definition 4.1. A normed linear space over F which is complete in the metric
generated by the norm is called a Banach space over F.
Clearly, if a linear space is a Banach space under one norm, then it is also
a Banach space under any equivalent norm, and hence for each n ∈ N, Rn is
complete under any norm. One more beginning example of Banach spaces is
listed below.
Example 4.2.
(i) For a compact metric space (X, dX ), suppose C(X, F) is equipped with
the sup-norm kf k∞ = supx∈X |f (x)| for f ∈ C(X, F). Then it is a Banach
space over F, because of Theorem 2.23 (i).
(ii) For each p ∈ [1, ∞], let `p (N, F) be the space of all F-valued sequences
(xj )∞
j=1 satisfying
P∞ 1
p p
∞
k(xj )j=1 kp = j=1 |x j | , p ∈ [1, ∞)
supj∈N |xj | , p = ∞.
72 4 Banach Spaces via Operators and Functionals
Then `p (N, F), k · kp is a Banach space over F. To see this, assume (xj )∞
j=1
(1) (2)
is a Cauchy sequence in `p (N, F), and write xj = (xj , xj , ...). Because of
k · kp ≥ k · k∞ , for any > 0 we may find N ∈ N such that m, n > N ⇒
kxn − xm kp < which in turn implies that kxn − xm k∞ < , so for each k,
(k) (k)
|xn − xm | < . In other words, if (xj )∞
j=1 is a Cauchy sequence in `p (N, F)
(k)
then (xj )∞
j=1 is a Cauchy sequence in F. Since F is complete, for each k
(k)
we have that |xj − y (k) | → 0 as j → ∞. Note that this does not imply by
itself xj → y = (y (1) , y (2) , ...). However, if we know that (xj )∞
j=1 is a Cauchy
sequence, then it does. In fact, we prove this for p < ∞ but the p = ∞ case is
similar. Fix > 0, and use the Cauchy criterion to find an N ∈ N such that
∞
X
n, m > N ⇒ |x(k) (k) p
n − xm | < .
k=1
P∞ (k)
Now fix n ∈ N and let m → ∞ to see k=1 |xn − y (k) |p ≤ . This inequality
1
means kxn − ykp ≤ p and y = (y (1) , y (2) , ...) ∈ `p (N, F).
To give another important class of Banach spaces, we need one more def-
inition and theorem.
Definition 4.3. For a sequence (xj )∞ in a normed linear space (X, k · kX )
P∞j=1
over F, we say that the series x
j=1 P j is absolutely convergent provided
P∞ ∞
j=1 kxj kX < ∞. Moreover,
Pn
a series j=1 xj is said to converge to x in
X provided limn→∞ k j=1 xj − xkX = 0.
Theorem 4.4. Let (X, k · kX ) be a normed linear space over
P∞ F. Then it is a
Banach space if and only if the absolute convergence of j=1 xj implies its
convergence.
Proof. The argument is motivated by that for Theorem ?? (ii). On the one
hand, suppose (X, k · kX ) is a Banach space. Consider the sequence of partial
Pk P∞
sums sk = j=1 xj . Since j=1 xj is absolutely convergent, we conclude that
m
X
ksm − sk kX ≤ kxj kX → 0 as m > k → ∞.
j=k+1
Pl ∞
So k=1 kyk kP
X l=1 is increasing and bounded, and hence is convergent.
∞ Pn
By hypothesis, k=1 yk is convergent. Since k=1 yk = xNn , it follows that
(xNn )∞ ∞
n=1 is convergent in (X, k·kX ). This, together with the fact that (xk )k=1
∞
is a Cauchy sequence, infers that (xk )k=1 is convergent. Therefore, (X, k · kX )
is a Banach space.
Remark 4.5. It is worthwhile to point out that Theorem 4.4 is clearly not true
for general normed linear spaces. For example, for j ∈ N let
0 , x ∈ [−1, −1/j]
hj (x) = 1 + jx , x ∈ [−1/j, 0]
1 , x ∈ [0, 1],
Example 4.6. Let LRS pg (E, F) be as in Example 3.14. Then each LRS pg (E, F)
is a Banach space whenever p ∈ [1, ∞]. Actually, in the case p ∈ [1, ∞), the
result can be demonstrated via Theorems 4.4 and ?? (ii). As with p = ∞,
∞
suppose (fj )∞
j=1 is a Cauchy sequence in LRS g (E, F). Clearly, we may assume
|fj (x)| ≤ kfk k∞,mg ,E for all x ∈ E by replacing fj with an equivalent function
∞
for which this is true. Then fj (x) k=1 is a Cauchy sequence in F for each x ∈
E and we can thus get an F-valued function f on E such that limk→∞ fk (x) =
f (x) for each x ∈ E. This in turns yields limk→∞ kf − fk k∞,mg ,E = 0.
Proof. If Y is not closed, then Theorem 3.13 tells us that kx + Y kX/Y is not
a norm. So, it remains
P∞ to prove that if Y is closed then X/Y is complete. In
doing so, suppose k=1 (xk + Y ) is an absolutely convergent series in X/Y .
For each k ∈ N choose zk ∈ xk + Y such that
74 4 Banach Spaces via Operators and Functionals
Example 4.9.
(i) (C[0, 1], k · k∞ ) is a Banach algebra with (f g)(x) = f (x)g(x).
(ii) LRS 1id (R, F) is a Banach algebra under the convolution
Z
f1 ∗ f2 (x) = f1 (x − t)f2 (t)dmid (t).
R
Beyond Example 4.9 (iv), we can derive more information on all bounded
linear operators on a given Banach space.
(i) If T ∈ B(X) satisfies kT k < 1 and I is the identity element in B(X), then
I − T ∈ B(X) is invertible and
k
| ◦ ·{z
where T is the k-th composition T · · ◦ T} for k ∈ N.
k
(ii) If I stands for the class of all invertible operators in B(X), then I is an
open set in B(X).
Proof. (i) Fix x ∈ X. Since m, n ∈ N and m > n imply
X ∞
≤ kT kj kxkX
j=n+1
kT kn+1
= kxkX
1 − kT k
→0 as n → ∞,
∞
we conclude that (I + T + T 2 + · · · + T n )(x) n=1 is a Cauchy sequence in
X. Note that (X, k · kX ) is a Banach space. So the sequence converges to a
limit y ∈ X. Let y = A(x). It is not hard to show that A : X → X is a linear
operator on X. Furthermore, letting m → ∞, we have
kT kn+1
kA(x) − (I + T + T 2 + · · · + T n )(x)kX ≤ kxkX ,
1 − kT k
kT kn+1
kA − (I + T + T 2 + · · · + T n )k ≤ →0 as n → ∞,
1 − kT k
But
76 4 Banach Spaces via Operators and Functionals
T2 T3
eT = I + T + + + ···,
2! 3!
which makes sense since
kT k2 kT k3
keT k ≤ 1 + kT k + + + · · · = exp kT k.
2! 3!
With this notion, we have that if x(t) = x1 (t), x2 (t), ..., xn (t) ∈ Rn , then
the system
Thus, for each x ∈ B 1 (0) we get kTα (x)kY ≤ 4j0 /r. This yields supα∈I kTα k ≤
4j0 /r, as required.
Definition 4.13. Let (X, k · kX ) and (Y, k · kY ) be normed linear spaces over
F.
(i) A sequence (Tn )∞
n=1 in B(X, Y ) is said to be uniformly convergent provided
there is a T ∈ B(X, Y ) such that limn→∞ kTn − T k = 0.
(ii) A sequence (Tn )∞
n=1 in B(X, Y ) is said to
∞be strongly (or pointwise) con-
vergent on X provided the sequence Tn (x) n=1 is convergent in Y for any
x ∈ X. Moreover, if there is a T ∈ B(X, Y ) such that limn→∞ kTn (x) −
T (x)kY = 0 for all x ∈ X, then (Tn )∞ n=1 is said to be strongly convergent to
T.
Example 4.14. Clearly, the uniform convergence implies the strong conver-
gence, but not conversely. Consider `p (N, F), p ∈ [1, ∞). For each n ∈ N
define
Tn (x) = (xn , xn+1 , ...), x = (x1 , x2 , ...) ∈ `p .
Then Tn is in B `p (N, F) with kTn k ≤ 1. Note that if x = (x1 , x2 , ...) ∈
`p (N, F) then
X∞ p1
lim kTn (x)kp = lim |xj |p = 0.
n→∞ n→∞
j=n
So (Tn )∞
n=1 is strongly convergent to 0. Meanwhile, for n ∈ N let en =
(0, ..., 0, 1, 0, ...). Then ken kp = 1 and Tn (en ) = (1, 0, 0, ...) and hence kTn k ≥
| {z }
n−1
kTn (en )kp = 1. This shows that (Tn )∞
n=1 is not uniformly convergent to 0.
Recall that a continuous map between two normed linear spaces has the
property that the pre-image of any open set is open, but in general the image
of an open set is not open. But, continuous linear operators between Banach
spaces cannot do this. This is the content of the open mapping theorem as
follows.
Proof. We split the argument into three steps. In what follows, we employ
BrX (x) and BrY (y) to denote the balls of radius r > 0 centered at x ∈ X and
y ∈ Y , respectively. In particular, we write BrX and BrY for BrX (0) and BrY (0),
respectively.
Step 1. We prove that for any > 0 there is a δ > 0 such that
4.2 Uniform Boundedness, Open Map and Closed Graph 79
BδY ⊆ T (B2
X ).
Since (Y, k · kY ) is a Banach space, we conclude that from the Baire category
theorem (i.e. Theorem 1.47) that some nT (BX ) is not nowhere dense in Y ,
and so that
for some z ∈ Y and some r > 0. Thus T (BX ) must contain the ball BδY (y0 )
where y0 = z/n and δ = r/n. It follows that the set
is contained in T (U ), where
BδY ⊆ T (B2X
).
P∞
To do so, choose (n )∞n=1 with n > 0 and n=1 n < . By Step 1 there is a
sequence of positive numbers (δn )∞ n=1 such that BδYn ⊆ T (B2
X ). Without loss
n
of generality, we may assume limn→∞ δn = 0.
X ) and hence there is a point x ∈ B X such
Let y ∈ BδY1 . Then y ∈ T (B2 1 1 21
that ky − T (x1 )kY < δ2 . Since y − T (x1 ) ∈ BδY2 , we conclude that there is a
X
point x2 ∈ B2 2
such that ky −T (x1 )−T (x2 )kY < δ3 . Continuing this process,
we obtain a sequence (xn )∞ X
n=1 such that xn ∈ B2n and
n
X
y − T xk
< δn+1 .
Y
k=1
P∞
Since kxn kX < 2n , we conclude that
P∞ n=1 xn is absolutely convergent and
hence by Theorem 4.4 that x = n=1 xn makes sense in (X, k · kX ). This
implies
80 4 Banach Spaces via Operators and Functionals
∞
X ∞
X
kxkX ≤ kxn kX < 2 n < 2.
n=1 n=1
X
In fact, if x ∈ O, then there exists an > 0 such that B2 (x) ⊆ O. By Step 2,
we have BδY ⊆ T (B2 X
) for some δ > 0. Hence
Definition 4.20. Given two normed linear spaces X and Y over F, let T :
X → Y be a linear operator. Then the graph of T is defined as
Theorem 4.21. Let (X, k · kX ) and (Y, k · kY ) be Banach spaces over F. Then
a linear operator T : X → Y is bounded if and only if T is closed.
Consequently, T is bounded.
Remark 4.22. It is worth mentioning that the second part of the argument
for Theorem 4.21 has used the assumption that (X, k · kX ) and (Y, k · kY )
are complete. Whenever this assumption is removed, the conclusion fails, i.e.,
there exist a linear operator T0 and a non-Banach space (X0 , k · kX0 ) such
that T0 is closed but unbounded on (X0 , k · kX0 ); see also Problem 4.8.
When dealing with a normed linear space X over the base field F, one typically
is only interested in the continuous linear functionals from the space into F.
According to Theorem 3.29 and Definition 3.31, these form a normed linear
82 4 Banach Spaces via Operators and Functionals
space. In this section, we will not only see that this new normed linear space is
indeed a Banach space, but also completely describe the dual spaces of three
typical Banach spaces.
We begin with a more general result.
Proof. If (Tj )∞
j=1 is a Cauchy sequence in B(X, Y ), then it is bounded and so
there is a constant C > 0 such that kTj (x)kY ≤ CkxkX for all x ∈ X and
j ∈ N. Since
If j → ∞, then
k > N ⇒ kT (x) − Tk (x)kY ≤ kxkX .
That is to say, kTk − T k ≤ as k > N . Thus, kTk − T k → 0 as k → ∞.
Theorem 4.23 particularly says that the dual space X ∗ = B(X, F) of the
normed linear space X over a given field F is always a Banach space no matter
whether X is complete or not. To see this picture clearly, we will identify the
dual spaces of three Banach spaces. Thanks to their importance, we will state
them as three independent theorems.
The first one is about the sequence spaces.
Theorem 4.24. Let p ∈ [1, ∞). Then `p (N, F)∗ = ` p−1 p (N, F) in the sense
such that
∞
X
L(x) = xk yk for all x = (xk )∞
k=1 ∈ `p (N, F).
k=1
4.3 Dual Banach Spaces by Examples 83
Proof. Since
∞
X
|xk yk | ≤ kxkp kyk p−1
p ,
k=1
∞
X
L(x) = xj yj ,
j=1
p
|yj | p−1 −2 yj
, 6 0
yj =
xj =
0 , yj = 0.
Moreover
n n
X p1
X p p
L(zn ) = |yj | p−1 and |L(zn )| ≤ kLkkzn kp = kLk |yj | p−1 .
j=1 j=1
Hence
n
X p−1
p p
|yj | p−1 ≤ kLk.
j=1
X∞
|L(x)| = xj yj ≤ kxkp kyk p−1
p .
j=1
84 4 Banach Spaces via Operators and Functionals
(ii) If p ∈ (1, ∞), then `p (N, F) is reflexive. However, if c0 (N, F) stands for
the space of all vectors (xj )∞
j=1 in `∞ with limj→∞ xj = 0, then c0 (N, F) =
∗
`1 (N, F) which can be checked in a similar way to proving Theorem 4.24, and
hence c0 (N, F) is not reflexive thanks to `1 (N, F)∗ = `∞ (N, F).
The second one characterizes the Banach dual of the space of all F-valued
continuous functions on a given finite closed interval in R.
Theorem 4.26. For −∞ < a < b < ∞ equip f ∈ C([a, b], F) with the sup-
norm
kf k∞ = sup |f (x)|,
x∈[a,b]
and let BV ([a, b], F) be the class of all F-valued functions having bounded
variation on [a, b]. If
BV0 ([a, b], F)
n Z b o
= g ∈ BV ([a, b], F) : f (x)dg(x) = 0 for all f ∈ C([a, b], F) ,
a
then
C([a, b], F)∗ = BV ([a, b], F)/BV0 ([a, b], F)
in the sense that L ∈ C([a, b], F)∗ when and only when there is a unique
equivalent class
g + BV0 ([a, b], F) ∈ BV ([a, b], F)/BV0 ([a, b], F),
such that L can be written as the Riemann-Stieltjes integral
Z b
L(f ) = f (x)dg(x) for all f ∈ C([a, b], F).
a
4.3 Dual Banach Spaces by Examples 85
Proof. On the one hand, suppose the statement after the when and only when
is true. We may write g = g1 + ig2 where g1 , g2 ∈ BV [a, b], i.e., Vab (gk ) < ∞
for k = 1, 2, in the sense of Definition ??, and consequently,
Z b
|L(f )| = f (x)d g1 (x) − ig2 (x)
a
Z b Z b
= f (x)dg1 (x) − i f (x)dg2 (x)
a a
Vab (g1 ) + Vab (g2 ) .
≤ kf k∞
Thus, this functional L belongs to C([a, b], F)∗ .
Conversely, suppose L ∈ C([a, b], F)∗ and E is a Hahn-Banach extension of
L to the space BD([a, b], F) of all functions f : [a, b] → F satisfying kf k∞ =
sup{|f (x)| : x ∈ [a, b]} < ∞. For t ∈ [a, b] let g(t) = E(1[a,t] ), but also for
{t0 , t1 , · · · , tn }, a partition of [a, b], and for each k ∈ {1, · · · , n} let λk be the
sign of g(tk ) − g(tk−1 ), i.e., the number in F obeying
|g(tk ) − g(tk−1 )| = λk g(tk ) − g(tk−1 ) .
Because of the following representation of 1:
n
X
1= 1[a,tk ] − 1[a,tk−1 ] on [a, b],
k=1
one has
n
X n
X
|g(tk ) − g(tk−1 )| = λk E(1[a,tk ] ) − E(1[a,tk−1 ] )
k=1 k=1
n
X
≤ E λk (1[a,tk ] − 1[a,tk−1 ] )
k=1
Xn
≤ kEk
λk (1[a,tk ] − 1[a,tk−1 ] )
∞
k=1
Xn
≤ kEk
(1[a,tk ] − 1[a,tk−1 ] )
∞
k=1
≤ kEk.
This means that g has bounded variation on [a, b]. Now let f ∈ C([a, b], F)
and tk = a + nk (b − a) for k = 0, ..., n, and define fn ∈ BD([a, b], F) via
n
X
fn (x) = f tk−1 1[a,tk ] (x) − 1[a,tk−1 ] (x) , x ∈ [a, b].
k=1
Now, it follows from the foregoing representation of 1 and the uniform conti-
nuity of f on [a, b] that
86 4 Banach Spaces via Operators and Functionals
kfn − f k∞ ≤ max sup f (x) − f (tk−1 ) → 0 as n → ∞.
1≤k≤n x∈[tk−1 ,tk ]
Note that E is continuous on BD([a, b], F). So it follows from the definition
of the Riemann-Stieltjes integral that
L(f ) = E(f )
= lim E(fn )
n→∞
n
X
= lim f (tk−1 )(g(tk ) − g(tk−1 )
n→∞
k=1
Z b
= f (x)dg(x),
a
as required. Finally, if h + BV0 ([a, b], F) ∈ BV ([a, b], F)/BV0 ([a, b], F) also
Rb
satisfies L(f ) = a f (x)dh(x) for all f ∈ C([a, b], F), then
Z b
f (x)d g(x) − h(x) = 0 for all f ∈ C([a, b], F),
a
and
p
(kf1 + f2 kp,mg ,E + kf1 − f2 kp,mg ,E )p + kf1 + f2 kp,mg ,E − kf1 − f2 kp,mg ,E
and
p
(kf1 + f2 kp,mg ,E + kf1 − f2 kp,mg ,E )p + kf1 + f2 kp,mg ,E − kf1 − f2 kp,mg ,E
Proof. It suffices to verify the first inequalities in (i) and (ii) since the second
ones follow from the first ones via replacing f1 and f2 with f1 + f2 and f1 − f2 .
Note that the cases p = 1 and p = 2 are clear from a direct computation. So,
it remains to check the case p 6= 2. To this end, we may assume
kf2 kp,mg ,E
R= ≤1 and kf1 kp,mg ,E = 1.
kf1 kp,mg ,E
(1 + r)p−1 − (1 − r)p−1
φ(r) = (1 + r)p−1 + (1 − r)p−1 and ψ(r) =
rp−1
and compute
dφ(r) dψ(r) p R p
+ R = (p − 1) (1 + r)p−2 − (1 − r)p−2 1 − ,
dr dr r
which vanishes only at r = R and indicates that φ(r)+ψ(r)Rp has a maximum
or minimum at r = R when p < 2 or p > 2. Since
≤ φ(r) , p < 2
ψ(r)
≥ φ(r) , p > 2,
≤ |a + b|p + |a − b|p
, p<2
φ(r)|a|p + ψ(r)|b|p
≥ |a + b|p + |a − b|p , p > 2,
Z
φ(r)|f1 |p dmg + ψ(r)|f2 |p dmg
E R
≤ RE |f1 + f2 |p + |f1 − f2 |p dmg , p<2
≥ E |f1 + f2 |p + |f1 − f2 |p dmg , p > 2.
Of course, the last estimates, along with r = R, give the desired inequalities
right away.
2δ ≤ khj + hk kp,mg ,E
≤ khj kp,mg ,E + khk kp,mg ,E
→ 2δ as k, j → ∞.
p
Suppose (hn )∞
n=1 is not a Cauchy sequence in LRS g (E, F). Then there is a
constant κ > 0 such that khj − hk kp,mg ,E ≥ κ holds for infinitely many j’s
and k’s.
Case 1: p ∈ [2, ∞). According to the first inequality in Theorem 4.27 (ii),
we have
Case 2: p ∈ (1, 2]. According to the second inequality in Theorem 4.27 (i),
we have
p
khj + hk kp,mg ,E + khj − hk kp,mg ,E
p
+khj + hk kp,mg ,E − khj − hk kp,mg ,E
≤ 2p khj kpp,mg ,E + khk kpp,mg ,E
→ 2p+1 δ p as j, k → ∞,
whence getting
|2δ + κ|p + |2δ − κ|p ≤ 2p+1 δ p .
Since φ(x) = |2δ + x|p is a strictly convex function with x; that is to say,
φ(tx1 + (1 − t)x2 ) < tφ(x1 ) + (1 − t)φ(x2 ) for any t ∈ (0, 1) and all x1 , x2 ∈ R,
we obtain the following contradiction:
4.3 Dual Banach Spaces by Examples 89
|ht |p − |h0 |p
= 2−1 p|h0 |p−2 (h − h0 )h0 + (h − h0 )h0
lim+
t→0 t
and
|ht |p − |h0 |p
|h0 |p − |2h0 − h|p ≤ ≤ |h|p − |h0 |p
t
p
which follows from the convexity tx1 + (1 − t)x2 ≤ txp1 + (1 − t)xp2 of the
function xp on [0, ∞), we use the dominated convergence theorem to obtain
that φ0 (0) exists with
Z
0 ≤ φ0 (0) = < (h − h0 )h0 |h0 |p−2 dmg ,
E
as desired.
Now, we are ready to identify the dual space of each LRS pg (E, F) for
p ∈ [1, ∞) via the Frigyes Riesz representation theorem below.
and p ∈ [1, ∞). Then LRS pg (E, F)∗ = LRS gp−1 (E, F) in the sense that
p
L ∈ LRS pg (E, F)∗ if and only if there is a unique element h ∈ LRS gp−1 (E, F)
such that Z
L(f ) = f hdmg for all f ∈ LRS pg (E, F).
E
Proof. The Hölder inequality (see also Theorem ??) implies the above-
formulated functional is an element of LRS pg (E, F)∗ .
For the reversed conclusion, we may assume 0 6= L ∈ LRS pg (E, F)∗ . Then
we consider two cases as follows.
Case 1: p ∈ (1, ∞). For the above-given L, define
whence finding
Z
< (±h)|f0 − h0 |p−2 (f0 − h0 )dmg ≤ 0 for all h ∈ K.
E
is valid for all f ∈ LRS pg (E, F). However, according to the previous Case 1
p
So, by the inclusion LRS pg2 (E, F) ⊆ LRS pg1 (E, F), which holds for 1 < p1 <
p2 < ∞ and follows from
p
Hölder’s inequality, indicates that h does not depend
on p. Now if f = |h| p−1 −2 h, then f ∈ LRS pg (E, F) and hence
p p−1 p
−1
khk p−1
p
,mg ,E
= L(f ) ≤ kLk mg (E) p khk p−1
p
,mg ,E
.
p−1 p−1
namely, h ∈ LRS ∞
R
g (E, F). This fact further implies that E |h||f |dmg < ∞
for all f ∈ LRS 1g (E, F). With f ∈ LRS 1g (E, F) and j ∈ N, we consider
f (x) , |f (x)| ≤ j
fj (x) =
0 , |f (x)| > j,
and find that |fj | ∈ LRSgp (E), fj (x) → f (x) and |fj (x)| ≤ |f (x)| for each
x ∈ E. By the dominated convergence theorem it follows that
and so that
Z Z
L(f ) = lim L(fj ) = lim fj hdmg = f hdmg .
j→∞ j→∞ E E
Note that f ∈ LRS 1g E ∩[j, j +1), F if and only if f 1E∩[j,j+1) ∈ LRS 1g (E, F).
∗
So, Lj (f ) = L(f 1E∩[j,j+1) ) defines an element in LRS 1g E ∩ [j, j + 1), F ,
and consequently, there exists an element hj ∈ LRS ∞
g E ∩ [j, j + 1), F such
that
92 4 Banach Spaces via Operators and Functionals
Z
L(f 1E∩[j,j+1) ) = f hj dmg
E∩[j,j+1)
and
khj k∞,mg ,E∩[j,j+1) ≤ kLj k ≤ kLk.
Now, if the function h is defined on E by setting h = hj on E ∩ [j, j + 1) for
each j ∈ Z, then h ∈ LRS ∞ g (E, F) and
∞
X X Z Z
L(f ) = L f 1E∩[j,j+1) = f hdmg = f hdmg ,
j=−∞ j=−∞ E∩[j,j+1) E
(for any |f1 |, |f2 | ∈ LRS g (E) and c1 , c2 ∈ F) as well as some elementary
algebraic operations of complex numbers.
(ii) In addition, a sequence may have at most one weak limit for if x 6= y then,
as we saw in the proof of Corollary 3.34 (iv), there exists an element L ∈ X ∗
w w
such that L(x) − L(y) = L(x − y) = 1. If xk → x and xk → y, then there is
an N ∈ N such that k > N implies
1 1
|L(xk ) − L(x)| < and |L(xk ) − L(y)| < ,
2 2
and hence |L(x) − L(y)| < 1, a contradiction.
The following is a natural way to introduce the weak topology where the
basis is formed by all weakly open sets.
Remark 4.34. Note that BX (x) ⊆ N (x; kLk, L) for each L ∈ X ∗ . So Defini-
tion 4.33 is a natural generalization of the idea of a ball in a metric space.
w
From the definition it turns out that xk → x is equivalent to saying that to
any such weak neighborhood of x there corresponds an N ∈ N such that xk
lies in this neighborhood for k ≥ N .
Definition 4.35. Let (X, k·kX ) be a normed linear space over F, and suppose
L, L1 , L2 , ... ∈ X ∗ . Then we say that
(i) (Ln )∞
n=1 is strongly convergent to L provided limn→∞ kLn − Lk = 0;
(ii) (Ln )∞
n=1 is weak* convergent to L provided limn→∞ |Ln (x) − L(x)| = 0 for
all x ∈ X;
(iii) (Ln )∞
n=1 is weakly convergent to L provided limn→∞ kF (Ln ) − F (L)k = 0
for all F ∈ X ∗∗ .
Remark 4.36.
(i) In general, (ii) and (iii) in Definition 4.35 are not equivalent. But, if there
is an isometry between X and X ∗∗ then both concepts coincide.
(ii) From Theorem 4.15 (ii) we can see that if (X, k · kX ) is a Banach space
over F, then any weak* convergent sequence in X ∗ must be bounded and its
converse is also true whenever this sequence is assumed to be convergent on
a dense subset of X.
94 4 Banach Spaces via Operators and Functionals
(iii) Motivated by Definition 4.35 (iii), we say that (Tn )∞ n=1 in B(X, Y) is
weakly convergent to T ∈ B(X, Y ) provided limn→∞ |F Tn (x) − F T (x) | =
0 for all x ∈ X and F ∈ Y ∗ . Here (X, k · kX ) and (Y, k · kY ) are supposed to
be normed linear spaces over F. According to Definition 4.13, we can find out
that the uniform convergence implies the strong convergence which implies
the weak convergence, but not conversely. Since Example 4.14 has shown that
the strong convergence does not yield the uniform convergence, it is enough
to check that the weak convergence does not derive the strong convergence.
To do so, consider X = Y = `2 (N, R) over R, and Tn (x) = (0, ..., 0, x1 , x2 , ...)
| {z }
n
for x = (x1 , x2 , ...) ∈ `2 (N, R). Clearly, kTn k = 1 and if y = (y1 , y2 , ...) ∈
`2 (N, R) then
X∞ ∞ ∞ 21
X X
Tn (x) k yk = xk yk ≤ kxk2 yk2
k=1 k=n+1 k=n+1
Similarly, we can define the weak-∗ topology on X ∗ via all weak* open
subsets of X ∗ .
(ii) An arbitrary union of such neighborhoods is called a weak* open set. The
complement of a weak* open set is called a weak* closed set.
(iii) A subset A of X ∗ is called weak* compact provided every cover of A by
weak* open sets has a finite subcover.
Definition 4.38.
(i) A collection T of subsets of a set X is said to be a topology for X provided:
(a) ∅, X ∈ T ;
(b) ∩nj=1 Ej ∈ T whenever n ∈ N and E1 , E2 , ..., En ∈ T ;
(c) ∪j∈I Ej ∈ T whenever {Ej }j∈I is a family in T .
In this case, (X, T ) is called a topological space. Meanwhile, members of T
and their complements are called open and closed sets of (X, T ) respectively.
4.4 Weak and Weak* Topologies 95
The product topology T on X is defined to be the topology with the fewest open
sets for which every projection pj is continuous; that is, p−1
j (O) is open in X
whenever O is open in Xj . Consequently, the above-defined (X, T ) is called a
product topological space.
Lemma 4.39. Let F be a family of subsets of a given set X. If F enjoys
the finite intersection property (FIP) – each finite collection of subsets in the
family has a nonempty intersection, then F can be extended to a family which
is maximal with respect to having the FIP.
Proof. Let T be the collection of all families of subsets of X that contain F
and satisfy the FIP. Define an order on T via inclusion and let {Tj : j ∈ J}
be any chain in T – a totally ordered subset of T . If S = ∪j∈J Tj , then S
contains F and if {S1 , ..., Sn } is any finite collection of members of S then
{S1 , ..., Sn } ⊆ Tj0 for some j0 since {Tj }j∈J is ordered by inclusion. Now Tj0
enjoys the FIP. So ∩nj=1 Sj 6= ∅; that is to say, S has the FIP but also each
chain in S has an upper bound. From Zorn’s lemma we conclude that S has
a maximal element, whence establishing the desired result.
Using this lemma, we can derive the following Andrey Nikolayevich Ty-
chonoff’s theorem.
Theorem 4.40. Let {(Xj , Tj )}j∈I be a family of compact topological
Q spaces
indexed by the set I. Then their product topological space (X = j∈I Xj , T )
is compact.
Proof. By forming the contrapositive statement and taking complements, we
immediately see that (X, T ) is compact if and only if any collection of closed
subsets of (X, T ) obeying the FIP has a nonempty intersection. For this rea-
son, we are about to make the following consideration where Tj and T will be
dropped from (Xj , Tj ) and (X, T ).
Suppose F is a collection of closed subsets of X which enjoys the FIP. By
Lemma 4.39 we can extend F to G which is maximal with respect to having
the FIP. Moreover, for each j ∈ I the family {pj (B) : B ∈ G} of subsets of
Xj has the FIP since for any finite collection {B1 , ..., Bn } of elements in G
one has
96 4 Banach Spaces via Operators and Functionals
n
\ n
\ n
\
∅=
6 pj Bk ⊆ pj (Bk ) ⊆ pj (Bk ).
k=1 k=1 k=1
Accordingly, xjk ∈ Ojk and thus Ojk ∩pjk (B) 6= ∅ for all B ∈ G. Using the fact
that Ojk is open and pjk is a projection, we can readily find Ojk ∩ pjk (B) 6= ∅
and hence p−1 jk (Ojk ) ∩ B 6= ∅ for all B ∈ G. Because G is maximal with re-
spect to enjoying the FIP, we must then have p−1 jk (Ojk ) ∈ G and similarly
Tn −1
p
k=1 jk (Ojk ) ∈ G, whence obtaining O ∈ G. From this we see that O inter-
sects every element of G and so that f ∈ B for all B ∈ G due to f ∈ O and O
being an arbitrary open set.
Now, we are ready to discuss the well-known theorem of Leonidas Alaoglu
(also known as the Banach-Alaoglu theorem).
Theorem 4.41. Let (X, k · kX ) be a normed linear space over F. Then the
X∗
closed unit ball B 1 (0) = {F ∈ X ∗ : kF k ≤ 1} of X ∗ is weak* compact.
Proof. For x ∈ X let Dx = {λ ∈ F : |λ| ≤ kxkX }. Note that |F (x)| ≤ kxkX for
X∗ X∗
all F ∈ B 1 (0). So B 1 (0) is a subset of the set of all functions f : X → Dx ,
X∗ Q Q
i.e., B 1 (0) ⊆ x∈X Dx . The product topology on x∈X Dx has as open sets
unions of neighborhoods of the form
Proof. Recall
X
B 1 (0) = {x ∈ X : kxkX ≤ 1}.
Since X is reflexive, we conclude
X X
x ∈ X ∗∗ : x ∈ B 1 (0)}
B 1 (0) = {b
d
is exactly the closed unit ball of X ∗∗ . From Theorem 4.41 it follows that
X
d X
B 1 (0) is weak* compact. But since X is reflexive, this says that B 1 (0) is
weakly compact. We are done.
Definition 4.43. Let (X, k · kX ) and (Y, k · kY ) be normed linear spaces over
F. Then a linear operator T : X → Y is called compact provided the closure
T (B) is a compact subset of Y for each bounded subset B of X.
98 4 Banach Spaces via Operators and Functionals
Here we should note that Definition 4.43 does not require T to be continu-
ous or bounded. Nevertheless, the following result tells us that the continuity
or the boundedness of T is an immediate consequence of the definition.
Theorem 4.44. Let (X, k · kX ) and (Y, k · kY ) be normed linear spaces over
F. If a linear operator T : X → Y is compact then it is bounded.
X
Proof. In this case, T (B 1 ) is compact for the closed unit ball
X
B 1 = {x ∈ X : kxkX ≤ 1},
Using the criterion in Remark 4.45 (iii), we obtain that the compactness
is preserved under uniform limits in the Banach space setting.
Tn (xk )∞
k=1 = x1 , x2 /2, · · · , xn /n, 0, · · · , n ∈ N.
It is clear that each Tn `p (N, F) is finite dimensional and so that Tn is
compact. Note that for any x = (xk )∞k=1 ∈ `p (N, F),
1
P∞ |xk |p p k(xk )kp
∞
≤ n+1 , p ∈ [1, ∞)
k(T − Tn ) (xk )k=1 kp = k=n+1 k p
into
C([a, b], F)∗ = BV ([a, b], F)/BV0 ([a, b], F)
by Theorem 4.48, where
Z d
T ∗ (g)(y) = k(x, y)dg(x), y ∈ [a, b],
c
and thus get that if B is a bounded subset of C([a, b], F); that is, there is a
constant M2 > 0 such that kf k∞ ≤ M2 for all f ∈ B, then
Z b
kT (f )k∞ = sup k(x, y)f (y)dy ≤ M1 M2 (b − a).
x∈[c,d] a
This means that T (B) is bounded in C([c, d], F). Since k(·, ·) is indeed uni-
formly continuous on [c, d] × [a, b], we see that for any > 0 there is a δ > 0
such that
4.5 Compact and Dual Operators 101
x1 , x2 ∈ [c, d] and |x1 − x2 | < δ ⇒ |k(x1 , y) − k(x2 , y)| < .
M2 (b − a)
for all f ∈ C([a, b], F). By Theorem 2.23 we can now conclude that the closure
T (B) is compact in C([c, d], F) and so that T is a compact operator.
Problems
4.1. Prove that if C n [0, 1] is the class of all functions f : [0, 1] → R for which
the derivatives f (0) , f (1) ,..., f (n) belong to C[0, 1] and satisfy
4.2. Prove that if (X, k · kX ) and (Y, k · kY ) are Banach spaces over F then so
is X × Y under the norm k(x, y)k = max{kxkX , kykY } for (x, y) ∈ X × Y .
4.5. Show by an example that the uniform bounded principle does not hold
once the completeness is dropped.
4.7.
(i) Prove
Z 2π
sin(n + 1 )x sin x −1 dx = ∞.
lim
n→∞ 0 2 2
(ii) Given a Riemann-integrable function f : [0, 2π] → R, let its Fourier series
be
∞ Z 2π
X 1
s(x) = am e imx
where am = f (y)e−imy dy.
m=−∞
2π 0
102 4 Banach Spaces via Operators and Functionals
1
Z 2π sin(n + 1 )y
2
sn (x) = f (x + y) dy for all x ∈ (0, 2π).
2π 0 sin y2
(iii) Let X be the Banach space of continuous functions f : [0, 2π] → R with
f (0) = f (2π), with the sup-norm. Prove that the linear operator Tn : X → R
defined by
sin(n + 12 )x
Z 2π
1
Tn (f ) = f (x) dx
2π 0 sin x2
is bounded, and
2π sin(n + 12 )x
Z
1
kTn k = dx.
2π 0
sin x2
(iv) Prove that there exists a continuous function f : [0, 2π] → R with f (0) =
f (2π) such that its Fourier series diverges at x = 0.
4.8. Let X = C 1 [0, 1] and Y = C[0, 1] be equipped with the sup-norm. Prove
the following results:
(i) X is not complete;
(ii) The map (d/dx) : X → Y is closed but not bounded.
4.9. Suppose (X, k · kX ) and (Y, k · kY ) are Banach spaces over F and T ∈
B(X, Y ) satisfying kT (x)kY ≥ ckxkX for all x ∈ X and for some constant
c > 0. Verify the following facts:
(i) N (T ) = {x ∈ X : T (x) = 0} = {0};
(ii) R(T ) = {y ∈ Y : y= T (x) for some x ∈ X} is closed;
(iii) T −1 ∈ B R(T ), X satisfies kT −1 k ≤ c−1 .
4.11. Let (X, k·kX ) be a separable Banach space over F and M be a bounded
subset of X ∗ . Prove that every sequence in M contains a weak* convergent
subsequence.
4.16. Let (X, k · kX ) and (Y, k · kY ) be Banach spaces over F. Prove that if
T : X → Y is a linear mapping such that f ◦ T ∈ X ∗ for all f ∈ Y ∗ then
T ∈ B(X, Y ).
4.17. Prove that a topological space X is compact if and only if any collection
of closed subsets of X obeying the FIP has a nonempty intersection.
4.18. Let (X, k · kX ) and (Y, k · kY ) be normed linear and Banach spaces over
F respectively. Prove that the set Bc (X, Y ) of all compact operators from X
to Y is a Banach space under the usual operator norm for B(X, Y ).
4.21. Let (X, k · kX ) be a Banach space over F. Prove the following three
results:
(i) If T : X → X is the zero operator; that is, T (x) = 0 for all x ∈ X, then T
is compact;
(ii) If T ∈ B(X) is compact and X is infinite dimensional, then T is not
invertible;
(iii) If T ∈ B(X) ensures that T (X) is closed and infinite dimensional, then
T is not compact.
5
Hilbert Spaces and Their Operators
Hilbert spaces, named after David Hilbert, are special Banach spaces which
share most of the rich geometric structure of the finite-dimensional Euclidean
spaces. The geometrical wealth of a Hilbert space is produced by a bilinear
form on the space which allows the introduction of a notion of perpendicu-
larity. This chapter is devoted to an introductory study of Hilbert spaces and
their bounded and compact linear operators.
Proof. (i) If hx, yi = 0, then there is nothing to argue. Suppose now hx, yi =
6 0.
Then x 6= 0 and y 6= 0. Let α = hx, yi and z = αy. Then for t ∈ R we have
0 ≤ hx − tz, x − tzi = kxk2 − 2t|hx, yi|2 + t2 |hx, yi|2 kyk2 .
This ensures that the last quadratic function of t takes its absolute minimum
at t = kyk−2 . Substituting this value for t, we get
0 ≤ kx − tzk2 = kxk2 − kyk−2 |hx, yi|2
with equality if and only if x − tz = x − αty = 0, from which the desired result
is immediate.
(ii) It is obvious that kxk = 0 ⇔ x = 0 and kλxk = |λ|kxk. As for the
triangle inequality, (i) is used to imply
kx + yk2 = kxk2 + 2<hx, yi + kyk2 ≤ (kxk + kyk)2 .
(iii) This follows directly from expanding kx + yk2 and kx − yk2 .
The above definition and theorem actually induce the notion of a Hilbert
space.
Definition 5.3. A Hilbert space X over F is a pre-Hilbert space which is
complete with respect to the metric
p
d(x, y) = kx − yk = hx − y, x − yi for all x, y ∈ X.
Furthermore, we will refer to real or complex Hilbert space according to
whether F is R or C.
Example 5.4.
n
Pn n ∈ N, F is a Hilbert space over F under nthe inner product
(i) For each
hx, yi = j=1 xj yj for x = (x1 , ..., xn ), y = (y1 , ..., yn ) ∈ F .
(ii) Regardless of the cardinality of an arbitrarily given set X, we can define
the sequence space
n X o
`2 (X, F) = f : X → F : |f (x)|2 < ∞ ,
x∈X
where
X nX o
|f (x)|2 = sup |f (x)|2 : all nonempty finite subsets Y of X .
x∈X x∈Y
This space becomes a Hilbert space over F with the inner product
X
hf1 , f2 i = f1 (x)f2 (x) for all f1 , f2 ∈ `2 (X, F).
x∈X
(iii) LRS 2id ([a, b], F), often denoted L2 ([a, b], F), is a Hilbert space over F
under the following inner product
Z
hf1 , f2 i = f1 f2 dmid for all f1 , f2 ∈ LRS 2id ([a, b], F).
[a,b]
The following result tells us when a Banach space becomes a Hilbert space.
Theorem 5.5. Let (X, k · k) be a normed linear space over F and satisfy the
parallelogram law above.
(i) If F = C, then
4
X
hx, yi = 4−1 in kx + in yk2
n=1
and
ku − v + wk2 + ku − v − wk2 = 2ku − vk2 + 2kwk2 .
Hence
This infers
<hu + w, vi + <hu − w, vi = 2<hu, vi.
In a similar fashion, we can establish the relation with the real part < replaced
by the imaginary part =:
Accordingly,
108 5 Hilbert Spaces and Their Operators
hu + w, vi + hu − w, vi = 2hu, vi.
When u = w, we have h2u, vi = 2hu, vi. Taking u + w = x, u − w = y, v = z,
we get Dx + y E
hx, zi + hy, zi = 2 , z = hx + y, zi.
2
To reach the latter, we observe that for any m ∈ N,
as desired.
(ii) Two vectors x, y ∈ X are called orthogonal, denoted x⊥y, provided hx, yi =
0.
(iii) For any subset S of X, S ⊥ = {x ∈ X : hx, yi = 0 for all y ∈ S} is
called the orthogonal complement of S.
The following result, which has a root in Corollary 4.28 and will be used
later, has natural geometrical and finite-dimensional antecedents.
Proof. Let δ = inf y∈M kyk. Then by definition there is a sequence (xj )∞
j=1 in
M such that limj→∞ kxj k = δ. Since M is convex, we conclude 2−1 (xj +xk ) ∈
M and consequently,
Next, we introduce the concept of a direct sum which has implicitly ap-
peared in Corollary 3.34.
Definition 5.8. Let X and Y be two linear subspaces of a given linear space
Z over F. Then Z is said to be the direct sum of X and Y , denoted Z = X ⊕Y ,
provided every z ∈ Z can be expressed uniquely in the form z = x + y where
x ∈ X and y ∈ Y and X ∩ Y = {0}.
x ∈ M ∩ M ⊥ ⇒ hx, xi = 0 ⇒ x = 0,
whence deriving x ∈ M ⊥ .
Finally, we consider the dual space of a Hilbert space. Below is the classical
Riesz representation theorem.
Theorem 5.11. Given a Hilbert space X over F and y ∈ X, define Ly : X →
F by Ly (x) = hx, yi. Then Ly ∈ X ∗ and kLy k = kyk. Conversely, for every
f ∈ X ∗ there exists a unique y ∈ X such that f = Ly .
Proof. Clearly, Ly ∈ X ∗ with kLy k ≤ kyk. Note that
f (x) D f (x) E
x− z∈M and x− z, z = 0.
f (z) f (z)
Accordingly, D z E
hx, zi = f (x) ,z .
f (z)
Taking y = f (z)kzk−2 z, we further get f (x) = hx, yi, and then f = Ly . To
see the uniqueness, assume there is another point w ∈ X such that f = Lw .
Then hx, w − yi = 0 for all x ∈ X, and hence
kw − yk2 = hw − y, w − yi = 0 and w = y.
We are done.
Here is an easy consequence of Theorem 5.11 which says that any Hilbert
space is self-reflexive.
Proof. The first part has just been proved in Theorem 5.11. Furthermore it is
easily seen that L(x1 + x2 ) = L(x1 ) + L(x2 ) so the isometry is additive. Since
Thus kyn+1 k =
6 0 since otherwise we would have
because this is true for {ej }nj=1 and {xj }nj=1 . Finally, for j ∈ {1, 2, ..., n}, we
have
hxn+1 , ej i − hxn+1 , ej ihej , ej i
hen+1 , ej i = = 0,
kyn+1 k
and consequently, the set {ej }∞
j=1 obtained by this inductive construction is
an orthonormal set with the desired property.
where the last equality is obtained by expanding in the usual fashion and
using the orthonormality of {ej }nj=1 .
(ii) This follows from
D n
X E
x− hx, ej iej , ek = hx, ek i − hx, ek ihek , ek i = 0.
j=1
Corollary 5.17. Let I be any index set and {ej }j∈I be an orthonormal set of
a Hilbert space X over F. Then S = {ej : hx, ej i 6= 0} is countable for any
x∈X .
Remark 5.18. Given an arbitrary orthonormal set {ej }j∈I Pindexed by2 a set
I, under Lemma 5.16 and Corollary 5.17, the sums |hx, e i| and
P P∞ 2
P∞ j
j∈I
j∈I hx, e ie
j j can be written as the series k=1 |hx, ejk i| and k=1 hx, ejk iejk
respectively, where we restrict ourselves to the countable number of ejk for
which hx, ejk i =
6 0.
The following result assures that the two series in Remark 5.18 are well
defined.
This, together with the completeness of X, implies the if and only if part of
the theorem. If n = 1 and m → ∞, Pthen the desired equality follows.
∞ P∞
To complete the proof, Passume k=1 |ck |2 < ∞ and let y = l=1 ckl xkl
∞
be a rearrangement of x = k=1 ck xk . Then
and
∞
X
hx, xi = hy, yi = |ck |2 .
k=1
If
m
X m
X
sm = ck x k and tm = ckl xkl ,
k=1 l=1
then
∞
X
hx, yi = lim hsm , tm i = |ck |2 .
m→∞
k=1
Note that hy, xi = hx, yi = hx, yi. So it follows that kx − yk = 0 and hence
x = y. We are done.
Here is a statement of Friedrich Wilhelm Bessel about the coefficients of
an element in a Hilbert space with respect to an orthonormal set.
Theorem 5.20. Let I be any index set and {ej }j∈I be an orthonormal set of
a Hilbert space X over P F. Then
(i) Bessel’s inequality: j∈I |hx, ej i|2 ≤ kxk2 for all x ∈ X.
P
(ii) x − j∈I hx, ej iej , ek = 0 for all x ∈ X and k ∈ I.
Proof. (i) This follows from
X n
X
|hx, ej i|2 = lim |hx, ejk i|2 ≤ kxk2 .
n→∞
j∈I k=1
(ii) Using the continuity of the inner product in its left-hand variable that
follows from the Cauchy-Schwarz inequality, we obtain
D X E DX E
x− hx, ej iej , ek = hx, ek i − hx, ej iej , ek
j∈I j∈I
D n
X E
= hx, ek i − lim hx, ejm iejm , ek
n→∞
m=1
5.3 Orthonormal Sets and Bases 115
n
DX E
= hx, ek i − lim hx, ejm iej , ek
n→∞
m=1
= hx, ek i − hx, ek i = 0.
Theorem 5.20 (ii) induces the concept of an orthonormal basis for a Hilbert
space.
Theorem 5.22. Given an index set I, let {ej }j∈I be an orthonormal set in
a Hilbert space X over F. Then the following statements are equivalent:
(i) {ej }j∈I is an orthonormal basis.
(ii) The closed linear span of {ejP
}j∈I is X.
(iii) Parseval’s identity: kxk2 = j∈I |hx, ej i|2 for all x ∈ X.
Proof. First, we prove that (i) implies (ii) and (iii). If (i) is true, then Defini-
tion 5.21 and Theorem 5.20 (ii) yield
X
x= hx, ej iej for all x ∈ X.
j∈I
This is a contradiction.
Example 5.23.
(i) The set {(1, 0, 0), (0, 1, 0), (0, 0, 1)} forms an orthonormal basis of R3 .
(ii) The set {ej : j ∈ N} with the jth entry of ej ∈ `2 being 1 and others
being 0 forms an orthonormal basis of the real sequence space `2 .
2
(iii) The set {eint }∞
n=−∞ is an orthonormal basis for LRS id ([0, 2π], C). Ac-
2
cordingly, C([0, 2π], C) is dense in LRS id ([0, 2π], C). In fact, this basis is
fundamental to the study of Fourier series, named in honor of Joseph Fourier.
Theorem 5.24.
(i) Every Hilbert space over F has an orthonormal basis.
(ii) Any orthonormal basis in a separable Hilbert space over F is countable.
Proof. (i) Let X be a Hilbert space over F with the inner product h·, ·i and
the norm k · k, and consider the collection E of orthonormal subsets of X. It
is clear that E is nonempty and can be partially ordered S under inclusion. If F
is any totally ordered subcollection of E, the set U = S∈F S is member of E
and an upper bound for F . By Zorn’s lemma there is a maximal orthonormal
set M . Since M is maximal, we conclude from Definition 5.21 that M is an
orthonormal basis – otherwise, there would be a nonzero x0 ∈ X \ M such
that hx0 , ej i = 0 holds for any ej ∈ M . Accordingly, {x0 /kx0 k} ∪ M forms an
orthonormal set which properly contains M – this contradicts the maximality
of M .
(ii) If X is separable, and if {ej }j∈I is an uncountable orthonormal basis
indexed by I, then for j, k ∈ I with j 6= k, we have
{f + Y ∈ H : |||f (·) − eit(·) |||2 < 1/2} and {f + Y ∈ H : |||f (·) − eis(·) |||2 < 1/2}
Corollary 5.26. Any two infinite dimensional separable Hilbert spaces over
F are isometrically isomorphic.
Proof. Suppose X and Y are two such spaces equipped with the inner products
h·, ·iX and h·, ·iY . Theorem 5.24 tells us that there are sequences (xj )∞j=1 and
(yj )∞j=1 that form orthonormal bases for X and Y respectively. If x ∈ X and
y ∈ Y , then
X∞ X∞
x= hx, xj iX xj and y = hy, yj iY yj .
j=1 j=1
This pairing actually produces a new operator on X – see also Theorem 4.48.
The three facts (i)-(ii)-(iii) in Theorem 5.28 below are the required condi-
tions for ∗ to be an involution.
Theorem 5.28. Let X be a Hilbert space over F. Then the operation ∗ maps
B(X) to itself, and has the following properties for all T, S ∈ B(X) and
α, β ∈ F:
(i) (αT + βS)∗ = ᾱT ∗ + β̄S ∗ ;
(ii) (T S)∗ = S ∗ T ∗ ;
(iii) T ∗∗ = T ;
(iv) kT ∗ k = kT k;
(v) kT ∗ T k = kT k2 .
Proof. First of all, we verify that ∗ maps B(X) to B(X). The linearity of T ∗
follows from the following calculation:
kT ∗ k = sup kT ∗ (y)k
kyk=1
≤ sup kT (x)k = kT k.
kxk=1
hx, (αT + βS)∗ (y)i = αhT (x), yi + βhS(x), yi = hx, (ᾱT ∗ + β̄S ∗ )(y)i.
kT ∗ T k ≤ kT ∗ kkT k = kT k2 .
In what follows, we deal with five special types of bounded linear operators
on a given Hilbert space. They are self-adjoint, normal, non-negative, unitary,
and projective operators.
Definition 5.30. Let X be a Hilbert space over F. An operator T ∈ B(X) is
said to be self-adjoint or symmetric provided T = T ∗ .
Example 5.31.
(i) On a finite-dimensional Hilbert space over F, a self-adjoint operator is one
that is its own adjoint, or, equivalently, one whose matrix is Hermitian, where
a Hermitian matrix is one which is equal to its own conjugate transpose.
(ii) Given n ∈ N, the operator T defined by T (f )(x) = xn f (x) is a self-adjoint
operator on LRS 2id ([0, 1], C).
The structure of self-adjoint operators on infinite dimensional Hilbert
spaces is complicated somewhat by the following characterization and its im-
mediate remark.
Theorem 5.32. Let X be a Hilbert space over F and T ∈ B(X).
(i) If T is self-adjoint, then kT k = supkxk=1 |hT (x), xi|.
(ii) If F = C, then T is self-adjoint when and only when hT (x), xi is real for
all x ∈ X. In particular, T = 0 when and only when hT (x), xi = 0 for all
x ∈ X.
Proof. (i) The Cauchy-Schwartz inequality implies kT k ≥ supkxk=1 |hT (x), xi|.
For the reversed estimate, we use the triangle inequality and the parallelogram
law to obtain that for any x ∈ X with kxk = 1,
and
f (x + iy) = f (x) + f (y) + ihT (y), xi − ihT (x), yi.
Since f is real-valued, we conclude from the last two identities that there are
r, s ∈ R such that
This yields
r + is r − is
hT (y), xi = and hT (x), yi = ,
2 2
and thus
hT (y), xi = hT (x), yi = hx, T ∗ (y)i = hT ∗ (y), xi.
Consequently, for any y ∈ X, we have
For the second half of (ii), it is enough to prove the sufficiency. Suppose
hT (x), xi = 0 for all x ∈ X. Then T is self-adjoint, and hence
holds for all y = T (x) ∈ X. This gives T (x) = 0 for all x ∈ X, i.e., T = 0.
Remark 5.33. Unfortunately, Theorem 5.32 (i) is not valid for any real Hilbert
space. For instance, if X = R2 and T (x) = T (x1 , x2 ) = (−x2 , x1 ) for
x = (x1 , x2 ) ∈ R2 then hT (x), xi = 0, 0 6= T ∈ B(R2 ) and T is not self-
adjoint since T ∗ (x) = T ∗ (x1 , x2 ) = (x2 , −x1 ) for x = (x1 , x2 ) ∈ R2 .
Example 5.35.
(i) Any bounded operator T on a given Hilbert space over C always generates
a nonnegative operator T ∗ T since hT ∗ T (x), xi = kT (x)k2 ≥ 0.
(ii) The operator defined in Example 5.31 (ii) is nonnegative.
5.4 Five Special Bounded Operators 121
(iii) The operator T on C2 determined
by T (x1 , x2 ) = (x1 + 2x2 , 2x1 + x2 )
is not nonnegative since T (1, −1) = (−1, 1) and the inner product between
two vectors (−1, 1) and (1, −1) is −2.
According to Theorem 5.32 (ii) and Example 5.35 (ii), any nonnegative
operator is self-adjoint, but not conversely. Here is the basic structure of a
nonnegative operator.
Theorem 5.36. Given a Hilbert space X over C, let I ∈ B(X) be the identity
operator.
(i) If −I ≤ T ≤ I or 0 ≤ T ≤ 0, then kT k ≤ 1 or T = 0.
(ii) If T ≥ 0, then T + I is invertible, and the generalized Cauchy-Schwartz
inequality holds:
p
|hT (x), yi| ≤ hT (x), xihT (y), yi ∀ x, y ∈ X.
Proof. (i) Note that T is self-adjoint. So, I ± T ≥ 0 implies ±hT (x), xi ≤ kxk2
and kT k ≤ 1 by Theorem 5.32 (i). Moreover, ±T ≥ 0 implies hT (x), xi = 0
and kT k = 0 by Theorem 5.32 (i).
(ii) Since T ≥ 0, we conclude
0 ≤ hx + λy, T (x + λy)i
= hx, T (x)i + 2t|hy, T (x)i| + t2 hy, T (y)i,
whence deriving
122 5 Hilbert Spaces and Their Operators
h(T ∗ T − I)(x), xi = 0 ⇒ T ∗ T = I ⇒ T T ∗ = I.
Theorem 5.41. Let X be a Hilbert space over C and T ∈ B(X). Then the
following are equivalent:
(i) T is normal.
(ii) T = T1 + iT2 where T1 and T2 are self-adjoint and T1 T2 = T2 T1 .
(iii) kT (x)k = kT ∗ (x)k for all x ∈ X.
T + T∗ T − T∗
T1 = and T2 = .
2 2i
It is easy to see that T = T1 + iT2 holds with T1 and T2 being self-adjoint and
commutative.
(ii)⇒(iii) Using the given decomposition of T plus T1 T2 = T2 T1 , we obtain
Example 5.42. Let T = 2iI on a given Hilbert space X over C, where I is the
identity operator in B(X). Then T ∗ = −2iI and T T ∗ = T ∗ T = 4I. Thus, T
is normal operator but not unitary nor self-adjoint.
The fifth special operator that we take account of is the projection oper-
ator.
x = y + z, y ∈ M = T (X) and z ∈ M ⊥ .
Then
hT (x), xi = hy, y + zi = hy, yi + hy, zi = hy, yi ≥ 0.
Hence T is nonnegative and the proof is complete.
126 5 Hilbert Spaces and Their Operators
Remark 5.46. Here it is worth remarking that there exists a natural one-to-
one correspondence between projection operators T and direct sum decom-
positions X = M ⊕ N where M and N are closed. As a matter of fact, if T
is a projection, then X = M ⊕ N where M = T (X) and N = (I − T )(X).
Conversely, if X = M ⊕ N , then we define T as was done for the orthogonal
projection but using N instead of M ⊥ . The orthogonal projections are exactly
those that arise when N = M ⊥ , and these are generally the projections of
interest in Hilbert space theory.
Example 5.49.
(i) For the closed interval [0, 1] of R, consider LRS 2id ([0, 1], C) and the differ-
ential operator T = −d2 /dx2 defined on the linear subspace of all complex-
valued functions f having derivatives of any order on [0, 1] and satisfy-
ing the boundary conditions: f (0) = f (1) = 0. Then a computation via
integration-by-parts indicates that T is self-adjoint; its eigenfunctions are
fj (x) = sin(jπx), j ∈ N with the real eigenvalues j 2 π 2 ; the well-known or-
thogonality of the sine functions follows as a consequence of the property of
being self-adjoint.
(ii) σ(T ) is compact. In fact, according to Theorem 4.10 (i), |λ| > kT k implies
The following result, which will be used later, gives much more information
on the spectrum of a self-adjoint operator.
(vi) rσ (T ) = kT k.
Proof. (i) This follows from the fact that T (x) = λx for some x 6= 0 implies
λkxk2 = hT (x), xi ∈ R
kxk2 + kyk2 = 2−1 (kx + yk2 + kx − yk2 ) = 2b2 + 2b−2 kT (z)k2 = 4kT (z)k.
Note that |hT u, ui| ≤ MT kuk2 for any u ∈ X. So, the last equation and a
routine calculation give
T − mT I = −(S − MS I)
rσ (T ) ≥ max{|mT |, |MT |} = kT k.
To reach the main result of this section, we introduce the following termi-
nology.
Remark 5.53.
(i) It is clear that N (T − λI) is invariant under T ; that is, if x ∈ N (T − λI)
then T (x) ∈ N (T − λI) since T (T (x)) = T (λx) = λT (x).
(ii) If T is compact and self-adjoint, then the eigenspace corresponding to each
nonzero eigenvalue must be finite dimensional, but also, there can be at most
finitely many eigenvalues λ obeying |λ| > c for a given constant c > 0. The
former clearly follows from Theorem 5.51 (ii). To see the latter, we assume that
there were an infinite sequence of different eigenvalues (λk )∞k=1 obeying |λk | >
c > 0. If (yk )∞
k=1 is a corresponding sequence of norm one eigenvectors, then by
Theorem 5.50 (ii) we see that (yk )∞ k=1 are orthogonal and hence orthonormal.
But, (yk )∞ k=1 cannot be infinite by Theorem 5.51 (ii). Thus, there can be
only finitely many eigenvalues obeying |λk | > c > 0. Consequently, the set
of nonzero eigenvalues of the operator is at most countable. Moreover, if the
operator has infinitely many eigenvalues, then they must converge to zero
since at most finitely many eigenvalues can satisfy an inequality of the form
|λ| > c > 0.
Theorem 5.54. Let X be a Hilbert space over C, and let T ∈ B(X) be com-
pact and self-adjoint. If (λj )∞
j=1 is an enumeration ofPthe distinct nonzero
∞
eigenvalues of T , then for each x ∈ X one has T (x) = k=1 λk Pk (x), where
each Pk is the orthogonal projection of X onto the eigenspace N (T − λk I).
Proof. From the definition of each Pk and Theorem 5.7 it follows that for
x ∈ X,
kPk (x) − xk = inf{ky − xk : y ∈ N (T − λk I)}.
If {yk,1 , · · · , yk,nk } is an orthonormal basis for N (T
P− λk I), then by Remark
nk
5.53 (ii), and Theorems 5.20 and 5.22, Pk (x) = j=1 hx, yk,j iyk,j . Because
eigenvectors
S∞ corresponding to different eigenvalues are orthogonal, the set S =
k=1 {yk,1 , · · · , yk,nk } is an orthonormal set. The vector
5.5 Compact Operators via the Spectrum 131
∞
X ∞ X
X nk
y= Pk (x) = hx, yk,nk iyk,nk
k=1 k=1 j=1
T (x − y) = U PM ⊥ (x) = 0.
Remark 5.55. The argument for Theorem 5.54 produces a linear subspace M ⊥
of X comprising elements mapped to zero by T . So, we can choose an or-
thonormal basis for M ⊥ , each element of which is an eigenvector of T with
eigenvalue 0. If these new eigenvectors are included in the sequence of eigen-
vectors constructed in Theorem 5.54, we obtain an orthonormal basis {ej } for
X. Note that the new eigenvalues are zero, their inclusion does
P not affect the
convergence of the sequence (λk )∞ k=1 . Consequently, if x = j cj ej ∈ X and
{λj } are all eigenvalues corresponding to {ej }, then
X ∞
X
T (x) = cj λj ej = ck λk Pk (x).
j k=1
In some ways, Theorem 5.54 tells us that the self-adjoint compact operators
in a Hilbert space are similar to the symmetric matrices in Rn . Moreover, as
an interesting consequence of Theorem 5.54, we can approximate any compact
operator on a Hilbert space by the operators of finite rank.
132 5 Hilbert Spaces and Their Operators
Theorem 5.56. Let X be a Hilbert space over C. Then for each compact
operator T ∈ B(X) there is a sequence of operators of finite rank converging
to T in norm.
Proof. First, assume X is separable. Then by Theorem 5.24 we see that X has
a countable orthonormal basis {ek }∞k=1 . For each n ∈ N define the operator
Pn via
Xn
Pn (x) = hx, ek iek , x ∈ X.
k=1
and
lim kPn (x) − xk = 0.
n→∞
Now, if the assertion were false, then there would be a compact operator
T ∈ B(X) and a number δ > 0 such that kT − F k ≥ δ for all operators F of
finite rank. Hence, for each n ∈ N there is an xn ∈ X such that kxn k = 1 and
k(T − Fn )xn k ≥ δ/2, where the operator Fn defined by
n
X
Fn (x) = Pn T (x) = hT (x), ek iek , x ∈ X,
k=1
k(T − Fnj )(xnj )k ≤ k(I − Pnj )(T (xnj ) − w)k + k(I − Pnj )(w)k
≤ kT (xnj ) − wk + k(I − Pnj )(w)k.
The foregoing estimates therefore produce a contradiction: δ/2 > δ/2, which
naturally proves the assertion in the case when X is separable.
Next, assume X is any Hilbert space over C. Set S = T ∗ T . Then S is
self-adjoint and compact. By Theorem 5.54, S can be written as the following
form
X∞
S(x) = λk hx, ek iek , x ∈ X,
k=1
for some orthonormal sequence (ek )∞k=1 . Suppose now that X0 is the subspace
of all linear combinations of elements of the form T n (ek ), n + 1 ∈ N; k ∈ N.
Then X0 is a separable closed linear subspace of X, and T maps X0 into itself.
5.5 Compact Operators via the Spectrum 133
that is, T (z) = 0. Thus, if Gn (x) = Fn (y) for n ∈ N, then Gn is of finite rank
and limn→∞ kGn − T k = 0 thanks to
kGn (x) − T (x)k = kFn (y) − T (y)k = kFn (y) − T̃ (y)k ≤ kFn − T̃ kkxk.
We are done.
To close this section, we use Remark 5.55 to produce the forthcoming
Fredholm’s alternative which arises in the study of the following homogeneous
and inhomogeneous Fredholm integral equations:
Z Z
f (x) − k(x, y)f (y)dmid (y) = 0; f (x) − k(x, y)f (y)dmid (y) = g(x)
[a,b] [a,b]
for given g ∈ LRS 2id ([a, b], C) and |k(·, ·)|2 ∈ LRS id ([a, b] × [a, b]).
Theorem 5.57. Let X be a Hilbert space over C and let T be a self-adjoint
compact linear operator on X.
(i) If the only solution to the homogeneous equation f − T (f ) = 0 is f = 0,
then the inhomogeneous equation f − T (f ) = g has a unique solution f for
each g ∈ X.
(ii) If f − T (f ) = 0 has nonzero solutions, then f − T (f ) = g has a solution
for a given g ∈ X only if hg, f i = 0 for any solution f to f − T (f ) = 0, in
which case f − T (f ) = g has infinitely many solutions, the difference of any
two of them being a solution of f − T (f ) = 0.
Proof. (i) From Remark 5.55 it follows that X has an orthonormal basis
{ej } comprising
P eigenvectors of T with eigenvalues {λj }. For g ∈ X, let
g =P j cj ej , and then search a solution to f − T (f ) = g in the form
f = j dj ej . Then
X X X
dj ej − cj λj ej = cj ej
j j j
Example 5.58. On LRS 2id ([0, 1], C) define T (f )(x) = (1 − x)f (x). Then T is
self-adjoint but not compact. The homogeneous equation f − T (f ) has no
solution other than f = 0. Nevertheless, the inhomogeneous equation f −
T (f ) = g has no solution in LRS 2id ([0, 1], C) for any nonconstant function
g ∈ LRS 2id ([0, 1], C) – indeed, the solution to f −T (f ) = g is then proportional
to 1/x, x ∈ (0, 1], which is certainly not in LRS 2id ([0, 1], C).
Problems
5.1. Equip C([−1, 1], C) with the 2-norm and the inner product
Z 1
hf, gi = f (x)g(x)dx.
−1
Prove that C([−1, 1], C) is an inner product space but not a Hilbert space.
5.2. Equip `p (N, C), 1 ≤ p < ∞, with the p-norm. Prove that `2 (N, C) is a
Hilbert space under the inner product
∞
X
hx, yi = xj yj for all x = (xj )∞ ∞
j=1 , y = (yj )j=1 ∈ `2 (N, C),
j=1
(i) Let
Find M ⊥ .
(ii) Let
and
5.6. Equip LRS 2id ([−π, π], C) with the inner product hf, gi = [−π,π] f gdmid .
R
Prove ∞
1
(i) (2π)− 2 einx n=−∞ is an orthonormal set.
(ii)
1 1 1 1 1
(2π)− 2 , π − 2 cos t, π − 2 sin t, π − 2 cos 2t, π − 2 sin 2t, ...}
is a Hilbert-Schmidt operator from LRS 2id ((c, d), C) to LRS 2id ((a, b), C) and
therefore compact.
where h·, ·iX and h·, ·iY stand for the inner products equipped with X and Y
respectively. Prove the following results:
(i) T is a bounded operator when and only when supk∈N |ck | < ∞.
(ii) T is a compact operator when and only when limk→∞ P |ck | = 0.
∞
(iii) T is a Hilbert-Schmidt operator when and only when k=1 |ck |2 < ∞.
(iv) T is an operator of finite rank when and only when there is an N ∈ N
such that |cn | = 0 for n > N .