C.E. Chidume∗
International Centre for Theoretical Physics
Trieste, Italy
Permanent address: Department of Mathematics, University of Nigeria, Nsukka,
To my wife, Ifeoma
and our children
Chukwudi, Uzoamaka, Kenechukwu and Okechukwu.
variety of applications.
The aim of this book is, in summary, to present the above results of Kakutani
and Eberlein-Smul’yan . Furthermore, we shall present two important applications
of these theorems: one - a fundamental theorem in optimization theory, and the
other, the Browder-Ghode-Kirk fixed point theorem for nonexpansive mappings.
In order to understand these theorems of Kakutani and Eberlein-Smul’yan, one
requires a reasonably good knowledge of the weak topology and to understand the
weak topology and prove some of its basic theorems, one requires virtually all the
fundamental theorems of functional analysis.
Most of the existing books in functional analysis cover a lot more than these fun-
damental theorems (including applications in several diverse fields) and the reader
is left to choose what he or she wants to study. In the Preface of his recent excellent
book on functional analysis, J.B. Conway wrote, ”Functional analysis has grown so
much that you can find two mathematicians who claim to be functional analysts but
they do not understand each other”. But whatever direction one wants to pursue,
the following topics covered in this book are fundamental:
Normed Linear Spaces; Hilbert Spaces; Linear Maps; Hahn Banach Theo-
rem (Analytic and Geometric Forms); Uniform Boundedness Principle; Open
Mapping and Closed Graph Theorems; Weak and Weak∗ Topologies; Reflexive
Spaces; the Banach Alaoglu Theorem; the Theorem of Kakutani characterizing
reflexive spaces; Uniformly Convex Spaces, Milman-Pettis Theorem; Eberlein-
Smul’yan Theorem.
I would like to express my profound gratitude to my students over the last
fifteen years or so for their interest in the subject and their numerous questions and
comments which helped me to improve on the exposition. I would like to thank
specially Dr. Habtu Zegeye, my former student, for reading the manuscript, for his
very useful comments and criticisms and for typesetting the final version of the book.
I am also grateful to my former student, Dr. Ani Udomene for typesetting work
and for helping with drawing the pictures, especially those in chapter 10. My son,
Chu-chu ,who is a mathematician, read the final version of the book, made useful
suggestions produced solutions to many of the exercises. I am grateful to him.
Special thanks go to my wife and our children for their patience, understanding and
encouragement. I would also like to thank the University of Nigeria, Nsukka, Nigeria
for granting me leave of absence during which I worked at the International Centre
for Theoretical Physics, Trieste, Italy and had adequate time to concentrate on the
book. It is my pleasure to thank the International Centre for Theoretical Physics,
for hospitality. In particular, my special thanks go to the Publications Section of
the ICTP for the excellent job of typing the original manuscript.
C.E. Chidume
Trieste, 2006
2 Linear Maps 23
2.1 Definition and some examples . . . . . . . . . . . . . . . . . . . . . . 23
2.2 A basic result concerning linear maps . . . . . . . . . . . . . . . . . . 24
2.3 Bounded Linear Maps . . . . . . . . . . . . . . . . . . . . . . . . . . 25
2.4 Equivalent norms and finite dimensional spaces . . . . . . . . . . . . 32
5 Open Mapping and Closed Graph Theorems 67
5.1 The Open Mapping Theorem . . . . . . . . . . . . . . . . . . . . . . 67
5.2 The Closed Graph Theorem . . . . . . . . . . . . . . . . . . . . . . . 69
5.2.1 Closed Maps, Closed Graphs . . . . . . . . . . . . . . . . . . . 69
5.2.2 The Closed Graph Theorem . . . . . . . . . . . . . . . . . . . 72
5.3 Some Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
6 Hilbert Spaces 81
6.1 Definition and Examples . . . . . . . . . . . . . . . . . . . . . . . . . 81
6.2 Basic Properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
6.3 Completeness of an Inner Product Space . . . . . . . . . . . . . . . . 85
6.4 Jordan Von Neumann Theorem . . . . . . . . . . . . . . . . . . . . . 86
6.5 Orthonormal Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
6.6 The Projection Theorem . . . . . . . . . . . . . . . . . . . . . . . . . 89
6.7 The Riesz Representation Theorem . . . . . . . . . . . . . . . . . . . 94
6.8 Complete Orthonormal Sets . . . . . . . . . . . . . . . . . . . . . . . 94
10 Application in Optimization 141
10.1 Convex Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141
10.2 Convex Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145
10.2.1 Notations and further definitions . . . . . . . . . . . . . . . . 146
10.3 Strictly Convex Functions . . . . . . . . . . . . . . . . . . . . . . . . 148
10.4 Lower Semi-continuity . . . . . . . . . . . . . . . . . . . . . . . . . . 150
10.5 A fundamental theorem of optimization . . . . . . . . . . . . . . . . . 155
||.|| : X → [0, ∞)
Fig. 1.1
Verification that ||.||2 is a norm on IR2 requires the use of the following inequality
called the Schwartz’s inequalityfor finite sums, i.e., for xi , yi ∈ CI, (where CI denotes
the set of complex numbers) i = 1, 2, ..., n, we have
X ³X ´ 12 ³X ´ 12
|xi yi | ≤ |xi |2 |yi |2 .
||x̄ + ȳ||22 = k(x1 + y1 , x2 + y2 )k22 = (xi + yi )2
X 2
X 2
X 2
= (x2i + 2xi yi + yi2 ) ≤ x2i +2 |xi yi | + yi2 (why?)
i=1 i=1 i=1 i=1
à 2 !1/2 à 2 !1/2 2
≤ x2i + 2 x2i yi2 + yi2
i=1 i=1 i=1 i=1
2 2
= A + 2AB + B ,
µ ¶1/2
¡P 2 ¢
2 1/2
where A = i=1 xi = kx̄k2 and B = yi2 = kȳk2 . Hence
(b) Consider the sequence xn := n2 +1
,n = 1, 2, .... Is xn ∈ l1 ? Again, we compute,
X X 1 X 1
|xn | = < < ∞,
n2 + 1 n2
by the p−series test with p = 2 > 1. Hence xn ∈ l1 .
Example 1.5 Let X = l2 := {x = (x1 , x2 , ...) : xi ∈ R, |xi |2 < ∞}.
Let x = (1, 12 , 31 , ...). Is x ∈ l2 ?.
To answer this question, we again compute the sum
X X 1
|xi |2 = < ∞.
Hence, x ∈ l2 . Recall that this x is not in l1 .
(b)Consider the sequence xn := n2 +1
,n = 1, 2, .... Is xn ∈ l2 ? Again, we compute,
X X 1 X 1
|xn |2 ≤ < < ∞,
(n2 + 1)2 n4
i.e., `∞ is the set of all bounded sequences. If addition and scalar multiplication are
defined componentwise as in Example 1.3, then `∞ is a linear space. More generally,
we have the following proposition.
Example 1.8 The function k.k : `p / [0, ∞) defined by
̰ !1/p
kx̄klp = |xi |p ,
Proof. The verification that k.klp satisfies N1 to N3 is easily carried out. The details
are left to the reader (see Exercises 1.4, Problem 4). The condition N3 is verified
by using Hölder’s inequality for infinite sums i.e., if 1 ≤ p < ∞ and p1 + 1q = 1, and
if ai , bi , i = 1, 2, . . . , are complex numbers such that {ai }∞ ∞
i=1 ∈ lp and {bi }i=1 ∈ lq ,
then Ã∞ !1/p à ∞ !1/q
X∞ X X
|ai bi | ≤ |ai |p |bi |q .
i=1 i=1 i=1
(The proof of Hölder’s inequality is given at the end of this chapter, Section 1.4).
Example 1.10 Let X = (C[a, b]) denote the set of all real-valued functions which
are functions of an independent real variable t and are defined and continuous on
a given closed and bounded interval [a, b]. For arbitrary f, g ∈ C[a, b], let addition
and scalar multiplication be defined as follows:
kf k∞ = sup |f (t)|;
Z b
kf k1 = |f (t)|dt;
and µZ ¶1/2
kf k2 = |f (t)| dt .
Then (X, k.k∞ ), (X, k.k1 ), and (X, k.k2 ) are normed linear spaces.
The proofs that k.k∞ , and k.k1 are norms on C[a, b] are trivial and are left as an
easy exercise for the reader. For, k.k2 , see Exercises 1.1, Problem 7 where generous
hints are supplied.
Remark 1.13 We first note that c and c0 are clearly proper subspaces of l∞ . Some
other important examples of normed linear spaces are given in Exercises 1.1 and
more will be introduced in what follows.
1. Let IRn := {x = (x1 , x2 , ..., xn ), xi ∈ IR} with addition and scalar multiplica-
tion defined componentwise. Let the real-valued functions k.k∞ , k.k1 and k.k2
be defined on IRn as follows:
kxk1 = |xi |;
n o 12
kxk2 = |xi | .
(b) Verify that (IRn , k.k∞ ), (IRn , k.k1 ) and (IRn , k.k2 ) are normed linear spaces.
( The space (IRn , k.k2 ) is generally called the Euclidean space and the space
(IRn , k.k∞ ) is sometime denoted by `n∞ .
2. Let IRn be as in Exercises 1.1 (1). Define a real-valued function k.kp (p ≥ 1)
as follows: for arbitrary x = (x1 , x2 , ..., xn ) ∈ IRn
n ´ p1
kxkp = |xi |p .
Verify that k.kp is a norm on IRn . The space (IRn , k.kp ) is usually denoted by
`np .
3. Let CIn := {z = (z1 , z2 , ..., zn ), zi ∈ CI} where addition and scalar multiplication
are defined componentwise. Let the real-valued function k.k2 be defined on CIn
as follows:
³Xn ´ 12
kzk2 = |zi |2 .
Verify that (CI , k.k2 ) is a normed linear space. This space is usually referred
as the unitary space .
4. Verify that the function ||.||lp defined in Example 1.8 is a norm on lp .
(Hint: For N3, let x = {ai }∞ ∞
i=1 ∈ lp and y = {bi }i=1 ∈ lp . Then,
X ∞
||x + y||plp = p
|ai + bi | = |ai + bi |.|ai + bi |p−1
i=1 i=1
X∞ ∞
≤ |ai |.|ai + bi | + |bi |.|ai + bi |p−1 .
i=1 i=1
Verify that X, with addition and scalar multiplication as in Example 1.10, is
a normed linear space.
6. Let X = C[a, b] be the space of all real-valued continuous functions on [a, b].
For arbitrary f ∈ X, define
Z b
||f || = |f (t)|dt.
Verify that norm is a convex function. What can you say about ||.||2 ?
1.2 Completeness
In this section we shall consider the concept of completeness in normed linear spaces.
For this, it is more convenient to work in metric spaces with which the reader is prob-
ably very familiar. (For a quick review of metric spaces the reader may consult [9]).
We first obseve that if (E, k · k) is a normed linear space, the norm k · k always
induces a metric ρ on E given by ρ(x, y) = kx − yk for each x, y ∈ E. With this,
(E, ρ) becomes a metric space. Recall that a sequence {xn } in a metric space (E, ρ)
is called Cauchy if ρ(xn , xm ) / 0, as n, m / ∞, and that a metric space (E, ρ)
is said to be complete if every Cauchy sequence in E converges to an element of
E. Completeness is a very important concept in functional analysis and this will
become more evident in subsequent chapters.
Step (a) is easily stated but generally requires some trick to handle. To construct
x∗ mentioned in (a), one generally uses the fact that the given sequence {xn }∞ n=1
is Cauchy to generate a Cauchy sequence in a complete space that is normally as-
sociated with the normed linear space under consideration. (This complete space is
usually the reals, IR or the complex numbers, CI ). Once x∗ is constructed, steps (b)
and (c) are generally not too difficult to complete. We give examples to illustrate
the above steps. In the sequel we shall assume the well known fact that IR and CI
are complete.
½ ¾
Proof. Recall that l2 := x = (x1 , x2 , . . . , ...) : |xi | < ∞ .
Let {x(m) }∞
m=1 be a Cauchy sequence in l2 where,
(1) (1)
(1) (1)
x(1) = (x1 , x3 , . . . , xn , . . .)
x2 ,
(2) (2) (2) (2)
x = (x1 , x2 , x3 , . . . , xn , . . .)
.. .. .. .. .. . (1.1)
. . . . .
(m) (m) (m) (m)
x(m) = (x1 , x2 , x3 , . . . , xn , . . .)
.. .. .. .. ..
. . . . .
{x(m) }∞m=1 is Cauchy implies, given ε > 0, there exists an integer N > 0 such that
for all m, t, ≥ N,
ρ(x(m) , x(t) ) < ε.
This implies, according to the definition of the norm in l2 , that
P∞ ¯ ¯2
(m) (t) 2 ¯ (m) (t) ¯
||x − x || = ¯ xi − xi ¯ < ε2 . (1.2)
These inequalities imply that each column in the array (1.1) is Cauchy. Since each
column is a sequence in IR, and IR is complete, it follows that each column converges
(m) / x∗ as m / ∞ (see array
to a point in IR. Without loss of generality, let xi i
(1.3) below)
(1) (1) (1) (1)
x(1) = (x1 , x2 , x3 , . . . , xn , . . .)
x = (x1 , x2 , x3 , . . . , xn , . . .)
(2) (2) (2) (2) (2)
x = (x1 , x2 , x3 , . . . , xn , . . .)
(m) (m) (m) (m) (m)
.. . (1.3)
x(t) = (x1 , x2 , x3 , . . . , xn , . . .)
(t) (t) (t) (t)
.. .. .. .. ..
. . . . .
↓ ↓ ↓ ↓ ↓
∗ ∗ ∗ ∗
(x1 x2 x3 xn , . . .)
Now define x∗ = (x∗1 , x∗2 , x∗3 , . . . , x∗n , . . .). (This completes step (a)).
This is not easy to show. So, we employ the following strategy. Using the Cauchy
condition, we show that x(n) − x∗ ∈ l2 . We know x(n) ∈ l2 . Since l2 is a linear
(vector) space, we have x(n) − (x(n) − x∗ ) ≡ x∗ ∈ l2 . We proceed to do this.
Let k ∈ IN be arbitrary. It follows from the Cauchy condition (1.2) that for
m, t ≥ N , we have
(m) (t) P
(m) (t)
|xi − x i |2 ≤ |xi − x i |2 < ² 2 . (1.4)
i=1 i=1
Let m be fixed, and let t → ∞ in inequality (1.4), (a finite sum). Then, we get for
every m ≥ N ,
X k
|xi − x∗i |2 ≤ ²2 .
Now from (1.5), we have for m = N, x(N ) − x∗ ∈ l2 and since x(N ) ∈ l2 , we have
x∗ = x(N ) − (x(N ) − x∗ ) ∈ l2 (because l2 is a linear space). This completes step (b).
Finally, from inequality (1.5), there exists a positive integer N such that ||x(m) −
x∗ || ≤ ² for all m ≥ N . So, x(m) → x∗ in l2 . The proof is complete.
Theorem 1.15 C[a, b] (endowed with “sup norm” in Example 1.10) is complete.
Proof. Let {fn }∞ n=1 be a Cauchy sequence in C[a, b]. For each t ∈ [a, b], there
exists a positive integer N such that given ε > 0,
Hence, for any fixed t0 ∈ [a, b], |fn (t0 ) − fm (t0 )| < ε for all n, m ≥ N . This
shows that {fn (t0 )}∞
n=1 is a Cauchy sequence of real numbers. Since IR is complete,
{fn (t0 )}∞ converges to a real number, say, f (t0 ) as n / ∞, i.e.,
fn (t0 ) / f (t0 ) as n / ∞.
This is the same as saying that the function, fn converges pointwise to the function
We prove now that this pointwise convergence is actually uniform in t ∈ [a, b], i.e.,
given ε > 0, there is an integer N ∗ such that sup |fn (t) − f (t)| < ε for all n ≥ N ∗ .
Now, for any ² > 0, there exists an integer N > 0 such that for all n, m ≥ N , we
||fn − fm ||C[a,b] < ².
This means, for n, m ≥ N, we have
and so the convergence is uniform. This implies f ∈ C[a, b] and completes the proof.
Example 1.17 Let X = C[a, b] be the space of all continuous real–valued functions
defined on the closed and bounded interval [a, b] with norm, k · k, given by
Z b
kf k = |f (t)|dt, t ε [a, b], f ε C[a, b].
We show that the space X endowed with this norm is NOT complete.
It suffices to produce a Cauchy sequence in C[a, b] which does not converge to an
element of C[a, b]. Without loss of generality we may take [a, b] = [−1, 1]. Consider
the function fn given by
0, if − 1 ≤ t ≤ 0;
fn (t) = nt, if 0 ≤ t ≤ n1 ;
1, if n1 ≤ t ≤ 1,
and sketched in Fig. 1.2. Observe that the expression for fn (t) is trivially obtained
as follows. The function is 0 for all values of t between −1 and 0. Then, for values of
t between 0 and n1 , we use the equation of a straight line studied in high school with
the coordinates of O as (0, 0) and the coordinates of P as ( n1 , 1). Thus the slope of
OA is n so that the equation of the line OA is y = nt. For values of t between n1
and 1, the function is constant with value 1. Observe that no inequality is strict.
This guarantees (by Pasting Lemma) that fn is continuous.
Fig. 1.2
Z 1
kfn − fm k = |fn (t) − fm (t)|dt.
Fig. 1.3
But this integral simply represents the area between fn and fm and from Fig. 1.3,
we have
Z 1
1 1 1
||fn − fm || = |fn (t) − fm (t)|dt = area of T riangle OCD = ( − ) → 0,
−1 2 n m
Z 1
as n, m → ∞. Hence {fn } is Cauchy. We can also evaluate |fn (t) − fm (t)|dt
directly as follows:
Z 1
kfn − fm k = |fn (t) − fm (t)|dt
Z 0 Z 1/m Z 1/n Z 1
= |0 − 0|dt + (mt − nt)dt + |1 − nt|dt + |1 − 1|dt
−1 0 1/m 1/n
Z 1/m Z 1/n
= (mt − nt)dt + (1 − nt)dt → 0 as m, n → ∞,
0 1/m
/ [0, 1] defined by
0, if − 1 ≤ t ≤ 0;
f (t) =
1, if 0 < t ≤ 1.
Fig. 1.4
Z 1 µ ¶
1 1 /0 / ∞,
kfn −f k = |fn (t)−f (t)|dt = area of Triangle OAB = as n
−1 2 n
(see Fig. 1.4). Thus, the sequence {fn }∞ n=1 defined above is a Cauchy sequence in
C[−1, 1] which converges to f . However, f is not in C[−1, 1] since f is not continuous
at 0 (Verify this!). Thus, C[−1, 1] is not complete.
Remark 1.18 The interval [−1, 1] which is employed in the above example has
been used by several authors in showing that the space C[a, b] is not complete when
endowed with the integral norm. This interval is certainly not crucial. Other intervals
can easily be used.
µZ 3 ¶1/2
Example 1.19 Let X = C[−3, 3] with kf k2 = |f (t)| dt . Then X is not
Solution. It is easy to see that
Z µZ ¶1/2
3 √ 3
|f (t)|dt ≤ 6 |f (t)| dt .
−3 −3
(Hint: Use the Cauchy-Schwartz inequality given in Exercises 1.7, Problem 7). It fol-
lows from this inequality and the sequence {fn } in Example 1.17 that X = C[−3, 3] is
not complete. Explain how this follows.
Remark 1.20 Incomplete spaces are not at all worthless. In fact every metric space
(complete or not) can be regarded as a dense subset of some complete metric space.
This complete metric space is called the completion of the incomplete space. Given
a metric space (M, ρ) (not necessarily complete) its completion can be constructed.
This process, however, requires the notions of linear maps and isometries which we
have so far not treated. We have therefore treated the question of completion of a
metric space as an Appendix at the end of this book.
With these definitions, we now state the following theorem whose proof the reader
is assumed to be familiar with (see Exercises 1.2, Problem 8).
1. Let X = C[−2, 2] be the space of real–valued continuous functions on the
closed and bounded interval [−2, 2], and let X be endowed with the norms
Z 2
(i) kf k1 = |f (t)|dt, and
µZ 2 ¶1/2
(ii) kf k2 = |f (t)| dt ,
for arbitrary f in X. Justify the following statements:
(i) (X, k · k1 ) is not complete;
(ii) (X, k · k2 ) is not complete.
2. Let X = C[0, 4], with the usual notation and suppose X is endowed with the
µZ 4 ¶1/2
kf k2 = |f (t)| dt .
Fig. 1.5
(ii) {gn }∞
n=1 is a Cauchy sequence;
(iii) gn / g as n / ∞, where
0, 0 ≤ t ≤ 2;
g(t) =
1, 2 < t ≤ 4.
Conclude that C[0, 4] with k · k2 norm is not a complete space. Give another
R4 1
proof (similar to Example 1.19) that C[0, 4] with ||f ||2 := ( 0 (f (t))2 dt) 2 is
not complete.
Fig. 1.6
4. Let S be the set of real sequences having only a finite
¡ 1 number of nonzero ¢ terms.
1 1
Clearly, S ⊆ l∞ . Take {xn }∞n=1 in S where x n
= 1, ,
2 3
, . . . , n
, 0, 0, . . . . Prove
¡ ¢
(i) {xn } is a Cauchy sequence in S; (ii) xn / x where x := 1, 1 , 1 , . . . .
2 3
6. Let X be a normed linear space and let f : X → X be any map. The map f
is called ıLipschitz if there exists k ≥ 0 such that ||tx − T y||
leqk||x − y|| for all x, y ∈ X. The map T is called nonexpansive if k ≤ 1.
Now, let X = c0 (the space of sequences of real numbers which converge to 0).
Define T : c0 → c0 by
7. Let X = l∞ (the space of sequences of real numbers which are bounded). Let
K = {x ∈ l∞ : ||x||∞ ≤ 1}. Define T : K → K by
8. Prove Theorem 1.20. (Hint: For arbitrary x0 ∈ M , define xn+1 = f (xn ), for
all integers n ≥ 0. Prove {xn } is Cauchy and use the continuity of f to obtain
the existence of an x∗ ∈ M such that f (x∗ ) = x∗ . Establish uniqueness by
9. Prove the inequality
Z µZ ¶1/2
3 √ 3
|f (t)|dt ≤ 6 |f (t)| dt
−3 −3
Ã∞ !1/p à ∞ !1/q
|xi yi | ≤ |xi |p |yi |q , (i)
i=1 i=1 i=1
1 1
where p
+ q
= 1.
y = xp−1 y = xp−1
b A2
A1 A1
a a
Fig. 1.7
By integration we obtain
Z a Z b
p−1 a p bq
ab ≤ x dx + y q−1 dy = A1 + A2 = + .
0 0 p q
Observe that if a = 0 or b = 0, this inequality still holds. Let {ui }, {vi } be such
that ∞ ∞
|ui | = 1, |vi |q = 1. (iii)
i=1 i=1
Define a = |ui |, b = |vi | and substitute in inequality (ii) to get
|ui |p |vi |q
|ui vi | ≤ + .
p q
Summing this inequality and using (iii) we obtain,
X ∞ ∞
1 X 1 X 1 1
|ui vi | ≤ |ui |p + |vi |q = + = 1,
p i=1 q i=1 p q
|ui vi | ≤ 1. (iv)
Now, let x := {xi }, y := {yi }, x ∈ `p , y ∈ lq , x 6= 0, y 6= 0.
xi yi
ui := µ ¶ 1/p
, v i := µ ¶1/q .
∞ P
|xi |p |yi |q
i=1 i=1
∞ P
∞ P∞
Clearly, |ui |p = 1, |vi |q = 1, so that inequality (iv) yields: i=1 |ui vi | ≤ 1
i=1 i=1
Ã∞ !1/p à ∞ !1/q
|xi yi | ≤ |xi |p |yi |q .
i=1 i=1 i=1
Observe that if either x or y is zero this inequality still holds. This completes the
proof of Hölder’s inequality.
Theorem 1.23 (Cauchy-Schwartz inequality) . For arbitrary x = {xi }∞ i=1 , y =
{yi }i=1 , x, y ∈ l2 , we have,
X∞ ³X
∞ ´ 1 ³X
∞ ´1
2 2 2 2
|xi yi | ≤ |xi | |yi | .
i=1 i=1 i=1
Proof. Set p = 2 in Theorem 1.20 and the result follows.
Linear Maps
Linear maps play a crucial role in Functional Analysis. In this chapter we shall
define linear maps and present some basic results concerning them.
for arbitrary x, y ∈ X and arbitrary scalars α, β ∈ K. Some authors use the term
linear operator or linear transformation instead of linear map. Condition (2.1) is
equivalent to the following two conditions: (i) T (x + y) = T x + T y ∀ x, y ∈ X;
and (ii) T (αx) = αT (x) ∀ x ∈ X, and for each scalar, α.
Definition 2.2 If in Definition 2.1 the linear space Y is replaced by a scalar field
K then the linear map T is called a linear functional on X.
Example 2.3 Differentiation, integration and limits (of convergent sequences) are
examples of linear maps.
Example 2.4 Let X = `2 . For each x̄ = (x1 , x2 , x3 , ...) in `2 define
³ x2 x3 ´
T x̄ = 0, x1 , , , ... .
2 3
Then T is a linear map on `2 .
and µ ¶
αx1 + βy1 αx2 + βy2 αx3 + βy3
T (αx̄ + β ȳ) = 0, , , , ...
1 2 µ 3 ¶
³ αx2 αx3 ´ βy2 βy3
= 0, αx1 , , , ... + 0, βy1 , , , ...
³ 2 3 ´ ³ 2 3´
x2 x3 y2 y3
= α 0, x1 , , , ... + β 0, y1 , , , ...
2 3 2 3
= αT (x̄) + βT (ȳ),
and so T is a linear map.
Example 2.5 Let CI denote the linear space of complex numbers over CI , and define
the map T : CI /C
I by T z = z̄, where z̄ denotes the conjugate of z. Then T is not
a linear map. (Justify this).
Proposition 2.6 Let X and Y be two linear spaces over a scalar field, K, and let
T : X → Y be a linear map. Then
(i) T (0) = 0;
(ii) The range of T , R(T ) = {y ∈ Y : T x = y for some x ∈ X} is a linear subspace
of Y ;
(iii) T is one-to-one if and only if T (x) = 0 implies x = 0;
(iv) If T is one-to-one, then T −1 exists on R(T ) and T −1 : R(T ) → X is also a
linear map.
Proof. (i) Since T is linear, we have, T (αx) = αT (x) for each x ∈ X and each
scalar α. Take α = 0 and (i) follows immediately.
(ii) We need to show that for y1 , y2 ∈ R(T ) and α, β scalars, αy1 +βy2 ∈ R(T ). Now,
y1 , y2 ∈ R(T ) implies that there exist x1 , x2 ∈ X such that T (x1 ) = y1 , T (x2 ) = y2 .
Moreover, αx1 + βx2 ∈ X (since X is a linear space). Furthermore, by the linearity
of T ,
T (αx1 + βx2 ) = αT (x1 ) + βT (x2 ) = αy1 + βy2 .
Hence αy1 + βy2 ∈ R(T ), and so R(T ) is a linear subspace of Y .
Definition 2.8 Let X and Y be normed linear spaces over a scalar field K, and
let T : X → Y be a linear map. Then T is said to be bounded if there exists some
constant k ≥ 0 such that for each x ∈ X,
the constant K is called a bound for T and in this case, T is called a bounded linear
We now turn our attention to linear maps that are continuous. The notion of
continuity can be stated, for linear maps in several useful equivalent forms. We
state these equivalent forms in the following theorem.
Theorem 2.9 Let X and Y be normed linear spaces and let T : X → Y be a linear
map. Then the following statements are equivalent:
(i) T is continuous;
(ii) T is continuous at the origin (in the sense that if {xn } is a sequence in X such
that xn → 0 as n → ∞, then T xn → 0 in Y as n → ∞);
(iii) T is Lipschitz, i.e., there exists a constant K ≥ 0 such that, for each x ∈ X,
(ii) ⇒ (iii) We are given that if {xn } is any sequence in X such that xn → 0
as n → ∞, then T (xn ) → 0. We want to prove that there exists a constant K ≥ 0
such that ||T x|| ≤ K||x|| for each x ∈ X. Suppose for contradiction that there is
no such K. Then, for any positive integer n, there exists some x = x(n) ∈ X, x(n)
different from 0, call it xn (since it depends on n) such that
However, T un 6→ 0. For,
° T (x ) ° ||T (x )||
° n ° n
||T un − 0|| = ° °= > 1,
n||xn || n||xn ||
(iii) ⇒ (iv) Given that a linear map T is Lipschitz we want to prove that T (D)
is bounded, (i.e., ||T (x)|| ≤ M for all x ∈ D, and some constant M ≥ 0). Take
y ∈ D arbitrary. Then, ||y|| ≤ 1. By (iii), ||T (y)|| ≤ K||y|| for some constant
K ≥ 0. But ||y|| ≤ 1. So,
||T y|| ≤ K||y|| ≤ K.
Since y was arbitrarily chosen in D, it follows that for all y in D, ||T y|| ≤ K. Now,
take K = M and we are done.
x1 − x2
||T ( )|| ≤ K, or, ||T x1 − T x2 || ≤ K||x1 − x2 ||.
||x1 − x2 ||
If, on the other hand, x1 − x2 = 0, this inequality clearly also holds. Thus, the
inequality holds for all x1 , x2 ∈ X. Now, given any ² > 0, choose δ = K+1 . So, if
||x1 − x2 || < δ we obtain,
µ ¶
||T x1 − T x2 || ≤ K||x1 − x2 || ≤ Kδ = ² < ²,
K +1
Property (iii) of the last theorem is a very important one. In fact, we have the
following definition:
Remark 2.10 In the light of definition 2.8, (and Theorem 2.9) we have that a linear
map T : X → Y is continuous if and only if it is bounded. Thus, for linear maps,
continuity and boundedness (as given in definition 2.8) are equivalent.
Definition 2.12 If T : X → Y is a bounded linear map from a normed linear space
X into a normed linear space Y , we define the norm of T by
Note that from definition 2.12 we obtain immediately that ||T x|| ≤ ||T ||||x|| for
each x ∈ X and that for every ² > 0, there exists x² ∈ X, x² 6= 0, such that
||T x² || > (||T || − ²)||x² ||.
Example 2.13 Let X and Y be normed linear spaces and B(X, Y ) denote the
family of all bounded linear maps from X to Y . Define,
Theorem 2.14 Let B(X, Y ) be the family of all bounded linear maps from X to Y .
Then we have the following: For arbitrary T ∈ B(X, Y ),
kT xk
kT k = sup kT xk = sup kT xk = sup .
kxk≤1 kxk=1 x6=0 kxk
sup kT xk ≤ kT k. (2.2)
(Otherwise kT x² k ≤ (kT k − ²)kx² k and kT k would no longer be the infimum). Let
u² = kxx∈² k then ku² k = 1 and kT u² k > kT k − ².
Consequently, we obtain from inequality (2.2) that
³ x ´
kT k ≥ sup kT xk ≥ sup kT xk ≥ sup kT k > kT k − ².
kxk≤1 kxk=1 kx² k6=0 kxε k
||T x||
kT k ≥ sup kT xk ≥ sup kT xk ≥ sup ≥ kT k
kxk≤1 kxk=1 kxk6=0 ||x||
Remark 2.15 Since ||T || is the smallest M such that ||T x|| ≤ M ||x|| for each
x ∈ X, it follows that whatever value M has, we must always have ||T || ≤ M .
Thus, whenever T is a linear map and we have the relation
Proposition 2.17 Let X and Y be normed linear spaces. Then, ||.|| defined by
||T || = sup ||T x|| for each T in B(X, Y ) is a norm on B(X, Y ).
(Hence, B(X, Y ) is a normed linear space with this norm).
Proposition 2.19 Let X, Y, Z be normed linear spaces and let P ∈ B(Y, Z), Q ∈
B(X, Y ). Define (P Q)(x) = P (Qx). Then
(a) P Q ∈ B(X, Z), and
(b) kP Qk ≤ kP k kQk.
Remark 2.20 If Y = IR (the reals) then B(X, Y ) becomes B(X, IR) and is denoted
by X ∗ and is called the conjugate space or the dual space of X. Thus, f ∈ X ∗ means
that f is a mapping of X into IR, which is linear and bounded. Recall that any map
from a linear space into a scalar field is called a functional. The members of X ∗ are,
therefore, bounded linear functionals. For f ∈ X ∗ we shall define
1 1
We now show that for p > 1, the dual space of lp is lq , where q
+ p
= 1.
1 1
Proposition 2.21 For 1 < p < ∞, `∗p = `q where p
+ q
= 1, (i.e., the dual of `p is
`q , where p1 + 1q = 1, 1 < p < ∞).
Ã∞ !1/p à ∞ !1/q
|fy (x)| ≤ |xi yi | ≤ |xi |p |yi |q = kxkkyk < ∞.
i=1 i=1 i=1
kfy k ≤ kyk.
Linearity of fy is obvious. Hence, fy is a bounded linear functional on `p and so
fy ∈ `∗p .
ei = (0, 0, . . . , 0, 1, 0, . . .).
↑ith position
zh := {h(ei )}∞ ∞
i=1 = {αi }i=1 .
X n
|h(xn )| = |αi | (sign αi )αi = |αi |q ≤ khk · kxn k
i=1 i=1
( n
)1/p ( n
= khk · |αi |(q−1)p = khk · |αi |q .
i=1 i=1
½ ¾1− p1
Hence, |αi | ≤ khk ∀ n = 1, 2, 3, . . . . Hence |αi |q < ∞, i.e.,
i=1 i=1
zh := {αi }∞
i=1 ∈ `q and ||zh || ≤ ||h||.
Observe that since h ∈ lp∗ is arbitrary, if we now take h = fy (the functional induced
by y) then zh = y and we get that
X 1
||y|| = ||zh || = ( |αi |q ) q ≤ ||h|| = ||fy ||.
Hence, ||y|| ≤ ||fy ||. But we already have ||fy || ≤ ||y||, so that kfy k = kyk. This
proves that `∗p is isometrically isomorphic to `q . This proof is complete.
Definition 2.22 Suppose E is a vector space over IR and suppose k·k1 and k·k2 are
two norms on E. Then k·k1 and k·k2 are said to be equivalent (written k·k1 ∼ k·k2 )
if there exist numbers α > 0, β > 0 such that
or equivalently,
αkxk2 ≤ kxk1 ≤ βkxk2 ∀ x∈E .
We now prove the following important theorem which is the main theorem of this
Theorem 2.23 Let E be a finite dimensional normed linear space. Then all norms
on E are equivalent.
Proof. Let dim E = n < ∞ and let {ei }ni=1 be a basis for E. For arbitrary
vector x ∈ E, there exist unique scalars {αi }ni=1 such that x = αi ei . Define
kxk0 = max |αi |. Clearly, k · k0 is a norm on E (Verify). It suffices now to prove
that any norm on E is equivalent to k · k0 ; i.e., that if k · k is an arbitrary norm on
E, there exist constants a > 0, b > 0 such that
akxk0 ≤ kxk ≤ bkxk0 ∀ x ∈ E.
n °P ° P
° n ° n
From x = αi ei we obtain, kxk = ° αi ei ° ≤ max |αi | kei k. Since {ei }ni=1
i=1 i=1 1≤i≤n i=1
is a basis, it follows that kei k is a number. Call it b. The last inequality then
kxk ≤ bkxk0 .
It now only remains to prove akxk0 ≤ kxk. Let S = {x ∈ E : kxk0 = 1}. Define the
ϕ : (E, k · k0 ) / IR by ϕ(x) = kxk ∀x ∈ E .
° °
° x °
Observe that ° kxk0
° = 1. Moreover, ϕ is continuous. To see this, let ² > 0 be
given. We want to find a δ > 0 such that for arbitrary x∗ ∈ E, if kx − x∗ k0 < δ then
|ϕ(x) − ϕ(x∗ )| < ². Recall that in part (a) we proved that kxk ≤ bkxk0
∀x ∈ E. Using this we now obtain that
|ϕ(x) − ϕ(x∗ )| = |kxk − kx∗ k| ≤ kx − x∗ k ≤ bkx − x∗ k0 ≤ bδ < ² ,
where we have chosen δ = 1+b . So, ϕ is continuous. Since S is compact, ϕ attains its
infimum on S. Let this infimum ° be°denoted by a. Then 0 < a < ϕ(x) := ||x||, ∀x ∈
x ° x °
S. But ||x||0 ∈ S. So, 0 < a ≤ ° kxk0
° ∀ x ∈ E and this implies akxk0 ≤ kxk ∀x ∈ E.
Combining this with kxk ≤ bkx0 k, we obtain that akxk0 ≤ kxk ≤ bkxk0 ∀x ∈ E
as required. ¤
1. Prove part (iv) of Proposition 2.6; and also prove Theorem 2.17.
2. Let X P
be a finite dimensional linear space and let x1 , x2 , x3 , ..., xn be a basis.
If x = ni=1 αi xi , define
³X n ¯2 ´ 1
¯ 2
||x|| = |αi ¯ .
If f is a bounded linear functional on X, compute P ||f ||. What will ||f || be if
||x|| = maxi |αi |? What will ||f || be if ||x|| = ni=1 |αi |?
11. Prove that any finite dimensional normed linear space is complete.
12. Prove that if E is a normed linear space and M is any finite dimensional
subspace, then M is closed. (Hint: A complete subset of a metric space is
3.1 Introduction
In this chapter we shall study one of the most important theorems in Linear Func-
tional Analysis – the Hahn Banach Theorem, and shall also study some of its many
striking consequences. From now on, all linear spaces are assumed to be real. We
start with the following definition.
Definition 3.1 Let X be a linear space (not necessarily normed), and consider the
p : X → IR
with the following properties: (i) p(x) ≥ 0 for all x ∈ X; (ii) p(x + y) ≤ p(x) + p(y)
for all x, y ∈ X; (iii) p(αx) = αp(x) for all x ∈ X and scalar α, α > 0.
A mapping satisfying all the conditions (i)-(iii) is called a convex functional. A map-
ping satisfying condition (ii) is called subadditive and is called positive homogeneous
if it satisfies (iii). If p satisfies (ii) and (iii), then p is called a sublinear functional.
Definition 3.3 Let M be a proper subset of a real vector space, X. Let f be a map
defined on the subset M into a vector space, W , (see Fig. 3.1).
M f
The mapping f (defined on M ) is said to be extended to X if there exists a map
F : X → W (defined on X into W ) such that F (x) = f (x) for all x ∈ M .
Remark 3.6 If f is a map defined on a subset M of a vector space V , to find an
extension of f to V , in general, is not particularly of interest. What is always of
interest is to establish the existence of an extension F which preserves some or all
of the properties of f . For example, let M = [0, 1) and let f : M / [0, 1] be
defined by f (x) = x+1 for each x ∈ M . Then, clearly f is continuous on M . If now
we define ½
f (x), if x ∈ M ;
F1 (x) =
1, if x = 1,
it is clear that F1 is an extension of f to [0, 1]. Observe that, F1 is not continuous
at x = 1 so that it does not preserve the continuity property of f . If, however, we
define ½
f (x), if x ∈ M ;
F2 (x) = 1
, if x = 1,
then F2 is an extension of f which preserves the continuity property of f .
The Hahn Banach Theorem, in its analytic form (there is also a geometric form
which we shall study later in this chapter) is concerned with the extension of a lin-
ear functional defined on a linear subspace to the whole space and preserving certain
properties of the functional. More precisely, we have the following theorem:
Theorem 3.7 (Hahn Banach Theorem, Analytic Form)
Let M be a subspace of a real linear space, V ; p : V → IR a sublinear functional on
V , and f : M → IR a linear functional defined on M such that
f (x) ≤ p(x)∀x ∈ M.
(i) there exists a linear functional F defined on V which is an extension of f ;
(ii) F (x) ≤ p(x) for all x ∈ V .
Theorem 3.7 is illustrated pictorially in Fig. 3.4.
/ R
S E∃F, F (x)≤p(x) ∀x∈V
M f
f (x)≤p(x) ∀x∈M
Remark 3.8 Observe that in Theorem 3.7, the linear space V need not be normed.
Consequently, f and F are linear functionals (not necessarily bounded).
A proof of Theorem 3.7 is given at the end of this chapter, Section 3.4. We shall
now obtain, as a consequence of Theorem 3.7, a version of Hahn Banach Theorem
which holds in normed linear spaces X and for bounded linear functionals defined on
subspaces of X. Furthermore, in Section 3.3 we shall give the proofs of two geometric
forms of Hahn Banach Theorem. These geometric forms will be very useful in what
follows. Meanwhile, we consider some consequences of Theorem 3.7.
Proof. The technique of proof is to apply Theorem 3.7. So, we try to obtain the
hypotheses of Theorem 3.7 from those given in Theorem 3.9. If M = {0}, then
f = 0 and the required extension clearly is F = 0. So, we may assume M 6= {0},
i.e., we may assume that there exists at least one x ∈ M, x 6= 0. By hypothesis, f is
a bounded linear map on M . This implies
for each x ∈ X.
(i) p(x) ≥ 0 for all x ∈ X; (ii) p(x + y) ≤ p(x) + p(y) for all x, y ∈ X; (iii)
p(αx) = αp(x) for all x ∈ X, α ≥ 0.
The claim is easily verified and so, is left as an easy exercise. Thus, p is a convex
functional. In particular, p is a sublinear functional. Moreover, for every x ∈ M ,
since |f (x)| ≤ ||f ||||x|| = p(x). Thus, by Theorem 3.7, there exists a linear functional
F (not necessarily bounded) defined on X which is an extension of f and such that
From (3.1) and (3.2), we obtain ||F || = ||f ||, completing the proof.
Theorem 3.9 is sometimes referred to as the Hahn Banach Theorem for normed
linear spaces. Our next result deals with another application of Theorem 3.7 to
a question of the existence of a bounded linear functional having certain special
Theorem 3.10 Let X be a normed linear space and let x0 6= 0 be an arbitrary
element of X. Then, there exists a bounded linear functional f on X such that
Proof. Our technique of proof here is to apply Theorem 3.9. We start by con-
sidering the subspace M of X consisting of elements of the form x = αx0 , where
α is a scalar, i.e., M = {x : x = αx0 , α is a scalar}. Define f : M → IR by
f (x) ≡ f (αx0 ) = α||x0 ||, for each x ∈ M .
(i) f is a linear functional;
(ii) f (x0 ) = ||x0 ||;
(iii) f is bounded;
(iv) ||f || = 1.
Verifications of (i) and (ii) are immediate, so we verify (iii) and (iv). Since x ∈ M ,
we have,
It follows from (3.3) that f is bounded, establishing (iii). For (iv), observe that
equation (3.3) also implies,
||f || ≤ 1.
But if there were a real constant k < 1 such that |f (x)| ≤ k||x|| for all x ∈ M , then
equation (3.3) would be contradicted. For then, we would have
Proof. Exercise (Try a proof by contradiction, using Theorem 3.10).
Corollary 3.12 If f (x) = 0 for all f ∈ X ∗ , then x = 0.
(i.e., if all bounded linear functionals vanish on a given vector then the vector must
be the zero vector).
Proof. Exercise (Apply Theorem 3.10).
||m|| = |α||| − − x0 || ≥ |α|δ.
|f (m)| = |α|δ ≤ ||m||,
for all m ∈ M . Observe that this inequality also holds if α is any scalar. Hence, f
is a bounded linear functional on M with (by Remark 2.15)
||f || ≤ 1. (3.4)
|f (x − x0 )| |0 − δ| δ
|f (y)| = = > . (3.6)
||x − x0 || ||x − x0 || δ+²
||f || = sup |f (y)| > , i.e.,
||y||=1 δ+²
||f || > . (3.7)
Since ² > 0 was arbitrary, it follows from (3.7) that
||f || ≥ 1. (3.8)
From (3.4) and (3.8) we obtain, ||f || = 1. By Theorem 3.9, there exists a bounded
linear functional F defined on X, extending f , such that
||F || = ||f || = 1,
Prove that
(a) p(x) ≥ 0 for each x ∈ X;
(b) p(x + y) ≤ p(x) + p(y) for x, y ∈ X;
(c) p(αx) = αp(x), x ∈ X; α ≥ 0.
5. Let X be a normed linear space. Prove that if f (x) = f (y) for all f ∈ X ∗ then
x = y.
6. Let x be an element of a normed linear space X and let X ∗ denote the dual
space of X. Prove that
H := {x ∈ E : f (x) = α},
where f (not identically zero) is a linear functional on E and α ∈ IR. Following the
notation of Brezis [4] we shall call H a hyperplane of equation [f = α]. We consider
some examples.
Example 3.15 Let E = IR. Define H1 = {x ∈ IR : 3x = −2}. Here f (x) = 3x and
α = −2 and the hyperplane of equation [f = α] is given by
½ ¾ ½ ¾
2 2
H1 = {x ∈ IR : 3x = −2} = x ∈ IR : x = − = − .
3 3
H1 is the singleton {− 23 } , a point in IR. Observe that H1 is a closed subset of IR.
Example 3.16 Let E = IR2 . Define H2 = {(x1 , x2 ) ∈ IR2 : 3x1 + 2x2 = 5}. Here
the hyperplane of equation [f = α] is simply the straight line in IR2 whose equation
is 3x + 2y = 5. Is H2 a closed subset of E? Justify your answer.
Example 3.17 Let E = IR3 and H3 = {(x1 , x2 , x3 ) ∈ IR3 : 3x1 − x2 + 4x3 = −1}.
It is clear that the hyperplane H3 is the plane in IR3 whose equation is given by
3x − y + 4z = −1. Is H3 a closed subset of IR3 ? Justify your answer.
In order not to obscure our presentation of the main theorems of this section, we
defer the proof of this proposition. It will be given at the end of this section.
We are now ready to state the geometric forms of Hahn Banach Theorem. Recall
that a set K ⊂ E is called convex if λx + (1 − λ)y ∈ K for all x, y ∈ K; λ ∈ [0, 1].
Theorem 3.20 (Hahn Banach Theorem, First geometric form)
Let A ⊂ E, B ⊂ E be two convex sets, nonempty and disjoint. Suppose that A is
open. Then there exists a closed hyperplane that separates A and B in the general
sense. (See Fig. 3.3).
closed hyperplane
111111 open,
111111 convex,
nonempty convex, nonempty
Remark 3.21 Recall that the method we used in the proof of the Hahn Banach
Theorem for normed linear spaces (Theorem 3.9) was to apply Theorem 3.7. To
prove the first geometric form of Hahn Banach Theorem (Theorem 3.20) we shall
use the same method, i.e., we shall apply Theorem 3.7. However, to apply Theorem
3.7, we must have a sublinear functional defined on E, we must have a subspace M
of E, and we must have a linear functional defined on M . To prepare the grounds,
we begin with the following definition.
Definition 3.22 Let E be a normed linear space. Let M ⊂ E be an open, convex
set with 0 ∈ M . For all x ∈ E define
p(x) := inf{α > 0 : α−1 x ∈ M }.
Then p is called the gauge of M .
Lemma 3.23 The gauge of M satisfies the following conditions:
G1: p(λx) = λp(x) for all x ∈ E, λ > 0;
G2: There exists a constant K > 0 such that 0 ≤ p(x) ≤ K||x|| for all x ∈ E;
G3: M = {x ∈ E : p(x) < 1};
G4: p(x + y) ≤ p(x) + p(y) for all x, y ∈ E.
Proof. G1: This follows from the definition;
G2: Since 0 ∈ M and M is open, there exists r > 0 such that Br (0) ⊂ M . Observe
rx x
that for arbitrary ε > 0 and x ∈ E, (1+ε)||x|| = (1+ε)||x|| ∈ Br (0) ⊂ M . From the
definition of p we obtain that
(1 + ε)
p(x) ≤ ||x||,
which establishes G2.
α(α−1 x) + (1 − α)0 ∈ M
i.e., x ∈ M . Thus,
M = {x ∈ E : p(x) < 1}.
G4: Let x, y ∈ E and ² > 0. By G1 and G3 we know that
x y
u := ∈ M, v := ∈ M,
p(x) + ² p(y) + ²
h i
x 1
(since p(u) = p p(x)+² = p(x)+² p(x) < 1 it follows that u ∈ M ,
similarly, v ∈ M ). Since M is a convex set, we have that
µ ¶ µ ¶
x y
λ + (1 − λ) ∈M
p(x) + ² p(y) + ²
p(x) + ²
λ= ,
p(x) + p(y) + 2²
so that µ ¶ µ ¶
x y x+y
λ + (1 − λ) = .
p(x) + ε p(y) + ε p(x) + p(y) + 2ε
Clearly λ ∈ [0, 1] and p(x)+p(y)+2²
∈ M . Again, by G3,
h x+y i
p <1
p(x) + p(y) + 2²
Remark 3.24 In order to prove Theorem 3.20 using Theorem 3.7, the gauge func-
tion introduced in Definition 3.22 will be a candidate for a sublinear functional.
Since we still need to find candidates for a subspace M of E and for a linear func-
tional f on M , we first prove the following lemma which is a special case of Theorem
3.20 in which B is a singleton {x0 } where x0 6∈ A. In this case, the required subspace
M and functional f are easily defined as can be seen from the following lemma.
M = {x : x = λx0 : λ ∈ IR}.
Define f : M → IR by
f (x) ≡ f (λx0 ) = λ.
(i) f is a linear functional on M ; (ii) f (x) ≤ p(x) for all x ∈ M .
By Theorem 3.7, there exists a linear functional F : E → IR extending f and such
F (x) ≤ p(x) for all x ∈ E.
In particular, by taking λ = 1, we have, since x0 ∈ M , F (x0 ) = f (x0 ) = 1.
Furthermore, from G2, F is continuous (Verify this) and from G3, for all x ∈ U ,
Proof of Theorem 3.20. Our technique here is to apply Lemma 3.25. Put
U := A − B = {a − b : a ∈ A, b ∈ B}.
(i) U is nonempty;
(ii) 0 6∈ U ;
(iii) U is open;
(iv) U is convex.
By Lemma 3.25 there exists f ∈ E ∗ such that
Theorem 3.26 (Hahn Banach Theorem; Second Geometric Form) Let A ⊂ E and
B ⊂ E be two convex sets, nonempty and disjoint. Suppose that A is closed and B
is compact. Then, there exists a closed hyperplane that separates A and B in the
strict sense. (See Fig.3.4).
closed hyperplane
closed, convex
convex, compact
f (x + ²z) ≤ α ≤ f (y + ²z)
for all x ∈ A, y ∈ B, z ∈ B(0, 1). Consequently, we obtain from this inequality that
(Verify, see Exercises 3.2, Problem 5),
for all x ∈ A, y ∈ B. Since ||f || 6= 0, this implies that A and B are separated in the
strict sense by the hyperplane of equation [f = α]. This completes the proof.
Proof of Proposition 3.18 Let f be continuous. We want to prove that the hy-
perplane H of equation [f = α] is closed. So let {xn } be a sequence in H such
that xn → x as n → ∞. Since f is continuous, we have f (xn ) → f (x). But
xn ∈ H ⇒ f (xn ) = α. Hence f (x) = α and so x ∈ H. Therefore, H is closed.
Conversely, let the hyperplane H of equation [f = α] be closed. We want to prove f
is continuous. H is closed implies H c (the complement of H) is open. Let x0 ∈ H c
and suppose that f (x0 ) < α. Without loss of generality we may assume α ≥ 0. Let
r > 0 be such that Br (x0 ) ⊆ H c where Br (x0 ) = {x ∈ E : ||x − x0 || < r}. (This is
possible since H c is open).
f (x1 ) > α.
(Note that f (x1 ) = α is not possible since x1 6∈ H). Then, since Br (x0 ) is a convex
set, we have that the set
f (xλ ) 6= α, (3.9)
f (x1 ) − α
λs := ∈ [0, 1]
f (x1 ) − f (x0 )
h i
and f (xλs ) = f λs x0 + (1 − λs )x1 = λs f (x0 ) + (1 − λs )f (x1 ) = α,
contradicting (3.9). Hence, the verification of claim is complete. As a consequence
of this claim, we obtain
f (x0 + ru) < α
for all u ∈ B(0, 1) (since x0 + ru ∈ Br (x0 )). Then,
(i) f is continuous; (ii) ||f || ≤ 1r (α − f (x0 )).
(For a verification of (i) and (ii), see Exercises 3.2, Problem 1). The proof is com-
1. With λ, f and the other notations as in the proof of Proposition 3.18, verify
the following assertions:
(i) f (xλs ) = α;
(ii) f is continuous;
(iii) ||f || ≤ 1r (α − f (x0 )).
2. Let E = IR, M = (−1, 1). Let p : E → IR denote the gauge of M . Compute
p(−3), p( −1
), p(0), p( 12 ), p(3).
3. Let M = {x : x = λx0 : λ ∈ IR} as in the proof of Lemma 3.25. Define
f : M → IR by
f (x) ≡ f (λx0 ) = λ.
(i) f is linear;
(ii) f (x) ≤ p(x) for all x ∈ M , where p is the gauge of U (see Lemma 3.25);
(iii) Verify that F obtained in the proof of Lemma 3.25 is continuous.
4. In the proof of Theorem 3.20, we set
U := A − B
and made the following assertions:
(i) U is nonempty;
(ii) 0 ∈/ U;
(iii) U is convex;
(iv) U is open.
Verify these assertions.
5. In the proof of Theorem 3.26, for any ² > 0, we set
A² := A + B(0, ²);
B² := B + B(0, ²).
(i) A² ,TB² are convex, nonempty, and open;
(ii) A² B² = ∅ for sufficiently small ² > 0, (give a detailed proof);
(iii) Using the notations of Theorem 3.26, prove that
f (x + ²z) ≤ α ≤ f (y + ²z)
for all x ∈ A, y ∈ B, z ∈ B(0, 1) implies
f (x) + ²||f || ≤ α ≤ f (y) − ²||f ||.
[Hint: For part (iii), consider first f (x + ²z) ≤ α to obtain
²f (z) ≤ α − f (x).
Proof. The proof will be broken into two parts. Fix x0 ∈ V \M and let N :=
M ∪ {x0 }, i.e., N is the subspace of V generated by M and {x0 }.
Part 1. In this part of the proof we shall prove that the map f is extendable
to N and with the desired properties. In short, we first prove Theorem 3.7 in the
case V = N . Once we have done this, we shall then use the result to obtain a proof
of Theorem 3.7.
For the first part, since f (x) ≤ p(x) for x ∈ M and f is linear we have for arbitrary
m1 , m 2 ∈ M
f (m1 − m2 ) = f (m1 ) − f (m2 ) ≤ p(m1 − m2 ),
so that since p is sublinear,
This yields
−p(−m2 − x0 ) − f (m2 ) ≤ p(m1 + x0 ) − f (m1 ). (∗)
If m1 is held fixed and m2 is allowed to vary over M , inequality (∗) implies that
the set {−p(−m2 − x0 ) − f (m2 )} of real numbers has an upper bound and hence
has the least upper bound. Let α = sup {−p(−m2 − x0 ) − f (m2 )}. Similarly, by
m2 ∈M
interchanging the roles of m1 and m2 we can set β = inf {p(m1 + x0 ) − f (m1 )}.
m1 ∈M
Then clearly, α ≤ β and so we can choose a real number r such that
α ≤ r ≤ β. (i)
We observe immediately that if α = β then we take r to be the common value.
Hence, for all m ∈ M , we obtain
Verifications of (a), (b) and (c) are straightforward and are therefore left for the
reader. We verify (d). Here we shall consider separately the three cases λ > 0,
λ = 0 and λ < 0.
But p is a sublinear functional. Hence, the last inequality implies, using also the
linearity of f , that
f (m) + λr ≤ p(m + λx0 ),
and this implies, from the definition of F , that F (x) ≤ p(x) ∀ x ∈ N .
Multiplying this inequality by λ (noting that λ is negative) we get
·µ ¶ ¸
(−λ)p − (m + λx0 ≥ F (x).
Since p is a sublinear functional and (− λ1 ) > 0, we get p(m + λx0 ) ≥ F (x), i.e.,
F (x) ≤ p(x) ∀ x ∈ N. This completes the proof of the first part.
Part 2. We now prove the second part which will complete the proof of the theorem.
We shall make use of Zorn’s lemma with which the reader is assumed to be famil-
iar with (see e.g., Chapter 10, section 10.1.1). Let S denote the set of all linear
functionals G extending f and satisfying G(x) ≤ p(x) ∀ x ∈ D(f ), domain of f .
Clearly S is nonempty for F ∈ S, where F is the functional constructed in the first
part of this proof. Now, partially order S by set-inclusion, i.e., F1 < F2 if and only if
domF1 ⊂ domF2 and F2 |domF1 = F1 . (Here domF means domain of F and F2 |domF1
means F2 restricted to domF1 ). We now show that every chain in S has an upper
bound in S. Let C = {Fα : α ∈ ∆} be a chain in S. Let D = ∪ domFα . Consider
the function F ∗ whose domain is D. If x ∈ D, then x ∈ DomFα0 for some α0 ∈ ∆.
Define F ∗ : D / IR by F ∗ (x) = Fα (x) .
(a) Recall that a linear functional can only be defined on a subspace (why?). This
follows from the fact that domFα is a subspace for each α and that C is a
chain. The details are left to the reader
Claim 2 is immediate and is therefore left for the reader. Hence, for each Fα ∈ C,
Fα ≤ F ∗ , so that F ∗ is an upper bound for C. By Zorn’s lemma, there exists a
maximal element F in S. This implies, F is linear, F extends f , F (x) ≤ p(x),
and F ∗ < F for every F ∗ ∈ S. It now only remains to prove that domF = V .
Clearly domF ⊆ V . We prove V ⊆ domF . Suppose that this is not the case,
i.e., suppose that domF $ V . Let x0 ∈ V \domF . By the first part of our proof
there exists F̂ such that F̂ is linear, F̂ (x) = F (x) for each x ∈ domF , F̂ (x) ≤
p(x) ∀ x ∈ [dom F ∪ {x0 }], and F < F̂ . This contradicts the maximality of F .
Hence dom F = V . The proof of the theorem is complete.
Recall that since {Tα } is a collection of bounded linear maps, it follows that
||Tα || < ∞ for each α ∈ I. However, of course, there is no guarantee that
sup ||Tα || < ∞. The Uniform Boundedness Principle provides a condition for deter-
mining when sup ||Tα || < ∞. As an immediate consequence of the uniform bound-
edness principle, we shall prove another important result concerning bounded linear
maps from X into Y – The Banach Steinhaus Theorem. This theorem asserts that
if X is a Banach space and Y is a normed linear space and if {Tn } is a family of
bounded linear maps from X into Y such that for each x ∈ X, {Tn (x)} converges,
then T : X → Y defined by T x := lim Tn (x) is a bounded linear map. For the
study of the uniform boundedness principle and the Banach Steinhaus theorem, we
start with the following preliminaries:
Lemma 4.1 (Baire’s Lemma) Let X be a nonempty complete metric space. Let
{Fn }∞
n=1 be a sequence of closed sets such that
X= Fn .
int(Fn0 ) 6= ∅.
In other
S∞words, Baire’s lemma asserts that if X 6= ∅ is a complete metric space and if
X = n=1 Fn where Fn is closed for each n, then at least one of the Fn ’s must have a
nonempty interior i.e., must contain a nonempty open set (“int” means “interior”).
Proof. (see fig. 4.1)
Fig. 4.1
Assume, for contradiction, that no Fn contains an open ball. Then, the complement
Fnc intersects every open ball, S0 in X, i.e., Fnc ∩ S0 6= ∅ for each n. Observe that
Fnc ∩ S0 is a nonempty open set. Let x1 ∈ F1c ∩ S0 . So, there exists r1 < 12 such that
B1 ≡ B(x1 , r1 ) ⊂ F1c ∩ S0 . Since no Fn contains an open ball, B1 6⊆ F2 . Hence F2c
intersects B1 , i.e., F2c ∩ B1 6= ∅. Let x2 ∈ F2c ∩ B1 . Since F2c ∩ B1 is an open set,
there exists r2 < 41 such that
By induction, we obtain a sequence Bn ≡ B(xn , rn ) with
c 1
Bn+1 ⊂ Fn+1 ∩ Bn ⊂ Bn ; rn < .
Furthermore, Bn+1 ⊆ B n for each n.
Moreover, {xn }∞ /
n=1 is a Cauchy sequence in X. By completeness of X, xn
x∗ ∈ Bn (since x∗ ∈ B n for each n). Hence x∗ ∈ Fnc for each n so that x∗ 6∈ Fn
for each n. This implies x∗ 6∈ Fn = X. Contradiction. This completes the proof.
Remark 4.2 Baire’s Lemma is sometimes stated in the following equivalent form:
Let (X, ρ) be a complete metric space. Let {Fn }∞n=1 be a sequence of closed subsets
of X. Assume that
int(Fn ) = ∅ f or all n ≥ 1,
then ∞
int( Fn ) = ∅.
Example 4.3 Let X = IR (with the usual metric), {Fn } = {qn }, where, for each n,
qn is a rational number. Clearly, {qn } is closed and int(qn ) = ∅ for each n (Verify).
By Baire’s Lemma,
int(Q) = int( qn ) = ∅,
(i) Fn is closed for each n,
(ii) X = Fn .
Condition (i) is easy to prove and is left as an exercise (See Exercises 4.1, Prob-
lem 3(i)). Condition (ii) follows from the condition sup ||Tα (x)|| < ∞. (Explain
how this happens. See Exercises 4.1, Problem 3(ii)). Hence, by Lemma 4.1, at least
one of the Fn ’s must have a nonempty interior. Let int(Fn0 ) 6= ∅. Let x0 ∈ int(Fn0 ).
Hence there exists r > 0 such that B̄r (x0 ) ⊂ Fn0 . Then since, for all z ∈ B̄(0, 1),
x0 + rz ∈ B̄r (x0 ) ⊂ Fn0 , we must have
and this implies (see Exercises 4.1, Problem 4) that
(b) T ∈ B(X, Y );
(c) ||T || ≤ lim inf ||Tn ||.
Proof. {Tn (x)} converges implies it is bounded. Thus, there exists Kx ≥ 0 such
sup ||Tn x|| ≤ Kx < ∞.
Furthermore, since
lim||Tn x|| = ||T x||,
taking limits as n / ∞ in inequality (4.1) we obtain
for each x ∈ X. Hence T ∈ B(X, Y ), establishing (b). Finally, from the first part
of inequality (4.1) we obtain
lim inf ||Tn x|| ≤ (lim inf ||Tn ||)||x||, i.e., ||T x|| ≤ (lim inf ||Tn ||)||x||
for each x ∈ X. This implies (by Remark 2.15) that
||T || ≤ lim inf ||Tn ||,
establishing (c), and completing the proof.
Corollary 4.7 Let E be a Banach space and let K be a subset of E. Suppose for
each f ∈ E ∗ , the set [
f (K) = hf, ki
is bounded in IR. Then, K is a bounded subset of E.
Proof. We apply the Uniform Boundedness Principle. Set
X = E ∗;
Y = IR;
I = K.
Define the map Tk : E ∗ / R as follows. For each k ∈ K, set Tk (f ) = hf, ki, f ∈
∗ ∗
E . Then, Tk ∈ B(X , IR) and by hypothesis,
sup |Tk (f )| = sup |hf, ki| < ∞.
k∈K k∈K
is a bounded subset of IR. Note that in Theorem 4.5 and Theorem 4.6, it is sufficient
that Y be a real normed linear space (completeness of Y is not necessary). (Convince
yourself). Corollary 4.7 is also sometimes expressed by saying that weak and strong
boundedness are equivalent. The reason for this is the following Corollary.
Corollary 4.9 Let E be a Banach space, K ∗ be a subset of E ∗ . Suppose that for
each x ∈ E the set [
hK ∗ , xi = hf, xi
f ∈K ∗
Proof. As in the proof of Corollary 4.7, we employ the Uniform Boundedness Prin-
ciple. Set
X = E;
Y = IR;
I = K ∗.
Define the map Tf ∗ : E / R as follows.
For each f ∗ ∈ K ∗ , put Tf ∗ (x) = hf ∗ , xi, x ∈ E. Then,
is a closed subset of X.
(ii) Verify that X = Fn .
4. With the notations as in the proof of the Uniform Boundedness Principle
(Theorem 4.5), prove that
r||Tα || ≤ n0 + ||Tα (x0 )||.
Tk (f ) := hf, ki, f ∈ E ∗ .
Prove that
Br (x0 ) = {x0 } + Br (0).
Generalize this result to the following equation:
for fixed x0 , y0 ∈ X.
(We shall use this result in the next section. The result is also a useful tool
for Exercises 3.2, Problems 4 and 5).
7. Apply the Baire’s lemma to prove the following version of the uniform bound-
edness principle for functionals: Let ∆ be an arbitrary index set and let E be
a complete metric space and {fα }α∈∆ be a family of real–valued continuous
functionals on E. Assume {fα }α∈∆ is pointwise bounded (i.e., for each x ∈ E,
there exists Mx ≥ 0 such that |fα (x)| ≤ Mx ). Prove that there exists an open
set U in E on which {fα }α∈∆ is uniformly bounded (i.e., there exist an open
set U and a constant M ≥ 0 such that
Lemma 5.1 Let X and Y be Banach spaces and let T ∈ B(X, Y ). Assume that T
is surjective. Then, there exists a constant c > 0 such that
We shall use the notations Br (0) ≡ B(0, r), Br (x0 ) ≡ B(x0 , r).
Remark 5.2 Under the hypothesis of Lemma 5.1, the inclusion (5.1) implies that
T maps open sets in X into open sets in Y . This is, in fact, the Open Mapping
Theorem. Its importance will become clear in what follows. For now, we verify
that T maps open sets to open sets. Let U be an open set in X. We shall show
T (U ) is an open set in Y . So, let y0 ∈ T (U ). Then, there exists x0 ∈ U such
that T x0 = y0 . But U is open and contains x0 . So there exists r > 0 such that
Br (x0 ) ⊂ U , i.e., x0 + Br (0) ⊂ U (by Exercises 4.1(6)). Then, T (x0 + Br (0)) ⊂ T (U )
Fig. 5.1
i.e., y0 + T (Br (0)) ⊂ T (U ) which is the same as y0 + T (rB1 (0)) ⊆ T (U ) and by the
linearity of T , y0 +rT (B1 (0)) ⊆ T (U ). But, by property (5.1), BY (0, rc) ⊂ T (Br (0))
since Br (0) = rB1 (0). Hence,
y0 + BY (0, rc) ⊂ T (U )
Theorem 5.3 (The Open Mapping Theorem) Let X and Y be Banach spaces. Sup-
(i) T ∈ B(X, Y );
(ii) T is surjective.
Then T is an open mapping (i.e., T maps open sets into open sets).
Proof. T ∈ B(X, Y ) and T is surjective imply (by Lemma 5.1) there exists c > 0
cT x
such that T (BX (0, 1)) ⊇ BY (0, c). Thus, ∀² > 0 and ∀x ∈ X, (1+²)||T x||
∈ BY (0, c)
cT x
and this implies (1+²)||T x|| = T z for some z ∈ BX (0, 1). Hence, since T −1 exists and is
linear, this implies (1+²)||T x||
= ||z|| < 1 ∀² > 0. Hence, c||x|| < (1 + ²)||T x|| ∀x ∈ X.
Since ² > 0 is arbitrary, this implies
||x|| ≤ ||T x|| ∀ x ∈ X,
which implies T −1 is continuous (Verify this) (See Exercises 5.1, Problem 3).
We shall see other applications of the Open Mapping Theorem later. Meanwhile
we introduce another fundamental theorem of functional analysis.
Example 5.6 Let X = [0, 1], Y = IR, and T : [0, 1] → IR be defined by
T x = x2 , x ∈ [0, 1].
Then, S
G(S) = {(x, Sx) : x ∈ [0, 1]} = {(x, 0) : x ∈ [−1, 0]} {(x, 1) : x ∈ (0, 1]}.
Definition 5.8 Let X and Y be normed linear spaces and T : X → Y be any map.
Then T is called a closed map (or, is said to be closed or is said to have closed graph)
if the graph of T , G(T ), is a closed subset of X × Y .
Example 5.9 The graph of T given in Example 5.6 is closed. To see this, it suffices
to take an arbitrary convergent sequence in G(T ) and show that its limit is in G(T ).
So, let {(xn , T xn )} be an arbitrary sequence in G(T ) and suppose (xn , T xn ) → (x, y)
as n → ∞. This implies xn → x and T xn → y as n → ∞. Observe that the map T
is continuous on [0, 1]. Thus, xn → x ⇒ T xn → T x as n → ∞. By uniqueness of
limit, T x = y so that (x, y) ∈ G(T ). Thus, G(T ) is a closed subset of X × Y and so
T is a closed map.
Example 5.10 The graph, G(S), of the map S defined in Example 5.7 is not closed.
To see this, it suffices to produce a convergent sequence in G(S) whose limit is not in
G(S). To this end, consider the sequence {xn } in [0, 1] defined by xn = n1 , n = 1, 2, ....
Then, Sxn = 1 for each n. So, (xn , Sxn ) = ( n1 , 1) ∈ G(S) for each n. Moreover,
(xn , Sxn ) → (0, 1) as n → ∞. However, (0, 1) 6∈ G(S) since S(0) 6= 1. Hence, G(S)
is not a closed subset of X × Y and so S is not a closed map.
Proof. Exercise.
Observe that T fn → y
⇒ ||T fn − y|| = sup |(T fn )(t) − y(t)| = sup |fn (t) − y(t)| → 0
t∈[0,1] t∈[0,1]
as n → ∞. This convergence is uniform and y(t) = lim fn (t). Since the conver-
gence is uniform, we have:
Z t Z t Z t
0 0
y(s)ds = limfn (s)ds = lim fn (s)ds = f (t) − f (0),
0 0 0
so that Z t
f (t) = f (0) + y(s)ds.
0 0
It is now easy to see that f (t) exists and f (t) = y(t) for all t ∈ [0, 1]. Thus,
f ∈ D and T f = y so T is closed.
(iii) T is not bounded.
Take fn (t) = tn . Then, ||fn || = sup |tn | = 1 and fn (t) = ntn−1 so that
||T fn || = sup |ntn−1 | = n, n = 1, 2, .... Thus, T is not bounded.
5.2.2 The Closed Graph Theorem
Theorem 5.13 (The Closed Graph Theorem) Let X and Y be Banach spaces. Let
(i) T : X → Y be a linear map;
(ii) The graph of T, G(T ), be closed.
Then, T is continuous.
(Simply put, Theorem 5.13 says that if any linear map from a Banach space to a
Banach space has closed graph then it is continuous).
Method 1. (See Fig.5.2)
G(T\ )
π1 π1−1
Consider the space X × Y endowed with norm k(x, y)kX×Y = kxkX + kykY . Then,
since X and Y are Banach spaces, it follows that X × Y is a Banach space with this
norm. G(T ) is closed in X × Y ⇒ G(T ) is a Banach space. Consider the projection
π1 : G(T ) /X
defined by
π1 ((x, T x)) = x.
Method 2. As in Method 1, consider now the projections π1 and π2 defined in the
usual way (see Fig.5.3), π1 (x, y) = x and π2 (x, y) = y.
G(TR 4 )
π1 44 π
44 2
π −1 44
1 44
¥ ½
X / Y
Proof. See Exercises 5.1, Problem 4.
∞ P
Example 5.15 Prove that if αk ξk is convergent whenever |ξk | < ∞, then
1 1
T (ξ1 , ξ2 , . . .) = (α1 ξ1 , α1 ξ1 + α2 ξ2 , α1 ξ1 + α2 ξ2 + α3 ξ3 , . . .)
i.e., ( )∞
T (ξ1 , ξ2 , ξ3 , . . .) = αk ξk .
k=1 n=1
Then T is: (i) well defined; (ii) linear; (iii) bounded.
P∞ P∞
(i) x = {ξk }∞
k=1 ∈ `1 ⇒ k=1 |ξk | < ∞. Then by hypothesis αk ξk < ∞. This
½ n ¾∞ k=1
implies the sequence {Sn }∞
n=1 = αk ξk converges and so is bounded
k=1 n=1
i.e., sup |Sn | := sup | αk ξk | < ∞, i.e., T x ∈ l∞ . Hence T is well defined.
n≥1 n≥1 k=1
Therefore, for each i, i = 1, 2, 3, . . . ,
¯ ¯
¯Xi ¯ i
¯ ¯
¯ αk (ξkn − ξk )¯ ≤ sup |αk | · |ξkn − ξk | / 0 as n / ∞.
¯ ¯ k=1,2,...,i
k=1 k=1
½ ¾∞ ½ ¾∞
i P
Thus, for each i, {T xn }∞
n=1 := αk ξkn converges to αk ξk in `∞ . But
k=1 n=1 k=1 i=1
{T xn }∞ ∞
n=1 also converges (by hypothesis) to z = {ηk }k=1 and so, by the uniqueness
of limit,
ηi = αk ξk , i = 1, 2, . . . ,
i.e., z = T x, and so T is a closed linear map. Since `1 and `∞ are Banach spaces, it
follows from the closed graph theorem that T is bounded (continuous).
kT xk`∞ ≤ kT k · kxk`1 ,
i.e., ¯ i ¯
¯X ¯ ∞
¯ ¯
sup ¯ αk ξk ¯ ≤ kT k · |ξk |.
i ¯ ¯
k=1 k=1
fn (ξ) = αi ξi ,
∞ P
where ξ = {ξi } ∈ l1 (so that |ξi | < ∞). By hypothesis, for each ξ ∈ l1 , αk ξk
1 k=1
is convergent. This implies, from our definition of fn that the sequence {fn (ξ)} is
convergent and consequently it is bounded. Thus,
Since l1 is a Banach space, we can now invoke the uniform boundedness principle to
obtain that
X n
|fn (ξ)| = | αi ξi | ≤ max |αi | |ξi | ≤ max |αi |.||ξ||l1 .
1≤i≤n 1≤i≤n
i=1 i=1
Consequently, we have
||ξ||l1 = 1.
Moreover, using this equality, we have that
|fn (ξ)| = | αi ξi | = |αk | = max |αi |.||ξ||l1 . (5.4)
From (5.3) and (5.5) we finally obtain that
This implies
sup |αn | < ∞,
|αk | < ∞.
6. Recall that two norms k · k1 and k · k2 on a linear space are called equivalent if
there exist numbers α > 0, β > 0 such that α||x||1 ≤ ||x||2 ≤ β||x||1 , ∀x ∈ X.
Suppose X is a linear space which is a Banach space for each of these norms.
Suppose that kxk1 ≤ kkxk2 is valid for some k and all x ∈ X. Prove that the
two norms are equivalent.
(Hint: Apply Proposition 5.14).
We conclude this chapter with the following proof which we deferred in Section 5.1.
h(y) = y − y0 .
which implies that the origin is an interior point of T (BX (0, 2n0 )). But T (BX (0, 2n0 ) =
2n0 T (BX (0, 1) and since multiplication by a nonzero scalar is a homeomorphism
of Y , it follows that the origin is an interior point of T (BX (0, 1)) , i.e., 0 ∈
int(T (BX (0, n0 ))). Hence there exists c∗ > 0 such that: (a)T (BX (0, 1)) ⊇ BY (0, c∗ ).
Part 2. To conclude the Lemma, it suffices now to show that: (b) T (BX (0, 3)) ⊃
BY (0, c∗ ) which is equivalent to T (BX (0, 1)) ⊃ BY (0, c), where c := c3 . We now
prove (2). Let y ∈ BY (0, c∗ ). By part (1), y ∈ T (BY (0, 1)). So, there exists
x1 ∈ X such that ||x1 || ≤ 1 and ||y − y1 || < 2
, where y1 = T x1 .
Observe that BY (0, c2 ) ⊆ T (BX (0, 21 )), (from Part 1, by dividing both sides by 2).
So, since (y − y1 ) ∈ BX (0, c2 ), there exists
1 c∗
x2 ∈ X such that ||x2 || ≤ 2
and ||(y − y1 ) − y2 || < 22
, where y2 = T x2 .
1 c∗
{xn } ∈ X such that ||xn || < 2n−1
and ||y − (y1 + y2 + ... + yn )|| < 2n
, where
yn = T x n .
If we put Sn := x1 + x2 + ... + xn , then it follows from ||xn || < 2n−1 that {Sn } is
a Cauchy sequence in X for which ||Sn || ≤ ||x1 || + ||x2 || + ... + ||xn || < 1 + 12 +
1 / x.
... + 2n−1 < 2. By completeness of X, there exists x ∈ X such that Sn
Moreover, ||x|| = || lim Sn || = lim ||Sn || ≤ 2 < 3. Hence, x ∈ BX (0, 3). By the
continuity of of T , T (x) = T (lim Sn ) = lim T (Sn ) = lim(y1 + y2 + ... + yn ) = y, since
||y − (y1 + y2 + ... + yn )|| < 21n / 0 as n / ∞. Hence, since x ∈ BX (0, 3)
and T x = y, it follows that y ∈ T (BX (0, 3)). Hence T (BX (0, 3)) ⊇ BY (0, c∗ ). Set
c := c3 . Then T (BX (0, 1)) ⊃ BX (0, c), completing proof of the lemma.
Hilbert Spaces
6.0 Introduction
In this chapter we introduce a special class of Banach spaces called Hilbert space.
These spaces are extremely useful in applications.
The pair, (E, h, i) is called an inner product space. We shall simply write E
for the inner product space (E, h, i) when the inner product h, i is known.
Example 6.2 The linear space IRn , with the function h, i defined, for arbitrary
vectors x = (x1 , x2 , . . . , xn ), y = (y1 , y2 , . . . , yn ) in IRn , by
X n
hx, yi = xi y i
is an inner product space.
Example 6.3 The linear space CI n , with the function h, i defined, for arbitrary
u = (z1 , z2 , . . . , zn ), ω = (w1 , w2 , . . . , wn ) in CI n ; by
hu, ωi = zi wi
Example 6.4 The linear space `2 (C I), with the function h, i defined, for arbitrary
vectors x = (x1 , x2 , . . .), y = (y1 , y2 , . . .) in `2 , by
hx, yi = xi y i
Example 6.5 The linear space C[0, 1] with the function h, i defined, for arbitrary
f, g ∈ C[0, 1], by
Z 1
hf, gi = f (t) g(t) dt
Example 6.6 The linear space L2 [0, T ] with h, i defined, for arbitrary f, g ∈ L2 [0, T ],
hf, gi = f (t) g(t) dt
is an inner product space. (This example is for readers who are familiar with the
space L2 [0, T ]).
The trivial verifications are left as easy exercises for the reader.
6.2 Basic Properties
In this section we establish some basic properties of inner product spaces.
as stated.
0 ≤ htzx + y, tzx + yi
= t2 z z̄hx, xi + tzhx, yi + tz̄hy, xi + hy, yi
= at2 + 2t|hx, yi| + c = at2 + 2bt + c.
From the theory of quadratic functions this yields the following inequality: b2 ≤ ac,
i.e., |hx, yi|2 ≤ hx, xi · hy, yi, as required.
Lemma 6.8 The function k · k : E / IR defined by kxk = hx, xi is a norm on
Proof. That k·k satisfies N1 and N2 follows from the definition of k·k and conditions
I1 and I2 . It now suffices to verify N3 . For arbitrary x, y ∈ E,
kx + yk2 = hx + y, x + yi = hx, x + yi + hy, x + yi = hx, xi + hx, yi + hy, xi + hy, yi
³ ´
= kxk + hx, yi + hx, yi + kyk2
With respect to the norm defined in Lemma 6.8 we can define a Cauchy sequence
in an inner product space E. A sequence {xn }∞ n=1 in E is called Cauchy if and only
if hxn − xm , xn − xm i := kxn − xm k / 0 as n, m / ∞. Consequently, an
inner product space E is called complete if every Cauchy sequence in E converges
to a point of E. A complete inner product space is called a Hilbert space.
6.3 Completeness of an Inner Product Space
A Hilbert space will be called a complex Hilbert space or a real Hilbert space accord-
ing as the underlying linear space is complex or real, respectively. In what follows
we shall denote Hilbert space by the letter H. Recall that a set K in a vector space
is called convex if for each x, y ∈ K and λ ∈ [0, 1] we have λx + (1 − λ)y ∈ K. In
other words, K is convex if for any two points x, y ∈ K the line segment joining x
and y lies in K.
Theorem 6.13 Let H be a Hilbert space and let K be a closed convex subset of H.
Then K contains a unique vector of minimum norm.
Remark 6.14 Closed convex subsets in arbitrary Banach spaces need not contain
unique vectors of minimum norm, (see Exercises 6.1, problem 20). Moreover, a
closed convex subset of an arbitrary Banach space may have infinitely many vectors
of minimum norm (see Exercises 6.1, Problem 21). Apart from Hilbert spaces, there
are also other Banach spaces in which closed convex subsets have unique vectors of
minimum norms (see e.g., Miscellaneous Exercises, Problem 25).
6.4 Jordan Von Neumann Theorem
Let V be a linear space (not necessarily normed). It is clear that we can always
define a metric on V . For example, the function δ : V × V / IR defined by
1, x 6= y
ρ(x, y) = is a metric on V . In case that V is a normed linear space,
0, x = y
the norm on V , k · k, can also induce a metric on V given by
ρ(x, y) = kx − yk.
Thus, given a metric space, it is sometimes of interest to know whether or not the
metric on the space is induced by a norm. It can be shown that a metric ρ on a
linear space (M, ρ) is induced by a norm if the following two conditions are satisfied:
C1 : The metric is homogeneous, i.e.,
ρ(λx, λy) = |λ|ρ(x, y)
for arbitrary x, y ∈ M and scalar λ.
C2 : The metric is translation invariant, i.e.,
ρ(x + z, y + z) = ρ(x, y)
for arbitrary x, y, z ∈ M .
For an example of a metric not induced by a norm, the reader is referred to Problem
13 in the Miscellaneous Exercises (see also the above example where ρ(λx, λy) =
1 ∀λ and ∀x 6= y).
In Lemma 6.8, we defined a norm on an inner product space by kxk = hx, xi1/2 .
The following question is of interest.
Question. When is the norm on a normed linear space given by an inner product?
Remark 6.16 Theorem 6.15 is useful in showing, for example, when a normed
linear space E is not an inner product space with kxk2 = hx, xi. It suffices, in view
of this theorem, to show that the norm defined on the space does not satisfy the
parallelogram law. (See Exercises 6.1, Problems 13, 14 and 15).
6.5 Orthonormal Sets
Definition 6.17 Two vectors x and y in an inner product space E are said to be
orthogonal (written x ⊥ y or y ⊥ x and read x ‘perp’ to y or y ‘perp’ to x) if
hx, yi = 0. If M is a subset of E, then we write x ⊥ M if x ⊥ y for every y ∈ M .
Remark 6.18 Since hx, yi = hy, xi, it follows that x ⊥ y if and only if y ⊥ x.
Furthermore, x ⊥ x if and only if x = 0, and x ⊥ 0 for all x ∈ E.
e1 = (1, 0, 0, . . .)
e2 = (0, 1, 0, . . .)
.. ..
. .
nth position
en = (0, 0, . . . , 0, 1, 0, . . .)
.. ..
. .
is an orthonormal set.
n o∞
Example 6.21 In L2 [0, 2π], the set S = √1 int
e is an orthonormal set.
2π n=1
so that, * +
X n
x− α i ui , x− αj uj ≥ 0,
i=1 j=1
* n
+ * n
+ * n n
hx, xi − x, αj uj − αi ui , x + αi ui , αj uj ≥ 0,
j=1 i=1 i=1 j=1
X n
X n X
X n
kxk − ᾱj hx, uj i − αi hui , xi + αi ᾱj hui , uj i ≥ 0,
j=1 i=1 i=1 j=1
which yields
X n
|αj |2 := |hx, uj i|2 ≤ kxk2 , as required.
j=1 j=1
Proposition 3.22 can be generalized to yield the following theorem.
Theorem 6.23 (Bessel’s Inequality) If {ui }∞
i=1 is an orthonormal set in an inner
product space E, then for arbitrary x ∈ E,
|hx, ui i|2 ≤ kxk2 .
³ P
∞ ´
Furthermore, x − hx, ui iui is orthogonal to ui for each i.
6.6 The Projection Theorem
Using the concept of orthogonality, we shall extend the well known elementary fact
that the shortest distance from a point to a plane is given by the perpendicular
from the point to the plane. This, in fact, gives us a very powerful and important
theorem – the Projection Theorem – used, for example, in Optimization Theory.
The answers to these questions are contained in the next two theorems.
kx − m∗ k = inf kx − mk,
kx − mδ k2 = kx − m∗ − δm0 k2
= kx − m∗ k2 − hx − m∗ , δm0 i − hδm0 , x − m∗ i + |δ|2
= kx − m∗ k2 − |δ|2 < kx − m∗ k2 ,
contradicting the hypothesis that m∗ is the unique minimizing vector. Hence
(x − m∗ ) ⊥ M .
kx − mk > kx − m∗ k for m 6= m∗ ,
which shows m∗ is the minimizing vector. Uniqueness follows trivially, completing
the proof of Theorem 6.24.
Remark 6.25 Theorem 6.24 does not guarantee the existence of the minimizing
vector. It only asserts that if it exists, then it is unique and (x − m∗ ) ⊥ M .
If, instead of considering arbitrary inner product spaces we consider Hilbert spaces,
H, and closed subspaces M of H, we obtain the following theorem which guarantees
the existence of the minimizing vector.
Theorem 6.26 (The Projection Theorem) Let H be a Hilbert space and M a closed
subspace of H. For arbitrary vector x in H, there exists a unique vector m∗ ∈ M
such that kx − m∗ k ≤ kx − mk for all m ∈ M . Furthermore, m∗ ∈ M is the unique
vector if and only if (x − m∗ ) ⊥ M .
δ := inf{kx − mk : m ∈ M }.
Rearrangement yields,
° mi + mj °
2 2 °2 °2
kmi − mj k = 2kmi − xk + 2kx − mj k − 4°x − °.
For i and j, the vector (mi + mj )/2 is in M since M is a linear subspace. Hence, by
the definition of δ,
kx − (mi + mj )/2k ≥ δ,
so that,
kmi − mj k2 ≤ 2kmi − xk2 + 2kx − mj k2 − 4δ 2 .
Since kmj − xk / δ as j / ∞, we have that kmi − mj k / 0 as i, j / ∞.
Hence {mj }∞j=1 is a Cauchy sequence in M and since M is complete (as a closed
subset of a Hilbert space), it follows that {mj }∞ ∗
j=1 has a limit m in M . This
implies, x − mj / x − m∗ as j / ∞ and so (by continuity of the norm)
kx − mj k / kx − m∗ k as j / ∞. But kx − mj k / δ as j / ∞, so that
by the uniqueness of limit, we obtain,
kx − m∗ k = δ = inf{kx − mk : m ∈ M }.
Definition 6.27 Let E be a vector space. E is said to be the direct sum of two
subspaces M and N of E, written E = M ⊕ N if each x ∈ E can be represented
uniquely as x = m + n with m ∈ M and n ∈ N . In this case, N is called the
algebraic complement of M in E (and vice versa). The subspaces M and N are
called complementary pair of subspaces in E.
Definition 6.28 For a Hilbert space H, given a subset S of H, the set of all vectors
orthogonal to S is called the orthogonal complement of S and is denoted by S ⊥ i.e.,
(i) The proof of this part is left as an easy exercise for the reader.
(ii) If x ∈ M then x ⊥ y for all y ∈ M ⊥ . Therefore, x ∈ (M ⊥ )⊥ = M ⊥⊥ .
x = m∗ + (x − m∗ ) = m∗ + n∗ , where n∗ := x − m∗ ,
m∗ + n∗ = m1 + n1 so that m1 − m∗ + (n1 − n∗ ) = 0.
Proof. As in Proposition 2.21, the proof is divided into 2 parts.
|fy (x)| = ||y|| ≤ ||fy ||.||x|| = ||fy ||, i.e.,||y|| ≤ ||fy ||. (6.2)
Hence, f (u) = α = hu, kxx11k2 i so that f (u) = hu, y0 i ∀ u ∈ H\K where y0 = kxx11k2 .
But this also holds if u ∈ K. Hence it holds for all u ∈ H. Moreover by part 1, we
have that ||f || = ||y0 ||. Finally, it only remains to show that y0 is unique. But, if
f (x) = hx, y ∗ i ∀ x ∈ H then
6.7 The Riesz Representation Theorem
In part 2 of proposition 6.31, we actually proved an important theorem in Hilbert
spaces called the Riesz Representation Theorem. This theorem states that any
bounded linear functional on a Hilbert space can be represented as an inner product
with a unique vector in H. More precisely, we proved the following theorem.
Theorem 6.32 (Riesz Representation Theorem) Let H be a Hilbert space and let
f be a bounded linear functional on H. Then,
(i) There exists a unique vector y0 ∈ H such that
f (x) = hx, y0 i for each x ∈ H; (6.3)
(⇐) Assume now that S = {uα }α∈s is complete. We want to show {uα }α∈∆ is a
basis. Now, for every x ∈ E,
* +
x− hx, uα iuα , uα = 0 (verify, see Exercises 6.1, Problem 18).
Hence, since S is complete, we must have x − α hx, uα iuα = 0, which yields the
desired result.
Lemma 6.35 Let E be an inner product space. Then S = {uα }α∈∆ is a basis for
E if and only if for arbitrary x ∈ E,
|hx, uα i|2 = kxk2 .
Proof. S is a basis for E if and only if x = hx, uα iuα for arbitrary x ∈ E, if
and only if DX X E
kxk = hx, uα i uα , hx, uβ iuβ ,
α∈∆ β∈∆
if and only if à !
kxk2 = hx, uα i hx, uβ i huα , uβ i,
α∈∆ β∈∆
if and only if à !
kxk2 = hx, uα i hx, uβ i huα , uβ i,
α∈∆ β∈∆
if and only if
kxk2 = hx, uα i hx, uα i, since huα , uβ i = 0 for α 6= β,
if and only if X
kxk2 = |hx, uα i|2 , as desired.
Remark 6.36 The identity: |hx, uα i|2 = kxk2 is called Parseval’s Identity (or
equation). The expression x = hx, uα iuα is called the Fourier expansion of x and
the numbers hx, uα i are called the Fourier coefficients.
Consider S = {einx , n = 0, ±1, ±2, . . .}, where i2 = −1. Then,
Z 2π ½
inx −imx 0, m 6= n
e e dx =
0 2π, m = n.
Thus, if we define
un = √ einx , n = 1, 2, ...,
then S := {un }∞
n=1 is an orthonormal set in L2 [0, 2π]. For any f ∈ L2 [0, 2π], the
numbers Z 2π
αn = hf, un i = √ f (x) e−inx dx
2π 0
are its Fourier coefficients; the Bessel’s inequality takes the form:
X Z 2π
|αn | ≤ |f (x)|2 dx.
−∞ 0
Z 2π
In fact, S is a complete orthonormal set in L2 [0, 2π] (so that 2
|αn | = |f (x)|2 dx).
−∞ 0
Moreover, each f ∈ L2 [0, 2π] has Fourier expansion given by
∞ Z 2π X
∞ Z 2π
1 X inx 1
f (x) = √ αn e = f (x + y) e−iny dy.
2π −∞ 2π 0 −∞ 0
Remark 6.38 The Fourier expansion f (x) = αn un (x) does not mean that the
RHS converges pointwise to f . The meaning of the expansion is that the partial sum
of the series fn (x) := αn un (x) converges to f in the sense that kfn − f kL2 [0,2π]
/ 0 as n / ∞.
1. Verify that the spaces defined in Examples 6.2 to 6.6 are inner product spaces.
2. Compute the `2 inner product of x and y where
½ ¾ ½ ¾
1 1 1 1 1 1
x = 1, , , , . . . and y = , , ,... .
2 3 4 3 4 5
8. Prove Lemma 6.10, Propositions 6.11 and 6.12.
9. Verify that the sets S defined in Examples 6.20 and 6.21 are orthonormal sets.
11. Give an example of an inner product space which is not a Hilbert space. Prove
that your example has the desired properties.
13. Verify that C[0, 1] with “sup” norm is not an inner product space.
15. Is IRn with k(x1 , x2 , . . . , xn )k∞ = max |xi |, an inner product space? Justify
your answer.
Z 1
16. Let E = C[0, 1] with hf, gi = f (t) g(t) dt.
Consider the functions f (t) = sin πt + i sin πt, g(t) = − sin 2πt + i sin 3πt. Are
the functions f and g orthogonal? Justify your answer.
17. Prove that if K is a closed convex subset of a Hilbert space H, then for each
x ∈ H there exists a unique x∗ ∈ K such that
kx − x∗ k = inf kx − uk .
Hint: Use the parallelogram law and follow the argument of Theorem 6.26.
18. VerifyPthe assertion made in the proof of Lemma 6.34 that for every x ∈ E,
hx − hx, uα iuα , uα i = 0 where {uα }α∈∆ is an orthonormal set.
Ti (x1 , x2 , . . . , xn ) = xi ∀ (x1 , x2 , . . .) ∈ `2 .
(b) Define T : L2 [0, 2π] / IR by
Z 2π
(T f )(t) = f (t)dt ∀ f ∈ L2 [0, 2π].
*20. Let E = C[0, 1], with “sup” norm. Let K consist of all f ∈ E such that
Z 1/2 Z 1
f (s)ds − f (s)ds = 1.
0 1/2
*21. Let K be the set of all f ∈ L1 ([0, 1]), relative to Lebesgue measure, such that
Z 1
f (s)ds = 1.
Prove that K is a closed convex subset of L1 ([0, 1]) which contains infinitely
many elements of minimum norm.
(d) (A∗ )∗ = A;
(e) ||A|| = ||A∗ ||;
(f) ||A∗ A|| = ||A||2 .
Proof. We verify (e) and (f) and leave the others as easy exercises. For (e), from
the relation hx, A∗ yi = hAx, yi, by setting x = A∗ y we obtain
so that
Applying (7.2) to A∗ , we have ||(A∗ )∗ || ≤ ||A∗ ||. Using (d), we now have
7.1 Self-adjoint, Normal and Unitary Operators
Definition 7.3 Let H be a Hilbert space. An operator T : H → H is called:
(i) Self-adjoint or Hermitian if T = T ∗ ;
(ii) Normal if T ∗ T = T T ∗ ;
(iii) Unitary if T T ∗ = T ∗ T = I.
Theorem 7.6 Let T : H → H be a bounded linear operator on a complex Hilbert
space, H. Then, T is self-adjoint if and only if hT x, xi is real.
hT x, xi = hx, T ∗ xi = hT x, xi
hT x, xi = hT x, xi = hT ∗ x, xi
so that
hT x − T ∗ x, xi = 0 or h(T − T ∗ )x, xi = 0,
1 1
T = (T + T ∗ ) + i[ (T − T ∗ )] = A + iB,
2 2i
where A = 12 (T + T ∗ ) and B = 1
(T − T ∗ ) are self-adjoint operators.
Then, ||T ∗ || = ||T ||, (already proved for all bounded linear operators on H). Take
x = (1, 0, 0, ...) ∈ l2 . Verify that ||T x|| 6= 0 whereas ||T ∗ x|| = 0. (See Exercises 7.1,
Problem 9).
7.1.2 Normal Operators
Proposition 7.10 Let T : H → H be a bounded linear map. If T is normal, then
Theorem 7.11 The set of all normal operators on H is a closed subset of B(H)
which contains the set of all self-adjoint operators, and is closed under scalar mul-
(ii) ⇒ (iii)
||T x||2 = hT x, T xi = hx, xi = ||x||2 ,
as required.
(iii) ⇒ (i) For all x ∈ H, ||x||2 = ||T x||2 ⇒ hx, xi = hT x, T xi ⇒ hx, xi =
hT ∗ T x, xi ⇒ h(I − T ∗ T )x, xi = 0
⇒ (by Lemma 7.5) (I − T ∗ T ) = 0 ⇒ T ∗ T = I as required.
Remark 7.13 It is easy to see that an operator satisfying any of the conditions
(i)-(iii) is not necessarily unitary. It suffices, for example, to consider the right shift
operator on l2
T : l2 → l2
defined by T (x1 , x2 , x3 , ...) = (0, x1 , x2 , x3 , ..). Clearly ||T x|| = ||x||. Observe that if
y = (y1 , y2 , y3 , ...) ∈ l2 then, hT x, yi = h(0, x1 , x2 , ...), (y1 , y2 , y3 , ...)i = x1 y2 + x2 y3 +
x3 y4 + . . . = h(x1 , x2 , x3 , ...), (y2 , y3 , ...)i,
so that T ∗ y = T ∗ (y1 , y2 , y3 , ...) = (y2 , y3 , y4 , . . .). Now,
(T T ∗ )(y) = T (y2 , y3 , ...) = (0, y2 , y3 , ...) 6= Iy,
so T T ∗ 6= I, and so T is not unitary.
Theorem 7.14 Let T : H → H be a bounded linear operator on a complex Hilbert
space, H. If T is an isometry, then T ∗ T = I.
Proof. This follows from parts (i) and (iii) of Theorem 7.12.
We now give the following characterization of unitary operators:
Corollary 7.15 Let T : H → H be a bounded linear operator on a complex Hilbert
space. Then, T is unitary if and only if T and T ∗ are both isometries.
Proof. Applying Theorem 3.50, the proof follows immediately.
Remark 7.16 In the light of the results of this subsection, we deduce that unitary
operators are those that preserve all the structures of H.
• Set theoretic structure is preserved because they are bijective.
• Linear structure is preserved because they are linear maps.
• Norm structure is preserved because they are isometries.
• Inner product structure is preserved because hT x, T yi = hx, yi for all x, y ∈ H.
(T ∗ )−1 = (T −1 )∗ .
Observe that if E is endowed with the discrete topology (i.e., the topology in which
every subset of E is open) then each φi is continuous (Why ?). Of course, this
topology is far from being the minimum. It is, in fact, the largest.
sets of Yi ’s. Having noted this fact we now ask the following question: Is the family
U = {Ui }i∈I a topology on E, and if it is, is it the smallest topology for which all φi
are continuous? The answer to this question is implicit in the following construction.
The Construction of τ
To construct the desired topology, we consider first, finite intersections of mem-
bers of U . In this way, we obtain a family Φ of subsets of E closed under finite
intersections. We then consider the family of arbitrary unions of members of Φ.
This family is clearly closed under arbitrary unions. We shall denote it by τ . It is,
however, no longer evident that the family τ is closed under finite intersection. We
state this as a lemma.
Proof. Exercise.
Remark 8.2 We cannot reverse the order of the operations in the above construc-
tion of τ . It was more natural to begin by considering arbitrary unions of subsets of
U and then after that to take finite intersections. If we follow this order, the family
obtained in this way is, of course, closed under finite intersection. It turns out that
it is not closed under arbitrary unions.
Let us now endow E with the topology τ constructed above and prove two ele-
mentary but important properties of this topology. Before we do this, however, we
make the following remark.
Remark 8.3 Recall that the open sets of the topology τ are obtained by considering
first the finite intersection of sets of the form φ−1
i (ωi ), ωi open in Yi and then taking
arbitrary unions of members of the resulting set. Thus, given a point x0 ∈ E we
obtain a neighborhood base at x0 for the topology τ by considering sets of the form
i (Vi )
where Vi is a neighborhood of φi (x0 ) in Yi .
let U be a neighborhood of x. We want to produce an N such that for all n ≥ N,
we have xn ∈ U . Since E is endowed with the topology τ we may suppose that U is
of the form \
U= φ−1
i (Vi ), J ⊂ I,
φi (xn ) ∈ Vi .
N = max Ni .
We then have, for all n ≥ N , and for each i ∈ J, φi (xn ) ∈ Vi , i.e., for all i ∈ J and
for all n ≥ N we have xn ∈ φ−1
i (Vi ). This implies
xn ∈ φ−1
i (Vi ) = U
for all n ≥ N . Since U was arbitrary, it follows that xn → x. The proof is complete.
Proposition 8.5 Let E be a topological space with the topology τ and let Z be an
arbitrary topological space. Let ψ be a mapping of Z into E (i.e., ψ : Z → E).
Then, ψ is continuous if and only if φi ◦ ψ is continuous from Z onto Yi , i ∈ I.
(Z, any Top.) / (E, ω)
NN φi
φi ◦ψ N N
NN ²
Proof. (See Fig. 8.1) (⇒) If ψ is continuous, then clearly φi ◦ ψ is also continuous
for all i ∈ I.
(⇐) Conversely, we want to prove ψ is continuous. Let U be an arbitrary open
subset of E. It suffices to show that ψ −1 (U ) is open in Z. By hypothesis, φi ◦ ψ is
continuous. We note that U is of the form
[ ³ \ ´
U= φ−1
i (ωi )
arbitrary finite
³ST ´
−1 −1 −1
with ωi open in Yi . Consequently, ψ (U ) = ψ φi (ωi )
S T −1
= (φi ◦ ψ) (ωi ) which is open in Z since φi ◦ ψ is continuous and ωi
arbitrary finite
is open in Yi . Hence ψ is continuous.
φf : E → IR
Remark 8.7 We shall use the notation hf, xi and f (x) interchangeably to denote
the action of f on x.
We observe that if we take Yi = IR for each i, and I = E ∗ , then the weak topology
ω on E becomes a special case of the topology τ defined earlier.
We now restate Proposition 8.4 when E is endowed with the weak topology as
Proposition 8.8 Let (E, ω) denote a Banach space endowed with the weak topology.
Then xn / x in (E, ω) if and only if φf (xn ) / φf (x), i.e., if and only if f (xn )
/ f (x) for each f ∈ E ∗ .
Proof. Let x1 , x2 , ∈ E, x1 6= x2 . We shall
T construct U1 , U2 open in the weak topology
such that x1 ∈ U1 , x2 ∈ U2 and U1 U2 = ∅. Observe that x1 6= x2 implies that
the sets {x1 } and {x2 } are disjoint. Clearly, these two sets are also convex, closed
and nonempty. Furthermore, they are compact. By Hahn Banach Theorem (Second
Geometric Form) there exists a closed hyperplane which separates {x1 } and {x2 } in
the strict sense. Hence, there exist f ∈ E ∗ and α ∈ IR such that
We now set
U1 = {x ∈ E : hf, xi < α} = φ−1
f {(−∞, α)}.
V = {x ∈ E : |hfi , x − x0 i| < ², i ∈ J}
x0 ∈ V ⊂ U1 ⊂ U.
Proposition 8.11 Let {xn } be a sequence in E. We have the following results.
(i) xn * x ⇔ f (xn ) → f (x) for each f ∈ E ∗ ;
(ii) xn → x ⇒ xn * x;
(iii) xn * x ⇒ {xn } is bounded and ||x|| ≤ lim inf ||xn ||;
(iv) xn * x (in E), fn → f (in E ∗ ) ⇒ fn (xn ) → f (x) (in IR).
(ii) To prove that xn * x it suffices to prove that hf, xn i → hf, xi for each
f ∈ E ∗ . But for each f ∈ E ∗ we have:
(iii) Here we shall make use of Corollary 4.7. So, it suffices to prove that for
each f ∈ E ∗ , the sequence {f (xn )} is bounded. Since xn * x we have (by Part (i)):
for each f ∈ E ∗
hf, xn i → hf, xi
as n → ∞. Hence, {f (xn )} is bounded for each f ∈ E ∗ and by Corollary 4.7, this
implies {xn } is bounded. Now, since f ∈ E ∗ we have
Taking “lim inf” in (8.1) we obtain: |hf, xi| ≤ ||f ||.lim inf ||xn ||. By Exercises 3.1,
Problem 6, we obtain
||x|| = sup |hf, xi| ≤ sup (||f ||.lim inf ||xn ||) ≤ lim inf ||xn ||,
||f ||≤1, f ∈E ∗ ||f ||≤1, f ∈E ∗
(iv) We have: |hfn , xn i − hf, xi| ≤ |hfn − f, xn i| + |hf, xn − xi| ≤ ||fn − f ||.||xn || +
|hf, xn − xi|. Since xn * x we have (by (iii)) that there exists a > 0 such that
||xn || ≤ a for all n ≥ 0 and since xn * x, we also have hf, xn − xi → 0. Hence
Remark 8.12 Proposition 8.11 (ii) shows that strong convergence implies weak
convergence. The converse, however, is false, i.e., weak convergence does not always
imply strong convergence. We give the following example.
Example 8.13 Let H be a real Hilbert space and define the sequence {en } in H
en = (0, 0, 0, ...0, 1, 0, ...)
with 1 in the nth position and zero in every other position. By Riesz Representation
Theorem, for each f ∈ H ∗ ,
f (en ) = hz, en i
for some unique z ∈ H, and by Bessel’s inequality,
|hz, en i|2 ≤ ||z||2 ,
and so n=1 |hz, en i| converges. Hence, limhz, en i = 0. But f (en ) := hz, en i → 0 for
each f ∈ H ∗ . Hence, en * 0 as n → ∞. However, {en } does not converge strongly
to zero since, for n 6= m, √
||en − em || = 2
and so {en } is not convergent (not even Cauchy!).
Proposition 8.14 Let E be a finite dimensional normed linear space. Then strong
and weak topologies coincide. (In particular, a sequence {xn } in E converges weakly
if and only if it converges strongly).
Proof. Let s denote the strong topology of E. The weak topology always has less
open sets than the strong topology. So, if a set is open in the weak topology then
it is open in the strong. Hence ω ⊂ s. Conversely, we must show that (in this case
where E is finite dimensional) a set which is open in the strong topology is also
open in the weak topology. Let the dimension of E be n. Let x0 ∈ E and let U be
a neighborhood of x0 in the strong topology. We shall construct a neighborhood V
of x0 in the weak topology such that x0 ∈ V ⊂ U . In particular, we shall find a set
{fi }i∈J , with J finite, in E ∗ and an ² > 0 such that
x0 ∈ V ⊂ B(x0 , r) ⊂ U . We choose a basisP{ei }ni=1 of E with ||ei || = 1 for all i.
Each x ∈ E can be written uniquely as x = ni=1 αi ei . Define the maps
fi : E → IR
n ´
fi (x) = fi αi ei = αi . i = 1, 2, ..., n.
Clearly, (i) fi is linear for each i; (ii) fi is bounded for each i. For, |fi (x)| = |αi | ≤
max1≤i≤n |αi | = ||x|| ∀ x ∈ E. So, {fi }ni=1 ∈ E ∗ and J = {1, 2, ..., n}. We take
² := nr (where, you recall, r is the radius of B(x0 , r)). We now show that with the
set {fi }ni=1 and this ², the set V defined in (8.2) is in B(x0 , r) ⊂ U . Let x0 = βi ei .
Then, for arbitrary x ∈ V , x = αi ei , we have
X n
||x − x0 || = || (αi − βi )ei || ≤ |αi − βi |.||ei ||
i=1 i=1
X Xn
= |fi (x) − fi (x0 )| = |hfi , x − x0 i|
i=1 i=1
< ² = ²n = r.
an x0 ∈ E, ||x0 || < 1, such that x0 ∈ cl Sw . Since such an x0 is not in S this will
complete the proof. So, let x0 ∈ E be such that ||x0 || < 1. Let TU be an arbitrary
neighborhood of x0 in the weak topology. It suffices to show U S 6= ∅. We can
always suppose that U is of the form
h : [0, ∞) → IR
1010 h
1010 h(t)
(0, 1)
101 t
10 Fig. 8.2
neighborhood of x0 in the weak topology, we have proved that x0 ∈ cl Sw . This
completes the example.
Example 8.16 The set A = {x ∈ E : ||x|| < 1} is clearly open in the strong
topology (Prove this). It is not open in the weak topology. It suffices to prove that
A has empty interior. We do this by contradiction. Assume that the interior of A
is nonempty. Let x0 ∈ intA and U be a neighborhood (in the weak topology) of
x0 . As in Example 8.15 x = x0 + t0 y0 ∈ U ∩ S ⊂ A ∩ S, where S is as defined in
Example 8.15, contradiction.
Theorem 8.19 Let K ⊂ E be convex and closed (in the strong topology). Then, K
is closed in the weak topology , (i.e., if K is strongly closed and convex, then it is
weakly closed).
Proof. The proof is another application of the Second Geometric Form of Hahn
Banach Theorem. It suffices to prove that K c (the complement of K) is open in
the weak topology. Let x0 ∈ K c . Then, x0 ∈
/ K. Since {x0 } is convex, nonempty
and compact and K is nonempty, convex and closed, it follows from Hahn Banach
Theorem (second geometric form) that there exist f ∈ E ∗ , α ∈ IR such that
for all x ∈ K. Set
U = {x ∈ E : hf, xi < α}.
Then x0 ∈ U, U K = ∅ so that U ⊂ K c . Moreover, U is open in the weak
topology. Hence, K c is open in the weak topology and so K is weakly closed.
We now turn our attention to the weak star topology.
φx : E ∗ → IR
defined by
φx (f ) = hf, xi.
As x ranges over E we obtain a family of maps {φx }x∈E of E ∗ into IR.
The weak star topology is the smallest topology on E ∗ for which all the maps φx
are continuous.
Notation. We shall write ω–topology for weak topology and ω ∗ (or weak∗ )–topology
for weak star topology.
Proof. Let E ∗ be endowed with the weak star topology and let f1 , f2 be arbitrary
elements of E ∗ with f1 6= f2 . Then, there exists x ∈ E such that hf1 , xi 6= hf2 , xi.
Without loss of generality, let hf1 , xi < hf2 , xi. Introduce α ∈ IR such that
U1 = {f ∈ E ∗ : hf, xi < α} = φ−1
x ((−∞, α)).
Proposition 8.21 Let f0 ∈ E ∗ . We obtain a neighborhood base at f0 for the w∗ −
topology by considering sets of the form:
V = {f ∈ E ∗ : |hf − f0 , xi i| < ²}
for all i ∈ J where J is finite , xi ∈ E, ² > 0.
Proof. Exercise (Use an argument similar to that used in the proof of Proposition
Notation. Given a sequence {fn } in E ∗ , we shall denote by fn −→ f the con-
vergence of fn to f in the weak star topology.
Proposition 8.22 Let {fn } be a sequence in E ∗ . Then,
(a) fn −→ f iff hfn , xi → hf, xi for all x ∈ E;
(b) fn * f ⇒ fn −→ f ;
/ f in E ∗ ⇒ fn −→ω∗
(c) fn f in E ∗ ;
(d) fn −→ f ⇒ {fn } is bounded and ||f || ≤ lim inf ||fn ||;
ω∗ / x ⇒ hfn , xn i / hf, xi in IR.
(e) fn −→ f, xn
Proof. Exercise (Follow the argument similar to that used in the proof of Proposi-
tion 8.11).
We have already seen that given any normed linear space E, the space E ∗ of all
bounded linear functionals on E is a Banach space. As a Banach space, E ∗ has its
own dual space which we denote by (E ∗ )∗ or simply by E ∗∗ and often refer to as the
second conjugate space of E or the double dual or bidual of E.
(ii) ||Jx|| = ||x|| for all x ∈ E; i.e., J is an isometry. In fact, for each f ∈ E ∗ ,
(Explain why the last equality follows). In general, the map J need not be onto.
Consequently, we always identify E as a subspace of E ∗∗ . Since an isometry is always
injective, it follows that J is an isomorphism onto J(E) ⊂ E ∗∗ . The mapping J
defined above is called the canonical map (or canonical embedding) of E into E ∗∗ ,
and the space E is said to be embedded in E ∗∗ . This leads to the following definition.
Definition 9.1 Let E be a normed linear space and let J be the canonical embed-
ding of E into E ∗∗ . If J is onto, then E is called reflexive. Thus, a reflexive Banach
space is one in which the canonical embedding is onto.
Proposition 9.2 Let E be a finite dimensional normed linear space. Then the
strong, weak and weak star topologies coincide
Proof. The canonical map J is surjective in this case (since dim E = dim E ∗ =
dim E ∗∗ ). (See also Exercises 9.1, Problem 1).
We now prove one of the main theorems of this chapter. We shall use the following
notation: for an arbitrary normed linear space X, we shall use BX to denote the
closed unit ball in X, i.e., BX = {x ∈ X : kxk ≤ 1}. In particular, E is a reflexive
normed linear space if and only if J(BE ) = BE ∗∗ .
Lemma 9.3 (Goldstein’s Theorem) Let E be a Banach space. Then, J(BE ) is dense
in (BE ∗∗ , ω ∗ ).
Lemma 9.4 Let X and Y be Banach spaces and let T : (X, s) → (Y, s) be a linear
continuous map. Then,
T : (X, ω) → (Y, ω)
is continuous, and conversely, where s denotes strong topology and w denotes weak
The following theorem is an important characterization of reflexive spaces. It is, in
some sense, the infinite dimensional spaces analogue of the Heine Borel Theorem.
J −1 / (E, ω)
(E ∗∗ , ω ∗N)
N f
f ◦J −1 N N
NN ²
f ◦ J −1 : (E ∗∗ , ω ∗ ) → IR
the last equality follows from equation (9.1), so that (f ◦J −1 ) is continuous. (Justify.
See Exercises 9.1, Problem 2).
(⇐) Suppose now BE is weakly compact. We prove E is reflexive. It suffices to prove
J(BE ) = BE ∗∗ . Since, J : (E, s) → (E ∗∗ , s) is an isometry, it is continuous, where
s denotes strong topology. By Lemma 9.4, this implies J : (E, w) → (E ∗∗ , w) is
continuous. This, in turn, implies J : (E, w) → (E ∗∗ , w∗ ) is continuous (Prove this.
See Exercises 9.1, Problem 3). Hence, J(BE ) is w∗ −compact in E ∗∗ . Since a compact
set in a Hausdorff topological space is always closed (Prove this. See Exercises 9.1,
Problem 4), it follows that J(BE ) is w∗ −closed in E ∗∗ . But by Lemma 9.3, J(BE ) is
dense in BE ∗∗ . Hence, in the w∗ −topology of E ∗∗ , we have J(BE ) = J(BE ) = BE ∗∗ .
Hence, E is reflexive.
We now examine some elementary properties of reflexive spaces.
Proposition 9.6 Let E be a reflexive Banach space and let K be a closed subspace
of E. Then K is reflexive.
Proof. Exercise.
Corollary 9.7 Let E be a Banach space. Then, E is reflexive if and only if E ∗ is.
Proof. (⇒) Let E be reflexive. We prove E ∗ is reflexive. By Kakutani’s Theorem,
it suffices to prove that BE ∗ is compact in the weak topology of E ∗ . By Banach
Alaoglu Theorem, BE ∗ is compact in the weak* topology of E ∗ . Since E is reflexive,
the weak* topology of E ∗ = the weak topology of E ∗ . (Verify). Hence, BE ∗ is
compact in the weak topology of E ∗ . By Kakutani’s theorem, E ∗ is reflexive.
(⇐) Let E ∗ be reflexive. We prove E is reflexive. By the first part we know that
E ∗∗ is reflexive. But J(E) is a closed subspace of E ∗∗ and hence, by Proposition 9.6,
J(E) is reflexive. Since J is an isometry, E is reflexive (see Exercises 9.1, Problem 5).
This completes the proof.
Corollary 9.8 Let E be a reflexive Banach space, K a closed bounded convex
nonempty subset of E. Then K is weakly compact.
Proof. E reflexive ⇒ (by Kakutani’s Theorem) BE is weakly compact. K bounded
⇒ K ⊂ tBE for some t > 0. Furthermore, tBE is weakly compact (Why?). K
closed and convex ⇒ (by Theorem 8.19) K is weakly closed. Hence, K is a weakly
closed subset of a weakly compact set, and so K is weakly compact.
We now give the proofs of Lemma 9.3 and Lemma 9.4. For a proof of Lemma 9.3,
we need the following lemma.
Lemma 9.9 (Helly’s Theorem) Let E be a Banach space, {fi }ni=1 ∈ E ∗ and {αi }ni=1 ∈
IR fixed. Then the following properties are equivalent:
¯Xn ¯ n
¯ ¯
¯ βi αi ¯ ≤ k β i fi k ∀β1 , β2 , . . . , βn ∈ IR.
i=1 i=1
and hence
¯n ¯ ¯ n ¯
¯P ¯ ¯ P P n Pn ¯
¯ βi αi ¯ = ¯− βi hfi , x² i + β α + β hf , x i ¯
¯ ¯ ¯ i i i i ² ¯
i=1 i=1¯ ¯
i=1 i=1
¯P n ¯
≤ S² + ¯¯ βi hfi , x² i¯¯
≤ S² + k βi fi k · kx² k
≤ S² + k βi fi k (since kx² k ≤ 1), ∀² > 0.
We establish this by contradiction.
Put α~ = (α1 , . . . , αn ) ∈ IRn and consider the map
φ : E → IRn
defined by φ(x) = (hf1 , xi, . . . , hfn , xi). It suffices to prove α
~ ∈ φ(BE ). Suppose α ~ 6∈
φ(B¯ E ). Clearly, {ᾱ}, as a singleton in IRn , is convex, nonempty and compact, φ(B̄E )
is convex, nonempty and closed. By Hahn-Banach theorem (second geometric form)
we can separate {~ α} and φ(B ¯ E ) in the strict sense; i.e., there exist β=(β
~ 1 , . . . , βn ) ∈
IR and δ ∈ IR such that
Observe that
Consequently, ¯ ¯
¯X n ¯ n
¯ ¯
¯h βi fi , xi¯ ≤ δ < βi αi ∀x ∈ BE ,
¯ ¯
i=1 i=1
|hJx − ψ, fi i| < ²,
since hJx, fi i = hfi , xi. (At this point think of the Condition (ii) of Helly’s lemma.
This may help us to find x² with ||x² || ≤ 1 which satisfies (9.2). So, we try to con-
struct Condition (ii) of the lemma which, by Lemma 9.9, is equivalent to condition
Put αi = hψ, fi i and note that for arbitrary β1 , β2 , ..., βn ∈ IR, we have
Pn Pn Pn Pn
| i=1 βi αi | = |hψ, i=1 βi fi i| ≤ ||ψ||.|| i=1 βi fi || ≤ || i=1 βi fi ||,
|hfi , x² i − αi | < ²
Proof of Lemma 9.4. (⇒) Suppose T : (X, s) / (Y, s) is linear and continuous.
We want to prove T : (X, ω) / (Y, ω) is continuous. By Proposition 8.5, it suffices
to verify that for each f ∈ Y , the map f ◦ T : (X, ω) → IR is continuous (See Fig.
T / (Y, s)
(X, s)
NN f
f ◦T N N
NN ²
Fig. 9.2
We show first that the map f ◦ T : (X, s) → IR is linear and continuous. Linearity
(f ◦ T )(xn ) → (f ◦ T )(y),
T : (X, ω) → (Y, ω)
is continuous.
(⇐) Suppose now that T : (X, ω) → (Y, ω) is linear and continuous. Then,
G(T ) = {(x, T x) : x ∈ X}
is a weakly closed subset of X × Y . Consequently, G(T ) is closed in the strong
topology of X × Y (since weakly closed sets are also strongly closed). Hence, by the
Closed Graph Theorem, T : (X, s) → (Y, s) is continuous. This completes the proof.
2. Give the concluding argument why the map f ◦ J −1 defined in the first part
of the proof of Theorem 9.5 is continuous.
3. Let ω, ω ∗ denote the weak and weak∗ topology respectively on a Banach space.
Prove that if
J : (E, ω) → (E ∗∗ , ω)
is continuous, then
J : (E, ω) → (E ∗∗ , ω ∗ )
is also continuous. (Here J is the canonical map of E into E ∗∗ ).
T : (X, s) → (Y, s)
is also continuous.
(Hint: Use Lemma 9.4).
Recall (Corollary 9.8) that every nonempty closed convex and bounded subset of a
reflexive Banach space is weakly compact. The following result is, in fact, true (see
e.g., [9]).
An immediate consequence of this proposition is that the unit ball (open or closed) in
a reflexive Banach space is always weakly compact. Moreover, every (norm) bounded
sequence in a reflexive Banach space has a weakly convergent subsequence. These
results are embodied in the following very important theorem:
Sr (x0 ) = {x ∈ E : kx − x0 k = r}.
Br (x0 ) = {x ∈ E : kx − x0 k < r}.
A Banach space E is called uniformly convex if for any ε ∈ (0, 2], there exists a
δ = δ(ε) > 0 such that if x, y ∈ E with kxk ≤ 1, kyk ≤ 1 and kx − yk ≥ ε, then
k 21 (x + y)k ≤ 1 − δ.
Roughly speaking, E is uniformly convex if and only if for each two points x, y on
the unit ball of E centred at the origin (i.e., kxk ≤ 1, kyk ≤ 1) which are such that
they are at least distance ε apart (i.e., kx − yk ≥ ε) then the midpoint of x and y
is inside the ball and is at least a distance of δ from the boundary of the ball (see
e.g., Fig. 9.3).
X 1
2 (X +Y)
Fig. 9.3
Remark 9.12 Since x, y ∈ B1 (0), the maximum value of ε > 0 is 2. Hence, the
restriction ε ∈ (0, 2]. Geometrically, a Banach space E is uniformly convex if and
only if the unit ball centred at the origin is “uniformly round”. To fix ideas we
consider the following example:
Example 9.13 Let E = IR2 , the plane. Endow IR2 with the following three norms.
For x = (x1 , x2 ) ∈ IR2 ,
Fig. 9.4
It then follows that IR2 with norm k · k2 is uniformly convex whereas the same IR2
with either the norm k · k1 or the norm k · k∞ is not uniformly convex. These as-
sertions can be established rigorously. Consequently, uniform convexity of a Banach
space E is actually a property of the norm on E. However, we shall follow the abuse
of language (or convention) and call a Banach space uniformly convex whenever the
norm on it is a uniformly convex one.
The following theorem shows that a large class of Banach spaces is uniformly convex.
Theorem 9.14 Every inner product space H is uniformly convex.
Proof. Recall the following identity (the parallelogram law Proposition 6.11) which
is valid in any inner product space. For each x, y ∈ H,
kx + yk2 + kx − yk2 = 2(kxk2 + kyk2 ). (9.3)
Let ε ∈ (0, 2] be given and let x, y ∈ H be such that kxk ≤ 1, kyk ≤ 1 and
kx − yk ≥ ε. Then equation (9.3) yields:
° °2 ° °2
°1 ° °1 °
° (x + y)° ≤ [2(2) − kx − yk ] = 1 − ° (x − y)° ≤ 1 − 1 ε2 ,
1 2
°2 ° 4 °2 ° 4
so that ° ° r
°1 °
° (x + y)° ≤ 1 − 1 ε2 .
°2 ° 4
We can now choose δ = 1 − 1 − 14 ε2 > 0 to complete the proof.
More generally, we have the following theorem.
Theorem 9.15 `p (1 < p < ∞) is uniformly convex.
For a proof of this theorem we need the following lemma.
Lemma 9.16 (R.C. James) Let E = `p . Then for p, q > 1 such that p1 + 1q = 1 and
for each pair x, y ∈ E, the following inequalities hold:
° °q ° °q
°1 ° °1 ° £ ¤
(i) ° (x + y)° + ° (x − y)° ≤ 2−1 (kxkp + kykp ) q−1 , for 1 < p ≤ 2,
°2 ° °2 °
(ii) kx + ykp + kx − ykp ≤ 2p−1 (kxkp + kykp ), for 2 ≤ p < ∞ .
Proof of Theorem 9.15. Given ε ∈ (0, 2], let x, y ∈ `p be such that kxk ≤ 1, kyk ≤
1 and kx − yk ≥ ε. Two cases arise:
Case 2: 2 ≤ p < ∞ The result follows as in Case 1 by using Lemma 9.16 (ii).
Although Theorems 9.14 and 9.15 provide large classes of spaces which are uniformly
convex, some well–known spaces are not uniformly convex.
Example 9.17 The space `1 is not uniformly convex. To see this, take ε = 1 and
choose x = (1, 0, 0, 0, . . .), y = (0, −1, 0, 0, . . .). Clearly x, y ∈ `1 and kxk`1 = 1 =
kyk`1 , kx − yk`1 = 2 > ε. However, k 21 (x + y)k = 1 so that k 12 (x + y)k ≤ 1 − δ, δ > 0
is not satisfied, showing that `1 is not uniformly convex.
Example 9.18 The space `∞ is not uniformly convex. Consider u = (1, 1, 0, 0, . . .), v =
(−1, 1, 0, 0, . . .) and take ε = 1. Then
kuk`∞ = kvk`∞ = 1, ku − vk`∞ = 2 > ε. However, k 21 (u + v)k`∞ = 1, and so `∞ is
not uniformly convex.
Example 9.19 The space C[a, b] of all real–valued continuous functions on the
compact interval [a, b] endowed with the “sup norm” is not uniformly convex.
To see this, choose two functions f and g as follows:
f (t) = 1 for all t ∈ [a, b].
g(t) = for each t ∈ [a, b].
Take ε = 12 . Clearly, f, g ∈ C[a, b], kf k = kgk = 1 and kf − gk = 1 > ε. Also
k 21 (f + g)k = 1 and so C[a, b] is not uniformly convex.
Proposition 9.20 Let X be a uniformly convex Banach space. Then for any d >
0, ε > 0 and arbitrary vectors x, y ∈ X with kxk ≤ d, kyk ≤ d, kx − yk ≥ ε there
exists a δ > 0 such that
°1 ° h ³ ε ´i
° °
° (x + y)° ≤ 1 − δ d.
2 d
Proof. Let ε > 0 be given and let z1 = xd , z2 = yd , and suppose we set ε̄ = dε .
Obviously ε̄ > 0. Moreover, kz1 k ≤ 1, kz2 k ≤ 1 and kz1 − z2 k = d1 kx − yk ≥ dε = ε̄.
Now, by uniform convexity we have
°1 °
° °
° (z1 + z2 )° ≤ 1 − δ(ε̄),
i.e., °1 ° ³ε´
° °
° (x + y)° ≤ 1 − δ ,
2d d
which implies °1 ° h ³ ε ´i
° °
° (x + y)° ≤ 1 − δ d.
2 d
Fix this δ > 0 (Corresponding to the given ε > 0). Since kξk = 1, it follows that
ξ 6= 0. Hence, by Theorem 3.10, we can choose f ∈ E ∗ such that kf k = 1 and
ξ ∈ Jx0 + ε BE ∗∗ . (9.6)
(U ∩ V ) ∩ J(BE ) 6= ∅.
|hf, x̂i − hξ, f i| < δ/2, (since J(x̂) ∈ U and hJ(x̂), f i = hf, x̂i).
so that
and consequently, using inequality (9.5), we obtain the following estimate:
° °
°1 °
° (x0 + x̂)° ≥ hξ, f i − δ/2 > 1 − δ/2 − δ/2 = 1 − δ.
°2 °
We now have the following situation: kx0 k ≤ 1, kx̂k ≤ 1 and k 12 (x0 + x̂)k > 1 − δ.
Then by uniform convexity (contrapositive) we have: kx0 − x̂k ≤ ε. But then,
kx0 − x̂k > ε, since J(x̂) ∈ V and J(x0 ) 6∈ V , which implies
(The last inequality following from the fact that V = (B(Jx0 , ε))c ). Contradiction.
The proof is complete.
Theorem 9.22 Let E be a normed linear space such that E ∗ is separable. Then E
is separable.
≤ |fn (xn )| = |fn (xn ) − f ∗ (xn )|
= |(fn − f ∗ )xn | ≤ kfn − f ∗ k · kxn k = kfn − f ∗ k,
so that kfn − f ∗ k ≥ 12 . Since f ∗ ∈ E ∗ , the last inequality contradicts the fact that
{fn } is dense in E ∗ . Hence F = E. So, E is separable.
Remark 9.23 The converse of Theorem 9.22 is not true. There are separable spaces
E with duals E ∗ which are not separable (e.g., `1 is separable but `∗1 = `∞ is not).
Proof. (Trick of proof: we shall use the hypothesis that E is reflexive and separable
to obtain that E ∗∗ is separable; then we employ Theorem 9.22 to conclude that E ∗
is separable). Now, E is separable ⇒ E has a countable dense subset. Call it
{xk }. Let x∗∗ ∈ E ∗∗ be arbitrary. Since E is reflexive, there exists x ∈ E such that
Jx = x∗∗ . For any ε > 0, by the density of {xk } in E, there exists xk such that
kxk − xk < ε. Since J is an isometry, we obtain,
Remark 9.25 Theorem 9.24 gives us a criterion for showing that a given space is
not reflexive. For, if E is a normed linear space such that (i) E is separable, and
(ii) E ∗ is not separable, then E is not reflexive. Thus, to show that a normed linear
space E is not reflexive, it suffices to verify that (i) and (ii) are satisfied.
We now prove yet another theorem which will also be useful in showing that certain
spaces are not reflexive.
Theorem 9.26 Let E be a reflexive Banach space. Then for each f ∗ ∈ E ∗ there
exists some x0 ∈ E, with kx0 k = 1 such that
f ∗ (x0 ) = kf ∗ k.
Remark 9.27 Theorem 9.26 asserts that all bounded linear functionals on a reflex-
ive Banach space attain their maximum absolute value on the closed unit ball. In
general, the point x0 ∈ E in Theorem 9.26 need not be unique. However, if E is say,
uniformly convex, then each f ∗ 6= 0 attains its maximum absolute value at a unique
point on the unit ball.
Remark 9.28 The converse of Theorem 9.26 is also true, i.e., if E is a Banach
space such that whenever f ∗ ∈ E ∗ , there exists some x0 ∈ E with kx0 k = 1 such
that f ∗ (x0 ) = kf ∗ k, then E is reflexive.
Example 9.30 c0 is not reflexive.
|f ∗ (x)| < kf ∗ k
for all x ∈ c0 such that kxkc0 ≤ 1. We now proceed to do this. For each
x = {αk } ∈ c0 , define
f (x) = .
Then f is a linear functional on c0 . Furthermore, since |αk | ≤ k{αk }k,
k = 1, 2, 3, . . ., it follows
µ ∞ that¶ if x ∈ c0 with kxk ≤ 1 then
αk P 1
|f (x)| = k!
≤ k!
, so that f is a bounded linear functional on c0 with
k=1 k=1
kf k ≤ k!
. In fact, kf k = k!
since for each positive integer n, if
k=1 k=1
xn = {αkn }
∈ c0 is such that αkn =
1, k = 1, 2, . . . , n and αkn = 0, k > n then
P 1
kxn k = 1 and f (xn ) = k!
. Hence,
Xn ∞
1 n 1
kf k = sup |f (x)| ≥ sup f (x ) ≥ sup = ,
kxk=1 n n
k! k=1 k!
so that kf k = k!
, as claimed.
However, there exists no x ∈ c0 with kxk ≤ 1 for which |f (x)| = kf k as can be seen
from the following argument. Suppose x = {αk } ∈ c0 such that kxk ≤ 1. Since αk
/ 0, there exists an integer N0 > 0 such that |αN | < 1. Hence,
¯X ∞
αk ¯¯ X |αk | |αn | X 1
∞ ∞
1 X∞
|f (x)| = ¯ ¯≤ + < + = = kf k.
k! k=1
k! n! k=1
k! n! k=1
1. Prove that a subset of a separable space is separable.
2. Prove that the set F defined in the proof of Theorem 9.22 is: (i) separable;
(ii) a closed subspace of E.
7. Prove that, with our usual notations, J(BE ) is always strongly closed in E ∗∗ ,
where E is an arbitrary Banach space and J is the canonical embedding.
Let E denote the set of almost periodic functions. With the usual operations of
addition and scalar multiplication, E is a vector space over CI. If f, g ∈ E are
arbitrary, it can be shown that
1 T
hf, gi = lim / f (s) g(s) ds
T ∞ T 0
Fig. 10.1
X k
yk := αi xi = αi xi + αk+1 xk+1 .
i=1 i=1
k+1 P
Since αi = 1, we have αk+1 = 1 − αi .
i=1 i=1
Case 2. If αk+1 < 1, then αi = 1 − αk+1 > 0. Hence,
X k
yk = (1 − αk+1 ). αi xi + αk+1 xk+1
(1 − αk+1 ) i=1
X αi
= (1 − αk+1 ) xi + αk+1 xk+1 .
(1 − αk+1 )
α1 αk
Observe that ( (1−αk+1 )
, ..., (1−αk+1 )
) ∈ Λk (why?). Thus,
X αi
x := xi ∈ C (by induction hypothesis).
(1 − αi+1 )
Fig. 10.2
Fig. 10.3
Df,α := {x ∈ X : hx, f i ≥ α}.
Df,α := {x ∈ X : hx, f i > α}.
Df,α := {x ∈ X : hx, f i ≤ α}.
Df,α := {x ∈ X : hx, f i < α}.
Proof. Exercise.
(ii) f is concave if
(a) D is convex, and
(b) For each t ∈ [0, 1] and for each x1 , x2 ∈ D, we have
f (tx1 + (1 − t)x2 ) ≥ tf (x1 ) + (1 − t)f (x2 ).
Remark 10.10 f is convex if and only if (−f ) is concave, and f is concave if and
only if (−f ) is convex. (See Fig. 1.4).
Fig. 10.4
[C] Epigraph.
The epigraph of f is the set defined by
Fig. 10.5
Fig. 10.6
[D] Section of f .
Let α ∈ R. We have the following definitions.
Sf,α := {x ∈ X : f (x) ≤ α}
= {x ∈ D(f ) : f (x) ≤ α}.
Fig. 10.7
f is strictly concave if (−f ) is strictly convex.
Fig. 10.8
f (x) = hx, g ∗ i ∈ R.
f (x) = a(x, x)
is convex.
Proof. Let x1 , x2 ∈ X, t ∈ [0, 1]. Put
Corollary 10.14 If a is strictly positive, i.e., for each x 6= 0, a(x, x) > 0, then f
is strictly convex.
For V ∈ U , take inf f (x). (See Fig. 10.9(a) and Fig 10.9(b)).
This ”inf” may not exist (in which case we say it is −∞). As V ranges over
U(x̄), take
sup { inf f (x)}.
V ∈U x∈V² {x̄}
It is clear that
sup { inf f (x)} = lim inf/ f (x).
V ∈U x∈V² {x̄} x x̄
Fig. 10.9(a) Fig. 10.9(b)
sup { inf f (x)} = lim inf/ f (x) ≥ f (x̄), or lim inf/ f (x) ≥ f (x̄).
V ∈U x∈V² {x̄} x x̄ x x̄
Remark 10.16 One easily observes that the function sketched in Fig. 1.10(a) is
lower semi-continuous at x̄, whereas the one sketched in Fig 1.10(b) is not. Observe
also that the function f sketched in Fig 1.10(a) is not continuous at x̄. This brings
us to the following ...
{xk : k ≥ n} ⊂ V.
Fig. 10.10
Now taking limit as n / ∞ (why can we do this?) we obtain
Fig. 10.11
f (x̄) − ² < lim inf/ f (x).
x x0
10.5 A fundamental theorem of optimization
We now prove the main theorem of this chapter.
Theorem 10.21 Let X be a real reflexive Banach space and let K be a closed
convex bounded and nonempty subset of X. Let f : X −→ R ∪ {+∞} be lower
semi-continuous and convex. Then, ∃ x̄ ∈ K such that f (x̄) ≤ f (x) ∀ x ∈ K, i.e.,
Fig. 10.12
and this is impossible. Hence m ∈ R.
We now use the definition of ”inf”. Let n ∈ N and take ²n = n1 . Then ∃ xn ∈ K
such that m ≤ f (xn ) < m+ n1 . {xn } in K implies {xn } is bounded and so ∃{xnk }k∈N ,
subsequence of {xn } and x̄ ∈ K such that xnk * x̄. Since f is weakly lower semi-
continuous, we have
f (x̄) ≤ lim inf
/ f (xnk )
k +∞
≤ lim inf
/ (m + )
k +∞ nk
= lim/ (m + ) = m.
k +∞ nk
Thus, f (x̄) ≤ m = inf f (x). But m ≤ f (x̄). Hence,
Fig. 10.13
f (x̄) ≤ f (x).
Now, let x ∈ XK. Then f (x) > f (x0 ). But x0 ∈ K. So, f (x̄) ≤ f (x0 ). Hence,
f (x) > f (x̄) ∀ x ∈ X, i.e.,
f (x̄) ≤ f (x), ∀ x ∈ X. ¤
Remark 10.23 If lim / f (x) = +∞, we say that f is coercive. This condition
kxk +∞
actually implies that
for each A > 0, ∃ B > 0 such that ∀ x ∈ X, kxk > B ⇒ f (x) > A.
Exercises 10.1
2. Let X be a real normed linear space. Prove that f : X −→ R ∪ {+∞} is lower
semi-continuous if and only if ∀ α ∈ R,
f −1 ((α, +∞)) is open,
or alternatively,
if and only if f −1 ((α, +∞]) is closed for each α ∈ R.
Problems concerning the existence of fixed points for Lipschitz maps have been of
considerable interest in Nonlinear Operator Theory. The study of nonlinear opera-
tors had its beginning about the start of the twentieth century with investigations
into the existence properties of solutions to certain boundary value problems aris-
ing in ordinary and partial differential equations. The earliest techniques, largely
devised by E. Picard [12], involved the iteration of an integral operator to obtain
solutions to such problems. In 1922, these techniques of Picard were given precise
abstract formulation by S. Banach [2] and R. Cacciopoli [8] in what is now generally
referred to as the Contraction Mapping Principle (Theorem 1.9).
The classical importance of fixed point theory in functional analysis is due to its
usefulness in the theory of ordinary and partial differential equations. The existence
or construction of a solution to a differential equation is often reduced to the ex-
istence or location of a fixed point for an operator defined on a subset of a space
of functions. Fixed point theorems have also been used to determine the existence
of periodic solutions for functional differential equations when solutions are already
known to exist. Apart from this deep involvement in the theory of differential equa-
tions, fixed point theorems have also been extremely useful in such problems as
finding zeros of nonlinear equations and proving surjectivity theorems. Partly as a
consequence of the importance of its applications, fixed point theory has developed
into an area of independent research.
An earlier fixed point theorem, called the Brouwer Fixed Point Theorem, concerns
continuous mappings and has an advantage over Theorem 1.19 in that it applies to a
much larger class of functions. It is, however, in a sense, weaker than Theorem 1.19
because the sequence of iterates of the function at a given point need not converge
to a fixed point. Furthermore, it is confined to finite dimensional spaces.
Theorem 11.2 (Brouwer Fixed Point Theorem (1910)) Let B be the closed unit
ball of any finite dimensional Euclidean space and f : B / B be continuous.
Then f has a fixed point.
The first analytic attempt at generalizing Theorem 10.2 to infinite dimensional
spaces was made by Birkoff and Kellog [3]. They were able to show that a con-
tinuous operator defined from a compact, convex subset of C n [0, 1] into itself has a
fixed point. This result was then applied in solving certain differential and integral
equations. Further generalizations resulted in the following theorem.
Theorem 11.3 (Schauder–Tychonov Theorem) Let K be a compact convex subset
of a Banach space E. If T : K / K is continuous, then T has a fixed point.
Despite the fact that there is no known constructive technique for determining a
fixed point of T , the Schauder–Tychonov fixed point theorem is extremely impor-
tant in the proofs of many existence theorems of differential equations.
to obtain in applications, considerable research has been done concerning weaker
conditions for the domain which guarantee the existence of a fixed point. In this
connection, the following example shows that if domains which are only bounded,
closed and convex are considered even a Lipschitz map with Lipschitz constant L = 1
may fail to have a fixed point.
Then, (i) B is closed, bounded and convex; (ii) T maps B into B; (iii) kT x − T yk =
kx − yk for x, y ∈ B; (iv) T has no fixed point in B.
Remark 11.5 Observe that c0 is not uniformly convex and that the map T in
Example 8.4 is Lipschitzian with Lipschitz constant L = 1. In some situations,
Lipschitz maps with L = 1 may have fixed point (which may even be unique!).
Definition 11.7 Let S be a nonempty set. Suppose there is a linear relation be-
tween some (not necessarily all) pairs of elements of S and suppose this relation is
expressed symbolically by x ≤ y, x, y ∈ S such that the following properties hold:
(i) x ≤ x, x ∈ S; (ii) x ≤ y and y ≤ x, ⇒ x = y x, y ∈ S; (iii) x ≤ y and
y ≤ z ⇒ x ≤ z; x, y, z ∈ S. Then, the set S is said to be partially ordered by “≤”.
If, in addition, (S, ≤) is such that any pair of elements of S are comparable (i.e., for
any pair x, y ∈ S either x ≤ y or y ≤ x holds) then (S, ≤) is said to be completely
(or totally) ordered, or is called a chain.
Lemma 11.9 (Zorn’s Lemma) Let (S, ≤) be a partially ordered set and suppose that
every chain in S has an upper (lower) bound. Then S contains at least one maximal
(minimal) element.
The following well known lemmas will also be needed in what follows.
Lemma 11.10 (Principle of Nested Sequences) Let (M, ρ) be any metric space.
Then the following are equivalent:
(i) (M, ρ) is complete;
we have, \
Fn 6= ∅.
Lemma 11.11 Let (M, ρ) be a metric space. The following are equivalent:
(i) M is compact;
(ii) Every collection of closed nonempty subsets of M with the finite intersection
property has a nonempty intersection.
point of C if sup{kx0 − xk : x ∈ C} < d. Let E be a Banach space and K a bounded
convex subset of E. Let d(K) denote the diameter of K. The set K will be called
non–trivial if d(K) > 0.
The Banach space E is said to have normal structure if every bounded convex subset
of E has normal structure. Geometrically, K is said to have normal structure if for
every non–trivial convex subset C of K there exists a ball centred at a point of C
and whose radius is less than the diameter of C such that the ball contains C, i.e.,
if C is an arbitrary non–trivial convex subset of K then
C ⊆ B(x0 , r)
Proposition 11.13 Every uniformly convex Banach space E has normal structure.
Proposition 11.14 (Brodskii and Mil’man Theorem [5]) Every compact convex
subset K of a Banach space E has normal structure.
Proof. The proof is by contradiction. So, assume K does not have normal structure.
The method of proof is to construct a sequence in K which does not have a conver-
gent subsequence. This, of course, will contradict the compactness of K. Let C be
a convex subset of K which has at least two points. Since K does not have normal
structure, all points of C are diametral. Let d > 0 denote the diameter of C. We
shall construct a sequence {xi }∞
i=1 in C such that kxi − xj k = d; i, j = 1, 2, . . . , i 6= j.
( Since C contains at least two distinct points x1 , x2 , then ||x1 − x2 || = d). To do
this we choose x1 ∈ C arbitrary and assume that x2 , x3 , . . . , xn have already been
chosen such that kxi − xj k = d; i, j = 1, 2, . . . , n. By the convexity of C,
(x1 + x2 + . . . + xn ) ∈ C,
and so it is a diametral point (by assumption). By compactness, sup{kx−yk : x, y ∈
K} is achieved. So, there exists xn+1 ∈ C such that
° °
° x + x + . . . + x °
°xn+1 − 1 2 n ° = d.
° n °
d = k(xn+1 − x1 ) + (xn+1 − x2 ) + . . . + (xn+1 − xn )k
1 X 1
≤ kxn+1 − xj k ≤ · nd = d.
n j=1 n
Then, (i) K is bounded and convex; (ii) K is clearly a convex subset of itself; (iii)
K does not have normal structure.
The verifications are trivial. Observe that for arbitrary f ∈ K, kf k = 1. Moreover,
for f1 , f2 ∈ K arbitrary, and λ ∈ [0, 1], let F = λf1 + (1 − λ)f2 . Then, F (0) =
0, F (1) = 1 and 0 ≤ F (t) ≤ 1 ∀ t ∈ [0, 1]. Each point of K is diametral since if
f0 ∈ K,
sup{kf0 − f k : f ∈ K} = 1 = diameter of K.
Hence, K does not have normal structure.
which consists of all the convex combinations of elements of S is called the convex
hull of S (and is generally denoted by coS). Clearly, coS is the smallest convex set
that contains S and, in fact, it is the intersection of all convex sets containing S.
Observe that if S is convex then coS = S. The closure of coS, generally denoted by
coS, is called the closed convex hull of S and it is the intersection of all closed convex
sets containing S (i.e., the smallest closed convex set that contains S). Again if S
is a closed convex set, we have,
S = coS = coS.
Claim S is inductive (i.e., every chain in S has a lower bound). To verify this, let
Z be an arbitrary chain in S. Consider the intersection
K∗ = Kα .
Kα ∈Z
Then, clearly, K ∗ is: (i) closed; (ii) convex; (iii) bounded; (iv) nonempty.
Moreover, (v) U (K ∗ ) ⊂ K ∗ .
(i) K ∗ is closed, (as an arbitrary intersection of closed sets).
(ii) Let x, y ∈ K ∗ be arbitrary and let λ ∈ [0, 1]. Then x, y ∈ Kα for each Kα ∈ Z.
Since each Kα is convex (by hypothesis),
T it follows that λx + (1 − λ)y ∈ Kα
for each α, and so λx + (1 − λ)y ∈ Kα = K ∗ , establishing (ii).
Kα ∈S
(iv) Since E is reflexive and K is closed convex bounded and nonempty we have
(by a consequence of Kakutani’s theorem (Corollary 9.8)) that K is weakly
compact. Each Kα is closed and convex and so is weakly closed (Theorem
8.19). Furthermore Z is a chain and each Kα is nonempty so Z has finite
intersection property. It now follows (from Lemma 11.10) that K ∗ is nonempty.
(v) Let x ∈ Kα = K ∗ be arbitrary. It suffices to show U x ∈ K ∗ . But
T Kα ∈Z
x∈ Kα ⇒ x ∈ Kα for each Kα ∈ Z. By hypothesis, U x ∈ Kα for each
Kα ∈S T
Kα ∈ S. Hence U x ∈ Kα = K ∗ .
Kα ∈Z
K 1 = {x ∈ K 0 : K 0 ⊂ B(x, d1 )}
= K0 ∩ ( B(ω, d1 )).
ω∈K 0
Then, K 1 is a closed convex subset of K 0 . Furthermore, K 1 6= ∅ (since x∗ ∈ K 1 ).
The condition d1 < d implies K 1 6= K 0 (verify). We now show U (K 1 ) ⊆ K 1 . Let
y0 ∈ K 1 be arbitrary. We want to show U y0 ∈ K 1 . It suffices to show
(a) U y0 ∈ K 0 and , (b) K 0 ⊆ B(U y0 , d1 ).
kU y0 − U vk ≤ ky0 − vk ≤ d1 .
Corollary 11.18 (F.E. Browder [6]; Göhde [10]) Let E be a uniformly convex Ba-
nach space and let K be a closed convex bounded nonempty subset of E. Suppose
that T : K / K is nonexpansive. Then T has a fixed point in K.
Proof. The corollary follows immediately from Theorem 10.17 on observing that
K has normal structure (Proposition 10.13) and E is reflexive (Theorem 9.21).
Remark 11.19 We had observed that the Contraction Mapping Principle is, per-
haps, the most important fixed point theorem. Certainly, it is now one of the most
durable and fruitful methods in mathematical analysis. Recall that a contraction
map T : (M, ρ) / (M, ρ) of a metric space into itself is one that satisfies the
following condition: for each x, y ∈ M , there exists k ∈ [0, 1) such that
It is then clear that nonexpansive maps, (those satisfying this inequality with k ∈
[0, 1]), can be considered as those maps lying at the boundary of the contraction
maps. Apart from being an obvious generalization of contraction mappings, interest
in such mappings stems mainly from the following two facts (see e.g., Bruck [7]):
Fact 1: Nonexpansive mappings appear in many applications as the transition op-
erators for initial–value problems of differential inclusions of the form
f∈ + A(t)u,
where the operators {A(t)} are generally minimally “continuous” and are, in some
sense “positive”.
13.1 Notations
Throughout this Chapter, Ω will represent some nonempty open subset of IRn (n ≥ 1
integer) with boundary ∂Ω. We will write K ⊂⊂ Ω if K is a compact subset of Ω.
A generic point in IRn is denoted by x = (x1 , x2 , ..., xn ), its Euclidean norm being
³P ´1
2 2
||x|| := |xi | . By a multi-index α, we mean an n-tuple α = (α1 , α2 , ..., αn ) of
nonnegative integers αj , i.e., α ∈ IN n . To the multi-index α, we associate the integer
P |α|
|α| := nj=1 αj ; and a differential operator Dα := ∂xα1 ∂x∂α2 ...∂xαn .
1 2 n
where XK is the characteristic function of the set K and L1 (Ω) is the space of
class of measurable functions on Ω which are Lebesgue integrable. In other words,
L1,loc (Ω) := {f : Ω / IR such that f is a measurable function on Ω and f is in-
tegrable on every K ⊂⊂ Ω}. To emphasize the “smallest” and “biggest” size of the
spaces D (Ω) and L1,log (Ω) respectively, we have the following embeddings between
the usual function spaces.
We now recall the definitions of the spaces in Fig 11.1. We first remark that it is
conventional to write C 0 (Ω) :≡ C(Ω). Then for a fixed integer m ≥ 1,
(ii) C0m (Ω) := {f ∈ C m (Ω) such that suppf ⊂⊂ Ω}, where suppf =support of f :=
the closure of the set of points at which f is nonzero.
For 1 ≤ p ≤ ∞, Lp (Ω) is the RLebesgue space defined as the set of class of measurable
functions f on Ω such that Ω |f (x)|p dx < ∞, if p < ∞, and there exists A such
that µ(Ω \ A) = 0 and sup |f (x)| < ∞, if p = ∞, where µ denotes the measure of
Ω \ A.
2. The space LP (Ω) is a Banach space under the norm
³ ´ p1
R p
||f ||0,p,Ω := Ω |f (x) dx , if p < ∞
ess sup |f (x)|, if p = ∞,
where ess supx∈Ω |f (x)| := inf{K ≥ 0 such that |f (x)| ≤ K a.e. on Ω} < ∞,
if p = ∞.
We simply write ||f ||0,Ω where p = 2. In this particular case, L2 (Ω) is a Hilbert
space, the inner product being defined by hf, gi0,Ω := Ω f (x).g(x)dx.
Example 12.1 Let Ω ⊂ IR3 . We examine what the elements of C 2 (Ω) look like. Re-
call, C 2 (Ω) := {f : Ω / IR such that D α f is continuous on Ω, ∀α ∈ IN n , |α| ≤ 2}.
Now x ∈ Ω ⊂ IR so x = (x1 , , x2 , x3 ) for x1 , x2 , x3 ∈ IR. Hence, a generic α is given
by α = (α1 , α2 , α3 ) with |α| = α1 + α2 + α3 ≤ 2. With this restriction the possible
value of α are:
(i) (0, 0, 0) (ii) (1, 1, 0), (iii) (0, 1, 1) (iv) (1, 0, 1) (v) (2, 0, 0), (vi) (0, 2, 0)
∂2f ∂2f
(ii) α = (1, 1, 0), |α| = 2 and Dα f = α α α
∂x1 1 ∂x2 2 ∂x3 3
= ∂x1 ∂x2
∂2f ∂2f
(iii) α = (0, 1, 1), |α| = 2 and Dα f = ∂x01 ∂x2 ∂x3
= ∂x2 ∂x3
(iv) α = (1, 0, 1), |α| = 2 and Dα f = ∂x1 ∂x3
(v) α = (2, 0, 0), |α| = 2 and Dα f = ∂x21
(vi) α = (0, 2, 0), |α| = 2 and Dα f = ∂x22
(vii) α = (0, 0, 2), |α| = 2 and Dα f = ∂x23
(viii) α = (0, 1, 0), |α| = 1 and Dα f = ∂x2
(ix) α = (1, 0, 0), |α| = 1 and Dα f = ∂x1
(x) α = (0, 0, 1), |α| = 1 and Dα f = ∂x3
Thus C 2 (Ω) consists of all real-valued functions defined on Ω which are such that
all their partial derivatives up to and including order 2 are continuous on Ω.
(i) (0, 0) (ii) (1, 0), (0, 1) (iii) (1, 1), (2, 0), (0, 2) (iv)(3, 0), (0, 3), (1, 2), (2, 1)
(i) α = (0, 0), |α| = 0, Dα f = f .
(iii) α = (1, 1), |α| = 2, Dα f = ∂x1 ∂x2
α = (2, 0), |α| = 2, Dα f = ∂x21
α = (0, 2), |α| = 2, Dα f = ∂x22
(iv) α = (3, 0), |α| = 3, Dα f = ∂x31
α = (0, 3), |α| = 3, Dα f = ∂x32
α = (1, 2), |α| = 3, Dα f = ∂x1 ∂x22
α = (2, 1), |α| = 3, Dα f = ∂x21 ∂x2
Again, C 3 (Ω), Ω ⊆ IR2 , consists of all real-valued functions whose partial derivatives
of all orders up to and including order 3 are continuous on Ω.
Remark 12.3 Functions in the smallest space D(Ω) have several nice properties.
They are integrable, differentiable, bounded, .... However, they are not suitable
conditions for solutions of most partial differential equations. For example, in the
electrostatic model
− 4 u = f in Ω
u=0 on ∂Ω,
it is not realistic to assume that the solution u is in D(Ω), i.e., that the solution u
is infinitely differentiable on Ω and has compact support in Ω. On the other hand,
the biggest space L1,loc (Ω) contains “pathological” functions which, nonetheless, are
good candidates for solutions of P DEs. To appreciate this fact, let us consider the
beam problem,
−u00 = f, on (−1, 1)
u(−1) = u(1) = 0,
The function f , which clearly belongs to Lp,loc (−1, −1), 1 < p < ∞, is pathological
as a non-continuous and hence non differentiable function.
To overcome these difficulties associated with L1,loc (Ω), we shall define a new space
D0 (Ω) which contains L1,loc (Ω) and for which elements enjoy, in a weak sense , sev-
eral of the nice properties of the functions in D(Ω).
To introduce what is meant by the weak sense, we first endow L1,loc (Ω) and D(Ω)
with the structure of reflexive, Hausdorff, complete, locally convex topological vector
spaces. For D(Ω), the topology in question is referred to as the canonical topology of
L-Schwartz. Actually, we do not need here the explicit definition of these topologies;
only the sense of convergence of sequences in L1,loc (Ω) and D(Ω) is of interest to us
13.3 Pseudo Topologies
Definition 12.4 (Pseudo topology of C ∞ (Ω)). We say that a sequence {ϕn } in
C ∞ (Ω) converges to ϕ ∈ C ∞ (Ω) if for any K ⊂⊂ Ω, Dα ϕn / D α ϕ uniformly on
K for all multi-indices α.
Definition 12.5 (Pseudo topology of D (Ω) = C0∞ (Ω)). A sequence {ϕn } in D (Ω)
converges to ϕ ∈ D (Ω) if there exists K ⊂⊂ Ω, such that (a) suppϕn ⊂ K, ∀n ≥ 1,
suppϕ ⊂ K; and (b) Dα ϕn / D α ϕ uniformly on K for all multi-indices α.
Definition 12.6 (Pseudo topology of L1,loc (Ω)). A sequence {fn } in L1,loc (Ω) con-
verges to f ∈ L1,loc (Ω) if for any K ⊂⊂ Ω, ||fn − f ||L1 / 0 as n / ∞ or
equivalently, if for every ϕ ∈ D (Ω), ||fn ϕ − f ϕ||L1 / 0 in K as n / ∞.
More generally, if f ∈ C |α| (Ω), then by integration by parts we obtain the following
α |α|
D f ϕdx = (−1) f Dα ϕdx ∀ϕ ∈ D(Ω). (12.2)
We observe immediately that the right side of the above equation is defined even
when f is not differentiable but is just locally integrable. This gives us a method of
generalizing the concept of derivative and leads to the following definition.
Definition 12.8 Let f, g ∈ L1,loc (Ω). Then g is called the distributional (or weak)
derivative of f , of order α, denoted by Dwα f if
gϕdx = (−1) f Dα ϕdx ∀ϕ ∈ D(Ω). (12.3)
Compare formulas (12.2) and (12.3) to see that these two notions of derivative
coincide when derivative exists in the usual sense.
Example 12.9 Let f (x) = |x| ∀x ∈ IR. In the usual sense, f 0 does not exist at
x = 0. Define the function,
1, if x > 0
sgn(x) := 0, if x = 0
−1, if x < 0.
Let ϕ ∈ D(IR) and suppϕ ⊂ (−n, n) where n is some positive integer. Then,
Z Z Z n
0 0
f (x)ϕ (x)dx = |x|ϕ (x)dx = |x|ϕ0 (x)dx
IR IR −n
Z 0 Z n
= −xϕ0 (x)dx + xϕ0 (x)dx
−n 0
Z 0 Z n
= ϕ(x)dx − ϕ(x)dx, (integration by parts)
−n 0
Z n Z
= − sgn(x)ϕ(x)dx = − sgn(x)ϕ(x)dx.
−n IR
Let ϕ ∈ D(R) and suppϕ ⊂ (−n, n). Then,
Z Z n
Hϕ dx = 1.ϕ0 (x)dx
IR 0
= ϕ(n) − ϕ(0), (integration by parts)
= −ϕ(0), since ϕ(n) = 0
:= −hδ, ϕi,
We observe that the distributional (or weak) derivative of the Heaviside function
does not exist. However, to the Heaviside function H corresponds a distribution TH .
The derivative of this distribution exists and is equal to δ. We do not explore this
here. For more on distributions the reader may consult [1].
14.1 Introduction
In the classical theory of Partial Differential Equations (PDEs), we seek the solution
u which satisfies some relation among its partial derivatives and u in some open sub-
set Ω of IRn . The existence and uniqueness of the solution of well-known problems
of PDEs can be proved if the given boundary conditions are smooth. This classical
theory fails if the boundary conditions fail to be smooth or if the right hand side
is not smooth enough, even on some finite points. To handle this situation, it is
necessary to generalize the idea of the usual differentiation. This generalization is
the so-called distributional (or weak) derivatives, which we introduce in this chapter.
Next we show that solutions of PDEs lie in a special class of Banach spaces called
Sobolev spaces. There are several Sobolev spaces associated with any open subset
Ω ⊂ IRn . Two special kinds that appear in most problems are H m (Ω) and H0m (Ω)
and both are separable Hilbert spaces. This gives a way to convert our original
problem related to the existence and uniqueness of solution, into a problem that
can be solved from the theory of Hilbert spaces, e.g., by means of a theorem called
Lax- Migram theorem. In this chapter, we first state and prove this theorem. Then
we introduce Sobolev spaces and use some of their properties, in conjunction with
Lax-Migram Theorem to solve the Dirichlet problem; Neumann problem and the
Bi-harmonic equation.
14.2 Abstract Minimization Problem
We begin with the notion of a bilinear functional.
Definition 13.1 Let E be a complex vector space. A mapping a(., .) : E ×E /CI
is called a bilinear form if
(a) a(αx + βy, z) = αa(x, z) + βa(y, z); for all x, y, z ∈ E and α, β ∈ C I; and
(b) a(x, λu + µv) = λ̄a(x, u) + µ̄a(x, v), for all x, u, v ∈ E and for all scalars λ, µ ∈ CI.
Note that a bilinear form is linear with respect to the first variable and anti-linear
with respect to the second variable. Of course, in a real vector space, bilinear
functionals are linear in both variables.
Example 13.2 (a) Inner product is a bilinear form.
(b) Let f, g : E / IR be linear functionals on a vector space E. Then a(x, y) =
f (x)g(x) is a bilinear form on E.
(c) Let P, Q : H / H be linear operators on a Hilbert space. Then a(., .) : H × H
I defined by
a(u, v) = (P u, v)
or = (u, Qv)
or = (P u, Qv),
is a bilinear form.
Let X be a real normed linear space with norm ||.||. Let a(., .) : X × X / IR
be a continuous bilinear form and f : X / IR be a continuous linear form. For
∅ 6= K ⊂ X, we, in general, associate an abstract minimization problem as follows:
Find an element x∗ ∈ X such that:
(i) x∗ ∈ K;
(ii) Jx∗ = infx∈K Jx, where J : X / IR is defined by
J(x) := a(x, x) − f (x) ∈ IR.
For a solution of this abstract minimization problem we have the following theorem.
Theorem 13.4 Assume in addition that the following conditions hold:
(a) X is complete;
(b) K is closed and convex;
(c) the bilinear form a(., .) is symmetric and;
(d) there exists α > 0 such that a(x, x) ≥ α||x||2 ∀x ∈ X. Then, the abstract
minimization problem has one and only one solution.
Proof. Condition (c) implies the bilinearpform a(., .) is an inner product on X with
the associated norm given by ||x||X := a(x, x), which is equivalent to the given
norm ||.||. Hence X is a complete inner product space and so is a Hilbert space. We
now denote it by H. By Riesz representation theorem, for each x ∈ H, there exists
a unique σf ∈ H such that f (x) = a(σf, x). This implies that
1 1
Jx = a(x, x) − f (x) = a(x, x) − a(σf, x)
2 2
1 1
= a(x − σf, x − σf ) − a(σf, σf ),
2 2
so that
1 1
min Jx = min{a(x − σf, x − σf )} − a(σf, σf ).
x∈K 2 x∈K 2
Hence, solving the abstract minimization problem amounts to minimizing the dis-
p between the unique element σf and the set K, with respect to the norm
a(., .) = ||.||. Consequently, the solution is simply the projection of the element
σf onto K with respect to the inner product a(., .). By the projection theorem, such
a projection exists and is unique, since K is a closed and convex subset of H.
Definition 13.5 A linear form a(., .) : X × X /CI which satisfies the condition
a(x, x) ≥ α||x|| ∀x ∈ X and for some α > 0 is said to be X-elliptic on X.
Variational formulations.
The variational forms of the abstract minimization problem are summarized in the
following theorem.
a(x∗ , v − x∗ ) ≥ f (v − x∗ ),
a(σf − x∗ , v − x∗ ) ≤ 0.
This implies, a(x∗ , v−x∗ ) ≥ a(σf, v−x∗ ) = f (v−x∗ ), since ∀v ∈ H, f (v) = a(σf, v),
by Riesz representation theorem, and clearly, v − x∗ ∈ H. This establishes (a).
14.2.2 The Non-symmetric case
The abstract minimization problem can also be formulated without making explicit
reference to the functional J. In this case, the problem is called an abstract varia-
tional problem and it is formulated as follows:
x∗ ∈ K and ∀v ∈ K, a(x∗ , v − x∗ ) ≥ f (v − x∗ ),
where K is a closed convex subset of H, or, (b) find an element x∗ such that
if K is a subspace of X.
We have seen from Theorem 12.4 that each such problem has one and only one
solution if X is complete, the subset K is closed and convex, and the bilinear form
is X-elliptic, continuous and symmetric.
If the assumption of symmetry of the bilinear form is dropped, it has been proved
that the above variational problem still has one and one solution, but in this case,
there is necessarily no longer an associated minimization problem.
a(v, u) = f (v) ∀v ∈ H;
(b) if, in addition, a(., .) is symmetric and a(v, v) ≥ 0 ∀v ∈ H, then the function
J :H / IR defined by
J(v) := a(v, v) − f (v)
has minimum value at u given in part (a).
Aw0 :H / IR by
A(w) = z.
Combining this definition with equation (13.5) we obtain the following relation,
Claim 2.
(i) A is linear;
(ii) A is one-to-one;
(iii) A is continuous (use α||v||2 ≤ a(v, u) ≤ M ||u||2 );
(iv) A−1 exists and is continuous on R(A), the range of A;
(v) R (A) = H.
Now, f ∈ H ∗ implies, by Riesz representation theorem, that there exists a unique
z ∈ H such that
Since R (A) = H, there exists u ∈ H such that A(u) = z. Using (13.7) and (13.6),
we obtain the following equation:
This completes the proof of existence. We now prove uniqueness. Assume there
exist u and u∗ in H, u 6= u∗ such that
For part (b), assume now that a(., .) is symmetric and a(v, v) ≥ 0 ∀v ∈ H.
Then, using the fact that f (u) = a(u, u) we have that
1 1 1
0 ≤ a(u − v, u − v) = a(u, u) − a(u, v) + a(v, v)
2 2 2
1 i 1 i
= −[ a(u, u) − a(u, u) + [ a(v, v) − a(u, v)
2 2
1 i 1 i
= −[ a(u, u) − f (u) + [ a(v, v) − f (v) ,
2 2
so that J(u) ≤ J(v) ∀v ∈ H. Hence, J has minimum value at u, i.e., u = min J(v).
The proof is complete.
1. Using the notations of the proof of Theorem 12.7, prove the following asser-
(i) Aw0 is a bounded linear functional on H;
(ii) A is linear;
(iii) A is continuous;
(iv) A is one-to-one;
(v) A−1 exists and is continuous on R (A);
(vi) R (A) = H.
Hint ||A(v)||2 = hA(v), A(v)i = a(A(v), v) ≤ M ||A(v)||.||v||∀v ∈ H, also,
α||v||2 ≤ |a(v, v)| = |hv, A(v)i| ≤ ||v||.||A(v)|| ⇒ ||A(v)|| ≥ α||v|| ∀v ∈ H.
By using the properties of norm of Lp (Ω), it follows that W m,p (Ω) is a normed space.
Definition 13.10 For any u ∈ W m,p (Ω) we define the semi-norms, denoted by |.|m,p
as follows:
³X ´ p1
||u||m,p := ||Dα u||pp , for 1 ≤ p < ∞,
Remark 13.11 H m (Ω) is an inner product space. For, given u and v ∈ H m (Ω) we
define an inner product as follows:
hu, vi := Dα u(x).Dα v(x)d(x).
|α|≤m Ω
Theorem 13.13 W m,p (Ω) is a Banach space with respect to the norm given in
Definition 10.2.
Theorem 13.14 W m,p (Ω) is separable for 1 ≤ p < ∞, and it is uniformly convex
(and hence also reflexive) for 1 < p < ∞.
Corollary 13.15 H m (Ω) and H0m (Ω) are separable Hilbert spaces. (H0m (Ω) is sep-
arable, since a closed subspace of a separable space is separable).
Remark 13.16 H0m (Ω) as the origin of BV P . The distinction between H0m (Ω) and
H m (Ω)X
is the origin of boundary value problems (BVP). In fact,
f := (−1)|α| D2α u with u ∈ H m (Ω) := H0m (Ω) ⊕ H0m (Ω)⊥ , then there exists
u ∈ H0m (Ω), solution of (−1)|α| D2α v = f in D0 (Ω).
∂v ∂ m−1 v
Formally, v ∈ H0m (Ω) implies v = ∂n
= ... = ∂nm−1
= 0 on ∂Ω.
Remark 13.17 The notation H0m (IR) is useless because D(IRn ) is dense in H m (IR).
We remark also that D (Ω) is not dense in H m (Ω) if Ω is bounded.
Definition 13.18 Let Ω be a nonempty open subset of IRn . We say that Ω has the
m-extension property if there exists an operator T : W m,p (Ω) / W m,p (IRn ) such
(i) T is linear;
(ii) T (u)(x) = u(x) a.e. in Ω ∀u ∈ W m,p (Ω);
(iii) ||T u||m,p,IRn ≤ M ||u||m,p,Ω ∀u ∈ W m,p (Ω) for some M ≥ 0. The map T is called
an m-extension operator.
Example 13.19 The half space IR+ := {(x1 , x2 , ..., xn ) = (x0 , xn ) ∈ IRn , xn > 0}
has the m-extension property, the m-extension operator T being defined as follows:
0 u(x0 , xn ), if xn > 0
T u(x , xn ) := 0
u(x , −xn ), if xn < 0.
More generally, regular (smooth) domains have the m-extension property. A very
important result on m-extensions property is the following theorem.
Theorem 13.20 If Ω has the m-extension property, then C ∞ (Ω̄)∩W m,p (Ω) is dense
in W m,p (Ω).
We now conclude this section with the following important embedding theorem.
Theorem 13.21 (Sobolev Embedding Theorem) Let Ω ⊆ IRn be an open set having
the m-extension property. Then, the following inclusions are continuous.
Corollary 13.22 Assume that the open set Ω has the m-extension property. Con-
sider the number p∗ defied by p1∗ = 12 − m
. Then, three situations are possible.
(a) If p∗
> 0, i.e., n > 2m then H m (Ω) ,→ Lp∗ (Ω).
(b) If p1∗ = 0, i.e., n = 2m, then H m (Ω) ,→ Lp (Ω0 ) for any 1 ≤ p < ∞ and for
any bounded open subset Ω0 ⊂ Ω. Moreover, we have the global embedding
H m (Ω) ,→ Lp (Ω) if 2 ≤ p < ∞.
(c) If p1∗ < 0, i.e., n < 2m, then H m (Ω) ,→ C(Ω̄). More precisely, H m (Ω) ,→
C s (Ω̄), where s ≥ 0 is the biggest nonnegative integer such that s < m − n2 .
Remark 13.23 In this our short introduction to Sobolev Spaces and in the exam-
ples that will follow we will not require the use of the embedding theorem (Theorem
12.21) and Corollary 12.22.
In particular, for p = 2,
||u||0,Ω ≤ C|| || ∀u ∈ H01 (Ω).
∂xn 0,Ω
Corollary 13.25 Let Ω ⊂ IRn be an open set, bounded along some axis. |.|1,p
becomes a norm and is equivalent to the norm ||.||1,p on W 1,p (Ω). Consequently, the
Poincare inequality, in this case can be written as:
||u||0,Ω ≤ C|| || ∀u ∈ H0 (Ω).
∂xn 0,Ω
Lemma 13.26 (Green’s Inequality) Let Ω ⊂ IRn be an open set with smooth bound-
ary ∂Ω, u ∈ C 2 (Ω̄) and v ∈ C 1 (Ω̄). Then,
5u 5 vdx = − (4u)vdx + v ds,
Ω Ω ∂ Ω ∂n
P 2
where 5u := ( ∂x 1
, ..., ∂x n
), 4u := ni=1 ∂∂xu2 , ∂n
:= 5u.n, and u is the unit normal
vector on ∂Ω.
Example 13.27 (Drichlet Problem for the Laplace Operator) Let Ω ⊆ IRn be a
bounded smooth domain. Let f be a real-valued continuous function on it. The
Drichlet problem is to find a function u ∈ C 2 (Ω̄) such that
− 4 u = f, in Ω
u = 0, on ∂Ω.
Let us assume that the solution of the above problem exists. Then, multiplying the
above equation with arbitrary ϕ ∈ D(Ω) and integrating by parts we obtain,
− 4uϕdx = f ϕdx.
This implies, by using the above Green’s identity and the fact that ϕ = 0 on ∂Ω,
5u 5 vdx = f ϕdx.
This observation gives us a clue on how to define a weak solution of the Drichlet
problem. We say that u ∈ H01 (Ω) is a weak solution of the Drichlet problem if
5u 5 vdx = f vdx∀v ∈ H01 (Ω). (13.9)
We now prove the existence and uniqueness of this weak solution for a suitable f .
Let f ∈ L2 (Ω) and H = H01 (Ω). Define a bilinear form a(., .) on H × H and a
continuous F on H respectively, as follows:
a(u, v) := 5u 5 vdx ∀v ∈ H01 (Ω);
F (v) := f vdx ∀v ∈ H01 (Ω).
Then, |a(u, v)| ≤ |u|1 |v|1 ≤ ||u||1 ||v||1 , by Cauchy-Schwartz inequality and the defi-
nitions of norm and semi-norm on H 1 (Ω). Furthermore,
|a(v, v)| ≥ a(v, v) := 5v 5 vdx ≥ α||v||21 ,
for some α > 0, by Poincare inequality. So, by Lax-Migram theorem, equation
(13.9) has a unique solution u ∈ H := H01 (Ω) and is characterized as
u= min Jv,
v∈H01 (Ω)
where Jv := 2 Ω 5v 5 vdx − Ω f vdx.
If we assume that the solution of (13.10) exists, then just as an Example 12.27, we
can define a weak solution. We say that u ∈ H01 (Ω) is a weak solution of (13.10) if
∂u ∂v
aij (x) + a0 uv = f v, ∀v ∈ H01 (Ω), (13.11)
Ω 1≤i,j≤n ∂xj ∂xi Ω Ω
By using the fact that a0 is nonnegative, and Poincare inequality, it can be shown
that a(., .) is an H-elliptic bilinear form. Hence, by Lax-migram theorem, there
exists a unique solution of equation (13.11). Furthermore, if aij = aji , so a(., .) is
symmetric, then we can characterize the weak solution u of (13.11) as follows:
u= min J(v),
v∈H01 (Ω)
1 ∂v ∂v 1 2
where Jv := aij (x) + a0 v − f v.
2 Ω 1≤i,j≤n ∂xi ∂xj 2 Ω Ω
Example 13.29 (The Neumann Problem) Let Ω ⊂ IRn be an open set with smooth
boundary ∂Ω, f be a real-valued continuous function on Ω. The Neumann problem
is to find a function u ∈ C 2 (Ω̄) such that
− 4 u + u = f, in Ω
∂u (13.12)
= 0, on ∂Ω.
Let ϕ ∈ D (Ω). Multiplying the above equation by ϕ, integrating and using Green’s
formula, it can be shown that
5u 5 ϕ + uϕ = f ϕ.
Let f ∈ L2 (Ω). We say u ∈ H 1 (Ω) is a weak solution of (13.12) if
5u 5 v + uv = f v, ∀v ∈ H 1 (Ω). (13.13)
Now, define a symmetric bilinear form a(., .) on H × H, where H := H 1 (Ω) by,
a(u, v) := 5u 5 v + uv.
Then, a(u, v) := ||u||21 . It is trivially H 1 (Ω)-elliptic. So, (13.13) has a unique solution
u ∈ H 1 (Ω) characterized by
u = min J(v),
v∈H 1 (Ω)
where Jv := 12 Ω 5v 5 v + 12 Ω v 2 − Ω f v.
We note that to prove the existence and uniqueness of the solution of equation
(13.13), Poincare’s inequality has not been used. Hence we require the weak solu-
tion to lie in H 1 (Ω) instead of in H01 (Ω).
u= min J(v),
v∈H02 (Ω)
R 2
R 2
where Jv := 2 Ω | 4 v| − Ω f v, ∀v ∈ H0 (Ω).
Miscellaneous exercises
1. Let ρ : IR × IR / IR be defined by
¯ x y ¯¯
ρ(x, y) = ¯ − ¯, x, y ∈ IR.
1 + |x| 1 + |y|
kx + M k = inf kx + mk.
T x = x + M.
Prove: (a) k·k is a norm on E/M ; (b) T is a bounded linear map; (c) kT k ≤ 1.
8. Let {0} 6= E be a normed linear space. Prove that E is a Banach space if and
only if B = {x ∈ E : kxk ≤ 1} is a complete metric space.
10. Prove the following special case of Brouwer fixed point theorem. Let f : [0, 1]
/ [0, 1] be continuous. Then f has a fixed point.
(Hint: Consider the function h : [0, 1] / [0, 1] defined by h(x) = f (x) − x
and use the Intermediate Value Theorem).
11. Let E be an arbitrary normed linear space and let T : `np / E, 1≤p<
∞, be an arbitrary linear map. Prove that T is continuous.
Hint: Use the fact that all norms on `np are equivalent. Let {ei }ni=1 be the
canonical basis in `np where
ei = (0, 0, . . . , 0, 1, 0, . . . , 0), i = 1, 2, . . . , n.
↑ ith position
For arbitrary x ∈ `np , x = αi ei for some unique αi ∈ IR. Recall that
kxk0 = max |αi | is a norm on `np .
12. Let E be a normed linear space with dim E = n < ∞ and let {e1 , e2 , . . . , en }
be a basis for E. Then, for each x ∈ E, there exist unique scalars αi ∈ IR such
that n
x= αi ei .
Let T : E / `n be defined by
T (x) = (α1 , α2 , . . . , αn ).
(i) T is 1 − 1 and linear;
(ii) T −1 : `n1 / E is continuous;
(ii) follows trivially from the previous exercise where p = 1. Now prove:
(a) T : E / `n is continuous;
(b) If Y is an arbitrary normed linear space, prove that every linear map
U :E / Y is continuous.
Hint: For (a), if T is not continuous, then for some ε > 0 there exists a se-
quence {xn } in E such that xn / 0 and kT xn k ≥ ε. If we now set un = xn
kT xn k
we obtain Un / 0 whereas kT Un k = 1. The subset of `n consisting of all
vectors of norm 1 is compact, so {T (Un )} has a convergent subsequence which
converges to a vector of norm 1. Now apply the continuity of T −1 .
Remark Part (b) shows that any linear map from a finite dimensional
normed linear space into an arbitrary normed liner space is always continuous.
13. Let E denote the space of all complex sequences {xi }∞
i=1 . Define a function
ρ:E×E / IR by
1 |xi − yi |
ρ(x, y) = i
2 1 + |xi − yi |
where x = {xi }∞ ∞
i=1 , y = {yi }i=1 . Prove:
Remark. Part (c) shows that while every normed space is a metric space
(by ρ(x − y) = kx − yk) there are metric spaces which are not normed spaces.
14. Let E = IRn . Consider the function k · k∞ : IRn / IR defined, for arbitrary
x = (x1 , x2 , . . . , xn ) ∈ IR by kxk∞ = max1≤i≤n |xi |. (a) Prove k · k∞ is a norm
on IRn . With this norm, IRn is usually denoted by `n∞ . Recall that IRn with
µn ¶1/p
P p
norm kxkp = |xi | is denoted by `np . (b) Prove kxk∞ = lim / kxkp .
i=1 p ∞
Remark If P is a projection on a normed linear space E then each x ∈ E
can be written as x = x1 + x2 where P x1 = x and P x2 = 0. In particular,
take x1 ≡ P x and x2 ≡ (I − P )x.
Conversely, given two closed linear subspaces M and N of E such that every
x ∈ E can be expressed uniquely as x = n + m, n ∈ N, m ∈ M then the map
P :E / E defined by P x = n is a closed linear map and hence (by the
Closed Graph Theorem) is bounded, with P 2 = P .
It is then clear that there is a 1 − 1 correspondence between projections on E
and direct sum decomposition of E into two closed subspaces M and N . The
subspaces corresponding to a projection P are N := P X and M := (I − P )X,
characterized respectively by P x = x and P x = 0.
17. Prove that if P1 and P2 are hermitian operators such that P1 P2 = 0 then
P2 P1 = 0.
20. Prove that a linear subspace of a normed linear space is closed if and only if it
is weakly closed.
21. Let E be a separable Banach space. Prove that every bounded sequence in E ∗
has a weak∗ convergent subsequence.
f ∗ (x0 ) = kf ∗ k.
26. Let E be a uniformly convex Banach space. If K ⊂ E is a nonempty closed
convex set and x0 ∈ E\K, prove there exists a unique y0 ∈ K such that
27. Let E be a uniformly convex Banach space with dual space E ∗ . Suppose
f ∗ ∈ E ∗ , f ∗ 6= 0. Prove there exists a unique x0 ∈ E, kx0 k = 1 such that
f ∗ (x0 ) = kf ∗ k.
28. Let E be a uniformly convex Banach space and let α ∈ (0, 1) and d > 0. Prove
that for any ε > 0, if x, y ∈ E are such that ¡ ¢
kxk ≤ d, kyk ≤ d, kx − yk ≥ ε, then there exists a δ = δ dε > 0 such that
h ³ε´ i
kαx + (1 − α)yk ≤ 1 − 2δ min{α, 1 − α} d.
(Hint: Without loss of generality, take α ∈ (0, 1/2] and apply Proposition
Definition Let E be a normed linear space over IR and let E ∗ denote the
dual of E. We define the single–valued normalized duality map of E as the
map j : E / E ∗ such that for each x ∈ E, j(x) ∈ E ∗ and satisfies
hx − y, j(x) − j(y)i ≥ 0.
31. (a) Find an example of a complete metric space (E, ρ) and a mapping
f :E / E such that
(b) Prove that if the Contraction Mapping Principle applies to f n where n is
a positive integer, then f has a unique fixed point.
(c) Let (E, ρ) be a complete metric space and f, g : E / E be functions.
Suppose f is a contraction and f (g(x)) = g(f (x))
∀ x ∈ E. Prove that g has a fixed point but that such a fixed point need
not be unique.
Prove: (a) K is nonempty, closed, convex and bounded; (b) ϕ maps K into
K; (c) ϕ is nonexpansive; (d) ϕ has no fixed points.
xn+1 = T xn , n = 1, 2, . . . , x0 ∈ E.
|f (x, y)| ≤ M in B.
Prove that the differential equation
= f (x, y),
with the initial condition y(x0 ) = y0 has a unique solution in the interval
[x0 − ε, x0 + ε].
36. Let C[0, 1] be endowed with the “sup” metric. Define
T : C[0, 1] / C[0, 1]
by Z t
(T f )(t) = f (s)ds, f ∈ C[0, 1], t ∈ [0, 1].
(a) T is not a contraction map;
(b) T 2 is a contraction map.
(Note: “sup” metric ρ is given by ρ(f, g) = sup |f (t) − g(t)|).
(c) Does T have a fixed point? (Refer to Problem 32(b)).
Definition If X and Y are normed linear spaces and T is a linear map with
domain D(T ) in X and range R(T ) in Y then T is called compact (or com-
pletely continuous) if for each bounded sequence {xn } in D(T ), we have that
{T xn } contains a subsequence converging to some element in Y .
Consequently, to prove that a linear map T from X into Y is compact it suffices
to prove that if B is the unit open ball in X we have T (B) is relatively compact
in Y . Now prove the following basic facts about compact maps (Problems 38
to 42).
37. Prove that every compact linear operator is continuous.
38. Let X and Y be Banach spaces. Prove,
(a) if T : X / Y is linear continuous and dim X < ∞ then T is compact
(i.e., every linear map on a finite dimensional Banach space is compact).
(b) If (i) T : X / Y is compact, and (ii) Range of T, R(T ), is closed then
dim R(T ) < ∞.
(c) If T : X / Y is linear and dim R(T ) < ∞ then T is compact.
41. Let X be a Banach space with dim X = ∞ and let T : X / X be a linear
−1 −1
compact map. Suppose T exists. Is T compact? Justify your answer.
Appendix: Completion of a metric space
A1.1 Introduction
Let (M, ρ) be a metric space. A metric space (W, ρW ) is said to be a a completion
of (M, ρ) if and only if the following two conditions hold:
Remark A1 The metric space (M, ρ) need not be a subspace of a completion, but
it is at least isometric to a dense subspace. This situation is illustrated in Fig. A1
(W, ρW )
(M, ρ) ∼
= (W ∗ , ρW )
Fig. A1
In what follows we shall show that every incomplete metric space has a completion.
Moreover, we shall show that all completions of a given metric space are isometric.
This implies that completions are essentially unique.
Remark A2
We know that if (X, ρX ) and (Y, ρY ) are two metric spaces that are isometric to
each other, then (X, ρX ) is complete if and only if (Y, ρY ) is. Consequently, since
the incomplete metric space (M, ρ) is isometric to a dense subspace (W ∗ , ρW ) of a
complete metric space (W, ρW ), for questions of continuity and convergence, (M, ρ)
may essentially be regarded as a subspace of its completion.
δs ≤ δr + ∈,
so that
δs − ∈ ≤ δr ≤ δs + ∈,
|δr − δs | < ∈ for all r, s ≥ N0 ,
and hence, {δn }∞
n=1 is a Cauchy sequence in IR.
Lemma A3 Let S denote the set of all Cauchy sequences in (M, ρ), i.e.,
Define a relation ∼ in S as follows:
Lemma A5 If {xn } and {x0n } are in the same equivalence class and if xn / x∗
then x0n / x∗ .
ρ(x0n , x∗ ) ≤ ρ(x0n , xn ) + ρ(xn , x∗ ) /0 as n / ∞.
Remark A6 Lemma A4 asserts that if an element in any class converges, then any
other element in that class must converge, and must converge to the same limit.
Lemma A7 Suppose {xn } and {x0m } are each equivalent to {yn }, then {xn } ∼ {x0n }.
ρ(xn , x0n ) ≤ ρ(xn , yn ) + ρ(yn , x0n ) /0 as n / ∞.
Remark A8 A consequence of Lemma A7 is that a sequence cannot belong to two
distinct classes.
Lemma A2, limn /∞ ρ(xn , yn ) exists. We now define the function ρW : W × W
/ IR by
ρW (Γ1 , Γ2 ) = lim / ρ(xn , yn ).
n ∞
Lemma A9 The function ρW is well defined, i.e., if {xn } ∼ {x0n } and {yn } ∼ {yn0 }
lim / ρ(xn , yn ) = lim / ρ(x0n , yn0 ).
n ∞ n ∞
Proof. Let k = lim / ρ(xn , yn ), k = lim / ρ(x0n , yn0 ). By Lemma A2 these
n ∞ n ∞
limits exist. Let ∈ > 0 be given. {xn } ∼ {x0n } ⇒ ∃ and integer N1 > 0 such that
ρW (Γ, γ) = 0 ⇔ Γ = γ.
To prove the triangle inequality, let β ∈ W and let {zn } ∈ β be arbitrary. Then,
ρW (Γ, γ) = lim / ρ(xn , yn )
n ∞
≤ lim / ρ(xn , zn ) + lim / ρ(zn , yn )
n ∞ n ∞
= ρW (Γ, β) + ρW (β, γ).
This completes the proof.
With reference to Fig. A1, we have so far constructed the metric space (W, ρW ).
We are yet to prove that it is complete. Meanwhile let us construct the metric space
(W ∗ , ρW ) as a subspace of W , show that it is isometric to (M, ρ) and that it is dense
in (W, ρW ). For the construction of (W ∗ , ρW ), suppose x∗ is an arbitrary element of
M . We immediately construct the constant sequence {x∗ , x∗ , x∗ , . . .}. This sequence
is clearly a Cauchy sequence in M (it is, in fact, a convergent sequence with limit
x∗ ). Since W consists of equivalence classes of all Cauchy sequences in M , this
sequence belongs to some equivalence class in W . Denote this equivalence class by
Γx∗ . Consequently, for all points x in M , we can construct such constant Cauchy
sequences in M and each such sequence belongs to some equivalence class Γx in W .
Let W ∗ denote the set of all such equivalence classes constructed in this way, i.e.,
W∗ = Γx : x ∈ M and Γx contains the constant
sequence {x, x, x, . . .} .
ϕ(x) = Γx ,
for each x ∈ M . We show ϕ is an isometry. Let x, y ∈ M . Then ϕ(x) = Γx ,
ϕ(y) = Γy and ¡ ¢
ρW (Γx , Γy ) = lim / ρ x(n) , y (n) = ρ(x, y),
n ∞
i.e., ρW (ϕ(x), ϕ(y)) = ρ(x, y), and so ϕ is an isometry. This completes the proof.
Lemma A12. (W ∗ , ρW ) is dense in (W, ρW ).
ρ(xn , xm ) /0 as n, m / ∞.
Lemma A14. Let {xn } be a Cauchy sequence in a metric space (M, ρ) and let
{yn } be any sequence in (M, ρ) such that ρ(xn , yn ) < n1 for each n. Then,
(a) We want to prove {yn } is Cauchy. So let ∈ > 0 be given. Since {xn } is Cauchy,
there exists an integer N1 > 0 such that
ρ(xn , xm ) < ∈ /3
1 1
< ∈ /3 and ≤ ∈ /3.
m n
Then, for all m, n ≥ N ,
(b) Assume first that yn / y ∗ as n / ∞. From the inequality
it follows that
µ ¶
∗ 1
lim / ρ(xn , y ) ≤ lim / + lim / ρ(yn , y ∗ ) = 0.
n ∞ n ∞ n n ∞
ρW (Γn , Γ∗n ) < .
∗ ∞
By Lemma A14, {Γn }n=1 is also Cauchy. However, we determined earlier that {Γ∗n }
has a limit say Γ∗ in (W, ρW ). By part (b) of Lemma A14, Γ∗ is also the limit of Γn .
Hence (W, ρW ) is complete.
Comment Let us now once more return to Fig. A1 We have shown that for any
metric space (M, ρ) we can construct a metric space (W, ρW ) of equivalence classes
of Cauchy sequences that is complete. Furthermore, we have constructed a subspace
(W ∗ , ρW ) of (W, ρW ) which is dense and which is isometric to (M, ρ). Hence (W, ρW )
is a completion of (M, ρ).
For the rest of this section we shall prove that all completions of (M, ρ) are isomet-
ric to (W, ρW ). This will then imply that the completion of (M, ρ) is unique. To
accomplish this, we shall make use of an important lemma in analysis. This lemma
concerns the extension of uniformly continuous maps defined on metric spaces.
Lemma A16. Let (M, ρ) be a metric space, not necessarily complete, and let
f : (M, ρ) / (E, ρE )
Proof. We start by observing that if {xn } is a Cauchy sequence in (M, ρ), uni-
form continuity of f implies {f (xn )} is also a Cauchy sequence in (E, ρE ). Since
(E, ρE ) is complete, f (xn ) / e∗ for some e∗ ∈ E. Furthermore {xn } ⊆ (M, ρ)
and (M, ρ) is isometric to (W , ρW ) ⊆ (W, ρW ), where (W ∗ , ρW ) is the subspace of
Lemma A17. Let (W 0 , ρ0W ) be a completion of (M, ρ). Then (W, ρW ) is isometric
to (W 0 , ρ0W ).
This implies that if u∗ ∈ h2 (x) and v ∗ ∈ h2 (y), and if h1 (x) = h(u∗ ) and
h1 (y) = h(v ∗ ) then
ρW (h(u∗ ), h(v ∗ )) = ρ0W (u∗ , v ∗ ),
and so (h2 (M ), ρ0W ) is isometric to (h1 (M ), ρW ). Now, using Lemma A16, we can
construct continuous extensions H1 and H2 of h1 and h2 respectively. Set H =
H1 ◦ H2 , the composition of H1 and H2 . Then, it is not difficult to see that
As a consequence of Lemma A2 through to Lemma A17 we have proved the follow-
ing theorem:
Theorem A18 Every metric space (M, ρ) has a completion and all of its com-
pletions are isometric (i.e., completions are essentially unique).
Example R 1 A19 We saw in Example 1.15 that the metric space C[0, 1] with metric
kf k1 = 0 |f (t)|dt ∀ f ∈ C[0, 1] is not complete. With this metric, the completion
of C[0, 1] is the space (L1 [0, 1], k · k1 ), (see e.g., [1],[2],[4],[10] or [13]).
hyperplanes, 39 Sobolev space, 158
space of almost periodic functions, 131
incomplete normed linear spaces, 10 spaces of test functions, 143
inner product, 73 strict contraction, 133
Kakutani’s Theorem, 115 sublinear functional, 31
Parseval’s Identity, 87
Poincare inequality, 161
Polarization Identity, 76
Principle of Nested Sequences, 136
projection, 168
Projection Theorem, 81
Pseudo Topologies, 147