Stoll - Introduction to Real Analysis
Stoll - Introduction to Real Analysis
Real Analysis
Textbooks in Mathematics
Series editors:
Al Boggess, Kenneth H. Rosen
Ordinary Differential Equations
An Introduction to the Fundamentals, Second Edition
Kenneth B. Howell
Differential Geometry of Manifolds, Second Edition
Stephen Lovett
The Shape of Space, Third Edition
Jeffrey R. Weeks
Differential Equations
A Modern Approach with Wavelets
Steven Krantz
Advanced Calculus
Theory and Practice, Second Edition
John Srdjan Petrovic
Advanced Problem Solving Using Maple
Applied Mathematics, Operations Research, Business Analytics, and Decision Analysis
William P Fox, William Bauldry
Nonlinear Optimization
Models and Applications
William P. Fox
Linear Algebra
James R. Kirkwood, Bessie H. Kirkwood
Train Your Brain
Challenging Yet Elementary Mathematics
Bogumil Kaminski, Pawel Pralat
Real Analysis
With Proof Strategies
Daniel W. Cunningham
Introduction to Real Analysis, 3rd Edition
Manfred Stoll
https://www.routledge.com/Textbooks-in-Mathematics/book-series/
CANDHTEXBOOMTH
Introduction to
Real Analysis
Third edition
Manfred Stoll
third edition published 2021
by CRC Press
6000 Broken Sound Parkway NW, Suite 300, Boca Raton, FL 33487-2742
Reasonable efforts have been made to publish reliable data and information, but the author and
publisher cannot assume responsibility for the validity of all materials or the consequences of
their use. The authors and publishers have attempted to trace the copyright holders of all material
reproduced in this publication and apologize to copyright holders if permission to publish in this
form has not been obtained. If any copyright material has not been acknowledged, please write and
let us know so we may rectify in any future reprint.
Except as permitted under U.S. Copyright Law, no part of this book may be reprinted, reproduced,
transmitted, or utilized in any form by any electronic, mechanical, or other means, now known or
hereafter invented, including photocopying, microfilming, and recording, or in any information
storage or retrieval system, without written permission from the publishers.
For permission to photocopy or use material electronically from this work, access www.copyright.
com or contact the Copyright Clearance Center, Inc. (CCC), 222 Rosewood Drive, Danvers, MA
01923, 978-750-8400. For works that are not available on CCC please contact mpkbookspermissions@
tandf.co.uk
Trademark notice: Product or corporate names may be trademarks or registered trademarks and are
used only for identification and explanation without intent to infringe.
Typeset in CMR10
by KnowledgeWorks Global Ltd.
Contents
To the Student xv
Acknowledgments xvii
v
vi Contents
5 Differentiation 181
5.1 The Derivative . . . . . . . . . . . . . . . . . . . . . . . . . . 182
5.2 The Mean Value Theorem . . . . . . . . . . . . . . . . . . . 192
5.3 L’Hospital’s Rule . . . . . . . . . . . . . . . . . . . . . . . . 206
5.4 Newton’s Method . . . . . . . . . . . . . . . . . . . . . . . . 214
Notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 219
Miscellaneous Exercises . . . . . . . . . . . . . . . . . . . . . 221
Supplemental Reading . . . . . . . . . . . . . . . . . . . . . . 222
6 Integration 223
6.1 The Riemann Integral . . . . . . . . . . . . . . . . . . . . . . 224
6.2 Properties of the Riemann Integral . . . . . . . . . . . . . . 240
6.3 Fundamental Theorem of Calculus . . . . . . . . . . . . . . . 248
6.4 Improper Riemann Integrals . . . . . . . . . . . . . . . . . . 258
6.5 The Riemann-Stieltjes Integral . . . . . . . . . . . . . . . . . 264
6.6 Numerical Methods . . . . . . . . . . . . . . . . . . . . . . . 279
6.7 Proof of Lebesgue’s Theorem . . . . . . . . . . . . . . . . . . 291
Notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 296
Miscellaneous Exercises . . . . . . . . . . . . . . . . . . . . . 297
Supplemental Reading . . . . . . . . . . . . . . . . . . . . . . 299
Bibliography 531
Index 559
Preface to the Third Edition
The major changes to the third edition involve the exposition of the topological
concepts required in the study of analysis. In the first two editions metric
spaces are scattered throughout the text. For example, the Notes section of
Chapter 3 contained the definitions of a metric space, ǫ-neighborhoods, as
well as open and compact sets. The Miscellaneous Exercises of this chapter
also contained several relevant exercises. The concept of a metric and norm,
as well as ǫ-neighborhoods are also included in the Text in Section 7.4.
In this third edition, I have decided to include the relevant topological
concepts in Chapter 2, beginning with a discussion of metric spaces in Section
2.1. The usual concepts of open and closed sets, as well as limit points of
a set are included in Section 2.2. While the emphasis is on the general, the
examples emphasize these topological concepts on the real line. Section 2.3
contains a brief introduction to compact sets and their properties that require
only the definition of compactness. Finally, the characterization of compact
subsets of R is included in Section 2.4. Chapter 3 now contains all the topics
previously included in Chapter 2, with the exception that the convergence
of a sequence is defined in metric spaces. Likewise, limits of functions and
continuity in Chapter 4 are also defined in metric spaces. Norms and normed
linear spaces are still introduced in Chapter 7, with examples of the usual
normed linear spaces in the remaining chapters.
Even with these changes, the emphasis is still on sequences of real numbers,
the compact subsets of R, as well as real-valued functions. The major theorems
remain unchanged. The main advantages of the revision is that it unifies the
subject matter, provides students with an introduction to metric spaces and
abstract topological concepts, and provides a better preparation for advanced
studies in analysis.
The third edition also includes additional exercises and expanded hints
and solutions. I have also attempted to correct some of the errors that were
present in the earlier editions. The Supplemental Readings sections have also
been updated. The one item that has been deleted is the appendix on Logic
and Proofs previously included in the second edition. In the author’s opinion
these are topics that are best covered in greater detail in a seperate course.
Manfred Stoll
ix
Preface to the First Edition
xi
xii Preface to the First Edition
My second goal was to make the text understandable to the typical stu-
dent enrolled in the course, taking into consideration the variations in abilities,
background, and motivation. For this reason, chapters one through six have
been written with the intent to be accessible to the average student, while at
the same time challenging the more talented student through the exercises.
The basic topological concepts of open, closed, and compact sets, as well as
limits of sequences and functions are introduced for the real line only. However,
the proofs of many of the theorems, especially those involving topological con-
cepts, are presented in a manner that permit easy extensions to more abstract
settings. These chapters also include a large number of examples and more
routine and computational exercises. Chapters seven through ten assume that
the students have achieved some level of expertise in the subject. In these
chapters, function spaces are introduced and studied in greater detail. The
theorems, examples, and exercises require greater sophistication and mathe-
matical maturity for full understanding. From my own experiences, these are
not unrealistic expectations.
The book contains most of the standard topics one would expect to find
in an introductory text on real analysis—limits of sequences, limits of func-
tions, continuity, differentiation, integration, series, sequences and series of
functions, and power series. These topics are basic to the study of real analy-
sis and are included in most texts at this level. In addition I have also included
a number of topics that are not always included in comparable texts. For in-
stance, Chapter 6 contains a section on the Riemann-Stieltjes integral, and
a section on numerical methods. Chapter 7 also includes a section on square
summable sequences and a brief introduction to normed linear spaces. Both
of these concepts appear again in later chapters of the text.
In Chapter 8, to prove the Weierstrass approximation theorem, I use the
method of approximate identities. This exposes the student to a very impor-
tant technique in analysis that is used again in the chapter on Fourier series.
The study of Fourier series, and the representation of functions in terms of
series of orthogonal functions, has become increasingly important in many
diverse areas. The inclusion of Fourier series in the text allows the student
to gain some exposure to this important subject, without the necessity of
taking a full semester course on partial differential equations. In the final
chapter I have also included a detailed treatment of Lebesgue measure and
the Lebesgue integral. The approach to measure theory follows the original
method of Lebesgue, using inner and outer measure. This provides an intuitive
and leisurely approach to this very important topic.
The exercises at the end of each section are intended to reinforce the
concepts of the section and to help the students gain experience in developing
their own proofs. Although the text contains some routine and computational
problems, many of the exercises are designed to make the students think about
the basic concepts of analysis and to challenge their creativity and logical
thinking. Solutions and hints to selected exercises are included at the end of
the text. These problems are marked by an asterisk (*).
Preface to the First Edition xiii
At the end of each chapter I have also included a section of notes on the
chapter, miscellaneous exercises, and a supplemental reading list. The notes in
many cases provide historical comments on the development of the subject, or
discuss topics not included in the chapter. The miscellaneous exercises are in-
tended to extend the subject matter of the text or to cover topics that although
important, are not covered in the chapter itself. The supplemental reading list
provides references to topics that relate to the subject under discussion. Some
of the references provide historical information; others provide alternate solu-
tions of results or interesting related problems. Most of the articles appear in
the American Mathematical Monthly or Mathematics Magazine, and should
easily be accessible for students’ reference.
To cover all the chapters in a one-year sequence is perhaps overly ambi-
tious. However, from my own experience in teaching the course, with a judi-
cious choice of topics it is possible to cover most of the text in two semesters.
A one-semester course should at a minimum include all or most of the first
five chapters, and part or all of Chapter 6 or Chapter 7. The latter chapter
can be taught independently of Chapter 6; the only dependence on Chapter
6 is the integral test, and this can be covered without a theoretical treatment
of Riemann integration. The remaining topics should be more than sufficient
for a full second semester. The only formal prerequisites for reading the text
is a standard three- or four-semester sequence in calculus. Even though an
occasional talented student has completed one semester of this course during
their sophomore year, some mathematical maturity is expected and the aver-
age student might be advised to take the course during their junior or senior
year.
Manfred Stoll
To the Student
xv
xvi To the Student
library. They are included to encourage you to develop the habit of looking
into the mathematical literature.
On reading the text you will inevitably encounter topics, formulas, or ex-
amples that may appear too technical and difficult to comprehend. Skip them
for the moment; there will be plenty for you to understand in what follows.
Upon later reading the section, you may be surprised that it is not nearly as
difficult as previously imagined. Concepts that initially appear difficult be-
come clearer once you develop a greater understanding of the subject. It is
important to keep in mind that many of the examples and topics that appear
difficult to you were most likely just as difficult to the mathematicians of the
era in which they first appeared.
The material in the text is self-contained and independent of the calculus.
I do not use any results from calculus in the definitions and development of
the subject matter. Occasionally, however, in the examples and exercises, I do
assume knowledge of the elementary functions and of notation and concepts
that should have been encountered elsewhere. These concepts will be defined
carefully at the appropriate place in the text.
Manfred Stoll
Acknowledgments
I would like to thank the students at the University of South Carolina who have
learned this material from me, or my colleagues, from preliminary versions of
this text. Your criticisms, comments, and suggestions were appreciated. I am
also indebted to those colleagues, especially the late Jeong Yang, who agreed
to use the manuscript in their courses.
Special thanks are also due to the many reviewers who examined the
manuscripts for the first and second editions and provided constructive criti-
cisms and suggestions for improvements. I would also like to thank the many
readers who over the years have informed me of errors in the text. Hopefully
all of the errors of the first and second edition have been corrected.
Finally, I would like to thank the staff at Addison-Wesley for their assis-
tance in the publication of the first two editions. I would also like to thank
Bob Ross of CRC Press for encouraging me to prepare a third edition, and
his staff for their assistance in the preparation of the third edition.
Manfred Stoll
xvii
1
The Real Numbers
1
2 Introduction to Real Analysis
Set Operations
There are a number of elementary operations which may be performed on
sets. If A and B are sets, the union of A and B, denoted A ∪ B, is the set of
all elements that belong either to A or to B or to both A and B. Symbolically,
A ∪ B = { x : x ∈ A or x ∈ B }.
The intersection of A and B, denoted A ∩ B, is the set of elements that
belong to both A and B; that is
A ∩ B = {x : x ∈ A and x ∈ B}.
Two sets A and B are disjoint if A ∩ B = ∅. The relative complement
B \ A, is the set of all elements which are in B but not in A. In set notation,
B \ A = {x : x ∈ B and x ∈
/ A}.
If the set A is a subset of some fixed set X, then X \ A is usually referred to
as the complement of A and is denoted by Ac . These basic set operations
are illustrated in Figure 1.1 with the shaded areas representing A ∪ B, A ∩ B,
and B \ A, respectively.
FIGURE 1.1
A ∪ B, A ∩ B, B \ A
There are several elementary set theoretic identities which will be encoun-
tered on numerous instances throughout the text. We state some of these in
the following theorem; others are given in the exercises.
4 Introduction to Real Analysis
The identities (a) and (b) are referred to as the distributive laws,
whereas (c) and (d) are De Morgan’s laws. If A and B are subsets of a
set X, then De Morgan’s laws can also be expressed as
(A ∪ B)c = Ac ∩ B c , (A ∩ B)c = Ac ∪ B c .
A more general version of both the distributive laws and De Morgan’s laws
will be stated in Theorems 1.7.12 and 1.7.13.
Proof. We will provide the proof of (a) to illustrate the method used in
proving these results. The proofs of (b) – (d) are relegated to the exercises
(Exercise 7). Suppose x ∈ A ∩ (B ∪ C). Then x ∈ A and x ∈ B ∪ C. Since
x ∈ B ∪ C, x ∈ B or x ∈ C. If x ∈ B, then x ∈ A ∩ B and therefore
x ∈ (A ∩ B) ∪ (A ∩ C). Similarly, if x ∈ C then x ∈ A ∩ C and thus again
x ∈ (A ∩ B) ∪ (A ∩ C). This then proves that
A ∩ (B ∪ C) ⊂ (A ∩ B) ∪ (A ∩ C).
denoted A × B, is defined as the set of all ordered pairs1 (a, b), where the
first component a is from A and the second component b is from B, i.e.,
A × B = {(a, b) : a ∈ A, b ∈ B}.
A × B = {(1, −1), (1, 2), (1, 4), (2, −1), (2, 2), (2, 4)}.
FIGURE 1.2
The Cartesian product A × B
Exercises 1.1
1. Let A = {−1, 0, 1, 2}, B = {−2, 3}, and C = {−2, 0, 1, 5}.
a. Find each of the following: (A ∪ B), (B ∪ C), (A ∩ B), (B ∩ C),
A ∩ (B ∪ C), A \ B, C \ B, A \ (B ∪ C).
b. Find each of the following: (A × B), (C × B), (A × B) ∩ (C × B),
(A ∩ C) × B.
c. On the basis of your answer in (b), what might you conjecture about
(A ∩ C) × B for arbitrary sets A, B, C?
1 A set theoretic definition of ordered pair can be given as follows: (a, b) = {{a}, {a, b}}.
With this definition two ordered pairs (a, b) and (c, d) are equal if and only if a = c and
b = d (Miscellaneous Exercise 1).
6 Introduction to Real Analysis
1.2 Functions
We begin this section with the fundamental concept of a function. In many
texts a function or a mapping f from a set A to a set B is described as a rule
The Real Numbers 7
FIGURE 1.3
A function as a graph
y = f (x) or f : x → y.
8 Introduction to Real Analysis
FIGURE 1.4
A function as a mapping
FIGURE 1.5
The function of Example 1.2.2(a)
Since both (−2, 4) and (−2, 1) are two elements of g with the same first com-
ponent, g is not a function from A to B.
(c) In this example, we let A = B = R, and let h be defined by
h = {(x, y) ∈ R × R : y = x2 + 2}.
h(x) = x2 + 2, Dom h = R.
This specifies both the equation defining the function and the domain of the
function. For this example, Range h = {y ∈ R : y ≥ 2}2 .
(d) Let A be any nonempty set and let
i = {(x, x) : x ∈ A}.
h(x) = x2 + 2, Dom h = R,
and
equality, we require that for every y > 2 there exists an x ∈ R such that x2 + 2 = y. The
existence of such a y will follow as a consequence of Example 1.4.6.
10 Introduction to Real Analysis
EXAMPLES 1.2.4 (a) As in Example 1.2.2 let A = {−3, −2, −1, 0, 1},
B = Z, and f : A → Z the function given by
f = {(−3, 2), (−2, −2), (−1, 4), (0, −6), (1, 4)}.
Since both f (−1) and f (1) are equal to 4, f −1 (4) = {−1, 1}. On the other
hand, since (x, 0) 6∈ f for any x ∈ A, f −1 (0) = ∅.
(b) Consider the function g : Z → Z given by g(x) = x2 , and let E =
{−1, −2, −3, · · · }. Then
h−1 (E) = {x ∈ R : 2x + 3 ∈ E} = {x : −2 ≤ x ≤ − 21 }.
The operations of finding the image or inverse image of a set usually pre-
serve the basic set operations of union and intersection. There is one important
exception which is presented in part (b) of the next theorem.
Proof. To prove (a), let y be an element of f (A1 ∪A2 ). Then y = f (x) for some
x in A1 ∪A2 . Thus x ∈ A1 or x ∈ A2 . Suppose x ∈ A1 . Then y = f (x) ∈ f (A1 ).
Similarly, if x ∈ A2 , y ∈ f (A2 ). Therefore y ∈ f (A1 ) ∪ f (A2 ). Thus
Since it is clear that f (A1 ) and f (A2 ) are subsets of f (A1 ∪ A2 ), the reverse
inclusion also holds, thereby proving equality.
Since f (A1 ∩ A2 ) is a subset of both f (A1 ) and f (A2 ), the relation stated
in (b) is also true.
To see that equality need not hold in (b), consider the function g(x) =
x2 , Dom g = Z, of Example 1.2.4(b). If A1 = {−1, −2, −3, · · · } and A2 =
{1, 2, 3, · · · }, then f (A1 ) = f (A2 ) = {1, 4, 9, · · · }, but A1 ∩ A2 = ∅. Thus
f (A1 ∩ A2 ) = f (∅) = ∅ =
6 f (A1 ) ∩ f (A2 ) = {1, 4, 9, ...}.
Proof. The proof of Theorem 1.2.6 is left to the exercises (Exercise 8).
12 Introduction to Real Analysis
Inverse Function
g = {(y, x) ∈ B × A : f (x) = y}
FIGURE 1.6
The inverse function
The Real Numbers 13
EXAMPLES 1.2.9 (a) Let h be the function of Example 1.2.4 (c); that is,
h(x) = 2x + 3, Dom h = R. The function h is clearly one-to-one and onto R
with
x = h−1 (y) = 21 (y − 3), Dom h−1 = R.
Composition of Functions
Suppose f is a function from A to B and g is a function from B to C. If
x ∈ A, then f (x) is an element of B, the domain of g. Consequently we can
apply the function g to f (x) to obtain the element g(f (x)) in C. This process,
illustrated in Figure 1.7, gives a new function h which maps x ∈ A to g(f (x))
in C.
FIGURE 1.7
Composition of g with f
√
EXAMPLES 1.2.11 (a) If f (x) = 1 + x with Dom f = {x ∈ R : x ≥ −1}
and g(x) = x2 , Dom g = R, then
√
(g ◦ f )(x) = g(f (x)) = ( 1 + x)2 = 1 + x, Dom(g ◦ f ) = {x ∈ R : x ≥ −1}.
Even though the equation (g◦f )(x) = 1+x is defined for all real numbers x, the
domain of the composite function g ◦ f is still only the set {x ∈ R : x ≥ −1}.
For this example, since Range g ⊂ Dom f , we can also find f ◦ g; namely,
p
(f ◦ g)(x) = f (g(x)) = 1 + x2 , Dom f ◦ g = R.
and for y ≥ 0,
p
(f ◦ f −1 )(y) = f (f −1 (y)) = (y 2 − 1) + 1 = y.
Exercises 1.2
1. Let A = {−1, 0, 1, 2} and B = N. Which of the following subsets of A × B
is a function from A into B.
a. f = {(−1, 2), (0, 3), (2, 5)}
*b. g = {(−1, 2), (0, 7), (1, −1), (1, 3), (2, 7)}
c. h = {(−1, 2), (0, 2), (1, 2), (2, −1)}
*d k = {((x, y) : y = 2x + 3, x ∈ A}
The Real Numbers 15
The proof of this theorem depends on the fact that the positive integers
are well-ordered; namely, every nonempty subset of N has a smallest ele-
ment. This statement is usually taken as a postulate or axiom for the positive
integers: we do so in this text. Since it will be used on several other occasions,
we state it both for completeness and emphasis.
WELL-ORDERING PRINCIPLE Every nonempty subset of N has a
smallest element.
The well-ordering principle can be restated as follows: If A ⊂ N, A 6= ∅,
then there exists n ∈ A such that n ≤ k for all k ∈ A.
To prove Theorem 1.3.1 we will use the method of proof by contradiction.
Most theorems involve showing that a statement P implies the statement Q;
namely, if P is true, then Q is true. In a proof by contradiction one assumes
that P is true and Q is false, and then shows that these two assumptions lead
to a logical contradiction; namely show that some statement R is both true
and false.
Proof of Theorem 1.3.1 Assume that the hypothesis of Theorem 1.3.1 are
true, but that the conclusion is false; that is, there exists a positive integer n
such that the statement P (n) is false. Let
A = {k ∈ N : P (k) is false }.
The Real Numbers 17
Thus the identity is valid for n = 1. Assume P (k) is true for k ≥ 1, i.e.,
r − rk+1
r + · · · + rk = , r 6= 1.
1−r
We must now show that the statement P (k + 1) is true; that is
r − r(k+1)+1
r + · · · + rk+1 = , r 6= 1.
1−r
But
r + · · · + rk+1 = r + · · · + rk + rk+1
≥ (1 + k h)(1 + h) = 1 + (k + 1)h + k h2
≥ 1 + (k + 1)h.
Q(n) = P (n + no − 1), n ∈ N,
n + 1 = n, n ∈ N.
This is clearly false! However, if we assume that P (k) is true, then we also
obtain that P (k + 1) is true. Thus it is absolutely essential that P (no ) be true
for at least one fixed value of no .
There is a second version of the principle of mathematical induction which
is also quite useful.
= k · (k − 1)! = k!.
Therefore the identity holds for n = (k + 1), and thus by the principle of
mathematical induction for all n ∈ N.
(b) For our second example, consider the function f : N → R defined by
f (1) = 0, f (2) = 13 , and for n ≥ 3 by f (n) = ( n−1
n+1 )f (n − 2), Computing the
values of f for n = 3, 4, 5, and 6, we have
1 1
f (3) = 0, f (4) = , f (5) = 0, f (6) = .
5 7
From these values we conjecture that
0, if n is odd,
f (n) = 1
, if n is even.
n+1
To prove our conjecture we will use the second principle of mathematical
induction. Our conjecture is certainly true for n = 1, 2. Suppose n > 2, and
suppose our conjecture holds for all k < n. If n is odd, then so is (n − 2),
and thus by the induction hypothesis f (n − 2) = 0. Therefore f (n) = 0. On
the other hand if n is even, so is (n − 2). Thus by the induction hypothesis
f (n − 2) = 1/(n − 1). Therefore
n−1 n−1 1 1
f (n) = f (n − 2) = = .
n+1 n+1 n−1 n+1
20 Introduction to Real Analysis
Exercises 1.3
1. Use mathematical induction to prove that each of the following identities
are valid for all n ∈ N.
n(n + 1)
a. 1 + 2 + 3 + · · · + n = .
2
2
*b. 1 + 3 + 5 + · · · + (2n − 1) = n .
n(n + 1)(2n + 1)
c. 12 + 22 + · · · + n2 = .
6
2
*d. 13 + 23 + · · · + n3 = 12 n(n + 1) .
e. 2 + 22 + 23 + · · · + 2n = 2(2n − 1).
*f. For x, y ∈ R, xn+1 − y n+1 = (x − y)(xn + xn−1 y + · · · + y n ).
1 1 1 n
g. + + ··· + = .
1(2) 2(3) n(n + 1) n+1
2. Use mathematical induction to establish the following inequalities for
n ∈ N.
*a. 2n > n for all n ∈ N b. 2n > n2 for all n ≥ 5
n
*c. n! > 2 for all n ≥ 4 *d 13 + 23 + · · · + n3 < 12 n4 for all n ≥ 3
3. Prove Theorem 1.3.3.
4. *Let f : N → N be defined by f (1) = 5, f (2) = 13, and for n ≥ 3,
f (n) = 2f (n − 2) + f (n − 1). Prove that f (n) = 3 · 2n + (−1)n for all
n ∈ N.
5. For each of the following functions with domain N, determine a formula
for f (n) and use mathematical induction to prove your conclusion.
1
a. f (1) = 12 , and for n > 1, f (n) = (n − 1)f (n − 1) − .
n+1
*b. f (1) = 1, f (2) = 4, and for n > 2, f (n) = 2f (n − 1) − f (n − 2) + 2.
(n + 1)
c. f (1) = 1, and for n > 1, f (n) = f (n − 1).
3n
f (n − 2)
*d. f (1) = 1, f (2) = 0, and for n > 2, f (n) = − .
n(n − 1)
e. For a1 , a2 ∈ R arbitrary, let f (1) = a1 , f (2) = a2 , and for n > 2,
f (n − 2)
f (n) = − .
n(n − 1)
a1 , a2 ∈R arbitrary, let f (1) = a1 , f (2) = a2 , and for n > 2,
*f. For
n−1
f (n) = f (n − 2).
n+1
6. Let f : N → N be defined by f (1) = 1, f (2) = 2, and
1
f (n + 2) = (f (n + 1) + f (n)).
2
Use Theorem 1.3.3 to prove that 1 ≤ f (n) ≤ 2 for all n ∈ N.
r − rn+1
7. *Prove that r + r2 + · · · + rn = , r 6= 1, n ∈ N, without using
1−r
mathematical induction.
The Real Numbers 21
a+b=b+a and a · b = b · a.
a + (b + c) = (a + b) + c and a · (b · c) = (a · b) · c.
a · a−1 = 1.
a · (b + c) = a · b + a · c.
22 Introduction to Real Analysis
The element 0 is called the zero of F and the element 1 is called the unit
of F. For a 6= 0, the element a−1 is customarily written as a1 or 1/a. Similarly
we write a − b instead of a + (−b), ab instead of a · b, and a/b or ab instead of
a · b−1 .
The real numbers R contain a subset P known as the positive real num-
bers satisfying the following:
(O1) If a, b ∈ P, then a + b ∈ P and a · b ∈ P.
(O2) If a ∈ R then one and only one of the following hold:
a ∈ P, −a ∈ P, a = 0.
Properties (O1) and (O2) are called the order properties of R. Any field F
with a nonempty subset satisfying (O1) and (O2) above is called an ordered
field. For the real numbers we assume the existence of a positive set P. For
the rational numbers Q, the set of positive rational numbers is given by
P ∩ Q which can be proved to be equal to { p/q : p, q ∈ Z, q 6= 0, pq ∈ N}.
Let a, b be elements of R. If a − b is positive, i.e., a − b ∈ P, then we write
a > b or b < a. In particular, the notation a > 0 (or 0 < a) means that a is a
positive element. Also, a ≤ b (or b ≥ a) if a < b or a = b.
The following useful results are immediate consequences of the order prop-
erties and the axioms for addition and multiplication. Let a, b, c be elements
of R.
(a) If a > b, then a + c > b + c.
(b) If a > b and c > 0, then ac > bc.
(c) If a > b and c < 0, then ac < bc.
(d) If a 6= 0, then a2 > 0.
(e) If a > 0, then 1/a > 0; if a < 0, then 1/a < 0.
To illustrate the method of proof, we provide the proof of (b). Suppose a > b;
i.e, a − b is positive. If c is positive, then by (O1) (a − b)c is positive. By the
distributive law,
(a − b)c = ac − bc.
Therefore ac − bc is positive; that is, ac > bc. The proofs of the other results
are left as exercises.
The concepts bounded below and lower bound are defined similarly.
A set E is bounded if E is bounded both above and below. We now consider
several examples to illustrate these concepts.
Since p2 < 2, q > p and q 2 < 2. Thus B has no largest element. Similarly, the
set
{p ∈ Q : p > 0, p2 > 2}
has no minimum or smallest element. Intuitively, the largest element of B
would satisfy p2 = 2. However, as was shown in the introduction, there is no
rational number p for which p2 = 2.
β < x ≤ α.
EXAMPLES 1.4.5 In the following examples, let’s consider again the three
sets of the previous examples.
(a) As in Example 1.4.2(a), let A = {0, 12 , 23 , 34 , · · · }. Since 0 is a lower
bound of A and 0 ∈ A, inf A = 0. We now prove that sup A = 1. Since
1 − n1 < 1 for all n = 1, 2, . . . , 1 is an upper bound. To show that 1 = sup A
we need to show that if β ∈ R with β < 1, then β is not an upper bound of
A. Clearly if β ≤ 0, then β is not an upper bound of A. Suppose as in Figure
1.8, 0 < β < 1. Then our intuition tells us that there exists an integer no such
that
1 1
no > , or β <1− .
1−β no
FIGURE 1.8
Proof that sup A = 1 in Example 1.4.5(a)
The Real Numbers 25
EXAMPLE 1.4.6 In this example, we show that for every positive real num-
ber y > 0, there exists a unique positive real number α such that α2 = y; i.e.,
√
α = y. The uniqueness of α was established in Example 1.2.9(b).
We only prove the result for y > 1, leaving the case 0 < y ≤ 1 to the
exercises (Exercise 6). Let
y − α2
y(α + 1)
β =α+ = . (1)
α+y α+y
Then
y(y − 1)(α2 − y)
β2 − y = . (2)
(α + y)2
If α2 < y, then by (1) β > α, and by (2) β 2 < y. This contradicts that α is an
upper bound for C. On the other hand, if α2 > y, then by (1) β < α and by
(2), β 2 > y. Thus if x ∈ R with x ≥ β, then x2 > y. Therefore β is an upper
bound of C. This contradicts that α is the least upper bound of C. Since β
defined by (1) may not be rational, the same proof will not work for the set B
of Example 1.4.5(c). However, using Theorem
√ 1.5.2 of the following section, it
is possible to also prove that sup B = 2.
Intervals
Using the order properties of R, we can define certain subsets of R known as
intervals.
[a, b] = {x ∈ R : a ≤ x ≤ b}.
with analogous definitions for (−∞, b) and (−∞, b]. The intervals (a, ∞), (−∞, b)
and (−∞, ∞) = R are also referred to as open intervals, whereas the intervals
[a, ∞) and (−∞, b] are called closed intervals.
In the above, when b = a, (a, a) = ∅ and [a, a] = {a}. Although the empty
set ∅ and the singleton {a} do not fit our intuitive definition of an interval,
we will include them as the degenerate case of open and closed intervals re-
spectively. It should be noted that the intervals of the form (a, b), (a, b], [a, b),
and [a, b] with a, b ∈ R, a ≤ b, are all bounded subsets of R.
An alternate way of defining intervals without use of the adjectives open
and closed is as follows:
Exercises 1.4
1. Use the axioms for addition and multiplication to prove the following: if
a ∈ R, then
a. a · 0 = 0. b. (−1) · a = −a. c. −(−a) = a.
2. Let a, b ∈ R. Prove the following:
1
a. If a 6= 0, then a
6= 0. b. If a · b = 0, then either a = 0 or b = 0.
3. Let a, b, c ∈ R. Prove the following:
a. If a > b, then a + c > b + c. b. If a 6= 0, then a2 > 0.
c. If a > b and c > 0, then ac > bc.
d. If a > 0 then 1/a > 0, and if a < 0 then 1/a < 0.
4. *If a, b ∈ R, prove that ab ≤ 12 (a2 + b2 ).
5. Find the supremum and the infimum of each of the following sets:
1
*a. A = {1, 12 , 14 , 81 , ....} = : n ∈ N
2n−1
b. B = {cos n π4 : n ∈ N}
*c. C = {(1 − (−1)n )n : n ∈ N}
d. D = {sin n π2 ; n ∈ N}
e. E = {n cos nπ : n ∈ N
2+n
*f. F = :n∈N
n
g. G = (−1)n − n1 : n ∈ N
n x > y.
Proof. If y ≤ 0, then the result is true for all n. Thus assume that y > 0. We
will again use the method of proof by contradiction. Let
A = { nx : n ∈ N }.
If the result is false, that is, there does not exist an n ∈ N such that nx > y,
then nx ≤ y for all n ∈ N. Thus y is an upper bound for A. Thus since A 6= ∅,
A has a least upper bound in R. Let α = sup A. Since x > 0, α − x < α.
Therefore α − x is not an upper bound and thus there exists an element of A,
say mx such that
α − x < mx.
But then α < (m + 1)x, which contradicts the fact that α is an upper bound
of A. Therefore, there exists a positive integer n such that nx > y.
Remark. One way in which the previous result is often used is as follows: given
ǫ > 0, there exists a positive integer no such that no ǫ > 1. As a consequence,
1
<ǫ
n
for all integers n, n ≥ no .
x < r < y.
The Real Numbers 31
m − 1 ≤ nx < m.
Therefore
nx < m ≤ 1 + nx < ny,
or dividing by n,
m
x < < y.
n
If x < 0 and y > 0, then the result is obvious. Finally, if x < y < 0, then by
the above there exists r ∈ Q such that −y < r < −x, i.e., x < −r < y.
The conclusion of Theorem 1.5.2 is often expressed by the statement that
the rational numbers are dense in the real numbers, that is, between any two
real numbers there exists a rational number. A precise definition of “dense”
is given in Definition 2.2.19.
Another consequence of the least upper bound property is the following
theorem concerning the existence of nth roots.
THEOREM 1.5.3 For every real number x > 0 and every positive integer
n, there exists a unique positive real number y so that y n = x
√ 1
The number y is written as n x or x n and is called the nth root of x.
The uniqueness of y is obvious. Since the existence of y will be an immediate
consequence of the intermediate value theorem (Theorem 4.2.11), we omit the
details of the proof in the text. A sketch of the proof of Theorem 1.5.3 using
the least upper bound property is included in the miscellaneous exercises. It
should be emphasized that the proof of Theorem 4.2.11 also depends on the
least upper bound property.
ab = αn β n = (αβ)n .
Exercises 1.5
1. *If r and s are positive rational numbers, prove directly (without using
the supremum property) that there exists an n ∈ N such that nr > s.
2. Given any x ∈ R, prove that there exists a unique n ∈ Z such that
n − 1 ≤ x < n.
3. If r 6= 0 is a rational number and x is an irrational number, prove that
r + x and rx are irrational.
4. *Prove directly (without using Theorem 1.5.2) that between any two
rational numbers there exists a rational number.
5. If x, y ∈ R with x < y, show that x < tx+(1−t)y < y for all t, 0 < t < 1.
6. *a. Prove that between any two rational numbers there exists an irra-
tional number.
*b. Prove that between any two real numbers there exists an irrational
number.
7. If x > 0, show that there exists n ∈ N such that 1 2n < x.
If we set nn
1 nk o
E= + ··· +
k
: k = 1, 2, ... ,
3 3
then E 6= ∅ and E is bounded above by x. As we will shortly see, sup E = x.
In terms of series, which will be covered in detail later, we have
∞
X nk
x= .
3k
k=1
EXAMPLES 1.6.2 (a) We now use the above definition to obtain the
ternary expansion of 31 . At the first step, we must choose n1 as the largest
integer in {0, 1, 2} such that
n1 1
< .
3 3
This inequality fails for n1 = 1, 2. Thus n1 = 0. To find n2 , we choose the
largest integer n2 ∈ {0, 1, 2} such that
0 n2 1
+ 2 < ,
3 3 3
which is satisfied by n2 = 2. To find n3 we must have
0 2 n3 1
+ + 3 < .
3 32 3 3
It is left as an exercise to show that this is satisfied for n3 = 0, 1, 2. Thus we
take n3 = 2. At this stage we conjecture that nk = 2 for all k ≥ 2, and that
1
= .02222... (base 3).
3
34 Introduction to Real Analysis
To see that this indeed is the case we use the fact that for the geometric series
∞
X 1
rk = , 0 < r < 1.
(1 − r)
k=0
Thus
∞ ∞ k
X 2 2 X 1 2 1 1
.0222... = = = 1 = .
3k 9 3 9 1− 3
3
k=2 k=0
In (c) we will illustrate how mathematical induction may also be used to prove
such a result. The above ternary expansion is not unique. The number 13 also
has a finite expansion
1
= .1000... (base 3).
3
We will discuss this in more detail at the end of this section.
(b) The binary expansion of 31 is given by
1
= .010101... (base 2).
3
This expansion can be obtained using the definition and induction (Exercise
4). Alternately, using the geometric series we have
∞ ∞ k
X 1 1 X 1 1
.010101... = = = .
22k 4 4 3
k=1 k=0
1
(c) The ternary expansion of 2 is given by
1
= .1111..... (base 3).
2
We now show in detail how this expansion is derived. Since 31 < 12 and 32 > 12 ,
n1 = 1. We will use the second principle of mathematical induction to prove
that nk = 1 for all k ∈ N. By the above, the result is true for k = 1. Let k > 1
and assume that nj = 1 for all j < k. By definition nk is the largest integer
in {0, 1, 2} such that
1 1 nk 1
+ · · · + k−1 + k < . (3)
3 3 3 2
r − rn+1
Using the identity r + r2 + · · · + rn = , r 6= 1, (Example 1.3.2(a))
1−r
with n = k − 1, we obtain
1 1
−
1 1 3 3k 1 1
+ · · · + k−1 = 2 = 1− .
3 3 3
2 3k−1
The Real Numbers 35
and 3 does not divide a, then x has a finite expansion of the form
a1 am
x= + ··· + m, am ∈ {1, 2},
3 3
or an infinite expansion of the form
∞
a1 0 X 2
x= + ··· + m + when am = 1, or
3 3 3k
k=m+1
∞
a1 1 X 2
x= + ··· + m + , when am = 2.
3 3 3k
k=m+1
Exercises 1.6
1. Find the ternary expansion of each of the following.
1 1
*a. 4
b. 13
2. Find the real number determined by each of the following finite or infinite
binary expansions. In the case of an infinite expansion use the geometric
series as in Example 1.6.2 (c) to determine your answer.
a. .1010 b. .101010 · · · *c. .0101
*d. .010101 · · · e. .001001 *f. .001001 · · ·
3. Find the real number determined by each of the following finite or infinite
ternary expansions.
*a. .0022 b. .00222 · · · c. .1010
*d. .101010 · · · e. .001001001 · · · *f. .121212 · · ·
1
4. *Find the binary expansion of 3
. Use induction to prove the result.
3
5. *Find both the finite and infinite binary expansions of 16
.
6. If 0 < x ≤ 1, prove that the infinite ternary expansion of x is unique.
a
7. If x = m , with a ∈ N odd, and 0 < a < 2m , show that x has a finite
2
binary expansion of the form
a1 am
x= + ··· m, am = 1,
2 2
and an infinite expansion
a1 0 1
+ ··· + m + ∞
P
x= k=m+1 k .
2 2 2
the sets A = {2, 7, 11, 21} and B = {7, 3, 19, 32} both have the same number
of elements; namely, four. We accomplish this by counting the number of
elements in each of the two sets. Alternately, the same can be accomplished,
without counting, by simply pairing up the elements; i.e.,
7↔2
3↔7
19 ↔ 11
32 ↔ 21
For infinite sets, the concept of two sets being of the same size or having
the same number of elements is vague. For example, let S denote the squares
of the positive integers; namely,
S = {12 , 22 , 32 , ....}.
1 ←→ 12
2 ←→ 22
3 ←→ 32
..
.
DEFINITION 1.7.2 For each positive integer n, let Nn = {1, 2, ..., n}. As
in Section 1, N denotes the set of all positive integers. If A is a set, we say:
(a) A is finite if A ∼ Nn for some n, or if A = ∅.
(b) A is infinite if A is not finite.
(c) A is countable if A ∼ N.
(d) A is uncountable if A is neither finite nor countable.
(e) A is at most countable if A is finite or countable.
The reader might want to ponder Exercise 20 at this point. Countable sets
are often called denumerable or enumerable sets. It should be pointed out
that some textbooks, when using the term countable, include the possibility
that the set is finite.
EXAMPLES 1.7.3 (a) As above, let S = {12 , 22 , 32 , ...}. Then the function
g(n) = n2 is a one-to-one mapping of N onto S. Thus S ∼ N and S is countable.
(b) For our second example, we show that Z ∼ N. To see this, consider
the function f : N → Z defined by
n
, (n even),
2
f (n) =
− (n − 1) , (n odd).
2
The following diagram illustrates what the mapping f does to the first few
integers.
1 −→ 0
2 −→ 1
3 −→ −1
4 −→ 2
5 −→ −2
..
.
Sequences
f maps N onto A, each a ∈ A is of the form f (n) for some n ∈ N. For each
a ∈ A, by the well ordering principle
f −1 ({a}) = {n ∈ N : f (n) = a}
Ex = {r ∈ Q : 0 ≤ r < x}.
[ k
[
En is denoted by En ,
n∈Nk n=1
Suppose not, then there exist x ∈ R such that x ∈ In for all n, i.e.,
1
0<x< , for all n ∈ N.
n
This however contradicts Theorem 1.5.1 which guarantees the existence of a
positive integer n such that nx > 1. For the union, since In ⊂ I1 for all n ≥ 1,
∞
[
In = I1 = {x ∈ R : 0 < x < 1}.
n=1
As for a finite number of sets, we also have analogs of the distributive laws
and De Morgan’s laws for arbitrary unions and intersections.
The Countability of Q
The countability of the set of rational numbers Q will follow as a corollary of
the following theorem.
∞
[
S= En ,
n=1
then S is countable.
En = { xn,k : k = 1, 2, .... }.
The Uncountability of R
In November 1873, in a letter to Dedekind, Cantor asked whether the set R
itself was countable. A month later he answered his own question by prov-
ing that R was not countable. We now prove, using Cantor’s elegant “di-
agonal” argument, that the closed interval [0, 1] is uncountable, and thus R
itself is uncountable. For the proof we will use the fact that as in Section
6, every x ∈ [0, 1] has a decimal expansion of the form x = .n1 n2 · · · with
ni ∈ {0, 1, 2, ..., 9}. As for the binary and ternary expansions, the decimal
1
expansion is not necessarily unique. Certain numbers such as 10 have two
expansions; namely
1 1
= .100 · · · and = .0999 · · · .
10 10
This however will not be crucial in the proof of the following theorem.
44 Introduction to Real Analysis
Proof. Since there are infinitely many rational numbers in [0, 1], the set is
not finite. To prove that it is uncountable, we only need to show that it is not
countable. To accomplish this, we will prove that every countable subset of
[0, 1] is a proper subset of [0, 1]. Thus [0, 1] cannot be countable.
Let E = {xn n = 1, 2, ...} be a countable subset of [0, 1]. Then each xn has
a decimal expansion
xn = .xn,1 xn,2 xn,3 · · ·
where for each k ∈ N, xn,k ∈ {0, 1, ..., 9}. We now define a new number
y = .y1 y2 y3 · · ·
Remark. The set A is the set of all functions f from N into {0, 1}. Thus a
sequence f ∈ A if and only if f (n) = 0 or 1 for all n.
Proof. As in the previous theorem, we will also prove that every countable
subset of A is a proper subset of A, and thus A cannot be countable.
Let E be a countable subset of A and let {sn : n = 1, 2, ... } be an enumer-
ation of the set E. For each n, sn is a sequence of 0’s and 1’s. We construct a
new sequence s as follows: for each k ∈ N, let
s(k) = 1 − sk (k).
Exercises 1.7
1. a. Prove that the function f of Example 1.7.3(b) is a one-to-one function
of N onto Z.
b. Find a one-to-one function of Z onto N.
*c. Find a one-to-one function from N onto O, the set of all odd positive
integers.
2. For each of the following determine whether the set is finite, infinite,
countable, or uncountable.
a. {1 − (−1)n : n ∈ N} b. {n cos nπ : n ∈ N}
c. {2n : n ∈ Z} d. 2mn : m, n ∈ N, m odd
e. [0, 1] \ Q f. {x ∈ [0, 1] : x has an infinite ternary expansion}
3. Prove that the function f of Theorem 1.7.4 is a one-to-one function of
N × N onto N.
4. *a. If a, b ∈ R with a < b, prove that (a, b) ∼ (0, 1).
b. Prove that (0, 1) ∼ (0, ∞).
5. Suppose X, Y, Z are sets. If X ∼ Y and Y ∼ Z, prove that X ∼ Z.
6. *a If A ∼ X and B ∼ Y , prove that (A × B) ∼ (X × Y ).
b. If A and B are countable sets, prove that A × B is countable.
7. If X ∼ Y , prove that P(X) ∼ P(Y ).
∞
[ ∞
\
8. Find An and An for each of the following sequences of sets {An }.
n=1 n=1
*a. An = {x ∈ R : −n < x < n}, n ∈ N
b. An = {x ∈ R : − n1 < x < 1}, n ∈ N
*c. An = {x ∈ R : − n1 < x < 1 + n1 }, n ∈ N
d. An = {x ∈ R : 0 ≤ x ≤ 1 − n1 } n ∈ N
*e. An = n1 , 1 − n1 , n ∈ N, n ≥ 2
f. An = {x ∈ R : n ≤ x < ∞}
9. For each x ∈ (0, 1), let Ex = {r ∈ Q : 0 ≤ r < x}. Prove that
\ [
Ex = {0} and Ex = {r ∈ Q : 0 ≤ r < x}.
x∈(0,1) x∈(0,1)
Notes
The most important concept of this chapter is the least upper bound property of the
real numbers. This property will be fundamental in the development of the underly-
ing theory of calculus. In the present chapter we have already seen its application in
proving the Archimedian property (Theorem 1.5.1). For the rational number system
this property can be proved directly. For the real number system it was originally
assumed as an axiom by Archimedes (287–212 B.C.). Cantor however proved that
the Archimedian property was no axiom, but a proposition derivable from the least
upper bound property.
In subsequent chapters the least upper bound property will occur in proofs of
theorems, either directly or indirectly, with regular frequency. It will play a crucial
role in the characterization of the compact subsets of R and in the study of sequences
of real numbers. It will also be required in the proof of the intermediate value theorem
for continuous functions. One of the corollaries of this theorem is Theorem 1.5.3 on
the existence of nth roots of positive real numbers. Many other results in the text
will depend on previous theorems which required the least upper bound property in
their proofs.
Miscellaneous Exercises 47
The emphasis on the least upper bound property is not meant to overshadow
the importance of the concepts of countable and uncountable sets. The fact that
the rational numbers are countable, and thus can be enumerated, will be used on
several occasions in the construction of examples. In all the examples and exercises,
every infinite subset of R turns out to be either countable or equivalent to [0, 1].
Cantor also made this observation and it led him to ask whether this result was true
for every infinite subset of R. Cantor was never able to answer this question; nor
has anyone else. The assertion that every infinite subset of R is either countable or
equivalent to [0, 1] is known as the continuum hypothesis. In 1938 Kurt Gödel
proved that the continuum hypothesis is consistent with the standard axioms of set
theory; that is, Gödel showed that continuum hypothesis cannot be disproved on the
basis of the standard axioms of set theory. On the other hand, Paul Cohen in 1963
showed that the continuum hypothesis is undecidable on the basis of the current
axioms of set theory.
Cantor’s creation of the theory of infinite sets was motivated to a great extend by
problems arising in the study of convergence of Fourier series. We will discuss some
of these problems in greater detail in Chapters 9 and 10. Cantor’s original work on
the theory of infinite sets can be found in his monograph listed in the Supplemental
Readings.
Miscellaneous Exercises
1. Let A and B be nonempty sets. For a ∈ A, b ∈ B, define the ordered pair
(a, b) by
(a, b) = {{a}, {a, b}}.
Prove that two ordered pairs (a, b) and (c, d) are equal if and only if a = c
and b = d.
The following two exercises are detailed and lengthy. The first exercise is
a sketch of the proof of Theorem 1.5.3. The second exercise shows how the
least upper bound property may be used to define the exponential function
bx , b > 1.
2. Let
E = {t ∈ R : t > 0 and tn < x}.
a. Show that E 6= ∅ by showing that x/(x + 1) ∈ E.
b. Show that 1 + x is an upper bound of E.
Let
y = sup E.
The remaining parts of the exercise are to show that y n = x. This will be
accomplished by showing that y n < x and y n > x lead to contradictions,
leaving y n = x. To accomplish this, the following inequality will prove
useful. Suppose 0 < a < b, then
bn − an = (b − a)(bn−1 + abn−2 + · · · + an−1 ) < n(b − a)bn−1 . (*)
48 Introduction to Real Analysis
z + w = (a + c, b + d)
z · w = (ac − bd, ad + bc).
The set of ordered pairs (a, b) of real numbers with the above operations
of addition and multiplication is denoted by C.
a. Prove that (C, +, ·) with zero 0 = (0, 0) and unit 1 = (1, 0) is a field.
b. Set i = (0, 1). Show that i2 = −1.
c. Prove that C is not an ordered field.
Supplemental Reading 49
Supplemental Reading
51
52 Introduction to Real Analysis
For example, |4| = 4 and | − 5| = 5. From the definition, |x| ≥ 0 for all
x ∈ R and |x| = 0 if and only if x = 0. This last statement follows from the fact
that if x 6= 0, then −x 6= 0 and thus |x| > 0. The following theorem, the proof
of which is left to the exercises, summarizes several well known properties of
absolute value.
|x + y| ≤ |x| + |y|.
0 ≤ (x + y)2 = x2 + 2xy + y 2
≤ |x|2 + 2|x||y| + |y|2 = (|x| + |y|)2 .
EXAMPLE 2.1.5 Determine the set of all real numbers x that satisfy the
inequality |2x + 4| < 8. By Theorem 2.1.2(d), |2x + 4| < 8 if and only if
−8 < 2x + 4 < 8, or equivalently, −12 < 2x < 4. Thus the given inequality is
satisfied if and only if −6 < x < 2.
Topology of the Real Line 53
d(x, y) = |x − y|.
for all x, y, z ∈ R. We now extend the notion of distance to sets other than R.
d(x, y) = |x − y|.
The distance between the points x and y is just the usual euclidean distance
between x and y as points in R. However, it is important to remember that
our space is the set X, and not R.
54 Introduction to Real Analysis
R2 = R × R = {(x1 , x2 ) : x1 , x2 ∈ R}.
A geometric proof of the triangle inequality follows from the simple fact
that in a triangle, the length of any one side does not exceed the sum of the
lengths of the other two sides. This inequality can also be proved algebrically
(Exercise 9)
(e) Again we let X = R2 . In the previous example the distance between
points was measured as the length of the straight line segment joining the
two points. This metric however is of little use if one is in a city, such as New
York, where the streets are laid out in a rectangular pattern. In such a setting
a more appropriate way to measure distance is along the actual path one needs
to traverse to get from one point to another. Specifically, for p, q ∈ R2 set
That d1 and d∞ are indeed metrics on R2 is left to the exercises (Exercise 10).
Rn = {(x1 , . . . , xn ) : xi ∈ R, i = 1, . . . , n}.
Clearly, since f and g are bounded, {|f (x) − g(x)| : x ∈ A} is bounded above,
and thus d(f, g) < ∞.
Since this is our first example of a space where the elements are functions
we proceed to show that d is a metric. Clearly d is nonnegative. Furthermore,
since |f (x)−g(x)| ≤ d(f, g) for all x ∈ A, d(f, g) = 0 if and only if f (x) = g(x)
for all x ∈ A. That d(f, g) = d(g, f ) follows from the fact that |f (x) − g(x)| =
|g(x)−f (x)|. It only remains to be shown that d satisfies the triangle inequality.
Let f, g, h be bounded functions on A. Then for x ∈ A
Therefore d(f, h)+d(h, g) is an upper bound for the set {|f (x)−g(x)| : x ∈ A},
and as a consequence,
Exercises 2.1
1. Prove Theorem 2.1.2.
2. *Prove Corollary 2.1.4
3. Prove that for x1 , . . . , xn ∈ R, |x1 + · · · + xn | ≤ |x1 | + · · · + |xn |.
DEFINITION 2.2.1 Let (X, d) be a metric space and let p ∈ X. For ǫ > 0,
the set
Nǫ (p) = {x ∈ X : d(p, x) < ǫ}
is called an ǫ-neighborhood of the point p.
N 21 ( 41 ) = {x ∈ [0, ∞) : |x − 14 | < 12 }
= {x ∈ [0, ∞) : − 14 < x < 43 } = [0, 34 ).
This is easily recognized as the interior of a circle with center a and radius ǫ.
(Figure 2.1) Although ǫ-neighborhoods in the plane R2 are typically drawn as
circular regions, this is only the case for the metric d2 . For other metrics this
need not be the case as is illustrated in Exercise 5 of this section.
FIGURE 2.1
Nǫ (a) for the metric d2
Topology of the Real Line 59
(d) Let A = [a, b] and X the set of real-valued functions on A with the
metric as given in Example 2.1.7(h). For a fixed f ∈ X and ǫ > 0,
EXAMPLES 2.2.4 (a) Let X = R and let E = (a, b] with a < b. Every p
satisfying a < p < b is an interior point of E. If ǫ is chosen such that
then Nǫ (p) ⊂ E. The point b however is not an interior point. For every ǫ > 0,
Nǫ (b) = (b − ǫ, b + ǫ) contains points which are not in E. Any x satisfying
b < x < b + ǫ is not in E. This is illustrated in Figure 2.2. For this example,
Int(E) = (a, b).
FIGURE 2.2
Epsilon neighborhoods of p and b in Example 2.2.4(a)
an interior point of E. The only point about which there may be some doubt
is p = 0. However, if 0 < ǫ < 1, then as in Example 2.2.2(b)
0 < ǫ < min{|p1 − 1|, |p1 − 3|, |p2 − 1|, |p2 − 2|},
DEFINITION 2.2.5
(a) A subset O of R is open if every point of O is an interior point of O.
(b) A subset F of R is closed if F c = R \ F is open.
Remark. From the definition of an interior point it should be clear that a set
O ⊂ R is open if and only if for every p ∈ O there exists an ǫ > 0 (depending on
p) so that Nǫ (p) ⊂ O. Both the definition of interior point and open depend
on the metric of the given set. In situations where there is more than one
metric defined on a given set, we will use the phrase open with respect to d to
emphasize the metric.
EXAMPLES 2.2.6 (a) The entire set R is open. For any p ∈ R and ǫ > 0,
Nǫ (p) ⊂ R. Since R is open, by definition the empty set ∅ is closed. However,
the empty set is also open. Since ∅ contains no points at all, Definition 2.2.5(a)
is vacuously satisfied. Consequently R is also closed.
(b) Every ǫ-neighborhood is open. Suppose p ∈ R and ǫ > 0. If q ∈ Nǫ (p),
then |p − q| < ǫ. Choose δ so that 0 < δ ≤ ǫ − |p − q|. If x ∈ Nδ (q), then
|x − p| ≤ |p − q| + |x − q|
< |p − q| + δ ≤ |p − q| + ǫ − |p − q| = ǫ.
Therefore Nδ (q) ⊂ Nǫ (p) (see Figure 2.3) Thus q is an interior point of Nǫ (p).
Since q ∈ Nǫ (p) was arbitrary, Nǫ (p) is open.
Topology of the Real Line 61
FIGURE 2.3
A delta neighborhood of q in Example 2.2.6(d)
(c) Let E = (a, b], a < b, be as in Example 2.2.4(a). Since the point b ∈ E
is not an interior point of E, the set E is not open. The complement of E is
given by
E c = (−∞, a] ∪ (b, ∞).
An argument similar to the one given in Example 2.2.4(a) shows that a is not
an interior point of E c . Thus E c is not open and hence by definition E is not
closed. Hence E is neither open nor closed.
(d) Let F = [a, b], a < b. Then
and this set is open. This can be proved directly, but also follows as a conse-
quence of Theorem 2.2.9(a) below.
(e) Consider the set Q. Since no point of Q is an interior point of Q
(Example 2.2.4(b)), the set Q is not open. Also, Q is not closed.
The use of the adjective open in describing the intervals (a, b), (a, ∞), (−∞, b)
and (−∞, ∞) is justified by the following theorem:
Proof. Exercise 1.
Then p ∈ Oi for all i = 1, ..., n. Since Oi is open, there exists an ǫi > 0 such
that
Nǫi (p) ⊂ Oi .
Let ǫ = min{ǫ1 , ..., ǫn }. Then ǫ > 0 and Nǫ (p) ⊂ Oi for all i. Therefore
Nǫ (p) ⊂ O, i.e., p is an interior point of O. Since p ∈ O was arbitrary, O is
open.
For closed subsets we have the following analogue of the previous result.
Proof. The proofs of (a) and (b) follow from the previous theorem and De-
Morgan’s laws:
!c c
\ [ n
[ n
\
Fα = Fαc , Fj = Fjc .
α∈A α∈A j=1 j=1
Remark. The fact that the intersection of a finite number of open sets is
open is due to the fact that the minimum of a finite number of positive num-
bers is positive. This guarantees the existence of an ǫ > 0 such that the
ǫ-neighborhood of p is contained in the intersection. For an infinite number
of open sets, the choice of a positive ǫ may no longer be possible. This is
illustrated by the following two examples.
EXAMPLES 2.2.11 We now provide two examples to show that part (b)
of Theorem 2.2.9 is in general false for a countable collection of open sets.
Likewise, part (b) of Theorem 2.2.10 is in general also false for an arbitrary
union of closed sets (Exercise 15).
(a) For each n = 1, 2, ..., let On = (− n1 , n1 ). Then each On is open, but
∞
\
On = {0},
n=1
Limit Points
EXAMPLES 2.2.13 (a) E = (a, b), a < b. Every point p, a < p < b, is a
limit point of E. This follows from the fact that for any ǫ > 0 there exists a
point x ∈ (a, b) satisfying p < x < p + ǫ. These however are not the only limit
points. Both a and b are limit points of E, but they do not belong to E.
(b) E = { n1 : n = 1, 2, . . . }. Each n1 is an isolated point of E. If ǫ is chosen
so that
1 1 1
0<ǫ< = − ,
n(n + 1) n n+1
Then Nǫ ( n1 ) = { n1 }. Hence no point of E is a limit point of E. However, 0 is
a limit point of E which does not belong to E. To see that 0 is a limit point,
given ǫ > 0 choose n ∈ N so that 1/n < ǫ. Such a choice of n is possible by
Theorem 1.5.1. Then 1/n ∈ Nǫ (0) ∩ E, and thus 0 is a limit point of E.
(c) Let E = Q ∩ [0, 1]. If p ∈/ [0, 1] then p is not a limit point of E. For if
p > 1, then for ǫ = 21 (p − 1) we have Nǫ (p) ∩ E = ∅. Likewise when p < 0.
On the other hand, every p ∈ [0, 1] is a limit point of E. Let ǫ > 0 be given.
Suppose first that 0 ≤ p < 1. Then by Theorem 1.5.2 there exists r ∈ Q such
that p < r < min{p + ǫ, 1}. When p = 1, Theorem 1.5.2 also guarantees the
existence of an r ∈ Q ∩ [0, 1] with p − ǫ < r < p. Thus for every ǫ > 0, Nǫ (p)
contains a point r ∈ E with r 6= p. The same argument also proves that every
point of R is a limit point of Q.
Proof. Suppose F is closed. Then by definition F c is open and thus for every
p ∈ F c there exists ǫ > 0 such that Nǫ (p) ⊂ F c , that is, Nǫ (p) ∩ F = ∅.
Consequently no point of F c is a limit point of F . Therefore F must contain
all its limit points.
Conversely, let F be a subset of X that contains all its limit points. To
show F is closed we must show F c is open. Let p ∈ F c . Since F contains all
its limit points, p is not a limit point of F . Thus there exists an ǫ > 0 such
that Nǫ (p) ∩ F = ∅. Hence Nǫ (p) ⊂ F c and p is an interior point of F c . Since
p ∈ F c was arbitrary, F c is open and therefore F is closed.
ǫ = min{d(qi , p) : i = 1, . . . , n}.
Closure of a Set
E = E ∪ E′.
(x − ǫ, x + ǫ) ⊂ U.
In particular (s, x] and [x, t) are subsets of U for some s < x and some t > x.
Define rx and lx as follows:
EXAMPLE 2.2.22 Let X = [0, ∞) and let U = [0, 1). Then U is not open
in R but is open in X. (Why?)
Connected Sets2
Our final topic of this section involves the notion of a connected set. The idea
of connectedness is just one more of the many mathematical concepts which
have their roots in the studies of Cantor on the structure of subsets of R. When
we use the term connected subset of R, intuitively we are inclined to think of
an interval as opposed to sets such as the positive integers N or (0, 1) ∪ {2}.
We make this precise with the following definition.
The definition for a connected set differs from most definitions in that it
defines connectedness by negation; i.e., defining what it means for a set not
to be connected. According to the definition, a set A is not connected if there
exist disjoint open sets U and V satisfying both (a) and (b). As an example of
a subset of R which is not connected, consider the set of positive integers N.
If we let U = ( 12 , 23 ) and V = ( 32 , ∞), then U and V are disjoint open subsets
of R with
U ∩ N = {1} and V ∩ N = {2, 3, ....}
that also satisfy (U ∩ N) ∪ (V ∩ N) = N. That the interval (a, b) is connected
is a consequence of the following theorem, the proof of which is left to the
exercises (Exercise 27).
Exercises 2.2
1. Prove Theorem 2.2.7.
2. *Prove Theorem 2.2.9(a).
3. *a Show that every finite subset of R is closed.
b. Show that the intervals (−∞, a] and [a, ∞) are closed subsets of R.
4. Let X be the metric space of Example 2.1.7(g). Let p = ( 21 , 0). Describe
the ǫ-neigborhoods of p for each of the following values of ǫ: ǫ = 14 , ǫ =
1
2
, ǫ = 43 .
2 This concept, although important and used implicitly in several instances in the text,
will not be required specifically in subsequent chapters except in a few exercises. Thus the
topic of connectedness can be omitted upon first reading of the text.
68 Introduction to Real Analysis
which is a contradiction.
(c) The closed set F = [0, ∞) is not compact. For the open cover U =
{(−1, n)}n∈N of F , no finite sub-collection can cover F . If there exist a finite
number of sets in U which cover F , then there exists N ∈ N such that F ⊂
(−1, N ). (Why?) This however is a contradiction.
THEOREM 2.3.5
(a) Every compact subset of a metric space is closed.
(b) Every closed subset of a compact set is compact.
Proof. (a) To show that K is closed, we need to show that K c is open. Let
p ∈ K c be arbitrary. For each q ∈ K, choose ǫq > 0 such that
Any ǫq satisfying 0 < ǫq < 21 d(p, q) will work. Then {Nǫq (q)}q∈K is an open
cover of K. Since K is compact, there exists q1 , ..., qn such that
n
[
K⊂ Nǫqj (qj ).
j=1
Exercises 2.3
1
1. Let A = n
: n = 1, 2, ... .
a. Show that the set A is not compact.
*b. Prove directly (using the definition) that K = A ∪ {0} is compact.
2. Show that (0, 1] is not compact by constructing an open cover of (0, 1]
that does not have a finite subcover.
3. Suppose A and B are compact subsets of a metric space X.
*a. Prove (using only the definition) that A ∪ B is compact.
b. Prove that A ∩ B is compact.
4. *Let K be a nonempty compact subset of R. Prove that sup K and inf K
exist and are in K.
5. Construct a compact subset K of R with an infinite number of isolated
points. Justify that your set K is compact.
6. Suppose K is an infinite compact subset of a metric space (X, d). Prove
that there exists a countable subset D of K such that D = K.
theorems rank very high among the many important advances in the foun-
dations of analysis during the nineteenth century. The importance of these
results will become evident in later chapters. As is to be expected, the least
upper bound property of R will play a crucial role in the proofs of these
theorems.
|pi | ≤ M, i = 1, . . . , n
Exercises 2.4
* Find a countable collection {Kn }∞
1. S n=1 of compact subsets of R such that
∞
n=1 K n is not compact.
2. a. Suppose I and J are closed and bounded intervals in R. Prove that
I × J is a compact subset of R2 .
4. Let d denote the usual metric on R and let ρ be the metric on R given
by
|x − y|
ρ(x, y) = .
1 + |x − y|
a. Prove that a subset of R is open with respect to the metric d if and
only if it is open with respect to ρ.
b. Show that [0, ∞) is closed and bounded in the metric ρ but that [0, ∞)
is not a compact subset of the metric space (R, ρ).
5. Let X = Q with metric d(p, q) = |p − q|. Let E = {p ∈ Q : p ≥ 0, p2 < 2}.
Show that E is closed and bounded in Q, but not compact.
6. This exercise outlines an alternate proof of the Heine-Borel Theorem.
Suppose [a, b] is not compact. Then there exists an open cover U =
{Uα }α∈A of [a, b] such that no finite sub-collection of U covers [a, b]. We
now proceed to show that this leads toa+b
a contradiction. Divide [a, b] into
two closed subintervals a, a+b
2
and 2 , b each of length (b − a)/2. At
least one of these, call it I1 cannot be covered by a finite number of the
Uα . Repeating this process obtain a sequence {In } of closed and bounded
intervals satisfying (a) [a, b] ⊃ I1 ⊃ I2 ⊃ · · · ⊃ In ⊃ · · · , (b) length of
In = (b − a)/2n , and (c) for each n, In is not covered by a finite number
of the Uα . Now use Exercise 3 to obtain a contradiction.
7. Prove the Bolzano-Weierstrass Theorem using the nested intervals prop-
erty.
Topology of the Real Line 77
of length 312 . This leaves 22 disjoint closed intervals J2,1 , J2,2 , J2,3 , J2,4 of
length 312 ; namely
1 2 3 6 7 8 9
0, 32 , 32 , 32 , 32 , 32 , 32 , 32 .
Set P2 = J2,1 ∪ J2,2 ∪ J2,3 ∪ J2,4 . In Figure 2.4, the shaded intervals indicate
the open intervals that are removed at each stage of the construction.
FIGURE 2.4
Construction of the Cantor set
We continue this process inductively. At the nth step, each Pn is the union
of 2n disjoint closed intervals each of length 1/3n , i.e.,
n
2
[
Pn = Jn,j ,
j=1
is a nonempty compact subset of [0, 1]. The set P is called the Cantor ternary
set.
We now consider some of the properties of the set P .
Property 1 P is compact and nonempty.
Property 2 P contains all the endpoints of the closed intervals {Jn,k },
n = 1, 2, ..., k = 1, 2, ..., 2n .
Property 3 Every point of P is a limit point of P .
Proof. Let p ∈ P and let ǫ > 0 be given. Choose m ∈ N such that 1/3m < ǫ.
Since p ∈ Pm , p ∈ Jm,k for some k, 1 ≤ k ≤ 2m . But
xk xk + 1
Jm,k = m , .
3 3m
Since length of Jm,k = 1/3m < ǫ, Jm,k ⊂ Nǫ (p). Thus both endpoints of Jm,k
are in P ∩ Nǫ (p), and at least one of these is distinct from p.
Property 4 The sum of the lengths of the intervals removed is 1.
Proof. At step 1, we removed one interval of length 1/3. At the second step,
we removed two intervals of length 1/32 . At the nth step, to obtain Pn , we
removed 2n−1 intervals of length 1/3n . Thus we obtain that
1 1 1
Sum of the lengths of the intervals removed = + 2 2 + · · · + 2n−1 n + · · ·
3 3 3
∞ ∞ n
X 2n−1 1 X 2
= =
n=1
3n 3 n=0 3
1 1
= 2 = 1.
3 1− 3
As a consequence of Property 4,
Property 5 P contains no intervals.
For x ∈ [0, 1] let x = .n1 n2 n3 ... be the ternary expansion. As we indicated
in Section 1.6, this expansion is unique except when
a
x= , a∈N with 0 < a < 3m ,
3m
where 3 does not divide a. In this case x has two expansions: a finite expansion
a1 am
x= + ··· + m, am ∈ {1, 2}
3 3
and an infinite expansion. If am = 2, we will use the finite expansion. If
am = 1, we will use the infinite expansion
∞
a1 0 X 2
x= + ··· + m + .
3 3 3k
k=m+1
Topology of the Real Line 79
Proof. Exercise 2.
As a consequence of Property 6 and Theorem 1.7.18,
Property 7 P is uncountable.
For each n, the set Pn has only a finite number of endpoints. As a con-
sequence, the set of points of P which are endpoints of some open interval
removed in the construction is countable. Since P is uncountable, P contains
points other than endpoints. By Exercise 1 of Section 1.6, the ternary expan-
sion of 41 is
1
= .020202.....
4
1 1
Thus 4 ∈ P , but 4 is not an endpoint of P .
Remark. By Property 4, the sum of the lengths of the intervals removed is 1.
This seems to imply that P is in some sense very “small.” On the other hand,
by Property 7 P is uncountable, which seems to imply that P is “large.”
Exercises 2.5
1
1. Determine whether 13
is in the Cantor set.
2. Prove Property 6 of the Cantor set.
3. Let 0 < α < 1. Construct a closed subset F of [0, 1] in a manner similar
to the construction of the Cantor set such that the sum of the lengths of
all the intervals removed is α.
4. *Prove that the Cantor set P is equivalent to [0, 1].
Notes
Without a doubt, the most important concept of this chapter is compactness. The
fact that every open cover of a compact set has a finite subcover will be crucial in
the study of continuous functions, especially uniform continuity. As we will see in
many instances, the applications of compactness depend on the ability to choose a
finite subcover from a particular open cover. A good example of this is the proof of
Theorem 2.3.5. Other instances will occur later in the text.
Since compactness is the most important concept, Theorems 2.4.1 and 2.4.2
are the two most important results. In the Heine-Borel theorem we proved that
80 Introduction to Real Analysis
Miscellaneous Exercises
The first two exercises involve the geometric and euclidean metric structure of Rn .
For n ≥ 2, Rn = {(x1 , ..., xn ) : xi ∈ R, i = 1, ..., n}. For p = (p1 , ..., pn ), q =
(q1 , ..., qn ) in Rn and c ∈ R, define
Also, let 0 = (0, ..., 0). For p, q ∈ Rn , the inner product of p and q, denoted
hp, qi, is defined as
hp, qi = p1 q1 + · · · pn qn .
quantity kpk2 is called the norm or the euclidean length of the vector
p.
a. Use the result of 1(d) to prove that kp + qk2 ≤ kpk2 + kqk2 for all
p, q ∈ Rn .
b. Using the result of (a), prove that d2 (p, q) = kp − qk2 is a metric on
Rn .
3. If E is an uncountable subset of R, prove that some point of E is a limit
point of E.
The following exercise is designed to prove the converse of Theorem 2.3.7;
namely, if K is a subset of a metric space (X, d) having the property that
every infinite subset of K has a limit point in K, then K is compact.
Miscellaneous Exercises 81
4. Let K be a subset of a metric space (X, d) that has the property that
every infinite subset of K has a limit point in K.
a. Prove that there exists a countable subset D of K which is dense
in K. (Hint: Fix n ∈ N. Let p1 ∈ K be arbitrary. Choose p2 ∈ K, if
possible, such that d(p1 , p2 ) ≥ n1 . Suppose p1 , ..., pj have been chosen.
Choose pj+1 , if possible, such that d(p1 , pj+1 ) ≥ n1 for all i = 1, ..., n. Use
the assumption about K to prove that this process must terminate after
a finite numberS of steps. Let Pn denote this finite collection of points,
and let D = n∈I Pn . Prove that D is countable and dense in K.)
b. Let D be as in (a), and let U be an open subset of X such that U ∩K 6=
∅. Prove that there exists p ∈ D and n ∈ N such that N1/n (p) ⊂ U .
c. Using the result of (b), prove that for every open cover U of K, S there
exists a finite or countable collection {Un }n ⊂ U such that K ⊂ n Un .
d. Prove that every countable open cover of K has a finite subcover.
(Hint: SIf {Un }∞n=1 is a countable open cover of K, for each n ∈ N let
Wn = n j=1 Uj . Prove that K ⊂ Wn for some n ∈ N. Assume that the
result is false, and obtain an infinite subset of K with no limit point in
K which is contradiction.)
Supplemental Reading
Asic, M. D. and Adamovic, D. D., tory of the Cantor set and Cantor func-
“Limit points of sequences in met- tion,” Math. Mag. 67 (1994), 136–140.
ric spaces,” Amer. Math. Monthly 77 Geissinger, L., “Pythagoras and
(1970) 613–616. the Cauchy Schwarz inequality,” Amer.
Corazza, P., “Introduction to met- Math. Monthly 83 (1976) 40–41.
ric preserving functions,” Amer. Math. Kaplansky, I., Set Theory and Met-
Monthly 106(1999) 309–323. ric Spaces Chelsea Publ. Co., New York,
Dubeau, F., “Cauchy-Bunyakowski- 1977
Schwarz inequality revisited,” Amer. Labarre, Jr., A. E., “Structure the-
Math. Monthly 99 (1990) 419–421. orem for open sets of real numbers,”
Espelie, M. S. and Joseph, J. E., Amer. Math. Monthly 72 (1965) 1114.
“Compact subsets of the Sorgenfrey Nathanson, M. B., “Round met-
line,” Math. Mag. 49 (1976) 250–251. ric spaces,” Amer. Math. Monthly 82
Fleron, Julian F., “A note on the his- (1975) 738–741.
3
Sequences of Real Numbers
Now that we have covered the basic topological concepts required for the study
of analysis, we begin with limits of sequences. This topic will be our first serious
introduction to the limit process. The notion of convergence of a sequence
dates back to the early nineteenth century and the work of Bolzano (1817)
and Cauchy (1821). Some of the concepts and results included in this chapter
have undoubtedly been encountered previously in the study of calculus. Our
presentation however will be considerably more rigorous—emphasizing proofs
rather than computations.
Although our primary emphasis will be on sequences of real numbers,
these are not the only sequences which are typically encountered. It is not
at all unusual to talk about sequences of functions, sequences of vectors, etc.
For this reason we will begin our study of sequences in the general setting of
metric spaces. Most of the examples however will come from the real numbers.
A good understanding of sequences in R will prove helpful in providing insight
into properties of sequences in more general settings.
We begin the chapter by introducing the notion of convergence of a se-
quence in a metric space, and then by proving the standard limit theorems
for sequences of real numbers normally encountered in calculus. In Section
3.3 we will use the least upper bound property of R to prove that every
bounded monotone sequence of real numbers converges in R. The study of
subsequences and sub-sequential limits will be the topic of Section 3.4. In this
section, we also prove the well known result of Bolzano and Weierstrass that
every bounded sequence of real numbers has a convergent subsequence. This
result will then be used to provide a short proof of the fact that every Cauchy
sequence of real numbers converges. Although the study of series of real num-
bers is the main topic of Chapter 7, some knowledge of series will be required
in the construction of certain examples in Chapters 4 and 6. For this reason
we include a brief introduction to series as the last section of this chapter.
83
84 Introduction to Real Analysis
As a general rule, the integer no will depend on the given ǫ. This will be
illustrated in the following examples.
EXAMPLES 3.1.2 (a) For our first example we show that the sequence
{1/n}∞n=1 converges to 0 in R. The proof of this is the remark following The-
orem 1.5.1; namely, given ǫ > 0, there exists a positive integer no such that
no ǫ > 1. Thus for all n ≥ no ,
| n1 − 0| = 1
n < ǫ.
1
Therefore lim = 0. In this example, the integer no must be chosen so that
n→∞ n
no > 1/ǫ.
(b) If p ∈ R, the sequence {pn } defined by pn = p for all n ∈ N is called the
constant sequence p. Since |pn − p| = 0 for all n ∈ N, we have lim pn = p.
n→∞
∞
2n + 1
(c) Consider the sequence . We will show that
3n + 2 n=1
2n + 1 2
lim = .
n→∞ 3n + 2 3
Since
2n + 1 2 1 1
− = < ,
3n + 2 3 3(3n + 2) 9n
1
given ǫ > 0, choose no ∈ N such that no > 9ǫ . Then for all n ≥ no ,
2n + 1 2
− < ǫ.
3n + 2 3
Sequences of Real Numbers 85
which is a contradiction.
(e) Consider the sequence
1 2n + 1
pn = 1− ,
n 3n + 2
which proves the results. Thus with respect to the usual metric a sequence
{pn } is bounded if and only if the set {pn : n = 1, 2, . . . } is a bounded subset
of R. This however is not always the case for other metrics (see Example
3.1.5(c)).
lim pn = p.
n→∞
Proof. (a) Suppose the sequence {pn } converges to two distinct points p, q ∈
X. Let ǫ = 13 d(p, q). Since pn → p, there exists an integer n1 such that
d(pn , p) < ǫ for all n ≥ n1 . Also, since pn → q, there exists an integer n2 such
that d(pn , q) < ǫ for all n ≥ n2 . Thus if n ≥ max{n1 , n2 }, by the triangle
inequality
d(p, q) ≤ d(pn , p) + d(pn , q) < 2ǫ = 32 d(p, q)
which is a contradiction.
(b) Let {pn } be a convergent sequence in X that converges to p ∈ X. Take
ǫ = 1. For this ǫ, there exists an integer no such that d(pn , p) < 1 for all
n > no . Let
M = max{d(p, p1 ), . . . , d(p, pno ), 1 }.
Then d(p, pn ) ≤ M for all n ∈ N. Therefore the sequence is bounded.
(c) We construct the sequence {pn } in E as follows: Since p is a limit point
of E, for each positive integer n, by the definition of limit point, there exists
pn ∈ E with pn 6= p, such that
1
d(pn , p) < .
n
This sequence clearly satisfies pn → p.
|x − y|
ρ(x, y) = .
1 + |x − y|
With this metric every sequence {pn } in R satisfies ρ(pn , p) < 1 for any p ∈ R.
Thus every sequence {pn } in (R, ρ) is bounded according to Definition 3.1.3.
Obviously however there exist sequences in R for which the range is not a
bounded subset of R. It is interesting to note that a sequence {pn } converges
to p in the usual metric if and only if pn → p in the metric ρ (Exercise 11).
Sequences of Real Numbers 87
√ (d) In this example, we illustrate part (c) of the previous theorem. Since
2 is a limit point of Q, the previous theorem guarantees
√ the existence of
a sequence {rn } of rational numbers such that
√ r n → 2. Note however that
this sequence need not be unique. If rn → 2, then the same is true for the
sequence {rn + n1 }.
Exercises 3.1
1. For each of the following sequences, prove, using an ǫ, no argument that
the sequence converges to the given limit p; that is, given ǫ > 0 determine
no such that |pn− p| < ǫ for all n ≥ no .
3n + 5 3 2n + 5 1
*a. ,p= b. ,p=
2n
2 + 7 2 6n − 3 3
(−1)n
n +1 1
*c. 2
, p = d. 1 − ,p=1
2n n 2 n
(−1) n √ √
e. , p = 0. *f. n+1− n , p=0
n2 + 1
n q o 1
g. n 1 + n1 − 1 , p =
2
2. Show that each of the following sequences
diverge in R.
n n 1 n nπ o
*a. {n (1 + (−1) )}. b. (−1) + . *c. sin .
n 2
(−1)n
n nπ o
d. n sin . e. .
2 n+1
1
3. If b > 0, prove that lim = 0.
n→∞ 1 + nb
11. Consider R with the usual metric, and also with the metric
|x − y|
ρ(x, y) = .
1 + |x − y|
88 Introduction to Real Analysis
THEOREM 3.2.1 If {an } and {bn } are convergent sequences of real num-
bers with
lim an = a and lim bn = b,
n→∞ n→∞
then
(a) lim (an + bn ) = a + b, and
n→∞
(b) lim an bn = a b.
n→∞
bn b
(c) Furthermore, if a 6= 0, and an 6= 0 for all n, then lim = .
n→∞ an a
Proof. The proof of (a) is left to the exercises (Exercise 1). To prove (b), we
add and subtract the term an b to obtain
|an bn − ab| = |(an bn − an b) + (an b − ab)| ≤ |an | |bn − b| + |b| |an − a|.
Since {an } converges, by Theorem 3.1.4(b), {an } is bounded. Thus there exists
a constant M > 0 such that |an | ≤ M for all n. Therefore
Let ǫ > 0 be given. Since an → a, there exists a positive integer n1 such that
ǫ
|an − a| <
2(|b| + 1)
Sequences of Real Numbers 89
for all n ≥ n1 . Also, since bn → b, there exists a positive integer n2 such that
ǫ
|bn − b| <
2M
for all n ≥ n2 . Thus if n ≥ max{n1 , n2 },
ǫ
ǫ
|an bn − ab| < M + |b| < ǫ.
2M 2(|b| + 1)
Therefore lim an bn = ab.
n→∞
To prove (c) it suffices to show that lim 1/an = 1/a. The result (c) then
n→∞
follows from (b). Since a 6= 0 and an → a, there exists a positive integer no
such that
|an − a| < 21 |a|
for all n ≥ no . Also, since
1
|a| ≤ |a − an | + |an | < 2 |a| + |an |
for n ≥ no , we have
1
|an | ≥ 2 |a|
for all n ≥ no . Therefore,
1 1 |a − an | 2
− = < 2 |an − a|.
an a |an ||a| |a|
Let ǫ > 0 be given. Since an → a, we can choose an integer n1 ≥ no so that
|a|2
|an − a| < ǫ
2
for all n ≥ n1 . Therefore
1 1
− <ǫ
an a
for all n ≥ n1 , and as a consequence
1 1
lim = .
n→∞ an a
COROLLARY 3.2.2 If {an } is a convergent sequence of real numbers with
lim an = a, then for any c ∈ R,
n→∞
(a) lim (an + c) = a + c, and
n→∞
(b) lim c an = c a.
n→∞
THEOREM 3.2.3 Let {an } and {bn } be sequences of real numbers. If {bn }
is bounded and lim an = 0, then
n→∞
lim an bn = 0.
n→∞
Proof. Exercise 3.
Remark. Since the sequence {bn } may not converge, Theorem 3.2.1(c) does
not apply. The fact that the sequence {bn } is bounded is crucial. For example,
consider the sequences { n1 } and {3n}.
THEOREM 3.2.4 Suppose {an }, {bn }, and {cn } are sequences of real num-
bers for which there exists no ∈ N such that
a n ≤ bn ≤ c n for all n ∈ N, n ≥ no ,
and that lim an = lim cn = L. Then the sequence {bn } converges and
n→∞ n→∞
lim bn = L.
n→∞
Proof. Exercise 4
The above result, commonly called the squeeze theorem , is very useful
in applications. Quite often to show that a given sequence {an } in R converges
to a, we will first prove that
|an − a| ≤ M bn
for some positive constant M and a nonnegative sequence {bn } with lim bn =
n→∞
0. Since the above inequality is equivalent to
−M bn ≤ an − a ≤ M bn ,
by Theorem 3.2.4 the sequence {an − a} converges to 0, or equivalently that
lim an = a.
n→∞
where !
n n!
=
k k!(n − k)!
is the binomial coefficient.
Sequences of Real Numbers 91
n! = n · (n − 1) · · · 2 · 1,
with the usual convention that 0! = 1. Since the binomial theorem is ancillary
to our main topic of discussion, we leave the proof, using mathematical in-
duction, to the exercises (Exercise 13). An alternate proof using Taylor series
will be provided in Section 8.7.
THEOREM 3.2.6 .
1
(a) If p > 0, then lim = 0.
n→∞ np
√
(b) If p > 0, then lim n p = 1.
n→∞
√
(c) lim n n = 1.
n→∞
nα
(d) If p > 1 and α is real, then lim = 0.
n→∞ pn
Proof. The proofs of (a) and (b) are left to the exercises (Exercise 5). The
proof of (a) is straightforward and the proof of√(b) (for p > 1) is similar to
the proof of (c). For the proof of (c), let xn = n n − 1. Since xn is positive,
by the binomial theorem
!
n n n(n − 1) 2
n = (1 + xn ) ≥ x2n = xn
2 2
n(n − 1) · · · (n − k + 1) nk
> k ,
k! 2 k!
92 Introduction to Real Analysis
and as a consequence,
nα 2k k!
1
0≤ ≤ .
pn qk nk−α
The result now follows by part (a) and Theorem 3.2.4.
±1
(e) Write p as p = , where q > 1. Then
q
1
|pn | = |p|n =
qn
which by part (d) (with α = 0) converges to 0 as n → ∞.
(f) Fix k ∈ N such that k > |p|. For n > k,
n
pn |p|n k (k−1) |p|
= < .
n! n! (k − 1)! k
Since |p|/k < 1, the result follows by (e).
2n + 1 n(2 + n1 ) 2+ 1
n
= 2 = 2 .
3n + 2 n(3 + n ) 3+ n
1 2
Since lim = lim = 0, by Corollary 3.2.2(a),
n→∞ n n→∞ n
1 2
lim 2 + = 2 and lim 3 + = 3.
n→∞ n n→∞ n
Therefore by Theorem 3.2.1(b) and (c),
1
2+ 1 lim 2 + n 1 2
n n→∞
lim 2 = 2
=2· = .
n→∞ 3+ n lim 3 + n
3 3
n→∞
(−1)n
(b) Consider the sequence √ . We first note that
2 n+7
(−1)n 1 1
0≤ √ ≤ √ .
2 n+7 2 n
Thus by Theorems 3.2.4 and 3.2.6(a) with p = 21 ,
(−1)n
lim √ = 0.
n→∞ 2 n + 7
Sequences of Real Numbers 93
2n + n3
(c) For our next example we consider the sequence . As in (a),
3n + n2
we first factor out the dominant power in both the numerator and denomi-
nator. By Theorem 3.2.6(d), lim nα /pn = 0 for any α ∈ R and p > 1. This
n→∞
simply states that pn (p > 1) grows faster than any power of n. Therefore
the dominant terms in both the numerator and denominator are 2n and 3n ,
respectively. Thus
3 n 3
2n + n3 2n (1 + 2nn ) 2 (1 + 2nn )
= 2 = 2 .
3n + n2 3n (1 + 3nn ) 3 (1 + 3nn )
By Theorems 3.2.1 and 3.2.6(d)
n3
(1 + 2n )
lim n2
= 1.
n→∞ (1 + 3n )
2 n
Finally, since lim 3 = 0 (Theorem 3.2.6(e)), we have
n→∞
2n + n 3
lim = 0.
n→∞ 3n + n2
(e) As out final example we consider the sequence {n((1 + n1 )−2 − 1)}.
Before we can evaluate the limit of this sequence we must first simplify the
nth term of the sequence. This is accomplished as follows:
n2
1 −2 1
xn = n((1 + n ) − 1) = n −1 =n −1
(1 + n1 )2 (n + 1)2
−2n2 − n
−2n − 1
=n = .
(n + 1)2 (n + 1)2
Now we can factor out an n2 from both the numerator and the denominator.
This gives
−2 − n1
xn = .
(1 + n1 )2
Using the limit theorem we now conclude that lim xn = −2.
n→∞
Exercises 3.2
1. Prove Theorem 3.2.1(a).
2. Let {an } and {bn } be sequences of real numbers.
a. If {an } and {an + bn } both converge, prove that the sequence {bn }
converges.
b. Suppose bn 6= 0 for all n ∈ N. If {bn } and {an /bn } both converge,
prove that the sequence {an } also converges.
94 Introduction to Real Analysis
E = { an : n = 1, 2, .... }.
96 Introduction to Real Analysis
lim an = a.
n→∞
a − ǫ < ano ≤ a.
FIGURE 3.1
Nested intervals property
In fact, one can show that equality holds. (See Exercise 3 of Section 2.4.)
EXAMPLES 3.3.4 (a) Our first example shows that the conclusion of
Corollary 3.3.3 is false if the intervals In are not closed. As in Example
1.7.11(b), for each n ∈ N set In = (0, n1 ). Then In ⊃ In+1 for all n, but
∞
\
In = ∅.
n=1
The conclusion of Corollary 3.3.3 may also be false if the intervals In are
unbounded (Exercise 1).
(b) Consider the sequence {pn } with 0 < p < 1. Even though Theorem
3.2.6(e) applies, we use the results of this section to prove that lim pn = 0.
n→∞
For n ∈ N set sn = pn . Since p > 0, sn > 0 for all n ∈ N. Thus {sn } is
bounded below. Also, since 0 < p < 1,
Thus the sequence {sn } is monotone decreasing, bounded below, and hence
by Theorem 3.3.2, is convergent. Let s = lim sn . Then
n→∞
1 1
a = lim an+1 = lim (2an + 5) = (2a + 5).
n→∞ n→∞ 6 6
Solving the equation a = 61 (2a + 5) for a gives a = 45 .
√
(d) Let a1 = 1, and for n > 1, set an+1 = 2an . To investigate the
convergence of the sequence {an }, we will establish by induction that
Thus the statement is true for n = 1. Assume that it is true for n = k. Then
√ p
1 ≤ ak+1 = 2ak < 2ak+1 = ak+2 ,
and p √
ak+2 = 2ak+1 < 4 = 2.
Thus the sequence {an } is monotone increasing, bounded above by 2, and
hence by Theorem 3.3.2 is convergent. It is possible to prove directly that
sup{an : n = 1, 2, ...} = 2.
Euler’s Number e
EXAMPLE 3.3.5 In this example, we consider in detail the very important
sequence {tn }∞
n=1 , where for each n ∈ N,
n
1
tn = 1 + .
n
We will show that the sequence {tn } is monotone increasing and bounded
above, and thus has a limit. The standard notation for this limit is e (in
honor of Leonhard Euler); i.e.,
n
1
e = lim 1 + .
n→∞ n
By the binomial theorem
n
1 1 n(n − 1) 1 n(n − 1) · · · 1 1
tn = 1 + =1+n· + 2
+ ··· + · n . (1)
n n 1·2 n 1 · 2···n n
For k = 1, ..., n, the (k + 1)-st term on the right side is
n(n − 1) · · · (n − k + 1) 1
· k
1 · 2···k n
which is equal to
1 1 2 k−1
1− 1− ··· 1 − . (2)
1 · 2···k n n n
If we expand tn+1 in the same way, we obtain n+2 terms, and for k = 1, 2, ..., n
the (k + 1)-st term is
1 1 2 k−1
1− 1− ··· 1 − ,
1 · 2···k n+1 n+1 n+1
which is greater than the corresponding term in (2). Thus tn < tn+1 for all n.
From (1) we also obtain
1 1 1
tn ≤ 1 + 1 + + + ··· +
1·2 1·2·3 1 · 2···n
1 1 1
≤ 1 + 1 + + 2 + · · · + n−1 ,
2 2 2
n−1
= (r − rn ) (1 − r), r 6= 1,
which by the identity 1 + r + · · · + r
1 − ( 12 )n 1
= 1+ <1+ = 3.
1 − 21 1− 1
2
Thus {tn } is bounded above by 3, and we can apply Theorem 3.3.2. Since tn ≤
3 for all n ∈ N we also have that e ≤ 3. To five decimal places, e = 2.71828....
The number e is the base of the natural logarithm function which will be
defined in Example 6.3.5 as a definite integral.
100 Introduction to Real Analysis
Infinite Limits
If a monotone increasing sequence {an } is bounded above then by Theorem
3.3.2 the sequence converges. If the sequence {an } is not bounded above, then
for each positive real number M there exists no ∈ N such that an ≥ M for all
n ≥ no . Since the real number M can be taken to be arbitrarily large, this is
usually expressed by saying that the sequence {an } diverges to ∞. We make
this concept precise, not only for monotone sequences, but for any sequence
of real numbers with the following definition.
EXAMPLE 3.3.8 Consider the sequence {n(2 + (−1)n )}. If n is even, then
n(2 + (−1)n ) = 3n; if n is odd, then n(2 + (−1)n ) = n. In either case,
n(2 + (−1)n ) ≥ n,
and thus the sequence diverges to ∞. The sequence however is clearly not
monotone.
Sequences of Real Numbers 101
Exercises 3.3
1. *Show by example that the conclusion of Corollary 3.3.3 is false if the
intervals In with In ⊃ In+1 are not bounded.
2. Show that each of the following sequences are monotone. find a lower or
upper
√bound ifit exists.
Find the limit
if you
can.
n2 + 1 1 1
*a. b. a+ a− , a > 1.
n n n
π 2π nπ
c. {an } , a > 1 d. {sn } where sn = cos3 + cos2 + · · · + cos2
n 2 2 2
n
e.
n!
√ √
3. Define the sequence {an } as follows: a1 = 2, and an+1 = 2 + an .
a. Show that an ≤ 2 for all n.
b. Show that the sequence {an } is monotone increasing.
c. Find lim an .
n→∞
4. *Let a1 > 1, and for n ∈ N, n ≥ 1, define an+1 = 2 − 1 an . Show that
the sequence {an } is monotone and bounded. Find lim an .
n→∞
5. Let 0 < a < 1. Set t1 = 2, and for n ∈ N, set tn+1 = 2 − a/tn . Show that
the sequence {tn } is monotone and bounded. Find lim tn .
n→∞
√
6. Let α > 0. Choose x1 > α. For n = 1, 2, 3, ..., define
1 α
xn+1 = xn + .
2 xn
*a. Show that the sequence {xn } is monotone and bounded.
√
b. Prove that lim xn = α
n→∞
√
c. Prove that 0 ≤ xn − α ≤ (x2n − α)/xn .
7. In Exercise
√ 6, let α = 3 and x1 = 2. Use part (c) to find xn such that
|xn − 3| < 10−5 .
8. For each of the following prove that the sequence {an } converges and find
the limit. √
1
a. an+1 = √ 6
(2an + 5), a1 = 2 b. an+1 = √2an , a1 = 3
*c. an+1 = √2an + 3, a1 = 1 d. an+1 =√ 2an + 3, a1 = 4
*e. an+1 = 3an − 2, a1 = 4 f. an+1 = 3an − 2, a1 = 23
9. Let A be a nonempty subset of R that is bounded above and let α =
sup A. Show that there exists a monotone increasing sequence {an } in
A such that α = lim an . Can the sequence {an } be chosen to be strictly
increasing?
10. Use (
Example 3.3.5)to find the
(limit of each )
of the following sequences.
2n n+1
1 1
*a. 1+ b. 1+
n n
( 3n ) n
1 1
*c. 1+ d. 1−
2n n
102 Introduction to Real Analysis
1 1
11. *For each n ∈ N, let sn = 1 + + · · · + . Show that {sn } is monotone
2 n
increasing but not bounded above.
1 1
12. For each n ∈ N, let sn = 1 + √ + · · · + √ . Show that the sequence
2 n
{sn } is monotone increasing but not bounded above.
1 1 1
13. *For each n ∈ N, let sn = 2 + 2 + · · · + 2 . Show that the sequence
1 2 n
{sn } is monotone increasing and bounded above by 2.
14. Let 0 < b < 1. For each n ∈ N, let sn = 1 + b + b2 + · · · + bn . Prove
that the sequence {sn } is monotone increasing and bounded above. Find
lim sn .
n→∞
Our first result assures us that for convergent sequences, every subsequence
also converges to the same limit.
104 Introduction to Real Analysis
THEOREM 3.4.3 Let (X, d) be a metric space and let {pn } be a sequence
in X. If {pn } converges to p, then every subsequence of {pn } also converges
to p.
Proof. Let {pnk } be a subsequence of {pn }, and let ǫ > 0 be given. Since
pn → p, there exists a positive integer no such that d(pn , p) < ǫ for all n ≥ no .
Since {nk } is strictly increasing, nk ≥ no for all k ≥ no . Therefore,
d(pnk , p) < ǫ
for all n, the sequence {pn } is monotone decreasing, bounded below, and hence
converges. Let
a = lim pn .
n→∞
By Theorem 3.4.3 the subsequence {p2n } also converges to a. But p2n = (pn )2 ,
and thus
a = lim p2n = lim (pn )2 = a2 .
n→∞ n→∞
2
Thus a = a. Since 0 ≤ a < 1, we must have a = 0.
(b) In our second example we show how the previous theorem may be used
to prove divergence of a sequence. Consider the sequence {sin nθπ}, where θ
is a rational number with 0 < θ < 1. Write θ = a/b, with a, b ∈ N and
b ≥ 2. When a = kb, k ∈ N, then sin nθπ = sin kaπ = 0. Therefore 0 is a
subsequential limit of the sequence. On the other hand, if n = 2kb + 1, k ∈ N,
then
a a
sin nθπ = sin (2kb + 1) π = sin 2kaπ + π
b b
a a
= cos(2kaπ) sin π = sin π.
b b
Since 0 < a/b < 1, sin ab π 6= 0. Thus sin ab π is another distinct subsequential
limit of {sin nθπ}. Hence as a consequence of Theorem 3.4.3 the sequence
{sin nθπ} diverges. The result is still true if θ is irrational. The proof however
is much more difficult.
THEOREM 3.4.5 Let K be a compact subset of a metric space (X, d). Then
every sequence in K has a convergent subsequence which converges in K.
Sequences of Real Numbers 105
Exercises 3.4
1. a. Prove that 0 and 2 are the only subsequential limits of the sequence
{1 − (−1)n }.
b. Prove that 0 and ∞ are the only subsequential limits of the sequence
{n(1 + (−1)n )}.
2. a. Construct a sequence {sn } for which the subsequential limits are
{−∞, −2, 1}.
b. Construct a sequence {sn } for which the set of subsequential limits is
countable.
3. Findnall the osubsequential limitsn of the following sequences.
nπ nπ o
*a. sin . b. n sin .
2 4
(−1)n
*c. 1 − d. {(1.5 + (−1)n )n }.
n n nπ o
*e. (−1)n + 2 sin nπ
2
f. n sin
4
4. Use Example 3.3.5 to find the limit of each of the following sequences.
Justify your answer.
( 6n ) n n
1 1 2
*a. 1+ b. 1+ c. 1+
3n 2n n
√
5. Suppose p > 1. Use the method of Example 3.4.4 to show that lim n p =
n→∞
1.
6. For n ∈ N set pn = n1/n .
a. Show that 1 < pn+1 < pn for all n ≥ 3.
b. Let p = lim pn . Use the fact that the subsequence {p2n } also con-
n→∞
verges to p to conclude that p = 1.
7. Let {pn } be a bounded sequence of real numbers and let p ∈ R be such
that every convergent subsequence of {pn } converges to p. Prove that the
sequence {pn } converges to p.
8. Prove Theorem 3.4.7.
9. Prove that every sequence in R has a monotone subsequence.
10. *Prove that every bounded sequence in Rn has a convergent subsequence.
11. Use the Bolzano-Weierstrass theorem to prove the nested intervals prop-
erty (Corollary 3.3.3).
12. Prove that every uncountable subset of R has a limit point in R.
limit of a sequence, the limit superior and limit inferior of a sequence always
exist. The concept of the limit superior and limit inferior will also be important
in our study of both series of real numbers and power series.
Let {sn } be a sequence in R. For each k ∈ N, we define ak and bk as
follows:
ak = inf{ sn : n ≥ k },
bk = sup{ sn : n ≥ k }.
Recall that for a nonempty subset E of R, sup E is the least upper bound of
E if E is bounded above, and ∞ otherwise.
From the definition, ak ≤ bk for all k. Furthermore, the sequences {ak }
and {bk } satisfy the following:
for all k. To prove (3), let Ek = {sn : n ≥ k}. Then Ek+1 ⊂ Ek . Therefore, if
bk = sup Ek , sn ≤ bk for all n ≥ k. In particular
s n ≤ bk for all n ≥ k + 1.
Therefore bk+1 = sup Ek+1 ≤ bk . A similar argument will show that the
sequence {ak } is nondecreasing.
As a consequence of (3) the sequence {ak } is monotone increasing and
the sequence {bk } is monotone decreasing. Thus by Theorems 3.3.2 and 3.3.7,
these two sequences always have limits in R ∪ {−∞, ∞}.
We now give several examples for which we will compute the limit inferior
and limit superior. As will be evident, these computations are very tedious.
An easier method will be given in Theorem 3.5.7.
(c) {(−1)n + n1 }∞ n
n=1 . Set sn = (−1) + 1/n. Then
(
−1 + n1 , n odd,
sn =
1 + n1 , n even.
To compute the limit superior and inferior of the sequence {sn }, we set Ek =
{sn : n ≥ k}. If k is even, then
n o
1
Ek = 1 + k1 , −1 + k+1 1
, 1 + k+2 ,··· .
Remark. The statement “sn < β + ǫ for all n ≥ no ” means that sn < β + ǫ
for all but finitely many n. On the other hand, the statement “given n, there
exists k ∈ N with k ≥ n such that sk > β − ǫ” means that sn > β − ǫ for
infinitely many indices n.
Proof of Theorem 3.5.3 We will only proof (a). The proofs of (b) and (c)
are left to the exercises (Exercise 5).
(a) Suppose β = lim sn = lim bk where
k→∞
bk = sup{sn : n ≥ k}.
Let ǫ > 0 be given. Since lim bk = β there exists a positive integer no such
k→∞
that bk < β + ǫ for all k ≥ no . Since sn ≤ bk for all n ≥ k,
sk > bn − ǫ ≥ β − ǫ,
Conversely, assume that (i) and (ii) hold. Let ǫ > 0 be given. By (i) there
exists no ∈ N such that sn < β + ǫ for all n ≥ no . Therefore
bno = sup{sn : n ≥ no } ≤ β + ǫ.
sn = (−1)n + 1/n
of Example 3.5.2(c). For this sequence, lim sn = 1 and lim sn = −1. Given
ǫ > 0, then
sn < 1 + ǫ
for all n ∈ N with n ≥ 1/ǫ. Since the odd terms get close to −1, we can never
have the existence of an integer no such that sn > 1 − ǫ for all n ≥ no . On
the other hand, given any n ∈ N, there exists an even integer k ≥ n such that
sk > 1 − ǫ.
An immediate consequence of the previous two theorems is as follows:
Proof. Suppose lim sn = lim sn = α ∈ R. Let ǫ > 0 be given. By (a) and (a’)
of the previous two theorems, there exist positive integers n1 and n2 , such
that
Thus if no = max{n1 , n2 },
α − ǫ < sn < α + ǫ
Proof. Exercise 6
The following theorem relates the limit superior and inferior of a sequence
to the subsequential limits of the sequence, and is in fact very useful for finding
lim sn and lim sn of a sequence {sn }.
Again, such an integer exists by (i) and (ii) of Theorem 3.5.3(a). Then {snk }
is a subsequence of {sn } which clearly converges to s. Therefore s ∈ E. The
case s = ∞ is treated similarly. If s = −∞, then by (c) of Theorem 3.5.3,
sn → −∞ as n → ∞.
Since s ∈ E, s ≤ sup E. It remains to be shown that sup E = s. If s = ∞
we are done. Otherwise, suppose sup E = β > s. Suppose β 6= ∞. Then there
exists α ∈ E such that
s < α ≤ β.
Since α ∈ R, we can choose ǫ > 0 such that s + ǫ < α − ǫ. For this ǫ, there
exists no ∈ N such that sn < s + ǫ for all n ≥ no . Hence there can exist only
finitely many k such that
|sk − α| < ǫ.
Consequently no subsequence of {sn } can converge to α. This contradiction
shows that sup E = s. The case β = ∞ is treated similarly.
112 Introduction to Real Analysis
nπ
(c) Let sn = sin . If n is even, i.e., n = 2k, then s2k = sin kπ = 0. On
2
the other hand, if n is odd, i.e., n = 2k+1, then s2k+1 = sin(2k+1) π2 = (−1)k .
Hence the set of subsequential limits of the sequence {sn } is {−1, 0, 1}. As a
consequence,
lim sn = −1 and lim sn = 1.
Exercises 3.5
1. Find the limit inferior and limit superior of each of the following se-
quences.
n nπ o n nπ o
*a. n sin b. (1 + (−1)n ) sin
4 4
n + (−1)n n2
n n
*c. d. {[1.5 + (−1) ] }
n2 + 1
1 − 2(−1)n n
1
e. + n(1 + cos nπ) *f.
n 3n + 2
2. Let {an } be a sequence in R. If lim |an | = 0, prove that lim an = 0.
n→∞
3. *Let {rn } be an enumeration of the rationals in (0, 1). Find lim rn and
lim rn .
4. Let {sn } be a sequence in R. If s ∈ R satisfies that for every ǫ > 0, there
exists no ∈ N such sn < s + ǫ for all n ≥ no , prove that lim sn ≤ s.
5. a. Prove Theorem 3.5.3(b).
b. Prove Theorem 3.5.3(c).
6. *a. Let {an } and {bn } be bounded sequences in R. Prove that
lim an + lim bn ≤ lim(an + bn ) ≤ lim an + lim bn .
b. Give an example to show that equality need not hold in (a).
7. a. If an and bn are positive for all n, prove that
lim(an bn ) ≤ (lim an ) (lim bn ),
provided the product on the right is not of the form 0 · ∞.
b. Need equality hold in (a)?
Sequences of Real Numbers 113
In order to apply the definition to prove that a given sequence {pn } converges,
it is required that we know the limit of the sequence {pn }. For this reason,
theorems that provide sufficient conditions for convergence, such as Theorem
3.3.2, are particularly useful. The drawback to Theorem 3.3.2 is that it applies
only to monotone sequences of real numbers. In this section, we consider
another criterion that for sequences in R is sufficient to ensure convergence of
the sequence.
Remark. In the above definition, the criterion d(pn , pm ) < ǫ for all integers
n, m ≥ no is equivalent to
d(pn+k , pn ) < ǫ
for all n ≥ no and all k ∈ N. Thus if {pn } is a Cauchy sequence in X,
lim d(pn+k , pn ) = 0
n→∞
Proof. (a) Suppose that {pn } converges to p ∈ X. Let ǫ > 0 be given. Then
for the given ǫ, there exists a positive integer no such that
d(pn , p) < 21 ǫ
1 1 1 1
1
d( n1 , m )= − ≤ + < ǫ.
n m n m
Thus the sequence {1/n} is Cauchy but does not converge in X. In R the
sequence converges to 0, but 0 ∈
/ X. Intuitively, a Cauchy sequence that fails
to converge does so because of the absence of a point or element in the space
to which it can converge.
(b) Let X = Q with d(p, q) = |p − q|. If {pn } is any sequence of rational
numbers that converges to an irrational number, then the sequence {pn } is a
Cauchy sequence in (Q, d), which however does not converge in Q.
Since pnk → p, for the given ǫ, there exists an integer k1 such that
EXAMPLES 3.6.7 (a) For our first example we consider the sequence {sn }
where for n ∈ N
1 1
sn = 1 + 2 + · · · + 2 .
2 n
For k ∈ N,
1 1
|sn+k − sn | = + ··· +
(n + 1)2 (n + k)2
1 1 1 1
≤ − + ··· + −
n n+1 n+k−1 n+k
1 1
= − .
n n+k
In the above we have used the inequality
1 1 1
≤ −
(n + m)2 n+m−1 n+m
As a consequence, for n ≥ 2,
This last statement is most easily verified by induction (Exercise 5). For m ≥ 1,
consider |an+m − an |. By the triangle inequality,
m−1
X m−1
X
|an+m − an | = an+k+1 − an+k ≤ |an+k+1 − an+k |,
k=0 k=0
m−1 m
X 1 1 X 1
≤ |a2 − a1 | = |a − a1 |
n−2 2
.
2n+k−1 2 2k
k=0 k=1
By Example 1.3.2(a)
m
X r − rm+1
rk = , r 6= 1. (5)
1−r
k=1
Thus with r = 21 ,
m 1
X 1 2 − ( 12 )m+1 1
= = 1 − m < 1.
2k
k=1
1 − 21 2
Therefore,
1
|an+m − an | ≤ |a2 − a1 |
2n−2
for all n ≥ 2 and m ∈ N. Let ǫ > 0 be given. Choose no such that
|a2 − a1 |/2n−2 < ǫ for all n ≥ no . Then by the above,
|an+m − an | < ǫ
Sequences of Real Numbers 117
exists in R.
Can we find the limit a here? If we take the same approach as in Example
3.3.4(c), by taking the limit of both sides of equation (5) we only get a = a.
To find the value of a, let us observe that
n
X
= (a2 − a1 ) (− 21 )k−1
k=1
= 23 (a2 − a1 )[1 − (− 21 )n ].
The last equality follows from formula (5). Since an+1 → a and (− 21 )n → 0,
upon taking the limit of both sides we obtain
a − a1 = 32 (a2 − a1 ) or a = a1 + 23 (a2 − a1 ).
Contractive Sequences
One of the key properties of the sequence {an } of the previous example was
that
|an+1 − an | ≤ 21 |an − an−1 |
for all n ≥ 2. This property was used to show that the sequence {an } was a
Cauchy sequence and thus converged. Sequences that satisfy a criterion such
as the above are commonly referred to as contractive sequences. We make this
precise in the following definition.
for all n ∈ N, n ≥ 2.
118 Introduction to Real Analysis
bn−1
d(pn+m , pn ) ≤ bn−1 d(p2 , p1 )(1 + b + · · · + bm−1 ) < d(p2 , p1 )
1−b
for all n, m ∈ N. As a consequence, every contractive sequence is a Cauchy
sequence. Therefore, if (X, d) is a complete metric space, every contractive
sequence in X converges to a point in X. We summarize this in the following
theorem.
THEOREM 3.6.9 Let (X, d) be a complete metric space. Then every con-
tractive sequence in X converges in X. Furthermore, if the sequence {pn } is
contractive and p = lim pn , then
bn−1
(a) d(p, pn ) ≤ d(p2 , p1 ), and
1−b
b
(b) d(p, pn ) ≤ d(pn , pn−1 ), where 0 < b < 1 is the constant in
1−b
Definition 3.6.8.
Proof. We leave the details of the proof to the exercises (Exercise 9).
Exercises 3.6
1. If {an } and {bn } are Cauchy sequences in R, prove (without using The-
orem 3.6.5) that {an + bn } and {an bn } are also Cauchy sequences.
2. For each of the following determine whether the given sequence is a
Cauchy
sequence.
(−1)n
n+1
*a. b. {(−1)n } c. n +
n n
n
1 + (−1)n n 1 + (−1)n n2
1
*d. e. f. 1 + √ .
n2 + 3 2n2 + 3 n
1 1 1
3. For n ∈ N let sn = 1 + + + · · · + . Prove that {sn } is a Cauchy
2! 3! n!
sequence.
1 1
4. Consider the sequence {sn } defined by sn = 1 + + · · · + .
2 n
*a. Show that {sn } is not a Cauchy sequence.
b. Even though {sn } is not a Cauchy sequence, show that
lim |sn+k − sn | = 0 for all k ∈ N.
n→∞
6. If K is a compact subset of a metric space (X, d), prove that every Cauchy
sequence in K converges to a point in K.
7. Prove that (R2 , d2 ) is complete.
8. Let {an } be the sequence of Example 3.6.7(b).
a. Use mathematical induction to prove that
1 1 1
a2k+1 = 2k−1 (a1 + a2 ) + (a1 + 2a2 ) 1 − k−1 .
2 3 4
b. Use the result of (a) to find lim an .
9. Prove Theorem 3.6.9.
10. *Let a1 > 0, and for n ≥ 2, define an = (2 + an−1 )−1 . Prove that {an }
is contractive, and find lim an .
n→∞
11. Let c1 ∈ (0, 1) be arbitrary, and for n ∈ N set cn+1 = 51 (c2n + 2).
a. Show that {cn } is contractive.
b. Let c = lim cn Show that c is a solution of x2 − 5x + 2 = 0.
n→∞
14. Prove that if every Cauchy sequence in R converges, then every nonempty
subset of R that is bounded above has a supremum.
For each n ∈ N, sn is called the nth partial sum of the series and an is called
the nth term of the series.
∞
P
The series ak converges if and only if the sequence {sn } of nth partial
k=1
sums converges in R. If lim sn = s, then s is called the sum of the series,
n→∞
and we write
X∞
s= ak .
k=1
∞
P
If the sequence {sn } diverges, then the series ak is said to diverge.
k=1
EXAMPLES 3.7.2 (a) For |r| < 1, consider the geometric series
∞
X
rk .
k=1
For n ∈ N,
n
X
sn = rk = r + r2 + · · · + rn .
k=1
Thus
(1 − r) sn = sn − r sn = r − rn+1 ,
and as a consequence
r − rn+1
sn = .
1−r
Sequences of Real Numbers 121
n
(
X
k 0, if n is even,
sn = (−1) =
k=1
−1, if n is odd.
∞
P
THEOREM 3.7.3 (Cauchy Criterion) The series ak converges if and
k=1
only if given ǫ > 0, there exists a positive integer no , such that
m
X
ak < ǫ
k=n+1
Proof. Since
m
X
ak = |sm − sn |,
k=n+1
∞
P
EXAMPLE 3.7.4 In this example, we show that the series 1/k diverges.
k=1
We accomplish this by showing that the sequence {sn } of partial sums is not
a Cauchy sequence. Consider
1 1
s2n − sn = + ··· + , n ∈ N.
n+1 2n
There are exactly n terms in the sum on the right, and each term is greater
than or equal to 1/2n. Therefore
1 1
s2n − sn ≥ n = .
2n 2
The sequence {sn } therefore fails to be a Cauchy sequence and thus the series
diverges. The divergence of this series appears to have been first established by
Nicole Oresme (1323?–1382) using a method of proof similar to that suggested
in the solution of Exercise 11 of Section 3.3
∞
P
COROLLARY 3.7.5 If ak converges, then lim ak = 0.
k=1 k→∞
∞
P
THEOREM 3.7.6 Suppose ak ≥ 0 for all k ∈ N. Then ak converges if
k=1
and only if {sn } is bounded above.
Proof. Since ak ≥ 0 for all k, the sequence {sn } is monotone increasing. Thus
by Theorem 3.3.2, the sequence {sn } converges if and only if it is bounded
above.
Sequences of Real Numbers 123
Exercises 3.7
1 1 1 1
1. *Using the inequality ≤ = − , prove that the series
k2 k(k − 1) k−1 k
∞
X 1
converges.
k2
k=1
∞
X 1
2. Prove that the series converges.
k2 + k
k=1
∞
rk diverges.
P
3. If |r| ≥ 1, show that the series
k=1
∞ 1
P
4. Prove that the series converges. (See Exercise 13 of Section 3.6.)
k=1 k!
P
5. *Suppose ak ≥ 0 for all k. Prove that if ak converges, then
∞ √
X ak
converges.
k
k=1
∞
P ∞
P
6. If ak and bk both converge, prove each of the following:
k=1 k=1
∞
P
a. cak converges for all c ∈ R.
k=1
∞
P
b. (ak + bk ) converges.
k=1
∞
P
7. If (ak + bk ) converges, does this imply that the series
k=1
a1 + b1 + a2 + b2 + · · · converges?
8. Suppose bk ≥ ak ≥ 0 for all k ∈ N.
∞
P P∞
a. If bk converges, prove that ak converges.
k=1 k=1
∞
P ∞
P
b. If ak diverges, prove that bk diverges.
k=1 k=1
∞
X 1
9. Consider the series , p ∈ R.
kp
k=1
Notes
This chapter provided our first serious introduction to the limit process. In subse-
quent chapters we will encounter limits of functions, the derivative, and the integral,
all of which are further examples of the limit process. Of the many results proved
124 Introduction to Real Analysis
in this chapter, it is difficult to select one or two for special emphasis. They are all
important! Many of them will be encountered again—either directly or indirectly—
throughout the text.
Some of the concepts and results of this chapter have certainly been encountered
previously; others undoubtedly are new. Two concepts which may not have been
previously encountered are limit superior (inferior) of a sequence of real numbers and
complete metric spaces. The primary importance of the limit superior and inferior
of a sequence is that these two limit operations always exist in R ∪ {−∞, ∞}. As we
will see in Chapter 7, this will allow us to present the correct statements of the root
and ratio test for convergence of a series. The limit superior will also be required to
define the radius of convergence of a power series. There will be other instances in
the text where these two limit operations will be encountered.
In the chapter we have proved several important consequences of the least up-
per bound property of R. The least upper bound property was used to prove that
every bounded monotone sequence converges. This result was subsequently used to
prove the nested intervals property, which in turn can be used to provide a proof
of the Bolzano-Weierstrass theorem. The nested intervals property can also be used
to prove the supremum property of R (Exercise 21 of Section 3.3). Another prop-
erty of the real numbers that is equivalent to the least upper bound property is the
completeness property of R; namely, every Cauchy sequence of real numbers con-
verges. Other consequences of the least upper bound property will be encountered
in subsequent chapters.
Cauchy sequences were originally studied by Cantor in the middle of the nine-
teenth century. He referred to them as “fundamental sequences” and used them in
his construction of the real number system R (See Miscellaneous Exercises 4 – 11).
The main reason that these sequences are attributed to Cauchy, rather than Cantor,
is due to the fact that his 1821 criterion for convergence of a series (Theorem 3.7.3) is
equivalent to the statement that the sequence of partial sums is a Cauchy sequence.
The fact that Cauchy was a more prominent mathematician than Cantor may also
have been a factor.
Miscellaneous Exercises
The first three exercises involve the concept of an infinite product. Let {ak } be a
sequence of nonzero real numbers. For each n = 1, 2, ..., define
n
Y
pn = ak = a1 · a2 · · · an .
k=1
Given a Cauchy sequence {an } in Q, let [{an }] denote the set of all
Cauchy sequences in Q equivalent to {an } . The set [{an }] is called the
equivalence class determined by {an }.
5. Given two Cauchy sequences {an } and {bn } in Q, prove that [{an }] =
[{bn }] provided {an } ∼ {bn }, and [{an }] ∩ [{bn }] = ∅ otherwise.
Let R denote the set of equivalence classes of Cauchy sequences in Q. We
denote the elements of R by lower case Greek letters α, β, γ, .... Thus if
α ∈ R, α = [{an }] for some Cauchy sequence {an } in Q. The sequence
{an } is called a representative of the equivalence class α. Suppose α =
[{an }] and β = [{bn }]. Define −α, α + β, and α · β as follows:
−α = [{−an }],
α + β = [{an + bn }],
α · β = [{an bn }].
126 Introduction to Real Analysis
One needs to show that these operations are well defined; that is, inde-
pendent of the representative of the equivalence class. For example, to
prove that −α is well defined, we suppose that {an } and {bn } are two rep-
resentatives of α; i.e., {an } ∼ {bn }. But then by 4(d), {−an } ∼ {−bn }.
Therefore, [{−an }] = [{−bn }]. This shows that −α is well defined.
6. Prove that the operations + and · are well defined on R.
For each p ∈ Q, let {p} denote the sequence all of whose terms are equal
to p. If p ∈ Q, let αp = [{p}]. Also, we set
θ = [{0}], ι = [{1}].
As we will see, the element θ will be the zero of R and ι will be the unit
of R. A Cauchy sequence {bn } in Q belongs to θ if and only if bn → 0.
Similarly, {an } ∈ ι if and only if (an − 1) → 0. The following problem
provides us with the multiplicative inverse of α 6= θ.
7. If α 6= θ, prove that there exists {an } ∈ α such that an 6= 0 for all n ∈ N,
and that { a1n } is a Cauchy sequence. Define α−1 = [{ a1n }].
8. Prove that R with operations + and · is a field.
αp + αq = αp+q
αp · αq = αpq
The above exercises prove that R is an ordered field which satisfies the least
upper bound property. One can show that any two complete ordered fields are in
fact isomorphic, that is, there exists a one-to-one map of one onto the other which
preserves the operations of addition, multiplication, and the order properties. Thus
R is isomorphic to the real numbers R.
Sequences of Real Numbers 127
Supplemental Reading
The concept of limit dates back to the late seventeenth century and the work
of Isaac Newton (1642–1727) and Gottfried Leibniz (1646–1716). Both of
these mathematicians are given historical credit for inventing the differential
and integral calculus. Although the idea of “limit” occurs in Newton’s work
Philosophia Naturalis Principia Mathematica of 1687, he never expressed the
concept algebraically; rather he used the phrase “ultimate ratios of evanescent
quantities” to describe the limit process involved in computing the derivatives
of functions.
The subject of limits lacked mathematical rigor until 1821 when Augustin-
Louis Cauchy (1789–1857) published his Cours d’Analyse in which he offered
the following definition of limit: “If the successive values attributed to the
same variable approach indefinitely a fixed value, such that finally they differ
from it by as little as desired, this latter is called the limit of all the others.”
Even this statement does not resemble the modern delta-epsilon version of
limit given in Section 1. Although Cauchy gave a strictly verbal definition of
limit, he did use epsilons, deltas, and inequalities in his proofs. For this reason
Cauchy is credited for putting calculus on the rigorous basis with which we
are familiar today.
Based on the previous study of calculus, the student should have an in-
tuitive notion of what it means for a function to be continuous. This most
likely compares to how mathematicians of the eighteenth century perceived
a continuous function; namely one that can be expressed by a single formula
or equation involving a variable x. Mathematicians of this period certainly
accepted functions that failed to be continuous at a finite number of points.
However, even they might have difficulty envisaging a function that is contin-
uous at every irrational number and discontinuous at every rational number
in its domain. Such a function is given in Example 4.2.2(g). An example of an
increasing function having the same properties will also be given in Section 4
of this chapter.
In Section 1 we define the limit at a point of a real-valued function defined
on a subset of a metric space, and provide numerous examples to illustrate this
idea. In Sections 2 and 3 we consider the closely related theory of continuity
and investigate some of the consequences of this very important concept.
129
130 Introduction to Real Analysis
Our intuition should convince us that v(t) approaches −64 ft/sec as t ap-
proaches 2, and that this is the velocity upon impact.
As another example, consider the function f (x) = x sin x1 , x 6= 0. Here the
function f is not defined at x = 0. Thus to investigate the behavior of f at 0
we need to consider the values f (x) for x close to, but not equal to 0. Since
for all x 6= 0, our intuition again should tell us that f (x) approaches 0 as x
approaches 0. This indeed is the case as will be shown in Example 4.1.10(c).
We now make this idea of f (x) approaching a value L as x approaches a
point p precise. In order that the definition be meaningful, we must require
that the point p be a limit point of the domain of the function f .
|f (x) − L| < ǫ
for all points x ∈ E satisfying 0 < d(x, p) < δ. If this is the case, we write
This is illustrated graphically in Figure 4.1 for the case where E is a subset
of R.
FIGURE 4.1
lim f (x) = L
x→p
Remarks. (a) In the definition of limit, the choice of δ for a given ǫ may
depend not only on ǫ and the function, but also on the point p. This will be
illustrated in Example 4.1.2(f) below.
(b) If p is not a limit point of E, then for δ sufficiently small, there do not
exist any x ∈ E so that 0 < |x − p| < δ. Thus if p is an isolated point of E,
the concept of the limit of a function at p has no meaning.
(c) In the definition of limit, it is not required that p ∈ E, only that p is
a limit point of E. Even if p ∈ E, and f has a limit at p, we may very well
have that
lim f (x) 6= f (p).
x→p
an ǫ > 0, such that for every δ > 0, there exists an x ∈ E with 0 < |x − p| < δ,
for which
|f (x) − L| ≥ ǫ.
We will illustrate this in Example 4.1.2(d).
x2 − 4
|f (x) − 4| = − 4 = |x + 2 − 4| = |x − 2|.
x−2
Thus given ǫ > 0, the choice δ = ǫ works in the definition.
(b) Consider the following variation of (a). Let g be defined on R by
2
x − 4,
x 6= 2,
g(x) = x−2
2, x = 2.
For this example, 2 is a point in the domain of g, and it is still the case that
lim g(x) = 4. However, the limit does not equal g(2) = 2. The graph of g is
x→2
given in Figure 4.2.
FIGURE 4.2
Graph of g
We claim that lim h(x) = 1/2. This result is obtained as follows: For x 6= 0,
x→0
√ √ √
x+1−1 x+1−1 x+1+1 x 1
= √ = √ =√ .
x x x+1+1 x( x + 1 + 1) x+1+1
From this last term we now conjecture that h(x) → 1/2 as x → 0. By the
above,
√
1 1 1 1− x+1
h(x) − = √ − = √
2 x+1+1 2 2( x + 1 + 1)
√ √
(1 − x + 1)(1 + x + 1) −x
= √ = √
2( x + 1 + 1)2 2( x + 1 + 1)2
1 |x|
= √ .
2 ( x + 1 + 1)2
√
For x ∈ E we have ( x + 1 + 1)2 > 1, and thus
1 |x|
h(x) − < .
2 2
Given ǫ > 0, let δ = min{1, ǫ}. Then for 0 < |x| < δ,
1 |x| δ
h(x) − < < < ǫ,
2 2 2
and thus lim h(x) = 1/2.
x→0
(d) Let f be defined on R as follows:
(
1, x ∈ Q,
f (x) =
0, x∈6 Q.
We will show that for this function, lim f (x) fails to exist for every p ∈ R.
x→p
Fix p ∈ R. Let L ∈ R and let
|f (x) − L| = |1 − L| = ǫ.
If ǫ = |L|, then by Exercise 6, Section 1.5, for any δ > 0, there exists an
irrational number x with 0 < |x − p| < δ. Again, for such an x, |f (x) − L| = ǫ.
Thus with ǫ as defined, for any δ > 0, there exists an x with 0 < |x − p| < δ
such that |f (x) − L| ≥ ǫ. Since this works for every L ∈ R, lim f (x) does not
x→p
exist.
134 Introduction to Real Analysis
Then lim f (x) = 0. Since |f (x)| ≤ |x| for all x, given ǫ > 0, any δ, 0 < δ ≤ ǫ,
x→0
will work in the definition of the limit. A modification of the argument given
in (d) shows that for any p 6= 0, lim f (x) does not exist. An alternate proof
x→p
will be provided in Example 4.2.2(b)
(f ) This example shows dramatically how the choice of δ will in general
depend not only on ǫ, but also on the point p. Let E = (0, 1) and let f : E → R
be defined by
1
f (x) = .
x
We will prove that for p ∈ (0, 1],
1 1
lim = .
x→p x p
1 1
− <1
x p
for all x, p ∈ (0, 1) with 0 < |x − p| < δ. Since any smaller δ will also work,
we can assume that 0 < δ < 21 . But now if we take p = 21 δ and x = δ, then
0 < |x − p| < δ and thus
1 2 1
|f (x) − f (p)| = − = > 1.
δ δ δ
This contradiction proves that the choice of δ must depend on both p and ǫ.
Limits and Continuity 135
(g) For our final example we consider the function f defined on R2 \ (0, 0)
xy
given by f (x, y) = 2 . We will prove that the limit lim f (x, y) = 52 .
x + y2 (x,y)→(1,2)
Consider
2 5xy − 2x2 − 2y 2
f (x, y) − =
5 5(x2 + y 2
(x − 2y)(y − 2) + (4y − 2x)(x − 1)
=
5(x2 + y 2 )
(|x| + 2|y|)|y − 2| + (4|y| + 2|x|)|x − 1|
≤ .
5(x2 + y 2
If (x, y) ∈ N 12 (1, 2), then 12 < |x| < 32 and 23 < |y| < 52 . (Verify!) Therefore for
(x, y) ∈ N 21 (1, 2) we have |x| + 2|y| < 13 26 2 2 25
2 , 4|y| + 2|x| < 2 , and 5(x + y ) > 2 .
Therefore
2 26
f (x, y) − < (|y − 2| + |x − 1|).
5 25
Hence if δ is chosen so that 0 < δ ≤ 21 , then for (x, y) ∈ Nδ (1, 2) we have
2 52
f (x, y) − < δ.
5 25
25
Thus given ǫ > 0, if we choose δ such that 0 < δ < min{ 21 , 52 ǫ}, then (x, y) ∈
2
Nδ (1, 2) implies that |f (x, y) − 5 | < ǫ.
Since lim pn = p, for the above δ, there exists a positive integer no such that
n→∞
Theorem 4.1.3 is often applied to show that a limit does not exist. If one
can find a sequence {pn } with pn → p, such that {f (pn )} does not converge,
then lim f (x) does not exist. Alternately, if one can find two sequences {pn }
x→p
and {rn } both converging to p, but for which
then again lim f (x) does not exist. We illustrate this with the following two
x→p
examples.
FIGURE 4.3
Graph of f (x) = sin(1/x), x > 0
Limit Theorems
then
(a) lim [f (x) + g(x)] = A + B,
x→p
(b) lim f (x)g(x) = AB, and
x→p
f (x) A
(c) lim = , provided B 6= 0.
x→p g(x) B
138 Introduction to Real Analysis
Proof. For (a) and (b) apply Theorem 4.1.3 and Theorem 3.2.1, respectively.
We leave the details to the exercises (Exercise 11).
Proof of (c). By (b) it suffices to show that
1 1
lim = .
x→p g(x) B
The proofs of the following two theorems are easy consequences of Theo-
rem 4.1.3 and the corresponding theorems for sequences (Theorems 3.2.3 and
3.2.4). First however we give the following definition.
x3 + 2x2 − 2x − 4 (x + 2)(x2 − 2) x2 − 2
2
= = .
x −4 (x + 2)(x − 2) x−2
1
lim x sin = 0.
x→0 x
The graph of f (x) = x sin x1 is given in Figure 4.4.
(d) Let E = (0, ∞) and let f be defined on E by
sin t
f (t) = .
t
140 Introduction to Real Analysis
FIGURE 4.4
Graph of f (x) = x sin(1/x), x 6= 0
As we will see in the next chapter, this limit will be crucial in computing the
derivative of the sine function. From Figure 4.5, we have
sin t
lim = 1.
t→0 t
Limits and Continuity 141
FIGURE 4.5
Triangles and sector of Example 4.1.10(d)
Limits at Infinity
Up to this point we have only considered limits at points p ∈ R. We now
extend the definition to include limits at ∞ or −∞. The definition of the
limit at ∞ is very similar to lim f (n) where f : N → R; that is, f is a
n→∞
sequence in R.
|f (x) − L| < ǫ
lim f (x) = L.
x→∞
lim f (x) = L
x→−∞
if and only if given ǫ > 0, there exists a real number M such that
|f (x) − L| < ǫ
Exercises 4.1
1. Use the definition to establish each of the following limits.
*a. lim (2x − 7) = −3. b. lim (3x + 5) = −1
x→2 x→−2
x 1
*c. lim = . d. lim 2x2 − 3x − 4 = 1.
x→1 1 + x 2 x→−1
x3 + 1 x3 − 2x − 4 5
*e. lim = 3. f. lim =
x→−1 x + 1 x→2 x2 − 4 2
2. Use the definition to establish each of the following limits.
a. lim c = c b. lim x = p.
x→p x→p
*c. lim x3 = p3 d. lim xn = pn , n ∈ N
x→p x→p
√ √
√ √ x+p− p 1
*e. lim x= p, p>0 f. lim = √ ,p>0
x→p x→p x 2 p
3. For each of the following, determine whether the indicated limit exists in
R. Justify your answer!
x x2 − 1
*a. lim b. lim
x→0 |x| x→1 x + 1
1 p 1
*c. lim cos d. lim |x| cos .
x→0 x x→0 x
(x + 1)2 − 1 x4 − 2x2 + 1
*e. lim f. lim 3
x→0 x x→1 x − x2 − x + 1
Limits and Continuity 143
4. *Define f : (−1, 1) → R by
x2 − x − 2
f (x) = .
x+1
Determine the limit L of f at −1 and prove, using ǫ and δ, that f has
limit L at −1.
5. *a. Using Figure 4.5, prove that | sin h| ≤ |h| for all h ∈ R.
b. Using the trigonometric identity 1 − cos h = 2 sin2 h2 , prove that
(i) lim cos h = 1.
h→0
1 − cos h
(ii) lim =0
h→0 h
6. Let E be a subset of a metric space (X, d), p a limit point of E, and
f : E → R. Suppose there exists a constant M > 0 and L ∈ R such that
|f (x) − L| ≤ M d(x, p) for all x ∈ E. Prove that lim f (x) = L.
x→p
8. Use the limit theorems, examples, and previous exercises to find each of
the following limits. State which theorem, examples, or exercises are used
in each case.
5x2 + 3x − 2 x3 − x2 + 2
*a. lim b. lim
x→−1
r x−1 x→−1 x+1
3x + 1 |x + 2|3/2
*c. lim d. lim
x→1
√ 2x + 5 x→−2
x+2
x−2 1 1 √
*e. lim f. lim √ − f rac1 p , p > 0
x→4 x − 4 x→o x x+p
sin 2x |x − 2| − |x + 2|
*g. lim h. lim
x→0 x x→0 x
9. *Suppose f : (a, b) → R, p ∈ [a, b], and lim f (x) > 0. Prove that there
x→p
exists a δ > 0 such that f (x) > 0 for all x ∈ (a, b) with 0 < |x − p| < δ.
10. Suppose E is a subset of a metric space (X, d), p is a limit point of E,
and f : E → R. Prove that if f has a limit at p, then there exists a
positive constant M and a δ > 0 such that |f (x)| ≤ M for all x ∈ E, 0 <
d(x, p) < δ.
11. a. Prove Theorem 4.1.6(a).
b. Prove Theorem 4.1.6(b).
12. *Prove Theorem 4.1.8.
13. Prove Theorem 4.1.9.
14. Let f, g be real-valued functions defined on E ⊂ R and let p be a limit
point of E.
*a. If lim f (x) and lim (f (x) + g(x)) exist, prove that lim g(x) exists.
x→p x→p x→p
144 Introduction to Real Analysis
b. If lim f (x) and lim (f (x)g(x)) exist, does it follow that lim g(x)
x→p x→p x→p
exists?
15. Let E be a subset of a metric space, p a limit point of E. Suppose f is
a bounded real-valued function on E having the property that lim f (x)
x→p
does not exist. Prove that there exist distinct sequences {pn } and {qn }
in E with pn → p and qn → p such that lim f (pn ) and lim f (qn ) exist,
n→∞ n→∞
but are not equal.
16. *Let f be a real-valued function defined on (a, ∞) for some a > 0. Define
g on (0, a1 ) by g(t) = f ( 1t ). Prove that
lim f (x) = L if and only if lim g(t) = L.
x→∞ t→0
FIGURE 4.6
An illustration of Definition 4.2.1
f is continuous at p = 0. On the other hand, since lim f (x) fails to exists for
x→p
every p 6= 0, f is discontinuous at every p ∈ R, p 6= 0.
(
1, x ∈ Q,
(c) The function f defined by f (x) = of Example 4.1.2(d)
0, x 6∈ Q,
is discontinuous at every p ∈ R.
(d) As in Example 4.1.2(f), the function f (x) = 1/x is continuous at every
p ∈ (0, 1).
(e) Let f be defined by
(
0, x = 0,
f (x) =
x sin x1 , x 6= 0.
By Example 4.1.10(c),
lim f (x) = 0 = f (0).
x→0
Thus f is continuous at x = 0.
Limits and Continuity 147
FIGURE 4.7
Graph of the function of Example 4.2.2(g)
lim f (x) = 0
x→p
for every p ∈ (0, 1). As a consequence, since f (p) = 0 for every irrational
number p ∈ (0, 1), f is continuous at every irrational number. Also, since
148 Introduction to Real Analysis
Proof. If p is an isolated point of E, then the result is true since every function
on E is continuous at p. If p is a limit point of E, then the conclusions follow
from Theorem 4.1.6.
Proof. Let ǫ > 0 be given. Since g is continuous at f (p), there exists a δ1 > 0
such that
Therefore h is continuous at p.
p(x)
r(x) = , x ∈ R \ E,
q(x)
is continuous on R \ E.
(c) By Example 4.2.2(f), f (x) = sin x is continuous on R. Hence if p is
a polynomial function on R, by Theorem 4.2.4 (f ◦ p)(x) = sin(p(x)) is also
continuous on R.
150 Introduction to Real Analysis
EXAMPLES√ 4.2.7 (a) We illustrate the previous theorem for the function
f (x) = x, Dom f = [0, ∞). Suppose first that V is an open interval (a, b)
with a < b. Then
∅,
b ≤ 0,
−1 2
f (V ) = [0, b ), a ≤ 0 < b,
2 2
(a , b ), 0 < a.
Clearly ∅ and (a2 , b2 ) are open subsets of R and hence also of [0, ∞). Although
[0, b2 ) is not open in R,
Therefore,
n
[ n
[
K= (Uαj ∩ K) = f −1 (Vαj ),
j=1 j=1
n
Since f (f −1 (Vαj )) ⊂ Vαj , f (K) ⊂
S
Vαj . Thus f (K) is compact.
j=1
As a corollary of the previous theorem we obtain the following general-
ization of the usual maximum-minimum theorem normally encountered in
calculus.
152 Introduction to Real Analysis
Thus if 0 < β < 1, there exists an integer n such that f (xn ) > β. Hence
sup{f (x) : x ∈ E} = 1.
(b) If E is not closed, let xo be a limit point of E which is not in E. Then
1
g(x) =
1 + (x − xo )2
FIGURE 4.8
Intermediate value theorem
f (c) − ǫ < f (x) < f (c) + ǫ for all x ∈ Nδ (c) ∩ [a, b].
Since f (c) < γ, c 6= b, and thus (c, b] ∩ Nδ (c) 6= ∅. But for any x ∈ (c, b] with
c < x < c + δ,
1 1 1
f (x) < f (c) + ǫ = f (c) + γ − f (c) = (f (c) + γ) < γ.
2 2 2
But then x ∈ A and x > c which contradicts c = sup A. Therefore f (c) ≥ γ.
Since c = sup A, either c ∈ A or c is a limit point of A. If c ∈ A, then
f (c) ≤ γ. If c is a limit point of A, then by Theorem 3.1.4 there exists a
sequence {xn } in A such that xn → c. Since xn ∈ A, f (xn ) ≤ γ. Since f is
continuous,
f (c) = lim f (xn ) ≤ γ.
n→∞
Proof. Let s, t ∈ f (I) with s < t, and let a, b ∈ I with a 6= b be such that
f (a) = s and f (b) = t. Suppose γ satisfies s < γ < t. If a < b, then since f is
continuous on [a, b], by the intermediate value theorem theorem there exists
c ∈ (a, b) such that f (c) = γ. Thus γ ∈ f (I). A similar argument also holds if
a > b.
There is an alternate way to state the previous corollary using the ter-
minology of connected sets. If I is a connected subset of R and f : I → R
is continuous on I, then f (I) is connected. This result can be proved using
only properties of continuous functions and the definition of a connected set
(Exercise 28). The corollary now follows from the fact that a subset of R
is connected if and only if it is an interval (Theorem 2.2.25). The proof of
Theorem 2.2.25 however also requires the least upper bound property of R.
Consequently, the supremum property of the real numbers cannot be avoided
in proving Corollary 4.2.12.
The following two corollaries are additional applications of the intermedi-
ate value theorem. Our first result is the proof of Theorem 1.5.3.
COROLLARY 4.2.13 For every real number γ > 0 and every positive in-
teger n, there exists a unique positive real number y so that y n = γ.
f (y) = y n = γ.
Proof. Let g(x) = f (x) − x. Then g(0) = f (0) ≥ 0 and g(1) = f (1) − 1 ≤ 0.
Thus there exists y ∈ [0, 1] such that g(y) = 0; i.e., f (y) = y.
Limits and Continuity 155
Then f (0) = −1, f ( π2 ) = 1, and for every γ, −1 < γ < 1, there exists an
x ∈ (0, π2 ) such that f (x) = γ. However, the function f is not continuous at
x = 0 (see Figure 4.3).
(b) In this example, we show that the conclusion of the intermediate value
theorem is false if the interval [a, b] of real numbers is replaced by an interval
of rational numbers. Let E = {x ∈ Q : 0 ≤ x ≤ 2}, and let f (x) = x2 . Then f
is continuous on E with f (0) < 2 < f (2). However, there does not exist r ∈ E
such that f (r) = 2.
Exercises 4.2
1. For each of the following, determine whether the given function is con-
tinuous at the indicated point xo .
2
2x − 5x − 3
, x 6= 3,
a. f (x) = x−3 at xo = 3
6, x = 3,
√
x−2
x 6= 4
b. h(x) = x−4 at xo = 4
4 x=4
1 − cos x ,
x 6= 0,
*c. g(x) = x at xo = 0
0, x = 0,
(
x2 , x ≤ 2,
*d k(x) = at xo = 2
4 − x, x > 2,
2. Let f : R → R be defined by
(
8x, when x is rational,
f (x) =
2x2 + 8, when x is irrational.
a. Prove, using ǫ and δ, that f is continuous at 2.
*b. Is f continuous at 1? Justify your answer.
3. Let f : R → R be defined by
(
x2 , x ∈ Q,
f (x) =
x + 2, x∈/ Q.
Find all points (if any) where f is continuous.
156 Introduction to Real Analysis
√
4. *Prove (without using Example 4.2.7) that f (x) = x is continuous on
[0, ∞).
r
1 x+1
5. Define f : (0, 1] → R by f (x) = √ − .
x x
a. Justify that f is continuous on (0, 1].
*b. Can one define f (0) so that f is continuous on [0, 1]?
6. Let E ⊂ R, and suppose f : E → R is continuous at p ∈ E.
*a. Prove that |f | is continuous at p. Is the converse true?
p
b. Set g(x) = |f x)|. Prove that g is continuous at p.
7. *Let E be a subset of a metric space X, and suppose f : E → R is
continuous on E. Prove that f n defined by f n (x) = (f (x))n is continuous
on E for each n ∈ N.
8. *a. Prove that f (x) = cos x is continuous on R.
b. If E ⊂ R and f : E → R is continuous on E, prove that g(x) =
cos(f (x)) is continuous on E.
9. For each of the following equations, determine the largest subset E of R
such that the given equation defines a continuous function on E. In each
case state which theorems or examples are used to show that the function
is continuous on E.
x3 + 4x − 5 1
*a. f (x) = . b. g(x) = sin .
x(x2 − 4) x
1 cos x
*c. h(x) = √ . d. k(x) = .
x2 + 1 sin x
(
x2 , x ≤ 2,
10. Prove that f (x) = is continuous on R and that
6 − x, x > 2,
Range f = R.
11. As in Example 4.2.7, use Theorem 4.2.6 to prove that each of the following
functions is continuous on the given domain.
1
a. f (x) = , Dom f = (0, ∞). b. g(x) = x2 , Dom g = R.
x
12. *a. Let f : R2 → R2 be defined by f (x, y) = (x + y, x − y). Show that f
is continuous on R2 .
b. Let E be a subset of R, and suppose f, g are continuous real-valued
functions on E. Prove that f (x, y) = (f (x), f (y)) is continuous on E × E.
xy
13. Prove that the function f defined on D = R2 \ (0, 0) by f (x, y) = 2
x + y2
is continuous and bounded on D.
14. Suppose E is a subset of R and f, g : E → R are continuous at p ∈ E.
Prove that each of the functions defined below is continuous at p.
*a. max{f, g}(x) = max{f (x), g(x)}, x ∈ E.
b. min{f, g}(x) = min{f (x), g(x)}, x ∈ E.
+
c. f (x) = max{f (x), 0}.
Limits and Continuity 157
The key point in the definition of uniform continuity is that the choice
of δ must depend only on ǫ, the function f , and the set E; it has to be inde-
pendent of any x ∈ E. To illustrate this, we consider the following examples.
Lipschitz Functions
Both of the functions in Example 4.3.2 (a) and (b) are examples of an extensive
class of functions. If E is a subset of a metric space (X, d), a function f : E →
R satisfies a Lipschitz condition on E if there exists a positive constant M
such that
|f (x) − f (y)| ≤ M d(x, y)
for all x, y ∈ E. Functions satisfying the above inequality are usually referred
to as Lipschitz functions. As we will see in the next chapter, functions for
which the derivative is bounded are Lipschitz functions. As a consequence
of the following theorem, every Lipschitz function is uniformly continuous.
However, not every uniformly continuous
√ function is a Lipschitz function. For
example, the function f (x) = x is uniformly continuous on [0, ∞), but f
does not satisfy a Lipschitz condition on [0, ∞) (see Exercise 5).
160 Introduction to Real Analysis
Proof. Exercise 1.
Proof. Let ǫ > 0 be given. Since f is continuous, for each p ∈ K, there exists
a δp > 0 such that
ǫ
|f (x) − f (p)| < (3)
2
for all x ∈ K ∩ Nδp (p).
The collection Nδp /2 (p) p∈K is an open cover of K. Since K is compact,
a finite number of these will cover K. Thus there exist a finite number of
points p1 , ..., pn in K such that
n
[
K⊂ Nδpi /2 (pi ).
i=1
Let
1
δ= 2 min{δpi : i = 1, ..., n}.
Then δ > 0. Suppose x, y ∈ K with d(x, y) < δ. Since x ∈ K, x ∈ Nδpi /2 (pi )
for some i. Furthermore, since d(x, y) < δ ≤ δpi /2 ,
x, y ∈ Nδpi (pi ).
EXAMPLE 4.3.6 In this example, we show that both the properties closed
and bounded are required. The interval [0, ∞) is closed, but not bounded. The
function f (x) = x2 is continuous on [0, ∞), but not uniformly continuous on
[0, ∞) (Exercise 2). On the other hand, the interval (0, 1) is bounded, but not
closed. The function f (x) = 1/x is continuous on (0, 1), but f is not uniformly
continuous on (0, 1).
Exercises 4.3
1. Prove Theorem 4.3.3.
2. Show that the following functions are not uniformly continuous on the
given domain.
1
*a. f (x) = x2 , Dom f = [0, ∞) b. g(x) = 2 , Dom g = (0, ∞)
x
1 1
*c. h(x) = sin , Dom h = (0, ∞) d. k(x) = , Dom k = (0, π)
x sin x
3. Prove that each of the following functions is uniformly continuous on the
indicated set.
x
*a. f (x) = , x ∈ [0, ∞) b. g(x) = x2 , x ∈ N.
1+x
1
*c. h(x) = 2 , x ∈ R. d. k(x) = cos x, x ∈ R
x +1
2
x sin x
e. e(x) = , x ∈ (0, ∞) *f. f (x) = , x ∈ (0, 1)
x+1 x
4. Show that each of the following functions is a Lipschitz function.
1
*a. f (x) = , Dom f = [a, ∞), a > 0.
x
x
b. g(x) = 2 , Dom g = [0, ∞).
x +1
1
*c. h(x) = sin , Dom h = [a, ∞), a > 0.
x
d. p(x) a polynomial, Dom p = [−a, a], a > 0.
√
5. *a. Show that f (x) = x satisfies a Lipschitz condition on [a, ∞),
a > 0.
√
b. Prove that x is uniformly continuous on [0, ∞) .
*c. Show that f does not satisfy a Lipschitz condition on [0, ∞).
6. Suppose E ⊂ R and f, g are Lipschitz functions on E.
a. Prove that f + g is a Lipschitz function on E.
b. If in addition f and g are bounded on E, prove that f g is a Lipschitz
function on E.
7. Suppose E ⊂ R, and f, g are uniformly continuous real valued functions
on E.
a. Prove that f + g is uniformly continuous on E.
162 Introduction to Real Analysis
lim f (x).
f (p+) = lim+ f (x) = x→p
x→p
x>p
lim f (x).
f (p−) = lim− f (x) = x→p
x→p
x<p
Types of Discontinuities
By the previous theorem a function f is continuous at p ∈ (a, b) if and only if
(a) f (p+) and f (p−) both exist, and
(b) f (p+) = f (p−) = f (p).
A real-valued function f defined on an interval I can fail to be continuous
at a point p ∈ I (the closure of I) for several reasons. One possibility is that
lim f (x) exists but either does not equal f (p), or f is not defined at p. Such a
x→p
function can easily be made continuous at p by either defining, or redefining,
f at p as follows:
f (p) = lim f (x).
x→p
FIGURE 4.9
Jump discontinuity of f at p
FIGURE 4.10
Graph of [x]
g(x) = sin(2πx[x]).
For x ∈ (n, n + 1), n ∈ Z, x[x] = nx, and thus g(x) is continuous on every
interval (n, n + 1), n ∈ Z. On the other hand, for n ∈ Z,
Monotone Functions
lim f (x) = A.
x→p−
168 Introduction to Real Analysis
The proof of this is similar to the proof of Theorem 3.3.2. Let ǫ > 0 be given.
Since A is the least upper bound of {f (x) : x < p}, there exists xo < p such
that
A − ǫ < f (xo ) ≤ A.
Thus if xo < x < p, A − ǫ < f (xo ) ≤ f (x) ≤ A. Therefore,
FIGURE 4.11
Graph of I(x − a)
Then Ik has a unit jump at each ak and is right continuous at ak (see Fig-
ure 4.11).
Suppose {c1 , ..., cn } are positive real numbers. Define f on [a, b] by
n
X
f (x) = ck I(x − ak ).
k=1
THEOREM 4.4.10 Let a, b ∈ R with a < b, and let {xn }n∈N be a countable
subset of (a, b). Let {cn }∞
n=1 be any sequence of positive real numbers such that
∞
P
cn converges. Then there exists a monotone increasing function f on [a, b]
n=1
such that
∞
P
(a) f (a) = 0 and f (b) = cn ,
n=1
(b) f is continuous on [a, b] \ {xn : n = 1, 2, ... },
(c) f (xn +) = f (xn ) for all n; i.e. f is right continuous at all xn , and
(d) f is discontinuous at each xn with
f (xn ) − f (xn −) = cn .
Thus for each x ∈ [a, b], the sequence {sn (x)} of partial sums is monotone
increasing and bounded above and hence by Theorem 3.7.6 converges. Since
for all x, y with x < y, f is monotone increasing on [a, b]. Furthermore, since
xn > a for all n, I(a − xn ) = 0 for all n. Therefore f (a) = 0. Also, since
I(b − xn ) = 1 for all n,
∞
X
f (b) = ck .
k=1
With this choice of δ, if xk ∈ Nδ (p)∩E, we have k > N . Suppose p < x < p+δ.
Then
I(p − xk ) = I(x − xk ) for all k = 1, 2, ..., N.
Furthermore, for any x > p, we always have
Therefore,
f (xn ) − f (y) = cn for all y, xn − δ < y < xn .
∞
P
Suppose xn is a limit point of E. Given ǫ > 0, choose N such that ck <
k=N +1
ǫ. For this N , choose δ > 0 such that if xk ∈ (xn − δ, xn ) ∩ E then k > N .
Then for all y ∈ [a, b] with xn − δ < y < xn ,
∞
X
cn ≤ f (xn ) − f (y) ≤ cn + ck < cn + ǫ.
k=N +1
In this example, the sequence {xn } satisfies 0 < x1 < x2 < · · · < 1. If
0 ≤ x < x1 = 21 , then I(x − xn ) = 0 for all n. Thus
f (x) = 0, x ∈ [0, 21 ).
2
If x1 ≤ x < x2 = 3, then I(x − x1 ) = 1 and I(x − xk ) = 0 for all k ≥ 2.
Therefore,
1
f (x) = c1 =, x ∈ [ 21 , 13 ).
2
If x2 ≤ x < x3 = 34 , then I(x − xk ) = 1 for k = 1, 2 and I(x − xk ) = 0 for
k ≥ 3. Therefore,
1 1 3
f (x) = c1 + c2 = + = , x ∈ [ 23 , 34 ),
2 22 4
and so forth. The graph of f is depicted in Figure 4.12.
(b) Let cn = 2−n and let {xn } be an enumeration of the rationals in (0, 1).
Theorem 4.4.10 guarantees the existence of a nondecreasing function on [0, 1],
which is discontinuous at each rational number in (0, 1), and continuous at
every irrational number in (0, 1).
(c) If in the proof of Theorem 4.4.10 we take {xn }n∈N to be a countable
P∞
subset of R and choose {cn }n∈N (cn > 0) such that cn = 1, then we obtain
n=1
a nondecreasing real-valued function f on R satisfying lim f (x) = 0 and
x→−∞
lim f (x) = 1. (See Exercise 21) Such a function is called a distribution
x→∞
function on R. Such functions arise naturally in probability theory.
Limits and Continuity 173
FIGURE 4.12 P
Graph of f (x) = cn I(x − xn )
Inverse Functions
Suppose f is a strictly increasing real-valued function on an interval I. Let
x, y ∈ I with x 6= y. If x < y, then since f is strictly increasing, f (x) < f (y).
Similarly, if x > y then f (x) > f (y). Thus f (x) 6= f (y) for any x, y ∈ I with
x 6= y. Therefore f is one-to-one and consequently has an inverse function f −1
defined on f (I). In the following theorem we prove that if f is continuous on
I then f −1 is also continuous on f (I).
FIGURE 4.13
Continuity of the inverse function
Exercises 4.4
1. Prove Theorem 4.4.3.
2. For each of the following functions f defined on R \ {0}, find lim f (x)
x→0−
and lim f (x), provided the limits exist.
x→0+
x[x], x < 0,
x
a. f (x) = . *b. f (x) = [x]
|x| , x > 0.
x
2
− x ]
c. f (x) = [1 *d. f (x) = [x2 − 1]
1 1
e. f (x) = . *f. f (x) = x .
x x
3. For each of the functions f in Exercise 2, determine whether f has a
removable discontinuity, a jump discontinuity, or a discontinuity of second
kind, at x = 0. If f has a removable discontinuity at 0, specify how f (0)
should be defined in order that f is continuous at 0.
4. *Investigate continuity of g(x) = (x − 2)[x] at xo = 2.
5. Let f (x) = x − [x]. Discuss continuity of f . Sketch the graph of f .
6. For each of the following determine the value of b such that f has a
removable discontinuity at the indicated point xo .
(
x − 2, x < 1,
a. f (x) = 3
xo = 1
bx + 4, x > 1,
(
−x2 [x], x < −2,
*b. f (x) = xo = −2
x + b, x > −2,
7. a. Sketch the graph of g(x) = sin(2πx[x]) for x ∈ (−4, 4).
*b. Prove that g(x) is not uniformly continuous on R.
8. Prove that the function f of Example 4.4.11(a) is continuous at x = 1.
9. Let E ⊂ R and let f be a real-valued function on E. Suppose p ∈ R is a
limit point of E ∩ (p, ∞). Prove that
lim f (x) = L if and only if lim f (pn ) = L
x→p+ n→∞
Notes
The limit of a function at a point is one of the fundamental tools of analysis. Not
only is it crucial to continuity, but also to many subsequent topics in the text. The
limit process will occur over and over again. We will encounter it in the next chapter
in the definition of the derivative. It will occur again in the chapters on integration,
series, etc.
Another very important concept that will be encountered on many other oc-
casions in the text is uniform continuity. Uniform continuity is important in that
given ǫ > 0, it guarantees the existence of a δ > 0 such that |f (x) − f (y)| < ǫ for all
x, y ∈ Dom f with |x − y| < δ. In Chapter 6 we will use this to prove that every
continuous real-valued function on [a, b], a, b ∈ R, is Riemann integrable on [a, b].
Other applications of uniform continuity will occur in many other theorems and in
the exercises.
One of the most important results of this chapter is the intermediate value
theorem (Theorem 4.2.11). The intermediate value theorem has already been used
in Corollary 4.2.13 to prove the existence of nth roots; namely, for every positive real
number x and n ∈ N, there exists a unique positive real number y such that y n = x.
Even though the existence of nth roots can be proved without the intermediate
value theorem, any such proof however is simply the statement that the function
178 Introduction to Real Analysis
f (x) = xn satisfies the intermediate value property on [0, a] for every a > 0. Other
applications of the intermediate value theorem will occur elsewhere in the text.
The proof of the intermediate value theorem depended on the fact that the
connected subsets of R are the intervals (Theorem 2.2.25) and that the continuous
image of a connected set is connected (Exercise 28 of Section 4.2). Assuming these
two results, the intermediate value theorem is an immediate consequence as follows:
Suppose f is continuous on [a, b]. Let I = f ([a, b]). Then I is connected and thus
must be an interval. Thus if f (a) < γ < f (b), γ ∈ I and hence there exists c ∈ [a, b]
such that f (c) = γ. That the continuous image of a connected set is connected
follows from the definition. However, the proof that the connected subsets of R are
the intervals requires the least upper bound property.
Miscellaneous Exercises
1. Let f be a continuous real-valued function on [a, b] with f (a) < 0 < f (b).
Let c1 = 12 (a + b). If f (c1 ) > 0, let c2 = 21 (a + c1 ). If f (c1 ) < 0, let
c2 = 12 (c1 + b). Continue this process inductively to obtain a sequence
{cn } in (a, b) which converges to a point c ∈ (a, b) for which f (c) = 0.
2. Let E ⊂ R, p a limit point of E, and f a real-valued function defined on
E. The limit superior of f at p, denoted lim f (x), is defined by
x→p
The following set of exercises involve the Cantor ternary function. Let
P denote the Cantor ternary set of Section 2.3. For each x ∈ (0, 1], let
x = .a1 a2 a3 .... denote the ternary expansion of x. Define N as follows:
(
∞, if an 6= 1 for all n ∈ N,
N=
min{n : an = 1}, otherwise.
Define bn = 21 an for n < N , and bN = 1, if N is finite. (Note: bn ∈ {0, 1}
for all n.)
N b
P n
4. If x ∈ (0, 1] has two ternary expansions, show that n
is independent
n=1 2
of the expansion of x.
Supplemental Reading
the epsilon? Cauchy and the origins of function uniformly continuous,” Math.
rigorous calculus,” Amer. Math. Monthly Mag. 57 (1994), 169–173.
90 (1983), 185–194. Straffin, Jr., Philip, D., “Periodic
points of continuous functions,” Math.
Radcliffe, D. G., “A function that
Mag. 51 (1978) 99–105.
is surjective on every interval,” Amer.
Velleman, D. J., “Characterizing con-
Math. Monthly 123 (2016), 88–89.
tinuity,” Amer. Math. Monthly 104
Snipes, Ray F., “Is every continuous (1997), 318-322.
5
Differentiation
181
182 Introduction to Real Analysis
s(t) − s(to )
.
t − to
The limit of this quantity as t approaches to , again provided that the limit
exists, is taken as the definition of the velocity of the object at time to .
Differentiation 183
FIGURE 5.1
Secant line between two points on the graph of f
provided of course that the limit exists. The analogous formula also holds if
p ∈ I is the right endpoint of I. In analogy with the right and left limit of a
function we also define the right and left derivative of a function.
DEFINITION 5.1.2 Let I ⊂ R be an interval and let f be a real-valued
function with domain I. If p ∈ I is such that I ∩ (p, ∞) 6= ∅, then the right
′
derivative of f at p, denoted f+ (p), is defined as
′ f (p + h) − f (p)
f+ (p) = lim+ ,
h→0 h
provided the limit exists. Similarly, if p ∈ I satisfies (−∞, p) ∩ I 6= ∅, then the
′
left derivative of f at p, denoted f− (p), is given by
′ f (p + h) − f (p)
f− (p) = lim ,
h→0− h
provided the limit exists.
(x + h)2 − x2
f ′ (x) = lim = lim (2x + h) = 2x.
h→0 h h→0
1 1
f ′ (x) = lim √ √ = √ .
h→0 x+h+ x 2 x
we obtain
sin(x + h) − sin x cos h − 1 sin h
= sin x + cos x .
h h h
sin h cos h − 1
lim =1 and lim = 0.
h→0 h h→0 h
Therefore,
sin(x + h) − sin x
f ′ (x) = lim
h→0 h
cos h − 1 sin h
= sin x lim + cos x lim
h→0 h h→0 h
= cos x.
d
In Exercise 3 you will be asked to prove that (cos x) = − sin x.
dx
186 Introduction to Real Analysis
Then
′ |h| h
f+ (0) = lim+ = lim+ = 1, and
h→0h h→0 h
′ |h| −h
f− (0) = lim− = lim− = −1.
h→0 h h→0 h
′ ′
Thus f+ (0) and f− (0) both exist, but are unequal. Therefore f ′ (0) does not
exist.
(e) In this example, let g(x) = x3/2 with Dom g = [0, ∞). Then for p = 0,
x3/2 √
g ′ (0) = g+
′
(0) = lim+ = lim+ x = 0.
x→0 x x→0
1 1 1
For x 6= 0, f ′ (x) = − cos + sin (Exercise 8). When x = 0,
x x x
f (h) − f (0) 1
f ′ (0) = lim = lim sin ,
h→0 h h→0 h
which by Example 4.1.5(a) does not exist. Therefore f ′ (0) does not exist.
(g) Consider the following variation of (f). Let
x2 sin 1 , x 6= 0,
g(x) = x .
0, x = 0.
This example is very important! In Exercise 9 you will be asked to show that
g ′ (x) exists for all x ∈ R with g ′ (0) = 0, but that the derivative g ′ is not
continuous at 0.
Proof. For t 6= p,
f (t) − f (p)
f (t) − f (p) = (t − p).
t−p
Since
f (t) − f (p)
lim
t→p t−p
exists and equals f ′ (p), by Theorem 4.1.6(b),
f (t) − f (p)
lim (f (t) − f (p)) = lim lim (t − p) = f ′ (p) · 0 = 0.
t→p t→p t−p t→p
Proof. The proof of (a) is left as an exercise (Exercise 4). For the proof of
(b), by adding and subtracting the term f (x + h)g(x), we have for h 6= 0,
(f g)(x + h) − (f g)(x) g(x + h) − g(x)
= f (x + h)
h h
f (x + h) − f (x)
+ g(x).
h
By Theorem 5.1.4, since f is differentiable at x, lim f (x + h) = f (x). Thus
h→0
since each of the limits exist, by Theorem 4.1.6
(f g)(x + h) − (f g)(x)
(f g)′ (x) = lim
h→0 h
g(x + h) − g(x)
= lim f (x + h) lim
h→0 h→0 h
f (x + h) − f (x)
+ g(x) lim
h→0 h
= f (x)g ′ (x) + g(x)f ′ (x).
1 1
′ −
1 g(x + h) g(x)
(x) = lim
g h→0 h
g(x + h) − g(x) 1
= − lim lim
h→0 h h→0 g(x)g(x + h)
−g ′ (x)
= 2 .
g (x)
Therefore, for t 6= x,
h(t) − h(x)
= [f ′ (x) + u(t)] [g ′ (y) + v(f (t))] .
t−x
Since v(f (t)) and u(t) both have limit 0 as t → x,
h(t) − h(x)
lim = f ′ (x) g ′ (y) = g ′ (f (x)) f ′ (x).
t→x t−x
(b) By Exercise 2
d n
x = nxn−1 for all n ∈ N.
dx
Thus if f : I → R is differentiable on the interval I, then by the chain rule
g(x) = [f (x)]n , n ∈ N, is differentiable on I with g ′ (x) = n[f (x)]n−1 f ′ (x).
This formula can also be obtained from Theorem 5.1.5(b) using mathematical
induction.
Exercises 5.1
1. Use the definition to find the derivative of each
√ of the following functions.
*a. f (x) = x3 , x ∈ R b. g(x) = x + 2, x > −2
1 1
*c. h(x) = , x 6= 0 d. k(x) = √ , x > −2
x x+2
x x
*e. f (x) = , x 6= −1 f. g(x) = 2 x∈R
x+1 x +1
d n
2. *Prove that for all integers n, x = n xn−1 (x 6= 0 if n is negative).
dx
d
3. *a. Prove that (cos x) = − sin x.
dx
sin x
b. Find the derivative of tan x = .
cos x
4. Prove Theorem 5.1.5(a).
5. For each of the following, determine whether the given function is differ-
entiable at the indicated point xo . Justify your answer!
*a. f (x) = x|x| at xo = 0
(
x2 , x ∈ Q,
b. f (x) = at xo = 0
0, x∈/ Q,
*c. g(x) = (x − 2)[x], at xo = 2
√
d. h(x) = x + 2, x ∈ [−2, ∞), at x0 = −2
(
sin x, x ∈ Q,
*e. f (x) = at xo = 0
x, x∈/ Q,
(
x4/3 cos x, x 6= 0,
f. g(x) = at xo = 0
0, x = 0,
Differentiation 191
6. Let f (x) = |x|3 . Compute f ′ (x), f ′′ (x), and show that f ′′′ (0) does not
exist.
7. Determine where each of the following functions from R to R is differen-
tiable and find the derivative.
*a. f (x) = x [x]. b. g(x) = |x
− 2| + |x + 1|.
x2 1 , 0 < x ≤ 1,
*c. h(x) = | sin x|. d. k(x) = x
0, x=0
8. Use the product rule, quotient rule, and chain rule to find the derivative
of each of the following.
a f (x) = x sin x1 , x 6= 0 b. f (x) = (cos(sin x)n )m , n, m ∈ N
p √
c. x + 2 + x d. f (x) = x4 (x + sin x1 ), x 6= 0
2
sin x
e. f (x) = 2 f. f (x) = cos4 x−1
x+1
, x 6= 1
1 + sin x
9. Let g be defined by
x2 sin 1 , x 6= 0,
g(x) = x
0, x = 0.
a. Prove that g is differentiable at 0 and that g ′ (0) = 0.
*b. Show that g ′ (x) is not continuous at 0.
10. Let f be defined by
(
x2 + 2, x ≤ 2,
f (x) =
ax + b, x > 2.
*a. For what values of a and b is f continuous at 2 ?
*b. For what values of a and b is f differentiable at 2?
11. Let f be defined by
ax + b,
x < −1,
f (x) = x3 + 1, −1 ≤ x ≤ 2,
cx + d, x > 2.
FIGURE 5.2
Absolute maxima and minima on the graph of f
f (t) − f (p)
≥ 0,
t−p
′ ′ ′
and therefore f− (p) ≥ 0. Finally, since f+ (p) = f− (p) = f ′ (p), we have
′
f (p) = 0. The proof of the case where f has a local minimum at p is similar.
As a consequence of the previous theorem we have the following corollary.
194 Introduction to Real Analysis
Rolle’s theorem
Prior to stating and proving the mean value theorem we first state and prove
the following theorem due to Michel Rolle (1652–1719).
Since the derivative of f at c gives the slope of the tangent line at (c, f (c)), a
geometric interpretation of Rolle’s theorem is that if f satisfies the hypothesis
of the theorem, then there exists a least one value of c ∈ (a, b) where the
tangent line to the graph of f is horizontal. For the function f depicted in
Figure 5.3, there are exactly two such points.
Proof. If f is constant on [a, b], then f ′ (x) = 0 for all x ∈ [a, b]. Thus, we
assume that f is not constant. Since the closed interval [a, b] is compact, by
Corollary 4.2.9, f has a maximum and a minimum on [a, b]. If f (t) > f (a) for
some t, then f has a maximum at some c ∈ (a, b). Thus by Theorem 5.2.2,
f ′ (c) = 0. If f (t) < f (a) for some t, then f has a minimum at some c ∈ (a, b),
and thus again f ′ (c) = 0.
Remarks. (a) Continuity of f on [a, b] is required in the proof of Rolle’s
theorem. The function
(
x, 0 ≤ x < 1,
f (x) =
0, x = 1,
Differentiation 195
FIGURE 5.3
Rolle’s theorem
is differentiable on (0, 1) and satisfies f (0) = f (1) = 0; yet f ′ (x) 6= 0 for all
x ∈ (0, 1). The function f fails to be continuous at 1.
(b) For Rolle’s theorem, differentiability
√ of f at a and b is not required. For
example, the function f (x) = 4 − x2 , x ∈ [−2, 2] satisfies the hypothesis of
Rolle’s theorem, yet the derivative does not exist at −2 and 2. For x ∈ (−2, 2),
−x
f ′ (x) = √ ,
4 − x2
and the conclusion of Rolle’s theorem is satisfied with c = 0.
Graphically, the mean value theorem states that there exists at least one
point c ∈ (a, b) such that the slope of the tangent line to the graph of the
function f is equal to the slope of the straight line passing through (a, f (a))
and (b, f (b)). For the function of Figure 5.4, there are two such values of c,
namely c1 and c2 .
Proof. Consider the function g defined on [a, b] by
f (b) − f (a)
g(x) = f (x) − f (a) − (x − a).
b−a
196 Introduction to Real Analysis
Then g is continuous on [a, b], differentiable on (a, b), with g(a) = g(b). Thus
by Rolle’s theorem there exists c ∈ (a, b) such that g ′ (c) = 0. But
f (b) − f (a)
g ′ (x) = f ′ (x) −
b−a
f (b) − f (a)
for all x ∈ (a, b). Taking x = c gives f ′ (c) = , from which the
b−a
conclusion now follows.
FIGURE 5.4
Mean value theorem
EXAMPLE 5.2.7 In this example, we illustrate how the mean value theo-
rem may be used in proving elementary inequalities. We will use it to prove
that
x
≤ ln(1 + x) ≤ x for all x > −1,
1+x
where ln denotes the natural logarithm function. This function is considered
in greater detail in Example 6.3.5 of the next chapter. Let f (x) = ln(1 + x),
x ∈ (−1, ∞). Then f (0) = 0. If x > 0, then by the mean value theorem, there
Differentiation 197
But f ′ (c) = (1+c)−1 and (1+x)−1 < (1+c)−1 < 1 for all c ∈ (0, x). Therefore
x
< f ′ (c)x < x,
1+x
and as a consequence
x
≤ ln(1 + x) ≤ x for all x ≥ 0.
1+x
Now suppose −1 < x < 0. Then again by the mean value theorem there exists
c ∈ (x, 0) such that
x
ln(1 + x) = f (x) − f (0) = .
1+c
But since x < c < 0, we have 1 < (1 + c)−1 < (1 + x)−1 , and since x is
negative,
x
< ln(1 + x) < x.
1+x
Hence the desired inequality holds for all x > −1, with equality if and only if
x = 0.
Proof. Let
h(x) = [f (b) − f (a)] g(x) − [g(b) − g(a)] f (x).
Then h is continuous on [a, b], differentiable on (a, b) with
Thus by Rolle’s theorem, there exists c ∈ (a, b) such that h′ (c) = 0, which
gives the result.
The geometric interpretation of the Cauchy mean value theorem is very
similar to that of the mean value theorem. If g ′ (x) 6= 0 for all x ∈ (a, b), then
g(a) 6= g(b) and the conclusion of Theorem 5.2.8 can be written as
f (b) − f (a) f ′ (c)
= ′ .
g(b) − g(a) g (c)
198 Introduction to Real Analysis
FIGURE 5.5
Cauchy mean value theorem
′
f+ (a) = lim+ f ′ (x).
x→a
′
Proof. Let L = lim+ f (x), that is assumed to exist. Given ǫ > 0, there exists
x→a
a δ > 0 such that
FIGURE 5.6
Graph of f ′ (x) = 4x3 (2 + sin(1/x)) − x2 cos(1/x), x 6= 0
Suppose 0 < h < δ is such that a+h < b. Since f is continuous on [a, a+h] and
differentiable on (a, a+h), by the mean value theorem f (a+h)−f (a) = f ′ (ζh ) h
for some ζh ∈ (a, a + h). Therefore
f (a + h) − f (a)
− L = |f ′ (ζh ) − L| < ǫ
h
′
for all h, 0 < h < δ. Thus f+ (a) = L.
Similarly,
′
f+ (1) = lim −2x = −2.
x→1+
of Example 5.1.3(g) has the property that g ′ (0) exists but lim g ′ (x) does not.
x→0
Differentiation 201
f −1 (yn ) − f −1 (yo ) xn − xo
lim = lim
n→∞ y n − yo n→∞ f (xn ) − f (xo )
1
= ′ .
f (xo )
Remark. The hypothesis that f ′ (x) 6= 0 for all x ∈ I is crucial. For example,
the function f (x) = x3 is strictly increasing on [−1, 1] with f ′ (0) = 0. The
inverse function f −1 (y) = y 1/3 however is not differentiable at y = 0.
Exercises 5.2
1. Graph the function f (x) = (x − 21 )2 , 0 ≤ x ≤ 2. Show that f+
′
(0) = −1
′
and f− (2) = 3.
2. Which of the following functions satisfy the hypothesis of the mean value
theorem. For those to which the mean value theorem applies, calculate a
suitable c.
a. f (x) = |x|, −2 ≤ x ≤ 2 b. f (x) = 2x − x3 , 0 ≤ x ≤ 2
x
c. f (x) = , −1 ≤ x ≤ 2. d. f (x) = 1 − x2/3 , −2 ≤ x ≤ 1
x+2
3. For each of the following functions determine the interval(s) where the
function is increasing, decreasing, and find all local maxima and minima.
*a. f (x) = x3 + 6x − 5, x ∈ R. b. g(x) = 4x − x4 , x ∈ R.
2
x √
*c. h(x) = , x ∈ R. d. k(x) = x − 12 x, x ≥ 0.
1 + x2
4 x−a
*e. l(x) = x + 2 , x 6= 0. f. f (x) = , a 6= b, x 6= b
x x−b
Xn
4. Let f (x) = (x−ai )2 , where a1 , a2 , ..., an are constants. Find the value
i=1
of x where f is a minimum.
5. As in Example 5.2.7 use the mean value theorem to establish each of the
following inequalities.
√
*a. 1 + x < 1 + 21 x, x > −1
b. ex ≥ 1 + x, x ∈ R
*c. xα − aα < αaα−1 (x − a), 0 < a < x, 0 < α < 1
d.1 (1 + x)α ≥ 1 + αx, x > −1, α > 1
1 For α ∈ N, this inequality was proved by mathematical induction in Example 1.3.2(b).
d α
In this exercise and in Exercise 6(b) you may assume that for α ∈ R, x = αxα−1 .
dx
204 Introduction to Real Analysis
f (x + h) − f (x)
a. Show that for any h > 0, lim = L.
x→∞ h
f (x)
b. Show that lim = L.
x→∞ x
Infinite Limits
Since l’Hospital’s rule allows for infinite limits, we provide the following defi-
nitions.
lim f (x) = ∞,
x→p
Differentiation 207
Since these definitions are similar to Definitions 4.1.11 and 4.4.1 they are left
to the exercises (Exercise 1).
Remark. Since we now allow the possibility of a function having infinite
limits, it needs to be emphasized that when we say that a function f has a
limit at p ∈ R (or at ±∞), we mean a finite limit.
L’Hospital’s Rule
L’Hospital’s rule is useful for evaluating limits of the form
f (x)
lim
x→p g(x)
where either (a) lim f (x) = lim g(x) = 0 or (b) f and g tend to ±∞ as x → p.
x→p
x→p
If (a) holds, then lim (f (x) g(x)) is usually referred to as indeterminate of
x→p
form 0/0, whereas in (b) the limit is referred to as indeterminate of form ∞/∞.
The reason that (a) and (b) are indeterminate are that previous methods may
no longer apply.
In (a), if either lim f (x) or lim g(x) is nonzero, then previous methods
x→p x→p
discussed in Section 4.1 apply. For example, if both f and g have limits at p
and lim g(x) 6= 0, then by Theorem 4.1.6(c)
x→p
lim f (x)
f (x) x→p
lim = .
x→p g(x) lim g(x)
x→p
On the other hand, if lim f (x) = A 6= 0 and g(x) > 0 with lim g(x) = 0, then
x→p x→p
as x → p, f (x) g(x) tends to ∞ if A > 0, and to −∞ if A < 0 (Exercise 5).
However, if lim f (x) = lim g(x) = 0, then unless the quotient f (x)/g(x) can
x→p x→p
somehow be simplified, previous methods may no longer be applicable.
208 Introduction to Real Analysis
f (x)
lim+ = L.
x→a g(x)
Remark. The analogous result where x → b− is obviously also true. A more
elementary version of l’Hospital’s rule that relies only on the definition of
the derivative is given in Exercise 2. Also, Exercise 7 provides
examples of
two functions f and g satisfying (a) for which lim (f (x) g(x)) exists but
x→a
lim (f ′ (x) g ′ (x)) does not exist.
x→a
Proof. Suppose (a) holds. We first prove the case where a is finite. Let {xn }
be a sequence in (a, b) with xn → a and xn 6= a for all n. Since we want to
apply the generalized mean value theorem to f and g on the interval [a, xn ],
we need both f and g continuous at a. This is accomplished by setting
f (a) = g(a) = 0.
Then by hypothesis (a), f and g are continuous at a. Thus by the generalized
mean value theorem, for each n ∈ N there exists cn between a and xn such
that
[f (xn ) − f (a)]g ′ (cn ) = [g(xn ) − g(a)]f ′ (cn ),
or
f (xn ) f ′ (cn )
= ′ .
g(xn ) g (cn )
Note, since g ′ (x) 6= 0 for all x ∈ (a, b), g(xn ) 6= g(a) for all n As n → ∞,
cn → a+ . Thus by Theorem 4.1.3 and the hypothesis,
f (xn ) f ′ (cn ) f ′ (x)
lim = lim ′ = lim+ ′ = L.
n→∞ g(xn ) n→∞ g (cn ) x→a g (x)
Since the above holds for every sequence {xn } with xn → a+ , the result
follows.
Suppose a = −∞. To handle this case, we make the substitution x = −1/t.
Then as t → 0+ , x → −∞. Define the functions ϕ(t) and ψ(t) on (0, c) for
some c > 0 by
ϕ(t) = f (−1/t) and ψ(t) = g(−1/t).
Differentiation 209
Therefore
f (y) g(y)
lim+ +r 1− =r<β
x→a g(x) g(x)
210 Introduction to Real Analysis
If L = −∞, then for any β ∈ R, there exists c3 such that (6) holds for all
x, a < x < c3 . Thus by definition,
f (x)
lim = −∞.
x→a+ g(x)
If L is finite, then given ǫ > 0, by taking β = L + ǫ, there exists c3 such that
f (x)
<L+ǫ for all x, a < x < c3 . (7)
g(x)
Remarks. (a) The proof of case (a) could have been done similarly to that of
(b), treating the case where a is finite and −∞ simultaneously. I chose not to
do so since making the substitution x = −1/t is a useful technique, reducing
problems involving limits at −∞ to right limits at 0. Conversely, limits at 0
can be transformed to limits at ±∞ with the substitution x = 1/t. These
new limits are in many instances easier to evaluate than the original. This is
illustrated in Example 5.3.4(c).
Differentiation 211
(b) In hypothesis (b) we only required that lim g(x) = ±∞. If lim f (x)
x→a+ x→a+
is finite, then it immediately follows that
f (x)
lim = 0,
x→a+ g(x)
and l’Hospital’s rule is not required (Exercise 4). Thus in practice, hypothesis
(b) of l’Hospital’s rule is used only if both f and g have infinite limits.
For convenience we stated and proved l’Hospital’s rule in terms of right
limits. Since the analogous results for left limits are also true, combining the
two results gives the following corollary.
f ′ (x)
lim = L, where L ∈ R ∪ {−∞, ∞}.
x→p g ′ (x)
If
(a) lim f (x) = lim g(x) = 0, or
x→p x→p
(b) lim g(x) = ±∞, then
x→p
f (x)
lim = L.
x→p g(x)
ln(1 + x)
EXAMPLES 5.3.4 (a) Consider lim , where ln is the natural
x
x→0+
logarithm function on (0, ∞). This limit is indeterminate of form 0/0. With
f (x) = ln(1 + x) and g(x) = x,
f ′ (x) 1
lim+ ′
= lim+ = 1.
x→0 g (x) x→0 1 + x
ln(1 + x)
Thus by l’Hospital’s rule lim = 1.
x→0+ x
Although l’Hospital’s rule provides an easy method for evaluating this
limit, the result can also be obtained by using previous techniques. In Example
5.2.7 we proved that
x
≤ ln(1 + x) ≤ x
1+x
for all x > −1. Thus
1 ln(1 + x)
≤ ≤1
1+x x
212 Introduction to Real Analysis
ln(1 + x)
for all x > 0. Thus by Theorem 4.1.9 lim = 1.
x
x→0+
1 − cos x
(b) In this example, we consider lim . This is indeterminate of
x→0 x2
form 0/0. If we apply l’Hospital’s rule we obtain
sin x
lim
x→0 2x
which is again indeterminate of form 0/0. However, applying l’Hospital’s rule
one more time gives
cos x 1
lim =
x→0 2 2
1 − cos x
. Therefore lim = 21 .
x→0 x2
(c) Consider
1
e− x
lim+ .
x→0 x
Since lim+ e−1/x = 0, the above limit is indeterminate of form 0/0. If we
x→0
apply l’Hospital’s rule we obtain
1
e− x
lim ,
x→0+ x2
and this limit is more complicated than the original limit. However, if we let
t = 1/x, then
1
e− x t
lim+ = lim t .
x→0 x t→∞ e
Exercises 5.3
1. Provide definitions for each of the following limits:
a. lim f (x) = ∞. b. lim f (x) = ∞.
x→p+ x→∞
3. Let h(x) be defined on (−∞, b). Show that there exists a c > 0 such that
ϕ(t) = h(−1/t) is defined on (0, c), and that lim h(x) = lim ϕ(t).
x→−∞ t→0+
4. *Let f, g be real-valued functions defined on (a, b). If lim f (x) exists in
x→a+
R and lim g(x) = ∞, prove that
x→a+
f (x)
lim = 0.
x→a+ g(x)
5. Suppose f, g are real-valued functions on (a, b) satisfying lim f (x) =
x→a+
A 6= 0, lim g(x) = 0, and g(x) > 0 for all x ∈ (a, b). If A > 0, prove
x→a+
that
f (x)
lim = ∞.
x→a+ g(x)
6. Use l’Hospital’s rule and any of the differentiation formulas from calculus
to find each of the following limits. In the following, ln x, x > 0 denotes
the natural logarithm function.
x5 + 2x − 3 x5 + 2x − 3
*a. lim 3 2
. b. lim .
x→1 2x − x − 1 x→−1 2x3 + x2 + 1
ln x 1 − cos 2x
*c. lim . d. lim .
x→∞ x x→0 sin x
ln(1 + x)
*e. lim xa ln, x where a > 0. f. lim .
x→0+ x→0 sin x
a p
(ln x) (ln x)
*g. lim , where a > 0. h. lim , p, q ∈ R.
x→∞ ex x→∞ xq
1 1
*i. lim − . j. lim x1/x
x→0+ x sin x x→∞
f (x)
7. Let f (x) = x2 sin(1/x) and g(x) = sin x. Show that lim exists but
x→0 g(x)
′
f (x)
that lim ′ does not exist.
x→0 g (x)
p(x)
8. Investigate lim , where p and q are polynomials of degree n and m,
x→∞ q(x)
respectively.
9. Let f (x) = (sin x)/x for x 6= 0, and f (0) = 1.
*a. Show that f ′ (0) exists, and determine its value.
*b. Show that f ′′ (0) exists, and determine its value.
10. Let f be defined on R by
− 1
(
e x2 , x 6= 0,
f (x) =
0, x = 0.
Prove that f (n) (0) = 0 for all n = 1, 2, ....
214 Introduction to Real Analysis
FIGURE 5.7
Newton’s method
f (x) = x2 − α.
√
If α > 1, then f has exactly one zero on [0, α], namely √α. If 0 < α < 1, then
the zero of f lies in [0, 1]. Let c1 be an initial guess to α. Then by formula
(8), for n ≥ 1,
c2n − α
1 α
cn+1 = cn − = cn + .
2cn 2 cn
This is exactly the sequence of Exercise 6 of Section
√ 3.3, where the reader
was asked to prove that the sequence converges to α. With α = 2, taking
c1 = 1.4 as an initial guess, yields
c2 = 1.4142857,
c3 = 1.4142135,
f (cn )
cn+1 = cn − , n ∈ N,
f ′ (cn )
is in I, and lim cn = c. Furthermore,
n→∞
M
|cn+1 − c| ≤ |cn − c|2 . (9)
2m
Prior to proving Theorem 5.4.2 we first state and prove the following
lemma. The result is in fact a special case of Taylor’s theorem (8.7.16) that
will be discussed in detail in Chapter 8.
Define g on [a, b] by
Therefore α = 21 f ′′ (ζ)
Proof of Theorem 5.4.2 Since f (a)f (b) < 0 and f ′ (x) 6= 0 for all x ∈ [a, b],
f has exactly one zero c in the interval (a, b).
Let x0 ∈ [a, b] be arbitrary. By Lemma 5.4.3 there exists a point ζ between
c and x0 such that
1
0 = f (c) = f (x0 ) + f ′ (x0 )(c − x0 ) + f ′′ (ζ)(c − x0 )2 ,
2
or
1
−f (x0 ) = f ′ (x0 )(c − x0 ) + f ′′ (ζ)(c − x0 )2 . (10)
2
If x1 is defined by
f (x0 )
x1 = x0 − ′ ,
f (x0 )
then by equation (10)
1 f ′′ (ζ)
x1 = x0 + (c − x0 ) + (c − x0 )2 .
2 f ′ (x0 )
Therefore
1 |f ′′ (ζ)| M
|x1 − c| = |c − x0 |2 ≤ |c − x0 |2 . (11)
2 |f ′ (x0 )| 2m
Choose δ > 0 so that δ < 2m/M and I = [c − δ, c + δ] ⊂ [a, b]. If cn ∈ I, then
|c − cn | < δ. If cn+1 is defined by (8), then by (11)
M 2
|cn+1 − c| ≤ δ < δ.
2m
218 Introduction to Real Analysis
FIGURE 5.8
An illustration of Remark (c)
M
But by our choice of δ, δ < 1, and as a consequence cn → c.
2m
Remarks. (a) For a given function f satisfying the hypothesis of the theorem,
the constants M and m, and thus δ can be determined. To determine the
interval I, one can use the method of bisection to find an approximation xn
to c satisfying |xn − xn−1 | < δ. If c1 is taken to be xn , then |c1 − c| < δ.
In practice however, one usually makes a judicious guess for c1 and proceeds
with the computations.
(b) Let en = c − cn be the error in approximating c, and let K = M/2m.
Then inequality (9) can be expressed as
|en+1 | ≤ K |en |2 .
Consequently, if |en | < 10−m , then |en+1 | < K10−2m . Thus, except for the
constant factor K, the accuracy actually doubles at each step. For this reason,
Newton’s method is usually referred to as a second order or quadratic
method.
(c) Even though Newton’s method is very efficient, there are a number of
things that can go wrong if c1 is poorly chosen. For example, in Figure 5.8,
the initial choice of c1 gives a c2 outside the interval, and the subsequent cn
tend to −∞. Such a function is given by f (x) = x/(x2 + 1). In Figure 5.9, the
initial choice of c1 causes the subsequent values to oscillate between c1 and
c2 . A function having this property is given by g(x) = x − 51 x3 . Taking c1 = 1
Differentiation 219
FIGURE 5.9
An example of oscillating c1 and c2
gives c2 = −1, c3 = 1, etc. For this reason, the initial choice of c1 for many
functions has to be sufficiently close to c in order to be sure that the method
works.
Exercises 5.4
1. *For α > 0, apply Newton’s method to f (x) = x3 −α to obtain a sequence
{cn } that converges to the cube root of α.
2. Use Newton’s method to find approximations to the roots, accurate to
six decimal places, of the given functions on the interval [0, 1].
*a. f (x) = x3 − 3x + 1. b. f (x) = 3x3 − 5x2 + 1
3 2
c. f (x) = 8x − 8x + 1.
3. Use Newton’s method to approximate the real zeros of f (x) = x4 − 4x − 3
accurate to four decimal places
4. Show that f (x) = ln x−x+3, x ∈ (0, ∞) has two real zeros. Use Newton’s
method to approximate them accurate to four decimal places.
5. Let f : [a, b] → R be differentiable on [a, b] with f (a) < 0 < f (b). Suppose
there exist constants m and M such that 0 < m ≤ f ′ (x) ≤ M for all
x ∈ [a, b]. Let c1 ∈ [a, b] be arbitrary, and define
f (cn )
cn+1 = cn − .
M
Prove that the sequence {cn } converges to the unique zero of f on [a, b].
(Hint: Consider the function g(x) = x − f (x)/M .)
Notes
Without question the most significant result of this chapter is the mean value theo-
rem. The simplicity of its proof disguises the importance and usefulness of the result.
The theorem allows us to obtain information about the function from its derivative.
220 Introduction to Real Analysis
This has many applications as was illustrated by the subsequent theorems and ex-
ercises. Additional applications will be encountered throughout the text.
Although the mean value theorem is attributed to Lagrange, his proof, that ap-
peared around 1772, was based on the false assumption that every function could be
expanded in a power series. Cauchy, in his 1823 text Résumé des Lecons donnees a
L’École Royale Polytechnique sur le Calcul Infinitésimal, used the modern definition
of the derivative to provide a proof of the mean value theorem. His statement and
proof of the theorem however differs from the version in the text in that he assumed
continuity of the derivative. What Cauchy actually
proved was that if f ′ is contin-
uous on [a, b], then the quantity {f (b) − f (a)} {b − a} lies between the minimum
and maximum values of f ′ on [a, b]. (See Miscellaneous Exercise 4.) Then by the
intermediate value theorem (Theorem 4.2.11) applied to the continuous function f ′ ,
there exists c ∈ (a, b) such that
It is worth noting that our proof of the mean value theorem depends ultimately on
the completeness property of R (through Rolle’s theorem and Corollary 4.2.9).
The mean value theorem can justifiably be called the fundamental theorem of
differential calculus. It allowed the development of rigorous proofs of many results
that were previously taken as fact or “proved” from geometric constructions. Al-
though the modern proof of l’Hospital’s rule uses the mean value theorem, it should
be remembered that the original version for calculating the limit of a quotient where
both the numerator and denominator become zero first appeared in 1696, 70 years
before Lagrange’s proof of the mean value theorem. The original version was stated
and “proved” in a purely geometric manner without reference to limits. For further
details, including a history of calculus, the reader is referred to the text by Katz
listed in the Bibliography.
The Bernoulli brothers, Jakob (1654–1705) and Johann (1667–1748) were among
the first mathematicians in Europe to use the new techniques of Newton and Leib-
niz in the study of curves and related physical problems. Among these were finding
the equations of the catenary and isochrone.3 Both brothers also contributed to the
study of differential equations by solving the Bernoulli equation y ′ +P (x)y = Q(x)y n .
Through their numerous publications and correspondence with other mathemati-
cians the Bernoulli brothers helped to establish the utility of the new calculus. The
first text on differential calculus by l’Hospital also contributed significantly to pop-
ularizing the subject.
Leonhard Euler (1707–1783), one of the most prolific mathematicians in his-
tory, contributed significantly to establishing calculus as an independent science.
Even though the calculus of exponential and logarithmic functions was basically
developed by Johann Bernoulli, it was Euler’s expositions on these topics in the
eighteenth century that brought them into the mainstream of mathematics. Much
of what we know today about the exponential, logarithmic and trigonometric func-
tions is due to Euler. He was also among the first mathematicians to define the
concept of a function. However to Euler, as for the other mathematicians of that
3 The catenary problem involves finding the equation of a freely hanging cable, whereas
the isochrone problem involves finding the equation of a curve along which an object would
fall with uniform vertical velocity.
Differentiation 221
period, a function was one that had a power series expansion. It is important to note
that most mathematicians of the eighteenth century, including Euler, were primarily
concerned with computations needed for the applications of calculus; proofs did not
gain prominence until the nineteenth century. For this reason, numerous results of
that era that were assumed to be true were subsequently proved to be true only
under more restrictive conditions.
Miscellaneous Exercises
1. Let f : (a, b) → R and suppose f ′′ exists at xo ∈ (a, b). Prove that
f (xo + h) + f (xo − h) − 2f (xo )
f ′′ (xo ) = lim .
h→0 h2
Give an example where this limit exists at xo but f ′′ does not exist at
xo .
2. Let f be a real-valued differentiable function on R.
a. If there exists a constant b < 1 such that |f ′ (x)| < b for all x ∈ R,
prove that f has a fixed point in R. (See Exercise 13 of Section 4.3).
b. Show that the function f (x) = x + (1 + ex )−1 satisfies |f ′ (x)| < 1 for
all x ∈ R but that f has no fixed point in R.
3. A function f is convex (or concave up) on the interval (a, b) if for any
x, y ∈ (a, b), and 0 < t < 1, f (tx + (1 − t)y) ≤ t f (x) + (1 − t) f (y).
a. If f is convex on (a, b), prove that f is continuous on (a.b).
′ ′
b. If f is convex on (a, b), prove that f+ (p) and f− (p) exist for every
p ∈ (a, b). Show by example that a convex function on (a, b) need not be
differentiable on (a, b).
c. Suppose f ′′ (x) exists for all x ∈ (a, b). Prove that f is convex on (a, b)
if and only if f ′′ (x) ≥ 0 for all x ∈ (a, b).
4. Suppose f is differentiable on [a, b] and that f ′ is continuous on [a, b].
Without using the mean value theorem, prove that
f (b) − f (a)
min{f ′ (x) : x ∈ [a, b]} ≤ ≤ max{f ′ (x) : x ∈ [a, b]}.
b−a
5. (T.M. Flett) If f is differentiable on [a, b] and f ′ (a) = f ′ (b), prove that
there exists ζ ∈ (a, b) such that
f (ζ) − f (a)
= f ′ (ζ).
ζ −a
6. Suppose f : (a, b) → R is differentiable at c ∈ (a, b). If {sn } and {tn } are
sequences in (a,b) with sn < c < tn and lim (tn − sn ) = 0, prove that
n→∞
f (tn ) − f (sn )
lim = f ′ (c).
n→∞ tn − s n
222 Introduction to Real Analysis
Supplemental Reading
Baxley, J. V. and Hayashi, E. K., “In- Miller, A. D. and Vyborny, R., “Some
determinate forms of exponential type,” remarks on functions with one-sided
Amer. Math. Monthly 85 (1978), 484– derivatives” Amer. Math. Monthly 93
486. (1986), 471–475.
Cajori, F., “Historical note on the Pan, D., “A maximum principle for
Newton-Raphson method of approxima- high-order derivatives,” Amer. Math.
tion,” Amer. Math. Monthly 18 (1911), Monthly 120 (2013), 846–848.
29–33. Range, R. M., “Where are lim-
Corless, R. M., “Variations on a its needed in Calculus,” Amer. Math.
theme of Newton,” Math. Mag. 71 Monthly 118 (2011), 404–417.
(1998), 34–41. Rosenholtz, “There is no differen-
Flett, T.M., “A mean value theo- tiable metric for Rn ,” Amer. Math.
rem,” Math. Gaz. 42 (1958), 38 – 39. Monthly 86 (1979), 585–586.
Hall, W. S. and Newell, M. L., “The Rotando, L. M. and Korn, H., “The
mean value theorem for vector valued indeterminate form 00 ,” Math. Mag. 50
functions,” Math. Mag. 52 (1979), 157– (1977), 41–42.
158. Sahoo, M. R., “Example of a mono-
Hartig, Donald, “L’Hopitals rule via tonic everywhere differentiable function
integration,” Amer. Math. Monthly 98 on R whose derivative is not continuous,”
(1991), 156–157. Amer. Math. Monthly 120 (2013), 566–
Katznelson, Y. and Stromberg, K., 568.
“Everywhere differentiable, nowhere Tandra, H., “A yet simpler proof of
monotone function,” Amer. Math. the chain rule,” Amer. Math. Monthly
Monthly 81 (1974), 349–354. 120 (2013), 900.
Langlois, W. E. and Holder, L. I., Thurston, H. A., “On the defini-
′
“The relation of f+ (a) to ”f ′ (a+),” tion of the tangent line,” Amer. Math.
Math. Mag. 39 (1966), 112–120. Monthly 71 (1964), 1099–1103.
Lynch, M., “A continuous function Tong, J. and Braza, P. A., “A con-
that is differentiable only at the ratio- verse to the mean value theorem,” Amer.
nals,” Math. Mag. 86 (2013), 132–135. Math. Monthly 104 (1997), 939–942.
6
Integration
When Newton and Leibniz developed the calculus, both considered integration
as the inverse operation of differentiation. For example, in the De analysi1 ,
Newton proved that the area under the curve y = axm/n (m/n 6= −1) is given
by
m
an x n +1
m+n
by using his differential calculus to prove that if A(x) represents the area from
0 to x then A′ (x) = axm/n . Even though Leibniz arrived at the concept of the
integral by using sums to compute the area, integration itself was always the
inverse operation of differentiation. Throughout the eighteenth century, the
Rb
definite integral of a function f (x) on [a, b], denoted a f (x)dx, was defined
as F (b) − F (a) where F was any function whose derivative was f (x). This
remained as the definition of the definite integral until the 1820’s.
The modern approach to integration is again due to Cauchy, who was the
first mathematician to construct a theory of integration based on approxi-
mating the area under the curve. Euler had previously used sums of the form
Pn
f (xk−1 )(xk − xk−1 ) to approximate the integral of a function f (x) in situ-
k=1
ations where the function F (x) could not be computed. Cauchy however used
limits of such sums to develop a theory of integration that was independent of
the differential calculus. One of the difficulties with Cauchy’s definition of the
integral was that it was very restrictive; only functions that were continuous or
continuous except at a finite number of points were proved to be integrable.
However, one of the key achievements of Cauchy was that using his defini-
tion he was able to prove the fundamental theorem of calculus; specifically,
if f is continuous on [a, b], then there exists a function F on [a, b] such that
F ′ (x) = f (x) for all x ∈ [a, b].
The modern definition of integration was developed in 1853 by Georg
Bernhard Riemann (1826–1866). Riemann was led to the development of the
integral by trying to characterize which functions were integrable accord-
ing to Cauchy’s definition. In the process, he modified Cauchy’s definition
and developed the theory of integration which now bears his name. One of
his achievements was that he was able to provide necessary and sufficient
1 The Mathematical Works of Isaac Newton, edited by D. T. Whiteside, Johnson Reprint
223
224 Introduction to Real Analysis
L(P, f ) ≤ U(P, f )
for any partition P of [a, b]. The upper sum for a nonnegative continuous
function f is illustrated in Figure 6.1. In this case, U (P, f ) represents the
circumscribed rectangular approximation to the area under the graph of f .
Similarly the lower sum represents the inscribed rectangular approximation
to the area under the graph of f .
FIGURE 6.1
Upper sum U (P, f )
226 Introduction to Real Analysis
for any partition P of [a, b]. To see that (1) holds, let P = {x0 , ..., xn } be any
partition of [a, b]. Since Mi ≤ M for all i = 1, ..., n,
n
X n
X
U (P, f ) = Mi ∆xi ≤ M (xi − xi−1 ) = M (b − a).
i=1 i=1
Similarly L(P, f ) ≥ m (b−a). Thus the set {U (P, f ) : P is a partition of [a, b]}
is bounded above and below, as is the set {L(P, f )}.
Since the sets {U (P, f )} and {L(P, f )} are nonempty and bounded, the
lower and upper integrals of a bounded function f : [a, b] → R always exist.
Rb Rb
Our first goal is to prove that a f ≤ a f for any bounded real-valued function
f on [a, b]. To this end we make the following definition.
Since f (t) ≤ Mk for all t ∈ [xk−1 , xk ], we have that f (t) ≤ Mk for all t ∈
[xk−1 , x⋆ ] and also for all t ∈ [x⋆ , xk ]. Thus both Mk1 and Mk2 are less than or
equal to Mk . Now
k−1
X n
X
U (P ⋆ , f ) = Mj ∆xj + Mk1 (x⋆ − xk−1 ) + Mk2 (xk − x⋆ ) + Mj ∆xj .
j=1 j=k+1
But
Mk1 (x⋆ − xk−1 ) + Mk2 (xk − x⋆ ) ≤ Mk ∆xk .
Therefore
U (P ⋆ , f ) ≤ U(P, f ).
The proof for the lower sum is similar. If P ⋆ contains k more points than P,
we need only repeat the above argument k times to obtain the result.
for any partition Q. Taking the infimum over Q gives the result.
Suppose a < b. If P = {x0 , ..., xn } is any partition of [a, b], then mi = 0 and
Mi = 1 for all i = 1, ..., n. Thus
FIGURE 6.2
The function of Example 6.1.6(b)
Thus
n
X
L(P, f ) = ∆xi = (1 − xk ), and
i=k+1
Xn
U (P, f ) = ∆xi = (1 − xk−1 )
i=k
1
Since 1 − xk ≤ 2< 1 − xk−1 , L(P, f ) ≤ 21 ≤ U (P, f ) for all partitions
R1 R1
P of [0, 1]. Thus 0 f ≤ 12 ≤ 0 f . Hence if f is integrable on [0, 1], then
R1
f = 1/2. We conclude by proving that f is indeed integrable on [0, 1]. Since
0
1 − xk−1 = (1 − xk ) + (xk − xk−1 ), we have
U (P, f ) = L(P, f ) + (xk − xk−1 ).
230 Introduction to Real Analysis
Let ǫ > 0 be arbitrary. If P is any partition of [0, 1] with ∆xi < ǫ for all i,
then U (P, f ) < L(P, f ) + ǫ. Thus
Z 1
f = inf U (Q, f ) ≤ U(P, f )
0 Q
Z 1
≤ L(P, f ) + ǫ ≤ sup L(Q, f ) + ǫ ≤ f + ǫ,
Q 0
where the infimum and supremum are taken over all partitions Q of [0, 1].
R1 R1
Since ǫ > 0 was arbitrary, we have 0 f = 0 f . Therefore f is integrable on
R1
[0, 1] with f = 1/2.
0
(c) We now provide another example to illustrate how tedious even a trivial
integral can be if one relies only on the definition of the integral. Luckily, the
fundamental theorem of calculus (Theorem 6.3.2) will allow us to avoid such
tedious computations. Let f (x) = x, x ∈ [a, b], where for the purpose of
illustration we take a ≥ 0 (Figure 6.3). Interpreting the integral as the area
under the curve, we intuitively see that
Z b
1 1
x dx = (b − a)(b + a) = (b2 − a2 ).
a 2 2
This is obtained from the formula for the area of a parallelogram. Let P =
{xo , x1 , ..., xn } be any partition of [a, b]. Since f (x) = x is increasing on [a, b],
mi = f (xi−1 ) = xi−1 , and Mi = f (xi ) = xi .
FIGURE 6.3
Example 6.1.6(c)
Therefore
n
X n
X
L(P, f ) = xi−1 ∆xi and U (P, f ) = xi ∆xi .
i=1 i=1
Integration 231
But
n
X 1 1 2 1 2
x2i − x2i−1 = xn − x2o = b − a2 .
i=1
2 2 2
Finally, if we take the supremum and infimum over all partitions P of [a, b],
we have
Z b Z b
1 2 2
x dx ≤ b −a ≤ x dx.
a 2 a
To prove that f is integrable on [a, b], we note that for any partition P of
[a, b],
Z b Z b n
X
0≤ x dx − x dx ≤ U(P, f ) − L(P, f ) = (xi − xi−1 )∆xi
a a i=1
≤ max ∆xi (b − a).
1≤i≤n
Let ǫ > 0 be given. If we choose a partition P such that ∆xi < ǫ (b−a+1) for
Rb Rb
all i, then a x dx − a x dx < ǫ. Since this holds for every ǫ > 0, we conclude
Rb Rb
that a x dx = a x dx. Thus f (x) = x is Riemann integrable on [a, b] with
Rb
x dx = 12 (b2 − a2 ).
a
(d) Consider the function f (x) = x2 , x ∈ [0, 1]. For n ∈ N, let Pn be
the partition {0, n1 , n2 , ..., 1}. Since f is increasing on [0, 1], its infimum and
supremum on each interval [ i−1 i
n ,n ] are attained at
the left and right endpoint
respectively, with mi = (i − 1) n2 and Mi = i2 n2 . Since ∆xi = n1 for all i,
2
1 2
L(Pn , f ) = [1 + 22 + · · · + (n − 1)2 ], and
n3
1
U (Pn , f ) = 3 [12 + 22 + · · · + n2 ].
n
232 Introduction to Real Analysis
R1
Therefore f (x) = x2 is integrable on [0, 1] with x2 dx = 1/3.
0
Proof. (a) Let ǫ > 0 be given. Choose η > 0 such that (b − a)η < ǫ. Since f
is continuous on [a, b], by Theorem 4.3.4 f is uniformly continuous on [a, b].
Thus there exists a δ > 0 such that
for all x, t ∈ [a, b] with |x − t| < δ. Choose a partition P of [a, b] such that
∆xi < δ for all i = 1, 2, ..., n. Then by (3),
Mi − mi ≤ η
(b − a)
[f (b) − f (a)] < ǫ.
n
For this n and corresponding partition P, U (P, f ) − L(P, f ) < ǫ. Thus f is
integrable on [a, b].
Proof. Since ϕ is continuous on the closed and bounded interval [c, d], ϕ is
bounded and uniformly continuous on [c, d]. Let K = sup{|ϕ(t)| : t ∈ [c, d]},
and let ǫ > 0 be given. Set ǫ′ = ǫ/(b − a + 2K).
Since ϕ is uniformly continuous on [c, d], there exists δ, 0 < δ < ǫ′ , such
that
|ϕ(s) − ϕ(t)| < ǫ′ (4)
for all s, t ∈ [c, d] with |s − t| < δ. Furthermore, since f ∈ R[a, b], by Theorem
6.1.7 there exists a partition P = {x0 , x1 , ..., xn } of [a, b] such that
m∗k = inf{ϕ(f (t)) : t ∈ [xk−1 , xk ]} and Mk∗ = sup{ϕ(f (t)) : t ∈ [xk−1 , xk ]}.
We partition the set {1, 2, ..., n} into disjoint sets A and B as follows:
Lebesgue’s Theorem
In Theorem 6.1.8(a) we proved that every continuous function on [a, b] is
Riemann integrable on [a, b]. By Exercise 16, this is also true for every bounded
function on [a, b] that is continuous except at a finite number of points. On the
other hand, as a consequence of Theorem 6.1.8(b), every monotone function on
[a, b] is Riemann integrable. Hence for example, if {rn }∞ n=1 is P
an enumeration
of the rational numbers in [0, 1] and cn > 0 are such that cn converges,
then by Theorem 4.4.10
∞
X
f (x) = cn I(x − rn )
n=1
236 Introduction to Real Analysis
is monotone increasing on [0, 1], and thus is Riemann integrable on [0, 1]. By
Theorem 4.4.10, the function f is continuous at every irrational number and
discontinuous at every rational number in [0,1].
We now state the beautiful result of Lebesgue which provides necessary and
sufficient conditions that a bounded real-valued function on [a, b] be Riemann
integrable. To properly state Lebesgue’s result we need to introduce the idea
of a set of measure zero. The concept of measure of a set will be treated in
detail in Chapter 10. The basic idea is that the measure of an interval is its
length. This is then used to define what we mean by measurable set and the
measure of a measurable set. At this point we only need to know what it
means for a set to have measure zero.
DEFINITION 6.1.11 A subset E of R has measure zero if given any
ǫ > 0, there exists a finite or countable collection {In }n∈A of open intervals
such that [ X
E⊂ In and ℓ(In ) < ǫ,
n∈A n∈A
where ℓ(In ) denotes the length of the interval In .
EXAMPLES 6.1.12 (a) Every finite set E has measure zero. Suppose E =
{x1 , ..., xN } is a finite subset of R. For each n = 1, 2, ..., N , as in Figure 6.4
let ǫ ǫ
In = xn − , xn + .
2N 2N
FIGURE 6.4
Example 6.1.12(a)
Then
N
[ N
X
E⊂ In and ℓ(In ) = ǫ.
n=1 n=1
Therefore E has measure zero.
(b) Every countable subset of R has measure zero. Suppose E = {xn }∞
n=1
is a countable subset of R. Let ǫ > 0 be given. For each n ∈ N, let
ǫ ǫ
In = xn − n+1 , xn + n+1 .
2 2
∞
In . Thus since ℓ(In ) = ǫ/2n ,
S
Since xn ∈ In for all n, E ⊂
n=1
∞ ∞
X X 1
ℓ(In ) = ǫ n
= ǫ.
n=1 n=1
2
Integration 237
We now state the following theorem of Henri Lebesgue, the proof of which
will be given in Section 6.7. This result appeared in 1902 and provides the most
succinct form of necessary and sufficient conditions for Riemann integrability.
(b) Let f be the Riemann integrable function on [0, 1] given in (a), and
let g : [0, 1] → R be defined by
(
0, x = 0,
g(x) =
1, x ∈ (0, 1].
Exercises 6.1
1. Let f (x) = 1 − x2 , x ∈ [−1, 2]. Find L(P, f ) and U (P, f ) for each of the
following partitions of [−1, 2].
*a. P = {−1, 0, 1, 2} b. P = {−1, − 21 , 0, 12 , 1 32 , 2}
2. Show that each of the following
R 2functions is Riemann integrable on [0, 2]
and use the definition to find 0 f .
(
1
0 ≤ x < 12 ,
−1, 0 ≤ x < 1, 1
*a. f (x) = b. f (x) = −3, 2
≤ x < 23 ,
2, 1≤x≤2 3
−2, ≤x≤2
2
3. Show that each of the following functions is Riemann integrable on [a, b],
Rb
and find a f .
0,
a≤x<c
*a. f (x) = c, (c a constant) b. f (x) = 12 , x = c,
1, c<x≤b
R1
4. Use one of the methods of Examples 6.1.6 to find 0 f for each of the
following functions f . In the following, [x] denotes the greatest integer
function.
*a. f (x) = [3x] b. f (x) = x[2x]
*c. f (x) = 3x + 1 d. f (x) = 1 − x2
Ra
5. Prove that f = 0 for any real-valued function f on [a, a].
a
Rb Rb
6. *If f, g ∈ R[a, b] with f (x) ≤ g(x) for all x ∈ [a, b], prove that f≤ g.
a a
13. *Let f be a bounded function on [a, b]. Suppose there exists a sequence
{Pn } of partitions of [a, b] such that
lim L(Pn , f ) = lim U(Pn , f ) = L, L ∈ R.
n→∞ n→∞
Rb
Prove that f is Riemann integrable on [a, b] with f = L.
a
14. *a. If f ∈ R[a, b], prove directly (without using Theorems 6.1.9 or 6.1.13)
that |f | ∈ R[a, b].
b. Let P denote the Cantor ternary set in [0, 1] and let f be defined on
[0, 1] by f (x) = 0 if x ∈ P , and f (x) = 1 elsewhere. Prove that f is
R1
Riemann integrable on [0, 1] and that f = 1.
0
Let ǫ > 0 be given. Since f, g ∈ R[a, b], there exist partitions Pf and Pg
of [a, b] such that
Z b Z b
U (Pf , f ) < f + 12 ǫ and U (Pg , g) < g + 21 ǫ.
a a
Therefore
Z b Z b Z b
(f + g) < f+ g + ǫ.
a a a
Since the above holds for all ǫ > 0,
Z b Z b Z b
(f + g) ≤ f+ g.
a a a
To prove the reverse inequality, let ǫ > 0 be given. Then there exist parti-
tions P1 and P2 of [a, c] and [c, b] respectively such that
c b
ǫ ǫ
Z Z
U (P1 , f ) < f+ and U (P2 , f ) < f+ .
a 2 c 2
Integration 243
Let P = P1 ∪ P2 . Then
Z b Z c Z b
f ≤ U(P, f ) = U (P1 , f ) + U (P2 , f ) < f+ f + ǫ.
a a c
which when combined with the previous inequality proves (7). A similar ar-
gument also proves
Z b Z c Z b
f= f+ f. (8)
a a c
Therefore f ∈ R[a, b] and identity (6) holds. Conversely, if f ∈ R[a, b], then
by (7) and (8),
Z c Z b Z c Z b
f+ f= f+ f.
a c a c
Since the lower integral of f is always less than or equal to the upper integral
of f , the above holds if and only if
Z c Z c Z b Z b
f= f and f= f.
a a c c
is called a Riemann sum of f with respect to the partition P and the points
{ti }.
244 Introduction to Real Analysis
FIGURE 6.5
A Riemann sum S(P, f ) of f
The quantity kPk is called the norm or the mesh of the partition P.
lim S(P, f ) = I
kPk→0
for all partitions P of [a, b] with kPk < δ, and all choices of ti ∈ [xi−1 , xi ].
lim S(P, f ) = I ,
kPk→0
Integration 245
Rb
then f ∈ R[a, b] and f = I. Conversely, if f ∈ R[a, b], then lim S(P, f )
a kPk→0
exists and Z b
lim S(P, f ) = f.
kPk→0 a
Proof. Suppose lim S(P, f ) = I. Let ǫ > 0 be given, and let δ > 0 be such
kPk→0
that (9) holds for all partitions P = {x0 , x1 , ..., xn } of [a, b] with kPk < δ, and
all ti ∈ [xi−1 , xi ]. By the definition of Mi , for each i = 1, ..., n, there exists
ζi ∈ [xi−1 , xi ] such that f (ζi ) > Mi − ǫ. Thus
n
X
U (P, f ) = Mi ∆xi
i=1
Xn n
X
< f (ζi ) ∆xi + ǫ ∆xi
i=1 i=1
< I + ǫ + ǫ [b − a] = I + ǫ [1 + b − a].
Suppose Q = {x0 , ..., xN }. Let δ = ǫ/N M , and let P = {y0 , ..., yn } be any
partition of [a, b] with kPk < δ. As in the definition of the integral, let
Consider any interval [yk−1 , yk ], k = 1, ..., n. This interval may or may not
contain points xi ∈ Q. Since Q contains N + 1 points, there are at most N − 1
intervals [yk−1 , yk ] which contain an xi ∈ Q, i 6= 0, N . Suppose as in Figure 6.6
{xj , ..., xj+m } ⊂ [yk−1 , yk ]. If xj 6= yk−1 , set Mk1 = sup{f (x) : x ∈ [yk−1 , xj ]}.
Similarly, if xj+m 6= yk , set Mk2 = sup{f (x) : x ∈ [xj+m , yk ]}.
Let tk ∈ [yk−1 , yk ] be arbitrary. Since |f (t) − f (s)| ≤ 2M for all t, s ∈
[yk−1 , yk ],
FIGURE 6.6
Partition of the interval [yk−1 , yk ]
Therefore
m
X
f (tk )∆yk = f (tk )(xj − yk−1 ) + f (tk )∆xj+s + f (tk )(yk − xj+m )
s=1
m
X
≤ 2M ∆yk + Mk1 (xj − yk−1 ) + Mj+s ∆xj+s + Mk2 (yk − xj+m )
s=1
< 2M δ + U (Pk , f ),
Therefore,
Z b
S(P, f ) − f < 3ǫ.
a
Since this holds for any partition P = {y0 , ..., yn } of [a, b] with kPk < δ and
any choice of tk ∈ [yk−1 , yk ], k = 1, ..., n,
Z b
lim S(P, f ) = f.
kPk→0 a
Integration 247
EXAMPLE 6.2.7 In this example, we will use the method of Riemann sums
Rb
to evaluate x dx. Since f (x) = x is Riemann integrable on [a, b],
a
Z b
x dx = lim S(P, f ).
a kPk→0
Since the limit exists for any ti ∈ [xi−1 , xi ], we can take ti = (xi−1 + xi )/2.
With this choice of ti ,
n n
1X 1X 2 1
S(P, f ) = (xi−1 + xi )(xi−1 − xi ) = (xi−1 − x2i ) = (b2 − a2 ),
2 i=1 2 i=1 2
Exercises 6.2
1. Prove Theorem 6.2.1(b).
Rb
2. *a. Use the method of Riemann sums to evaluate x2 dx.
a
Rb
b. Use the method of Riemann sums to evaluate xn dx, n ∈ N, n ≥ 3.
a
3. *a. Let f be a real-valued function on [a, b] such that f (x) = 0 for all
Rb
x 6= c1 , ..., cn . Prove that f ∈ R[a, b] with f = 0.
a
*b. Let f, g ∈ R[a, b] be such that f (x) = g(x) for all but a finite number
Rb Rb
of points in [a, b]. Prove that f = g.
a a
c. Is the result of (a) still true if f (x) = 0 for all but countably many
points in [a, b]?
4. Let f ∈ R[−a, a], a > 0. Prove each of the following:
Ra Ra
a. If f is even (i.e. f (−x) = f (x) for all x ∈ [−a, a]), then f =2 f.
−a 0
Ra
b. If f is odd (i.e. f (−x) = −f (x) for all x ∈ [−a, a]), then f = 0.
−a
7. Use the previous exercise to evaluate each of the following limits. (You
may use any applicable methods from calculus to evaluate the definite
integrals.)
n n n
1 X 2 X k X n
*a. lim 3 k b. lim 2 2
*c. lim
n→∞ n n→∞ n +k n→∞ n2 + k 2
k=1 k=1 k=1
m
1 X p 2
d. lim 3 k k + n2
n→∞ n
k=1
R1
where xn = n/(n + 1), n ∈ N. Find f (x)dx. (Leave your answer in the
0
form of an infinite series.)
9. Suppose f ∈ R[a, b] and c ∈ R. Define fc on [a − c, b − c] by fc (x) =
f (x + c). Prove that fc ∈ R[a − c, b − c] with
Z b−c Z b
fc (x)dx = f (x)dx.
a−c a
provides a major tool for the evaluation of Riemann integrals. We begin with
the following definition.
Since
n
X
L(P, f ) ≤ f (ti )∆xi ≤ U(P, f ),
i=1
we have
Z b Z b
f (x) dx ≤ F (b) − F (a) ≤ f (x) dx.
a a
Rx
If 0 ≤ x ≤ 1, then F (x) = 1 dt = x. On the other hand, if 1 < x ≤ 2, then
0
1 x
1 2 3
Z Z
F (x) = 1 dt + (1 − t) dt = x −x+ .
0 1 2 2
Thus (
x, 0 ≤ x ≤ 1,
F (x) = 1 2 3
2x −x+ 2, 1 < x ≤ 2.
Even though f is not continuous at x = 1, the function F is continuous
everywhere. This in fact is always the case. (See Exercise 2.)
Integration 251
F ′ (c) = f (c).
Therefore,
c+h
F (c + h) − F (c) 1
Z
− f (c) = f (t) dt − f (c)
h h c
c+h
1
Z
= [f (t) − f (c)] dt.
h c
Let ǫ > 0 be given. Since f is continuous at c, there exists a δ > 0 such that
Thus
F (c + h) − F (c)
lim+ = f (c),
h→0 h
F+′ (c)
i.e., = f (c). Similarly, if f is continuous at c ∈ (a, b], F−′ (c) = f (c),
which proves the result.
252 Introduction to Real Analysis
′
Since f is continuous, F (x) = f (x) for all x ∈ [a, b]. As a consequence,
we obtain the following more elementary version of Theorem 6.3.2 normally
encountered in the study of calculus: If f is continuous on [a, b] and G is any
Rb
antiderivative of f , then f = G(b) − G(a).
a
(b) Integrability of a function f on [a, b] does not imply the existence of
an antiderivative
Rx of f . For example, if f is monotone increasing on [a, b] and
F (x) = a f , then for any c ∈ (a, b), F+′ (c) = f (c+) and F−′ (c) = f (c−)
(Exercise 15). Thus if f is not continuous at c, the derivative of F does not
exist at c.
Since f (t) = 1/t is continuous on (0, ∞), by Theorem 6.3.4 L(x) satisfies
L′ (x) = 1/x for all x > 0. Furthermore, since L′ (x) > 0 for all x ∈ (0, ∞), L
is strictly increasing on (0, ∞).
We now prove that the function L(x) satisfies the usual properties of a
logarithm function; namely
(a) L(a b) = L(a) + L(b) for all a, b > 0,
(b) L(1/b) = −L(b), b > 0, and
(c) L(br ) = r L(b), b > 0, r ∈ R.
To prove (a), consider the function L(ax), x > 0. By the chain rule (The-
orem 5.1.6).
d 1 1
L(ax) = a = = L′ (x).
dx ax x
Thus by Theorem 5.2.9, L(ax) = L(x) + C for some constant C. From the
definition of L we have L(1) = 0. Therefore,
L(a) = L(1) + C = C.
Hence L(ax) = L(a) + L(x) for all x > 0, which proves (a). The proof of (b)
Integration 253
proceeds analogously. It is worth noting that for the proof of (a) and (b) we
only used the fact that L′ (x) = 1/x and L(1) = 0.
To prove (c), if n ∈ N, then by (a) L(bn ) = n L(b). Also by (b),
1
L(b−n ) = n L = −n L(b).
b
√
Therefore,
√ L(bn ) = n√ L(b) for all n ∈ Z. Consider L( n b) where n ∈ N. Since
n L( n b) = L(b), L( n b) = n1 L(b). Therefore
L(br ) = r L(b)
for all r ∈ Q. Since L is continuous the above holds for all r ∈ R.
Our final step will be to prove that L(e) = 1, where e is Euler’s number
of Example 3.3.5. To accomplish this we use the definition to compute the
derivative of L at 1. Since L′ (1) exists,
L(1 + n1 ) − L(1)
1 = L′ (1) = lim 1
n→∞
n
= lim n L(1 + n1 )
n→∞
= lim L((1 + n1 )n ) = L(e).
n→∞
The last equality follows by the continuity of L and the definition of e. There-
fore L(e) = 1 and the function L(x) is the logarithm function to the base e.
This function is usually denoted by loge x or ln x, and is called the natural
logarithm function.
Proof. Since f, g are differentiable on [a, b], they are continuous and thus also
integrable on [a, b]. Therefore by Theorem 6.2.1(c), f g ′ and gf ′ are integrable
on [a, b]. Since
(f g)′ = g f ′ + f g ′ ,
the function (f g)′ ∈ R[a, b]. By the fundamental theorem of calculus (Theo-
rem 6.3.2),
Z b Z b Z b
′ ′
f (b)g(b) − f (a)g(a) = (f g) = gf + f g′ ,
a a a
d
F (ϕ(t)) = F ′ (ϕ(t))ϕ′ (t) = f (ϕ(t))ϕ′ (t)
dt
for all t ∈ I. Therefore by Theorem 6.3.2
Z b Z ϕ(b)
′
f (ϕ(t))ϕ (t) dt = F (ϕ(b)) − F (ϕ(a)) = f (s) ds.
a ϕ(a)
2 2
t 1
Z Z
dt = f (ϕ(t))ϕ′ (t) dt,
0 1 + t2 2 0
5
1 1 1
Z
= dx = ln 5.
2 1 x 2
√ √ √
(b) For integrals involving a2 − x2 , x2 − a2 , and x2 + a2 , an appro-
priate trigonometric substitution isZuseful in evaluating the integral. We illus-
ap
trate this with the following. Find a2 − x2 dx. We make the substitution
0
π
x = a sin t, 0≤t≤ .
2
√
Then dx = a cos t dt and a2 − x2 = a cos t. Thus
Z ap Z π2
a2 − x2 dx = a2 cos2 t dt,
0 0
π π
a2 a2 a2 π
1
Z 2 2
= (1 + cos 2t)dt = t + sin 2t = .
2 0 2 2 0 4
Exercises 6.3
1. Sketch both the graph of f (x) and the graph of F (x) of Example 6.3.3
(c).
Rx
2. *Let f ∈ R[a, b]. For x ∈ [a, b], set F (x) = f . Prove that F is continuous
a
on [a, b].
Rx
3. For x ∈ [0, 1], find F (x) = f (t)dt for each of the following functions f
0
defined on [0, 1]. In each case verify that F is continuous on [0, 1], and
that F ′ (x) = f (x) at all points where f is continuous.
(
3 1, 0 ≤ x < 21 ,
a. f (x) = x − 3x + 5 *b. f (x) = 1
−2, 2
≤x<1
c. f (x) = x − [3x] *d f (x) = x[3x].
256 Introduction to Real Analysis
(
t, 0 ≤ t < 1,
4. Let f (t) be defined by f (t) = , and let F (x) be
b − t2 , 1 ≤ t ≤ 2,
Rx
defined by F (x) = f (t) dt, 0 ≤ x ≤ 2.
0
a. Find F (x).
b. For what value of b in the definition of f is F (x) differentiable for all
x ∈ [0, 2].
5. Let f be a continuous real-valued function on [a, b] and define H on [a, b]
Rb
by H(x) = f . Find H ′ (x).
x
R2
x
*d. F (x) = f (t) dt, where f is continuous on [0, 1]
0
Find H ′ (x).
10. *Let f be a continuous real-valued function on [a, b], g ∈ R[a, b] with
g(x) ≥ 0 for all x ∈ [a, b]. Prove that there exists c ∈ [a, b] such that
Rb Rb
f (x)g(x)dx = f (c) g(x)dx.
a a
provided the limit exists. If the limit exists, then the improper integral is said
to be convergent. Otherwise, the improper integral is said to be divergent.
Integration 259
EXAMPLES 6.4.2 (a) Consider the function f (x) = 1/x on (0, 1]. This
function is clearly unbounded at 0. Since f is continuous on (0, 1], f ∈ R[c, 1]
for every c ∈ (0, 1). By Example 6.3.5
Z 1
1
dx = ln c.
c x
Therefore, Z 1
lim+ ln x dx = −1.
c→0 c
Hence the improper integral of ln x converges on (0, 1] with
Z 1
ln x dx = −1.
0
260 Introduction to Real Analysis
Thus the improper integral of f converges on (0, 1]. However, f 2 (x) = 1/x,
and the improper integral of 1/x on (0, 1] diverges. Also, if f ∈ R[a, b], then
|f | ∈ R[a, b]. This again is false for improper Riemann integrals. A function f
for which the improper integral of f converges, but the improper integral of
|f | diverges is given in Exercise 4.
Infinite Intervals
We now turn our attention to functions defined on infinite intervals.
provided the limit exists. If the limit exists, then the improper integral is said
to be convergent. Otherwise, the improper integral is said to be divergent.
provided the limit exists. If f is defined on (−∞, ∞), then the improper inte-
gral of f is defined as Z p Z ∞
f+ f,
−∞ p
Integration 261
EXAMPLES 6.4.4 (a) Let f (x) = 1/x2 , x ∈ [1, ∞). Since f is continuous
on [1, ∞), f ∈ R[1, c] for every c > 1. Therefore,
Z ∞ Z c
1 1
f = lim dx = lim − + 1 = 1.
1 c→∞ 1 x2 c→∞ c
Thus the improper integral converges to the value 1.
(b) In this example, we consider the function f (x) = (sin x)/x, x ∈ [π, ∞).
Since f is continuous, f ∈ R[π, c] for every c > π. This function has the
property that the improper integral of f on [π, ∞) converges, but the improper
integral of |f | diverges. The proof of the convergence of the improper integral
of f is left as an exercise (Exercise 7). Here we will show that
Z ∞ Z ∞
| sin x|
|f | = dx = ∞.
π π x
For n ∈ N, consider
Z (n+1)π n (k+1)π
| sin x| | sin x|
X Z
dx = dx.
π x kπ x
k=1
262 Introduction to Real Analysis
and as a consequence
(n+1)π
√ n
| sin x| 2X 1
Z
dx ≥ .
π x 4 k+1
k=1
∞
P
By Example 3.7.4 the series 1/k diverges. Therefore,
k=1
∞ (n+1)π
| sin x| | sin x|
Z Z
dx = lim dx = ∞.
π x n→∞ π x
Exercises 6.4
1. For each of the following functions f defined on (0, 1], determine whether
R1
the improper integral of f exists. If it exists, find f .
0
1 x ln x
*a. f (x) = p , 0 < p < 1 b. f (x) = √ c. f (x) =
x 1−x x
1
*d. f (x) = x ln x e. f (x) =
π (1 + x) ln(1 + x)
*f. f (x) = tan x
2
2. For each of the following determine whether the improper integral con-
verges
Z ∞or diverges. If it Zconverges, evaluate the integral.
∞ Z ∞
−x 1
*a. e dx b. dx *c. x−p dx, p > 1
Z0 ∞ Z 1∞ x Z 1∞
ln x 1 1
d. dx *e. f. ,p>1
Z ∞1 x 2
Z ∞ x ln x 2 x(ln x)p
x x
*g. 2 +1
dx h. 2 + 1)p
dx, p > 1
0 x 0 (x
3. For each of the following, determine the values of p and q for which the
improper integral converges.
Z 1 Z ∞ Z ∞
2
*a. xp | ln x|q dx b. xp (ln x)q dx c. xp [ln(1 + x)]q dx
0 2 0
4. Let f be defined on (0, 1] by
d 1 1 2 1
f (x) = x2 sin 2 = 2x sin 2 − cos 2 .
dx x x x x
Show that the improper Riemann integral of f converges on (0, 1], but
that the improper integral of |f | diverges on (0, 1].
5. *If f is absolutely integrable on [a, ∞), and integrable on [a, c] for every
c > a, prove that the improper Riemann integral of f on [a, ∞) exists.
6. Prove Theorem 6.4.5.
cos x
7. Let f (x) = , x ∈ [π, ∞).
x2
*a. Show that the improper integral of |f | converges on [π, ∞).
R∞ sin x
*b. Use integration by parts on [π, c], c > π, to show that dx
π x
exists.
R∞
8. Show that x−p sin xdx converges for all p, 0 < p < 2.
0
9. For x > 0, set
R∞
Γ(x) = e−t tx−1 dt.
0
On the other hand, if we have a wire of length ℓ along the x-axis with one
end at the origin, then the moment of inertia I is given my
Z ℓ
I= x2 ρ(x) dx,
0
where for each x ∈ [0, ℓ], ρ(x) denotes the cross-sectional density at x.
FIGURE 6.7
Example 6.5.1
Although these two problems are totally different, the first being discrete
and the second continuous, the Riemann-Stieltjes integral will allow us to
express both of these formulas as a single integral. In the definition of the
3 Since the results of this section are not specifically required in subsequent chapters, this
Riemann integral we used the length ∆xi of the ith interval to define the
upper and lower Riemann sums of a bounded function f . The only difference
between the Riemann and Riemann-Stieltjes integral is that we replace ∆xi
by
∆αi = α(xi ) − α(xi−1 ),
where α is a nondecreasing function on [a, b]. Taking α(x) = x will give the
usual Riemann integral. Although the modification in the definition is only
minor, the consequences however are far reaching. Not only will we obtain
a more extensive theory of integration, we also obtain an integral which has
broad applications in the mathematical sciences.
for all partitions P of [a, b]. Let P be any partition of [a, b]. Since Mi ≤ M for
all i and ∆αi ≥ 0,
n
X n
X n
X
Mi ∆αi ≤ M ∆αi = M ∆αi = M [α(b) − α(a)].
i=1 i=1 i=1
266 Introduction to Real Analysis
In analogy with the Riemann integral, the upper and lower Riemann-
Rb
Stieltjes integrals of f with respect to α over [a, b], denoted a f dα and
Rb
a
f dα respectively, are defined by
Z b
f dα = inf {U (P, f, α) : P is a partition of [a, b]} ,
a
Z b
f dα = sup {L(P, f, α) : P is a partition of [a, b]} .
a
for any partition Q. Taking the infimum over Q gives the result.
Integration 267
As was indicated previously, the special case α(x) = x gives the usual
Riemann integral on [a, b].
xk−1 < c ≤ xk .
Then
Therefore
for all t ∈ [a, b] with |t−c| < δ. If P is any partition of [a, b] with xj −xj−1 < δ
for all j, then
f (c) − ǫ ≤ mk ≤ Mk ≤ f (c) + ǫ.
Therefore f (c) − ǫ ≤ L(P, f, α) ≤ U(P, f, α) ≤ f (c) + ǫ. As a consequence
Z b Z b
f (c) − ǫ ≤ f dα ≤ f dα ≤ f (c) + ǫ.
a a
Since ǫ > 0 was arbitrary, the upper and lower integrals of f are equal, and
thus f is integrable with respect to α on [a, b] with
Z b
f dα = f (c).
a
We now use the previous theorem to prove the following analogue of The-
orem 6.1.8. Except for some minor differences, the two proofs are very similar.
Integration 269
Proof. (a) The proof of (a) is identical to the proof of Theorem 6.1.8(a) except
that given ǫ > 0, we choose η > 0 such that
The remainder of the proof now follows verbatim the proof of Theorem
6.1.8(a).
(b) For any positive integer n, choose a partition P = {x0 , x1 , ..., xn } of
[a, b] such that
1
∆αi = α(xi ) − α(xi−1 ) = [α(b) − α(a)].
n
Since α is continuous, such a choice is possible by the intermediate value
theorem. Assume f is monotone increasing on [a, b]. Then Mi = f (xi ) and
mi = f (xi−1 ). Therefore,
n
X
U (P, f, α) − L(P, f, α) = [f (xi ) − f (xi−1 )]∆αi
i=1
n
[α(b) − α(a)] X
= [f (xi ) − f (xi−1 )]
n i=1
[α(b) − α(a)]
= [f (b) − f (a)].
n
Given ǫ > 0, choose n ∈ N such that
[α(b) − α(a)]
[f (b) − f (a)] < ǫ.
n
For this n and corresponding partition P, U (P, f, α) − L(P, f, α) < ǫ, which
proves the result.
Remark. In part (b) above, the result may be false if α is not continuous.
For example, if a < c ≤ b, then the monotone function Ic is not integrable
with respect to Ic on [a, b] (Exercise 2(b)).
THEOREM 6.5.8
(a) If f, g ∈ R(α), then f + g and cf are in R(α) for every c ∈ R, and
Z b Z b Z b Z b Z b
(f + g) dα = f dα + g dα and cf dα = c f dα.
a a a a a
(d) If f, g ∈ R(α) with f (x) ≤ g(x) for all x ∈ [a, b], then
Z b Z b
f dα ≤ g dα.
a a
Proof. We provide the proofs of (b) and (e). The proofs of (a), (c), and (d),
along with other properties of the Riemann-Stieltjes integral are left to the
exercises.
(b) Since f ∈ R(αi ), given ǫ > 0, there exists a partition Pi , i = 1, 2, such
that
ǫ
U (Pi , f, αi ) − L(Pi , f, αi ) < . (11)
2
Let P = P1 ∪ P2 . Since P is a refinement of both P1 and P2 , inequality (11)
is still valid for the partition P. Thus since
(e) Suppose f ∈ R(α) and P = {x0 , x1 , ..., xn } is a partition of [a, b]. For
each i = 1, ..., n, let
If t, x ∈ [xi−1 , xi ], then
As for the Riemann integral, we also have the following mean value theorem
and integration by parts formula for the Riemann-Stieltjes integral.
272 Introduction to Real Analysis
Proof. Let m and M denote the minimum and maximum of f on [a, b] re-
spectively. Then by Theorem 6.5.8(d),
Z b
m [α(b) − α(a)] ≤ f dα ≤ M [α(b) − α(a)].
a
If α(b) − α(a) = 0, then any c ∈ [a, b] will work. If α(b) − α(a) 6= 0, then by
the intermediate value theorem there exists c ∈ [a, b] such that
b
1
Z
f (c) = f dα,
α(b) − α(a) a
Therefore,
Hence,
Z b Z b
α dβ ≤ U(P, α, β) < α(b)β(b) − α(a)β(a) − β dα + ǫ.
a a
Integration 273
A similar argument using the lower sum proves the reverse inequality.
We conclude this section with two results that represent the extremes
encountered in Riemann-Stieltjes integration. As in Example 6.5.4(a), let Ic
be the unit jump function at c ∈ R. Suppose {sn }Nn=1 is a finite subset of (a, b]
and {cn }N
n=1 are non-negative real numbers. Define the monotone increasing
function α on [a, b] by
XN
α(x) = cn I(x − sn ).
n=1
Suppose {sn }∞ ∞
Pof (a, b] and {cn }n=1 is a sequence of
n=1 is a countable subset
nonnegative real numbers for which cn converges. As in Theorem 4.4.10
define α on [a, b] by
X∞
α(x) = cn I(x − sn ). (13)
n=1
Since 0 ≤ I(x−sn ) ≤ 1 for all n, the series in (13) converges for every x ∈ [a, b],
and α is a monotone increasing function on [a, b]. For such a function α we
have the following theorem.
Proof. Since f is continuous on [a, b], f ∈ R(α) (Theorem 6.5.6(a)). Let ǫ > 0
be given. Choose a positive integer N such that
∞
X
cn < ǫ.
n=N +1
Let M = max{|f (x)| : x ∈ [a, b]}. Then by Theorem 6.5.8(b) and (e),
Z b N
X Z b
f dα − cn f (sn ) = f dβ2 ≤ M [β2 (b) − β2 (a)]
a n=1 a
∞
X
≤M cn < M ǫ,
n=N +1
Proof. Since both f and α′ are Riemann integrable on [a, b], by Theorem
6.2.1(c), f α′ ∈ R[a, b]. Let ǫ > 0 be given. Since α′ ∈ R[a, b], by Theorem
6.1.7 there exists a partition P of [a, b] such that
By the mean value theorem, for each i = 1, ..., n, there exists ti ∈ [xi−1 , xi ]
such that
∆αi = α(xi ) − α(xi−1 ) = α′ (ti )∆xi
Therefore,
n
X n
X
f (si )∆αi = f (si )α′ (ti )∆xi . (16)
i=1 i=1
Let M = sup{|f (x)| : x ∈ [a, b]}, and for i = 1, ..., n, let mi and Mi denote the
infimum and supremum respectively of α′ over the interval [xi−1 , xi ]. Then
Since f α′ ∈ R[a, b], there exists a partition Q of [a, b], which is a refinement
of P, such that
Z b
U (Q, f α′ ) < f α′ + ǫ.
a
Thus
Z b Z b
f dα < f α′ + (M + 2)ǫ.
a a
Since this holds for any ǫ > 0,
Z b Z b
f dα ≤ f α′ .
a a
A similar argument using lower sums proves the reverse inequality. Thus f ∈
R(α) and
Z b Z b
f dα = f α′ .
a a
We now give several examples to illustrate the previous two theorems.
EXAMPLES 6.5.13 (a) For our first example we illustrate the finite ver-
sion of Theorem 6.5.11; namely, identity (12). Consider
Z 2
ex d[x].
0
(b) To illustrate Theorem 6.5.12, if α(x) = x2 on [0, 1], then for any
Riemann integrable function f on [0, 1],
Z 1 Z 1
f (x) dx2 = 2 f (x)x dx.
0 0
1 1 1 1
Z
= − x cos πx + cos πxdx
π 0 π 0
1 1 1 1
= − cos π + 2 sin πx = .
π π 0 π
where m(x) denotes the mass of the wire or system from 0 to x. Clearly the
function m(x) is nondecreasing, and thus since x2 is continuous, the above
integral exists.
Integration 277
In the first case, since 0 < r1 < r2 < · · · < rn , and the masses mi are
located at ri (see Figure 6.7), m(x) is given by
n
X
m(x) = mi I(x − ri ).
i=1
Riemann-Stieltjes Sums
We conclude this section with a few remarks concerning Riemann-Stieltjes
sums. As in Definition 6.2.4, let f be a bounded real-valued function on [a, b],
α a monotone increasing function on [a, b], and P = {x0 , ..., xn } a partition of
[a, b]. For each i = 1, 2, ..., n, choose ti ∈ [xi−1 , xi ]. Then
n
X
S(P, f, α) = f (ti )∆αi
i=1
Exercises 6.5
Z 1
1. *Evaluate f (x) dα(x) where f is bounded on [−1, 1] and continuous
−1
at 0, and α is given by
−1,
x < 0,
α(x) = 0, x = 0,
1, x > 0.
*a. Prove that |F (x)−F (y)| ≤ M |α(x)−α(y)| for some positive constant
M and all x, y ∈ [a, b].
b. Prove that if α is continuous at xo ∈ [a, b], then F is also continuous
at xo .
4. a. Prove Theorem 6.5.8(a).
b. Prove Theorem 6.5.8(c).
c. Prove Theorem 6.5.8(d).
5. Use the theorems from the text to compute each of the following integrals:
Z π/2 Z 3 Z 3
*a. x d(sin x). b. [x] dx2 . *c. x2 d[x].
0
Z 3 0
Z 1 0
Z 4
d. ([x] + x) d(x2 + ex ) e. sin πxd[4x]. *f. (x − [x]) dx3
1 0 1
Integration 279
6. Verify the integration by parts formula with (b) and (c) of the previous
exercise.
Z 1
7. Find f dα, where f is continuous on [0, 1] and
0
∞
X 1 1
α(x) = n
I(x − n
).
n=1
2
Leave your answer in the form of an infinite series.
8. Let α be as in Exercise 7. Evaluate each of the following integrals. Leave
your answers in the form of an infinite series.
Z 1 Z 1
*a. x dα(x) b. α(x) dx
0 0
9. Suppose α and β are monotone increasing on [a, b], and P is a partition
of [a, b]. Prove that
U (P, α, β) = α(b)β(b) − α(a)β(a) − L(P, β, α).
10. Prove that if f is a continuous real-valued function on [a, b] and α is
monotone increasing on [a, b], then lim S(P, f, α) exists and
kPk→0
Rb
lim S(P, f, α) = f dα.
kPk→0 a
12. *a. Let α be a monotone increasing function on [a, b]. If f ∈ R(α), prove
that f 2 ∈ R(α).
b. If f, g ∈ R(α), prove that f g ∈ R(α).
13. If f ∈ R(α) on [a, b] with Range f ⊂ [c, d], and ϕ is continuous on [c, d],
prove that ϕ ◦ f ∈ R(α) on [a, b].
14. Suppose f is a nonnegative continuous function on [a, b], and α is non-
Rx
decreasing on [a, b] Define the function β on [a, b] by β(x) = f dα. If g
1
is continuous on [a, b], prove that
Z b Z b
g dβ = f g dα.
1 a
though the fundamental theorem of calculus provides an easy method for eval-
uating definite integrals, it is useful only if we can find an antiderivative of
the function being integrated. To illustrate this, in Example 6.3.9 we showed
that Z 2
t
2
dt = 12 ln 5.
0 1+t
This however is not particular useful if we do not know the value of ln 5. By
Example 6.3.5, Z 5
1
ln 5 = dt.
1 t
To obtain an approximation to ln 5, we can choose from several available
methods for obtaining numerical approximations to the definite integral.
as an approximation to the integral. This is nothing but the Riemann sum for
the partition P = {x0 , x1 , ..., xn } of [a, b] with tk = xk−1 for all k.
In using Riemann sums to approximate the integral of f it is convenient
to take equally spaced partitions. Let n ∈ N, and set h = (b − a)/n. Define
x0 = a, x1 = a + h, x2 = a + 2h, · · · , xn = a + nh = b. (17)
Thus if Pn = {x0 , x1 , ..., xn } with xi as defined, we always have ∆xi = h for
all i = 1, ..., n, and
n
X n
X
S(Pn , f ) = f (ti )∆xi = h f (ti ),
i=1 i=1
where for each i = 1, ..., n, ti ∈ [xi−1 , xi ]. If we take ti = xi−1 for all i, then
we obtain the above formula of Euler. Similarly, we could take ti to be the
right endpoint xi of the interval [xi−1 , xi ]. Another choice of ti would be the
midpoint; i.e., ti = (xi−1 + xi )/2. For monotone functions, it is intuitively
obvious that the midpoint gives a better approximation than either the right
or left endpoint to the integral of f over the interval [xi−1 , xi ] (see Figure 6.8).
With xi as defined by (17) and ti = (xi−1 + xi )/2 the above formula becomes
n
X
Mn (f ) = h f (a + (i − 12 )h). (18)
i=1
Integration 281
FIGURE 6.8
Midpoint approximation
for any choice of tk ∈ [xk−1 , xk ]. Inequality (19) follows from the fact that
both the Riemann sum of f and the integral of f lie between the lower and
upper sum of f . If f is monotone increasing on [a, b], then by the proof of
Theorem 6.1.8(b),
U (Pn , f ) − L(Pn , f ) = h[f (b) − f (a)].
Thus by inequality (19),
Z b n
X
f (x)dx − h f (ti ) ≤ h|f (b) − f (a)| (20)
a i=1
for any choice of ti ∈ [xi−1 , xi ]. Thus with n = 8, the error is less than
2/5 = 0.4; n = 160 only guarantees an error of less than 1/50 = 0.02. We
would be required to take very large values of n to be guaranteed a sufficiently
small error.
With n = 8, h = 1/2 and xi = 1 + i/2, i = 0, 1, ..., 8, and ti = xi−1 =
(1 + i)/2 (the left end point),
8
X 1
S(P8 , f ) =
i=1
1+i
1 1 1 1 1 1 1 1
= + + + + + + +
2 3 4 5 6 7 8 9
= 1.8290 (to four decimal places).
To four decimal places, ln 5 = 1.6094. Thus the error is 0.2196 which is less
than the predicted error of 0.4.
If we use the midpoint approximation (18) (again with n = 8), then ti =
1 + (i − 21 ) 12 which upon simplification equals (3 + 2i)/4. Thus
8
1X 4
M8 (f ) =
2 i=1 3 + 2i
1 1 1 1 1 1 1 1
=2 + + + + + + +
5 7 9 11 13 15 17 19
= 1.5998 (to four decimal places).
Using the midpoint approximation the error is less than 0.01. This is consid-
erably better than predicted by inequality (20). As we will shortly see, this
improved accuracy is not a coincidence.
Inequality (20) provides an error estimate for monotone functions. For arbi-
trary real-valued functions, under the additional hypothesis that the derivative
is bounded, we have the following theorem.
Since f is continuous on [a, b], for each i there exist si , s′i ∈ [xi−1 , xi ] such
that
Mi − mi = f (si ) − f (s′i ).
By the mean value theorem, f (si ) − f (s′i ) = f ′ (ti )(si − s′i ) for some ti between
si and s′i . Therefore, since |f ′ (ti )||si − s′i | ≤ M kPk,
n
X
U (P, f ) − L(P, f ) ≤ M kPk ∆xi = kPk M (b − a).
i=1
Remark. Since the error between the true value and the approximate value in-
volves the term h2 , the midpoint approximation is a second order method.
Proof. To prove the result, we first prove that for each i = 1, ..., n,
xi
M 3
Z
f (x)dx − h f (a + (i − 12 )h) ≤ h . (21)
xi−1 24
284 Introduction to Real Analysis
|gi′′ (t)| ≤ 2M t
for all t ∈ [0, h/2]. Since gi′ (0) = 0, by the fundamental theorem of calculus,
Z t Z t Z t
|gi′ (t)| = gi′′ (x)dx ≤ |gi′′ (x)|dx ≤ 2M x dx = M t2 .
0 0 0
FIGURE 6.9
Trapezoidal approximation
The above error estimate can also be expressed as follows: If f satisfies the
hypothesis of the theorem, then for n ∈ N,
b
M (b − a)3 1
Z
f (x)dx − Tn (f ) ≤ . (23)
a 12 n2
By computing gi′ (t) and gi′′ (t) one obtains from the hypothesis on f that
|gi′′ (t)| ≤ 21 tM . Since gi (0) = gi′ (0) = 0, applying the fundamental theorem
1
of calculus twice we obtain as in Theorem 6.6.3 that |gi (h)| ≤ 12 M h3 . The
remainder of the proof is identical to Theorem 6.6.3.
Integration 287
Simpson’s Rule
The trapezoidal approximation Tn (f ) amounts to approximating the func-
tion f with a piecewise linear function gn that passes through the points
{(xi , f (xi ))}, i = 0, . . . , n. Our intuition should convince us that one way to
obtain a better approximation to the integral of f over [a, b] is to use smoother
curves. This is exactly what is done in Simpson’s rule which uses parabolas
to approximate the integral. To use quadratic approximations we will need to
use three successive points of the partition of [a, b]. This is due to the fact
that three points are required to uniquely determine a parabola.
Prior to deriving Simpson’s rule, we first establish the following formula:
If p(x) = Ax2 + Bx + C is the quadratic function passing through the points
(0, y0 ), (h, y1 ), (2h, y2 ), then
2h
h
Z
p(x)dx = [y0 + 4y1 + y2 ]. (24)
0 3
One way to derive this formula would be to first determine the coefficients
A, B, C so that p(0) = y0 , p(h) = y1 , p(2h) = y2 , and then integrate p(x).
This however is not necessary. By integrating first,
2h
A B
Z
p(x) dx = (2h)3 + (2h)2 + C(2h)
0 3 2
h
= [8Ah2 + 6Bh + 6C]
3
h h
= [p(0) + 4p(h) + p(2h)] = [y0 + 4y1 + y2 ].
3 3
Let f ∈ R[a, b] and let n ∈ N be even. Set h = (b − a)/n. On each of the
intervals [a + 2(i − 1)h, a + 2ih], i = 1, ..., n/2, we approximate the integral of
f by the integral of the quadratic function that agrees with f at the points
h
[f (a + 2(i − 1)h) + 4f (a + (2i − 1)h) + f (a + 2ih)]
3
as an approximation to the integral of f over the interval [a+2(i−1)h, a+2ih].
Summing these terms from i = 1 to n/2 gives
h
Sn (f ) = [f (a)+4f (a + h) + 2f (a + 2h)+
3
+ 4f (a + 3h) + · · · + 4f (a + (n − 1)h) + f (b)]
called the nth Simpson approximation to the integral of f over [a, b]. If we
set yi = f (a + ih), then Sn (f ) is given by
h
Sn (f ) = [y0 + 4y1 + 2y2 + 4y3 + · · · + 2yn−2 + 4yn−1 + yn ]. (25)
3
The following theorem, again under suitable restrictions on f , provides an
error estimate for Simpsons’s rule.
To prove inequality (27), we are required to show that |gi (h)| ≤ M h5 /90.
Upon computing the successive derivatives of gi we obtain
t ′ 2 4
gi′ (t) =
[f (ci − t) + f ′ (ci + t)] + [f (ci − t) + f (ci + t)] − f (ci ),
3 3 3
′′ t ′′ ′′ 1 ′ ′
gi (t) = [−f (ci − t) − f (ci + t)] + [−f (ci − t) + f (ci + t)], and
3 3
′′′ t ′′′ ′′′
gi (t) = [f (ci − t) − f (ci + t)].
3
Integration 289
By the mean value theorem f ′′′ (ci − t) − f ′′′ (ci + t) = f (4) (ζ)(2t) for some
ζ ∈ (ci − t, ci + t). Therefore
2
|gi′′′ (t)| ≤ M t2 .
3
As in the previous two theorems, since gi (0) = gi′ (0) = gi′′ (0) = 0, upon three
integrations we have |gi (h)| ≤ M h5 /90. This proves inequality (27). Finally,
Z b n/2 n/2
X X
f (x)dx − Sn (f ) = gi (x) ≤ |gi (x)|
a i=1 i=1
M h5 n M (b − a) 4
≤ = h .
90 2 180
EXAMPLE 6.6.6 In this example, we will use the trapezoidal rule and
Simpson’s rule with n = 8 to obtain approximations to ln 5. For f (x) = x−1 ,
f ′′ (x) = 2x−3 and f (4) (x) = 24x−5 . Therefore
sup{|f ′′ (x)| : x ∈ [1, 5]} = 2 and sup{|f (4) (x)| : x ∈ [1, 5]} = 24.
y0 = 1, y1 = 23 , y2 = 12 , y3 = 52 , y4 = 31 , y5 = 27 , y6 = 41 , y7 = 29 , y8 = 51 .
Therefore,
7
" #
1 X
T8 (f ) = y0 + 2 yi + y8
4 i=1
1 4 4 2 4 1 4 1
= 1+ +1+ + + + + +
4 3 5 3 7 2 9 5
= 1.6290 (to four decimal places).
290 Introduction to Real Analysis
Since ln 5 = 1.6094 to four decimal places, the error is less than 0.02, well
within the tolerance predicted by the theory. With Simpson’s rule,
1
S8 (f ) = [y0 + 4y1 + 2y2 + 4y3 + 2y4 + 4y5 + 2y6 + 4y7 + y9 ]
6
1 8 8 2 8 1 8 1
= 1+ +1+ + + + + +
6 3 5 3 7 2 9 5
= 1.6108 (to four decimal places).
2 · 43 1
|En (f )| ≤ .
12 n2
Thus to obtain accuracy to within 0.001 we need
2 · 43 3
n2 > 10 = 10666.66..
12
which is accomplished with n ≥ 104. On the other hand, using Simpson’s rule,
24 · 45 1
|En (f )| ≤ .
180 n4
Thus to obtain accuracy to within 0.001, we need n ∈ N even with
24 · 45 3
n4 > 10 .
180
This will be satisfied with n ≥ 20.
Exercises 6.6
1. a. Use the midpoint rule, trapezoidal rule, and Simpson’s rule to ap-
R2
proximate ln 2 = (1/x) dx with n = 4. For each method, determine
1
the estimated error and compare your answer to ln 2 = 0.69315 (to five
decimal places).
b. Repeat (a) with n = 8.
c. For each of the three methods (midpoint rule, trapezoidal rule, Simp-
son’s rule) determine how large n must be chosen to assure accuracy in
the approximation of ln 2 to within 0.0001.
Integration 291
LEMMA 6.7.1 A finite or countable union of sets of measure zero has mea-
sure zero.
Proof. We will prove the lemma for the case of a countable sets of measure
zero. The result for a finite union is an immediate consequence.
Suppose
S {En }n∈N is a countable collection of sets of measure zero. Set
E = n En and let ǫ > 0 be given. Since each set En is a set of measure
zero, for each n ∈ N there exists S a finite orPcountable collection {In,k }k of
open intervals such that En ⊂ k In,k and k ℓ(In,k ) < ǫ/2k . Since we can
always take In,k to be the empty set, there is no loss of generality in assuming
that the collection {In,k }k is countable. S Then {In,k }n,k is again a countable
collection of open intervals with E ⊂ n,k In,k .
Since N × N is countable, there exists a one-to-one function f from N onto
N×N. For each m ∈ N, set JS m = If (m . Then {Jm }m∈N is a countable collection
of open intervals with E ⊂ m Jm . Since f is one-to-one, for each N ∈ N, the
set FN = f ({1, . . . , N }) is a finite subset of N × N. Hence there exists positive
integers N1 and K1 such that for all (n, k) ∈ FN we have 1 ≤ n ≤ N1 and
1 ≤ k ≤ K1 . Hence
N
X X X
ℓ(Jm ) = ℓ(In,k ) ≤ ℓ(In,k ).
m=1 (n,k)∈FN (n,k)∈N1 ×K1
But
K1
N1 X N1 X
∞ N1
X X X X ǫ
ℓ(In,k ) = ℓ(In,k ) ≤ ℓ(In,k ) < < ǫ.
n=1 k=1 n=1 k=1 n=1
2n
(n,k)∈N1 ×K1
∞
P
Thus ℓ(Jm ) < ǫ. Therefore E has measure zero.
m=1
Proof. We first prove that for each c > 0, the set Ec = {x ∈ [a, b] : f (x) ≥ c}
Rb
has measure zero. Let ǫ > 0 be given. Since f = 0, there exists a partition
a
P = {x0 , x1 , . . . , xn } of [a, b] with U (P, f ) < c ǫ, where
n
X
U (P, f ) = Mi ∆xi ,
i=1
by the above each En has measure zero. Thus by Lemma 6.7.1, the set E has
measure zero.
The graphs of the lower function Lf and upper function Uf are depicted in
Figure 6.10. Since mi ≤ f (x) ≤ Mi for all x ∈ [xi−1 , xi ),
FIGURE 6.10
Graphs of the lower and upper function of f
Proof. Assume first that f is Riemann integrable on [a, b]. Then for each
n ∈ N there exists a partition Pn of [a, b] with kPn k < 1/n, such that
b b
1 1
Z Z
0≤ f − L(Pn , f ) < and 0 ≤ U(Pn , f ) − f< . (28)
a n a n
Then
Ln (x) ≤ L(x) ≤ f (x) ≤ U (x) ≤ Un (x)
for all x ∈ [a, b]. Hence
Z b Z b Z b Z b Z b Z b Z b
Ln ≤ L≤ L≤ f≤ U≤ U≤ Un .
a a a a a a a
Integration 295
Since U (x) − L(x) ≥ 0 for all x ∈ [a, b], by Lemma 6.7.2, {x ∈ [a, b] : U (x) 6=
L(x)} has measure zero. Furthermore, since each Pn has measure zero, by
Lemma 6.7.1, the set
!
[ [
E = {x ∈ [a, b] : U (x) 6= L(x)} Pn
n
for all x ∈ [a, b] with |x − xo | < δ. Finally, since Lk (x) ≤ f (x) ≤ Uk (x) for all
x ∈ [a, b),
P = {a = t0 , t1 , . . . , tN = b}
Exercises 6.7
1. Let f be a real-valued function on [a, b] and P a partition of [a, b]. Prove
that the lower function Lf and upper function Uf for the partition P are
Riemann integrable on [a, b] with
Z b Z b
Lf = L(P, f ) and Uf = U (P, f ).
a a
2. R*Let P be that Cantor set in [0, 1]. Show that χP ∈ R[0, 1] and find
1
0
χP (x)dx.
Rx
3. Let f ∈ R[a, b], and for x ∈ [a, b], set F (x) = f (t)dt. Prove that there
a
exists a subset E of [a, b] of measure zero such that F ′ (x) = f (x) for all
x ∈ [a, b] \ E,
4. *Suppose f ∈ R[a, b] and g is a bounded real-valued function on [a, b]. If
{x ∈ [a, b] : g(x) 6= f (x)} has measure zero, is g Riemann integrable on
[a, b]?
Notes
The fundamental theorem of calculus is without question the key theorem of cal-
culus; it relates the Cauchy-Riemann theory of integration with differentiation. For
Newton, Leibniz, and their successors, integration was the inverse operation of differ-
entiation. When Cauchy however defined integration independent of differentiation,
the fundamental theorem of calculus became a necessity. It was crucial in proving
that Cauchy’s integral was the inverse of differentiation, thereby providing both a
convenient tool for the evaluation of definite integrals and proving that every con-
tinuous real-valued function defined on an interval I has an antiderivative on I.
Integration 297
Miscellaneous Exercises
A real-valued function f on [a, b] is a step function if there exist a finite number
of disjoint intervals {Ij }n j=1 with ∪Ij = [a, b] such that f is constant on each of the
intervals Ij , j = 1, ..., n.
298 Introduction to Real Analysis
Supplemental Reading
Although the study of series has a long history in mathematics,1 the modern
definition of convergence dates back only to the beginning of the nineteenth
century. In 1821, Cauchy, in his text Cours d’Analyse, used his definition of
limit to provide the first formal definition of convergence of a series in terms
of convergence of the sequence of partial sums. The Cauchy criterion (Theo-
rem 3.7.3) was the first significant result to provide necessary and sufficient
conditions for convergence of a series. Cauchy not only stated and proved the
result, he also applied his result to prove convergence and divergence of given
series. Many of the early convergence tests, such as the root and ratio test, are
due to him. Cauchy, with his formal development of series, placed the subject
matter on a rigorous mathematical foundation.
In this chapter, we will continue our study of series of real numbers begun
earlier in the text. Our primary emphasis in Section 7.1 will be on deriving
several tests that are useful in determining the convergence or divergence of a
given series. In Section 7.3 we will study the concepts of absolute convergence,
conditional convergence, and rearrangements of series. One of the key results
of this section is that every rearrangement of an absolutely convergent series
not only converges, but converges to the same sum. As we will also see, this
fails dramatically if the series converges but fails to converge absolutely.
In Section 7.4 we give a brief introduction to the topic of square summable
sequences. These play an important role in the study of Fourier series. One
of the main result of this section will be the Cauchy-Schwarz inequality for
series. This section also contains a brief introduction to normed linear spaces.
301
302 Introduction to Real Analysis
P
ak diverges. On the other hand however, if lim ak = 0, then nothing P can
be ascertained concerning convergence or divergence of the series ak . In
this section, we will state and prove several useful results that can be used
to establish convergence or divergence of a given series. Additional tests for
convergence will also be given in the exercises and subsequent sections. With
the exception of Theorem 7.1.1, all of our results in this section will be stated
for series of nonnegative terms.
∞
P
As in Definition 3.7.1, given an infinite series ak of real numbers,
k=1
{sn }∞
n=1 will denote the associated sequence of partial sums defined by
n
X
sn = ak .
k=1
P
The series ak converges if and only if the sequence {sn } of nth partial
sums converges. Furthermore, if lim sn = s, (s ∈ R), then s is called the
n→∞
sum of the series, and we write
∞
X
ak = s.
k=1
P
If the sequence {sn } diverges, then the series aP
k is said to diverge. Fur-
thermore, if lim sn = ∞ (or −∞), then we write ak = ∞ (−∞) to denote
n→∞
that the series diverges to ∞ (or −∞). P
If ak ≥ 0 for all k, then by Theorem 3.7.6 ak converges if and only if
lim sn < ∞. Thus for series of nonnegative terms we adopt the notation
n→∞
∞
X
ak < ∞
k=1
∞
P
Also, from the Cauchy criterion (Theorem 3.7.3) it is clear that ak con-
k=1
∞
P
verges if and only if ak converges for some, and hence every, ko ∈ N.
k=ko
Series of Real Numbers 303
∞
P (b) Given any sequence {sn }n=1 of real numbers we can always find a series
ak whose nth partial sum is sn . If we set a1 = s1 and ak = sk −sk−1 , k > 1,
then
Xn
ak = s1 + (s2 − s1 ) + · · · + (sn − sn−1 ) = sn .
k=1
∞
P ∞
P
THEOREM 7.1.1 If ak = α and bk = β, then
k=1 k=1
∞
P
(a) c ak = c α, for any c ∈ R, and
k=1
∞
P
(b) (ak + bk ) = α + β.
k=1
Proof. The proof of (a) is similar to (b) and thus is omitted. To prove (b),
for each n ∈ N, let
n
X n
X
sn = ak and tn = bk .
k=1 k=1
Comparison Test
One of the most important convergence tests is the comparison test. Although
very elementary, it provides one of the most useful tools in determining con-
vergence or divergence of a series. It is useful both in applications and theory.
Several of the proofs of subsequent theorems rely on this test. In applications,
by comparing the terms of a given series with the terms of a series for which
convergence or divergence is known, we are then able to determine whether
the given series converges or diverges.
P P
THEOREM 7.1.2 (Comparison Test) Suppose ak and bk are two
given series of nonnegative real numbers satisfying
0 ≤ ak ≤ M bk
304 Introduction to Real Analysis
for some positive constant M and all integers k ≥ ko , for some fixed ko ∈ N.
∞
P ∞
P
(a) If bk < ∞, then ak < ∞.
k=1 k=1
∞
P ∞
P
(b) If ak = ∞, then bk = ∞.
k=1 k=1
Proof. Suppose the terms {ak } and {bk } satisfy ak ≤ M bk for all k ≥ ko and
some positive constant M . Then for n > m ≥ ko
n
X n
X
0≤ ak ≤ M bk .
k=m+1 k=m+1
P
Suppose bk converges. Then given ǫ > 0, by the Cauchy criterion (3.7.3)
there exists an integer no ≥ ko such that
n
X ǫ
bk <
M
k=m+1
Pn
for all n > m ≥ no . Thus 0 ≤ ak < ǫ for all n > m ≥ no , and hence
P k=m+1 P
by thePCauchy criterion ak converges. On the other hand, if ak diverges,
then bk must also diverge.
As a corollary of the previous theorem we also have the following version
of the comparison test.
P P
COROLLARY 7.1.3 (Limit Comparison Test) Suppose ak and bk
are two given series of positive real numbers.
an P
(a) If lim = L with 0 < L < ∞, then ak converges if and only if
P n→∞ b n
bk converges.
an P P
(b) If lim = 0 and bk converges, then ak converges.
n→∞ bn
Proof. The proof, the details of which are left to the exercises (Exercise 6),
follows immediately from the definition of the limit and the comparison test.
P
Remark. If lim an /bn = 0 and ak converges, then nothing can be con-
n→∞ P
cluded about the convergence of the seriesP bk . In Example 7.1.4(d),
P we
provide an example of a divergent series bk and a convergent series ak
for which lim an /bn = 0. On the other hand, in Exercise 23, given a con-
n→∞ P
vergentP series ak with ak > 0, you will be asked to construct a convergent
series bk with bk > 0 such that lim an /bn = 0.
n→∞
Series of Real Numbers 305
We
P willk compare the given series with the convergent geometric series
(1/2) . Thus we wish to show that there exists ko ∈ N such that k 3k ≤
k
1 2 for all k ≥ ko . Since 2/3 < 1, by Theorem 3.2.6(d)
k
2
lim k = 0.
k→∞ 3
Thus by taking ǫ = 1, there exists an integer ko such that k(2/3)k ≤ 1 for all
k ≥ ko . As a consequence,
k 1
≤ k for all k ≥ ko .
3k 2
Since (1/2)k < ∞, by the comparison testP
P
the given series also converges. A
k n ak converges for any n ∈ Z
similar argument can be used to prove that
and a ∈ R with aP > 1 (Exercise 2(l)). This is accomplished by comparing the
given series with b−k , where 1 < b < a.
(b) As our second example consider the series
∞ r
X k+1
.
2k 3 + 1
k=1
In order to use the comparison test we have to determine what series we want
to compare with. Since
s s
1 1 + k1 1 + k1 √
r
k+1
3
= 1 and lim 1 = 21 2,
2k + 1 k 2 + k3 k→∞ 2 + k3
P
we will compare the given series with the series
√ 1/k. This series is known
1
to diverge (Example 3.7.4). If we take ǫ = 4 2, then we can conclude that
there exists ko ∈ N such that
s
1 + k1 1
√ 1
√
1 ≥ 2 2−ǫ= 4 2
2 + k3
for all k ≥ ko . Thus there exists a positive constant M and ko ∈ N such that
r
k+1 1
3
≥M
2k + 1 k
for all k ≥ ko , and as a consequence, the given series diverges.
306 Introduction to Real Analysis
r
∞
k+1
P
(c) The divergence of the series can also be obtained by the
k=1 2k 3 + 1
limit comparison
P test. Comparing the terms of the given series to the terms
of the series 1/k we have
√
q
k+1
s
2k3 +1 1 + k1 2
lim 1 = lim 1 = 2 .
k→∞
k
k→∞ 2 + k3
P
Thusrsince the series 1/k diverges, by the limit comparison test the series
P∞ k+1
also diverges.
k=1 2k 3 + 1
(d) Let ak = 2−k and bk = 1/k. Then
P P
ak converges, bk diverges, and
an n
lim = lim n = 0.
n→∞ bn n→∞ 2
an P
Thus if lim = 0, convergence of the series ak does not imply conver-
n→∞ bn P
gence of the series bk .
Integral Test
Our second major convergence test is the integral test. Recall from Section
6.4, if f is a real-valued function defined on [a, ∞) with f ∈ R[a, c] for every
c > a, then the improper Riemann integral of f is defined by
Z ∞ Z c
f (x) dx = lim f (x) dx,
a c→∞ a
FIGURE 7.1
Integral test
P = {1, 2, ..., n} of [1, n]. Since f is decreasing, for each k = 2, 3, ..., n, (see
Figure 7.1)
sup{f (t) : t ∈ [k − 1, k]} = f (k − 1) = ak−1 , and
inf{f (t) : t ∈ [k − 1, k]} = f (k) = ak .
Therefore
n
X Z n n−1
X
ak = L(P, f ) ≤ f (x) dx ≤ U(P, f ) = ak ,
k=2 1 k=1
EXAMPLES 7.1.6 (a) As our first application of the integral test we con-
sider the p-series
∞
X 1
.
kp
k=1
When p = 1 this series is referred to as the harmonic series. If p ≤ 0,
then {k −p } does not converge to zero, and thus by Corollary 3.7.5 the series
diverges. Suppose p > 0, p 6= 1. Let f (x) = x−p , which is decreasing on [1, ∞).
Then Z c
−p 1 1
x dx = 1 − p−1 .
1 p−1 c
Therefore,
1 ,
Z ∞ Z c
−p −p p > 1,
x dx = lim x dx = p − 1
c→∞
1 1 ∞, p < 1.
308 Introduction to Real Analysis
Thus by the integral test the series diverges for p < 1 and converges for p > 1.
When p = 1, then by Example 6.3.5,
Z c
1
dx = ln c.
1 x
(b) As our second application of the integral test we consider the series
∞
X 1
.
k ln k
k=2
We first consider the case p = 1. Since f (x) = (ln x)/x is decreasing on [e, ∞)
and Z c
ln x
lim dx = lim (ln c)2 − 1 = ∞,
c→∞ e x c→∞
P∞
by the integral test the series k=2 (ln k)/k diverges.
Suppose now that p > 1. Write p = q + r, where q > 1 and r > 0.
By l’Hospital’s rule, lim (ln x)/xr = 0. Thus there exists ko ∈ N such that
x→∞
(ln k)/k r ≤ 1 for all k ≥ ko . As a consequence
ln k 1 ln k 1
= q r ≤ q
kp k k k
1/k q converges. Hence by the compar-
P
for all k ≥ ko . SinceP
q > 1 the series
ison test the series (ln k)/k p also converges when p > 1.
Series of Real Numbers 309
Before proving the theorems we give several examples that illustrate these
two convergence tests.
P∞ 1
EXAMPLES 7.1.9 (a) Consider the p-series p
for p > 0. For this
k=1 k
series,
r = R = α = 1
for any p > 0; thus both tests are inconclusive. The series however diverges if
0 < p ≤ 1 and converges for p > 1.
(b) As our second example we consider
∞
X pk
, 0 < p < ∞.
k!
k=1
ak+1 pk+1 k! 1
lim = lim = p lim = 0.
k→∞ ak k→∞ pk (k + 1)! k→∞ k + 1
By the ratio test the series converges for all p, 0 < p < ∞. In this example,
the presence of k! makes the root test difficult to use.
P
(c) Consider an where
1
k,
if n = 2k,
2
an =
1,
if n = 2k + 1.
3k
Here
∞
X 1 1 1 1
an = 1 + + + 2 + 2 + ··· .
n=1
2 3 2 3
Series of Real Numbers 311
By computation,
k
2
, n = 2k,
an+1
3
= k
an
1 3
, n = 2k + 1.
2 2
For the subsequence of {an+1 /an } with even n,
k
a2k+1 2
lim = lim = 0.
k→∞ a2k k→∞ 3
an ≤ M cn for all n ≥ no ,
no
where
P n M = ano /c . ThusP since 0 < c < 1, by Example 3.7.2(a) the series
c converges. Therefore an also converges by the comparison test.
Suppose
an+1
lim = r > 1.
n→∞ an
312 Introduction to Real Analysis
Again choose c so that r > c > 1. As above, there exists a positive integer no
n
P n that an ≥ M c for some constant M and
such P all n ≥ no . But since c > 1,
c = ∞, and thus by the comparison test, an = ∞.
Proof of Root Test. Let
√
α = lim n an .
n→∞
Suppose α < 1. Choose c so that α < c < 1. Again by Theorem 3.5.3 there
exists a positive integer no such that
√
n
an < c for all n ≥ no .
n
P
But then an ≤ c for all n ≥ no , and an < ∞ by the comparison test.
√
If α > 1, then n an > 1 for infinitely many n. Thus an > 1 for infinitely
many n, and as a consequenceP {an } does not converge to zero. Hence by
Corollary 3.7.5 the series an diverges.
Example 7.1.9(c) provides an example of a series for which the ratio test
is inconclusive, but the root test worked. Thus it appears that the root test is
a stronger test, a fact which is confirmed by the following theorem.
If R = ∞, then there is nothing to prove. Thus assume that R < ∞, and let
β > R be arbitrary. Then there exists a positive integer no such that
an+1
≤β for all n ≥ no .
an
As in the proof of the ratio test, this gives an ≤ M β n for all n ≥ no , where
M = ano /β no . Hence
√ √
n
n
an ≤ β M for all n ≥ no .
√
n
Since lim M = 1, we have
n→∞
√
lim n
an ≤ β.
n→∞
√
Since β > R was arbitrary, lim n an ≤ R, which proves the result. The
n→∞
inequality for the limit inferior is proved similarly.
Series of Real Numbers 313
Exercises 7.1
1. If a and b are positive real numbers, prove that
∞
X 1
(ak + b)p
k=1
trigonometric series. The key to the Dirichlet test is the following summation
by parts formula of Neils Abel (1802–1829). This formula is the analogue for
series of the integration by parts formula.
Then if 1 ≤ p ≤ q,
q
X q−1
X
a k bk = Ak (bk − bk+1 ) + Aq bq − Ap−1 bp .
k=p k=p
THEOREM 7.2.2 (Dirichlet Test) Suppose {ak } and {bk } are sequences
of real numbers satisfying
n
P
(a) the partial sums An = ak form a bounded sequence,
k=1
(b) b1 ≥ b2 ≥ b3 ≥ · · · ≥ 0, and
(c) lim bk = 0.
k→∞
∞
P
Then ak bk converges.
k=1
Proof. Since {An } is a bounded sequence, we can choose M > 0 such that
|An | ≤ M for all n. Also, since bn → 0, given ǫ > 0, there exists a positive
Series of Real Numbers 317
≤ 2 M bp < ǫ.
∞
P
Hence by the Cauchy criterion (Theorem 3.7.3), ak bk converges.
k=1
Alternating Series
Our first application of the Dirichlet P
test is to alternating series. An alter-
nating series is a series of the form (−1)k bk or (−1)k+1 bk , with bk ≥ 0
P
for all k.
Proof. Let ak = (−1)k+1 . Then |An | ≤ 1 for all n, and the Dirichlet test
applies.
Remark. The hypothesis (a) that {bk } is decreasing is required. If we only
assume that bk ≥ 0 and lim bk = 0, then the conclusion is false (Exercise 2).
k→∞
Trigonometric Series
Our next application of the Dirichlet test is to the convergence of trigonometric
series. These types of series will be studied in much greater detail in Chapter
9.
Proof. To prove the result, we require the following two identities: For t 6=
2pπ, p ∈ Z,
n
X cos 12 t − cos(n + 12 )t
sin kt = , (1)
k=1
2 sin 12 t
n
X sin(n + 12 )t − sin 21 t
cos kt = . (2)
k=1
2 sin 12 t
Series of Real Numbers 319
n
P
We will prove (1), leaving (2) for the exercises (Exercise 4). Set An = sin kt.
k=1
1
Using the trigonometric identity sin A sin B = 2 [cos(A − B) − cos(A + B)] we
obtain
n
X
(sin 21 t) An = sin 21 t sin kt
k=1
n
1 X
cos(k − 21 )t − cos(k + 21 )t
=
2
k=1
1
cos 21 t − cos(n + 12 )t .
=
2
Thus for t 6= 2pπ, p ∈ Z,
cos 12 t − cos(n + 21 )t
An = ,
2 sin 21 t
and therefore
| cos 12 t| + | cos(n + 21 )t| 1
|An | ≤ ≤ ,
2| sin 12 t| | sin 21 t|
Exercises 7.2
P
1. *(Abel’s Test)PProve that if ak converges, and {bk } is monotone and
bounded, then ak bk converges.
2. *Show by example that the hypothesis of Theorem 7.2.3 cannot be re-
placed by bk ≥ 0 and lim bk = 0.
k→∞
P P 2
3. If ak converges, does ak always converge?
4. *Prove that
n
X sin(n + 12 )t − sin 21 t
cos kt = , t 6= 2pπ, p ∈ Z.
k=1
2 sin 21 t
5. Test each of the following series for convergence:
P∞ (−1)k+1 ∞ (−1)k ln k
P ∞ (−1)k
P
*a. p
,p>0 b. *c.
k=1 k k=2 k k=2 k ln k
∞
P k+1 kk P∞
k+1 kk P∞ sin k
*d. (−1) k
e. (−1) k+1
*f.
k=1 (k + 1) k=1 (k + 1) k=2 ln k
P∞ sin kt ∞ cos kt
P
g. p
, t ∈ R, p > 0 *h. , t ∈ R, p > 0
k=1 k k=1 kp
∞
(−1)k+1 sin(π/k)
P
i.
k=1
6. Given that
∞ (−1)k+1
= π 2 12,
P
k 2
k=1
P
DEFINITION 7.3.1 A series ak of real numbers
P is said to be abso-
lutely convergent (or converges absolutely) if |ak | converges. The series
is said to be conditionally convergent if it is convergent but not absolutely
convergent.
EXAMPLES 7.3.2 P (a) Since the sequence {1/k} decreases to zero, by The-
orem 7.2.3 the series (−1)k+1 /k converges. However,
∞ ∞
X (−1)k+1 X 1
= = ∞.
k k
k=1 k=1
q
X q
X
ak ≤ |ak |,
k=p k=p
and the result now follows by the Cauchy criterion (Theorem 3.7.3).
P
Remark. To test a series ak for absolute convergence we can
P apply any of
the appropriate convergence tests of Section 7.1 to the series |ak |. There
P is
however one important fact which needs to be emphasized. If the series |ak |
diverges by virtue of the ratio or root test, i.e.,
|an+1 | p
r = lim >1 or α = lim n |an | > 1,
n→∞ |an | n→∞
P P
then not only does |ak | diverge, but ak also diverges.
To see this, suppose α > 1. Then as in the proof of the root test, |ak | > 1
for infinitely many k. Hence the sequence {ak } does not converge to zero, and
322 Introduction to Real Analysis
P
thus by Corollary 3.7.5, ak diverges. Similarly, if r > 1 and if 1 < c < r,
then as in the proof of Theorem 7.1.7(b), there exists a positive integer no
and constant M such that
|an | ≥ M cn
for all n ≥Pno . Thus again, since c > 1, {an } does not converge to zero, and
the series ak diverges. We summarize this as follows:
P
THEOREM 7.3.4 (Root and Ratio Test) Let ak be a series of real
numbers, and let p
α = lim k |ak |.
k→∞
Also, if ak 6= 0 for all k ∈ N, let
ak+1 ak+1
R = lim and r = lim .
k→∞ ak k→∞ ak
P
(a) If α < 1 or R < 1, then the series ak is absolutely convergent.
P
(b) If α > 1 or r > 1, then the series ak is divergent.
(c) If α = 1 or r ≤ 1 ≤ R, then the test is inconclusive.
Rearrangements of Series
We nextP take up the topic of rearrangements P of series. Loosely speaking, a
series P a′k is a rearrangement of the series P
ak if all the terms in the original
series ak appear exactly once in the series a′k , but not necessarily in the
same order. For example, the series
1 1 1 1 1 1 1
+ 2 + 2 + 2 + 2 + 2 + 2 + ···
12 3 2 4 5 7 6
is a rearrangement of the series
∞
X 1
.
k2
k=1
which converges, but not absolutely. Consider also the following series which
is a rearrangement of the above:
1 1 1 1 1 1 1 1
1+ − + + − + + − + ··· . (3)
3 2 5 7 4 9 11 6
Let
∞
X 1
s= (−1)k+1 .
k
k=1
Since
1 1 1 8k − 3
+ − =
4k − 3 4k − 1 2k 2k(4k − 1)(4k − 3)
we have
1 1 1 1
0< + − ≤M 2
4k − 3 4k − 1 2k k
for some constant M . Thus the sequence {s′3n } is strictly increasing, and by
the comparison test converges. Let s′ = lim s′3n . Since
n→∞
1
s′3n+1 = s′3n + , and
4n + 1
1 1
s′3n+2 = s′3n + + ,
4n + 1 4n + 3
324 Introduction to Real Analysis
the sequences {s′3n+1 } and {s′3n+2 } also converge to s′ . Therefore lim s′n = s′ .
n→∞
Thus the series (3) also converges. However, since
5
= s′3 < s′6 < s′9 < · · · ,
6
s′ = lim s′n > 65 . Thus the series (3) does not converge to the same sum as
n→∞
the original series. This, as we will see, is due to the fact that the given series
does not converge absolutely.
P
THEOREM P 7.3.8 If the series ak converges absolutely, then every rear-
rangement of ak converges to the same sum.
P ′ P P
Proof. Let ak be a rearrangement of ak . Since |ak | < ∞, given ǫ > 0,
there exists a positive integer N such that
m
X
|ak | < ǫ (4)
k=n
If n ≥ p,
n
X n
X
sn − s′n = ak − aj(k) .
k=1 k=1
By the choice of p, the numbers a1 , ..., aN appear in both sums and conse-
quently will cancel. Thus the only terms remaining will have index k or j(k)
greater than or equal to N . Therefore by (4),
|sn − s′n | ≤ 2ǫ
for all n ≥ p. Therefore lim s′n = lim sn , and thus the rearrangement con-
n→∞ n→∞
verges to the same sum as the original series.
For conditionally convergent series we have the following very interesting
result of Riemann.
P
THEOREM 7.3.9 Let ak be a conditionally convergent P
series ofPreal
numbers. Suppose α ∈ R. Then there exists a rearrangement a′k of ak
which converges to α.
Series of Real Numbers 325
Before proving the theorem, we illustrate the method of proof with the
alternating series
∞
X (−1)k+1 1 1 1
=1− + − + ··· .
k 2 3 4
k=1
This series converges, but fails to converge absolutely. The positive terms of
this series are Pk = 1/(2k − 1), k ∈ N, and the absolute value of the negative
P1 P P
terms are Qk = 1/2k, k ∈ N. Since = ∞, we also have Pk = Qk =
k
∞. Suppose for purposes of illustration α = 1.5. Our first step is to add “just
enough” positive terms to exceed α. More precisely, we let m1 be the smallest
integer such that P1 + · · · + Pm1 > α. For α = 1.5, m1 = 3; i.e., 1 + 31 + 15 > 1.5
whereas 1 + 13 < 1.5. Our next step is to go in the other direction; namely, we
let n1 be the smallest integer such that
the convergence of one implies the convergence of the other. Thus they both
must diverge. P
Let P1 , P2 , P3 , ... denote the positive terms of ak in the original order,
and let Q1 , Q2 , Q3 , ... be the
P absolute
P values of the negative
P terms,
P also in the
original order. The series Pk and Qk differ from pk and qk only by
zero terms and thus are also divergent.
326 Introduction to Real Analysis
We will inductively construct sequences {mk } and {nk } such that the series
P1 +· · ·+Pm1 −Q1 −· · ·−Qn1 +Pm1 +1 +· · · Pm2 −Qn1 +1 −· · ·−Qn2 +· · · (5)
has the desired property. Clearly (5) is a rearrangement of the original series.
Let m1 be the smallest integer such that
X1 = P1 + · · · + Pm1 > α.
P
Such an m1 exists since Pk = ∞. Similarly, let n1 be the smallest integer
such that
Y1 = X1 − Q1 − · · · − Qn1 < α.
Suppose {m1 , ..., mk } and {n1 , ..., nk } have been chosen. Let mk+1 and nk+1
be the smallest integers greater than mk and nk respectively such that
Xk+1 = Yk + Pmk +1 + · · · + Pmk+1 > α, and
Yk+1 = Xk+1 − Qnk +1 − · · · − Qnk+1 < α.
P P
Such integers always exist due to the divergence of the series Pk and Qk .
Since mk+1 was chosen to be the smallest integer such that the above holds,
Xk+1 − Pmk+1 ≤ α.
Therefore
0 < Xk+1 − α ≤ Pmk+1 .
Similarly,
0 < α − Yk+1 ≤ Qnk+1 .
P
Since an converges, we have lim Pk = lim Qk = 0. Therefore,
k→∞ k→∞
lim Xk = lim Yk = α.
k→∞ k→∞
Let Sn be the nth partial sum of the series (5). If the last term of Sn is a
Pn , then there exists a k such that
Yk < Sn ≤ Xk+1 .
If the last term of Sn is −Qn , then there exists a k such that
Yk+1 ≤ Sn < Xk+1 .
In either case we obtain lim Sn = α, which proves the result.
n→∞
Remark.
P By a variation of the above proof one can show that if the series
ak is conditionally convergent, then given α, β with
−∞ ≤ α ≤ β ≤ ∞,
P ′ P
there exists a rearrangement ak of ak such that
lim Sn = α, and lim Sn = β,
n→∞ n→∞
Exercises 7.3
1. Prove the following:
a. If lim kp ak = A for some p > 1, then
P
ak converges absolutely.
k→∞
P
b. If lim k ak = A 6= 0, then ak diverges.
k→∞
|p(k + 1)|
8. Prove that lim = 1 for any polynomial p.
k→∞ |p(k)|
1 1 1 1 1
9. *Show that the series 1 + − + + − + · · · diverges.
2 3 4 5 6
1 1 1 1 1 1
10. Determine whether the series 1 − − + + − − + · · · converges
2 3 4 5 6 7
or diverges.
∞
X sin k
11. *Prove that the series is conditionally convergent.
k
k=1
P P ′
12. If ak ≥ 0 for allPk ∈ N,P and ak = ∞, prove that ak = ∞ for any
′
rearrangement ak of ak .
328 Introduction to Real Analysis
P
13. Suppose that the series ak is conditionally convergent. Given
P ′α, βPwith
−∞ ≤ α ≤ β ≤ ∞, show that there exists a rearrangement ak of ak
such that lim Sn = α and lim Sn = β, where Sn is the nth partial sum
n→∞
P ′ n→∞
of ak .
P
14. P
Suppose every rearrangement of the series ak converges, prove that
ak converges absolutely.
2
The set ℓ is called the space of square summable sequences, and the
quantity k{ak }k2 is called the norm of the sequence {ak }.
2 The topic of square summable sequences, although important in the study of Fourier
series, is not specifically required in Chapter 9 and thus can be omitted on first reading. The
concept of a normed linear space occurs on several occassions in the discussion of subsequent
topics in the text. At that point the reader can study this topic more carefully.
Series of Real Numbers 329
√ ∞
X 1
k{1/ k}k22 = = ∞,
k
k=1
we have {1 k} 6∈ ℓ2 .
√
∞
X 1
k{1/k q }k22 = .
k 2q
k=1
By Example 7.1.9(a)
this series converges for all q > 1/2 and diverges for all
q ≤ 1/2. Thus {1 k q } ∈ ℓ2 if and only if q > 1/2.
Cauchy-Schwarz Inequality
Our main goal in this section is to prove the Cauchy-Schwarz inequality for
sequences in ℓ2 . First however we prove the finite version of this inequality.
0 ≤ A − 2λC + λ2 B,
n n n
a2k , C = b2k . If B = 0, then bk = 0 for
P P P
where A = |ak bk |, and B =
k=1 k=1 k=1
all k = 1, ..., n and the Cauchy-Schwarz inequality certainly holds. If B 6= 0,
we take λ = C B which gives
C2
0≤A−
B
330 Introduction to Real Analysis
Taking the square root of both sides gives the desired result.
As a consequence of the previous result we have the following corollary.
∞
X
|ak bk | ≤ k{ak }k2 k{bk }k2 .
k=1
Minkowski’s Inequality
Our next result shows that the norm k k2 satisfies the “triangle inequality” on
ℓ2 .
we have
∞
X ∞
X
k{ak + bk }k22 = 2
(ak + bk ) ≤ k{ak }k22 +2 |ak bk | + k{bk }k22 ,
k=1 k=1
Taking the square root of both sides gives the desired result.
Although not specifically stated, Minkowski’s inequality is also true for
finite sums. In particular if n ∈ N and a1 , ..., an , b1 , ..., bn , are real numbers,
then v v v
u n u n u n
uX uX uX
t (ak + bk )2 ≤ t 2
ak + t b2k . (6)
k=1 k=1 k=1
Proof. The results (a) and (b) are obvious from the definition, and (d) is just
a restatement of Minkowski’s inequality. The verification of (c) is left as an
exercise (Exercise 5).
DEFINITION 7.4.7 A set X with two operations “+”, vector addition, and
“·”, scalar multiplication, satisfying
x+y ∈X for all x, y ∈ X, and
c·x∈X for all x ∈ X, c ∈ R
is a vector space over R if the following are satisfied:
(a) x + y = y + x. (commutative law)
(b) x + (y + z) = (x + y) + z. (associative law)
(c) There is a unique element in X called the zero element, denoted 0,
such that
x+0=x for all x ∈ X.
Exercises 7.4
1. Determine which of the following sequences are in ℓ2 .
∞ ∞ ∞ ∞
1 1 ln k sin k
*a. b. √ *c. √ a. .
ln k k=2 k ln k k=2 k k=2 k k=1
5. If {ak } ∈ ℓ2 and c ∈ R, prove that {cak } ∈ ℓ2 and k{cak }k2 = |c|k{ak }k2 .
6. For each n ∈ N, let en be the sequence in ℓ2 defined by
(
0, k 6= n,
en (k) = .
1, k = n.
334 Introduction to Real Analysis
√
Show that ken − em k2 = 2 if n 6= m. (Remark. The sequence {en }
is a bounded sequence in ℓ2 with no convergent subsequence. Thus the
Bolzano-Weierstrass theorem (3.4.6) fails in ℓ2 . )
7. Show that equality holds in the Cauchy-Schwarz inequality if and only if
for all k ∈ N, bk = cak for some c ∈ R.
8. For a, b ∈ Rn , let ha, bi denote the inner product of a and b. Prove each
of the following:
a. ha, ai ≥ 0.
b. ha, ai = 0 if and only if a = (0, ..., 0).
c. ha, b + ci = ha, bi + ha, ci.
d. ha, bi = hb, ai.
e. ha, bi = 21 kak22 + kbk22 − ka − bk22 .
Notes
The geometric series is perhaps one of the most important series in analysis. In the
seventeenth and eighteenth century, the convergence of many series was established
by comparison with the geometric series. It forms the basis for the proof of the
root and ratio test, and thus also for the study of convergence of power series. The
geometric series dates back to Euclid in the 3rd century B.C. The formula for a finite
sum of a geometric progression appeared in Euclid’s Elements, and Archimedes, in
his treatise Quadrature of the Parabola, indirectly used the geometric series to find
the area under a parabolic arc.3 Even though Greek mathematicians knew how to
3 See p. 105 of the text by Katz.
Series of Real Numbers 335
in this text. Cauchy was undoubtedly influenced by the works of d’Alembert and
Lagrange in his definition of convergence of a series. However, unlike the results of
d’Alembert, which applied only to the binomial series, and those of Lagrange for
Taylor series, Cauchy’s definition of convergence was entirely general; applicable to
any series of real numbers.
Using his definition of convergence, Cauchy proved that the statement now
known as the Cauchy criterion was necessary for the convergence of a series of
real numbers. This was was also known earlier to Bolzano. However, without the
completeness property of the real number system, neither Bolzano nor Cauchy were
able to prove that the Cauchy criterion was also sufficient for convergence of the
series.
Miscellaneous Exercises
∞
P ∞
P n
P
1. Given two series ak and bk , set cn = ak bn−k , n = 0, 1, 2, ....
k=0 k=0 k=0
P∞ P P
The series cn is called the Cauchy product of ak and bk .
n=0
P P P
a. If ak and bk converge absolutely, prove that cn converges ab-
solutely and that
∞
∞ ∞
P P P
cn = ak bk .
n=0 k=0 k=0
k
√
b. Let ak =P bk = (−1)
P k + 1, k = 0, 1, 2... Prove that the Cauchy
product of ak and bk diverges.
c. Prove that the result of (a) is still true if only one of the two series
converge absolutely; the other series must still converge.
2. Let X be a non-empty set. If f is a real-valued function on X, define
P
kf k1 = sup{ |f (x)| : F is a finite subset of X}.
x∈F
In the above, the supremum is taken over all finite subsets F of X. Denote
by ℓ1 (X) the set of all real-valued functions f on X for which kf k1 < ∞.
a. Suppose X is infinite. If f ∈ ℓ1 (X), prove that {x ∈ X : f (x) 6= 0} is
at most countable.
∈ X : f (x) 6= 0} = {xn : n ∈ A}, where A ⊂ N, prove that
b. If {x P
kf k1 = |f (xn )|.
n∈A
Rb
a. Prove that for f, g ∈ R[a, b], |f (x)g(x)| dx ≤ kf k2 kgk2 .
a
Series of Real Numbers 337
Supplemental Reading
Ali, S. A., “The mth ratio test: century analysis,” Math. Mag. 72 (1999),
new convergence tests for series,” Amer. 347–355.
Math. Monthly 115 (2008), 514–524. Grabinger, Judith V., The Origins of
Behforooz, G. H., “Thinning out the Cauchy’s Rigorous Calculus, The MIT
harmonic series,” Math. Mag. 68 (1995), Press, Cambridge, Massachusetts, 1981.
289–293. Hoang, N. S., “A limit comparison
Boas, R. P., “Estimating remain- test for general series,” Amer. Math.
ders,” Math. Mag. 51 (1978), 83–89. Monthly 122 (2015), 893–896.
Boas, R. P., “Partial sums of infinite Jungck, G. “An alternative to the in-
series and how they grow,” Amer. Math. tegral test,” Math. Mag. 56 (1983), 232–
Monthly 84 (1977), 237–258. 235.
Bradley, D. M., “An infinite series Katz, V. J., “Ideas of Calculus in Is-
that displays the concavity of the nat- lam and India,” Math. Mag. 68 (1995),
ural logarithm,” Math. Mag. 90 (2017), 163–174.
353–354. Kortram, R. A., “Simple proofs
∞
Cowen, C. C., Davidson, K. R., and for
P
1/k2 = 61 π 2 and sin x =
Kaufman, R.P., “Rearranging the alter- k=1
∞ x2
nating harmonic series,” Amer. Math. Q
x 1 − 2 2 ,” Math. Mag. 69
Monthly 87 (1980), 817–819. k=1 k π
Creswell, S. H.. “A continuous bi- (1996), 122-125.
jection of ℓ2 onto a subset of ℓ2 whose Krantz, S. G. and McNeal, J.
inverse is discontinuous everywhere,” D., “Creating more convergent series,”
Amer. Math. Monthly 117 (2010), 823– Amer. Math. Monthly 111 (2004), 32–38.
828. Maher, P., “Jensens inequality by
Daners,
P D.,2 “A 2short elementary differentiation,” Math. Gaz. 73 (1989),
proof of 1/k = π /6,” Math. Mag. 139–140.
85 (2012), 361–364. P Passare, M., “How to compute
Dence, T. P. and Dence, J. P., “A 1/n2 by solving triangles,” Amer.
survey of Euler’s constant,” Math. Mag. Math. Monthly 115 (2008), 745–752.
82 (2009), 255–265. Tolsted, E., “An elementary deriva-
Efthimiou, C. J., “Finding exact val- tion of the Cauchy, Hölder, and
ues for infinite series,” Math. Mag. 72 Minkowski inequalities from Young’s in-
(1999), 45–51. equality,” Math. Mag. 37 (1964), 2–12.
Goar, M., “Olivier and Abel on series Young, R. M., “Euler’s constant,”
convergence: An episode from early 19th Math. Gazette 75 (1991), 187–190.
8
Sequences and Series of Functions
339
340 Introduction to Real Analysis
If as above, {fn }∞
n=1 is a sequence of real-valued functions on a nonempty
set E, then with the sequence {fn } we can associate the sequence {Sn } of
nth partial sums, where for each n ∈ N, Sn is the real-valued function on E
defined by
n
X
Sn (x) = f1 (x) + · · · + fn (x) = fk (x).
k=1
P∞
The sequence {Sn } is called a series of functions on E denoted by fk or
P P k=1
simply fk . The series fk converges pointwise on E if for each x ∈ E
∞
P
the sequence {Sn (x)} of partial sums converges; that is, the series fk (x)
k=1
converges for each x ∈ E. If the sequence {Sn } converges
Ppointwise to the
function S on E, then S is called the sum of the series fk and we write
∞
P
S= fk , or if we wish to emphasize the variable x,
k=1
∞
X
S(x) = fk (x), x ∈ E.
k=1
Suppose fn : [a, b] → R for all n ∈ N, and fn (x) → f (x) for each x ∈ [a, b].
Among the questions we want to consider are the following:
(a) If each fn is continuous at p ∈ [a, b], is the function f continuous at
p? Recall that the function f is continuous at p if and only if
lim f (t) = f (p).
t→p
EXAMPLES 8.1.2 (a) Let E = [0, 1], and for each x ∈ E, n ∈ N, let
fn (x) = xn . Clearly each fn is continuous on E. Since fn (1) = 1 for all n,
lim fn (1) = 1. If 0 ≤ x < 1, then by Theorem 3.2.6(e), lim fn (x) = 0.
n→∞ n→∞
Therefore (
0, 0 ≤ x < 1,
lim fn (x) = f (x) =
n→∞ 1, x = 1.
The function f however is not continuous on [0, 1]. (In Exercise 1 you will be
asked to sketch the graphs of f1 , f2 , f4 .)
(b) Consider the sequence {fk }∞ k=0 defined by
x2
fk (x) = , x ∈ R.
(1 + x2 )k
∞
P
For each k = 0, 1, 2, .., fk is continuous on R. Consider the series fk which
k=0
for each x ∈ R is given by
∞ ∞ k
X X
2 1
fk (x) = x .
1 + x2
k=0 k=0
We now show that this series converges for all x ∈ R and also find its sum
f . If x = 0, then fk (0) = 0 for all k, and thus f (0) = 0. If x 6= 0, then
1/(1 + x2 ) < 1 and hence by Example 3.7.2(a)
∞ k " #
2
X 1 2 1
x =x 1 = 1 + x2 = f (x).
1 + x2 1 − 1+x2
k=0
Therefore (
0, x = 0,
f (x) =
1 + x2 , x 6= 0,
which again is not continuous on R.
(c) Let {xk } be an enumeration of the rational numbers in [0, 1]. For each
n ∈ N, define fn as follows:
(
0, if x = xk , 1 ≤ k ≤ n,
fn (x) =
1, otherwise.
sin nx
(e) As our final example, consider fn (x) = , x ∈ R. Since | sin nx| ≤
n
1 for all x ∈ R and n ∈ N,
f (x) = lim fn (x) = 0 for all x ∈ R.
n→∞
′
Therefore f (x) = 0 for all x. On the other hand,
fn′ (x) = cos nx.
In particular fn′ (0) = 1 so that lim fn′ (0) = 1 6= f ′ (0). This example shows
n→∞
that in general
d
lim fn (x) 6= lim fn′ (x).
dx n→∞ n→∞
Additional examples are also given in the exercises.
Exercises 8.1
1. Let fn be as in Example 8.1.2. Sketch the graphs of f1 , f2 , and f4 .
2. Find the pointwise limits of each of the following sequences of functions
on the
given set.
nx sin nx
*a. , x ∈ [0, ∞) b. , x ∈ [0, ∞)
1 + nx n 1 + nx 2 o
*c. (cos x)2n , x ∈ R. d. nxe−nx , x ∈ R.
3. Determine the values of x for which each of the following series converge.
∞ ∞
X nxn X xn
*a. n
b. , x=6 1
n=1
2 n=1
(1 − x)n
∞ ∞
X 1 X 2n (sin x)n
*c. d.
n=1
3nx n=1
n
344 Introduction to Real Analysis
*b. Given ǫ > 0, does there exist an integer no ∈ N such that |fn (x)| < ǫ
for all x ∈ [0, ∞) and all n ≥ no . (Hint: determine the maximum of fn
on [0, ∞))
c. Answer the same question as in (b) for x ∈ [0, a], a > 0.
8. *If an,m ≥ 0, n, m ∈ N, prove that
X∞ X ∞ ∞ X
X ∞
an,m = an,m ,
n=1 m=1 m=1 n=1
with the convention that if one of the sums is finite, the other is also and
equality holds, and if one is infinite, so is the other.
for all n ≥ no . The key here is that the choice of the integer no may depend
Sequences and Series of Functions 345
We now show that the convergence is not uniform. If the convergence were
uniform, then given ǫ > 0, there would exist a positive integer no such that
|fn (x) − f (x)| < ǫ for all n ≥ no . In particular,
Since the series is a telescoping series, the nth partial sum Sn (x) is given by
2
Sn (x) = nxe−nx .
346 Introduction to Real Analysis
FIGURE 8.1
Graphs of S4 , S8 , S16
Proof. If {fn } converges uniformly to f on E, then the proof that (1) holds
is similar to the proof that every convergent sequence is Cauchy. Conversely,
suppose that the sequence {fn } satisfies (1). Then for each x ∈ E, the sequence
{fn (x)} is a Cauchy sequence in R, and hence converges (Theorem 3.6.5).
Therefore,
f (x) = lim fn (x)
n→∞
for all x ∈ E. Since the above holds for all m ≥ no , the sequence {fn } converges
uniformly to f on E.
The analogous result for series is as follows:
∞
P
COROLLARY 8.2.4 The series fk of real-valued functions on E con-
k=1
verges uniformly on E if and only if given ǫ > 0, there exists a positive integer
no , such that
Xm
fk (x) < ǫ
k=n+1
Proof. The proof of the Corollary followsP by applying the previous theorem
to the partial sums Sn (x) of the series fk (x).
Proof. Exercise 1.
348 Introduction to Real Analysis
we have
Mn = sup |fn (x) − f (x)| = 1.
x∈[0,1]
Thus since {Mn } does not converge to zero, the sequence {fn } does not con-
verge uniformly to f on [0, 1]. On the other hand, if 0 < a < 1 is fixed,
then
Mn = sup |fn (x)| = an .
x∈[0,a]
n
Since lim a = 0, by Theorem 8.2.5 the sequence {fn } converges uniformly
n→∞
to the zero function on [0, a] for every fixed a, 0 < a < 1,
n
X n
X n
X
|Sn (x) − Sm (x)| = fk (x) ≤ |fk (x)| ≤ Mk .
k=m+1 k=m+1 k=m+1
P
Uniform convergence now follows by the Cauchy Criterion. That |fk (x)|
also converges is clear.
P
EXAMPLES 8.2.8 (a) If ak converges absolutely, then P since |ak cos kx| ≤
|ak | for all x ∈ R, by the Weierstrass M-test
P the series ak cos kx converges
uniformly on R. Similarly for the series ak sin kx. In particular, the series
∞ ∞
X cos kx X sin kx
, , p > 1,
kp kp
k=1 k=1
converge uniformly on R.
∞
(x/2)k . This is a geometric series that converges
P
(b) Consider the series
k=1
for all x ∈ R satisfying |x| < 2. If 0 < a < 2 and |x| ≤ a, then
x k a k
≤ .
2 2
Since a/2 < 1 the series (a/2)k converges. Thus by the Weierstrass M-test,
P
∞
(x/2)k converges uniformly on [−a, a] for any a, 0 < a < 2. The
P
the series
k=1
series however does not converge uniformly on (−2, 2) (Exercise 11).
Thus by Theorem 7.2.3, the series (−1)k+1 ak (x) converges for all x ∈ [0, 1].
P
Let
∞
X
S(x) = (−1)k+1 ak (x).
k+1
350 Introduction to Real Analysis
If Sn (x) is the nth partial sum of the series, then by Theorem 7.2.4
1
|S(x) − Sn (x)| ≤ an+1 (x) ≤ , for all x ∈ [0, 1].
n+1
Thus {Sn } converges uniformly to S on [0, 1]. However, the given
series does
not converge absolutely when x = 1. The series (−1)k+1 xk k, x ∈ [0, 1],
P
also provides an example of a series that converges uniformly on [0, 1] but for
which the Weierstrass M-test fails.
(b) The converse is also false; absolute convergence need not imply uniform
∞
x2 (1 + x2 )−k of Example
P
convergence! As an example, consider the series
k=1
8.1.2(b). Since all the terms are nonnegative, the series converges absolutely
to (
0, x = 0,
f (x) = 2
1+x , x 6= 0,
on R. However, as a consequence of Corollary 8.3.2 of the next section, since
f is not continuous at 0, the convergence cannot be uniform on any interval
containing 0.
Exercises 8.2
1. Prove Theorem 8.2.5.
2. a. If {fn } and {gn } converge uniformly on a set E, prove that {fn + gn }
converges uniformly on E.
*b. If {fn } and {gn } converge uniformly on a set E, and if in addition
there exist constants M and N such that |fn (x)| ≤ M and |gn (x)| ≤ N
for all n ∈ N and all x ∈ E, prove that {fn gn } converges uniformly on E.
c. Find examples of sequences {fn } and {gn } that converge uniformly
on a set E, but for which {fn gn } does not converge uniformly on E.
3. Show that if {fn } converges uniformly on (a, b) and {fn (a)} and {fn (b)}
converge, then {fn } converges uniformly on [a, b].
4. *Let fn (x) = n x(1 − x2 )n , 0 ≤ x ≤ 1. Show that {fn } does not converge
uniformly to 0 on [0, 1].
xn
5. Let fn (x) = , 0 ≤ x ≤ 1.
1 + xn
*a. Show that {fn } converges uniformly to 0 on [0, a] for any a, 0 < a < 1.
*b. Does {fn } converge uniformly on [0, 1] ?
2
6. Show that the sequence {nxe−nx } converges uniformly to 0 on [a, ∞)
for every a > 0.
x
7. For each n ∈ N, set fn (x) = x + sin nx, x ∈ R. Show that the sequence
n
{fn } converges uniformly to f (x) = x for all x ∈ [−a, a], a > 0. Does
{fn } converge uniformly to f on R?
Sequences and Series of Functions 351
8. Show that each of the following series converge uniformly on the indicated
interval.
∞ 1 ∞
e−kx xk , 0 ≤ x < ∞.
P P
*a. 2 + x2
, 0 ≤ x < ∞. b.
k=1 k k=1
∞ ∞ (−1)k+1
k2 e−kx , 1 ≤ x < ∞.
P P
*c. d. , 0 ≤ x < ∞.
k=1 k=1 k + x
9. Test each of the following series for uniform convergence on the indicated
interval.
∞
P sin 2kx P∞ xk
*a. 3/2
, x ∈ R. b. 2
, |x| ≤ 1.
k=1 (2k + 1) k=2 k (ln k)
∞ (−1) k+1 2k+1
P x
c. , |x| ≤ 1.
k=0 2k + 1
∞
sin(x/kp ), p > 1, |x| ≤ 2.
P
*d.
k=1
∞
P 1 1
*e. − , 0 ≤ x ≤ 1.
k=0 kx + 2 (k + 1)x + 2
10. Show that each of the following series converge uniformly on [a, ∞) for
any a > 0, but do not converge uniformly on (0, ∞).
P∞ 1 P∞ 1
*a. 2
. b. 1+x
.
k=0 1 + k x k=1 k
∞
(x/2)k does not converges uniformly on (−2, 2).
P
11. Show that the series
k=1
12. *If ∞
P P∞ k
k=0 ak converges absolutely, prove that k=0 ak x converges uni-
formly on [−1, 1].
13. Let {fn } be a sequence of functions that converges uniformly to a con-
tinuous function f on (−∞, ∞). Prove that
lim fn x + n1 = f (x) for all x ∈ (−∞, ∞).
n→∞
P
14. Let {ck } be a sequence of real numbers satisfying |ck | < ∞, and let
{xk } be a countable subset of [a, b]. Prove that the series
X∞
ck I(x − xk )
k=1
converges uniformly on [a, b]. Here I is the unit jump function defined in
Definition 4.4.9.
15. (Dirichlet Test for Uniform Convergence) Suppose {fk } and {gk }
are sequences of functions on a set E satisfying
n
P
(a) the partial sums Sn (x) = gk (x) are uniformly bounded on E,
k=1
i.e., there exists M > 0 such that |Sn (x)| ≤ M for all n ∈ N and x ∈ E.
(b) fk (x) ≥ fk+1 (x) ≥ 0 for all k ∈ N and x ∈ E, and
(c) lim fk (x) = 0 uniformly on E.
k→∞
P
Prove that fk (x)gk (x) converges uniformly on E.
P∞ sin kx ∞ cos kx
P
16. *Prove that p
, (p > 0) converge uniformly on any
k=1 k k=1 kp
closed interval which does not contain an integer multiple of 2π.
352 Introduction to Real Analysis
lim fn (x) = An .
x→p
for all n, m ≥ no and all x ∈ E. Since (2) holds for all x ∈ E, letting x → p
gives
|An − Am | ≤ ǫ, for all n, m ≥ no .
Thus {An } is a Cauchy sequence in R, which as a consequence of Theorem
3.6.5 converges. Let A = lim An .
n→∞
It remains to be shown that lim f (x) = A. Again, let ǫ > 0 be given. First,
x→p
by the uniform convergence of the sequence {fn (x)} and the convergence of
the sequence {An }, there exists a positive integer m such that
ǫ
|f (x) − fm (x)| <
3
for all x ∈ E, and also that
ǫ
|A − Am | < .
3
Since lim fm (x) = Am , there exists a δ > 0 such that
x→p
ǫ
|fm (x) − Am | < for all x ∈ E, 0 < d(x, p) < δ.
3
By the triangle inequality,
|f (x) − A| < ǫ;
∞
X
S(x) = fn (x)
n=1
is continuous on E.
354 Introduction to Real Analysis
Therefore f is continuous at p.
(b) For the proof of (b), let
n
X
Sn (x) = fk (x).
k=1
EXAMPLE 8.3.3 The sequence {xn }∞ n=1 , x ∈ [0, 1], of Example 8.1.2(a)
does not converge uniformly on [0, 1] since the limit function
(
0 0 ≤ x < 1,
f (x) =
1, x = 1,
Dini’s Theorem1
A natural question to ask is whether the converse of Corollary 8.3.2 is true.
Namely, if f and fn are continuous for all n and fn → f pointwise, is the
convergence necessarily uniform? The following example shows that this need
not be the case. However, in Theorem 8.3.5 we will prove that with the addi-
tional assumption that the sequence {fn (x)} is monotone for all x, then the
convergence is indeed uniform.
1 This topic is not required in subsequent sections and thus can be omitted on first
reading.
Sequences and Series of Functions 355
However, by Theorem 2.3.8 this can only be the case if Kno = ∅ for some
no ∈ N. Thus for all n ≥ no ,
0 ≤ gn (x) < ǫ for all x ∈ K.
Therefore the sequence {gn } converges uniformly to 0 on K.
356 Introduction to Real Analysis
2 This topic can also be omitted on first reading. The concept of a complete normed
Proof. Let {fn } be a Cauchy sequence in C(K); i.e., given ǫ > 0, there exists
a positive integer no such that kfn − fm ku < ǫ for all n, m ≥ no . But then
for all x ∈ [a, b] and all n, m ≥ no . Thus by Theorem 8.2.3 and Corollary
8.3.2 the sequence {fn } converges uniformly to a continuous function f on K.
Finally, since the convergence is uniform, given ǫ > 0, there exists an integer
no such that
Contraction Mappings
In Exercise 13 of Section 4.3 we defined the notion of a contractive function
on a subset E of R. We now extend this to normed linear spaces.
THEOREM 8.3.13 Let (X, k k) be a complete normed linear space and let
T : X → X be a contraction mapping. Then there exists a unique point x ∈ X
such that T (x) = x.
Proof. Suppose T : X → X satisfies kT (x) − T (y)k ≤ ckx − yk for fixed
c, 0 < c < 1, and all x, y ∈ X. We now define a sequence {xn } in X as
follows: Let xo ∈ X be arbitrary. For n ∈ N set xn = T (xn−1 ). That is,
x1 = T (xo ), x2 = T (x1 ), etc. Since
kxn+1 − xn k = kT (xn ) − T (xn−1 )k ≤ ckxn − xn−1 k,
the sequence {xn } is a contractive sequence in X (see Definition 3.6.8). An
argument similar to the one used for a contractive sequence in Section 3.6
shows that
cn
kxn+m − xn k ≤ kx1 − xo k
1−c
for all n, m ∈ N. The details are left as an exercise (Exercise 10). Since cn → 0,
the sequence {xn } is a Cauchy sequence in X. By completeness, the sequence
{xn } converges (in the norm) to some x ∈ X. But by continuity of the mapping
T,
x = lim xn+1 = lim T (xn ) = T (x),
n→∞ n→∞
i.e., T (x) = x. Suppose y ∈ X also satisfies T (y) = y. But then
ky − xk = kT (y) − T (x)k ≤ cky − xk.
Since 0 < c < 1, the above can be true if and only if ky − xk = 0, that is
y = x. Thus x is unique.
Sequences and Series of Functions 359
Exercises 8.3
∞
x(1 − x)k cannot converge uniformly for 0 ≤
P
1. *Show that the series
k=0
x ≤ 1.
2. For n ∈ N, let fn (x) = xn /(1 + xn ), x ∈ [0, 1]. Prove that the sequence
{fn } does not converge uniformly on [0, 1].
3. Give an example of a sequence of functions that are not continuous at
any point but which converges uniformly to a continuous function.
4. *Suppose that f is uniformly continuous on R. For each n ∈ N, set
fn (x) = f (x + n1 ). Prove that the sequence {fn } converges uniformly to
f on R.
5. Let {fn } be a sequence of continuous real-valued functions that converges
uniformly to a function f on a set E ⊂ R. Prove that lim fn (xn ) = f (x)
n→∞
for every sequence {xn } ⊂ E such that xn → x ∈ E.
6. * Let E ⊂ R and let D be a dense subset of E. If {fn } is a sequence of
continuous real-valued functions on E, and if {fn } converges uniformly
on D, prove that {fn } converges uniformly on E. (Recall that D is dense
in E if every point of E is either a point of D or a limit point of D)
7. Find a sequence {fn } in C[0, 1] with kfn ku = 1 such that no subsequence
of {fn } converges (in norm) in C[0, 1].
8. Suppose {fn } is a sequence of continuous functions on [a, b] that converges
uniformly on [a.b]. For each x ∈ [a, b], set g(x) = sup{fn (x)}.
n
Rx
11. Define T : C[0, 1] → C[0, 1] by (T ϕ)(x) = ϕ(t) dt, 0 ≤ x ≤ 1,
0
2
ϕ ∈ C[0, 1], and set T = T ◦ T .
*a. Prove that |(T 2 ϕ)(x)| ≤ 21 x2 kϕku .
b. Show that T 2 is a contraction mapping on C[0, 1] and thus has a fixed
point in C[0, 1].
c. Prove that T has a fixed point in C[0, 1].
12. Prove that (C(K), k ku ) is a normed linear space.
13. Prove that (ℓ2 , k k2 ) is a complete normed linear space.
360 Introduction to Real Analysis
R1 R1
Thus lim fn (x) 6= lim fn (x)dx. Hence, pointwise convergence, even if
n→∞ 0 0 n→∞
the limit function is Riemann integrable, is also not sufficient for the inter-
change of limits.
In this section, we will prove that uniform convergence of a sequence {fn }
of Riemann integrable functions is again sufficient for the limit function f
to be Riemann integrable, and for convergence of the definite integrals of fn
to the definite integral of f . The analogous result for the Riemann-Stieltjes
integral is left to the exercises (Exercise 2).
THEOREM 8.4.1 Suppose fn ∈ R[a, b] for all n ∈ N, and suppose that the
sequence {fn } converges uniformly to f on [a, b]. Then f ∈ R[a, b] and
Z b Z b
f (x) dx = lim fn (x) dx.
a n→∞ a
Since fn → f uniformly on [a, b], by Theorem 8.2.5, lim ǫn = 0. Also, for all
n→∞
x ∈ [a, b],
fn (x) − ǫn ≤ f (x) ≤ fn (x) + ǫn .
Hence
Z b Z b Z b Z b
(fn − ǫn ) ≤ f ≤ f ≤ (fn + ǫn ). (3)
a a a a
Therefore
Z b Z b
0≤ f − f ≤ 2ǫn [b − a].
a a
Sequences and Series of Functions 361
and thus Z b Z b
lim fn (x) dx = f (x) dx.
n→∞ a a
where the series converges uniformly on [a, b], then f ∈ R[a, b] and
Z b ∞ Z b
X
f (x) dx = fk (x) dx.
a k=1 a
n
P
Proof. Apply the previous theorem to Sn (x) = fk (x), which by Theorem
k=1
6.2.1 is integrable for each n ∈ N.
Although uniform convergence is sufficient for the conclusion of Theorem
8.4.1; it is not necessary. For example, if fn (x) = xn , x ∈ [0, 1], then {fn }
converges pointwise, but not uniformly, to the function
(
0, 0 ≤ x < 1,
f (x) =
1, x = 1.
In Section 10.6, using results from the Lebesgue theory of integration, we will
be able to prove a stronger convergence result that does not require uniform
convergence of the sequence {fn }. However, it does require that the limit
function f is Riemann integrable. For completeness we include a statement of
that result at this point.
It is easily checked that the sequence {fn } on [0, 1], where for each n ∈ N
fn (x) = xn , satisfies the hypothesis of the previous theorem. Also, since the
limit function f is continuous except at x = 1, f ∈ R[0, 1]. On the other hand,
the sequence {fn } of Example 8.1.2(d) does not satisfy the hypothesis of the
theorem.
Exercises 8.4
P
1. *If |ak | < ∞, prove that
∞
Z 1 ! ∞
X k
X ak
ak x dx = .
0 k+1
k=0 k=0
4. *If f is Riemann integrable on [0, 1], use the bounded convergence theo-
rem to prove that
Z 1
lim xn f (x) dx = 0.
n→∞ 0
5. Let {fn } be a sequence in R[a, b] that converges uniformly to f ∈ R[a, b].
Rx Rx
For n ∈ N set Fn (x) = fn , and let F (x) = f , x ∈ [a, b]. Prove that
0 0
{Fn } converges uniformly to F on [a, b].
R1
6. *Suppose f : [0, 1] → R is continuous. Prove that lim f (xn )dx = f (0).
n→∞ 0
7. *Let {rn } be an enumeration of the rational numbers in [0, 1], and let
f : [0, 1] → R be defined by
P∞ 1
f (x) = k
I(x − xk ),
k=1 2
where I is the unit jump function of Definition 4.4.9. Prove that
f ∈ R[0, 1].
8. Define g on R by g(x) = x − [x], where [x] denotes the greatest integer
function. Prove that the function
P∞ g(nx)
f (x) =
n=1 n2
is Riemann integrable on [0, 1]. (This function was given by Riemann as
an example of a function that is not integrable according to Cauchy’s
definition.)
Sequences and Series of Functions 363
for all x ∈ [a, b]. The result can now be proved using Corollary 8.3.2 and
Theorem 8.4.1. The details are left to the exercises (Exercise 2).
Proof. Let ǫ > 0 be given. Since {fn (xo )} converges and {fn′ } converges
uniformly, there exists no ∈ N such that
ǫ
|fn (xo ) − fm (xo )| < , for all n, m ≥ no , (4)
2
and
ǫ
|fn′ (t) − fm
′
(t)| < , for all t ∈ [a, b], and all n, m ≥ no . (5)
2(b − a)
Apply the mean value theorem to the functions fn − fm with n, m ≥ no fixed.
Then for x, y ∈ [a, b], there exists t between x and y such that
Thus by (5),
ǫ ǫ
|(fn (x) − fm (x)) − (fn (y) − fm (y))| ≤ |x − y| < . (6)
2(b − a) 2
Take y = xo in (6). Then by (4) and (6), for all x ∈ [a, b] and n, m ≥ no ,
|fn (x) − fm (x)| ≤ |(fn (x) − fm (x)) − (fn (xo ) − fm (xo ))| + |fn (xo ) − fm (xo )|
ǫ ǫ
< + = ǫ.
2 2
Hence by Theorem 8.2.3 the sequence {fn } converges uniformly on [a, b]. Let
for all x ∈ [a, b]. Fix p ∈ [a, b], and for t 6= p, t ∈ [a, b], define
Then gn (t) → g(t) for each t ∈ [a, b], t 6= p, and for each n
Since 2−k sin kx ≤ 2−k for all x ∈ R, by the Weierstrass M-test this series
converges uniformly to a function S on R. For n ∈ N, let
n
X sin kx
Sn (x) = .
2k
k=1
Then
n
X k cos kx
Sn′ (x) = .
2k
k=1
P −k
Since k2 converges, by the Weierstrass M-test the sequence {Sn′ } con-
verges uniformly on R. Thus by Theorem 8.5.1,
∞
X k cos kx
S ′ (x) = lim Sn′ (x) = .
n→∞ 2k
k=1
FIGURE 8.2
Graphs of S1 , S2 , S3
we obtain
n−1
X 1 cos ak π(x + h) − cos ak πx
Sn (x + h) − Sn (x)
≤ (8)
h 2k h
k=0
n−1 n
π 1 − a2
X a k
≤π =
2 1 − a2
k=0
2π a n
< .
a−2 2
We now proceed to obtain a lower estimate on the term involving Rn . To
do so we write
a n x = k n + δn ,
where kn is an integer, and − 12 ≤ δn < 21 . Set
1 − δn
hn = .
an
Since − 21 ≤ δn < 21 , we have 3
2 ≥ 1 − δn > 21 . Therefore,
2 n 1
a ≤ < 2 an . (9)
3 hn
For k ≥ n,
Therefore,
As a consequence,
∞
Rn (x + hn ) − Rn (x) X 1 cos ak π(x + hn ) − cos ak πx
=
hn 2k hn
k=n
∞
X (−1)kn +1 [1 + cos ak−n δn π]
=
hn 2k
k=n
∞
1 X 1 + cos ak−n δn π 1 1 + cos δn π
= ≥ .
hn 2k hn 2n
k=n
Rn (x + hn ) − Rn (x) 1 1 2 a n
≥ ≥ . (10)
hn hn 2n 3 2
2 a n 2π a n
≥ −
3 2 a−2 2
a n 2 2π
= − .
2 3 a−2
f (x + hn ) − f (x)
→∞ as n → ∞,
hn
Remark. The above proof is based on the proof of a more general result given
Sequences and Series of Functions 369
Exercises 8.5
1. For n ∈ N, set fn (x) = xn /n, x ∈ [0, 1]. Prove that the sequence {fn }
converges uniformly to f (x) = 0 on [0, 1], that the sequence {fn′ (x)}
converges pointwise on [0, 1], but that {fn′ (1)} does not converge to f ′ (1).
2. *Let {fn } be a sequence of differentiable functions on [a, b] for which
fn′ is continuous on [a, b] for all n ∈ N. If {fn′ } converges uniformly on
[a, b], and {fn (xo )} converges for some xo ∈ [a, b], use the fundamental
theorem of calculus to prove that {fn } converges uniformly to a function
f on [a, b] and that f ′ (x) = lim fn′ (x) for all x ∈ [a, b].
n→∞
3. Let {ak }∞
P
k=0 be a sequence of real numbers satisfying k|ak | < ∞. Show
∞
P k
that the series ak x converges uniformly to a function f on |x| ≤ 1
k=0
and that
∞
f ′ (x) = kak xk−1
P
k=1
FIGURE 8.3
Graph of a periodic function
The canonical examples of periodic functions are the functions sin x and
cos x, both of which are periodic of period 2π. The graph of a periodic function
of period p is illustrated in Figure 8.3. The graphs of a periodic function of
period p on any two successive intervals of length p are identical. It is clear
that if f is periodic of period p, then
Proof. Exercise 2.
Approximate Identities
for every δ > 0. This seems to indicate that the functions are concentrated
near 0 and must become very large near 0 (see Exercise 6). The graphs of the
first few functions Q1 , Q2 , and Q3 of a typical approximate identity {Qn } are
given in Figure 8.4.
THEOREM 8.6.5 Let {Qn } be an approximate identity on [−1, 1], and let f
be a bounded real-valued periodic function on R of period 2 with f ∈ R[−1, 1].
For n ∈ N, x ∈ R, define
Z 1
Sn (x) = f (x + t)Qn (t) dt. (11)
−1
If f is continuous at x ∈ R, then
Proof. We first note that since f is periodic and integrable on [−1, 1], f is
integrable on every finite subinterval of R. Thus the integral in (11) is defined
Sequences and Series of Functions 373
FIGURE 8.4
Graphs of Q1 , Q2 , Q3
for all x ∈ R. Also, since f is bounded, there exists a constant M > 0 such
that |f (x)| ≤ M for all x ∈ R.
Suppose first that f is continuous at x ∈ R. By (a) of Definition 8.6.4
Z 1
f (x) = f (x)Qn (t) dt.
−1
Therefore,
Z 1
|Sn (x) − f (x)| = [f (x + t) − f (x)]Qn (t)dt
−1
Z 1
≤ |f (x + t) − f (x)|Qn (t) dt. (12)
−1
Let ǫ > 0 be given. Since f is continuous at x, there exists a δ > 0 such that
ǫ
|f (x + t) − f (x)| <
2
for all t, |t| < δ. Therefore
δ δ 1
ǫ ǫ ǫ
Z Z Z
|f (x + t) − f (x)|Qn (t) dt < Qn (t) dt ≤ Qn (t) dt = (13)
−δ 2 −δ 2 −1 2
for all x ∈ R and all n ≥ no . This proves that the sequence {Sn } converges
uniformly to f on R.
f (0) = f (1) = 0.
f (x) = f (x − 2k), k ∈ Z,
Qn (t) = cn (1 − t2 )n ,
Thus the sequence {Qn } satisfies hypothesis (a) of Definition 8.6.4. To show
that it also satisfies (b) we need an estimate on the magnitude of cn . Since
Z 1 Z 1
2 n
1 = cn (1 − t ) dt = 2cn (1 − t2 )n dt
−1 0
√
Z 1/ n
≥ 2cn (1 − t2 )n dt
0
√
1/ n
1 1
Z
2
≥ 2cn (1 − nt ) dt = 2cn √ − √
0 n 3 n
4cn
= √ ,
3 n
we obtain √ √
3
cn ≤ 4 n< n.
In the above we have used the inequality (1 − t2 )n ≥ 1 − nt2 valid for all
t ∈ [0, 1] (Example 1.3.2(b)). Finally, for any δ, 0 < δ < 1,
√
Qn (t) = cn (1 − t2 )n ≤ n(1 − δ 2 )n , for all t, δ ≤ |t| ≤ 1.
This is the function Sn (x) of Theorem 8.6.5 except restricted to x ∈ [0, 1]. Let
x ∈ [0, 1]. Since f (t) = 0 for t ∈ [−1, 0] ∪ [1, 2],
Z 1 Z 1−x
f (x + t)Qn (t) dt = f (x + t)Qn (t) dt,
−1 −x
Therefore Pn (x), for x ∈ [0, 1], is a polynomial of degree less than or equal to
2n. As a consequence of Theorem 8.6.5,
A very interesting result, whose proof is beyond the scope of the text, is the
Müntz-Szasz Theorem3 as follows: The set Pn is dense in C([0, 1]) if and
only if
∞
X 1
= ∞.
nk
k=1
Hence, the set of all polynomials with even exponents is dense, whereas the
set of all polynomials of the form
2 2 2
P (x) = a0 + a1 x2 + a2 x2 + a3 x3 + · · · + an xn
is not.
3 A proof of the Müntz-Szasz theorem may by found in the following text by Walter
Exercises 8.6
1. If f : R → R is periodic of period 2 and continuous on [−1, 1], prove that
f is uniformly continuous on R.
2. *Prove Theorem 8.6.3.
3. For n ∈ N, define Qn on [−1, 1] as follows:
(
n
, − n1 ≤ x ≤ n1 ,
Qn (x) = 2 1
.
0, n
< |x| ≤ 1.
Show that {Qn } is an approximate identity on [−1, 1].
4. For n ∈ N, set Qn (x) = cn (1 − |x|)n , x ∈ [−1, 1].
R1
*a. Determine cn > 0 so that Qn (t)dt = 1.
−1
b. Prove that with the above choice of cn the sequence {Qn } is an ap-
proximate identity on [−1, 1].
6. *If {Qn } is an approximate identity on [−1, 1], prove that
lim sup{Qn (x) : x ∈ [−δ, δ]} = ∞
n→∞
RProve that f (x) = 0 for all x ∈ [0, 1]. (Hint: First show that
1
0
f (x)P (x)dx = 0 for every polynomial P , then use the Weierstrass
R1
theorem to show that 0 f 2 (x)dx = 0.)
nature, power series possess certain properties which are not valid for series
of functions. We begin with the following definition.
DEFINITION 8.7.1 Let {ak }∞ k=0 be a sequence of real numbers, and let
c ∈ R. A series of the form
∞
X
ak (x − c)k = a0 + a1 (x − c) + a2 (x − c)2 + a3 (x − c)3 + · · ·
k=0
1 |ak+1 |
= lim .
R k→∞ |ak |
Proof. Statements (a) and (b) were proved in the discussion preceding the
statement of the theorem. Suppose 0 < ρ < R. Choose β such that ρ < β < R.
Since
p 1 1
lim k |ak | = < ,
k→∞ R β
there exists no ∈ N such that
p
k 1
|ak | < for all k ≥ no .
β
But (ρ/β) < 1 and thus (ρ/β)k < ∞. Therefore by the Weierstrass M-test,
P
the series converges uniformly on |x − c| ≤ ρ.
The previous theorem provides no suggestion as to what happens when
|x − c| = R. As the following examples (with c = 0) illustrate, the series may
either converge or diverge when |x| = R.
∞
xk has radius of convergence R = 1.
P
EXAMPLES 8.7.4 (a) The series
k=0
This series diverges at both x = 1 and −1.
∞ xk
P
(b) The series also has radius of convergence R = 1. In this case,
k=1 k
when x = 1 the series diverges; whereas when x = −1, the series is an alter-
nating series that converges by Theorem 7.2.3.
380 Introduction to Real Analysis
∞ xk
P
(c) Consider the series 2
. Again the radius of convergence is R = 1.
k=1 k
In this example, the series converges at both x = 1 and −1.
(d) Consider the series
∞
X
1 + 2x + 32 x2 + 23 x3 + 34 x4 + · · · = ak xk
k=0
where (
3k , if k is even,
ak =
2k , if k is odd.
p
Hence lim k |ak | = 3, and therefore R = 1/3. The series diverges at both
x = 1/3 and x = −1/3.
k!xk . Here ak = k!, and
P
(e) Finally, consider the series
ak+1
lim = lim (k + 1) = ∞.
k→∞ ak k→∞
p
Thus by Theorem 7.1.10 k |ak | → ∞, and R = 0. Therefore the power series
converges only for x = 0.
Abel’s Theorem
ak (x − c)k with radius of convergence
P
Suppose we are given a power series
R > 0. By setting
X∞
f (x) = ak (x − c)k , (14)
k=0
∞
X
lim f (x) = ak .
x→1−
k=0
Sequences and Series of Functions 381
n
P
Proof. Set s−1 = 0, and for n = 0, 1, 2, ... let sn = ak . Then by the partial
k=0
summation formula (7.2.1)
n
X n−1
X
ak xk = sk (xk − xk+1 ) + sn xn
k=0 k=0
n−1
X
= (1 − x) s k xk + s n xn .
k=0
Since the sequence {sn } converges, if we let n → ∞, then for all x, |x| < 1,
∞
X
f (x) = (1 − x) s k xk .
k=0
Let s = lim sn , and let ǫ > 0 be given. Choose no ∈ N such that |s−sn | <
n→∞
ǫ/2 for all n ≥ no . Since
∞
X
(1 − x) xk = 1, |x| < 1,
k=0
The obvious question to ask is, what is the radius of convergence of the dif-
∞
ak (x − c)k ,
P
ferentiated series (15)? Furthermore, if f is defined by f (x) =
k=0
|x − c| < R, does the series (15) converge to f ′ (x)? The answers to both of
these questions are provided by the following theorem.
∞
ak (x − c)k has radius of convergence R > 0,
P
THEOREM 8.7.7 Suppose
k=0
and
∞
X
f (x) = ak (x − c)k , |x − c| < R.
k=0
Then
∞
k ak (x − c)k−1
P
(a) has radius of convergence R, and
k=1
∞
(b) f ′ (x) = k ak (x − c)k−1 ,
P
for all x, |x − c| < R.
k=1
|x|
q
lim k |x|k−1 = lim p = |x|.
k→∞ k→∞ k
|x|
p
Therefore lim k
k|x|k−1 = |x|. By Exercise 10 of Section 3.5, for x 6= 0,
k→∞
q p q
k k
lim |ak | k |x|k−1 = lim k
|ak | lim k |x|k−1
k→∞ k→∞ k→∞
pk
= |x| lim |ak |.
k→∞
Sequences and Series of Functions 383
∞
ak (x − c)k has radius of convergence R >
P
COROLLARY 8.7.8 Suppose
k=0
0, and
∞
X
f (x) = ak (x − c)k , |x − c| < R.
k=0
In particular,
f (n) (c) = n! an . (17)
Proof. Clearly, if ak = bk for all k, then the two power series are equal and
converge to the same function. Conversely, set
∞
X ∞
X
f (x) = ak (x − c)k and g(x) = bk (x − c)k .
k=0 k=0
If f (x) = g(x) for all x, |x−c| < R, then f (n) (x) = g (n) (x) for all n = 0, 1, 2, ...,
and all x, |x − c| < R. In particular, f (n) (c) = g (n) (c) for all n = 0, 1, 2, ....
Thus by (17), an = bn for all n.
for all x, |x − c| < ǫ, with ak = f (k) (c) k! for all k = 0, 1, 2, ... The following
2
2 e−1/x
f ′ (x) = .
x3
When x = 0, we have
2
′ f (h) − f (0) e−1/h t
f (0) = lim = lim = lim t2 = 0.
h→0 h h→0 h t→∞ e
where P is a polynomial of degree 3n. The details are left to the exercises
(Exercise 16). Thus the function f is infinitely differentiable on R. If there
∞
ak xk for all x, |x| < R, then ak = 0 for all
P
exists R > 0 such that f (x) =
k=0
k. As a consequence, f cannot be presented by a power series which converges
to f in a neighborhood of 0.
386 Introduction to Real Analysis
For the special case c = 0, the Taylor series of a function f is often referred
to as the Maclaurin series. The first three Taylor polynomials T0 , T1 , T2 ,
are given specifically by
for all k = 0, 1, ..., n. Since f (n) (c) might possibly be zero, Tn (as the next
example shows) could very well be a polynomial of degree strictly less than n.
Sequences and Series of Functions 387
Thus
1
T3 (f, π2 )(x) = 1 − (x − π2 )2 ,
2!
which is a polynomial of degree 2. In general, if n is odd, f (n) ( π2 ) = 0, and if
n = 2k is even, f (2k) ( π2 ) = (−1)k . Therefore, if n is even,
n/2
X (−1)k
Tn (f, π2 )(x) = Tn+1 (f, π2 ) = (x − π2 )2k .
(2k)!
k=0
π
The Taylor expansion of f (x) = sin x about c = 2 is given by
∞
X (−1)k
(x − π2 )2k .
(2k)!
k=0
2
(b) For the function f (x) = e−1/x , by Example 8.7.12
for all x, |x − 2| < 2. By uniqueness, the given series must be the Taylor series
of f (x) = 1/x. In this instance, the power series also converges to the function
f (x) for all x satisfying |x − 2| < 2.
Remainder Estimates
To investigate when the Taylor series of a function f converges to f (x), we
consider
Rn (x) = Rn (f, c)(x) = f (x) − Tn (f, c)(x). (18)
The function Rn is called the remainder or error function between f and
Tn (f, c). Clearly,
Since the Taylor polynomial Tn is the nth partial sum of the Taylor series of f ,
the Taylor series converges to f at a point x if and only if lim Rn (f, c)(x) = 0.
n→∞
To emphasize this fact, we state it as a theorem.
The formula
f ′′ (c)
f (x) = f (c) + f ′ (c)(x − c) + (x − c)2 + · · ·
2!
f (n) (c)
+ (x − c)n + Rn (f, x)(x)
n!
is known as Taylor’s formula with remainder. We now proceed to derive several
formulas for the remainder term Rn . These can be used to show convergence
of Tn to f .
Sequences and Series of Functions 389
f (n+1) (ζ)
Rn (x) = Rn (f, c)(x) = (x − c)n+1 . (19)
(n + 1)!
To prove the result we need to show that (n + 1)! M = f (n+1) (ζ) for some
ζ between x and c. To accomplish this, set
Thus g ′ (x1 ) = 0 for some x1 , c < x1 < x. Since g ′ (c) = 0, by the mean value
theorem applied to g ′ on the interval [c, x1 ], g ′′ (x2 ) = 0 for some x2 , c < x2 <
x1 . Continuing in this manner we obtain a point xn satisfying c < xn < x,
such that g (n) (xn ) = 0. Applying the mean value theorem one more time to
the function g (n) on the interval [c, xn ], we obtain the existence of a ζ ∈ (c, xn )
such that
0 = g (n) (xn ) − g (n) (c) = g (n+1) (ζ)(xn − c).
Thus g (n+1) (ζ) = 0; i.e., f (n+1) (ζ) − (n + 1)! M = 0, for some ζ between x
and c.
390 Introduction to Real Analysis
In 8.7.20 we will give several examples to show how the remainder estimates
may be used to prove convergence of the Taylor series to its defining function.
In the following example we show how the previous theorem may be used to
derive simple estimates and inequalities.
f (3) (ζ) 3 1
R2 (f, 0)(x) = x = (1 + ζ)−5/2 x3
3! 16
for some ζ between 0 and x. If x > 0, then ζ > 0 and in this case (1 + ζ)−5/2 <
1. Therefore we have
√ 1 3
| 1 + x − T2 (f, 0)(x)| < x
16
for
√ any x > 0. If we let x = 0.4, then T2 (f, 0)(.4) = 1.18, and by the above
| 1.4 − 1.18| < 0.004, √
so that two decimal place accuracy is assured. In fact,
to five decimal places 1.4 = 1.18322.
(b) The error estimates can also be used to derive inequalities. As in the
previous example,
√
1 + x = 1 + 21 x − 18 x2 + R2 (f, x).
1 3
For x > 0 we have 0 < R2 (f, 0)(x) < 16 x . Thus
1 1 √ 1 1 1
1 + x − x2 < 1 + x < 1 + x − x2 + x3
2 8 2 8 16
for all x > 0.
f (n+1) (ζ)
Rn (x) = Rn (f, c)(x) = (x − c) (x − ζ)n . (21)
n!
Proof. Since f (n+1) (t)(x−t)n is continuous on the interval from c to x, by the
mean value theorem for integrals (Theorem 6.3.6), there exists a ζ between c
and x such that
Z x
f (n+1) (t)(x − t)n dt = (x − c)f (n+1) (ζ) (x − ζ)n .
c
EXAMPLES 8.7.20 (a) As our first example we prove the binomial the-
orem (Theorem 3.2.5). For n ∈ N let f (x) = (1 + x)n , x ∈ R. Since f is a
polynomial of degree n, if k > n then f (k) (x) = 0 for all x ∈ R. Therefore by
Theorem 8.7.16,
n
X f (k) (0) k
f (x) = x .
k!
k=0
(k)
By computation, f (0) = n!/(n − k)! for k = 0, 1, ..., n. Therefore
n
X n!
(1 + x)n = xk .
k!(n − k)!
k=0
Thus f (n) (0) = 0 for all even n ∈ N, and f (n) (0) = (−1)k , whenever n =
2k + 1, k = 0, 1, 2, .... Therefore the Taylor series of f at c = 0 is given by
∞
X (−1)k 2k+1
x .
(2k + 1)!
k=0
|x|n+1
|Rn (x)| ≤ .
(n + 1)!
∞
X (−1)k 2k+1
sin x = x , x ∈ R.
(2k + 1)!
k=0
The sine function, as well as the cosine function, can be defined strictly in
terms of power series. For further details, the reader is encouraged to look at
Miscellaneous Exercise 3.
(c) As our third example we derive the Taylor series for f (x) = ln(1 + x),
where as in Example 6.3.5
Z x
1
ln x = dt, x > 0,
1 t
denotes the natural logarithm function on (0, ∞). Then f (0) = ln(1) = 0, and
by the fundamental theorem of calculus f ′ (x) = 1/(1+x). Thus for n = 1, 2, ...,
(n − 1)!
f (n) (x) = (−1)n+1 .
(1 + x)n
Although we have already proved that this series converges to ln(1 + x) for all
x, −1 < x ≤ 1 (Example 8.7.6), we will prove this again to illustrate the use
of the remainder formulas.
394 Introduction to Real Analysis
(−1)n+2 xn+1
Rn (x) = Rn (f, 0)(x) =
(n + 1)(1 + ζ)n+1
We next consider the more difficult case −1 < x < 0. By the Cauchy form
of the remainder, if −1 < x < 0, there exists ζ, x ≤ ζ ≤ 0, such that
1 (n+1) (x − ζ)n x
Rn (x) = f (ζ)(x − ζ)n x = (−1)n+2 .
n! (1 + ζ)n+1
Therefore, n
|x| (ζ − x)
|Rn (x)| ≤ .
(1 + ζ) 1 + ζ
Consider the function ϕ(t) defined on [x, 0] by
t−x
ϕ(t) = .
1+t
Then ϕ′ (t) = (1 + x)/(1 + t)2 which is positive on [x, 0] provided x > −1.
Therefore
ϕ(t) ≤ ϕ(0) = −x = |x|
for all t, x ≤ t ≤ 0. Thus
|x|
|Rn (x)| ≤ |x|n .
(1 + ζ)
Since |x| < 1, lim Rn (x) = 0. Therefore the Taylor series converges to
n→∞
ln(1 + x) for all x, −1 < x ≤ 1; i.e.,
∞
X (−1)n+1 n
ln(1 + x) = x , −1 < x ≤ 1.
n=1
n
which again has only radius of convergence R = 1. The reason for this is that
even though g is well behaved on R, if we extend g to the complex plane C
by
1
g(z) = ,
1 + z2
then g is not defined when z 2 = −1; i.e., z = ±i, where i is the complex
number that satisfies i2 = −1.
Exercises 8.7
1. Find the radius of convergence of each of the following power series:
∞ 3k ∞ 1
xk (x + 1)2k
P P
a. 3
*b. k
k=1 k k=0 4
∞
k ∞
k 2
1 1
xk *d. xk
P P
c. 1− 1−
k=1 k k=1 k
1
∞ x k ∞ , when k is even,
k
ak xk where ak = 2 1
P P
e. k *f.
k=1 2 k=0 , when k is odd.
2k+2
2. For each of the following, determine all values of x for which the given
series converges:
k
∞ k ∞ 3 k xk ∞
P P P 1+x
a. k
, (x 6
= 0) *b. k k
, (x 6
= 1) c. , (x 6= 1)
k=0 x k=1 2 (1 − x) k=0 1−x
3. Using the power series expansion of 1 (1 − x) and its derivatives, find
∞ ∞ ∞ k
kxk , |x| < 1 b. k2 xk , |x| < 1. c.
P P P
*a. k
.
k=1 k=1 k=1 2
Prove that
∞
X ak
F (x) = (x − c)k+1 , |x − c| < R.
k+1
k=0
d 1
8. *a. Use the previous exercise and the fact that arctan x = to
dx 1 + x2
obtain the Taylor series expansion of arctan x about c = 0.
b. Use part (a) to obtain a series expansion for π.
*c. How large must n be chosen so that the nth partial sum of the series
in (b) provides an approximation of π correct to four decimal places?
P∞ (−1)k π
9. Use Exercise 8 and Abel’s theorem to prove that = .
k=0 2k + 1 4
10. *Find constants a0 , a1 , a2 , a3 , and a4 such that
x4 + 3x2 − 2x + 5 = a4 (x − 1)4 + a3 (x − 1)3 + a2 (x − 1)2 + a1 (x − 1) + a0 .
11. Prove that the Taylor series of ex (with c = 0) converges to ex for all
x ∈ R.
12. Using any applicable method, find the Taylor series of each of the follow-
ing functions at the indicated point, and specify the interval on which
the series converges to the function
a. f (x) = cosx, c = 0. *b. f (x) = ln x, c = 1
1+x
c. f (x) = ln , c = 0 *d. f (x) = (1 − x)−1/2 , c = 0
1−x
x2
e. f (x) = , c=0 *f. f (x) = arcsin x, c = 0
√1 − x2
g. f (x) = x, c = 1 *h. f (x) = (1 + x)p , c = 0, p real
∞
ak xk , |x| < R, where R > 0. Prove the following:
P
13. Suppose f (x) =
k=0
DEFINITION 8.8.1 For 0 < x < ∞, the Gamma function Γ(x) is de-
fined by Z ∞
Γ(x) = tx−1 e−t dt. (22)
0
When 0 < x < 1, the integral in (22) is an improper integral not only at ∞,
but also at 0. The convergence of the improper integral defining Γ(x), x > 0,
was given as Exercise 9 in Section 6.4. The Graph of Γ(x) for 0 < x < 5 is
given in Figure 8.5. The following properties of the Gamma function show
that it is closely related to factorials.
x ′ −t
With u = t and v = e ,
Z R R Z R
x −t x −t
t e dt = − t e +x tx−1 e−t dt
c c c
R
Rx
Z
=− + cx e−c + x tx−1 e−t dt.
eR c
Sequences and Series of Functions 399
FIGURE 8.5
Graph of Γ(x), 0 < x < 5
Since lim cx e−c = 0 and lim Rx e−R = 0, taking the appropriate limits in
c→0+ R→∞
the above yields
Z ∞ Z ∞
Γ(x + 1) = tx e−t dt = x tx−1 e−t dt = x Γ(x).
0 0
This proves (a). For the proof of (b) we first note that
Z ∞
Γ(1) = e−t dt = 1.
0
x = r cos θ, y = r sin θ,
Therefore, √
∞
π
Z
−x2
e dx = ,
0 2
from which the result follows.
We will now prove that this formula is still valid for all α ∈ R with α > 0.
Proof. We first show that the radius of convergence of the series (23) is R = 1.
Set an = Γ(n + α)/n!. Then
an+1 Γ(n + 1 + α) n!
= .
an (n + 1)! Γ(n + α)
But by Theorem 8.8.2, Γ(n + 1 + α) = (n + α)Γ(n + α). Therefore,
an+1 n+α
lim = lim = 1,
n→∞ an n→∞ n + 1
Multiplying by (1 − x) gives
∞
1 X n Γ(n + α)
(1 − x) fα (x) = (1 − x) xn−1
Γ(α) n=1 n!
"∞ ∞
#
1 X Γ(n + α) X n Γ(n + α)
= xn−1 − xn
Γ(α) n=1 (n − 1)! n=1
n!
∞
1 X Γ(n + 1 + α) n Γ(n + α)
= − xn .
Γ(α) n=0 n! n!
As a consequence,
d
[(1 − x)α fα (x)] = −α (1 − x)α−1 fα (x) + (1 − x)α fα′ (x)
dx
= −α (1 − x)α−1 fα (x) + α (1 − x)α−1 fα (x) = 0.
Therefore (1−x)α fα (x) is equal to a constant for all x, |x| < 1. But fα (0) = 1.
Thus (1 − x)α fα (x) = 1; that is
fα (x) = (1 − x)−α ,
The function
Γ(x)Γ(y)
B(x, y) = , x, y > 0,
Γ(x + y)
is called the Beta function.
Exercises 8.8
1. *a. Compute Γ( 23 ), Γ( 72 ).
√
(2n)! π
b. Prove that for n ∈ N, Γ(n + 21 ) = .
4n n!
2. *By making a change of variable, prove that
Z 1 x−1
1
Γ(x) = ln dt, 0 < x < ∞.
0 t
3. Evaluate each of the following definite integrals:
Z ∞ Z 1 n
3 1
*a. e−t t 2 dt b. ln dt, n ∈ N.
0 0 t
4. By making the change of variable t = sin2 u in Theorem 8.8.5, prove that
Z π
2 1 Γ(n)Γ(m)
(sin u)2n−1 (cos u)2m−1 du = , n, m > 0.
0 2 Γ(n + m)
5. Evaluate each of the following integrals:
Z π Z π
2 2
*a. (sin x)2n dx, n ∈ N. b. (sin x)2n+1 dx, n ∈ N.
0 0
6. Use the binomial series and term-by-term integration to find the power
series expansion of
Z x
arcsin x = (1 − t2 )−1/2 dt.
0
α
7. Let α > 0. Set = 1 and for k ∈ N set
0
α α(α − 1)(α − 2) · · · (α − k + 1)
= .
k k!
Sequences and Series of Functions 403
Note, if m ∈ N,
m m! m
= , k ≤ m, and = 0 for k > m.
k k!(m − k)! k
∞
α
xk converges uniformly and absolutely
P
a. Prove that the series
k=0 k
for x ∈ [−1, 1].
∞
α
xk = (1 + x)α , x ∈ [−1, 1].
P
b. Prove that
k=0 k
Notes
Without question the most important concept of this chapter is that of uniform
convergence of a sequence or series of functions. It is the additional hypothesis
required in proving that the limit function of a sequence of continuous or integrable
functions is again continuous or integrable. As was shown by numerous examples,
pointwise convergence is not sufficient. For differentiation, uniform convergence of
{fn } is not sufficient; what is also required is uniform convergence of the sequence
of derivatives {fn′ }.
The example of Weierstrass (Example 8.5.3) is interesting for several reasons.
First, it provides an example of a continuous function which is nowhere differentiable
on R. Furthermore, it also provides an example of a sequence of infinitely differen-
tiable functions which converges uniformly on R, but for which the limit function is
nowhere differentiable. Exercise 7 of Section 8.5 provides another construction of a
continuous function f that is nowhere differentiable. Although this construction is
much easier, the partial sums of the series defining the function f are themselves not
differentiable everywhere. Thus it is not so surprising that f itself is not differentiable
anywhere on R.
The proof of the Weiserstrass approximation theorem presented in the text is
only one of the many poofs available. A constructive proof by S. N. Bernstein using
the so-called Bernstein polynomials can be found on p. 107 of the text by Natanson
listed in the Bilbiography. The proof in the text using approximate identities was
chosen because the technique involved is very important in analysis and will be en-
countered later in the text. In Theorem 9.4.5 we will prove a variation of the Weier-
strass approximation theorem. At that point we will show that every continuous
real-valued function on [−π, π] with f (−π) = f (π) can be uniformly approximated
to within a given ǫ by a finite sum of a trigonometric series.
404 Introduction to Real Analysis
Miscellaneous Exercises
1. Using Miscellaneous Exercise 1 of Chapter 6 and the Weierstrass approx-
imation theorem, prove the following: If f ∈ R[a, b] and ǫ > 0 is given,
then there exists a polynomial P such that
Z b
|f − P | < ǫ.
a
2. Define f on R by
−1
c exp , |x| < 1,
f (x) = 1 − x2 ,
0, |x| ≥ 1,
R∞
where exp(x) = ex , and c > 0 is chosen so that f (x) dx = 1. For
−∞
1 1
λ > 0, set fλ (x) = f ( x).
λ λ
a. Prove that fλ ∈ C ∞ (R) for all λ > 0.
R∞
b. Prove that fλ (x) = 0 for all x ∈ R, |x| ≥ λ, and that fλ (x)dx = 1.
−∞
R
c. Prove that for every δ > 0, lim fλ (t) dt = 0.
λ→0+ {δ≤|t|}
a. Show that the power series defining S and C converge for all x ∈ R.
b. Show that S ′ (x) = C(x) and C ′ (x) = −S(x), x ∈ R.
c. Show that S ′′ (x) = −S(x) and C ′′ (x) = −C(x).
d. Show that if f : R → R satisfies f ′′ (x) = −f (x) with f (0) = 0, f ′ (0) =
1, then f (x) = S(x) for all x ∈ R.
e. If f : R → R satisfies f ′′ (x) = −f (x), prove that there exist constants
c1 , c2 such that f (x) = c1 S(x) + c2 C(x).
f. Show that (S(x))2 + (C(x))2 = 1. (Hint: Consider the function f (x) =
(S(x))2 + (C(x))2 .)
g. Show that C(x+y) = C(x)C(y)−S(x)S(y) and S(x+y) = S(x)C(y)+
C(x)S(y) for all x, y ∈ R.
Sequences and Series of Functions 405
Supplemental Reading
where the an and bn are real numbers. As we will see, such series afford
much greater generality in the type of functions that can be represented as
opposed to Taylor series. The study of trigonometric series has its origins in
the monumental work of Joseph Fourier (1768–1830) on heat conduction in
solids. His 1807 presentation to the French Academy introduced a whole new
subject area in mathematics while at the same time providing very useful
techniques for solving physical problems.
Fourier’s work is the source of all modern methods in mathematical physics
involving boundary value problems and has been a source of new ideas in
mathematical analysis for the past two centuries. To see how greatly mathe-
matics has been influenced by the studies of Fourier one only needs to look
at the two volume work Trigonometric Series by A. Zygmund (Cambridge
University Press, 1968). In addition to trigonometric series, Fourier’s origi-
nal method of separation of variables leads very naturally to the study of
orthogonal functions and the representation of functions in terms of a series
of orthogonal functions. All of these have many applications in mathematical
physics and engineering.
Fourier initially claimed and tried to show, with no success, that the
Fourier series expansion of a function actually represented the function. Al-
though his claim is false, in view of the eighteenth century concept of a func-
tion this was not an unrealistic expectation. Fourier’s claim had an immediate
impact on nineteenth century mathematics. It caused mathematicians to re-
consider the definition of “function.” The question of what type of function has
a Fourier series expansion also led Riemann to the development of the theory
of the integral and the notion of an integrable function. The first substantial
progress on the convergence of a Fourier series to its defining function is due
to Dirichlet in 1829. Instead of trying to prove like Fourier that the Fourier
series always converged to its defining function, Dirichlet considered the more
restrictive problem of trying to find sufficient conditions on the function f for
which the Fourier series converges pointwise to the function.
407
408 Introduction to Real Analysis
rules of addition and scalar multiplication, R[a, b], the set of Riemann inte-
grable functions on [a, b], is also a vector space over R. If for f, g ∈ R[a, b] we
define Z b
hf, gi = f (x)g(x) dx,
a
then it is easily shown that h , i satisfies (a), (c), and (d) of the definition of
an inner product. It however does not satisfy (b). If a < b and c1 , ..., cn are a
finite number of points in [a, b], then the function
(
0, x 6= ci ,
f (x) =
1, x = ci ,
is in R[a, b] satisfying hf, f i = 0, but f is not the zero function. Thus tech-
nically h , i is not an inner product on R[a, b], a minor difficulty which can
easily be overcome by defining two Riemann integrable functions f and g to
be equal if f (x) = g(x) for all x ∈ [a, b] except on a set of measure zero. This
will be explored in greater detail in Chapter 10. Alternately, if we restrict
ourself to the subset C[a, b] of R[a, b], then hf, gi as defined above is an inner
product on C[a, b] (Exercise 11).
Orthogonal Functions
We now define orthogonality with respect to the above inner product on
R[a, b].
Z b
hφn , φm i = φn (x)φm (x) dx = 0, for all n 6= m.
a
EXAMPLES 9.1.2 (a) For our first example we consider the two functions
φ(x) = 1 and ψ(x) = x, x ∈ [−1, 1]. Since
Z 1 Z 1
φ(x)ψ(x)dx = x dx = 0,
−1 −1
for n 6= m,
Z π
1 π
Z
sin nx sin mx dx = [cos(n − m)x − cos(n + m)x] dx
−π 2 −π
π
1 sin(n − m)x sin(n + m)x
= − = 0.
2 (n − m) (n + m) −π
Thus the functions in the collection {cos nπxL } are all orthogonal on [−L, L] as
are the function sin nπx
L and cos mπx
L for all n, m ∈ N with n 6= m. For m = n
Z L L
2 nπx
sin nπx
L cos nπx
L dx = L
2nπ sin L = 0.
−L −L
nπx
This last identity shows that the functions sin and cos nπx
L L are also or-
thogonal on [−L, L] for all n ∈ N. Finally, since
Z L Z L
nπx
sin L dx = cos nπx
L dx = 0
−L −L
If in Example 9.1.2(b) we define φn (x) = √1π sin nx, then the sequence
{φn (x)}∞
n=1 satisfies
Z π (
0, when n 6= m,
φn (x)φm (x) dx =
−π 1, when n = m.
A natural question is, given a real-valued function f on [a, b], how must the
coefficients cn be chosen so that SN gives the best approximation to f on [a, b]?
In the Weierstrass approximation theorem we have already encountered one
form of approximation; namely uniform approximation or approximation in
the uniform norm. However, for the study of orthogonal functions there is
another type of norm approximation that turns out to be more useful.
If X is a vector space over R with inner product h , i, then there is a natural
norm on X associated with this inner product. If for x ∈ X we define
p
kxk = hx, xi,
Thus for f ∈ R[a, b], the natural problem to consider is how must the constants
cn be chosen in order to minimize the quantity
Z b
2
kf − SN k2 = [f (x) − SN (x)]2 dx ?
a
Also,
N N
!
Z b Z b X X Z b
2
SN (x) dx = SN (x) cn φn (x) dx = cn SN (x)φn (x) dx.
a a n=1 n=1 a
But
Z b N
X Z b
SN (x)φn (x) = ck φk (x)φn (x) dx,
a k=1 a
which by orthogonality,
Z b
= cn φ2n (x) dx.
a
Therefore,
Z b N
X Z b
2
SN (x) dx = c2n φ2n (x) dx.
a n=1 a
FIGURE 9.1
Graphs of f and S2
The coefficients cn occur only in the middle term. Since this term is nonneg-
ative, the right side is a minimum if and only if
Rb
f φn
cn = Ra b , n = 1, 2, ..., N.
φ 2
a n
and
R1
f (x)φ2 (x)dx 1
3 3
Z
−1
c2 = R1 = (x4 + x) dx = .
φ2 (x)dx 2 −1 5
−1 2
∞
X
f (x) ∼ cn φn (x). (6)
n=1
Remark. The notation “∼” in (6) only means that the coefficients {cn } in
the series are given by formula (5). Nothing is implied about convergence of
the series!
In this case, the series converges for all x, but clearly not to f (x) = 1.
Bessel’s Inequality
For each N ∈ N, let SN (x) denote the N th partial sum of the Fourier series
of f ; i.e.,
N
X
SN (x) = cn φn (x),
n=1
416 Introduction to Real Analysis
where the cn are the Fourier coefficients of f with respect to the sequence
{φn } of orthogonal functions on [a, b]. Then by identity (3) of Theorem 9.1.4,
Z b N
X Z b
0≤ f 2 (x) dx − c2n φ2n (x) dx.
a n=1 a
Therefore
N
X Z b Z b
c2n φ2n (x) dx ≤ f 2 (x) dx.
n=1 a a
X∞ Z b Z b
c2n φ2n (x) dx ≤ f 2 (x) dx.
n=1 a a
Rπ
In Example 9.1.8 with f (x) = 1, f 2 (x)dx = 2π, and cn = 0 for all
−π
n = 1, 2, ..... Thus it is clear that equality need not hold in Bessel’s inequality.
However, there is one consequence of Theorem 9.1.9 which will prove useful
later.
Rb
Proof. Since f ∈ R[a, b], f 2 (x) dx is finite. Thus by Bessel’s inequality, the
a
P 2 Rb 2
series cn a φn converges. As a consequence,
Z b
lim c2n φ2n (x) dx = 0.
n→∞ a
and thus
s Rb
b
f (x)φn (x) dx
Z
a
lim cn φ2n (x) dx = lim qR = 0.
n→∞ a n→∞ b 2
φ
a n
(x) dx
Fourier Series 417
Exercises 9.1
1. *Let f (x) = sin πx, φ1 (x) = 1, and φ2 (x) = x. Find c1 and c2 so that
S2 (x) = c1 φ1 (x) + c2 φ2 (x) gives the best approximation in the mean to
f on [−1, 1].
2. a. Show that the polynomials P0 (x) = 1, P1 (x) = x, and P2 (x) = 32 x2 − 12
are orthogonal on [−1, 1].
b. Let
(
0, −1 ≤ x < 0,
f (x) =
1, 0≤x≤1
Find the constants c0 , c1 , and c2 , such that S2 (x) = c0 P0 (x) + c1 P1 (x) +
c2 P2 (x) gives the best approximation in the mean to f on [−1, 1].
3. *a. Let φ0 (x) = 1, φ1 (x) = x − a1 , φ2 (x) = x2 − a2 x − a3 . Determine the
constants a1 , a2 , and a3 , so that {φ0 , φ1 , φ2 } are orthogonal on [0, 1].
*b. Find the polynomial of degree less than or equal to 2 that best
approximates f (x) = sin πx in the mean on [0, 1].
4. Let {φn }∞
n=1 be aPsequence of orthogonal
P functions on [a, b]. For f, g ∈
R[a, b] with f ∼ an φn and g ∼ bn φn , show that for α, β ∈ R,
∞
P
(αf + βg) ∼ (αan + βbn )φn .
n=1
b. For f ∈ R[0, π], show that the Fourier series of f with respect to the
sequence {sin nx} is given by
∞
P 2 Rπ
bn sin nx where bn = f (x) sin nx dx.
n=1 π 0
*c. Find the Fourier series of f (x) = x on [0, π] in terms of the orthogonal
sequence {sin nx}.
6. a. Show that the sequence {1, cos nx}∞
n=1 is orthogonal on [0, π].
b. For f ∈ R[0, π], show that the Fourier series of f with respect to the
above orthogonal sequence is given by
∞
1
P
a +
2 0
an cos nx
n=1
2 Rπ 2 Rπ
where a0 = f (x) dx and an = f (x) cos nx dx, n = 1, 2, ....
π0 π0
*c. Find the Fourier series of f (x) = x on [0, π] in terms of the orthogonal
sequence {1, cos nx}.
7. If f ∈ R[0, π], prove that
Rπ Rπ
lim f (x) sin nx dx = lim f (x) cos nx dx = 0.
n→∞ 0 n→∞ 0
9. Let {an } be a sequence in (0, 1) satisfying 0 < an+1 < an < 1 for all
n ∈ N. Define φn on [0, 1] by
0, 0 ≤ x < an+1 ,
2(x − a )
n+1
, an+1 ≤ x < 12 (an+1 + an )
an − an+1
φn (x) =
−2(x − an ) 1
, 2
(an+1 + an ) ≤ x ≤ an
an − an+1
0, an < x ≤ 1.
Show that {φn } is orthogonal in R[0, 1] and compute kφn k2 for each
n ∈ N.
10. Let P0 (x) = 1 and for n ∈ N let
1 dn
Pn (x) = n (1 − x2 )n , x ∈ [−1, 1].
2 n! dxn
The polynomials Pn are called the Legendre polynomials on [−1, 1].
a. Find P1 , P2 , and P3 .
b. Show that the sequence {Pn }∞ n=0 is orthogonal on [−1, 1]. (Hint: Use
repeated integration by parts.)
Rb
11. For f, g ∈ C[a, b] define hf, gi = a f (x)g(x)dx. Prove that h , i is an
inner product on C[a, b].
12. Let X be a vector space over R with inner product h , i. Prove each of
the following:
*a. |hx, yi| ≤ kxk kyk for all x, y ∈ X.
b. k k is a norm on X.
13. (Cauchy-Schwarz Inequality) Use the previous exercise to prove that
if f, g ∈ R[a, b], then
2
Rb
b b
R 2 R 2
f (x)g(x)dx ≤ f (x) dx g (x) dx .
a a a
14. Let X be a vector space over R with inner product h , i. If {y1 , ..., yn }
are non-zero orthogonal vectors in X and x ∈ X, prove that the quantity
kx − (c1 y1 + · · · cn yn )k is a minimum if and only if
hx, yi i
ci =
kyi k2
for all i = 1, ..., n. (Imitate the proof of Theorem 9.1.4)
Proof. Since the proof of this result is similar to the proof of Theorem 8.4.1,
we leave it as an exercise (Exercise 1).
The first four functions f1 , f2 , f3 , and f4 are given as follows: f1 (x) = 1, and
(
1, 0 ≤ x ≤ 21 ,
f2 (x) = 1
0, 2 < x ≤ 1,
(
0, 0 ≤ x < 12
f3 (x) = 1
1 2 ≤ x ≤ 1,
(
1, 0 ≤ x ≤ 14 ,
f4 (x) = 1
0, 4 < x ≤ 1.
From this it follows immediately that {SN } converges in the mean to f if and
only if Parseval’s equality holds.
Fourier Series 421
Proof. The hypothesis implies that the Fourier coefficients cn of f are zero
for all n ∈ N. Thus by Parseval’s equality,
Z b
f 2 (x) dx = 0.
a
For the orthogonal system {sin nx}∞ n=1 on [−π, π] and f (x) = 1, we have
cn = 0 for all n = 1, 2, .... Thus as a consequence of Theorem 9.2.6 the
orthogonal system {sin nx} is not complete on [−π, π]. However, as we will
see in Exercise 3 of Section 9.4, this system will be complete on [0, π].
422 Introduction to Real Analysis
for all n = 1, 2, ..., then f (x) = g(x) for all x ∈ [a, b]. The above assumption
simply means that f and g have the same Fourier coefficients. To prove the
result, apply Theorem 9.2.6 to h(x) = f (x) − g(x).
∞
X ∞
X
f (x) ∼ cn φn (x) and g(x) ∼ bn φn (x).
n=1 n=1
Then,
Z b ∞
X Z b
f (x)g(x) dx = c n bn φ2n (x) dx.
a n=1 a
Proof. Exercise 5.
Exercises 9.2
1. Prove Theorem 9.2.2.
2. For n ∈ N, define the function fn on [0, 1] by
√n,
1
0<x< ,
fn (x) = n
0, elsewhere.
Show that {fn } converges to 0 pointwise, but not in the mean.
3. Consider the orthogonal system {1, cos nπx
L
, sin nπx
L
}∞
n=1 on [−L, L].
a. Show that if f ∈ R[−L, L], then the Fourier series of f with respect
to the above orthogonal system is given by
∞
f (x) ∼ 21 ao + (an cos nπx + bn sin nπx
P
L L
),
n=1
where
1 RL nπx
an = f (x) cos dx, n = 0, 1, 2, ..., and
L −L L
1 RL nπx
bn = f (x) sin dx, n = 1, 2, ...
L −L L
b. Show that Bessel’s inequality becomes
1 2
∞ 1 RL 2
(a2n + b2n ) ≤
P
a
2 o
+ f (x) dx.
n=1 L −L
Fourier Series 423
4. *a. Assuming that the orthogonal system {sin nx} is complete on [0, π],
show that Parseval’s equality becomes
∞ 2 Rπ 2 Rπ
b2n = [f (x)]2 dx where bn =
P
f (x) sin nx dx.
n=1 π0 π0
*b. Use Parseval’s equality and the indicated function to find the sum of
the given series.
P∞ 1 P∞ 1
(i) 2
, f (x) = 1. (ii) 2
, f (x) = x.
k=1 (2k − 1) k=1 k
are nonnegative and decrease to zero, by Theorem 7.2.6 the first series con-
verges for all x ∈ R, whereas the second converges for all x ∈ R, except
x = 2pπ, p ∈ Z.
424 Introduction to Real Analysis
Fourier Series
For the orthogonal system {1, cos nx, sin nx}∞
n=1 on [−π, π] we have
Z π Z π Z π
sin2 nx dx = cos2 nx dx = π, and 12 dx = 2π,
−π −π −π
For the orthogonal system {1, cos nx, sin nx}, by Exercise 3(b) of the pre-
vious section, Bessel’s inequality of Theorem 9.1.9 becomes
∞
1 π
X Z
1 2 2 2
2 a0 + (an + bn ) ≤ [f (x)]2 dx. (Bessel’s Inequality)
n=1
π −π
Thus for f ∈ R[a, b] the sequences {an } and {bn } of Fourier coefficients of
f are square summable sequences. In Theorem 10.8.7 we will prove that if
{an } and {bn } are square summable sequences, then there exists a Lebesgue
integrable function f such that {an } and {bn } are the Fourier coefficients of
f.
Remark on Notation. In subsequent sections we will primarily be interested
in real-valued functions defined on all of R that are periodic of period 2π. As a
consequence, in the examples and exercises, rather than defining our functions
on [−π, π], we only define the function f on [−π, π) with the convention that
f (π) = f (−π). This then allows us to extend f to all of R as a 2π periodic
function according to the following definition.
Fourier Series 425
(
0, −π ≤ x < 0,
EXAMPLES 9.3.3 (a) Let f (x) = . Then
1, 0≤x<π
π π
1 1
Z Z
a0 = f (x) dx = 1 dx = 1,
π −π π 0
If SN (x) denotes the N th partial sum of the Fourier series, then S1 and S3
are given by
1 2 1 2 1
S1 (x) = + sin x and S3 (x) = + sin x + sin 3x .
2 π 2 π 3
The graph of f, S1 , S3 , S5 , and S15 are given in Figure 9.2.
(b) Let f (x) = x. Before we compute the Fourier coefficients, we will make
several observations which simplify this task. Recall that a function g(x) is
even on [−a, a] if g(−x) = g(x) for all x, and g(x) is odd if g(−x) = −g(x)
for all x. By Exercise 4 of Section 6.2, if g(x) is even on [−a, a], then
Z a Z a
g(x) dx = 2 g(x) dx,
−a 0
FIGURE 9.2
Graphs of f, S1 , S3 , S5 , and S15
Therefore
∞
X (−1)n+1
x∼2 sin nx.
n=1
n
Riemann-Lebesgue Lemma
There is one additional result from the general theory that will be needed later.
For the orthogonal system {1, cos nx, sin nx}, Corollary 9.1.10 is as follows:
Then π π nπ
sin nx sin x
Z Z Z
f (x) sin nx dx = dx = dx.
−π 0 x 0 x
Hence,
π nπ ∞
sin x sin x π
Z Z Z
lim f (x) sin nx dx = lim dx = dx = .2
n→∞ −π n→∞ 0 x 0 x 2
with {An } and {Bn } converging to zero, does there exist a function f on
[−π, π] such that the coefficients An and Bn are given by Definition 9.3.1?
As we will see, the answer is no! First however, in the positive direction, we
prove the following.
Since the series converges uniformly on [−π, π], and Sn is continuous for each
n,
f (x) = lim Sn (x)
n→∞
2 The value of π/2 for the improper integral is most easily obtained by contour integration
and the theory of residues of complex analysis. A real variables approach that computes
the value of this integral is given in the article by K. S. Williams listed in the supplemental
readings.
428 Introduction to Real Analysis
Since for each m, the sequence {Sn (x) cos mx} converges uniformly to
f (x) cos mx on [−π, π], the above interchange of limits and integration is valid
by Theorem 8.4.1. If n > m, then
Z π Z π n
X Z π
Sn (x) cos mx = 21 A0 cos mx dx + Ak cos kx cos mx dx
−π −π k=1 −π
n
X Z π
+ Bk sin kx cos mx dx.
k=1 −π
Thus by orthogonality,
Z π
Sn (x) cos mx dx = Am π.
−π
Letting n → ∞ gives
π
1
Z
Am = f (x) cos mx dx.
π −π
The analogous formula also holds for Bm , and thus the given series is the
Fourier series of f .
P P
Remarks. (a) If |Ak | and |Bk | both converge, then by the Weierstrass
M-test the series
∞
1 X
A0 + (Ak cos kx + Bk sin kx)
2
k=1
is continuous (in fact, measurable) then Ak and Bk are the Fourier coefficients
of the function f .3 (See also Miscellaneous Exercise 1.)
We now turn to the negative results. Consider the series
∞
X sin nx
, (7)
n=2
ln n
which by Theorem 7.2.6 converges for all x ∈ R. However, there does not exist
a Riemann integrable function f on [−π, π] such that
1 1 π
Z
= bn = f (x) sin nx dx.
ln n π −π
But, since f is Riemann integrable on [−π, π], so is f 2 , and thus the integral
is finite. On the other hand however,
∞
X 1
= ∞,
n=2
(ln n)2
3 Sur les series trigonometric, Annales Scientifiques de l’École Normale Supérieure, (3)
20 (1903), 453–485.
430 Introduction to Real Analysis
DEFINITION 9.3.7 For f ∈ R[0, π], the Fourier sine series of f is given
by
X∞
f (x) ∼ bn sin nx,
n=1
where
2 π
Z
bn = f (x) sin nx dx, n = 1, 2, ...
π 0
are the Fourier sine coefficients of f . Similarly, the Fourier cosine series
of f ∈ R[0, π] is given by
∞
X
f (x) ∼ 21 a0 + an cos nx,
n=1
where
π π
2 2
Z Z
a0 = f (x) dx and an = f (x) cos nx dx, n = 1, 2, ...
π 0 π 0
There is a simple connection between Fourier series and the Fourier sine
and cosine series. As in Example 9.3.3(b), we first note that if f is an even
function on [−π, π], then by Exercise 1,
∞
2 π
X Z
1
f (x) ∼ 2 a0 + an cos nx, where an = f (x) cos nx dx
n=1
π 0
Thus the coefficients {an } and {bn } depend only on the values of the function
f on [0, π]. Conversely, given a function f on [0, π), the following definition
extends f both as an even and an odd function to the interval (−π, π):
It is easily seen that the functions fe (x) and fo (x) are even and odd re-
spectively on (−π, π), and agree with the given function f on (0, π). From
the above discussion it is easily seen that if f ∈ R[0, π], then the Fourier sine
series of f is equal to the Fourier series of fo , and the Fourier cosine series of
f is equal to the Fourier series of fe (Exercise 2).
Remark. In the definition of the even and odd extension we only assumed
that f was defined on [0, π), and then defined fe and fo on (−π, π). If
exists, then for the even extension we define fe (−π) = f (π−). Thus fe is
now defined on [−π, π) and hence can be extended to all of R as a 2π-periodic
function. For the odd extension fo , we set fo (−π) = −f (π−), thereby defining
fo on [−π, π).
Exercises 9.3
1. Let f ∈ R[−π, π]. Prove the following:
*a. If f is even on [−π, π], then the Fourier series of f is given by
∞ 2 Rπ
f (x) ∼ 21 a0 +
P
an cos nx, where an = f (x) cos nx dx,
n=1 π 0
n = 0, 1, 2, ...
b. If f is odd on [−π, π], then the Fourier series of f is given by
P∞ 2 Rπ
f (x) ∼ bn sin nx where bn = f (x) sin nx dx, n = 1, 2, ...
n=1 π0
2. Let f ∈ R[0, π]. Prove the following:
a. The Fourier sine series of f is equal to the Fourier series of the odd
extension fo of f .
b. The Fourier cosine series of f is equal to the Fourier series of the even
extension fe of f .
3. Find the Fourier
( series of each of the following functions
( f on [−π, π).
−1, −π ≤ x < 0, 0, −π ≤ x ≤ 0,
*a. f (x) = . b. f (x) =
1, 0 ≤ x < π. x, 0 < x < π.
*c. f (x) = |x|. d. f (x) = x2 .
*e. f (x) = 1 + x .
−π ≤ x < − π2 ,
0,
− π2 ≤ x < 0,
−1,
f. f (x) =
1, 0 ≤ x < π2 ,
π
0, ≤ x < π.
2
4. On the interval [−2π, 2π] sketch the graph of the 2π-periodic extension
of each of the functions in Exercise 3.
5. Find the Fourier sine and cosine series on [0, π) of each of the following
functions.
432 Introduction to Real Analysis
11. Using the previous exercise, show that each of the following hold.
Rπ Rπ
a. lim ln x sin nx dx = lim ln x cos nx dx = 0.
n→∞ 0 n→∞ 0
Rπ Rπ
b. lim xα sin nx dx = lim xα cos nx dx = 0 for all α > −1.
n→∞ 0 n→∞ 0
1 π
Z
Sn (x) = f (t)Dn (x − t) dt,
π −π
π
1
Z
Sn (x) = f (t)Dn (x − t) dt.
π −π
To conclude the proof, it remains to derive the formula for Dn (s). If s = 2pπ,
p ∈ Z, then
n
1 X 1
Dn (2pπ) = + 1=n+ .
2 2
k=1
FIGURE 9.3
Graphs of D1 , D3 , and D5
the sum defining Dn (s) term by term. The only nonzero term will be the
integral involving 1/2, and thus
1 π 1 π 1
Z Z
Dn (t) dt = dt = 1.
π −π π −π 2
The graphs of D1 , D3 , and D5 are illustrated in Figure 9.3.
However, it is possible that the sequence {Sn (x)} diverges for a particular x,
whereas the sequence {σn (x)} may converge.
n
LEMMA 9.4.2 For n ∈ N, let Sn (x) = 12 a0 +
P
ak cos ks + bk sin kx. Then
k=1
n
1 X j
(a) σn (x) = a0 + 1− (aj cos jx + bj sin jx), and
2 j=1
n+1
Z π Z π
(b) [f (x) − Sn (x)]2 dx ≤ [f (x) − σn (x)]2 dx.
−π −π
436 Introduction to Real Analysis
FIGURE 9.4
Graphs of F1 , F3 , and F5
Proof. The proof of (a) is left as an exercise (Exercise 1). Since σn is itself
the partial sum of a trigonometric series, the result (b) follows by Corollary
9.1.5.
Our next step is to express σn (x) as an integral analogous to that of Sn (x)
in the previous theorem.
where
n
1 X
Fn (t) = Dk (t).
n+1
k=0
1
If t = 2pπ, p ∈ Z, then Dk (2pπ) = k + 2, and thus
n
1 X (n + 1)
Fn (2pπ) = (k + 12 ) = .
n+1 2
k=0
If t 6= 2pπ, p ∈ Z, then
n
1 X
Fn (s) = sin(k + 12 )t
2(n + 1) sin 2t k=0
n
1 X
= sin 2t sin(k + 21 )t.
2(n + 1) sin2 t
2 k=0
THEOREM 9.4.4
(a) Fn is periodic of period 2π with Fn (−t) = Fn (t).
(b) Fn (t) ≥ 0 for all t.
1 Rπ
(c) Fn (t)dt = 1.
π −π
(d) For 0 < δ < π, lim Fn (t) = 0 uniformly for all t, δ ≤ |t| ≤ π.
n→∞
and sin 21 (t + 2π) = − sin 2t , substituting into the formula for Fn gives
Fn (t + 2π) = Fn (t). Therefore Fn is periodic of period 2π. The proof of (b) is
obvious.
Rπ
(c) Since π1 Dk (t)dt = 1, we have
−π
π n π
1 1 X1
Z Z
Fn (t)dt = Dk (t) dt = 1.
π −π n+1 π −π
k=0
1
Fn (t) ≤ for all t, δ ≤ |t| ≤ π.
2(n + 1) sin2 δ
2
1 π
Z
σn (x) = f (t)Fn (x − t)dt,
π −π
π−x
1
Z
= f (s + x)Fn (s) ds.
π −π−x
such that
|f (x) − Tn (x)| < ǫ
for all x ∈ [−π, π]. The function Tn (x) is called a trigonometric polynomial
since in complex form it can be rewritten as
n
X
Tn (x) = ck eikx ,
k=−n
LEMMA 9.4.8 Let f ∈ R[a, b] with |f (x)| ≤ M for all x ∈ [a, b]. Then
given ǫ > 0, there exists a continuous function g on [a, b] with g(a) = g(b) and
|g(x)| ≤ M for all x ∈ [a, b], such that
Z b
|f (x) − g(x)| dx < ǫ.
a
440 Introduction to Real Analysis
Before proving the lemma, we will first use the result to prove Theorem
9.4.7, then consider some consequences of this theorem.
Proof of Theorem 9.4.7 Suppose |f (x)| ≤ M with M > 0, and let ǫ > 0
be given. By the lemma, there exists a continuous function g on [−π, π] with
g(−π) = g(π) and |g(x)| ≤ M for all x ∈ [−π, π] such that
Z π
ǫ
|f (x) − g(x)| dx < . (9)
−π 8M
Let Sn (g)(x) be the function defined by
1 π
Z
Sn (g)(x) = g(t)Dn (x − t) dt,
π −π
where Dn is the Dirichlet kernel. Then since g is continuous, by Corollary
9.4.6 the sequence {Sn (g)} converges in the mean to g on [−π, π]. Thus there
exists no ∈ N such that
Z π
ǫ
[g(x) − Sn (g)(x)]2 dx <
−π 4
Rπ
for all n ≥ no . Consider −π [f (x) − Sn (g)(x)]2 dx. Since
|f (x) − Sn (g)(x)| ≤ |f (x) − g(x)| + |g(x) − Sn (g)(x)|,
we have
|f (x) − Sn (g)(x)|2 ≤ 2 |f (x) − g(x)|2 + |g(x) − Sn (g)(x)|2 .
However, for each n ∈ N, Sn (g) is the nth partial sum of a trigonometric series.
Thus if Sn is the nth partial sum of the Fourier series of f , by Corollary 9.1.5
Z π Z π
[f (x) − Sn (x)]2 dx ≤ [f (x) − Sn (g)(x)]2 dx.
−π −π
which gives
∞
X 1 π2
= .
n=1
n2 6
Proof of Lemma 9.4.8. Let f ∈ R[a, b] with |f (x)| ≤ M for all x ∈ [a, b].
Since f1 (x) = f (x) + M is nonnegative, if we can prove the result for the
function f1 , the result also follows for the function f . Thus we can assume
that f satisfies 0 ≤ f (x) ≤ M for all x ∈ [a, b].
Let ǫ > 0 be given. Since f is Riemann integrable on [a, b], there exists a
partition P = {x0 , x1 , ..., xn } of [a, b] such that
Z b
ǫ
0≤ f (x) dx − L(P, f ) < ,
a 2
where
n
X
L(P, f ) = mi ∆xi ,
i=1
and mi = inf{f (t) : t ∈ [xi−1 , xi ]}, i = 1, 2, ..., n. For each i = 1, 2, ..., n, define
the functions hi on [a, b] by
(
mi , xi−1 ≤ x < xi ,
hi (x) =
0, elsewhere,
n
P
and let h(x) = hi (x) (see Figure 9.5). The function h is called a step
i=1
function on [a, b]. Since h is continuous except at a finite number of points,
h is Riemann integrable and
Z b Xn
h(x) dx = mi ∆xi = L(P, f ).
a i=1
442 Introduction to Real Analysis
FIGURE 9.5
Graph of the Step Function h
Therefore,
b
ǫ
Z
0≤ [f (x) − h(x)] dx < .
a 2
Also, since mi = inf{f (t) : t ∈ [xi−1 , xi ]}, 0 ≤ h(x) ≤ f (x) for all x ∈ [a, b]. By
taking slightly shorter intervals, and connecting the endpoints with straight
line segments, we leave it as an exercise (Exercise 9) to show that there exists
a continuous function g on [a, b] with g(a) = g(b) = 0, 0 ≤ g(x) ≤ h(x), such
that Z b
ǫ
[h(x) − g(x)] dx < .
a 2
Then 0 ≤ g(x) ≤ f (x), and
Z b Z b Z b
0≤ [f (x) − g(x)] dx = [f (x) − h(x)] dx + [h(x) − g(x)] dx
a a a
ǫ ǫ
< + = ǫ.
2 2
Exercises 9.4
1. Prove Lemma 9.4.2(a).
2. *Using the Fourier series of f (x) = x2 and Parseval’s equality, find
P∞ 1
4
.
n=1 n
9. Complete the proof of Lemma 9.4.8; namely, given the step function h
on [a, b] and ǫ > 0, there exists a continuous function g on [a, b] with
Rb
0 ≤ g(x) ≤ h(x), g(a) = g(b) = 0, and [h(x) − g(x)] dx < ǫ.
a
10. * Let f ∈ R[a, b]. Given ǫ > 0, prove that there exists a polynomial p(x)
Rb
such that |f (x) − p(x)|dx < ǫ.
a
1 x−π 1 x+π
Z Z
=− f (x − s)Dn (s) ds = f (x − s)Dn (s) ds.
π x+π π x−π
Since both f and Dn are periodic of period 2π, by Theorem 8.6.3
1 x+π 1 π
Z Z
f (x − s)Dn (s) ds = f (x − s)Dn (s) ds.
π x−π π −π
Therefore,
0 π
1 1
Z Z
Sn (x) = f (x − s)Dn (s) ds + f (x − s)Dn (s) ds.
π −π π 0
where g1 (t) = g(t) cos 2t and g2 (t) = g(t) sin 2t . Since g is Riemann integrable
on [−π, π], so are both g1 and g2 . Thus by Theorem 9.3.4,
Z π Z π
lim g1 (t) sin nt dt = lim g2 (t) cos nt dt = 0,
n→∞ −π n→∞ −π
Dirichlet’s Theorem
Before we state and prove Dirichlet’s theorem, we briefly review some notation
introduced in Chapter 4. For a real-valued function f defined in a neighbor-
hood of a given point p, the right and left limits of f at p, denoted f (p+)
and f (p−) respectively, are defined by
For i = 0 and n, we of course only require the existence of the right and left
limit respectively. Also, we do not require that f be defined at x0 , x1 , ..., xn .
In addition to piecewise continuous functions, it will also be useful to
consider functions for which the derivative is piecewise continuous. If f is
piecewise continuous on [a, b], then the derivative f ′ is piecewise continuous
on [a, b] if there exist a finite number of points a = x0 < x1 < · · · < xn = b
such that
(a) f ′ (x) exists and is continuous on each interval (xi−1 , xi ), i = 1, 2, ..., n,
and
(b) for each i = 0, 1, 2, ..., n, the quantities f ′ (xi +) and f ′ (xi −) both exist as
finite limits. Again, for the endpoints a and b we obviously only require the
existence of f ′ (a+) and f ′ (b−).
(
x, 0 < x < 1,
EXAMPLES 9.5.5 (a) Consider the function f (x) =
x2 + 2, 1 < x < 2.
Since f is continuous on (0, 1) and (1, 2) and f (0+), f (1−), f (1+), and f (2−)
all exist, f is piecewise continuous on [0, 2]. Also, since
(
1, 0 < x < 1,
f ′ (x) =
2x, 1 < x < 2,
and f ′ (0+), f ′ (1−), f ′ (1+), and f ′ (2−) all exist, f ′ is also piecewise contin-
uous on [0, 2].
(b) As our second example, consider the function
(
0, x = 0,
f (x) = 2 1
x sin x , 0 < x ≤ 1.
448 Introduction to Real Analysis
However, since f ′ (0+) does not exist, f ′ is not piecewise continuous on [0, 1].
there exists x1 > xo such that f and f ′ are continuous on (xo , x1 ). If f is not
continuous at xo , redefine f at xo as f (xo +). Then f (redefined if necessary)
is continuous on [xo , x1 ), and thus by Theorem 5.2.11
f (xo + t) − f (xo +)
lim = f ′ (xo +).
t→0+ t
As a consequence, there exists a constant M and a δ > 0 such that
for all t, 0 < t < δ. Similarly, the existence of f ′ (xo −) implies the existence
of a constant M and a δ > 0 such that
for all t, 0 < t < δ. Thus f satisfies the hypothesis of Dirichlet’s theorem. On
combining the above discussion with Theorem 9.5.3, we obtain the following
corollary.
for all x ∈ R.
FIGURE 9.6
Graph of 12 [f (x+) + f (x−)]
The graph of 21 [f (x+) + f (x−)] on the interval [−2π, 2π] is given in Figure
9.6.
To discuss convergence of the Fourier series of f , we first note that f is
piecewise continuous on [−π, π] and differentiable on (−π, − π2 ), (− π2 , π2 ), and
( π2 , π) with f ′ (x) = 0 in the respective intervals. Thus f ′ is also piecewise
continuous on [−π, π]. As a consequence of Corollary 9.5.6 the Fourier series
of f converges to 21 [f (x+) + f (x−)] for all x ∈ R.
The Fourier series of f is obtained as follows:
Z π2
1
a0 = 3 dx = 3,
π − π2
Z π2
1 6 nπ
an = 3 cos nx dx = sin , n = 1, 2, ...
π − π2 nπ 2
and since f is an even function, bn = 0 for all n ∈ N. Therefore,
∞
3 X 6 nπ
f (x) ∼ + sin cos nx.
2 n=1 nπ 2
If n is even, sin nπ
2 = 0. If n is odd, i.e., n = 2k + 1,
π π
sin(2k + 1) π2 = sin kπ + = cos kπ sin = (−1)k .
2 2
Thus
∞
3 6 X (−1)k
f (x) ∼ + cos(2k + 1)x.
2 π 2k + 1
k=0
When x = 0, the series converges to f (0) = 3. As a consequence
∞
3 6 X (−1)k
3= + ,
2 π 2k + 1
k=0
450 Introduction to Real Analysis
(b) Dirichlet’s theorem can also be applied to the Fourier sine and cosine
series of a real-valued function f defined on [0, π). As an example, consider
the cosine series of f (x) = x on (0, π). By Exercise 5(b) of Section 9.3, the
cosine series of f (x) = x is given by
∞
1 4X 1
x∼ π− cos(2k − 1)x.
2 π (2k − 1)2
k=1
Since the even extension fe of f to [−π, π] is given by fe (x) = |x|, the cosine
series of x is the Fourier series of |x| on [−π, π]. Since both fe and fe′ are
piecewise continuous on [−π, π], the Fourier series converges to the 2π-periodic
extension of |x| for all x ∈ R.
By Dirichlet’s theorem, the Fourier series converges to |x| for all x ∈
[−π, π]. Taking x = 0 gives
∞
π 4X 1
0= − ,
2 π (2k − 1)2
k=1
∞
P 1 π2
or 2
= .
k=1 (2k − 1) 8
Remarks. Although the inequalities (10) of Theorem 9.5.3 are sufficient, they
are by no means necessary. There are variations of Dirichlet’s theorem which
provide sufficient conditions on f to guarantee convergence of the series to
1
2 [f (x0 +) + f (x0 −)]. For example, the inequalities of Theorem 9.5.3 can be
replaced by the following:
for all t, 0 < t < δ, and some α, 0 < α ≤ 1. If f satisfies the above at xo ,
then the conclusion of Dirichlet’s theorem is still valid. An even more general
condition is due to Ulisse Dini (1845–1918) who proved that if f ∈ R[−π, π]
satisfies
Z δ Z δ
|f (xo + t) − f (xo +)| |f (xo − t) − f (xo −)|
dt < ∞ and dt < ∞
0 t 0 t
for some δ > 0, then the Fourier series of f converges to 21 [f (xo +) + f (xo −)]
at xo . Both of the above hold if f satisfies (11) at xo .
Fourier Series 451
does not converge since its nth term fails to approach zero as n → ∞. The 2π-
periodic extension of f has discontinuities at x = ±(2k − 1)π, k ∈ N. It turns
out that continuity of the periodic function is important for differentiability
of the Fourier series. Sufficient conditions are given by the following theorem.
THEOREM 9.5.8 Let f be a continuous function on [−π, π] with f (−π) =
f (π), and let f ′ be piecewise continuous on [−π, π]. If
∞
1 X
f (x) = a0 + (an cos nx + bn sin nx), x ∈ [−π, π],
2 n=1
Remark. At a point x where f ′′ (x) does not exist but f ′′ (x−) and f ′′ (x+)
both exist, the series (12) converges to 21 [f ′ (x−) + f ′ (x+)].
Proof. Suppose x ∈ (−π, π) is such that f ′′ (x) exists. Since f ′ is continuous
at x, by Dirichlet’s theorem
∞
1 X
f ′ (x) = α0 + (αn cos nx + βn sin nx).
2 n=1
Exercises 9.5
1. For each of the functions of Exercise 3, Section 9.3, sketch the graph of the
function on the interval [−2π, 2π] to which the Fourier series converges.
2. *Use the Fourier series of f (x) = x2 to find each of the following sums:
P∞ (−1)n+1 P∞ 1
a. , b. .
n=1 n2 n=1 n
2
6 “Une série de Fourier–Lebesgue divergente partout,” Compte Rendus, 183 (1926), 1327–
1328.
7 “On convergence and growth of partial sums of Fourier Series,” Acta Math., 116 (1966),
135–157.
454 Introduction to Real Analysis
8. Suppose f ∈ R[−π, π], and xo ∈ (−π, π) is such that f (xo −) and f (xo +)
exist, and that
f (xo + t) − f (xo +) f (xo − t) − f (xo −)
lim , lim
t→0+ t t→0+ t
both exist as finite limits. Prove that f satisfies the hypothesis of Dirich-
let’s theorem (9.5.3).
9. Suppose f ∈ R[−π, π] and xo ∈ (−π, π) is such that f (xo −) and f (xo +)
both exist. If f satisfies the inequalities (11) at xo for some α, 0 < α ≤ 1,
prove that the Fourier series of f at xo converges to 21 [f (xo +) + f (xo −)].
Notes
There is no doubt about the significance of Fourier’s contributions to the areas of
mathematical physics and applied mathematics; one only needs to consult a text on
partial differential equations. The methods which he developed in connection with
the theory of heat conduction are applicable to a large class of physical phenom-
ena, including problems in acoustics, elasticity, optics, and the theory of electrical
networks, among others. Fourier’s work however is even more significant in that it
inaugurated a new area of mathematics.
The study of Fourier series led to the development of the fundamental concepts
and methods of what is now called real analysis. The study of the concept of a
function by Dirichlet and others was directly linked to their interest in Fourier series.
The study of Fourier series by Riemann led to his development of the Riemann
integral. He was concerned with the question of finding sufficient conditions for the
existence of the integrals which gave the Fourier coefficients of a function f , that is
1 π 1 π
Z Z
an = f (x) cos nx dx and bn = f (x) sin nx dx.
π −π π −π
The quest for an understanding of what types of functions possessed Fourier series is
partly responsible for the development of Lebesgue’s theory of integration. The need
for a more extensive theory of integration was illustrated by Lebesgue in 1903 in the
paper “Sur les series trigonometric.”8 In this paper, he constructed an example of a
function that is not Riemann integrable but that is representable everywhere by its
Fourier series. Such a function is f (x) = − ln |2 sin(x/2)| whose Fourier series is
∞
X cos kx
.
k
k=1
This series converges everywhere to the function f on [−π, π], but since f is un-
bounded, it is not Riemann integrable on [−π, π]. It is however integrable in the
sense of Lebesgue. The article by Alan Gluchoff provides an excellent exposition
on the influence of trigonometric series to the theories of integration of Cauchy,
Riemann, and Lebesgue.
There are several important topics that we have not touched upon in the course
of this chapter. One of these is the Gibbs phenomenon, named after Josiah Gibbs
(1839–1903). To explain this phenomenon, we consider the Fourier series of f (x) =
0, x ∈ [−π, 0), and f (x) = 1, x ∈ [0, π) of Example 9.3.3(a). By Dirichlet’s theorem
we have
∞ 0,
−π < x < 0,
1 2 X 1
+ sin(2k − 1)x = 21 , x = 0,
2 π (2k − 1)
k=1
1, 0 < x < π.
A careful examination of the graphs (see Figure 9.2) of the partial sums S3 , S5 ,
and S15 , shows that near 0, each of the functions Si , i = 3, 5, 15, has an absolute
maximum at a point ti , where the ti get closer to 0, but Si (ti ) is bounded away from
1.
We now consider the behavior of the partial sums Sn more closely. For n =
(2k − 1), k ∈ N,
1 2 1 1
S2k−1 (x) = + sin x + sin 3x + · · · + sin(2k − 1)x ,
2 π 3 (2k − 1)
and
′ 2
S2k−1 (x) = [cos x + cos 3x + · · · + cos(2k − 1)x] .
π
′
If we multiply S2k−1 (x) by sin x and use the identity
we obtain
′ 1
sin x S2k−1 (x) = sin kx.
π
From this it now follows that S2k−1 (x) has relative maxima and minima at the points
These points are equally spaced in (−π, π). Consider the points xk = π/(2k), at
which each S2k−1 has an (absolute) maximum with
1 2 π 1 3π 1 (2k − 1)π
S2k−1 (xk ) = + sin + sin + ··· + sin .
2 π 2k 3 2k 2k − 1 2k
To find lim S2k−1 (xk ), we write the above sum as a Riemann sum of the function
k→∞
g(x) = (sin x)/x. This function is Riemann integrable on [0, π]. With the partition
P of [0, π] given by yj = j πk , j = 0, 1, 2, ..., k, and tj = 12 (yj−1 + yj ),
k
2 π 1 3π 1 (2k − 1)π 1X
sin + sin + ··· + sin = g(tj )∆yj .
π 2k 3 2k 2k − 1 2k π j=1
Therefore,
1 π sin x
Z
1
lim S2k−1 (xk ) =
+ dx.
k→∞ 2 π 0 x
To approximate the integral we use the Taylor series expansion of sin x. This gives
sin x x2 x4 x6 x8
=1− + − + − ··· , x ∈ R.
x 3! 5! 7! 9!
456 Introduction to Real Analysis
Therefore,
π
π3 π5 π7 π9
Z
1 sin x 1
dx = π− + − + − ···
π 0 x π 3 · 3! 5 · 5! 7 · 7! 9 · 9!
π2 π4 π6 π8
=1− + − + − ···
3 · 3! 5 · 5! 7 · 7! 9 · 9!
≈ 1 − .54831 + .16235 − .02725 + .00291
≈ .59 (to two decimal places).
The notation “≈” denotes approximately equal to. Therefore lim S2k−1 (xk ) ≈ 1.09.
k→∞
Even though lim Sn (x) = 1 for all x ∈ (0, π), if we approach 0 along the points xk ,
n→∞
then S2k−1 (xk ) overshoots the value 1 by approximately .09, i.e.,
for all x ∈ [−π, π], must Ak = Ck and Bk = Dk for all k = 0, 1, 2, ...? Alternately, if
∞
X
(Ak cos kx + Bk sin kx) = 0 (13)
k=0
for all x ∈ [−π, π], must Ak = Bk = 0 for all k = 0, 1, 2, ...? By (13) we mean that
lim Sn (x) = 0 for all x ∈ [−π, π], where Sn is the nth partial sum of the series.
n→∞
In 1870 Eduard Heine proved that if the sequence {Sn } converges uniformly
to 0 on [−π, π] then Ak = Bk = 0 for all k. This is Theorem 9.3.6. The general
uniqueness problem was solved by Cantor in the early 1870’s. He was also able to
prove uniqueness if (13) holds for all but a finite number of x in [−π, π]. This then led
Cantor to consider the uniqueness problem for infinite subsets of [−π, π]; specifically,
if E ⊂ [−π, π] is infinite and (13) holds for all x ∈ [−π, π] \ E, must Ak = Bk = 0
for all k? Since point set theory was undeveloped at this time, this question also led
Cantor to devote much of his time and effort to studying point subsets of R. For a
thorough discussion of the uniqueness problem the reader is referred to the article
by Marshall Ash listed in the supplemental readings or to the text by A. Zygmund
listed in the Bibliography.
Fourier Series 457
Micellaneous Exercises
∞
1
P
1. Suppose f is continuous on [−π, π) and f (x) = A
2 0
+ (Ak cos kx +
k=1
Bk sin kx). Let Sn be the nth partial sum of the series. If there exists a
positive constant M such that |Sn (x)| ≤ M for all x ∈ [−π, π] and n ∈ N,
prove that Ak and Bk are the Fourier coefficients of f .
Rπ
2. Prove that lim |Dn (t)| dt = ∞, where Dn is the Dirichlet kernel. (See
n→∞ 0
Example 6.4.4(b).)
3. Let f (x) = − ln |2 sin(x/2)|. Show that the Fourier series of f is given
∞
P
by (cos kx)/k. (Note: Since f is unbounded at x = 0, the integrals
k=1
defining ak and bk have to be interpreted as improper integrals.)
Supplemental Reading
The concept of measure plays a very important role in the theory of real
analysis. On the real line the idea of measure generalizes the length of an
interval, in the plane, the area of a rectangle, and so forth. It allows us to talk
about the measure of a set in the same way that we talk about the length of
an interval. The development of the Riemann integral of a bounded function
on a closed and bounded interval [a, b] depended very much on the fact that
we partitioned [a, b] into intervals. The notion of measure and measurable set
will play a prominent role in the development of the Lebesgue integral in that
we will partition [a, b] not into intervals, but instead into pairwise disjoint
measurable sets.
The theory of measure is due to Henri Lebesgue (1875–1941) who in his
famous 1902 thesis defined measure of subsets of the line and the plane, and
also the Lebesgue integral of a nonnegative function. Like Riemann, Lebesgue
was also led to the development of his theory of integration by the problem
of finding sufficient conditions on a function f for which the integrals defining
the Fourier coefficients of f exist. In this chapter, we will develop the theory of
Lebesgue measure of subsets of R following the original approach of Lebesgue
using inner and outer measure. Although this approach is somewhat more
tedious than the modern approach due to Constantin Carathéodory (1873–
1950), it has the advantage of being more intuitive and conceptually easier to
visualize.
In the first section we will illustrate the need for the concept of measure
of a set and measurable function by considering an alternate approach to in-
tegration developed by Lebesgue in 1928. Although this ultimately will not
be how we define the Lebesgue integral, the approach is instructive in empha-
sizing the concepts required for the development of the Lebesgue theory of
integration. In Section 10.2, we use the fact that every open subset of R can
be expressed as a finite or countable union of disjoint open intervals to define
the measure of open sets, and then of compact sets. These are then used to
define the inner and outer measure of subsets of R. A bounded subset of R is
then said to be measurable if these two quantities are the same.
In Section 10.6, we develop the theory of the Lebesgue integral of a
bounded real-valued function using upper and lower sums as in the devel-
opment of the Riemann integral. However, rather than using point partitions
of the interval, we will use measurable partitions. They key result of the sec-
tion is that a bounded real-valued function on [a, b] is Lebesgue integrable if
459
460 Introduction to Real Analysis
FIGURE 10.1
Partition of the range of f
In Figure 10.1,
β β
E4 = x ∈ [a, b] : 3 ≤ f (x) < 4 = [a, x1 ) ∪ [x2 , x3 ) ∪ [x4 , x5 ).
8 8
Assuming that we can do this for each of the sets Ej , a lower approximation
to the area under the graph of f is given by
n
X β
(j − 1) m(Ej ),
j=1
n
Taking limits as n → ∞, assuming that they exist and are equal, would in
fact provide another approach to integration. That this indeed is the case for
a large class of functions, including the Riemann integrable functions, will be
proved in Section 10.6.
462 Introduction to Real Analysis
For nice functions, namely those for which the sets Ej are finite unions of
intervals, the above is perfectly reasonable. However, suppose our function f
is defined on [0, 1] by
(
0, x ∈ Q ∩ [0, 1],
f (x) =
x, elsewhere.
E + x = {y + x : y ∈ E}, and
S P
(c) λ( Ek ) = λ(Ek ) for any finite or countable family {Ek } of pairwise
disjoint sets in M.
Thus if J is (a, b), (a, b], [a, b), or [a, b], a, b ∈ R, then
m(J) = b − a.
If J is R, (a, ∞), [a, ∞), (−∞, b) or (−∞, b], we set m(J) = ∞. In dealing with
the symbols ∞ and −∞, it is customary to adopt the following conventions:
Lebesgue Measure and Integration 463
where the In are pairwise disjoint bounded open intervals, we may still have
∞
X
m(U ) = m(In ) = ∞,
n=1
due to the divergence of the series to ∞. Since m(In ) ≥ 0 for all n, the
sequence of partial sums is monotone increasing and thus will either converge
to a real number or diverge to ∞.
464 Introduction to Real Analysis
(c) As in Section 2.5, let U denote the complement of the Cantor set in
[0, 1]. Since U is the union of the open intervals that have been removed, by
Property 4 of the Cantor set,
m(U ) = 1.
We now state and prove several results concerning the measure of open
sets.
m(U ) ≤ m(V ).
where {In }n and {Jm }m are finite or countable collections of pairwise disjoint
open intervals. Since U ⊂ V , each interval In ⊂ Jm for some m. For each m,
let
Nm = {n : In ⊂ Jm }.
Since the collection {Jm }m is pairwise disjoint, so is the collection {Nm }m ,
and [ [ [
U= In = In .
n m n∈Nm
Lebesgue Measure and Integration 465
Therefore X X
m(U ) = m(In ).
m n∈Nm
But by Exercise 1, X
m(In ) ≤ m(Jm ),
n∈Nm
Uk ⊂ Uk+1 ⊂ U
U ∩ (−k, k) = U
for all k ≥ ko . Hence m(Uk ) = m(U ) for all k ≥ ko , and thus equality holds
in (1).
Suppose that U is an unbounded open subset of R with
[
U= In ,
n
(−∞, an ) or (an , ∞)
and thus
∞ = lim m(In ∩ (−k, k)) ≤ lim m(Uk ) ≤ m(U ).
k→∞ k→∞
Therefore equality holds in (1). The other two cases follow similarly.
Suppose m(In ) < ∞ for all n. Since U is unbounded, the collection {In }
must be infinite. If the collection were finite, then since each interval has
finite length, each interval is bounded, and as a consequence U must also be
bounded. Let α ∈ R with α < m(U ). Since
∞
X
m(In ) = m(U ) > α,
n=1
But V ∩(−k, k) ⊂ Uk for all k ∈ N. Hence by Theorem 10.2.4, m(V ∩(−k, k)) ≤
m(Uk ) and as a consequence
and thus
m
X m Z
X b Z b
m(U ) = m(Jk ) = χJk (x) dx = χU (x) dx. (2)
k=1 k=1 a a
Proof of Lemma 10.2.7 Let {In }N n=1 be a finite collection of bounded open
SN
intervals and let U = n=1 In . Choose a, b ∈ R such that U ⊂ [a, b]. Then χU
is continuous except at a finite number of points, and
N
X
χU (x) ≤ χIn (x).
n=1
468 Introduction to Real Analysis
Proof of Theorem 10.2.6 Since the result for a finite collection follows ob-
viously from that of a countable collection, we suppose {Un }∞
n=1 is a countable
collection of open sets and
[∞
U= Un .
n=1
S
Since U is open, U = m Jm where {Jm }m is a finite or countable collection
of pairwise disjoint open intervals. Also, for each n,
[
Un = In,k ,
k
Thus
N
X
m(U ) < m(Jm ) + ǫ.
m=1
X X mi
J X
≤ m(Ini ,kij ) = m(Ini ,kij )
i,j i=1 j=1
J
X ∞
X
≤ m(Uni ) ≤ m(Un ).
i=1 n=1
Since ǫ > 0 was arbitrary, the result follows for the case where U is bounded.
If U is unbounded, then for each k ∈ N,
∞
X ∞
X
m(U ∩ (−k, k)) ≤ m(Un ∩ (−k, k)) ≤ m(Un ).
n=1 n=1
Proof. (a) If both U and V are unions of a finite number of bounded open
intervals, then so are U ∩ V and U ∪ V . Thus the functions
χU , χV , χU ∪V , χU ∩V
where the collections {In } and {Jn } consist of pairwise disjoint open intervals
respectively. If one of m(U ) or m(V ) is ∞, then by Theorem 10.2.4, m(U ∪V ) =
∞ and the conclusion holds. Thus we can assume that both m(U ) and m(V )
are finite.
Let ǫ > 0 be given. Choose N ∈ N such that
∞
X ∞
X
m(In ) < ǫ and m(Jn ) < ǫ.
n=N +1 n=N +1
Since the sets U ∗ and V ∗ are finite unions of open intervals, by part (a)
≤ m(U ∪ V ) + m(U ∩ V ).
U ∪ V = (U ∗ ∪ V ∗ ) ∪ (U ∗∗ ∪ V ∗∗ ),
and as a consequence
U ∩ V = (U ∗ ∪ U ∗∗ ) ∩ (V ∗ ∪ V ∗∗ )
= (U ∗ ∩ V ∗ ) ∪ (U ∗∗ ∩ V ∗ ) ∪ (U ∗ ∩ V ∗∗ ) ∪ (U ∗∗ ∩ V ∗∗ )
⊂ (U ∗ ∩ V ∗ ) ∪ U ∗∗ ∪ V ∗∗ .
Therefore,
m(U ∩ V ) < m(U ∗ ∩ V ∗ ) + 2ǫ.
Combining the above gives
= m(U ∗ ) + m(V ∗ ) + 4ǫ
≤ m(U ) + m(V ) + 4ǫ.
Again since ǫ > 0 was arbitrary, this proves the reverse inequality.
U = K ∪ (U \ K).
Using the fact that U \ K is also open and bounded, and thus has finite
measure, we define the measure of K as follows:
EXAMPLES 10.2.12 (a) In our first example we show that for a closed
and bounded interval [a, b], a, b ∈ R, Definition 10.2.10 is consistent with
Definition 10.2.1; namely
m([a, b]) = b − a.
Let U = (a − ǫ, b + ǫ), ǫ > 0. Then
U \ [a, b] = (a − ǫ, a) ∪ (b, b + ǫ).
Therefore,
m([a, b]) = m(U ) − m(U \ [a, b])
= (b − a) + 2ǫ − 2ǫ = b − a.
THEOREM 10.2.13
(a) If K is compact and U is open with K ⊂ U , then m(K) ≤ m(U ).
(b) If K1 and K2 are compact with K1 ⊂ K2 , then m(K1 ) ≤ m(K2 ).
Since m(In ∩ [a, b]) = m(In ∩ (a, b)) for all n, we have
Remark. What the above definition really defines is the measure of relatively
open subsets of [a, b] (see Definition 2.2.21). By Theorem 2.2.23, a subset G
of [a, b] is open in [a, b] if and only if there exists an open subset U of R such
that G = U ∩ [a, b]. Since the set U may not be unique, we leave it as an
exercise (Exercise 4) to show that if U, V are open subsets of R with
But
V \ K = V ∩ (U c ∩ [a, b])c = (V ∩ U ) ∪ (V ∩ [a, b]c ) ⊃ V ∩ U.
Therefore, m(V ∩ U ) ≤ m(V \ K). Since U ∩ [a, b] ⊂ U ∩ V ,
m(U ∩ [a, b]) + m(K) ≤ m(U ∩ V ) + m(V ) − m(V \ K) ≤ m(V ).
Given ǫ > 0, take V = (a − ǫ, b + ǫ). Then
m(U ∩ [a, b]) + m(U c ∩ [a, b]) ≤ b − a + 2ǫ.
Since ǫ > 0 is arbitrary, this proves
m(U ∩ [a, b]) + m(U c ∩ [a, b]) ≤ b − a.
To prove the reverse inequality, let Iǫ = [a+ǫ, b−ǫ], where 0 < ǫ < 21 (b−a).
Then
m(U ∩ [a, b]) + m(U c ∩ [a, b]) ≥ m(U ∩ (a, b)) + m(U c ∩ Iǫ ).
Since (a, b) is an open set containing U c ∩ Iǫ ,
m(U c ∩ Iǫ ) = b − a − m((a, b) \ (U c ∩ Iǫ )).
But
m((a, b) \ (U c ∩ Iǫ )) = m(((a, b) ∩ U ) ∪ ((a, b) ∩ Iǫc ))
= m(((a, b) ∩ U ) ∪ (a, a − ǫ) + (b − ǫ, b)),
Exercises 10.2
1. If {In }n is a finite or countable collection of disjoint open intervals with
S P
n In ⊂ (a, b), prove that m(In ) ≤ m((a, b)).
n
2. *If U 6= ∅ is an open subset of R, prove that m(U ) > 0.
3. *Let P denote the Cantor set in [0, 1]. Prove that m(P ) = 0.
4. Suppose U, V are open subsets of R, a, b ∈ R with U ∩ [a, b] = V ∩ [a, b].
Prove that m(U ∩ [a, b]) = m(V ∩ [a, b]).
5. If A, B are subsets of R, prove that
χA∩B (x) = χA (x)χB (x),
χA∪B (x) = χA (x) + χB ( x) − χA∩B (x),
χAc (x) = 1 − χA (x).
6. *If K1 and K2 are disjoint compact subsets of R, prove that
m(K1 ∪ K2 ) = m(K1 ) + m(K2 ).
Lebesgue Measure and Integration 475
THEOREM 10.3.2
(a) For any subset E of R,
0 ≤ λ∗ (E) ≤ λ∗ (E).
0 ≤ m(K) ≤ m(U ).
If we fix K, then m(K) ≤ m(U ) for all open sets U containing E. Taking the
infimum over all such U gives
0 ≤ m(K) ≤ λ∗ (E).
Taking the supremum over all compact subsets K of E proves (a). The proof
of (b) is similar and is left as an exercise (Exercise 7).
λ∗ (E) = λ∗ (E) = 0.
Suppose E = {xn }∞
n=1 . Let ǫ > 0 be arbitrary. For each n, let
ǫ ǫ
I n = x n − n , xn + n ,
2 2
476 Introduction to Real Analysis
S∞
and set U = n=1 In . Then U is open with E ⊂ U . By Theorem 10.2.6,
∞ ∞
X X ǫ
m(U ) ≤ m(In ) = n−1
= 2 ǫ.
n=1 n=1
2
Therefore, λ∗ (E) < 2ǫ. Since ǫ > 0 was arbitrary, λ∗ (E) = 0. As a conse-
quence, we also have λ∗ (E) = 0.
(b) If I is any bounded interval, then
λ∗ (I) ≤ m(I) = b − a.
On the other hand, if 0 < ǫ < (b − a), then [a + ǫ/2, b − ǫ/2] is a compact
subset of I, and as a consequence
h ǫ ǫ i
b − a − ǫ = m a + ,b − ≤ λ∗ (I).
2 2
Therefore,
b − a − ǫ ≤ λ∗ (I) ≤ λ∗ (I) ≤ b − a.
Since ǫ > 0 was arbitrary, equality holds. A similar argument proves that if I
is any closed and bounded interval, then
λ∗ (U ) = λ∗ (U ) = m(U ).
Measurable Sets
In both of the previous examples, the inner and outer measure of the sets
are equal. As we shall see, all subsets of R build out of open sets or closed
sets by countable unions, intersections, and complementation will have this
property, and this includes most sets encountered in practice. In fact, the
explicit construction of a set whose inner and outer measure are different
requires use of an axiom from set theory, the Axiom of Choice, which we have
not discussed. The construction of such a set is outlined in the miscellaneous
exercises.
DEFINITION 10.3.4
(a) A bounded subset E of R is said to be Lebesgue measurable or
measurable if
λ∗ (E) = λ∗ (E).
If this is the case, then the measure of E, denoted λ(E), is defined to be
exists.
(c) There is no discrepancy between the two parts of the definition. We
will shortly prove in Theorem 10.4.1 that if E is a bounded measurable set,
then E ∩ I is measurable for every interval I. Conversely, if E is a bounded
set for which
λ∗ (E ∩ [a, b]) = λ∗ (E ∩ [a, b])
478 Introduction to Real Analysis
for all a, b ∈ R, then by choosing a and b sufficiently large such that E ⊂ [a, b],
we have λ∗ (E) = λ∗ (E). The two separate definitions are required due to the
existence of unbounded nonmeasurable sets E for which
λ∗ (E) = λ∗ (E) = ∞.
Proof. Suppose E ⊂ R with λ∗ (E) = 0. Then for any closed and bounded
interval I,
λ∗ (E ∩ I) ≤ λ∗ (E) = 0
Thus λ∗ (E ∩I) = λ∗ (E ∩I) = 0, and hence E ∩I is measurable for every closed
and bounded interval I. Since λ(E ∩ [−k, k]) = 0 for every k ∈ N, λ(E) = 0.
As a consequence of the previous theorem and Example 10.3.3(a), every
countable set E is measurable with λ(E) = 0. In particular, Q is measurable
with λ(Q) = 0. Another consequence of Theorem 10.3.5 is that every subset
of a set of measure zero is measurable.
The last equality follows by Theorem 10.2.15. Taking the infimum over all
open sets U containing E ∩ [a, b] gives
λ∗ (E ∩ [a, b]) + m(K ∩ [a, b]) ≤ m(K c ∩ [a, b]) + m(K ∩ [a, b]) = b − a.
The last equality again follows by Theorem 10.2.15 with U = K c . Thus taking
the supremum over all compact subsets K of E ∩ [a, b] gives
= m(U1 ∪ U2 ) + m(U1 ∩ U2 )
≥ λ∗ (E1 ∪ E2 ) + λ∗ (E1 ∩ E2 ).
The last inequality follows from the definition of outer measure. Since ǫ > 0
was arbitrary, inequality (a) follows.
(b) Let a, b ∈ R be arbitrary. By (a) applied to [a, b] ∩ Eic , we have
Therefore,
Exercises 10.3
1. a. If E ⊂ R, a, b ∈ R, prove that λ∗ (E ∩ (a, b)) = λ∗ (E ∩ [a, b]), and
λ∗ (E ∩ (a, b)) = λ∗ (E ∩ [a, b]).
*b. If E ⊂ R, prove that λ∗ (E + x) = λ∗ (E) and λ∗ (E + x) = λ∗ (E) for
every x ∈ R, where E + x = {a + x : a ∈ E}.
2. Prove that every subset of a set of measure zero is measurable.
3. *Let E1 ⊂ R with λ∗ (E1 ) = 0. If E2 is a measurable subset of R, prove
that E1 ∩ E2 and E1 ∪ E2 are measurable.
4. Let E = [0, 1] \ Q. Prove that E is measurable and λ(E) = 1.
5. Let P denote the Cantor set in [0, 1].
a. Prove that λ∗ (P c ∩ [0, 1]) = 1.
b. Prove that λ∗ (P ) = 0.
6. *If E ⊂ R, prove that there exists a sequence
T {Un } of open sets with
E ⊂ Un for all n ∈ N such that λ∗ (E) = λ∗ ( n Un ).
7. Prove Theorem 10.3.2(b).
8. *Prove Lemma 10.3.8.
9. a. Prove that every compact set K is measurable with λ(K) = m(K).
b. Prove that every closed set is measurable.
E1 ∩ E 2 and E1 ∪ E2
Proof. (a) We consider first the case where both E1 and E2 are bounded
measurable sets, in which case, E1 ∩ E2 and E1 ∪ E2 are also bounded. Since
by Theorem 10.3.9,
Therefore,
and as a consequence,
But for any bounded set E, λ∗ (E) − λ∗ (E) ≥ 0. Thus equality can hold in the
above if and only if both sides are zero; namely,
(b) Suppose one or both of the measurable sets E1 and E2 are unbounded.
Let I = [a, b] with a, b ∈ R. If both E1 and E2 are unbounded, then E1 ∩ I and
E2 ∩ I are measurable by definition. If one of the two sets, say E1 , is bounded,
then by part (a) E1 ∩ I is measurable. Thus in both cases, E1 ∩ I and E2 ∩ I
are bounded measurable sets. But then
(E1 ∩ E2 ) ∩ I = E1 ∩ E2 .
are measurable.
for every a, b ∈ R.
we obtain
λ∗ (E ∩ [a, b]) − λ∗ (E ∩ [a, b]) ≤ 0.
Since λ∗ (E ∩ [a, b]) ≤ λ∗ (E ∩ [a, b]), the above can hold if and only if
THEOREM 10.4.4
(a) If {En }∞
n=1 is a sequence of subsets of R, then
∞ ∞
!
[ X
∗
λ En ≤ λ∗ (En ).
n=1 n=1
(b) If {En }∞
n=1 is a sequence of pairwise disjoint subsets of R, then
∞ ∞
!
[ X
λ∗ En ≥ λ∗ (En ).
n=1 n=1
∞
λ∗ (En ) = ∞, then the conclusion in (a) is certainly true.
P
Proof. (a) If
n=1
∞
λ∗ (En ) < ∞. Let ǫ > 0 be given. For each n ∈ N,
P
Thus we assume that
n=1
there exists an open set Un with En ⊂ Un such that
ǫ
m(Un ) < λ∗ (En ) + .
2n
S∞ S∞
Let U = n=1 Un . Then U is an open subset of R with E = n=1 En ⊂ U .
Thus
∞
!
[
∗
λ (E) ≤ m Un ,
n=1
By induction,
n n ∞
! !
X [ [
λ∗ (Ek ) ≤ λ∗ Ek ≤ λ∗ Ek .
k=1 k=1 k=1
Since the above holds for all n ∈ N, letting n → ∞ gives the desired result.
THEOREM
S∞ 10.4.5 Let {En }∞
n=1 be a sequence of measurable sets. Then
T ∞
n=1 En and n=1 En are measurable with
∞ ∞
!
[ X
(a) λ En ≤ λ(En ).
n=1 n=1
(b) If in addition !
the sets En , n = 1, 2, ..., are pairwise disjoint, then
[∞ X∞
λ En = λ(En ).
n=1 n=1
S∞
Proof. Let E = n=1 En . Without loss of generality we can assume that E
(and hence all the sets En ) is bounded. If not, we consider
∞
[
E ∩ [a, b] = (En ∩ [a, b])
n=1
for a, b ∈ R.
Set A1 = E1 , and for each n ∈ N, n ≥ 2, set
n−1
! n−1
!c
[ \ [
An = En \ Ek = E n Ek .
k=1 k=1
∞ ∞
!c
\ [
c
En = En ,
n=1 n=1
THEOREM 10.4.6
(a) If {En }∞
n=1 is a sequence of measurable sets with E1 ⊂ E2 ⊂ · · · , then
∞
!
[
λ En = lim λ(En ).
n→∞
n=1
(b) If {En }∞
n=1 is a sequence of measurable sets with E1 ⊃ E2 ⊃ · · · and
λ(E1 ) < ∞, then
∞
!
\
λ En = lim λ(En ).
n→∞
n=1
S∞
Proof. (a) Let E = n=1 En . By the previous theorem, E is measurable. If
λ(Ek ) = ∞ for some k, then λ(En ) = ∞ for all n ≥ k and λ(E) = ∞, thus
proving the result. Hence we assume that λ(En ) < ∞ for all n.
Set Eo = ∅, and for n ∈ N, let An = En \ En−1 . Then each An is measur-
able, the collection {An } is pairwise disjoint, and
∞
[
An = E.
n=1
But En = (En \ En−1 ) ∪ En−1 . Since the sets En \ En−1 and En−1 are disjoint,
Therefore
N
X N
X
λ(En \ En−1 ) = [λ(En ) − λ(En−1 )] = λ(EN ) − λ(Eo ) = λ(EN ),
n=1 n=1
λ(U ) = m(U ).
THEOREM 10.4.7
(a) If E ∈ M, then E c ∈ M.
(b) ∅, R ∈ M.
(c) If En ∈ M, n = 1, 2, ..., then
∞
[ ∞
\
En ∈ M and En ∈ M.
n=1 n=1
λ(E + x) = λ(E).
Proof. The result (a) is Corollary 10.4.3, whereas (b) follows from Theorem
10.3.6 and Corollary 10.4.3. The statement (c) is Theorem 10.4.5, whereas (d)
is Theorem 10.3.6. The proof of (e) follows from Exercise 1(b) of the previous
section.
Any collection A of subsets of a set X satisfying (a), (b), and (c) of the
previous theorem is called a sigma-algebra (σ-algebra) of sets. The σ denotes
that the collection A is closed under countable unions.
Remark. An alternate approach to the theory of measure is due to Constantin
Carathéodory (1873–1950) in which a subset E of R is said to be measurable
if
λ∗ (E ∩ T ) + λ∗ (E c ∩ T ) = λ∗ (T ) (5)
for every subset T of R. Since T = (E ∩ T ) ∪ (E c ∩ T ), by Theorem 10.3.9 one
always has λ∗ (T ) ≤ λ∗ (E ∩ T ) + λ∗ (E c ∩ T ). Thus E satisfies (5) if and only
if
λ∗ (E ∩ T ) + λ∗ (E c ∩ T ) ≤ λ∗ (T ).
The advantage to this approach is that it does not require the concept of inner
measure, and it includes both unbounded and bounded sets simultaneously.
If a subset E of R satisfies (5), taking T = [a, b], a, b ∈ R gives
Exercises 10.4
1. Find a sequence {En }∞
n=1 of measurable sets with E1 ⊃ E2 ⊃ · · · such
that
λ ∞
T
n=1 En 6= lim λ(En ).
n→∞
{x ∈ E : f (x) > s}
Then
[0, 1],
if s < 0,
{x ∈ [0, 1] : f (x) > s} = Qc ∩ (s, 1), if 0 ≤ s < 1,
∅, if s ≥ 1.
Proof. The set of (d) is the complement of the set in (a). Thus by Corollary
10.4.3, one is measurable if and only if the other is. Similarly for the sets of
(b) and (c). Thus it suffices to prove that (a) is equivalent to (b).
Suppose (a) holds. For each n ∈ N, let
1
En = x : f (x) > s −
n
490 Introduction to Real Analysis
Proof. Exercise 7.
Remark. The assertion that a set is of measure zero includes the assertion
that it is measurable. This however is not necessary. If instead we only require
that λ∗ ({x : P does not hold }) = 0, then by Theorem 10.3.5 the set
{x : P does not hold } is in fact measurable.
We will illustrate the concept of a property holding almost everywhere by
means of the following examples.
Then {x ∈ [0, 1] : f (x) 6= g(x)} = [0, 1] ∩ Q which has measure zero. Therefore
f = g a.e.
(b) In Theorem 10.5.4 we proved that if g is a real-valued measurable
function on [a, b] with g(x) 6= 0 for all x ∈ [a, b], then 1/g is also measurable
on [a, b]. Suppose we replace the hypothesis g(x) 6= 0 for all x ∈ [a, b] with
g 6= 0 a.e.; that is, the set
E = {x ∈ [a, b] : g(x) = 0}
492 Introduction to Real Analysis
then f (x) 6= 0 for all x ∈ [a, b] and f (x) = g(x) except for x ∈ E, which has
measure zero. Thus f = g a.e. on [a, b]. As a consequence of our next theorem,
the function f will also be measurable on [a, b].
(c) A real-valued function f on [a, b] is continuous almost everywhere
if
{x ∈ [a, b] : f is not continuous at x}
has measure zero. As in Example 6.1.14, consider the function f on [0, 1]
defined by
1,
x = 0,
f (x) = 0, if x is irrational
1
if x = m
n, n in lowest terms , x 6= 0.
has measure zero. To illustrate this, consider the sequence {fn } defined in
Example 8.1.2(c) as follows: Let {xk } be an enumeration of Q ∩ [0, 1]. For
each n ∈ N, define fn on [0, 1] by
(
0, x = xk , 1 ≤ k ≤ n,
fn (x) =
1, otherwise.
Then (
0, if x is rational,
lim fn (x) = f (x) =
n→∞ 1, if x is irrational.
Thus {x ∈ [0, 1] : {fn (x)} does not converge to 1} = Q ∩ [0, 1], which has
measure zero. Hence fn → 1 a.e. on [0, 1].
Proof. The result follows by Theorem 10.4.5, and the fact that for every
s ∈ R,
∞
[
{x : ϕ(x) > s} = {x : fn (x) > s} and
n=1
[∞
{x : ψ(x) < s} = {x : fn (x) < s}.
n=1
Exercises 10.5
1. *Let f be defined on [0, 1] by
0, x = 0,
1
f (x) = , 0 < x < 1,
x
2, x = 1.
lim λ({x ∈ [a, b] : |fn (x)| > t}) = λ({x ∈ [a, b] : |f (x)| > t}).
n→∞
where the infimum and supremum are taken over all measurable partitions Q
and P of [a, b], if and only if f is measurable on [a, b].
Remark. Although the previous theorem is stated for a closed interval [a, b],
the result is also true for f defined an any measurable subset A of R with
λ(A) < ∞.
As a consequence of the previous theorem, which we will shortly prove, we
make the following definition.
Lebesgue Measure and Integration 497
where the supremum is taken over all measurable partitions of [a, b]. IfR A is a
measurable subset of [a, b], the Lebesgue integral of f over A, denoted A f dλ,
is defined by Z Z
f dλ = f χA dλ.
A [a,b]
R
Remarks. (a) In defining A f dλ it was implicitly assumed that f is defined
on all of [a, b]. If f is only defined on the measurable set A, A ⊂ [a, b], then
f χA can still be defined on all of [a, b] in the obvious manner; namely,
(
f (x), x ∈ A,
(f χA )(x) =
0, x 6∈ A.
LL (Q, f ) = LL (P, f χA ).
Riemann integrable function on [a, b] is also Lebesgue integrable, and the two
integrals are equal.
Prior to proving Theorem 10.6.2, we first need the analogue of Theorem
6.1.4.
{Ei ∩ Aj }n,m
i=1,j=1
As a consequence,
sup LL (P, f ) ≤ inf UL (Q, f ).
P Q
Proof. The proof of the lemma is almost verbatim the proof of Lemma 6.1.3
and Theorem 6.1.4, and thus is left to the exercises (Exercise 2).
where the supremum and infimum are taken over all measurable partitions Q
of [a, b]. Since P = {E1 , ..., En } is a measurable partition of [a, b] and s(x) = αj
for all x ∈ Ej ,
Xn
LL (P, f ) = UL (P, f ) = αj λ(Ej ).
j=1
But then
n
X n
X
αj λ(Ej ) ≤ sup LL (Q, f ) ≤ inf UL (Q, f ) ≤ αj λ(Ej ).
Q Q
j=1 j=1
R n
P
Therefore s is Lebesgue integrable on [a, b] with s dλ = αi λ(Ei ).
[a,b] i=1
where the supremum is taken over all simple functions ϕ on [a, b] satisfying
ϕ(x) ≤ f (x) for all x ∈ [a, b].
Proof of Theorem 10.6.2. Suppose f is a measurable function on [a, b] with
m ≤ f (x) < M for all x ∈ [a, b]. Let β = M − n, and for n ∈ N, partition
[m, M ) into n subintervals of length β/n. For each k = 1, ..., n, set
β β
Ek = x ∈ [a, b] : m + (k − 1) ≤ f (x) < m + k .
n n
Then Pn = {E1 , ..., En } is a measurable partition of [a, b]. Also, if mk =
inf{f (x) : x ∈ Ek } and Mk = sup{f (x) : x ∈ Ek }, then
β β
mk ≥ m + (k − 1) and Mk ≤ m + k .
n n
Therefore
Define ϕ and ψ on [a, b] by ϕ(x) = supn ϕn (x) and ψ(x) = inf n ψn (x). By
Theorem 10.5.9 the functions ϕ and ψ are measurable functions on [a, b], with
Q = {Bj ∩ An,k }N
j=1 ∪ {[a, b] \ An,k }
N N
X 1X λ(An,k )
LL (Q, sn ) = m∗j λ(Bj ∩ An,k ) > λ(Bj ∩ An,k ) = .
j=1
k j=1 k
502 Introduction to Real Analysis
Combining the above two inequalities gives λ(An,k ) < k/n for all k, n ∈ N.
Since Ek ⊂ An,k for all n, for each k,
k
λ(Ek ) <
n
for all n ∈ N. Therefore λ(Ek ) = 0. Finally since
∞
X
λ(E) ≤ λ(Ek ),
k=1
If f is Riemann integrable on [a, b], then the upper and lower Riemann inte-
grals of f are equal, and thus
Z b Z b
f (x) dx ≤ sup LL (Q, f ) ≤ inf UL (Q, f ) ≤ f (x) dx,
a Q Q a
Lebesgue Measure and Integration 503
where the supremum and infimum are taken over all measurable partitions Q
of [a, b]. As a consequence of Theorem 10.6.2, this proves the following result.
Proof. Since the proof of (a) is similar to the proof of the corresponding result
for the Riemann integral we leave it as an exercise (Exercise 4). For the proof
of (b), by definition
Z Z
f dλ = f χA1 ∪A2 dλ.
A1 ∪A2 [a,b]
Since A1 ∩ A2 = ∅, f χA1 ∪A2 = f χA1 + f χA2 , and the result now follows by
(a).
(c) Consider the function h(x) = f (x) − g(x). By hypothesis h ≥ 0 a.e. on
[a, b]. Let
E1 = {x : h(x) ≥ 0} and E2 = [a, b] \ E1 .
Consider the measurable partition P = {E1 , E2 } of [a, b]. Then
Z
h dλ ≥ LL (P, f ) = m1 λ(E1 ) + m2 λ(E2 ).
[a,b]
Remark. Although we state and prove the bounded convergence theorem for
a closed and bounded interval [a, b], the conclusion is still valid if the sequence
Lebesgue Measure and Integration 505
Furthermore, gn (x) → g(x) for all x ∈ [a, b]. Let ǫ > 0 be given. For m ∈ N,
set
Em = {x ∈ [a, b] : |g(x) − gn (x)| < ǫ for all n ≥ m}.
S∞
Then E1 ⊂ E2 ⊂ · · · with n=1 Em = [a, b]. Therefore
∞
\
c
Em = ∅.
m=1
c c
Here Em = [a, b] \ Em . Thus by Theorem 10.4.6 lim λ(Em ) = 0. Choose
m→∞
c
m ∈ N such that λ(Em ) < ǫ. Then |g(x) − gn (x)| < ǫ for all n ≥ m and all
x ∈ Em . Therefore
Z b Z b Z b Z b Z
f dλ − fn dλ = g dλ − gn dλ ≤ |g − gn | dλ
a a a a [a,b]
Z Z
= |g − gn | dλ + |g − gn | dλ
Em c
Em
c
< ǫ λ(Em ) + 2M ǫ[b − a + 2M ].
λ(Em ) <
R R
Since ǫ > 0 was arbitrary, we have lim [a,b] fn dλ = [a,b] f dλ.
n→∞
Combining the bounded convergence theorem with Corollary 10.6.8, we
obtain the bounded convergence theorem for Riemann integrable functions
previously stated in Chapter 8. The theorem does require the additional hy-
pothesis that the limit function f is Riemann integrable.
THEOREM 8.4.3 Let f and fn , n ∈ N, be Riemann integrable functions on
[a, b] with lim fn (x) = f (x) for all x ∈ [a, b]. Suppose there exists a positive
n→∞
constant M such that |fn (x)| ≤ M for all x ∈ [a, b] and all n ∈ N. Then
Z b Z b
lim fn (x) dx = f (x) dx.
n→∞ a a
506 Introduction to Real Analysis
EXAMPLES 10.6.12 (a) In the first example we show that the conclu-
sion of the bounded convergence theorem is false if the sequence {fn } is
not bounded; that is, there does not exist a finite constant M such that
|fn (x)| ≤ M for all n ∈ N, and all x ∈ [a, b]. For each n ∈ N, define fn
on [0, 1] by (
n, 0 < x ≤ n1 ,
fn (x) =
0, otherwise.
Then {fn }∞
n=1 is a sequence of measurable functions on [0, 1] that is not
bounded but which satisfies
1
1
Z
fn (x) dx = .
0 2k
R1
Thus lim fn (x) dx = 0. On the other hand, if x ∈ [0, 1], then the sequence
n→∞ 0
{fn (x)} contains an infinite number of 0’s and 1’s, and thus does not converge.
Exercises 10.6.
1. *Let f be a bounded real-valued measurable function on [a, b] with m ≤
f (x) < M for all x ∈ [a, b]. Set β = M − m. For n ∈ N and j = 1, ..., n,
β β
let Ej = {x ∈ [a, b] : m + (j − 1) n ≤ f (x) < m + j n }.
The Lebesgue sums for f are defined by
n
P β
Sn (f ) = (m + (j − 1) n )λ(Ej ). Prove that
j=1
Z
lim Sn (f ) = f dλ.
n→∞ [a,b]
Lebesgue Measure and Integration 507
6. Let f be a bounded measurable function on [a, b]. For each c > 0, prove
that
Z
1
λ({x ∈ [a, b] : |f (x)| > c}) ≤ |f | dλ.
c [a,b]
7. *Let
R f be a nonnegative bounded measurable function on [a, b] satisfying
[a,b]
f dλ = 0. Use the previous exercise to prove that f = 0 a.e. on [a, b].
8. (Fundamental Theorem of Calculus for the Lebesgue Integral) If
f is differentiable on [a, b] and f ′ is bounded on [a, b], then f ′ is Lebesgue
integrable, and
Z
f ′ dλ = f (b) − f (a).
[a,b]
∞ π/2 √
(1 − cos x)n sin x dx converges to a finite limit, and
P R
9. Prove that
n=0 0
find that limit.
12. *If f is a bounded real-valued measurable function on [a, b], prove that
there exists a sequence {sn } of simple functions on [a, b] such that
lim sn (x) = f (x) uniformly on [a, b].
n→∞
13. Modify the proof of the bounded convergence theorem where the interval
[a, b] is replaced by a bounded measurable set A.
14. Use Egorov’s theorem (Exercise 15, Section 10.5) to provide an alternate
proof of the bounded convergence theorem.
15. Let f be a bounded measurable function on [a, b].
a. Given
R ǫ > 0, prove that there exists a simple function ϕ on [a, b] such
that |f − ϕ| dλ < ǫ.
[a,b]
*b. If ϕ is a simple function on [a, b] and ǫ > 0, prove that there exists a
508 Introduction to Real Analysis
step function h on [a, b] such that ϕ(x) = h(x) except on a set of measure
less than ǫ.
c. Given
R ǫ > 0, prove that there exists a step function h on [a, b] such
that |f − h| dλ < ǫ.
[a,b]
n
1
P
16. Let Sn (x) = A
2 0
+ Ak cos kx + Bk sin kx. If |Sn (x)| ≤ M for all
k=1
x ∈ [−π, π] and n ∈ N and f (x) = lim Sn (x) exists a.e. on [−π, π],
n→∞
prove that f is measurable and that the Ak and Bk are the Fourier
coefficients of f .
DEFINITION 10.7.1
(a) Let f be a nonnegative measurable function defined on a bounded
R mea-
surable subset A of R. The Lebesgue integral of f over A, denoted A f dλ,
is defined by
Z Z Z
f dλ = lim fn dλ = sup min{f, n} dλ.
A n→∞ A n A
EXAMPLES √ 10.7.3 (a) For our first example we consider the function
f (x) = 1 x defined on (0, 1). Then for each n ∈ N,
1
n,
0 < x < 2,
fn (x) = min{f (x), n} = n
1 1
√ ,
≤ x ≤ 1.
x n2
Therefore
Z 1 1/n2 1
1 1 2 1
Z Z
fn dλ = n dλ + √ dx = + 2 − =2− .
0 0 1/n2 x n n n
510 Introduction to Real Analysis
As a consequence
Z Z
f dλ = lim fn dλ = lim (2 − n1 ) = 2.
(0,1) n→∞ (0,1) n→∞
Therefore
1 1/n 1
1
Z Z Z
min{g, n} dλ = n dλ + dx = 1 + ln n.
0 0 1/n x
Thus Z 1
g dλ = lim (1 + ln n) = ∞.
0 n→∞
Proof. We will indicate the method of proof by proving part of (a). The
remaining proofs are left to the exercises (Exercise 2). Suppose first that the
set A is bounded. Let h = f + g. Since
min{f (x) + g(x), n} ≤ min{f (x), n} + min{g(x), n} ≤ min{f (x) + g(x), 2n},
Therefore, since
Z Z Z
lim hn dλ = lim h2n dλ = (f + g) dλ,
n→∞ A n→∞ A A
the result R
follows fromRthe above by letting n → ∞. If one, or both of the
sequences A fn dλ R, A gn dλ diverges to ∞, then so does their sum. In
this case, we obtain A (f + g)dλ = ∞. If A is unbounded, then by the above,
for each n ∈ N,
Z Z Z
(f + g) dλ = f dλ + g dλ,
A∩[−n,n] A∩[−n,n] A∩[−n,n]
Fatou’s Lemma
Our first major convergence theorem for integrals of nonnegative measurable
functions is the following result of Fatou.
Proof. Suppose first that the set A is bounded. For each k ∈ N, let
EXAMPLE 10.7.6 In this example, we show that equality need not hold in
Fatou’s lemma. Consider the sequence {fn } on [0, 1] of Example 10.6.12(a).
For each n ∈ N, fn (x) = n if 0 < x ≤ n1 , and fn (x) = 0 elsewhere. This
sequence satisfies
Z Z
0= ( lim fn ) dλ < 1 = lim fn dλ.
[0,1] n→∞ n→∞ [0.1]
Remark. Fatou’s lemma is often used to prove that the limit function f of a
convergent sequence ofR nonnegative Lebesgue integrable functions is Lebesgue
integrable. For if lim A fn dλ < ∞
R and if fn → f a.e. on A with fn ≥ 0 a.e.
for all n, then by Fatou’s lemma A f dλ < ∞. Thus f is integrable on A.
If f (x) > 0, then f + (x) = f (x) and f − (x) = 0. On the other hand, if f (x) < 0,
then f + (x) = 0 and f − (x) = −f (x). If f is measurable on A, then f + and
f − are nonnegative measurable functions on A with
for all x ∈ A.
Our natural inclination is to define the integral of f over A by
Z Z Z
f dλ = f + dλ − f − dλ.
A A A
Lebesgue Measure and Integration 513
R +
RThe −only problem with this definition is that it is possible that A f dλ =
A
f dλ = ∞ giving the undefined ∞−∞ in the above. However, if we assume
that both f + and f − are integrable on A, then the above definition makes
sense. Furthermore, if f is measurable, and f + and f − are both integrable on
A, then |f | is also integrable on A. Conversely, if f is measurable and |f | is
integrable on A, then since f + ≤ |f | and f − ≤ |f |, by Theorem 10.7.4(c) both
f + and f − are integrable on A. Therefore we make the following definition.
= f dλ + g dλ.
A A
from which the result follows. (c) The proof of (c) also follows from (a) and
the fact that since A1 ∩ A2 = ∅, f χA1 ∪A2 = f χA1 + f χA2 .
Lebesgue Measure and Integration 515
Therefore, Z Z
f dλ ≤ lim fn dλ.
A n→∞ A
Similarly, by applying Fatou’s lemma to the sequence {g − fn }n∈N , which is
again a sequence of nonnegative functions on A,
Z Z Z Z
(g − f ) dλ ≤ lim (g − fn ) dλ = g dλ + lim (−fn )dλ.
A n→∞ A A n→∞ A
R R
But lim A
(−fn )dλ = − lim fn dλ. Therefore,
n→∞ n→∞ A
Z Z
f dλ ≥ lim fn dλ.
A n→∞ A
Exercises 10.7
1. RLet f be a nonnegative measurable function on a measurable set A. If
A
f dλ = 0, prove that f = 0 a.e. on A.
2. *a. Prove Theorem 10.7.4(b).
b. Prove Theorem 10.7.4(c).
3. Let A be a measurable subset of R.
a. If f is integrable on A and g is bounded and measurable on A, prove
that f g is integrable on A.
b. If f and g are integrable on A, is the function f g integrable on A?
4. *Let fp (x) = x−p , x ∈ (0, 1). Prove that fp is integrable on (0, 1) for all
p, 0 < p < 1, and that
Z
1
fp dλ = .
(0,1) 1−p
5. Define f on [1, ∞) by
(√
n if x ∈ [n, n + 1/n2 ), n = 1, 2, . . . ,
f (x) =
0 otherwise.
Show that f ∈ L([1, ∞)) but f 2 6∈ L([1, ∞)).
6. Let P denote the Cantor set of Section 2.5. Define f on [0, 1] as follows:
f (x) = 0 for every x ∈ P , and f (x) = k for each x in the open interval
of length 1/3k on [0, 1] \ P . Prove that f is integrable on [0, 1] and that
R1
f dλ = 3.
0
Lebesgue Measure and Integration 517
9. *Let f be an integrable function on [a, b]. Given ǫ > 0, prove that there
Rb
exists a bounded measurable function g on [a, b] such that |f −g| dλ < ǫ.
a
20. Show that the monotone convergence theorem is false for decreasing se-
quences of measurable functions.
21. *a. Let f be a nonnegative measurable function on a measurable set A,
S {An } be a sequence of pairwise disjoint measurable subsets of A
and let
with n An = A. Prove that
Z X∞ Z
f dλ = f dλ.
A n=1 An
b. Prove that the conclusion of part (a) is still valid for arbitrary
f ∈ L(A).
22. Let {fn } be a sequence of measurable functions on [a, b] satisfying
|fn (x)| ≤ g(x) a.e., where g is integrable on [a, b]. If lim fn (x) = f (x)
n→∞
exists a.e. on [a, b], and h is any bounded measurable function on [a, b],
prove that
Z b Z b
f h dλ = lim fn h dλ.
a n→∞ a
23. *As in Exercise 4, Section 6.4, let f be defined on (0, 1) by
d
x2 sin x12 .
f (x) =
dx
Prove that f is not Lebesgue integrable on (0, 1).
24. Let f be a nonnegative measurable function on [a, b]. For each t ≥ 0, let
mf (t) = λ({x ∈ [a, b] : f (x) > t}).
a. Prove that mf (t) is monotone decreasing on [0, ∞).
Rb R∞
b. Prove that f dλ = mf (t) dt.
a 0
EXAMPLES 10.8.2
√
(a) For our first example let f (x) = 1 x, x ∈ (0, 1). By Exercise 4 of
R1
Section 10.7, f is integrable on (0, 1) with 0 f dλ = 2. Since f 2 (x) = 1/x, by
Example 10.7.3(a), f 2 is not integrable on (0, 1) and thus f 6∈ L2 ((0, 1)). On
the other hand, if g(x) = x−1/3 , then g 2 (x) = x−2/3 , which by Exercise 4 of
the previous section is integrable. Thus g ∈ L2 ((0, 1)) with
Z 1
kgk22 = x−2/3 dx = 3.
0
(b) Consider the function f (x) = 1 x for x ∈ [1, ∞). For any n ∈ N, n ≥ 2,
Z n Z n
1
|f |2 dλ = x−2 dx = 1 − .
1 1 n
Thus Z ∞ Z n
kf k22 = 2
|f | dλ = lim |f |2 dλ = 1
1 n→∞ 1
Cauchy-Schwarz Inequality
Our first result will be the analogue of the Cauchy-Schwarz inequality for ℓ2 .
The following inequality is sometimes also referred to as Hölder’s inequality.
520 Introduction to Real Analysis
( |f g| dλ)2
R
0 ≤ kf k22 − A ,
kgk22
Our next result is Minkowski’s inequality for the space L2 . Since the proof
of this is identical to the proof of Theorem 7.4.5, we leave the details to the
exercises.
kf + gk2 ≤ kf k2 + kgk2 .
Proof. Exercise 4.
THEOREM 10.8.5 The normed linear space (L2 ([a, b]), k k2 ) is complete.
Remark. Although we state and prove the result for a closed and bounded
interval, the same method of proof will work for L2 (A) where A is any mea-
surable subset of R.
Before proving the theorem, we first state and prove the following lemma.
and for some finite constant C. Then f (x) = lim fn (x) is finite a.e. on A.
n→∞
Proof. Since {fn (x)} is monotone increasing for each x ∈ A, the sequence
either converges to a real number or diverges to ∞. Let f (x) = lim fn (x),
n→∞
and let
E = {x ∈ A : f (x) = ∞}.
We will prove that λ(E) = 0. For each k ∈ N, let
But
1 1 C
Z Z Z
λ(An,k ) = 1 dλ ≤ fn dλ ≤ fn dλ ≤ .
An,k k An,k k A k
Therefore λ(Ek ) ≤ C/k for all k ∈ N. Since λ(E1 ) < ∞, by Theorem 10.4.6(b),
By Minkowski’s inequality,
2
Z k
X
gk2 dλ = kgk k22 ≤ kfn1 k2 + kfnj+1 − fnj k2
[a,b] j=1
2
k
X 1 ≤ (kfn1 k2 + 1)2 .
≤ kfn1 k2 + k
j=1
2
Thus the sequence {gk2 } satisfies the hypothesis of Lemma 10.8.6. Therefore
lim gk2 is finite a.e. on [a, b]. But then lim gk is also finite a.e. on [a, b]. As a
k→∞ k→∞
consequence the series
∞
X
|fn1 (x)| + |fnj+1 (x) − fnj (x)|
j=1
But the kth partial sum of this series is fnk+1 (x). Therefore the sequence
{fnk }k∈N converges a.e. on [a, b]. Let E denote the set of x ∈ [a, b] for which
this sequence converges. Then λ([a, b] \ E) = 0. Define
(
lim fnk (x), x ∈ E,
f (x) = k→∞
0, otherwise.
by Fatou’s lemma
Z Z
|f |2 dλ ≤ lim gk2 < ∞.
[a,b] k→∞ [a,b]
Therefore kf − fnk k2 < 1/2k for all k ∈ N. Thus the subsequence {fnk }k∈N
converges to f in the norm of L2 . Finally by the triangle inequality,
1
kf − fn k2 ≤ kf − fnk k2 + kfnk − fn k2 < + kfnk − fn k2 .
2k
From this it now follows that the original sequence {fn } also converges to f
in the norm of L2 .
Fourier Series
We close this chapter by making a few observations about Fourier series and
the space L2 ([−π, π]). As in Definition 9.3.1, if f is Lebesgue integrable on
[−π, π], the Fourier coefficients of f with respect to the orthogonal system
{1, cos nx, sin nx}∞
n=1 are given by
1 π
Z
an = f (x) cos nx dx, n = 0, 1, 2, ...,
π −π
Z π
1
bn = f (x) sin nx dx, n = 1, 2, ...
π −π
Since f is Lebesgue integrable, the functions f (x) sin nx and f (x) cos nx are
measurable on [−π, π], and in absolute value less than or equal to |f (x)|. Thus
the functions f (x) cos nx and f (x) sin nx are all integrable on [−π, π].
The same method of proof used in proving Bessel’s inequality for Riemann
integrable functions proves the following (Exercise 8): If f ∈ L2 ([−π, π]) and
{ak } and {bk } are the Fourier coefficients of f , then
∞ π
1 2 X 2 1
Z
a + ak + b2k ≤ f 2 dλ. (Bessel’s Inequality)
2 0 π −π
k=1
then there exists f ∈ L2 ([−π, π]) whose Fourier coefficients are precisely {ak }
and {bk }.
which by orthogonality
n
X
=π (a2k + b2k ).
k=m+1
Therefore
π π
1 1
Z Z
f (x) cos mxdx − am = (f (x) − Sn (x)) cos mx dx
π −π π −π
1 1
≤ kf − Sn k2 k cos mxk2 = √ kf − Sn k2 .
π π
Since this holds for all n > m, letting n → ∞ gives
1 π
Z
am = f (x) cos mx dx.
π −π
A similar argument proves that the bm are the sine coefficients of f .
Lebesgue Measure and Integration 525
THEOREM 10.8.8
∞
X bn
(a) If bn > 0 for all n and = ∞, then
n=1
n
∞
X
bn sin nx
n=1
P∞ sin nx
Since the sequence 1 (ln n) satisfies hypothesis (a), the series
n=2 ln n
is not the Fourier series of any integrable
function on [−π, π]. However, it is
interesting to note that the sequence 1 (ln n) also satisfies hypothesis (b)
(Exercise 13), and thus the series
∞
X cos nx
n=2
ln n
Exercises 10.8
1. *For x ∈ (0, 1) let fp (x) = x−p , p > 0. Determine all values of p such
that fp ∈ L2 ((0, 1)).
2. For each n ∈ N, let In = (n, n + 1/n2 ). For a given sequence {cn } of
∞
P
real numbers, define f on [1, ∞) by f (x) = cn χIn (x). Show that
n=1
∞ c2
2 P n
f ∈ L ([1, ∞)) if and only if 2
< ∞.
n=1 n
8. If f ∈ L2 ([−π, π]) and {ak } and {bk } are the Fourier coefficients of f ,
prove that
∞
1 π 2
Z
1 2 X 2
a0 + ak + b2k ≤ f dλ.
2 π −π
k=1
Notes
Lebesgue’s development of the theory of measure and integration was one of the
great mathematical achievements of the twentieth century. His proof that every
bounded measurable function is Lebesgue integrable was based on the new idea of
partitioning the range of a function, rather than its domain. Lebesgue’s theory of
integration also permitted him to provide necessary and sufficient conditions for
Riemann integrability of a bounded function f .
In addition to the fact that the Lebesgue integral enlarged the family of integrable
functions, the power of the Lebesgue integral results from the ease with which it
handles the interchange of limits and integration. For the Riemann integral, uniform
convergence of the sequence {fn } is required. Otherwise, the limit function may not
be Riemann integrable. On the other hand, if {fn } is a sequence of measurable
functions on [a, b], then its pointwise limit f is also measurable. Hence if f is also
bounded, then f is integrable. The bounded convergence theorem is notable for its
simplicity of hypotheses and proof. It only requires that {fn } be uniformly bounded
and converge a.e. on [a, b]. This is sufficient to ensure that
Z Z
lim fn dλ = ( lim fn ) dλ.
n→∞ [a,b] [a,b] n→∞
With the additional hypothesis that the pointwise limit f is Riemann integrable,
the bounded convergence theorem is also applicable to a sequence {fn } of Riemann
integrable functions.
The bounded convergence theorem, or the dominated convergence theorem, are
also the tools required to prove the fundamental theorem of calculus for the Lebesgue
integral.
Theorem AR If f is differentiable and f ′ is bounded on [a, b], then f ′ is Lebesgue
b
integrable, and a
f ′ dλ = f (b) − f (a).
The proof of this result was requested in Exercise 8 of Section 10.6 . It fol-
lows simply by applying the bounded convergence theorem to the sequence {gn }
defined by gn (x) = n[f (x + n1 ) − f (x)]. Since f is differentiable, the sequence {gn }
converges pointwise to f . Also, by the mean value theorem the sequences {gn } is uni-
formly bounded on [a, b]. This then establishes the analogue of Theorem 6.3.2 for the
528 Introduction to Real Analysis
for all x ∈ [−π, π], then f , being the pointwise limit of a sequence of continuous
functions, is automatically measurable on [−π, π]. If f is also bounded, then f is in-
tegrable. If the sequence {Sn } of partial sums of the trigonometric series is uniformly
bounded, then by the bounded convergence theorem, the trigonometric series is the
Fourier series of f . In his 1903 paper, “Sur les series trigonometric,” 4 Lebesgue
showed that uniform boundedness of the partial sums may be removed; that bound-
edness of the function f itself was sufficient. This result was extended in 1912 by de
la Vallée-Poussin5 to the case were the function f is integrable on [−π, π]. The reader
is referred to the article by Alan Gluchoff for an overview of how trigonometric series
has influenced the various theories of integration.
3 Seepage 208 of the text by Natanson.
4 Annales Scientifiques de l’École Normale Supérieure, (3)20 (1903), 453–485.
5 “Sur l’unicité du développement trigonométrique,” Bull de l’Acad. Royale de Belgique
Micellaneous Exercises
1. Let A be a measurable subset of R. For f ∈ L(A) set
Z
kf k1 = |f | dλ.
A
b.
S∞Show that 3the collection {Ak } is pairwise disjoint with [− 12 , 21 ] ⊂
3
k=0 Ak ⊂ [− 2 , 2 ].
c. Use the above to show that λ∗ (A) = 0 and λ∗ (A) > 0, thus proving
that A is nonmeasurable.
Rx
5. Suppose f is Lebesgue integrable on [a, b]. If a f dλ = 0 for every x ∈
[a, b], prove that f = 0 a.e..
Supplemental Reading
Botts, T., “Probability theory and ence between Cantor sets,” Amer. Math.
the Lebesgue integral,” Math. Mag. 42 Monthly 101 (1994), 640–650.
(1969), 105–111. Maligranda, L., “A simple proof of
Burkill, H., “The periods of a pe- the Hölder and Minkowski inequality,”
riodic function,” Math Mag. 47 (1974), Amer. Math. Monthly 102 (1995), 256–
206–210. 259.
Darst, R. B., “Some Cantor sets Mc Shane, E. J., “A unified theory
and Cantor functions,” Math. Mag. 45 of integration,” Amer. Math. Monthly 80
(1972), 2–7. (1973), 349–359.
Priestly, W. M., “Sets thick and
Dressler, R. E. and Stromberg, K.
thin,” Amer. Math. Monthly 83 (1976),
R., “The Tonelli integral,” Amer. Math.
648–650.
Monthly 81 (1974), 67–68.
Thompson, B. S., “Monotone conver-
Gluchoff, A. D., “Trigonometric se-
gence theorem for the Riemann integral,”
ries and theories of integration,” Math.
Amer. Math. Monthly 117 (2010), 547–
Mag. 67 (1994), 3–20.
550.
Katznelson, Y. and Stromberg, K., Varberg, D. E., “On absolutely
“Everywhere differentiable, nowhere continuous functions,” Amer. Math.
monotone function,” Amer. Math. Monthly 72 (1965), 831–841.
Monthly 81 (1974), 349–354. Xiang, J. X., “A note on the
Koliha, J. J., “A fundamental the- Cauchy-Schwarz inequality,” Amer.
orem of calculus for Lebesgue integra- Math. Monthly 120 (2013), 456–459.
tion,” Amer. Math. Monthly 113 (2006), Wade, W. R., “The bounded conver-
551–555. gence theorem,” Amer. Math. Monthly
Kraft, R. L., “What’s the differ- 81 (1974), 387–389.
Bibliography
531
Hints and Solutions
Chapter 1
Exercises 1.1 page 5
2. (a) A ∩ B = B, A ∩ Z = {−1, 0, 1, 2, 3, 4, 5}, B ∩ C = {x : 2 ≤ x ≤ 3}. (b)
A × B = {(x, y) : −1 ≤ x ≤ 5, 0 ≤ y ≤ 3}. 4. (a) Suppose x ∈ A ∩ (B ∩ C).
Then x ∈ A and x ∈ B ∩ C. Since x ∈ B ∩ C, x ∈ B and x ∈ C. Thus x ∈ A ∩ B
and x ∈ C. Therefore x ∈ (A ∩ B) ∩ C. This proves A ∩ (B ∩ C) ⊂ (A ∩ B) ∩ C.
The reverse containment is proved similarly. 7.(a) Let x ∈ A ∪ (B ∩ C). Then
x ∈ A or x ∈ B ∩ C. If x ∈ A, then x ∈ A ∪ B and x ∈ A ∪ C. Therefore
x ∈ (A ∪ B) ∩ (A ∪ C). If x ∈ B ∩ C, then x ∈ B and x ∈ C. Hence x ∈ A ∪ B and
x ∈ A ∪ C, i.e., x ∈ (A ∪ B) ∩ (A ∪ C). Thus A ∪ (B ∩ C) ⊂ (A ∪ B) ∩ (A ∪ C). The
reverse containment is proved similarly. (c) Let x ∈ C \ (A ∩ B). Then x ∈ C and
x∈/ A ∩ B. Since x ∈ / A ∩ B we have x ∈ / A or x ∈/ B . If x ∈ / A then x ∈ C \ A.
Likewise, if x ∈/ B, then x ∈ C \ B. In either case, x ∈ (C \ A) ∪ (C \ B). Therefore
C \ (A ∩ B) ⊂ (C \ A) ∪ (C \ B). The reverse containment is proved similarly.
8. If A = {1, 2, 3} then P(A) = {∅, {1}, {2}, {3}, {1, 2}, {1, 3}, {2, 3}, A}. 11. Let
(x, y) ∈ A × (B1 ∪ B2 ). Then x ∈ A and y ∈ B1 ∪ B2 . Thus x ∈ A and y ∈ B1 or
y ∈ B2 . If y ∈ B1 then (x, y) ∈ A × B1 . If y ∈ B2 , then (x, y) ∈ A × B2 . In either
case, (x, y) ∈ (A × B1 ) ∪ (A × B2 ). The reverse containment is proved similarly.
Exercises 1.2 page 14
1. (b) No. The ordered pairs (1, −1) and (1, 3) contradict the definition of function.
(d) In terms of ordered pairs k = {−1, 1), (0, 3), (1, 5), (4, 7)} and thus is a function
from A into B. 2. (a) No! The ordered pairs (0, 1) and (0, −1) are both elements
of A. This however contradicts the definition of function. 3. (a) f ({1, 2, 3, 4}) =
{1, 3, 5, 7} and f −1 ({1, 2, 3,√4}) = {1.2}. 4. (a) f (A) = √ {y : 0 ≤ y ≤ 9}; f −1 (A) =
3 3 −1
{x : x + 1 ∈ A} = {x : − 3 ≤ x ≤ 1}. (c) f (x) = 3 x − 1, x ∈ R.
5. (b) For k ∈ N, (f ◦ g)(k) = 2k + 3. Therefore (f ◦ g)(N) = {2k + 3 : k = 1, 2, 3, . . . }
which is the set of odd integers greater or equal to 5. 6. (b) Range f = R, f is
one-to-one, and x = f −1 (y) = 31 (y + 2). (e) Range f = R, f is not one-to-one: If
y1 6= y2 , then (x, y1 ) 6= (x, y2 ) for any x, yet f (x, y1 ) = f (x, y2 ).
(f ) Range f = {y : 21 ≤ y ≤ 1}. f is not one-to-one. If 0 < x ≤ 1, then f (−x) =
f (x). 7. (a) Range f = {(x, y) ∈ R × R : x2 + y 2 = 1}. (b) f −1 ((1, 0)) =
−1
0, f ((0, −1)) = 3π 2
. 11. Assume f is not one-to-one and show that this leads to
a contradiction.
Exercises 1.3 page 20
1. (b) For n = 1, 1 = 12 . Assume the result is true for n = k. Then for n = k + 1,
1+3+5+· · ·+(2k −1)+(2(k +1)−1) = k2 +(2k +1) = (k +1)2 . (d) When n = 1,
13 = [ 21 ·1·2]2 . Assume true for n = k. Then for n = k+1, 13 +23 +· · ·+k3 +(k+1)3 =
533
534 Hints and Solutions
Sn −rSn = r−rn+1 , from which the result follows. 8. Hint: Let A = n1 (a1 +· · ·+an )
and write an+1 = xA for some x ≥ 0. Use the induction hypothesis to prove that
(a1 · · · an · an+1 )1/(n+1) ≤ x1/(n+1) A. Now use Bernoulli’s inequality to prove that
x1/(n+1) ≤ (n + x)/(n + 1). From this it now follows that
n+x 1
x1/(n+1) A ≤ A= (a1 + · · · + an + an+1 ).
n+1 n+1
Exercises 1.4. page 28
4. (a) Consider (a − b)2 . 5. (a) inf A = 0, sup A = 1. (c) inf C = 0, sup C = ∞.
(f ) inf F = 1, sup E = 3. (h) inf H = −2, sup H = 2 8. Apply Theorem
1.4.4. 14. (b) Since A and B are non-empty and bounded above, α = sup A and
β = sup B both exist in R. Since α = sup A we have a ≤ α for all a ∈ A. Similarly
b ≤ β for all b ∈ B. Therefore a + b ≤ α + β for all a ∈ A, b ∈ B. Thus α + β
is an upper bound for A + B, and thus γ = sup(A + B) ≤ α + β. To prove the
reverse inequality, we first note that since γ is an upper bound for A + B, a + b ≤ γ
for all a ∈ A, b ∈ B. Let b ∈ B be arbitrary, but fixed. Then a ≤ γ − b for all
a ∈ A. Thus γ − b is an upper bound for A and hence α ≤ γ − b. Since this holds
for all b ∈ B, we also have that b ≤ γ − α for all b ∈ B. Thus β ≤ γ − α; i.e.,
α + β ≤ γ. 15. (a) Let α = sup{f (x) : x ∈ X}, β = sup{g(x) : x ∈ X}. since
the range of f and g are bounded, α and β are finite with f (x) + g(x) ≤ α + β
for every x ∈ X. Therefore α + β is an upper bound for {f (x) + g(x) : x ∈ X}.
Thus sup{f (x) + g(x) : x ∈ X} ≤ α + β. (d) Let α = sup{g(x) : x ∈ X}. Hence
g(x) ≤ α for all x ∈ X. Thus by hypothesis f (x) ≤ α for all x ∈ X. Therefore α
is an upper bound for {f (x) : x ∈ X}. As a consequence sup{f (x) : x ∈ x} ≤ α.
16. (a) F (x) = 3x + 2, sup{F (x) : x ∈ [0, 1]} = 5. (c) Range f = [0, 5]. Therefore
sup{f (x, y) : (x, y) ∈ X × Y } = 5. 20. (a) F (x) = 3x + 2 and H(y) = 2y. Thus
sup{H(y) : y ∈ [0, 1]} = 2 and inf{F (x) : x ∈ [0, 1]} = 2.
Exercises 1.5. page 32
1. First prove that if p and q are positive integers then there exists n ∈ N such that
np > q. If p > q then n = 1 works. If p ≤ q, consider (q + 1)p. 4. Suppose r1 , r2
are rational with r1 < r2 . Then r2 − r1 > 0 and r = r1 + 12 (r2 − r1 ) is rational with
√
r1 < r < r2 . 6. (a) Use the fact 2/2 is irrational. (b) Use Theorem 1.5.2 and
Hints and Solutions 535
(a). 8. Since x < y and u > 0, we have x/u < y/u. Now apply Theorem 1.5.2 and
(a).
Exercises 1.6. page 36
1. (a) .0202020 . . . 2. (c) .0101 = 212 + 214 = 83 . (d) .010101 · · · = 212 + 214 +
∞ h i
1
· · · = 212 ( 14 )n = 14 1−1 1 = 13 . (f ) .001001 · · · = 213 + 216 + 219 + · · · =
P
26
n=0 4
∞ h i
1 1 n 1
( 8 ) = 8 1− 1 = 7 . 3. (a) .0022 = 30 + 302 + 323 + 324 = 27
1 1 2 2 8
P
23
+ 81 = 81 .
n=0 8
∞ h i
(d) .101010 · · · = 31 + 302 + 313 + 304 + 315 + · · · = 13 ( 91 )n = 13 1−1 1 = 38 . (f )
P
n=0 9
∞ ∞ ∞ ∞
1 2 1 1 n 2 1 n 1 9 2 9
= 58 .
P P P P
.121212 · · · = 32k+1
+ 32k
= 3 9
+ 9 9
= 3
· 8
+ 9
· 8
k=0 k=1 n=0 n=0
4. .010101 · · · . 5. Finite binary expansion is .0011 whereas the infinite binary
expansion is .0010111 · · · .
Exercises 1.7. page 45
1. (c) Let f : N → O be defined by f (n) = 2n−1. 4. (a) g(x) = a+x(b−a) is a one-
to-one mapping of (0, 1) onto (a, b). 6. (a) Since A ∼ X, there exists a one-to-one
function h from A onto X. Similarly, there exists a one-to-one function g from B onto
Y . To prove the result, show that F S : A×B →TX ×Y defined by F (a, b) = (h(a), g(b))
is one-to-one and onto. 8. (a) An = S R, An = {x :T−1 < x < 1}.
(c) An = (−1, 2), An = [0, 1]. (e) An = (0, 1), An = { 21 }. 12. (a) Let
S T
y ∈ f (∪α Eα ). Then y = f (x) for some x ∈ ∪α Eα . But then x ∈ Eα for some α.
Therefore y = f (x) ∈ f (Eα ). Thus f (∪α Eα ) ⊂ ∪α f (Eα ). The reverse containment
follows similarly. 13. (b) Since the set of rational number Q is countable and
the set of real numbers R is uncountable, by part (a) the set of irrational numbers,
namely R \ Q, is uncountable. 15. (a) For n ∈ N let Pn denote the set of all
polynomials in x of degree less than or equal to n with rational coefficients, and let
Qn+1 = Q × · · · × Q (n + 1 times). By repeated application of Exercise 6 (b), Qn+1 is
countable. Define f : Qn+1 → Pn by f (a0 , a1 , . . . , an ) = an xn + · · · + a1 x + a0 . Since
f maps Qn+1 onto Pn and Pn is infinite, Pn is countable. (b) By part (a) Pn is
countable. Thus by Theorem 1.7.15, ∪n Pn , the set of all polynomials with rational
coefficients, is countable. 18. (a) Consider the function on (0, 1) that for each
n ∈ N, n ≥ 2, maps n1 to n−1 1
, and is the identity mapping elsewhere. 19. For a
polynomial p(x) = an xn + · · · + a1 x + a0 , consider the height h of the polynomial
defined by h = n + |a0 | + |a1 | + · · · |an |. Prove that there are only a finite number of
polynomials with integer coefficients of a given height h, and therefore only a finite
number of algebraic numbers arising from polynomials of a given height h. 22. If
f is a function from A → P(A), show that f is not onto by considering the set
{x ∈ A : x 6∈ f (x)}. 23. For a, b ∈ [0, 1] with decimal expansion a = .a1 a2 . . . and
b = .b1 b2 . . . , consider the function f : [0, 1]×[0, 1] → [0, 1] by f (a, b) = .a1 b1 a2 b2 . . . .
Chapter 2
Exercises 2.1. page 56
2. (b) We first note that |x| = |x − y + y| ≤ |x − y| + |y|. Therefore |x| − |y| ≤ |x − y|.
Interchanging x and y gives |y| − |x| ≤ |y − x| = |x − y|. Now use the definition of
||x| − |y||. 5. (a) −3 ≤ x ≤ 13/3. (c) −1 < x < 2. 7. (c) This is a metric.
536 Hints and Solutions
The only nontrivial part is the triangle inequality. This follows from the following.
Since the ln function is increasing on (0, ∞), for a, b positive we have
ln(1 + a + b) ≤ ln((1 + a)(1 + b)) = ln(1 + a) + ln(1 + b).
11. (a)(i) Since the points are collinear, √ the distance is just the usual euclidean
distance, i.e. d(( 21 , 41 ), (− 12 , − 41 )) = 21 5. (ii) The points are not collinear. So
√
d(( 21 , 21 ), (0, 1)) = 21 2 + 1. 12. (b) For x ∈ [0, 1], |x − x2 | = x − x2 , which has
a maximum at x = 21 . Therefore d(f, g) = 41 . 14. Again the only non-trivial part is
the triangle inequality and it follows from the following: for a, b positive,
a+b a b a b
= + ≤ + .
1+a+b 1+a+b 1+a+b 1+a 1+b
Exercises 2.2 page 67
S
2. Let p ∈ O = α Oα be arbitrary. Then p ∈ Oαo for some αo ∈ A. Since Oαo
is open, there exists ǫ > 0 such that Nǫ (p) ⊂ Oαo . But then Nǫ (p) ⊂ O; i.e., p is
an interior point of O. 3. (a) By Corollary 2.2.16 a finite set has no limit points.
Now apply Theorem 2.2.14. This can also be proved directly by showing that the
complement ofpa finite set is the finite union of open intervals.
5. (b) Since x21p + x22 ≤ |x1 | + |x2 | we obtain Nǫ1 (p) ⊂ Nǫ2 (p). Likewise, since
max{|x1 |, |x2 |} ≤ x21 + x22 it follows that Nǫ2 (p) ⊂ Nǫ∞ (p). The last containment
follows since |x1 | + |x2 | ≤ 2 max{|x1 |, |x2 |}. 8. (a) For E = (0, 1) ∪ {2}, Int E =
(0, 1), E ′ = [0, 1], isolated points = {2}, E = [0, 1] ∪ {2}. (d) For E = { n1 : n ∈
N}, Int(E) = ∅, E ′ = {0}, isolated points = E, E = E ∪ {0}. 9. (a) {a, b}.
13. (a) Closed in X. (c) Neither. 16. Since α ∈ / A, for every ǫ > 0 there exists
an a ∈ A such that α − ǫ < a < α. Therefore α is a limit point of A. 17. (a)
Let p ∈ Int(E) be arbitrary. Since p is an interior point of E, there exists an ǫ > 0
such that Nǫ (p) ⊂ E. To show that Nǫ (p) ⊂ Int(E) it remains to be shown that
every q ∈ Nǫ (p) is an interior point of E. 19. (a) Since A ∪ B is a subset of A ∪ B
(which is closed), by Theorem 2.2.18(c) A ∪ B ⊂ A ∪ B. The reverse containment
follows analogously. 22. Let {rn }∞ ∞
n=1 be an enumeration of Q, and {ǫn }n=1 an
enumeration of the positive rational numbers. Take I = {Nǫj (rn ) : j, n ∈ N}.
23. Suppose U ⊂ R is open and suppose E ⊂ U is open in U . Let p ∈ E be arbitrary.
Use the fact that E is open in U and that U is open to show that there exists an ǫ > 0
such that Nǫ (p) ⊂ E. Thus p is an interior point of E. Since p ∈ E was arbitrary,
E is open in R. The converse is obvious. 25. (a) Let U = (0, 1) and V = ( 23 , 52 ).
26. Hint: Use the fact that A = A ∪ A′ , and if U open satisfies U ∩ A′ 6= ∅ then
U ∩ A 6= ∅. 27. Let A be a non-empty subset of R. If A contains at least two points
and is not an interval, then there exist r, s ∈ A with r < s and t ∈ R with r < t < s,
but t 6∈ A. The open sets U = (−∞, t) and V = (t, ∞) will prove that A is not
connected. Therefore, every connected set is an interval. Conversely, suppose A is
an interval and A is not connected. Then there exist disjoint open sets U and V
with A ∩ U 6= ∅, A ∩ V 6= ∅, and A ⊂ U ∪ V . Suppose a ∈ A ∩ U and b ∈ A ∩ V . By
Theorem 2.2.20 applied to U and V there exist disjoint open intervals I and J such
that a ∈ I and b ∈ J. Suppose a < b and J = (t, s). Show that t 6∈ U ∪ V , but t ∈ A.
This contradiction proves that A is connected.
Exercises 2.3. page 73
1. (b) Use the fact that 0 is a limit point of A. 3. (a) Let U = {Uα }α∈A be an
open cover of A ∪ B. Then U is also an open cover of A and B, respectively. Now
use the compactness of A and B to obtain a finite subcover of A ∪ B. 4. Since
K is compact, by Theorem 2.3.5 K is closed. Now show that K is bounded. Let
Hints and Solutions 537
α = sup K. This exists since K is non-empty and bounded. Now use the fact that
K is closed to show that α ∈ K.
Exercises 2.4. page 76
1. Take Kn = [0, n]. 3. Suppose In = [an , bn ]. Since In ⊃ Im for all m ≥ n, we
have an ≤ am ≤ bm ≤ bn for all m ≥ n. Let a = sup{an } and b = inf{bn }. Now
show that ∩In = [a, b].
Exercises 2.5. page 79
1
4. If x ∈ P with x = .a1 a2 · · · , an ∈ {0, 2}, set bn = a .
2 n
Consider the function
x → .b1 b2 · · · .
Chapter 3
Exercises 3.1. page 87
1. (a) Let an = (3n + 5) (2n + 7). Then |an − 32 | = 4 (4n + 7) < 1/n. Given ǫ > 0,
choose no ∈ N such that no ≥ 1/ǫ. Then for all n ≥ no , |an − 23 | < ǫ. Therefore
lim an = 23 . (c) Set an = (n2 + 1)/2n2 . Then |an − 21 | = 2n1 2 < 2n 1
. Given ǫ > 0,
1
choose no ∈ N such that no ≥ 1/2ǫ. √ Then for all n ≥
√ n o , |a n − | < ǫ. Thus
1 √ √ 2 √
lim an = 2 . (f ) First show that n + 1 − n = 1 ( n + 1 + n) < 1 (2 n).
√
Now given ǫ > 0 choose no such that 1 (2 n) < ǫ for all n ≥ no . 2. (a) If n is
even then n(1 + (−1)n ) = 2n. Thus the sequence is unbounded and diverges in R.
(c) When n is even, i.e., n = 2k, then sin nπ
2
= sin kπ = 0. On the other hand, when
n is odd, i.e. n = 2k+1, then sin nπ
2
= sin(2k+1) π
2
= (−1)k . Thus sin nπ 2
assumes the
values −1, 0, and 1 for infinitely many values of n. Therefore the sequence diverges.
4. (a) Write b = 1 + a where a > 0. By Example 1.3.2(b), bn ≥ 1 + na. Now use
the previous exercise. 5. First show that |a2n − a2 | ≤ (|an | + |a|)|an − a|. Now use
the fact that since {an } converges, there exists a positive constant M such that
|an | ≤ M for all n ∈ N. 6. Consider a = 0 and a > 0 separately. For a > 0,
√ √ √ √
an − a = (an − a) ( an + a). 8. Take ǫ = a/2. Then for this ǫ, there exists
no ∈ N such that |an − a| < a/2 for all n ≥ no . From this it now follows that
an > a/2 for all n ≥ no .
Exercises 3.2. page 93
√
5. (b) If p > 1, let xn = n p − 1. Apply the inequality of Example 1.3.2(b) to
n
(1 + xn ) . 6. (a) 3/5. (c) −1. (e) 0. (g) a/2. 7. (a) converges to 1.
(c) converges to 0. (e) converges
to 2/3. 8. Use the fact that | cos x| ≤ 1.
10. (a) Suppose lim an+1 an = L < 1. Choose
ǫ > 0 such that L + ǫ < 1. For
this ǫ there exists no ∈ N such that an+1 an < L + ǫ for all n ≥ no . From this
one obtains that nfor n > no , 0 < an ≤ (L + ǫ)n−no ano = M (L + ǫ)n , where
M = ano (L + ǫ) o . Since (L + ǫ) < 1, by Theorem 3.2.6(e), lim (L + ǫ)n = 0.
n→∞
The result now follows by Theorem 3.2.4. 11. (a) With an = n2 an , 0 < a < 1,
an+1
L = lim = a lim (1+ n1 )2 = a. Thus since L = a < 1, the sequence converges.
n→∞ an n→∞
12. Set xn = (an − 1) (an + 1) and solve for an 13. (b) Verify the result for n = 1.
Assume the result holds for n = k. For n = k + 1, (1 + a)(k+1) = (1 + a)(1 + a)k
which by the induction hypothesis
k k k+1
X k j
X k j
X k j
= (1 + a) a = a + a .
j j j−1
j=0 j=0 j=1
538 Hints and Solutions
k+1
P k+1 j
Using part (a) show that the above is equal to a . 14. (a) By con-
j=0 j
sidering the sequence {ak − a} show first that one can assume a = 0.
Exercises 3.3. page 101
√
1. Take In = [n, ∞). 2. (a) Set an = n2 + 1 n = 1 + 1/n2 . Since 1/(n+1)2 <
p
1/n2 we have an+1 < an and lim an = 1. 4. Use mathematical induction to show
that an > 1 for all n ∈ N. From the inequality 2ab ≤ a2 + b2 , a, b ≥ 0, we have
2an ≤ a2n + 1 or an+1 = 2 − 1/an ≤ an . Therefore {an } is monotone decreasing.
Finally, if a = lim an , then a = lim an+1 = lim (2 − 1/an ) = 2 − 1/a. Therefore
n→∞ n→∞ √
a = 1. 6. (a) Use induction to show that xn > α for all n. The inequality
1 2 2 √
ab ≤ 2 (a + b ), a, b ≥ 0 should prove useful. Use the fact that xn > α to prove
that {xn } is monotone decreasing. Consider xn+1 − xn and simplify. 8. (c) {an }
is monotone increasing with an ≤ 3 for all n. If a = lim an then a2 = 2a + 3. Thus
a = 3. (e) {an } is monotone decreasing with an > 2 for all n and lim an = 2.
10. (a) e2 (c) e3/2 . 11. To show that {sn } is unbounded show that s2n > 1 + n2 .
Hint: First show it for n = 1, 2, and 3, then use mathematical induction to prove the
result for all n ∈ N. 13. Hint: For k ≥ 2, k12 ≤ k(k−1) 1
= k−11
− k1 . 15. (a) Write
a = 1 + b with b > 0. Now use the binomial theorem to show that an /n ≥ cn for
n
some positive constant c and n sufficiently large. (c) n + (−1) ≥ n − n1 ≥ (n − 1).
n√
n
16. (d) The sequence is not monotone: If xn = n + (−1) n, then x2n+1 < x2n .
21. This problem is somewhat tricky. It is not sufficient to just choose a monotone
increasing sequence in the set; one also has to guarantee that the sequence converges
to the least upper bound of the set. Let E be a non-empty subset of R that is bounded
above. Let U denote the set of upper bounds of E. Since E 6= ∅, we can choose an
element x1 ∈ E. Also, since E is bounded above, U = 6 ∅. Choose β1 ∈ U . Let
α1 = 12 (x1 + β1 ), and consider the two intervals [x1 , α1 ] and (α1 , β1 ]. Since x1 ∈ E,
one, or both of these intervals have non-empty intersection with E. If (α1 , β1 ]∩E 6= ∅,
choose x2 ∈ E such that α1 < x2 ≤ b1 . In this case set β2 = β1 . If (α1 , β1 ] ∩ E = ∅,
choose x2 ∈ E such that x1 ≤ x2 ≤ α1 , and set β2 = α1 . In this case β2 ∈ U .
Proceeding inductively construct two monotone sequences {xn } and {βn } such that
(a) {xn } ⊂ E with xn ≤ xn+1 for all n, (b) {βn } ⊂ U with βn ≥ βn+1 for all n,
and (c) 0 ≤ βn − xn ≤ 2−n+1 (b1 − x1 ). Assuming that every bounded monotone
sequence converges, let β = lim βn . By (c) we also have β = lim xn . It only remains
to be shown that β = sup E. 22. Suppose A = {xn : n ∈ N} is a countable subset
of [0, 1]. To show that A $ [0, 1] proceed as follows: At least one of the three closed
intervals [0, 31 ], [ 13 , 23 ], [ 23 , 1] does not contain x1 . Call it I1 . Divide I1 into three closed
intervals of length 1/32 . At least one of these, say I2 , does not contain x2 .
Exercises 3.4. page 106
3. (a) {−1, 0, 1}. (c) {1}. (e) {1, −3}. 4. (a) e2 . 10. For convenience we
take n = 2. Let {pn } be a bounded sequence in R2 where for each n, pn = (an , bn ).
But then the sequences {an } and {bn } are also bounded. Since {an } is bounded, by
the Bolzano-Weierstrass theorem there exists a subsequence {ank } that converges,
to say a. Since the sequence {bnk } is also bounded, it has a convergent subsequence,
say {bnkj } that converges to say b. But then the subsequence {pnkj } converges to
(a, b).
Hints and Solutions 539
Chapter 4
Exercises 4.1. page 142
1. (a) If f (x) = 2x − 7, L = −3, then |f (x) − L| = 2|x − 2|. Given ǫ > 0 take
δ = ǫ/2. Then for all x with |x − 2| < δ, |f (x) − L| = 2|x − 2| < 2δ = ǫ. (c) If
|x−1|
f (x) = x/(1+x), then |f (x)− 21 | = 2|x+1| < 12 |x−1| for all x > 0. Hence given ǫ > 0,
choose δ = min{2ǫ, 1}. With this choice of δ, x > 0 and thus |f (x) − 21 | < 21 δ ≤ ǫ.
(e) Let f (x) = (x3 + 1)/(x + 1). Since x3 + 1 = (x + 1)(x2 − x + 1), we have
f (x) = x2 −x+1 for x 6= −1. Therefore |f (x)−3| = |x+1||x−2|. But for −2 < x < 0
we have |x − 2| < 4. Therefore for all such x we have |f (x) − 3| < 4|x − (−1)|. Given
ǫ > 0 take δ = min{ 4ǫ , 1}. Then for |x − (−1)| < δ we have |f (x) − 3| < ǫ.
2. (c) We first note that x3 − p3 = (x − p)(x2 + xp + p2 ). For |x − p| < 1 we have
|x| < |p| + 1. Therefore |x3 − p3 | < (3|p|2 + 3|p| + 1)|x − p|. Hence given ǫ > 0
choose δ so that 0 < δ ≤ min{1, ǫ/(3|p|2 + 3|p| + 1). Then for |x − p| < δ we
√ √ √ √
have |x3 − p3 | < ǫ. (e) Note that for x > 0, x − p = (x − p)/( x + p).
√ √ √ √
Therefore | x − p| < |x − p|/ p. Given ǫ > 0 choose δ so that 0 < δ < pǫ.
√ √
Then if |x − p| < δ we have x > 0 and | x − p| < ǫ. 3. (a) The limit does not
exist. For x > 0, x/|x| = 1, whereas for x < 0, x/|x| = −1. (c) The limit does
not exist. Consider the sequence {1/nπ} which has limit 0 as n → ∞. (e) Since
540 Hints and Solutions
|f (y)| ≤ Mx for all y ∈ Nǫx (x) ∩ K. The collection {Nǫx (x)}x∈K is an open cover of
K. Now use compactness of K to show that there exists a positive constant M such
that |f (y)| ≤ M for all y ∈ K.
Exercises 4.3. page 161
2. (a) Suppose f (x) = x2 is uniformly continuous on [0, ∞). Then with ǫ = 1, there
exists a δ > 0 such that |f (x) − f (y)| < 1 for all x, y ∈ [0, ∞) satisfying |x − y| < δ.
Set xn = n and yn = n + n1 . If no ∈ N is such that no δ > 1, then |yn − xn | = n1 < δ
for all n ≥ no . But |f (yn ) − f (xn )| = 2 + n12 ≥ 2 for all n. This is a contradiction!
1 1
(c) Take pn = (2n+1) π and qn = nπ . Then |h(pn )| = 1 and h(qn ) = 0 for all
2
n. But {pn } and {qn } both converge to 0. Hence given any δ > 0 there exists
an integer no such that |pn − qn | < δ for all n ≥ no and |h(pn ) − h(qn )| = 1.
Hence h is not uniformly continuous on (0, ∞). 3. (a) For all x, y ∈ [0, ∞),
x y |x−y|
|f (x) − f (y)| = 1+x
− 1+y
= (1+x)(1+y)
< |x − y|. Thus given ǫ > 0, the choice
2 2
δ = ǫ will work. (c) We first note that |h(x)−h(y)| = (x2|y −x |
+1)(y 2 +1)
< |y −x| |x|+|y|
x2 +1
.
2
But for |y − x| < 1, |y| < |x| + 1. As a consequence (|x| + |y|)/(x + 1) < 2.
(Verify!) Therefore |h(x) − h(y)| < 2|y − x|. Hence given ǫ > 0, choose δ so that
0 < δ < min{1, ǫ/2}. (f ) Set g(x) = sin x x, x ∈ (0, 1] and g(0) = 1. Then g
is continuous on [0, 1], and thus by Theorem 4.3.4 uniformly continuous on [0, 1].
From this it now follows that f is uniformly continuous on (0, 1). 4. (a) Show
that |f (x) − f (y)| = | x1 − y1 | ≤ a12 |x − y| for all x, y ∈ [a, ∞), a > 0. (c) Using a
trigonometric identity forsin A − sin B we obtain
1 1 1 1 1 1 1 1 1 y−x
sin − sin = 2 sin − cos + ≤ 2 sin . Now us-
x y 2 x y 2 x y 2 xy
ing the | sin h| ≤ |h| we have
inequality
1 y−x |y − x| 1
2 sin ≤ ≤ |y − x| for all x, y ∈ [a, ∞).
2 xy xy a2
√ √ √ √ 1
5. (a) | x − y| = |x − y| ( x + y) ≤ 2√ a
|x − y| provided x, y ∈ [a, ∞).
(c) Assume that it does and obtain a contradiction. 7. (b) Suppose |f | and |g|
are bounded by M1 and M2 respectively. Then
|f (x)g(x) − f (y)g(y)| ≤ |f (x)||g(x) − g(y)| + |g(y)||f (x) − f (y)| ≤ M1 |g(x) − g(y)| +
M2 |f (x) − f (y)|.
Now use the uniform continuity of f and g. 10. Suppose f is not bounded. Then
there exists a sequence {xn } in E such that |f (xn )| → ∞. Since E is bounded,
{xn } has a convergent subsequence {xnk } in R. Thus {xnk } is Cauchy. Since f is
uniformly continuous, {f (xnk )} is Cauchy. But then by Theorem 3.6.2 the sequence
{f (xnk )} is bounded, which is a contradiction. 12. (a) By taking ǫ = 1 show that
there exists ro such that |f (x)| < |L| + 1 for all x ∈ [ro , ∞). Now use the continuity
of f on [a, ro ] to conclude that f is bounded on [a, ∞). 13. (a) Let x1 ∈ E be
arbitrary. For n ≥ 1 set xn+1 = f (xn ). Show that the sequence {xn } is contractive.
Exercises 4.4. page 175
2. (b) lim f (x) = lim f (x) = 0. (d) For 0 < |x| < 1, [x2 − 1] = −1, Hence
x→0+ x→0−
both the right and left limit at 0 exist and are equal to −1. (f) lim f (x) = 1.
x→0+
1 1
Hint: For n+1
< x ≤ n
, [ x1 ]
= n. 4. Use the fact that [x] is bounded near
1
xo = 2. 6. (b) b = −10. 7. (b) For n ∈ N set xn = n and yn = n + nπ , and
show that |f (xn ) − f (yn )| = sin 2. 10. (a) Define F on [a, b] by F (x) = f (x)
542 Hints and Solutions
for x ∈ (a, b] and F (a) = f (a+), which is assumed to exist. Then F is continuous
on [a, b], and thus uniformly continuous by Theorem 4.3.4. Hence f is uniformly
continuous on (a, b]. 12. If n ∈ N, g(x) = xn is continuous and strictly increasing
on (0, ∞) with Range g = (0, ∞). Therefore by Theorem 4.4.12, its inverse function
g −1 (x) = x1/n is also continuous on (0, ∞). From this it now follows that f (x) is
continuous on (0, ∞). 14. (a) Suppose first that U = (a, b) ⊂ I. Then since f is
strictly increasing and continuous on I, fS((a, b)) = (f (a), f (b)), which is open. For
an arbitrary open set U ⊂ I, write U = n In , where {In } is a finite or countable
collection of open intervals and use Theorems 1.7.14 and 2.2.9. 16. (b) f is strictly
increasing on [0, 21 ) and strictly decreasing on [ 21 , 1], and thus one-to-one on each of
the intervals. (c) f ([0, 21 )) = [0, 1) and f ([ 12 , 1]) = [1, 2]. Therefore f ([0, 1]) = [0, 2].
(d) For y ∈ [0, 1], f −1 (y) = 21 y, and for y ∈ [1, 2], f −1 (y) = 21 (3 − y). Therefore
f −1 (1−) = 21 and f −1 (1+) = 1. Thus f −1 is not continuous at yo = 1.
Chapter 5
Exercises 5.1. page 190
1. (a) For f (x) = x3 ,
f (x + h) − f (x) (x + h)3 − x3
f ′ (x) = lim = lim = lim (3x2 + 3xh + h2 ) = 3x2 .
h→0 h h→0 h h→0
(c) For h(x) = 1/x, x 6= 0,
1
− x1 −1 1
h′ (x) = lim x+h = lim = − 2.
h→0 h h→0 x(x + h) x
x f (x + h) − f (x) 1
(e) For f (x) = , x 6= −1, = .
x+1 h (x + 1)(x + h + 1)
1 1
Thus f ′ (x) = lim = .
h→0 (x + 1)(x + h + 1) (x + 1)2
2. If n ∈ N, by the binomial theorem
n
n
n n n k n−k n−1 n
hk−1 xn−k .
P P
(x + h) − x = h x = nhx +h
k=1 k k=2 k
Dividing by h and taking the limit as h → 0 proves the result for n ∈ N. If n ∈ Z
is negative, write xn = 1 xm , m ∈ N, and use Theorem 5.1.5(c). 3. (a) Since
cos x = sin(x + π2 ), by the chain rule dx d
cos x = cos(x + π2 ) = − sin x. Alternately
use the definition of the derivative. 5. (a) Yes. (c) No. (e) Yes.
7. (a) f ′ (x) exists for all x ∈ R \ Z. For x ∈ (k, k + 1), k ∈ Z , f (x) = x[x] = kx.
Thus f ′ (x) = k. (c) The function h is differentiable at all x where sin x 6= 0.
For x ∈ (2kπ, (2k + 1)π), k ∈ Z, h′ (x) = cos x. For x ∈ ((2k − 1)π, 2kπ), k ∈ Z,
h′ (x) = − cos x. 9. (b) For x 6= 0, g ′ (x) = 2x sin x1 − cos x1 . Since lim cos x1 does
x→0
not exist, lim g ′ (x) does not exist. 10. (a) 2a + b = 6. (b) a = 4, b = −2.
x→0
Justify why b = −2. 12. (a) f (x) = 2 (2x + 1). (c) h′ (x) = 3[L(x)]2 x.
′
′
14. (b) f+ (0) = lim h(b−1) sin h1 which exists and equals 0 if and only if (b−1) > 0;
h→0+
i.e., b > 1. 15. (b) No. Consider f (x) = |x| at xo = 0.
Exercises 5.2. page 203
3. (a) Increasing on R. (c) Decreasing on (−∞, 0) and increasing on (0, ∞),
with an absolute minimum at x = 0. (e) Increasing on (−∞, 2) ∪ (2, ∞), decreas-
ing on (0, 2). The function has a local minimum at x = 2. 5. (a) Let f (x) =
Hints and Solutions 543
1
(1 + x) 2 , x > −1. By the mean value theorem f (x) − f (0) = f ′ (ζ) where ζ is be-
tween 0 and x. If x > 0, then f ′ (ζ) < 21 . On the other hand, if x < 0 and x < ζ < 0,
then f ′ (ζ) > 12 . But then f ′ (ζ) < 12 x. (c) Take f (x) = xα , 0 < α < 1. Then
f (x) − f (a) = f ′ (ζ)(x − a) where a < ζ < x. But then f ′ (ζ) < αaα−1 .
6. (a) Show that the function f (x) = x1/n − (x − 1)1/n is decreasing on the interval
1 ≤ x ≤ a/b. (b) Set f (x) = αx − xα , x ≥ 0. Prove that f (x) ≥ f (1) to obtain
xα ≤ αx + (1 − α), x ≥ 0. Now take x = a/b. 7. (a) If f ′′ (c) > 0, then there exists
a δ > 0 such that f ′ (x) (x − c) > 0 for all x, |x − c| < δ. Therefore f ′ (x) < 0 on
(c − δ, c) and f ′ (x) > 0 on (c, c + δ). Thus f has a local minimum at c.
8. Show that f ′ (x) = 0 for all x ∈ (a, b). 9. Since P (2) = 0 we can assume that
P (x) = a(x − 2)2 + b(x − 2). Now use the fact that P must satisfy P (1) = 1 and
P ′ (1) = 2 to determine a and b. 10. a = −2, b = 2, c = 1. 12. (a) See Example
4.1.10 (d). (b) By the result of (a), f ′ (x)< 0 for all x ∈ (0, π2 ]. 14. (a) Let
tn → c. Since f ′ (c) exists, lim (f (tn ) − f (c)) (tn − c) = f ′ (c). Now apply the mean
n→∞
′
value theorem. 17. Since f+ (a) = lim (f (x) − f (a)) (x − a) > 0, there exists a
x→a +
δ > 0 such that (f (x) − f (a)) (x − a) > 0 for all x, a < x < a + δ.
19. Hint: Consider f ′ (x). 20. (b) Check the values of f ′ (x) at pn = 1/(nπ), n ∈ N.
24. (a) For fixed a > 0 consider f (x) = L(ax), x ∈ (0, ∞). (c) By (a) and (b),
L(bn ) = nL(b) for all n ∈ Z and b ∈ (0, ∞). But then L(b) = L((b1/n )n ) = nL(b1/n ).
From this it now follows the L(br ) = rL(b) for all r ∈ Q. Now use the continuity
of L to prove that L(bx ) = xL(b) for all x ∈ R where bx = sup{br : r ∈ Q, r ≤
x}. 25. (b) Since tan(arctan x) = x, by Theorem 5.2.14 and the chain rule,
d
dx
tan(arctan x) = (sec2 (arctan x))( dxd
arctan x) = 1. The result now follows from
2 2
the identity sec (arctan x) = 1 + x . To prove this, consider the right triangle with
sides of length 1, |x|, 1 + x2 respectively.
Exercises 5.3. page 212
f (x) f (x) − f (xo ) g(x) − g(xo )
2. = . Now use Theorem 4.1.6(c) and the definition
g(x) x − xo x − xo
of the derivative. 4. Use the fact that since lim f (x) exists, f (x) is bounded on
x→a+
x5 + 2x − 3 5x4 + 2 7
(a, a + δ) for some δ > 0. 6. (a) lim = lim = . (c) By
x→1 2x3 − x2 − 1 x→1 6x2 − 2x 4
ln x 1
l’Hopital’s rule, lim = lim = 0. (e) Make the substitution x = 1/t.
x→∞ x x→∞ x
(g) 0. Use repeated applications of l’Hospitals rule until the exponent of ln x is less
than or equal to zero. (i) 0. Use l’Hospitals rule twice on (sin x−x)/(x sin x), x 6= 0.
9. (a) f ′ (0) = 0. (b) f ′′ (0) = − 31 .
Exercises 5.4. page 219
1. Let c1 > 0 be arbitrary. By Newton’s method cn+1 = (2c3n + α) 3c2n .
2. (a) f (0) = 1 and f (1) = −1. Therefore f has a zero on the interval [0, 1].
With c1 = 0.5, c2 = 0.33333333, f (c2 ) = .037037037, c3 = 0.34722222, f (c3 ) =
0.000195587, c4 = .34729635, f (c4 ) = 0.000000015.
Chapter 6
Exercises 6.1. page 238
1.(a) Since f is increasing on [−1, 0] and decreasing on [0, 2], m1 = m2 = 0, m3 = −3
and M1 = M2 = 1, M3 = 0. Since ∆xi = 1, L(P, f ) = −3 and U(P, f ) = 2.
544 Hints and Solutions
(
−1, 0 ≤ x < 1,
2. (a) f x) = Let P = {x0 , x1 , ..., xn } be any partition of [0, 2]
2, 1 ≤ x ≤ 2.
and let k ∈ {1, ..., n} be such that xk−1 < 1 ≤ xk . Then L(P, f ) = 4 − 3xk and
U (P, f ) = 4−3xk−1 . Thus U(P, f )−L(P, f ) = 3(xk −xk−1 ). By Theorem 6.1.7 it now
follows that f is Riemann integrable on [0, R 2 2]. Also, since xk−1 < 1 ≤ xk , L(P, f ) ≤
1 < U (P, f ) for any partition P. Hence 0 f = 1. Alternatively, consider the parti-
tion P = {0, c, 1, 2} where 0 < c < 1 is arbitrary. For this partition, L(P, f ) = 1 and
U(P, f ) = 4 − 3c. The results now follow as above. 3. (a) If P = {xo , x1 , . . . , xn }
is a partition of [a, b], then inf{f (t) : t ∈ [xi−1 , xi ]} = sup{f (t) : t ∈ [xi−1 , xi ]} = c.
Pn
Therefore L(P, f ) = U(P, f ) = c(xi −xi−1 ) = c(b−a). 4. (a) Since f (x) = [3x]
i=1
is monotone increasing on [0, 1], f is Riemann integrable on [0, 1]. For n ≥ 4 consider
the partition Pn = {0, 13 − n1 , 31 , 23 − n1 , 23 , 1 − n1 , 1}. For this partition L(Pn , f ) = 1
R1
and U(Pn , f ) = 1 + n3 . From this it now follows that [3x] dx = 1.
0
(c) Since f is increasing on [0, 1] it is Riemann integrable. Take Pn =
n
n n
1 2
P i 1 3 P 1 P
{0, n , n , . . . , 1}. Then U (Pn , f ) = 3 +1 = 2
i + 1 =
i=1 n n n n
1
i=1 i=1
3 n(n + 1) R 3 1 5
+ 1. Therefore (3x + 1)dx = lim 1+ +1 = . 6. Let
2 n2 0 n→∞ 2 n 2
P = {x0 , x1 , ..., xn } be a partition of [a, b]. Since f (x) ≤ g(x) for all x ∈ [a, b],
sup{f (t) : t ∈ [xi−1 , xi ]} ≤ sup{g(t) : t ∈ [xi−1 , xi ]} for all i = 1, ..., n. As a
consequence U (P, f ) ≤ U (P, g) for all partitions P of [a, b]. Taking the infimum
Rb Rb
over P gives f ≤ g. The result now follows from the fact that f, g ∈ R[a, b].
a a
R1 R1 1
8. f = 0, f = 2
. 10. Since a ≥ 0, f is increasing on [a, b]. Thus if
0 0
P = {x0 , x1 , ..., xn } is a partition of [a, b], mi = x2i−1 and Mi = x2i . Therefore
n n
x2i−1 ∆xi and U(P, f ) = x2i ∆xi . Now show that x2i−1 ∆xi ≤
P P
L(P, f ) =
i=1 i=1
1 3
1
3
(x3i − x3i−1 ) ≤ x2i ∆xi . From this it will now follow that L(P, f ) ≤ (b − a3 ) ≤
3
Rb 1 3
U (P, f ). Since f is continuous, f ∈ R[a, b] with f = (b − a3 ). 12. (a) With
a 3
Pn = { nk : k = 0, 1,
. . . , n} and f (x) = x,
n k 1 P n n
k = 21 n(n + 1).
P P
U (Pn , f ) = = 2 k. By Exercise 1(a), Section 1.3,
k=1 n n k=1 k=1
R1
Therefore xdx = lim n(n + 1)/(2n2 ) = 1/2. (c) Take Pn = {0, n1 , n2 , . . . , 1}.
0 n→∞
Then 2
n i3
n
1 1 P 1 2 1 1
i3 = 4 12 n(n + 1) =
P
U (Pn , f ) = = 4 1+ . Therefore,
i=1 n3 n n i=1 n 4 n
R1 3
x dx = lim U(Pn , f ) = 1/4. 13. Use Theorem 6.1.7. 14. (a) Let Pn =
0 n→∞
{xo , x1 , x2 , . . . , xn } be a partition of [a, b], and for each i = 1, 2, . . . , n, let Mi and
Mi∗ denote the supremum of f and |f | respectively on [xi−1 , xi ]. Likewise, let mi
and m∗i denote the infimum of f and |f | respectively on the same intervals. Use the
Hints and Solutions 545
inequality ||f (s) − |f (t)|| ≤ |f (s) − f (t)| to prove that Mi∗ − m∗i ≤ Mi − mi from
which the result follows. 15. (a) Since f is bounded on [a, b], |f (x)| ≤ M for all
x ∈ [a, b]. Let P = {x0 , x1 , ..., xn } be any partition of [a, b]. For each i let Mi (f 2 ) and
mi (f 2 ) denote the supremum and infimum of f 2 respectively over [xi−1 , xi ], with an
analogous definition for Mi (f ) and mi (f ). Let ǫ > 0 be given. Then for each i there
exists si , ti ∈ [xi−1 , xi ] such that Mi (f 2 ) < f 2 (si ) + 21 ǫ and mi (f 2 ) > f 2 (ti ) − 12 ǫ.
Therefore
0 ≤ Mi (f 2 ) − mi (f 2 ) < f 2 (si ) − f 2 (ti ) + ǫ ≤ |f (si ) + f (ti )||f (si ) − f (ti )| + ǫ ≤
2M [Mi (f ) − mi (f )] + ǫ.
Since this holds for any ǫ > 0, Mi (f 2 ) − mi (f 2 ) ≤ 2M [Mi (f ) − mi (f )]. Now use
Theorem 6.1.7. 16. Assume that f is continuous on [a, b] except perhaps at a or
b, or both. Since f is bounded there exists M > 0 such that |f (x)| ≤ M for all
x ∈ [a, b]. Let ǫ> 0 be given. Choose y1 , y2 , a < y1 < y2 < b so that y1 − a < ǫ 8M
and b − y2 < ǫ 8M . Then
[sup{f (t) : t ∈ [a, y1 ]} − inf{f (t) : t ∈ [a, y1 ]}](y1 − a) ≤ 2M (y1 − a) < 41 ǫ.
Similarly for the interval [y2 , b]. Since f is continuous on [y1 , y2 ] there exists a par-
tition P of [y1 , y2 ] such that U(P, f ) − L(P, f ) < 21 ǫ. Let P ∗ = P ∪ {a, b}. Then P ∗
is a partition of [a, b], and by the above U(P ∗ , f ) − L(P ∗ , f ) < ǫ. Thus by Theorem
6.1.7, f ∈ R[a, b]. Suppose f is continuous on [a, b] except at a finite number of
points c1 , c2 , ..., cn with a ≤ c1 < c2 < · · · < cn ≤ b. Apply the above to each of
the intervals [a, c1 ], [c1 , c2 ], ..., [cn , b] to obtain a partition of [a, b] for which Theorem
6.1.7 holds.
Exercises 6.2. page 247
2. (a) Take ti = 31 (x2i + xi xi−1 + x2i−1 ). 3. (a) If c1 6= a, set c0 = a. Similarly,
Rc set
cn+1 = b if cn 6= b. First prove that f ∈ R[ci−1 , ci ], i = 1, 2, ..., n+1 with c i f = 0.
i−1
Now use Theorem 6.2.3. (b) Set h = f − g. 5. Since f is bounded, |f (x)| ≤ M
for all x ∈ [a, b] for some M > 0. Use equations (7), (8), and the fact that f ∈ R[c, b]
for every c ∈ (a, b) to prove that
Rb Rb Rc Rc
0 ≤ f − f = f − f ≤ 2M (c − a).
a a a a
1 Pn R1 1
Use this to show that f ∈ R[a, b]. 7. (a) lim 3
k2 = x2 dx = .
n→∞ n k=1 0 3
n
P n R1 1 π
(c) lim 2 2
= 2
dx = . 8. For each fixed c, 0 < c < 1, the series
n→∞ k=1 n + k 0 1+x 4 Rc
defining f becomes a finite sum on [0, c]. Evaluate 0 f and use Exercise 5.
Exercises 6.3. page 255
2. Since f is bounded, |f (x)| ≤ M for all x ∈ [a, b]. If x, y ∈ [a, b] with x < y, then
Ry Ry
|F (y) − F (x)| = f ≤ |f | ≤ M |y − x|.
x x
Thus F satisfies a Lipschitz condition on [a, b] and hence is uniformly continuous.
3. (b) F (x) = x for 0 ≤ x ≤ 21 ; F (x) = 23 − 2x for 21 < x ≤ 1. (d) F (x) = 0
for 0 ≤ x ≤ 31 ; F (x) = 21 (x2 − 19 ) for 31 < x ≤ 23 ; F (x) = x2 − 18
5
for 23 < x ≤ 1.
6. (b) F ′ (x) = cos x2 . (d) F ′ (x) = 2xf (x2 ). 7. By the chain rule, dx d
L( x1 ) =
− x1 = dxd
[−L(x)]. Therefore L( x1 ) = −L(x) + C for some constant C. Taking x = 1
shows that C = 0. 10. Let m and M denote the minimum and maximumR of f on
b
[a, b] respectively. Since g(x) ≥ 0 for all x, mg(x) ≤ f (x)g(x) ≤ M g(x). If a g > 0,
then from the previous inequality one obtains
546 Hints and Solutions
Rb Rb
m≤ fg g ≤ M.
a a
Rb
Now apply the intermediate value theorem to f . If a g = 0, use Theorem 6.2.2 to
Rb
prove that a f g = 0 and thus the conclusion holds for any c ∈ [a, b]. 12. (a) With
R2 ln
R2
ϕ(x) = ln x, by Theorem 6.3.8 lnxx dx = x dx = 12 (ln 2)2 . (b) Use integration
1 0 √
by parts. (e) 2 ln 2 − 1. Use Exercise 9 with ϕ(x) = x and√f (t) = t/(1 + t).
13. (b) With the given change of variable, dx = a sec2 t dt and a2 + x2 = a sec t.
Z a Z π
1 4
Therefore √ dx = sec t dt. To evaluate this last integral first establish
2
x +a 2
0 0
b 1/n
sin t cos t
|f (x)|n dx
R
that sec t = + . 14. That ≤ M is straight forward.
cos t 1 + sin t a
For the reverse inequality, given ǫ, 0 < ǫ < M , using continuity of f show that there
exists an interval I ⊂ [a, b] such that |f (x)| ≥ M − ǫ for all x ∈ I. Using this show
b 1/n
|f (x)|n dx ≥ (M − ǫ)ℓ(I)1/n . Explain why the result now follows.
R
that
a
c+h
15. First show that F+′ (c) = lim 1
R
f (x) dx. Since f is monotone increasing
h→0+ h c
f (c+ ) exists. Thus given ǫ > 0, there exists δ > 0 such that f (c+ ) ≤ f (x) < f (c+ )+ǫ
for all x, c < x < c + δ. Explain how the conclusion now follows.
Exercises 6.4 page 263
R1 R1
1. (a) Since 0 < p < 1, 0 x−p dx = lim c x−p dx = lim 1
1−p
[1 − c1−p ] = 1
1−p
.
c→0+ c→0+
Note: If p ≤ 0, then x−p is continuous Ron [0, 1], and if p ≥ 1 then the improper in-
1
tegral diverges. (d) Converges, with 0 x ln x dx = − 14 . (f ) In this problem the
Rc
integrand is undefined at x = 1. For 0 < c < 1, tan π2 x dx = − π2 ln(cos π2 c). Since
0
lim ln(cos π2 c) does not exist, the improper integral diverges. 2. (a) Converges,
c→1+
R∞ Rc
with 0
e−x dx = lim e−x dx = lim 1 − e−c = 1. (c) The improper inte-
c→∞ 0 c→∞
c
gral converges with lim 1 x−p dx = lim p−1 1
1 − c1−p = p−11
R
. (e) Diverges.
c→∞ c→∞
Rc 1
dx = ln(ln c) − ln(ln 2), which diverges to ∞ as c → ∞.
2 x ln x
c
x
dx = 12 ln(c2 + 1). The improper integral diverges since lim ln(c2 + 1)
R
(g) 2 +1
0 x c→∞
does not exist. 3. (a) For p > −1, the improper integral converges for all q ∈ R,
and for p < −1, the improper integral diverges for all q ∈ R. When p = −1,
the improper integral
R1 converges for all q < −1, and diverges for all q ≥ −1. 4.
Since limc→0+ c f (x)dx = sin 1 the improper integral converges. To show that the
1
improper integral of |f | diverges, first show that |f (x)| ≥ − 2x on each of the
x
intervals √ 1 1 , √ 1 1 . Use the above to find a partition PN such that
(2n+ 3 )π (2n− 3 )π
N
P
L(PN , |f |) ≥ C 1/k, for some positive constant C independent of N . From this
k=1
you can conclude that the improper integral of |f | diverges. 5. Hint: Use the fact
Hints and Solutions 547
Rc cos x Rc 1 1 1
that 0 ≤ |f (x)| − f (x) ≤ 2|f (x)|. 7. (a) 2
dx ≤ 2
dx = − . Thus
π x π x π c
Rc Rc sin x 1 cos c Rc cos x
lim |f | < ∞. (b) By integration by parts, dx = − − − dx.
c→∞ π π x π c π x2
c
R cos x cos c
By (a) and Exercise 5, lim dx exists. Also, lim = 0. Therefore,
c→∞ π x2 c→∞ c
c
R sin x
lim dx exists. 9. (a) To show convergence of the improper integral con-
c→∞ π x
sider the integrals of tx−1 e−t over the two intervals (0, 1] and [1, ∞). If x ≥ 1, then
tx−1 e−t is continuous on [0, 1], and thus the integral over [0, 1] clearly exists. If
0 < x < 1, then since e−t ≤ 1 for t ∈ (0, 1],
R1 R1 1 R1
0 ≤ tx−1 e−t dt ≤ tx−1 dt = (1−cx ). Thus lim tx−1 e−t dt < ∞. For t ≥ 1, use
c c x c→0+ c
1
l’Hospital’s rule to show that there exists a to ≥ 1 such that tx−1 ≤ e 2 t for all t ≥ to .
R
Thus tx−1 e−t ≤ e−t/2 for all t ≥ to , and as a consequence lim tx−1 e−t dt < ∞.
R
R→∞ 1
Thus the improper integral defining Γ(x) converges for all x > 0.
Exercises 6.5. page 278
1. 2f (0). 3. (a) See Exercise 2, Section 6.3. 5. (a) π2 − 1. Use Theorem 6.5.10.
(c) 12 + 22 + 32 . For x ∈ [0, 3], [x] = I(x − 1) + I(x − 2) + I(x − 3). Now use formula
R4 R4 R4 R2 R3 R4
(12). (f ) (x−[x])dx3 = (x−[x])3x2 dx = 3x3 dx+ 3x2 dx+ 6x2 dx+ 9x2 dx
1 1 1 1 2 3
∞
P 1
etc. 8. (a) n
. 12. (a) As in the solution of Exercise 15(a) of Section 6.1,
n=1 n2
if P = {x0 , x1 , ..., xn } is a partition of [a, b], Mi (f 2 ) − mi (f 2 ) ≤ 2M [Mi (f ) − mi (f )]
where M > 0 is such that |f (x)| ≤ M for all x ∈ [a, b]. From this it now follows that
0 ≤ U (P, f 2 , α) − L(P, f 2 , α) ≤ 2M [U(P, f, α) − L(P, f, α)]. Now apply Theorem
6.5.5.
Exercises 6.6. page 290
2. (a) With n = 4, h = .25. Set xi = .25 i, yi = f (xi ), i = 0, 1, 2, 3, 4. Then
y0 = 1.00000, y1 = 0.94118, y2 = 0.80000, y3 = 0.64000, y4 = 0.50000. Therefore,
.25
T4 (f ) = [y0 + 2y1 + 2y2 + 2y3 + y4 ] = 0.782795.
2
.25
S4 (f ) = [y0 + 4y1 + 2y2 + 4y3 + y4 ] = 0.785393.
3
By computation f ′′ (t) = 2(3t2 − 1) (1 + t2 )3 . Using the first derivative test, f ′′ (t)
has a local maximum of 12 at t = 1. Therefore |f ′′ (t)| ≤ 12 for all t ∈ [0, 1]. Thus by
equation (23) with n =4,
π 1 1 1
| − T4 (f )| ≤ = 0.0026042. Since π/4 = 0.7853982 (to seven dec-
4 12 2 42
imal places), |π/4 − T4 (f )| = 0.0026032. 3. (b) By computation, f (4) (x) =
7
3(4x − 1)(1 + x ) . By the first derivative testqthe function f (4) has a maxi-
2 2 −2
√
mum on [0, 2] at x = 3/2. Thus |f (4) (x)| ≤ 6 (1 + 34 )7 < 1. If we choose n
R2
(even) so that | 0 f − Sn (f )| < 10−5 , then we will be guaranteed accuracy to
four
decimal places. By inequality (26) with M = 1, we need to choose n so that
25 180n4 < 10−5 , or n4 > 17, 778. The value n = 12 will work. This value of n will
548 Hints and Solutions
guarantee that√E12 (f ) < 0.0000086. Compare your answer with the exact answer of
√
5 + 12 ln(2 + 5).
Exercises 6.7. page 296
2. Hint: Consider f (x) = 1 − χp (x), 0 ≤ x ≤ 1, and use Exercise 21(b) of Section
6.1. 4. No. Consider g = χQ on [0, 1] and let f be the zero function.
Chapter 7
Exercises 7.1. page 313
2. (a) Diverges. (c) Converges. (e) Diverges. (g) √ Converges by the ratio
test. P(i) Converges. (k) Diverges. First P show that k ≥ ln k for k ≥ 4. Thus
since 1/k diverges, by the comparison test 1/(ln k)2 diverges. (m) Converges
for p > 1; diverges for 0 < p ≤ 1. (o) Converges. Hint, rewrite k1 ln(1 + k1 ) as
1
ln(1 + k1 )k and use the comparison text. 3. (a) Converges to 1 (1 − sin p) for
k2
all p ∈ R for which
P | sin p| < 1; that is, for all p 6= (2k + 1) π2 , k ∈ Z.
4. (b) Since ak converges, lim ak = 0. Thus there exists ko ∈ NPsuch that 0 ≤
ak ≤ 1 for all k ≥ ko . But then 0 ≤ a2k ≤ ak for all k ≥ ko , and a2k converges
2
by the comparison test. (d) Take ak = 1/k . (f ) Converges. Use the inequality
ab ≤ 12 (a2 + b2 ), a, b ≥ 0. 5. The series diverges for all q < 1, p ∈ R, and converges
for all q > 1, p ∈ R. If q = 1, the series diverges for p ≤ 1 and converges for p > 1.
6. Suppose lim(an /bn ) = L, where 0 < L < ∞. Take ǫ = 12 L. For this ǫ, there exist
ko ∈ N such that 12 L ≤ ak bk ≤ 23 L for all k ≥ ko . The result now follows by the
comparison test. 8.PSince an > 0 for all n, we have bk ≥ a1 /k for all k. Hence by
the comparison test bk diverges. 11. For a simple √ example take ak = (−1)k .
k+1 n k
12. The proof uses the fact that lim k
= lim kn = 1 for all n ∈ Z.
k→∞ k→∞
P∞ 1 P∞ 1
13. The given series is the sum of the two series 2
and 3
, each
k=1 (2k − 1) k=1 (2k)
of which converges. 16. Let sn = a1 + a2 + · · · + an and tk = a1 + 2a1 + · · · + 2k a2k .
By writing sn = a1 + (a2 + a3 ) + (a4 + a5 + a6 + a7 ) + · · · , show that if n < 2k , then
sn ≤ tk , and if n > 2k , then sn ≥ 21 tk . From these two inequalities it now follows that
P P k
ak < ∞ if and only if 2 a2k < ∞. 18. (a) Diverges. If ak = 1 (k ln k), then
2k a2k = 1 (k ln 2). 19. Use Example 5.2.7 to show that ck − ck+1 ≥ 0 for all k.
Thus {ck } is monotone decreasing. Use the definition of ln k and the method of proof
of the integral test to show that ck ≥ 0 for all k. 21. Write ak+1 /ak = 1 − xk /k
where xk = (q − p)(k/(q + k + 1)).
2 k 2
1 2k
22. (c) When p = 2, ak = 1·3···(2k−1) 1
Q
2·4···(2k)
= 1 − 2j ≥ 1 − 2k . Now
j=1
n
1/h P
use the fact that lim (1 + h) = e. 23. (a) Set sn = ak , and let s = lim sn .
h→0 k=1
P √ √ √
Consider the series bk where b1 = ( s− s − s1 ) and for k ≥ 2, bk = ( s − sk−1 −
√
s − sk ).
Exercises 7.2. page 320
∞
P
1. If {bn } is monotone increasing to b, consider (b − bk )ak . 2. Take bk = 1/k
k=1
n
for k odd, and bk = 1/k2 for k even.
P
4. If Bn = cos kt, then
k=1
Hints and Solutions 549
n
sin 21 t Bn = 1
[sin(k + 21 )t − sin(k − 12 )t] = 12 [sin(n + 21 )t − sin 12 t].
P
2
k=1
kk
5. (a) Converges. (c) Converges. (d) Diverges; lim= 1/e 6= 0.
k→∞ (1 + k)k
(f ) Converges. (h) Converges for all t 6= 2nπ, n ∈ Z. If t = 2nπ, then the series
converges for p > 1 and diverges for 0 < p ≤ 1. 8. Use the partial summation
formula to prove that
n
P n−1
P n
P
kak = nAn − Ak where Ak = ak .
k=1 k=1 k=1
Now use Exercise 14 of Section 3.2.
Exercises 7.3. page 327
1 2 2
Pn |ab| ≤ 2 (a + b ), a, b ∈ R. 4. Use the hypothesis on |an | to
2. Use the inequality
show that sn = k=1 |ak | ≤ b1 − bn+1 . 6. (a) Converges conditionally.
(c) Converges absolutely for p > 1 and conditionally for 0 < p ≤ 1. (e) Converges
kk
absolutely for p > 1, and conditionally for 0 < p ≤ 1. (g) Rewrite ak =
(k + 1)k
as ak = (1 + k1 )−k . Since limk→∞ −1 k+1
P
a
P k 2 = e , the series (−1) a k diverges. (i)
By the comparison test (with 1/k ) the series converges absolutely. 9. First
n−1
P 1
1 1
note that S3n = 3k+1
+ 3k+2 − 3k+3 . Now show that S3n → ∞ as n → ∞.
k=0 P
11. By Theorem, 7.2.6 the series converges. To show that | sin k|/k = ∞, show
that for any three consecutive integers, at least one satisfies | sin k| ≥ 21 .
Exercises 7.4. page 333
∞
1. (a) k{1/(ln k)}k22 = 1/(ln k)2 , which diverges (Exercise 5, Section 7.1).
P
k=2
√ ∞
(c) k{ln k/ k}k22 = (ln k)2 /k which diverges by the Comparison test.
P
k=2
2. (a) |p| < 1. (c) p ≥ 21 . 3. Since {1/k} ∈ ℓ2 , the result follows by the
Cauchy-Schwarz inequality. 9. If we interpret the vectors a and b as forming
two sides of a triangle, with the third side given by b − a, then by the law of cosines
kb − ak22 = kbk22 + kak22 − 2kak2 kbk2 cos θ. Now apply Exercise 8(e). 11. (b)
Suppose {ak } ∈ ℓ1 . Since limk→∞ ak = 0 there exists ko ∈ N such that |ak | ≤ 1 for
all k ≥ ko . But then |ak |2 ≤ |ak | for all k ≥ ko . Hence {ak } ∈ ℓ2 .
Chapter 8
Exercises 8.1. page 343
(
nx 0, x = 0,
2. (a) lim =
n→∞ 1 + nx 1, x > 0.
(
0, x 6= ±(2k − 1) π2 , k ∈ N,
(c) lim (cos x)2n = 3. (a) By the root test the
n→∞ 1, x = ±(2k − 1) π2 , k ∈ N.
series converges for all |x| < 2; diverges for |x| ≥ 2. (c) converges for all x > 0;
Z 1 Z 1/n Z 2/n
diverges for x ≤ 0. 5. (c) fn = n2 x dx+ (2n−n2 x) dx = 1/2+1/2 =
0 0 1/n
1. 7. (a) If x = 0, fn (0) = 0 for all n ∈ N. If x > 0, then 0 < fn (x) < x/n, from
which the result follows. (b) For each n ∈ N, fn (x) has a maximum of e−1 at
550 Hints and Solutions
x = n. 8. Use
! the M !M ∈
fact that for N, N,
N M N M ∞
! ∞ ∞
!
X X X X X X X X
an,m = an,m ≤ an,m ≤ an,m .
n=1 m=1 m=1 n=1 m=1 n=1 m=1 n=1
The above inequalities hold since an,m ≥ 0 for all n, m ∈ N. Now first let M → ∞,
and then N →! ∞, to obtain
∞ ∞ ∞ ∞
!
X X X X
an,m ≤ an,m .
n=1 m=1 m=1 n=1
The same argument also proves the reverse inequality.
Exercises 8.2. page 350
2. (b) Suppose {fn } and {gn } converge uniformly to f and g respectively on E. Then
|fn (x)gn (x) − f (x)g(x)| ≤ |gn (x)||fn (x) − f (x)| + |f (x)||gn (x) − g(x)|. By hypothesis
|gn (x)| ≤ N for all x ∈ E, n ∈ N. Also, since |fn (x)| ≤ M for all x ∈ E, n ∈ N,
|f (x)| ≤ M for all x ∈ E. Therefore
|fn (x)gn (x) − f (x)g(x)| ≤ N |fn (x) − f (x)| + M |gn (x) − g(x)|.
Now use the definition of uniform convergence of {fn } and {gn } to show that given
ǫ > 0, there exists no ∈ N such that |fn (x)gn (x) − f (x)g(x)| < ǫ for all x ∈ E
and n ≥ no . 4. Find Mn = max{fn (x) : x ∈ [0, 1]}, and show that Mn → ∞.
5. (a) For x ∈ [0, a], |fn (x)| ≤ an . If 0 < a < 1, then lim an = 0. Thus given
n→∞
ǫ > 0 there exists no ∈ N so that an < ǫ for all n ≥ no ; that is |fn (x)| < ǫ for
all x ∈ [0, a], n ≥ no . Therefore {fn } converges uniformly to 0 on [0, a] whenever
1 1
a < 1. (b) No. Obtain a contradiction to Theorem 8.2.5. 8. (a) 2 ≤ 2
k + x2 k
1
1/k2 < ∞, the series
P P
for all x ∈ R. Since converges uniformly by
k 2 + x2
2 −kx
the Weierstrass M-test. (c) For x ≥ 1, k e ≤ k (1/e)k . Since 1/e < 1, the
2
P 2 sin 2kx 1 1
series k (1/e)k converges. 9. (a) ≤ ≤ C 3/2 for all
(2k + 1)3/2 (2k + 1)3/2 k
P 1
x ∈ R. Since < ∞, the given series converges uniformly for all x ∈ R by the
k3/2
Weierstrass M-test. (d) Since | sin h| ≤ |h| we have | sin(x/kp )| ≤ 2/kp for |x| ≤ 2.
Since p > 1, by the Weierstrass M-test theseries converges uniformly and absolutely
Pn 1 1
for |x| ≤ 2. (e) Hint: Let Sn (x) = − . 10. (a) For
k=0 kx + 2 (k + 1)x + 2
2 2 P 2
x ≥ a > 0, 1 + k x ≥ ak . Thus since 1/k < ∞, by the Weierstrass M-test the
given series converges uniformly on [a, ∞) for every a > 0. To show that it does not
converge uniformly on (0, ∞), consider (S2n −Sn )(1/n2 ), where Sn P is the nth partial
sum of the series. 12. Since |ak xk | ≤ |ak | for all x ∈ [−1, 1] and |ak | converges,
the given series converges absolutely and uniformly by the Weierstrass M-test.
n
sin kt. By (1) of the proof of Theorem 7.2.6, |An | ≤ 1/| sin 21 t|.
P
16. Set An =
k=1
Thus {An } is uniformly bounded on any closed interval that does not contain an
integer multiple of 2π. The conclusion now follows by the Abel partial summation
xn
formula. 18. Suppose |F0 (x)| ≤ M for all x ∈ [0, 1]. Show that |Fn (x)| ≤ M
n!
for all x ∈ [0, 1], n ∈ N. Now use the Weierstrass M-test.
Exercises 8.3. page 359
(
∞ 0, x = 0, 1,
P k
1. Show that x(1 − x) = Thus by Corollary 8.3.2 the conver-
k=0 1, 0 < x < 1.
Hints and Solutions 551
lim Rn (x) = 0, −1 < x ≤ 0. If 0 < x < 1, use Corollary 8.7.19 to show that
n→∞
n
1 · 3 · · · (n + 21 )
x x−ζ
|Rn (x)| ≤
n! (1 − x)3/2 1 − ζ
for some ζ, 0 < ζ < x. Now use the method of Example 8.7.20(c) to show that
1
lim Rn (x) = 0 for all x, 0 < x < 1. Thus the series converges to (1 − x)− 2 for all
n→∞
Rx 1
x, |x| < 1. (f ) Use the fact that arcsin x = √ dt, |x| < 1. (h) For p real,
0 1 − t2
p(p − 1) 2 p(p − 1)(p − 2) 3
(1 + x)p = 1 + px + x + x + · · · . If p is a positive integer,
2! 3!
then the expansion is finite.
Exercises 8.8. page 402
√
1. (a) Γ( 23 ) = Γ( 12 +1) = 21 Γ( 12 ) = 21 π. 2. Make the change of variable t = − ln s.
π/2 1 1
R∞ √ 1 Γ(n + 2 )Γ( 2 )
3. (a) 0 e−t t3/2 dt = Γ( 25 ) = 34 π. 5. (a) (sin x)2n dx =
R
.
0 2 Γ(n + 1)
Chapter 9
Exercises 9.1. page 417
R1 2 R1 R1 2 R1 2 1
R1
1. φ1 = 1 = 2 and φ2 = x = 2/3. Therefore c1 = 2
sin πx dx = 0 and
−1 −1 −1 −1 −1
3
R1
c2 = 2
x sin πx dx = 3/π. Thus by Theorem 9.1.4, S2 (x) = (3x)/π gives the best
−1
approximation in the mean to sin πx on [−1, 1]. 3. (a) a1 = 1/2, a2 = 1, a3 =
−1/6. (b) S2 (x) = π2 + π603 (π 2 − 12)(x2 − x + 61 ).
Rπ
5. (c) bn = π2 x sin nx dx = − n2 cos nπ = n2 (−1)n+1 . Therefore
0
P∞ (−1)k+1 π ∞
4 P 1
x∼2 sin kx. 6. (c) x ∼ − cos(2k + 1)x.
k=1 k 2 π k=0 (2k + 1)2
12. (a) As in the proof of Theorem 7.4.3,for λ ∈ R, 0 ≤ kx − λyk2 = kxk2 −
2λhx, yi + λ2 kyk2 . If y 6= 0, take λ = hx, yi kyk2 to derive the inequality.
Exercises 9.2. page 422
Rπ
4. (a) For theRπorthogonal system {sin nx}∞ 2 1
n=1 on [0, π], 0 sin nx dx = 2 π. There-
2
fore bn = π 0 f (x) sin nx dx. Thus Parseval’s equality for the orthogonal system
∞ Rπ π2 π2
b2n = π2 f 2 (x) dx. (b) (i)
P
{sin nx} becomes . (ii) . 5. Use Parse-
n=1 0 8 6
1 2 2 2
val’s equality and the fact that f g = 2 [(f + g) − f − g ]. 6. Any function that is
Rb
identically zero except at a finite number of points will satisfy a f (x)φn (x) dx = 0.
Exercises 9.3. page 431
1. (a) If f is even on [−π, π], then f (x) sin nx is odd and f (x) cos nx is even. Thus
Rπ
bn = 0 for all n = 1, 2, ... and an = π2 f (x) cos nx dx, n = 0, 1, 2....
0
2 P∞ (1 − (−1)k ) 4 P∞ 1
3. (a) f (x) ∼ sin kx = sin(2k + 1)x.
π k=1 k π k=0 2k + 1
554 Hints and Solutions
π 4 P∞ 1 P∞ (−1)k+1
(c) |x| ∼ − cos(2k + 1)x. (e) 1 + x ∼ 1 + 2 sin kx.
2 π k=0 (2k + 1)2 k=1 k
∞ ∞ k
4 P 1 1 2 P (−1)
5. (a) 1, sin(2k + 1)x. (f ) − cos(2k + 1)x,
π k=0 2k + 1 2 π k=0 2k + 1
2 ∞ 1
(−1)n − cos 21 nπ sin nx.
P
−
π n=0 n
6. (c) Since he is even, the Fourier series of he is the cosine series of h. Therefore
2c π/2
R 2c Rπ πc
a0 = x dx + (π − x) dx = , and
π 0 π π/2 2
2c π/2 2c Rπ 2c
2 cos nπ + (−1)n+1 − 1 .
R
an = x cos nx dx + (π − x) cos nx dx = 2
π 0 π π/2 πn2
cπ c P∞ 1 − (−1)k 2
Thus he (x) ∼ − cos 2kx. 7 (b) f (x) ∼ −
4 π k=1 k2 π
∞ π
4 P 1 1 R ′′
cos 2kx. 8. By integration by parts, bk = − 2 f (x) sin kx dx.
π k=1 (4k2 − 1) πk −π
′′ ′′
Since f ∈ R[−π, π], it is bounded on [−π, π]; i.e., |f (x)| ≤ M . Therefore
|bk | ≤ 2M k2 . Similarly for ak . Thus by the Weierstrass M-test, the Fourier se-
ries of f converges uniformly on [−π, π].
Exercises 9.4. page 442
π4
2. . 3. To show that {sin nx}∞ n=1 is complete on [0, π] it suffices to show that
90
∞ 2 Rπ 2
b2n =
P
Parseval’s equality holds for every f ∈ R[0, π]; i.e., f (x) dx. To accom-
n=1 π0
plish this, let fo denote the odd extension
Rπ of f to [−π, π]. Since fo is odd, an = 0
for all n = 0, 1, 2..., and bn = π2 0 f (x) sin nx dx. Since the orthogonal system
∞ 1 Rπ 2 2 Rπ 2
{1, cos nx, sin nx}∞ b2n =
P
n=1 is complete on [−π, π], fo (x) dx = f (x) dx.
n=1 π −π π 0
n
6. Let Sn (t) = 12 a0 =
P
ak cos kt + bk sin kt. If x ∈ [−π, π], then
k=1
Rx Rx Rx
f (t)dt − Sn (t) dt ≤ |f (t) − Sn (t)| dt.
−π −π −π
Rx Rx
Thus by Exercise 5(a) and Theorem 9.4.7, lim Sn (t) dt = f (t) dt, with
n→∞ −π −π
Rx
the convergence being uniform on [−π, π]. But Sn (t) dt = 12 a0 (x + π) +
−π
n
P ak bk
sin kx − (cos kx − cos kπ) , from which the result follows. 10. Use
k=1 k k
Lemma 9.4.8 and the Weierstrass approximation theorem.
Exercises 9.5. page 453
π2 π2 1 1 π (eaπ − 1) 2a P ∞ ((−1)n eaπ − 1)
2. , . 3. (a) . ( b) − . 6. (a) + cos kx.
12 6 2 2 4 aπ π k=1 a2 + k 2
(b) On [−π, π], the series converges to e|ax| , and thus to the 2π-periodic extension
of e|ax| on all of R.
Hints and Solutions 555
Chapter 10
Exercises 10.2. page 474
2. Since U is open and non-empty, there exists x ∈ U and r > 0 such that
(x − r, x + r) ⊂ U . Thus by Theorem 10.2.4, m(U ) ≥ m((x − r, x + r)) = 2r > 0. 3.
Let Vǫ = (−ǫ, 1 + ǫ). Then Vǫ is an open set containing P . Show that m(Vǫ \ P ) ≥ 1
for all ǫ > 0 and thus m(P ) < 2ǫ for every ǫ > 0. Alternately, show that
m(P c ∩ [0, 1]) = 1 and use Theorem 10.2.15. 6. First show that there exist dis-
joint bounded open sets U1 , U2 with U1 ⊃ K1 and U2 ⊃ K2 . Then m(K1 ∪ K2 ) =
m(U1 ∪U2 )−m((U1 ∪U2 )\(K1 ∪K2 )). But (U1 ∪U2 )\(K1 ∪K2 ) = (U1 \K1 )∪(U2 \K2 ).
Now use Theorem 10.2.9.
Exercises 10.3. page 480
1. (b) First show that if U is any open set, then U +x is open and m(U +x) = m(U ).
Use this and the definition to prove that λ∗ (E + x) = λ∗ (E). If K is compact and U
is a bounded open set containing K, show that (U + x) \ (K + x) = (U \ K) + x. Use
this to show that m(K +x) = m(K) and λ∗ (E +x) = λ∗ (E). 3. Since E1 ∩E2 ⊂ E1
and λ∗ (E1 ) = 0, λ∗ (E1 ∩ E2 ) = 0. Thus by Theorem 10.3.5, E1 ∩ E2 is measurable.
For E1 ∪ E2 apply Theorem 10.3.9. 6. If λ∗ (E) < ∞, then for each k ∈ N there
∗ 1
T set Uk with Uk ⊃ E such that m(Uk ) < λ (E) + k . Now use the fact
exists an open
that E ⊂ Un ⊂ Uk for all k ∈ N. 8. Set Ek = E ∩ [−k, k], k ∈ N. Then {λ∗ (Ek )}
is monotone increasing with λ∗ (Ek ) ≤ λ∗ (E) for all k ∈ N. Let α = lim λ∗ (Ek ).
k→∞
Suppose α < λ∗ (E). Choose β ∈ R such that α < β < λ∗ (E). By definition there
exists a compact set K with K ⊂ E such that m(K) > β. Use this to show that
there exists ko ∈ N such that λ∗ (Ek ) > β for all k ≥ ko , which is a contradiction.
Exercises 10.4. page 488
2. If E is bounded, the result follows from the definition of λ∗ (E) and λ∗ (E), and
Theorem 10.4.5(b) (for a finite union). If E is unbounded, let En = E ∩ (−n, S n).
Given ǫ > 0, chooseS Un open such that E ⊂ Un and λ(Un \E) < ǫ 2n . Let U = Un .
Show that U \ E ⊂ (Un \ En ). Now use Theorem 10.3.5 to show that λ(U \ E) < ǫ.
To obtain a closed set F ⊂ E satisfying λ(E \F ) < ǫ, apply the result for open sets to
E c . 4. First show that λ(E1 ∪E2 ) = 1; then use Theorem 10.4.1. 6. If E satisfies
λ∗ (E ∪T )+λ∗ (E c ∪T ) = λ∗ (T ) for every T ⊂ R, then E satisfies Theorem 10.4.2 and
thus is measurable. Conversely, suppose E is measurable and T ⊂ R. If λ∗ (T ) = ∞,
the result is true. Assume λ∗ (T ) < ∞. Let ǫ > 0 be arbitrary. Then there exists an
open set U ⊃ T such that λ(U ) < λ∗ (T ) + ǫ. Since E and U are measurable, E ∩ U
and E c ∩ U are disjoint measurable sets with (E ∩ U ) ∪ (E c ∩ U ) = U . Furthermore,
E ∩ U ⊃ E ∩ T and E c ∩ U ⊃ E c ∩ T . Thus by Theorem 10.3.9
λ∗ (T ) ≤ λ∗ (E ∩ T ) + λ∗ (E c ∩ T ) ≤ λ(E ∩ U ) + λ(E c ∩ U ) = λ(U ) < λ∗ (T ) + ǫ.
Since the above holds for every ǫ > 0, we have λ∗ (E ∩ T ) + λ∗ (E c ∩ T ) = λ∗ (T ).
Exercises 10.5. page 494
[0, 1],
if c < 0,
(0, 1], if 0 ≤ c < 1,
1. {x : f (x) > c} = 1
(0, c
) ∪ {1}, if 1 ≤ c < 2,
1
(0, c ), if 2 ≤ c.
T
5. If c > 0, then {x : 1/g(x) > c} = {x : g(x) > 0} {x : g(x) < 1/c}. Since g
is measurable, each of the sets {g(x) > 0} and {g(x) < 1/c} is measurable. Thus
556 Hints and Solutions
R1 n−1/p R1
1 1 1
x−p dx =
R
fn dλ = n dx + + 1 − =
0 0 n−1/p
n(1−p)/p 1−p n(1−p)/p
1 h p i
1 − (1−p)/p .
1−p n
R1 PN N R
P
Since (1 − p) > 0, lim fn dλ = 1/(1 − p). 7. 0 ≤ nλ(En ) ≤ f dλ
n→∞ 0 n=1 n=1 En
Z Zb
= f dλ ≤ f dλ < ∞. 9. Justify first why f can be assumed to be non-
∪N a
n=1 En
negative
R R use Definition 10.7.1. 10. Let An = A ∩ [−n, n]. By definition,
and then
A
f dλ = lim An f dλ. Since f is integrable, given ǫ > 0, there exists n ∈ N such
R n→∞ R R R R
that 0 ≤ f dλ − f dλ < ǫ. By Theorem 10.7.4(b), f dλ − f dλ = f dλ.
A An A An A\An
Thus E = An is the desired set. 13. Let fn = min{|f |, n}. Then 0 ≤ Rfn ≤ |f |.
Since f is integrable, by Lebesgue’s dominated convergence theorem lim fn dλ =
n→∞ A
R R
|f | dλ. Therefore, given ǫ > 0, there exists n ∈ N such that (|f | − fn ) dλ < ǫ/2.
A A
In
R particular,
R if E is any measurable subset of A,
|f | dλ < fn dλ + 21 ǫ.
E E R
But if λ(E) < ∞, E fndλ ≤ nλ(E). Hence choose δ> 0 so that nδ < ǫ/2. 15. For
n ∈ N set hn (t, x) = n sin((t + n1 )f (x) − sin(tf (x)) . Show that |hn (t, x)| ≤ 2|f (x)|
and apply Lebesgue’s dominated convergence theorem. 16. First prove the result
for the characteristic function of an interval. Then use Exercise 9 above and Exercise
15 of Section 10.6. 19. Since {fn } is monotone increasing on A, f (x) = lim fn (x)
n→∞
exists,
R either Ras a finite number or as ∞, for every x ∈ A. By Fatou’s R lemma,
RA f dλ ≤ lim A fn dλ. On the other hand, since fn ≤ f for all n ∈ N, lim A fn dλ ≤
A
f dλ. Combining the two inequalities proves the result. 21. (a) Use the mono-
tone convergence theorem. 23. Hint: |f (x)| ≥ (1/x)| cos 1/x2 | − 2x ≥ x−1 − 2x on
1 1
each of the intervals ((2n + 31 )π)− 2 ≤ x ≤ ((2n − 13 )π)− 2 .
Exercises 10.8. page 526
1. 0 < p < 4. Since |f + g|2 ≤ 2(|f |2 + |g|2 ), the function f + g ∈ L2 (A).
1
2
.
Assume kf +R gk2 6= 0. Then
kf + gk22 = |f + g|2 ≤ |f + g||f | + |f + g||g|
R R
A A A
which by the Cauchy Schwarz inequality ≤ kf + gk2 kf k2 + kf + gk2 kgk2 . The result
follows upon simplification. 7. Use the Cauchy-Schwarz inequality. 9. (a) By
Bessel’s inequality it is the Fourier series of an L2 ([0, π]) function. 13. Let f (x) =
1 ln x, x ∈ (1, ∞). Since f ′′ (x) > 0 for all x ∈ (1, ∞), f is convex on (1, ∞) (see
A = B, 3 S(P, f ), 243
A ∩ B, 3 S(P, f, α), 277
A ∪ B, 3 U(P, f, α) (L(P, f, α)), 265
A ∼ B, 37 U(P, f ), L(P, f ), 225
A ⊂ B, 3 E, 64
Rb
A $ B, 3
Rb
a
f dα f dα, 266
a
A × B, 5 Rb Rb
Ac , 3 a
f , a f , 226
B(x, y), 402 σ–algebra, 487
B \ A, 3 sup E, 23
E ′ , 64 lim sn , lim sn , 107
En (f ), 283 d(p, A), 158
Jf (p), 177 d2 (p, q), d1 (p, q), 54
Lf (Uf ), 293 d∞ (p, q), 55
Mn (f ), 281 f (E), f −1 (H), 10
Rn (f, c)(x), 388 f (p+), f (p−), 163, 445
Tn (f, c)(x), 386 f : A → B, 8
′ ′
[x], 166 f+ (p), f− (p), 184
+ −
Γ(x), 398 f , f , 512
m(U ), 463
S T
Eα , Eα , 41
χE , 293, 467 nth derivative, 184
√ 1
ℓ2 , 328 nth root, n x, x n , 31
inf
R E, 24 nth term, 39, 84, 120
RA f dλ, 509 x ∈ A, x 6∈ A, 2
f dλ, 497 y = f (x), 7
RAb
f dα, 267
Rab Abel partial summation formula, 316
a
f , 228
Abel’s test, 320
Int(E), 59
Abel’s theorem, 380
λ(E), 477
Abel, Neils, 316
λ∗ (E), λ∗ (E), 475
absolute maximum (minimum), 192
h , i, 80, 330, 408
absolute value, 52
C(K), 356
absolutely convergent, 321
L(A), L1 (A), 513
absolutely integrable, 262
L2 (A), 518
algebraic number, 46
L2 ([a, b]), 520
almost everywhere (a.e.), 491
LL (P, f ), UL (P, f ), 496
alternating series, 317
M, 486
alternating series test, 317
P(A), 4
antiderivative, 249
R(α), 270
approaches infinity, 100
R[a, b], 228
559
560 Index
Heine, Eduard, 70, 73, 161, 456 least squares approximation, 412
Heine-Borel theorem, 74 least upper bound, 23
Heine-Borel-Bolzano-Weierstrass Least Upper Bound Property, 25
theorem, 74 Lebesgue integrable, 509, 513
Lebesgue integral, 497, 509, 513
identity function, 9 Lebesgue measurable, 477
image of, 10 Lebesgue sums, 506
improper Riemann integral, 258, 260 Lebesgue’s dominated convergence
increasing, 167 theorem, 515
index set, 40 Lebesgue’s theorem, 237, 293
indexed family, 40 Lebesgue, Henri, 74, 237, 297, 454, 459,
infimum, 24 527
Infimum Property, 25 left continuous, 163
infinite expansion, 35 left derivative, 184
infinite intervals, 27 left limit, 163, 445
infinite limits, 100 Legendre polynomials, 418
infinite product, 124 Leibniz’s rule, 192
infinite series, 120 Leibniz, Gottfried, 129, 181, 223, 296
infinite set, 38 limit, 84, 130
infinitely differentiable, 383 limit at ∞, 141
inner measure, 475 limit comparison test, 304
inner product, 80, 330, 408 limit inferior, 107
integers, 2 limit of the sequence {fn }, 340
integrable function, 227, 267 limit point, 63
integral form of the remainder, 390 limit superior, 107
integral test, 306 limit superior (inferior) of f , 178
integration by parts formula, 254, 272 linear approximation, 386
interior, 59 linear function, 337
interior point, 59 Lipschitz condition, 159
intermediate value theorem, 152 Lipschitz function, 159
intermediate value theorem for local maximum (minimum), 192
derivatives, 201 lower (upper) Lebesgue sums, 496
intersection, 3, 40 lower bound, 23
interval, 27 lower function, 293
inverse function, 12, 173 lower integral, 226
inverse function theorem, 201 lower Riemann-Stieltjes integral, 266
inverse image, 10 lower Riemann-Stieltjes sum, 265
irrational numbers, 2 lower sum, 225
isolated point, 63
Müntz-Szasz theorem, 376
jump discontinuity, 165 Maclaurin series, 386
jump of f at p, 177 Maclaurin, Colin, 340, 386
mapping, 8
l’Hospital’s rule, 208, 211 maps, 8
l’Hospital, Marquis de, 206 mathematical induction, 16
Lagrange form of the remainder, 389 maximum element, 23
Lagrange, Joseph, 195, 220, 336, 389 mean value theorem, 195
Laplace transform, 298 mean value theorem for integrals, 253,
largest element, 23 272
Index 563