MATH101 Algebra and Differential Calculus Lecture Notes Part 1
MATH101 Algebra and Differential Calculus Lecture Notes Part 1
MATH101
ALGEBRA AND
DIFFERENTIAL CALCULUS
Trimester 1, 2015
c
University of New England
CRICOS Provider No: 00003G
CONTENTS i
Contents
Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii
Preface
The work of Euclid has certainly been extended many times. Euclid, however,
has not been corrected – his theorems are valid today and for all time! The other
remarkable thing about mathematics is its extraordinary utility in describing and
quantifying the world around us. Mathematics is the language of the sciences, both
natural and social. This forces mathematics to be abstract, since it must embrace
theories from physics, economics, chemistry, psychology, etc. Mathematics is so
widely applicable precisely because of — not despite — its intrinsic abstractness.
MATH101 is the first half of the MATH101/102 sequence, which lays the founda-
tion for all further study and application of mathematics and statistics, presenting
an introduction to differential calculus, integral calculus, algebra, differential equa-
tions and statistics, providing sound mathematical foundations for further studies
not only in mathematics and statistics, but also in the natural and social sciences.
Achieving this, requires a brief, preliminary foray into the basics of mathematics,
because much of the material requires a high degree of abstract reasoning, rather
than rote learning of computational techniques.
This is the first of three parts of the lecture notes which together constitute the
unit material for MATH101. These notes were originally prepared by Chris Radford
and have been revised by Shusen Yan and others.
iv Preface
The reader is invited and encouraged to point out any mistakes (s)he finds. I
hope you enjoy the challenges the unit offers and that you experience a sense of
achievement at the end.
Bea Bleile
UNE
Lecture 1.1 Mathematical Language and Proof 1
Introduction
Basic Terminology
Example “I will have a BBQ and it will rain” is a compound statement consisting
of two simple statements. “I will have a BBQ” and “It will rain”.
A theorem will give a deeper insight into the structure of a piece of mathematics.
Some theorems can have proofs that are long and intricate. It is useful in such
cases to break the proof into intermediate steps which are separated out as lemmas
which lead into the main result.
Implication
Consider the following compound statement: “If I pass MATH101 then I will do
MATH 102”. Under what circumstances is this statement true or false?
Let’s have a look at the two simple statements making up this compound state-
ment.
A. I pass MATH101.
B. I will do MATH102.
A B
1. True True
2. True False
3. False True
4. False False
Case 1: I pass MATH101 and enrol in MATH 102. The implication is True.
Case 3: I do not pass 101 but still enrol in 102. My original statement is not a
lie, it is not a falsehood. Implication is True.
Case 4: I do not pass 101 and do not enrol in 102. Again, my original statement
is not a lie or a falsehood. Implication is True.
A B If A then B
1. True True True
2. True False False
3. False True True
4. False False True
There is only one case when the implication is false, ie. when A is true and B is
false.
Example Prove that the sum of two prime numbers larger than 2 is an even
number.
The hypothesis “p and q are prime numbers larger than 2” assumes that we know
what prime numbers are. A prime number is a natural or counting number divisible
only by itself and one – 2, 3, 5, 7, 11, 13, 17, 19, 23 are prime numbers.
So we are to assume p and q are prime numbers bigger than 2. We must then
construct a series of arguments which lead directly to the conclusion that p + q is
divisible by 2 (i.e. p + q is even).
Firstly, we see that p and q must be odd. We must be careful about this point
because 2 is prime but even! However, for primes greater than 2 we note that a
prime cannot be divisible by 2; so it must have remainder 1 after division by 2. So
there must be counting numbers n and m such that
p = 2n + 1
q = 2m + 1
This really just says that p and q are odd. We can now move to our conclusion,
p + q = (2n + 1) + (2m + 1)
= 2n + 2m + 2
= 2(n + m + 1).
This is a direct proof as we moved directly from the (assumed true) hypothesis,
“p and q are prime numbers larger than 2” directly to the conclusion “p + q is an
even number”.
Lecture 1.1 Mathematical Language and Proof 5
Proof by Contradiction
Recall that the implication “If A then B” can only be false if we have A true
and B false. So if we were to assume B to be false and prove, as a consequence,
that A is necessarily false, then we have proved the implication is true. We would
have shown that the only possible case with B false is A false which contradicts the
assumed truth of the hypothesis. This is proof by contradiction.
Example Prove that if p is a prime number larger than 2 then p + 1 is not prime.
We are now free to use any means at our disposal to find a contradiction based on
these two assumptions. If we can find such a contradiction then we have established
the truth of the original implication.
We have a contradiction. Our proof is complete, our assumption that the con-
clusion is false cannot hold, the conclusion must be true. 2
6 Lecture 1.1 Mathematical Language and Proof
♠ Exercises 1
2. Use a direct proof technique to prove that if (a + b)2 = a2 + b2 for all real
numbers b, then a = 0.
In your mathematical reading you will find that there are certain types of theorem
whose statement and proof follow a standard pattern. Our aim here is to look at a
couple of the more important examples.
The statement of theorems of this type takes one of the following possible forms
Notice that these forms are all basically equivalent – the truth (or falsity) of A
automatically implies the truth (or falsity) of B and vice versa. So to prove such a
theorem we have to produce two proofs:
The first proof says A is sufficient for B (or B is necessary for A). The second
proof says B is sufficient for A (or A is necessary for B). In fact another com-
mon statement of equivalence is a theorem which takes the following form, “A is a
necessary and sufficient condition for B”.
Solution Let A be the statement “n is odd” and let B be the statement “n2 is
odd”.
1. A implies B
2. B implies A
n2 = (2r + 1)2 = 4r 2 + 4r + 1
= 4(r 2 + r) + 1,
which is again an odd number. The first implication has been proved.
We must now tackle “B implies A”. This is not easy to do using a direct proof.
Try it, you need to assume n2 is odd and then prove n is odd. Here we will use a
proof by contradiction. The conclusion of the implication is that n is odd, so we will
assume that n is even in order to produce a contradiction to our assumed hypothesis
(that n2 is odd). That is, we assume n2 is odd (hypothesis) and that n is even and
show that this leads to a contradiction.
Well, if n is even there is a counting number m such that n = 2m. In which case
we have
n2 = (2m)2
= 4m2 .
But his means n2 is even. We have a contradiction with the hypothesis that n2 was
odd. We have a proof by contradiction.
The way we have written the above proof contains much more verbiage than we
would normally expect in a mathematical proof. We needed it to explain what was
going on! So let’s rewrite the above theorem and proof in a more economical and
“cleaner” style.
We have
n2 = (2r + 1)2
= 4r 2 + 4r + 1
= 4(r 2 + r) + 1,
Thus
n2 = (2m)2
= 4m2 ,
This “if and only if” type of theorem need not be restricted to the equivalence
of just two statements. A theorem might state the equivalence between three (or
more) statements. Something like
1. A
2. B
3. C.
Mathematical Induction
L.H.S. = 1 + 2 + 3 + 4 + 5 + 6 = 21
and R.H.S. = 21 6(6 + 1) = 3 × 7 = 21.
Here, L.H.S. is an abbreviation for left hand side of the equation and R.H.S. an
abbreviation for right hand side of the equation.
Our statement says the equation is true for all counting numbers n. Just because
we have checked a few examples does not mean we can conclude that the general
case is true. We cannot go from the particular to the general.
The usual way in which statements like this are proved is via the principle of
mathematical induction. This technique involves three steps
10 Lecture 1.2 Important Types of Theorems and Proof
1. Verify that the statement is true for the smallest number that can be used in
the statement.
2. Assume that the statement is true for an arbitrary counting number k. This
is called the inductive hypothesis.
3. Using 1 and 2 prove that the statement is true for the next counting number
k + 1. This is called the deductive step.
Suppose that we wish to prove that the statement P (n) is true for all counting
numbers n. For example P (n) might be the statement
1
1 + 2 + 3 + . . . + n = n(n + 1),
2
of our example above. Our three step program would now read
Mathematical Induction.
3. Use the assumed truth of P (k) and the verified truth of P (1)
to prove P (k + 1) is true.
If we can complete these three steps then we have proved P (n) for all counting
numbers n. To see why this is true notice that steps 2 and 3 show the following: If
P (k) is true then P (k + 1) is also true. Well, we know that P (1) is true (step 1),
so P (2) must be true. But if P (2) is true then P (3) must be true. And so on ‘ad
infinitum’ !
We can now finally prove the statement of our working example, that the sum
of the first n counting numbers is 21 n(1 + n).
1
L.H.S. P (1) = 1 and R.H.S. P (1) = 2
· 1 · (1 + 1) = 1 so P (1) is true.
L.H.S. P (k + 1) = 1 + 2 + 3 + . . . + k + (k + 1)
1
= k(1 + k) + (k + 1), using P (k).
2
And,
1
R.H.S. P (k + 1) = (k + 1)(1 + (k + 1))
2
1
= (k + 1)(k + 2)
2
1 1
= (k + 1)k + (k + 1)2
2 2
1
= k(k + 1) + (k + 1).
2
Thus we conclude
L.H.S. P (k + 1) = R.H.S. P (k + 1),
so P (k + 1) is true.
1 + 2 + 3 + . . . + k + (k + 1)
1
= k(1 + k) + (k + 1) using P (k)
2
1 1 1
= k(k + 1) + (2(k + 1)) as × 2 = 1
2 2 2
1
= (k(k + 1) + 2(k + 1)) by factorising
2
1
= ((k + 2)(k + 1)) again by factorising.
2
P (k), 2k > k 3 .
L.H.S. P (k + 1) = 2k+1
= 2k · 2 > 2k 3 , usingP (k).
And,
R.H.S. P (k + 1) = (k + 1)3
= k 3 + 3k 2 + 3k + 1.
So P (k + 1) is true.
♠ Exercises 2
1. Let m and n be counting numbers. Prove that the following statements are
equivalent:
(a) m > n.
(b) am > an , for all real numbers a > 1.
(c) am < an , for all positive real numbers a < 1.
1 + 2 + 22 + 23 + . . . + 2n−1 = 2n − 1.
4. Let a be a real number with a > 0. Prove for any counting number n > 1 that
(1 + a)n > 1 + na.
Basic Terminology
Capital letters will be used to denote sets and lower case letters will denote
elements of the sets. The symbol ∈ is used to denote membership of a set. So
x∈S
Note that we do not exclude the possibility that x is a set in its own right, except
that x cannot be S: We explicitly exclude S ∈ S.
The statement that a set is completely determined by its elements can be thought
of in terms of the following equivalence.
A set may be given by a property which defines the elements of the set. We
write
S = {x : “statement involving x”},
which reads S is the set of all x such that “statement involving x” holds. For
example,
N = {x : x is a natural number}
= {0, 1, 2, 3, 4, 5, . . .}
is the set of natural numbers. The set of even natural numbers could be written as
X ⊂ Y, or Y ⊃ X.
The empty set is the set with no elements. It is denoted by the symbol φ. Note
that φ ⊂ X, for any set X.
Operations on Sets
X ∩ Y = {x : x ∈ X and x ∈ Y }.
The union of sets X and Y is the set of all objects which are elements of X or
Y . Note that the “or” is inclusive, that is, x is contained in the union of X and Y
if and only if it is in X, in Y or both in X and in Y . Symbolically,
X ∪ Y = {x : x ∈ X or x ∈ Y }.
S S
A A B
x∈A A ∩ B 6= φ
S S
A B A B
A∩B = φ A∪B
S S
A A B
Ā (in S) A ∪ B̄
Such diagrams provide a simple way of demonstrating the equality of sets. For
example, draw the diagrams for A ∪ B and for A ∩ B to discover that A ∪ B =
A ∩ B (one of the De Morgan laws).
X × Y = {(x, y) : x ∈ X, y ∈ Y }.
The usual pictorial example of the Cartesian product is that of rectangular co-
ordinates in the plane. The whole plane is identified as the Cartesian product of the
two axes.
Lecture 1.3 Sets and Functions 17
Example Find, explicitly, the Cartesian product of the two sets X = {1, 7, 9, 21, 66}
and Y = {2, 3}.
Solution The Cartesian product, X ×Y , consists of all possible ordered pairs (x, y),
where x ∈ X and y ∈ Y . Hence
X × Y = {(1, 2), (1, 3), (7, 2), (7, 3), (9, 2), (9, 3), (21, 2), (21, 3), (66, 2), (66, 3)}
Functions
The function “rule” can be specified in many ways, but the important thing
is that to any given element of X we associate, somehow, a unique element of Y .
We indicate that we have a function or map between sets X and Y by the arrow
notation,
f : X −→ Y.
For any x ∈ X the element y of Y associated with x by the function f is denoted
f (x). Then f (x) ∈ Y and we write
y = f (x) or x 7−→ y.
This is a more general concept of a function than you have probably met before. The
familiar functions of calculus are functions from a subset, D, of the real numbers to
the real numbers. Denoting the set of real numbers by R, we obtain
f : D −→ R.
f : R −→ R; x 7−→ x2 + x + 1.
The graph of such a real valued function on D is the set of points (x, y) in the
plane such that y = f (x) for x ∈ D. In fact, this idea of graph is easy to generalise
to our more abstract setting.
18 Lecture 1.3 Sets and Functions
Another class of functions with which you should be familiar are the sequences.
A sequence {an } where n ∈ N can be thought of as a function
f : N −→ R.
This means for each n ∈ N the function f associates a real number f (n), which
by tradition we denote by a symbol such as an . In fact all your familiar functions
from calculus (quadratics, exponentials and so on) can be made into sequences if we
restrict ourselves to the subset N of R. For example, consider the quadratic
f : R −→ R, with f (x) = x2 .
We may restrict this function to the subset of natural numbers, that is, we think of
f as acting only on the subset N of R. This is usually written as f |N , and read as “f
restricted to N”. The function f |N has exactly the same function rule as f , it simply
acts only on the subset N of R. This all sounds a bit abstract, but in practice it’s
very simple, we have
f |N : N −→ R, with f |N (n) = n2 .
The set X is called the domain and the set Y is called the codomain
of the function f . The set f (X) = {f (x) : x ∈ X} is called the
range or image of the function.
g : {x ∈ R : x ≥ 0} → {y ∈ R : y ≥ 0}, x 7−→ x2 ,
f : R → R, x 7−→ x2 .
Lecture 1.3 Sets and Functions 19
They are given by the same formula, that is, f (x) = x2 and g(x) = x2 , but they
are quite different functions – they have different domains. They also have different
properties, for example, g is one-to-one (see lecture 8), whereas f is not.
is a function?
√
Solution As u is defined if and only if u ≥ 0, we must have 1 − x2 ≥ 0, that is,
−1 ≤ x ≤ 1. Thus D = {x : −1 ≤ x ≤ 1}.
g ◦ f : X −→ Z
Notice the order of f and g in g ◦ f – we use the function f first and then we use g.
The expression g(f (x)) simply means g(y) where y = f (x).
Solution
Furthermore,
f ◦ g (x) = f (g(x))
= f (x2 + 1)
= (x2 + 1)2
= x4 + 2x2 + 1
1
Example For functions h : N −→ R and g : R −→ R, where h(n) = n
and
g(x) = x21+1 , find the equation for g ◦ h(n).
Solution
g ◦ h(n) = g(h(n))
1
= g( )
n
1
= 1 2
(n) + 1
n2
=
1 + n2
2
Lecture 1.3 Sets and Functions 21
♠ Exercises 3
1. If A ⊂ S show that
(a) (A) = A
(b) A ∪ A = A ∩ A = A ∪ φ = A
2. Use a Venn diagram to demonstrate the truth of the second De Morgan law
A ∩ B = A ∪ B.
(a) What are the largest subsets Df and Dg of R, such that f and g are
functions?
(b) What are the ranges (images) of f and g?
(c) Find the equation defining g ◦ f . What are the domain, codomain and
and range of g ◦ f ?
Number theory, even today, offers some of the deepest and most challenging
problems in mathematics. Our aim over the next couple of lectures will be quite
modest. We want to give a brief introduction to those ideas from the theory of
number which are required for a rigorous look at the differential calculus.
Integers
We will accept as given the counting numbers {1, 2, 3, 4, . . .} and the natural
numbers N = {1, 2, 3, 4, . . .}, stressing only the facts needed for extensions of this
number system.
m + x = n; m, n ∈ N
we will need to introduce more numbers. Notice this equation can only be solved
in N (i.e. with x ∈ N) if n > m. We widen our number system to include the
“negatives” of the counting numbers.
Notice that the properties of closure under addition and multiplication still hold
for Z, as does the ordering property.
Lecture 1.4 Numbers 23
ax = b.
Rational Numbers
Theorem Between any two rationals there are infinitely many other rationals.
Irrational Numbers
It was realised by the Greeks more than 2000 years ago that there is an incom-
pleteness about the system of rational numbers. The diagonal of a square with
sides of unit length has a length which is not a rational number – it is an irrational
number. In algebraic language the equation for x,
x2 = a, a ∈ Q,
has a rational solution for x only for exceptional values of the rational number a.
For our unit square Pythagoras’ theorem says that the length of the diagonal, x, is
given by
x2 = 12 + 12 = 2.
√
We will now show that the formal solution to this equation, x = 2, cannot be a
√
rational number. We call 2 an irrational number.
Proof This is a very old theorem having been proved first by Pythagoras or one of
his school. It is a very simple and economical proof by contradiction.
Assume, on the contrary, that the rational p/q has square 2. We can also assume
that p and q are integers having no common factors – if they did have common factors
we would simply cancel them from the “fraction”. We have
p2
= 2, or
q2
p2 = 2q 2 .
p2 = 4r 2 = 2q 2 ,
from which we see that q 2 = 2r 2 so that q must be even. This means that p and q
have the common factor 2, contradicting our hypothesis. The theorem is proved.
The rationals and irrationals together give us the real numbers – all possible
points on the number line.
Lecture 1.4 Numbers 25
The irrationals fill in the “gaps” in the rationals. Given any irrational we can
find rational numbers arbitrarily close to it. This process is just the decimal approx-
imation to the infinite decimal expression for the irrational number. This is most
√
easily seen by looking at the irrational 2 = 1.4142 . . . (an infinite decimal). We
√
then simply look at successive decimal approximations to 2,
This sequence is found by adding a one to the last decimal place of our approximation
from below. Notice that this latter sequence is a decreasing sequence of rationals.
√
The sequence approximating 2 from below has no greatest member – we simply
go to the next decimal place to get a bigger member of the sequence. The sequence
√
approximating 2 from above has no smallest member.
√
So the irrational number 2 can be defined by cutting the rationals into two
classes L and R. Where L has no greatest member and R which has no smallest
member.
L R
z }| {z }| {
√
0 1 2 2 3
The rationals and the reals satisfy the axioms for what is known in algebra as an
ordered field. The reals are distinguished from the rationals by one further axiom,
Such a division of the real numbers into two classes by means of some rule is
called a Dedekind cut.
♠ Exercises 4
223 22
1. (a) Is 71
greater than 7
?
265 1351
(b) Is 153
greater than 780
?
4*. The density property of the rationals amounts to saying that there is no ra-
tional which is next to another. The following plan for arranging the rationals
(not in order of magnitude) does assign a definite place to each
1 2 1 3 2 1 4 3 2 1 5
, , , , , , , , , , ,... .
1 1 2 1 2 3 1 2 3 4 1
6
What place does 1
occupy? (Counting from the left, of course!)
Show that p/q occupies the { 12 (p + q − 1)(p + q − 2) + q}th place. (Notice
that each rational occurs infinitely often in this scheme; e.g. 2 appears as
2 4 6
, , , . . . .)
1 2 3
Lecture 1.5 Some Properties of Real Numbers 27
Basic Arithmetic
• a + b = b + a (commutativity of addition);
• (a + b) + c = a + (b + c) (associativity of addition);
• ab = ba (commutativity of multiplication);
• (a + b)c = ac + bc (distributivity).
These properties are essentially the axioms for an ordered field. If we add the
Dedekind axiom of the last lecture then we have the axioms defining the real num-
bers.
From these basic rules or axioms all the well-known and well-used properties of
real numbers follow.
28 Lecture 1.5 Some Properties of Real Numbers
am · an = am+n
(am )n = amn
(ab)n = an bn ,
Proof The first three properties are trivial consequences of the definition and are
left as exercises.
|a + b| = −(a + b)
= −a − b
= −a + (−b)
≤ |a| + |b|,
|a + b| = a + b
≤ |a| + |b|.
Lecture 1.5 Some Properties of Real Numbers 29
so that
|a − b| ≥ |a| − |b|.
Similarly,
|b| = |(b − a) + a| ≤ |b − a| + |a|,
so that
|b − a| = |a − b| ≥ |b| − |a|.
Combine the two inequalities for |a − b|,
|a − b| ≥ ±(|a| − |b|).
So we have (5),
|a − b| ≥ ||a| − |b||.
We first observe that all these sets are infinite. The examples 1 and 2 might give
the impression that an infinite set must contain elements which are unboundedly
large. However, examples 3, 4 and 5 correct that false impression!
The sets of examples 4 and 5 are of an important type, each is called an interval.
Example 4 in which the end points, x = 1 and 2, are members of the set is called a
closed interval and denoted by [1, 2]. In general, the closed interval from a to b with
a ≤ b is the set
[a, b] = {x ∈ R : a ≤ x ≤ b}.
30 Lecture 1.5 Some Properties of Real Numbers
The set in example 5, in which the end points are excluded from the set, is called an
open interval and denoted by (1, 2). In general, the open interval from a to b with
a ≤ b is the set
(a, b) = {x ∈ R : a < x < b}.
For example, in the set S = [−1, 1] the number 1 is greater than any other
member of S. The set S also has a least member, −1.
On the other hand the set S = (−1, 1) does not have a largest member. Given
any element of S we can always find another member of S which is larger. Suppose
k ∈ S = (−1, 1), then −1 < 21 (k + 1) < 1 so 21 (k + 1) ∈ S. But, 12 (k + 1) > k.
What both sets do have in common is the fact that they are bounded above (and
below). That is, there is a number K (not necessarily in the set S) such that x ≤ K
for all x ∈ S.
Solution The set is clearly bounded. The number 12 serves as a lower bound – or
any number less than 12 . The number 1 (or any number greater than 1) serves as an
upper bound. Note that the set does have a least member, 12 . However, the set does
not have a greatest member. The elements of the set form an increasing sequence
which is bounded above by 1. 2
K ≥ x for all x ∈ S and for any ε > 0 there is an s ∈ S such that s > K − ε.
Lecture 1.5 Some Properties of Real Numbers 31
In a similar fashion we may seek a greatest lower bound k with the properties
k ≤ x for all x ∈ S and for any ε > 0 there is an s ∈ S such that s < k + ε.
The least upper bound is usually called the supremum and we write
K = sup S, read as K is the supremum of the set S.
The greatest lower bound is usually called the infimum, and we write
k = inf S.
Using the Dedekind axiom we can prove that a (non-empty) set of real num-
bers which is bounded above must have a supremum. And in a similar way that a
nonempty set of real numbers which is bounded below must have an infimum. We
will not give the proof here – although we now certainly have the background to
tackle it. As illustrations consider the sets
1. S1 = {x ∈ Q : 0 ≤ x ≤ 12 }
2. S2 = {x ∈ Q : x2 < 2}
3. S3 = { 21 , 32 , 43 , . . . , n+1
n
, . . .}.
♠ Exercises 5
1. If a, b ∈ R and
a < b + ε, for any ε > 0,
prove a ≤ b.
[Hint: try a proof by contradiction. The negative of the statement a ≤ b is
a > b.]
a + b + |a − b|
maximum {a, b} =
2
minimum {a, b} = − maximum {−a, −b}
a + b − |a − b|
=
2
3. Find (if they exist) the supremum and infimum of the following sets
(a) {1, 12 , 31 , 41 , . . . , n1 , . . .}
(b) {1.1, (1.1)2, (1.1)3 , . . . , (1.1)n , . . .}
qp
√ p√ √
(c) { 2, 2, 2, . . .}
Our system of real numbers R would appear to be perfectly adequate for mathe-
matics and indeed most applications. We feel comfortable with the fact that R can
be represented as the real number line every point of which is in R. We think of
rulers and other measuring devices as segments or intervals of the real number line.
Why would we want to extend further our system of numbers?
Well, curiosity would be one reason. If it is possible let’s try it! On the other
hand there are sound mathematical reasons why we might want to extend our num-
ber system. The extension to the complex numbers was first carried out by Gauss
in about 1795. Complex numbers are indispensable in modern mathematics and
physics. In Quantum theory in particular complex numbers are used in a funda-
mental way.
One motivation for the extension of the number system is to note that only
non-negative numbers have square roots. In all other respects there is complete
symmetry between positive and negative numbers. Another way of stating this is
to note that the equation
x2 = a, with a ∈ R,
x2 = −1
We extend the field of real numbers to the field of complex numbers by adjoining
a new element, i, to the set. This element i is defined as a solution to the equation
i2 = −1.
√
It is traditional to write i = −1 as the solution; note that x2 = −1 then has two
possible solutions x = ±i.
We also include in our set of complex numbers all numbers which can be formed
from i and R by use of the operations + and −. The elements of this field of complex
numbers then take the form a + ib, where a, b ∈ R.
All the axioms for multiplication and addition of the real number system listed
in the last lecture apply to C. However, the order axioms involving < and > no
longer apply. The real numbers can be included in C as a subset, namely R can be
identified with the complex numbers of the form a + i0. For this reason we write
a + i0 = a.
So we may think of complex numbers as numbers of the form a + ib, with a and
b real. We refer to the a as the real part and b as the imaginary part.
z = a + ib, with a, b ∈ R.
Notice that
Example
Solution
You should not bother memorising this formula, instead you simply multiply
out the brackets as per our example. In the general case we have
So the policy is expand everything in sight, use i2 = −1 and then collect terms into
two groups those with an i and those without an i. The terms without an i give the
real part of the product and the coefficient of the i gives the imaginary part. So,
Re (z1 z2 ) = a1 a2 − b1 b2 and
Im (z1 z2 ) = a1 b2 + b1 a2 .
Example
(a) (3 + 4i)(6 + i) = 3 · (6 + i) + 4i · 6 + i)
= 18 + 3i + 24i + 4i2
= 18 + 3i + 24i − 4
= 14 + 27i.
√ √ √ √
(c) ( 2 + i 3)(1 − i) = 2(1 − i) + i 3(1 − i)
√ √ √ √
= 2−i 2+i 3+ 3
√ √ √ √
= ( 2 + 3) + i( 3 − 2).
√ √ √
(d) ( 2 − i)2 = ( 2)2 − 2 2i + i2
√
= 2 − 2 2i − 1
√
= 1 − 2 2i.
To specify a complex number we must give two real numbers, the real and imagi-
nary parts. So two complex numbers are equal if and only if their real and imaginary
parts are equal (respectively).
z 2 = −3 + 4i.
Solution We write z = x + iy, with x and y real. Substituting into the equation
we have
z 2 = (x + iy)2 = −3 + 4i
i.e. x2 − y 2 + i2xy = −3 + 4i.
Lecture 1.6 Complex Numbers 37
Now equate real and imaginary parts – remember the complex number on the
left is equal to that on the right if and only if their real and imaginary parts are
(respectively) equal. We get
x2 − y 2 = −3 and 2xy = 4.
2
y= ,
x
which we substitute into the first equation. This gives
2
2 2
x − = −3.
x
x4 − 4 = −3x2
i.e. x4 + 3x2 − 4 = 0.
(x2 + 4)(x2 − 1) = 0,
z = 1 + 2i or z = −1 − 2i.
1+i
Example Write 3−2i
in the form a + ib.
1+i 1 + i 3 + 2i (1 + i)(3 + 2i)
Solution = · =
3 − 2i 3 − 2i 3 + 2i 32 + 22
3 + 2i + 3i − 2
=
9+4
1 + 5i
=
13
1+i 1 5
i.e. = +i
3 − 2i 13 13
2
We can now solve all quadratic equations with real coefficients. For example we
solve
z 2 + z + 1 = 0.
Using the usual quadratic formula
√
−1 ±12 − 4.1.1
z =
√2
−1 ± −3
= .
2
Clearly the solutions are complex, we write them in the standard a + ib format.
We note that √ √ √ √ √
−3 = −1 × 3 = −1 · 3 = i 3.
So the solutions to the quadratic are
√ √
−1 ± −3 1 3
z= =− ±i .
2 2 2
Lecture 1.6 Complex Numbers 39
♠ Exercises 6
1 + sin θ + i cos θ
2. Show that = sin θ + i cos θ.
1 + sin θ − i cos θ
3. Solve the following equations for z, writing your solution in the form a + ib
z 2 = 6 − 8i.
40 Lecture 1.7 Complex Numbers (continued)
Complex Conjungation
Notice that all we have to do to get the complex conjugate of a complex number
is to replace the imaginary part by its negative.
Example
(c) If z = 5 then z = 5.
Theorem If z1 , z2 ∈ C then z1 z2 = z 1 z 2 .
Proof Let z1 = x1 +iy1 , and z2 = x2 +iy2 . Then z1 z2 = (x1 x2 −y1 y2 )+i(y1x2 +y2 x1 ).
Now,
(a) z1 + z2 = z 1 + z 2
z1 z
(b) = 1 , for z2 6= 0.
z2 z2
Lecture 1.7 Complex Numbers (continued) 41
Proof Exercise!
Taken together these two theorems tell us that if we wish to take the complex
conjugate of an expression involving complex numbers all we have to do is write
down the expression with each complex number replaced by its conjugate.
Example
For z1 , z2 , z3 ∈ C we have
z12 + 2z1 z2 + z2 + 1 − i z 21 + 2z 1 z 2 + z 2 + 1 + i
= ,
z3 z3
for z3 6= 0.
In the previous lecture we mentioned the method for writing a complex number
z1
(with z1 , z2 ∈ C) in the form x + iy. If z2 = x2 + iy2 we had to multiply the
z2
x2 − iy2
expression by . We now see that the method is simply multiplying the
x2 − iy2
z2
expression by . The method works because z2 z 2 is always a real number. This is
z2
easily seen z2 z 2 = (x2 + iy2 )(x2 − iy2 ) = (x2 )2 + (y2 )2 .
zz = (Re(z))2 + (Im(z))2 .
Example
(b) If z = 2 then zz = 22 = 4
Argand Diagrams
A complex number has two parts, its real part and its imaginary part. We can
think of a complex number z = x + iy (x, y ∈ R) as an ordered pair (x, y). In this
way we see that a complex number can be thought of as a point in the plane –
we simply plot the point (x, y) in the xy plane. The resulting picture is called an
Argand diagram. We represent z by the point (x, y) in the plane.
42 Lecture 1.7 Complex Numbers (continued)
3 (1, 3) or 1 + 3i
(−1, 1) or −1 + i 1 (1, 1) or 1 + i
−2 −1 1 2 3 x
The real numbers (numbers with no imaginary part) simply lie on the x-axis.
The pure imaginary numbers (those with no real part) lie along the y-axis.
2 (3, 2) or 3 + 2i
(−1, 1) or −1 + i 1
−2 −1 1 2 3 x
(−1, −1) or −1 − i −1
−2 (3, −2) or 3 − 2i
y z or (x, y)
|z|
x x
Example
p √
(a) If z = 2 − 3i then |z| = 22 + (−3)2 = 13.
p
(b) If z = 3 − 4i then |z| = 32 + (−4)2 = 5.
Theorem
If z1 , z2 ∈ C then
Proof Both follow from the earlier theorem on the complex conjugate. We will
prove (ii) and leave (i) as an exercise. For (ii) we have
s r r
z1 z1 z1 z1 z 1 z1 z 1
= = . =
z2 z2 z2 z2 z 2 z2 z 2
√
z1 z 1
= √
z2 z 2
|z1 |
=
|z2 |.
44 Lecture 1.7 Complex Numbers (continued)
|z1 | + |z2 | = |1 + i| + |1 − i|
√ p
= 12 + 12 + 12 + (−1)2
√ √
= 2+ 2
√
= 2 2,
Theorem If z1 , z2 ∈ C then
Proof Notice that |z1 + z2 | and |z1 | + |z2 | are both non-negative so we can prove
the inequality by proving
Hence,
√ √
(|z1 | + |z2 |)2 − |z1 + z2 |2 = 2 z1 z 1 z2 z 2 − z1 z 2 − z 1 z2
p
= 2 (z1 z 2 )(z 1 z2 ) − z1 z 2 − z 1 z2 .
Thus
(|z1 | + |z2 |)2 − |z1 + z2 |2 ≥ 0
and our result follows.
This result is also known as the triangle in equality. It can be thought of geo-
metrically as stating the old fact that the length of one side of a triangle is less than
the sum of the lengths of the other two sides.
De Moivre’s Theorem
y z = x + iy
θ
x x
What can we say about (cos θ + i sin θ)n ? It is here that the remarkable de
Moivre’s theorem enters.
Theorem If θ ∈ R then
(cos θ + i sin θ)n = cos(nθ) + i sin(nθ),
for n any rational number.
Example
√ 1 1 √ π π
(a) 1 + i = 2 √ + i√ = 2 cos + i sin .
2 2 4 4
(b) i = cos π2 + i sin π2 .
♠ Exercises 7
1. For each of the following complex numbers write down the complex conjugate
and modulus
(a) 6 + 2i
(b) 1 − 3i
1+i
(c) √
2
1
(d) 1+i
2−3i
(e) 1−i
(f) i.
for n ∈ N.
6.* Extend de Moivre’s theorem taking n to be (i) a negative integer and then (ii)
n = p/q, a rational number.
48 Lecture 1.8 Functions on R
f : A −→ B
and
f : a 7−→ b or f(a) = b.
In lecture 3 we also met the idea of composition of two functions f and g. For
f : A −→ B, a 7−→ b and
g : B −→ C, b 7−→ c
we define
g ◦ f : A −→ C, a 7−→ c
by
g ◦ f (a) = g(f (a)).
Another way of saying that f is injective is to say that, for every b ∈ B there is
at most one a ∈ A with f (a) = b. Or that, whenever a1 6= a2 , with a1 , a2 ∈ A, then
f (a1 ) 6= f (a2 ).
Example
(a) f : N −→ R
1
f : n 7−→ f (n) = .
1+n
This function is one-one (injective) but not surjective.
(b) f : R −→ R
f (x) = x, x ∈ R.
(c) f : R −→ R
f (x) = x2 .
g : R −→ [0, ∞)
f (x) = x2 .
(d) f : R −→ {0, 1}
0, when x is irrational,
f (x) =
1, when x is rational.
g : [0, 1] −→ [0, 1]
√
f (x) = 1 − x2
The concepts of injective, surjective and bijective are now particularly easy to
deal with. They are basically about how many ”x-values” correspond to a given ”y-
value” – one or none for injective; one or more for surjective; and one and only one
for bijective. The easiest way to visualise this is to draw the graph of the function,
the question now becomes how many times does a horizontal line drawn at the given
y-value hit the graph? Try it on the following examples.
Example Decide whether the following functions are injective or surjective justi-
fying your statements.
(a) f : N −→ R, n 7−→ n2 − 9
√
(b) f : [−1, 1] = {x ∈ R : −1 ≤ x ≤ 1} −→ R, x 7−→ 1 − x4
1
(c) f : R \ {1} −→ R \ {0}, x 7−→
x−1
Solution
(a) For n1 , n2 ∈ N with f (n1 ) = f (n2 ) we obtain n21 − 9 = n22 − 9. Thus n21 = n22 ,
so that n1 = n2 as n1 , n2 ≥ 0. Hence f is injective. As there is no natural
number n with f (n) = 2, the function is not surjective.
(b) As f (−x) = f (x) for x ∈ [−1, 1], the function is not injective. Note that the
function is well–defined as 1 − x4 ≥ 0 for all x ∈ [−1, 1]. Since the square root
is always greater than or equal to 0, the function is not surjective.
1 1
= ⇐⇒ x2 − 1 = x1 − 1
x1 − 1 x2 − 1
⇐⇒ x2 = x1
1
y= ⇐⇒ y(x − 1) = 1
x−1
1
⇐⇒ x − 1 =
y
1 1+y
⇐⇒ x = + 1 =
y y
Your past experience with graph drawing will have taught you that the behaviour
of the function as x −→ ±∞, or as x approaches a singular point of the function, is
a most important feature of your sketch. To discuss such behaviour rigourously we
need to understand the notion of a limit.
The idea we want to make precise is the following. Suppose we have a function
f : D −→ R with D ⊆ R and c ∈ R. Our question is does f (x) approach some
specific number, L, as x approaches c. If it does then we say f (x) has a limit at
x = c. We write this symbolically as
f (x) −→ L as x −→ c,
lim f (x) = L,
x−→c
read as the “limit of f (x) as x approaches c is L”. The idea here is that we can
make f (x) as close to L as we like simply by taking x sufficiently near to c.
You might like to think of this idea in terms of approximation. We can make the
difference f (x) − L as small as we like by simply taking x close enough to c. Well,
we can do this provided f (x) has limit L as x approaches c. We are saying that I
can always make |f (x) − L| smaller than any number you give me simply by taking
x close enough to c.
Give me any number, as small as you like, say 0.001. Then we can make |f (x)−1|
smaller than 0.001 by taking x close enough to 1.
We want |x2 − 1| < 0.001. We can achieve this by making x just less than 1 or
just greater than 1.
52 Lecture 1.8 Functions on R
Let’s have a look at the value of x, x0 say for which x20 − 1 = 0.001, so
√
x0 = + 1.001. Because of the absolute value signs |x2 − 1| we also need the value
√
x1 , x21 − 1 = −0.001 i.e. x1 = 0.999. We conclude |x2 − 1| < 0.001 provided
√ √
x1 = 0.999 < x < x0 = 1.001.
Now x0 + 1.0005 and x1 + 0.9095 so we can ensure |x2 −1| < 0.001 by (for example)
No matter what number you give me, lets say it is ε, I can always find x0 , x1 such
that
where x0 and x1 are near x = 1. Such a statement is just about right for the abstract
definition of a limit, we’ll say more about this in the next lecture.
Example
lim 2x = 1.
x−→0
From this table of calculated values its clear that it would suffice to take −0.01 <
x < 0.01 to ensure that
|2x − 1| < 0.01.
Lecture 1.8 Functions on R 53
1
Example Sketch the graph of the function f : R \ {−1} −→ R, x 7−→ .
1+x
Solution. The first thing you notice is that f (x) is “badly behaved” near x = −1,
it’s undefined there. What happens as we approach x = −1? Suppose we approach
from below, i.e. consider x = −1 − δ where δ is small and positive. Then
1 1
f (−1 − δ) = =− ,
1 + (−1 − δ) δ
so if δ is small 1δ is large. The function is large in magnitude and negative just below
x = −1. As we approach x = −1 from below, f (x) −→ −∞.
Next let’s see what happens when we approach from above, let x = −1 + δ, δ
small and positive. Then
1 1
f (−1 + δ) = = ,
1 + (−1 + δ) δ
−1
x
The behaviour of the function for x > −1 or x < −1 is easy to determine once
1
we know what happens as x −→ ±∞. For x large and positive 1+x will be small
and positive. That is f (x) −→ 0 as x −→ +∞. We use the notation f (x) −→ 0+
+
to indicate that f (x) is approaching zero from above, i.e. from the positive side.
1
For x large and negative 1+x will be small and negative. That is f (x) −→ 0−
as x −→ −∞, f (x) approaches 0 from below (i.e. from the negative side). One
important point on the graph is where it cuts the y-axis, this is at x = 0 and
f (0) = 1; so the graph cuts the y-axis at (0, 1). This is enough information to give
us a reasonably accurate sketch.
54 Lecture 1.8 Functions on R
−1
x
1
y = f (x) = 1+x
2
Lecture 1.8 Functions on R 55
♠ Exercises 8
2. For each of the functions of question 1 state whether or not the function is
injective or surjective.
1
3. Find a δ > 0 such that |x3 − 1| < 10
whenever |x − 1| < δ.
[Hint: |x − 1| < δ means 1 − δ < x < 1 + δ. You have to find a δ > 0 which
ensures that
1
|x3 − 1| < .]
10
1
4. Sketch the graph of f (x) = x2 −4
.
5*. Show that given any ε > 0 we can always find δ such that
1
x + 1 − 1 <ε
lim f (x) = L
x−→c
means that given any number ε, no matter how small, we can ensure that the
distance, |f (x) − L|, between f (x) and L, is smaller than ε by taking x close enough
to c, that is, by making the distance |x − x| small enough. Let’s state this formally.
Note that 0 < |x − c| < δ excludes x = c. For the limit we are only interested
in the behaviour of the function near c, not in f (c). In fact, we don’t even require
that c is in the domain of f .
To show that f (x) has limit L we have to demonstrate the existence of a δ > 0 for
any given ε. The idea is exactly that of the last lecture, you give me any number,
no matter how small, call it ε. Then I have to come up with a δ > 0 such that
|f (x) − L| <∈ whenever 0 < |x − c| < δ.
(a) limx−→a c = c
(b) limx−→a x = a
Proof
(a) As |c − c| = 0 < ε no matter how far x is from a, we may choose any δ > 0,
for example, put δ = 231. Then |c − c| = 0 < ε whenever |x − a| < 231. Thus
limx−→a c = c.
1
Example Prove that −→ 1 as x −→ 0.
1+x
Our strategy has to be to find a δ > 0, depending on ε, such that if 0 < |x−0| < δ
1
then − 1 < ε.
1+x
1 −x
Now we note that − 1 =
1+x 1 + x
|x|
=
|1 + x|
|x|
≤ , for |x| < 1.
1 − |x|
[Note we have increased the numerator |x| to δ and decreased the denominator
|x| δ
from 1 − |x| to 1 − δ. So the fraction has been increased to . ]
1 − |x| 1−δ
δ ε
So we have to choose δ so that < ε ie. δ < .
1−δ 1+ε
ε
|x − 0| < δ < .
1+ε
We are finished. 2
As you can see there is quite an art in getting one of these proofs to work.
Remember you are given ε > 0 and you have to find an appropriate δ such that
|x − c| < δ implies |f (x) − L| < ε. Here is another example:
1+x
Example Prove that lim = 1.
x−→0 1−x
1+x 1 + |x|
≤ .
1−x 1 − |x|
a
[Remember: You increase the value of a fraction b
by increasing the numerator
a or decreasing the denominator b (or both).]
Now, for |x| < δ < 1 we have |1 − x| ≥ 1 − |x| > 0 and hence
1 1
≥ .
1 − |x| |1 − x|
We obtain
1 + x x
1 − x − 1 = 1 − x
|x|
=
|1 − x|
|x|
≤
1 − |x|
δ
<
1−δ
δ ε 1 + x
To obtain = ε we must have δ = . Then δ < 1, so that
− 1 < ε
1−δ 1+ε 1−x
whenever |x| < δ = ε/(1 + ε).
1+x
This is the formal definition of lim = 1. 2
x−→0 1−x
One-Sided Limits
Implicit in our definition of limits, so far, is the notion that we approach the
point x = c from either side. That is f (x) −→ L whether we approach c from
above, ie. x −→ c+ , or from below, x −→ c− . But there are cases where we can
√
approach from only one side. For example, the limit of x as x −→ 0. This limit,
for real valued functions, must be from above, ie. we should really write x −→ 0+ .
This is a one-side limit.
Lecture 1.9 Limits 59
Solution
Notice the jump at x = 0. We need to discuss separately the limits from above and
below.
Two important one-sides limits are the limits x −→ ±∞, if they exist for the
given function.
If a function is defined for large values of x it may or may not approach a finite
value as x −→ ±∞.
60 Lecture 1.9 Limits
All this says is that if you give me any ε > 0 I can find a number X > 0 such
that |f (x) − L| is smaller than ε whenever |x| > X.
Example
1
Prove that √ −→ 0 as x −→ ∞.
x
1
Solution Let ε > 0 be given. We have to find an X > 0 such that √ < ε
x
whenever x > X. Well this is pretty easy, in this case.
1 1
Let x > X, with X to be determined. Then √ < √ , as decreasing the
x X
1 1 1
denominator increases the fraction. So √ = √ < √ = ε, for X = ε12 . We are
x x X
1 1
finished, we have √ < ε whenever x > ε2 . 2
x
√ √
x+1− x−1
Example Prove that −→ 0 as x −→ ∞.
x
Solution The trick here is to make the troublesome numerator “look better”. We
rationalize the numerator (i.e. eliminate the square roots),
√ √ √ √
x+1− x−1 x + 1 + x − 1 = (x + 1) − (x − 1), “difference of squares”.
= 2
Therefore
√ √ √ √ √ √
x+1− x−1 x+1− x−1 x+1+ x−1
= ·√ √
x x x−1+ x−1
2
= √ √ .
x x+1+ x−1
It’s now clear that this expression tends to zero as x −→ ∞. We need to prove
it formally.
Lecture 1.9 Limits 61
so
1
√ √ < 1.
x+1+ x−1
Thus we have
√ √
x+1− x−1 2 2
= √ √ < .
x x x+1+ x−1 x
♠ Exercises 9
1
3. Discuss the limit of 1−x2
as x −→ 1.
4. Prove that r
x+1
lim = 1.
x−→∞ x−1
Lecture 1.10 Limits and Continuous Functions 63
In manipulating and calculating limits it is useful to have a set of rules for such
things as the limit of a sum or product. The following theorem provides just those
rules.
Then
Proof
(a) This just says that the “limit of a sum is the sum of the limits”.
Let ε > 0 be given. Assume one of α, β is non-zero. Now both f and g have
well defined limits as x −→ c so we know there exist a number δ such that
ε ε
|f (x) − L1 | < and |g(x) − L2 | <
|α| + |β| |α| + |β|
Whenever |x − c| < δ.
Which completes the proof of (a).
64 Lecture 1.10 Limits and Continuous Functions
Now f and g both have well-defined limits as x −→ c, so we have for any ε > 0
and some δ > 0
whenever |x − c| < δ.
From the second of these inequalities we also have
i.e.,
L2 − ε < g(x) < L2 + ε, whenever |x − c| < δ.
So, whenever |x − c| < δ, we have
This is true for any ε > 0, in particular for ε2 + ε (|L1 | + |L2 |) = ε, i.e. for
q
1 2
ε= − (|L1 | + |L2 |) + (|L1 | + |L2 |) + 4ε
2
|f (x)g(x) − L1 L2 | < ε
whenever |x − c| < δ.
3 − 2x 3 − 2x
(i) lim (ii) lim
x−→0 x + 6 x−→∞ x+6
x+1 x+4
(iii) lim cos x (iv) lim .
x−→0 x − 1 x−→∞ x
Solution
Recall that limx−→a c = c and limx−→a x = a for a, c ∈ R and use the Theorem
above.
(i)
3 − 2x lim (3 − 2x)
x−→0
lim =
x−→0 x + 6 lim (x + 6)
x−→0
lim (3) − lim (2x)
x−→0 x−→0
=
lim (x) + lim (6)
x−→0 x−→0
3 − lim (2) lim (x)
x−→0 x−→0
=
0+6
3−0 1
= = .
6 2
x+1 lim (x + 1)
x−→0
lim cos x = lim cos x
x−→0 x − 1 lim (x − 1) x−→0
x−→0
1
= ·1
−1
= −1.
66 Lecture 1.10 Limits and Continuous Functions
(iv) Again we cannot use (c) directly. We rewrite the expression so that we
can apply (c):
x+4 4
lim = lim 1+ = 1.
x−→∞ x x−→∞ x
Continuity
You will have gained the impression that the common functions are what could
reasonably be called continuous, that is if you trace along the function by changing
x the graph of the function it does not have any breaks. We think, intuitively, of
a function being continuous if we can draw its graph without removing the pencil
from the paper.
We now refine these ideas and formalise them mathematically. In fact, a little
reflection should convince you that for a function to be continuous at x = c we
require two things:
A function f : D −→ R, D ⊆ R, is continuous at c ∈ D if
1
Example Prove that f : R \ {0} −→ R, x 7−→ x
is continuous at x for all
x ∈ (0, ∞).
Solution We take x > 0 and let ε > 0 be given. Then for any c ∈ (0, ∞),
1 1 |c − x|
− = .
x c cx
Armed with this theorem we can considerably expand our list of continuous
functions. All polynomial functions f : R −→ R given by
f (x) = a0 + a1 x + a2 x2 + . . . + an xn
f (x) = p(x)/q(x)
For continuity at x = c it is necessary and sufficient that lim+ f (x) = f (c) = lim− f (x).
x−→c x−→c
This typically fails when there is a jump in f (x) from one side of x = c to the other.
(a) (
x2 , x<0
f : R −→ R, x 7−→ .
1 + x, x > 0
Here continuity fails at x = 0.
(b)
f : R −→ R, x 7−→ [x],
where [x] is the greatest integer less than or equal to x. This function has
discontinuities at x = n for all n ∈ N.
♠ Exercises 10
1. Sketch the two functions of the last example, indicating the discontinuities.
x2 + x + 1
(c) lim
x−→∞ 2x2 − x − 1
x2 − 3x + 2
.
x3 − 3x2 + 2
1
4. Prove formally that f : (−1, ∞] → R, x 7−→ is continuous.
x+1
Lecture 1.11 Continuous Functions 69
Graphs of continuous functions have all the “nice” properties one would expect
of a curve that is drawn on paper without removing your pencil from the page. A
good example of such properties is the intermediate value theorem.
Theorem Suppose f is continuous on [a, b] and that f (a) 6= f (b). Then f (x), x ∈
(a, b), takes every value between f (a) and f (b).
Remark: As your pencil moves from f (a) to f (b) you must pass over every value
(every real number) between f (a) and f (b).
Proof We restrict our function f to [a, b] and suppose that there is a y ∈ ]f (a), f (b)[ \Im(f).
Put A := {x ∈ [a, b] | f (t) < y for all t ∈ [a, x]}.
As f (s) 6= y, either f (s) < y, or f (s) > y. Put ε := |f (s) − y|, and take δ > 0.
Then
δ
Choose x ∈ A with s − 2
< x.
We know from the previous lecture that all polynomial functions and all rational
functions are continuous.
We would also guess that the trigonometric functions, sine, cosine, tangent,
cotangent, secant and cosecant are continuous at all points of their respective natural
domains. Just sketch them! Let’s formalise this.
Proof We will only prove continuity of sin x, the other functions can be dealt with
in a similar manner. We measure x in radians, 360◦ is 2π radians, 180◦ is π radians
and 90◦ is π2 radians. In general, x = 360
2π
◦ θ, where θ is the angle measured in degrees.
Lecture 1.11 Continuous Functions 71
In the diagram below we have a unit circle, you need to know that if the angle
made by two radii (each one unit) is x radians then the length of the arc of the circle
between the ends of the radii is x. The circumference of the circle (the angle in this
case is 2π) is just 2π — remember the radius is 1.
sin c c−x
sin x
c
x
1
c−x
sin c − sin x
The dotted lines form a right-angle triangle. The hypotenuse of this triangle
must be greater in length than either of the other two sides. In particular, it is
larger than sin c − sin x. However, this hypotenuse is smaller in length than the
circular arc of length c − x.
π
So for 2
≥ x ≥ 0 we have
sin c − sin x ≤ c − x.
This proves lim− sin x = sin c. In a similar way we can prove lim+ sin x = sin c.
x→c x→c
Which then shows that sin x is continuous on [0, π2 ]. The result is easy to extend to
all values of x.
Using the continuity results established thus far with the results on continuity of
sums and products of the last lecture we can extend our list of continuous functions
72 Lecture 1.11 Continuous Functions
We are also now in a position to establish many limits involving these functions.
A particularly useful way to establish these limits is the squeezing principle. It says
that, if two functions have limit L as x approaches c ∈ R, then any function in
between must also have limit L as x approaches c.
Let f, g and h be functions whose domains contain the open interval (a, b), such
that
g(x) ≤ f (x) ≤ h(x)
for all x ∈ (a, c) and for all x ∈ (c, b) with a < c < b.
lim f (x) = L.
x→c
Proof Let x ∈ (a, b). From the definition of the limit we know that for any ε > 0
there exists a δ > 0 such that
Theorem
sin x 1 − cos x
(a) lim =1 and (b) lim = 0.
x→0 x x→0 x
sin x
Proof (a) We first establish “squeezing” inequalities for x
. We simply adapt the
techniques used in the proof of the continuity of sin x.
x tan x
1
sin x
x
1
x
1
sin x
x
1
On the other hand comparing the area of the large triangle (tan x as one side)
with the area of the sector of the unit circle subtended by the angle x we have
1 x
Area of triangle = 2
· 1 · tan x) ≥ Area of sector = 2
,
tan x ≥ x ≥ sin x.
sin x
Now x ≥ sin x so x
≥ 1 (for x > 0). Also tan x ≥ x is just
sin x
≥x ,
cos x
which can be written as
sin x
≥ cos x for x > 0, x near 0.
x
Therefore
sin x
cos x ≤ ≤ 1.
x
74 Lecture 1.11 Continuous Functions
sin x sin(−x)
= ≥ cos(−x) = cos x, as sin(−x) = − sin x.
x (−x)
Now,
lim cos x = 1 and lim 1 = 1.
x→0 x→0
Now take limits, remember we have proved the limit of product is the product of
the limits, etc.,
1 − cos x sin x sin x
lim = lim lim
x→0 x x→0 x x→0 1 + cos x
0
= (1) = 0.
1+1
Lecture 1.11 Continuous Functions 75
♠ Exercises 11
sin h x + sin x
(a) lim (b) lim
h→0 2h x→0 x
1 − cos2 x sin(2y)
(c) lim (d) lim
x→0 x y→0 y
1 − cos x h
(e) lim (f) lim
x→0 sin x h→0 tan h
sin x sin x
(g) lim+ (h) lim− .
x→0 |x| x→0 |x|
a continuous function at x = 0.
In this lecture we want to confirm the intuition that continuous functions on a closed
interval are “nice” functions.
Remarks It is essential to the truth of the theorem that the interval is closed. The
function x1 is continuous in (0, 1], but is certainly not bounded as x −→ 0. We can
make x1 arbitrarily large by taking x near enough to 0.
Since f is continuous on [a, b], it follows by the Lemma above, that there is an
r > 0 such that f is bounded on the interval (s − r, s + r) ∩ [a, b], and so, in
particular, on [a, s + r).
This is again quite a subtle proof. Read it carefully to get the essential idea of
the argument.
Example If they exist, find supf and inff for the following:
Solution
(a) Here supf = 1 and inff = 0. Note that there is no x ∈ (0, 1) such that
f (x) = sup or f (x) = inf. We say that f (x) does not attain its bounds.
(b) Here inff (x) = 1 and supf does not exist as the function is unbounded above –
1
x
→ ∞ as x −→ 0. Again we see there is no x ∈ (0, 1) such that f (x) = inff .
The function does not attain its lower bound.
(c) Here supf = 1 and inff = 0. In this case we see that f does attain its bounds
– note the interval is closed.
Proof We prove there exists an x1 ∈ [a, b] such that f (x1 ) = M. The proof for the
infinum is similar.
The proof we give is by contradiction – there is another proof using the so-called
method of bisection.
Since M − f (x) 6= 0 for all x ∈ [a, b] we see, from Lecture 10, that
1
M − f (x)
is continuous on [a, b].
1
So, by our previous theorem, M −f (x)
is bounded on [a, b]. We have
1
< K, for some finite K ∈ R, and all x ∈ [a, b].
M − f (x)
Then, as M − f (x) > 0,
1
< M − f (x)
K
or
1
f (x) < M − for x ∈ [a, b].
K
But this contradicts M = supf (x), M is the least upper bound, there cannot be a
smaller bound M − K1 ! We have our proof by contradiction.
We have had two reasonably difficult proofs for what would seem to be two
intuitively obvious facts!
Example State whether or not the following functions meet the conditions of the
previous two theorems. If the function does, what are its supremum and infinum?
1
(c) f : [−1, 1] \ {0} −→ R, x 7−→ x2
;
Solution The hypothesis for both theorems is that f is defined on a closed interval
[a, b] and continuous.
(a) Does not satisfy the hypothesis, the interval is not closed.
(c) The function does not satisfy the hypothesis as it is not defined on a closed
interval.
(d) The function cos is continuous and restricting a continuous function to a closed
interval we obtain another continuous function. Thus the hypothesis is satis-
fied.
sup f = 1 and inff = −1.
Monotone Functions
It is easy to see that every strictly monotone function must be injective (one-one)
but it is by no means true that every injective function has to be strictly monotone.
x, x<0
(c) f : R −→ R, x 7−→ 0, 0 ≤ x < 1;
x − 1, x ≥ 1
Note that the functions in (a), (b) and (d) are strictly monotone, continuous
and injective. The function in example (c) is not strictly monotone, though it is
monotone. The example (c) is continuous but not injective. You should draw the
graphs of these functions to convince yourself of these facts.
80 Lecture 1.12 More on Continuity
(b) f : R −→ R, x 7−→ x2 ;
(
x, x<0
(c) f : R −→ R, x 7−→ ;
x − 1, x ≥ 0
The functions in (a), (b) and (d) are all continuous, but they are not injective.
Hence they cannot be monotone. The function in (c) is not continuous at x = 0
and not monotone. It is, however, injective. In fact, the function in (c) is monotone
on each of the intervals (−∞, 0) and [0, ∞), separately. The same is also true for
the other functions – there exist subintervals on which the (restricted) function is
monotone.
f (c − δ) = f (c + δ).
However, each of these possibilities contradicts the fact that f is injective. We can
thus have only one subinterval – f is either strictly increasing on [a, b] or strictly
decreasing on [a, b]. It’s monotone. 2
♠ Exercises 12
3. State whether or not the following functions are monotone. Also decide if
the functions are are injective, surjective or bijective. If they exist, find the
supremum and infimum in each case and state whether or not the function
attains its bounds.
4*. Let f : [a, b] −→ R be bounded, such that, for every pair of values x1 , x2 with
a ≤ x1 ≤ x2 ≤ b
x1 + x2 1
f ≤ [f (x1 ) + f (x2 )] .
2 2
Prove f is continuous on (a, b).
82 Lecture 1.13 Sequences
f : N −→ R
f : n 7−→ f (n).
Of course this is not the notation you would be familiar with. The usual way of
specifying a sequence is to give the nth term, usually denoted by an . In function
notation the nth term is just f (n). Clearly any function f : R −→ R can be
restricted to the natural numbers to give a sequence.
Example
1 1
(a) f (x) = x2
, when restricted to N gives the sequence an = n2
.
Another common way of specifying a sequence is to write down some of the early
terms. But, unless the nth term is given, this does not uniquely specify a sequence.
(a) 1, 21 , 13 , 41 , . . . , n1 , . . .
∞
1 1 1 1
(b) = 1, , , . . . , , . . .
n n=1 2 3 n
(c) f (n) = n1 .
For an infinite sequence perhaps the most interesting question is “does the se-
quence tend to a finite limit as n −→ ∞?”.
Example
∞
1
(a) tends to the limiting value 0.
n n=1
1 2 3 n
(b) , , ,..., , . . . approaches the value 1.
2 3 4 n+1
Lecture 1.13 Sequences 83
We can define a limit for a sequence in much the same way as we defined the
one-sided limits for f (x) as x −→ ±∞.
A sequence with nth term an is said to have the limit a if, given
any ε > 0, there is an N such that
We write lim an = a or an −→ a as n −→ ∞.
n−→∞
Note that if a sequence an does have a limit then it must be bounded. For n > N
we have
−ε < an − a < ε
or a − ε < an < a + ε.
Our definition of the limit can be abbreviated using some mathematical notation.
We introduce two symbols which replace commonly used mathematical phrases.
an −→ a if
If a sequence does not tend to a finite limit it does not necessarily diverge to
±∞ – there are other types of behaviour.
Example
1
(d) The sequence an = sin n
approaches 0.
an < −A ∀ n > N.
From our earlier work on limits we would expect to have a theorem on the limit
sums and products of sequences. There is such a theorem.
(b) an bn −→ ab
an a
(c) −→ , provided b 6= 0.
bn b
Proof The technique of proof is the same as for limits of functions. We will prove
(c).
Lecture 1.13 Sequences 85
Note that
an a ban − abn
− =
bn b bbn
b(an − a) + a(b − bn )
=
bbn
|b(an − a)| + |a(b − bn )|
≤ ,
|bbn |
ie.
an a |b| · |an − a| + |a| · |bn − b|
(∗) − ≤ .
bn b |bbn |
Now an −→ a and bn −→ b so for any given ε > 0 we can find an N such that
|b| |b|2
|an − a| < ε, |bn − b| < ε
4 4|a|
1
and |bn | >|b|
2
for n > N.
If |a| = 0 then we do not need the inequality for |bn − b|. Putting these last three
inequalities into (*) we have
|b| |b|2
an a |b| · 4 ε + |a| · 4|a| ε
− < = ε.
bn b |b| 21 |b|
for n > N.
an a
We have formally established −→ as n −→ ∞.
bn b
3 3
n−3 3n + 1
(a) (b)
3n + 1 n−3
(−1)n
(c) cos n (d)
n
Solution
(a) We need to use the trick of dividing out the n so as to apply our theorem on
an
.
bn
3 3
n−3 1 − n3
=
3n + 1 3 + n1
3
1 − n3
= 3 .
3 + n1
86 Lecture 1.13 Sequences
(c) The sequence oscillates through all numbers between −1 and +1.
Monotone Sequences
Monotonic sequences have the very important property that they must tend to
a limit or to +∞ or to −∞. Monotonic sequences cannot oscillate.
We will prove this property for an increasing sequence. This proof is easily
adapted to the decreasing case.
an ≤ K ∀n
or
Deal with possibility (2) first. As an is increasing an > A not just for n = N but
for all n ≥ N. So from our definition an −→ ∞ as n −→ ∞.
Now for (1). The number K is an upper bound for the set {a1 , a2 , . . . , an , . . .} =
{an }, so the set must have supremum a which satisfies
an ≤ a ∀n and
an > a − ε for some particular value n = N.
As an is increasing the second inequality must hold for all n ≥ N. So we have
an < a(< a + ε) and
a − ε < an
for all n > N. We can write this as
|an − a| < ε ∀n > N.
Which just says an has limit a.
♠ Exercises 13
As a direct consequence of our last theorem (Lecture 13) we have the following useful
result.
2. a > 1 let a = 1 + k, k > 0 then (1 + k)n > 1 + nk (Exercise 4 Lecture 2). Now
as n −→ ∞ 1 + nk −→ ∞ and so an = (1 + k)n −→ ∞.
1 1
3. 0 < a < 1. Let a = , k > 0. Then n = (1 + k)n > 1 + nk. So
1+k a
1 1
an = <
(1 + k)n 1 + nk
1 1
Now −→ 0 as n −→ ∞ so an = −→ 0 as n −→ ∞.
1 + nk (1 + k)n
4. a < 0. Firstly, if −1 < a < 0 we use a similar argument to that in 3 to find
an −→ 0. If a = −1 then an = (−1)n = ±1 and the sequence oscillates finitely.
If a < −1 then (using a similar argument to 2) we see that the sequence
oscillates infinitely.
Summing up
an = an , a constant
a > 1, an −→ ∞
a=1 an −→ 1
−1 < a < 1 an −→ 0
a = −1 an oscillates finitely
a < −1 an oscillates infinitely.
Lecture 1.14 Sequences and Series 89
Infinite Series
P
this is the sigma or summation notation. We just think of the as standing for
sum over all counting number values between the given limits.
We have still not said what we mean by “sum to infinity”. This term is fraught
with danger and we need to say precisely what we mean by it. We will give the
appropriate definition below, but first an illustration of the dangers.
∞
X
Consider (−1)n = −1 + 1 − 1 + 1 − 1 + 1 . . . . We might want to “sum” this
n=1
by grouping terms as follows (−1 + 1) + (−1 + 1) + (−1 + 1) + . . . to conclude that
the sum is zero. This would be wrong. To see this we could also group the terms
as −1 + (1 − 1) + (1 − 1) + . . . , to get a sum of −1! There is clearly something
wrong with the process of “grouping terms” for such a series.
We will define our infinite sums in terms of a sequence, the sequence of partial
sums.
Let {an }∞
n=1 be an infinite sequence. We define the nth partial sum
Sn of the sequence as
Sn = a1 + 12 + a3 + . . . + an , or
Xn
Sn = ar .
r=1
We can now talk about the possible limit of the sequence {Sn }∞
n=1 .
We think of the words “infinite sum” as applying only in the sense of a convergent
limit of the sequence of partial sums.
The geometric series is the “sum to infinity” of the geometric sequence. When
this sum exists!
1 + a + a2 + . . . + an . . .
Proof The series has first term 1 so the nth term is an = an−1 . We note that
Sn = 1 + a + a2 + . . . + an−1
and aSn = a + a2 + . . . + an .
So aSn − Sn = an − 1.
n
Therefore Sn = 1−a
1−a
, for a 6= 1, if a = 1 Sn = n. We see that if a = 1 then
Sn −→ ∞ as n −→ ∞. For a 6= 1 we have
1 − an 1 1 n
Sn = = − a .
1−a 1−a 1−a
We see that the sequence of partial sums Sn , converges if and only if the sequence
a converges. But from our section on the geometric sequence we know an tends to
n
a finite limit (which is 0) if and only if −1 < a < 1. This proves the theorem. Note
that we also have the sum to infinity in the case −1 < a < 1, it is
1
Sn −→ as n −→ ∞.
1−a
P 1
The Harmonic Series n
diverges.
Lecture 1.14 Sequences and Series 91
(i) un ≥ 0, vn ≥ 0
then
P P
(a) if vn converges un converges
or
P P
(b) if un diverges vn diverges.
Proof
n
X n
X
An exercise. Use the fact that the partial sums Sn = un and Tn = vn
r=1 r=1
form increasing sequences for n > N. Then use the results on monotone sequences.
Example Discuss the convergence (or divergence) of the following infinite series.
∞ ∞
X 1 X n
(a) √ (b)
n=1
n n=1
3n
∞
X n ∞
X n2n + 1
(c) (d)
n=1
n2 + 1 n=0
5n + 1
92 Lecture 1.14 Sequences and Series
Solution
1 1
(a) For n ≥ 1 √ ≥
n n
P
The comparison test then shows that √1 diverges.
n
♠ Exercises 14
1. A ball is dropped from a height of 20m. On each bounce the ball returns to a
height of 45 of that of the previous bounce.
What is height of the 3rd bounce?
Give a formula for the height of the nth bounce.
Give a formula for the distance travelled by the ball up to and including the
nth bounce.
What is the total distance travelled by the ball from the time it is first dropped
till the time it comes to rest?
1 2 3 n
(a) + + + ...+ + ...
3 5 7 2n + 1
1 1 1 1
(b) 1+ + + + ...+ + ...
1 2 · 1 3 · ·1 n!
X 2n + 1
(c)
3n + 1
X 1
(d) √
n2 + 1
X∞
(e) cos(kπ)
k=0
X n2
(f) .
2n
∞
X
3. Prove that if 0 ≤ an ≤ 1, then the series (an xn ) converges for 0 ≤ x < 1.
n=0
4. If n −n
1 1
an = 1 + and bn = 1 −
n n
prove that an is an increasing and bn a decreasing sequence. Prove further
that as n −→ ∞, bn − an −→ 0, and that an and bn lead to the same limit.
(This limit is e the base of the natural logarithms.)
94 Lecture 1.15 Series
Proof We know the series diverges for k = 1 so for k ≤ 1 we can use the comparison
test. We have
1 1
k
≥ , for k ≤ 1.
n n
For the case k > 1 we use the same device we used in proving the divergence of
P 1
n
.
Group terms of the series in blocks of length 2, 4, 8, . . . , to get
1 1 2 1
k
+ k < k = k−1
2 3 2 2
1 1 1 1 4 1 1
k
+ k + k + k < k = k−1 = k−1 2
4 5 6 7 4 4 (2 )
1 1 1 8 1 1
k
+ k + . . . + k < k = k−1 = k−1 3 .
8 9 15 8 8 (2 )
1 1 1
The series 1 + + + + . . . is a convergent geometric series,
2k−1 (2k−1 )2 (2k−1 )3
n
1 X 1
(as a 0 < < 1, for k > 1) . The sequence of partial sums Sn = is then
2k−1 r=1
r k
X n2 + 1
(a)
n4 + 1
Lecture 1.15 Series 95
X 1
(b) √3
n
Solution
n2 + 1 n2 + n2 2
(a) Note < = .
n4 + 1 n4 n2
X 1
So, as , converges (see last theorem) we have from the comparison test
n2
X n2 + 1
that converges.
n4 + 1
1
(b) Straightforward application of last theorem with k = .
3
X 1
√ diverges.
3
n
Proof
N
X
1. Changing a finite number of terms, say we replace an by some other num-
n=1
∞
X N
X ∞
X
ber A, say. Then an = an + an and the new sum to infinity is
n=1 n=1 n=N +1
96 Lecture 1.15 Series
∞
X ∞
X ∞
X
A+ an . However an converges (or diverges) if and only if an
n=N +1 n=1 n=N +1
XN
converges (or diverges) as an is finite. Hence the result.
n=1
n
X
2. Note that un = Sn − Sn−1 where Sn = ur is the nth partial sum.
r=1
X
However, if un converges to s then Sn −→ s and Sn−1 −→ s as n −→ ∞.
The result follows.
X n Xn
3. Let Sn = ur and Tn = vn . Then lim Sn = s and lim Tn = t. However
n→∞ n→∞
r=1 r=1
from lecture 13 we know that
lim (Sn + Tn ) = s + t.
n→∞
n
X X
But, Sn + Tn = Pn = (un + vn ) the nth partial sum of (un + vn ). The
r=1
result is proved.
4. Exercise.
n
X
5. With un ≥ 0 Sn = ur is an increasing sequence. The result then follows
r=1
from lecture 13.
X (3n + 2n )
(a)
6n
∞
X
(b) 210−n
n=1
Solution
X 1 X 1
(a) The series and both converge so the series
2n 3n
X 1 1
X 3n + 2n
+ =
2n 3n 2n · 3n
X 2n + 3n
=
6n
must also converge by our last theorem.
Lecture 1.15 Series 97
∞ 9 ∞
(b) X X X
210−n = 210−n + 210−n
n=1 n=1 n=10
∞
X 1 1 1
210−n = 1 + + 2 + 3 + ...
n=10
2 2 2
∞
X 1
= , where n′ = n − 10.
2n′
′n =0
This last series is a convergent geometric series so the original series converges,
property 1 of the last theorem. 2
Proof
un+1
(a) Now ≥ 0 ∀n ∈ N so we have 0 ≤ ρ < 1 in this case.
un
un+1 un+1
As −→ ρ < 1 we must have for n ≥ N, for some N, < 1. So there
un un
∞
un+1 X
is an r < 1 such that for n ≥ N, < r. Consider the series un . Now,
un n=N
un+1
using < r, for n ≥ N, we have
un
uN +1 < ruN
uN +2 < ruN +1 < r 2 uN
uN +3 < ruN +2 < r 3 uN
..
.
∞
X
In general un < r n−N
uN for n ≥ N. The series r n−N uN is a convergent
n=N P
geometric series (as 0 < r < 1) so by the comparison test un converges.
98 Lecture 1.15 Series
uN +1
(b) In this case we have −→ ρ > 1 as n −→ ∞. So for some N we have
un
un+1
> 1 for all n ≥ N.
un
From which we have
uN +1 > uN
uN +2 > uN +1 > uN
..
.
In general un > uN for all n ≥ N. However, the series
X∞ ∞
X
uN = uN 1 = uN (1 + 1 + 1 + · · · ) clearly diverges. Hence, by the com-
n=N n=N
P
parison test un must diverge.
1
(c) Consider the ratio test for un = where 0 < k < ∞. We have
nk
,
k
nk n 1
un+1 un = k
= = k .
(n + 1) n+1 1 + n1
1
But −→ 1 as n −→ ∞.
1 k
1+ n
P
So for un = n1k , ρ = 1. However, un diverges if 0 < k ≤ 1 and converges for
k > 1.
∞
X nn X 3n X 5n
(a) (b) (c)
n=1
n! n! n2
Solution
nn nn
(a) un = = , so
n! n(n − 1) . . . 2.1
un+1 (n + 1)n+1 n!
= ·
un (n + 1)! nn
(n + 1)n+1
=
(n + 1)nn
n
(n + 1)n 1
= = 1+
nn n
Lecture 1.15 Series 99
n
1
We know from an earlier exercise that 1+ −→ e > 1 as n −→ ∞. So
n
X nn
the series diverges.
n!
3n un+1 3n+1 n! 3 un+1 3
(b) un = , = n
= . So = −→ 0 as n −→ ∞.
n! un (n + 1)! 3 n+1 un n+1
The series converges.
2
5n un+1 5n2 1
(c) un = 2 , = =5 .
n un (n + 1)2 1 + n1
un+1 5
So, = 2 −→ 5 as n −→ ∞. The series diverges. 2
un 1 + n1
♠ Exercises 15
X 1 X 2
1. 2
2.
5n − n n2 +n
∞ X n10
X k
3. 4.
k=1
k − 21
3/2 3k
X k2 X sin2 k
5. 6.
3k 3 + 1 k2
X k2 X (n!)2
7. 8.
5k (2n)!