Book C: Essential Mathematics 1
Book C: Essential Mathematics 1
Essential mathematics 1
Book C
This publication forms part of an Open University module. Details of this and other
Open University modules can be obtained from the Student Registration and Enquiry Service, The
Open University, PO Box 197, Milton Keynes MK7 6BJ, United Kingdom (tel. +44 (0)845 300 6090;
email [email protected]).
Alternatively, you may visit the Open University website at www.open.ac.uk where you can learn
more about the wide range of modules and packs offered at all levels by The Open University.
To purchase a selection of Open University materials visit www.ouw.co.uk, or contact Open
University Worldwide, Walton Hall, Milton Keynes MK7 6AA, United Kingdom for a brochure
(tel. +44 (0)1908 858779; fax +44 (0)1908 858787; email [email protected]).
Contents
Unit 7 Differentiation methods and integration 1
Introduction 3
1 Derivatives of further standard functions 3
1.1 Derivatives of sin, cos and tan 4
1.2 Derivatives of exp and ln 10
2 Finding derivatives of more complicated functions 14
2.1 Product rule 14
2.2 Quotient rule 19
2.3 Chain rule 23
2.4 Differentiating functions of linear expressions 31
2.5 Using the differentiation rules together 33
3 More differentiation 38
3.1 Optimisation problems 38
3.2 Differentiation using a computer 44
3.3 Derivatives of inverse functions 46
3.4 Table of standard derivatives 54
4 Reversing differentiation 55
4.1 Antiderivatives and indefinite integrals 56
4.2 Antiderivatives of power functions 61
4.3 Constant multiple rule and sum rule for
antiderivatives 64
5 Particular antiderivatives 67
5.1 Finding particular antiderivatives 67
5.2 Using integration to work with rates of change 68
5.3 Changes in the values of antiderivatives 74
6 Antiderivatives of further standard functions 78
6.1 An antiderivative of the reciprocal function 78
6.2 Table of standard indefinite integrals 81
Learning outcomes 83
Solutions to activities 84
Acknowledgements 101
Contents
Introduction 105
Acknowledgements 216
Contents
Introduction 219
Index 279
Unit 7
Introduction
The first half of this unit continues with the topic of Unit 6, differential
calculus. You’ll meet the formulas for the derivatives of five standard
functions, namely sin, cos, tan, exp and ln. You’ll also learn some further
ways of differentiating complicated functions by combining the formulas
for the derivatives of standard functions.
By the end of this part of the unit you should have acquired skills in
differentiating a wide variety of functions. You’ll need these skills in the
later parts of the module, and in any further modules that you study that
involve calculus. So it’s important that you become proficient in them.
The only way to do that is to practise, so you should aim to do all the
activities in this unit, and as many further activities in the practice quizzes
and exercise booklets as you have time for, until you feel confident with
your skills.
In the first half of the unit you’ll also learn how to use your computer to
find and work with derivatives, and you’ll see some further applications of
differentiation.
In the second half of the unit you’ll start to learn about integral calculus,
which can be thought of as the reverse of differential calculus. You’ve seen
that in differential calculus you start off knowing the values taken by a
continuously changing quantity throughout a period of change, and you
use this information to find the values taken by the rate of change of the
quantity throughout the same period. In integral calculus, you do the
opposite: you start off knowing the values taken by the rate of change of a
quantity throughout a period of change, and you use this information to
find the values taken by the quantity throughout the same period. This
process is called integration. The topic of integral calculus continues in
Unit 8.
3
Unit 7 Differentiation methods and integration
Derivative of sin
Our first aim in this subsection is to obtain a formula for the derivative of
the sine function, f (x) = sin x.
As you know, the derivative of a function is itself a function, and therefore
the derivative has a graph. So, to get a rough idea of what the derivative
of the sine function might be like, let’s try to roughly sketch its graph, by
estimating the gradient of the graph of the sine function at various points.
Figure 1 shows the graph of the sine function, drawn on axes with equal
scales, and some tangents to the graph.
4
1 Derivatives of further standard functions
5
Unit 7 Differentiation methods and integration
In Activity 1 you should have worked out that the graph of the derivative
of the sine function for x between 0 and 2π must look something like the
sketch in Figure 4.
6
1 Derivatives of further standard functions
Derivative of cos
Let’s now look at obtaining a formula for the derivative of the cosine
function, f (x) = cos x. You could get a rough idea of what this derivative
must be like by using a similar procedure to the one that you’ve just seen
for the sine function. Figures 6 and 7 show the graph of f (x) = cos x, and
the sketch of its derivative that’s obtained from this procedure.
7
Unit 7 Differentiation methods and integration
The derivative of the sine function is the cosine function, whose graph is
shown in Figure 8. Translating this graph by π/2 to the left gives the
graph in Figure 9. Because of the symmetries of the graphs of sine and
cosine, this graph is the graph of y = − sin x. So the derivative of
f (x) = cos x is indeed f " (x) = − sin x.
Derivative of tan
The final derivative of a trigonometric function that you’ll meet in this
subsection is the derivative of the tangent function. In fact,
if f (x) = tan x, then f " (x) = sec2 x.
Remember that sec2 x means (sec x)2 , and also that sec x means 1/(cos x).
So another way to write the derivative of f (x) = tan x is
1
f " (x) = .
(cos x)2
However, we usually use the form f " (x) = sec2 x, for conciseness.
This formula for the derivative of the tangent function can be worked out
by using the fact that tan x = (sin x)/(cos x), together with the formulas
for the derivatives of f (x) = sin x and f (x) = cos x. The process involves a
rule for combining derivatives that you’ll meet later in the unit, and you’ll
see the justification of the formula there.
For now, you might like to compare the graphs of y = tan x and y = sec2 x,
shown in Figure 10, and convince yourself that the graph of y = sec2 x
does seem, at least roughly, to give the gradients of the graph of y = tan x.
8
1 Derivatives of further standard functions
In particular, notice that the gradients of the graph of y = tan x are always
positive and that the graph of y = tan x is the least steep whenever x is a
multiple of π.
In the next activity you’re asked to use these formulas, together with the
sum and constant multiple rules, to find the derivatives of some more
functions. In this activity, and in general, remember that if a function is
specified using function notation, then you should usually use Lagrange
notation for its derivative, whereas if it’s specified by an equation that
expresses one variable in terms of another, then you should usually use
Leibniz notation.
9
Unit 7 Differentiation methods and integration
Derivative of exp
Let’s look first at the exponential function, f (x) = ex . Its graph is shown
in Figure 11.
10
1 Derivatives of further standard functions
You can find the general formula for the derivative f " (x) of the function
f (x) = ex by using differentiation from first principles. The difference
quotient for f (x) = ex is
f (x + h) − f (x) ex+h − ex
= .
h h
As usual, to find a formula for f " (x), you have to work out what happens
to the value of the difference quotient as h gets closer and closer to zero.
It’s not immediately obvious how to do this, but surprisingly you can do it
by using the fact that you already know what happens in the particular
case when x = 0, since you know that f " (0) = 1.
When x = 0, the difference quotient is
e0+h − e0 eh − 1
= .
h h
So you already know that, as h gets closer and closer to zero, the value of
this expression gets closer and closer to 1.
Now look again at the difference quotient for a general value of x. You can
rearrange it as follows:
) h -
ex+h − ex ex eh − ex x e −1
= =e .
h h h
The expression in the brackets is the difference quotient in the case when
x = 0. So as h gets closer and closer to zero, the expression in the brackets
gets closer and closer to 1, and hence the value of the whole expression
gets closer and closer to ex . That is, the formula for the derivative is
f " (x) = ex .
So the derivative of the exponential function is the exponential function!
To see that this makes sense, look at the graph of the exponential function
in Figure 11. As x increases, the gradient of the graph increases, and it
increases more and more rapidly. So as x increases, the value of the
derivative of the exponential function increases, and it increases more and
more rapidly. The exponential function itself has the property that as x
increases, the value of the function increases, and it increases more and
more rapidly.
The exponential function, and its constant multiples such as the function
f (x) = 3ex , are the only functions that are equal to their own derivatives
(except, of course, for functions obtained by restricting the domains of
these functions). It’s this property that makes the constant e such a
special number.
11
Unit 7 Differentiation methods and integration
Derivative of ln
Now let’s look at the natural logarithm function, f (x) = ln x. Since this
function is the inverse of the exponential function f (x) = ex , its graph is
the reflection of the graph of f (x) = ex in the line y = x, as shown in
Figure 13.
12
1 Derivatives of further standard functions
In the next activity you’re asked to use these formulas, together with the
sum and constant multiple rules and the formulas for the derivatives of
trigonometric functions, to find the derivatives of some more functions.
The formulas for the derivatives of standard functions that you’ve met in
this section are all included in the Handbook. However, it’s worth
memorising the formulas for the derivatives of sin, cos, exp and ln, at least,
as they occur frequently.
13
Unit 7 Differentiation methods and integration
The product rule can be proved by using the idea of differentiation from
first principles, and you’ll see this done at the end of this subsection.
Your main task in this subsection is to learn how to use the product rule.
The next example demonstrates how it can be used to differentiate the
14
2 Finding derivatives of more complicated functions
Check that if you find the derivative of the function k(x) = x5 by writing
x5 = x2 × x3 and using the product rule, then you obtain the same answer
as you obtain by differentiating it directly.
15
Unit 7 Differentiation methods and integration
For most people, the best way to remember and use it is as the
abbreviated informal version below.
When you use this informal version of the product rule, you can usually
skip the steps of writing down the first and second functions and their
derivatives, and just go straight to applying the rule. To do this, you recite
the informal version of the rule in your head: as you think ‘first’, you write
down the formula for the first function, then as you think ‘derivative of
second’, you differentiate the second function in your head and write down
the result, and so on. This is illustrated in the next example.
16
2 Finding derivatives of more complicated functions
Solution
Identify the first and second functions: here first = 2x + 1 and
second = ln x. Use the rule
) - ) - ) -
derivative derivative derivative
= (first) × + (second) × .
of product of second of first
By the product rule,
1
k " (x) = (2x + 1) × + (ln x) × 2
x
Simplify the answer.
2x + 1
= + 2 ln x
x
2x + 1 2x ln x
= +
x x
2x + 1 + 2x ln x
= .
x
There are various ways to write the final answer in Example 2. For
example, the following are all acceptable:
1 1 2x(1 + ln x) + 1
2+ + 2 ln x, + 2(1 + ln x) and .
x x x
You can practise using the informal version of the product rule in the next
activity. Alternatively, you might prefer to continue using the formal
version for a little longer, and move to the informal version when you feel
more confident with it. You can use the exercises on the product rule in
the Unit 7 practice quiz and exercise booklet for further practice.
Whichever version of the product rule you use, remember to simplify your
final answers, where possible. You should always do this when you
differentiate functions.
17
Unit 7 Differentiation methods and integration
18
2 Finding derivatives of more complicated functions
Hence the whole expression gets closer and closer to f " (x)g(x) + f (x)g " (x);
that is, f (x)g " (x) + g(x)f " (x). So
k " (x) = f (x)g " (x) + g(x)f " (x),
which is the product rule.
19
Unit 7 Differentiation methods and integration
There are various ways to write the final answer in Example 3. For
example, the following are all acceptable:
x(2 sin x − x cos x) 2x x2 cos x 2x − x2 cot x
, − , .
sin2 x sin x sin2 x sin x
For most people the best way to remember and use the quotient rule is as
the informal version below.
20
2 Finding derivatives of more complicated functions
You can use this informal version of the rule in a similar way to the
informal version of the product rule. To differentiate a quotient of two
functions by using the quotient rule, you recite the informal version of the
rule in your head: as you think ‘bottom’, you write down the formula for
the bottom function, then as you think ‘derivative of top’ you differentiate
the top function in your head and write down the result, and so on. (Make
sure that you remember that the quotient rule contains a minus sign rather
than a plus sign.) This is illustrated in the next example.
21
Unit 7 Differentiation methods and integration
You can practise using the informal version of the quotient rule in the next
activity. Alternatively, as with the product rule, you might prefer to
continue using the formal version for a little longer, and move to the
informal version when you feel more confident with it.
In Subsection 1.1 you met the fact that the derivative of tan x is sec2 x, but
this result wasn’t justified there. It can be confirmed by using the quotient
rule, as shown in the next example. (Note, however, that this justification
is incomplete, as you haven’t yet seen a proof of the quotient rule!)
22
2 Finding derivatives of more complicated functions
You can also use the quotient rule to find the derivatives of the
trigonometric functions cosec, sec and cot. These derivatives are given in
the box below, and you’re asked to confirm them in the next activity.
The derivatives of cosec, sec and cot can now be included in the list of
derivatives of standard functions that you have available to work with.
The Handbook includes a table of all the standard derivatives given in this
module.
23
Unit 7 Differentiation methods and integration
24
2 Finding derivatives of more complicated functions
25
Unit 7 Differentiation methods and integration
As you can see, in Leibniz notation the chain rule looks like an obvious
statement about fractions, which makes it easy to remember. You should
keep in mind, though, that this is not what it is! The pieces of notation
dy/dx, dy/du and du/dx represent derivatives, not fractions.
As with the other rules for combining derivatives, the chain rule can be
proved by using the idea of differentiation from first principles.
Unfortunately, this is quite tricky to do, and the proof isn’t included in
this module.
However, here’s an informal way to see that the chain rule makes sense.
Suppose that y is a function of u, where u is a function of x, as in the
statement of the chain rule. Suppose that for a particular value of x, the
value of du/dx is 2, and for the value of u that corresponds to this
particular value of x, the value of dy/du is 3. Since y is increasing at the
rate of 3 units for every unit that u increases, and u is increasing at the
rate of 2 units for every unit that x increases, you’d expect y to be
increasing at the rate of 3 × 2 = 6 units for every unit that x increases.
This is what the chain rule tells you.
The next example shows you how to use the chain rule.
= (cos(x2 )) × 2x
26
2 Finding derivatives of more complicated functions
= 2x cos(x2 ).
Notice that when you apply the chain rule as illustrated in Example 7, you
first obtain an answer that’s expressed in terms of both u and x. You then
have to substitute for u, using the equation for u in terms of x that you
wrote down when you decomposed the function, to obtain a final answer in
terms of x.
You’ve now seen how to apply the chain rule by first introducing an extra
variable, usually u, which you then remove again to obtain the final
answer. Once you’re familiar with using the chain rule, you can often
apply it without introducing this extra variable at all.
To see how to do this, consider again the equation y = sin(x2 ). Think of
the expression that you would have written as u, that is, the expression
that was referred to as ‘something’ earlier, as one ‘big variable’:
✎☞
x2 )
y = sin( ✍✌
To find dy/dx, first differentiate with respect to this big variable; that is,
with respect to the ‘something’. This gives
✎☞
x2 )
cos( ✍✌
By the chain rule, to obtain the final answer, you now need to multiply by
the derivative of the something with respect to x. This gives
dy
= cos(x2 ) × 2x.
dx
This is the final answer, but, as before, you should simplify it slightly to
give
dy
= 2x cos(x2 ).
dx
27
Unit 7 Differentiation methods and integration
Here are two more examples. Notice that in the second of these examples
the function is expressed using function notation rather than using two
variables. This makes no difference to how you apply the ideas above.
Try this quicker way of using the chain rule in the following activity.
28
2 Finding derivatives of more complicated functions
You should try to get used to applying the chain rule in the quick way
demonstrated in Example 8, as you’ll find this skill useful in calculus.
However, sometimes, if a function that you want to differentiate using the
chain rule isn’t straightforward, then you might find it easier to revert to
introducing an extra variable. It’s fine to do that if you find it helpful.
Sometimes you can make it easier to apply the chain rule without
introducing an extra variable if you start by rearranging the formula for
the function. This applies in particular to composites that involve power
functions. Here’s an example.
With practice, you may be able to do the sort of initial rearranging needed
in Example 9 in your head.
29
Unit 7 Differentiation methods and integration
Now here’s an explanation of where the name ‘chain rule’ comes from.
Consider a function that’s a composite of three functions. Suppose that
y is a function of u,
u is a function of v,
v is a function of x.
You can find the derivative of y with respect to x by applying the chain
rule twice:
dy dy du du du dv
= and =
dx du dx dx dv dx
Gerald was never very good so, using the second equation to substitute in the first equation, you obtain
at using the chain rule. dy dy du dv
= .
dx du dv dx
As you can see, this gives a ‘chain’ of derivatives. You can keep adding
‘links in the chain’ indefinitely. One of the examples in Subsection 2.5,
Example 12, illustrates how you can apply the chain rule twice in this way.
Finally in this subsection, let’s translate the chain rule into Lagrange
notation. You can do this by thinking about the way of applying it
without introducing an extra variable. Suppose that f and g are functions,
and you want to differentiate the expression g(f (x)) with respect to x.
The chain rule says that you can do this by differentiating it with respect
to the ‘something’ f (x), then multiplying the result by the derivative of
f (x) with respect to x. This gives the version of the chain rule below.
30
2 Finding derivatives of more complicated functions
You can use this rule to quickly differentiate any function of the type
discussed above. For example, it tells you that
if k(x) = sin(5x), then k " (x) = 5 cos(5x).
Try using the rule above in the next activity.
31
Unit 7 Differentiation methods and integration
In fact you can slightly extend the special case of the chain rule above.
Notice that, by the chain rule,
if k(x) = sin(3x − 2), then k " (x) = cos(3x − 2) × 3 = 3 cos(3x − 2),
if h(x) = e−2x+5 , then h" (x) = e−2x+5 × (−2) = −2e−2x+5 .
32
2 Finding derivatives of more complicated functions
33
Unit 7 Differentiation methods and integration
34
2 Finding derivatives of more complicated functions
You can practise combining the differentiation rules in the next activity. If
you find it tricky to work out which rules you need to apply, then the
checklist below might help.
Note also that when you need to apply more than one rule to differentiate
a function, the first rule that you should apply is the one that applies to
the overall structure of the rule of the function. For example, as you saw in
Example 10, the overall structure of the function f (x) = x2 sin(x3 ) is that
it’s a product, since x2 sin(x3 ) is the product of x2 and sin(x3 ). So, to
differentiate this function, you start by applying the product rule, ignoring
for the moment the fact that one of the two functions in the product is
itself a composite. After you’ve done that, you can deal with the
composite.
35
Unit 7 Differentiation methods and integration
36
2 Finding derivatives of more complicated functions
In fact, the quotient rule is really just a combination of the product rule
and a special case of the chain rule. Here’s a proof of it, as promised in
Subsection 2.2.
So, for each value of x at which both f and g are differentiable and
g(x) )= 0,
d % $
k " (x) = (g(x))−1 f " (x) + f (x) (g(x))−1 (by the product rule)
dx
% $
= (g(x))−1 f " (x) + f (x) −(g(x))−2 g " (x) (by the chain rule)
f " (x) f (x)g " (x)
= −
g(x) (g(x))2
g(x)f " (x) − f (x)g " (x)
= ,
(g(x))2
which is the quotient rule.
37
Unit 7 Differentiation methods and integration
To prove the formula for x > 0, you can use the fact that x = eln x for such
values of x to write the expression xn in the following form:
xn = (eln x )n = en ln x .
This gives
d d n ln x
(xn ) = (e )
dx dx
d
= (en ln x ) (n ln x) (by the chain rule)
dx
d
= (en ln x ) × n × (ln x) (by the constant multiple rule)
dx
1
= (en ln x ) × n ×
x
n 1
=x ×n×
x
n−1
= nx ,
which is the formula stated above.
Note that the two proofs that you’ve just seen, namely the proof of the
quotient rule and the proof of the formula for the derivative of a power
function, both depend on the chain rule. Since this module doesn’t include
a proof of the chain rule, strictly the proofs of these two results are
incomplete.
3 More differentiation
In this section you’ll see how to use differentiation to solve a type of
problem that occurs frequently in applications of mathematics, how to use
differentiation on a computer, and how to find the derivative of the inverse
of a function if you know the derivative of the original function.
38
3 More differentiation
Strategy:
To find the greatest or least value of a function on an
interval of the form [a, b]
(This strategy is valid when the function is continuous on the interval,
and differentiable at all values in the interval except possibly the
endpoints.)
1. Find the stationary points of the function.
2. Find the values of the function at any stationary points inside the
interval, and at the endpoints of the interval.
3. Find the greatest or least of the function values found.
The next example illustrates how you can use this strategy to solve an
optimisation problem. You might like to try guessing the answer to the
problem posed in the example, and then read on to see whether you’re
right.
39
Unit 7 Differentiation methods and integration
Solution
Simplify the problem if possible. Draw a diagram.
The volume of the channel is the area of its cross-section times its
(fixed) length. So to achieve a channel of maximum volume, we have
to achieve the maximum area of the cross-section.
The diagrams below show the cross-sections of the metal sheet and the
channel. We have to determine how far from the side the metal sheet
should be bent to achieve the maximum area of the cross-section.
40
3 More differentiation
Use calculus to solve the problem. You can use the strategy in the
box given before this example.
The variable A is a polynomial function of x, so it is continuous on its
whole domain and differentiable at every value of x. So the maximum
value of A must occur either at an endpoint of the interval [0, 12 ], or at
a stationary point.
Find the stationary points.
The formula for A is
A = x(1 − 2x)
= x − 2x2 ,
so
dA
= 1 − 4x.
dx
At a stationary point, dA/dx = 0, which gives
1 − 4x = 0;
that is,
x = 41 .
So the only stationary point occurs when x = 14 .
Consider the values of the function at any stationary points inside
the interval, and at the endpoints of the interval.
The endpoints of the interval are 0 and 21 , so the maximum value of A
occurs when x = 0, x = 14 or x = 12 .
When x = 0, A = 0, and similarly when x = 21 , A = 0.
When x = 14 ,
1
A= 4 × (1 − 2 × 14 ) = 1
4 × 1
2 = 1
8 = 0.125.
Find the greatest of the function values found.
The greatest of the three values of A found is 0.125, which is achieved
when x = 14 .
So the maximum value of A is achieved when x = 14 .
State a conclusion that answers the original problem.
To achieve a channel of maximum volume, the metal should be bent
at a distance of 25 cm from each side.
41
Unit 7 Differentiation methods and integration
1
Figure 16 The graph of A = x(1 − 2x) for 0 ≤ x ≤ 2
Notice that in fact you could have solved the problem in Example 13
without using calculus, because the function involved turned out to be a
quadratic function. You could have found the value of x that gives the
maximum value of A by using a method other than calculus to find the
vertex of the parabola that’s the graph of the function. You saw some
methods for doing this in Unit 2. However, once you’re familiar with
calculus, it’s usually easier to use it.
Each of the next two activities poses an optimisation problem for you to
solve by using calculus. The first activity involves maximising the volume
of a box, and the second one involves maximising the area of the
cross-section of an open channel, in a similar way to Example 13. In both
activities, you might like to try guessing the answers to the problems posed
before you start work on the solutions.
42
3 More differentiation
In the next activity you should use the fact that the area of a triangle with
sides of lengths a and b and included angle θ (as illustrated in Figure 17) is
given by the formula A = 21 ab sin θ. You met this formula in Unit 4.
Consider again the problem of the open channel in Example 13. Suppose Figure 17 A triangle with
sides of lengths a and b and
that instead of bending the metal along two straight lines, you intend to
included angle θ
bend it along one straight line in the middle, to form an open channel with
a v-shaped cross-section, as shown below. What should be the angle
between the two sides of the v-shape, to give a channel with maximum
volume?
43
Unit 7 Differentiation methods and integration
Here’s a summary of the main steps that you can use to solve an
optimisation problem. Of course, you should combine these steps with
other problem-solving techniques as appropriate, such as simplifying the
problem if possible, drawing a diagram and drawing a graph.
Strategy:
To solve an optimisation problem
1. Identify the quantity that you can change, and represent it by a
variable, noting the possible values that it can take. Identify the
quantity to be maximised or minimised, and represent it by a
variable. These variables are the independent and dependent
variables, respectively.
2. Find a formula for the dependent variable in terms of the
independent variable.
3. Use the techniques of differential calculus to find the value of the
independent variable that gives the maximum/minimum value of
the dependent variable.
4. Interpret your answer in the context of the problem.
44
3 More differentiation
1 km
2 km
1 km
45
Unit 7 Differentiation methods and integration
Solve the problem discussed above, by following the steps below. Carry out
parts (a), (b), (d) and (e) using the module computer algebra system.
(a) Define a function f as follows:
√
1 + x2 2 − x
f (x) = + .
8 16
(b) Plot the graph of f , for values of x in the interval [0, 2] at least.
(c) Use your plot to estimate roughly the value of x in [0, 2] that gives the
minimum value of f (x).
(d) Find the derivative f " of f .
(e) Solve the equation f " (x) = 0.
(f) Hence state, to the nearest 10 metres, how far along the road the man
should join it.
You can see that, as with the chain rule, the Leibniz form of the inverse
function rule looks like an obvious fact about fractions. As with the chain
rule, you should keep in mind that that’s not what it is.
Here’s an informal way to see that the inverse function rule makes sense.
Suppose that y is an invertible function of x, which means that x is also a
function of y. Suppose that, for a particular value of x and its
corresponding value of y, the value of x is increasing at the rate of 2 units
for every unit that y increases, as illustrated in Figure 21. Then, at these
values of x and y, you’d expect the value of y to be increasing at the rate
of 21 unit for every unit that x increases. This is what the inverse function
rule tells you.
47
Unit 7 Differentiation methods and integration
You’ll see the inverse function rule in Lagrange notation later in this
subsection. First let’s look at some examples of how you can use it.
As illustrated in Example 14, when you use the inverse function rule, you
obtain an expression for the derivative that’s expressed in terms of y
rather than x. You then have to use the relationship between x and y to
express the derivative in terms of x.
In the next example, the inverse function rule is used to find the derivative
of the function f (x) = sin−1 (x). You saw in Unit 4 that this function has
domain [−1, 1]. Its graph is shown in Figure 22.
48
3 More differentiation
49
Unit 7 Differentiation methods and integration
Notice that the formula for the derivative of sin−1 x found in Example 15
doesn’t apply at the endpoints of the domain of this function, that is,
when x = −1 and x = 1. This is because the function f (x) = sin−1 x isn’t
right-differentiable at x = −1 and isn’t left-differentiable at x = 1, as you
can see, roughly, from the shape of its graph, in Figure 22. The ‘one-sided
tangents’ at these values of x are vertical.
Here’s another point to note about Example 15. Strictly, it involves the
sine function with its domain restricted to the interval [−π/2, π/2], rather
than the sine function with its usual domain, R. As you saw in Unit 4, the
sine function with its usual domain doesn’t have an inverse function.
In the next activity you’re asked to find the derivatives of the functions
f (x) = cos−1 x and f (x) = tan−1 x. You saw in Unit 4 that these functions
have domains [−1, 1] and R, respectively. Their graphs are shown in
Figures 23 and 24, respectively.
Use the inverse function rule, and the derivatives of the cosine and tangent
Figure 23 The graph of functions, to differentiate the following functions.
y = cos−1 x
(a) y = cos−1 x (b) y = tan−1 x
Hint: To express the derivatives in terms of x, use the identity
sin2 y + cos2 y = 1 in part (a), and the identity tan2 y + 1 = sec2 y in
part (b).
d 1
(sin−1 x) = √
dx 1 − x2
d 1
(cos−1 x) = − √
dx 1 − x2
d 1
(tan−1 x) =
dx 1 + x2
50
3 More differentiation
51
Unit 7 Differentiation methods and integration
First, it’s important to realise the following fact. If two straight lines
(drawn on axes with equal scales) are reflections of each other in the line
y = x, then their gradients are reciprocals of each other. This is because if
you consider any pair of points on one line, and their reflections on the
other line, then the values of the run and the rise are interchanged for the
two pairs of points, as illustrated in Figure 25(a). The lines in this
diagram have gradients 2 and 12 .
Another fact, which you met in Unit 3 and will be useful here, is that if
two points (plotted on axes with equal scales) are reflections of each other
in the line y = x, then they’re obtained from each other by interchanging
x- and y-coordinates. For example, Figure 25(b) shows the point (3, 1) and
its reflection (1, 3) in the line y = x.
Figure 25 (a) Two straight lines that are reflections of each other in the
line y = x (b) Two points that are reflections of each other in the line
y=x
Now remember that the graphs of a function and its inverse function are
reflections of each other in the line y = x, when they’re drawn on axes with
equal scales. For example, Figure 26 shows the graphs of a function f and
its inverse function f −1 . (The function
√ is in fact f (x) = x2 + 2 (x ≥ 0),
−1
with inverse function f (x) = x − 2.) The two tangents shown are
reflections of each other in the line y = x, so their gradients are reciprocals
of each other. The gradient of the red tangent is the gradient of the graph
of f −1 at the point (3, 1), which is (f −1 )" (3). The gradient of the blue
tangent is the gradient of the graph of f at the point (1, 3), which is f " (1).
Hence
1
(f −1 )" (3) = " .
f (1)
This is a particular instance of the inverse function rule, with x = 3 and
f −1 (x) = f −1 (3) = 1.
52
3 More differentiation
53
Unit 7 Differentiation methods and integration
Standard derivatives
Function f (x) Derivative f " (x)
a (constant) 0
xn nxn−1
ex ex
1
ln x
x
sin x cos x
cos x − sin x
tan x sec2 x
cosec x − cosec x cot x
sec x sec x tan x
cot x − cosec2 x
1
sin−1 x √
1 − x2
1
cos−1 x −√
1 − x2
1
tan−1 x
1 + x2
54
4 Reversing differentiation
4 Reversing differentiation
Johann Bernoulli (1667–1748)
In Unit 6 you learned that if you know the values taken by a continuously
changing quantity throughout a period of change, then you can use
differentiation to find the values taken by its rate of change throughout the
same period. For example, you saw that if you have a formula for the
displacement of a moving object in terms of time, then you can use
differentiation to find a formula for its velocity in terms of time. You saw
that the area of calculus that deals with this type of process is called
differential calculus.
In the rest of this unit, and in Unit 8, we’ll consider the opposite process.
We’ll consider situations where you know the values taken by the rate of
change of a continuously changing quantity throughout a period of change,
and you want to find the values taken by the quantity throughout the
same period. For example, you might have a formula or a graph that tells
you the values taken by the velocity of an object (such as the walking man
discussed in Unit 6) over some time period, and you want to use this
information to work out the values taken by the displacement of the object
over the same time period. As mentioned earlier, the area of calculus that
deals with this type of process is called integral calculus. The title page of Analyse des
Infiniment Petits (1696)
55
Unit 7 Differentiation methods and integration
As you’ll see, when you’re working with integral calculus, it can be useful
to consider whether a function is continuous. A continuous function is
one whose graph has no discontinuities (that is, ‘breaks’) in it. In other
words, informally, it’s a function whose graph you can draw without taking
your pen off the paper (though you might need an infinitely large piece of
paper, as the graph might be infinitely long!). For example, the function
f (x) = 2x is a continuous function, as shown in Figure 28(a), but the
function f (x) = 1/x2 isn’t, as its graph has a break at x = 0, as shown in
Figure 28(b).
Figure 28 The graphs of Activity 26 Differentiating functions whose formulas are the same
(a) f (x) = 2x apart from a constant term
(b) f (x) = 1/x2
Differentiate the following functions.
5
(a) F (x) = x2 (b) F (x) = x2 + 3 (c) F (x) = x2 − 7
You should have found that all three functions in Activity 26 have
derivative F " (x) = 2x. In fact, you can see that any function of the form
F (x) = x2 + c,
where c is a constant, has derivative F " (x) = 2x. This is because the
derivative of a constant term is zero. So each function of the
form F (x) = x2 + c is an antiderivative of the function f (x) = 2x.
Another way to think about the fact that all the functions of the form
F (x) = x2 + c have the same derivative is to remember that adding a
constant to the formula of a function has the effect of translating its graph
vertically. So the graphs of all the functions of the form F (x) = x2 + c are
vertical translations of each other, as illustrated in Figure 29(b).
Translating a graph vertically doesn’t change its gradient at each x-value,
of course. In other words, it doesn’t change the derivative of the function
that the graph represents. So each of the functions in Figure 29(b) has the
same derivative, namely f (x) = 2x, whose graph is shown in Figure 29(a).
56
4 Reversing differentiation
57
Unit 7 Differentiation methods and integration
You don’t need to be concerned about what happens when the function f
isn’t continuous, because normally when we’re solving problems in integral
calculus we work only with continuous functions. These are usually the
only functions that we need for the sorts of calculations that we want to
perform. However, if you’d like to know what the problem is with
functions that aren’t continuous, then read the explanation at the end of
this subsection, when you reach it.
Before you go on, it’s important to make sure that you fully understand
the difference between an antiderivative and an indefinite integral of a
function. The definitions that you’ve seen are summarised below.
58
4 Reversing differentiation
59
Unit 7 Differentiation methods and integration
60
4 Reversing differentiation
61
Unit 7 Differentiation methods and integration
62
4 Reversing differentiation
Notice that the formula for the indefinite integral of a power function tells
you that the indefinite integral of the function
f (x) = 1 (which is the same as f (x) = x0 )
is
F (x) = x + c.
(As in earlier units, we assume that 00 = 1, so the formula works when
x = 0.) Another way to obtain the indefinite integral above is, of course, to
use the fact that the derivative of x is 1.
In Subsection 6.1 later in the unit, you’ll see a formula for the indefinite
integral of the only power function that isn’t covered by the general
formula that you’ve met in this subsection. This is the power function
f (x) = x−1 , that is, the reciprocal function f (x) = 1/x.
63
Unit 7 Differentiation methods and integration
(You might have noticed that the letter used for the constant in the
statement above of the constant multiple rule for antiderivatives, namely k,
is different from the letter used in Unit 6 for the constant in the constant
multiple rule for derivatives, which was a. It doesn’t matter which letter is
used, of course, but in integral calculus it’s sometimes convenient to
reserve the letter a to denote something else, as you’ll see in Unit 8.)
These rules follow directly from the similar rules for derivatives. To see
why, first suppose that F (x) is an antiderivative of f (x). Then
d
(F (x)) = f (x).
dx
It follows from the constant multiple rule for derivatives that, for any
constant k,
d
(kF (x)) = kf (x).
dx
This equation tells you that kF (x) is an antiderivative of kf (x), which is
what the constant multiple rule for antiderivatives says.
64
4 Reversing differentiation
Now suppose that F (x) and G(x) are antiderivatives of f (x) and g(x),
respectively. Then
d d
(F (x)) = f (x) and (G(x)) = g(x).
dx dx
It follows from the sum rule for derivatives that
d
(F (x) + G(x)) = f (x) + g(x).
dx
This equation tells you that F (x) + G(x) is an antiderivative of
f (x) + g(x), which is what the sum rule for antiderivatives says.
Remember that, as always when a constant multiple rule and a sum rule
hold, the sum rule also holds if you replace the plus signs by minus signs.
That is, if F (x) and G(x) are antiderivatives of f (x) and g(x), respectively,
then F (x) − G(x) is an antiderivative of f (x) − g(x).
Here’s an example of how you can use the two rules above.
Example 18 Using the constant multiple rule and the sum rule for
antiderivatives
Find the indefinite integrals of the following functions.
6 √
(a) f (x) = 10x4 (b) g(t) = 3 − 5 t + 1
t
Solution
(a) Find an antiderivative by using the constant multiple rule,
together with the rule for finding an antiderivative of a power
function. To obtain the indefinite integral, add the arbitrary
constant c.
The indefinite integral is
F (x) = 10 × 15 x5 + c
= 2x5 + c.
(b) First express the function as a sum of constant multiples of
power functions.
The function is
g(t) = 6t−3 − 5t1/2 + 1.
By the sum rule, you can find an antiderivative by working
term by term. Use the method of part (a) to find an
antiderivative of each term. To obtain the indefinite integral, add
the arbitrary constant c.
65
Unit 7 Differentiation methods and integration
Notice in particular that it follows from the constant multiple rule for
antiderivatives that, for any constant a, the indefinite integral of the
function
f (x) = a (which is the same as f (x) = a × 1)
is
F (x) = ax + c.
Another way to see this is, of course, to use the fact that the derivative of
ax is a.
This useful fact is summarised below.
You can practise using the constant multiple rule and the sum rule for
antiderivatives in the activity below. In parts (d)–(g), you have to begin
by using an algebraic technique, such as multiplying out brackets or
expanding a fraction, to express the function as a sum of constant
multiples of power functions.
Remember that you must include an arbitrary constant c in every
indefinite integral that you find.
Activity 30 Using the constant multiple rule and the sum rule for
antiderivatives
Find the indefinite integrals of the following functions.
√
(a) f (x) = 8x3 (b) f (x) = 7 (c) f (x) = x − x2
(d) f (x) = (x + 2)(x − 5) (e) g(t) = (t + 1)2
) -
x2 + 1 √
3 2 1
(f) g(x) = (g) f (r) = r 10r − 2
x4 r
5 Particular antiderivatives
Sometimes when you’re working with a function, you need to find a
particular antiderivative that it has, rather than any antiderivative, or its
indefinite integral. For example, this is often what you need to do when
the function is a mathematical model of a real-life situation. In this
section, you’ll learn how to find such particular antiderivatives, and see
some examples of how you can use them.
67
Unit 7 Differentiation methods and integration
68
5 Particular antiderivatives
Solution
(a) Write down the given equation for the man’s velocity.
The man’s velocity v at time t is given by
v = 6 − 45 t.
Use integration to find an equation for his displacement. It
will contain an arbitrary constant, because at this stage we don’t
know which antiderivative is the right one.
Hence his displacement s at time t is given by
s = 6t − 5
4 × 21 t2 + c,
that is,
s = 6t − 58 t2 + c,
where c is a constant.
Use extra information to find the value of the constant c.
When the man begins his walk, his displacement from his starting
point is 0 km. That is, when t = 0, s = 0. Substituting these
values into the equation above gives
0=6×0− 5
8 × 02 + c,
that is,
c = 0.
Hence the required equation for s in terms of t is
s = 6t − 85 t2 .
(b) When t = 2,
s=6×2− 5
8 × 22 = 12 − 5
2 = 19
2 = 9.5.
So the man’s displacement two hours after he began his walk is
9.5 km.
69
Unit 7 Differentiation methods and integration
choose the correct function from a family of functions whose graphs are
vertical translations of each other), you need to use further information,
which is usually a fact about the displacement of the object at some
moment in time.
This is just what you’d expect: if all you know is an equation for the
velocity of the object in terms of time, then you don’t have enough
information to determine its displacement in terms of time, because it
could have started anywhere!
In this sort of situation you do, however, have enough information to find
the change in the displacement of the object between any two points in
time. You’ll see some examples of this later in this subsection.
Here’s an example similar to Example 20 for you to try.
Suppose that a marble is rolled in a straight line down a long slope, and its
velocity v (in metres per second) at time t (in seconds) after it begins
rolling is given by the equation
1
v= 10 t.
The marble starts rolling from a position that is 0.3 metres down the slope.
Let s be its displacement in metres from the top of the slope.
(a) Find an equation that expresses s in terms of t.
(Hint: First find an equation for s in terms of t that contains an
arbitrary constant, then use the value of s when t = 0.)
(b) Hence find the displacement of the marble from the top of the slope
4 seconds after it begins rolling.
70
5 Particular antiderivatives
In the next example, we’ll carry out this process in reverse. We’ll start by
assuming that the effect of gravity on any object falling from rest is to
produce a constant acceleration of −9.8 m s−2 . We’ll use this fact to write
down an equation for the object’s acceleration in terms of time, integrate it
to obtain an equation for the object’s velocity in terms of time, and then
integrate again to obtain an equation for the object’s displacement in
terms of time. You’ll see that the result is the equation stated above.
71
Unit 7 Differentiation methods and integration
The next activity is about the motion of a ball thrown vertically into the
air. The ball is an example of a vertical projectile. A projectile is an
object that’s launched into the air by a force that ceases after launch, and
a vertical projectile is a projectile that’s launched vertically upwards.
Provided that the effects of air resistance are negligible, a vertical
projectile has a constant downwards acceleration due to gravity of about
9.8 m s−2 at all times after its launch, just as an object falling from rest
has. The only thing that makes its motion different from that of an object
falling from rest is that at the start of its motion its velocity is not zero. In
the next activity you’re asked to use a process similar to that in
Example 21 to find a formula for the displacement of a ball thrown
vertically into the air, in terms of time.
72
5 Particular antiderivatives
Let the acceleration, velocity and displacement of the ball at time t (in
seconds) after it was thrown be a (in m s−2 ), v (in m s−1 ) and s (in m),
respectively, where displacement is measured from the point from which
the ball was thrown.
(a) Find an equation that expresses v in terms of t.
(b) Hence find an equation that expresses s in terms of t.
(c) Use the formulas that you found in parts (a) and (b) to find the
velocity and the displacement of the ball 1 second after it was thrown.
(d) Use the formula that you found in part (b) to determine how long it
takes for the ball to fall back to the point from which it was thrown.
Hint: At the time when the ball has fallen back to this point, what is
the value of s?
The next activity involves an economic model of a type that you met in
Unit 6. You saw there that marginal cost is the derivative of total cost.
73
Unit 7 Differentiation methods and integration
74
5 Particular antiderivatives
Figure 34 (a) The numbers 1 and 2 in the domain of f (x) = 2x (b) The
amounts that two different antiderivatives of f (x) = 2x change as x
changes from 1 to 2
You can see that no matter what the value of c is, the two occurrences of it
will cancel out when you calculate F (2) − F (1). Hence you’ll always get
the same answer for F (2) − F (1), namely 3.
In general, we have the following important fact.
75
Unit 7 Differentiation methods and integration
Figure 36 The graph of Suppose that the rate of change of a quantity is given by the function
f (x) = ex/8 (x ≥ 0) f (x) = ex/8 (x ≥ 0), as shown in Figure 36. Find the amount by which the
quantity changes from x = 2 to x = 5, to three significant figures.
76
5 Particular antiderivatives
Solution
Find any antiderivative F of f and calculate F (5) − F (4).
An antiderivative of f is
F (t) = 23 t2 .
So the change in the displacement of the car from t = 4 to t = 5 is
F (5) − F (4) = ( 23 × 52 ) − ( 23 × 42 )
= 37.5 − 24
= 13.5.
That is, the car travels 13.5 m (in the positive direction) during the
fifth second.
The ideas that you’ve met in this subsection will be important in Unit 8.
77
Unit 7 Differentiation methods and integration
Figure 40 The graphs of (a) g(x) = 1/x (x > 0) and (b) G(x) = ln x
78
6 Antiderivatives of further standard functions
To deal with the ‘other half’ of the function f (x) = 1/x, you can use the
fact that equation (1), together with the chain rule, gives
d 1 1
(ln(−x)) = × (−1) = .
dx −x x
This formula applies for negative values of x, because ln(−x) is defined
when −x is positive, that is, when x is negative. So it tells you that an
antiderivative of the function
h(x) = 1/x (x < 0)
is
H(x) = ln(−x).
This fact is illustrated in Figure 41. The graph of h(x) = 1/x (x < 0), in
Figure 41(a), gives the gradients of the graph of H(x) = ln(−x), in
Figure 41(b).
Figure 41 The graphs of (a) h(x) = 1/x (x < 0) and (b) H(x) = ln(−x)
You can obtain an antiderivative for the function f (x) = 1/x, with its
whole domain (−∞, 0) ∪ (0, ∞), by putting together the antiderivatives
found for the two ‘halves’ of f . The graph of this antiderivative is shown in
Figure 42(b). The graph of f (x) = 1/x is shown in Figure 42(a): it gives
the gradients of the graph in Figure 42(b).
Figure 42 The graphs of (a) f (x) = 1/x and (b) F (x) = ln x and
F (x) = ln(−x)
79
Unit 7 Differentiation methods and integration
The formula for the antiderivative of f (x) = 1/x found above can be
written as
/
ln(−x) (x < 0)
F (x) = (2)
ln x (x > 0).
However, there’s a more concise way to write it, using the notation | | for
the modulus of a number. Notice that if x is a negative number, then
−x = |x|
(for example, −(−3) = 3 = | − 3|). Also, if x is a positive number, then
x = |x|
(for example, 3 = |3|). So formula (2) can be written concisely as
F (x) = ln |x|.
Hence the indefinite integral of f (x) = 1/x is
F (x) = ln |x| + c.
This formula is sometimes useful, but often when you’re working with the
function f (x) = 1/x, the variable x takes only positive values. If this is the
case, then you can just use the antiderivative of the function f (x) = 1/x
(x > 0), which has the simpler formula
F (x) = ln x + c.
The facts that you’ve seen so far in this section are summarised below.
In the next activity you’re asked to use this formula, together with the
constant multiple rule and the sum rule for antiderivatives, to find the
indefinite integrals of some further functions.
80
6 Antiderivatives of further standard functions
81
Unit 7 Differentiation methods and integration
ex ex + c
sin x − cos x + c
cos x sin x + c
sec2 x tan x + c
cosec2 x − cot x + c
sec x tan x sec x + c
cosec x cot x − cosec x + c
1
√ sin−1 x + c
1 − x2 or − cos−1 x + c
1
tan−1 x + c
1 + x2
In the next activity you’re asked to use these formulas, together with the
constant multiple rule and the sum rule for antiderivatives, to find the
indefinite integrals of some further functions. In some of the parts you
have to start by using an algebraic technique to express the function in a
form that you can integrate.
82
Learning outcomes
Learning outcomes
After studying this unit, you should be able to:
• remember the formulas for the derivatives of some standard functions
• differentiate a wide variety of functions, including products, quotients
and composites
• use differentiation to solve simple optimisation problems
• find and use derivatives using a computer
• understand and use the relationship between the derivative of an
invertible function and the derivative of its inverse function
• understand that integration reverses differentiation
• understand what’s meant by an antiderivative and the indefinite
integral of a function
• remember the formulas for the indefinite integrals of some standard
functions
• use the constant multiple rule and the sum rule for antiderivatives
• use integration to work with the relationship between a changing
quantity and its rate of change
• in particular, use integration to work with the relationships between
displacement, velocity and acceleration.
83
Unit 7 Differentiation methods and integration
Solutions to activities
Solution to Activity 1 Solution to Activity 2
The graph of the derivative of the sine function (a) f (x) = sin x + cos x, so
for x between 0 and 2π must look something like the f " (x) = cos x − sin x.
sketch below.
(b) g(u) = u2 − cos u, so
g " (u) = 2u − (− sin u)
= 2u + sin u.
(c) P = 6 tan θ, so
dP
= 6 sec2 θ.
dθ
(d) r = −2(1 + sin φ), so
dr
(You can work this out as follows. = −2(0 + cos φ)
dφ
When x = π, the gradient of the graph of the sine = −2 cos φ.
function seems to be about the same as the gradient
of the line y = −x; that is, it seems to be about −1. Solution to Activity 3
As x increases, the graph gradually gets less steep – (a) f (x) = ex + ln x, so
that is, its gradient takes negative values of smaller
1
and smaller magnitude, until x = 3π/2, when the f " (x) = ex + .
gradient seems to be zero. The gradient changes x
slowly when x is only a little larger than π, but (b) h(r) = r − cos r − 3 ln r, so
changes more rapidly as x gets closer to 3π/2. 1
h" (r) = 1 − (− sin r) − 3 ×
In other words, for x between π and 3π/2 the value r
3
of the derivative seems to increase from −1 to zero, = 1 + sin r − .
slowly at first, but then more rapidly. r
1
When x increases from 3π/2, the gradient starts at (c) v = + ln t, so
zero, then it becomes positive. At first the graph is t
dv 1 1
not very steep – that is, the gradient has small =− 2 +
positive values – but it becomes steeper and steeper dt t t
1 t
– that is, the gradient takes larger and larger =− 2 + 2
positive values. Eventually, when x = 2π, the t t
t−1
gradient seems to be about the same as the gradient = 2 .
of the line y = x; that is, it seems to be about 1. t
The gradient increases fairly rapidly when x is only (d) w = 5 − 3eu , so
a little larger than 3π/2, but increases more slowly dw
= −3eu .
as x gets closer to 2π. du
So, for x between 3π/2 and 2π, the value of the (e) k = 4(ln v − tan v), so
) -
derivative seems to increase from 0 to 1, fairly dk 1 2
rapidly at first, but then more slowly.) =4 − sec v .
dv v
Solution to Activity 4
(a) The function is k(x) = x3 tan x.
Let f (x) = x3 and g(x) = tan x.
Then f " (x) = 3x2 and g " (x) = sec2 x.
84
Solutions to activities
85
Unit 7 Differentiation methods and integration
86
Solutions to activities
87
Unit 7 Differentiation methods and integration
√
(c) f (x) = cos( x) = cos(x1/2 ), so, by the chain (f) w(x) = ln(7x), so
rule, 7 1
w" (x) = = .
f " (x) = − sin(x1/2 ) × 12 x−1/2 7x x
√ (Here’s an alternative way to differentiate this
sin( x)
=− √ . function:
2 x
1 w(x) = ln(7x) = ln 7 + ln x, so
(d) f (x) = √ x = (ex )−1/2 = e−x/2 , so, by the 1 1
e w" (x) = 0 + = .)
chain rule, x x
) -
f " (x) = e−x/2 × (− 12 ) −2t
(g) f (t) = sin , so
3
= − 12 e−x/2 ) -
2 −2t
1 f " (t) = − cos .
= − √ x. 3 3
2 e
(This solution√uses the index laws (am )n = amn (h) k(φ) = tan(3φ), so
and am/n = n am .) k " (φ) = 3 sec2 (3φ).
1
(e) a(t) = = (2 − 3t − 3t2 )−1/3 , Solution to Activity 16
(2 − 3t − 3t2 )1/3
so, by the chain rule, (a) k(x) = cos(7x + 4), so
a" (t) = − 13 (2 − 3t − 3t2 )−4/3 × (−3 − 6t) k " (x) = −7 sin(7x + 4).
1 + 2t (b) h(x) = e−x−3 , so
= .
(2 − 3t − 3t2 )4/3 h" (x) = −e−x−3 .
2 −2 ) - ) -
(f) C(p) = e1/p = e(p )
, so, by the chain rule, θ−1 θ 1
(c) r(θ) = sin = sin − , so
C " (p) = e(p
−2
)
× (−2p−3 ) 2 2 2
) - ) -
2 1 θ 1 1 θ−1
= −2p−3 e1/p r" (θ) = cos − = cos .
2 2 2 2 2 2
2e1/p ) - ) -
=− . 2−θ 2 θ
p3 (d) s(θ) = sin = sin − , so
3 3 3
) - ) -
Solution to Activity 15 " 1 2 θ 1 2−θ
s (θ) = − cos − = − cos .
(a) k(x) = cos(7x), so 3 3 3 3 3
k " (x) = −7 sin(7x).
) - Solution to Activity 17
θ
(b) r(θ) = sin , so (a) g(x) = (x2 + 1)(x3 + 1)
2
) - = x5 + x3 + x2 + 1,
" 1 θ
r (θ) = 2 cos . so
2
g " (x) = 5x4 + 3x2 + 2x
3u/2
(c) g(u) = e , so
= x(5x3 + 3x + 2).
"
g (u) = 23 e3u/2 .
(An alternative method is to use the product
−x
(d) h(x) = e , so rule.)
h" (x) = −e−x .
(e) p(α) = cos(−8α), so
p" (α) = −8(− sin(−8α))
= 8 sin(−8α).
89
Unit 7 Differentiation methods and integration
x cos(4u)
(b) f (x) = , so (e) q(u) = , so
(x − 2)3 e3u
d % $ d d % 3u $
(x − 2)3 × 1 − x (x − 2)3 e3u (cos(4u)) − cos(4u) e
f " (x) = dx q " (u) = dx dx
(x − 2)6 (e3u )2
(by the quotient rule) (by the quotient rule)
(x − 2)3 × 1 − x × 3(x − 2)2 × 1 e3u × (−4 sin(4u)) − cos(4u) × 3e3u
= =
(x − 2)6 (e3u )2
(by the chain rule) (by the chain rule)
(x − 2)3 − 3x(x − 2)2 −4e3u sin(4u) − 3e3u cos(4u)
= =
(x − 2)6 e6u
3u
(x − 2) − 3x −e (4 sin(4u) + 3 cos(4u))
= =
(x − 2)4 e6u
−2x − 2 4 sin(4u) + 3 cos(4u)
= =−
(x − 2)4 e3u
2(x + 1) (f) f (x) = esin(5x) , so
=− .
(x − 2)4 f " (x) = esin(5x) × 5 cos(5x)
(c) g(x) = cos(x ln x), so (by the chain rule)
d = 5esin(5x) cos(5x).
g " (x) = − sin(x ln x) (x ln x)
dx
(by the chain rule) (g) h(x) = cos3 (x2 ), so
) - d % $
1 h" (x) = 3 cos2 (x2 ) cos(x2 )
= − sin(x ln x) x × + (ln x) × 1 dx
x (by the chain rule)
(by the product rule)
= 3 cos2 (x2 ) × (− sin(x2 )) × 2x
= − sin(x ln x)(1 + ln x). (by the chain rule again)
(d) h(x) = ex/2 sin(3x), so
= −6x cos2 (x2 ) sin(x2 ).
d d , x/2 +
h" (x) = ex/2 (sin(3x)) + sin(3x) e e−2x
dx dx (h) z(x) = = 51 e−2x , so
(by the product rule) 5
z " (x) = 15 × (−2e−2x )
= ex/2 × 3 cos(3x) + sin(3x) × 12 ex/2 2
(by the chain rule) = − e−2x .
5
= 3ex/2 cos(3x) + 12 ex/2 sin(3x) (i) G(x) = 3 sin x cos x + 2 sin(3x), so
= 12 ex/2 (6 cos(3x) + sin(3x)). G" (x) = 3((sin x)(− sin x) + (cos x)(cos x))
+ 2(cos(3x)) × 3
(by the product rule)
= −3 sin2 x + 3 cos2 x + 6 cos(3x).
(You can simplify this answer further by using
the trigonometric identity
cos(2θ) = cos2 θ − sin2 θ:
− 3 sin2 x + 3 cos2 x + 6 cos(3x)
= 3(− sin2 x + cos2 x) + 6 cos(3x)
= 3 cos(2x) + 6 cos(3x).)
90
Solutions to activities
91
Unit 7 Differentiation methods and integration
Let the volume of the box be V (in m3 ). The endpoints of the interval are 0 and 21 , so the
maximum value of V occurs when x = 0, x = 16 or
x = 12 .
When x = 0 the height of the box is zero, and when
x = 12 its length and width are zero, so in both these
cases the volume V will be zero. When x = 16 , the
length, width and height of the box are all positive,
so the volume V will also be positive.
So the maximum value of V is achieved when x = 16 .
Hence the sides of the squares should be 16 m; that
is, about 16.7 cm.
(A graph of V against x, for x between 0 and 12 , is
shown below. You can see that the maximum value
of V does indeed seem to occur when x is
approximately 61 = 0.166 . . ..)
92
Solutions to activities
Let the angle between the two sides of the v-shape (A graph of A against θ, for θ between 0 and π, is
be θ, as shown below. The value of θ must be shown below. You can see that the maximum value
between 0 and π. of A does indeed seem to occur when θ is
Let the area of the cross-section of the channel be A approximately π/2.)
(in m2 ).
93
Unit 7 Differentiation methods and integration
(e) The solution of the equation f " (x) = 0 is At a stationary point, f " (x) = 0, which gives
x = 0.577 (to 3 s.f.). x 1
√ − =0
(f) The man should join the road 0.58 kilometres 8 1+x 2 16
(to the nearest 10 metres) along from the point x 1
√ =
that is closest to his initial position. 8 1+x 2 16
(
(Because the graph of f indicates that the minimum 16x = 8 1 + x2
value of f (x) occurs at a local minimum rather than (
at one of the endpoints of the interval [0, 2], in the 2x = 1 + x2
solution above we don’t consider the values of f (0) (2x) = 1 + x2
2
94
Solutions to activities
√
1 + 22 2 − 2 Solution to Activity 25
f (2) = +
√ 8 √ 16 (a) f (x) = sin−1 (3x), so, by the chain rule,
5 5 1
= +0= ≈ 0.28. f " (x) = ( ×3
8 8 1 − (3x)2
So the √
minimum value of f (x) is achieved when
3
x = 1/ 3 = 0.577 (to 3 s.f.). Hence, as was found =√ .
above using a computer, the man should join the 1 − 9x2
road 0.58 kilometres (to the nearest 10 metres) along x −1
from the point that is closest to his initial position. (b) f (x) = e cos) x, so, by the-
product rule,
1
f " (x) = ex − √ + (cos−1 x)ex
Solution to Activity 24 1 − x2
) -
(a) We have y = cos−1 x, so 1
= cos−1 x − √ ex .
x = cos y. 1−x 2
95
Unit 7 Differentiation methods and integration
√
4 x = x1/4 is
(d) The indefinite integral of f (x) = (f) We have
F (x) = 4 5/4
5x + c. x2 + 1 1 1
g(x) = = 2 + 4 = x−2 + x−4 ,
x 4 x x
(e) The indefinite integral of g(t) = t−2/3 is
which has indefinite integral
G(t) = 3t1/3 + c. G(x) = −x−1 − 31 x−3 + c
1 1 1
(f) The indefinite integral of b(v) = √ = v −1/2 is = − − 3 + c.
v x 3x
1/2
√
B(v) = 2v + c = 2 v + c. (g) We have
) -
√ 1
Solution to Activity 29 f (r) = 3 r 10r2 − 2
r
(a) The indefinite integral of f (x) = x5 is
= r1/3 (10r2 − r−2 )
F (x) = 16 x6 + c.
= 10r7/3 − r−5/3
(b) Three different antiderivatives of f are which has indefinite integral
F (x) = 61 x6 , 3 10/3
F (r) = 10( 10 r ) − (− 32 r−2/3 ) + c
F (x) = 16 x6 + 1,
= 3r10/3 + 23 r−2/3 + c
1 6
F (x) = 6x
− 2.
= 23 r−2/3 (1 + 2r4 ) + c.
(You probably chose different examples.)
Solution to Activity 31
Solution to Activity 30
(a) The function is f (x) = x + 2. Its indefinite
(a) The indefinite integral of f (x) = 8x3 is integral is
F (x) = 8 × 14 x4 + c F (x) = 12 x2 + 2x + c.
= 2x4 + c. Using the fact that F (1) = 9
gives
2
(b) The indefinite integral of f (x) = 7 is 1 2
×1 +2×1+c= 9
2 2
F (x) = 7x + c. 5
+c= 9
2 2
(c) The indefinite integral of c = 24 = 2.
√
f (x) = x − x2 = x1/2 − x2 is
Hence the required antiderivative is
F (x) = 23 x3/2 − 13 x3 + c F (x) = 21 x2 + 2x + 2.
= 13 (2x3/2 − x3 ) + c. (b) The function is f (x) = 1/x3 = x−3 with domain
(d) We have (0, ∞). Its indefinite integral is
f (x) = (x + 2)(x − 5) 1 −2 1
F (x) = x + c = − 2 + c.
= x2 − 3x − 10, −2 2x
which has indefinite integral Using the fact that F (3) = − 91 gives
1 1
F (x) = 13 x3 − 32 x2 − 10x + c. − +c=−
2×3 2 9
(e) We have 1 1
− +c=−
g(t) = (t + 1)2 18 9
= t2 + 2t + 1, 1 1 1
c=− + =− .
which has indefinite integral 9 18 18
Hence the required antiderivative is
G(t) = 31 t3 + t2 + t + c. 1 1
F (x) = − 2 −
2x 18
9 + x2
=− (x ∈ (0, ∞)).
18x2
96
Solutions to activities
97
Unit 7 Differentiation methods and integration
Solution to Activity 34
(a) The given equation for m in terms of q is
m = 250 − 0.5q.
Since marginal cost is the derivative of total
cost, the equation for t in terms of q is
t = 250q − 0.5 × 21 q 2 + c,
that is,
t = 250q − 0.25q 2 + c,
where c is a constant.
According to the information given in the
question, when q = 80, t = 30 400.
Solution to Activity 35
Substituting these values into the equation
An antiderivative of f is
found for t above gives
F (x) = 8ex/8 .
30 400 = 250 × 80 − 0.25 × 802 + c
So the change in the quantity from x = 2 to x = 5 is
30 400 = 18 400 + c
F (5) − F (2) = 8e5/8 − 8e2/8
c = 12 000.
= 4.67 (to 3 s.f.).
Hence the equation for t in terms of q is
t = 250q − 0.25q 2 + 12 000. Solution to Activity 36
(b) The total weekly cost of producing 160 tonnes An antiderivative of f is
of fertiliser each week is F (t) = 3t − 23 × 12 t2 = 3t − 34 t2 .
£(250 × 160 − 0.25 × 1602 + 12 000) (a) The change in the displacement (in metres) of
= £45 600. the object from t = 0 to t = 1 is
(c) The question states that the total weekly cost of F (1) − F (0) = (3 × 1 − 43 × 12 )
producing 80 tonnes of fertiliser each week is − (3 × 0 − 34 × 02 )
£30 400. Hence the cost per tonne of producing
= 2.25.
80 tonnes of fertiliser each week is
£30400 That is, the object travels 2.25 m in that time
= £380. (in the positive direction).
80
98
Solutions to activities
(b) The change in the displacement (in metres) of Hence its indefinite integral is
the object from t = 4 to t = 5 is 1
G(t) = ln |t| + c.
F (5) − F (4) = (3 × 5 − 34 × 52 ) 3
− (3 × 4 − 3
× 42 ) (e) The function is
4
1
= (15 − 75
4 ) − (12 − 12) f (x) = (x > 0),
πx
= −3.75. which can be written as
That is, the object travels 3.75 m in that time 1 1
f (x) = × (x > 0).
(in the negative direction). π x
Hence its indefinite integral is
Solution to Activity 37 1
F (x) = ln x + c.
4 π
(a) The function f (x) = has indefinite integral
x 3
F (x) = 4 ln |x| + c. (f) The indefinite integral of f (x) = is
1 + x2
(b) The function F (x) = 3 tan−1 x + c.
x+1 1 1 1 1
f (x) = = + 2 = + x−2 (g) The indefinite integral of h(t) = √ is
x2 x x x 4 1 − t2
has indefinite integral
H(t) = 14 sin−1 t + c.
1
F (x) = ln |x| − x−1 + c = ln |x| − + c. (Or, alternatively, H(t) = − 41 cos−1 t + c.)
x
(c) The function (h) The function is
1 1
f (x) = (x > 0) p(x) = ,
2x 5 + 5x2
has indefinite integral which can be written as
1 1
F (x) = 21 ln x + c. p(x) = × .
5 1 + x2
(The answer F (x) = 12 ln |x| + c is also correct, Hence its indefinite integral is
but can be simplified, as x takes only positive
values.) P (x) = 51 tan−1 x + c.
(i) The function is
Solution to Activity 38 1
g(x) = √ ,
(a) The indefinite integral of h(θ) = 3 cos θ + 4 sin θ 4 − 4x2
is which can be written as
H(θ) = 3 sin θ + 4(− cos θ) + c 1
g(x) = √ √ ;
= 3 sin θ − 4 cos θ + c. 4 1 − x2
that is,
(b) The indefinite integral of g(φ) = 6 − 5 cosec2 φ is
1
G(φ) = 6φ − 5(− cot φ) + c g(x) = √ .
2 1 − x2
= 6φ + 5 cot φ + c. Hence its indefinite integral is
7 G(x) = 21 sin−1 x + c.
(c) The indefinite integral of f (x) = is
x
(Or, alternatively, G(x) = − 21 cos−1 x + c.)
F (x) = 7 ln |x| + c.
(d) The function is
1
g(t) = ,
3t
which can be written as
1 1
g(t) = × .
3 t
99
Unit 7 Differentiation methods and integration
100
Acknowledgements
Acknowledgements
Grateful acknowledgement is made to the following sources:
Page 30: Mark Hobbs, for the idea for the cartoon
c gallica.bnf.fr / Bibliothèque nationale de France
Page 55: '
Every effort has been made to contact copyright holders. If any have been
inadvertently overlooked the publishers will be pleased to make the
necessary arrangements at the first opportunity.
101
Unit 8
Integration methods
Introduction
Introduction
In this unit you’ll continue your study of integration. In Unit 7 you saw
that integration is the reverse of differentiation. You saw that you can use
it to solve problems in which you know the values taken by the rate of
change of a continuously changing quantity, and you want to work out the
values taken by the quantity, or how much the quantity changes over some
period. For example, you saw that if you have a formula for the velocity of
an object in terms of time, then you can use integration to find a formula
for the displacement of the object in terms of time, or work out the change
in the displacement of the object over some period of time.
In Sections 1 and 2 of this unit, you’ll meet a different way to think about
integration. This way of thinking about it involves areas of regions of the
plane, rather than rates of change. You’ll see that the link between the two
ways of thinking about integration helps you to solve problems both about
areas and about rates of change.
In Sections 3 to 5 of the unit, you’ll learn some more methods for
integrating functions, which you can use in addition to the methods that
you learned in Unit 7. These further methods will allow you to integrate a
much wider variety of functions than you’ve learned to integrate so far.
Many students find some of the material in this unit quite challenging
when they first meet it, so don’t worry if that’s the case for you too. As
with most mathematics, it will seem more straightforward once you’re
more familiar with it. If you find a particular technique difficult, then
make sure that you watch the associated tutorial clips, and use the
practice quiz and exercise booklet for this unit for more practice.
This unit is likely to take you more time to study than most of the other
units, so the study planner allows extra time for it.
105
Unit 8 Integration methods
Figure 1 (a) The shape of a building (b) the graph that gives the curve
of the roof
106
1 Areas, signed areas and definite integrals
In the next example this method is used, with four subintervals, to find an
approximate value for the area discussed above. Notice that, in both the
example and in Figure 2 above, the rectangle corresponding to the first
subinterval has zero height and therefore zero area – this happens because
the value of the function at the left endpoint of this subinterval is zero.
107
Unit 8 Integration methods
Solution
We divide the interval [−18, 18] into four subintervals of equal width.
The whole interval has width 36, so each subinterval has width
36/4 = 9.
The left endpoints of the four subintervals are
−18, −18 + 9, −18 + 2 × 9, −18 + 3 × 9,
that is
−18, −9, 0, 9.
So the heights of the rectangles are
f (−18), f (−9), f (0), f (9),
that is,
9− 1
36 × (−18)2 , 9− 1
36 × (−9)2 , 9− 1
36 × 02 , 9− 1
36 × 92 ,
which evaluate to
27 27
0, 4 , 9, 4 .
So we obtain the following approximate value for the area:
405
(0 × 9) + ( 27 27
4 × 9) + (9 × 9) + ( 4 × 9) = 2 = 202.5.
108
1 Areas, signed areas and definite integrals
Note that the units for the area of a region on a graph are, as you’d
expect, the units on the vertical axis times the units on the horizontal axis.
For example, if the units on both axes are metres, then the units for area
are square metres (m2 ). If a graph has no specific units on the axes, then
the units for area are simply ‘square units’, and we usually omit them
when we state an area.
In the next activity you’re asked to find another approximation for the
area of the cross-section of the roof discussed earlier, by using the
subinterval method with six subintervals instead of four.
Use the method described above, with six subintervals as shown below, to
find an approximate value for the area between the graph of the function
1 2
f (x) = 9 − 36 x and the x-axis.
Table 1 shows the approximate values for the area between the graph of
1 2
the function f (x) = 9 − 36 x and the x-axis that were found in Example 1
and in the solution to Activity 1. It also shows, to three decimal places,
some further approximate values that were found in the same way, but
using larger numbers of subintervals. The calculations were carried out
using a computer.
You can see that as the number of subintervals gets larger and larger, the
approximation obtained seems to be getting closer and closer to a
particular number. We say that this number is the limit of the
approximations as the number of subintervals tends to infinity. It’s the
1 2
exact value of the area between the graph of the function f (x) = 9 − 36 x
and the x-axis. From Table 1, it looks as if the exact value of this area is
216, or perhaps a number very close to 216.
109
Unit 8 Integration methods
Figure 3 (a) The area between a graph and the x-axis, from x = a to
x = b (b) A collection of rectangles whose total area is approximately
this area
Here’s a summary of the method that you’ve met for finding an
approximate value for an area of this type. You start by dividing the
interval [a, b] into a number, say n, of subintervals of equal width. The
width of each subinterval is then (b − a)/n. For each subinterval you
calculate the product
, -
left endpoint b−a
f × , (1)
of subinterval n
and you add up all these products. In general, the larger the number n of
subintervals, the closer your answer will be to the area between the graph
of f and the x-axis, from x = a to x = b.
Now suppose that you have a continuous function f whose graph lies on or
below the x-axis throughout an interval [a, b], and you want to calculate
the area between the graph of f and the x-axis, from x = a to x = b, as
illustrated in Figure 4(a).
You can use the method above, with a small adjustment at the end, to
calculate an approximate value for an area like this. Consider what
happens when you apply the method to a graph like the one in
Figure 4(a). You start by dividing the interval [a, b] into n subintervals of
equal width, then you calculate all the products of form (1) and add them
all up. For a graph like this, because the value of f at the left endpoint of
each subinterval is negative (or possibly zero), each product of form (1)
will also be negative (or zero). In fact, each product of form (1) will be the
negative of the area between the line segment that approximates the curve
and the x-axis, as illustrated in Figure 4(b). So when you add up all the
products of form (1), you’ll obtain an approximate value for the negative of
the area between the curve and the x-axis, from x = a to x = b.
110
1 Areas, signed areas and definite integrals
Figure 4 (a) The area between another graph and the x-axis, from x = a
to x = b (b) A collection of rectangles whose total area is approximately
this area
That’s not a problem, because you can simply remove the minus sign to
obtain the approximate value for the area that you want. However, to help
us deal with situations like this, it’s useful to make the following
definitions. These definitions will be important throughout the unit.
Consider any region on a graph that lies either entirely above or entirely
below the x-axis. The signed area of the region is its area with a plus or
minus sign according to whether it lies above or below the x-axis,
respectively. For example, in Figure 5 the two shaded regions above the
x-axis have signed areas +4 and +6, respectively, which you can write
simply as 4 and 6, and the shaded region below the x-axis has signed
area −3.
111
Unit 8 Integration methods
Of course, the signed area value found in Example 2(c) doesn’t correspond
to any actual area on the graph, as it’s the sum of a positive signed area
and a negative signed area.
112
1 Areas, signed areas and definite integrals
Consider again the graph in Example 2. In each of parts (a)–(d) below, use
the given areas to find the signed area between the graph and the x-axis,
from the first value of x to the second value of x.
(a) From x = 1 to x = 5. (b) From x = 5 to x = 7.
(c) From x = 1 to x = 7. (d) From x = 1 to x = 9.
With the definition of signed area that you’ve now seen, the method that
you’ve met in this subsection can be described concisely as follows.
Strategy:
To find an approximate value for the signed area between
the graph of a continuous function f and the x-axis, from
x = a to x = b
Divide the interval between a and b into n subintervals, each of
width (b − a)/n. For each subinterval, calculate the product
, -
endpoint of subinterval b−a
f × ,
nearest a n
and add up all these products.
In general, the larger the number n of subintervals, the closer your
answer will be to the required signed area.
Notice that the box above uses the phrase ‘endpoint of subinterval
nearest a’, rather than ‘left endpoint of subinterval’, which means the same
thing. This will be convenient later in this subsection. (Note that, in this
phrase, it’s the endpoint of the subinterval, not the subinterval itself,
that’s nearest a!)
Figures 6 and 7 illustrate the strategy above. If you add up the signed
areas of all the rectangles in Figure 6(a), then you’ll obtain an
approximate value for the signed area in Figure 6(b).
Similarly, if you add up the signed areas of all the rectangles in Figure 7(a),
then you’ll obtain an approximate value for the signed area in Figure 7(b).
Use the method described in the box above, with six subintervals as shown
on the left below, to find an approximate value for the signed area between
the graph of the function f (x) = 3 − x2 and the x-axis from x = −3 to
x = 3, as shown on the right below.
114
1 Areas, signed areas and definite integrals
Open the applet Approximations for signed areas. Check that the function
is set to f (x) = 3 − x2 , the values of a and b are set to −3 and 3
respectively, and the number of subintervals is set to 6. The resulting
approximation for the signed area between the graph of f and the x-axis,
from x = a to x = b, should then be the value found in the solution to
Activity 3.
Now increase the number of subintervals, and observe the effect on the
approximation. What do you think is the exact value of the signed area?
Experiment by changing the function, the values of a and b, and the
number of subintervals, to find approximate values for some other signed
areas.
115
Unit 8 Integration methods
116
1 Areas, signed areas and definite integrals
Solution
(a) From 3 to 3 is from a number to the same number.
The signed area from x = 3 to x = 3 is 0.
(b) From 6 to 10 is forward.
The signed area from x = 6 to x = 10 is −5.4.
(c) From 10 to 6 is backward.
The signed area from x = 6 to x = 10 is −5.4,
so the signed area from x = 10 to x = 6 is 5.4.
(d) From 6 to −1 is backward.
The signed area from x = −1 to x = 6 is −2.4 + 4.0 = 1.6,
so the signed area from x = 6 to x = −1 is −1.6.
In each part of the next activity, start by thinking about whether x moves
forward or backward (or stays fixed) as it moves from the first value of x
to the second value of x.
117
Unit 8 Integration methods
The subinterval method for finding approximate values for signed areas,
which is summarised in the box on page 113, applies no matter whether
the value of b is greater than, less than, or equal to the value of a. The
reason why it works in cases where b is less than a is that in these cases
the ‘width’ (b − a)/n of each subinterval is negative.
You might like to use the applet Approximations for signed areas to see
examples of the method giving approximate values for signed areas in cases
where b is less than a.
Of course, if you want to work out an area between a curve and the x-axis
over an interval [a, b] by using the method that you’ve seen in this
subsection, then normally you would work out the signed area from a to b,
rather than the signed area from b to a. However, it’s important to
understand what’s meant by the signed area from b to a, for reasons that
you’ll see later in the unit.
118
1 Areas, signed areas and definite integrals
119
Unit 8 Integration methods
As with Leibniz notation for derivatives, the ‘d’ in the notation for definite
integrals has no independent meaning. In many texts, including this one,
it’s printed in upright type, rather than italic type, to emphasise that it’s
not a variable. You don’t need to do anything special when you
handwrite it.
/b
When you use the notation a f (x) dx, remember that you must include
/b
not only the a at the beginning, but also the dx at the end. Try not to
forget the dx!
You’ve now seen that a definite integral is a type of signed area, and you
saw in Unit 7 that an indefinite integral is a type of general antiderivative.
These two seemingly unrelated concepts are, as you probably suspect,
closely related. You’ll see how in Section 2.
The box below lists some standard properties of definite integrals, which
come from their definition as signed areas. These properties hold for all
values of a, b and c in the domain of the continuous function f . The
second property comes from the extended definition of signed area, to
cases where b is less than a. The third property expresses the fact that the
signed area from a to c is equal to the signed area from a to b plus the
signed area from b to c. Figure 12 illustrates this property in a case where
a < b < c, but the extended definition of signed area ensures that the
property holds even when a, b and c aren’t in this order. This is one reason
why the definition of signed area is extended in the way that you’ve seen.
120
1 Areas, signed areas and definite integrals
Consider the function f whose graph is shown below. The areas of some
regions are marked. By using these areas, write down the values of the
following definite integrals.
Hint: notice that in some of these definite integrals the upper limit of
integration is less than the lower limit of integration.
! −3 ! 2 ! 7
(a) f (x) dx (b) f (x) dx (c) f (x) dx
−5 −3 2
! 2 ! 7 ! 7
(d) f (x) dx (e) f (x) dx (f) f (x) dx
−5 −3 −5
! 5 ! −3 ! 2
(g) f (x) dx (h) f (x) dx (i) f (x) dx
5 2 7
! −5 ! −3 ! −5
(j) f (x) dx (k) f (x) dx (l) f (x) dx
−3 7 7
121
Unit 8 Integration methods
As you’d expect, you can replace the expression f (x) in the notation for a
definite integral by the formula for a particular function. For example, the
signed areas in Figure 13 are denoted by
! 1 ! 1
x2 dx and (x3 − 1) dx,
−1 0
respectively.
122
1 Areas, signed areas and definite integrals
/b
Suppose that you want to find a f (x) dx, as illustrated in Figure 14(a).
You divide the interval between a and b into n subintervals, each of
width (b − a)/n, as illustrated in Figure 14(b). We’ll denote (b − a)/n
by w here, for conciseness.
/b
Figure 14 (a) A definite integral a f (x) dx (b) A collection of n
rectangles whose total signed area is approximately this definite integral
The endpoints nearest a of the subintervals are
a + 0w, a + 1w, a + 2w, ..., a + (n − 1)w.
123
Unit 8 Integration methods
where w = (b − a)/n.
124
1 Areas, signed areas and definite integrals
The name and principal symbol for integral calculus were discussed
by Gottfried Wilhelm Leibniz and the Swiss mathematician
Johann Bernoulli.
/ Leibniz preferred ‘calculus summatorius’ as the
name, and , the elongated S, as the symbol. Bernoulli preferred
‘calculus integralis’ as the name, and the capital letter I as the
symbol. Eventually they agreed to compromise, / adopting Bernoulli’s
name ‘integral calculus’ and Leibniz’s symbol . The first appearance
in print of the word ‘integral’ was in a work by Johann’s brother,
Jacob Bernoulli, in 1690, though Johann insisted that he had been
the one to introduce the term.
Jacob Bernoulli (1654–1705)
125
Unit 8 Integration methods
The modern notation for a definite integral, with limits at the bottom
and top of the integral sign, is due to the French mathematician
Joseph Fourier, who introduced it in his pioneering book on heat
diffusion Théorie analytique de la chaleur (The analytic theory of
heat) of 1822. He first published the notation four years previously in
an announcement for the book. Fourier accompanied
Napoleon Bonaparte on his Egyptian expedition of 1798, and later
supported Jean-François Champollion, the decipherer of the Rosetta
Stone. He is remembered today for initiating the investigation of
Fourier series. These are infinite trigonometric series, which are now
named after him, and which arose in the context of his work on heat
diffusion. You’ll meet the idea of a series in Unit 10.
Definite integrals
Suppose that f is a continuous function, and a and b are numbers in
its domain. The signed area between the graph of f and the x-axis
from x = a to x = b is called the definite integral of f from a to b,
and is denoted by
! b
f (x) dx.
a
126
2 The fundamental theorem of calculus
true. The two subsections that follow, Subsections 2.2 and 2.3, show you
how to use the fundamental theorem of calculus. You might like to try
reading Subsection 2.1 again later, as some of the ideas that you found
difficult the first time might seem easier once you’re more familiar with the
ways in which you can use the fundamental theorem of calculus.
127
Unit 8 Integration methods
(Here the variable t has been used in the definite integral instead of the
usual variable x, to avoid confusion with the input variable x of the
function A.)
Note that if a and b are any two numbers in the domain of f , then the
signed area between the graph of f and the x-axis, from x = a to x = b, is
given by
A(b) − A(a).
In other words,
! b
f (x) dx = A(b) − A(a).
a
This fact is illustrated in Figure 18, in a case where s < a < b. The third
property of definite integrals in the box on page 120 tells you that the fact
holds even when s, a and b aren’t in this order.
/b
Figure 18 An illustration of the fact that a
f (x) dx = A(b) − A(a)
Now let’s consider how the value of a signed-area-so-far function changes
as the value of the input variable x changes. Let’s start by looking at what
happens when the function f is a constant function.
128
2 The fundamental theorem of calculus
You can see that the signed area A(x) changes at the rate of 3 square units
for every unit by which x changes. For example,
if x increases by 1, then A(x) increases by 3
if x increases by 0.5, then A(x) increases by 0.5 × 3 = 1.5
if x increases by −1, then A(x) increases by −1 × 3 = −3.
(Remember that a negative increase is a decrease.)
In other words, the rate of change of A(x) with respect to x is 3.
129
Unit 8 Integration methods
More generally, you can see that for any constant function f (x) = k, and
any signed-area-so-far function A for f , the value of A(x) changes at the
rate of k square units for every unit by which x changes. In other words,
the rate of change of A(x) is k.
There are two more examples of this in Figure 21. In Figure 21(a), the
function is f (x) = 1.5, so the signed area A(x) changes at the rate of
1.5 square units for each unit by which x changes. In Figure 21(b), the
function is f (x) = −2, so the signed area A(x) changes at the rate of
−2 square units for each unit by which x changes.
130
2 The fundamental theorem of calculus
Figure 23(b), when x is close to 7 the value of A(x) changes at the rate of
about 4 square units for every unit by which x increases.
Theorem
Suppose that f is a continuous function, and s is any number in its
domain. Let A be the signed-area-so-far function for f with starting
point s. Then A is an antiderivative of f .
It’s this fact that leads to the fundamental theorem of calculus. Consider
any continuous function f . Let s be any number in its domain, and let A
be the signed-area-so-far function for f with starting point s.
131
Unit 8 Integration methods
Suppose that a and b are any numbers in the domain of f . Then, as you
/b
saw earlier in this subsection, the value of a f (x) dx, the signed area
between the graph of f and the x-axis from x = a to x = b, is given by
! b
f (x) dx = A(b) − A(a). (3)
a
132
2 The fundamental theorem of calculus
133
Unit 8 Integration methods
differ by a constant and hence they all lead to the same final answer. For
example, using the antiderivative F (x) = sin x + 7 gives
! π/2 . .π# #
cos x dx = sin + 7 − (sin 0 + 7) = 1 + 7 − 0 − 7 = 1.
0 2
It’s usually best to use the simplest antiderivative in calculations of this
kind.
You’ll see another example of the fundamental theorem of calculus used to
find a signed area shortly, and you’ll then have the opportunity to find
some signed areas in this way yourself. First, however, it’s helpful for you
to learn a type of notation that’s convenient in calculations of this kind.
For any function F , the expression
F (b) − F (a)
can be denoted by
) 'b
F (x) a .
For example,
) 'π/2 .π#
sin x 0 = sin − sin 0 = 1 − 0 = 1.
2
You can practise evaluating expressions of this type in the next activity.
Evaluate the following expressions. In part (c), give your answer to three
significant figures.
) '5 ) '2π ) '1
(a) 12 x2 3 (b) cos x 0 (c) ex −1
134
2 The fundamental theorem of calculus
Solution
Use the fundamental theorem of calculus. By the general formula
for the indefinite integral of a power function, an antiderivative of x3
is 14 x4 .
! 2
) '2
x3 dx = 41 x4 1
1
You’ve now seen that, when you want to find a signed area, it’s much
quicker and simpler to use the fundamental theorem of calculus than to
use the method involving subintervals that you saw in Subsection 1.1.
Another advantage of using the fundamental theorem of calculus is that it
gives you an exact answer, rather than an approximate one. However,
remember that you can use the fundamental theorem of calculus only when
you can find a formula for the antiderivative that you need. Sometimes it’s
difficult or impossible to find such a formula.
In the next activity, you can try evaluating some definite integrals by using
the fundamental theorem of calculus. Use the square bracket notation, as
in Example 4, since you need to get used to this notation. You can find the
antiderivatives that you need by using the table of standard indefinite
integrals given near the end of Unit 7. This table is also included in the
Handbook.
135
Unit 8 Integration methods
Evaluate the following definite integrals (which are shown in Figure 27).
Give exact answers, and (except in parts (a) and (d), where the answer is a
simple fraction or an integer) also give answers to three significant figures.
! 1 ! 2 ! 4
1
(a) x2 dx (b) et dt (c) du
−1 0 1 u
! π/4 ! 1
2 1
(d) sec θ dθ (e) 2
du
−π/4 −1 1 + u
136
2 The fundamental theorem of calculus
There are other acceptable ways to leave the final answer in Example 5.
For example, you could write it as −2(1 + 2 ln 2).
137
Unit 8 Integration methods
Evaluate the following definite integrals (which are shown in Figure 29).
Give exact answers.
! 4 ! 0 ! 4
√ r+5
(a) 3 x dx (b) x(1 + x) dx (c) dr
1 −1 2 r
Constant multiple rule and sum rule for the square bracket
notation
) 'b ) 'b
kF (x) a = k F (x) a , where k is a constant
) 'b ) 'b ) 'b
F (x) + G(x) a = F (x) a + G(x) a
For example, the constant multiple rule for the square bracket notation
tells you that
) '2 ) '2
5 sin x 1 = 5 sin x 1
and the sum rule for the square bracket notation tells you that
) '2 ) '2 ) '2
sin x + cos x 1 = sin x 1 + cos x 1 .
To see why these rules hold, you just need to use the definition of the
square bracket notation. For example,
) '2
5 sin x 1 = 5 sin 2 − 5 sin 1
= 5(sin 2 − sin 1)
) '2
= 5 sin x 1 .
138
2 The fundamental theorem of calculus
Similarly,
) '2
sin x + cos x 1 = (sin 2 + cos 2) − (sin 1 + cos 1)
= (sin 2 − sin 1) + (cos 2 − cos 1)
) '2 ) '2
= sin x 1 + cos x 1 .
As always when a constant multiple rule and a sum rule hold, the sum rule
for the square bracket notation also holds if you replace the plus sign on
each side with a minus sign. To prove this, you combine the sum rule with
the case k = −1 of the constant multiple rule. You saw this done for the
constant multiple and sum rules for derivatives in Unit 6, Subsection 2.3.
The next example illustrates how you can use the constant multiple rule
and the sum rule for the square bracket notation when you’re evaluating a
definite integral.
Example 6 Using the constant multiple rule and sum rule for the
square bracket notation
Evaluate the definite integral
! 2
(1 + x3 ) dx.
1
(It is shown in Figure 30.)
Solution
Use the fundamental theorem of calculus.
! 2
) '2
(1 + x3 ) dx = x + 14 x4 1
1
= (2 − 1) + 14 (24 − 14 )
1
=1+ 4 × 15
19
= 4 .
You can practise using the constant multiple rule and sum rule for the
square bracket notation in the next activity.
139
Unit 8 Integration methods
Activity 11 Using the constant multiple rule and sum rule for the
square bracket notation
These rules follow from the fundamental theorem of calculus and the
constant multiple rule and sum rule for antiderivatives, which you met
in Unit 7. To see this for the constant multiple rule, suppose that f is a
continuous function whose domain contains the numbers a and b, and
that k is a constant. Let F be an antiderivative of f . Then, by the constant
multiple rule for antiderivatives, kF (x) is an antiderivative of kf (x).
140
2 The fundamental theorem of calculus
Figure 33 The area in (c) is the sum of the areas in (a) and (b)
The next example illustrates how you can use the constant multiple rule
and the sum rule for definite integrals when you’re evaluating a definite
integral.
141
Unit 8 Integration methods
Example 7 Using the constant multiple rule and sum rule for
definite integrals
Evaluate the definite integral
! π
(x + 2 sin x) dx.
0
(It is shown in Figure 34.)
Solution
Use the fundamental theorem of calculus. Use the constant
multiple rule and sum rule, for definite integrals and for the square
bracket notation, as convenient.
! π ! π ! π
(x + 2 sin x) dx = x dx + 2 sin x dx
0 0 0
! π ! π
= x dx + 2 sin x dx
Figure 34 The definite 2 0 1π 0
) 'π
integral in Example 7 = 21 x2 + 2 − cos x 0
0
1
) 2 'π ) 'π
= 2 x 0 − 2 cos x 0
= 21 (π2 − 02 ) − 2(cos π − cos 0)
= 21 π2 − 2(−1 − 1)
= 21 π2 + 4.
You can practise using the constant multiple rule and sum rule for definite
integrals in the next activity.
Activity 12 Using the constant multiple rule and sum rule for
definite integrals
Evaluate the following definite integrals. (They are shown in Figure 35.)
Give exact answers.
! √7 ! π/4
1 3
(a) √ 2 x dx (b) (sin x + cos x) dx
3 −π/4
142
2 The fundamental theorem of calculus
The curve of the roof of a building is given by the graph of the function
1 2
f (x) = 9 − 36 x ,
where x and f (x) are measured in metres. Use the ideas that you’ve met
in this section to find the area between this graph and the x-axis from
x = −18 to x = 18, as shown below. Hence state the area of the
cross-section of the roof.
144
2 The fundamental theorem of calculus
145
Unit 8 Integration methods
146
2 The fundamental theorem of calculus
In each of parts (a) to (c), use the given areas to find the amount by
which the displacement of the object changes from the first time to
the second time.
(a) From t = 0 to t = 4. (b) From t = 4 to t = 10.
(c) From t = 0 to t = 10.
Solution
(a) The change in displacement from t = 0 to t = 4 is 9 m.
(b) The change in displacement from t = 4 to t = 10 is −7 m.
(c) The change in displacement from t = 0 to t = 10 is
9 m + (−7) m = 2 m.
Notice that the velocity of the object in Example 8 changes from positive
to negative at time t = 4. That is, at this time the object changes from
moving in the positive direction to moving in the negative direction, as
illustrated in Figure 36. The negative displacement that occurs after this
time cancels out most of the positive displacement that occurred initially.
Since the object was displaced by 9 m in the positive direction, and then
by 7 m in the negative direction, its final displacement from its starting
position is 2 m, as found in Example 8(c). The object travels a total
distance of 9 m + 7 m = 16 m during the 10 seconds of its motion.
147
Unit 8 Integration methods
(a) In each of parts (i) to (iv), find the amount by which the displacement
of the object changes from the first time to the second time.
(i) From t = 0 to t = 1. (ii) From t = 3 to t = 4.
(iii) From t = 0 to t = 3. (iv) From t = 3 to t = 6.
(b) What is the displacement of the object from its starting position at
each of the following times?
(i) t = 2 (ii) t = 6
(c) What is the total distance travelled by the object during the 6 seconds
of its motion?
If you want to solve a problem that involves finding a change in the value
of an antiderivative, and you can find a formula for an antiderivative, then
it’s usually convenient to use the notation for a definite integral, and the
square bracket notation, in your working. Here’s Example 22 from
Subsection 5.3 of Unit 7 again, but with the working carried out using this
notation.
148
2 The fundamental theorem of calculus
In the next activity, you’re asked to repeat Activity 36 from Subsection 5.3
of Unit 7, but using the notation for a definite integral.
Suppose that an object moves along a straight line, and its velocity v
(in m s−1 ) at time t (in seconds) is given by v = 3 − 23 t (0 ≤ t ≤ 6)
(as shown in Figure 38).
(a) Write down a definite integral that gives the change in the
displacement of the object from time t = 0 to time t = 1. Evaluate it,
and hence state how far the object travels in that time.
(b) Repeat part (a), but from time t = 4 to time t = 5.
Try using the notation for a definite integral to solve the problem in the
next activity.
149
Unit 8 Integration methods
The driver of a car travelling at 28 m s−1 along a straight road applies the
brakes. It takes four seconds for the car to stop, and its decreasing
velocity v (in m s−1 ) during that time is given by the equation
v = 28 − 7t,
where t is the time in seconds since the brakes were applied (as shown in
Figure 39). How far does the car travel during those four seconds?
In this subsection you’ve seen examples of the following two useful facts.
For the velocity-time graph of any object, and for any two points in
Figure 39 The graph of time t = a and t = b in the time period covered by the graph, the
v = 28 − 7t (0 ≤ t ≤ 4) following hold.
• The change in the displacement of the object from t = a to t = b
is the total signed area between the graph and the time axis from
t = a to t = b.
• The total distance travelled by the object from t = a to t = b is
the total area between the graph and the time axis from t = a to
t = b (this applies when a ≤ b).
Suppose that a car begins to move along a straight road, and its velocity v
(in m s−1 ) during the first five seconds of its journey is given by the
equation
v = 3t,
where t is the time in seconds since it began moving, as shown in
Figure 40. How far does the car travel from the end of the fourth second to
the end of the fifth second?
Figure 40 The graph of
v = 3t (0 ≤ t ≤ 5)
150
2 The fundamental theorem of calculus
and similarly
!
sin x dx = − cos x + c.
(You saw in Unit 7 that the indefinite integral of sin x is − cos x + c.)
Note that this notation is used only for indefinite integrals – it’s never used
for antiderivatives. So it’s important to remember the following fact.
/
If an equation has the notation . . . dx on one side only, then there
must be an arbitrary constant on the other side.
151
Unit 8 Integration methods
The notation for indefinite integrals introduced above, and the similar
notation for definite integrals, are together known as integral notation.
We refer to either an indefinite integral or a definite integral as an
integral. In the indefinite integral
!
f (x) dx,
(b) Manipulate the integrand to get it into a form that allows you
to use the constant multiple rule and sum rule for antiderivatives,
together with standard indefinite integrals.
! ! , -
x−5 5
dx = 1− dx
x x
! , -
1
= 1−5× dx
x
= x − 5 ln |x| + c
152
2 The fundamental theorem of calculus
You can practise using integral notation for indefinite integrals in the
following activity. Don’t forget to add the ‘ + c’ in each part.
As with the other constant multiple rules and sum rules, you can choose
whether or not to use these rules in your working, as seems convenient.
The next example repeats Example 10(b), but uses the constant multiple
rule and sum rule for indefinite integrals in the forms stated above.
153
Unit 8 Integration methods
Example 11 Using the constant multiple rule and sum rule for
indefinite integrals
Find the indefinite integral
!
x−5
dx.
x
Solution
Manipulate the integrand to get it into a form that allows you to
use the constant multiple rule and sum rule for indefinite integrals,
together with standard indefinite integrals.
! ! , -
x−5 5
dx = 1− dx
x x
! !
1
= 1 dx − 5 dx
x
= x − 5 ln |x| + c
Here’s something to notice about Example 11. You might have thought
that each of the two indefinite integrals in the second line of the working
should have been replaced by an expression containing an arbitrary
constant. That would lead to a solution something like this:
! ! , -
x−5 5
dx = 1− dx
x x
! !
1
= 1 dx − 5 dx
x
= (x + c) − 5(ln |x| + d)
= x − 5 ln |x| + c − 5d,
where c and d are arbitrary constants.
However, since the arbitrary constants c and d can take any value, the
expression c − 5d can also take any value, so we may as well replace it by a
single letter that can take any value. It’s convenient to use the usual
letter, c. In general, if an expression consists of a sum of constant multiples
of indefinite integrals, then it needs only a single arbitrary constant.
You can practise using the constant multiple rule and sum rule for
indefinite integrals in the next activity.
154
3 Integration by substitution
Activity 20 Using the constant multiple rule and sum rule for
indefinite integrals
You’ve now reached the end of the material in this module that tells you
what integration is all about. In the rest of this unit you’ll concentrate on
actually finding indefinite and definite integrals.
Before you go on to that, you might like to watch the 25-minute video
The birth of calculus, which is available on the module website. It tells you
more about the history of the development of calculus.
3 Integration by substitution
To allow you to fully exploit all the ideas about integration that you’ve
met, it’s important that you’re able to find formulas for the indefinite
integrals of as wide a range of functions as possible. In the rest of this
unit, you’ll learn to integrate a much wider variety of functions than you
learned to integrate in Unit 7. In the final subsection of the unit, you’ll
learn how to use the module computer algebra system to find indefinite
and definite integrals.
Unfortunately, as mentioned in Unit 7, it’s generally more tricky to
integrate functions than it is to differentiate them. That’s because there
are no rules for antiderivatives that are similar to the product, quotient
and chain rules for derivatives, so you can’t usually integrate functions in
the systematic way that you can differentiate them.
However, there are still many useful techniques that you can use to
integrate functions. Each of these techniques applies to functions with
particular characteristics. You’ll meet some of these techniques in the rest
of this unit. You’ll learn how to recognise functions to which each
technique can be applied, and how to use the techniques.
The technique that you’ll learn in this section is integration by substitution.
This technique might seem quite complicated when you first meet it, but
you should find it more straightforward once you’ve had some practice
with it. If you find it difficult, then make sure that you watch the tutorial
clips, and try some extra practice. Remember that there are more practice
exercises in the practice quiz and exercise booklet for this unit.
155
Unit 8 Integration methods
if you hadn’t first seen the integrand obtained as an ‘output’ of the chain
rule.
The first step is to recognise that the integrand has the form
f (something) × the derivative of the something,
where f is a function that you can integrate. You can see that it does,
since it is
cos(something) × the derivative of the something,
where the ‘something’ is x2 .
Then, since the ‘something’ is x2 , you put u = x2 . This gives du/dx = 2x.
Hence, in the integral you can replace cos(x2 ) by cos u, and 2x by du/dx,
to give
!
du
cos(u) dx.
dx
156
3 Integration by substitution
Now here’s the crucial step that you need to apply at this stage. Any
integral of the form
!
du
f (u) dx
dx
is equal to the simpler integral
!
f (u) du.
This is the step that uses the chain rule, and you’ll see an explanation of
why it’s correct later in this subsection. Notice that it’s easy to remember,
because it looks like we’ve simply cancelled a ‘dx’ in a denominator with a
‘dx’ in a numerator. (Of course that’s not what we’ve done, since du/dx
isn’t a fraction.)
So the integral that we’re trying to find here can be expressed in terms
of u as the simpler integral
!
cos u du,
as expected.
Here’s a summary of the method used above, which is the method known
as integration by substitution.
Integration by substitution
1. Recognise that the integrand is of the form
f (something) × the derivative of the something,
where f is a function that you can integrate.
2. Set the something equal to u, and find du/dx.
3. Hence write the integral in the form
!
f (u) du,
! !
du
by using the fact that f (u) dx = f (u) du.
dx
4. Do the integration.
5. Substitute back for u in terms of x.
157
Unit 8 Integration methods
Remember that we put u = x2 , which gave du/dx = 2x, and this allowed
us to write the integral in terms of u as
!
cos u du.
It’s convenient to think of this new form of the integral as being obtained
from the original form by making two replacements, as shown below:
!
& +
cos(x2 ) (2x) dx .
* %$ 3 * %$ 3
cos u du
It’s straightforward to work out that you can replace cos(x2 ) by cos u, as
that just comes from the equation u = x2 .
A helpful way to work out that you can replace (2x) dx by du is to imagine
‘cross-multiplying’ in the equation
du
= 2x,
dx
to obtain
du = (2x) dx.
This isn’t a real equation, of course, since du and dx don’t have
independent meanings outside the notation for a derivative or integral.
But it tells you immediately that you can replace (2x) dx by du. (In fact,
it’s possible to attach a meaning to du and dx, and hence to the equation
above, but this is outside the scope of this module.)
Here’s another example of integration by substitution, which illustrates all
the ideas above.
158
3 Integration by substitution
Solution
Since the derivative of tan x is sec2 x, the integrand is of the form
esomething × the derivative of the something.
So set the something equal to u.
du
Let u = tan x; then = sec2 x.
dx
Imagine ‘cross-multiplying’ in the second equation to obtain
du = (sec2 x) dx.
So
! !
tan x 2
e sec x dx = eu du
Do the integration.
= eu + c
Substitute back for u in terms of x.
= etan x + c.
Here are some examples for you to try. Make sure that you have your
Handbook to hand, so you can consult the tables of standard derivatives
and standard integrals.
In the examples that you’ve seen so far, it’s been fairly straightforward to
decide what the ‘something’ should be, so that the integrand is of the form
f (something) × the derivative of the something.
However, sometimes you need to think about this more carefully, as
illustrated in the next example.
159
Unit 8 Integration methods
In the particular case here you can remove the modulus signs,
since 2 + sin x is always positive.
= ln(2 + sin x) + c.
In each part of the next activity it’s useful to take the ‘something’ to be an
expression that includes a constant term, in a similar way to Example 13.
160
3 Integration by substitution
Solution
The derivative of ex is ex , so the integrand is in the required form,
except that the two factors cos(ex ) and ex are in the ‘wrong’ order. If
you swap them, then the integrand will be exactly in the required
form.
du
Let u = ex ; then = ex .
dx
Imagine ‘cross-multiplying’ to obtain du = ex dx. Then start by
swapping the two factors in the integrand.
So
! !
x x
e cos(e ) dx = (cos(ex )) ex dx
!
= cos u du
= sin(u) + c
= sin(ex ) + c.
162
3 Integration by substitution
Solution
The derivative of 1 + x3 is 3x2 , which is ‘nearly’ x2 (it just has the
wrong coefficient). So if you swap the order of the two factors x2 and
(1 + x3 )8 , then the integrand will be nearly in the required form.
du
Let u = 1 + x3 ; then = 3x2 .
dx
Imagine ‘cross-multiplying’ to obtain du = 3x2 dx. Then start by
swapping the two factors in the integrand.
So
! !
& +8 & +8
x 1 + x3 dx =
2
1 + x3 x2 dx
1 1 9
= 3 × 9u + c
1 9
= 27 u + c
1 3 9
= 27 (1 + x ) + c.
& +
= 13 91 u9 + c
1 9 1
= 27 u + 3 c
1 3 9
= 27 (1 + x ) + 31 c.
163
Unit 8 Integration methods
However, since the arbitrary constant c can take any value, the expression
1
3 c can also take any value, so we may as well replace it by a single letter
that can take any value. It’s convenient to use the usual letter, c.
This sort of thing occurs frequently when you integrate. You never need to
multiply or divide a term that is an arbitrary constant by a number – you
just replace it by another arbitrary constant, and you usually use the
standard letter, c.
Another point to remember from the working in Example 15 is the
strategy of multiplying by a constant inside an integral, and dividing by
the same constant outside to compensate. It’s often helpful to do this
when you use integration by substitution. It’s okay to do it because the
constant multiple rule for indefinite integrals tells you that for any
function f that has an antiderivative, and any non-zero constant k,
! ! !
1 1
f (x) dx = kf (x) dx = kf (x) dx.
k k
Remember that the constant k can be either positive or negative. In
particular, it can be −1.
Remember too that k must be a constant. You can’t do the same thing
with an expression in x.
Here are some examples for you to try.
The next example illustrates another way in which you can sometimes
rearrange an integrand to get it into the right form for integration by
substitution.
164
3 Integration by substitution
Solution
The derivative of the denominator 1 − x2 is −2x, which is ‘nearly’
the numerator x (it just has the wrong coefficient). So if you split the
numerator from the rest of the fraction, then the integrand will be
nearly in the required form.
du
Let u = 1 − x2 ; then = −2x.
dx
Imagine ‘cross-multiplying’ to obtain du = −2x dx. Then start by
splitting the numerator from the rest of the fraction in the
integrand.
So
! ! , -
x 1
dx = x dx
1 − x2 1 − x2
The only problem is that in place of x you need −2x. To achieve
this, multiply by −2 inside the integral, and divide by −2 outside to
compensate.
! , -
1 1
= −2 (−2x) dx
1 − x2
Now do the substitution.
!
1 1
= −2 du
u
= − 21 ln |u| + c
= − 12 ln |1 − x2 | + c.
165
Unit 8 Integration methods
Here are three more integrals for you to find by using substitution.
They’re a mixture of the types in Activities 24 and 25.
By writing
sin x
tan x = ,
cos x
and using integration by substitution, find
!
tan x dx.
Remember that when you’ve found an indefinite integral, you can always
check your answer by differentiating it. This can be particularly helpful
when you found the integral by using integration by substitution.
166
3 Integration by substitution
Solution
The derivative of 5x − 2 is 5, so the integral will be in the required
form if you rearrange it to obtain 5 inside.
du
Let u = 5x − 2; then = 5. So
dx
! !
sin(5x − 2) dx = 15 sin(5x − 2) × 5 dx
!
= 15 sin(5x − 2) × 5 dx
!
1
= 5 sin u du
= − 15 cos u + c
= − 15 cos(5x − 2) + c.
167
Unit 8 Integration methods
The next example uses exactly the same ideas as Example 17 and
Activity 28, but is a little more complicated.
168
3 Integration by substitution
The answers to Activities 28 and 29 all follow the same general pattern, as
follows. Suppose that you have an integral of the form
!
f (ax + b) dx,
It’s worth remembering this fact, and using it to deal with integrals where
the integrand is a simple function of a linear expression, rather than using
integration by substitution every time. For example, the fact above tells
you immediately that
!
cos(3x − 2) dx = 31 sin(3x − 2) + c.
169
Unit 8 Integration methods
The following special case of the fact in the box above is useful especially
often.
For example,
!
e−2x dx = − 12 e−2x + c.
Try using the facts in the boxes above in the following activity.
The next example is of a type that you met in Unit 7, but it involves a
function that you’ve learned how to integrate in this section.
170
3 Integration by substitution
171
Unit 8 Integration methods
172
3 Integration by substitution
= esin(3π/2) − esin(π/2)
= e−1 − e1
1
= −e
e
= −2.35 (to 3 s.f.).
173
Unit 8 Integration methods
174
3 Integration by substitution
Solution
If the integral were just
!
(3x + 1)9 dx,
175
Unit 8 Integration methods
4 Integration by parts
In this section you’ll meet another integration technique, integration by
parts. Like integration by substitution, this technique is useful for
integrating functions that have certain characteristics. It’s based on the
product rule for differentiation.
176
4 Integration by parts
If you now integrate both sides of this equation with respect to x, then you
obtain
! !
f (x)g(x) dx = f (x)G(x) − f ! (x)G(x) dx.
Here G is an antiderivative of g.
177
Unit 8 Integration methods
Solution
The integrand is a product of two expressions, x and sin x. You
can integrate the second expression, sin x, and differentiating the first
expression, x, makes it simpler. So try integration by parts.
Let f (x) = x and g(x) = sin x.
Then f ! (x) = 1, and an antiderivative of g(x) is G(x) = − cos x.
So the integration by parts formula gives
! !
x sin x dx = f (x)g(x) dx
!
= f (x)G(x) − f ! (x)G(x) dx
!
= x × (− cos x) − 1 × (− cos x) dx
!
= −x cos x + cos x dx
= −x cos x + sin x + c.
You can see that in Example 23 the use of the integration by parts formula
changed the problem of integrating x sin x into the problem of integrating
cos x, which is easier. Here are two similar examples for you to try.
178
4 Integration by parts
by parts in fairly simple cases, like the ones that you’ve seen so far, which
is usually all that you need to do.
Solution
The integrand is a product of two expressions, x and sin x. You
can integrate the second expression, sin x, and differentiating the first
expression, x, makes it simpler. So try integration by parts.
Recite the informal version of the formula in your head. As you think
‘first’, write down the first expression, then as you think
‘antiderivative of second’, write down an antiderivative of the second
expression, and so on.
Integrating by parts gives
! !
x sin x dx = x × (− cos x) − 1 × (− cos x) dx
!
= −x cos x + cos x dx
= −x cos x + sin x + c.
You can practise using the informal version of the integration by parts
formula in the next activity. Alternatively, you might prefer to continue
using the formal version for a little longer, and move to the informal
version when you feel more confident with it. You can use the exercises on
integration by parts in the practice quiz and exercise booklet for this unit
for further practice.
179
Unit 8 Integration methods
Solution
The integral is of the form described above, so try integration by
parts.
180
4 Integration by parts
/
The integral in this expression, xe4x dx, looks easier to find than
the original integral, but it’s still not straightforward to find.
However, it is itself of the form described above, so try integration by
parts again.
, ! -
= 14 x2 e4x − 12 x × 14 e4x − 1 × 14 e4x dx
, ! -
1 2 4x 1 1 4x 1 4x
= 4 x e − 2 4 xe − 4 e dx
Multiply out the brackets. Take care with the minus sign in front
of the brackets – remember that it turns the minus sign inside the
brackets into a plus sign.
!
= 14 x2 e4x − 18 xe4x + 18 e4x dx
Now find the integral in this expression, and simplify the resulting
expression.
= 14 x2 e4x − 18 xe4x + 1
8 × 41 e4x + c
= 14 x2 e4x − 18 xe4x + 1 4x
32 e +c
1 4x 2
= 32 e (8x − 4x + 1) + c.
The method introduced above for dealing with integrands of the form
xn g(x) works just as well if the expression xn is replaced by any
polynomial
/ expression in x of degree n. For example, you can find the
integral (3x2 − 1)e4x dx by integrating by parts twice, and you can find
the integral in the next activity by integrating by parts once.
181
Unit 8 Integration methods
Solution
The integral is of the form discussed above, so swap the
expressions x2 and ln x, then integrate by parts.
! !
x ln x dx = (ln x) x2 dx
2
!
1 3 1 1 3
= (ln x) × 3 x − × x dx
x 3
!
= 31 x3 ln x − 31 x2 dx
& +
= 13 x3 ln x − 31 13 x3 + c
= 31 x3 ln x − 19 x3 + c
= 91 x3 (3 ln x − 1) + c
182
4 Integration by parts
You can use the ideas that you used in Activity 38 to deal with other,
similar integrals that involve the natural logarithm function. Try the
following activity.
183
Unit 8 Integration methods
Solution
The integral is of the form discussed above, so use integration by
parts twice. Start by swapping the expressions ex and cos(2x), as this
makes the working slightly easier (it avoids introducing fractions in
the early stages).
We use integration by parts twice.
! !
e cos(2x) dx = cos(2x) ex dx.
x
Integrate by parts.
! !
cos(2x) e dx = cos(2x)e − (−2 sin(2x))ex dx
x x
!
= cos(2x)ex + 2 sin(2x)ex dx
The final term here contains the original integral. Move this term
to the other side of the equation. When you do this, the right-hand
side no longer contains an indefinite integral, so you have to add an
arbitrary constant to that side.
So
! !
x
cos(2x)e dx + 4 cos(2x)ex dx = cos(2x)ex + 2 sin(2x)ex + c;
that is,
!
5 cos(2x)ex dx = cos(2x)ex + 2 sin(2x)ex + c.
184
4 Integration by parts
This gives
!
ex cos(2x) dx = 51 ex cos(2x) + 25 ex sin(2x) + c
= 15 ex (cos(2x) + 2 sin(2x)) + c.
!
= 7 xe − 7 e7x dx
1 7x 1
= 17 xe7x − 1
7 × 17 e7x + c
1 7x
= 49 e (7x − 1) + c.
185
Unit 8 Integration methods
1 7
= 49 (6e − (−1))
1 7
= 49 (6e + 1).
186
4 Integration by parts
! 1 2 11 ! 1
xe7x dx = x × 17 e7x − 1 × 71 e7x dx
0 0 0
! 1
) '1
= 71 xe7x 0 − 17 e7x dx
0
Use the integration by parts formula for definite integrals (the method of
Example 29) to evaluate the definite integral
! 1
xe3x dx.
0
Give your answer to four decimal places.
In the next two activities you’re asked to use integration by parts to find
areas.
187
Unit 8 Integration methods
188
5 More integration
5 More integration
You’ve now met several integration techniques, and practised using them
on a variety of integrals.
In this section, you’ll start by meeting one further approach, which can be
helpful for integrals that contain trigonometric expressions. Then you’ll
have an opportunity to practise identifying which of the integration
techniques that you’ve met are appropriate for which integrals, for a
variety of different types of integrals.
Finally, you’ll learn how to use a computer for integration.
189
Unit 8 Integration methods
Solution
By the half-angle identity for sine,
! !
sin2 x dx = 1
2 (1 − cos(2x)) dx
!
1
= 2 (1 − cos(2x)) dx
= 12 (x − 1
2 sin(2x)) + c
1 1
= 2 x − 4 sin(2x) + c
1
= 4 (2x − sin(2x)) + c.
In the next activity you’re asked to use two of the trigonometric identities
stated near the beginning of this subsection to integrate another two
expressions that contain trigonometric functions.
(b) Use
! the double-angle identity in the box above to find the integral
sin x cos x dx.
190
5 More integration
In fact there are even two different ways to find the integral in
Activity 44(b), namely
!
sin x cos x dx,
and make the substitution u = cos x. The working for each of these
methods is given in the solution to Activity 45.
So you’ve now seen three different ways to integrate the expression
sin x cos x: one way by using a trigonometric identity, and two ways by
using substitution. These three different ways lead to three answers that
look different, as you can see in the solutions to Activities 44(b) and 45.
This illustrates that there can be more than one way to integrate an
expression, and different ways of integrating it can lead to answers that
look different, though of course these answers will be equivalent.
If an answer that you obtain for an integral is different from the answer
given in a book or by a computer, but you think that you haven’t made a
mistake, then you may be able to check that the two answers are equivalent
by using algebraic manipulation. You may need to use trigonometric
identities – there are examples of this in the solution to Activity 45.
A rougher, but sometimes quicker, check is to plot the graphs of the two
different answers on a computer (taking the arbitrary constants to be zero,
for example), and check that they seem to be vertical translations of each
other. This shows that the two different answers seem to be equivalent
once the arbitrary constants are taken into account. Again, there’s an
example of this in the solution to Activity 45.
In this subsection you’ve seen just a few examples of how you can use
trigonometric identities and integration by substitution to help you
integrate expressions that contain trigonometric functions. There are many
more possibilities, some of which are covered in the module Essential
mathematics 2 (MST125).
191
Unit 8 Integration methods
192
5 More integration
For each integral below, suggest a method of integration to use. Where you
suggest substitution, also suggest a suitable equation for u in terms of x.
You are not asked to actually find the integrals, but solutions are included
so you can do so if you wish.
! ! !
(a) x3 cos(9x4 ) dx (b) x cos(5x) dx (c) (x + cos(5x)) dx
! ! !
1 x2
(d) x(x2 + 3) dx (e) dx (f) dx
1 + 3x2 (7 − x3 )7
! ! !
−x/3
&1 + &1 +
(g) xe dx (h) sin 2 x cos 2 x dx (i) (x − 1)4 dx
! ! !
x x 2
(j) e (1 − 2e ) dx (k) x(1 + sin(x )) dx (l) x(3x2 − 2)8 dx
You’ll learn further useful strategies for integration in the module Essential
mathematics 2 (MST125), if it’s in your study programme. In particular,
you’ll learn more about integration by substitution and integration by
using trigonometric identities, and you’ll study partial fractions, which are
useful when you want to integrate rational functions.
193
Unit 8 Integration methods
You can use the skills that you learned in Activity 47 in the next activity.
The next two activities are the final ones of the unit. In them, you’re
asked to use your knowledge of integration to solve practical problems.
The definite integrals involved in the problems can’t easily be evaluated by
using the fundamental
√ theorem of calculus, as it’s difficult to find
3 2
antiderivatives of x + 1 and sin(t /150). However, you can use a
computer to find approximate values.
194
5 More integration
Consider the plastic edging shown below. The shape of its cross-section is
given by the equation
"
y = 13 x3 + 1 (−1 ≤ x ≤ 1.5),
where x and y are measured in centimetres.
Calculate the volume of plastic that is required to make one metre of the
edging. Give your answer in cubic centimetres to three significant figures.
195
Unit 8 Integration methods
You’ve now come to the end of the introduction to basic calculus in this
module. You can use the skills that you’ve acquired to solve many different
types of problems in mathematics, engineering and science, for example.
These skills also equip you to study many further topics in these areas.
You’ll make a start on that in Unit 11, in which you’ll use calculus in your
study of Taylor polynomials. If your study programme includes further
mathematics modules, then in some of these you may study more
advanced calculus, with even more powerful applications.
Learning outcomes
After studying this unit, you should be able to:
• understand what is meant by a definite integral
• understand how to use subintervals to find an approximate value for a
definite integral
• use integral notation
• understand the fundamental theorem of calculus
• evaluate some definite integrals by using the fundamental theorem of
calculus
• integrate by substitution
• integrate by parts
• appreciate how to use trigonometric identities and integration by
substitution to help you integrate some trigonometric expressions
• identify a suitable method of integration for some types of integrals
• use a computer for integration.
196
Solutions to activities
Solutions to activities
Solution to Activity 1 So the heights of the rectangles are:
If we divide the interval [−18, 18] into six f (−3) = 3 − (−3)2 = −6
subintervals of equal width, then each subinterval f (−2) = 3 − (−2)2 = −1
has width 36/6 = 6.
f (−1) = 3 − (−1)2 = 2
The left endpoints of the six subintervals are
f (0) = 3 − 02 = 3
− 18, −18 + 6, −18 + 2 × 6,
f (1) = 3 − 12 = 2
− 18 + 3 × 6, −18 + 4 × 6, −18 + 5 × 6
f (2) = 3 − 22 = −1.
that is
So we obtain the following approximate value for
−18, −12, −6, 0, 6, 12. the signed area:
So the heights of the rectangles are: (−6 × 1) + (−1 × 1) + (2 × 1)
1
f (−18) = 9 − 36 × (−18)2 = 0 + (3 × 1) + (2 × 1) + (−1 × 1) = −1.
1 2
f (−12) = 9 − 36 × (−12) =5
Solution to Activity 4
f (−6) = 9 − 36 1
× (−6)2 = 8
The exact value of the signed area between the
1
f (0) = 9 − 36 × 02 = 9 graph of the function f (x) = 3 − x2 and the x-axis
1
f (6) = 9 − 36 × 62 = 8 from x = −3 to x = 3 is 0.
f (12) = 9 − 361
× 122 = 5.
Solution to Activity 5
So we obtain the following approximate value for
the area: (a) The signed area from x = 2 to x = 6 is 4.0.
(0 × 6) + (5 × 6) + (8 × 6) (b) The signed area from x = 2 to x = 6 is 4.0,
+ (9 × 6) + (8 × 6) + (5 × 6) = 210. so the signed area from x = 6 to x = 2 is −4.0.
(c) The signed area from x = 2 to x = 10 is
Solution to Activity 2 4.0 − 5.4 = −1.4.
(a) The signed area from x = 1 to x = 5 is −6.3. (d) From part (c), the signed area from x = 2 to
(b) The signed area from x = 5 to x = 7 is 1.3. x = 10 is −1.4, so the signed area from x = 10
to x = 2 is 1.4.
(c) The signed area from x = 1 to x = 7 is
−6.3 + 1.3 = −5. (e) The signed area from x = 8 to x = 8 is 0.
(d) The signed area from x = 1 to x = 9 is (f) The signed area from x = −3 to x = 2 is
−6.3 + 1.3 − 5.9 = −10.9. 4.1 − 2.4 = 1.7, so the signed area from x = 2 to
x = −3 is −1.7.
Solution to Activity 3 (g) The signed area from x = −1 to x = 10 is
If we divide the interval [−3, 3] into six subintervals −2.4 + 4.0 − 5.4 = −3.8, so the signed area
of equal width, then each subinterval has width from x = 10 to x = −1 is 3.8.
6/6 = 1.
The left endpoints of the six subintervals are
−3, −2, −1, 0, 1, 2.
197
Unit 8 Integration methods
198
Solutions to activities
! 1
Solution to Activity 10
! ! (b) (u − 2u2 ) du
4 √ 4
−1/2
(a) 3 x dx = 3x1/2 dx 2 11
1 2
1
(
1
04 = − 2 × 13 u3
2u
−1/2
1 3/2 2 11
= 3× x
3/2 1 = 21 u2 − 32 u3
2 14 −1/2
= 2x3/2 2 11 2 11
1 = 12 u2 − 32 u3
−1/2 −1/2
= 2 × 4 − 2 × 13/2
3/2
2 11 2 11
=2×8−2 = 12 u2 − 32 u3
−1/2 −1/2
= 14. & 2 + 2& 3 +
! = 2 1 − (− 2 ) − 3 1 − (− 21 )3
1 1 2
0
(b) x(1 + x) dx = 12 (1 − 14 ) − 23 (1 + 18 )
−1 1 3 2 9
! 0 = 2 × 4 − 3 × 8
= (x + x2 ) dx = 3
8 − 3
4
−1
)1 2 1 3
'0 = − 83 .
= 2x + 3x −1
= ( 12 ×0 + 2 1
× 03 ) − ( 12 (−1)2 + 13 (−1)3 ) Solution to Activity 12
3 √ √
! 7 ! 7
= −( 21 − 31 ) 1 3
(a) √ 2x dx = 1
2 √ x3 dx
= − 61 . 3 3
2 1√7
! 4 ! 4, - 1 1 4
r+5 5 = 2 4x √
(c) dr = dr 1+ 3
2 r 2 r ) '√
) '4 1 4 √ 7
= 8 x
= r + 5 ln r 2 .√ 3
√ #
= (4 + 5 ln 4) − (2 + 5 ln 2) = 1
8 ( 7)4 − ( 3)4
= 2 + 5(ln 4 − ln 2) = 18 (72 − 32 )
= 2 + 5 ln(4/2) = 18 (49 − 9)
= 2 + 5 ln 2. = 5.
! π/4
Solution to Activity 11 (b) (sin x + cos x) dx
! 5 ( 05 −π/4
1 5/2 ! !
(a) x3/2 dx = x π/4 π/4
3 5/2 3 = sin x dx + cos x dx
2 15 −π/4 −π/4
2 5/2
= 5x ) 'π/4 ) 'π/4
3 = − cos x −π/4 + sin x −π/4
2 15
= 25 x5/2 ) 'π/4 ) 'π/4
3 = − cos x −π/4 + sin x −π/4
= 2 5/2
− 35/2 ) . .π# . π ##
5 (5
= − cos − cos −
= 16.1 (to 3 s.f.). . π #4 . π #4
+ sin − sin −
, 4 - 4 , -
1 1 1 1
=− √ −√ + √ − −√
2 2 2 2
2
=√
2
√
= 2.
199
Unit 8 Integration methods
200
Solutions to activities
Solution to Activity 21 du
(f) Let u = sin x; then = cos x. So
du ! , - dx ! , -
(a) Let u = cos x; then = − sin x. So 1 1
! dx ! cos x dx = du
1 + sin2 x 1 + u2
ecos x (− sin x) dx = eu du
= tan−1 u + c
= eu + c = tan−1 (sin x) + c.
= ecos x + c.
du Solution to Activity 22
3
(b) Let u = x ; then = 3x2 . So du
! dx ! (a) Let u = 4 + cos x; then = − sin x. So
& 3
+ 2 ! dx !
sin(x ) (3x ) dx = sin(u) du
(4 + cos x) (− sin x) dx = u7 du
7
= − cos u + c
= − cos(x3 ) + c. = 81 u8 + c
du = 18 (4 + cos x)8 + c.
(c) Let u = sin x; then = cos x. So
! , - dx ! du
1 1 (b) Let u = 1 + x2 ; then = 2x. So
cos x dx = du ! " dx !
sin x u √
1 + x2 (2x) dx = u du
= ln |u| + c
!
= ln | sin x| + c. = u1/2 du
du
(d) Let u = cos x; then = − sin x. So 1 3/2
! , - dx ! = u +c
1 1 3/2
(− sin x) dx = du
cos x
2 u2 = 23 (1 + x2 )3/2 + c.
!
du
= u−2 du (c) Let u = x5 − 8; then = 5x4 . So
! dx !
1 −1
=u +c (x − 8) (5x ) dx = u10 du
5 10 4
−1
1 = 1 11
=− +c 11 u +c
u 5
1 = 1
11 (x − 8)11 + c.
=− +c du
cos x (d) Let u = ex + 5; then = ex . So
= − sec x + c. ! , - dx!
1 1
(A quicker way to find this indefinite integral, x
e dx = du
x
e +5 u
without using substitution, is to write the
integrand as − sec x tan x and use the table of = ln |u| + c
standard indefinite integrals.) = ln |ex + 5| + c
du = ln(ex + 5) + c.
(e) Let u = sin x; then = cos x. So
! dx! du
(e) Let u = 5 + 2x3 ; then = 6x2 . So
sin4 x cos x dx = u4 du ! dx !
& 3
+ 2
sin(5 + 2x ) (6x ) dx = sin u du
= 15 u5 + c
= 1
sin5 x + c. = − cos u + c
5
= − cos(5 + 2x3 ) + c.
202
Solutions to activities
Solution to Activity 23 du
(b) Let u = 1 + x2 ; then = 2x. So
du ! " !dx." #
(a) Let u = ex ; then = ex . So
! dx ! x 1 + x2 dx = 1 + x2 x dx
x x
(cos(e )) e dx = cos u du ! ." #
= 21 1 + x2 × 2x dx
= sin u + c !
√
= sin(ex ) + c. = 12 u du
du !
(b) Let u = 1 + sin x; then = cos x. So
! dx
! = 12 u1/2 du
e1+sin x (cos x) dx = eu du = 1 2 3/2
2 × 3u +c
= eu + c = 1 3/2
3u +c
= e1+sin x + c. = 1 2 3/2
3 (1 + x ) + c.
du du
(c) Let u = x10 + 6; then = 10x9 . So (c) Let u = cos x; then = − sin x. So
! , - dx ! dx !
1 1 !
9
10
(10x ) dx = du cos4 x sin x dx = − (cos4 x)(− sin x) dx
x +6 u
= ln |u| + c !
= − u4 du
= ln |x10 + 6| + c
= ln(x10 + 6) + c. = − 15 u5 + c
du = − 51 cos5 x + c.
(d) Let u = cos x; then = − sin x. So
! dx !
Solution to Activity 25
(cos3 x)(− sin x) dx = u3 du
du
1 4 (a) Let u = 1 + 2x2 ; then = 4x. So
= 4u + c ! ! ,dx -
= 1 4 x 1
4 cos x + c. dx = x dx
1 + 2x2 1 + 2x2
! , -
Solution to Activity 24 1 1
=4 (4x) dx
du 1 + 2x2
(a) Let u = x3 ; then = 3x2 . So !
! dx ! 1
& + = 41 du
2 3
x cos(x ) dx = cos(x3 ) x2 dx u
! = 14 ln |u| + c
= 3 (cos(x3 ))(3x2 ) dx
1
= 1
4 ln |1 + 2x2 | + c
! = 1
ln(1 + 2x2 ) + c
1 4
= 3 cos u du (since 1 + 2x2 is always positive).
1
= 3 sin u + c
= 1
3 sin(x3 ) + c.
203
Unit 8 Integration methods
du du
(b) Let u = 2 + ex ; then = ex . So (c) Let u = 1 + 12 sin x; then = 21 cos x. So
! dx
! , - ! dx
ex 1
dx = ex dx (1 + 21 sin x)2 cos x dx
(2 + ex )2 (2 + ex )2 !
!
1 = 2 (1 + 21 sin x)2 ( 12 cos x) dx
= du
u2 !
!
= u−2 du = 2 u2 du
= −u−1 + c = 2 × 31 u3 + c
1 = 23 u3 + c
=− +c
u
1 = 23 (1 + 12 sin x)3 + c.
=− + c.
2 + ex Solution to Activity 27
du du
(c) Let u = 3 − sin x; then = − cos x. So Let u = cos x; then = − sin x. So
! dx
! , - dx
cos x 1 ! !
dx = − (− cos x) dx sin x
3 − sin x 3 − sin x tan x dx = dx
! cos x
! , -
1 1
=− du = (sin x) dx
u cos x
= − ln |u| + c ! , -
1
= − ln |3 − sin x| + c =− (− sin x) dx
cos x
!
= − ln(3 − sin x) + c 1
=− du
(since 3 − sin x is always positive). u
= − ln |u| + c
Solution to Activity 26 = − ln | cos x| + c.
du
(a) Let u = cos x; then = − sin x. So
! dx ! Solution to Activity 28
ecos x sin x dx = − ecos x (− sin x) dx du
(a) Let u = 6x − 1; then = 6. So
! ! ! dx
=− eu du e6x−1 dx = 1
6 e6x−1 × 6 dx
!
= −eu + c
= 1
eu du
= −ecos x + c. 6
du = 16 eu + c
(b) Let u = 3 − ex ; then = −ex . So
! dx
! , - = 61 e6x−1 + c.
ex 1
dx = − (−ex ) dx du
3 − ex 3 − ex (b) Let u = 4x; then = 4. So
! ! dx!
1
=− du sin(4x) dx = 41 (sin(4x)) × 4 dx
u
= − ln |u| + c !
1
= − ln |3 − ex | + c. = 4 sin u du
= 14 (− cos u) + c
= − 41 cos u + c
= − 41 cos(4x) + c.
204
Solutions to activities
du du
(c) Let u = −9x; then = −9. So (g) Let u = 2x + 1; then = 2. So
! dx
! ! dx
! , -
1 1
e−9x dx = − 19 e−9x × (−9) dx dx = 1
2 × 2 dx
! (2x + 1)2 (2x + 1)2
!
= − 19 eu du 1 1
=2 du
u2
!
= − 91 eu + c
=2 1
u−2 du
= − 19 e−9x + c.
du = − 12 u−1 + c
(d) Let u = −x/3; then = − 31 . So 1 1
! !dx =− × +c
2 u
e−x/3 dx = −3 e−x/3 × (− 31 ) dx 1
! =− + c.
2(2x + 1)
u
= −3 e du
du
(h) Let u = 1 − x; then = −1. So
= −3eu + c ! dx
! , -
= −3e−x/3 + c. 1 1
dx = − × (−1) dx
du (1 − x)3 (1 − x)3
(e) Let u = 3 − 7x; then = −7. So !
dx 1
! ! =− 3
du
u
1
cos(3 − 7x) dx = − 7 (cos(3 − 7x)) × (−7) dx !
! = − u−3 du
= − 17 cos u du
= −(− 21 u−2 ) + c
= − 71 sin u + c 1
= 2 +c
1
= − 7 sin(3 − 7x) + c. 2u
1
du = + c.
(f) Let u = 4 − x; then = −1. So 2(1 − x)2
! ! dx
, - du
1 1 (i) Let u = 6x; then = 6. So
dx = − × (−1) dx dx !
4−x 4−x !
!
1 sec2 (6x) dx = 16 (sec2 (6x)) × 6 dx
=− du !
u
= − ln |u| + c = 6 sec2 u du
1
= − ln |4 − x| + c. = 1
tan u + c
6
1
= 6 tan(6x) + c.
205
Unit 8 Integration methods
!
Solution to Activity 29 (c) e3t+4 dt = 13 e3t+4 + c.
du !
(a) Let u = 2x; then = 2. So
! dx ! (d) cos( 12 θ) dθ = 2 sin( 21 θ) + c.
1 1
√ dx = " dx !
1 − 4x2 1 − (2x)2
! (e) e−4p dp = − 41 e−4p + c.
1 1
=2 " × 2 dx !
1 − (2x)2 1
! (f) dx = 17 ln |7x − 4| + c.
1 7x − 4
= 12 √ du !
1 − u2 1
(g) dr = − ln |1 − r| + c.
= 21 sin−1 u + c 1−r
!
= sin−1 (2x) + c.
1
2 (h) sec2 (5x − 1) dx = 15 tan(5x − 1) + c.
√ du √ !
(b) Let u = 2 x; then = 2. So
! dx ! (i) sin( 15 x) dx = −5 cos( 15 x) + c.
1 1 1 √
dx = √ &√ + × 2 dx !
1 + 2x2 2
2 1+ 2x (j) e−x dx = −e−x + c.
!
1 1 !
=√ du
2 1 + u2 (k) ex/2 dx = 2ex/2 + c.
1 !
= √ tan−1 u + c
2 (l) sec2 (2 − 3p) dp = − 13 tan(2 − 3p) + c
1 .√ #
= √ tan−1 2 x + c. = 31 tan(3p − 2) + c.
2
du (The final expression was obtained by using the
(c) Let u = 32 x; then = 23 . So trigonometric identity tan(−θ) = − tan θ.)
! dx ! ! , - !
1 1 1 2−x & +
dx = 4 dx (m) sin dx = sin 32 − 13 x dx
4 + 9x2 1 + 94 x2 3
! & & ++
1 = −3 − cos 32 − 13 x + c
= 14 & 3 +2 dx , -
1 + 2x 2−x
! = 3 cos + c.
1 2 1 3 3
=4×3 & 3 +2 × 2 dx
1 + 2x Solution to Activity 31
!
1
= 61 3
& 3 +2 × 2 dx (a) The velocity v (in m s−1 ) of the object at time t
1 + 2x (in seconds) is given by
! & +
1 1 v = 2 cos 21 t + 5 .
=6 du
1 + u2 Hence the displacement s (in m) of the object
= 61 tan−1 u + c at time t (in seconds) is given by
& + !
= 61 tan−1 23 x + c. & +
s = 2 cos 21 t + 5 dt
Solution to Activity 30 & +
! = 2 × 2 sin 21 t + 5 + c
& +
(a) 1
cos(10x) dx = 10 sin(10x) + c. = 4 sin 12 t + 5 + c,
! where c is an arbitrary constant.
(b) sin(2 − 5x) dx = − 15 (− cos(2 − 5x)) + c
1
= 5 cos(2 − 5x) + c.
206
Solutions to activities
When t = 0, s = 4. Hence du
& + (b) Let u = cos x; then = − sin x. Putting
4 = 4 sin 12 × 0 + 5 + c dx
x = π/2 gives u = 0 and putting x = π gives
4 = 4 sin(5) + c u = −1. So
! π ! π
c = 4 − 4 sin(5) 3
cos x sin x dx = − cos3 x(− sin x) dx
c = 7.835 . . . π/2 π/2
! −1
c = 7.84 (to 2 d.p.).
=− u3 du
So the equation for the displacement of the 0
object in terms of time is 2 1−1
& + = − 14 u4
s = 4 sin 12 t + 5 + 7.84. 0
1
) 4 '−1
(b) When t = 10, = −4 u 0
& + & +
s = 4 sin 12 × 10 + 5 + 7.835 . . . = − 14 (−1)4 − 04
= 4 sin(10) + 7.835 . . . = − 41
= 5.7 (to 2 s.f.). = −0.25.
So the displacement of the object at time (Notice that in the second line of the
10 seconds is 5.7 m (to 2 s.f.). manipulation above we obtain an integral,
/ −1 3
0
u , in which the lower limit of integration is
Solution to Activity 32 greater than the upper limit of integration. An
du alternative way to proceed from this line is to
(a) Let u = 1 + 2x2 ; then = 4x.
dx use the fact that, in general,
/a /b
Putting x = 0 gives u = 1 and putting x = 1 b
f (x) dx = − a f (x) dx. Then the
gives u = 3. So manipulation continues like this:
! 1 ! 1, - ! −1 ! 0
x 1
2
1
dx = 4 (4x) dx u3 du = u3 du
0 1 + 2x 0 1 + 2x2 0 −1
! 3 2 10
1 = 1 4
4u
1
=4 du
1 u
−1
) '3 1
) '
4 0
= 41 ln |u| 1 = 4 u −1
& 4 +
= 14 (ln 3 − ln 1) = 1
4 0 − (−1)4
= 14 ln 3 = − 41
= 0.275 (to 3 s.f.). = −0.25.)
du
(c) Let u = −3x2 ; then = −6x.
dx
Putting x = −1 gives u = −3 and putting x = 0
gives u = 0. So
! 0 ! 0. #
−3x2 2
xe dx = − 61
e−3x (−6x) dx
−1 −1
!0
= − 16 eu du
−3
) '0
= − 61 eu −3
& +
= − 61 e0 − e−3
& +
= 61 e−3 − 1
= −0.158 (to 3 s.f.).
207
Unit 8 Integration methods
= − 51 x cos(5x) + 51 × 51 sin(5x) + c
= −u−1 + 25 u−2 + c
1 5 = − 15 x cos(5x) + 25 1
sin(5x) + c
=− + 2 +c = 251
sin(5x) − 15 x cos(5x) + c.
u 2u
1 5 ! !
=− + + c. 2x
& 1 2x + & +
x + 3 2(x + 3)2 (b) xe dx = x 2 e − 1 × 12 e2x dx
!
1 2x
= 2 xe − 2 e2x dx
1
Solution to Activity 34
(a) Let f (x) = x and g(x) = cos x. Then f " (x) = 1 = 12 xe2x − 1 1 2x
2 × 2e +c
and an antiderivative of g(x) is G(x) = sin x. = 1
2 xe
2x
− 1 2x
4e +c
Hence
! ! 1 2x
= 4 e (2x −1) + c.
x cos x dx = f (x)g(x) dx ! !
& + & +
! (c) xe−x dx = x −e−x − 1 × −e−x dx
= f (x)G(x) − f " (x)G(x) dx !
! = −xe−x + e−x dx
= x sin x − sin x dx
= −xe−x + (−e−x ) + c
= x sin x − (− cos x) + c = −xe−x − e−x + c
= x sin x + cos x + c. = −e−x (x + 1) + c.
(b) Let f (x) = x and g(x) = ex . Then f " (x) = 1
and an antiderivative of g(x) is G(x) = ex .
208
Solutions to activities
209
Unit 8 Integration methods
210
Solutions to activities
(c) Integrating by parts gives The function y = e−x is decreasing on its whole
! π/6 domain, so the function y = e−x − 1 is also
x sin(3x) dx decreasing on its whole domain. The x-intercept
π/12
! of its graph is given by e−x − 1 = 0, which gives
2 1π/6 π/6
1
= x(− 3 cos(3x)) − (− 13 cos(3x)) dx e−x = 1
π/12 π/12 − x = ln 1
! π/6
) 'π/6 −x=0
= − 31 x cos(3x) π/12 + 13 cos(3x) dx
π/12 x = 0.
. .π# . π ##
1 π π So its x-intercept is x = 0.
= −3 cos − cos
6 2 12 4
2 1π/6 So we have the following table of signs, for
+ 31 13 sin(3x) x > 0.
π/12
.π π 1 # ) ' x (0, 2) 2 (2, ∞)
π/6
= − 31 ×0− × √ + 19 sin(3x) π/12
6 12 2 x−2 − 0 +
π . .π# . π ## e−x − 1 − − −
= √ + 19 sin − sin
36 2 2 4 (x − 2)(e−x − 1) + 0 −
π . 1 #
= √ + 91 1 − √ The table shows that the graph of the function
36 2 2
= 0.0943 (to 4 d.p.). f (x) = (x − 2)(e−x − 1) lies above the x-axis for
0 < x < 2 and below the x-axis for x > 2.
(The area found in this solution is shown below.)
(b) Because the graph of f lies above the x-axis for
0 < x < 2 and below the x-axis for x > 2, to
find the total area between the graph of f and
the x-axis from x = 0 to x = 4 we have to work
out the signed area from x = 0 to x = 2 and the
signed area from x = 2 to x = 4 separately.
The signed area between the graph of f and the
x-axis from x = 0 to x = 2 is given by
! 2
(x − 2)(e−x − 1) dx
0
2 12 ! 2
−x
= (x − 2)(−e − x) − (−e−x − x) dx
0 0
! 2
= 0 − (−2)(−e−0 ) + (e−x + x) dx
0
2 12
−x 1 2
= 2(−1) + −e + 2 x
0
−2 −0
= −2 + (−e + 2) − (−e + 0)
Solution to Activity 43 = −2 − e−2 + 2 + 1
(a) To help us construct a table of signs for the = 1 − e−2 .
function f (x) = (x − 2)(e−x − 1), first we Since the graph of f lies above the x-axis for
consider the two factors of the expression 0 < x < 2, it follows that the area between the
(x − 2)(e−x − 1) separately. graph of f and the x-axis from x = 0 to x = 2 is
The function y = x − 2 is increasing on its 1 − e−2 .
whole domain, and the x-intercept of its graph
is x = 2.
211
Unit 8 Integration methods
! !
Similarly, the signed area between the graph (b) sin x cos x dx = 1
sin(2x) dx
2
of f and the x-axis from x = 2 to x = 4 is given
by = 1
× 21 (− cos(2x)) + c
! 4 2
(x − 2)(e−x − 1) dx = − 14 cos(2x) + c.
2
2 14 ! 4 Solution to Activity 45
= (x − 2)(−e−x − x) − (−e−x − x) dx du
2 2 Let u = sin x; then = cos x. So
! 4 ! dx
!
= 2(−e−4 − 4) − 0 + (e−x + x) dx sin x cos x dx = u du
2
2 14
= −2e −4
− 8 + −e−x + 12 x2 = 12 u2 + c
2
= −2e−4 − 8 + (−e−4 + 8) − (−e−2 + 2) = 12 sin2 x + c.
= −3e−4 + e−2 − 2. (There’s an alternative approach to finding this
integral, as follows. Let u = cos x; then
Since the graph of f lies below the x-axis for du
2 < x < 4, it follows that the area between the = − sin x. So
dx! !
graph of f and the x-axis from x = 2 to x = 4 is
−(−3e−4 + e−2 − 2) = 3e−4 − e−2 + 2. sin x cos x dx = cos x sin x dx
!
So the total area between the graph of f and = − (cos x)(− sin x) dx
the x-axis from x = 0 to x = 4 is !
(1 − e−2 ) + (3e−4 − e−2 + 2) = − u du
= 3 − 2e−2 + 3e−4
= − 21 u2 + c
= 2.784 (to 4 s.f.).
= − 21 cos2 x + c.
(The area found in this solution is shown below.)
This approach gives a different answer, but of
course it’s equivalent to the first answer. You can
confirm this by using the identity
sin2 θ + cos2 θ = 1, as follows:
− 21 cos2 x + c = − 12 (1 − sin2 x) + c
= 1
2 sin2 x − 1
2 +c
1 2
= 2 sin x + d,
where d = − 21 + c is an arbitrary constant.
Notice also that the solution to Activity 44(b) gives
a third
! different answer:
sin x cos x dx = − 14 cos(2x) + c.
212
Solutions to activities
(The diagram below shows the graphs of the three second expression is a simple function of a
different antiderivatives of f (x) = sin x cos x that linear expression.
!
were found in the solutions to Activity 44(b) and
Activity 45. The arbitrary constant has been taken (x + cos(5x)) dx = 12 x2 + 51 sin(5x) + c.
to be zero in each case. You can see that, as you’d
(d) You could use integration by substitution
expect, the three graphs appear to be vertical
(taking u = x2 + 3) or integration by parts, but
translations of each other.)
the simplest method is to start by multiplying
out! the brackets. !
x(x2 + 3) dx = (x3 + 3x) dx
= 14 x4 + 32 x2 + c
= 41 x2 (x2 + 6) + c.
1
(e) The integrand can be written as &√ +2 ,
1+ 3x
√
so it’s a function of the linear expression
√ 3 x.
So you can use the substitution u = 3 x,
1
together with the indefinite integral of ,
1 + x2
which is a standard integral.
√ √
Let u = 3 x; then du/dx = 3. So
! ! √
1 1 1
dx = √ &√ + × 3 dx
Solution to Activity 46 1 + 3x2 3 1+ 3x
2
!
(a) Since the derivative of 9x4 is 36x3 , which is 1 1
‘nearly’ x3 , you can use the substitution =√ du
3 1 + u2
u = 9x4 . 1
du = √ tan−1 u + c
Let u = 9x4 ; then = 36x3 . So 3
! dx ! 1 .√ #
= √ tan−1 3 x + c.
x3 cos(9x4 ) dx = 1
36 cos(9x4 ) × 36x3 dx 3
! (f) Since the derivative of 7 − x3 is −3x2 , which is
1
= 36 cos u du ‘nearly’ the numerator x2 , you can use the
1 substitution u = 7 − x3 .
= 36 sin u + c
du
= 1
sin(9x4 ) + c. Let u = 7 − x3 ; then = −3x2 . So
36
! dx !
(b) The integrand is of the form xg(x), where g is a x2 1
function that you can integrate, which suggests dx = − 1
3 (−3x2 ) dx
(7 − x3 )7 (7 − x3 )7
that you should try integration by parts. To !
1 1
integrate the ‘second’ expression, you can use = −3 du
u7
the fact that it is a simple function of a linear !
expression. = − 13 u−7 du
! !
1 1
x cos(5x) dx = 5 x sin(5x) − 5 sin(5x) dx 1 1 −6
=− × u +c
& + 3 −6
= 51 x sin(5x) − 15 − 15 cos(5x) + c 1
= +c
= 15 x sin(5x) + 1
25 cos(5x) + c. 18u6
(c) The integrand is the sum of two expressions 1
= + c.
that are straightforward to integrate. The 18(7 − x3 )6
213
Unit 8 Integration methods
!
Let u = x2 ; then du/dx = 2x. The expression
= (ex − 2e2x ) dx
above becomes
!
= ex − 2 × 12 e2x + c 1 2
(sin(x2 )) × 2x dx
1
2x + 2
= ex − e2x + c !
1 2 1
= ex − (ex )2 + c = 2 x + 2 sin u du
= ex (1 − ex ) + c. = 12 x2 + 21 (− cos u) + c
(Here’s how the integration goes if you use
= 12 x2 − 1
2 cos(x2 ) + c
integration by substitution.
= 21 (x2 − cos(x2 )) + c.
214
Solutions to activities
2 8
x(3x − 2) dx = 1 2 8
(3x − 2) × 6x dx A computer gives the value of this definite integral
6
! as 0.939 (to 3 s.f.). So the volume of plastic
= 1
u8 du required to make one metre of the edging is
6
approximately 93.9 cm3 .
1 1 9
= 6 × 9u (Details of how to use the CAS to find the indefinite
= 1 2
2)9 + c. integral are given in the Computer algebra guide, in
54 (3x −
the section ‘Computer methods for CAS activities
Solution to Activity 48 in Books A–D’.)
(a) The domain of f is the interval
√ [−1, ∞) Solution to Activity 50
(because the expression x + 1 is defined only
when x is in this interval). The change in the displacement of the object from
time t = 0 to time t = 10 is
(b) The graph of f is shown below. ! 10 , 2 -
t
sin dt.
0 150
A computer gives the value of this definite integral
as 2.15 (to 2 d.p.).
So the displacement of the object at time t = 10 is
(6 + 2.15) m = 8.15 m, to the nearest centimetre.
(Details of how to use the CAS to find the indefinite
integral are given in the Computer algebra guide, in
the section ‘Computer methods for CAS activities
in Books A–D’.)
215
Acknowledgements
Acknowledgements
Grateful acknowledgement is made to the following source:
Page 193: Mark Hobbs, for the idea for the cartoon
216
Unit 9
Matrices
Introduction
Introduction
This unit introduces mathematical objects called matrices (the singular
form is matrix ). At their simplest, matrices are arrays of numbers used to
hold information in an orderly way. In particular, they provide a useful
way of storing data that can be organised in rows and columns. Everyday
examples of such data include transport timetables and tables holding
survey results.
In this unit you will discover that arrays of numbers can be thought of as
‘multi-dimensional numbers’. For example, you can add, subtract and
multiply them, just as you can add, subtract and multiply numbers.
You’ve met a similar idea before: in Unit 5, you saw that column vectors
are numerical entities made up of more than one number, and they can be
added, subtracted and multiplied by scalars, for example.
One advantage of arranging large amounts of numerical data in matrix
form is precisely that a block of data can be treated as a single entity.
Many problems that involve large amounts of numerical data are first
reduced to matrix problems before being solved with a computer. So
matrices are an important topic in mathematics, and find widespread use
in mathematical modelling, as well as in computer graphics (where they
are used to move and rotate objects), medical imaging, economic models,
electrical networks, data encryption and many other applications. A magic square (an example
of an array of numbers) on
As a simple practical example, consider a publisher that supplies its books the facade of the Sagrada
to a number of bookshops. The publisher’s sales department might collect Familia basilica in Barcelona
the weekly order quantities for three of its bestselling books in a grid such
as the one in Table 1.
Table 1 A publisher’s order quantities for three books in a certain week
Bestseller 1 Bestseller 2 Bestseller 3
Bookshop 1 10 25 12
Bookshop 2 5 10 5
Bookshop 3 7 5 0
Bookshop 4 10 8 10
219
For example, the information for two consecutive weeks can be recorded in
two separate matrices:
10 25 12 15 30 20
5 10 5 10 20 10
.
7 5 0 , 20 8 5
10 8 10 15 0 10
The order quantities for a fortnight can now be calculated by adding
corresponding numbers in the two arrays, which gives
10 + 15 25 + 30 12 + 20 25 55 32
5 + 10 10 + 20 5 + 10
, that is, 15 30 15 .
7 + 20 5+8 0+5 27 13 5
10 + 15 8 + 0 10 + 10 25 8 20
This calculation amounts to adding the two matrices, in a sense that will
be made precise in Section 1.
You might wonder what the advantage is in carrying out calculations in
this way. After all, the total number of orders for each bookshop can be
worked out in a straightforward way without resorting to matrices. In a
real-life situation, however, a large publisher will supply hundreds of
bookshops with hundreds of books. Calculating order quantities and total
charges requires careful accounting, and it is precisely in this sort of
situation that matrices become a useful way to store data (especially
electronically) and carry out calculations with them.
There are other useful operations that can be carried out when data are
stored in matrix form, such as the multiplication of matrices by single
numbers, and the multiplication of two matrices. You’ll learn about these
operations in Section 1, and see an application of matrix multiplication in
Section 2. In Section 3 you’ll learn about matrix inverses, and in Section 4
Pages from The nine chapters you’ll meet a connection between matrices and systems of linear equations.
on the mathematical art
220
1 Matrices and matrix operations
221
Unit 9 Matrices
The column notation for vectors that you met in Unit 5 is a particular
instance of matrix notation: two-dimensional column vectors are 2 × 1
matrices, and three-dimensional column vectors are 3 × 1 matrices. In
general, the word vector is used to mean any matrix with a single column,
even if it has more than 3 elements and hence has no geometric
interpretation in the plane or three-dimensional space. For instance, the
following matrices are vectors:
√
2
1 −3 w
3 1 x
, and , where w, x, y, z are variables.
5 2 y
0
7 z
7
The elements of a vector are often called its components, as in Unit 5.
A vector with n components is called an n-dimensional vector.
There is a useful notation for the elements of a matrix in terms of their
row and column positions. The element in row i and column j of a
matrix A is denoted by aij . Similarly, the element in row i and column j
of a matrix B is denoted by bij , and so on.
For instance, if
0 1
1
3 4
A= ,
7 −6
Write down the elements a33 , a34 , a43 , a14 , a44 and a42 of the matrix
2 8 0 −1
4 7 5 −2
A= 9 −1
.
3 −4
6 −5 −7 −9
What is the size of A?
Two matrices are equal if they have the same numbers of rows and
columns, and all corresponding elements are equal. For example,
. / . /
1 2 1 2
= ,
3 4 3 4
222
1 Matrices and matrix operations
but
. / 1 2 . / . /
1 2 1 2 1 3
'= 3 4 and '= .
3 4 3 4 2 4
5 6
You can carry out operations on matrices in a similar way to numbers. For
example, you can add, subtract and multiply them, as you’ll see in this
section. There’s also a way to manipulate matrices that in certain
circumstances plays the role of division, which you’ll learn about in
Section 3. While matrix addition is similar to addition of vectors, which
you met in Unit 5, matrix multiplication is more surprising. We’ll start by
looking at matrix addition.
The rule for adding two matrices can be expressed formally as follows.
223
Unit 9 Matrices
Matrix addition
If A and B are m × n matrices, then A + B is the m × n matrix
whose element in row i and column j is aij + bij .
Addition of two matrices of different sizes is not defined.
For each of the pairs of matrices below, work out the matrix sum A + B.
. / . /
2 1 4 6
(a) A = , B=
3 0 2 −1
. / . /
2 0 −4 1 0 1
(b) A = , B=
−1 6 0 0 1 1
. / . /
a b 3 −1
(c) A = , B=
c d −1 2
In Unit 5, you saw that vector addition shares many properties with the
addition of numbers. This happens for matrix addition, too. For example,
you can see from the definition of matrix addition that
A + B = B + A,
for all matrices A and B of the same size. As you saw with vectors, this
property is expressed by saying that matrix addition is commutative.
You can also see from the definition of matrix addition that
(A + B) + C = A + (B + C),
for all matrices A, B and C of the same size. In other words, when you
add three matrices A, B and C of the same size, it doesn’t matter whether
you add A and B and then add C to the result, or add B and C and then
add A to the result, as you get the same overall result either way. This
property is expressed by saying that matrix addition is associative. It
tells you that you can write the expression
A+B+C
without ambiguity: it doesn’t matter whether it means (A + B) + C or
A + (B + C), as both are equal.
There are other respects in which matrix operations behave like number
operations. You will discover more similarities between matrices and
numbers as you meet other matrix operations.
224
1 Matrices and matrix operations
You can also subtract matrices of the same size by subtracting their
corresponding elements. For example,
. / . / . /
4 5 3 1 3 4 4−1 5−3 3−4
− =
7 −6 0 3 9 1 7 − 3 −6 − 9 0 − 1
. /
3 2 −1
= .
4 −15 −1
Matrix subtraction
If A and B are m × n matrices, then A − B is the m × n matrix
whose element in row i and column j is aij − bij .
Subtraction of two matrices of different sizes is not defined.
Since vectors are single-column matrices, they are added and subtracted
by adding or subtracting corresponding components, as in the examples
below.
1 7 1+7 8
−3 −2 −3 + (−2) −5
4 + 1 = 4 + 1 = 5
1 3 1 3
2
2 2 2 + 2
0 −1 0 + (−1) −1
10 −2 10 − (−2) 12
3 3 3 − 3 0
− =
1 2 1 − 2 = −1
−1 1 −1 − 1 −2
In other words, the ideas of addition and subtraction of column vectors
that you met in Unit 5 extend in the obvious way to vectors with more
than three components.
225
Unit 9 Matrices
The matrix kA is called the scalar multiple of the matrix A by the real
number k, and the operation of forming kA is called scalar
multiplication.
Scalar multiplication
If A is an m × n matrix and k is any real number, then kA is the
matrix whose element in row i and column j is kaij .
226
1 Matrices and matrix operations
Notice that it follows from the definition of scalar multiplication that, for
any matrix A,
(−1)A = −A,
as you’d expect.
In the next activity you can practise the matrix operations that you’ve
learned so far.
227
Unit 9 Matrices
228
1 Matrices and matrix operations
10 25 12 4
5 10 5 7 gives 10 × 4 + 25 × 7 + 12 × 9
7 5 0
10 8 10 9
229
Unit 9 Matrices
Now suppose that as well as calculating the amount owed by each of the
four bookshops, the publisher also wants to calculate the profit made from
sales to each individual bookshop. Suppose that the profits made on each
copy of Bestsellers 1, 2 and 3 are £1, £2 and £2, respectively. You can use
exactly the same procedure as above to calculate the profit made from the
sales to each bookshop. You write the profits (in £) made on the three
books as a three-dimensional vector, and form
10 25 12
5 10 5 1
the product of 7 5 0
and 2 ,
2
10 8 10
which is
10 × 1 + 25 × 2 + 12 × 2 84
5 × 1 + 10 × 2 + 5 × 2 35
7 × 1 + 5 × 2 + 0 × 2 = 17 .
10 × 1 + 8 × 2 + 10 × 2 46
So the profits made from sales to the four bookshops are £84, £35, £17
and £46, respectively.
In fact you can view the two matrix multiplications above – the one for the
amounts owed, and the one for the profits – as a single matrix
multiplication. To do this, you put the two vectors containing the prices
and the profits together as a single matrix with two columns, and you put
the two vectors that contain the results of the calculations together as a
single matrix with two columns, and say that
10 25 12 323 84
5 10 5 4 1
the product of and 7 2 is 135 35 . (1)
7 5 0 63 17
9 2
10 8 10 186 46
230
1 Matrices and matrix operations
4× 3 3 ×2 gives 4 × 2.
Now let’s look at the procedure for multiplying two matrices step by step.
It’s demonstrated in the next example.
231
Unit 9 Matrices
To obtain the element in the first row and first column of AB,
multiply each element in the first row of A by the corresponding
element in the first column of B, and add the results.
−2 1 3 1 0 (−2) × 1 + 1 × 21 + 3 × (−1)
2 1
4 −2 2 2 =
0 −1 0 −1 4
9
−2
=
To obtain the element in the first row and second column of AB,
apply the same procedure to the first row of A and the second column
of B.
9
−2 1 3 1 0 − 2 (−2) × 0 + 1 × 2 + 3 × 4
2 4 −2 12 2 =
0 −1 0 −1 4
9
− 2 14
=
To obtain the element in the second row and first column of AB,
apply the same procedure to the second row of A and the first column
of B.
−2 1 3 1 0 − 29 14
2 4 −2 12 2 = 2 × 1 + 4 × 21 + (−2) × (−1)
0 −1 0 −1 4
9
− 2 14
= 6
In general, to obtain the element in the ith row and jth column
of AB, multiply each element in the ith row of A by the
corresponding element in the jth row of B, and add the results.
−2 1 3 1 0
2 4 −2 12 2
0 −1 0 −1 4
− 29 14
= 6 2 × 0 + 4 × 2 + (−2) × 4
1
0 × 1 + (−1) × 2 + 0 × (−1) 0 × 0 + (−1) × 2 + 0 × 4
9
− 2 14
= 6 0
− 21 −2
232
1 Matrices and matrix operations
Matrix multiplication
Let A and B be matrices. Then the product matrix AB can be
formed only if the number of columns of A is equal to the number of
rows of B.
If A has size m × n and B has size n × p, then the product AB has
size m × p.
The element in row i and column j of the product matrix AB is
obtained by multiplying each element in the ith row of A by the
corresponding element in the jth column of B and adding the results.
In element notation, if cij denotes the element in the ith row and
jth column of AB, then
cij = ai1 b1j + ai2 b2j + · · · + ain bnj .
bnj
233
Unit 9 Matrices
In each of
parts (a)–(e) below, calculate the matrix product if it exists.
29 . / 9
−2 3 1 $ -
(a) 71 (b) 1 2 3 8
4 0 5
−6 8 7
2
9 0 10 9 $ -
(c) 7
1 4 1 (d) 8 1 2 3
−68 −2 7 7
. / 1 0 1
1 2 3
(e) 1 0 −1
7 6 1
2 1 3
(a) Determine whether the products AB and BA are defined for the
following pairs of matrices.
. / . /
−1 4 3
(i) A = and B =
2 1 5
. / . /
1 1 0 1
(ii) A = and B =
−1 0 2 3
(b) For each of parts (a)(i) and (a)(ii), if the products AB and BA are
both defined, evaluate them. What do you notice?
234
1 Matrices and matrix operations
In Activity 8 you should have found that, for the particular three matrices
A, B and C in the activity, (AB)C = A(BC). This does not of course tell
you that this property holds for matrices in general. However, this
property does hold in general: whenever A, B and C are matrices such
that the products AB and BC are defined, we have
(AB)C = A(BC).
In other words, matrix multiplication is associative. In this respect matrix
multiplication behaves like multiplication of numbers.
In the next activity you can investigate how matrix multiplication
interacts with multiplication by a scalar.
235
Unit 9 Matrices
236
1 Matrices and matrix operations
Matrix powers
Just as the square a2 of a real number a is defined to be the product a × a,
so we define the square A2 of a matrix A to be the matrix product AA,
if it exists.
For example, if
. /
2 1
A= ,
3 0
then
. /. / . /
2 1 2 1 7 2
A2 = AA = = .
3 0 3 0 6 3
237
Unit 9 Matrices
238
2 Matrices and networks
2.1 Networks
Figure 3 shows a simple example of a network of water pipes. Water can
be poured into the system at position A or B, and it then flows down the
pipes to come out at positions U , V and W . The pipes have different
capacities: the three pipes that come out of position A are labelled with
numbers that indicate the proportions of water that flow through these
pipes when water is poured in at A, and the three pipes that come out of
position B are labelled in the same way. For example, if a litre of water is
poured in at position A, then 0.4 litres flows down the first pipe from A,
0.2 litres flows down the second pipe from A, and 0.4 litres flows down the
third pipe from A.
A B
0:2 0:1
0:4 0:6
0:3 0:4
U V W
A B
0:2 0:1
0:4 0:6
0:3 0:4
U V W
239
Unit 9 Matrices
A B
0:2 0:1
0:4 0:6
0:3 0:4
U V W
Now consider what happens if water is input at both node A and node B
in the network in Example 2. The amount of water reaching an output
node is the sum of the water reaching the node from node A and from
node B. For example, if 1 litre of water is input at node A and another
1 litre is input at node B, then node U will collect 0.4 litres of water from
node A and 0.3 litres from node B. So, in total, 0.7 litres of water will be
output at node U . Similarly, 0.2 + 0.1 = 0.3 litres of water will be output
at node V , and 0.4 + 0.6 = 1 litre of water will be output at node W .
In general, if x litres of water is input at node A and y litres of water is
input at node B, then
0.4x + 0.3y litres of water is output at node U,
0.2x + 0.1y litres of water is output at node V,
0.4x + 0.6y litres of water is output at node W.
In the next activity you can practise calculating network outputs yourself.
240
2 Matrices and networks
U V W
0:6 0:4
0:8 0:3
0:2 0:7
C D
Network diagrams like those that you’ve seen in this subsection can be
used to model many other kinds of flows, such as the movement of money
in and out of companies. For example, consider three corporate donors, D,
E and F , say, that every month make donations to two charities, J and K.
Donor D donates 20% of its monthly designated sum to charity J and 80%
to charity K, donor E donates 30% of its sum to charity J and 70% to
charity K, and donor F donates 50% of its sum to charity J and 50% to
charity K.
The flow of money from the three donors to the two charities can be
represented by the network in Figure 5. For example, a total donation of
£1000 from donor D in a given month is modelled by an input of £1000 at
node D. The label of the arc connecting D to J tells you that 0.2, or 20%,
of that sum, which is £200, reaches the output node J .
D E F
0:3 0:7
0:8 0:5
0:2 0:5
J K
241
Unit 9 Matrices
D E F
0:3 0:7
0:8 0:5
0:2 0:5
J K
A B
0.2 0.1
0.4 0.6
0.3 0.4
U V W
242
2 Matrices and networks
243
Unit 9 Matrices
244
2 Matrices and networks
column are the labels on an arc from a particular node. You can see that
the matrix representing the network in Example 3 has this property.
U V W
0:6 0:4
0:8 0:3
0:2 0:7
C D
A B U V W
0:6 0:4
0:2 0:1 0:8 0:3
0:4 0:6
0:2 0:7
0:3 0:4
U V W C D
(a) (b)
245
Unit 9 Matrices
The first network has three output nodes, and the second network has
three input nodes. We can join these nodes so that, for instance, the water
output at U in the first network flows down the pipes that leave U in the
second network. The combined network is shown in Figure 8.
A B
0:2 0:1
0:4 0:6
0:3 0:4
U V W
0:6 0:4
0:8 0:3
0:2 0:7
C D
A B
C D
through nodes U , V and W are 0.4, 0.2 and 0.4 litres, respectively. So,
from the arc labels in the bottom half of the diagram, you can work out
the following.
• Of the 0.4 litres of water that passes through node U , the amount that
reaches node C is 0.2 × 0.4 = 0.08 litres.
• Of the 0.2 litres of water that passes through node V , the amount that
reaches node C is 0.6 × 0.2 = 0.12 litres.
• Of the 0.4 litres of water that passes through node W , the amount
that reaches node C is 0.3 × 0.4 = 0.12 litres.
To find the total amount of water output at node C, we add the
contributions from the three different routes, which gives
0.08 + 0.12 + 0.12 = 0.32 litres. So the proportion of the water input at
node A that’s output at node C is 0.32, and hence the arc from A to C
should be labelled with the number 0.32.
You can find the labels for the remaining arcs in a similar way. However,
these calculations are quite tedious, and it turns out that they can be
simplified by using the matrices that represent the two original networks,
which are
0.4 0.3 . /
0.2 0.1 and 0.2 0.6 0.3
,
0.8 0.4 0.7
0.4 0.6
respectively. These matrices were found in Subsection 2.2.
Suppose that the input and output vectors (in litres) for the combined
network in Figure 8 are
. / . /
a c
and ,
b d
respectively. Suppose also that the output vector (in litres) of the top part
of the network, which is also the input vector for the bottom part, is
u
v .
w
Then
u 0.4 0.3 . / . / . / u
v = 0.2 0.1 a and
c
=
0.2 0.6 0.3
v .
b d 0.8 0.4 0.7
w 0.4 0.6 w
Using the first of these equations to substitute in the second gives
. / . / 0.4 0.3 . /
c 0.2 0.6 0.3 a
= 0.2 0.1 .
d 0.8 0.4 0.7 b
0.4 0.6
You know from Section 1 that matrix multiplication is associative, so
. / . / 0.4 0.3 . /
c 0.2 0.6 0.3 a
= 0.2 0.1 ;
d 0.8 0.4 0.7 b
0.4 0.6
247
Unit 9 Matrices
that is,
. / . /. /
c 0.32 0.3 a
= . (2)
d 0.68 0.7 b
The conclusion of this discussion is that the matrix that represents the
combined network in Figure 9 is the product of the matrices that represent
the original networks. So matrix multiplication provides a neat way to
calculate outputs of the combined network.
Notice that the matrix representing the top network is the second matrix
in the product that represents the combined network.
The labels of the simpler network that is equivalent to the combined
network can be read off the product matrix. This is easier if we label the
rows and the columns with the nodes as below:
A B
. /
C 0.32 0.3
.
D 0.68 0.7
Notice that each column of this matrix has sum 1, as expected in view of
the comment after Example 3.
The original combined network and the equivalent simpler network, with
the arc labels, are shown in Figure 10.
A B A B
0:2 0:1
0:4 0:6
0:68 0:3
0:3 0:4
U V W 0:32 0:7
0:6 0:4
0:8 0:3
0:2 0:7
C D C D
248
2 Matrices and networks
The network below is the combination of two smaller networks, one with
input nodes U and V and output nodes P and Q, and the other with input
nodes P and Q and output nodes A, B, and C.
U V
0.5 0.8
0.2 0.5
P Q
0.3 0.6
0.3 0.4
0.3 0.1
A B C
. /
p
(a) Write down the matrix equation that gives the output vector at
. / q
u
nodes P and Q when the input vector is at nodes U and V .
v
a
(b) Write down the matrix equation that gives the output vector b at
c
. /
p
nodes A, B and C when the input vector is at nodes P and Q.
q
(c) Use your answers to parts (a) and (b) to find the
. matrix
/ equation for
u
the combined network when the input vector is at nodes U
v
and V .
(d) Draw a network with input nodes U and V and output nodes A, B
and C that is equivalent to the given network.
(e) Suppose that the given network represents a network of water pipes.
Determine the outputs at nodes A, B and C when 1 litre of water is
input at U and 2 litres of water is input at V .
249
Unit 9 Matrices
D E F
0:3 0:7
0:8 0:5
0:2 0:5
J K
J K
R S
250
3 The inverse of a matrix
251
Unit 9 Matrices
In fact, for every natural number n, the n × n matrix that has ones down
the leading diagonal (the diagonal that starts at the top left element and
ends at the bottom right element), and zeros everywhere else, is an
identity matrix.
Furthermore, for every natural number n, this matrix is the only n × n
identity matrix. To see this, let I be the n × n identity matrix described
above, and let E be an n × n identity matrix. Then IE = I and IE = E,
so I = E.
When we’re working with matrices, we usually reserve the letter I to
denote the n × n identity matrix.
The facts that you’ve seen about identity matrices are summarised in the
following box.
Identity matrices
An identity matrix is a square matrix I such that
• for any matrix A for which the product AI is defined, AI = A
• for any matrix A for which the product IA is defined, IA = A.
Each identity matrix has the form
1 0 0 ··· 0
0 1 0 · · · 0
0 0 1 · · · 0
.
.. .. .. . . ..
. . . . .
0 0 0 ··· 1
That is, it has ones down the leading diagonal and zeros elsewhere.
252
3 The inverse of a matrix
253
Unit 9 Matrices
Inverse of a matrix
If A is an invertible matrix, then
AA−1 = I and A−1 A = I,
where I is the identity matrix of the same size as A.
Inverses of 2 × 2 matrices
For 2 × 2 matrices, there is a useful formula for finding inverses.
Inverse of a 2 × 2 matrix
. /
a b
The inverse of the 2 × 2 matrix is given by
c d
. /
1 d −b
, provided that ad − bc '= 0.
ad − bc −c a
In other words, to obtain the inverse when ad − bc '= 0, swap the two
elements on the leading diagonal and multiply the other two elements
by −1, then multiply the resulting matrix by the scalar 1/(ad − bc).
You’ll be asked to verify this general formula later in this subsection, but
for now let’s check that it works for the matrix
. /
8 5
A=
3 2
in Activity 21(a). Here ad − bc = 8 × 2 − 5 × 3 = 1 '= 0, so the scalar
1/(ad − bc) is 1. You can see that the matrix
. /
2 −5
B=
−3 8
in Activity 21(a) is indeed obtained from A by swapping 8 and 2,
multiplying the other two elements by −1, and multiplying the resulting
matrix by the scalar 1.
Notice that the box above includes the condition ad − bc '= 0. This is
because the formula involves dividing by ad − bc, and division by 0 is not
defined. If the elements of a 2 × 2 matrix do not satisfy this condition,
254
3 The inverse of a matrix
then the matrix does not have an inverse. You’ll see a proof of this fact in
the next section.
In fact the quantity ad − bc that appears in the formula is so important
that it deserves a name, as follows.
Determinant of a 2 × 2 matrix
. /
a b
Let A = . Then the number ad − bc is called the
c d
determinant of A, and written as det A.
If the determinant of A is not zero, then A is invertible.
If the determinant of A is zero, then A is not invertible.
Some texts denote det A by |A|, which looks like the notation for the
modulus of a real number, but has a different meaning in this context.
This notation is also used when a matrix is written out in full; for example,
* *
* 2 1 *
* *
* −3 0 * = 2 × 0 − 1 × (−3).
When seeking the inverse of a 2 × 2 matrix, you should first check that the
matrix is invertible by calculating the determinant. If the determinant is
not zero, then you can use the rule for finding the inverse.
255
Unit 9 Matrices
det B = 3 × 8 − 4 × 6 = 24 − 24 = 0.
You can practise finding the inverses of 2 × 2 matrices in the next activity.
Determine which of the following matrices are invertible. For each matrix
that is invertible, write down its inverse.
. / . /
13 5 3 0
(a) A = (b) B =
5 2 −5 2
0 1 . /
1
−3 5 1.5 2.5
(c) C = (d) D =
5 − 13 0.5 1.5
In the next activity you’re asked to verify that the formula for the inverse
of a 2 × 2 matrix works in general.
256
4 Simultaneous linear equations and matrices
257
Unit 9 Matrices
Solution
Rearrange each equation so that the unknowns are on the
left-hand side and in the same order in each equation, and the
constant terms are on the right-hand sides.
The equations can be rewritten as follows:
2x + (−1)y = 1
0x + 3y = 2.
Now read off the matrix form from the equations.
Therefore the equations can be written in matrix form as
. /. / . /
2 −1 x 1
= .
0 3 y 2
In the next activity you can practise finding matrix forms of pairs of
simultaneous linear equations.
259
Unit 9 Matrices
In other words,
x=5 and y = 10.
260
4 Simultaneous linear equations and matrices
Strategy:
To solve a pair of simultaneous linear equations in two
unknowns using matrices
Write the simultaneous linear equations in matrix form Ax = b,
where A is the coefficient matrix, x is the corresponding vector of
unknowns, and b is the vector whose components are the
corresponding right-hand sides of the equations.
If the matrix A is invertible, then the solution is given by
x = A−1 b.
261
Unit 9 Matrices
As usual, you can check the answer to Example 6 by substituting back into
the original equations.
You can practise the method in Example 6 in the next activity.
262
4 Simultaneous linear equations and matrices
263
Unit 9 Matrices
Suppose that in the first equation a '= 0; a similar argument holds if b '= 0.
Since ad − bc = 0, we have d = bc/a. Hence c '= 0, since if c = 0 then d = 0
also. Multiplying the first equation by c/a gives
cx + (bc/a)y = ce/a
cx + dy = f.
Since d = bc/a, these two equations can be rewritten as
cx + dy = ce/a
cx + dy = f.
These equations have identical left-hand sides. So they (and hence the
original equations) have either no solution (if ce/a =
' f ) or infinitely many
solutions (if ce/a = f ).
So, in conclusion, if the coefficient matrix has determinant zero, then the
equations have either no solution or infinitely many solutions.
We can deduce from this fact that if a 2 × 2 matrix has determinant zero,
then it is not invertible. This is a fact that was mentioned in
Subsection 3.2. To see how, consider any 2 × 2 matrix with determinant
zero. If the matrix were invertible, then any pair of simultaneous equations
with this matrix as its coefficient matrix would have a unique solution,
which cannot be the case, since you saw above that if the determinant of
the coefficient matrix is zero, then the equations have no solution or
infinitely many solutions.
So you have learned the following important payoffs from the matrix
method for simultaneous linear equations.
264
4 Simultaneous linear equations and matrices
265
Unit 9 Matrices
First of all, the amounts invested in each product must add up to £9000.
So, if x, y and z represent the amounts invested (in £) in products A, B
and C, respectively, then
x + y + z = 9000.
Moreover, the investor wants to invest a total of £4000 in products B
and C, so
y + z = 4000.
Since the interest on his investment in product A is 2.5%, the investor’s
expected interest payment from his investment in product A is 0.025x.
Likewise, his expected interest payments from his investments in products
B and C are 0.05y and 0.06z, respectively. The investor is seeking an
overall return of 4%, that is, an overall interest payment of
0.04 × £9000 = £360. Thus the amounts invested in the three products
must also satisfy the equation
0.025x + 0.05y + 0.06z = 360.
Therefore, the amounts x, y and z must simultaneously satisfy the three
linear equations
x+ y+ z = 9000
y+ z = 4000 (7)
0.025x + 0.05y + 0.06z = 360.
Such systems of linear equations can be solved efficiently by methods
similar to elimination and substitution – you can probably see that
equations (7) can be solved fairly easily for x, y and z by
• first subtracting the second equation from the first to give x = 5000
• then substituting this value for x into the third equation and solving
the resulting two simultaneous equations for y and z.
Methods of this sort are covered in detail in other modules. Systems of
linear equations like those above can also be solved by using matrices, and
the rest of this subsection covers this approach.
Any system of n linear equations in n unknowns can be written as a single
matrix equation
Ax = b,
where A is an n × n coefficient matrix, x is an n-dimensional vector whose
components are the unknowns, and b is an n-dimensional vector, as you
saw for the case n = 2 in Subsection 4.2. If the coefficient matrix A is
invertible, then the matrix equation can be solved by multiplying both
sides by the inverse of A, giving x = A−1 b, just as in the 2 × 2 case.
In Section 3 you learned how to use a computer to find the inverses of
matrices of size 3 × 3 and larger. This enables you to use the matrix
method to solve systems of three or more linear equations. In the next
example, this method is used to solve equations (7).
266
4 Simultaneous linear equations and matrices
Note that you could use the computer algebra system to solve the
simultaneous equations (7) directly, that is, without first finding the
inverse of the coefficient matrix, and so check the answer to Example 7.
The solution to equations (7) found in this example tells our investor that
the best way to spread his investment in order to achieve a 4% return is as
follows:
£5000 on investment A
£500 on investment B
£3500 on investment C.
You can practise finding inverses of matrices and using them to solve
systems of more than two linear equations in the next activity.
267
Unit 9 Matrices
In fact using matrix inverses is not usually the most efficient way to solve
systems of linear equations, but the method has theoretical importance.
For example, it can be shown that, for all n ≥ 2,
• if the determinant of the coefficient matrix of a system of n equations
is non-zero, then this matrix is invertible and the equations have a
unique solution
• if the determinant of the coefficient matrix of a system of n equations
is zero, then this matrix is not invertible and the equations have no
solution or infinitely many solutions.
You saw the case n = 2 of this result in Subsection 4.2.
268
Learning outcomes
Learning outcomes
After studying this unit, you should be able to:
• use matrix notation
• evaluate sums, differences and scalar multiples of matrices
• establish whether two matrices can be multiplied together
• evaluate a matrix product both by hand and by using the computer
algebra system
• determine whether a given n × n matrix has an inverse
• work out the determinant and inverse (if it exists) of a 2 × 2 matrix by
hand
• use the computer algebra system to work out the determinant and
inverse (if it exists) of a square matrix
• write a system of linear equations in matrix form and solve it using
matrix methods where possible.
269
Unit 9 Matrices
Solutions to activities
Solution to Activity 1 Solution to Activity 4
. ./ /
We have a33 = 3, a34 = −4, a43 = −7, 4 3 12 9
a14 = −1, a44 = −9, a42 = −5. (a) (i) 3 =
2 1 6 3
The matrix A has four rows and four columns, so . / . /
1 4 6 2 3
its size is 4 × 4. (ii) =
2 2 −1 1 − 21
Solution to Activity 2 . / 0 1
2 6 x 4 23 x
. / . / . / (iii) =
2 1 4 6 6 7 3 3 y 2 23 y
(a) A + B = + =
3 0 2 −1 5 −1 . / . /
. / . / 4.5 3 9 6
2 0 −4 1 0 1 (b) (i) = 0.5
(b) A + B = + 2 1.5 4 3
−1 6 0 0 1 1 0 1 . /
. / 3
1 9
3 0 −3 (ii) 2
=
= 2
6 4
−1 7 1 3
. / . /
. / . / 2x 3x 2 3
a b 3 −1 (iii) =x
(c) A + B = + 0 5x 0 5
c d −1 2
. / In each of the answers in this part, there is
a+3 b−1
= more than one way to choose the scalar
c−1 d+2
multiple. For example, another correct
solution to part (i) is
Solution to Activity 3 . / . /
4.5 3 18 12
. / . 1 / .3 / = 0.25 .
2 −2 2 4 2 −6
2 1.5 8 6
(a) A − B = − =
1 0 −1 2 2 −2
. 1 / . / . 3 / Solution to Activity 5
2 4 2 −2 −2 6 We have
(b) B − A = − = .1 / . 3 /
−1 2 1 0 −2 2 1 2
2 2 0 −1
. 1 / . / A−C= −
2 4
0 −x 4 −2 0 −1 3 5
(c) B − C = − . /
−1 2 x 0 −1 1 3
. / = .
1
4+x 5 −5 −5
2
= Next,
−1 − x 2 . /
1 2 4
2A = .
8 −4 0
The matrix 2A has size 2 × 3 and B has size 2 × 2;
hence the sum B + 2A is not defined.
Finally,
. 3 / . /
2 0 −1
1 2 4
C − 2A = −
−1 3 5 8 −4 0
. 1 /
2 −2 −5
= .
−9 7 5
270
Solutions to activities
Solution to Activity 6 . / 1 0 1
1 2 3
For each pair of matrices in this question, the sizes (e) 1 0 −1
7 6 1
of the matrices are written underneath the matrices, 2 1 3
and the middle two numbers are boxed in order to 2× 3 3 ×3
use the method described in the text.
The middle numbers are the same, so these
2 9 . /
−2 3 1 matrices can be multiplied together. Their
(a) 7 1
4 0 5 product is
−6 8
. / 1 0 1 . /
1 2 3 9 3 8
3× 2 2 ×3 1 0 −1 = .
7 6 1 15 1 4
2 1 3
The middle numbers are the same, so these
matrices can be multiplied together. Their Solution to Activity 7
product is (a) (i) Here A has size 2 × 2 and B has size 2 × 1.
2 9 . / 32 6 47 For AB, the size check is
7 1 −2 3 1 = −10 21 12 .
4 0 5 2× 2 2 × 1.
−6 8 44 −18 34
The middle numbers are the same, so the
product AB is defined.
$ - 9
(b) 1 2 3 8 For BA, the size check is
7
2× 1 2 × 2.
1× 3 3 ×1
The middle numbers are different, so the
The middle numbers are the same, so these product BA is undefined.
matrices can be multiplied together. Their
(ii) Here both A and B have size 2 × 2, so the
product is
products AB and BA are both defined.
$ - 9 (b) For the matrices in part (a)(i), the product BA
1 2 3 8 = (46).
is not defined.
7
For the matrices in part (a)(ii),
2 9 0 10 . /. / . /
1 1 0 1 2 4
(c) 7 1 4 1 AB = =
−1 0 2 3 0 −1
−6 8 −2 7 and . /. / . /
3× 2 3 ×2 0 1 1 1 −1 0
BA = = .
2 3 −1 0 −1 2
The middle numbers are different, so these
Therefore AB '= BA.
matrices cannot be multiplied together.
9 $ - Solution to Activity 8
(d) 8 1 2 3
(a) The size of A is 1 × 2 and the size of B is 2 × 2.
7 The size check for the product AB is
3× 1 1 ×3 1× 2 2 × 2.
The middle numbers are the same, so these Thus AB has size 1 × 2.
matrices can be multiplied together. Their
product is The size of C is 2 × 3. The size check for the
product BC is
9 $ - 9 18 27
8 1 2 3 = 8 16 24 . 2× 2 2 × 3.
7 7 14 21 Thus BC has size 2 × 3.
271
Unit 9 Matrices
272
Solutions to activities
273
Unit 9 Matrices
274
Solutions to activities
275
Unit 9 Matrices
Solution to Activity 23 is 0 1. / . /
3
Using the property of matrix products given in the 5 − 45 x 18
= .
hint, we have 4 3 y −1
. /. . // 5 5
a b 1 d −b
AB = Solution to Activity 26
c d ad − bc −c a
. /. / (a) Activity 25(a) gives the matrix form of the
1 a b d −b
= equations as
ad − bc c d −c a . /. / . /
. / 2 3 x 3
1 ad − bc 0 = .
= 1 4 y −1
ad − bc 0 ad − bc
. / The determinant of the coefficient matrix is
1 1 0 2 × 4 − 3 × 1 = 5,
= × (ad − bc)
ad − bc 0 1
so the inverse of the coefficient
1 matrix is
. / . / 0 4 3
1 0 1 4 −3 5 −5
= = I. = .
0 1 5 −1 2 −51 2
5
Moreover, Hence
. . // . / . / . /. /
1 d −b a b x 1 4 −3 3
BA = =
ad − bc −c a c d y 5 −1 2 −1
. / . /
1 ad − bc 0 3
= = I. = .
ad − bc 0 ad − bc −1
Hence AB = BA = I, and it follows that B = A−1 . The solution is x = 3, y = −1.
Solution to Activity 25 (b) Activity 25(b) gives the matrix form of the
equations as
(a) The coefficient matrix of the equations . /. / . /
2 0 x 6
2x + 3y = 3 = .
3 6 y 15
x + 4y = −1 The determinant of the coefficient matrix is
. /
2 3 2 × 6 − 0 × 3 = 12,
is . Thus, the matrix form of the
1 4
so the inverse of the coefficient
1 matrix is
equations is
. /. / . / . / 0 1
1 6 0 0
2 3 x 3 = 2
.
= . 12 −3 2 − 14 16
1 4 y −1
(b) First rearrange the first equation as Hence
. / . /. / . /
x 1 6 0 6 3
2x = 6. = = .
y 12 −3 2 15 1
The matrix form of the equations
The solution is x = 3, y = 1.
2x =6
(c) Activity 25(c) gives the matrix form of the
3x + 6y = 15
equations as
is . 0 1. / . /
/. / . / 3 4
2 0 x 6 5 −5 x 18
= . = .
3 6 y 15
4 3 y −1
5 5
(c) The matrix form of the equations The determinant of the coefficient matrix is
3 4 $ 4- 4
5 x − 5 y = 18
3 3
5 × 5 − − 5 × 5 = 1,
4 3
5 x + 5 y = −1 so the
0 inverse1of the
0 coefficient
1 matrix is
3 4 3 4
1 5 5 5 5
= .
1 − 54 53 − 54 35
276
Solutions to activities
Hence We have
. / 0 3 41 . / −1
x 5 5 18 3 2 1 2 −3 1
= 4 3 1 = −3 4 −1 .
y − 54 35 −1
0 1 . / 7 5 1 1 1 −1
50
5 10 Therefore
= = .
− 75 −15 x 2 −3 1 1 −3
5
The solution is x = 10, y = −15. y = −3 4 −1 2 = 4 .
z 1 1 −1 1 2
Solution to Activity 27 The solution is x = −3, y = 4, z = 2.
(a) The matrix form of the equations is (Details of how to use the CAS to find the
. /. / . /
5 −3 x 5 inverses of the matrices are given in the
= .
−1 4 y 7 Computer algebra guide, in the section
The determinant of the coefficient matrix is ‘Computer methods for CAS activities in
5 × 4 − (−3) × (−1) = 17. Books A–D’.)
This is non-zero, so the coefficient matrix is Solution to Activity 29
invertible and the equations have a unique
solution. (a) The matrix form of the equations is
1 2 3 x 5
(b) The matrix form of the equations is 2 3 1 y = 6 .
. /. / . /
3 −1 x −6 4 7 7 z 1
= .
−9 3 y 18
The determinant of the coefficient matrix is
The determinant of the coefficient matrix is zero, so the coefficient matrix is non-invertible
3 × 3 − (−1) × (−9) = 0. and the equations do not have a unique
This is zero, so the coefficient matrix is solution.
non-invertible and the equations do not have a (b) The matrix form of the equations is
unique solution.
2 2 3 x 5
2 3 1 y = 6 .
Solution to Activity 28
4 7 7 z 1
(a) The matrix form of the equations is
The determinant of the coefficient matrix is 14.
2 2 3 x 2 This is non-zero, so the coefficient matrix is
2 3 4 y = −1 .
invertible and the equations have a unique
4 7 10 z 3 solution.
We have (c) The matrix form of the equations is
−1 1 1
2 2 3 1 2 −2 1 1 −1 1 x 0
2 3 4 = −2 4 −1 . 1 −1 1 1 y 2
= .
4 7 10 1 −3 1 2 0 3 −1 w 1
6 2 4 0 z 0
Therefore
1 1 The determinant of the coefficient matrix is
x 1 2 −2 2 0
y = −2 zero, so the coefficient matrix is non-invertible
4 −1 −1 = −11 .
z 3 8 and the equations do not have a unique
1 −3 1
solution.
The solution is x = 0, y = −11, z = 8.
(Details of how to use the CAS to find the
(b) The matrix form of the equations is determinants are given in the Computer algebra
3 2 1 x 1 guide, in the section ‘Computer methods for
4 3 1 y = 2 . CAS activities in Books A–D’.)
7 5 1 z 1
277
Acknowledgements
Acknowledgements
Grateful acknowledgement is made to the following sources:
Every effort has been made to contact copyright holders. If any have been
inadvertently overlooked the publishers will be pleased to make the
necessary arrangements at the first opportunity.
278
Index
Index
acceleration due to gravity 70 determinant of a matrix 255
addition of matrices 223 differentiation
properties of 228 of a composite function 25, 27
antiderivative 56 of a function of a linear expression 31–32
change in the value of 75, 146 of a product 14
antidifferentiation 56 of a quotient 19
arbitrary constant 57, 151, 154, 164 of an inverse function 51
arc of a network 239 using a computer 44
dimension of a vector 222
Barrow, Isaac 145 displacement 146
Bernoulli, Jacob 125 dummy variable 122
Bernoulli, Johann 125
element (entry) of a matrix 221
Cauchy, Augustin Louis 145 equivalent networks 246
Cayley, Arthur 223 exp, derivation of derivative of 10–11
chain rule
in Lagrange notation 30 Fourier, Joseph 126
in Leibniz notation 25 function of a linear expression
without introducing an extra variable 27–30 differentiating 31–32
coefficient matrix of simultaneous equations 258 integrating 169–170
column of a matrix 221 function, continuous 56, 106
component of a vector 222 fundamental theorem of calculus 132, 134
composite rule see chain rule history 145
constant function, indefinite integral 66
constant multiple rule greatest value of a function on an interval 39
for antiderivatives 64
for definite integrals 140 identity matrix 251
for indefinite integrals 153 identity, trigonometric 189
for square bracket notation 138 indefinite integral 57, 151
constant of integration 57, 151, 154, 164 notation 151
continuous function 56, 106 of a constant function 66
cos, derivation of derivative of 7–8 of a function of a linear expression 169–170
cube of a matrix 237 of a power function 61
of ln 183
decomposing a composite function 24–25 of tan 166
definite integral 119, 124, 126 of the reciprocal function 80
evaluating 124, 135 standard 82
properties of 120 integral (see also indefinite integral) 152
derivative definite 119, 124, 126
of a composite function 25, 27 indefinite 151
of a function of a linear expression 31–32 standard 82
of a product 14 integral notation 119, 152
of a quotient 19 integral sign 119
of an inverse function 51 history 125
of exp and ln 10–13 integrand 136, 152
of sin, cos and tan 4–9
standard 54
279
Index
281