Alevel Support Calculus1
Alevel Support Calculus1
org/step
General comments
Stationary Points . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Optimisation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Reversing Differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
There are some suggested NRICH and Underground Mathematics problems at various places in
this document. You can find many more by visiting https://undergroundmathematics.org/ and
https://nrich.maths.org/. You might find the search facilities useful to find problems and articles
on specific topics.
Some definitions
The above sketch shows a curve y = f(x). At point A on the curve the line that touches the curve
at this point has been drawn. This means that the curve and the line meet at this point, and they
are heading in the same direction — this line is called the tangent to the curve at the point A. The
gradient of the curve at point A is equal to the gradient of the tangent at this point.
Another way of thinking of the point where a line touches a curve is that it is the limiting case
between the line intersecting the curve twice near the point and not intersecting it at all. If you
moved the tangent line in the above example a little bit it would either intersect the curve twice
near A or it would not meet the curve near A at all. If you equate the equations of the curve and
line then the point where they touch corresponds to a repeated root of the equation.
The line that passes through point A on the curve and is perpendicular to the tangent is called the
normal to the curve at point A. A line joining two distinct points A and B on the curve is called a
chord. As the point B gets closer to A then the gradient of the chord tends to the gradient of the
tangent, as shown in the picture below:
You can explore how the tangent to the curve changes as the point A moves around the curve, and
how the gradient of the chord tends to the gradient of the tangent as the end points of the chord
get closer by using this GeoGebra diagram.
The gradient of a curve at a point can be found by considering what happens to the gradient of
the chord as the end points get closer together. Consider two points on a curve y = f(x). The
horizontal distance between then is δx and the vertical distance between them is δy.1
δy
The gradient of the chord is given by and as δx gets smaller, δy gets smaller. The gradient of
δx
the curve at a point is given by:
dy δy
= lim
dx δx→0 δx
δy
You cannot evaluate when δx = 0, as then the fraction would be undefined, but you can look at
δx
the behaviour of the fraction as δx gets smaller and smaller. This is called a limit, and the notation
lim means “take the limit as δx tends to 0”.
δx→0
δy dy
Note that is a fraction whereas is not a fraction.
δx dx
Alternatively, we can consider the two points on the curve to be at x, f(x) and x + h, f(x + h) .
f(x + h) − f(x)
Then the gradient of the chord between them is and we have:
(x + h) − x
f(x + h) − f(x)
f 0 (x) = lim (*)
h→0 h
dy
There are two different notations used here for the derivative function. The notation is called
0
dx
Leibniz’s notation, and the notation f (x) was first used by Joseph Louis Lagrange in some of his
unpublished works. Other notations include the dot notation used by Isaac Newton.
1
The Greek letter δ is often used to represent a small change in a quantity, so δx is a small change in x. Note
that δ is a prefix rather than a parameter in it’s own right, so δx is a single parameter.
δy
You cannot cancel the δs in !
δx
d d
If a is a constant then af(x) = a × f(x)
dx dx
d d d
f(x) + g(x) = f(x) + g(x)
dx dx dx
Stationary Points
dy
Stationary points on a curve are where the gradient is equal to 0, i.e. points where = 0.
dx
Example: find the stationary points of y = 3x5 − 5x3 .
Differentiating gives:
dy
= 15x4 − 15x2
dx
0 = 15x2 (x2 − 1)
0 = x2 (x + 1)(x − 1)
=⇒ x = −1, x = 0, x = 1
Substituting these values for x into y = 3x5 − 5x3 gives the stationary points as being at (−1, 2),
(0, 0) and (1, −2).
Sketching the graph (which you could do using Desmos or GeoGebra) shows these three stationary
points.
The point (−1, 2) is a local maximum, the point (0, 0) is a stationary point of inflection3 and (1, −2)
is a local minimum. Local maxima and minima are sometimes called turning points.
This GeoGebra page shows how the gradient of the curve changes as you move along it. Note that
the gradient of the curve is equal to zero at each of the stationary points, and for the point (0, 0)
there is a local maximum of the gradient. There are also two local minima of the gradient, these
correspond to non-stationary points of inflection on the original curve.
There is more about points of inflection in this STEP Support Programme assignment.
3
You can also get non-stationary points of inflection!
At local maxima, the gradient of the curve is changing from being positive to being negative, and
at a local minimum the gradient is changing from being negative to being positive. This means
that the gradient at a maximum is decreasing, and so “gradient of the gradient” is negative. The
gradient at a minimum is increasing and the “gradient of the gradient” is positive.
d2 y
The “gradient of the gradient” is the second derivative, , and is found by differentiating the
dx2
gradient of a curve twice. We can sometimes use the second derivative to help us determine the
nature of stationary points.
d2 y
If < 0 then the gradient is decreasing and the curve has a maximum stationary point
dx2
d2 y
If > 0 then the gradient is increasing and the curve has a minimum stationary point
dx2
d2 y
If = 0 then the stationary point could be any of the three types (maximum/minimum/point
dx2
of inflection) and you need a different approach to classify the stationary point. A couple of ways
of doing this are:
Look at the sign of the gradient either side of the stationary point. If the gradient goes from
positive to negative then it is a maximum, if it goes from negative to positive then it is a
minimum and if it stays the same sign either side of the stationary point then it is a point of
inflection.
Look at how the y-coordinate changes either side of the stationary point by looking at
points slightly to each side. For example, if y = 3x5 − 5x3 then considering the points
with x = 0.9, x = 1 and x = 1.1 gives points with coordinates (0.9, −1.87353), (1, −2) and
(1.1, −1.82347). This shows that if you travel a little bit either side of (1, −2) the y-coordinate
increases slightly and so (1, −2) is a minimum.
Try looking at the points on y = 3x5 − 5x3 with x-coordinates equal to −0.1, 0, and 0.1 and see if
you can use these to show that (0, 0) is a point of inflection.
You might like to try Underground Mathematics’ Gradient Match and Floppy Hair.
Optimisation
You can (sometimes!) use the derivative to find maximum or minimum values. Care is needed
as the maximum/minimum value of a function in a given range might not be at a local min-
imum/maximum. Sketching a graph can often be a good way of making sure your answer is
sensible.
Example: A piece of paper measure 20 cm by 30 cm. Equal sizes squares are cut out of each
corner and the sides turned up to make an open box. What is the maximum volume of this box?
Let the cut out squares have side length x. This means that the width of the box is 30 − 2x and
then depth of the box is 20 − 2x. The height of the box is x. The volume (V ) of the box is given
by:
V = x(30 − 2x)(20 − 2x)
Expanding and differentiating gives:
Since the second derivative is negative then the stationary point must be a local maximum.
If the second derivative was equal to zero then you would need to use another method to verify
that the stationary point is a maximum. A couple of methods have been mentioned in the previous
section, but you could also sketch a graph of the function. The volume V = 4x3 − 100x2 + 600x
is a cubic function with roots at x = 0, x = 10 and x = 15 (which are fairly obvious roots when
you think about the problem!). These roots, and the fact that it is a “happy” cubic, are enough
to sketch the curve and then you can see that the stationary point between x = 0 and x = 10 is a
maximum.
You can use the gradient of a curve and equations of straight lines to find the equations of tangents
(and normals - see the diagram on page 2) to the curve.
(a) Find the equation of the tangent at the point (−1, 0). Find the coordinates of the point where
this tangent meets the curve again.
(b) Find the equation of the normal at the point (−1, 0). Show that this normal does not intersect
the curve again.
It is often a good idea to sketch the curve first. This curve crosses the x axis at x = −1, 0, 1 and
it is a “happy” cubic.
(a)
dy
Expanding the equation for the curve gives y = x3 − x and differentiating this gives = 3x2 − 1.
dx
Therefore when x = −1 the gradient of the curve is equal to 3 × (−1)2 − 1 = 2.
Using y − y1 = m(x − x1 ) the equation of the tangent is y − 0 = 2(x − −1) =⇒ y = 2x + 2.
To find where the tangent meets the curve again we can equate y expressions and solve 2x + 2 =
x3 − x. Rearranging gives:
x3 − 3x − 2 = 0
(x + 1)(x2 − x − 2) = 0
(x + 1)(x + 1)(x − 2) = 0
(x + 1)2 (x − 2) = 0
A couple of points to note here. I could have used the factor theorem to show that f(−1) = 0, but
since the straight line is a tangent to the curve when x = −1 I already know that x = −1 must be a
root and hence (x + 1) is a factor (which I can divide out by inspection). Note that I get a repeated
root at x = −1, this is because at this point the line is a tangent4 . Sometimes it is possible for a
tangent to meet a curve again — a common misconception is that a tangent touches a curve once
and does not meet it again at all.
The point where the tangent intersects with the curve again is when x = 2 and so y = 23 − 2 = 6.
The coordinates are therefore (2, 6).
(b)
The gradient of the normal to the curve when x = −1 will be − 12 and so the equation is y − 0 =
− 21 (x + 1) =⇒ y = − 21 x − 12 . To find any other points of intersection solve − 12 x − 12 = x3 − x.
1 1
x3 − x + = 0
2 2
3
2x − x + 1 = 0
(x + 1)(2x2 − 2x + 1) = 0
4
Sometimes a helpful way of finding where tangents are is to search for repeated roots
√
2± 4−8
The roots of the quadratic are x = and hence there are no more real roots and the
4
normal does not meet the curve again.
This GeoGebra page shows the cubic with a tangent and normal at the point (−1, 0). You can
move the point along the curve and see when the tangent intersects the curve again and when it
doesn’t.
If you consider the equation of the tangent at a general point on the curve, (t, t3 − t) you can
show that the tangent always intersects with the curve at one other point. You can also show that
normal will intersect the curve again when − √13 < t < √13 , but this is a little trickier.
You might like to explore the Underground Mathematics resource Tangent or normal.
Reversing Differentiation
This Geogebra page shows a cubic of the form y = x3 − x + c and the gradient function (derivative)
of that cubic. You can change c to see how the curve and gradient function are related.
In general we have:
dy 1
= xn =⇒ y = xn+1 + c for n 6= −1
dx n+1
where c is a constant (it is often called the constant of integration).
Note that we need to state that n 6= −1. There is a result that holds when n = −1, but it does not
follow this rule. The case when n = −1 will be covered in another set of notes.
The sketch below shows a continuous curve y = f(x). Consider the area between the curve y = f(x)
and the x axis, and the vertical lines x = a and x = b. We can consider the area to be the sum of
a lot of small5 strips.
If we look at the shaded strip which has a width of δx and an area of δA and approximate it with
a rectangle we have:
δA ≈ y δx (*)
δA
≈y
δx
Then, taking the limit as δx tends to 0 we have:
δA
lim =y
δx→0 δx
dA
=y
dx
This then becomes a “reverse differentiation” problem like those in the previous section.
=⇒ y = x3 − x + c
The notation “ 3x2 − 1 dx” means “the integral of 3x2 − 1 with respect to x”. Sometimes the
R
function you are trying to integrate is not written in terms of the variable you want to integrate
with respect to — these sorts of situations will be discussed later in another set of notes.
To represent the area under a curve between x = a and x = b we can introduce limits on the
integral to show where we start and stop measuring the area from. We can write:
Z b
A= y dx
a
To evaluate the above integral we must think a little more about what it means.
Consider the area under the curve y = f(x), and let A be the area under the graph from a to x
(where x can vary!). Let the integral of f(x) be F(x) + c, so we have A = F(x) + c.
When x = a the area between a and x is equal to 0. Hence when x = a we have A = 0. This gives:
0 = F(a) + c
c = −F(a)
Hence we have A = F(x) − F(a), and in particular the area under the curve between a and b is
F(b) − F(a).
Note that this means that when we are considering an integral with limits (also called a definite
integral) we do not need to include the constant of integration.
√
Example: Find the area enclosed by the curve y = x, the x axis and the lines x = 1 and x = 9.
Z 9
1
We need to find x 2 dx:
1
9 3 9
Z
1
2 2
x2 dx = 3x
1 1
2 2
= 3 × 27 − 3 ×1
= 17 31
Example: Find the area enclosed by the curve y = x(x − 2)(x − 4), the x axis and the lines x = 1
and x = 3.
We have:
Z 3 Z 3
x(x − 2)(x − 4) dx = x3 − 6x2 + 8x dx
1 1
4 3 2 3
1
= x − 2x + 4x
4 1
81 1
= − 2 × 27 + 4 × 9 − −2+4
4 4
= 2 41 − 2 14 = 0
It seems a bit surprising to have zero area, and a quick sketch suggests that the area isn’t equal to
0.
When using integration to find an area which lies below the x axis then this is identified with a
negative sign. This means that if a curve passes below the x axis then you need to consider the
regions above and and below the x axis separately6 . In this case the area is given by:
Z 2 Z 3
x(x − 2)(x − 4) dx + x(x − 2)(x − 4) dx = 4 − ( 14 − 2 + 4) + −1 43
1 2
= 1 34 + 1 34 = 3 12
You might not have met modulus “|x|” signs before — these find the “size” or magnitude of a
number, and when dealing with real numbers they remove any negative signs. Another term for
“the modulus of x” is “the absolute value of x”, and some calculators have an “ABS” function.
In this section we have shown that the area enclosed by the curve y = f(x), the x axis and the
Z b
lines x = a and x = b is given by y dx. If instead we wanted to find the area enclosed between
a
y = f(x), the y axis and the horizontal lines y = c and y = d then we would to evaluate the integral
Z y=d
x dy. You would need to rearrange y = f(x) to get x in terms of y in order to do this (i.e.
y=c
find g such that x = g(y)).
6
Yet another reason why sketching a graph is often a good idea!
Consider the area between the curves y = f(x), y = g(x) and the lines x = a and x = b.
Z b
This area is given by f(x) − g(x) dx. This formula holds as long as f(x) > g(x) for all values
a
of x in the integration range, even if the curves fall below the x axis. However if the curves cross
at any point then the integral needs to be split up into the different regions for f(x) > g(x) and
g(x) > f(x).
You might like to try Area between a parabola and a line and Area between a parabola and a
tangent both from Underground Maths.
The Fundamental Theorem of Calculus says that for a continuous function f(x)7 :
Z x
dF
= f(x) ⇐⇒ F(x) − F(a) = f(t) dt ,
dx a
Note the use of a different letter for the variable in the first integral. Here t is called a ‘dummy
variable’: it doesn’t matter what letter you use because it will disappear when you do the integral
and evaluate the result at t = x and t = a. The only letters you should not use as a dummy
variable here are x and a, because they have been used as limits.
Z b Z c
f(x) dx + f(x) dx = F(b) − F(a) + F(c) − F(b)
a b
= F(c) − F(a)
Z c
= f(x) dx
a
Z b Z a
Property 2: f(x) dx = − f(x) dx
a b
Z b
f(x) dx = F(b) − F(a)
a
= − F(a) − F(b)
Z a
=− f(x) dx
b
Z s
1
1. dx where s > 1
1 x2
Z 1
1
2. dx where 0 < r < 1
r x2
Z s
1
3. √ dx where s > 1
1 x
Z 1
1
4. √ dx where 0 < r < 1
r x
Z
x−2 dx = −x−1 (+c) and for the second two we have
For the first two we have
Z
1 1
−2 −2
x dx = 2x (+c). This gives the four definite integrals:
Z s
1 1 1
1. 2
dx = − − (−1) = 1 −
1 x s s
Z 1
1 1 1
2. 2
dx = −1 − − = −1
r x r r
Z s
1 √
3. √ dx = 2 s − 2
1 x
Z 1
1 √
4. √ dx = 2 − 2 r
r x
Now consider what happens as s → ∞ and r → 0. For the first two integrals we have:
Z s
1 1
lim dx = lim 1 − =1
s→∞ 1 x2 s→∞ s
and Z 1
1 1
lim dx = lim − 1 =??
r→0 r x2 r→0 r
For this second integral we can see that as r → 0, the value gets larger and larger, and when r = 0
1
it is undefined. This Geogebra file shows the areas under y = 2 with sliders for r and s, and you
x
can use this to help you visualise the limits as s → ∞ and r → 0. If you right click on the sliders
you can change the settings so that you can investigate certain ranges of r and/or s in more detail.
and Z 1 √
1
lim √ dx = lim 2 − 2 r = 2
r→0 r x r→0
In this case it is the first integral which is undefined and the second interval which converges to
a limit. This Geogebra file illustrates the areas in these cases. Like in the previous one, you can
right click on the sliders and change the settings.
Z ∞
1
An integral of the form dx = 1 is called an infinite integral.
1 x2
Z ∞
1 1
Often people will write something along the lines of 2
dx = 1 − = 1 − 0 = 1 which makes
1 x ∞
some mathematicians wince, but is basically shorthand for letting s → ∞. Probably best not to
write this during an interview to study maths at university!
Z 1
1
An integral of the form √ dx = 2 is called an improper integral, as at a value in the range
0 x
of integration the integrand is undefined. You do need to be careful and check for undefined
values
Z 1 within the range of integration, otherwise you might end up with nonsensical results like
1
2
dx = 0.
−1 x