From Graphs To Calculus
From Graphs To Calculus
Mathematics
Curriculum HILARY SHUARD
HUGHNEILL
From Graphs
to Calculus
Blackie
The Mathematics Curriculum
FROM GRAPHS
TO CALCULUS
£V- :.;
V
N22321
The Mathematics Curriculum: A Critical Review
was a project set up by the Schools Council at the University of Nottingham 1973-77
Geometry
From Graphs to Calculus
Mathematics across the Curriculum
Number
Algebra
Counting and Configurations
Mathematics in the World
Eleven to Thirteen
The Mathematics Curriculum
FROM GRAPHS
TO CALCULUS
Written for the Project by
HILARY SHUARD and HUGH NEILL
It may seem strange that a book which includes the word Further, every school leaver should be able to extract and
"calculus" in its title should appear in a series written about the understand the information contained in the simpler forms of
mathematics curriculum for 11 to 16 year olds. However, although graph. More and more facts are being presented in this way by
the beginnings of formal calculus are the end-point of this book, industry and government, and it is vital that people should have
the authors believe that ideas such as measurement of area and the formed the habit of looking critically at suggested interpretations
concept of a limit should be encountered by every pupil, whether or of the data. This is an important prerequisite for a responsible
not he goes on to use these in differentiation and integration. society.
J. V. Armitage
Acknowledgements
The authors wish to thank the many people who have in various Lastly, the authors would like to acknowledge with gratitude
ways contributed to the production of this book. Its design and permission to reproduce material from:
outlook were greatly influenced by the planning team, whose The School Mathematics Project, Books A to H, and X to Z,
meetings were both thought-provoking and fun. The authors hope Cambridge University Press, 1968 to 1974
that the team's fresh approach to what at first seemed well-worn The School Mathematics Project, Books 1 to 5. Cambridge
mathematical paths is reflected in the book. The members of the University Press, 1965 to 1969, and Metric Edition, 1969 to
team were: 1971
Dr. J. A. Anderson, Department of Mathematics, University of The Scottish Mathematics Group, Modern Mathematics for
Nottingham Schools, Books 1 to 9, Second Edition, Blackie and Chambers,
Mr. T. Farrell, Sarah Metcalfe Secondary School, Mid- 1971 to 1975
dlesbrough C. V. Durell, Certificate Mathematics, Volumes 1 to 4, Bell, 1974
Mr. A. Gorringe, Chosen Hill School, Churchdown, Nr. Glou- to 1975
cester Nuffield Mathematics Project: Pictorial Representation, Graphs
Mr. D. S. Hale, H.M.I. Leading to Algebra, Chambers and Murray, 1967
Mr. M. W. Pedelty, South Hackney Secondary School, London
Science 5/13: Structures anil Forces, Macdonald Educational,
Professor R. L. E. Schwarzenberger, Department of Mathe-
1972-73
matics, University of Warwick
Mr. E. B. C. Thornton, Claremont Teachers' College, Perth, SMP, Revised Advanced Mathematics, Book Cambridge
Australia University Press, 1973
They received perceptive comments on, and constructive criti- Reference has also been made to:
cism of, the planning material from Mr. J. K. Backhouse, Dr. T. J. J. B. Channon, A. McLeish Smith, H. C. Head, New General
Fletcher and from working groups of teachers in North and South Mathematics. Books 1 to 4. Longman, 1970 to 1971
London. D. E. Mansfield, D. Thompson, M. Bruckheimer, Mathematics: A
They would also like to thank Mrs. J. A. Gadsden, who has New Approach, Chatto & Windus, 1965
prepared an endless succession of typescripts, together with all the Nuffield Physics Teachers' Guide I, Longmans/Penguin, 1966
diagrams, and has helped the authors in countless other ways. B. Holland, P. Rees, Maths Today, Books I to 4, Harrap, 1975
vn
Contents
Introduction CHAPTER 5 Further Developments of the Function Idea 32
5.1 Introduction 32
5.2 Doing without the "rule" 32
CHAPTER The Early Stages of Graphical Work 1
5.3 Review of texts 33
1.1 Graphical literacy 1
5.4 Recommendations 36
1.2 Beginnings 1
1.3 The introduction of Cartesian graphs 6
1.4 The expectation of science teachers 8 CHAPTER 6 Some Special Graphs 39
6.1 Straight-line graphs and proportionality 39
6.2 Other examples of proportionality 40
CHAPTER 2 Graphs as Mathematical Models 10 6.3 The reciprocal function 41
2.1 Graphical models 10 6.4 Transformations of some common graphs 42
2.2 Discontinuous graphs as models 12 6.5 Graphs of the sine and cosine functions 43
2.3 Aim of graphical work 13 6.6 The exponential function 46
2.4 An example 13 6.7 Logarithms 49
2.5 Looking forward 15
2.6 A recommendation 15
CHAPTER 7 What Is Area? 50
7.1 An approach 50
CHAPTER 3 The Idea of a Function 17 7.2 Sandwich arguments 52
3.1 Dependence 17 7.3 Two remarks 52
3.2 Historical development of the idea of a 7.4 An example of a sandwich inequality 54
function 17 7.5 Volume 54
3.3 Elementary notation and wording 19
3.4 Use of the word /'unction in "traditional"
CHAPTER 8 First Ideas of Speed and Gradient 55
texts ' 19
8.1 First experiences 55
8.2 Average speed 55
CHAPTER 4 Functions in the Early Secondary Years 21 8.3 Gradient of a straight-line distance-time
4.1 The introduction of functions 21 graph 56
4.2 Graphical representation of functions 21 8.4 The gradient of a straight-line graph as a
4.3 The labelling of Cartesian graphs 27 rate 57
IX
CONTENTS
CHAPTER 16 Some Applications of Integration and Differen Suggestions for Further Reading 156
tiation 129
16.1 Displacement, velocity and acceleration 129 Index 157
Introduction
This book is about graphs, their drawing, their interpretation, "O" Level syllabuses which do not include area under a graph or
their development and their use. It discusses the teaching of graphs gradient of a curve. CSE syllabuses contain just the same range of
from their early introduction in secondary schools as far as the views about the extent of graphical work as GCE syllabuses do.
beginnings of integral and differential calculus. The introduction of a common system of examinations at sixteen
Graphs are used in order to convey in a simple pictorial and would, no doubt, produce syllabuses which reflect a similar range
immediate way ideas which otherwise would require many words, of opinion and practice.
figures or symbols to portray. "Every picture tells a story" is a Some teachers who argue against teaching integration and
particularly apt saying in this context; in fact, to those who differentiation to the more-able pupils before sixteen, do so on the
understand the message, the graphical picture avoids the need for grounds that an introduction to an important topic shortly before
the story by saying it all. an examination may lead to a hurried approach in which a proper
In order to extract as much as possible from graphical messages, understanding takes second place to the learning of rules to answer
pupils must learn to read the messages and to become fluent in the examination questions. It is unfortunate that the types of question
vocabulary of graphical language. Some pupils will learn to read set in a number of GCE and CSE examinations indicate clearly
these messages more quickly than others, and these more-able that an expectation of rote learning is usual in this area of work.
students will progress to the stage where they appreciate the Other teachers argue that it is very important that able pupils
significance of area and gradient; they can then go on to study whose major interests lie outside mathematics, and whose formal
integration and differentiation. Other pupils who are not so adept mathematical education will end at age sixteen, should see
at the language will not progress much further than reading and something of the power of calculus. Some of these pupils will, in the
understanding block graphs. All pupils, whatever their ability, sixth form, study quantitative subjects such as economics or
should learn to read as many of the messages contained in graphs biology or geography, where some knowledge of calculus is very
as they can, for otherwise they will be deprived of a means of useful. Others will eventually take up positions of responsibility in
communication of proven use. government, industry or commerce, and will need to make
It is interesting to observe the differences of opinion between decisions on the basis of numerical and graphical information.
teachers of mathematics about the depth of study of graphs which The authors of this book take the second view. They believe that
is appropriate for pupils of various abilities. Some GCE "O" Level a well-developed understanding of mathematics is a national
syllabuses recognize that the concepts of area and gradient are resource. The more numerate the decision makers, the more likely
closely associated with graphs, and require that integration and the decisions are to be well informed. It is because government,
differentiation should be taken as far as obtaining algebraic industry and business often communicate quantifiable ideas by
formulae and their application. Other syllabuses require only graphs, and because a knowledge of calculus helps citizens to read
approximate calculation of area and gradient. There are still other and understand these graphs, that calculus assumes its position of
xn
INTRODUCTION Xlll
importance. We cannot, as a nation, afford not to develop this reference has been made to computers, as the principles underlying
mathematical skill in our pupils. the use of calculators and computers are the same.
However, while holding the second view, the authors try to meet The discussion of the teaching of graphs, area and gradient form
the criticisms expressed by those who hold the first view by keeping the major part of this book. There follows an essay by Professor
integration and differentiation firmly embedded in the context of R. L. E. Schwarzenberger of the Mathematics Department,
graphs, and by deploring the mere mechanical "learning of rules" University of Warwick. This essay traces the historical develop-
which is not based on understanding of the central concepts of area ment of integration and differentiation, and relates history to
and gradient. The authors hope that those teachers who disagree teaching. In addition to contributing to the book. Professor
with their view, and first teach calculus as part of a sixth-form Schwarzenberger has enthusiastically given the authors most
course, will nevertheless find material which is of interest to them. valuable help and guidance throughout its preparation.
New developments in the mathematics curriculum have affected The authors believe that pupils should build up their knowledge
the teaching of graphs in various ways. First, there have been of important ideas over a number of years, treating the ideas in
considerable changes in the teaching of functions. In many greater depth and using more powerful techniques as their own
syllabuses, functions are taught much earlier than before, some- thinking develops. The two key ideas of the calculus area and
times appearing as special kinds of relations, and often gradient are treated in this way in this book, and the order of the
illustrated by types of diagrams other than the traditional chapters reflects this development. In the early secondary years the
Cartesian graph. Another development has been the increased use treatment is graphical and pictorial. Pupils' increasing arithmetical
of approximate methods to calculate areas under graphs and skill enables many areas under curves to be evaluated numericallv
gradients of graphs. This development is a most valuable stepping and, as the concept of ratio develops, pupils are able to find
stone before the introduction of integration and differentiation, gradients numerically. The final stage of abstraction is the
thereby reducing the number of new ideas with which the beginner generalization to algebraic formulae for integrals and derivatives,
in calculus has to cope at the same time. and the link between integrals and derivatives. By the age of
Both these developments have meant that textbook writers have sixteen, the most-able pupils are reaching a level of thinking when
incorporated new ideas into their books. The new material on they can handle these abstract ideas. For the rather less able,
relations and functions has demanded decisions about their concentration on the details of algebraic manipulation often
relative importance, while the material on area and gradient has prevents appreciation of the underlying ideas, and such pupils
required the writers to handle analytical ideas of some delicacy. should postpone algebraic work on area and gradient until later.
A third development is very new indeed. Modern electronic However, there is a powerful argument in favour of their meeting
calculators are now so cheap and so easily available that they have the ideas of the calculus in graphical and numerical settings which
opened up the possibility of a greatly increased emphasis on enable them to have a conceptual background for the later
numerical work, particularly in the teaching of limits and in- introduction of formal calculus, which is now needed in so many
tegration. These ideas are discussed in some detail in this book. sciences and other subjects. Thus, many teachers will not expect
The greater availability of computing facilities also increases the their pupils to cover the content of this book by the age of sixteen,
possibility of numerical exploration of these ideas, but no specific but the book goes as far as the fundamental theorem in calculus, a
XIV INTRODUCTION
LINKING THE
DEFINITE INTEGRAL
WITH DIFFERENTIATION
15
1 The Early Stages of Graphical Work
1.1. Graphical Literacy some literacy in graphical work, for the citizen who cannot read a
graph is a handicapped member of society.
Few pupils nowadays reach the age of 11 without having some
experience of drawing block graphs; at this stage, however, most
1.2. Beginnings
children's appreciation of graphical representation is vivid but
imprecise. During the years from 11 to 16 they should develop a We start by looking at some typical examples of graphs drawn
much greater understanding of how information can be conveyed by children before the age of 11. These examples draw our
by Cartesian graphs, and a knowledge of the fact that some attention to several conceptual developments which children need
information cannot be represented in this way. Their early to make in their graphical work. The examples are drawn from the
experience should also help pupils to acquire, through informal Teacher's Guides (Pictorial Representation and Graphs Leading to
examples, an idea of the essential characteristics of functions. Algebra) to the Nuffield Mathematics Project', among the children
Pupils should also come to link together the Cartesian graph and who enter secondary schools will be found some who are capable
other pictures of a function with algebraic representations such as of drawing each type of graph illustrated. However, the repre-
formulae. Graphs are often used to convey information about sentation most commonly used in primary schools is that of block
functions in a vigorous immediate way, but a graph will only get its graphs, and a number of children will have had no other experience
message across to a person who has learnt to read that message. of drawing graphs.
The growth of graphical literacy, both in the reading of graphs and Seven-year-old Richard has drawn his graph (figure 1.1) so that
in the interpretation of their messages, should be one of the main each square of the graph represents a child. Whole squares are
aims of graphical work for all pupils. coloured, and it is natural for Richard to write the numbers of
The more-able pupils will be able to draw graphs and to make children represented in the centres of the intervals on the axes. The
deductions from them, and also to interpret graphs and to make idea that a square of the graph paper represents a unit of a quantity
conjectures about the formulae associated with them. Graphs are leads naturally to the drawing of histograms. We shall also want a
used to convey information and to assist in the analysing of square of graph paper, which is a unit of area, to represent a unit of
information. Graphical representation is one of the most valuable some quantity such as distance at a later stage (see chapter 11).
mathematical tools used by scientists, economists, government, However, in order to draw a Cartesian graph, Richard needs to
industry and commerce. Indeed, many pupils will find a variety of supplement his idea of representing numbers by squares with the
uses for graphs in their activities outside mathematics. For less- idea that an axis of a graph is a number-line, so that a point on the
able pupils the aim of graphical literacy will inevitably be restricted axis represents a number, instead of a square of the paper'
to more limited circumstances, but all pupils can and should have representing a number. Carol, who drew the graph in figure 1.2,
FROM GRAPHS TO CALCULUS
J=> b w b w S
0 i 2 3 456
VfMO te
u
UJ
cr
uJ
Of
Jl J5 J.3 J<r
"ttvo-t~<4i .cawi a*
-
artip acr ucn*u4 citnwt u-iTKe.sa>*w-crvcAtr'lwiudo' wcl^
was in the process of learning this idea, and used points to of using an ordered pair such as (1, 3^} to satisfy D +2A = 8. Do
represent numbers on the vertical axis, but not on the horizontal the points intermediate to the grid points yet represent number-
axis. Julia (figure 1.3) was able to draw a line graph instead of a pairs to her? What meaning (if any) does she attach to the lines she
block graph. This stage leads naturally to the Cartesian graphs has drawn to join up the points on her graphs? We cannot tell from
drawn by Sandra (figure 1.4). She can represent an ordered pair of these graphs, but these questions are most important in under-
numbers such as (2,3) by a point. Many children of 11 are not as standing children's interpretation of graphical representations.
advanced as this. We notice, however, that Sandra has not thought They commonly join up points they have drawn on the graph
THE EARLY STAGES OF GRAPHICAL WORK
SontirCL ft.
n +2 A *$ » D + 2A
o, v
£ , 3
U , 2
b ( (
\, % , O
^
^ '^
-— ^ "-i
i 3 X 1. V » » ^ Ci 1
A na ^< 1 A, D -t 2 A = 5
5? , 0
^^
^ ^ ^^ 1
—1 ——«! r "<
D
u 1 ^ Jl t
IKooc OK boiK o
S-frmohl
Cjmnruo and aK bo^K at the. Sarrtt
I'igurc 1.4 Front </></;>/is l^'tuliiiti In Ahjchru.
FROM GRAPHS TO CALCULUS
merely because they have seen graphs which look like this, How la come to school.
although they do not yet understand that all the points of the
graph satisfy its equation. We notice, too, that Sandra is develop-
ing an intuitive idea of gradient, as she says that both graphs are at
the same angle. It is not clear whether she is judging entirely by eye,
or whether she knows that a right-angled triangle with sides in the Q.
Q.
ratio 8:4 is of exactly the same shape as one with sides in the ratio "S
5:2^. However, spontaneous expressions such as her statement <D
that the lines are at the same angle are to be encouraged among .0
E
secondary-school pupils as much as they are amongjunior-school
pupils.
A graph is a means of recording and conveying information. Walk Cycle Bus Train
While pupils are learning the techniques of graph drawing, it helps The natural representation of this information, the frequency of
their understanding considerably if the information is vivid and a particular occurrence, is by a block graph. This starts a line of
full of meaning to them. Often, this implies that information should development towards the drawing of a histogram, in which the
be collected or measured by the pupils themselves. frequency of a property is represented by an area.
Much of the information easily gathered by pupils is of the form In order for pupils to progress towards drawing Cartesian
shown in the quotations below from SMP Book B (p. 107). graphs of functions, we need to look for situations in which
numbers in one set depend on numbers in another set. The following
are suitable situations in which pupils can collect information and
record it by drawing graphs:
Here is the result of a survey made on a class of first-form boys and girls to (i) The height of a growing plant or animal depends on the
find out how many are using each method:
time after planting or birth.
walk, 13; cycle, 8; bus, 4; train, 9; car, 2.
(ii) The length of a spring depends on the mass hung from it.
We can just write the results out like this, or we could try to find a way to (iii) The time of a railway journey depends on the length of the
display them so that they were easier to follow, or even to see at a glance.
The first method suggested is a simple table: journey.
Walk Cycle Bus Train Car
(iv) People's European clothing size depends on their British
clothing size.
13 8 4 9 2 Total 36 (v) The charge for a trunk telephone call depends on the
The total has been included and this acts as a useful check. distance.
THE EARLY STAGES OF GRAPHICAL WORK 7
age in
weeks
Figure 1.6 (i) The axes of the graph are parallel: the arrow shows that 5 maps to 6.
(iii) The mapping arrow is invisible, but the dot marks where it "turns the corner".
height in
centimetres
age in
weeks
(ii) The axes are perpendicular: the mapping arrow is shown with a right-angled
bend in it.
THE EARLY STAGES OF GRAPHICAL WORK
Thermometer n-
Plastic bag
enclosing
plenty of air
12
information, to ask ourselves questions, and to draw conclusions.
For instance, we can measure the height of the classroom broad 1.0 -
bean plant each Friday afternoon, and draw a dot graph such as
that of figure 2.1. This draws attention to one aspect only of the 8 '
plant's growth: its height. We have ignored its girth, its mass,
6 -
whether it has grown a new leaf this week, and any other aspects of
•
its behaviour apart from its height. 4
We have made a mathematical model which simulates the
2
particular aspect of the real world in which we are interested; it
helps us to focus our attention on the height of the plant to the I i < I i I I I y
exclusion of all else, in a way similar to that in which a small boy 0 12345678 time m
weeks
can concentrate completely on a model car without the noise and
Figure 2.1 First model.
danger and uncontrollability of the street scene in real life. He can
act out his thoughts in a model world which is completely under (iv) Do other plants grow faster or slower?
his control. Similarly, a mathematical model enables us to concen- (v) Do side shoots grow at the same rate as the main stem?
trate on particular features of the situation, and to act out changes These questions are questions about plants, but they can be
and study their consequences in a way which would be impossible handled within a model, and the answers and predictions made
in the real world. The graphical model at this stage is purely de- within the model can then be applied back to the real world.
scriptive. But the graphical description also suggests questions: A next step in our thinking may be to improve the model. A
(i) Does the bean grow steadily? straight-line graph is both a simple model and one which displays
(ii) How fast is it growing? the continuity of plant growth (which the dot-graph of figure 2.1
(iii) How tall should we expect it to be next Friday? does not). It also contains answers to some of our questions.
10
GRAPHS AS MATHEMATICAL MODELS 11
height in
centimetres
123456
For pupils aged eleven, parabolic graphs can model the areas of
different-sized squares. Older pupils will find that parabolas are
also reasonably good models of the paths of balls when thrown,
and of the shapes of arches of such bridges as the Royal Tweed
Bridge at Berwick (figure 2.3).
Thus, a number of simple graphical models are worth studying
in their own right because they fit so many different situations. The
process of moving from the situation to the graphical model and
back is also an important subject for study in the secondary
school we need to discuss with pupils the limitations of the
models used. The parabolic model of the path of a ball ignores air
resistance, and is also a poor model of the path of the tip of a javelin
in flight.
40 50 mass in grams
2.2. Discontinuous Graphs as Models
cost in
pence
The graphical models which are used to represent real-life
situations are often, but not always, continuous graphs. For
example, consider the problem of devising a scale of postal charges
for letters of different masses. The principle of fair play might
40
suggest that the postage should be proportional to the mass of the
letter. But this model, the straight-line graph (figure 2.4(i)), is not a
suitable model for practical use for, although mass is a continuous 30
quantity, money is not and it is not much use choosing a model
which would direct the Post Office to charge 10-23 pence for a 20 -
letter. The only suitable model is that of a step-function such as
that shown in figure 2.4(ii). Step-functions turn out to be
fundamental in later work on areas, and so it is helpful to introduce 10
them early in concrete situations such as this.
A situation in which neither variable is continuous is shown in
10 20 30 40 50 mass in grams
figure 2.5. The straight-line model shown by a dotted line is
illuminating, in spite of the fact that interpolation is useless, there Figure 2.4 Scales of postal charges: (i) an impractical straight-line model;
being no shoe sizes between 4^ and 5, for instance. (ii) the step-function used for air-mail letters to America during 1976.
GRAPHS AS MATHEMATICAL MODELS 13
2 34567 8
16 16
3
3
cr cr
1234 1234
number of spots on side number of spots on side
Figure 2.6 The squares of positive whole numbers: (i) line graph; (ii) point graph.
However, squares turn up in other contexts as well as in ing chapters. However, we should note here that, when directed
arrangements of spots. We can draw a square of side 2-5 numbers and their multiplication make an appearance, we shall
centimetres, and we can also use the arithmetic process of squaring extend the quadratic or parabolic model which maps the real
the number 2-5. number x to x2, and which is usually written either y = x 2 or
A graph showing the squares of all the real numbers models the x i >x2, to allow x to be a directed number (figure 2.8). Pupils will
squaring of fractional, as well as of whole, numbers. Hence it is also get used to seeing the parabola in a number of different
more useful, and leads pupils towards a greater knowledge of the positions, and will eventually see that this model applies to a
real number system. Thus teachers may think it sensible to problem such as:
encourage pupils to join the points in figure 2.6(ii) by drawing a 100 metres of fencing is used to make three sides of a rectangular enclosure
smooth curve through them (figure 2.7), thus setting the graph against a wall. What is the greatest area which can be enclosed?
which was originally concerned with spots within a wider math-
ematical context. Thus we see that a major aim of the work is to give the parabolic
Later developments of graphical work are discussed in succeed- model, and the formula x i >x 2 for the function, a life of their own
GRAPHS AS MATHEMATICAL MODELS 15
square *
y
16
-4 -3
number
Figure 2.7 The squares of positive real numbers. Figure 2.8 Graph of .vi ».v 2 .
within the pupil's mind. By the age of 14 or 15, the more-able pupils development towards, and study of, these two general categories of
will be considering problems about the mathematical model and, graphical model. The unifying concept of a function is of particular
for instance, the gradient of the graph of XH->x2 can be studied importance in this study, and the next chapter discusses the
without immediate reference back to the real-life situations being concept.
modelled.
Among the many graphical models available, there are two The early treatment of graphical representation in secondary
types which are particularly useful. These are loosely described as schools is often disappointing. The location and the plotting of
continuous graphs and graphs of step functions (figure 2.9). In this points are usually introduced in the first year, but the subsequent
book, therefore, we shall concentrate particularly on pupils' development of graphical work seems to take two forms:
16 FROM GRAPHS TO CALCULUS
xi—» y
(i) Real situations are dealt with in chapters entitled statistics. graphs, their equations and the idea of functional dependence
(ii) The work which leads towards functions and the idea of could grow from children's primary-school experience of collect-
dependence is conducted in an abstract way, not relating the ing, recording and interpreting information in a systematic way,
ideas to graphs which may occur outside mathematics. so as to incorporate the graphs of measured data which pupils
If these two lines of thought could be fused so that Cartesian draw in other subjects, this would be a great improvement.
3 The Idea of a Function
3.1. Dependence should meet activities which will enable them to develop the idea of
functional dependence. From these examples will emerge the
The idea of a function is one of the basic ideas of mathematics generality and power of functional dependence, and of the uses
and is particularly important in the uses of mathematics in the real which are made of functions in mathematics and other subjects.
world. It is derived in the first place from the fact that a quantity Like the man who discovered that he had spoken prose all his life,
may depend on one or more other quantities. As such, it is a very the pupil needs to discover that he is surrounded by functions,
familiar aspect of everyday life, which does not have to be whose behaviour he can begin to analyse mathematically.
artificially introduced into children's thinking. They know that the For some pupils, their ideas of functional dependence will
price they pay for sweets depends on how many packets they buy, remain simple, pictorial, and mostly qualitative:
that the BBC 1 television programme which is showing depends on The bigger the block of chocolate, the more it costs.
the time of day, that whether it is school or holiday time depends The time it gets dark in the evening depends on the time of the year.
on the time of year, and so on. In the secondary school, teachers
introduce their pupils to many functions: the science teacher shows The majority of pupils will be able to understand that different
them that the length of a spring depends on the mass hung from it, types of functional dependence are expressed by graphs of different
and that the pressure exerted by a given volume of gas depends on shapes, and to realize the significance of a linear function:
its temperature; the geography teacher discusses the fact that the Doubling the weight of chocolate doubles the price.
local time depends on the time zone in which a place is situated; the
physical education teacher demonstrates how the distance travel- They will by age sixteen be able to attach a good deal of
led by a cricket ball depends on a number of aspects of the significance to the steepness of straight-line and other graphs. For
thrower's action, such as the angle at which the ball is thrown. In some students, the building-up of a vocabulary of different types of
mathematics we expect our pupils to notice that the area of a functional dependence will be a necessary preparation for future
rectangle depends on its length and its breadth, that the distance a work. They will need to recognize quadratic as well as linear
ladder reaches up a wall depends on its angle to the floor, that the functions, and to realize the significance of inverse as well as direct
area of a circle depends on its radius; and we find many other proportion.
examples of dependence throughout mathematics.
The concept of function grew in order to express mathematically
this notion of dependence. For users of mathematics, as well as for 3.2. Historical Development of the Idea of a Function
mathematicians, it is one of the key ideas, for so much of both
mathematics and science is concerned with studying how one thing Like many other important ideas, the idea of a function has
depends on another. Hence all pupils in the 11-16 age group gradually been refined and clarified over the years, and has only
17
18 FROM GRAPHS TO CALCULUS
fairly recently been seen in its full generality. It is important that later we read in Hardy's great textbook Pur? M<if/icm«n'cs (1908):
teachers should be aware of this change, for even some recent texts All that is essential (to a function) is that there should be some relation between
which they may use or consult may be based on outmoded x and y such that to some values of x at any rate correspond values of y.
definitions of a function. Moreover, in the last few years some
school texts have adopted a point of view about functions which Hardy had abandoned the idea that a unique y should depend
does not seem to help pupils to realize their importance. We on each x. This admitted "many-valued functions" to mathe
therefore now discuss the historical development of the function matics, so that to Hardy y^ = 1 — x^ was a function, but had the
idea, and how this is reflected in school texts. property that if x = 0, then y = + 1 or — 1. This generalization has
The following early definitions show how mathematicians been rejected by later writers, who have returned to Dirichlet's
groped to make their ideas of a function more precise. statement that a function is obtained by giving a #•»&' whereby
exact/y o;?e y is obtained from ear/? admissible x.
A quantity composed in any manner of a variable and any constants. (Jean Modern writers have also realized the importance of the scf from
Bernoulli/I 71X)
Any analytic expression whatsoever made up from that variable quantity and which the admissible values of x are drawn and the stV from which
from numbers or constant quantities. (Euler, I74KJ the values of)' are drawn, so that in elementary modern treatments
Quantities dependent on others, such that as the second change, so do the first. the concept of a function is developed so as to have three
are said to be functions. (Euler)
If a variable y is related to a variable .\, so that whenever a numerical value is constituent parts:
assigned to .\- there is a rule according to which a miiVytic value of y is determined. (i) a starting set, called the (/«mm;i of the function, whose
then v is said to be a function of the independent variable x. (Dirichlet, 1837) members are the admissible values of x;
These early definitions reached out to express in words the idea (ii) a target set. called the co&jmam of the function, a single
of the dependence of one quantity on another, together with the member of which is attached to each x;
idea that the second quantity is imi'que/y determined from the first (iii) a set of arrows or a ni/c, to show which member of the
by some ni/e. codomain depends on each member of the domain.
At first, this rule had to be expressed in a single formula, but by This idea of a function is shown diagrammatically in figure 3.1,
the time of Dirichlet the single algebraic formula had been found to and is very simple and suitable for development with pupils
be too restrictive. For example, if x is a counting number greater rule
than I, the rule
v is the highest prime factor of x
or For Durell, as for Hardy and all his predecessors, the formula
was the function. They talked about "the function 7.v". So do many
In this book the second of these notations is used. teachers at the present time. However, most modern authors
20 FROM GRAPHS TO CALCULUS
regard 7.v as the member of the codomain of the function which Thus, traditional authors, by emphasizing the formula for a
corresponds to the member .v of the domain (figure 3.4). Thus, in function, emphasized the idea of functional dependence. Unfor-
modern usage, the function which Durell spoke about as "the tunately, the words "the function 7.x" lose the idea of mapping. It is
function 7.x" consists of: .x which maps to 7.x, and this vitally important link between .x and
(i) a domain, which is usually assumed to be the set of real its image 7.x needs emphasis for young pupils. We need to say
numbers, if there is no statement to the contrary;
the function: .x maps to 7.x
(ii) a codomain, which again is usually assumed to be the set of
and to write
real numbers; A'i > 7.x.
(iii) the rule: x\—>7.x.
»7x Moreover, the suppression in traditional texts of the domain or
the codomain of the function sometimes leads to difficulties over
the use of functions as models of real situations.
For example, a function with rule .v i > .v 2 models the areas of
squares which can be built with the rods of structural apparatus.
But this model only applies to structural apparatus when .x is a
natural number. The domain of the function is the set N of natural
numbers.
Similarly, a function with formula s = 5t 2 models the distance s
Figure 3.4 metres fallen in time t seconds by a stone dropped from the hand,
but the model only applies to the situation when t is positive. The
Many teachers have been brought up to speak of "the function domain of the function is the set of positive real numbers.
7.x" and it requires considerable concentration in the classroom to Thus, the modern view of a function as having three parts, a
replace this in everyday speech by domain, a codomain and a rule, is most helpful to pupils in
the function: .v maps to 7.v. understanding the power of the function concept, and how
functions are used in modelling real situations. This development
The view that the formula is the function persisted in school
of the meaning of the word function in school mathematics started
textbooks for many years, so that in 1959 Channon and McLeish
with the introduction of modern mathematics texts in the early
Smith wrote, in General Mathematics Book 4, p. 152:
1960s.
Any algebraic expression which involves the variable .v (and no other However, the treatment of function in most modern texts has
variable) is called a function of x, and its value depends on the value of x. The been greatly influenced by another mathematical point of view
symbol used is/(.v), which is re'ad as "function of .\-";/{2) means "the same
expression with 2 written instead of .x", / ( 1) means "the same expression with which seems to be unhelpful for young pupils, and so to be
- 1 written instead of x", and so on. regrettable. This is described in chapter 5.
4 Functions in the Early Secondary Years
4.1. The Introduction of Functions cost in
pence
of showing functions visually are available. Each of these pictures accommodates domains and codomains which have no particular
brings out particular features of the concept of function, and has its structure, and so which need not be sets of numbers. Many pupils
own strengths and weaknesses. Pupils should be able to choose will have used it in the primary school.
from a variety of different representations, according to the
particular function or aspect of a function in which they are
interested at that moment. They therefore need to be introduced to
these representations at appropriate times in the early secondary
years. It is also illuminating for pupils to use more than one
representation of the same function.
Some of the strengths and weaknesses of the various forms of
diagrammatic representation are summarized below, so that the
teacher can help pupils to choose the most appropriate one for a
particular situation. The teacher will realize that some of the points
listed do not apply to 11 and 12-year-old pupils, but he needs to
help pupils to build up a vocabulary which they will use later.
rule
Figure 4.3
X \—» X + 1
/-1 0
Figure 4.4 Arrow diagrams and Cartesian graphs for some important functions.
(iii) It brings out properties of the function, such as one-one, Its main weaknesses are:
onto and the existence of an inverse function. (i) If the domain has a large number of members, the diagram
(iv) It shows composite functions very easily. is messy,
(v) It gives a diagrammatic representation of the most general (ii) If the domain has an infinity of members, the diagram can
function without any special features. only be symbolic of the function.
FROM GRAPHS TO CALCULUS
(ii) The arrow diagram for functions with numerical domain and
codomain
This is an adaptation of (i) to the case when the domain and
codomain are sets of real numbers. It shows the order of the
numbers and contains the important idea of scale. It is also the
most basic representation in which different functions produce
different-shaped diagrams. Arrow diagrams for a few of the most
important functions are shown in figure 4.4, where their Cartesian
graphs are also shown for comparison. The teacher will realize that
for each function, the two representations stress very different
aspects of the function. In all cases, the domain and codomain of
the function drawn is the set of real numbers.
More experience is needed to associate these simple functions as
easily with their Cartesian graphs as with their arrow diagrams.
xi->2x+1
Able and average pupils should certainly be expected to recognize
the shapes of the Cartesian graphs of the functions shown, and
others, but the arrow diagram, which is often more meaningful to
beginners, gives a useful lead towards realizing that particular
types of functions produce particular patterns of diagram.
Some strengths of the arrow diagram are:
(i) It is very simple and is comprehensible to almost all pupils.
(ii) Good pictures for simple functions. Linear functions
connect easily with transformation geometry.
(iii) The order of the numbers is emphasized. Functions which
(iii) If the domain and codomain are subsets of the real preserve order are distinguished from those which do not.
numbers, the diagram does not show the order of the (iv) The diagram for a composite function is easily constructed,
numbers. (v) The diagram for the inverse function (if any) is easily
(iv) It does not attach a shape of picture to any particular constructed.
function. (vi) It brings out whether the function is one-one or many-
FUNCTIONS IN THE EARLY SECONDARY YEARS 25
(v) The graph of the inverse function (if any) is not immediately We often speak about the processes of arithmetic. Addition is a
obvious, although it can easily be found, function which needs two inputs. If we feed in 2 and 3, the addition
(vi) It is not good for composite functions, for which new function produces 5. The function machine illustration is the only
graphs always hnvp
aranlr: alwnvc have tn
to hp
be Hrau/n
drawn. elementary diagram for this situation. It can also be used to focus
25
(iv) The "function machine"
Figure 4.8
60
40
20
0123 t
'(1. 2)
0 1
.a
the practice may seem to pupils who do not appreciate the finer
points of logic to be at variance with the usual style of labelling
when physical quantities are graphed, as in the following extract,
which is taken from SMP Book C, Teachers'" Guide (p. T97).
If we let x stand for the number of hours and y for the height of the candle in
cm, then the relation between x and y is
x+y = 8 or .x = 8 y or y= 8 x
_c
0>
If the height of the candle after n hours is h cm, the axes of the
Cartesian graph could certainly be labelled h = 0 and ;i = 0, but
no such labelling is possible for the arrow graph, although the
labelling shown in figure 4.14 would still be possible.
1 OT >1 C C -7 Q
An additional point which arises from the extract is the conflict
number of hours of conventions between teachers of science and mathematics over
(b) the labelling of graphs. This controversy will no doubt continue to
FUNCTIONS IN THE EARLY SECONDARY YEARS 31
be bitterly fought out between science teachers, who teach that /? when elsewhere in mathematics this use of a bracket denotes
stands for a height, and that the associated number is /i/cm, and multiplication. Our more logical science colleagues write either
mathematics teachers, who usually insist that a letter stands for a
number, and so write that the height of the candle is h cm. height of candle in cm
However, from either point of view, there seems little merit in or
height of candle (cm), height of candle/cm.
5 Further Developments of the Function Idea
5.1. Introduction
or
We have seen that in "traditional" texts, a formula was regarded (a2,b2 ), (a3 ,b2 )
as a function. More "modern" texts, published since about 1960,
is quite sufficient to specify the function shown, without a verbal
regard a function as having three parts:
description of the rule. Indeed, no convenient verbal description of
domain, codomain and rule. this rule exists. Hence, all that is needed to describe the function is a
In previous chapters, we have emphasized the importance of the
idea offunctional dependence. However, a number of modern texts
treat functions very differently, regarding them as particular
examples of relations.
This approach is unhelpful to pupils' understanding of
functions, and should not be used for the reasons given in the next
section.
(i) The idea of a set is first studied. development that working mathematicians discard the idea of a rule
(ii) The idea that two elements a and b can be put together to in favour of a subset of A x B.
form an ordered pair (a, b) is introduced. Children of 12 and 13 years of age are not yet concerned with the
(iii) This enables the Cartesian product A x B of two sets A and logic of axiomatic development. They are still learning the tools
B to be defined as the set of all ordered pairs (a, b) such that of the mathematician's trade. When teaching beginners about
aeA,beB. functions, it is very unwise to suppress the idea that functional
(iv) A relation from A to B is defined to be a subset of A x B; dependence is expressed by a rule for mapping. Functions are so
that is, a relation is any set of ordered pairs (a,b) whose important in their own right that they should not be taught
members are drawn from A and B respectively; all mention as particular (and perhaps unimportant) types of relations.
of the word "rule" is avoided by simply giving a list of pairs Functions are used as mathematical models in situations where
which would have been linked together by the rule, dependence needs to be expressed. In many "modern" texts this
(v) A function f with domain A and codomain B is now a idea is not clearly brought out.
relation from A to B with the additional property that each The introduction of "modern" work, as at present seen, has not
«eA is the first member of exactly one ordered pair improved pupils' understanding of the key concept of a function.
belonging to the function. In traditional texts, functional ideas were introduced late, and the
Thus a function is a set of ordered pairs (a, b) such that point of view used was not in accordance with the present
(a) the first member a of each pair belongs to A, mainstream of mathematical thinking. By contrast, the "modern"
(b) the second member b of each pair belongs to B, orthodoxy is to introduce functions early, but to minimize their
(c) each member of A belongs to exactly one ordered pair of the importance. This seems very odd.
function and so is uniquely attached to a single member of B.
In other words a function can be considered as a particular type of
relation. 5.3. Review of Texts
This definition of a function is logically satisfactory in an
axiomatic development of mathematics in which the aim is to
present each concept in terms of those ideas which have been The changes described above have only become part of the
previously defined. common currency of mathematics within the last twenty years, so
Unfortunately, the idea of functional dependence has been that different texts in use in schools today show different stages of
totally eliminated from this formal definition of a function. In the historical growth. Moreover, different authors have adopted
process of generalization, the rule which was the essential idea of different definitions for the same words, leading to a most
the function has vanished. unfortunate state of confusion.
Inquiries among mathematicians who use functions as basic The date of publication is no guide to the view of functions taken
tools of their thinking show that working mathematicians usually by an author. The latest editions of Durel! still firmly hold that the
think of functions as rules for mapping, rather than as sets of formula is the function (C. V. Durell. Certificate Mathematics,
ordered pairs. It is onlv when they need the logic of an axiomatic Volume 2 (p. 150), second edition, 1971).
34 FROM GRAPHS TO CALCULUS
Graph of y = x2 SMP Books I to 5 were among the first "modern" texts. For their
25 1 1 | 1 1 1 1 1 ! n | ! 1 1 1| | | .1 1 1 1 1 1 1 1
y ::—:::_. .. :::„!. :...—
authors, a function was a particular relation (SMP Book 2 (1966)
EEEEEEEEEEEEEEEEEEE|EE|EEEEEEEj p. 158).
.__ _U__- __________ -^--- --J----L--
20 ... — —.- — — ..-- — A relation is a connection between members of two sets or members of the
same set. Figure 8 represents the relation "has the prime factor" between the sets
{4,6,7,15,16,20} and {2,3,5,7}.
15 ...................... ....|— -——-/— -^-—-ii-i--— We can call this relation a mapping, saying that each number is mapped onto
7 its prime factors. The set on the left in figure 8 is called the domain.
::::::TZ:::::::::::::::::::::::: A mapping is a function if each member of the domain has only one image, for
. ^ -. . . J example, the mapping which maps each number onto its smallest prime factor is
:-:p:-i:"::ii:----:|^-j a function.
1o;EEEEEEEE!:EElp::i:::" Figure 9 shows this mapping for the two sets above. Only one arrow starts
' ^4r-
' i i x— from each member of the domain. The image set [2,3, 7} is called the range.
::::::::::::::::::::::::::- ^- — ----Lr— — ----- — The mapping represented in Figure 8 is not a function. The ranee in this case
i 1 is [2,3, 5, 7}. "
5 ::::i::::::::rr:::::^::::
---L— — - -4— -j — -44+ — ~ Another word appeared in these texts: SMP, at that time, used
a**
( "'
mapping as another word for relation. They said that a mapping
Q ----.=•"---.-——.--—- ::::±:::::::±:::::::::::::::::: was a function if each member of the domain had a unique image.
0 1 2 3 4 5 x
Writing at about the same time, Mansfield and Bruckheimer in
> Fig. 181 Book4 of Mathematics: A New Approach (1965), which is also in
the "particular relation" school, avoided both words relation
The graph in figure 181 represents the relation between a number .v and its and function, using the word mapping for what is usually called a
square x 2 : it is called the graph of the function x2 . Figure 181 also represents the
graph of y, where y = x 2 , for values of x from 0 to 5. relation, and described functions as one-one or manv-one map
pings (Mathematics: A New Approach, Pupil's Book 4 (1965), pp.
Channon, McLeish Smith and Head (New General Mathe 10-11).
matics 4 (1971) p. 102) use both the "particular relation" and the
"function is the formula" views. Any system by which every member of one set called the domain sat (in the
above example, the set of numbers of therms) has associated with it a member of
Functional notation another set called the range set (in the above example, the set of gas bills) is
A many-to-one relation is called a function. called a mapping.
As a one-to-one relation, often called a one-to-one correspondence, is a special The only necessity in a mapping is that every member of the first set should
have an image and that the mapping should provide a means of finding that
case of a many-to-one relation, it is also a function.
Any algebraic expression which involves the variable x (and no other image. We can, indeed, have mappings where a member of the domain has
variable) is a function of x, and its value depends on the value of x. The symbol several different images. These mappings are called one-many.
used is/(x), which is read as "function of x" ;/(2) means "the same expression
with 2 written instead of x",/( 1) means "the same expression with 1 written In the Teacher's Book 4, they explained their policy. They were
instead of x", and so on. aware of the changing use of the word function, and attempted to
FURTHER DEVELOPMENTS OF THE FUNCTION IDEA 35
were appearing, the following discussion of relations, mappings
and functions appeared in SMP Book D. Teachers Guide (1970),
p.T231:
The following definitions of the different types of mappings are included for
the benefit of the teacher.
A relation is many to one if the image of each element of the domain is a single
element of the range. A relation is one to many if the inverse image of each
element of its range is a single element of the domain. A relation is one to one if it
is both many to one and one to many. A relation is many to many if it is neither
one to many nor many to one.
(iv) The relation 'is the brother of defined on the set of brothers and sisters
[Jane, Janet, June, Jack] is an example of a one to many correspondence.
Many to one and one to one relations are mappings.
It might be worth noting here that, in this course, unlike the SMP O-level
course, we have chosen to use the more commonly recognized definition of a
mapping: that is, a relation in which each member of the domain is related to
one and only one member of the codomain. This statement corresponds to the
definition of a function given in Book 2, p. 158, and the authors have decided
that in Books A-U, in order to keep the mathematical language as simple as
possible, the use of the word 'function' is therefore unnecessary.
first section of Modern Mathematics for Schools, Book 5, chapter 3 Notation: If a function/maps an element x of set A to an element y of set B, we
(second edition, 1973) is headed "Mappings, or functions". The write/ :.x -»3', which may be read '/maps .x to /'.
v is called the image of .x under/ and the set of images form the range of the
authors are still members of the "particular relation" school, function.
although they emphasize the importance of functions. This Thus for a function we require:
approach to functions also necessitates their very late (i) a set A, called the domain of the function;
(ii) a relation which assigns each element of A to exactly one element of B.
introduction in this case not until Book 5 . The following extract The set of images in B is called the range of the function.
is the key to what follows (MMS Teacher's Book 5 (1973), pp.
50-51).
1 Mappings, or functions
5.4. Recommendations
In Book 3, we looked at some relations and mappings from one set to another The nomenclature used about functions in school texts has
set. In the present chapter, we develop these ideas further and study the
important concept ofjunction.
become very varied since about 1960, so that a word which is used
with one meaning in one set of texts may be used with another
Example 1. Let A = {1,4,9} and B = {1,2,3,4}. Show in an arrow diagram
the following relations from A to B: (different) meaning in another set of texts. This causes considerable
(i) is greater than (ii) is the square of
confusion for pupils and teachers alike. Pupils frequently change
schools and texts, and good teachers consult more than one text,
and often draw exercises from a number of places. Moreover, many
A B students who proceed into higher education may have to modify
sthesquareof/ their vocabulary at the beginning of their course. There seems no
good reason for the present confusion of vocabulary to be
perpetuated. When choosing a vocabulary for school use, the
vividness with which it conveys the ideas to beginners, its
memorability and its clarity are important considerations. There
seems very little to choose between different versions on these
grounds, so that this criterion gives little guidance, except that
phrases such as
3 maps to 21
In (ii), the relation is a mapping since each element of A is related to exactly one
element of B as is shown in figure 1 (ii) where one arrow leaves each element of A, give a vivid dynamic picture of what is happening, and should be
Relations which are mappings are of prime importance in mathematics and
are often referred to as functional relations, or simply functions. A function is preserved.
therefore another name for a mapping. Both terms are useful; the idea of a In higher mathematics there is a considerable measure of
mapping as a kind of operation helps to give a picture of a function. agreement about the vocabulary of functions, and confusion
A function, or mapping, from a set A to a set B is a relation in which each
element of A is related to exactly one element of B. We write A -> B (A maps to would be diminished if this vocabulary were universally adopted.
It is illustrated diagrammatically in figure 5.2.
FURTHER DEVELOPMENTS OF THE FUNCTION IDEA 37
A relation associates some elements of a first set the domain with some
elements of a second (or the same) set the coilomuin.
/^function associates with each element of the domain one and only one
element of the codomain. The elements of the codomain which are "images" of
elements of the domain form a subset of the codomain the range of the
function. Thus we may have a "many-valued relation" but not a "many-valued
function".
We recommend that mapping be synonymous with function.
The variety of uses of the noun mapping and the verb to map
in school mathematics presents particular difficulty as they have
both been used in connection with both functions and relations.
In advanced work, the noun mapping (sometimes abbreviated
to map) is only used to refer to functions (except in texts of the
Open University) and its use in schools should also be confined to
Figure 5.3
functions. Similarly the verb to map should be confined to
functions or mappings, and used as a vivid way of talking about the formula is the function. We should no longer speak of "the function
mapping arrow in sentences such as "Each person maps to the 7.x". In advanced work, a new notation has grown up to describe
number of hairs on his head". such a function. Many mathematicians write
The use of the verb to map in connection with relations should /:R->R given by /(.x) = Ix
be avoided. The words is related to can always replace the words
maps to in relations, and so reduce the danger of ambiguity. For to express the function with domain the set of real numbers R,
example, consider the relation "is joined to" illustrated in figure codomain R and rule x i > 7.x. This is unwieldy for everyday use by
5.3. It is better to say "B is related to D" than "B maps to D". beginners, who are often not interested in the domain and
Thus we concur with the following recommendations of the codomain as much as they are in the rule. In this case it is better to
Teaching Committee of the Mathematical Association (Mathe use the abbreviated form of words
matical Gazette 4\8, December 1977): the function ,xi > Ix (the function x maps to 7.x)
There are considerable difficulties in the definition of some of the common
terms; our aim is to suggest a scheme which is usable in the sixth form, with or if the function needs a name:
obvious consequences at earlier stages.
The idea of a relation is a very general one in mathematics. For school we the function/:xH->7.x
make the following recommendations. (the function/under which .x maps to 7.x).
38 FROM GRAPHS TO CALCULUS
This notation has already been adopted in the majority of modern cube plus one. This can be described as x i-+ x3 +1, but the
school texts, and presents no difficulty to the learner. temptation to say
However, the Mathematical Association recommendations
continue: the function x 3 + 1
A function may be denoted by a single letter/ Generally by/(x) we mean that is strong, and certainly this "abuse of notation" is convenient.
clement of the range which is the image of the element x of the domain under the However, if notation is abused before the learner grasps the
function/ but it is sometimes convenient to use/(.v) in an "old-fashioned" way
to describe the function itself (usually where a particular function is involved, concept he is being taught, it may prevent him from fully
e.g. "the function ,\ 2 3.x+ 2"). appreciating the concept. It is better to talk about
The arrow »is used to mean both "is mapped onto" and "tends to". Usually
there is little chance of confusion, but if distinction is necessary the "barred the function x maps to x 3 + 1
arrow" t-> may be used to indicate that an element of the domain is mapped to
an element of the codomain.
until the learner is thoroughly familiar with the constituent parts of
The problem referred to in the second sentence is that a general a function: domain, codomain, and rule. Thus, the "abuse of
function can be described by a single letter/, but there is no equally notation" which talks about "the function 7x" should be avoided
brief way of describing the function which maps each number to its before the age of 16. '
6 Some Special Graphs
6.1. Straight-Line Graphs and Proportionality example, x might represent the number of kilometres travelled by a
car on a motorway and y the number of litres of petrol used; or x
Functions whose graphs are straight lines through the origin might be the number of people in a household and £y the cost of
turn up so often as models of situations in the real world that a their food per day.
particular jargon has developed to describe them, and is used as In the first case the car may travel about 10 kilometres for each
everyday language by scientists and mathematicians. This jargon litre of petrol, and y = -^x is then the equation of the graph used as
a model. However, in this situation x can only take positive values,
yi\ and so we are concerned with the function .XH-^.X with domain
the non-negative numbers.
In the second case, if it costs about 50p per day to feed a person,
then y = jx. It is obvious that this statement is only approximately
true, for not everyone eats the same amount of food, and it is also
clear that .x must be a positive integer or zero. However, y = %x,
although only an approximation, acts as a mathematical model
which says something useful about the cost of running a house-
hold.
These graphs and functions are often described in the language
of proportionality
y is directly proportional to .x
or
y is proportional to .x.
Figure 6.1
The two examples above are often described as "the petrol used
by a car on the motorway is proportional to the distance travelled"
of proportionality needs to be known by pupils, who often do not and "the cost of the food used in a household is proportional to the
realize that they are dealing with linear functions under a different number of people in the household".
guise. More precisely, the statement that "y is directly proportional to
A straight line through the origin is a graphical model which .x" means that y = kx for some fixed value of k. In the cases in
describes either exactly or approximately many everyday pheno- which the statement "y is directly proportional to .x" is used, the
mena, and can be easily used to make predictions about them. For domain is the set of values of x which are appropriate to the
39
40 FROM GRAPHS TO CALCULUS
Figure 6.3
of the graphs of these functions, and should be aware that common responding to the function ,xt > 1/x, should be known and
sense should be used in the interpretation of the graph, both recognized by pupils.
regarding the domain of the function and the suitability of the The calculator makes it easy to explore reciprocals numerically,
function as a model of the situation. so that x ^ \jx gives, for example,
2i->0-5
20h->0-05
6.3. The Reciprocal Function 200 1-> 0-005
0-2 )-> 5 ., . .
It is particularly important that the graph of y= l./x, cor- 0-02^50 . . .. .
42 FROM GRAPHS TO CALCULUS
(i)
y = mx
O/
and so on. Discussion of the idea of an asymptote to a graph may y = mx + c, and of translating it a units in the x-direction, to give
well arise from this. the graph whose equation is y = m(x — a). While the first of these
The calculator's error message also reinforces pupils' knowledge translations is usually taught, the second is equally important
that 0 does not have a reciprocal, so that the domain of xt-> 1/x for a full understanding of the relation between the graph and its
cannot include 0. equation. Of course, most pupils below the age of sixteen will deal
with these transformations largely in numerical terms.
Similarly, the general shape of the graph of the parabola of
6.4. Transformations of Some Common Graphs y = x2 should be known, together with its translations y = x2 + c
and y = (x — a)2. These are shown in figure 6.5.
Pupils who recognize the association between a straight-line An enlargement in the y-direction is also an important transfor-
graph through the origin, its gradient m, and its equation y = mx, mation. If the scale-factor of the enlargement is k, then the
should also see the effect on the equation of translating the graph c parabola y = x2 becomes y = /ex2, and the straight line y = x
units in the y-direction to give the graph whose equation is becomes y = kx (figure 6.6). Knowledge of these transformations
SOME SPECIAL GRAPHS 43
f>5
Then its periodic nature will be appreciated. It is this property of cosine graphs in figure 6.9 and should recognize the type of
periodicity which makes the sine function such a useful model for physical situation involving periodic motion for which they are
so many regularly recurring physical situations. good models.
Pupils should know and be able quickly to sketch the sine and Translations and enlargements along the x and y-axis are often
SOME SPECIAL GRAPHS 45
(x,y)P
needed. Pupils will notice that the graph of x H-> sin x° is the graph Figure 6.10(iii) shows the graph ofy = sin (x — 30)° in which the
of x i—> cos x° translated through 90° along the x-axis, so that whole wave has been shifted in the x-direction, or undergone a
phase change, while figure 6JO(iv) shows y = 2 + sinx° in which
sinx° = cos(x-90)°.
the whole wave has been translated by two units in the y-direction.
Enlargements are particularly important, for enlargement in the Some examples of the uses of sine and cosine graphs in
direction of the x-axis changes the wave-length, and enlargement modelling are shown below.
in the direction of the y-axis changes the amplitude of the wave. The height of a bicycle pedal above the ground as the cycle is
For in stance, the graph of figure 6.10(i) is obtained from y = sinx° being pedalled can be modelled by a function with formula of the
by an enlargement scale-factor j in the x-direction. Its equation is type
y = sin 2x°.
Figure 6.10(ii) is obtained from y = sinx° by an enlargement 15sin(360rf
scale-factor 2 in the y-direction. The equation is y = 2 sin x°, and
the waves are twice as high, or have an amplitude twice that of where r is the time in seconds from the time of starting, and h is the
y = sin x°. height in centimetres. The graph of this is shown in figure 6.1 1.
46 FROM GRAPHS TO CALCULUS
y*.
y = sin 2x°
y= cos x
y = sin x°
The distance of the bob of a pendulum from the vertical position and the slower movement up and down of tides. All these are
after time t seconds may be given by a formula such as modelled by sine or cosine graphs which have been transformed
from the basic graphs of figure 6.9.
x = 5 cos (2nt)°
and the graph is that of figure 6.12. 6.6. The Exponential Function
Other examples are the variation in the length of daylight
through the year, the bobbing up and down of a cork on a wave, The exponential function arises from the attempt by mathema-
SOME SPECIAL GRAPHS 47
y., ,, height of pedal
(iii) above ground in
centimetres
1-
40 —
= sin (x-30)°
10--
time in seconds
displacement in
y = 2 + sinx° centimetres
0 360
This is shown in figure 6.13(i). Figure 6.12 The displacement of a pendulum hob.
48 FROM GRAPHS TO CALCULUS
(i)
= 2x :x is a non-negative integer
4 x
In this case, because the weed is growing continuously, it is negative numbers; but this is not a good model. The step-function
appropriate to extend the model to values of x other than integers; shows the actual amount of money accumulated after x years,
this amounts to joining the points by a smooth curve. where x need not be an integer.
Another example of growth, but one in which it is not In general, a function of the form x i—>• a* or x H-> a~ x (a > 1) is
appropriate to join points by a smooth curve, is compound called an exponential function, the first modelling growth and the
interest. Suppose £100 is invested at 8% interest, the interest to be second modelling decay. The study of exponential functions is one
paid at the end of each year and re-invested. Then after n of the major themes in calculus at the stage beyond sixteen. In the
completed years the amount £A of money invested would be given meantime, the calculator makes it easy for pupils to investigate
by the formula A = 100 x 1-08". The dotted line in figure 6.14 is the numerically exponential functions such as those associated with
graph of the function xi—> 100 x 1-08*, with domain the non- continued doubling and with compound interest, which occur in a
SOME SPECIAL GRAPHS 49
A i <• number of examples suitable at this stage. For example, the effect of
the compounding of interest at more frequent intervals than once
each year can be investigated.
220
180 The advent of the calculator has meant that the place of
logarithms in the secondary mathematics curriculum is changing
160 rapidly. Teachers who argue that their pupils will need logarithms
for calculation in real life must think again; no one who has used a
140 calculator will return willingly to logarithms. Some argue that
calculators are too expensive for all pupils to have access to them,
120 and that there are economic if not educational reasons for
continuing to use logarithms; the authors believe that this
100 argument will be short-lived, and that logarithms and slide rules
will take their place alongside Napier's bones as interesting
80 historical curiosities which aided calculation before better me
thods were developed.
60 In the sixth-form mathematics course, the natural logarithm will
be needed and will be introduced either as the inverse of the
40 exponential function or in terms of the area under the graph of
v = 1/x. Teachers of pupils between eleven and sixteen need to
20 decide whether there are reasons other than computational ones
for teaching their pupils logarithms below the age of sixteen. The
authors think the arguments for the retention of logarithms are
0 10
weak, and that as sufficient calculators become available in schools
for those pupils who do not have their own to have access to them,
'. ' the teaching of logarithms before age sixteen should be discon-
Fiaure6.14 Investment growth with compound interest. •• tinued.
7 What is Area?
7.1. An Approach we attempt to ask what the other words in these definitions mean,
we see that they are not definitions at all.
What is area? This is one of those deceptively simple questions A more helpful approach than attempting to define area, which
which is much harder to answer than it looks. Some textbooks give is handled very well in SMP Book B, chapter 4, p. 36, is to consider
a definition of the form "Area is the amount of surface covered" or the problem of how to compare two shapes in order to say which is
"Area is the amount of space covered by a surface" but, as soon as the larger. It is necessary to remind pupils (because, of course, they
will previously have met area in primary schools) precisely what it
is that is supposed to be larger, for pupils are often unclear
whether the perimeter or surface is meant. It may be useful to think
in terms of paint required to cover the surface in conversations
with pupils. Once it is established what is being investigated, it is
clear that if the shapes are like A and B in figure 7.1 there is no
problem, because one will fit inside the other; but if the shapes are
like X and Y, a third shape Z is needed so that X and Z can first be
compared, then Y and Z, and so X and Y can be compared with
each other.
A convenient shape is then picked for Z, the only proviso being
that copies of the shape Z must fit together without leaving holes
(see figure 7.2).
In general the most convenient shape for Z will be the square. A
grid of squares is then used to make an estimate of the area. This is
illustrated in the following extract which is taken from SMP
Book B, p. 43.
Measurement of area
50
WHAT IS AREA / 51
13 squares
No comment need be made on the relative accuracy of these information about A which can be used in further calculations, and
methods, but there is one very important idea which can be put exact limits of accuracy for those calculations are then known.
forward at this stage. Suppose the accuracy of a statement such as A sr 13 is known and
that, for instance, the estimate of 13 units is known to be in error by
less than 3 units. This is precisely equivalent to saying that
10<A< 16
7.2. Sandwich Arguments
that is, it is equivalent to a sandwich inequality.
While the approximations of the previous section Another advantage of the original sandwich inequality
7 < A < 24 is that it can be refined. If a grid with smaller squares
area » 13 squares
and were to be used, we would expect to enclose A within closer
bounds. If each square is \ of the area of the original one, then the
area s; 15-j squares
result
may be reasonably accurate statements, it is impossible to say how
9j < A < 16f
accurate they are. Neither is there any way of knowing whether or
not 13 is a better approximation than 15|. However, there is is obtained, a considerable improvement.
another statement which can be guaranteed to be correct. This is This idea should not be developed very far at this stage, but the
the statement that the area of the figure is sandwiched between an sandwich notion will become important later, especially in in
area of 7 whole squares and 24( = 7 + 17) whole squares— tegration. It therefore seems a pity not to plant the seed of the idea
in symbols early. The sandwich idea is taken up again in chapter 9.
7 < A < 24
7.3. Two Remarks
where A is the number of units of area covered by the figure. The
essential difference between each of the statements (I) In the example of the previous section, the approximation
A ^ 13 and A
and the statement was obtained by counting the squares inside the figure and by
allowing each part square inside the figure to count as one half-
7 < A < 24
square. The inequality
is that while the first statements are vague, the second is precise.
7 < A < 24
Without additional information there is no way of knowing how
near A is to either 13 or to 15^. On the other hand the second was also obtained, and it will be noted that 1 5^ is the average of 7
statement, 7 < A < 24, is not an approximation at all. It gives and 24, that is 1 5^ = |(7 + 24 ).
WHAT IS AREA? 53
That this is no coincidence may be seen by the following
argument.
Let s be the number of complete squares inside the shape (see
figure 7.3) and 5 the number of squares required to enclose the
shape. Then the sandwich inequality states that
s<A<S
But the number of squares cut by the perimeter of the shape is S —s
and part of each of these squares lies inside the shape. Each of these
S — s cut-squares is taken to count one half-square towards the
area.
Hence
which simplifies to
s+S
It would be good practice to write down the sandwich inequality Figure 7.3 The sandwich inequality.
first and then to use it to write down an approximation for A. This
would prepare the way for the development of the sandwich idea at
a later stage.
The discussion of such graphs in many texts does not bring out
this modelling aspect of the graph. These books tend to give the
impression that if the stop at Darlington on the Durham train is 500
taken into account as in figure 8.1 (ii), the train actually travels like
that. Although the straight-line model with the stops shown is a
400
good one, and is actually used by British Rail in their day-to-day distance in
timetabling, teachers should emphasize to pupils that the straight- kilometres
line graph is only a simple approximate model which shows 300
average speed very clearly.
Detailed discussion of such straight-line graphical models helps 200
to bring out this fact that the interval of time over which an average
speed is found may be very short. It is quite reasonable to say that
100
the average speed of a train is 150 km/h for a period of 5 minutes; it
means that in the 5 minutes in question the train travelled 12^ km.
To have an average speed of 150 km/h a train does not have to
0 1234
travel for 1 hour.
time in hours
o
There are many ether examples, as well as distance and time,
where a real-life situation is usefully modelled by a straight-line
graph, in such a way that the gradient of the graph is meaningful. In
all these situations, the gradient of the straight-line graph repre
300 sents a rate. Some examples are given below.
(i) Figure 8.3(i) shows the petrol consumption of a car. A
straight line has been used as a model, and its gradient represents
200
the rate of consumption of petrol, which is measured in miles per
gallon.
100 (ii) Cloth is sold by the metre; a graph can be drawn of cost
against length (figure 8.3(ii)). The marked price represents a rate
of, for instance, £1-25 per metre. It is the gradient of this straight-
line graph which represents the price rate of £1-25 per metre.
time in hours (iii) The exchange rate of francs against pounds is the gradient
of the graph (figure 8.3(iii)) which shows how many francs the
tourist will obtain for his pounds.
(iv) When cricketers measure the rate of scoring needed to win,
400 they give it as, for instance, 7 runs per over. This is very clearly an
average rate required over the remaining overs of the match, and
fluctuates from over to over. In examples (ii) and (iii), however,
300
although an average rate is used, it is a constant average rate.
These four examples call attention to the fact that in everyday
200 life, a rate or (more properly) an average rate is not always
measured with respect to time. Of course there are many examples
gradient = 80
100
of rates with respect to time, such as the rate of growth of a plant, or
the rate of inflation, as well as our first example of speed.
Speed is not always clearly seen as a rate, but it is in fact the rate
0 123456* at which a moving body changes its distance from a fixed point
with respect to time, and so is a rate of change. The units used for
speed, such as kilometres per hour, are entirely comparable with
Figure 8.2 The gradient of a distance-time graph: (i) the steeper graph represents other units for rates, such as runs per over, or miles per gallon.
the greater speed; (ii) calculation of speed from gradient. The links between speed and other rates, and between rate and
58 FROM GRAPHS TO CALCULUS
Figure 8.3 The gradient of a straight-line graph: (i) rate of consumption of petrol; (ii) price per unit of length; (iii) rate of exchange for foreign currency.
the gradient of a graph, need to be carefully made. The following Rale. The rate giving the number of pence per book is the same for any pair of
numbers in a row of the table.
extracts from Modern Mat hematics for Schools (second edition) From the first row we see that the rate is 30 pence per book.
form a train of thought which needs careful development by the The rate is also given by -'f, or ™, or -Lf2, or j%, or J-.
teacher if the pupils are to make the connections expected. The (MMS Book 3, p. 167)
development from rate oj change is particularly important
Exercise 2
The costs of different numbers of a certain book are shown below. 1 Find the rate in each of the following, in the units stated:
Number of pence a 8 oranges cost 24p: pence per orange
Number of hooks in the coat b 20 bars of chocolate cost 90p; pence per bar
1 30 c 273 units of electricity per week ; units per day
2 60 d 20 km in 4 hours; km per hour
3 90 e 20 apples weigh 2 kg; apples per kg
4 120 / 140 km take 2\-hours; km per hour
10 x (MMS Book 3. p. 168)
v
One-to-one correspondence. There is one. and only one, cost corresponding to
a given number of books: and one, and only one, number of books Speed is rate of change of distance. When the speed is constant, the distance is
corresponding to a given cost. So we say that there is a one-to-one cor proportional to the time and. as we saw in Chapter 2, the graph of a direct
respondence between the number of books and the cost, as indicated by the proportion relationship is a straight line.
arrows. {MMS Book 3. p. 206)
FIRST IDEAS OF SPEED AND GRADIENT 59
8.5. Gradient of a Road: a Red Herring
Pupils will also have drawn graphs such as that of the function
x i—> — 2x (figure 8.6(ii)). They will see that to an increase in x there
Figure 8.4 Gradient of a road. corresponds a decrease, or a negative increase, in y. Thus the
definition of the gradient of a straight line,
road, rather than horizontally, so the gradient described by the increase in y
road sign is sin 0, not tan 0. Moreover, a railway gradient post corresponding increase in x
shows
inevitably attaches a negative gradient to such a straight line.
-=—ry or cosec 0. \ Similarly, the graph of the function x i—> mx + c has a gradient of
sin u
m. However, this last statement represents a considerable step in
abstraction from the previous one, for now a general statement
8.6. Gradients of Straight-Line Graphs about the whole class of straight-line functions is made, and the
teacher should not take this last step until he is sure his pupils are
It is now an easy step to discuss the gradients of the graphs of ready for it.
straight-line functions, such as xt->2.\. Pupils who have used
straight-line graphs to model the situations above will realize that
the gradient of the graph shows the rate of change, and so will 8.7. A Change of Wording
realize that the study of the gradient of a straight-line graph such as
that of the function xi->2x gives important information about One further point needs to be made; it concerns the nature of the
rates which can be applied in a variety of contexts. Pupils easily words distance and speed. It is unfortunate that these words, which
verify that the graph of the function ,x H-» 2x has a gradient of 2. are used so much in everyday speech and thought, are not
60 FROM GRAPHS TO CALCULUS
y' •
-2x
^corresponding
^increase in )'
considered by mathematicians to be sufficiently precise. Gradients way the distance between two points is always positive, while
of straight-line graphs may be either positive or negative, but when mathematicians wish to allow for the possibility of negative
average speeds are always thought of as positive. The word velocity distance they use the word displacement.
is used when the direction of a speed is taken into account, so These "official" words will be used in those remaining parts of
velocity is allowed to be negative. Hence it is average velocity which the book which use the ideas of motion.
corresponds to the gradient of the straight-line graph. In a similar The ideas of this chapter are developed in chapter 12.
9 Numbers
9.1. The Need for Real Numbers in Graphical Work and the pupil is usually (and rightly) expected to accept this
assumption in good faith without even realizing that it is an
When a pupil is asked to draw the graph of a function such as assumption.
xi—>x2 and to join the small number of points he has plotted to If, instead, a pupil attempts to find ^/2Q using a calculator, the
form a curve, there is an implicit assumption that each point of the calculator may show that
x-axis represents a number and that each point of the y-axis also
represents a number. v/20 = 4-47213595
Pupils might then be asked to read off from the graph of x t->x2 but when the number on the right-hand side is squared it is found
an approximation to the positive number whose square is 20. This not to be exactly equal to 20.* Indeed, none of the successive
task carries the implicit assumption that there is such a number, approximations which a calculator might produce,
44, 4.47, 4-472, 4-4721 and so on,
has a square which is exactly 20, although the sequence has a
limit whose square is exactly 20. This concept of a limit, which will
be necessary in the later study of integration and differentiation,
will involve pupils in an intuitive appreciation of the complete
system of real numbers.
This chapter is devoted to a discussion of a way in which this
intuitive appreciation of real numbers may develop in children
before the age of sixteen by using the interplay between numbers
and their representation on the real number-line; that is, by using
the interplay between arithmetical and geometrical thinking about
numbers.
Many of the ideas are subtle and delicate. The teacher will be
introducing them implicitly in discussion and by assumption,
rather than by formal exposition, over a period of some years. The
development of pupils' concepts of number is one of the most The interaction of three ideas, the number itself, its repre
important tasks in mathematics, and this chapter is therefore sentation on the number-line, and its written representation in the
addressed to the teacher, so that he may examine the development denary system for the purposes of calculation and comparison,
of the number concept as it is needed for later work in calculus. The forms the theme of this chapter.
level of exposition is often more sophisticated than that which
would be suitable for pupils, who may take for granted much of
what is explicitly stated here. 9.3. Fractions and Decimals on the Number-Line
Most pupils know before the age of eleven how some simple
9.2. Early Stages fractions such as f or 1^ and some decimals can be represented on
the number-line, using different scales as convenient (see figure
By the age of eleven, most children have absorbed the whole 9.3).
numbers into their thinking. They have met the number-line and
H 11
they see the whole numbers as attached to equally spaced points on 1——————————————1
0 1 2 3
Finure9.5
9.4. Recurring Decimals on the Number-Line
A correspondence has now been set up between some arithmeti
cal ideas and some geometrical ideas. These ideas are the When 3 is converted by division into a decimal, there is at every
development from whole numbers to fractions, and the develop- stage a remainder. It is usual practice, as in the quotation below
64 FROM GRAPHS TO CALCULUS
Figure 9.6
0-1
so that although the decimal graduation poihts on the number-line When directed numbers are introduced, the number-line needs
become indefinitely close together, none of them ever falls exactly to be extended indefinitely in the negative as well as the positive
on the point which corresponds to f The explicit discussion of this direction, so that it can represent positive and negative whole
point may help pupils to extend their ideas about number. numbers and fractions. At this time, official mathematical voca
But if a number-line which is graduated in thirds, ninths, etc., of bulary can be introduced: the positive and negative whole
a unit is taken instead of one which is graduated in tenths, numbers are the integers, and the positive and negative whole
66 FROM GRAPHS TO CALCULUS
numbers and fractions together make up the rational numbers. The procedures for exact calculation. This might be considered as the
correspondence now contains three sets of ideas: price paid for the beauty of the decimal system in which the relative
sizes of any two numbers may be distinguished by inspection.
Positive and
Positive and negative Pupils who have begun to explore fractions on the number-line
fractions, or positive and probably think that to every point of the number-line there
Arithmetical ideas negative —
negative terminating and
whole numbers
recurring decimals corresponds a number of a type they know: in other words, a
f rational number. At this stage, it is difficult not to reinforce the
belief that every point on the number-line corresponds to a
Line with equally Line with
Geometrical ideas spaced points —— • fractional points rational number, because of the very necessary emphasis which
marked marked must be placed on interpolation from graphs. From the graph of
x i—>x2, a pupil finds that (as nearly as he can tell) ^20 = 4-5. He
should also be encouraged to verify that (4-5)2 = 20-25, so that his
Rational result, although near, is too big: -y/20 < 4-5. With a sharper
Integers
numbers
pencil, a larger scale and a good deal of luck he may
obtain the approximation 4-47 for ^20. But (4-47) 2 = 19-9809.
9.5. Calculating with Recurring Decimals so 4-47 < ^/20 < 4-5. It is likely that pupils will draw from this the
The procedures which children have learned for calculating with belief that if only they could be accurate enough, or if they had a
terminating decimals break down for decimals which do not calculator with enough decimal places, they could find a decimal
terminate. For example, it is not possible to calculate directly whose square is exactly 20.
It is one of the most remarkable discoveries of Greek mathe
0-33333 ...... x 0-142857142857...... matics, and still comes as a great shock to many pupils today, that
because it is not possible to find "the right-hand end of the there is no rational number whose square is exactly 20. But much
recurring decimal". All that can be done is to cut short the further preparation is needed before pupils will appreciate the
recurring decimals, to calculate with the resulting terminating significance of this fact.
decimal approximations, and hope that the result of the calcu
lation is a reasonable approximation to the answer.
Exact calculation of the product of the two recurring decimals 9.6. The Sandwiching Process and Measurement
above is possible by converting them to fractions, carrying out the
multiplication and then reconverting to a decimal. This gives A further step in the argument comes from the combination of
sandwich inequalities with ideas derived from practical measure
i x i = ^L = 0-047619047619047619......
ment. No measurement made with a ruler or other measuring
The change from a fractional representation of a rational instrument is ever exact; measurements are made "to the near
number to a decimal representation has resulted in the loss of est ...". The statement that the lenath of the room is 4-27 m must
NUMBERS 67
be taken to mean that it is in the range from 4-265 m to 4-275 m. It of enclosing a number in a sequence of sandwich inequalities
has been (presumably) measured to the nearest centimetre, so that whose width can be indefinitely decreased (figure 9.8) is of utmost
the actual measurement is sandwiched in an interval of length 1 cm importance when moving from the rational numbers to the
(or 0-01 m). irrational numbers.
4-265 ^ / ^ 4-275
0-01
important that the most-able pupils should realize that irrational Sandwich inequality Length oj interval
numbers are part of the theoretical construction of the number 2 /5 3 1
2-2 5 ^ 2-3 0-1
system.
2-23 = 5 ^ 2-24 0-01
2-236 = S =5 2'237 0-001
9.10. The Introduction of Irrational Numbers in Texts
In most texts, irrational numbers are first encountered in At this stage, pupils may begin to realize intuitively why the
connection with Pythagoras' Theorem and the need for square sandwiching process can never end, for to obtain a terminating
roots. A typical treatment is that of the Scottish Mathematics decimal whose square is exactly 5-000....0, the last digit would
Group shown below (Modern Mathematics for Schools, Book 4, have to be 0, and so would the digit before, and so on. Thus,
p. 19): discussion of the sandwiching process helps to build up the
u
realization that non-terminating non-recurring decimals, or ir
rational numbers, do actually turn up in mathematics. Apart from
\A the square roots, other irrational numbers which pupils are likely
to meet before the age of sixteen are n and the trigonometric
functions.
-2-101;l*k 3 However, the rational numbers are very few among the totality
24
of the real numbers, and the vast majority of points of the number-
line do in fact correspond to irrational numbers. This fact is on the
From figure 24, using Pythagoras' theorem.
face of it most surprising, when the rational numbers are so
,- 2 = 2 2 + l 2 = 5
familiar. An informal explanation may be given as follows. A real
number between 0 and 1 could be generated in a random way by
Hence r = ^/5 = 2-236 ..., a number which cannot be expressed as a ratio of using a ten-sided die labelled 0,1,... 9, and taking the digit
two integers or as a decimal. Therefore vo does not belong to Q.
obtained as the first decimal place of a number. The die could
^/5 is called an irrational number. We can find its approximate position on
the number line by drawing an arc of a circle with centre O and radius OA as similarly be used to give a digit for the second decimal place, and
shown. any number of further decimal places could be similarly generated.
It is very highly unlikely that such a random process would
This is a good opportunity for raising several points in class generate one of the two types of non-terminating decimals which
discussion. First, on the real number-line which the pupils correspond to rational numbers: those such as
visualize, the exact position of the point representing ^/5 can be 0-342134213421... in which there is always a cyclic repetition, and
constructed by the method described. Secondly, the statement those such as 0-250000000 ... which correspond to terminating
r = v/5 = 2-236 ... can usefully be examined by the sandwich decimals. Thus, it appears probable that comparatively very few
method described above real numbers are rational.
NUMBERS 71
9.11. Proofs of Irrationality that an irrational number cannot be written as a fraction in any
number base, a point about which pupils sometimes have doubts.
Able pupils will be interested to see how the fact that a number By contrast, the expansion of a rational number always terminates
such as ^/5 is irrational can be established. All such arguments are in some base, as was shown in section 9.4.
proofs by contradiction, a method which able pupils should meet
before they are sixteen.
The proof that ^/5 is irrational begins by supposing, on the 9.12. Recommendations
contrary, that ^/5 is a rational number. Hence there are integers p
It is suggested that the following ideas should form the goal of
and q such that
work on the real numbers with able pupils.
1. Every point on the number-line corresponds to a real number
and, conversely, every real number corresponds to a point on
and so the number-line.
P2 = V- 2. To every fraction there corresponds a terminating or re
curring decimal, and conversely, to every terminating or
Because q2 is a perfect square, it has an even number of prime recurring decimal there corresponds a fraction.
factors. Thus 5q2 has an odd number of prime factors. But 3. Every real number, whether rational or irrational, can be
5q2 = p2 and therefore p2 must have an odd number of prime approximated to within any required degree of accuracy
factors. But this contradicts the fact that p2 is a perfect square. using terminating decimals.
Hence ^/5 is not a rational number. 4. Calculations can only be carried out by the procedures for
This proof clearly extends to a proof that the square root of terminating decimals.
every prime number is irrational. Finally, the concept of real numbers is very closely bound up
It is worth noticing that the above proof is independent of the with the related idea of a limit, which is discussed in the next
number base chosen to represent p and q. This emphasizes the fact chapter.
10 The Idea of a Limit
10.1. Strategy worksheets. Discussion between pupils will make an important
contribution to the work.
Although the idea of a limit, and the notation for limits, lie at the
heart of the differential calculus, they receive little attention in the
introduction to differentiation in any of the texts reviewed. This is 10.2. First Ideas
not because ideas about limits have been growing steadily
throughout the secondary course—far from it. In fact, it is usual for In the primary school, children meet many situations in which a
the idea of a limit to be met for the first time in the most difficult of process is repeated again and again, and the same process could be
all possible circumstances: the limits commonly used in differen repeated indefinitely. Many of these repetitive processes are
tiation, which are limits of fractions where both the numerator and concerned with sequences.
the denominator tend to zero. This is conceptually the most (i) Peter (aged 9) said "Tie a knot in your handkerchief to
baffling situation in which to grasp the idea of a limit. It is remind you. And another to remind you what that's for. And
suggested in this chapter that there are many fairly simple another. And another. And...."
mathematical situations in which pupils can gain informal ex (ii) A familiar exercise is "Give the next three terms of the
perience of limits from very early in the secondary school. If they sequences
do this, they will find the limits used in differentiation easier to 1,3,7,13,....
understand, and progress will be correspondingly greater.
It is not suggested that a formal treatment of limits should form 2,3,5,8,12,...."
part of the work done by secondary pupils; that must properly be (iii) Squares are often made with Cuisenaire rods. Children
deferred for higher education. However, some reference to limits is explore how many units are needed to turn each square into the
essential to any work on differentiation and is also needed for the next one (fiaure 10.1).
study of infinite geometric and other series at the next stage of
education.
It is extremely difficult to convey in print the delicacy of touch
which is necessary. The work described subsequently in this
chapter is not suitable for conventional teaching and testing, and
there may be little or no written output by the pupil. The aim is to
encourage ideas to grow gradually, and there need be no attempt in
the early years to draw things together. Many of the ideas
suggested are suitable either for group working or for individual Figure 10.1 Construction of squares with Cuisenaire rods.
72
THE IDEA OF A LIMIT 73
(iv) Repeated doubling produces a sequence whose terms grow We return to this example to discuss its mathematical impli
very rapidly: 1, 2, 4, 8, 16, 32, 64, 128, 256, 512, 1024,. .. cations in section 10.5.
From these experiences, children gain the idea that many (ii) A variation on this situation is obtained by rearranging the
mathematical processes could be repeated indefinitely, or as they squares and rectangles into a spiral, as shown in figure 10.3. Pupils
say, "for ever". Peter's enjoyment came from his vision of knotted can investigate to which point the spiral converges.
handkerchiefs receding into the dim distance. But, at this stage,
most of the work is done with whole numbers, so that the terms of
the sequences are likely to grow beyond all bounds. Ideas about
limits begin to develop when situations can be explored in which a
sequence "settles down" or converges. This may initially be more
vivid in geometrical rather than arithmetical contexts.
1 dm 2 1 dm 2
area 1 dm 2 area 1t dm 2
i dm 2 i dm 2
1 dm 2 1 dm 2
J- dm 2
idm 2
\ dm 2
2 3 4 5 Gal 10
Figure 10.2 Geometrical introduction to convergent sequences. Figure 10.4 Curve stitching.
74 FROM GRAPHS TO CALCULUS
points on each line are numbered as shown in figure 10.4, the point Positions of the rabbit are drawn at R0, R 1; ..., R 10, each 1 unit
1 is joined to 10,2 to 9 and so on. The lines appear to form a curve. apart. Each part of the dog's path is 1-5 units long and D0 D, is
This is illusory, but better and better approximations may be directed towards R0, D,D 2 towards R l5 D2 D3 towards R 2 and so
obtained by inserting more and more lines, using the rule on. The dog catches the rabbit after about 12 seconds. By taking
a + b = 11 for deciding which points should be joined; a and b short enough time intervals, the path drawn can be made as close
may be fractional or negative. to the actual path of the dog as we please.
(iv) Pursuit curves give another method of approximating to (v) Another example of a similar type concerns four can
curves by using small line segments. A dog D is chasing a rabbit R. nibalistic beetles. These beetles, A, B, C and D start at the corners
The rabbit runs along a straight line at 1 m/s. The dog can run at of a square. A wants to eat B, B wants to eat C, C wants to eat D,
1-5 m/s and always aims directly towards the rabbit. and D wants to eat A. All the beetles walk at the same speed, and
R0 R, R2 R3 R4 R5 R6 R7 R8 R9 R 10
the path of each beetle is aimed directly towards the beetle it is
chasing.
path of rabbit
path of dog
Figure 10.6 Pursuit curve: four cannibalistic beetles.
spirals, but once again intuition suggests that the use of shorter
steps would have given a closer approximation to the actual paths.
It is interesting that in the last two examples we are, in effect,
solving differential equations by step-by-step methods. This idea is
introduced in section 16.4
(vi) The last geometrical example is of the snowllake curve, and
returns to the sandwiching theme. To construct the curve, an
equilateral triangle is drawn and each side is divided into three
equal parts. An equilateral triangle is constructed on the middle
third, as shown in figure 10.7, and the process of constructing
equilateral triangles on the middle third of each line is repeated
indefinitely.
It is clear that both the area and the perimeter of the snowllake
curve increase at each step. Construction of the first few approxi-
76 FROM GRAPHS TO CALCULUS
mations both to the snowflake curve, and to the "anti-snowflake" 10.4. Further Simple Convergent Sequences of Numbers
curve which is obtained by inscribing the original equilateral
triangle in a larger one, and then at each step drawing inward- The advent of cheap electronic calculators makes possible an
facing equilateral triangles on the two outer thirds of each line investigation of convergent sequences in a way which was
(figure 10.8) will convince pupils that the area and perimeter of the previously not possible.
snowflake curve behave differently. The area is sandwiched (i ) Suppose we wish to find ,/6 by successive approximation or
between the areas of the snowflake and anti-snowflake curves by trial and error.
(figure 10.8). The perimeter of the snowflake, however, is multi We know that
plied by j at each step, as the effect of the construction at each step
is to increase the length of each line by j- of itself. Thus the sequence 2 < v'6 < 3.
of perimeters is
By squaring 2-4 and 2-5 we find that
3, 3x(f), 3x(f) 2, 3x(-j)3 ,..., 3 x &"-',...
and this increases bevond all bounds. 2-4 < v/6< 2-5.
78 FROM GRAPHS TO CALCULUS
The next step is Teachers should be aware that pupils may think that the
2-44 < 76 < 2-45 calculator has fixed on 10 as the exact value but this is not so. A
and the next is calculator which displays ten digits may store three more digits in
2-449 < v/6 < 2-450. the calculator and the number displayed will be a thirteen-digit
number corrected to ten significant figures. When this corrected
Pupils find it easy to believe that they can get as close as they version is 10-00000000, the calculator simply shows 10.
please to ,/6 by going on long enough, although they do not know
and cannot find the exact value of %/6.
(ii) The classical Newton method for square root converges 10.5. Sequences given by Recurrence Relations
much more rapidly than that discussed above. It uses the idea that
if x is an approximation to ^/N, then When a young child is asked how to get the next term of the
sequence
\( NN
2, 5, 10, 17, 26, 37,...
which is the average of x and N/x, will be a better approximation. he is much more likely to say
Unlike the previous method, Newton's method improves more "you add 3, then 5, then 7, then 9, . . ."
than one decimal place at each step. In fact it roughly doubles the
number of decimal places which are correct at each step, and so is a than he is to say
very rewarding exercise, even if a poor first approximation is taken. "it is the next perfect square, plus 1".
For example, for N = 2, using 1 as a first approximation,
He prefers the recurrence relation, which tells how to calculate the
successive approximations are
next term of the sequence from the previous ones, to the formula
1, 1-5, 1-42, 1-4142, 1-414213562 for the nth term. In official mathematical language, he prefers the
additive approach of the recurrence relation
and although the calculator then shows no further change, pupils
should realize that (1-414213562)2 is not exactly equal to 2. xi=2, xn+l =xn
With N = 100 and 1 as the first approximation, the figures
shown by the calculator are to the formula for the nth term,
x andj —
N to get the
, next approximation
• • „ This is extremely quick and easy, giving for N = 10 and x i = 2,
the sequence
rather than 2, 2-1147, 2-1444, 2-1519, 2-1538,....
N
= i A',, + Again, if the (assumed) limit is x, (x ^ 0),
(i) If a reciprocal key is available on the calculator, the routine
=> x4 = NX
. N-l
=>x 3 = N.
(in) By a natural method this can be extended to other integral
is easy to use for
roots. For example
This method does not need any intermediate writing down, but
it does not converge so quickly as Newton's method. Pupils will
easily see that the method does converge, but may doubt whether it
converges to ^/N. If, for example, they take N = 10, and start with
x 1 = 3, they obtain successively (to 4 decimal places) and
3, 3-25, 3-1176, 3-1857, 3-1502, 3-1686,... all give sequences of approximations to N l!S .
so that successive values oscillate and become nearer together, (iv) An interesting method of finding the reciprocal of N
sandwiching a number which lies between 3-1502 and 3-1686. An without using division is the relation
algebraic calculation then gives, if the (assumed) limit is denoted
by .x,
Pupils might be given this relation and asked to explore what it
x = 1, + -——
N-\ does, for different values of ,V, and for different starting values. This
1 +x sequence only converges to I/TV from starting values fairly near
OX(1+X) = 1 + X + /V-1 l/N, and serves as a useful antidote for pupils who are beginning to
o x + x2 = x + N think that all sequences converge.
80 FROM GRAPHS TO CALCULUS
(v) The linear recurrence relation For pupils who are familiar with matrix notation it may be
convenient to use the notation
ax,, + b
Pn+l N\fpn
is well worth exploring for various values of a, b and c. \<J»+\
An interesting exercise is to find the temperature which Then pn+ l /q,,+ j is a better approximation to ^/N than pjqn.
is the same in °C as in °F. This is given by x = |(x — 32) and can be We give an example for N = 3 taking p l = q l = 1. The calcu
solved iteratively using the recurrence relation lator is used only to verify how the sequence is converging.
Successive terms are
x,, + 1 =f(x,,-32).
4 10 76 208 1151
6 i
Conversely, the temperature which is the same as °F in °C, has the 2> 44' 120' 896 '
recurrence relation We note that (^^f)2 = 3-0003 ... The reader should consider why
this method works.
(viii) In two articles in Mathematics Teaching*, Dr. T. J.
These two recurrence relations have different convergence proper Fletcher shows an ingenious and effective method of approximat
ties. ing using vectors. The article would give excellent material for
(vi) Another interesting recurrence relation is workcards, and once again shows sandwich methods at work.
(ix) The calculator is particularly useful for solving equations of
S l = a, Sn + i = rSn + a, where r < 1. the type x =/(x) by using the recurrence relation x,, + , =/(x,,)
with a suitable starting point. This does not always work, as some
This gives the sum to (n + 1 ) terms of the geometric series of the above examples illustrate, but it works often enough to be
a + ar+ ... + ar" worth trying. A good elementary discussion, though not suitable
for pupils before the sixth form, is found in Elementary Calculus
and can be used for series such as and Co-ordinate Geometry II by C. G. Nobbs, OUP, p. 339.
(x) The matrix
0-5 0-2
0-5 0-8
(vii) The following method of obtaining a sequence of rational
number approximations to ^/N gives the approximations in
"fractional" form. maps the vectors
Suppose pn and qn are positive integers such that pjq,, is 0-5 '0-2
an approximation to JN. Then let Pn +i=pn + Nqn and and to and
0-5 0-8
THE IDEA OF A LIMIT 81
respectively. Then the matrix maps
0-2
0-8
35)
to
0-5 0-2 0-5 '0-5 0-2 '0-2 '0-26
0-5 and 0-8 0-74
0-5 0-8 0-65 / 0-5 0-8
Writing
and
(iii) Gary is clear, however, in his assertion of superiority "I can draw more More-able pupils will be able to see that at the nth step, the
squares than you", that however many terms of the sequence Angela has number of dm 2 filled is
drawn, more terms are always possible, and that the area always
increases. 1 1
Between them they realize that 1 +- + - + ...+
24 /•• - r '
A! < A 2 < A 3 < ... < An < A,, + 1 < ... < 2
and that the difference between An and 2 can be made as small as we The unfilled area of 1/2" ~ l dm2 can be made as small as we please
please by taking n large enough. by taking enough steps, but for every n there always is an unfilled
area. At this stage, the following wording should be used, without
In the class discussion which followed this experiment, the mention of limits:
pupils were equally divided between two points of view. The area
had now been represented arithmetically on the blackboard as We can make the filled area An as close as we please to 2 dm 2 by taking a large
enough number of steps. The filled area converges to 2 dm 2 .
Some pupils thought that the sum must always be less than 2 ('"cos The pupils' attention can also be drawn to the fact that we have a
you can't fill the corner in."). Others maintained that eventually sequence of numbers
the sum would exceed 2 ("You keep on adding more—it must get
bigger".)
In order that pupils gain the maximum of understanding about in which the terms are always increasing, but all the terms remain
limits from this and similar examples involving summation, it is less than 2. This sequence, which is bounded above, is of a
important to use a notation which hides nothing; the usual fundamentally different nature from the many sequences already
notation familiar to the pupils, such as
2, 3, 5, 12,
hides a great deal, and should not be used at this stage. which increase beyond all bounds.
The pupils' attention should be focused on two points: clarity "Achilles and the tortoise" provides another situation which can
about successive steps of the process, and the size of the difference usefully be linked to the last one. It also has the advantage that
between each member of the sequence (AJ of areas and 2. pupils are likely to start off not knowing exactly where or when
Area filled/dm 2 Area unfilled/din 2 Achilles catches the tortoise. A step-by-step analysis with a
1st step 1 distance-time graph is helpful.
2nd step i To simplify the numbers, we use a rather speedy tortoise.
3rd step "Achilles can run twice as fast as the tortoise, so he gives the
4th step
5th step tortoise a kilometre start. While Achilles runs the kilometre, the
tortoise runs \ km; then while Achilles runs that ^-krn, the tortoise
nth step
runs 5 km; and so on."
THE IDEA OF A LIMIT 83
————————————————————————»*a' S
W ' 1 1
."• 4. ' A km to go
X 2 > 2 km to go
km to go
1 1
0 1 km 2 km
Figure 1 0.1 1
We can think of this in several ways. A number which often baffles pupils is 0-999 ... or 0-9. They may
First, the sequence of rational numbers 0-3, 0-33, 0-333..... is say that the difference between 0-9 and 1 is "point nought recurring,
increasing, and it is also bounded above by j. We can, however, get one", indicating that they do not yet understand the significance of
as close as we please to 5 by going on far enough in this sequence. It the recurring figure. The sequence of rational approximations to
is instructive for pupils to calculate the difference between the 0-9 is
successive terms of the sequence and ^:
0-9, 0-99, 0-999,...
•TO = 301
33 1 and the sequence of differences between these numbers and 1 is
100 300
333 1 0-1, 0-01, 0-001,....
' 1000 3000
and so on. Thus we can make the rational number 0-999... 9 as close as we
We say that the sequence please to 1 by going on far enough in the sequence. It is perhaps
unfortunate for young pupils that this fact is written
0-3, 0-33, 0-333,..., converges to i
0-9 = 1
Similarly, the second sequence
in the same way that we write 0-3 = ^. What is meant is that the
0-4, 0-34, 0-334,....
sequence 0-9, 0-99, 0-999,... converges to 1.
is decreasing, all its terms are greater than ^, and it also converges Appropriate forms of words to discuss these phenomena with
to | because the terms can be made as close as we please to 3 by 'pupils are:
going on far enough in the sequence. (i) 0-3 is the number whose approximations are
Alternatively, we may look at the successive sandwich in
equalities, note that the thickness of the sandwiches can be made as 0-3, 0-33, 0-333,....
Sandwich Thickness of sandwich
0-3 =? i sc 0-4 0-1 The approximation can be made as close as we please to 5, so
0-33 s: 1 s: 0-34 0-01 we say
0-333 < 1 ^ 0-334 0-001
0-3 =i
small as we please by going on far enough and that there is one
number, y, which is common to all the sandwich inequalities. This (ii) 0-9 is the number whose approximations are
is sufficient to show that both the sequences
0-9, 0-99, 0-999,....
0-3, 0-33, 0-333...
and The approximation can be made as close as we please to 1,
0-4, 0-34, 0-334 ... so we say
converge to j. 0-9 = 1.
THE IDEA OF A LIMIT 85
10.8. The Key Idea approximations to ,y6 must also decrease, so an approximation
can be found which is as close as we please to v/6.
Suppose we have a sequence of numbers. We shall say that this
Lower Upper Difference between
sequence converges to a number or approaches a limit if we can approximation approximation v'6 and each approximation
convince ourselves that we can get as close as we please to this 2 3 Less than 1
number or limit by going on far enough in the sequence. 2-4 2-5 Less than 0-1
2-44 2-45 Less than 0-01
We shall use this key idea
2-449 2-450 Less than 0-001
We can get as close as we please to...
Hence we say that the sequence of rational approximations 2,
again and again, for it is the essential first step in developing the 24, 244, 2449,... converges to ^6.
limit concept.
What we cannot do, and it would not be appropriate to try in
any systematic way with sixteen-year-old pupils, is to prove in all
the examples we have given that the sequences have limits. At this
age, an intuitive grasp of the idea of a limit should be the objective. 10.10. Fibonacci Sequences
the sequence into two sequences, one of which is increasing and the useful preparation for the limits "at a point" used in differentiation,
other decreasing: and this preliminary work is important not only in enabling pupils
1-5, to tackle differentiation with understanding, but also in keeping
1, 1-6, 1-615384,...
1-6, mathematics alive as an investigating exploratory subject in the
1-625, 1-619047,...
years preceding the 16+ examinations. Nearly all this work can
(ii) It can be seen that if this pattern continues, and if the terms and should be done in play-like activities in which the limit idea
of the upper sequence can be made as close as we please to those of can develop over several years before being applied in the context
the lower sequence, there will be a number sandwiched in between of differentiation. The key concept in all work on limits is that we
them. It is intuitively clear that the sequence of ratios converges to can get as close as we please by... In different settings, we get "as
a number whose value is 1-61..., although we cannot say precisely close as we please" in different ways. The first two examples are
what number this is. In the case of this sequence, we are sure that it drawn from this chapter, while the third and fourth look forward
converges, although we cannot write down an expression for its to chapter 12.
limit. This is in contrast to the example of section 10.9, when we (i) We get as close as we please to the limit of a sequence by
knew the limit was 6. going far enough in the sequence.
(ii) We get as close as we please in drawing a curve of pursuit by
taking short enough steps.
(iii) We get as close as we please to the tangent to a curve at P by
10.11. Conclusion taking a chord through points P and Q which are close enough
together.
In each activity in this chapter, pupils will have the specific (iv) We get as close as we please to the actual velocity of a
objective of investigating a particular situation, but the teacher will particle by taking the average velocity over a short enough
also have in mind the growth of the concept of a limit. It is helpful if interval.
he uses forms of words such as "Can you get as close as you please Formal work on differentiation and other limit ideas at the next
to the number? (or the curve, or the point?)" stage will be much more intelligible to pupils if the limit idea is
The limits discussed in this chapter are largely limits "at allowed to grow slowly and informally before the official notation
infinity", although we have not used the phrase. They form a very for limits is introduced.
11 Area under a Graph: Velocity and Displacement
11.1. Introduction made of this aspect of area in the early years in the secondary
school. For instance, in the block graph in figure 11.1 the shaded
The idea that area under a graph can convey useful information area under the graph tells us the total number of pupils in the class.
should not be new to pupils, because it can be brought out in If a new pupil were to arrive, and information about him were
graphical work in the late primary school or early secondary added to the graph, the height of one of the columns would be
school in the context of block graphs or histograms. More could be increased by one unit and hence an additional area of one unit
square would be shaded. Areas of histograms will become more
significant and important in later studies of statistics, but an early
introduction to the interpretation of areas under graphs would be
most profitable.
frequency
In this chapter we concentrate on velocity-time graphs and on
the problem of convincing pupils that the area under a velocity-
time graph gives a measure of displacement. Many traditional
books and courses do not include this work, but a number of
modern courses for average and more-able pupils do.
E
_^;
T3
CD
CD
Q.
co
123456 12345
Time (h)
at a constant speed of 100 km 'h for 1 hour and then at a constant speed of 120 maximum. The waste is then produced at this constant rate for 4 hours. After
km.'h for 2 hours. this it decreases. The graph in Figure 16 shows the amount of waste per hour
Do you agree that the red square represents a distance of 20 km? How many flowing into the river at various times during the day.
of the red squares can be fitted into the shaded area? What is the total distance The shaded rectangle represents a quantity of 5000 litres of industrial waste.
travelled by the car? The area of triangle .4 is i x2 x 15 = 15. Thus the area of region ,4 represents a
{ti) The graphs in sections (h) and k'| represent idealized situations. Lorries quantity of 15 000 litres of waste.
and cars cannot, in fact, change speeds instantaneously as the graphs suggest. A Calculate (i) the area of region B: (ii) the area of region C. Hence write down
more realistic description of the lorry's journey is represented by Figure 15. the total amount of waste flowing from the factory per day.
Use the graph to describe the lorry's journey in words. Calculate the area
under the graph and hence write down the distance the lorry travelled.
Now sketch a more realistic version of the car's journey than that represented The technique of leading the pupils gently into a new situation,
by Fiaure 14. Do not forget that the car has to travel a total distance of 340 km.
(t-I The rate at which industrial waste flows into a river from a factory
which is so ably used in this exercise, is one which is familiar to and
increases during the first two hours of a working day until it reaches a used by all teachers. The pupils are first convinced that the area
AREA UNDER A GRAPH: VELOCITY AND DISPLACEMENT 89
t 15
<u velocity
0) Q-
w w in km/h
CD g
.I'S 10
If
II
1 5
6 a.m. 7 1112 1
time (h)
Fig. 16.
what information the area under the graph conveys, provided that
velocity
the graph is a step-function, so that the area under it is a rectangle in m/s
or a sum of rectangles.
The next stage in the teaching strategy in the extract is shown in
(d), where it is suggested that although the velocity-time graph is
no longer a step-function, the area under it still represents the
displacement. By (e) the use of the word "Thus" in the sentence
"Thus the area of region A represents a quantity of 15 000 litres of
waste." makes the pupil believe that the fact that the area under the
graph gives useful information has been thoroughly established for
all graphs.
This technique, which is extremely useful in many places in the
teaching of mathematics, depends on the fact that it is common for
mathematical ideas to generalize from simple cases into a more
widely applicable setting. However, able pupils may recognize and
be worried when jumps are made without explanation, and the
teacher needs to make conscious decisions whether to discuss these time in seconds
jumps with particular pupils or groups of pupils. The decisions
depend on balancing the development of pupils' critical awareness Figure 11.3
against the degree of difficulty that a fuller treatment may involve.
Unfortunately, when an attempt is made to justify the repre
shown in figure 11.3. The argument runs as follows. As the particle
sentation of displacement by area under a velocity-time graph, a
starts with velocity 2 m/s and finishes with velocity 4 m/s, the
fallacious argument is sometimes used, and this is described in
average of its initial and final velocities is 3 m/s. Thus it might be
section 11.3.
regarded as travelling for 2 seconds at an average velocity of 3 m/s,
There is available, however, a straightforward argument which
so that it covers a distance of 6 metres. Also, the number of units of
forms an important step towards the understanding of integration,
area below the graph is . .
and which is intelligible and convincing to pupils. This argument is
discussed in section 11.4. 2+4
x 2 = 6.
11.3. An Incorrect Argument Hence this example might be thought to illustrate the result that
the area under a velocity-time graph represents the displacement,
Consider a particle which accelerates uniformly from a velocity even when the velocity is not constant.
of 2 m/s to a velocity of 4 m/s in 2 seconds. Its velocity-time graph is However, in general, the average of the initial and final velocities
AREA UNDER A GRAPH: VELOCITY AND DISPLACEMENT 91
is not equal to the average velocity. It will be recalled that
velocity
total displacement in m/s
average velocity = -
time taken
velocity velocity
in m/s in m/s
1 -
Figure 11.5 A sandwiching representation connecting displacement with the area velocity
under a velocity-time graph. i n m/s
It should be made clear by examples that these ideas do not only regarded as positive or negative according as the values of the
apply to velocity-time graphs, but also to other graphs such as that function are positive or negative. The reason for this is shown by
given as example (e) in the SMP extract. The reason that velocity- the following example. The parallel between displacement and
time graphs are dealt with first is that pupils are more familiar with area having been established, it needs to be maintained in all
velocity than they are with other rates. situations.
Consider the velocity-time graph of the particle X shown in
figure 11.6. Between the times t = 2 and t = 4, X is travelling with a
11.5. Negative Velocity and Negative Area positive but decreasing velocity. At 4 seconds it instantaneously
stops and starts off again in the opposite direction. Hence at the
A distinction needs to be made between the use of the word area end of 10 seconds it has travelled a distance represented by the area
in chapter 7, and some implications of its use in this chapter. The A in figure 11.6 forwards, and a distance represented by the area B
area of a plane figure, discussed in chapter 7, is always regarded as backwards. In order that the total area under the graph should
positive, whereas the area under the graph of a function needs to be represent the displacement at the end of the journey, area B must
velocity '
in m/s
velocity
in m/s
10 time in seconds
-1
-2
1 9 time in seconds
Figure 11.6 Negative velocities and areas. Figure 11.7 The trapezium rule.
AREA UNDER A GRAPH I VELOCITY AND DISPLACEMENT 95
be regarded as negative. Pupils should be aware that it is not an It is not clear in speech whether the trapezium rule refers to the
arbitrary decision that areas below the x-axis are regarded as idea of dividing the region under the graph into trapezia and then
negative. saying that the area under the graph is approximately equal to the
sum of the areas of the trapezia, or whether the trapezium rule
refers to the formula to which the idea leads.
11.6. The Trapezium Rule The idea is much more important than the formula. Formulae
are never easy to remember, and the trapezium-rule formula is no
In numerical work, pupils will often obtain sandwich in exception. In more advanced work other formulae are nearly
equalities for the area under a graph, as suggested above. If an always used when approximating to areas under graphs, and in
approximate numerical value is needed for an area under a graph, elementary work it always suffices to calculate the area of each
instead of a pair of inequalities, the trapezium rule is the best trapezium in turn. The calculation of areas under graphs is
method for pupils at this level to use. discussed further in chapter 13.
12 Velocity and Gradient
12.1. The Tangent to a Curve
displacement
in metres time in seconds
displacement
in metres
(a)
(c)
time in seconds
Figure 12.3 Approximations to instantaneous velocity through the link with the
gradient of a displacement-time curve. time in seconds
VELOCITY AND GRADIENT 99
close to P are very difficult to distinguish from the curve itself in the displacement-time graph using a ruler and pencil, and by calculat
neighbourhood of P. This leads easily to the conviction that the ing the gradient of their tangent line from measurements made
"actual velocity" at P is given by the gradient of the tangent to the from their graph. They can also learn through examples that the
curve drawn at P. gradient of a curve is useful in other circumstances.
It is true that mathematical difficulties can arise in trying to be For instance, a domestic gas meter measures the volume of gas
precise in explaining what is meant by "velocity at an instant" or used by the householder, and it is possible to draw a graph of the
"gradient of a curve", and mathematical purists may say that volume of gas used against time. This might have the general
neither concept is yet properly defined. At this stage this is characteristics of figure 12.4.
unimportant, for the concepts of "velocity at an instant" and In this example, the gradients of chords and tangents give
"gradient of a curve" are intuitively clear to pupils. It is much more information about the average rate of flow of gas and the actual
important for these concepts to be linked in the minds of the pupils rate of flow of gas. The gradient of the tangent at P corresponds to
than it is for pupils to be able to articulate clearly and concisely the rate of change of the volume of gas at P.
what the concepts mean. In fact, the concepts of velocity and Another example in which gradient is useful is provided by
gradient are strengthened by the link between them, and precise acceleration. In the same way that the average velocity of a moving
definitions are inappropriate until after that link has been made. body is calculated by measuring changes of displacement in a given
time, so also its acceleration can be calculated by measuring
—*
lime in seconds
time in seconds
Fieure 12.4 I-ieurc 12.5
100 FROM GRAPHS TO CALCULUS
changes of velocity in a given time. A sprinter who starts from rest 12.4. Remarks
and reaches a velocity of 10 metres per second in half a second has
an average acceleration over that time of 20 metres per second in This is a convenient stopping point for a school course and the
each second, or 20 m/s 2 . The average acceleration corresponds to SMP course and some other courses go no further than this
the gradient of a chord of the velocity-time graph. The acceleration towards calculating either areas or gradients without approxi
of a moving body at a particular instant corresponds to the mations being made en route. Pupils who have reached this stage
gradient of the tangent to the velocity-time graph at that instant. are well prepared for integration and differentiation, and in
addition they will understand clearly why integration and differen
tiation are important processes.
y /
While pupils are drawing tangents by eye, brief informal
discussion of drawing methods can help to form ideas which will be
useful in calculating gradients of curves without drawing. Pupils
use two methods.
(i) The ruler is slid up to the curve, its direction being adjusted
by eye until it approximates to the direction of the curve at
the required point P.
(ii) The pencil is placed at P and the ruler slid round from a
position such as PQ to a position where the second
Figure 12.6
y/
xi—> y xi—» y
intersection of ruler and curve, Q, appears to the eye to In order to calculate velocity from a displacement-time graph
move from one side of P to the other. without using drawing and measurement, pupils must return to the
The first of these drawing methods leads ultimately in the study idea that chords PQ for which Q is very close to P are difficult to
of differentiation to the method of defining the derivative of a distinguish from the tangent to the curve at P. When this statement
function at a point as the gradient of the best linear approximation is put into the context of displacement-time graphs, it becomes
to its graph (section 14.5). The second method leads to the idea very difficult to distinguish "the average velocity over a short time
that the derivative of a function at a point is the limit of the interval a to a + /i" from the actual velocity at P (figure 12.8).
gradient of a chord (section 14.3). Both ideas are valuable, and This suggests a method of calculation of velocity at the instant a
both can be built up from pupils" actions in drawing. corresponding to point P on the curve, for it is now a small step to
the conviction that we can get as close as we please to the velocity at
time a by taking a short enough time interval from a to a + h anil
12.5. Towards Definitions of Velocity and Gradient calculating the average velocity over this interval.
In the corresponding language about gradients of tangents to
In section 1 2.2 the intuitive concepts of velocity at an instant and graphs, the words in italics above become (figure 12.9).
gradient of a curve were linked, and in section 12.3 it was suggested we can get as close as we please to the gradient of the tangent at P by
that pupils should do plenty of numerical examples in which they taking
PQ.' 0 ~ close enough to P and calculating the gradient oj the chord
draw a tangent line to a curve and then calculate the gradient of the
tangent line using measurements made from the graph. This idea, differentiation, is developed in chapter 14.
13 The Definite Integral
Y = f(x)
y = f(x)
upper step-function
Figure 13.1 (i) Area under the graph of y=f(x); (ii) lower step-function;
(iii) upper step-function. For simple functions, numerical calculations are very quickly
made, as the following examples show.
We first find a sandwich inequality for the area under the graph
of y = x2 between x = 0 and x = 1, using lower and upper step-
functions with ten steps each of width 0-1. Figure 13.2 shows the
lower step-function.
The sandwich inequality is
0-285 sc area ^ 0-385
and is obtained very quickly indeed. (A calculator is more
hindrance than help!)
A calculator is more help if the inequality is refined by taking a
step-function with twenty steps each of width 0-05. This time the
sandwich inequality is
0-30875 sc area s£ 0-35875
If an approximation is made to the area under the graph by
averaging the upper and lower estimates (see section 7.3) the result
lower step-function is 0-33375. Hence, to 2 decimal places, an estimate of the area under
THE DEFINITE INTEGRAL 105
y = x 2 between x = 0 and x = 1 is 0-33, but a clear distinction
must be drawn between the inequality which is precise and the
average result which is approximate. 1 -
This work can be followed up by using similar approximate
methods to calculate areas under the graphs of y = x 3 and y = x4
between x = 0 and x = 1.
The table below gives approximate results, using ten-step step-
functions, for the areas under the graphs of y = xp for
p = 0,1, 2,..., 9. The step-function areas are omitted for p = 0 and
p = 1 because the required areas are those of a rectangle and a
triangle and can be calculated directly. In these cases the number in
the right-hand column is the actual area under the graph.
Average of
the lower
step-function
area and
upper
step-function
Area under Area under area (to
Value Equation lower upper two decimal
of;> of graph step-function step-function places)
0 0-5 1-0 x
0 v= 1 _ _ 1 Figure 13.2 A lower step-function for the graph of y = .v 2 .
1 V =x — — 0-5
2 V = X2 0-285 0-385 0-33
3 V = X3 0-2025 0-3025 0-25 This is not a proof, but it is a very convincing demonstration to
4 y = x4 0-1533 0-2533 0-20 pupils at this stage that the area under r = x'' between x = 0 and
5 y = x' 0-1208 0-2208 0-17
y = x (> 0-15 x = 1 is l/(p+l).
6 0-0978 0-1978
7 y = A" 7 0-0808 0-1808 0-13 It is suggested that the numerical verification of this result, given
8 v = x8 0-0677 0-1677 0-12 above, is sufficient for almost all pupils, and a formal theoretical
9 y = x" 0-0574 0-1574 0-1 1 proof should be postponed to the next stage of work.
The sequence of approximations in the right-hand column is
1,0-5,0-33, 0-25, 0-20, 0-17....
13.3. Definition of the Definite Integral
and suggests very strongly that the actual areas are
The definite integral of a function /between a and / can now be
J 1 1 i L
i it 3' 4> 5' 6> introduced as the area under the graph of y = f(x) between x = a
106 FROM GRAPHS TO CALCULUS
x"dx = ——-
13.4. Integrals of Increasing Functions From the sandwich assumption of section 9.7, there is just one
number which is sandwiched between every possible lower step-
By this stage, pupils should have realized intuitively that if they function area and every possible upper step-function area, and we
make the steps of the step-functions narrower, they make the can, by making the steps of our step-function sufficiently narrow,
sandwich inequality thinner. Indeed for all the graphs so far identify this number.
considered the thickness of the sandwich inequality, or the This argument applies not only to the integral of y = xp between
difference between the upper step-function area and the lower step- 0 and 1, but to every increasing function because the shaded areas
function area, is of the form shown by the shaded area in figure can always be stacked. Hence every increasing function has a
13.4. All these areas can be slid over to the left and stacked into the definite integral which can be evaluated by a sandwiching process;
column shown against the y-axis. If the steps all have equal width so also do decreasing functions.
vv, the shaded area is w x 1 = w. Hence the sandwich can be made Pupils under sixteen will not concern themselves with the details
as thin as we please by choosing w small enough. of this argument, but the intuition that smaller step-lengths give a
thinner sandwich should be encouraged. Later, more-advanced
students will prove that not only every increasing function, but
also every continuous function (and some other functions as well)
has a definite integral which can be evaluated by a sandwiching
process.
P+ 1
and
is demonstrated similarly. , ,
Combining inequalities (1) and (2) gives
area under graph
/lower step-function of.V=/(x) /upper step-function
\area for/ area under graph \area for f 13.7. Comments
of >• = //(*)
In the above treatment, no reference is made to differentiation,
Now for y = /"(.v), the sandwich inequality can be made as thin so that the definite integral can be studied as an extension of the
as we please by taking a step-function with thin enough steps, so it pupil's knowledge of area and sandwiching, independently of work
follows that the quantities outlined in the blocks can be made as on gradient and differentiation. In chapter 15 the link between the
close together as we please. We conclude that the middle terms of definite integral and differentiation is discussed.
the two inequalities are equal. It is suggested that this treatment establishes patterns of thought
Hence which are helpful to the future user of mathematics, whether he is a
pure mathematician, applied mathematician, scientist, engineer, or
f(x)dx= lf(x)dx. economist.
14 Differentiation
L'M'
14.1. Introduction
14.2(ii), the same ratio measures the scale-fact or of the
TjvT
The gradient of the graph of a function at a point is the gradient enlargement from LM to L'M'. The calculations are the same as in
of the tangent to the graph at that point. Traditionally, the method the chord approach, but because the geometrical interpretation is
of calculating this has been to find the gradient of a chord PQ essentially different, we shall call this approach to differentiation
through point P, and then to find the limit of the gradient of PQ as the scale-factor approach.
Q approaches P (figure 14.1). We shall call this the chord approach In 1970, Montgomery and Jones, in Calculus and Elementary
Functions, gave the first elementary treatment of a third approach
to differentiation, that of the best linear approximation. This has, so
far, received little attention in schools, although the treatment was
intended for sixteen-year olds. The picture is again that of the
Cartesian graph, but the idea used is that among all the possible
straight lines drawn through the point P of the graph, there is one
which approximates to the curve more closely than any other. This
best straight-line approximation is the tangent at P.
In the next sections we discuss these three approaches and also
mention briefly one or two other possibilities. Whatever approach
is used, there are considerable notational difficulties, and these are
also considered.
Figure 14.1
14.2. Chords: a Numerical Approach
to differentiation. It was Newton's original method based on
velocity and was used in almost all school textbooks published It is suggested that, before the age of sixteen, the major emphasis
before 1960 which included any calculus. should be on the idea of the gradient of a graph, with careful
The SMP "A" Level course pioneered another approach, using attention to clarity of concept and wording, and many simple
the geometrical picture of a function given by the arrow diagram. numerical examples. Calculators make it possible for pupils easily
L'M' and rapidly to calculate gradients of many chords PQ (figure 14.3)
The ratio ——, in the Cartesian araph of figure 14.2(i), measures through a given point P, and to acquire a feeling for the
LM
the gradient of the chord PQ, but in the arrow diagram of figure relationship between the position of the second point Q (for fixed
112 FROM GRAPHS TO CALCULUS
Figure 14.3
4 16 7 1 7
3-5 12-25 3-25 0-5 6-5
Figure 14.2 (i) Chord approach to differentiation; (ii) scale-factor approach, to 3-25 10-5625 1-5625 0-25 6-25
differentiation; (iii) best-linear-approximation approach to differentiation.
3-125 9-765625 0-765625 0-125 6-125
3-0625 9-37890625 0-37890625 0-0625 6-0625
P) and the gradient of PQ. When the time comes for a more formal
treatment, this numerical feeling for the gradient will help the pupil The chords in this particular sequence have been chosen so that the
to understand what is going on behind the notation of limits. x-distance of Q from P is continually halved. The sequence of
For example, the gradient of the tangent to the graph of the gradients is
function x i—> x2 at the point (3,9) can be approximated by finding 7, 6-5, 6-25, 6-125, 6-0625,...
DIFFERENTIATION 113
and it would seem that the gradient of the chord can be made as beyond P and Q, so that when P and Q are close together the chord
close as we please to 6 by taking Q close enough to P. is visible on the diagram, and can easily be related to the tangent
Another example of a sequence of chords PQ is the following, in at P.
which Q is on the left of P. Pupils should be encouraged to
experiment with a variety of different positions of Q.
14.3. Differentiation
x-coordinate v-coordinate Gradient of
ofQ ofQ QR PR PQ
2-9
In the numerical work, the same type of calculation has been
841 -0-59 5-9
2-99 8-9401 -0-0599 5-99 repeated many times. The time has now come to generalize, and it
2-999 8-994001 -0-005999 5-999 will seem natural to find the gradient of the chord of xi—>.x 2 joining
P(3,9) and Q(3 + /1, (3 +h) 2 ). Because P and Q are distinct points, h
Here again, it can be seen that we can make the gradient of the cannot be zero (see diagram opposite).
chord as close as we please to 6 by going far enough along the Now
sequence of gradients QR = (3
and
5-9, 5-99, 5-999,..., PR = h,
or by taking Q near enough to P. so the gradient of chord PQ is
It is the calculator which makes it easy for pupils to have
personal experience of the process of finding gradients of sequences QR _ (3 -h)2 -9
~l~'""~
l^R""
of chords through P. They can thus approximate to the gradient of
the tangent through P by using chords whose intersections P and
Q with the graph are very close together. Time spent in this
numerical experimentation is well spent, for soon pupils will try to
find positions of Q such that the .gradient of PQ is within for = 6 + /;.
instance 0-000001 of 6, and will realize that they cannot ever make
the gradient of the chord PQ e.\actly,6. At this stage, the wording As P and Q must be different points, we can never have h = 0, but
the gradient ofthe chord PQ can be made as close as we please to 6
the limit of the gradient of the chord as Q tends to P is 6 by taking /? close enough to 0. It is important that pupils realize
may be introduced to express the idea that although there is no chord whose gradient is exactly 6, there is
a line through P whose gradient is exactly 6, and it is this line which
the gradient ofthe chord can he made us <•/<>.«' us we pli'iise to 6 by taking 0 close is the tangent at P.
enough to P.
After this method of finding the gradient ofthe tangent at a point
When drawing diagrams such as that in figure 14.3, it is helpful with given numerical coordinates, the next step is to generalize to
to pupils' imagination to be sure that the chord is produced find a rule for the gradient ofthe tangent to a graph of a function at
114 FROM GRAPHS TO CALCULUS
dv
The notations dx, <5v and -f- are better avoided until later, for a
ax
number of reasons:
(i) The symbols d.\ and 6y are the pupils' first experience of
using two letters to stand for a single number.
The temptation is strong to think of 8x as a multiple of x,
and 5y as the same multiple of y, so that pupils may want to Figure 14.6
(5y y fty y
write —^ = — , in the same way that — = — . This notational
dx x nx x
problem has to be faced at some time, but a time when function /', which is given by /'(x) = 2x for all x. The traditional
pupils are meeting several other new and subtle ideas is not notation
the best time for extra notational difficulties.
(ii) In the traditional notation shown in figure 14.6, the symbol for all
x is used with two quite distinct meanings, as can be seen dx
from the two occurrences of x on the horizontal axis. It is can then be introduced, but this notation has no advantages for
first the x-coordinate of an arbitrary point on the graph pupils before sixteen.
y = x 2 of the function /:xi->x 2 . Secondly, it is the In many "O" Level texts in which differentiation is treated, the
x-coordinate of a particular point P on the graph, and so aim seems to be the achievement of rote learning of the formulae
is a fixed number. It can only be used for both by an abuse of
notation which is very unsuitable until a later stage. which enable pupils to differentiate expressions such as 3x 2 — —.
This is not a suitable aim if it is done at the expense of the
This confusion is easily avoided by using a letter other than x for understanding of how gradients of graphs are found. In the first
the x-coordinate of P. Then we may say that, since/'(«) = 2a for treatment of differentiation, attention should be focused on
each point a, we have established a new function, the derived methods rather than on formulae.
116 FROM GRAPHS TO CALCULUS
14.4. The Scale-Factor Approach scale-factor over the interval PQ. The idea of local scale-factor at P
is introduced, so that
This approach is developed in SMP Revised Advanced Mathe
matics, Book 1. The calculations are exactly as in the chord lim'- (a + h)-f(a)
approach, but the arrow diagram gives a different geometric
motivation from the gradient of a Cartesian graph. The method is is interpreted as the local scale-factor of the function at P.
now discussed briefly in language more sophisticated than would One or two difficulties occur in this interpretation. The first
be appropriate for most 16-year-old pupils. arises when a point between P and Q maps to a point which is not
4 Q'
f(a + h) Q
3
0 a+h
Q 2 2
1 1 P'
P a f(a) P
0 0
P -1
The closed interval PQ[ti,a + /i] of the domain is considered. between P' and Q'. This happens in the case of the function x \-> x..2
This interval is mapped so that P maps to P' and Q maps to Q'. The when we consider the interval [—1,2]. Because 0 maps to 0, the
signed lengths P'Q' and PQ are then calculated and found to be whole interval [—1,2] does not map to the interval [ 1,4]; it is P
f(ii + h)—f(a) and h respectively. The quotient which maps to P' and Q which maps to Q', not PQ which maps to
P'Q'. The idea of scale-factor is geometrically more obscure in this
P'Q'
« + /« )-/(«) example.
1S
The other pictorial difficulty is that the local scale-factor can
never actually be seen because it cannot be drawn. It literally
with attention paid to signs, is calculated and called the average almost vanishes as the interval PQ becomes small.
DIFFERENTIATION 117
These difficulties are greater than those which occur in the
h)-f(a)
gradient approach, in which the expression - always
represents the gradient of a chord. Even when h is very close to
zero, the gradient of the extended chord is visible in the diagram,
and the extended chord is almost indistinguishable from the
tangent.
The authors of SMP Revised Advanced Mathematics must have
been aware of these difficulties, for they introduce the gradient
picture very soon after differentiation has been defined using scale-
Fie. 7
factor ideas, and with one exception subsequent work in Revised
Advanced Mathematics I is all based on the gradient-of-chord
picture rather than the scale-factor picture. It is interesting to note that the authors do not attempt to prove
This exception is the rule for differentiating composite functions the result but make it clear that assumptions are made. This
(or the chain rule), which has a very natural interpretation using contrasts favourably with most texts, which claim to "prove" the
scale-factors but no equivalent easy picture using gradients. The chain rule, but take no account of the difficulties mentioned at the
quotation below is taken from Revised Advanced Mathematics I, end of the extract.
and is given to show the use of the scale-factor approach at the next It will be seen that the scale-factor picture forms a useful
stage of work. By this time, this text has introduced the traditional supplement to the gradient-of-chord picture, but cannot replace it.
ox and ov notation (SMP Revised Advanced Mathematics, Book 1,
p. 305). '
14.5. The Best Linear Approximation
2.3 Alternative statement of the chain rule. Using the notation shown in Figure
7, the average scale factor of the function f:x -> rover the interval of length fix
is This approach is not well known, and so is described in some
Sy fiy dti
detail. The interested reader should consult Montgomery and
Ax fin fix
Jones, Calculus and Elementary Functions (CUP) for a teaching
method, and Fraleigh, Calculus. A Linear Approach, Vol. I
To find the value of the derived function, ily ilx. we have to consider the limit as
r5x-»0; 6y/Sx-*d\-/dx, Sv/fin ~> dv du and fiu'fix -» du.dx so it would be
(Addison Wesley) for a more advanced treatment.
reasonable to conclude thai The essential idea is that of trying to find a straight line which fits
the graph of a function at a point better than any other straight
line. This line (if it exists) is defined to be the tangent to the graph at
the point (figure 14.9).
To prove the result formally, which we shall not attempt here, we have to cope
with the limit of a product and also the possibility that fiu might be zero even il
The equation of the tangent is also the equation of the best linear
fix ^ 0. (What difficulty would this create?) approximation to the function at that point. For example, when
118 FROM GRAPHS TO CALCULUS
not a tangent
tangent
Figure 14.9
the values of the function .xi—>2 + 3.x + x 2 are calculated near
x = 0, it is found that the x 2 -term makes little difference, com
pared with 2 + 3.x.
X 2 + 3.T 2 + 3.X + .V
1 -1 0
0-1 1-7 1-71
0-001 1-997 1-997001
0 2 2
0-001 2-003 2-003001 Figure 14.10
0-1 2-3 2-31
1 5 6
It is essential that pupils should understand from numerical
Thus, for .x near 0, the error in replacing 2 + 3.x + ,x2 by 2 + 3x is
experience that near 0, x2 is small in comparison with x, so that the
small. Discussion of the interval of values of x for which the error is
terms 2 + 3x dominate. Figure 14.10 shows the geometrical
"acceptably" small is helpful. Moreover, the graph of x i-> 2 + 3.x is
significance of the three terms at the point x = /?.
a straight line, where that of x i—> 2 + 3x + x 2 is a curve. The x 2-term
The derivative of the function when x = 0 can now be calculated
can helpfully be regarded as the error term in replacing 2 + 3.x + x2
as the gradient of the best linear approximation near x = 0. Thus,
by 2 + 3x. When finding a best linear approximation near a
the derivative of x i—> 2 + 3x + x2 at x = 0 is 3.
particular point we always try to replace a non-linear function by a This approach is easily adapted when the derivative at a point
linear function such that the error is small in comparison with the other than x = 0 is required, as the following numerical example
linear function near the point. Graph drawing should convince shows. We try to find the derivative of the function
pupils that the graph of x i—> 2 + 3.x is the tangent to x i—> 2 + 3x + x 2
at (0, 2). x H-> 3 - 3.x + x 2 - x 3 at x = 2.
DIFFERENTIATION 119
a 2 + 2ah
Figure 14.11
This technique for teaching differentiation admits that differen For some pupils who have learnt to differentiate combinations
tiation is difficult; therefore the teacher explains to the pupils that of functions such as xi—>x2 and xi—»x3, the teacher may wish to
in the seventeenth century, Newton and Leibniz discovered that proceed as far as finding the derivative of the function with formula
for a graph of the form y = x", the gradient is given by the formula f(x) = x", where n is a positive integer.
MX"' 1 .
Care needs to be taken in deciding whether to make this
While no self-respecting teacher admits to using this method, the extension, for it is easy for pupils to lose the thread of the argument
effects of other methods presented with insufficient time and in a maze of algebraic symbols. Most texts find the derivatives of
preparation are equivalent to a black box plus hang-ups on the x i-> x 2, x i—> x3 and x i-> x4, and then draw out a general rule from
part of the student, while the black-box method as described leaves the pattern of the results. At this level this is a satisfactory
a mystery but no hang-up about it! procedure which many teachers will wish to follow.
(a + h) n
(a + hy
/ a a+h a+h
Figure 14.13 The derivative of AH-.X": (i) using Pascal's triangle: (ii) using the
best-linear-approximation method.
122 FROM GRAPHS TO CALCULUS
Although it is not possible to give a rigorous proof that the This can be made as close as we please to na" ' by taking h close
gradient of x\—>x" at x = a is na"" 1 without using proof by enough to 0. The values of A, B,... do not matter.
induction somewhere in the chain of results, it is possible to use It follows that the gradient of x i-> x" at x = a is na" ~ 1 .ln other
Pascal's Triangle to show a reason for the form of the result. words,
Pupils with skill at algebra can calculate if /(x) = x", then f'(a) = ncf~ i .
(a + h)2 = a 2 + 2 If the best-linear-approximation method is used, we consider
(a + h) 2 = (a the point (a + h), at which (a + h)" = a" + na"~ 1 h + Aa"~ 2 lr+ ....
+ ah 2 The best linear approximation is a" + na"~ l h,and this has gradient
15.2. The Link between Integration and Differentiation fiszure 15.2(ii) with sign taken into account. It is clear that 3.x 2 d\
To develop the link between integration and differentiation we and (x 2 —3x + 2}dx depend on the position of the right-hand
need to appreciate that differentiation is a process applied to
126 FROM GRAPHS TO CALCULUS
boundary and hence on t, not x. This can be reinforced by working 15.3. Further Discussion of the Link between Integration and
out the integrals Differentiation
The reader may be worried by the fact that the work of the
previous section relies on a statement which was assumed earlier
and on the basis of numerical evidence only. This statement is
3x.2 (P/-1)
A(t)= f(x)dx
Ja
then
A'(t) =f(t).
where s'(T) is the derivative of s at time T. Hence v and s' are the
same function of time.
Now it is also known (section 11.4) that the area under the
velocity-time graph between f = a and f = b represents the
displacement during that time; in symbols gradient f (X)
= p S'(t)dt
Ja
s(h)-s(a)= s'(t)dt
Ja
along a straight line in such a way that its displacement from the This argument is suitable for able pupils who are sufficiently
origin at time x is given by f(x). Its velocity at time x will be/'(.x), familiar with the relation between displacement-time and velocity-
and by an exact repetition of the previous argument we obtain time graphs to argue from them. Proofs which do not depend on
this fact are suitably left for the sixth form.
f(b)-f(a) = \"f'(x)dx. For the majority of pupils who meet the definite integral before
Ja the age of 16, the numerical verification of
The final step in the argument is made by considering a function
/such that x"dx =
f'(x) = x", where p ^ -I.
Experiment easily convinces pupils that the formula for f(x) must carries complete conviction. If they regard the two facts
be of the form
x"dx=\ '-
/(-x) = +c
(i)
p+l
P+l
Axp+1 \. r
where c is any real number. This immediately yields (ii) the derivative of —r ls x (P^-ll
VP+V
1 «" +1
x"clx =
p+1 p+1 as a curious coincidence, no harm will be done, for the first was
b studied for more than a thousand years before its connection with
the second was discovered, as is shown in the final historical
chapter of the book. More careful discussion of the connection
Thus the link between the definite integral, defined as an area between the two can appropriately wait for the next stage of the
under a graph, and antidifferentiation is established. student's education.
16 Some Applications of Integration and Differentiation
The preceding two chapters were concerned with the calculation
of areas and gradients, and neglected the applications which arise
from integration and differentiation. In this chapter we discuss
some examples of applications, while recognizing that teachers
would want to introduce some of them during the teaching of
velocity in
integration and differentiation. metres/second
r
Figure 16.2
1. Sketch the graph of the function f: x i—> x2 — 6.x+ 4. Figure 16.4 Sketch of the graph of/: xi-> x 2 -6x +4.
(0,0)
(3,0)
(2-4)
(v) (vi)
(iii) or (v); numerically large negative values of x confirm this.
Substitution also shows that when x = 0,/(x) = 0, so (0,0) lies on
the graph. The gradient at x = a is/'(a) = 3a2 —6a and it can be
Figure 16.5 Shapes of the graphs of cubic functions. seen that this is zero when a = 0 or 2; thus, the graph has two
points of zero gradient, at (0,0) and (2, —4), and so is of type (i). In
2. Sketch the graph of the function f: x i—> x 3 — 3x 2
this case, it is easy to see what happens when /(x) = 0, and so
where the graph meets the x-axis. At these points,
Previous experience of plotting graphs of cubic functions should x 3 -3x2 =0
lead pupils to expect a graph of one of the shapes shown in figure
16.5. The only problem is to decide which of these it is and to place and so x = 0 or x = 3. Thus the graph meets the x-axis at (0,0) and
it correctly with respect to the coordinate axes. Numerical (3,0). The sketch in figure 16.6 shows all the main features of the
experiment quickly satisfies pupils that when x is large (positive or graph.
negative), the term x3 is more important than the term —3.x2 , so The teacher may like to introduce vocabulary to describe the
that the sign of/(x) is the same as the sign of x3 . When x is large behaviour of the graph. The function is increasing when x < 0 and
and positive x 3 is large and positive. This restricts the shape to (i), when x > 2; it is decreasing when 0 < x < 2. Pupils should
SOME APPLICATIONS OF INTEGRATION AND DIFFERENTIATION 133
recognize that increasing functions have positive gradients, and At this stage, it is quite inappropriate for pupils to know rules,
decreasing functions have negative gradients. They know that the involving the second derivative, to distinguish between maxima,
gradient when x = a is minima and points of inflexion. A sketch will nearly always suffice,
and if any doubt remains, working out the gradient when x is just
to the right or just to the left of the point of zero gradient will dispel
It is easily verified that, for instance, all doubts.
when a = — \,f'(a) = 9;
when a = 1, f'(a) = -3;
when a = 3, f'(a) = 9. 3. Sketch the graph of the function f: x \—> 3.x 5 — 20x3 .
The point of zero gradient at (0, 0) is a local maximum ; the point of This example is probably too advanced for all but the most-able
zero gradient at (2, —4) is a local minimum. pupils, but illustrates how much information about a graph points
Points of zero gradient are always either local maxima or of zero gradient can give.
minima, or horizontal points of inflexion, as shown in figure 16.7.
(i) 3x 5 is the more important term, so when x is large and
positive, /(x) is large and positive; when x is large and
negative,/(x) is large and negative.
(ii) The gradient at x = a is given by
/»= 15a4 -60a2.
Thus the points of zero gradient are at
15a4 -60a 2 = 0
or
15a 2 (a 2 -4) =
(-2,64)
(0,0)
(2,-64)
(0.-1)
form of the equation and in the form of the derivative (if they can these problems by careful graph-drawing earlier. Now. instead,
calculate it). they can sketch the graph and find the important points, which are
usually points of zero gradient, by exact calculation.
1. How should a loop of string of length 24cm be arranged on a
16.3. Design Problems peg-board to enclose a rectangle of maximum area? This problem
will have been familiar to many pupils since their primary-school
Some of the most compelling and dramatic glimpses of the days. They can now solve it by calculation, by finding the point of
power of calculus come, for 16-year-old pupils, when they see how zero gradient on the graph of
differentiation can solve problems of design. A=x(\2-x)
There are many simple beautiful examples, some of which are
discussed below. Pupils may well have met and solved some of = 12.x-A- 2.
136 FROM GRAPHS TO CALCULUS
A/1 an existing stone wall for the fourth side. How does he arrange the
pen to get the maximum area?
4. A famous problem is that of the open cardboard box which is
made from a square sheet of cardboard by cutting out the shaded
squares at the corners, bending upwards along the dotted lines and
then gluing at the edges. How should the squares be cut to obtain a
t box of maximum volume?
A cm 2 (12-x) cm
This problem brings out further teaching points. Suppose the
i cardboard sheet is a square of side 100 cm and that a square of side
x cm is cut from each corner. The box is shown in figure 16.12 and it
Figure 16.10 xcm xcm
Kcm
They will now expect the graph of this function to be a parabola,
with its vertex upwards, and should note that the function
A = 12.x-x2 (100-2x)cm
50 50
Figure 16.13
zero when a = " and a = 50. Thus the graph is that shown in and it is required to find the general form of the graph of the
figure 16.13(ii), where the parts of the graph which do not model function. One example of this, which has already been examined
the physical situation are shown by dotted lines. from a different viewpoint in section 6.6, is growth; a situation in
The box has a maximum volume of about 0-07 m 3 ; the base of which at each instant the amount of material present increases at a
this box is a square of side about 67cm, and its height is about rate proportional to the total amount of material there at that
17 cm: a surprising shape to give the maximum volume. instant, that is, the rate of change of material is proportional to the
This kind of argument based on curve sketching and common total amount of material. This gives rise to the equation
sense is all that is necessary for design problems at this level. Rules f'(t) = //(f).
to distinguish maxima from minima are not yet needed, and should
not yet be introduced. For instance, in the problems just given, it is For example, the birthrate is given as 13 live births per 1000 of
surely unnecessary to verify that x = 4p gives a local maximum and population, and the interest rate for investment of money is given
x = 50 gives a local minimum for it is quite clear that when x = 50, as a percentage of the amount invested.
the volume of the box is zero. It is useful and constructive, both for future work and for a
proper understanding of gradient, for pupils to study a step-by-
step approach for finding such approximations to graphs whose
16.4. Step-by-Step Integration; Orienteering gradients are known at each point.
The method is illustrated by the following example which is not
Many situations exist in real life which lead to mathematical related to growth. A graph passes through the point (0,0) and has
models in which the gradient of a function is known at each point, the property that its gradient at each point is equal to the x-
138 FROM GRAPHS TO CALCULUS
gradient 0-1
C(0-3,0-03)
A^jy___^6(0-2,0-01)
0-1 0-1 (0,0) 0-1 A 0-1 0-1
0-5
0-4
0-3
02
0-1
0-1 0-2 0-3 0-4 0-5 0-6 0-7 0-8 0-9 1-0
0-5
0-4
03
02
0-1
coordinate at that point. An approximation to the shape of this reached. What instruction should it be? Since the gradient at that
graph can be stepped out by using an idea from orienteering. point is known to be the same as its x-coordinate, the instruction is
Imagine you are orienteering, that you are at (0,0), and that you to proceed with a gradient of 0-1 until further orders. These will be
must walk in the direction given by a first set of instructions, and given on reaching the point with x-coordinate 0-2; that is. at
keep walking in that direction until you receive another in B. The step in the y-direction from A to B is 0-01, so B has
struction. At (0,0) the x-coordinate is 0, so the gradient is 0, and the coordinate (0-2,0-01).
instructions are to walk along a line with gradient 0 in the direction The new instruction will be to travel with gradient 0-2 until
of the positive x-axis. reaching a point C with x-coordinate 0-3. The step in the y-
The next instruction may come after walking a distance 0-1 km direction from B to C is 0-02, so C has coordinates (0-3,0-03).
or on reaching the line x — 0-1 or according to some other rule. Pupils can proceed in this way using a mixture of drawing on graph
Suppose the next instruction is given when the point (0-1,0) is paper with a large scale and calculation until they see that they
140 / FROM GRAPHS TO CALCULUS
Many pupils are told, on the basis of very little evidence, that the
volume of a circular cone is jnr 2h, or one-third of the volume of a
cylinder of the same height and radius.
For pupils who have studied the definite integral, the volume of
a cone may be taken as the first example of the volume of a solid of
revolution. The line through the origin with gradient - is rotated
round this x-axis, and sweeps out a circular cone of radius r and
height/; (figure 16.17(ii)).
This section concentrates on establishing that when the graph of
y =f(x) shown in figure 16.18 is rotated around the x-axis the
volume of revolution generated is given by
Y = f (x)
Figure 16.18
N y = f(x) y = f(x)
/ \
M
• y = f(x) H
m
= F(x)=n{f(x)} 2
\
view along the x-axis
outer cylindrical discs is shown in figure 16.20. The thickness of But h = Tim 2 and H = nM 2, since F(x) = 7t(/(x)} 2 . Thus
each disc is w, while the radii of the inner and outer cylindrical discs
hw ^ volume of revolution of y =/(x) ^ Hw.
are m and M respectively. Hence
nnrw ^ volume of revolution of y = f(x) ^ nM 2 \v. Now we can compare two sandwich inequalities
But this can be interpreted differently. We consider the curve hw = f- \olume of revolution of Hw =
lower step-function y =/l.v) upper step-function
y = F(x) where F(x) = n[f(x)} 2 and calculate the area under area for F(x) < ;irea under y = /-"(.v| area for F(x)
V = F(x) between x = a and x = b. Dividing the interval between
x = a and x = b into n equal parts as before, we can set up a and add the similar inequalities for all the discs between x = a and
correspondence between a strip of the area under v = F(x) (figure x = b. When this is done we reach the inequality
16.21) and the disc of the volume of revolution of v = /(x) shown in
figure 16.20. If the greatest and the least heights are H and h \olumc of solid of revolution of
lower step-function v = /(.v) between x = a and .v = /) upper step-function
respectively, the sandwich inequality for area is area for F(x) • <: area under y = /-'(.v) between < | area for F(x)
hw < area under v = F(.v) ^ Hw. __ __ __I x — a and x = /'
144 FROM GRAPHS TO CALCULUS
This is a familiar situation. We know that the sandwich inequality We can now return to finding the volume of a circular cone. The
can be made as thin as we please by choosing w small enough. . <-,,.• , , • , • r , <. r
_ .. . . , -11, • • i equation ol the function whose araph is rotated is v = /(x) = -x.
From this we deduce that the two middle expressions in the h
sandwich are equal. Thus Hence the volume of the cone is
volume of solid of revolution of area under v = F(.v) between _ I 1' /"'' fr } 2 jj,- 2 P/i
Y = /'(x) between .x = a and .x = h ~ x = a and .x = b ~ X' 7I<-X> C/X = ~rr \ -x' 2 dx
Jo (h j /1 2 J 0
ButF(x) = 7t{/(x)} 2 , so = !^!r^
145
146 FROM GRAPHS TO CALCULUS
17.2. Pythagoras
•tof
T- <&>if-m9-fSt ttzfft The study of area under the Pythagorean school followed the
<**&, vfixzt-' StffSxii t&ttt utn
ufig -ft» etu'j&iZ'- e^.
method familiar in primary schools of dissecting a region into parts
and reassembling them to form a known area. The rules for doing
Figure 17.2
so were expressed in "axioms" such as "if equals be added to equals, The procedure, of deducing non-trivial results about area from
the wholes are equal" and "the whole is greater than the parts". very elementary assumptions, is precisely that used in "measure
Once the area of rectangles is taken for granted, other areas can be theory" today. If these methods are combined with the notion of
determined without further assumptions. For example, the dissec similar triangles, there are further consequences. For example, if r
tion shown in figure 17.2 gives the area of a triangle as ^base is the radius (distance from centre to any vertex) of a regular
x height, and in fact yields three equivalent expressions for this polygon with N sides then the length a of each edge, and the
area (taking different sides as base). The consequences of this distance f from the centre to the mid-point of each edge, are each
equivalence are non-trivial. For example, it is a short step to the proportional to r by the properties of similar triangles. It follows
fact (equivalent to Pythagoras' theorem) that, if ABC is a right- that the area of a regular polygon with N sides is proportional to
angled triangle, then the two areas shaded in figure 17.3 (where the square of its radius.
AB = AD and AC = AE) are equal.
Figure 17.4
17.3. Eudoxus
17.4. Archimedes
Figure 17.6
circle, there are inscribed regular polygons with area larger than B.
It is this theorem which can now be used to prove that the area of a
circle is proportional to the square of its radius.
The argument is as follows. Recall that we already know the
result for a regular polygon with N sides, so that if two circles have
areas A, A' and radii r, ?•', then the areas of the corresponding
polygons satisfy
where k is the ratio r/r'. Now compare k 2 A' with A. If k z A' < A,
then by the theorem established above there is an integer N such
that k2 A' < A N ; but this implies k 2 A' < k 2 A'N in contradiction to
the fact that A'N < A'. Again, if A < k 2 A' there is an integer N such
that A < k 2 A'N because the same theorem can be applied to the Figure 17.7
circle of radius ?•'; but this implies A < A N in contradiction to the
fact that A N < A. Since both k 2 A' < A and A < k 2 A' are false, the with tangent parallel to RS. The problem is to relate the area A of
only possibility remainingis theequality k2 A' = A which was to be the segment to the area A 0 of the triangle RST. Archimedes proves
proved. that A = |/4 0 in the following four stages.
Notice the completely rigorous nature of this proof, and also the (i) A centre-of-gravity argument applied to strips parallel to the
absence of spurious appeals to "infinites" or "approximations". axis of the parabola suggests f A 0 as a likely guess for the value of A.
150 FROM GRAPHS TO CALCULUS
Archimedes' comment is "the fact is not demonstrated by the (iii) Comparing the area A with that of the parallelogram with
argument used ; but that argument has given a sort of indication side RS we see that A 0 < A < 2A 0. The same argument applied to
that the conclusion is true. Seeing then that the theorem is not each stage of the preceding construction shows that
demonstrated, but at the same time suspecting that the conclusion
is true, we shall have recourse to geometrical demonstration . . .".
(ii) A geometrical argument shows that the two triangles
(iv) Since An = —A 0 is obtained by successive quartering, it is
formed by adding points T, T" with tangents parallel to RT, ST
respectively have a combined area A l = %A 0 . The same con true that if fy4 0 < A there is a value of n for which f/! 0 + f/l n < A,
struction performed n times yields a polygon with area while if A < %A 0 there is a value of n for which A < f/4 0 — p4,,-
Either case contradicts the inequality obtained above; the only
— possibility remaining is the equality f/4 0 = A which was to be
proved.
This argument is that of Archimedes subject only to the use of
But I 1 +- + ...+— 1(1-71=1-7X7- and therefore the
\ 4 4"/\ 4/ 44" modern notation. The four stages are typical of the method used in
polygon has area fA 0 — -j/4n. • •.. . • many other problems:
THE EARLY HISTORICAL DEVELOPMENT OF CALCULUS 151
(i) to guess the answer, often by the use of physical argument; particular definitions, and moreover the methods of Eudoxus and
(ii) to establish a geometrical result which ensures that a Archimedes were available and could easily be adapted to the new
known area or volume can be built up from blocks of coordinate geometry.
diminishing size; Three examples must suffice; all are not merely of historical
(iii) to prove that the required area or volume is bounded above interest but could also provide teaching material for pupils at the
and below by known areas or volume; corresponding stage of development. The first is the work of
(iv) to show that the area or volume is neither more nor less Cavalieri" on the area under the curve v = xp. In modern notation
than that guessed initially. his result would be written
The great merit of the method is the absence of "infinites" and
"approximations". Its great defect is that each problem must be x"dx =
attacked separately and may require ingenious geometrical argu
ments. When Greek mathematics began once again to be studied but in fact he appears first to have noted that the integral is
and understood in the fourteenth, fifteenth and sixteenth centuries, proportional to ap+1 . Today we would prove this fact using the
the stumbling block to further progress was the lack of any general method of substitution: if a' = ta then
method which would be applicable to a wide range of problems
and which would contain as special cases the results of Eudoxus x"dx = x"dx = (ty)"tdy
o Jo Jo
and Archimedes. The discovery of such a method had to await the
systematic use of coordinates in graphical work due to Descartes where x = ty. Thus
and, when it came, the method lacked the straightforward
y>dy
precision characteristic of Eudoxus and Archimedes.
8c, = ! x 2 dx = x 2 dx = x 2 dx + X x = C^ (x + 1 )2 dx
(4w)f>
length of curves and the calculus of variations and to the solution definite. Inevitably "the Calculus" appeared impossibly abstruse
of differential equations. and the harm described at the beginning of this essay was done. It
With hindsight we can see that the achievement of Newton and was only by a gradual return to more geometrical methods, and by
Leibniz was built upon a vast body of results established by the a rejection of infinitesimals in favour of a more sensible under
mathematicians of the early seventeenth century: not merely those standing of real numbers and of limits, that the balance was
mentioned above by way of example but also such men as Kepler, gradually restored and the methods of differentiation and in
Roberval, Wallis, Barrow and Gregory. On the other hand, the tegration could take their place among the ordinary concepts of
dramatic power of the technique of differentiation, and the extreme mathematics taught at an elementary level.
difficulty which was experienced in justifying and in explaining the At a more advanced level the return to more geometric methods
concepts and techniques involved in its creation, combined to set has provided a proper basis for the calculus of functions of several
differentiation (and hence also its opposite, integration) apart as variables. Integration, partial differentiation and calculus of
"the Calculus". The result was a tension between two extremes. On variations all now arise from a fusion of the analytic methods of
the one hand were the geometric problems from which calculus calculus and the geometric methods of linear algebra. The right
arose, especially those concerning area and volume; the concepts balance must be achieved between algebraic manipulations and
were clear, and could be applied to prove satisfying general results geometric concepts, for both are necessary if the subject is to
or to justify results guessed by other methods, but any particular progress healthily.
numerical determination of area or volume required either an
ingenious special argument or the restatement of the problem in 17.7. Conclusions
more algebraic terms followed by recourse to results from
differentiation. What conclusions does the historical development of calculus
On the other hand, there was the supremely efficient algebraic suggest for the teacher of graphical work today? Those who
tool of differentiation which was very difficult to understand and believe that historical development is a good guide to psychologi
which appeared mystical or even magical when attempts were cal development in individuals might find it useful to chart the
made to explain how or why it worked. former in terms of the dichotomy between "relational" and
So powerful was the spell cast by this algebraic tool that the "instrumental" understanding popularized by Skemp 13 . The sum
origins of calculus in the calculation of areas were almost total of man's knowledge of calculus might be displayed as a trail
forgotten. Differentiation took over from integration, and mathe traced on a diagram, in which fast progress was made in the three
maticians, instead of viewing differentiation as a method for periods 400-200 BC, AD 1600-1700, AD 1850-1950 and very slow
finding gradients, began to view it as the subject matter of calculus progress in the periods between. Perhaps there is a case for
par excellence. building up the knowledge of the individual pupil by much the
They even began to introduce a so-called "indefinite in same route? If so, three conclusions suggest themselves.
tegration" as the opposite of differentiation, so that the pupil was Firstly, calculus has its roots in geometrical problems so that it is
faced first with the concept of differentiation whose definition was
most obscure, and then with two sorts of integration, indefinite and 13 The Psycholayv of Learning Mathematics, Pelican 1971.
THE EARLY HISTORICAL DEVELOPMENT OF CALCULUS 155
relational Secondly, the concepts associated with calculus are an essential
understanding
part of graphical work and may be expected to enter at an early
-500 stage. The first such concepts are likely to arise from the estimation
\ of area and volume, and these will be accepted naturally by quite
\
-400 + 1950
young pupils. Area is important both for its own sake and to
/
geometric calculate other quantities.
/ \
/
Thirdly, understanding of gradients and derivatives is likely to
area, / 1 slope, rate
be achieved only by pupils at age fifteen or sixteen. Perhaps this is
volume I i of change because ratio is a much more sophisticated concept than area. It
-200 + 1850 suggests that differentiation should be deferred until after in
\ tegration. It should arise in the first instance as a set of rules for
\
algebraic calculation of gradients which turns out, surprisingly, to be a
\
\ successful aid for evaluating definite integrals. One advantage of
+ 1600 + 1700 this approach is that it stresses that differentiation is an exact
\ process rather than some kind of approximation.
- +1650- - If these conclusions suggested by the history of the development
instrumental of calculus were to be adopted, pupils would see calculus as a
understanding natural development of the ideas of area and gradient. They would
Figure 17.13 Historical development of man's knowledge of calculus. have seen successive stages of the function of the concepts of area
and gradient, through quantitative pictorial ideas to exact calcu
essential for good understanding to use geometric diagrams ana lations, and would be led by this route to a greater understanding
not merely to rely on algebraic symbols. of calculus.
SUGGESTIONS FOR FURTHER READING
156
Index
'Titles of books and journals are given in italics
abuse of notation, 38 chord approach to differentiation, 111-5 direct proportion, 39
acceleration, 91, 99, 129 chord, gradient of, 97, 111-2 directed numbers, 65
Achilles and the tortoise, 82-3 circular cone, volume of, 144 displacement, 60, 87-95, 129
analysis, non-standard, 120 codomain of a function, 18 represented by an area, 146
area, 50-4, 87-95 composite function, differentiation of, 117 displacement-time graph, 55-6, 126
negative, 94 compound interest, 48 as a model, 56
of circle, 54 cone, circular, volume of, 144 distance, 59. Also see displacement
of irregular figure, 51-4 continuous graph, 15, 16 domain of a function, 18
of regular polygon, 147 convergence of a sequence, 73, 84 Durell, C. V., 19, 33-4
under a graph, 87-95, 103 coordinates, 7
under velocity-time graph, 87-90 cosine function, 43
under v = x", 105, 151 cubic function, 132 equation of a graph, 28
Archimedes, 124, 149-51 curve, gradient of, 98-9 Eudoxus, 147-9
arrow diagram, 7, 8, 22, 24 stitching, 73 exponential functions, 46-8
average rate, 57 tangent to, 96
average speed, 55
average velocity, 60, 90-1, 97 Fermat, 145, 152
axes of graph, labelling of, 27, 29 decimal, 62-8, 83-4 Fibonacci sequence, 85
non-terminating, 83 fractions, 62
recurring, 63-6, 68, 84 on the number-line, 63
ball thrown upwards, 130 terminating, 63, 65 function, 1, 15, 17-38
Berkeley. Bishop, 148 decimal number, 62-6, 67-8 as a relation, 32
best-fitting straight line, tangent as, 96 decreasing function, 132 codomain of, 18
best linear approximation to a graph, 117-8, 120 decreasing sequence, 84 composite, 117
black-box technique, 121 definite integral, 103-10 cosine, 43
block graph, 1, 6 denary system, 62 cubic, 132
dependence, functional, 17, 21, 33 decreasing, 132
derivative, 115, 121 derived, 115
calculator, use of, 48, 49, 54,61,69, 78-80, 104-5, 112-3 derived function, 115 domain of. 18
rounding by, 78 Descartes, R., 145 exponential, 46
Calculus and Elementary Functions, 111, 117 differential equation, step-by-step solution of. 75, 140 formula for, 19
calculus, fundamental theorem of, 125-8, 154 differentiation, 111-23, 124-8, 129-36, 145, 153 4 "function machine", 26
Calculus Made Easy, 120 chord approach to, 111-5 growth function, 140
Cartesian graph, 1, 4, 6, 23-7 limits and. 72 increasing, 132
Cavalieri, 151-3 linear approximation approach to. 117 20, integral, 125
change, rate of, 58, 99, 100 link with integration. 124 8 linear, 10, 17,39. 131
Channon. McLeish Smith and Head. See New General problems of notation in. 122 notation for, 37
Mathematics scale-factor approach to. 116-7 quadratic, 42-3
157
158 INDEX
real number, 61, 68, 69 School Mathematics Project, Revised Advanced straight-line graph, 10. 39, 131
reciprocal, 79 Mathematics, 116-7 gradient of. 56-9
reciprocal function, 41 Scottish Mathematics Group. See Modern
recurrence relation, 78-80 Mathematics for Schools tangent to a curve, 96-99, 117-8, 120
recurring decimal. 63-6, 68, 84 sequence, 72, 77 as best linear approximation. 117-8, 120
relation, 32-4 convergence of, 73, 84 gradient of, 97,99, 102, 113
function as 32 decreasing. 84 terminating decimal, 63, 65
recurrence, 78-80 Fibonacci, 85 Thompson, Sylvanus, P., 120
root of a number, 79 given by recurrence relation, 78 transformations of graphs, 42, 45
rule for a function, 18, 19, 32 increasing, 84 trapezium rule, 95
limit of. 81-2
series, geometric, sum of, 80
sine function, 43 units, 89
sandwich inequality, 51-2, 54, 65, 66-8, 70, 77-8, 85, Skemp. R. R., 154
91-2, 103, 106 snowflake curve, 75 velocity, 60. 87-102. 129. 146
scale-factor. 111, 116-7 speed, 55-60, 91 actual, 98-9
approach to differentiation. 111, 116-7 average, 55. See also velocity and speed, 60, 91
average, 116 square root. 61, 80 at an instant, 97, 99
local, 116 Newton's method for, 78 average. 60, 90-1
Science, 5/13, 8-9 squares, graph of, 14 negative, 94
School Mathematics Project, Books A-H and A'-Z. step-by-step method, of integration, 137 velocity-time graph, area under, 87-90
6,13,29,30,35,50-1,87-8 of solution of differential equation, 75, 140 volume, 54, 141-4
School Mathematics Project, Books 1-5, 34 step-function, 12. 15, 16, 104 of revolution, 141-4
•<*:
The Mathematics Curriculum
A Schools Council Project