Statics P1
Statics P1
STATISTICS 4040/01
Paper 1 October/November 2009
2 hours 15 minutes
Candidates answer on the question paper.
Additional Materials: Mathematical tables
Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use a soft pencil for any diagrams or graphs.
Do not use staples, paper clips, highlighters, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (SC/SLM) 11521/8
© UCLES 2009 [Turn over
www.xtremepapers.net
2
1 A small boatyard makes canoes. The pictogram below shows the number of canoes made in the
years 2002 and 2005.
2002
2005
represents 10 canoes.
............................................................[1]
............................................................[1]
(ii) Draw a pictogram to represent the 33 canoes which the boatyard made in the year 2008.
[2]
www.xtremepapers.net
3
2 A large-scale survey is to be carried out, with the required information being collected by means of
a questionnaire.
(i) Give two purposes of firstly conducting a small-scale survey using a pilot questionnaire.
Purpose 1 .................................................................................................................................
...................................................................................................................................................
Purpose 2 .................................................................................................................................
...............................................................................................................................................[2]
(ii) Give one advantage and one disadvantage of sending the questionnaire to respondents
through the post, rather than using interviewers to ask the questions on it.
Advantage ................................................................................................................................
...................................................................................................................................................
Disadvantage ...........................................................................................................................
...............................................................................................................................................[2]
3 In a large school a daily record was kept, from the start of each month, of the total number of
absences up to and including that day.
The totals recorded for the first 11 school days of one month were as follows.
(i) Write the data in a form showing the number of absences on each individual day.
...............................................................................................................................................[2]
(ii) Obtain the mean and the median of the data you have written in (i).
Mean ...............................................................
Median ............................................................[3]
(iii) Explain why there is a problem with stating a value for the mode of the data you have written
in (i).
...................................................................................................................................................
...............................................................................................................................................[1]
www.xtremepapers.net
4
4 Five competitors in a quiz are isolated from each other. They are asked, in a random order, the
same question, until one of them gives the correct answer.
At this point the questioning stops. Two of them know the correct answer, and three do not.
(i) State the maximum number of times the question would have to be asked.
............................................................[1]
........................................................... [1]
(b) calculate the probability of each value, presenting your results in a suitable table.
[4]
www.xtremepapers.net
5
5 In a large factory the number of repair jobs reported during each of the three daily shifts (Early,
Late and Night) was recorded over a three week period. The following table shows some of the
results.
Shift
Week Early Late Night Total
1 34
2 34 18
3 32 84
Total 44 250
(i) The total number of repair jobs reported during the Early and Late shifts was the same. Use
this information to insert two values into the table above. [1]
(ii) In week 1 and week 3 there were equal numbers of repair jobs reported during the Night shift.
Use this information to insert a further two values into the table above. [1]
(iii) Showing all your working, calculate the remaining five values and insert them into the table
above.
[5]
www.xtremepapers.net
6
6 The table below gives, by category of employment in the year 2007, the number of employees,
and the number of accidents they suffered, at a particular factory. It also shows the standard
population for the whole industry of which the factory is part.
Technical 9 75 17
Skilled 18 230 41
Unskilled 25 115 38
(i) Calculate the crude accident rate per thousand for the factory.
............................................................[3]
(ii) Showing your full working for at least one of the categories, calculate the standardised
accident rate per thousand for the factory.
............................................................[5]
(iii) Explain briefly why the answers you have obtained to (i) and (ii) are not equal.
...................................................................................................................................................
...............................................................................................................................................[1]
www.xtremepapers.net
7
BLANK PAGE
www.xtremepapers.net
8
7 The lengths of a certain engineering component coming off a production line are measured.
............................................................[1]
............................................................[1]
The length of each component is intended to be exactly 5.008 cm. Accurate measurements of a
sample of 80 of these components from the production line produced the following cumulative
frequency curve.
80
70
60
50
Cumulative
frequency 40
30
20
10
0
5.000 5.002 5.004 5.006 5.008 5.010 5.012 5.014 5.016
Length of component (cm)
www.xtremepapers.net
9
(ii) The graph is much steeper close to the intended length than it is at either end.
What does this tell you about the precision of the components produced?
...................................................................................................................................................
...............................................................................................................................................[2]
...................................................... cm [1]
...................................................... cm [1]
...................................................... cm [2]
(iv) Estimate the number of components which have a length of more than 5.0084 cm.
............................................................[2]
Components shorter than 5.004 cm, or with length 5.012 cm or more, are rejected.
(v) Estimate from the graph how many components in this sample are accepted.
............................................................[3]
(vi) Use the graph to estimate, to 4 decimal places, for the accepted components only,
...................................................... cm [2]
...................................................... cm [1]
www.xtremepapers.net
10
8 (a) A turn in a game consists of rolling an unbiased six-sided dice, with faces numbered 1 to 6.
If the face landing uppermost is 1, 2, 3, 4 or 5, then that number is the score for the turn.
If the 6-face lands uppermost, the dice is rolled again, and the score for the turn is the sum of
6 and the number which lands uppermost on the second roll.
.........................................................................................[1]
(ii) Calculate, as a fraction, the probability of obtaining the lowest possible score in a turn.
............................................................[1]
(iii) Calculate, as a fraction, the probability of obtaining the highest possible score in a turn.
............................................................[1]
(iv) Calculate the probability that, in two consecutive turns, a player will obtain the lowest
possible score and the highest possible score, in either order.
............................................................[2]
www.xtremepapers.net
11
(b) You are one of a class of 10 students. A bag contains one blue disc and 9 red discs.
The students take it in turn to draw a disc at random from the bag without replacement.
The student who draws the blue disc will win a prize.
You have been given the choice of drawing a disc first, fourth or last.
Calculate the probability of winning the prize for each of these three alternatives, and hence
state which one you would choose.
...............................................................................................................................................[5]
(c) Box A contains 7 green balls and 3 white balls. Box B contains 5 green balls and 9 white
balls.
A ball is selected at random from Box A and placed in Box B. A ball is then selected at
random from Box B and placed in Box A.
Calculate the probability that, after these two operations, the numbers of green and white
balls in Box A and in Box B are the same as at the start.
............................................................[6]
www.xtremepapers.net
12
9 A new pressure gauge has been fitted onto a piece of equipment in a chemical plant.
Past experience has shown that such gauges may need to have their calibration (scale of
measurement) adjusted.
To determine whether any adjustments are necessary on the new gauge, several pressure
readings are taken using both it and a standard gauge which is known to be accurate. All readings
are in units of megapascals (MPa).
Pressure (MPa)
Reading
Standard gauge (x) New gauge (y)
A 4 7
B 38 41
C 7 13
D 11 16
E 15 21
F 27 33
G 31 36
H 35 41
(i) Briefly explain why the standard gauge readings, rather than the new gauge readings, are
plotted on the x-axis.
...................................................................................................................................................
...............................................................................................................................................[1]
45
40
35
30
New 25
Gauge
Pressure
20
(MPa)
15
10
0
0 5 10 15 20 25 30 35 40 45
Standard Gauge Pressure (MPa)
[3]
www.xtremepapers.net
13
(iii) Explain why you should use readings A, C, D and E to calculate one semi-average, and B, F,
G and H to calculate the other semi-average.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[1]
(iv) Calculate the overall mean and the two semi-averages, and plot them on your graph.
[5]
(v) Draw a line of best fit through your plotted averages. [1]
(vi) Either by calculation, or by using your graph, find the equation of the line of best fit, and write
it in the form y = mx + c.
y = ...........................................................[3]
(vii) Explain what the values of m and c in your equation tell you about any necessary adjustments
to the new gauge.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
www.xtremepapers.net
14
10 A random sample of 200 customers leaving a supermarket during one day was asked how much
money (in $) and how much time (in minutes) they had spent during their visit.
(a) Complete the histogram, which illustrates the amounts spent during each visit, on the
grid below. The rectangle representing the 30 – under 50 class has been drawn.
45
40
35
30
25
Number of
customers
per $10
20
15
10
0
0 20 40 60 80 100 120 140 160
Amount spent ($)
[5]
© UCLES 2009 4040/01/O/N/09
www.xtremepapers.net
15
.......................................................................................................................................[1]
(c) If the 80 – under 100 and 100 – under 150 classes were merged to form a single class,
80 – under 150, calculate the value on the vertical axis of the height of the rectangle
which would be drawn to represent the merged class.
............................................................[2]
(ii) (a) The following table summarises the times spent by the customers in the supermarket.
Number of
Time (minutes)
customers
0 – under 20 10
20 – under 40 49
40 – under 50 54
50 – under 60 45
60 – under 90 31
90 – under 120 11
Total 200
Estimate, to 3 significant figures, the mean and the standard deviation of the times spent
in the supermarket by these customers.
(b) If the 60 – under 90 and 90 – under 120 classes are merged, the mean will increase.
Explain why this is so.
...........................................................................................................................................
.......................................................................................................................................[2]
© UCLES 2009 4040/01/O/N/09 [Turn over
www.xtremepapers.net
16
11 (a) At the start of the school year, all 125 girls at a school had to choose to play at least one
of the sports hockey, netball and tennis. The following diagram illustrates the numbers who
chose to play the different sports.
Hockey Netball
8 30 36
7
9 x
20
Tennis
............................................................[2]
...........................................................................................................................................
.......................................................................................................................................[1]
(iii) Showing all your working, determine which sport was chosen by the greatest number of
girls.
............................................................[2]
www.xtremepapers.net
17
One month after the start of the year, girls were allowed to change their choice, as long as they
still played at least one of the sports.
(iv) Insert, on the diagram below, the number of girls who have chosen to play the different
sports after these changes have been made.
Hockey Netball
Tennis
. [3]
www.xtremepapers.net
18
(b) In this part of the question, you are NOT required to DRAW any pie charts.
The following table gives the number of different models of car sold by a large company
in the year 2005.
Hatchback 542
Estate 521
(i) If these figures were to be illustrated by a pie chart, calculate, to the nearest degree, the
angle of the sector representing sales of Estate cars.
°
......................................................... [2]
(ii) In the year 2008, the company sold, in total, 26% more cars than it had sold in 2005.
If the sales in 2005 were illustrated by a pie chart of radius 5 cm, calculate the radius of a
pie chart illustrating the sales in 2008.
...................................................... cm [3]
(iii) In a pie chart illustrating the sales in 2008, the angles of the sectors representing sales of
two-door saloons, four-door saloons and hatchbacks are 72°, 103° and 85° respectively.
Calculate the number of estate cars which were sold in 2008.
............................................................[3]
www.xtremepapers.net
19
BLANK PAGE
4040/01/O/N/09
www.xtremepapers.net
20
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
University of Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of
Cambridge Local Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
4040/01/O/N/09
www.xtremepapers.net
UNIVERSITY OF CAMBRIDGE INTERNATIONAL EXAMINATIONS
GCE Ordinary Level
4040 STATISTICS
4040/01 Paper 1, maximum raw mark 100
This mark scheme is published as an aid to teachers and candidates, to indicate the requirements of
the examination. It shows the basis on which Examiners were instructed to award marks. It does not
indicate the details of the discussions that took place at an Examiners’ meeting before marking began,
which would have considered the acceptability of alternative answers.
Mark schemes must be read in conjunction with the question papers and the report on the
examination.
• CIE will not enter into discussions or correspondence in connection with these mark schemes.
CIE is publishing the mark schemes for the October/November 2009 question papers for most IGCSE,
GCE Advanced Level and Advanced Subsidiary Level syllabuses and some Ordinary Level
syllabuses.
www.xtremepapers.net
Page 2 Mark Scheme: Teachers’ version Syllabus Paper
GCE O LEVEL – October/November 2009 4040 01
1 (i) (a) 20 B1
4 (i) 4 B1
N P(N)
1 0.4
2 0.3
3 0.2
4 0.1
© UCLES 2009
www.xtremepapers.net
Page 3 Mark Scheme: Teachers’ version Syllabus Paper
GCE O LEVEL – October/November 2009 4040 01
(iii) The category of employment structure in the factory is not the same as in the standard
population. B1
[9]
(b) Continuous B1
(b) LQ = 5.0055 B1
© UCLES 2009
www.xtremepapers.net
Page 4 Mark Scheme: Teachers’ version Syllabus Paper
GCE O LEVEL – October/November 2009 4040 01
(vi) (a) Correct method for either measure seen, or implied by a correct result M1
Median item is [11 + (66/2)] =44th on graph
5.0077 A1
8 (a) (i) 1 2 3 4 5 7 8 9 10 11 12 B1
(ii) 1/6 B1
(iii) 1/36 B1
(iv) Correct probability expression using their answers to (ii) and (iii) M1
1/108 A1
(c) Any appreciation of the fact that the two operations must involve discs of the same
colour B1
P(green discs moved) = 7/10 × 6/15 = 7/25
P(white discs moved) = 3/10 × 10/15 = 1/5
Correct method for either, including seeing 15 (or 14) as the second denominator M1
P(G) = 7/25 A1
P(W) = 1/5 A1
Attempt to sum their probabilities for the two sequences M1
7/25 + 1/5 = 12/25 (= 0.48) A1
[16]
9 (i) Any comment which states or implies that the standard gauge is the independent
variable. B1
(iii) Because A, C, D and E have the four lowest x-values (or equivalent) B1
(v) Straight line through at least two of their points plotted in (iv) B1
© UCLES 2009
www.xtremepapers.net
Page 5 Mark Scheme: Teachers’ version Syllabus Paper
GCE O LEVEL – October/November 2009 4040 01
(vii) The equation shows that the new gauge will give a reading 5MPa greater than the
accurate reading given by the standard gauge.
For any correct comment about their equation in context M1
The new gauge calibration therefore needs to be adjusted 5 MPa downwards.
For correct description of required change to calibration cao A1
[16]
(b) Allow either 60 – under 70 (tallest rectangle) or 80 – under 100 (highest frequency)
B1
(ii) x represents the number who chose to play netball and tennis, but not hockey. B1
© UCLES 2009
www.xtremepapers.net
Page 6 Mark Scheme: Teachers’ version Syllabus Paper
GCE O LEVEL – October/November 2009 4040 01
(iii) Attempt to obtain total sales for 2008 (may be seen as part of (ii)) M1
Attempt to multiply 2008 total sales by (360 – sum of angles)/360 M1
2772 × (100/360) = 770 A1
[16]
© UCLES 2009
www.xtremepapers.net
UNIVERSITY OF CAMBRIDGE INTERNATIONAL EXAMINATIONS
General Certificate of Education Ordinary Level
* 4 7 9 6 8 2 4 0 5 0 *
STATISTICS 4040/13
Paper 1 October/November 2010
2 hours 15 minutes
Candidates answer on the question paper.
Additional Materials: Mathematical tables
Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use a soft pencil for any diagrams or graphs.
Do not use staples, paper clips, highlighters, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (NF/SW) 24563/4
© UCLES 2010 [Turn over
2
simple random;
stratified random;
quota;
systematic.
State
(i) the method which does not require the use of any form of random numbers,
...................................................................................................................................... [1]
...................................................................................................................................... [1]
(iii) the method in which the choice of which individual items are selected is left to the
interviewer/researcher,
...................................................................................................................................... [1]
(iv) the method which is most prone to bias if there is a pattern in the sampling frame which
repeats at regular intervals,
...................................................................................................................................... [1]
(v) the two methods which require the use of random numbers to select every item in a
sample,
..........................................................................................................................................
...................................................................................................................................... [1]
(vi) the two methods which require the population to be subdivided into appropriate
categories.
..........................................................................................................................................
...................................................................................................................................... [1]
2 Twenty men were asked in which sports they participated regularly. The results are given in For
the diagram below. Examiner’s
Use
Athletics
Badminton
Cricket
Football
Jogging
Rugby
Swimming
Tennis
represents 1 man.
...................................................................................................................................... [1]
..........................................................................................................................................
...................................................................................................................................... [1]
...................................................................................................................................... [1]
(iv) Explain why the number of symbols in the diagram is greater than 20.
..........................................................................................................................................
..........................................................................................................................................
...................................................................................................................................... [1]
(v) Calculate the percentage of these men who played rugby regularly.
................................................... [2]
3 In a dancing competition, couples were judged on their performance in three dances. Each For
couple chose their dances from a list. Three of the dances on the list were the Foxtrot (F ), the Examiner’s
Paso Doble (P) and the Tango (T ). The following diagram gives information on the choices of Use
the couples entered in the competition.
F P
7 6 5
2
4 3
(i) Write, in the appropriate place in the diagram, the number of couples who did not choose
any of these three dances.
[2]
..........................................................................................................................................
...................................................................................................................................... [1]
(iii) Find which of these three dances was chosen by the most couples, and state the number
of couples who chose it.
Dance ......................................................
4 The date in the table below relates to the inhabitants of two streets, S and T. For
Examiner’s
Use
Interquartile range of
Street Median age (years)
ages (years)
S 27.5 18.0
T 58.3 4.2
(a) street S,
.............................................................................................................................. [1]
(b) street T.
.............................................................................................................................. [1]
(ii) State the percentage of the population of street S which is between 27.5 and 34.3 years
old.
................................................... [1]
(iv) For each of the following, state whether it is definitely true, possibly true, or definitely
false.
(a) The lower and upper quartile ages of the population of street T are 56.2 and 60.4
years respectively.
.............................................................................................................................. [1]
(b) The oldest inhabitant of street T is older than the oldest inhabitant of street S.
.............................................................................................................................. [1]
4 7
7 10
10 6
13 3
The arithmetic mean and the standard deviation of this distribution are 7.58 and 2.88
respectively, each correct to 3 significant figures.
(i) Using these results, and showing your method, find the mean and the variance of the
frequency distribution in the following table.
Variable Frequency
8 7
14 10
20 6
26 3
Mean = ......................................................
(ii) The table below gives the grouped frequency distribution of a discrete variable.
Variable Frequency
4, 5, 6 7
7, 8, 9 10
10, 11, 12 6
13, 14, 15 3
(b) Estimate, to 2 significant figures, the mean and the standard deviation of this For
distribution. Examiner’s
Use
Mean = ......................................................
6 The following table summarises the age and marital status of the female members of a sports For
club. Examiner’s
Use
Marital Status
Age TOTAL
Single Married Widowed Divorced
Under 20 18 2 0 0 20
20–24 9 13 4 7 33
25–29 7 89 7 11 114
30–39 5 4 4 2 15
40 and over 1 2 1 6 10
.............................................................................................................................. [1]
(b) Given that she is in the 30–39 age group, find the probability that she is single.
................................................... [2]
(ii) By considering divorced women aged 25–29, show that age and marital status are not
independent for female members of this club.
[3]
BLANK PAGE
7 The table below gives the percentages in different age groups of anglers (people whose
hobby is fishing) in the United Kingdom.
15–under 25 28
25–under 30 20
30–under 35 22
35–under 45 16
45–under 65 14
On the grid below a histogram representing these ages is to be drawn. The rectangle for the
30–under 35 group is given.
Frequency
density 3
0
0 10 20 30 40 50 60 70
Age (years)
(i) By considering the height of the given rectangle, state, on the answer line below, the For
units of the vertical axis. Examiner’s
Use
................................................... [2]
(ii) Draw the rectangles representing the other four age groups.
[4]
(iii) Briefly explain why it is not possible to obtain an accurate value for the mode from this
histogram.
..........................................................................................................................................
...................................................................................................................................... [1]
...................................................................................................................................... [1]
(v) Estimate, to 3 significant figures, the mean and the standard deviation of the anglers’ ages.
(vi) State, giving a reason, which of the mean and median ages you would expect to be
higher for this distribution.
..........................................................................................................................................
..........................................................................................................................................
...................................................................................................................................... [2]
© UCLES 2010 4040/13/O/N/10 [Turn over
12
8 The lengths, in mm, of a sample of 160 rods produced on a machine are represented by the For
following cumulative frequency curve. Examiner’s
Use
160
140
120
100
Cumulative
frequency
80
60
40
20
0
0 40 50 60 70 80 90
Length (mm)
(i) State which feature of the graph shows that the majority of rods had lengths in the
middle of this range.
..........................................................................................................................................
...................................................................................................................................... [1]
.......................................... mm [2]
.......................................... mm [3]
Rods can only be used for their intended purpose if their lengths are in the range 55 mm to
75 mm inclusive.
Rods which are longer than 75 mm are shortened to be within the accepted limits, at a cost
of $3 each.
Rods which are shorter than 55 mm cannot be used, and are sold for recycling at $2 each.
................................................... [1]
................................................... [2]
................................................... [2]
(iv) By considering the cost of manufacturing all 160 rods in the sample, estimate the mean For
cost per rod of those rods which can be used. Examiner’s
Use
BLANK PAGE
9 (a) The information below relates to a city in the year 2008. For
Examiner’s
The total population at the start of the year was 420 000. Use
The crude death rate for the city was 9.6 per 1000.
(i) Calculate the number of deaths in the city in the year 2008.
................................................... [2]
(ii) State, giving a reason, whether subtracting your answer to (i) from 5250 would give
the increase in the city’s population in 2008.
..................................................................................................................................
..................................................................................................................................
.............................................................................................................................. [2]
(b) The data in the table below relates to a town in the year 2009.
25–49 80 10 3000
(i) Fill in the four values which are missing from the table.
[5]
(ii) Calculate, to 1 decimal place, and stating the units, the crude death rate and the For
standardised death rate of the town. Examiner’s
Use
It was subsequently discovered that the overall population of the region in which the town
was situated was younger than had been thought, and that a more appropriate standard
population for the three age groups would be 4000, 3000 and 1000 respectively.
(iii) Without carrying out any further calculations, state what effect using this new
standard population would have on the crude death rate and on the standardised
death rate of the town.
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
.............................................................................................................................. [2]
10 Zaheer only buys petrol when the tank of his car is almost empty, except that if he passes a For
garage selling cheap petrol he will buy some, however much is still in the tank. Examiner’s
Use
Whenever he buys petrol he always fills the tank and records the details in his log book, and
usually re-sets the trip recorder to 0.
The following table is an extract from Zaheer’s log book. The odometer records the total
distance in km travelled by the car since its manufacture. The trip recorder records the
distance in km travelled by the car since the trip recorder was last re-set to 0.
(i) Estimate the capacity of the car’s petrol tank to the nearest 5 litres.
(ii) State the four dates on which Zaheer bought cheap petrol.
...................................................................................................................................... [1]
(iii) On one occasion when Zaheer bought petrol he forgot to re-set the trip recorder.
................................................... [1]
(b) Find the actual distance travelled between this purchase and the next one.
........................................... km [1]
The grid below gives petrol bought, in litres, on the x-axis, and distance travelled since For
previous purchase of petrol, in km, on the y-axis. Examiner’s
Use
700
600
500
Distance travelled (km)
400
300
200
100
0
0 10 20 30 40
Petrol bought (litres)
(iv) Plot on the grid the eight points corresponding to the occasions on which Zaheer
purchased petrol. [2]
(v) Calculate the overall mean and the two semi-averages, and plot them on the grid. For
Examiner’s
Use
[6]
(vi) Draw a line of best fit through your plotted averages. [1]
(vii) Use your line of best fit to estimate
(a) the distance, to the nearest 10 km, which Zaheer could travel using 30 litres of
petrol,
........................................... km [1]
................................................... [2]
BLANK PAGE
11 (a) Research organisations in the United Kingdom often divide the population into six For
categories, called social classes, denoted by A, B, C1, C2, D and E. The percentages of Examiner’s
the population in the different social classes in the year 1987 are given in the following Use
table.
[4]
(ii) By the year 2001, the population of the U.K. was 7% larger than it had been in 1987.
Calculate, to 2 decimal places, the radius of a corresponding pie chart representing
the 2001 population.
........................................... cm [4]
(b) The following table gives, for each gender separately, the percentage of the U.K. For
population of working age in different occupational groups in the year 2002. Examiner’s
Use
Professional 8 3
Skilled (non-manual) 11 31
Skilled (manual) 27 7
Others 10 14
‘Others’ included those in the Armed Forces, those who did not state their current or last
occupation, and those who had not worked in the previous eight years.
(i) Draw, on the grid below, fully-labelled percentage component bar charts for males
and females to illustrate the data in the table.
[4]
(ii) Describe two differences between the occupations of males and females illustrated For
by the charts. Examiner’s
Use
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
.............................................................................................................................. [2]
(iii) State, with a reason, whether you would regard pie charts or bar charts as more
appropriate to compare these two sets of percentages.
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
.............................................................................................................................. [2]
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
University of Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of
Cambridge Local Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
STATISTICS 4040/12
Paper 1 October/November 2010
2 hours 15 minutes
Candidates answer on the question paper.
Additional Materials: Mathematical tables
Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use a soft pencil for any diagrams or graphs.
Do not use staples, paper clips, highlighters, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (NF/SW) 34298
© UCLES 2010 [Turn over
2
4 25 7 8 4 4 8 9 6 5 19
............................................................[1]
............................................................[1]
............................................................[1]
(ii) State which one of these three quantities you would choose as the most appropriate measure
of central tendency (average) to represent these figures, and give a reason for rejecting one
of the other two.
...................................................................................................................................................
...............................................................................................................................................[2]
(iii) State (but do not calculate) which measure of dispersion (spread) you would use as the most
appropriate to represent these marks.
............................................................[1]
2 There are 34 pupils in a school class. Their method of travel to school is given in the following
table.
(i) Calculate, to the nearest degree, the angles of the sectors representing Car and Bus.
Car ....................................................... °
[2]
(iii) In another class there are 27 pupils. Calculate, correct to 1 decimal place, the radius of a
comparable pie chart to illustrate this class. You are NOT required to draw this pie chart.
.................................................... cm [2]
© UCLES 2010 4040/12/O/N/10 [Turn over
4
3 The diagram below shows how many pets (cat, dog, rabbit) are owned by each of 100 households.
No household owns more than one of each different animal.
Cat Dog
33
z 31
y
z z
x
2
Rabbit
...............................................................................................................................................[1]
x = ...........................................................[1]
(iii) Three times as many households own only a cat as own all three types of pet. Find the value
of y.
y = ...........................................................[1]
z = ...........................................................[2]
............................................................[1]
4 The heights of the girls in a village were measured and are represented in the histogram below.
15
10
Frequency
density
5
0
0 145 150 155 160 165 170 175 180
Height (cm)
[4]
(ii) Use the histogram to estimate the modal height of the girls.
..................................................... cm [2]
5 Before an examination, teachers were required to predict the grades which their pupils would
obtain, from A (highest) to D (lowest). The following table shows the numbers of pupils for whom
the predicted grade was accurate.
A 132
B 284
C 203
D 68
(a) Among pupils for whom grade C was predicted, 33 obtained grade B and 4 obtained
grade D.
[1]
(b) There were no pupils for whom the predicted grade and the grade obtained were more
than one grade apart.
[1]
(c) Of the candidates for whom grade D was predicted, one-third obtained grade C.
[1]
(d) Of the candidates for whom grade A was predicted, eleven times as many obtained
grade A as obtained grade B.
[1]
(e) There were 50 candidates for whom grade B was predicted but who obtained another
grade. Nine times as many obtained grade A as obtained grade C.
[1]
(ii) For all those pupils whose predicted grade was not accurate, state, with a reason, whether
the grade obtained was likely to be higher or lower than the predicted grade.
...................................................................................................................................................
...............................................................................................................................................[1]
Frequency
Variable
[2]
(ii) a distribution in which the mean, median and mode are all equal.
Frequency
Variable
[2]
Frequency
1 2 3 4 5 6
Variable
Cumulative
frequency
1 2 3 4 5 6
Variable
[2]
7 (a) A man has three unbiased coins, one gold, one silver and one bronze.
In an experiment, the gold coin is tossed. If a head lands uppermost the experiment stops,
otherwise the silver coin is tossed. If a head on the silver coin lands uppermost the experiment
stops, otherwise the bronze coin is tossed.
On each toss of a coin, if a head lands uppermost, H is recorded, and if a tail lands uppermost,
T is recorded.
(i) State each of the possible sequences of outcome in the experiment together with its
probability.
[4]
(ii) The experiment is carried out twice. Calculate the probability that both experiments stop
with H being recorded.
............................................................[2]
(b) John and Zaheer play a game in which the player who starts the game has an advantage.
When John starts, the probability that he will win is 0.7. When Zaheer starts, the probability
that he will win is 0.8. All games end in a win for one player or the other.
They agree to play a number of games until one of them has won 3 games. Except for the first
game, each game is started by the loser of the previous game.
(i) State the maximum number of games which have to be played for one player to win three
games.
............................................................[1]
(ii) Calculate the probability that John will win by 3 games to 0 when
............................................................[2]
............................................................[2]
(a) write down the three different sequences of four results which lead to Zaheer winning
by 3 games to 1,
[2]
(b) calculate the probability that Zaheer wins by 3 games to 1, if John starts the first
game.
[3]
8 The table below gives certain information about the population and deaths in town X for the year
2008, together with the standard population of the area in which town X is situated.
20 – under 40 d 4500 30
40 – under 60 11 4400 25
(i) The death rate for the 20 – under 40 age group was 2 per thousand. Show that d = 9.
[1]
(ii) Calculate the death rates per thousand for the other three age groups.
................................................................
................................................................
............................................................[2]
(iii) Calculate the crude death rate per thousand for town X.
............................................................[4]
(iv) Using the given rate for the 20 – under 40 age group, and the rates you have calculated
in (ii), calculate, correct to 2 decimal places, the standardised death rate per thousand for
town X.
............................................................[4]
© UCLES 2010 4040/12/O/N/10
13
The table below gives information about town Y, another town in the same area as town X, for the
year 2008. The crude death rate for town Y in 2008 was 7.97 per thousand.
20 – under 40 3 24
40 – under 60 5 22
60 and over 25 19
(v) Calculate the standardised death rate per thousand for town Y in the year 2008, using the
same standard population as for town X.
............................................................[2]
(vi) State, with a reason, which of the two towns would appear to have a healthier environment.
...................................................................................................................................................
...............................................................................................................................................[2]
For both town X and town Y, the standardised death rate is higher than the crude death rate.
(vii) State what this tells you about how their populations compare with the standard population.
...................................................................................................................................................
...............................................................................................................................................[1]
9 A man travels to and from work by car. The following table shows the cost (in $) over a period of
160 working days.
Cost ($) (y) 230 290 340 390 445 500 560 610
(i) Plot a scatter diagram of cost against number of working days on the grid below.
y
800
700
600
500
Cost
($) 400
300
200
100
0
0 20 40 60 80 100 120 140 160 x
Number of working days
[2]
(ii) Calculate three appropriate averages using the data, plot them on the grid, and use them to
draw the line of best fit.
................................................................
................................................................
............................................................[6]
(iii) Obtain the equation of your line of best fit, giving the equation in the form y = mx + c.
............................................................[4]
(iv) Estimate the cost of travelling to and from work for 110 days.
$ ............................................................[1]
(v) If the man used public transport, the journey to and from work would cost him $5 per day.
On the grid, draw a line representing public transport cost against the number of working
days.
[1]
(vi) State the number of days after which the man would have spent the same amount whether
travelling by car or public transport.
............................................................[1]
(vii) Explain briefly why one of your lines passes through the origin, but the other does not.
...................................................................................................................................................
...............................................................................................................................................[1]
© UCLES 2010 4040/12/O/N/10 [Turn over
16
10 The owner of a travelling shop noted how many kilometres he travelled on each of the 300 days on
which he worked during one particular year. He grouped his results as given in the following table.
under 10 6
10 – under 20 13
20 – under 30 62
30 – under 40 96
40 – under 60 65
60 and over 58
The lowest and highest classes in the above table are ‘open-ended’.
(i) State, in each case with a reason, what you would regard as a suitable value to use for
...........................................................................................................................................
.......................................................................................................................................[2]
...........................................................................................................................................
.......................................................................................................................................[2]
(ii) Using your answers to (i), write in the above table the class mid-points (m) of the six classes.
[2]
(iii) The values of m are scaled, by subtracting 25 and dividing by 5, to give values of x.
(b) Estimate, to 3 significant figures, the mean and the standard deviation of the values of x.
Mean = ...............................................................
(c) Use your results in (iii)(b) to estimate, to 3 significant figures, the mean and the standard
deviation of the number of kilometres travelled per day by the owner of the shop.
Mean = .................................................... km
11 A company designed an aptitude test to gauge the suitability of people applying to join its workforce.
The following table summarises the time taken (in minutes) to complete the test by 88 applicants.
30 – under 40 12
40 – under 45 14
45 – under 50 13
50 – under 60 16
60 – under 80 16
80 – under 110 9
(i) Complete the cumulative frequency column in the above table. [2]
(ii) Plot the cumulative frequencies on the grid below, joining the points by a smooth curve.
100
80
60
Cumulative
frequency
40
20
0
0 20 40 60 80 100 120
Time (minutes)
[3]
(iii) Use your graph to estimate the median of the times taken to complete the test.
Applicants who complete the test in under 37 minutes are invited to attend an interview.
(iv) Use your graph to estimate the percentage of these 88 applicants who were not invited for an
interview.
...................................................... % [4]
In addition to the 88 applicants referred to in the table, 11 others had failed to complete the test
after 110 minutes and were stopped from continuing with it.
(v) (a) Explain why, if these 11 applicants are included in the data, it is still possible to estimate
the interquartile range of the times taken to complete the test by all 99 applicants.
...........................................................................................................................................
...........................................................................................................................................
...........................................................................................................................................
.......................................................................................................................................[2]
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
University of Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of
Cambridge Local Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
STATISTICS 4040/11
Paper 1 October/November 2010
2 hours 15 minutes
Candidates answer on the question paper.
Additional Materials: Mathematical tables
Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use a soft pencil for any diagrams or graphs.
Do not use staples, paper clips, highlighters, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (LEO/SW) 15799/6
© UCLES 2010 [Turn over
2
4 25 7 8 4 4 8 9 6 5 19
............................................................[1]
............................................................[1]
............................................................[1]
(ii) State which one of these three quantities you would choose as the most appropriate measure
of central tendency (average) to represent these figures, and give a reason for rejecting one
of the other two.
...................................................................................................................................................
...............................................................................................................................................[2]
(iii) State (but do not calculate) which measure of dispersion (spread) you would use as the most
appropriate to represent these marks.
............................................................[1]
2 There are 34 pupils in a school class. Their method of travel to school is given in the following
table.
(i) Calculate, to the nearest degree, the angles of the sectors representing Car and Bus.
Car ....................................................... °
[2]
(iii) In another class there are 27 pupils. Calculate, correct to 1 decimal place, the radius of a
comparable pie chart to illustrate this class. You are NOT required to draw this pie chart.
.................................................... cm [2]
© UCLES 2010 4040/11/O/N/10 [Turn over
4
3 The diagram below shows how many pets (cat, dog, rabbit) are owned by each of 100 households.
No household owns more than one of each different animal.
Cat Dog
33
z 31
y
z z
x
2
Rabbit
...............................................................................................................................................[1]
x = ...........................................................[1]
(iii) Three times as many households own only a cat as own all three types of pet. Find the value
of y.
y = ...........................................................[1]
z = ...........................................................[2]
............................................................[1]
4 The heights of the girls in a village were measured and are represented in the histogram below.
15
10
Frequency
density
5
0
0 145 150 155 160 165 170 175 180
Height (cm)
[4]
(ii) Use the histogram to estimate the modal height of the girls.
..................................................... cm [2]
5 Before an examination, teachers were required to predict the grades which their pupils would
obtain, from A (highest) to D (lowest). The following table shows the numbers of pupils for whom
the predicted grade was accurate.
A 132
B 284
C 203
D 68
(a) Among pupils for whom grade C was predicted, 33 obtained grade B and 4 obtained
grade D.
[1]
(b) There were no pupils for whom the predicted grade and the grade obtained were more
than one grade apart.
[1]
(c) Of the candidates for whom grade D was predicted, one-third obtained grade C.
[1]
(d) Of the candidates for whom grade A was predicted, eleven times as many obtained
grade A as obtained grade B.
[1]
(e) There were 50 candidates for whom grade B was predicted but who obtained another
grade. Nine times as many obtained grade A as obtained grade C.
[1]
(ii) For all those pupils whose predicted grade was not accurate, state, with a reason, whether
the grade obtained was likely to be higher or lower than the predicted grade.
...................................................................................................................................................
...............................................................................................................................................[1]
Frequency
Variable
[2]
(ii) a distribution in which the mean, median and mode are all equal.
Frequency
Variable
[2]
Frequency
1 2 3 4 5 6
Variable
Cumulative
frequency
1 2 3 4 5 6
Variable
[2]
7 (a) A man has three unbiased coins, one gold, one silver and one bronze.
In an experiment, the gold coin is tossed. If a head lands uppermost the experiment stops,
otherwise the silver coin is tossed. If a head on the silver coin lands uppermost the experiment
stops, otherwise the bronze coin is tossed.
On each toss of a coin, if a head lands uppermost, H is recorded, and if a tail lands uppermost,
T is recorded.
(i) State each of the possible sequences of outcome in the experiment together with its
probability.
[4]
(ii) The experiment is carried out twice. Calculate the probability that both experiments stop
with H being recorded.
............................................................[2]
(b) John and Zaheer play a game in which the player who starts the game has an advantage.
When John starts, the probability that he will win is 0.7. When Zaheer starts, the probability
that he will win is 0.8. All games end in a win for one player or the other.
They agree to play a number of games until one of them has won 3 games. Except for the first
game, each game is started by the loser of the previous game.
(i) State the maximum number of games which have to be played for one player to win three
games.
............................................................[1]
(ii) Calculate the probability that John will win by 3 games to 0 when
............................................................[2]
............................................................[2]
(a) write down the three different sequences of four results which lead to Zaheer winning
by 3 games to 1,
[2]
(b) calculate the probability that Zaheer wins by 3 games to 1, if John starts the first
game.
[3]
8 The table below gives certain information about the population and deaths in town X for the year
2008, together with the standard population of the area in which town X is situated.
20 – under 40 d 4500 30
40 – under 60 11 4400 25
(i) The death rate for the 20 – under 40 age group was 2 per thousand. Show that d = 9.
[1]
(ii) Calculate the death rates per thousand for the other three age groups.
................................................................
................................................................
............................................................[2]
(iii) Calculate the crude death rate per thousand for town X.
............................................................[4]
(iv) Using the given rate for the 20 – under 40 age group, and the rates you have calculated
in (ii), calculate, correct to 2 decimal places, the standardised death rate per thousand for
town X.
............................................................[4]
© UCLES 2010 4040/11/O/N/10
13
The table below gives information about town Y, another town in the same area as town X, for the
year 2008. The crude death rate for town Y in 2008 was 7.97 per thousand.
20 – under 40 3 24
40 – under 60 5 22
60 and over 25 19
(v) Calculate the standardised death rate per thousand for town Y in the year 2008, using the
same standard population as for town X.
............................................................[2]
(vi) State, with a reason, which of the two towns would appear to have a healthier environment.
...................................................................................................................................................
...............................................................................................................................................[2]
For both town X and town Y, the standardised death rate is higher than the crude death rate.
(vii) State what this tells you about how their populations compare with the standard population.
...................................................................................................................................................
...............................................................................................................................................[1]
9 A man travels to and from work by car. The following table shows the cost (in $) over a period of
160 working days.
Cost ($) (y) 230 290 340 390 445 500 560 610
(i) Plot a scatter diagram of cost against number of working days on the grid below.
y
800
700
600
500
Cost
($) 400
300
200
100
0
0 20 40 60 80 100 120 140 160 x
Number of working days
[2]
(ii) Calculate three appropriate averages using the data, plot them on the grid, and use them to
draw the line of best fit.
................................................................
................................................................
............................................................[6]
(iii) Obtain the equation of your line of best fit, giving the equation in the form y = mx + c.
............................................................[4]
(iv) Estimate the cost of travelling to and from work for 110 days.
$ ............................................................[1]
(v) If the man used public transport, the journey to and from work would cost him $5 per day.
On the grid, draw a line representing public transport cost against the number of working
days.
[1]
(vi) State the number of days after which the man would have spent the same amount whether
travelling by car or public transport.
............................................................[1]
(vii) Explain briefly why one of your lines passes through the origin, but the other does not.
...................................................................................................................................................
...............................................................................................................................................[1]
© UCLES 2010 4040/11/O/N/10 [Turn over
16
10 The owner of a travelling shop noted how many kilometres he travelled on each of the 300 days on
which he worked during one particular year. He grouped his results as given in the following table.
under 10 6
10 – under 20 13
20 – under 30 62
30 – under 40 96
40 – under 60 65
60 and over 58
The lowest and highest classes in the above table are ‘open-ended’.
(i) State, in each case with a reason, what you would regard as a suitable value to use for
...........................................................................................................................................
.......................................................................................................................................[2]
...........................................................................................................................................
.......................................................................................................................................[2]
(ii) Using your answers to (i), write in the above table the class mid-points (m) of the six classes.
[2]
(iii) The values of m are scaled, by subtracting 25 and dividing by 5, to give values of x.
(b) Estimate, to 3 significant figures, the mean and the standard deviation of the values of x.
Mean = ...............................................................
(c) Use your results in (iii)(b) to estimate, to 3 significant figures, the mean and the standard
deviation of the number of kilometres travelled per day by the owner of the shop.
Mean = .................................................... km
11 A company designed an aptitude test to gauge the suitability of people applying to join its workforce.
The following table summarises the time taken (in minutes) to complete the test by 88 applicants.
30 – under 40 12
40 – under 45 14
45 – under 50 13
50 – under 60 16
60 – under 80 16
80 – under 110 9
(i) Complete the cumulative frequency column in the above table. [2]
(ii) Plot the cumulative frequencies on the grid below, joining the points by a smooth curve.
100
80
60
Cumulative
frequency
40
20
0
0 20 40 60 80 100 120
Time (minutes)
[3]
(iii) Use your graph to estimate the median of the times taken to complete the test.
Applicants who complete the test in under 37 minutes are invited to attend an interview.
(iv) Use your graph to estimate the percentage of these 88 applicants who were not invited for an
interview.
...................................................... % [4]
In addition to the 88 applicants referred to in the table, 11 others had failed to complete the test
after 110 minutes and were stopped from continuing with it.
(v) (a) Explain why, if these 11 applicants are included in the data, it is still possible to estimate
the interquartile range of the times taken to complete the test by all 99 applicants.
...........................................................................................................................................
...........................................................................................................................................
...........................................................................................................................................
.......................................................................................................................................[2]
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
University of Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of
Cambridge Local Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
STATISTICS 4040/13
Paper 1 October/November 2013
2 hours 15 minutes
Candidates answer on the question paper.
Additional Materials: Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use a soft pencil for any diagrams or graphs.
Do not use staples, paper clips, highlighters, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (CW/CGW) 66919/2
© UCLES 2013 [Turn over
2
1 A survey was carried out to discover whether the quantity of traffic on a busy road was
sufficient to justify the installation of a pedestrian crossing. At intervals throughout one day
an investigator recorded the number of vehicles passing the proposed location in periods of
30 seconds duration.
12 51 64 55 51 61 31 22 20 15 34 14 69 35
When his record sheet was examined the number shown here as was illegible, but it was
certainly a single-digit number.
(i) two measures of central tendency (average) which can still be found,
.......................................................
................................................... [2]
................................................... [1]
................................................... [1]
.......................................................
................................................... [2]
2 The pie chart below illustrates the distribution by location of the total net profit of $787 million For
earned by an international company in the year 2011. Examiner’s
Use
Asia
North
America
Rest
of the
Europe World
(i) Measure, to the nearest degree, the sector angles of the pie chart, and insert them in
the appropriate places on the chart.
[2]
(ii) Calculate, to the nearest $million, the net profit of the company in Asia.
(iii) Measure and state the radius, in centimetres, of the above pie chart.
............................................. cm [1]
The total net profit of the same company in the year 2005 was $523 million.
(iv) Calculate, correct to 2 significant figures, the radius, in centimetres, of the comparable
pie chart for 2005.
............................................ cm [2]
3 A factory employs both male and female staff in each of the three categories managerial, For
inspection and production. Examiner’s
Use
There are altogether 3500 employees, of whom 2150 are male. There are a total of 660
managerial staff, 540 male inspection staff and 785 female production staff.
(i) Insert these values in the appropriate places in the following table.
[5]
4 There are 50 girls in their final year at a school. The diagram below illustrates the number of For
the girls who play each of the sports badminton, volleyball and handball. Examiner’s
Use
Badminton Volleyball
5 6 11
9
7 8
Handball
x
x = ......................................................
..........................................................................................................................................
...................................................................................................................................... [2]
(ii) Find
(a) how many more girls play volleyball than play handball,
................................................... [1]
(b) how many more girls play exactly two sports than play exactly one sport.
................................................... [1]
Half of the girls who play volleyball and two thirds of the girls who play only handball say they
intend to continue playing sport after they have left school.
(iii) Find the number of girls who intend to continue playing sport after they have left school.
................................................... [2]
5 In answering this question you are not required to draw a histogram. For
Examiner’s
The times taken, in minutes, by 174 people to complete an aptitude test are summarised in Use
The times are to be illustrated by a histogram, in which the 30 – under 40 class is represented
by a rectangle of height 18 units.
(i) Calculate the height of the rectangle representing the 40 – under 45 class, and insert
the value in the table.
[1]
(ii) Calculate the heights of the rectangles representing the remaining four classes, and
insert the values in the table.
[3]
(iii) If the final two classes were combined into a single 50 – under 120 class, calculate, to
2 decimal places, the height of the rectangle which would represent the combined class.
................................................... [2]
6 (a) (i) Describe the situation which can lead to the method of systematic sampling For
producing a biased sample. Examiner’s
Use
..................................................................................................................................
..................................................................................................................................
.............................................................................................................................. [1]
(ii) There are 380 students at a college. It is proposed to take a systematic sample of
20 of the students. Explain briefly how this could be achieved.
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
.............................................................................................................................. [3]
(b) Briefly explain how a population could be stratified, prior to taking a stratified sample, in
order to ascertain the views of members of the public on
..................................................................................................................................
.............................................................................................................................. [1]
..................................................................................................................................
.............................................................................................................................. [1]
7 (a) A test for a particular disease has a 95% chance of correctly giving a positive result for a
person who has the disease, but a 10% chance of incorrectly giving a positive result for
a person who does not have the disease.
(i) Find the chance that the test gives a negative result for a person who has the
disease, and insert it in the following table.
[1]
15% of the people who are tested are believed to have the disease.
(iii) Calculate the probability that the test gives a correct result for this person.
................................................... [4]
(b) Give all probabilities in this part of the question as fractions. For
Examiner’s
The following diagram classifies the members of a tennis club as to whether they are Use
male or female, left-handed or right-handed, and whether or not they have represented
the club in matches.
Left-handed Right-handed
1 5
3 4
Female
0 8
(i) Calculate the probability that this member has represented the club in matches.
................................................... [1]
................................................... [2]
................................................... [2]
(c) Laura walks to school. On her route she passes two shops, A and B. The probability that For
she will go into shop A on any morning is 0.2, and into shop B is 0.7. Examiner’s
Her decision of whether to go into one of the shops is independent of whether she goes Use
into the other shop. If she goes into either or both shops the probability that she will be
late for school is 0.09.
(i) Calculate the probability that on any morning she will go into exactly one shop and
be late for school.
................................................... [3]
Laura has been told that she must aim to be late on no more than 5% of the schooldays
on which she goes into exactly one shop.
(ii) State, with a reason, whether she is likely to achieve this target.
..................................................................................................................................
.............................................................................................................................. [2]
8 (a) The table below summarises information about the number of GCE O Level subjects For
passed by different numbers of pupils at a school in the year 2011. Examiner’s
Use
(i) On the grid below, draw and label two axes, the horizontal axis representing the
number of subjects passed and the vertical axis representing cumulative frequency.
[2]
[4]
(b) The cumulative frequency graph below illustrates the lengths of journey times, in For
minutes, to their homes of a number of students at a college at the end of one particular Examiner’s
day. Use
200
160
120
Cumulative
frequency
80
40
0
0 10 20 30 40 50 60
Journey time (minutes)
(iv) the number of students whose journey time was longer than 23 minutes,
................................................... [3]
................................................... [2]
On the next day, due to bad weather, the journey time of all students was 5 minutes
longer than the original times illustrated in the graph.
Compared with the original times, state, without further calculation, the effect which the
bad weather had on
.............................................................................................................................. [1]
.............................................................................................................................. [1]
9 The following table gives information about the populations and deaths in two towns, A and For
B, during the course of one year, together with the standard population of the area in which Examiner’s
both towns are situated. Use
Town A Town B
Standard
Age Death rate
Population Deaths Population Deaths population
(per thousand)
0 – under 15 5000 45 p= 6000 66 400
15 – under 45 3750 15 4 27000 54 300
45 – under 65 2500 25 10 15000 60 200
65 and over 1250 q= 32 2000 30 100
(i) For town A, calculate the values of p and of q and insert them in the table.
[2]
................................................... [4]
................................................... [4]
(iv) Use the population figures given in the table to state why the crude death rate and the
standardised death rate of town A are equal.
..........................................................................................................................................
...................................................................................................................................... [2]
The table shows that far more deaths occurred in town B than in town A during the year, For
and yet the standardised death rate for town B is much lower than that for town A. Examiner’s
Use
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
.............................................................................................................................. [2]
(vi) State, with a reason, the effect, if any, which correcting this error would have on the
crude death rate of town B.
..................................................................................................................................
.............................................................................................................................. [2]
10 The time, in minutes, taken by each of 6 children to walk 1 kilometre, is given in the following For
table. Examiner’s
Use
Child A B C D E F
Age in years (x) 13 8 7 15 12 9
Time in minutes (y) 12 23 25 11 18 23
y
30
20
Time
(minutes)
10
0
0 2 4 6 8 10 12 14 16 x
Age (years)
[2]
(ii) Calculate the overall mean and the two semi-averages of the data, and plot them on
your graph.
[5]
(iii) Use your plotted averages to draw a line of best fit. [1] For
Examiner’s
(iv) Using any valid method, obtain the equation of your line of best fit, and write it in the Use
form y = mx + c.
................................................... [3]
(v) Use your equation to estimate, to the nearest minute, the time taken to walk 1 kilometre
by a child aged 14 years.
................................................... [1]
(vi) (a) Comment on how well your line of best fit matches the data points.
..................................................................................................................................
.............................................................................................................................. [1]
(b) From the graph, identify the child for whom your line of best fit most overestimates
the time taken.
................................................... [1]
(vii) State, with a reason, whether it would be valid to use your line of best fit to estimate the
time taken to walk 1 kilometre by a person whose age is outside the range of values
given in the table.
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
.............................................................................................................................. [2]
11 The following table summarises the increase, in dollars, of the annual income of a sample of For
200 people between the years 2006 and 2011 (a negative value indicates a decrease). Examiner’s
Use
(i) Obtain the mid-point, m, for each of the five classes and insert the values in the table.
[1]
(ii) For each class, obtain the value of the scaled variable, y, where
y = m – 750 ,
250
and insert the values of y in the table.
[2]
(iii) Obtain the values of Σfy and Σfy 2 and use them to estimate the values of the mean of y For
and the variance of y. Examiner’s
Use
Mean = ......................................................
................................................... [2]
................................................... [3]
................................................... [1]
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
University of Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of
Cambridge Local Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
STATISTICS 4040/12
Paper 1 October/November 2013
2 hours 15 minutes
Candidates answer on the question paper.
Additional Materials: Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use a soft pencil for any diagrams or graphs.
Do not use staples, paper clips, highlighters, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (NH/CGW) 66916/2
© UCLES 2013 [Turn over
2
mean,
median,
mode,
range,
interquartile range,
variance
and standard deviation.
In each of the following situations, one of these measures is to be found by the person
described. State the appropriate measure in each case.
................................................... [1]
(ii) An athlete who competes in the 100 metres sprint finds the difference between his
slowest and quickest practice times.
................................................... [1]
(iii) A graduate who seeks employment with a company finds a measure of central tendency
for the salaries of the company’s employees. The company has twenty employees, of
whom three are managers earning salaries very much higher than the other employees.
................................................... [1]
(iv) A teacher finds a measure of dispersion for the scores of her pupils in a test, in which no
pupil scored an exceptionally high mark, and no pupil scored an exceptionally low mark.
................................................... [1]
(v) A biologist finds a measure of dispersion for the growth of twelve plants over a period of
three months. Two plants have been attacked by insects and have grown very much less
than the others.
................................................... [1]
(vi) A sociologist finds a measure of central tendency for the first names given to the male
babies born in a hospital over a period of six months.
................................................... [1]
2 A large keep fit class for women is held at a sports club once every week. The manager of For
the club asks the class instructor to select a sample of size 10 from the class. Examiner’s
Use
(i) State the method of sampling used if the instructor decides to select
................................................... [1]
................................................... [1]
The sample is required to obtain responses to a proposal to change the time of the class from
Monday evening to Monday afternoon. For class members the only items of data presently
available to the instructor are name and age.
(ii) State, and justify, two other items of data relating to class members which the instructor
needs to know when selecting the sample in order to avoid bias in responses. You are
not required to describe how the sample is selected.
..........................................................................................................................................
..........................................................................................................................................
..........................................................................................................................................
..........................................................................................................................................
..........................................................................................................................................
..........................................................................................................................................
..........................................................................................................................................
...................................................................................................................................... [4]
3 In a photographic equipment store a record was kept of the number of cameras sold each For
day. The values, for eleven consecutive days, were as follows. Examiner’s
Use
6 0 8 2 1 6 0 9 6 4 1
................................................... [1]
................................................... [2]
................................................... [2]
The values recorded for the next three days were x, x + 1 and x + 2.
(iv) If the median for the entire fourteen-day period was the same as the median for the first
eleven days, find x.
x = .................................................. [1]
4 The diagram below shows the number of actors at a film festival who have worked in one or For
more of the cities Mumbai, Los Angeles and Rome. Examiner’s
Use
13 5 9
4
3 6
Rome
................................................... [1]
..........................................................................................................................................
...................................................................................................................................... [1]
................................................... [2]
................................................... [1]
(v) Rome, given that the actor has worked in Mumbai and Los Angeles.
................................................... [1]
5 In this question you are not required to draw any charts. For
Examiner’s
A charity, Camfam, classifies the income it receives under the headings Special Events, Use
Donations, Grants, and Other Sources. In Camfam’s report for 2010, the following percentage
bar chart was given, which represents a total income of $80 million.
2010
0 10 20 30 40 50 60 70 80 90 100
Percentage
(i) Find the income which Camfam received in 2010 from Grants.
$ ................................................... [2]
(ii) If a pie chart were to be drawn to represent this information, find the angle which would
represent the sector for Special Events.
................................................... [2]
(iii) Find, in its simplest terms, the ratio of the area of the chart representing 2010 to the
area of the chart representing 2011.
................................................... [2]
6 The following table is to show the distance, in kilometres, between any two of the five towns For
A, B, C, D and E. Examiner’s
Use
A
42 B
20 39 D
36 18 25 E
(a) The distance between B and C is 10 km more than the distance between D and E.
[1]
(b) The distance between A and C is two thirds of the distance between A and E.
[1]
(c) The distance between A and B is twice the distance between C and E.
[1]
[1]
Dimitri lives in town A, but has one friend in each of the towns D and E. He makes a journey
in which he leaves his home, visits each of these friends once, and then returns home.
(ii) Find the distance which Dimitri travels to complete the journey.
............................................. km [2]
7 In this question the fertility rate of a population is defined as the number of births per 1000
females.
The table below gives information about the female population and age group fertility rates in
a particular city for the year 2012, together with the standard population of the area in which
the city is situated.
(i) Calculate, to 1 decimal place, the standardised fertility rate for the city.
................................................... [4]
(ii) Calculate the number of births for each age group and insert the values in the table
above.
[2]
(iii) Calculate, to 1 decimal place, the crude fertility rate for the city. For
Examiner’s
Use
................................................... [4]
There are equal numbers of males and females in the city and in the standard population.
The standardised and crude death rates for the city in 2012 were 8.5 and 7.8 per thousand
of the population respectively.
(iv) Using one of these values, and any other appropriate values from parts (i), (ii) and (iii),
find the increase in the population of the city in 2012 due to births and deaths.
................................................... [5]
..........................................................................................................................................
...................................................................................................................................... [1]
8 In a large residential building there are 120 apartments, of which 50 are private apartments For
(owned by the residents) and 70 are company apartments (owned by the company which Examiner’s
constructed the building). Use
................................................... [2]
................................................... [2]
The weekly rents, in dollars, charged on the company apartments are represented in the
histogram below, from which one rectangle, representing the $400 to under $500 class, has
been omitted.
25
20
15
Number
of
apartments
per $50
10
0
200 250 300 350 400 450 500
Weekly rent ($)
Use the histogram to find the number of company apartments for which the weekly rent was For
Examiner’s
(iii) from $250 to under $400, Use
................................................... [2]
................................................... [2]
There were 10 company apartments for which the weekly rent was from $400 to under $500.
(v) Complete the histogram by drawing on the grid the rectangle representing
the $400 to under $500 class.
[1]
(vi) Write down the term used to describe the $300 to under $350 class.
................................................... [1]
The private apartments are of three different sizes. There are 24 apartments with three
rooms, 14 with four rooms, and 12 with five rooms.
A safety expert, conducting a survey on the use of smoke detectors, chooses three private
apartments at random.
(vii) If the apartments chosen have 12 rooms in total, find the probability that the apartments
are all of the same size.
................................................... [6]
© UCLES 2013 4040/12/O/N/13 [Turn over
12
9 The mid-day temperature at a particular location in a city was measured every day throughout For
the year 2010. The following table summarises the results obtained. Examiner’s
Use
(i) Complete the cumulative frequency column in the above table. [2]
(ii) Plot the cumulative frequencies on the grid opposite, joining the points by a smooth
curve. [3]
.............................................. °C [1]
.............................................. °C [4]
For
Examiner’s
400 Use
350
300
Cumulative frequency (days)
250
200
150
100
50
0
0 5 10 15 20 25 30 35 40
Temperature (°C)
When the results were obtained, a scientist predicted that, because of climate change, For
temperatures in the city would increase at the rate of 0.5 °C every ten years. Examiner’s
Assume that this prediction is accurate. Use
(iv) use your answers to part (iii) to estimate, for the year 2050,
........................................°C [2]
........................................°C [1]
(v) use your graph to estimate, for the period 2010 to 2050, the increase in the number of
days with a mid-day temperature of more than 36 °C.
................................................... [3]
BLANK PAGE
10 Emilie, a student teacher, conducted research on the number of pupils and the number For
of teachers in the schools in the town of Astra, where she lives. The schools supplied the Examiner’s
following data. Use
School A B C D E F G H
Number of pupils, x 760 1219 927 470 1361 628 381 1085
Number of teachers, y 29 44 33 34 52 24 16 40
y
60
50
40
Number
of 30
teachers
20
10
0
0 200 400 600 800 1000 1200 1400 x
Number of pupils
[2]
The data have an overall mean of (853.875, 34) and an upper semi-average of (1148, 42.25). For
Examiner’s
(ii) Show how the value 1148 is calculated. Use
[2]
................................................... [2]
(iv) Without plotting the averages, and without drawing the line, find the equation of the
line of best fit in the form y = mx + c.
................................................... [3]
(v) Explain briefly why the value of c which you have found in part (iv) might give you
cause for concern.
..........................................................................................................................................
...................................................................................................................................... [1]
Emilie discovered later that the data supplied by one of the schools gave, incorrectly, the
total number of people employed by the school, and not the number of teachers.
(vi) Ignoring the point representing the school which supplied incorrect data, draw, by eye,
on the grid in part (i), a line of best fit through the remaining seven points. [1]
(vii) Use the line you have drawn in part (vi) to find its equation in the form y = mx + c.
................................................... [3]
Emilie repeated the research for schools in the nearby town of Belport, for which she found For
the equation of the line of best fit to be y = 0.0431x + 1.72 . Examiner’s
Use
(viii) Using this equation, and your answer to part (vii), state in which of the two towns a pupil
might choose to be educated, if free to choose. Explain your answer briefly.
..........................................................................................................................................
..........................................................................................................................................
...................................................................................................................................... [2]
11 (i) Give one advantage and one disadvantage of forming a large set of data into a grouped For
frequency distribution. Examiner’s
Use
Advantage.........................................................................................................................
..........................................................................................................................................
Disadvantage ....................................................................................................................
...................................................................................................................................... [2]
The presenter of a radio programme, in which recordings of popular songs are played, plans
his programme. For each song chosen he writes down the song length, in terms of time, in
minutes, taken to play the song. The following table summarises the song lengths.
(ii) Estimate, in minutes, the mean and standard deviation of these song lengths. Give your
answers to 3 significant figures.
Mean = ......................................................
Information about five of the presenter’s earlier programmes is shown below. For
Examiner’s
Use
Number of songs Mean of song lengths Standard deviation of
Programme
played (minutes) song lengths (minutes)
P 38 3.70 0.339
Q 39 3.52 0.328
R 42 3.69 0.294
S 37 3.83 0.305
T 38 3.74 0.291
................................................... [1]
................................................... [1]
All the presenter’s programmes are three hours in duration. Songs are not played continuously
throughout each programme; for some of the time the presenter talks about the songs and
the singers.
(iv) Find the probability that a song was not being played at that moment.
................................................... [4]
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
University of Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of
Cambridge Local Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
STATISTICS 4040/13
Paper 1 October/November 2012
2 hours 15 minutes
Candidates answer on the question paper.
Additional Materials: Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use a soft pencil for any diagrams or graphs.
Do not use staples, paper clips, highlighters, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (RW/JG) 51082/2
© UCLES 2012 [Turn over
2
1 State, in each case, the name of a method of sampling which fits the given description.
................................................... [1]
(ii) A method which only requires the use of random numbers to select the first item in a
sample.
................................................... [1]
(iii) A method which guarantees that the proportions of different categories in a sample will
exactly match those in the population being sampled.
................................................... [1]
(iv) A method which may produce a biased sample if there is some pattern in the population
which repeats at regular intervals.
................................................... [1]
(v) A method in which, at the start of sampling, all items in the population have an equal
chance of being selected for the sample.
................................................... [1]
................................................... [1]
2 A group of children are given a general knowledge test, and the following table gives, for ten For
of them, their age in months and their score in the test. Each child is identified by a letter in Examiner’s
the range A to J. Use
Child A B C D E F G H I J
Age (months) 148 174 126 180 160 108 138 192 158 189
Test score 60 72 36 82 61 20 40 86 50 81
The line of best fit method is to be used to estimate a child’s test score from his/her age.
(i) State, with a reason, which of age and test score should be used as the independent
variable (x) and which as the dependent variable (y).
..........................................................................................................................................
..........................................................................................................................................
...................................................................................................................................... [2]
(ii) State, with a reason, the letters corresponding to the children whose ages and scores
you would use to calculate the lower semi-average.
..........................................................................................................................................
..........................................................................................................................................
...................................................................................................................................... [2]
[2]
3 It is required to obtain the equation of the line of best fit of two variables, x and y, in the form For
y = mx + c. Examiner’s
Use
Observations are taken on the two variables and the two semi-averages are calculated.
m = .................................................. [2]
(ii) Use your value of m from part (i) and the overall mean to obtain the value of c, and
hence state the equation of the line of best fit.
................................................... [3]
(iii) State the reason why the same equation would have been obtained if the overall mean
and a semi-average had been used to obtain the value of m.
..........................................................................................................................................
..........................................................................................................................................
...................................................................................................................................... [1]
4 The table below summarises the number of unsuccessful attempts which a sample of 100 For
people made before passing their driving test. Examiner’s
Use
Number of unsuccessful
Number of people Cumulative frequency
attempts
0 51
1 34
2 10
3 4
4 1
(i) Calculate the cumulative frequencies and insert them in the table. [1]
(ii) On the grid below, draw and fully annotate, using suitable scales, an appropriate
cumulative frequency graph to illustrate these data.
[5]
5 Twelve families live in a certain street. The table below summarises the number of girls and For
boys in each of these families. Examiner’s
Use
For example, there is one family with two boys and three girls.
(i) Construct the frequency distribution of the number of children per family, and present it
in an appropriate table.
[3]
(ii) Complete the frequency distribution table below to show the number of brothers that the
girls have.
6 (i) Name one measure of central tendency and one measure of dispersion which would be For
distorted by the presence of one extreme observation in a set of data. Examiner’s
Use
(ii) Name one measure of central tendency and one measure of dispersion which could still
be calculated for a set of data even if the largest value in it were unknown.
(iii) The units in which a particular variable is measured are cm. State the units of the
variance of this variable.
................................................... [1]
(iv) Draw a sketch of a distribution in which the mean and median are equal, but in which
the mode is not equal to them.
Frequency
Variable
[1]
7 (a) A boy has two unbiased coins, one silver and one bronze. In an experiment he tosses the
silver coin, and if a tail lands uppermost tail (T) is recorded. If a head lands uppermost
he tosses the bronze coin, and whichever face lands uppermost is recorded, head (H)
or tail (T).
(i) State the possible recorded outcomes and the probability associated with each.
[4]
(ii) If the experiment is carried out twice, calculate the probability that on both occasions
a tail is recorded.
................................................... [2]
1 1
(b) Of a large bird population near a garden 3 are thrushes, 4 are starlings and the rest
are robins. Assuming that the birds behave independently of each other, calculate the
probability that three birds landing in the garden will
................................................... [1]
................................................... [4]
© UCLES 2012 4040/13/O/N/12
9
On another day, two robins, one starling and one thrush are in the garden. For
Examiner’s
(iii) Under the same assumption, calculate the probability that they will fly away in Use
alphabetical order.
................................................... [2]
(c) Four people, including the car owner and his wife, are going on a journey in a four-
seater car. In how many different ways can they be seated in the car if
(i) the car owner is to drive and his wife is to sit next to him,
................................................... [1]
(ii) the car owner is to drive and the other three people may be seated anywhere,
................................................... [1]
................................................... [1]
8 In answering all parts of this question you are required to show your working, either as For
the calculations leading to your result, or by drawing appropriate lines on the graph. Examiner’s
Use
The cumulative frequency graph below illustrates the marks obtained by 100 candidates on
each of two examination papers.
100
Paper 2
Paper 1
80
60
Cumulative frequency
40
20
Paper 1 Paper 2
0
0 10 20 30 40 50 60 70 80 90
Marks
................................................... [2]
(ii) the lower quartile, the upper quartile and the interquartile range of the marks scored on
Paper 1.
(iii) on which paper the strongest candidates generally scored more marks,
..........................................................................................................................................
...................................................................................................................................... [2]
(iv) on which paper the weakest candidates generally scored more marks,
..........................................................................................................................................
...................................................................................................................................... [2]
(v) which paper gave the greater discrimination between the strongest and the weakest
candidates.
..........................................................................................................................................
...................................................................................................................................... [2]
For each paper, a distinction is to be awarded to the 7% of candidates who achieve the
highest marks on that paper.
(vi) Estimate the mark on Paper 1 and the mark on Paper 2 which would lead to distinctions
being awarded.
(vii) Interpret the point at which the two lines on the graph intersect.
..........................................................................................................................................
...................................................................................................................................... [1]
9 In this question give all death rates per thousand, and those that are not exact to For
2 decimal places. Examiner’s
Use
The following table gives the population of a city in the year 2010, together with various
statistics relating to deaths in the city during that year, and the standard population of the
area in which the city is situated.
(i) Calculate the values of p, q and r and insert them in the table.
[3]
................................................... [4]
................................................... [4]
(iv) Calculate the percentage of the city’s population in each age group, and insert your For
results in the table. Examiner’s
Use
[2]
(v) Using your results in part (iv), explain why the city’s crude death rate is higher than its
standardised death rate.
..........................................................................................................................................
...................................................................................................................................... [1]
(vi) If the last two classes in the table were combined into a 45 and over class, state, with a
reason, which of the rates in parts (ii) and (iii) would remain unaltered.
..........................................................................................................................................
...................................................................................................................................... [2]
10 (a) In this part of the question you are NOT required to draw a histogram. For
Examiner’s
The table below shows the age distribution of the residents of a housing estate. Use
There are four times as many residents in the 21 – under 30 group as there are in the
5 – under 11 group.
(iv) Give a reason why the rectangle representing the 21 – under 30 group would not
be four times as high as that representing the 5 – under 11 group.
..................................................................................................................................
.............................................................................................................................. [1]
(b) A manufacturer of car tyres presented the following data summarising the life (in terms For
of distance travelled) of a sample of 100 tyres. Examiner’s
Use
Distance 18 19 20 21 22
Frequency 8 14 26 35 17
A statistician presented with this data assumed that the distances had been measured
in thousands of kilometres, correct to the nearest thousand.
(i) Estimate, to 3 significant figures, the mean and the standard deviation of the
distances, under the statistician’s assumption.
The statistician subsequently discovered that the distances in the table actually
represented lower class limits (for example, the 18 represents any distance in the
18 – under 19 range), and also that they were in units of thousands of miles.
(ii) Taking 1 mile to be equal to 1.6 km, estimate the correct mean and standard
deviation in metric units.
11 (a) At the end of the year 2009, 100 students were asked which of the three countries France For
(F ), Germany (G) and Italy (I ) they had ever visited. The diagram below summarises Examiner’s
their replies. Ten of the students had not visited any of the three countries. Use
F G
32 7 24
2
5 x
17
10
x = .................................................. [2]
..................................................................................................................................
.............................................................................................................................. [1]
During the year 2010 the following visits took place. For
Examiner’s
Of the students who had previously visited only France, 2 visited Germany, and a Use
Of the students who had previously visited only France and Germany, 3 visited Italy.
Of the 10 students who had previously not visited any of the three countries, 1 visited
France, a different 1 visited Italy, and a further 2 visited Germany.
(iii) Complete the following diagram to illustrate the situation at the end of the year
2010.
F G
[6]
(b) In this part of the question you are NOT required to draw any pie charts. For
Examiner’s
The treasurer of a tennis club classifies all club expenditure under four headings: Use
maintenance of the club house and courts; tennis balls; wages paid to the club cleaner;
taxes & insurance. In the year 2010 expenditure on these four items was in the ratio
4 : 3 : 2 : 1 respectively.
(i) Calculate the angle of the sector representing expenditure on tennis balls.
In the previous year, 2009, total expenditure had been 90% of what it was in 2010.
(ii) Calculate, to 1 decimal place, the radius of the pie chart which would be drawn to
represent expenditure in 2009.
............................................ cm [2]
(iii) Calculate, to the nearest degree, the angle of the sector which would represent the
expenditure on tennis balls in a pie chart for the year 2011.
BLANK PAGE
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
University of Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of
Cambridge Local Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
STATISTICS 4040/12
Paper 1 October/November 2012
2 hours 15 minutes
Candidates answer on the question paper.
Additional Materials: Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use a soft pencil for any diagrams or graphs.
Do not use staples, paper clips, highlighters, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (SJF/CGW) 51081/5
© UCLES 2012 [Turn over
2
1 A supermarket carried out a survey to see if customers were satisfied with the services it
provided. The results are shown in the pictogram below.
= 60 satisfied customers
= 6 dissatisfied customers
(a) satisfied,
................................................... [1]
(b) dissatisfied.
................................................... [1]
..........................................................................................................................................
..........................................................................................................................................
...................................................................................................................................... [2]
The survey was carried out by a team of interviewers who spoke to customers in the
supermarket on one particular weekday afternoon.
(iii) State, giving a reason for your answer, whether or not you would expect the results of
the survey to be biased.
..........................................................................................................................................
..........................................................................................................................................
...................................................................................................................................... [2]
2 A car hire company has 5 small cars, 15 medium-sized cars and 10 large cars. An inspector For
selects a sample of cars from this population to test their mechanical condition. Examiner’s
Use
(i) For the different possible sampling methods, state whether each of the following is true
or false.
(a) A (simple) random sample of size 10 might contain only medium-sized cars.
................................................... [1]
................................................... [1]
(c) If quota sampling is used, the inspector may select as many cars as he wishes, and
as many of each type of car as he wishes.
................................................... [1]
(d) If systematic sampling is used, every car has an equal chance of being selected
after the first car has been selected.
................................................... [1]
(ii) The inspector decides he has time to test only 9 cars. If he wishes to obtain a random
sample stratified by car size, find how many cars of each size he must select.
Small .......................................................
Medium .......................................................
3 The diagram below shows the number of musicians performing at a music festival who play For
one or more of the instruments bass, keyboards and drums. Examiner’s
Use
EDVV NH\ERDUGV
GUXPV
(i) keyboards,
................................................... [1]
................................................... [1]
................................................... [2]
The guitar is also played by all the bass players, but not by any of the other musicians. For
Examiner’s
For the four instruments, bass, keyboards, drums and guitar, find the number of musicians Use
who play
................................................... [1]
................................................... [1]
4 The products leaving a factory are checked every day to ensure that faulty items are not sent For
out to customers. The table below shows values of the variable X, the number of faulty items Examiner’s
found in a day, for a period of 50 days. Use
Number of days, f 19 10 7 6 5 3
For example, there were 7 days when 2 faulty items were found.
(i) State the mode of the distribution, and explain why it is a poor measure of central
tendency in this case.
..........................................................................................................................................
..........................................................................................................................................
...................................................................................................................................... [2]
................................................... [1]
The mean of the distribution has been calculated to be 1.75 faulty items per day.
(iii) Find the upper class limit of the 5 or more class which has been used in the calculation.
................................................... [3]
5 Each person at a committee meeting receives a plate of seven biscuits for refreshments, of For
which four are ginger, two are chocolate, and one is plain. One committee member chooses Examiner’s
four biscuits at random from her plate. Use
................................................... [1]
(ii) find the probability of each value of X, and present your results in a suitable table.
[5]
6 At a medical centre the dosage for a particular vaccination is 0.500 ml, but small variations For
from this are acceptable. Examiner’s
Use
The exact dosages (in millilitres) in a prepared batch of six were found to be as follows.
................................................... [2]
It is very important that, for a batch of six, the standard deviation is no greater than 0.01250 ml,
and the range is no more than three times the standard deviation.
(ii) Find whether or not the prepared batch satisfies both of these conditions.
...................................................................................................................................... [3]
A trainee technician at the medical centre suggests that, as well as checking that the
standard deviation is not too large, they must also check that it is not too small.
(iii) State, giving a reason, whether or not you agree with the trainee.
..........................................................................................................................................
...................................................................................................................................... [1]
7 Two universities carried out a survey of their graduates one year after graduation. The
graduates were classified as employed, unemployed, or engaged in further study, as shown
in the following pie charts, which are drawn to scale.
University A University B
Employed
Unemployed
Further study
Find the number of graduates who, one year after graduation, were
................................................... [2]
................................................... [4]
The graduates from University A who were employed were asked how long it took them to For
find employment. The times are represented in the histogram below. Examiner’s
Use
250
200
Number of 150
graduates
per
1 month
100
50
0
0 2 4 6 8 10 12
Time (months)
................................................... [1]
Use the histogram to find the number of graduates from University A for whom the time taken
to find employment was
................................................... [2]
(v) from 3 months to 6 months,
................................................... [2]
(vi) from 6 months to 12 months.
................................................... [1]
Of the employed graduates from University A, 163 were science graduates. For
Examiner’s
(vii) Estimate the number of science graduates from University A who found employment within Use
6 months of graduation.
................................................... [3]
(viii) State the assumption which you have made in your calculation in part (vii).
..........................................................................................................................................
...................................................................................................................................... [1]
8 In this question calculate all fertility rates per thousand, and to the nearest whole For
number. Examiner’s
Use
The fertility rate is defined as the number of births per 1000 females.
The table below gives information about the female population and births in the town of
Bluedorf for the year 2010, together with the standard female population of the area in which
Bluedorf is situated.
20 – 30 459 2250 15
31 – 40 488 3050 20
Over 40 76 4000 40
................................................... [4]
(ii) Calculate the fertility rate for each age group and insert the values in the table above.
[2]
................................................... [4]
The table below gives information about Redville, another town in the same area as Bluedorf, For
also for the year 2010. Examiner’s
Use
20 – 30 225 1560
31 – 40 180 1700
Over 40 20 4950
(iv) Calculate the standardised fertility rate for Redville in the year 2010, using the same
standard population as for Bluedorf.
................................................... [2]
(v) Find how many more births there were in Bluedorf than in Redville in the year 2010.
................................................... [2]
The local government of the area in which Bluedorf and Redville are situated wishes to limit
population growth, but only has sufficient funds for a publicity campaign on birth control in
one of these two towns.
(vi) State, with a reason, in which of these two towns the campaign should be conducted.
..........................................................................................................................................
..........................................................................................................................................
...................................................................................................................................... [2]
9 In answering all parts of this question you are required to show your working, either as For
the calculations leading to your result, or by drawing appropriate lines on the graph. Examiner’s
Use
Passengers on an aeroplane are allowed to check-in one bag for carriage in the aeroplane’s
hold. The weights of checked-in bags for one particular flight are illustrated in the cumulative
frequency curve below.
160
140
120
100
Cumulative
frequency 80
(bags)
60
40
20
0
0 5 10 15 20 25 30 35
Weight of bag (kg)
............................................. kg [1]
............................................. kg [4]
k = ................................................... [2]
The weight allowance for a checked-in bag is 20 kg. Bags heavier than 20 kg are classified as
overweight.
................................................... [2]
............................................. kg [2]
For overweight bags passengers are charged $8 per kilogram for the amount by which the
weight of their bag exceeds 20 kg. For example, for a bag weighing 21.5 kg, the passenger
would be charged $12.
(iii) Assuming that the median and mean of the overweight bags are approximately the
same, use your answers to part (ii) to estimate the total money received in charges for
the overweight bags on this particular flight.
$ ................................................... [4]
(iv) Interpret the point at which the cumulative frequency curve meets the horizontal axis.
..........................................................................................................................................
...................................................................................................................................... [1]
10 Soraya is checking her telephone bill. For all the national calls listed she writes down the For
length of the call, to the nearest minute, and the cost of the call, to the nearest cent. The Examiner’s
results are shown in the following table. Use
120
100
80
Cost
(cents)
60
40
20
0 x
0 2 4 6 8 10 12 14
Length of call (minutes)
[2]
The data have an overall mean of (5.875, 57.625) and a lower semi-average of (1.75, 31.5).
[2]
(iii) Find the upper semi-average, and plot this and the two given averages on your graph. For
Examiner’s
Use
................................................... [3]
(iv) Use your plotted averages to draw a line of best fit, and find its equation in the form
y = mx + c.
.................................................. [4]
For national calls, the cost is made up of a connection charge for the call, and a charge per
minute for the length of the call.
For local calls, there is a fixed charge of 10 cents for any call lasting up to 2 minutes. There is
a further charge of 3 cents per minute for any length of time for which the call is longer than
2 minutes.
(vi) Draw, on the grid in part (i), a graph showing the cost of local calls lasting between
0 and 14 minutes.
[2]
Later, Soraya makes a national call which costs 40 cents, and a local call which costs
40 cents.
(vii) Use your graphs to estimate how much longer the local call lasted than the national call.
Give your answer to the nearest minute.
.....................................minutes [2]
11 In this question give your answers either as fractions, or as exact decimals, or as For
decimals correct to 3 significant figures. Examiner’s
Use
(a) In an office there are 15 workers, of whom 3 are supervisors and 12 are assistants. Two
workers are chosen at random, without replacement.
................................................... [2]
................................................... [2]
(b) Kwame likes quizzes, and estimates that he knows the correct answer to 60% of quiz
questions. He enters a television quiz, in which, for each question asked, four alternative
answers are offered to the contestant, only one of which is correct. If Kwame knows the
correct answer he chooses the correct answer, otherwise he makes a random guess.
Assume that his own estimate of his knowledge is accurate.
................................................... [2]
................................................... [2]
(b) he knows none of the correct answers, but answers all the questions correctly.
................................................... [3]
(c) In a game, a turn consists of rolling an unbiased six-sided dice with faces numbered 1, For
2, 3, 4, 5 and 6. Examiner’s
If the number obtained is 3, 4, 5 or 6, then that is the score for the turn. Use
If the number obtained is 2, then the dice is rolled one more time, and the score for the
turn is the sum of 2 and the number obtained on the second roll.
If the number obtained is 1, the dice is rolled two more times, and the score for the turn
is the sum of 1 and the numbers obtained on the second and third rolls.
If, in a turn, a player obtains a score of 5, find the probability that the dice has been
rolled two times altogether.
................................................... [5]
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
University of Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of
Cambridge Local Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
STATISTICS 4040/13
Paper 1 October/November 2011
2 hours 15 minutes
Candidates answer on the question paper.
Additional Materials: Mathematical tables
Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use a soft pencil for any diagrams or graphs.
Do not use staples, paper clips, highlighters, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
1 The number of patients in the waiting room of a medical surgery was recorded at noon on each of
eleven different days. The recorded numbers were
3 5 7 8 9 1■ 7 5 5 6 4
The largest number, shown as 1■, was partially illegible, although it was clearly a two-digit number
with first digit 1.
(i) Name and calculate two measures of average (central tendency) which can still be found.
[4]
(ii) Name and calculate one measure of dispersion which can still be found.
.............................................................................................................................................. [2]
2 (a) Describe briefly the difference between the method of obtaining a simple random sample
and that of obtaining a stratified random sample.
...................................................................................................................................................
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
(b) Describe briefly how you could obtain a systematic sample of the pupils in a school.
...................................................................................................................................................
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
(c) The results of a statistical survey may be subject to either or both of bias and error. Explain
the difference between bias and error in this context.
...................................................................................................................................................
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
3 The pictogram below illustrates the number of houses in each of three streets in a town.
High Street
Middle Street
Low Street
....................................................[1]
(ii) State how many more houses there are in Low Street than in High Street.
....................................................[1]
....................................................[2]
(iv) Calculate, as a fraction in its lowest terms, the probability that both chosen houses are in
Middle Street.
....................................................[2]
4 A survey was conducted in 47 households to discover how many cats and how many dogs each
owned. No household owned more than 3 of either animal. The results are to be shown in the
following table.
Number of dogs
0 1 2 3 TOTAL
0 13 29
1 12
Number
of 2 4
cats
3 2
TOTAL 24 13 7 3 47
The given frequency shows that 13 households owned neither a cat nor a dog.
(i) Insert the appropriate frequencies into the table for each of the following pieces of
information.
(a) 1 household owned two cats and one dog, and 2 households owned one cat and two
dogs. [1]
(b) All other households which owned more than one of either animal owned none of the
other. [1]
[4]
5 During the course of one week a man recorded how much time he spent carrying out various
1 1
tasks in his garden. He spent –3 of the total time weeding and –4 of the total time planting flowers
and vegetables. The remaining time was split equally between mowing grass and all other work.
(i) Draw and label a pie chart of radius 5 cm to illustrate the proportion of the time spent by the
man on each of these four tasks.
[4]
(ii) During the following week the man spent 50% more time working in his garden than in the
first week. Calculate, to 2 decimal places, the radius of a comparative pie chart that would
represent the second week. (You are not required to draw a second pie chart.)
..............................................cm [2]
6 The histogram below represents the age distribution of the workers in a factory.
40
30
Number of
workers
20
per 5-year
interval
10
0
0 10 20 30 40 50 60 70
Age (years)
Find
(i) the number of workers who were aged between 35 and 50,
....................................................[2]
....................................................[2]
There were altogether 191 workers in the factory. All those not yet represented were aged between
50 and 70 years.
[2]
7 (a) Oliver and Pierre are playing a game which requires that initially a fair coin is tossed to decide
which of the two has the first turn. A turn involves rolling an unbiased six-sided dice with faces
numbered 1, 2, 3, 4, 5 and 6. They keep rolling the dice alternately until one of them rolls a 4,
at which point he is declared the winner.
(i) Calculate the probability, before the coin is tossed, that Pierre will win on his first turn.
....................................................[3]
(ii) Given that the coin has been tossed, and Oliver is to roll the dice first, calculate the
probability that he will win on his second turn.
....................................................[2]
(b) A box contains 5 red discs and 7 blue discs. A girl draws a disc, notes its colour, and replaces
it in the box. A boy then draws a disc and notes its colour.
....................................................[1]
....................................................[2]
The experiment is then repeated, using the same original contents of the box, except that the
girl’s disc is not replaced before the boy draws.
....................................................[2]
....................................................[1]
(c) In a group of 100 people, each is classified by gender and according to whether or not he or
she wears glasses. The results are tabulated below.
One person is chosen at random from the group. Denote the event ‘the chosen person is
female’ by A, and the event ‘the chosen person does not wear glasses’ by B.
...........................................................................................................................................
...........................................................................................................................................
...........................................................................................................................................
.......................................................................................................................................[2]
P(B ʹ) = ........................................................
P(Aʹ ∪ B ) = ....................................................[2]
(iii) Given that the chosen person is male, calculate the probability that he does not wear
glasses.
....................................................[1]
BLANK PAGE
8 A school wishes to establish a system of awarding points in each event in the school athletics
championships. As a trial experiment, it has been decided to analyse data from past years for
one particular throwing event. The distances thrown, in metres, have been summarised into two
grouped frequency distributions, one for male athletes and one for female athletes. The cumulative
frequency graphs illustrating these two distributions are shown below.
100
Male
90
Female
80
70
60
Cumulative
50
frequency
40
30
20
10
Female Male
0
0 10 20 30 40 50
Distance (metres)
The further the throw, the more points are to be awarded. Each gender is to have its own points
scale.
(i) For male athletes the shortest 10% of throws are to be awarded the minimum of 1 point.
Estimate the shortest distance which a male athlete must throw to be awarded 2 points.
........................................metres [2]
(ii) For male athletes, the central 40% of throws are to be awarded 3 points. Estimate the
distances between which a male athlete must throw to be awarded 3 points.
(iii) Estimate the median, upper quartile and lower quartile distances thrown by female athletes.
A quantity called the inter-quartile ratio is obtained by expressing the inter-quartile range as a
percentage of the median.
(iv) Using your results from part (iii), estimate, to 1 decimal place, the inter-quartile ratio for female
athletes.
................................................% [3]
(v) Are the positions of the two curves, relative to each other, what you would expect them to be?
Give a reason.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
9 (a) For summarising its census data, a country divides its population into three age groups, 0
to 10 years, 11 to 30 years, and 31 years and older. The marriage rates for the three groups
at the last census were mistakenly published in the wrong order as 12 per thousand, 5 per
thousand and 0 per thousand.
...........................................................................................................................................
.......................................................................................................................................[1]
...........................................................................................................................................
.......................................................................................................................................[1]
...........................................................................................................................................
.......................................................................................................................................[1]
(b) The table below gives information about two neighbouring towns, A and B, together with the
standard population for the area in which the towns are situated.
Town A
Standard Town B
Age group Number of Death rate
population Population population
deaths (per thousand)
00 – 24 3500 3000 45 P 2500
25 – 49 3000 2500 Q 8 1500
50 and over 2500 R 30 20 3000
P = ......................................................
Q = ......................................................
R = ..................................................[3]
....................................................[2]
....................................................[4]
For Town B the crude death rate is 14.0 per thousand, and the standardised death rate is 12.5
per thousand. One of these rates is larger than the corresponding rate for Town A, but the
other is smaller.
(iv) State, with a reason, which rate should be used to compare the chances of survival in
the two towns, and which town gives the better chance.
...........................................................................................................................................
...........................................................................................................................................
...........................................................................................................................................
.......................................................................................................................................[3]
(v) State what causes the other rate to lead to an incorrect conclusion in this case.
...........................................................................................................................................
.......................................................................................................................................[1]
10 The following table gives data for eight geographical areas derived from the 1991 UK Census.
Area A B C D E F G H
X 12.8 7.3 7.6 15.2 3.1 6.6 5.0 7.9
Y 77 52 57 73 33 49 32 51
(i) Plot points representing these eight areas on the grid below. [3]
y
80
60
40
20
0
0 2 4 6 8 10 12 14 16 x
(ii) Calculate the overall mean point and plot it on the grid.
[2]
(iii) Explain why, in calculating the semi-averages, you should use areas B, E, F and G for one
average, and areas A, C, D and H for the other.
...................................................................................................................................................
...............................................................................................................................................[1]
(iv) Obtain the two semi-averages and plot them on the grid.
[3]
(v) Draw the line of best fit using your plotted averages. [1]
(vi) Use your graph to obtain the equation of the line of best fit, giving it in the form
y = mx + c.
y = ....................................................[3]
(vii) (a) Comment on how well the line you have drawn fits the points you have plotted.
...........................................................................................................................................
.......................................................................................................................................[1]
(b) Explain why the variable labelled X has been considered to be the independent variable.
...........................................................................................................................................
.......................................................................................................................................[1]
(c) If the variable labelled Y had been the independent variable, state which four areas would
have been used to obtain the lower semi-average. (You are not required to calculate this
semi-average.)
...........................................................................................................................................
.......................................................................................................................................[1]
11 A company doctor wished to assess whether there was an association between the amount
of sleep employees achieved during nights prior to working days and their productivity. Fifty
employees were asked to record, to the nearest hour, the total number of hours they slept during
such nights in one particular week. The results are tabulated below.
31 36 37 48 38 34 37 36 44 36
38 37 38 29 42 36 40 26 37 34
39 38 37 41 33 38 38 36 37 39
36 37 35 37 36 37 37 43 36 43
36 37 36 32 38 35 37 38 41 40
The total number of hours slept by the fifty employees was 1857.
(i) Calculate, to 2 decimal places, the mean number of hours slept per employee.
....................................................[1]
The table below, from which some values have been omitted, summarises the data in the form of
a grouped frequency distribution.
Number of
Hours of sleep Class
employees
(to nearest hour) mid-points (x )
(frequency)
25 – 29 2
30 – 31
– 35 34 5
36 36
37 12
38
39 – 40 6
– 44 43
45 – 49 1
(ii) Insert in the table all the missing values of class limits, class mid-points and
frequencies. [5]
(iii) Estimate, to 2 decimal places, the mean of the data in your table.
....................................................[2]
(iv) Explain why the values for the mean number of hours slept which you have obtained in parts
(i) and (iii) are different.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
(v) Express the difference between the two means as a percentage of the true value, correct to 3
significant figures.
....................................................[2]
(vi) Using the data in your table, estimate, to 2 decimal places, the standard deviation of the
number of hours slept by these employees.
....................................................[4]
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
University of Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of
Cambridge Local Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
STATISTICS 4040/12
Paper 1 October/November 2011
2 hours 15 minutes
Candidates answer on the question paper.
Additional Materials: Mathematical tables
Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use a soft pencil for any diagrams or graphs.
Do not use staples, paper clips, highlighters, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (SLM) 44823
© UCLES 2011 [Turn over
2
1 In a biological experiment, a scientist measured the heights of a large number of plants. She
calculated the following statistical measures for the heights.
Median 12.3 cm
Lower quartile 7.9 cm
Upper quartile 16.5 cm
90th percentile 18.1 cm
....................................................[1]
....................................................[1]
....................................................[1]
....................................................[1]
(ii) Using the above information, state whether you would expect the mean of this distribution
to be about the same as the median, smaller than the median, or larger than the median.
Explain briefly the reason for your answer.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
2 In a school class there are 10 boys and 20 girls. The teacher selects a sample of 6 pupils from the
class.
(i) In each case, write down the one word which best describes the method of sampling used
if she
....................................................[1]
(b) selects 2 of the boys and 4 of the girls at random from the class register,
....................................................[1]
(c) selects the first 3 boys and the first 3 girls who arrive for a lesson.
....................................................[1]
Having selected the sample, the teacher collects data from each pupil by asking three questions.
C How much time did you spend doing your homework yesterday?
(ii) In each case, state whether the data obtained from these questions is either qualitative or
quantitative, and is either discrete or continuous.
3 The diagram below shows the number of people at an international conference who speak one or
more of the languages Arabic, French and Urdu.
Arabic French
13 9 16
3
5 2
10
Urdu
(i) Arabic,
....................................................[1]
....................................................[1]
....................................................[1]
....................................................[1]
....................................................[2]
4 For a particular type of canned food, the nominal mass of food in the can (the mass written on
the can) is 450 g. A sample of 8 cans was chosen, and the actual mass of food in each can was
measured accurately. The following results (in grams) were obtained.
(i) Using an assumed mean equal to the nominal mass, find the mean and standard deviation
of these values.
Mean = ........................................................
(ii) The empty cans each have a mass of 180.0 g. Write down the mean and standard deviation
of the total mass of these cans of food.
Mean = ........................................................
5 In a game, a turn involves throwing an unbiased six-sided dice with faces numbered 1 to 6, and, if
a 6 is thrown, a disc is chosen from a bag containing 1 black disc and 3 white discs.
Selections from the bag are made with replacement, and the person who chooses the black disc
wins the game.
(i) Carlo does not choose a disc from the bag on his first turn,
....................................................[1]
....................................................[2]
....................................................[3]
6 A team composed of equal numbers of men and women enters an athletics competition. In each
event it is possible to win a gold, silver or bronze medal. The team wins 25 medals, of which 3 are
gold, as shown in the following table.
(i) The team wins equal numbers of silver and bronze medals. Use this information to insert two
numbers into the table.
[1]
(ii) The women win one more medal than the men. Use this information to insert two more
numbers into the table.
[1]
(iii) The women win only one gold medal, but they win twice as many silver medals as bronze
medals. Use this information to complete the table.
[2]
(iv) The head coach has a method of assessing performance overall by counting 4 points for a
gold medal, 2 points for a silver medal, and 1 point for a bronze medal. Using this method,
and showing all your working, find who have performed better, the men or the women.
....................................................[2]
7 In this question give your answers either as fractions, or as decimals correct to 3 significant
figures.
In a hotel there are 25 rooms. The following table shows the number of guests staying in these
rooms on one particular night.
Number of guests 0 1 2 3 4
Number of rooms 3 4 12 5 1
(i) Find the total number of guests staying in the hotel on this night.
....................................................[3]
(ii) The hotel manager chooses a room at random to check the standard of cleaning. Find the
probability that
....................................................[1]
....................................................[2]
(iii) The manager chooses a guest at random to ask if they like their room. Find the probability
that this guest is staying in a room
....................................................[3]
....................................................[3]
(iv) The manager chooses two guests at random. Find the probability that they are staying in the
same room.
....................................................[4]
8 A travel company collected data on the ages, x years, of the people who booked holidays with the
company. The results were formed into the following grouped frequency distribution.
(i) Estimate, to 3 significant figures, the mean and the standard deviation of the ages of the
people who booked holidays with the company.
Mean = ........................................................
(ii) On the following grid draw a histogram to illustrate the data in the table above. The rectangle
representing the 35 – under 40 class has already been drawn for you.
20
16
Number 12
of people
per 5
years 8
0
0 10 20 30 40 50 60 70 80 90
Age (years)
[5]
(iii) On the same grid, draw a frequency polygon to represent the data. [1]
(iv) One year later the company repeated the research and found that the mean and standard
deviation of the ages were now 40.7 years and 16.1 years respectively. Explain briefly what
these values tell you about how the ages of people booking holidays had changed.
...................................................................................................................................................
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
9 A large company recorded the time spent travelling in one particular week by the 40 people in its
sales team. These times are illustrated in the cumulative frequency curve below.
40
30
Cumulative
frequency 20
(salespeople)
10
0
0 5 10 15 20 25 30
Travelling time (hours)
(ii) Use the graph to estimate the number of salespeople who spent more than 12.5 hours
travelling.
....................................................[3]
Because of the expense, the company wanted to reduce the time spent by its salespeople in
travelling. The sales team were given a training course on using the internet to improve sales
techniques. The table below shows the distribution of the team’s travelling times in a subsequent
week after the training.
(iii) Enter the cumulative frequencies for this distribution in the table. [1]
(iv) Draw the cumulative frequency curve for this distribution on the same grid as the given
cumulative frequency curve. [2]
(v) Use your graph to estimate the percentage reduction in the number of salespeople who spent
more than 12.5 hours travelling after the training. Give your answer to 2 significant figures.
....................................................[3]
10 Abdul is considering buying a particular model of new car. To see how well the car will keep its
value, he collects information on the price of old cars of this model. The information is given in the
following table.
Age of car (years), x 1.5 3.0 3.5 4.5 6.25 7.0 8.75 10.0
Price (thousands of $), y 13.2 11.8 10.5 9.4 7.0 6.8 4.4 2.6
16
14
12
10
Price
(thousands 8
of $)
6
0 x
0 2 4 6 8 10 12
Age of car (years)
[3]
(ii) Calculate the overall mean and the two semi-averages of the data, and plot them on your
graph.
[5]
(iii) Use your plotted averages to draw a line of best fit, and find its equation in the form y = mx + c.
....................................................[4]
(iv) Use your equation to estimate the price of a car of this model which is 5 years old.
$ ....................................................[2]
(v) Interpret the point at which the line of best fit meets the y-axis.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[1]
Abdul’s friend Bruno also wants to buy a new car, but of a different model. Bruno does a similar
investigation on his chosen model of car, and obtains a line of best fit with equation y = 0.93x + 16.2.
(vi) Explain briefly why Bruno has probably made a mistake in his calculation.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[1]
11 In this question calculate all death rates per thousand. Where values do not work out exactly
give your answers to two decimal places.
The table below gives information on the population and deaths in the town of Bosco for the year
2010, together with the standard population of the area in which Bosco is situated.
............................................................[4]
(ii) Calculate the death rate for each age group, and insert the values in the table above.
[2]
(iii) Use your results from part (ii) to calculate the standardised death rate for Bosco.
............................................................[4]
Riva and Techno are two other towns in the same area. The table below gives data for these towns,
also for 2010.
(iv) Showing all your working, find which of the three towns experienced the largest number of
total deaths in 2010.
....................................................[3]
(v) State, giving a reason, which of the three towns appears to have the healthiest environment.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
(vi) Considering your answers to parts (iv) and (v) together, explain briefly what has caused this
situation to occur.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[1]
BLANK PAGE
BLANK PAGE
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
University of Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of
Cambridge Local Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
STATISTICS 4040/11
Paper 1 October/November 2011
2 hours 15 minutes
Candidates answer on the question paper.
Additional Materials: Mathematical tables
Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use a soft pencil for any diagrams or graphs.
Do not use staples, paper clips, highlighters, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (NH/DJ) 34282/4
© UCLES 2011 [Turn over
2
1 In a biological experiment, a scientist measured the heights of a large number of plants. She
calculated the following statistical measures for the heights.
Median 12.3 cm
Lower quartile 7.9 cm
Upper quartile 16.5 cm
90th percentile 18.1 cm
....................................................[1]
....................................................[1]
....................................................[1]
....................................................[1]
(ii) Using the above information, state whether you would expect the mean of this distribution
to be about the same as the median, smaller than the median, or larger than the median.
Explain briefly the reason for your answer.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
2 In a school class there are 10 boys and 20 girls. The teacher selects a sample of 6 pupils from the
class.
(i) In each case, write down the one word which best describes the method of sampling used
if she
....................................................[1]
(b) selects 2 of the boys and 4 of the girls at random from the class register,
....................................................[1]
(c) selects the first 3 boys and the first 3 girls who arrive for a lesson.
....................................................[1]
Having selected the sample, the teacher collects data from each pupil by asking three questions.
C How much time did you spend doing your homework yesterday?
(ii) In each case, state whether the data obtained from these questions is either qualitative or
quantitative, and is either discrete or continuous.
3 The diagram below shows the number of people at an international conference who speak one or
more of the languages Arabic, French and Urdu.
Arabic French
13 9 16
3
5 2
10
Urdu
(i) Arabic,
....................................................[1]
....................................................[1]
....................................................[1]
....................................................[1]
....................................................[2]
4 For a particular type of canned food, the nominal mass of food in the can (the mass written on
the can) is 450 g. A sample of 8 cans was chosen, and the actual mass of food in each can was
measured accurately. The following results (in grams) were obtained.
(i) Using an assumed mean equal to the nominal mass, find the mean and standard deviation
of these values.
Mean = ........................................................
(ii) The empty cans each have a mass of 180.0 g. Write down the mean and standard deviation
of the total mass of these cans of food.
Mean = ........................................................
5 In a game, a turn involves throwing an unbiased six-sided dice with faces numbered 1 to 6, and, if
a 6 is thrown, a disc is chosen from a bag containing 1 black disc and 3 white discs.
Selections from the bag are made with replacement, and the person who chooses the black disc
wins the game.
(i) Carlo does not choose a disc from the bag on his first turn,
....................................................[1]
....................................................[2]
....................................................[3]
6 A team composed of equal numbers of men and women enters an athletics competition. In each
event it is possible to win a gold, silver or bronze medal. The team wins 25 medals, of which 3 are
gold, as shown in the following table.
(i) The team wins equal numbers of silver and bronze medals. Use this information to insert two
numbers into the table.
[1]
(ii) The women win one more medal than the men. Use this information to insert two more
numbers into the table.
[1]
(iii) The women win only one gold medal, but they win twice as many silver medals as bronze
medals. Use this information to complete the table.
[2]
(iv) The head coach has a method of assessing performance overall by counting 4 points for a
gold medal, 2 points for a silver medal, and 1 point for a bronze medal. Using this method,
and showing all your working, find who have performed better, the men or the women.
....................................................[2]
7 In this question give your answers either as fractions, or as decimals correct to 3 significant
figures.
In a hotel there are 25 rooms. The following table shows the number of guests staying in these
rooms on one particular night.
Number of guests 0 1 2 3 4
Number of rooms 3 4 12 5 1
(i) Find the total number of guests staying in the hotel on this night.
....................................................[3]
(ii) The hotel manager chooses a room at random to check the standard of cleaning. Find the
probability that
....................................................[1]
....................................................[2]
(iii) The manager chooses a guest at random to ask if they like their room. Find the probability
that this guest is staying in a room
....................................................[3]
....................................................[3]
(iv) The manager chooses two guests at random. Find the probability that they are staying in the
same room.
....................................................[4]
8 A travel company collected data on the ages, x years, of the people who booked holidays with the
company. The results were formed into the following grouped frequency distribution.
(i) Estimate, to 3 significant figures, the mean and the standard deviation of the ages of the
people who booked holidays with the company.
Mean = ........................................................
(ii) On the following grid draw a histogram to illustrate the data in the table above. The rectangle
representing the 35 – under 40 class has already been drawn for you.
20
16
Number 12
of people
per 5
years 8
0
0 10 20 30 40 50 60 70 80 90
Age (years)
[5]
(iii) On the same grid, draw a frequency polygon to represent the data. [1]
(iv) One year later the company repeated the research and found that the mean and standard
deviation of the ages were now 40.7 years and 16.1 years respectively. Explain briefly what
these values tell you about how the ages of people booking holidays had changed.
...................................................................................................................................................
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
9 A large company recorded the time spent travelling in one particular week by the 40 people in its
sales team. These times are illustrated in the cumulative frequency curve below.
40
30
Cumulative
frequency 20
(salespeople)
10
0
0 5 10 15 20 25 30
Travelling time (hours)
(ii) Use the graph to estimate the number of salespeople who spent more than 12.5 hours
travelling.
....................................................[3]
Because of the expense, the company wanted to reduce the time spent by its salespeople in
travelling. The sales team were given a training course on using the internet to improve sales
techniques. The table below shows the distribution of the team’s travelling times in a subsequent
week after the training.
(iii) Enter the cumulative frequencies for this distribution in the table. [1]
(iv) Draw the cumulative frequency curve for this distribution on the same grid as the given
cumulative frequency curve. [2]
(v) Use your graph to estimate the percentage reduction in the number of salespeople who spent
more than 12.5 hours travelling after the training. Give your answer to 2 significant figures.
....................................................[3]
10 Abdul is considering buying a particular model of new car. To see how well the car will keep its
value, he collects information on the price of old cars of this model. The information is given in the
following table.
Age of car (years), x 1.5 3.0 3.5 4.5 6.25 7.0 8.75 10.0
Price (thousands of $), y 13.2 11.8 10.5 9.4 7.0 6.8 4.4 2.6
16
14
12
10
Price
(thousands 8
of $)
6
0 x
0 2 4 6 8 10 12
Age of car (years)
[3]
(ii) Calculate the overall mean and the two semi-averages of the data, and plot them on your
graph.
[5]
(iii) Use your plotted averages to draw a line of best fit, and find its equation in the form y = mx + c.
....................................................[4]
(iv) Use your equation to estimate the price of a car of this model which is 5 years old.
$ ....................................................[2]
(v) Interpret the point at which the line of best fit meets the y-axis.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[1]
Abdul’s friend Bruno also wants to buy a new car, but of a different model. Bruno does a similar
investigation on his chosen model of car, and obtains a line of best fit with equation y = 0.93x + 16.2.
(vi) Explain briefly why Bruno has probably made a mistake in his calculation.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[1]
11 In this question calculate all death rates per thousand. Where values do not work out exactly
give your answers to two decimal places.
The table below gives information on the population and deaths in the town of Bosco for the year
2010, together with the standard population of the area in which Bosco is situated.
............................................................[4]
(ii) Calculate the death rate for each age group, and insert the values in the table above.
[2]
(iii) Use your results from part (ii) to calculate the standardised death rate for Bosco.
............................................................[4]
Riva and Techno are two other towns in the same area. The table below gives data for these towns,
also for 2010.
(iv) Showing all your working, find which of the three towns experienced the largest number of
total deaths in 2010.
....................................................[3]
(v) State, giving a reason, which of the three towns appears to have the healthiest environment.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
(vi) Considering your answers to parts (iv) and (v) together, explain briefly what has caused this
situation to occur.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[1]
BLANK PAGE
BLANK PAGE
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
University of Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of
Cambridge Local Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
STATISTICS 4040/12
Paper 1 October/November 2014
2 hours 15 minutes
Candidates answer on the question paper.
Additional Materials: Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use an HB pencil for any diagrams or graphs.
Do not use staples, paper clips, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (LK/SLM) 102872/4 R
© UCLES 2014 [Turn over
2
1 In an industrial process, readings, x, of a particular gauge are recorded regularly. For 6 such
recorded readings it is found that Σx = 279 and Σx2 = 13 093.
Mean = .......................................................
It was discovered later that one of the readings had been incorrectly recorded as 43, when in fact
the correct reading was 34.
(ii) State, for each of the mean and standard deviation, whether its correct value will be smaller
than, larger than, or the same as the value found in part (i).
Mean .......................................................
2 A student calculated, correctly, five statistical measures for a set of data. The five values he
obtained were, in ascending order, 6, 36, 43, 48 and 53.
(i) Insert these values in their correct positions in the table below.
(ii) State the value of the 75th percentile for the student’s original set of data.
....................................................[1]
3 A national government plans a survey to obtain the responses of its citizens to its proposal to build
wind farms as sources of renewable energy.
B Face to face interviews will be conducted with a total of 1000 citizens in shopping centres
in different parts of the country.
D Telephone calls will be made to 1000 citizens chosen from the telephone directory.
...........................................................................................................................................
.......................................................................................................................................[1]
...........................................................................................................................................
.......................................................................................................................................[1]
Advantage ..........................................................................................................................
...........................................................................................................................................
Disadvantage .....................................................................................................................
.......................................................................................................................................[2]
Yes
No
...........................................................................................................................................
.......................................................................................................................................[1]
(b) Write down one open question which could be asked in the survey.
...........................................................................................................................................
...........................................................................................................................................
.......................................................................................................................................[1]
© UCLES 2014 4040/12/O/N/14
5
4 The following diagram is to show the number of patients at a medical centre who have received
vaccine against one or more of the diseases polio, cholera and typhoid.
Polio
24
30
17
Cholera Typhoid
(a) The number of patients who have received only cholera vaccine is 5 fewer than the
number of patients who have received only polio vaccine. [1]
(b) The number of patients who have received only typhoid vaccine is two thirds of the
number of patients who have received polio and cholera vaccines but not typhoid vaccine.
[1]
(c) The number of patients who have received polio and typhoid vaccines but not cholera
vaccine is the same as the number of patients who have received all three vaccines. [1]
(d) Twice as many patients have received typhoid and cholera vaccines but not polio vaccine
as have received all three vaccines. [1]
(ii) Find the mode of the number of these vaccines received by these patients.
....................................................[2]
5 The table below gives information on the gender of, and number of books written by, 40 authors
attending a book fair.
(i) a male,
....................................................[1]
....................................................[1]
(iii) an author who has written 6 – 10 books, given that the author is male.
....................................................[1]
(iv) Find the probability that both have written 5 or fewer books.
....................................................[3]
6 A police camera at the side of a road measures the speed, in km/h, of every vehicle travelling on
the road.
The following histogram represents the information it recorded over a certain period of time.
50
40
30
Number of
vehicles
per 10 km/h
20
10
0
0 20 40 60 80 100 120
Vehicle speed (km/h)
Use the histogram to find, for this period of time, the number of vehicles whose speeds were
....................................................[2]
....................................................[2]
....................................................[1]
The speed limit on this road is 100 km/h. Any driver of a vehicle travelling at a speed which is
5 km/h or more greater than the speed limit must pay a fine.
(iv) Estimate the number of drivers represented by this information who had to pay a fine.
....................................................[1]
7 In this question calculate all accident rates per thousand. Where values do not work out
exactly give your answers to one decimal place.
The table below gives information on the number of employees, and the number of accidents
they suffered, at a building construction company, Kwikbuild, in the year 2012. It also shows the
standard population for the building construction industry.
....................................................[4]
(ii) Calculate the accident rate for each job group and insert the values in the table above.
[2]
(iii) Use your results from part (ii) to calculate the standardised accident rate for Kwikbuild.
....................................................[4]
Fastbuild is another building construction company. In 2012 its crude and standardised accident
rates were 109.4 and 98.7 per thousand respectively.
(iv) State, with a reason, which of the two companies most likely operates in the safer environment.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
For each company some of the accidents suffered by employees were classed as ‘serious’, and
they all occurred in the Site Labour job group.
The table below gives information on the serious accidents suffered at the two companies.
(v) Calculate, for each company, for the Site Labour job group only, the serious accident rate, and
hence state the company where an employee is less likely to suffer a serious accident.
....................................................[2]
(vi) State, with a reason, whether the values you have calculated in part (v) are crude or
standardised rates.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
8 A running club holds a cross-country race. Competitors enter in either the junior or senior age
category. When they enter, they also choose to follow one of three routes: easy, moderate or
challenging. Information about the number of competitors and the routes chosen is shown below.
= 10 junior competitors
= 10 senior competitors
10%
25%
Easy Easy
45%
Moderate Moderate
35% 55%
Challenging Challenging
30%
(i) Find the total number of competitors who entered the race.
....................................................[1]
(ii) Show that there were 42 junior competitors who chose the moderate route.
[1]
(iii) Find the number of senior competitors who chose the easy route.
....................................................[2]
Not all the entrants completed the race. The times taken by those who did complete the race are
shown in the table below.
Number of competitors
Completion time
Junior Senior
(minutes)
Easy Moderate Challenging Easy Moderate Challenging
60 – under 90 16 4 0 19 3 2
90 – under 120 29 12 1 32 15 5
120 – under 150 18 15 3 20 17 14
150 – under 180 2 8 6 0 11 16
TOTAL 65 39 10 71 46 37
(iv) Find the number of competitors who entered the race but did not complete it.
....................................................[3]
(v) Estimate, to the nearest minute, the mean time taken by senior competitors who completed
the challenging route.
....................................................[3]
(vi) Of the junior competitors who completed the race in 2 hours or more, find the percentage who
had chosen the challenging route.
....................................................[3]
(vii) Of all the senior competitors who had chosen the moderate route, find the percentage who
completed the race in under 2 hours.
....................................................[3]
9 One way to determine if an adult has a healthy weight, independent of age and gender, is to
measure their body mass index, BMI (a continuous variable).
The BMI values for the adult population of a particular country in the years 1980 and 2010 are
summarised in the cumulative frequency polygons below.
100
90
1980 2010
80
70
Cumulative 60
frequency
(% of adult 50
population)
40
30
20
10
0
0 15 20 25 30 35 40
BMI
Use these graphs to answer the following questions on the adult population of this country.
(i) Estimate
....................................................[1]
....................................................[1]
....................................................[1]
....................................................[1]
An adult’s weight is classified as ‘healthy’ if their BMI value is between 18.5 and 25.
(ii) Estimate the percentage of the adult population whose weights were classified as ‘healthy’
(a) in 1980,
....................................................[2]
(b) in 2010.
....................................................[1]
(iii) Estimate the median BMI value of the ‘overweight’ adult population in 2010.
....................................................[3]
(iv) Estimate the percentage of the adult population in 2010 who were ‘obese’.
....................................................[4]
(v) By referring to any of the values you have estimated in parts (i), (ii), and (iv), comment on
how the health of the adult population of this country, assessed in terms of its weight, changed
between 1980 and 2010.
...................................................................................................................................................
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
10 Baruti’s teacher has suggested that pupils who enjoy studying a subject are likely to perform well
in tests in the subject.
To investigate this, Baruti asked his friends to rate their enjoyment of Statistics on a linear scale
from 0 (dislike very much) to 5 (like very much), then recorded their scores on the next class test.
His results are shown in the following table.
Friend A B C D E F G H
Enjoyment
3 2 5 4 1 4 5 2
rating, x
Test score
57 47 78 59 26 86 53 34
(%), y
y
100
80
60
Test
score
(%)
40
20
0 x
0 1 2 3 4 5
Enjoyment rating
[2]
(ii) Explain briefly why the points (5, 78) and (4, 59) are not used if the lower semi-average is
calculated.
...................................................................................................................................................
...............................................................................................................................................[1]
(iii) Calculate the two semi-averages and the overall mean of the data, and plot them on your
graph.
[5]
(iv) Use your plotted averages to draw a line of best fit, and find its equation in the form y = mx + c.
....................................................[4]
Another friend, who had rated his enjoyment of Statistics at 3, missed the test through illness.
(v) Use the line you have drawn in part (iv) to estimate the score this friend would have obtained
if he had taken the test.
....................................................[1]
(vi) State, with a reason, in which of the subjects Statistics, English and Science a pupil’s test
score is most affected by their enjoyment rating.
...................................................................................................................................................
...............................................................................................................................................[2]
(vii) Explain briefly why Baruti may have experienced difficulty in deciding which of his two
variables should be the independent and which the dependent.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[1]
11 At a restaurant it is known from experience that 10% of the customers order an omelette.
Assume that customers make choices independently of each other and that nobody orders more
than one omelette.
(a) no customers,
....................................................[2]
....................................................[2]
The restaurant serves small omelettes and large omelettes. It is known from experience that 60%
of those ordered are small and 40% are large.
Find the probability that at this table only one customer orders an omelette and it is a large
omelette.
....................................................[4]
Small omelettes are made with 2 eggs and large omelettes with 3 eggs. The chef has a special
store of high quality eggs which are used only for making omelettes.
Find the probability that, in preparing the food for this table, from his special store the chef
uses
....................................................[3]
....................................................[5]
BLANK PAGE
BLANK PAGE
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of Cambridge Local
Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
4040 STATISTICS
4040/12 Paper 1, maximum raw mark 100
This mark scheme is published as an aid to teachers and candidates, to indicate the requirements of
the examination. It shows the basis on which Examiners were instructed to award marks. It does not
indicate the details of the discussions that took place at an Examiners’ meeting before marking began,
which would have considered the acceptability of alternative answers.
Mark schemes should be read in conjunction with the question paper and the Principal Examiner
Report for Teachers.
Cambridge will not enter into discussions about these mark schemes.
Cambridge is publishing the mark schemes for the October/November 2014 series for
most Cambridge IGCSE®, Cambridge International A and AS Level components and some
Cambridge O Level components.
if zero scored allow SC1 for their LQ, their med, their UQ in ascending order
(ii) their UQ B1
(iv) (17/40) × M1
(their 17 – 1)/(their 40 – 1) M1
272/1560 or 136/780 or 68/390 or 34/195 or 0.174 or 0.17 A1
(iii) 40 A1
(iv) 6 A1
7 (i) 1 + 8 + 3 + 37 (=49) M1
25 + 167 + 40 + 228 (=460) M1
(their 49/their 460) × 1000 M1
106.5 A1
(iii) any one job group rate multiplied by standard population figure M1
sum of four such products M1
(40 × 0.08) + (47.9 × 0.35) + (75 × 0.12) + (162.3 × 0.45) A1
102 or 102.0 A1
8 (i) 280 B1
(v) attempted use of class mid points (75, 105, 135, 165) M1*
correct method for mean (Σfx = 5205) M1dep
141 A1
(b) 26.2–26.5 B1
(c) 21.2–21.5 B1
(d) 29.5–29.8 B1
(ii) (a) attempt to read cf% values for BMI = 18.5 and 25 and subtract
on either graph (65 – 7) (40 – 4) M1
57(%)–59(%) A1
(b) 36(%) A1
support strengthened by
reference to more than one of these changes
or citation of specific values for any change B1
(iv) straight line through at least two of their plotted points in (iii) B1
correct method for gradient, m, of their line M1
correct method for c M1
m = 11.0 – 11.4 and c = 18 – 19 A1
(v) 52 B1
(vii) difficult to know if pupils perform well because they like a subject,
or they like a subject because they perform well in it B1
STATISTICS 4040/13
Paper 1 October/November 2014
2 hours 15 minutes
Candidates answer on the question paper.
Additional Materials: Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use an HB pencil for any diagrams or graphs.
Do not use staples, paper clips, glue or corrections fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (RW/SLM) 83695/3
© UCLES 2014 [Turn over
2
7 8 16 10 20 5 8 9 8 2 26 9 15 .
Three different measures of central tendency (average) of these numbers are 8, 9 and 11.
Complete the following table by giving, for each of these three measures, its name and a brief
explanation of how its value has been obtained.
..................................................................................................
8 ............................ ..................................................................................................
..................................................................................................
..................................................................................................
9 ............................ ..................................................................................................
..................................................................................................
..................................................................................................
11 ............................ ..................................................................................................
..................................................................................................
[6]
50
40
30
Cumulative
frequency
20
10
0
0 1 2 3 4 5 6 7 8
X
...................................................................................................................................................
...............................................................................................................................................[2]
(ii) State for which of the integer values shown in the graph the frequency of X is 0.
....................................................[2]
x 0 1 2 3 4 5 6 7
Frequency
[2]
3 (a) The population of a town is tabulated in different age groups. A research organisation wishes
to interview, from the population, a sample which represents it in terms of age. It proposes to
do this using either stratified random sampling or quota sampling.
State one way in which the use of these sampling methods would be similar, and one way in
which it would be different.
...................................................................................................................................................
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
(b) It is wished to obtain an estimate of the mean number of words on each page of a book. For
each of the following methods state, with a reason, whether a sample obtained using it would
be likely to be biased or unbiased:
(i) counting the number of words on the last page of each chapter;
...........................................................................................................................................
...........................................................................................................................................
.......................................................................................................................................[2]
...........................................................................................................................................
...........................................................................................................................................
.......................................................................................................................................[2]
4 The table below summarises the lengths, in millimetres, of a random sample of 50 leaves taken
from a bush.
(i) Calculate the cumulative frequencies and insert them in the table. [1]
(ii) Plot the cumulative frequencies on the grid below and draw a smooth curve through the
plotted points. [2]
50
40
30
Cumulative
frequency
20
10
0
0 28 30 32 34 36 38 40 42
Length (mm)
.............................................mm [1]
(b) the percentage of leaves that have a length greater than 37.2 mm.
....................................................[2]
© UCLES 2014 4040/13/O/N/14 [Turn over
6
5 A company which produces different sizes of sawn wood wishes to display information about the
amount of sawn wood it produces in each of two consecutive years.
(i) State one advantage and one disadvantage of using a dual bar chart, as opposed to a
percentage bar chart, to illustrate the amount produced in the two years.
Advantage..................................................................................................................................
...................................................................................................................................................
Disadvantage .............................................................................................................................
...............................................................................................................................................[2]
(ii) Name a quantity which neither a dual bar chart nor a percentage bar chart would show.
...............................................................................................................................................[1]
(iii) State the names of two types of diagram which will give a relative indication of both the
amount of different sizes of sawn wood and the total amount of sawn wood produced in each
year.
........................................................
....................................................[2]
(iv) State the name of the type of diagram which will give a direct indication of the differences in
the total amount of sawn wood produced from one year to the next.
....................................................[1]
6 Three of the official languages of Switzerland are French, German and Romansh. The diagram
below illustrates which of these languages are spoken by a random sample of 70 Swiss citizens.
French German
17 8 23
0
3 2
12
Romansh
(i) Find the value which should be written inside the box but outside the circles in order to
complete the diagram.
....................................................[2]
...................................................................................................................................................
...............................................................................................................................................[1]
(iii) State, with a reason, in each of the following cases, whether the value 0 would be changed if
the person described learned to speak Romansh.
(a) One of the people denoted in the diagram by the value 17.
...........................................................................................................................................
.......................................................................................................................................[1]
...........................................................................................................................................
.......................................................................................................................................[1]
(c) One of the people denoted in the diagram by your answer to part (i).
...........................................................................................................................................
.......................................................................................................................................[1]
© UCLES 2014 4040/13/O/N/14 [Turn over
8
7 (a) Mr Hassan can travel to work by either car or train. The probability that on any day he travels
by train is 47. If he travels by car the probability that he will be late for work is 19, but by train it is 15.
Calculate the probability that on any randomly chosen day he is not late for work.
....................................................[4]
(b) Three children are to be chosen at random from a group of seven, consisting of four boys,
Ian, James, Michael and Nathan, and three girls, Karen, Lucy and Olive.
(i) Calculate the probability that Ian, Lucy and Nathan are the three chosen.
....................................................[2]
(ii) Calculate the probability that the brother and sister will both be among the three chosen.
....................................................[3]
(c) Sammy and Pekos each have a bag containing a number of blue balls and white balls. Each
selects one ball from his bag at random. If the selected balls are of the same colour, Sammy
puts them both in his bag; if they are of different colours, Pekos puts them both in his bag.
Originally, Sammy’s bag contains 2 blue and 6 white balls, while Pekos’ bag contains 3 blue
and 5 white balls.
(i) Calculate the probability that both selected balls are of the same colour.
....................................................[3]
(ii) If, on the first selection, the balls were of the same colour (so both were put in Sammy’s
bag before a second selection), calculate the probability that on the second selection the
balls are of different colours.
....................................................[4]
8 The following table summarises the times, x minutes, which the visitors to an art gallery during
one day spent in the gallery. The first row of the table gives the column numbers.
(i) Insert in column (3) of the table the mid-points, m, of each of the classes.
[1]
m − 45
(ii) Values of a variable, y, are given by y= .
2.5
Calculate the value of y for each class and insert the values in column (4) of the table.
[2]
(iii) For each class, calculate the value of the product fy, and insert the values in column (5) of the
table. [1]
(iv) For each class, calculate the value of fy 2, and insert the values in column (6) of the table.
[1]
(v) Calculate Σfy and Σfy 2 and insert their values in the table. [1]
....................................................[2]
....................................................[2]
....................................................[2]
....................................................[2]
(ix) Comment on whether or not, for these data, the interquartile range would be a more
appropriate measure of dispersion than the standard deviation.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
9 A bakery kept a record of the diameters, d centimetres, of the cakes it produced during one week.
The results are summarised in the histogram below.
50
40
Number 30
of cakes
per cm of
diameter 20
10
0
0 15 16 17 18 19 20 21 22 23
Diameter (cm)
(i) Use the histogram to complete the following grouped frequency table for d.
(ii) Use the frequencies you have obtained to produce a simpler grouped frequency distribution,
having four classes of equal width between 15 cm and 23 cm, and present your distribution in
a table.
[3]
(iii) On the grid below illustrate your simpler grouped frequency distribution by a histogram.
[3]
(iv) Use the histogram you have drawn in part (iii) to estimate the modal diameter.
.............................................. cm [2]
(v) Cakes with a diameter between 16.5 cm and 22 cm can be sold in the bakery’s shop. Find the
percentage of this week’s cakes which can be sold in the shop.
....................................................[4]
© UCLES 2014 4040/13/O/N/14 [Turn over
14
10 In this question calculate all death rates per thousand, and to 2 decimal places.
The first table below gives certain information about the population and deaths in a town, Eastbury,
for the year 2012, together with the standard population of the area in which Eastbury is situated.
Population in Standard
Age group Deaths
age group population (%)
0 – 14 25 4500 20
15 – 34 x 7000 35
35 – 59 47 6000 25
60 and over 83 7000 20
(i) The death rate for the 15 – 34 age group is 3.00 per thousand.
Show that x = 21.
[1]
....................................................[4]
(iii) Calculate the death rates for the other three age groups.
(iv) Using the given rate for the 15 – 34 age group, and the rates you have calculated in part (iii),
calculate the standardised death rate for Eastbury.
....................................................[4]
The table below gives information about Westville, another town in the same area, for the year
2012.
The crude death rate for Westville in 2012 was 6.62 per thousand.
(v) Calculate the standardised death rate for Westville, using the same standard population as
for Eastbury.
....................................................[2]
One of the two towns has a higher crude death rate, but the other has a higher standardised death
rate.
...................................................................................................................................................
...............................................................................................................................................[1]
(vii) State, with a reason, which of the two towns would appear to have the healthier environment.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
© UCLES 2014 4040/13/O/N/14 [Turn over
16
11 Three trainee technicians, A, B and C, carried out laboratory trials to examine the effect of
temperature, x, in °C, on the yield, y, in kg, of an industrial process. The following table shows the
results obtained by each technician.
Technician A B C A B C A B C A B C
Temperature, x (°C) 10 15 20 25 30 35 40 45 50 55 60 65
Yield, y (kg) 80 106 75 90 117 118 97 127 80 109 140 115
(i) Plot the points representing these results on the grid below and label each point A, B or C
according to which technician carried out the trial.
140
130
120
110
100
Yield
(kg)
90
80
70
60
0
0 10 20 30 40 50 60 70
Temperature (°C)
[3]
[3]
(iii) Plot the semi-averages and use the three plotted averages to draw the line of best fit. [3]
It is known that over this range of temperatures the relationship between yield and temperature is
approximately linear.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
An experienced and reliable technician carried out a trial at a temperature of 40°C and obtained a
yield of 125 kg.
(vi) What might this extra information tell you about the performance of the trainees?
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
(vii) Use the extra information to draw, by eye, a revised line of best fit. [1]
(viii) Use this revised line of best fit to estimate the yield for a temperature of 52°C.
............................................... kg [1]
BLANK PAGE
BLANK PAGE
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of Cambridge Local
Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
4040 STATISTICS
4040/13 Paper 1, maximum raw mark 100
This mark scheme is published as an aid to teachers and candidates, to indicate the requirements of
the examination. It shows the basis on which Examiners were instructed to award marks. It does not
indicate the details of the discussions that took place at an Examiners’ meeting before marking began,
which would have considered the acceptability of alternative answers.
Mark schemes should be read in conjunction with the question paper and the Principal Examiner
Report for Teachers.
Cambridge will not enter into discussions about these mark schemes.
Cambridge is publishing the mark schemes for the October/November 2014 series for
most Cambridge IGCSE®, Cambridge International A and AS Level components and some
Cambridge O Level components.
1 8 is the mode M1
The value which occurs most frequently. A1
9 is the median M1
Obtained by arranging the values in ascending or descending order and
selecting the 'middle' one. A1
(iii)
x 0 1 2 3 4 5 6 7
Frequency 0 5 15 10 0 7 6 7
3 (a) Similar in that both would sample proportionately from the different age groups. B1
In stratified sampling interviewers would be given a list of specific people to
interview, in quota sampling the interviewer selects the individuals. B1
(b) (i) Because the last page of a chapter is less likely than all other others to be
filled with words, B1
the sample is likely to be biased. B1
(iii) (a) Correct reading from graph of a point between cum. freqs. 12 and 13 B1
(b) Clear attempt to use appropriate point on the graph and any valid method
to find the required percentage. M1
14%–16% A1
6 (i) Attempt to sum the values in the diagram and subtract the total from 70. M1
5 A1
(ii) None of the people in the sample speak all three languages. B1
(iii) (a) No, because this person will still only speak two languages. B1
(b) Yes, because the person now speaks all three languages. B1
(c) No, as this person only speaks one of the three languages. B1
(ii) Any appreciation of the fact that it is irrelevant which two are the
brother and sister. B1
(c) (i) Clear attempt at both two blue and two white M1
(2/8 × 3/8) + (6/8 × 5/8) A1
9/16 A1
(ii) Given first balls were the same colour, P(both were blue) = 1/6,
P(both were white) = 5/6 B1
Attempt to add probabilities relating to whether first balls were blue or white M1
8
(1) (2) (3) (4) (5) (6)
Time (x)
Frequency (f) Mid-pts (m) y fy fy²
(minutes)
40 – under 50 40 45 0 0 0
(iii) (Deaths/Population) × 1000 seen for any age group (or can be implied
by one correct result) M1
5.56 7.83 11.86 all correct A1
(iv) Rate × SP% seen for any age group (or can be implied by one correct result) M1
(vi) Any valid comment relating to the towns having different age structures B1
(viii) 135 kg, with clear indication value found from use of the revised line B1
STATISTICS 4040/12
Paper 1 October/November 2015
2 hours 15 minutes
Candidates answer on the Question Paper.
Additional Materials: Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use an HB pencil for any diagrams or graphs.
Do not use staples, paper clips, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (ST/FD) 100433/3
© UCLES 2015 [Turn over
2
1 Four types of sample which may be obtained from a population by a researcher are:
simple random, stratified, quota, and systematic.
(i) the individual items are selected at regular intervals from a sampling frame,
....................................................[1]
(ii) the choice of which individual items are selected is left to the researcher,
....................................................[1]
(iii) the sample is selected so that the proportions of different categories in the sample correspond
with those of the population.
....................................................[1]
x 1 2 3 4 5 6 or more
Frequency, f 3 6 4 5 6 7
....................................................[1]
(ii) name, but do not find, a measure of central tendency which can be found exactly,
....................................................[1]
....................................................[1]
(iv) name, and find, a measure of dispersion which can be found exactly.
Name .......................................................
....................................................[3]
3 Randa’s friend Sonia claims that, of the two drinks tea and coffee, males generally prefer tea,
whilst females generally prefer coffee. To investigate this Randa asks her friends and relatives
their preferences.
She records whether the person asked is male (M) or female (F), and whether they prefer tea (T)
or coffee (C), or express no preference (X).
Her raw data is as follows:
FC MT FC MX MC FC FT MC FC FC
FX MC FT FC MC FT FC MT FX MC
For example, the first person asked was female and preferred coffee.
[4]
(ii) Explain whether or not Randa’s survey has shown Sonia’s claim to be correct.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
4 A man owns three shops in a town. For a trial period of 12 days he offers for sale in each shop a
new type of chocolate bar.
He records the number of these bars sold each day in each shop over this period, and calculates
the measures shown in the table below for the daily sales in each shop.
....................................................[1]
....................................................[1]
....................................................[1]
(ii) Find the total number of these chocolate bars sold in all three shops combined over this
period.
....................................................[3]
5 In a game, a turn consists of throwing three unbiased six-sided dice, each with faces numbered
1, 2, 3, 4, 5 and 6. The score for a turn is the sum of any numbers which appear more than once.
For example, if 5, 2, 5 appear, the score is 10; if 5, 2, 1 appear, the score is zero.
(i) Write down four integers between 0 and 18 which it is impossible to score in one turn.
....................................................[2]
....................................................[5]
6 At a town centre car park the electronic barrier records the length of stay of all vehicles parked
there. When a vehicle leaves the car park, it is not allowed to return the same day.
The lengths of stay for the 120 vehicles using the car park on one particular day are summarised
in the graph below.
120
100
80
Cumulative
frequency 60
(vehicles)
40
20
0
0 2 4 6 8 10
Length of stay (hours)
...............................................................................................................................................[1]
....................................................[1]
....................................................[2]
For the first two hours parking is free. For stays lasting from 2 hours up to 5 hours the charge is $6,
from 5 hours up to 8 hours it is $9, and from 8 hours up to 10 hours it is $12.
(iv) Use the graph to estimate the total amount paid in parking charges on this particular day.
....................................................[4]
7 In a school science experiment, a beaker of hot water is allowed to cool, and its temperature is
measured every 5 minutes. The results are shown in the following table.
Time, x (minutes) 0 5 10 15 20 25
Temperature, y (°C) 96 76 61 49 42 38
(i) Calculate the overall mean and the two semi-averages of these data.
........................................................
........................................................
....................................................[5]
(ii) Use the values obtained in part (i) to find the equation of the line of best fit to these data in the
form y = mx + c.
....................................................[3]
(iii) Use your equation to estimate the temperature of the water after 30 minutes, giving your
answer to the nearest degree.
....................................................[2]
(iv) On the grid below, plot the data given in the table at the start of the question.
\
100
80
60
Temperature
(°C)
40
20
0 [
0 5 10 15 20 25 30
Time (minutes)
[2]
(v) Draw on the grid the line whose equation you found in part (ii) for times between 0 and
30 minutes.
[2]
(vi) By inspecting the points plotted, explain briefly why it can be considered that it was
inappropriate to find a line of best fit in the form y = mx + c in this case.
...................................................................................................................................................
...............................................................................................................................................[1]
(vii) State how the actual water temperature after 30 minutes will compare with the value calculated
in part (iii).
...............................................................................................................................................[1]
8 At a college, students are enrolled into one of four departments: Arts, Languages, Science, or
Technology. The ages of students in these departments, for the year 2014, are shown in the table
below. Students aged 25 – under 30 are classed as ‘mature’ students.
Department
Age (years)
Arts Languages Science Technology
18 – under 19 60 37 125 107
19 – under 20 78 50 153 138
20 – under 22 101 66 112 96
22 – under 25 62 72 84 70
25 – under 30 53 60 51 39
TOTAL 354 285 525 450
(i) Find the number of students at the college who are under 20 years of age.
....................................................[1]
(ii) Of all the students, show that the percentage who are enrolled in Science is 32.5%, correct to
3 significant figures.
[1]
(iii) Of all the students, find the percentage who are mature students.
....................................................[2]
(iv) On the grid below draw a histogram to illustrate the ages of students enrolled in Languages. The
rectangles representing the 18 – under 19 class and the 19 – under 20 class have already
been drawn for you.
70
60
50
40
Number of
students
per 1 year
30
20
10
0
0 18 20 22 24 26 28 30
Age (years)
[4]
At the end of the year, of all the students, 144 obtained distinctions in their examinations. The
departments in which these 144 students were enrolled are represented by the following pie chart,
which is drawn to scale.
$UWV
/DQJXDJHV
7HFKQRORJ\
6FLHQFH
....................................................[2]
(vi) Of all the students enrolled in Science, find the percentage who obtained distinctions.
....................................................[3]
(vii) Of all the mature students, find the percentage who obtained distinctions in Technology.
....................................................[3]
9 At an international sporting event, the team from a particular country includes swimmers and track
athletes.
(i) The diagram below shows the number of swimmers who are specialists in one or more of the
styles breaststroke, freestyle and backstroke.
%UHDVWVWURNH
)UHHVW\OH %DFNVWURNH
Use this information to find the number of these swimmers who are specialists in
(a) backstroke,
....................................................[1]
....................................................[1]
....................................................[1]
....................................................[1]
One of these swimmers is chosen at random to appear on television. Find the probability of
choosing a specialist in
....................................................[1]
....................................................[1]
(ii) The diagram below shows the number of track athletes who enter one or more of the events
100 metres, 200 metres and 400 metres.
Use this information to find the number of these track athletes who enter
....................................................[1]
....................................................[1]
....................................................[1]
....................................................[1]
(iii) One of the swimmers in part (i) and one of the track athletes in part (ii) are chosen at random
to undergo blood tests.
Find the probability that the swimmer is a specialist in freestyle and the track athlete enters
more than one of the given events.
....................................................[4]
(iv) Later, one of the track athletes in part (ii), who currently enters both the 100 metres and the
200 metres, decides to enter also the 400 metres.
Draw and label a new Venn diagram to represent the track athletes in part (ii) after this change
has been made.
[2]
10 A postman wears a pedometer, with which he measures the daily distance he walks when
delivering mail. The following table summarises the data he collected over 50 working days.
3 – under 6 10
6 – under 9 15
9 – under 12 11
12 – under 15 5
15 – under 20 2
....................................................[1]
(ii) Estimate, in kilometres, the mean and standard deviation of the daily distance walked. Give
your answers correct to 3 significant figures.
Mean = .......................................................
(iii) State the units in which the variance of the daily distance walked would be measured.
....................................................[1]
(iv) From the data in the table, possible values for the range, r, are given by a ⬍ r ⬍ b.
Find a and b.
a = .......................................................
b = ...................................................[2]
Later the postman has to deliver mail to a new apartment building. The mail boxes are inside the
building, and to gain access he must enter a four-digit security code on a keypad outside the
building.
1 2 3
4 5 6
7 8 9
The postman has forgotten the exact code, but he remembers, correctly, that the first digit is 4, and
the other digits are odd numbers which are different from each other. He uses this knowledge, but
otherwise randomly guesses.
....................................................[3]
....................................................[2]
11 In this question calculate all mortality rates as deaths per thousand admissions. Where
values do not work out exactly give your answers to two decimal places.
Medical Standard
Medical condition Mortality Admissions condition population of
mortality rate admissions (%)
Non-urgent 6 4000 15
Stable 35 5600 25
....................................................[4]
(ii) Calculate the mortality rate for each medical condition and insert the values in the table
above.
[2]
....................................................[4]
The table below gives mortality rate information about Southshore hospital, which is situated in
the same area as Northshore hospital, also for the year 2014.
Serious 85 7800
(iv) Calculate the standardised mortality rate for Southshore hospital in the year 2014, using the
same standard population as for Northshore hospital.
....................................................[2]
(v) Find how many fewer deaths there were at Southshore hospital than at Northshore hospital in
2014.
....................................................[2]
The local government of the area where Northshore and Southshore hospitals are situated has
sufficient funds available to improve medical care in one of the hospitals only.
(vi) State, with a reason, to which of these two hospitals the funds should be allocated.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every reasonable
effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the publisher will
be pleased to make amends at the earliest possible opportunity.
To avoid the issue of disclosure of answer-related information to candidates, all copyright acknowledgements are reproduced online in the Cambridge International
Examinations Copyright Acknowledgements Booklet. This is produced for each series of examinations and is freely available to download at www.cie.org.uk after
the live examination series.
Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of Cambridge Local
Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
4040 STATISTICS
4040/12 Paper 1, maximum raw mark 100
This mark scheme is published as an aid to teachers and candidates, to indicate the requirements of
the examination. It shows the basis on which Examiners were instructed to award marks. It does not
indicate the details of the discussions that took place at an Examiners’ meeting before marking began,
which would have considered the acceptability of alternative answers.
Mark schemes should be read in conjunction with the question paper and the Principal Examiner
Report for Teachers.
Cambridge will not enter into discussions about these mark schemes.
Cambridge is publishing the mark schemes for the October/November 2015 series for most
Cambridge IGCSE®, Cambridge International A and AS Level components and some
Cambridge O Level components.
The following notes are intended to aid interpretation of mark schemes in general, but individual mark
schemes may include marks awarded for specific reasons outside the scope of these notes.
Types of mark
A Accuracy mark, awarded for a correct answer or intermediate step correctly obtained. For
accuracy marks to be given, the associated Method mark must be earned or implied.
When a part of a question has two or more ‘method’ steps, the M marks are in principle independent
unless the scheme specifically says otherwise; and similarly where there are several B marks
allocated. The notation ‘dep’ is used to indicate that a particular M or B mark is dependent on an
earlier, asterisked, mark in the scheme.
The symbol implies that the A or B mark indicated is allowed for work correctly following on from
previously incorrect results. Otherwise, A and B marks are given for correct work only.
Abbreviations
1 (i) systematic B1
(ii) quota B1
(iii) stratified B1
(ii) median B1
(ii) from these data, for males no, for females yes B1
but sample too small for general conclusion B1
4 (i) (a) B B1
(b) C B1
(c) C B1
5 (i) any four from 1, 5, 7, 11, 13, 14, 16, 17 for first four numbers written down
allow B1 for three correct B2
correct method for total payment (at least one correct product)
(240 + 216 + 96) M1
$552 A1
8 (i) 748 B1
rectangles
width 2 height 33 A1
width 3 height 24 A1
width 5 height 12 A1
9 (i) (a) 11 B1
(b) 6 B1
(c) 4 B1
(d) 2 B1
(e) 8/30 oe B1
(f) 6/14 oe B1
(ii) (a) 4 B1
(b) 9 B1
(c) 16 B1
(d) 0 B1
(iv) any Venn diagram with a triple intersection of 1 and double intersections of 5, 2, 0 M1
fully correct and annotated diagram A1
10 (i) 6 – under 9 B1
(ii) attempted use of class mid-points (1.5 4.5 7.5 10.5 13.5 17.5) M1*
correct method for mean (Σfx = 386) M1dep
7.72 A1
finding values of f × variable squared M1
correct method for SD or variance (Σfx2 = 3798.5, Σfx2 / Σf = 75.97) M1dep
4.04 to 4.05 A1
4.05 A1
(iii) km2 B1
(iv) 12 B1
20 B1
(iii) any one medical condition rate multiplied by standard population figure M1
sum of four such products M1
(1.5 × 0.15) + (6.25 × 0.25) + (80 × 0.40) + (155 × 0.20) oe A1
64.79 or 64.7875 A1
STATISTICS 4040/13
Paper 1 October/November 2015
2 hours 15 minutes
Candidates answer on the Question Paper.
Additional Materials: Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use an HB pencil for any diagrams or graphs.
Do not use staples, paper clips, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (ST/FD) 100434/3
© UCLES 2015 [Turn over
2
1 A teacher gives her pupils a test consisting of 5 questions, in which each question is worth 1 mark.
The test scores of the pupils are shown in the following chart.
8
6
Number
4
of pupils
2
0
0 1 2 3 4 5
Test score
....................................................[1]
....................................................[2]
....................................................[1]
....................................................[2]
2 A bird protection society is concerned by the declining numbers of certain birds, particularly
blackbirds and sparrows. To monitor the situation, members of the public were asked to count the
number of birds in their gardens during a 1-hour period last Sunday.
Andy took part in the survey and altogether he counted 212 birds. His results are shown in the pie
chart below, which has a radius of 3 cm.
Blackbird
Other
Sparrow
(i) Measure the angle for blackbirds. Give your answer correct to the nearest degree.
....................................................[1]
....................................................[2]
Katrina also took part in the survey and counted 137 birds in total.
She wants to show this information in a comparative pie chart.
(iii) Calculate the radius of Katrina’s pie chart, correct to 2 decimal places.
....................................................[2]
(iv) Explain why the angle for sparrows on Katrina’s pie chart will be larger than the angle for
sparrows on Andy’s pie chart. You are not required to find either of these angles.
...................................................................................................................................................
...............................................................................................................................................[1]
3 Rishi and Rakhi are conducting a survey about the meals at the school canteen. They decide to
interview pupils aged 12, 13 and 14.
The table below shows the number of pupils of each age.
Rather than asking all the pupils they decide to take a sample.
(i) Explain how they should select a systematic sample of size 25.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[3]
(ii) If they choose this method, calculate the number of pupils aged 14 that would be interviewed.
....................................................[2]
After discussion they each conduct their own survey independently. They both take a systematic
sample of size 25. When they compare their results, Rishi notices that he has interviewed one
more student aged 12 than Rakhi has.
...................................................................................................................................................
...............................................................................................................................................[1]
4 Students at a local college have the option of taking a 1-year vocational Computing course. At the
end of the course the students achieve either a PASS or a FAIL.
Information about the numbers taking the course is shown in the table below.
(i) State the number of girls who took the Computing course in 2012.
....................................................[1]
(ii) Of the boys who took the Computing course in 2010, calculate the percentage who achieved
a PASS.
....................................................[2]
(iii) Of all the students at the college in 2013, calculate the percentage who took the Computing
course and achieved a PASS.
....................................................[2]
(iv) Identify one trend in the number of students taking the Computing course from 2010 to 2013.
...............................................................................................................................................[1]
5 A bag contains 10 counters, of which 3 are red. Basil selects counters from the bag one at a time,
at random, without replacement. He stops if he selects a red counter or if he has selected a total of
4 counters.
....................................................[1]
[2]
x 1 2 3 4
Probability 7/30
[3]
6 A child’s toy consists of a box of coloured pieces. Each piece is green or blue, a square or a
triangle, and made of wood or plastic.
(a) qualitative,
....................................................[1]
....................................................[1]
....................................................[1]
The diagram below shows the number of pieces which have one or more of the properties square,
made of wood, and green.
2
7 9
30
Green
....................................................[1]
....................................................[1]
....................................................[1]
7 (a) Gary drives to work every day. On his journey he has to pass through two sets of traffic lights.
The probability that he has to stop at the first set of traffic lights is 0.62 . The probability that
he has to stop at the second set is 0.45 . These events are independent.
Find the probability that on any particular journey to work he has to stop at at least one set of
traffic lights.
....................................................[3]
(b) Alex and Beatrice play a series of games. In each game they throw, alternately, a dart at a
dart board. The first player to hit the bull’s-eye (centre) wins the game. They play a number of
games until one of them wins the series by winning three games.
In any game, if Alex throws first the probability that he wins the game is 0.64 . If Beatrice
throws first the probability that she wins the game is 0.78 .
To decide who starts the first game in the series an unbiased coin is tossed. Subsequent
games in the series are started by the winner of the previous game.
(i) Beatrice wins the toss and wins the first game,
....................................................[2]
(ii) Alex wins the toss and Beatrice wins the first game,
....................................................[2]
....................................................[2]
....................................................[2]
They start a new series. Beatrice wins the first game and Alex wins the second game.
....................................................[5]
8 Konrad is conducting an experiment. He attaches a spring to a stand, then attaches a mass to the
bottom of the spring, and then measures the length of the spring. He repeats this experiment for
eight different masses. His results are shown in the table below.
Mass, x (g) 30 35 40 45 50 55 60 65
Length of spring, y (cm) 26.1 27.7 34.5 38.5 40.8 44.1 48.2 49.7
70
60
50
Length of
40
spring (cm)
30
20
[
30 40 50 60 70 80
Mass (g)
[2]
........................................................
....................................................[3]
(iii) Plot these three averages on your graph and hence draw the line of best fit. [2]
(iv) Calculate the equation of the line of best fit in the form y = mx + c.
....................................................[4]
...............................................................................................................................................[1]
(a) 42 g,
....................................................[1]
(b) 75 g.
....................................................[1]
(vii) Which of your estimates in part (vi) is likely to be more reliable? Give a reason for your
answer.
...................................................................................................................................................
...............................................................................................................................................[2]
9 140 people applied to be contestants in a quiz show. As part of the selection process they were
given 60 seconds to solve a set of simple puzzles. The times taken by those who completed the
puzzles are summarised in the cumulative frequency graph below.
140
120
100
80
Cumulative
frequency
(people)
60
40
20
0
30 40 50 60
Completion time (seconds)
(i) State the number of people who failed to complete the puzzles within the allotted time.
....................................................[1]
....................................................[1]
....................................................[4]
....................................................[1]
The people were graded for speed. Those who took less than 40 seconds were graded A.
Those who took 40 seconds or more, but less than 49 seconds, were graded B. Those who took
49 seconds or more were graded C.
(a) A,
....................................................[2]
(b) B.
....................................................[2]
....................................................[2]
The people who completed the puzzles within the allotted time were also graded A, B or C for
accuracy. The table shows the cumulative percentages of these people graded A, B or C.
(v) Find the maximum number of people that could have been graded B for both speed and
accuracy.
....................................................[3]
© UCLES 2015 4040/13/O/N/15 [Turn over
14
10 Dirota is a keen gardener. She likes to grow tomatoes in her greenhouse. She sows a packet
of seeds in a tray of compost and after five weeks she measures the heights of the plants. The
results are summarised in the histogram below.
20
16
Number of 12
plants per
1 cm of
plant height 8
0
0 1 2 3 4 5 6 7 8 9
Plant height (cm)
....................................................[2]
....................................................[2]
....................................................[2]
Each plant is transferred to its own pot. Dirota puts the plants in order of size, starting with the
shortest, and numbers the pots. The pot which contains the shortest plant is numbered 1; the pot
with the second shortest plant is numbered 2 etc.
....................................................[3]
After 8 weeks she has 60 surviving plants. She measures the height, x cm, of each of the plants
and finds that Σx = 443 and Σx 2 = 3489.
Mean .......................................................
Dirota sees an advertisement for a new plant food, which adds 2 cm to plant growth during the first
8 weeks after sowing.
(v) Write down what the mean and standard deviation of the heights of Dirota’s plants would
have been after 8 weeks if she had used this new plant food.
Mean .......................................................
11 The table below gives information about the population and deaths in the town of Ashville for the
year 2012, together with the standard population of the area in which Ashville is situated.
(i) The death rate for the 0 – under 25 age group was 6 per thousand. Show that a = 21.
[1]
(ii) The death rate for the 25 – under 45 age group was 2.5 per thousand. Find the value of b.
....................................................[2]
(iii) Calculate the death rates per thousand for the other two age groups.
(iv) Calculate the crude death rate per thousand for Ashville, correct to 2 decimal places.
....................................................[4]
....................................................[1]
(vi) Calculate the standardised death rate per thousand for Ashville.
....................................................[4]
Birchville is a town in the same area as Ashville. For Birchville, the crude death rate is 5.21 per
thousand and the standardised death rate is 3.44 per thousand.
(vii) State, with a reason, which of the two towns appears to have the healthier environment.
...................................................................................................................................................
...............................................................................................................................................[2]
BLANK PAGE
BLANK PAGE
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every reasonable
effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the publisher will
be pleased to make amends at the earliest possible opportunity.
To avoid the issue of disclosure of answer-related information to candidates, all copyright acknowledgements are reproduced online in the Cambridge International
Examinations Copyright Acknowledgements Booklet. This is produced for each series of examinations and is freely available to download at www.cie.org.uk after
the live examination series.
Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of Cambridge Local
Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
4040 STATISTICS
4040/13 Paper 1, maximum raw mark 100
This mark scheme is published as an aid to teachers and candidates, to indicate the requirements of
the examination. It shows the basis on which Examiners were instructed to award marks. It does not
indicate the details of the discussions that took place at an Examiners’ meeting before marking began,
which would have considered the acceptability of alternative answers.
Mark schemes should be read in conjunction with the question paper and the Principal Examiner
Report for Teachers.
Cambridge will not enter into discussions about these mark schemes.
Cambridge is publishing the mark schemes for the October/November 2015 series for most
Cambridge IGCSE®, Cambridge International A and AS Level components and some
Cambridge O Level components.
The following notes are intended to aid interpretation of mark schemes in general, but individual mark
schemes may include marks awarded for specific reasons outside the scope of these notes.
Types of mark
A Accuracy mark, awarded for a correct answer or intermediate step correctly obtained. For
accuracy marks to be given, the associated Method mark must be earned or implied.
When a part of a question has two or more ‘method’ steps, the M marks are in principle independent
unless the scheme specifically says otherwise; and similarly where there are several B marks
allocated. The notation ‘dep’ is used to indicate that a particular M or B mark is dependent on an
earlier, asterisked, mark in the scheme.
The symbol implies that the A or B mark indicated is allowed for work correctly following on from
previously incorrect results. Otherwise, A and B marks are given for correct work only.
Abbreviations
1 (i) 5 B1
(ii) 4 + 5+ 6 + 6 + 7 + 3 M1
31 A1
(iii) 4 B1
2 (i) 100° ± 2° B1
(iii) √(137/212) × 3 M1
2.41 cm A1
4 (i) 49 B1
5 (i) 0.3 B1
(ii) 7/10 B1
× 3/9 B1
(ii) (a) 11 B1
(b) 22 B1
(c) 30 B1
(iv) correct ratio using two of the averages or two points on their line M1
m = 0.7 A1
substitution M1
c = 5.45 A1
(vi) (a) 35 cm B1
(b) 58 cm B1
(ft their line or equation, ± 0.5 if using line)
9 (i) 10 B1
(ii) (a) 47 s B1
(c) 33 s–33.5 s B1
(iv) 443/60 M1
7.38 A1
3489/60 – (443/60)2 (= 3.6363888888…) M1*
√ M1dep
1.91 A1
(iii) 5 B1
7.5 B1
(v) 15 B1
STATISTICS 4040/12
Paper 1 October/November 2016
2 hours 15 minutes
Candidates answer on the Question Paper.
Additional Materials: Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use an HB pencil for any diagrams or graphs.
Do not use staples, paper clips, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (KN/CGW) 117163/1
© UCLES 2016 [Turn over
2
1 The main sources of energy in the human diet are carbohydrates, proteins and fats. A nutritionist
recommends the following percentages from each of these sources.
Carbohydrates ..................................................... °
Proteins ..................................................... °
Fats ..................................................... °
[2]
[2]
2 Flights from an airport have either a domestic or an international destination. For each scheduled
departure the flight is categorised as on time, delayed or cancelled.
On one particular day there were 50 scheduled departures, of which 3 were cancelled, and 4 were
delayed domestic flights, as shown in the following table.
Flight departure
Destination TOTAL
On time Delayed Cancelled
Domestic 4
International
TOTAL 3 50
(i) Use this information to insert two numbers into the table. [1]
(ii) Use this information to insert three more numbers into the table. [2]
3 The atmospheric pressure in a town at mid-day was measured every day for one week. The
following results, in millibars (mb), were obtained.
(i) Using an assumed mean of 1000 mb and showing your working, find the mean and standard
deviation of these values.
Mean = .......................................................
It is known that, on a mountain near the town, the atmospheric pressure is usually about 80 mb
lower than it is in the town.
(ii) Write down estimates for the mean and range of the atmospheric pressure on the mountain
at mid-day for this week.
Mean = .......................................................
Range = ...................................................[2]
4 The diagram below shows the number of stores in a shopping mall that sell clothes for one or
more of men, women and children.
Men
11 1
8
7 3
5
Women Children
Use this information to find the number of stores that sell clothes for
(i) children,
....................................................[1]
....................................................[1]
....................................................[2]
Later, two of the stores that sell clothes for men and women but not children start selling clothes
for children also.
Find, after this change, the number of stores that now sell clothes for
....................................................[1]
....................................................[1]
5 On an examination paper there are four questions, numbered 1, 2, 3 and 4. Candidates are
instructed to answer any three questions, but not more than three.
At the examination board the computer print-out shows the following information for the questions
answered by candidates from a particular school.
Questions answered
1, 2 and 3 1, 2 and 4 1, 3 and 4 2, 3 and 4 1, 2, 3 and 4
Number of
18 23 15 28 3
candidates
For checking the marking, a manager at the board selects the answer paper from one of these
candidates at random.
....................................................[1]
....................................................[1]
(iii) had answered Question 2, given that the candidate had followed the examination instructions.
....................................................[1]
If, instead of selecting one, the manager selects two answer papers at random,
(iv) find the probability that one candidate had, and one candidate had not, followed the
examination instructions.
....................................................[3]
6 Rong and Shui survey the passengers on one journey along a particular bus route.
Rong records the number of passengers boarding, and alighting from, the bus at each point along
the route. Her raw data is as follows.
+27 +4 +14 +7 +2
Start A B C D E F G H I J Finish
–1 –5 –5 –4 –9 –30
For example, at the start of the journey 27 passengers boarded the empty bus, and at stop C,
4 passengers boarded the bus and 1 passenger alighted from the bus.
Assuming that each passenger boarded and alighted from the bus once only, find, for this journey,
....................................................[2]
(ii) the least and greatest number of passengers travelling on the bus between stops at any one
time.
Least = .......................................................
Greatest = ...................................................[3]
Shui asks a sample of the passengers to rate their opinions of bus services on the route, on each
of the aspects punctuality, cost, and comfort, on a scale from 0 (very poor) to 4 (very good). From
the ratings he calculates the measures shown in the table below.
(iii) State, for which one of the aspects punctuality, cost, or comfort, passengers are generally
....................................................[1]
....................................................[1]
....................................................[1]
7 In this question calculate all pass rates as percentages, that is, as the number of passes
per 100 enrolments.
At Yarvard University, the academic ability of students enrolled, based on school performance, is
recorded as one of excellent, very good, good or moderate.
The table below gives information on the number of enrolments and number of passes in
Economics at the University, together with the standard population of enrolments for universities in
the area.
Standard
Number of Number of Ability group
Ability group population of
passes enrolments pass rate
enrolments (%)
Excellent 48 48 20
Very good 68 80 35
Good 20 32 30
Moderate 11 20 15
(i) Show that the crude pass rate for this course, correct to 1 decimal place, is 81.7%.
[3]
(ii) Calculate the pass rate for each ability group and insert the values in the table above.
[2]
© UCLES 2016 4040/12/O/N/16
9
(iii) Calculate the standardised pass rate for this course at Yarvard University.
....................................................[4]
The table below gives information on the pass rate, over the same period of time, for students of
Economics at Hale University, which is situated in the same area as Yarvard University.
....................................................[3]
(v) the standardised pass rate, using the same standard population as for Yarvard University.
....................................................[2]
(vi) State, with a reason, which of the two universities appears to provide the higher quality
teaching in Economics.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
BLANK PAGE
8 Tariq lives in a hill village but works in a fuel station on the main road below the village. He walks
down the hill from home to work in the morning, and walks back up the hill from work to home in
the evening.
The following table summarises his daily walking time from home to work over 60 working days.
Daily walking
Number
time from home
of days
to work (minutes)
16 – under 18 5
18 – under 20 14
20 – under 22 19
22 – under 24 15
24 – under 28 7
(i) Estimate, in minutes, the mean and standard deviation of these walking times. Give your
answers to 3 significant figures.
Mean = .......................................................
The following histogram summarises Tariq’s daily walking time from work to home for the same
60 days.
20
15
Number of
days per
10
2 minutes
0
20 25 30 35 40 45
Walking time from work to home
(minutes)
(iii) Estimate the total time Tariq takes, on average, walking to and from work each day. Give your
answer to the nearest minute.
....................................................[4]
9 The following table summarises the daily water consumption of a family over a period of 80 days.
(ii) Plot the cumulative frequencies on the grid opposite, joining the points by a smooth curve. [3]
(iii) Use the graph to estimate, for the daily water consumption,
....................................................[1]
....................................................[4]
....................................................[2]
(iv) Use your answer to part (iii)(c) to find the probability that, on any one day, the water
consumption is more than 375 litres.
....................................................[1]
80
70
60
50
Cumulative
frequency 40
(days)
30
20
10
0
0 200 250 300 350 400 450 500
Daily water consumption (litres)
The water company charges $2.50 per cubic metre for water consumed, plus an additional service
charge of $0.25 per day.
Assuming that the mean and median daily water consumption are approximately the same, and
given that 1000 litres = 1 cubic metre,
(v) estimate the total amount owed by the family to the water company for these 80 days.
....................................................[4]
© UCLES 2016 4040/12/O/N/16 [Turn over
16
10 The numbers of visitors staying in a particular town for leisure and business, in the years 2014 and
2015, are shown in the pictograms below.
Leisure
2014
Business
Leisure
2015
Business
(i) State the number of visitors who stayed in the town for business in 2014.
....................................................[1]
(ii) How many more visitors stayed in the town for leisure than business in 2015?
....................................................[2]
(iii) Calculate the percentage increase, from 2014 to 2015, in the number of visitors who stayed in
the town for leisure.
....................................................[2]
The town’s tourist office provides the following information on hotels in the town.
Hotel Facilities
Royal P
Mountain View P
Palm Beach P
Commercial P
Panorama
Central
Key
P Car park Lift Wheelchair access
Swimming pool Free internet access Regular entertainment
(iv) If a visitor staying in the town chooses one of these hotels at random, find the probability that
the hotel
....................................................[1]
(b) has a car park or free internet access but not both,
....................................................[1]
(c) does not have a swimming pool, given that it does not provide regular entertainment.
....................................................[1]
The tourist office estimates that 30% of all visitors staying in the town for business choose The
Commercial Hotel, and that the remainder are equally likely to choose one of the other five hotels.
(v) Estimate the decrease, from 2014 to 2015, in the number of visitors staying in the town for
business who chose The Palm Beach Hotel.
....................................................[3]
(vi) A saleswoman comes to stay in the town to make new business contacts.
Estimate the probability that she chooses a hotel with wheelchair access.
....................................................[2]
(vii) Three visitors (who were old classmates, but now work for different companies) come to stay
in the town for a business conference.
Assuming they make choices independently of each other, estimate the probability that they
all choose the same hotel, and it has a lift.
....................................................[3]
11 Alfred has smoked cigarettes for many years. He decides to try to stop by reducing his consumption
gradually. His daughter Violet (a Statistics student) helps him by recording the number of cigarettes
he smokes each week. Her results are shown in the following table.
Week number, x 1 2 3 4 5 6 7 8
Number of
108 95 98 83 67 72 57 52
cigarettes smoked, y
120
100
80
Number of
cigarettes 60
smoked
40
20
0
0 1 2 3 4 5 6 7 8 x
Week number
[2]
The data have an overall mean of (4.5, 79) and a lower semi-average of (2.5, 96).
(ii) Find the upper semi-average, and plot this and the two given averages on your graph.
[3]
(iii) Use your plotted averages to draw a line of best fit, and find its equation in the form
y = mx + c.
....................................................[4]
(iv) Use the equation you have found in part (iii) to predict the additional number of weeks after
which Alfred will have stopped smoking.
....................................................[2]
(v) Give a statistical reason why the prediction made in part (iv) might be unreliable.
...................................................................................................................................................
...............................................................................................................................................[1]
Alfred persuaded his friends George and Joseph, also cigarette smokers, to try to stop smoking,
at the same time and using the same method as himself.
The equations Violet found for their lines of best fit were
(vi) Use this information, together with your answer to part (iii), to state, explaining your answers
briefly, which one of Alfred, George and Joseph
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every reasonable
effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the publisher will
be pleased to make amends at the earliest possible opportunity.
To avoid the issue of disclosure of answer-related information to candidates, all copyright acknowledgements are reproduced online in the Cambridge International
Examinations Copyright Acknowledgements Booklet. This is produced for each series of examinations and is freely available to download at www.cie.org.uk after
the live examination series.
Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of Cambridge Local
Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
STATISTICS 4040/12
Paper 1 October/November 2016
MARK SCHEME
Maximum Mark: 100
Published
This mark scheme is published as an aid to teachers and candidates, to indicate the requirements of the
examination. It shows the basis on which Examiners were instructed to award marks. It does not indicate the
details of the discussions that took place at an Examiners’ meeting before marking began, which would have
considered the acceptability of alternative answers.
Mark schemes should be read in conjunction with the question paper and the Principal Examiner Report for
Teachers.
Cambridge will not enter into discussions about these mark schemes.
Cambridge is publishing the mark schemes for the October/November 2016 series for most
Cambridge IGCSE®, Cambridge International A and AS Level components and some Cambridge O Level
components.
The following notes are intended to aid interpretation of mark schemes in general, but individual mark
schemes may include marks awarded for specific reasons outside the scope of these notes.
Types of mark
A Accuracy mark, awarded for a correct answer or intermediate step correctly obtained. For
accuracy marks to be given, the associated Method mark must be earned or implied.
When a part of a question has two or more ‘method’ steps, the M marks are in principle independent
unless the scheme specifically says otherwise; and similarly where there are several B marks
allocated. The notation ‘dep’ is used to indicate that a particular M or B mark is dependent on an
earlier, asterisked, mark in the scheme.
The symbol implies that the A or B mark indicated is allowed for work correctly following on from
previously incorrect results. Otherwise, A and B marks are given for correct work only.
Abbreviations
© UCLES 2016
Page 3 Mark Scheme Syllabus Paper
Cambridge O Level – October/November 2016 4040 12
[4]
35 in correct place B1
[6]
mean = 1002 A1
SD = 6.52 or 6.52 … A1
range = 19 B1
[6]
4 (i) 17 B1
(ii) 19 B1
35 A1
(iv) 9 B1
(v) 11 B1
[6]
© UCLES 2016
Page 4 Mark Scheme Syllabus Paper
Cambridge O Level – October/November 2016 4040 12
(ii) 64/87 B1
×2 M1
[6]
54 A1
25 A1
39 A1
(b) cost B1
(c) punctuality B1
[8]
7 (i) 48 + 68 + 20 + 11 (=147) M1
48 + 80 + 32 + 20 (=180) M1
81.7% AG
100 85 62.5 55 A1
© UCLES 2016
Page 5 Mark Scheme Syllabus Paper
Cambridge O Level – October/November 2016 4040 12
76.75% or 76.8% A1
79.5% A1
78.0% A1
Hale A1
[16]
8 (i) attempted use of class mid-points (17, 19, 21, 23, 26) M1*
4, 21, 10 A3
© UCLES 2016
Page 6 Mark Scheme Syllabus Paper
Cambridge O Level – October/November 2016 4040 12
54
ft only on their 1277 A1
[16]
suitable curve A1
use of IQR = Q3 – Q1
with at least one of Q1, Q3 found properly from their curve M1
83 – 88 (litres) A1
58.75 – 61.25 A1
$92 A1
[16]
10 (i) 15 000 B1
25 000 A1
© UCLES 2016
Page 7 Mark Scheme Syllabus Paper
Cambridge O Level – October/November 2016 4040 12
(b) 3/6 B1
(c) 3/5 B1
350 A1
0.86 A1
(vii) (0.3)3 (× 1) B1
+ (0.14)3 × 2 M1
[16]
© UCLES 2016
Page 8 Mark Scheme Syllabus Paper
Cambridge O Level – October/November 2016 4040 12
[16]
© UCLES 2016
Cambridge International Examinations
Cambridge Ordinary Level
* 3 2 7 0 8 2 2 8 9 3 *
STATISTICS 4040/13
Paper 1 October/November 2016
2 hours 15 minutes
Candidates answer on the Question Paper.
Additional Materials: Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use an HB pencil for any diagrams or graphs.
Do not use staples, paper clips, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (CW/CGW) 134135/1
© UCLES 2016 [Turn over
2
1 The main sources of energy in the human diet are carbohydrates, proteins and fats. A nutritionist
recommends the following percentages from each of these sources.
Carbohydrates ..................................................... °
Proteins ..................................................... °
Fats ..................................................... °
[2]
[2]
2 Flights from an airport have either a domestic or an international destination. For each scheduled
departure the flight is categorised as on time, delayed or cancelled.
On one particular day there were 50 scheduled departures, of which 3 were cancelled, and 4 were
delayed domestic flights, as shown in the following table.
Flight departure
Destination TOTAL
On time Delayed Cancelled
Domestic 4
International
TOTAL 3 50
(i) Use this information to insert two numbers into the table. [1]
(ii) Use this information to insert three more numbers into the table. [2]
3 The atmospheric pressure in a town at mid-day was measured every day for one week. The
following results, in millibars (mb), were obtained.
(i) Using an assumed mean of 1000 mb and showing your working, find the mean and standard
deviation of these values.
Mean = .......................................................
It is known that, on a mountain near the town, the atmospheric pressure is usually about 80 mb
lower than it is in the town.
(ii) Write down estimates for the mean and range of the atmospheric pressure on the mountain
at mid-day for this week.
Mean = .......................................................
Range = ...................................................[2]
4 The diagram below shows the number of stores in a shopping mall that sell clothes for one or
more of men, women and children.
Men
11 1
8
7 3
5
Women Children
Use this information to find the number of stores that sell clothes for
(i) children,
....................................................[1]
....................................................[1]
....................................................[2]
Later, two of the stores that sell clothes for men and women but not children start selling clothes
for children also.
Find, after this change, the number of stores that now sell clothes for
....................................................[1]
....................................................[1]
5 On an examination paper there are four questions, numbered 1, 2, 3 and 4. Candidates are
instructed to answer any three questions, but not more than three.
At the examination board the computer print-out shows the following information for the questions
answered by candidates from a particular school.
Questions answered
1, 2 and 3 1, 2 and 4 1, 3 and 4 2, 3 and 4 1, 2, 3 and 4
Number of
18 23 15 28 3
candidates
For checking the marking, a manager at the board selects the answer paper from one of these
candidates at random.
....................................................[1]
....................................................[1]
(iii) had answered Question 2, given that the candidate had followed the examination instructions.
....................................................[1]
If, instead of selecting one, the manager selects two answer papers at random,
(iv) find the probability that one candidate had, and one candidate had not, followed the
examination instructions.
....................................................[3]
6 Rong and Shui survey the passengers on one journey along a particular bus route.
Rong records the number of passengers boarding, and alighting from, the bus at each point along
the route. Her raw data is as follows.
+27 +4 +14 +7 +2
Start A B C D E F G H I J Finish
–1 –5 –5 –4 –9 –30
For example, at the start of the journey 27 passengers boarded the empty bus, and at stop C,
4 passengers boarded the bus and 1 passenger alighted from the bus.
Assuming that each passenger boarded and alighted from the bus once only, find, for this journey,
....................................................[2]
(ii) the least and greatest number of passengers travelling on the bus between stops at any one
time.
Least = .......................................................
Greatest = ...................................................[3]
Shui asks a sample of the passengers to rate their opinions of bus services on the route, on each
of the aspects punctuality, cost, and comfort, on a scale from 0 (very poor) to 4 (very good). From
the ratings he calculates the measures shown in the table below.
(iii) State, for which one of the aspects punctuality, cost, or comfort, passengers are generally
....................................................[1]
....................................................[1]
....................................................[1]
7 In this question calculate all pass rates as percentages, that is, as the number of passes
per 100 enrolments.
At Yarvard University, the academic ability of students enrolled, based on school performance, is
recorded as one of excellent, very good, good or moderate.
The table below gives information on the number of enrolments and number of passes in
Economics at the University, together with the standard population of enrolments for universities in
the area.
Standard
Number of Number of Ability group
Ability group population of
passes enrolments pass rate
enrolments (%)
Excellent 48 48 20
Very good 68 80 35
Good 20 32 30
Moderate 11 20 15
(i) Show that the crude pass rate for this course, correct to 1 decimal place, is 81.7%.
[3]
(ii) Calculate the pass rate for each ability group and insert the values in the table above.
[2]
© UCLES 2016 4040/13/O/N/16
9
(iii) Calculate the standardised pass rate for this course at Yarvard University.
....................................................[4]
The table below gives information on the pass rate, over the same period of time, for students of
Economics at Hale University, which is situated in the same area as Yarvard University.
....................................................[3]
(v) the standardised pass rate, using the same standard population as for Yarvard University.
....................................................[2]
(vi) State, with a reason, which of the two universities appears to provide the higher quality
teaching in Economics.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
BLANK PAGE
8 Tariq lives in a hill village but works in a fuel station on the main road below the village. He walks
down the hill from home to work in the morning, and walks back up the hill from work to home in
the evening.
The following table summarises his daily walking time from home to work over 60 working days.
Daily walking
Number
time from home
of days
to work (minutes)
16 – under 18 5
18 – under 20 14
20 – under 22 19
22 – under 24 15
24 – under 28 7
(i) Estimate, in minutes, the mean and standard deviation of these walking times. Give your
answers to 3 significant figures.
Mean = .......................................................
The following histogram summarises Tariq’s daily walking time from work to home for the same
60 days.
20
15
Number of
days per
10
2 minutes
0
20 25 30 35 40 45
Walking time from work to home
(minutes)
(iii) Estimate the total time Tariq takes, on average, walking to and from work each day. Give your
answer to the nearest minute.
....................................................[4]
9 The following table summarises the daily water consumption of a family over a period of 80 days.
(ii) Plot the cumulative frequencies on the grid opposite, joining the points by a smooth curve. [3]
(iii) Use the graph to estimate, for the daily water consumption,
....................................................[1]
....................................................[4]
....................................................[2]
(iv) Use your answer to part (iii)(c) to find the probability that, on any one day, the water
consumption is more than 375 litres.
....................................................[1]
80
70
60
50
Cumulative
frequency 40
(days)
30
20
10
0
0 200 250 300 350 400 450 500
Daily water consumption (litres)
The water company charges $2.50 per cubic metre for water consumed, plus an additional service
charge of $0.25 per day.
Assuming that the mean and median daily water consumption are approximately the same, and
given that 1000 litres = 1 cubic metre,
(v) estimate the total amount owed by the family to the water company for these 80 days.
....................................................[4]
© UCLES 2016 4040/13/O/N/16 [Turn over
16
10 The numbers of visitors staying in a particular town for leisure and business, in the years 2014 and
2015, are shown in the pictograms below.
Leisure
2014
Business
Leisure
2015
Business
(i) State the number of visitors who stayed in the town for business in 2014.
....................................................[1]
(ii) How many more visitors stayed in the town for leisure than business in 2015?
....................................................[2]
(iii) Calculate the percentage increase, from 2014 to 2015, in the number of visitors who stayed in
the town for leisure.
....................................................[2]
The town’s tourist office provides the following information on hotels in the town.
Hotel Facilities
Royal P
Mountain View P
Palm Beach P
Commercial P
Panorama
Central
Key
P Car park Lift Wheelchair access
Swimming pool Free internet access Regular entertainment
(iv) If a visitor staying in the town chooses one of these hotels at random, find the probability that
the hotel
....................................................[1]
(b) has a car park or free internet access but not both,
....................................................[1]
(c) does not have a swimming pool, given that it does not provide regular entertainment.
....................................................[1]
The tourist office estimates that 30% of all visitors staying in the town for business choose The
Commercial Hotel, and that the remainder are equally likely to choose one of the other five hotels.
(v) Estimate the decrease, from 2014 to 2015, in the number of visitors staying in the town for
business who chose The Palm Beach Hotel.
....................................................[3]
(vi) A saleswoman comes to stay in the town to make new business contacts.
Estimate the probability that she chooses a hotel with wheelchair access.
....................................................[2]
(vii) Three visitors (who were old classmates, but now work for different companies) come to stay
in the town for a business conference.
Assuming they make choices independently of each other, estimate the probability that they
all choose the same hotel, and it has a lift.
....................................................[3]
11 Alfred has smoked cigarettes for many years. He decides to try to stop by reducing his consumption
gradually. His daughter Violet (a Statistics student) helps him by recording the number of cigarettes
he smokes each week. Her results are shown in the following table.
Week number, x 1 2 3 4 5 6 7 8
Number of
108 95 98 83 67 72 57 52
cigarettes smoked, y
120
100
80
Number of
cigarettes 60
smoked
40
20
0
0 1 2 3 4 5 6 7 8 x
Week number
[2]
The data have an overall mean of (4.5, 79) and a lower semi-average of (2.5, 96).
(ii) Find the upper semi-average, and plot this and the two given averages on your graph.
[3]
(iii) Use your plotted averages to draw a line of best fit, and find its equation in the form
y = mx + c.
....................................................[4]
(iv) Use the equation you have found in part (iii) to predict the additional number of weeks after
which Alfred will have stopped smoking.
....................................................[2]
(v) Give a statistical reason why the prediction made in part (iv) might be unreliable.
...................................................................................................................................................
...............................................................................................................................................[1]
Alfred persuaded his friends George and Joseph, also cigarette smokers, to try to stop smoking,
at the same time and using the same method as himself.
The equations Violet found for their lines of best fit were
(vi) Use this information, together with your answer to part (iii), to state, explaining your answers
briefly, which one of Alfred, George and Joseph
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[2]
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
To avoid the issue of disclosure of answer-related information to candidates, all copyright acknowledgements are reproduced online in the Cambridge International
Examinations Copyright Acknowledgements Booklet. This is produced for each series of examinations and is freely available to download at www.cie.org.uk after
the live examination series.
Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of Cambridge Local
Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
STATISTICS 4040/13
Paper 1 October/November 2016
MARK SCHEME
Maximum Mark: 100
Published
This mark scheme is published as an aid to teachers and candidates, to indicate the requirements of the
examination. It shows the basis on which Examiners were instructed to award marks. It does not indicate the
details of the discussions that took place at an Examiners’ meeting before marking began, which would have
considered the acceptability of alternative answers.
Mark schemes should be read in conjunction with the question paper and the Principal Examiner Report for
Teachers.
Cambridge will not enter into discussions about these mark schemes.
Cambridge is publishing the mark schemes for the October/November 2016 series for most
Cambridge IGCSE®, Cambridge International A and AS Level components and some Cambridge O Level
components.
The following notes are intended to aid interpretation of mark schemes in general, but individual mark
schemes may include marks awarded for specific reasons outside the scope of these notes.
Types of mark
A Accuracy mark, awarded for a correct answer or intermediate step correctly obtained. For
accuracy marks to be given, the associated Method mark must be earned or implied.
When a part of a question has two or more ‘method’ steps, the M marks are in principle independent
unless the scheme specifically says otherwise; and similarly where there are several B marks
allocated. The notation ‘dep’ is used to indicate that a particular M or B mark is dependent on an
earlier, asterisked, mark in the scheme.
The symbol implies that the A or B mark indicated is allowed for work correctly following on from
previously incorrect results. Otherwise, A and B marks are given for correct work only.
Abbreviations
© UCLES 2016
Page 3 Mark Scheme Syllabus Paper
Cambridge O Level – October/November 2016 4040 13
[4]
35 in correct place B1
[6]
mean = 1002 A1
SD = 6.52 or 6.52 … A1
range = 19 B1
[6]
4 (i) 17 B1
(ii) 19 B1
35 A1
(iv) 9 B1
(v) 11 B1
[6]
© UCLES 2016
Page 4 Mark Scheme Syllabus Paper
Cambridge O Level – October/November 2016 4040 13
(ii) 64/87 B1
×2 M1
[6]
54 A1
25 A1
39 A1
(b) cost B1
(c) punctuality B1
[8]
7 (i) 48 + 68 + 20 + 11 (=147) M1
48 + 80 + 32 + 20 (=180) M1
81.7% AG
100 85 62.5 55 A1
© UCLES 2016
Page 5 Mark Scheme Syllabus Paper
Cambridge O Level – October/November 2016 4040 13
76.75% or 76.8% A1
79.5% A1
78.0% A1
Hale A1
[16]
8 (i) attempted use of class mid-points (17, 19, 21, 23, 26) M1*
4, 21, 10 A3
© UCLES 2016
Page 6 Mark Scheme Syllabus Paper
Cambridge O Level – October/November 2016 4040 13
54
ft only on their 1277 A1
[16]
suitable curve A1
use of IQR = Q3 – Q1
with at least one of Q1, Q3 found properly from their curve M1
83 – 88 (litres) A1
58.75 – 61.25 A1
$92 A1
[16]
10 (i) 15 000 B1
25 000 A1
© UCLES 2016
Page 7 Mark Scheme Syllabus Paper
Cambridge O Level – October/November 2016 4040 13
(b) 3/6 B1
(c) 3/5 B1
350 A1
0.86 A1
(vii) (0.3)3 (× 1) B1
+ (0.14)3 × 2 M1
[16]
© UCLES 2016
Page 8 Mark Scheme Syllabus Paper
Cambridge O Level – October/November 2016 4040 13
[16]
© UCLES 2016
Cambridge International Examinations
Cambridge Ordinary Level
* 1 6 3 1 8 6 6 1 9 0 *
STATISTICS 4040/12
Paper 1 October/November 2017
2 hours 15 minutes
Candidates answer on the Question Paper.
Additional Materials: Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use an HB pencil for any diagrams or graphs.
Do not use staples, paper clips, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (LK/AR) 135683/2
© UCLES 2017 [Turn over
2
1 (a) State, for each of the following, the name of a method of sampling in which
....................................................[1]
(ii) every item in the population is numbered, and a random number table or generator is
used to select every item in the sample,
....................................................[1]
(iii) every item in the population is numbered, and a random number table or generator is
used to select only the first item in the sample.
....................................................[1]
(b) Consider the following statement, from which two statistical terms have been omitted.
‘When selecting a sample from a population, a researcher should, wherever possible, ensure
that the sampling method is free from ......................................... , and that the sample is
Insert the appropriate terms into the spaces to complete the statement. [1]
2 The maximum temperature each week in a town was recorded over a ten-week period.
(i) The following values, in °C, rounded to the nearest integer, were obtained.
23 24 22 26 28 23 28 32 29 28
....................................................[2]
....................................................[1]
....................................................[2]
The temperatures in the town were originally recorded correct to one decimal place.
(ii) Using the original values, it might have been impossible to find one of the measures named in
part (i).
...................................................................................................................................................
...............................................................................................................................................[1]
3 A holiday company organises safari tours to African countries. The diagram below shows the
number of the company’s guides who have taken tours to one or more of the countries Tanzania,
Botswana and Kenya.
Tanzania
2 5
1
4 3 8
Botswana Kenya
Use this information to find the number of guides who have taken tours to
(i) Botswana,
....................................................[1]
....................................................[1]
....................................................[2]
All the guides who have taken tours to Botswana, but only these guides, have also taken tours to
Zimbabwe.
Of the four countries Tanzania, Botswana, Kenya and Zimbabwe, find the number of guides who
have taken tours to
....................................................[1]
....................................................[1]
4 A dental surgery is open for six days each week, and holds appointments each day in three
sessions: morning, afternoon, and evening.
The table below shows measures for the number of appointments held in each session during one
particular week.
(i) State, for which one of the sessions morning, afternoon or evening, the number of
appointments was generally
(a) largest,
....................................................[1]
....................................................[1]
(ii) Find the total number of appointments held in the surgery during this week.
....................................................[2]
(iii) Assuming that appointments are held continuously throughout the session, estimate the
length of time of an afternoon appointment, on average, during this particular week. Give your
answer in minutes.
....................................................[2]
5 The table below summarises the performance of a hockey team for every match played in one
season. For each match there are two pieces of information: the number of goals scored and the
number of goals conceded.
For example, there were three matches in which the team scored 4 goals and conceded 2 goals.
....................................................[1]
....................................................[1]
....................................................[1]
(ii) Calculate the total number of goals conceded by the team in the season.
....................................................[2]
(iii) Explain why it is not possible to calculate, from the table above, the total number of goals
scored by the team in the season.
...................................................................................................................................................
...............................................................................................................................................[1]
6 Candidates who enter for a proficiency certificate with an examining board must submit a long
essay, with specified minimum and maximum word limits, in their subject of study. From its
records, the board knows that, for any subject, 3% of the essays submitted are underlength, 5%
are overlength, and the remainder are of the specified length.
(i) A manager at the board selects at random an essay submitted in History and an essay
submitted in Sociology.
....................................................[2]
(b) the essay in History is of the specified length, but the essay in Sociology is not,
....................................................[2]
(c) one of the essays is underlength and the other essay is overlength.
....................................................[2]
(ii) In one particular year, 134 candidates submitted an essay that was not of the specified length.
....................................................[2]
7 In this question all pass rates, whether given or to be found, are expressed, or are to be
expressed, as percentages.
At Lernalott School the academic ability of all enrolled A Level pupils, based on O Level
performance, is recorded as one of outstanding, very good, good or modest.
The table below gives information on the number of enrolments and the pass rate in A Level
Biology at the school, together with the standard population of enrolments for all A Level subjects
at the school.
....................................................[4]
....................................................[5]
The table below gives information on pass rates, over the same period of time, for pupils in other
A Level subjects at Lernalott School.
Number of Standardised
Subject Crude pass rate (%)
enrolments pass rate (%)
Chemistry 28 84.7 85.7
Physics 18 75.0 72.2
Chinese 25 73.3 80.0
Japanese 12 88.0 66.7
English 17 65.0 76.5
(iii) State, with a reason, in which of these five subjects the highest quality of teaching appears to
have been provided.
...................................................................................................................................................
...............................................................................................................................................[2]
(iv) State in which of these five subjects the highest number of passes was obtained. Explain how
this can be known without further calculation.
...................................................................................................................................................
...............................................................................................................................................[2]
In publicity material the school gives crude pass rates for Arts, Sciences, Languages etc. (rather
than for individual subjects).
(v) Calculate the crude pass rate for the languages Chinese, Japanese and English combined.
....................................................[3]
8 The following table summarises the heights of the 25 tallest buildings in a city.
....................................................[3]
(ii) On the following grid, draw a histogram to illustrate the data in the table above. The rectangle
representing the 80 – under 90 class has already been drawn for you.
10
Number of 6
buildings
per
10 metres 4
0
70 80 90 100 110 120 130 140 150
Building height (metres)
[4]
In the city a new building is being constructed which will have a height of 165 metres after
completion.
(iii) Estimate the new mean height of the 25 tallest buildings in the city after the completion of this
building.
....................................................[2]
One of the buildings has 20 floors (levels) of three different types: 4 floors have only shops,
9 floors have only offices, and the remainder have only apartments.
Three of the 20 floors are selected at random for routine safety checks.
....................................................[2]
(v) two floors have shops and one floor has offices,
....................................................[3]
....................................................[2]
9 A particular supermarket sells 64 different types of bread. The salt content of these types, in grams
per 100 grams of bread (g/100 g), is illustrated in the cumulative frequency curve below.
70
60
50
40
Cumulative
frequency
(types of
bread) 30
20
10
0
0 0.8 0.9 1.0 1.1 1.2 1.3 1.4 1.5
Salt content (g/100 g)
(c) the value of p, if the pth percentile of the salt content is 1.35 g/100 g.
....................................................[2]
© UCLES 2017 4040/12/O/N/17
13
For a healthy diet, the government recommends a maximum salt content for bread of 1.0 g/100 g.
....................................................[1]
(b) the median salt content of those which do not meet the government’s recommendation.
A government food inspector, visiting this supermarket, selects four different types of bread at
random.
....................................................[3]
....................................................[2]
Later, the supermarket reduces the salt content of all types of bread by 0.05 g/100 g.
(v) State, explaining your answer, which of the measures found in part (i) will be unchanged. You
are not required to recalculate the measures.
...................................................................................................................................................
...............................................................................................................................................[1]
10 Pedro is a market trader who displays no prices on the items he offers for sale. When a potential
customer shows interest in an item, Pedro states his asking price. Usually, following bargaining, if
he sells the item it will be for a different price. His daughter Manuela (a statistics student) observes
eight transactions her father makes, and in each case records the asking price and the actual
selling price. Her results are shown in the following table.
Item A B C D E F G H
Asking price, x ($) 15 35 40 75 10 65 60 25
Actual selling price, y ($) 12 20 40 55 8 45 35 20
60
50
40
Actual
selling
30
price
($)
20
10
0 x
0 10 20 30 40 50 60 70 80
Asking price ($)
[2]
The data have an overall mean of (40.625, 29.375) and an upper semi-average of (60, 43.75).
(ii) Find the lower semi-average, and plot this and the two given averages on your graph.
[3]
(iii) Use your plotted averages to draw a line of best fit, and find its equation in the form
y = mx + c .
....................................................[4]
(iv) Use your line to estimate, to the nearest $5, for other transactions,
(a) the actual selling price of an item with an asking price of $55,
....................................................[2]
(b) the asking price for an item which Pedro wishes to sell for $50.
....................................................[2]
Manuela observes from her line of best fit that, overall, Pedro’s actual selling price is
approximately k% less than the asking price.
....................................................[2]
(vi) State, with a reason, for which of the items A – H Pedro will be most satisfied with the business
transacted.
...................................................................................................................................................
...............................................................................................................................................[1]
11 A restaurant manager surveys a sample of customers to find their opinions on a proposed ban on
the use of mobile phones in her restaurant. Results are shown in the pictogram below.
Males
? ? ? ? ? ? ? ?
Females
? ? ? ? ? ?
(i) state the total number of customers who were in favour of a ban,
....................................................[1]
(ii) state how many fewer females were against a ban than in favour of it,
....................................................[1]
(iii) calculate the percentage of all the customers who had a decided opinion on the ban.
....................................................[3]
The results shown in the pictogram are to be represented in comparative pie charts drawn to
scale; one for males and one for females. The chart for males has already been drawn for you on
the opposite page.
Undecided
In favour
Against
[5]
(v) By comparing the pie charts, write down three conclusions that may be drawn from the survey.
1 ................................................................................................................................................
2 ................................................................................................................................................
3 ............................................................................................................................................[3]
.......................................................................................................................................[1]
.......................................................................................................................................[1]
(vii) State, for the restaurant manager’s survey, which of these types of question she seems to
have used. Explain your answer.
...................................................................................................................................................
...............................................................................................................................................[1]
BLANK PAGE
BLANK PAGE
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
To avoid the issue of disclosure of answer-related information to candidates, all copyright acknowledgements are reproduced online in the Cambridge International
Examinations Copyright Acknowledgements Booklet. This is produced for each series of examinations and is freely available to download at www.cie.org.uk after
the live examination series.
Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of Cambridge Local
Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
STATISTICS 4040/12
Paper 1 October/November 2017
MARK SCHEME
Maximum Mark: 100
Published
This mark scheme is published as an aid to teachers and candidates, to indicate the requirements of the
examination. It shows the basis on which Examiners were instructed to award marks. It does not indicate the
details of the discussions that took place at an Examiners’ meeting before marking began, which would have
considered the acceptability of alternative answers.
Mark schemes should be read in conjunction with the question paper and the Principal Examiner Report for
Teachers.
Cambridge International will not enter into discussions about these mark schemes.
Cambridge International is publishing the mark schemes for the October/November 2017 series for most
Cambridge IGCSE®, Cambridge International A and AS Level components and some Cambridge O Level
components.
The following notes are intended to aid interpretation of mark schemes in general, but individual mark
schemes may include marks awarded for specific reasons outside the scope of these notes.
Types of mark
A Accuracy mark, awarded for a correct answer or intermediate step correctly obtained. For
accuracy marks to be given, the associated Method mark must be earned or implied.
When a part of a question has two or more ‘method’ steps, the M marks are in principle independent
unless the scheme specifically says otherwise; and similarly where there are several B marks
allocated. The notation ‘dep’ is used to indicate that a particular M or B mark is dependent on an
earlier, asterisked, mark in the scheme.
The symbol implies that the A or B mark indicated is allowed for work correctly following on from
previously incorrect results. Otherwise, A and B marks are given for correct work only.
Abbreviations
Partial
Question Answer Marks
Marks
1(a)(i) quota 1 B1
1(a)(iii) systematic 1 B1
Partial
Question Answer Marks
Marks
27 A1
2(i)(b) 28 1 B1
26.3 A1
Partial
Question Answer Marks
Marks
3(i) 10 1 B1
3(ii) 5 1 B1
26 A1
3(iv) 9 1 B1
3(v) 6 1 B1
Partial
Question Answer Marks
Marks
4(i)(a) morning 1 B1
4(i)(b) afternoon 1 B1
85 A1
4(iii) (4 × 60)/5.33 oe 2 M1
45 (minutes) A1
Partial
Question Answer Marks
Marks
5(i)(a) 10 1 B1
5(i)(b) 9 1 B1
5(i)(c) 6 1 B1
5(ii) (5 × 1) + (4 × 2) + (10 × 3) + (3 × 4) 2 M1
55 A1
Partial
Question Answer Marks
Marks
6(i)(a) (0.92)2 2 M1
0.8464 oe (529/625) A1
0.0736 oe (46/625) A1
0.003 oe (3/1000) A1
6(ii) 134/0.08 2 M1
1675 A1
Partial
Question Answer Marks
Marks
7(i) any one of very good, good or modest group rate multiplied 4 M1
by standard population figure
84(%) A1
82.6(%) A1
7(iii) Japanese 2 B1
SPR largest B1
7(iv) Chemistry 2 B1
75.9(%) A1
Partial
Question Answer Marks
Marks
104.4 (m) A1
Partial
Question Answer Marks
Marks
107.6 (m) A1
7/228 oe (0.0307) A1
×3 M1
9/190 oe (0.0474) A1
21/95 oe (0.221) A1
Partial
Question Answer Marks
Marks
9(i)(a) 1.16 1 B1
use of IQR = Q3 – Q1 M1
0.17 A1
awrt 94 A1
9(ii)(a) 8 1 B1
1.18 A1
Partial
Question Answer Marks
Marks
9(v) IQR 1 B1
dispersion unchanged by same change in all population
elements/Q1, Q3 both decrease by same amount so
difference unchanged.
Partial
Question Answer Marks
Marks
10(iii) straight line through at least two of their plotted points in (ii) 4 B1
$40 A1
$70 A1
25(%) – 30(%) A1
Partial
Question Answer Marks
Marks
Partial
Question Answer Marks
Marks
11(i) 33 1 B1
11(ii) 8 1 B1
11(iii) 17 + 15 + 16 + 8 (= 56) 3 M1
80(%) A1
r = √[(30/40) × 3.52] M1
11(v) in favour: 3 B1
smaller proportion of males/greater proportion of females
against: B1
greater proportion of males/smaller proportion of females
undecided: B1
proportion of males and females same/approx same
11(vii) closed 1 B1
apparently customers were restricted to only three possible
responses
STATISTICS 4040/13
Paper 1 October/November 2017
2 hours 15 minutes
Candidates answer on the Question Paper.
Additional Materials: Pair of compasses
Protractor
Write your Centre number, candidate number and name on all the work you hand in.
Write in dark blue or black pen.
You may use an HB pencil for any diagrams or graphs.
Do not use staples, paper clips, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
Answer all questions in Section A and not more than four questions from Section B.
If working is needed for any question it must be shown below that question.
The use of an electronic calculator is expected in this paper.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
DC (SC) 151779
© UCLES 2017 [Turn over
2
1 (a) State, for each of the following, the name of a method of sampling in which
....................................................[1]
(ii) every item in the population is numbered, and a random number table or generator is
used to select every item in the sample,
....................................................[1]
(iii) every item in the population is numbered, and a random number table or generator is
used to select only the first item in the sample.
....................................................[1]
(b) Consider the following statement, from which two statistical terms have been omitted.
‘When selecting a sample from a population, a researcher should, wherever possible, ensure
that the sampling method is free from ......................................... , and that the sample is
Insert the appropriate terms into the spaces to complete the statement. [1]
2 The maximum temperature each week in a town was recorded over a ten-week period.
(i) The following values, in °C, rounded to the nearest integer, were obtained.
23 24 22 26 28 23 28 32 29 28
....................................................[2]
....................................................[1]
....................................................[2]
The temperatures in the town were originally recorded correct to one decimal place.
(ii) Using the original values, it might have been impossible to find one of the measures named in
part (i).
...................................................................................................................................................
...............................................................................................................................................[1]
3 A holiday company organises safari tours to African countries. The diagram below shows the
number of the company’s guides who have taken tours to one or more of the countries Tanzania,
Botswana and Kenya.
Tanzania
2 5
1
4 3 8
Botswana Kenya
Use this information to find the number of guides who have taken tours to
(i) Botswana,
....................................................[1]
....................................................[1]
....................................................[2]
All the guides who have taken tours to Botswana, but only these guides, have also taken tours to
Zimbabwe.
Of the four countries Tanzania, Botswana, Kenya and Zimbabwe, find the number of guides who
have taken tours to
....................................................[1]
....................................................[1]
4 A dental surgery is open for six days each week, and holds appointments each day in three
sessions: morning, afternoon, and evening.
The table below shows measures for the number of appointments held in each session during one
particular week.
(i) State, for which one of the sessions morning, afternoon or evening, the number of
appointments was generally
(a) largest,
....................................................[1]
....................................................[1]
(ii) Find the total number of appointments held in the surgery during this week.
....................................................[2]
(iii) Assuming that appointments are held continuously throughout the session, estimate the
length of time of an afternoon appointment, on average, during this particular week. Give your
answer in minutes.
....................................................[2]
5 The table below summarises the performance of a hockey team for every match played in one
season. For each match there are two pieces of information: the number of goals scored and the
number of goals conceded.
For example, there were three matches in which the team scored 4 goals and conceded 2 goals.
....................................................[1]
....................................................[1]
....................................................[1]
(ii) Calculate the total number of goals conceded by the team in the season.
....................................................[2]
(iii) Explain why it is not possible to calculate, from the table above, the total number of goals
scored by the team in the season.
...................................................................................................................................................
...............................................................................................................................................[1]
6 Candidates who enter for a proficiency certificate with an examining board must submit a long
essay, with specified minimum and maximum word limits, in their subject of study. From its
records, the board knows that, for any subject, 3% of the essays submitted are underlength, 5%
are overlength, and the remainder are of the specified length.
(i) A manager at the board selects at random an essay submitted in History and an essay
submitted in Sociology.
....................................................[2]
(b) the essay in History is of the specified length, but the essay in Sociology is not,
....................................................[2]
(c) one of the essays is underlength and the other essay is overlength.
....................................................[2]
(ii) In one particular year, 134 candidates submitted an essay that was not of the specified length.
....................................................[2]
7 In this question all pass rates, whether given or to be found, are expressed, or are to be
expressed, as percentages.
At Lernalott School the academic ability of all enrolled A Level pupils, based on O Level
performance, is recorded as one of outstanding, very good, good or modest.
The table below gives information on the number of enrolments and the pass rate in A Level
Biology at the school, together with the standard population of enrolments for all A Level subjects
at the school.
....................................................[4]
....................................................[5]
The table below gives information on pass rates, over the same period of time, for pupils in other
A Level subjects at Lernalott School.
Number of Standardised
Subject Crude pass rate (%)
enrolments pass rate (%)
Chemistry 28 84.7 85.7
Physics 18 75.0 72.2
Chinese 25 73.3 80.0
Japanese 12 88.0 66.7
English 17 65.0 76.5
(iii) State, with a reason, in which of these five subjects the highest quality of teaching appears to
have been provided.
...................................................................................................................................................
...............................................................................................................................................[2]
(iv) State in which of these five subjects the highest number of passes was obtained. Explain how
this can be known without further calculation.
...................................................................................................................................................
...............................................................................................................................................[2]
In publicity material the school gives crude pass rates for Arts, Sciences, Languages etc. (rather
than for individual subjects).
(v) Calculate the crude pass rate for the languages Chinese, Japanese and English combined.
....................................................[3]
8 The following table summarises the heights of the 25 tallest buildings in a city.
....................................................[3]
(ii) On the following grid, draw a histogram to illustrate the data in the table above. The rectangle
representing the 80 – under 90 class has already been drawn for you.
10
Number of 6
buildings
per
10 metres 4
0
70 80 90 100 110 120 130 140 150
Building height (metres)
[4]
In the city a new building is being constructed which will have a height of 165 metres after
completion.
(iii) Estimate the new mean height of the 25 tallest buildings in the city after the completion of this
building.
....................................................[2]
One of the buildings has 20 floors (levels) of three different types: 4 floors have only shops,
9 floors have only offices, and the remainder have only apartments.
Three of the 20 floors are selected at random for routine safety checks.
....................................................[2]
(v) two floors have shops and one floor has offices,
....................................................[3]
....................................................[2]
9 A particular supermarket sells 64 different types of bread. The salt content of these types, in grams
per 100 grams of bread (g/100 g), is illustrated in the cumulative frequency curve below.
70
60
50
40
Cumulative
frequency
(types of
bread) 30
20
10
0
0 0.8 0.9 1.0 1.1 1.2 1.3 1.4 1.5
Salt content (g/100 g)
(c) the value of p, if the pth percentile of the salt content is 1.35 g/100 g.
....................................................[2]
© UCLES 2017 4040/13/O/N/17
13
For a healthy diet, the government recommends a maximum salt content for bread of 1.0 g/100 g.
....................................................[1]
(b) the median salt content of those which do not meet the government’s recommendation.
A government food inspector, visiting this supermarket, selects four different types of bread at
random.
....................................................[3]
....................................................[2]
Later, the supermarket reduces the salt content of all types of bread by 0.05 g/100 g.
(v) State, explaining your answer, which of the measures found in part (i) will be unchanged. You
are not required to recalculate the measures.
...................................................................................................................................................
...............................................................................................................................................[1]
10 Pedro is a market trader who displays no prices on the items he offers for sale. When a potential
customer shows interest in an item, Pedro states his asking price. Usually, following bargaining, if
he sells the item it will be for a different price. His daughter Manuela (a statistics student) observes
eight transactions her father makes, and in each case records the asking price and the actual
selling price. Her results are shown in the following table.
Item A B C D E F G H
Asking price, x ($) 15 35 40 75 10 65 60 25
Actual selling price, y ($) 12 20 40 55 8 45 35 20
60
50
40
Actual
selling
30
price
($)
20
10
0 x
0 10 20 30 40 50 60 70 80
Asking price ($)
[2]
The data have an overall mean of (40.625, 29.375) and an upper semi-average of (60, 43.75).
(ii) Find the lower semi-average, and plot this and the two given averages on your graph.
[3]
(iii) Use your plotted averages to draw a line of best fit, and find its equation in the form
y = mx + c .
....................................................[4]
(iv) Use your line to estimate, to the nearest $5, for other transactions,
(a) the actual selling price of an item with an asking price of $55,
....................................................[2]
(b) the asking price for an item which Pedro wishes to sell for $50.
....................................................[2]
Manuela observes from her line of best fit that, overall, Pedro’s actual selling price is
approximately k% less than the asking price.
....................................................[2]
(vi) State, with a reason, for which of the items A – H Pedro will be most satisfied with the business
transacted.
...................................................................................................................................................
...............................................................................................................................................[1]
11 A restaurant manager surveys a sample of customers to find their opinions on a proposed ban on
the use of mobile phones in her restaurant. Results are shown in the pictogram below.
Males
? ? ? ? ? ? ? ?
Females
? ? ? ? ? ?
(i) state the total number of customers who were in favour of a ban,
....................................................[1]
(ii) state how many fewer females were against a ban than in favour of it,
....................................................[1]
(iii) calculate the percentage of all the customers who had a decided opinion on the ban.
....................................................[3]
The results shown in the pictogram are to be represented in comparative pie charts drawn to
scale; one for males and one for females. The chart for males has already been drawn for you on
the opposite page.
Undecided
In favour
Against
[5]
(v) By comparing the pie charts, write down three conclusions that may be drawn from the survey.
1 ................................................................................................................................................
2 ................................................................................................................................................
3 ............................................................................................................................................[3]
.......................................................................................................................................[1]
.......................................................................................................................................[1]
(vii) State, for the restaurant manager’s survey, which of these types of question she seems to
have used. Explain your answer.
...................................................................................................................................................
...............................................................................................................................................[1]
BLANK PAGE
BLANK PAGE
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
To avoid the issue of disclosure of answer-related information to candidates, all copyright acknowledgements are reproduced online in the Cambridge International
Examinations Copyright Acknowledgements Booklet. This is produced for each series of examinations and is freely available to download at www.cie.org.uk after
the live examination series.
Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of Cambridge Local
Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
STATISTICS 4040/13
Paper 1 October/November 2017
MARK SCHEME
Maximum Mark: 100
Published
This mark scheme is published as an aid to teachers and candidates, to indicate the requirements of the
examination. It shows the basis on which Examiners were instructed to award marks. It does not indicate the
details of the discussions that took place at an Examiners’ meeting before marking began, which would have
considered the acceptability of alternative answers.
Mark schemes should be read in conjunction with the question paper and the Principal Examiner Report for
Teachers.
Cambridge International will not enter into discussions about these mark schemes.
Cambridge International is publishing the mark schemes for the October/November 2017 series for most
Cambridge IGCSE®, Cambridge International A and AS Level components and some Cambridge O Level
components.
The following notes are intended to aid interpretation of mark schemes in general, but individual mark
schemes may include marks awarded for specific reasons outside the scope of these notes.
Types of mark
A Accuracy mark, awarded for a correct answer or intermediate step correctly obtained. For
accuracy marks to be given, the associated Method mark must be earned or implied.
When a part of a question has two or more ‘method’ steps, the M marks are in principle independent
unless the scheme specifically says otherwise; and similarly where there are several B marks
allocated. The notation ‘dep’ is used to indicate that a particular M or B mark is dependent on an
earlier, asterisked, mark in the scheme.
The symbol implies that the A or B mark indicated is allowed for work correctly following on from
previously incorrect results. Otherwise, A and B marks are given for correct work only.
Abbreviations
Partial
Question Answer Marks
Marks
1(a)(i) quota 1 B1
1(a)(iii) systematic 1 B1
Partial
Question Answer Marks
Marks
27 A1
2(i)(b) 28 1 B1
26.3 A1
Partial
Question Answer Marks
Marks
3(i) 10 1 B1
3(ii) 5 1 B1
26 A1
3(iv) 9 1 B1
3(v) 6 1 B1
Partial
Question Answer Marks
Marks
4(i)(a) morning 1 B1
4(i)(b) afternoon 1 B1
85 A1
4(iii) (4 × 60)/5.33 oe 2 M1
45 (minutes) A1
Partial
Question Answer Marks
Marks
5(i)(a) 10 1 B1
5(i)(b) 9 1 B1
5(i)(c) 6 1 B1
5(ii) (5 × 1) + (4 × 2) + (10 × 3) + (3 × 4) 2 M1
55 A1
Partial
Question Answer Marks
Marks
6(i)(a) (0.92)2 2 M1
0.8464 oe (529/625) A1
0.0736 oe (46/625) A1
0.003 oe (3/1000) A1
6(ii) 134/0.08 2 M1
1675 A1
Partial
Question Answer Marks
Marks
7(i) any one of very good, good or modest group rate multiplied 4 M1
by standard population figure
84(%) A1
82.6(%) A1
7(iii) Japanese 2 B1
SPR largest B1
7(iv) Chemistry 2 B1
75.9(%) A1
Partial
Question Answer Marks
Marks
104.4 (m) A1
Partial
Question Answer Marks
Marks
107.6 (m) A1
7/228 oe (0.0307) A1
×3 M1
9/190 oe (0.0474) A1
21/95 oe (0.221) A1
Partial
Question Answer Marks
Marks
9(i)(a) 1.16 1 B1
use of IQR = Q3 – Q1 M1
0.17 A1
awrt 94 A1
9(ii)(a) 8 1 B1
1.18 A1
Partial
Question Answer Marks
Marks
9(v) IQR 1 B1
dispersion unchanged by same change in all population
elements/Q1, Q3 both decrease by same amount so
difference unchanged.
Partial
Question Answer Marks
Marks
10(iii) straight line through at least two of their plotted points in (ii) 4 B1
$40 A1
$70 A1
25(%) – 30(%) A1
Partial
Question Answer Marks
Marks
Partial
Question Answer Marks
Marks
11(i) 33 1 B1
11(ii) 8 1 B1
11(iii) 17 + 15 + 16 + 8 (= 56) 3 M1
80(%) A1
r = √[(30/40) × 3.52] M1
11(v) in favour: 3 B1
smaller proportion of males/greater proportion of females
against: B1
greater proportion of males/smaller proportion of females
undecided: B1
proportion of males and females same/approx same
11(vii) closed 1 B1
apparently customers were restricted to only three possible
responses
STATISTICS 4040/12
Paper 1 October/November 2018
2 hours 15 minutes
Candidates answer on the Question Paper.
Additional Materials: Pair of compasses
Protractor
Electronic calculator
Write your Centre number, candidate number and name in the spaces at the top of this page.
Write in dark blue or black pen.
You may use an HB pencil for any diagrams or graphs.
Do not use staples, paper clips, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
The total number of marks for this paper is 100.
DC (ST/SG) 151388/1
© UCLES 2018 [Turn over
2
survey,
census,
random,
systematic,
quota,
and stratified.
In each of the following statements one of these terms has been omitted.
Complete each statement by inserting the appropriate term.
(a) If a population is made up of different categories, and a sample is required which will be
(b) When selecting a sample from a population, in order to ensure that the sampling method is
conducted. [1]
(d) When selecting a sample from a population without the use of a sampling frame,
2 In a particular town there are often cuts in the electrical power supply. The following data shows,
for a period of 8 weeks, the number of days in each week on which there were power cuts.
2 5 2 0 2 0 7 4
(a) For this period, find the mean number of days per week on which
...................................................[2]
(ii) there were no power cuts.
...................................................[1]
(b) For this period, find the total number of days on which there were no power cuts.
...................................................[1]
3 The guest rooms in a hotel are of either standard or superior quality, are either non-smoking or
smoking, and have either a city view or a mountain view.
The diagram below shows the number of these rooms which are one or more of standard quality,
non-smoking, and city view.
Standard
9 5
10
6 7 3
(a) Use this information to find the number of guest rooms which are
(i) non-smoking,
...................................................[1]
...................................................[1]
...................................................[1]
...................................................[1]
(b) State precisely the extra information which you would need to find the total number of guest
rooms in the hotel.
...................................................................................................................................................
...............................................................................................................................................[1]
4 At a telephone company, statistical measures are calculated for the lengths of an equally large
number of telephone calls made by each of two people. The following values, in minutes, are
obtained.
...................................................[1]
...................................................[1]
...................................................[1]
...................................................[1]
A trainee statistician at the company inspects the measures and makes the following statement to
her supervisor.
‘Because the median length of a call from Person A is greater than the median length of a call from
Person B, the total length of all the calls from Person A must be greater than the total length of all
the calls from Person B.’
...................................................................................................................................................
...............................................................................................................................................[1]
5 In a city there are 15 tennis clubs. The following table shows the number of indoor and outdoor
courts at each of these clubs.
For example, there are 3 clubs with 4 outdoor courts and 2 indoor courts.
...................................................[1]
(ii) at least 4 outdoor courts.
...................................................[1]
(b) Calculate the total number of outdoor courts at the city’s tennis clubs.
...................................................[2]
(c) Complete the following table to show the frequency distribution of the total number of courts
per club.
Total number of
Number of clubs, f
courts, x
2 1
[2]
(d) A national tennis coach, investigating the facilities available in different cities, requests from
each city information on the number of courts at clubs in the city, summarised in one table.
State, giving a reason, whether you would expect the coach to prefer this to be in the form of
the first or second table in this question.
...................................................................................................................................................
...............................................................................................................................................[1]
6 The main political parties in a country are the Modern Left, the Reform Party, and the Central
Union. In an opinion poll before an election, 1800 voters were questioned on which party they
supported. Results are shown in the following pie chart, of radius 4 cm, which is drawn to scale.
Other or
undecided
Modern
Left
Central
Union
Reform
Party
...................................................[2]
...................................................[2]
(c) If a comparative pie chart were drawn to show the results of this later poll, find, correct to
1 decimal place, its radius.
...................................................[3]
7 Warona buys a pack of 7 toothbrushes, of which 1 is red, 2 are green, and 4 are blue. She opens
the pack, selects a toothbrush at random, and starts using it.
...................................................[1]
Warona’s dentist advises that one toothbrush should be used for 3 months, then thrown away.
Warona follows the advice, selecting a new toothbrush at random from those remaining in the
pack each time she throws away a used toothbrush.
(b) Given that the first toothbrush selected was blue, find the probability that the toothbrushes
being used 8 months and 11 months after opening the pack are the same colour as each
other.
...................................................[7]
8 Bruce is a nutritionist. He collects information on the daily energy intake from food and drink of
volunteer groups of 60 men and 60 women. His results, in Calories (a continuous variable), are
illustrated in the cumulative frequency curves below.
60
50
Women
Men
40
Cumulative
frequency
(volunteers)
30
20
10
0
0 1800 2000 2200 2400 2600 2800 3000 3200
Daily energy intake (Calories)
(a) Use the graph to estimate, for the daily energy intake of the men,
(b) Use the graph to estimate, for the daily energy intake of the women,
...................................................[2]
Bruce observes from the graph that the interquartile range of the daily energy intake for women is
smaller than that for men.
(c) Explain how he is able to make this observation without calculating the interquartile range of
both groups of volunteers.
...................................................................................................................................................
...............................................................................................................................................[1]
Bruce’s rules for a healthy diet are a maximum daily energy intake of 2500 Calories for men, and
2000 Calories for women.
(d) Estimate the number of these men and women whose diet Bruce judged to be not healthy.
The volunteers themselves had recorded energy intake from their own estimates of the amounts
of food and drink they had consumed.
Bruce’s work colleague Sheila claims that, generally, women record values accurately, but men
underestimate values by about 300 Calories.
(e) Estimate the number of these men whose diet Bruce should have judged to be not healthy, if
Sheila’s claim is correct.
...................................................[2]
9 In this question calculate all injury rates per thousand. Where values do not work out
exactly give your answers to 1 decimal place.
The table below gives information on the number of employees, and the number of injuries they
suffered, in the Fire Service of a city in 2016. It also shows the standard population for the Fire
Services of the area in which the city is situated.
(a) Calculate the crude injury rate for this Fire Service.
...................................................[4]
(b) Calculate the injury rate for each job group and insert the values in the table above.
[2]
(c) Use your results from part (b) to calculate the standardised injury rate for this Fire Service.
...................................................[4]
(d) Explain, by reference to the information given in the table and the values calculated in part (b),
why the standardised injury rate is smaller than the crude injury rate in this case.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[3]
Some of the injuries suffered by employees required hospital treatment, and they all occurred in
the firefighter groups. Information on these injuries is shown in the table below.
(e) Find the job group in which an injured employee was most likely to require hospital treatment.
Justify your answer.
...................................................................................................................................................
...............................................................................................................................................[2]
10 The loggerhead sea turtle is an endangered species. To increase the chances of survival of baby
turtles, a nursery (conservation centre) collects them from the beach when the turtle eggs hatch,
and cares for them until they are older.
The following table summarises the masses, in grams, of a sample of baby turtles collected by the
nursery.
Number of
Mass (g)
turtles
15 – under 17 4
17 – under 18 7
18 – under 19 12
19 – under 20 15
20 – under 22 9
22 – under 25 3
(a) Estimate, in grams, the mean and standard deviation of these masses. Give your answers to
3 significant figures.
Mean = ......................................................
(b) On the grid below draw a histogram to illustrate the data in this table. The rectangle
representing the 235 – under 245 class has already been drawn for you.
25
20
15
Number
of turtles
per 10 g
10
0
190 200 210 220 230 240 250 260 270
Mass (g)
[4]
The nursery’s policy is to release a turtle when its mass has become a certain multiple, k, of the
mass when collected.
(c) Explain why k cannot be found accurately from the two tables of data above.
...................................................................................................................................................
...............................................................................................................................................[1]
(d) Use the mid-class values of the modal classes from the two tables of data above to estimate
k to the nearest integer.
...................................................[3]
© UCLES 2018 4040/12/O/N/18 [Turn over
16
11 An English teacher gives the 26 pupils present in her class a spelling test. She reads aloud ten
words of different lengths and the pupils write them down. The following table summarises the
results.
Word length, x
3 3 4 6 6 8 9 9 10 12
(number of letters)
Number of pupils with
24 23 21 19 18 17 15 13 11 9
correct spelling, y
y
30
25
20
Number
of pupils
with 15
correct
spelling
10
0
0 2 4 6 8 10 12 14 x
...............................................................................................................................................[2]
The data have an overall mean of (7, 17) and a lower semi-average of (4.4, 21).
(c) Find the upper semi-average and plot this and the two given averages on your graph.
[3]
(d) Use your plotted averages to draw a line of best fit, and find its equation in the form
y = mx + c .
...................................................[4]
(e) Use your line to estimate the number of pupils who would spell a word with 5 letters correctly.
...................................................[2]
The teacher also reads aloud the following three-word sentence and the 26 pupils write it down.
(f) Estimate the probability that the pupil’s answer contains the correct spelling of all three words.
...................................................[3]
BLANK PAGE
BLANK PAGE
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
To avoid the issue of disclosure of answer-related information to candidates, all copyright acknowledgements are reproduced online in the Cambridge International
Examinations Copyright Acknowledgements Booklet. This is produced for each series of examinations and is freely available to download at www.cie.org.uk after
the live examination series.
Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of Cambridge Local
Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
STATISTICS 4040/12
Paper 1 October/November 2018
MARK SCHEME
Maximum Mark: 100
Published
This mark scheme is published as an aid to teachers and candidates, to indicate the requirements of the
examination. It shows the basis on which Examiners were instructed to award marks. It does not indicate the
details of the discussions that took place at an Examiners’ meeting before marking began, which would have
considered the acceptability of alternative answers.
Mark schemes should be read in conjunction with the question paper and the Principal Examiner Report for
Teachers.
Cambridge International will not enter into discussions about these mark schemes.
Cambridge International is publishing the mark schemes for the October/November 2018 series for most
Cambridge IGCSE™, Cambridge International A and AS Level components and some Cambridge O Level
components.
These general marking principles must be applied by all examiners when marking candidate answers.
They should be applied alongside the specific content of the mark scheme or generic level descriptors
for a question. Each question paper and mark scheme will also comply with these marking principles.
• the specific content of the mark scheme or the generic level descriptors for the question
• the specific skills defined in the mark scheme or in the generic level descriptors for the question
• the standard of response required by a candidate as exemplified by the standardisation scripts.
Marks awarded are always whole marks (not half marks, or other fractions).
• marks are awarded for correct/valid answers, as defined in the mark scheme. However, credit
is given for valid answers which go beyond the scope of the syllabus and mark scheme,
referring to your Team Leader as appropriate
• marks are awarded when candidates clearly demonstrate what they know and can do
• marks are not deducted for errors
• marks are not deducted for omissions
• answers should only be judged on the quality of spelling, punctuation and grammar when these
features are specifically assessed by the question as indicated by the mark scheme. The
meaning, however, should be unambiguous.
Rules must be applied consistently e.g. in situations where candidates have not followed
instructions or in the application of generic level descriptors.
Marks should be awarded using the full range of marks defined in the mark scheme for the question
(however; the use of the full mark range may be limited according to the quality of the candidate
responses seen).
Marks awarded are based solely on the requirements as defined in the mark scheme. Marks should
not be awarded with grade thresholds or grade descriptors in mind.
The following notes are intended to aid interpretation of mark schemes in general, but individual mark
schemes may include marks awarded for specific reasons outside the scope of these notes.
Types of mark
A Accuracy mark, awarded for a correct answer or intermediate step correctly obtained. For
accuracy marks to be given, the associated Method mark must be earned or implied.
When a part of a question has two or more ‘method’ steps, the M marks are in principle independent
unless the scheme specifically says otherwise; and similarly where there are several B marks
allocated. The notation ‘dep’ is used to indicate that a particular M or B mark is dependent on an
earlier, asterisked, mark in the scheme.
The symbol implies that the A or B mark indicated is allowed for work correctly following on from
previously incorrect results. Otherwise, A and B marks are given for correct work only.
Abbreviations
Partial
Question Answer Marks
Marks
1(a) stratified 1 B1
1(b) random 1 B1
1(c) census 1 B1
1(d) quota 1 B1
Partial
Question Answer Marks
Marks
2.75 A1
2(b) 34 1 B1
Partial
Question Answer Marks
Marks
3(a)(i) 32 1 B1
3(a)(ii) 19 1 B1
3(a)(iii) 13 1 B1
3(a)(iv) 3 1 B1
Partial
Question Answer Marks
Marks
4(a)(i)(a) 75[%] 1 B1
4(a)(i)(b) 60[%] 1 B1
4(a)(ii)(a) 15[%] 1 B1
4(a)(ii)(b) 75[%] 1 B1
Partial
Question Answer Marks
Marks
5(a)(i) 7 1 B1
5(a)(ii) 13 1 B1
79 A1
corresponding f values B1
[1], 1, 2, [0], 4, [0], 3, 2, 1, [0], 1
Partial
Question Answer Marks
Marks
550 A1
1350 A1
5.2 [cm] A1
Partial
Question Answer Marks
Marks
7(a) 4/7 1 B1
4/15 oe (0.267) A1
Partial
Question Answer Marks
Marks
8(a)(i) 2620 1 B1
use of IQR = Q3 – Q1 M1
400 A1
2560 A1
65 A1
8(c) central part of graph for women is steeper than that for men 1 B1
[indicating Q1, Q3 closer together] oe
Partial
Question Answer Marks
Marks
55 A1
Partial
Question Answer Marks
Marks
9(a) 0 + 3 + 91 + 7 + 8 (= 109) 4 M1
12 + 50 + 260 + 70 + 25 (= 417) M1
261.4 A1
240.7 A1
part-time firefighter A1
Partial
Question Answer Marks
Marks
19.2 or 19.21 A1
1.74 – 1.75 A1
240/19.5 A1
12 A1
Partial
Question Answer Marks
Marks
11(b) strong 2 B1
negative B1
Partial
Question Answer Marks
Marks
20 ft A1
0.155 oe A1
STATISTICS 4040/13
Paper 1 October/November 2018
2 hours 15 minutes
Candidates answer on the Question Paper.
Additional Materials: Pair of compasses
Protractor
Electronic calculator
Write your Centre number, candidate number and name in the spaces at the top of this page.
Write in dark blue or black pen.
You may use an HB pencil for any diagrams or graphs.
Do not use staples, paper clips, glue or correction fluid.
DO NOT WRITE IN ANY BARCODES.
At the end of the examination, fasten all your work securely together.
The number of marks is given in brackets [ ] at the end of each question or part question.
The total number of marks for this paper is 100.
DC (KS) 164028
© UCLES 2018 [Turn over
2
survey,
census,
random,
systematic,
quota,
and stratified.
In each of the following statements one of these terms has been omitted.
Complete each statement by inserting the appropriate term.
(a) If a population is made up of different categories, and a sample is required which will be
(b) When selecting a sample from a population, in order to ensure that the sampling method is
conducted. [1]
(d) When selecting a sample from a population without the use of a sampling frame,
2 In a particular town there are often cuts in the electrical power supply. The following data shows,
for a period of 8 weeks, the number of days in each week on which there were power cuts.
2 5 2 0 2 0 7 4
(a) For this period, find the mean number of days per week on which
...................................................[2]
(ii) there were no power cuts.
...................................................[1]
(b) For this period, find the total number of days on which there were no power cuts.
...................................................[1]
3 The guest rooms in a hotel are of either standard or superior quality, are either non-smoking or
smoking, and have either a city view or a mountain view.
The diagram below shows the number of these rooms which are one or more of standard quality,
non-smoking, and city view.
Standard
9 5
10
6 7 3
(a) Use this information to find the number of guest rooms which are
(i) non-smoking,
...................................................[1]
...................................................[1]
...................................................[1]
...................................................[1]
(b) State precisely the extra information which you would need to find the total number of guest
rooms in the hotel.
...................................................................................................................................................
...............................................................................................................................................[1]
4 At a telephone company, statistical measures are calculated for the lengths of an equally large
number of telephone calls made by each of two people. The following values, in minutes, are
obtained.
...................................................[1]
...................................................[1]
...................................................[1]
...................................................[1]
A trainee statistician at the company inspects the measures and makes the following statement to
her supervisor.
‘Because the median length of a call from Person A is greater than the median length of a call from
Person B, the total length of all the calls from Person A must be greater than the total length of all
the calls from Person B.’
...................................................................................................................................................
...............................................................................................................................................[1]
5 In a city there are 15 tennis clubs. The following table shows the number of indoor and outdoor
courts at each of these clubs.
For example, there are 3 clubs with 4 outdoor courts and 2 indoor courts.
...................................................[1]
(ii) at least 4 outdoor courts.
...................................................[1]
(b) Calculate the total number of outdoor courts at the city’s tennis clubs.
...................................................[2]
(c) Complete the following table to show the frequency distribution of the total number of courts
per club.
Total number of
Number of clubs, f
courts, x
2 1
[2]
(d) A national tennis coach, investigating the facilities available in different cities, requests from
each city information on the number of courts at clubs in the city, summarised in one table.
State, giving a reason, whether you would expect the coach to prefer this to be in the form of
the first or second table in this question.
...................................................................................................................................................
...............................................................................................................................................[1]
6 The main political parties in a country are the Modern Left, the Reform Party, and the Central
Union. In an opinion poll before an election, 1800 voters were questioned on which party they
supported. Results are shown in the following pie chart, of radius 4 cm, which is drawn to scale.
Other or
undecided
Modern
Left
Central
Union
Reform
Party
...................................................[2]
...................................................[2]
(c) If a comparative pie chart were drawn to show the results of this later poll, find, correct to
1 decimal place, its radius.
...................................................[3]
7 Warona buys a pack of 7 toothbrushes, of which 1 is red, 2 are green, and 4 are blue. She opens
the pack, selects a toothbrush at random, and starts using it.
...................................................[1]
Warona’s dentist advises that one toothbrush should be used for 3 months, then thrown away.
Warona follows the advice, selecting a new toothbrush at random from those remaining in the
pack each time she throws away a used toothbrush.
(b) Given that the first toothbrush selected was blue, find the probability that the toothbrushes
being used 8 months and 11 months after opening the pack are the same colour as each
other.
...................................................[7]
8 Bruce is a nutritionist. He collects information on the daily energy intake from food and drink of
volunteer groups of 60 men and 60 women. His results, in Calories (a continuous variable), are
illustrated in the cumulative frequency curves below.
60
50
Women
Men
40
Cumulative
frequency
(volunteers)
30
20
10
0
0 1800 2000 2200 2400 2600 2800 3000 3200
Daily energy intake (Calories)
(a) Use the graph to estimate, for the daily energy intake of the men,
(b) Use the graph to estimate, for the daily energy intake of the women,
...................................................[2]
Bruce observes from the graph that the interquartile range of the daily energy intake for women is
smaller than that for men.
(c) Explain how he is able to make this observation without calculating the interquartile range of
both groups of volunteers.
...................................................................................................................................................
...............................................................................................................................................[1]
Bruce’s rules for a healthy diet are a maximum daily energy intake of 2500 Calories for men, and
2000 Calories for women.
(d) Estimate the number of these men and women whose diet Bruce judged to be not healthy.
The volunteers themselves had recorded energy intake from their own estimates of the amounts
of food and drink they had consumed.
Bruce’s work colleague Sheila claims that, generally, women record values accurately, but men
underestimate values by about 300 Calories.
(e) Estimate the number of these men whose diet Bruce should have judged to be not healthy, if
Sheila’s claim is correct.
...................................................[2]
9 In this question calculate all injury rates per thousand. Where values do not work out
exactly give your answers to 1 decimal place.
The table below gives information on the number of employees, and the number of injuries they
suffered, in the Fire Service of a city in 2016. It also shows the standard population for the Fire
Services of the area in which the city is situated.
(a) Calculate the crude injury rate for this Fire Service.
...................................................[4]
(b) Calculate the injury rate for each job group and insert the values in the table above.
[2]
(c) Use your results from part (b) to calculate the standardised injury rate for this Fire Service.
...................................................[4]
(d) Explain, by reference to the information given in the table and the values calculated in part (b),
why the standardised injury rate is smaller than the crude injury rate in this case.
...................................................................................................................................................
...................................................................................................................................................
...............................................................................................................................................[3]
Some of the injuries suffered by employees required hospital treatment, and they all occurred in
the firefighter groups. Information on these injuries is shown in the table below.
(e) Find the job group in which an injured employee was most likely to require hospital treatment.
Justify your answer.
...................................................................................................................................................
...............................................................................................................................................[2]
10 The loggerhead sea turtle is an endangered species. To increase the chances of survival of baby
turtles, a nursery (conservation centre) collects them from the beach when the turtle eggs hatch,
and cares for them until they are older.
The following table summarises the masses, in grams, of a sample of baby turtles collected by the
nursery.
Number of
Mass (g)
turtles
15 – under 17 4
17 – under 18 7
18 – under 19 12
19 – under 20 15
20 – under 22 9
22 – under 25 3
(a) Estimate, in grams, the mean and standard deviation of these masses. Give your answers to
3 significant figures.
Mean = ......................................................
(b) On the grid below draw a histogram to illustrate the data in this table. The rectangle
representing the 235 – under 245 class has already been drawn for you.
25
20
15
Number
of turtles
per 10 g
10
0
190 200 210 220 230 240 250 260 270
Mass (g)
[4]
The nursery’s policy is to release a turtle when its mass has become a certain multiple, k, of the
mass when collected.
(c) Explain why k cannot be found accurately from the two tables of data above.
...................................................................................................................................................
...............................................................................................................................................[1]
(d) Use the mid-class values of the modal classes from the two tables of data above to estimate
k to the nearest integer.
...................................................[3]
© UCLES 2018 4040/13/O/N/18 [Turn over
16
11 An English teacher gives the 26 pupils present in her class a spelling test. She reads aloud ten
words of different lengths and the pupils write them down. The following table summarises the
results.
Word length, x
3 3 4 6 6 8 9 9 10 12
(number of letters)
Number of pupils with
24 23 21 19 18 17 15 13 11 9
correct spelling, y
y
30
25
20
Number
of pupils
with 15
correct
spelling
10
0
0 2 4 6 8 10 12 14 x
...............................................................................................................................................[2]
The data have an overall mean of (7, 17) and a lower semi-average of (4.4, 21).
(c) Find the upper semi-average and plot this and the two given averages on your graph.
[3]
(d) Use your plotted averages to draw a line of best fit, and find its equation in the form
y = mx + c .
...................................................[4]
(e) Use your line to estimate the number of pupils who would spell a word with 5 letters correctly.
...................................................[2]
The teacher also reads aloud the following three-word sentence and the 26 pupils write it down.
(f) Estimate the probability that the pupil’s answer contains the correct spelling of all three words.
...................................................[3]
BLANK PAGE
BLANK PAGE
BLANK PAGE
Permission to reproduce items where third-party owned material protected by copyright is included has been sought and cleared where possible. Every
reasonable effort has been made by the publisher (UCLES) to trace copyright holders, but if any items requiring clearance have unwittingly been included, the
publisher will be pleased to make amends at the earliest possible opportunity.
To avoid the issue of disclosure of answer-related information to candidates, all copyright acknowledgements are reproduced online in the Cambridge International
Examinations Copyright Acknowledgements Booklet. This is produced for each series of examinations and is freely available to download at www.cie.org.uk after
the live examination series.
Cambridge International Examinations is part of the Cambridge Assessment Group. Cambridge Assessment is the brand name of University of Cambridge Local
Examinations Syndicate (UCLES), which is itself a department of the University of Cambridge.
STATISTICS 4040/13
Paper 1 October/November 2018
MARK SCHEME
Maximum Mark: 100
Published
This mark scheme is published as an aid to teachers and candidates, to indicate the requirements of the
examination. It shows the basis on which Examiners were instructed to award marks. It does not indicate the
details of the discussions that took place at an Examiners’ meeting before marking began, which would have
considered the acceptability of alternative answers.
Mark schemes should be read in conjunction with the question paper and the Principal Examiner Report for
Teachers.
Cambridge International will not enter into discussions about these mark schemes.
Cambridge International is publishing the mark schemes for the October/November 2018 series for most
Cambridge IGCSE™, Cambridge International A and AS Level components and some Cambridge O Level
components.
These general marking principles must be applied by all examiners when marking candidate answers.
They should be applied alongside the specific content of the mark scheme or generic level descriptors
for a question. Each question paper and mark scheme will also comply with these marking principles.
• the specific content of the mark scheme or the generic level descriptors for the question
• the specific skills defined in the mark scheme or in the generic level descriptors for the question
• the standard of response required by a candidate as exemplified by the standardisation scripts.
Marks awarded are always whole marks (not half marks, or other fractions).
• marks are awarded for correct/valid answers, as defined in the mark scheme. However, credit
is given for valid answers which go beyond the scope of the syllabus and mark scheme,
referring to your Team Leader as appropriate
• marks are awarded when candidates clearly demonstrate what they know and can do
• marks are not deducted for errors
• marks are not deducted for omissions
• answers should only be judged on the quality of spelling, punctuation and grammar when these
features are specifically assessed by the question as indicated by the mark scheme. The
meaning, however, should be unambiguous.
Rules must be applied consistently e.g. in situations where candidates have not followed
instructions or in the application of generic level descriptors.
Marks should be awarded using the full range of marks defined in the mark scheme for the question
(however; the use of the full mark range may be limited according to the quality of the candidate
responses seen).
Marks awarded are based solely on the requirements as defined in the mark scheme. Marks should
not be awarded with grade thresholds or grade descriptors in mind.
The following notes are intended to aid interpretation of mark schemes in general, but individual mark
schemes may include marks awarded for specific reasons outside the scope of these notes.
Types of mark
A Accuracy mark, awarded for a correct answer or intermediate step correctly obtained. For
accuracy marks to be given, the associated Method mark must be earned or implied.
When a part of a question has two or more ‘method’ steps, the M marks are in principle independent
unless the scheme specifically says otherwise; and similarly where there are several B marks
allocated. The notation ‘dep’ is used to indicate that a particular M or B mark is dependent on an
earlier, asterisked, mark in the scheme.
The symbol implies that the A or B mark indicated is allowed for work correctly following on from
previously incorrect results. Otherwise, A and B marks are given for correct work only.
Abbreviations
Partial
Question Answer Marks
Marks
1(a) stratified 1 B1
1(b) random 1 B1
1(c) census 1 B1
1(d) quota 1 B1
Partial
Question Answer Marks
Marks
2.75 A1
2(b) 34 1 B1
Partial
Question Answer Marks
Marks
3(a)(i) 32 1 B1
3(a)(ii) 19 1 B1
3(a)(iii) 13 1 B1
3(a)(iv) 3 1 B1
Partial
Question Answer Marks
Marks
4(a)(i)(a) 75[%] 1 B1
4(a)(i)(b) 60[%] 1 B1
4(a)(ii)(a) 15[%] 1 B1
4(a)(ii)(b) 75[%] 1 B1
Partial
Question Answer Marks
Marks
5(a)(i) 7 1 B1
5(a)(ii) 13 1 B1
79 A1
corresponding f values B1
[1], 1, 2, [0], 4, [0], 3, 2, 1, [0], 1
Partial
Question Answer Marks
Marks
550 A1
1350 A1
5.2 [cm] A1
Partial
Question Answer Marks
Marks
7(a) 4/7 1 B1
4/15 oe (0.267) A1
Partial
Question Answer Marks
Marks
8(a)(i) 2620 1 B1
use of IQR = Q3 – Q1 M1
400 A1
2560 A1
65 A1
8(c) central part of graph for women is steeper than that for men 1 B1
[indicating Q1, Q3 closer together] oe
Partial
Question Answer Marks
Marks
55 A1
Partial
Question Answer Marks
Marks
9(a) 0 + 3 + 91 + 7 + 8 (= 109) 4 M1
12 + 50 + 260 + 70 + 25 (= 417) M1
261.4 A1
240.7 A1
part-time firefighter A1
Partial
Question Answer Marks
Marks
19.2 or 19.21 A1
1.74 – 1.75 A1
240/19.5 A1
12 A1
Partial
Question Answer Marks
Marks
11(b) strong 2 B1
negative B1
Partial
Question Answer Marks
Marks
20 ft A1
0.155 oe A1