A Level Math Paper 2 Statistics
A Level Math Paper 2 Statistics
This a branch of mathematics dealing with collection, presentation, analysis and interpretation of
data
Types of data
(a) Discrete data
Its information collected by counting and usually takes integral values that do not lie within a
given range
(b) Continuous data
It is information that takes values within a given range
(viii) Percentiles
A percentile is a value that divides given values into 100 pats.
(ix) Deciles
A decile is a value that divides given values into 10 pats.
2, 1, 3, 4, 5, 6, 7, 8, 9, 10, 3, 4, 6, 8, 9, 6, 3, 2
(viii) Tenth percentile ( ) value where N is the sum of all the variables
( ) ( ) =18th value
From cf, 9
Example 2
The ages of eight students in a class are: 12, 13, 14, 15, 12, 17, 12, 13, 16, 12.
Find the;
x f cf fx x2 fx2
12 4 4 48 144 576
13 2 6 26 169 338
14 1 7 14 196 196
15 1 8 15 225 225
16 1 9 16 256 256
17 1 10 17 289 289
∑f =10 ∑fx = 136 ∑fx2=1,880
(b) Use the table to find
a. mean age
∑
mean = ∑
b. variance
∑ ∑
Var (X) = ∑
(∑ )
= – 13.62
= 3.04
f. Range = 17 – 12 = 5
g. Lower quartile, q1 = ( ) =( ) value, from cf, q1 = 12
( ) ( ) =9th value
From cf, 16
Example 3
The frequency distribution table shows the marks of some student from a certain school
x 45 63 65 66 70 72 75 80 88
f 3 5 6 4 6 2 1 2 1
Calculate standard deviation
Solution
x f fx fx2
45 3 135 6075
63 5 315 19845
65 6 390 25350
66 4 264 17424
70 6 420 29400
72 2 144 10368
75 1 75 5625
80 2 160 12800
88 1 88 7744
∑ 30 1991 ∑ 134631
Using assumed mean to get variance and standard deviation
∑ ∑
Var (X) = ∑
(∑ )
∑ ∑
s.d = √ ∑
(∑ )
∑ ∑
S.d = √ ∑
(∑ )
=√ ( )
=√ ( )
=√
= 9.12
Revision exercise 1
1. The data below represents the length of leaves in cm: 4.5 4.4, 6.2, 9.4, 8.2, 12.6, 10.0, 8.8, 3.8 and
13.6. find the;
(a) Mean
(b) Standard deviation
2. The concentration in m per litre of a trace element in 7 randomly chosen samples of water from
spring wells were: 240.8, 237.3, 236.6, 2333.9 and 232.5. Determine the mean and the variance
of the concentration of the trace elements per litre.
3. The table below shows the length of flowers from a certain plant to the nearest 0.5cm.
Length (cm) 7.5 8.0 8.5 9.0 9.5 10.0 10.5 11.0
Number of flowers 4 9 11 8 10 7 2 3
Find the:
(a) Mean
(b) Mode
(c) The median
(d) Standard deviation
4. The marks scored by 11 students in a test are:52, 61, 78, 49, 47, 79, 54, 58, 62, 73, 72
Find;
(a) Median,
(b) Mean,
(c) Interquartile range
(d) Semi-quartile range
5. The frequency distribution table shows the heights of some students at a certain school
Marks 10 11 12 13 14 15 16 17 18 19 20
Number of students 1 2 2 2 2 4 2 1 2 1 1
Find:
Solution
x x2
4.5 20.25
4.4 19.36
6.2 38.44
9.2 84.64
8.2 67.24
12.0 144
10.0 100
8.8 77.44
3.8 14.44
13.6 184.96
∑ 80.7 ∑ 750.77
∑
(a) Mean, ̅ =
∑
(b) s.d =√ ̅ =√ ( ) =3.155
2. The concentration in m per litre of a trace element in 7 randomly chosen samples of water from
spring wells were: 240.8, 237.3, 236.6, 2333.9 and 232.5. Determine the mean and the variance
of the concentration of the trace elements per litre.
Solution
∑𝑥
x x2 Mean, 𝑥̅ =
𝑛
240.8 57984.64 ∑𝑥
Var(x) = 𝑥̅
237.3 56311.29 𝑛
236.7 56026.89
=
234.2 54849.64
236.6 55979.56 = 55,702.5 – 55,696
233.9 54709.21
= 6.5
232.5 54056.25
∑ 1652 ∑ 389917.5
3. The table below shows the length of flowers from a certain plant to the nearest 0.5cm.
Length (cm) 7.5 8.0 8.5 9.0 9.5 10.0 10.5 11.0
Number of flowers 4 9 11 8 10 7 2 3
Find the:
(a) Mean
(b) Mode
(c) The median
(d) Standard deviation
Solution
x f fx fx2 cf ∑ 𝑓𝑥
(a)Mean = ∑𝑓
7.5 4 30 225 4
8.0 9 72 576 13
8.5 11 93.5 794.75 24
9.0 8 72 648 32 (b) mode = 8.5
9.5 10 95 902.5 42
𝑡
10.0 7 70 700 49 (c) Median = ( ) value
10.5 2 21 220.5 51
11.0 3 33 363 54 = 27th value
∑ 54 ∑ 486.5 ∑ 4429.75 From cf the 27th value = 9.0
∑
(d) s.d = √ ∑ ̅ =√ ̅ =1
4. The marks scored by 11 students in a test are:52, 61, 78, 49, 47, 79, 54, 58, 62, 73, 72
Find;
(a) Median
Arrange values in ascending order
47, 49, 52, 54, 58, 61, 62, 72,73,78,79
(a) Median = = 6th value = 61
(b) Mean,
∑ = 52+ 61+78 + 49 + 47 + 79 + 54 + 58 + 62 + 73 + 72 =685
∑
Mean = = 62.273
(b) Interquartile range
q1 = ( ) value = 2.75th value = 52
5. The frequency distribution table shows the heights of some students at a certain school
6. The table below shows the marks obtained by 20 students in a mathematics test marked out of
20
Marks 10 11 12 13 14 15 16 17 18 19 20
Number of students 1 2 2 2 2 4 2 1 2 1 1
Find:
Solution
x f cf fx fx2
∑ 𝑓𝑥 ∑ 𝑓𝑥
10 1 1 10 100 (a) s.d = √ ∑𝑓
(∑ )
𝑓
11 2 3 22 242
12 2 5 24 288 =√ ( )
13 2 7 26 338 = 2.722
14 2 9 28 392 (b) 60th percentile
15 4 13 60 900 𝑡
16 2 15 32 512 =( 𝑥 ) value
17 1 16 17 289
= 12th value from cf =15
18 2 18 36 648
19 1 19 19 361
20 1 20 20 400
∑ 20 ∑ 294 ∑ 4470
(d) q1 = ( ) value = 5th value from cf = 12
Solution
x f cf fx fx2
∑ 𝑓𝑑 ∑ 𝑓𝑑
4 3 3 12 48 (b) s.d = √ (∑ )
∑𝑓 𝑓
5 1 4 5 25
6 5 9 30 180
=√ ( ) =2.26
7 1 11 7 49
8 5 16 40 320 𝑡
(c) q1 = ( 𝑥 ) value
9 1 17 9 81
10 2 19 20 200 = 5th value from cf = 6
11 1 20 11 121 𝑡
q3 = ( 𝑥 ) value
12 1 21 12 144
∑ 20 ∑ 146 ∑ 1168 = 15th value, from cf = 8
Semi-interquartile range =
Terms used
(a) Class: these are limits of distribution. In the table above, the classes are: (20 – 29),
(30 – 39), (40 – 49), (50 – 59), (60 – 69), (70 – 79), (80 – 89).
(b) Class mark or mark (x)
(c) Frequency (f) = number of items in a class
This is the mid-point value of the class. It is normally denoted by x. in the table above, the
class marks are 24.5, 34.5, 44.5 …..
(d) Class boundary
These are continuous class limits. In the above table the first class boundary is (20-0.5) –
(29 +0.5). In this case, the lower class boundary is 19.5 and upper class boundary is 29.5
For class interval 2.0 – 2.9, the class boundary is (2.0 -0.05) – (2.9 + 0.05) =
1.95-2.95.
(e) Class width or class interval
This is the width of each class boundary.
It is given by;
Class width = upper class boundary – lower class boundary
In the table above, class width = 29.5 – 19.5 = 10
∑
(f) Mean, ̅ = ∑
∑ ∑
(g) Variance (Var(X) = ∑
(∑ )
∑ ∑
(h) Standard deviation = ( ) √ (∑ )
∑
(i) Median of grouped data
Median of grouped data is defined by
∑
Median = Lb + ( )
Median = Lb + ( )
(l) Histogram
This is a graph consisting of vertical bars. It is a graph of frequency against class boundary.
The area of the bar is equal to the frequency. Histogram is used to obtain the mode
(m) Percentile
This a value that divides a given distribution into 100 equal parts
The 60th percentile for instance is defined as
∑
P60 = ( )
Where
Lb = lower class boundary of the 60th class
C= class width
F = frequency of the 60th class
c.fb = cumulative frequency before that one of the 60th class
(n) Quartiles
This a value that divides a given distribution into 4 equal parts
Where
Lb = lower class boundary of the q1 class
C= class width
f = frequency of the q1 class
c.fb = cumulative frequency before that one of the q1 class
The upper quartile denoted q3 for instance is defined as
∑
q3 = ( )
Where
Lb = lower class boundary of the q3 class
C= class width
f = frequency of the q3 class
c.fb = cumulative frequency before that one of the q3 class
Interquartile range = q3 –q1
Semi-interquartile range =
Example 4
The table below shows the weight of 250 students at The Science Foundation College
Weight (kg) 44.0 – 48.0 – 52.0 – 56.0 – 60.0 – 64.0 – 68.0 – 72.0 –
47.9 51.9 55.9 59.9 63.9 67.9 71.9 75.9
Frequency 3 17 50 45 46 57 23 9
Find
(a) Average weight
(b) Standard deviation
(c) Median weight
(d) Modal weight
(e) Draw an Ogive and use it to find
(i) Upper quartile
(ii) And 10th percentile
(f) Construct a histogram and use it to determine the mode
Solution
(c) Median = Lb + ( )
∑
(iii) 10th percentile ( ) 225th value, from the graph, P90 = 69.15
(f) Construct a histogram and use it to determine the mode
The table below shows the number of students and the mark scored in a test.
Example 6
The frequency distribution table below shows the marks of 50 students score in a test
Solution
∑
Mean, ̅ ∑
(ii) Standard deviation. (05 marks)
∑
s.d = √ ∑
̅
=√
= 12.06
(b) (i) Plot a cumulative frequency curve (Ogive) for the given data. (04 marks)
Note that CF is plotted against the upper limit of each class
(ii) Use the Ogive to estimate the median mark. (02 marks)
56.5
Example 7
The table below shows the age in years of mothers at the time they had their first child.
Age in years 15 - 20 - 25 - 30 - 35 - 40 - 45
Number of 2 14 29 43 33 9
mothers
Calculate the modal age of the mothers. (05 marks)
Mode = Li + ( )
∆1 = 43 - 29 = 14
∆2 = 43 – 33 = 10
Li = 30, Cc = 5
Example 8
The table below shows a frequency distribution of marks scored by s5 students in a test.
Marks 10 - 20 - 30 - 40 - 50 - 60 - 70 - 80 -
≤90
Number of students 2 6 12 15 10 6 3 1
(a) Draw a histogram for the data and use it to estimate the modal mark. (05marks)
Marks x f fx fx2
10 - 20 15 2 30 450
20 - 30 25 6 150 3750
30 - 40 35 12 420 14700
40 - 50 45 15 675 30375
50 - 60 55 10 550 30250
60 - 70 65 6 390 25350
70 - 80 75 3 225 16875
80 - 90 85 1 85 7225
∑f = 55 ∑fx=2525 ∑fx2=128975
(i) mean mark
∑
Means, ̅ ∑
∑ ∑
S.D = √ ∑
(∑ ) √ ( ) = 15.4
Example 9
60 81 76 68 84 112 76 102 86 67
65 98 107 110 72 99 87 92 76 77
94 102 87 86 73 118 98 120 62 87
65 92 104 116 91 93 78 122 102 92
80 111 73 120 106 123 94 109 80 96
(a) Form a grouped frequency table for the data with classes of equal intervals, starting with
the class 60 – 69. (06 marks)
Note that CF is plotted against the upper limit value of the class
Lower quartile, q1 = ( ) ( )
(iii) number of cattle which weigh 118kg and above. (05 marks)
Upper quartile, q3 = ( ) ( )
Example 10
Solution
Example 11
The table shows the weights (kg) of 150 patients who visited a certain health centre.
(a) Mean
(b) Mode
(c) Median
(b) Mode = Lb + ( )
Modal class boundary is 29.5 – 39.5, since 32 is the highest frequency and C = 10
and
( )
∑
(c) Median = Lb + ( )
∑
Example 12
Solution
Number of x f fx fx2
crime
5-<10 7.5 10 75 562.5
10-<20 15 15 225 3375
20-<30 25 25 625 15625
30-<50 40 40 1600 64000
50-<100 75 25 1875 140625
∑ 115 ∑ 4400 ∑ 224187.5
∑ ∑
Var(x) =
∑
(∑ ) ( )
s.d = ( ) √
Example 13
The table below shows the weight of 250 students at a certain day school
weight x f d = x - A fd fd2
43.95-47.95 45.95 3 -12 -36 432
47.95-51.95 49.95 17 -8 -136 1088
51.95-55.95 53.95 50 -4 -200 800
55.95-59.95 57.95 45 0 0 0
59.95-63.95 61.95 46 4 184 736
63.95-67.95 65.95 57 8 456 3648
67.95-71.95 69.95 23 12 276 3312
71.95-75.95 73.95 9 16 144 2304
∑ 250 ∑ 688 ∑ 12320
∑
(a) ̅ ∑
∑ ∑
(b) Var(x) = ∑
(∑ )
= ( )
(c) S.d = ( ) √
Example 14
The following table shows the marks obtained by to students in a physics test marked out of 100
Marks (%) 20 -29 30-39 40-49 50-59 60-69 70-79 80-89 90-100
Number of students 4 6 2 5 7 8 5 2
Find
(a) Mean
(b) Standard deviation
(c) Median and mode
(d) Semi-interquartile range
(e) 40th and 85th percentile range
Solution
(iii) Median = Lb + ( )
∑
Example 15
Solution
(c) Mode = Lb +
=34.5 + (( ) (
) = 39.5
)
∑
(d) Median = Lb + ( )
∑
∑
(e) q1 = ( )
34.5 + ( ) = 38.67
∑
Q3 = ( )
64.5 + ( ) = 67.5
∑
(vii) P90 = ( )
( )
= 74.4 + ( ) = 77
Example 17
Marks 5-14 15-24 25-34 35-44 45-54 55-64 65-74 75-85 85-94
Frequency 3 7 12 20 30 15 8 3 2
Draw a histogram and use it to determine the mode
Example 18
(a) Median
(b) Interquartile range
(c) 10th percentile
Solution
Class 19.5 – 29.5 29.5–39.5 39.5–49.5 49.5 – 59.5 59.5 – 69.5 69.5 – 79.5
boundary
cf 4 10 22 30 37 40
(a) The median = ( ) ( ) = 20th value from the graph = 48.5
Orgive
Example 19
The data shows the length in centimetres for different calendars produced by a printing press. A
cumulative frequency distribution was formed
Solution
Marks (%) 25-29 30-34 35-39 40-44 45-49 50-54 55-59 60-64 60-69 70-74
Frequency 9 12 10 17 13 25 18 14 8 8
(a) Draw a histogram and use it to estimate the modal mark. (52.5)
(b) Find the
(i) Mean mark (49.4627)
(ii) Standard deviation (12.424)
9. The table below shows the marks obtained in an examination by 200 candidates
Thank You