Unit 22 Solution
Unit 22 Solution
D
If each datum of a data set is doubled, then the mean, the
1. C inter-quartile range and the standard deviation of the data set
Range 9 2 7 will be doubled.
∴ The answer is D.
Q1 4 , Q3 8
∴ Inter-quartile range 8 4 4 9. C
1 2 3 4 5
Mean 3
2. D 5
Range (64.5 39.5) kg 25 kg Among the 5 data, removing the datum 3 which is equal to
the mean will make the distribution of the data set the least
concentrated about the mean. As a result, the standard
3. B deviation will be increased the most.
From the graph,
range (180 155) cm 25 cm
Exercise
Q1 160 cm , Q3 175cm
∴ Inter-quartile range (175 160) cm Part I Sectional Exercise
15 cm 1. B
Arrange the daily expenditures in ascending order:
4. C $88, $93, $99, $101, $111, $125, $171
02468 Q1 $93 , Q3 $125
Mean 4
5 ∴ Inter-quartile range of the daily expenditures
Standard deviation $(125 93)
(0 4) 2 (2 4) 2 (4 4) 2 (6 4) 2 (8 4) 2 $32
5
8 2. D
Arrange the waiting times (in min) in ascending order:
2 2 2.3, 4.2, 5.1, 6.8, 7.5, 8.1, 8.8, 10.5, 10.9, 12.4
Range of the waiting times (12.4 2.3) min
5. A 10.1min
1 8 3 6 5 5 7 1
Mean 2.9 Q1 5.1 min , Q3 10.5 min
8 6 5 1
Standard deviation ∴ Inter-quartile range of the waiting times
(10.5 5.1) min
8(1 2.9) 2 6(3 2.9) 2 5(5 2.9) 2 (7 2.9) 2
5.4 min
8 6 5 1
1.84 (cor. to 3 sig.fig.)
3. B
6. B
72 55
Standard score 4.25
4
7. B
∵ 20 h (30 2 5) h x 2σ
95
∴ The required percentage 50 %
2
2.5% Q1 Q3
1
22 More about Statistics
2
22 More about Statistics
3
22 More about Statistics
18. D 23. C
Mean ∵ 126 min (150 2 12) min x 2
( x 6) ( x 2) ( x 4) ( x 2) ( x 3) ( x 1) 162 min (150 12) min x
6
95 68
x 1 ∴ The required percentage %
Variance 2 2
81.5%
[( x 6) ( x 1)]2 [( x 2) ( x 1)]2 ... [( x 1) ( x 1)]2
6
32 24. C
Standard deviation of the new data set s (1 2%)
3
1.02s
19. D
From the histograms, the distribution of III is the most 25. B
concentrated and the distribution of I is the least concentrated. New median 12 3 15
∴ The ascending order of the standard deviations of the New inter-quartile range 17
distributions is given by: III, II, I
26. D
20. D Since the finishing times of the members leaving are equal to
Let x be the mean mark. the mean, removing them will have no effects on mean but
78 x make the distribution of data less concentrated about the
0.2
4 mean. As a result, the mean will remain unchanged and the
x 78.8 standard deviation will be increased.
∴ The mean mark is 78.8.
27. C
21. C Median of the new set of numbers (28 5) 2 66
Let be the standard deviation of the marks. Standard deviation of the new set of numbers 6 2 12
61 66
0.625 Range of the new set of numbers 15 2 30
σ
σ 8
76 66
Alex’s standard score 1.25 28. B
8 The numbers 20 – p, 20 – q, 20 – r, 20 – s and 20 – t can be
obtained by multiplying –1 to each datum in the numbers p,
22. A q, r, s and t, and then adding 20 to each of the resulting data.
For I: ∴ The required variance 16 (1)2 16
66 52
Standard score in paper I = 1.1667
12
29. D
∴ Standard score in paper I > standard score in paper II
∴ I is true. 30. A
For II: For I:
Let x be the marks in paper II. Since only three customers are selected, the sample size is
x 65 too small comparing with the population size.
0.75
16 ∴ I is a disadvantage of this sampling method.
x 77 For II:
∴ Marks in paper I < marks in paper II Since only the three male customers at the cashier counter
∴ II is not true. are selected and female customers have no chance of being
For III: selected, the sample obtained is not representative.
∵ Standard score in paper I > standard score in paper II ∴ II is a disadvantage of this sampling method.
∴ Brian performs relatively better in paper I. For III:
Since this is a sampling method, it is not necessary to select
∴ III is not true. all the customers.
∴ The answer is A. ∴ III is not a disadvantage of this sampling method.
∴ The answer is A.
4
22 More about Statistics
31. B 34. B
For I: For I:
From the graphs, we have Since the median marks for both examinations are 40 which
the sales volume of magazine P in 2016 = 50 millions is the passing mark, the number of candidates passing the
the sales volume of magazine Q in 2016 = 5 millions examination A may be the same as that passing the
∴ The sales volume of magazine P is 10 times that of examination B.
magazine Q in 2016. ∴ I may not be true.
∴ I is true. For II:
For II: Since the upper quartile of the marks in examination A is
the decrease of the sales volume of magazine P greater than the highest mark in examination B, this implies
from 2016 to 2017 that at least 25% of candidates get higher marks in
= (50 – 40) millions examination A than examination B.
= 10 millions ∴ II is true.
the decrease of the sales volume of magazine Q For III:
from 2016 to 2017 We cannot obtain the mean from a box-and-whisker diagram.
= (5 – 4) millions ∴ III may not be true.
= 1 million ∴ The answer is B.
∴ II is not true.
For III: 35. C
the percentage decrease of the sales volume of magazine P From the cumulative frequency curves, the distribution of III
from 2016 to 2017 is the least concentrated and the distribution of II is the most
10 millions concentrated.
100%
50 millions ∴ The descending order of the standard deviations of the
20% distributions is given by: III, I, II
the percentage decrease of the sales volume of magazine Q
from 2016 to 2017 36. [The option I in this question is amended to: ‘ mK mH ’]
1 million A
100%
5 millions
20%
∴ III is true.
∴ The answer is B.
32. C
Arrange the data except a and b in ascending order:
17, 25, 30, 41, 47, 52, 58
Since the median is 41 and a > b, b and a should be in the
lower half and upper half of the data set respectively.
Moreover, when b and a are the smallest and the greatest
data respectively, the inter-quartile range attains its
maximum value. mK mH
Arrange all data in ascending order:
b, 17, 25, 30, 41, 47, 52, 58, a
rH
17 25 52 58
Q1 21 , Q3 55
2 2 rK
∴ Maximum possible value of the inter-quartile range For I:
55 21 From the graph,
34 mK mH
∴ I is true.
33. B For II:
From the box-and-whisker diagram, the lower half of the From the graph,
data is more dispersed than the upper half of the data. rH rK
∴ The graph in B may be its corresponding frequency ∴ II is true.
curve.
5
22 More about Statistics
6
22 More about Statistics
46. B
43. C
x1 x2 x3 x4
Note that the group of numbers {3x1 2, 3x2 2, 3x3 2, ∵ m
4
3x4 2, 3x5 2, 3x6 2} can be obtained by multiplying 3 x1 x2 x3 x4 4m
to each datum in {x1 , x2 , x3 , x4 , x5 , x6} , and then adding 2 to
x2 x3 x4 4m x1 4m x1
each of the resulting data. ∴ a ,
3 3 3 3
For I:
x1 x3 x4 4m x2 4m x2
m2 3m1 2 b ,
3 3 3 3
∴ I is true.
x1 x2 x4 4m x3 4m x3
For II: c and
r2 3r1 3 3 3 3
x x x3 4m x4 4m x4
∴ II is not true. d 1 2
For III: 3 3 3 3
v2 v1 32 9v1 4m m
∴ The mean of the four numbers
∴ III is true. 3 3
∴ The answer is C. m
2
1
44. [The question is amended to: ‘Let A be a group of numbers The variance of the four numbers v
{, , , , } and B be another group of numbers { + 10, 3
+ 10, + 10, + 10}, where < < < < . Which of v
the following must be true?’] 9
D
47. D
For I:
For I:
Median of A
The number of copies of the novel sold in March
( 10) ( 10) = 100
Median of B
2 The number of copies of the novel sold in January
20 = 200
= 2 the number of copies of the novel sold in March
2
∴ The claim in I is true.
∵ , and are unknown.
For II:
∴ The median of A may not be greater than that of B.
50% of the number of copies of the novel sold in February
∴ I may not be true.
= 150 50%
For II: = 75
Range of A the number of copies of the novel sold in March
Range of B ( 10) ( 10) ∴ The claim in II is false.
∴ II must be true.
7
22 More about Statistics
5. C
From the graph, From the diagram,
largest datum = 200, smallest datum = 60 upper-quartile of the heights of blueberry bushes 1.6 m
∴ Range 200 60
140 6. A
Q1 96 , Q3 162 For I:
From the diagram,
∴ Inter-quartile range of the numbers of vehicles
the minimum score = 50 marks
162 96 ∴ I is true.
66 For II:
From the diagram,
2. D Q1 = 65 marks, Q3 = 90 marks
∵ Range 2 inter-quartilerange ∴ Inter-quartile range (90 65) marks
∴ 76 (20 a) 2(68 44) 25 marks
∴ II is true.
a 8
For III:
From the diagram,
3. B median 75 marks
Arrange the scores except x and y in ascending order: ∴ At least half of the students in the class obtained not
7, 11, 11, 12, 15, 18, 18, 20, 20, 21
more than 75 marks.
Modes of the above scores are 11, 18 and 20.
∴ III is not true.
∵ The mode and the range of all the scores are 18 and 23
∴ The answer is A.
respectively, and both x and y are positive.
∴ Among x and y, one is equal to 18 and the other is equal
7. B
to 30.
From the stem-and-leaf diagram, the upper half of the data is
Suppose x = 18 and y = 30.
less dispersed than the lower half of the data. Also, the lower
Arrange all the scores in ascending order:
quartile (20) is closer to the minimum (11) than the
7, 11, 11, 12, 15, 18, 18, 18, 20, 20, 21, 30
maximum (49).
11 12 20 20 ∴ The diagram in B may represent the distribution.
Q1 11.5 , Q3 20
2 2
8
22 More about Statistics
8. C 10. A
Range of the heights of apple trees
(219.5 139.5) cm
80 cm
149.5 12
169.5 36
xA xB
189.5 18
209.5 4 yA
9
22 More about Statistics
12. D 16. B
Let x kg and kg be the mean and the standard deviation of For I:
the weights respectively. Range of G1 = x5 – x1
52 x Range of G2 = x5 – x6
1.75 > x5 – x1
σ
∴ I is not true.
52 x 1.75σ ......( 1)
For II:
43 x G2 can be obtained by adding x6 to G1 and
0.5
σ x6 < x1 < x2 < x3 < x4 < x5.
43 x 0.5σ ......( 2) ∴ Mean of G1 is greater than mean of G2.
(1) – (2): 9 2.25σ ∴ II is true.
For III:
σ4
Median of G1 = x3
∴ The standard deviation of the weights is 4 kg.
x x3
Median of G2 2
13. C 2
For I: x3 (∵ x2 < x3)
From the figure, ∴ III is not true.
lower quartile 6 ∴ The answer is B.
∴ I is true.
For II: 17. B
5 4 68 7 4 8 6 9 2 For I:
Mean a1 a2 a3 ... a70
48 46 2 ∵ M1
6.75 70
Standard deviation ∴ a1 a2 a3 ... a70 70M1
4(5 6.75) 8(6 6.75) 4(7 6.75)
2 2 2
a1 a2 a3 ... a70 M1
∵ M2
6(8 6.75) 2 2(9 6.75) 2 71
70M 1 M 1
24 ∴ M2
1.23322 71
M1 M 2
1.5
∴ II is not true. ∴ I is true.
For III: For II:
For the two students who get 9 marks in the quiz, Since M1 may be the upper quartile or the lower quartile of
the group of 71 numbers {a1, a2, a3, … , a70, M1}, I1 may not
9 6.75
the standard score be equal to I2.
1.23322 ∴ II may not be true.
1.82449 For III:
2 Note that the group of 71 numbers {a1, a2, a3, … , a70, M1}
∴ No students in the class whose standard score are can be obtained by adding M1 to the group of 70 numbers
greater than or equal to 2. {a1, a2, a3, … , a70}.
∴ The standard scores of all students in the class are less Adding it will make the distribution of data more
than 2. concentrated about the mean. As a result, the standard
∴ III is true. deviation will be decreased.
∴ The answer is C. ∴ S1 > S2
∴ III is true.
14. B ∴ The answer is B.
∵ 260 g (245 3 5) g x 3σ
18. C
99.7
∴ The required probability 50 % Variance of the new set 36 32 324
2
0.0015
19. D
Note that the group of numbers {2x1 + 4, 2x2 + 4, 2x3+ 4,
15. A 2x4 + 4, 2x5 + 4, 2x6 + 4, 2x7 + 4} can be obtained by
14 12 18 17 16 15 21 15 20 multiplying 2 to each datum in the group of numbers {x1, x2,
Mean 16.4444
9 x3, x4, x5, x6, x7}, and then adding 4 to each of the resulting
Among the 4 options, the deviation of 10 from the mean is data.
the greatest. Adding 10 to the data set will make the
distribution of data the least concentrated about the mean. As
a result, the standard deviation will be increased the most.
10
22 More about Statistics
For I:
m 50 2 4
104
∴ I is true.
For II:
r 12 2
24
∴ II is true.
For III:
v 10 22
40
∴ III is true.
∴ The answer is D.
20. B
For I and III:
∵ We do not know the total numbers of students in class A
and class B.
∴ The claims in I and III may not be true.
For II:
Percentage of students who like orange juice in class A
90
100%
360
25%
Percentage of students who like orange juice in class B
90
100%
360
25%
∴ The claim in II is true.
∴ The answer is B.
11