Exam Questions Chapter2&3
Exam Questions Chapter2&3
The box plot in Figure 1 shows a summary of the weights of the luggage, in kg, for each musician in an
orchestra on an overseas tour.
The airline's recommended weight limit for each musician's luggage was 45 kg. Given that none of the
musicians' luggage weighed exactly 45 kg,
(a) state the proportion of the musicians whose luggage was below the recommended weight limit.
(1)
A quarter of the musicians had to pay a charge for taking heavy luggage.
(b) State the smallest weight for which the charge was made.
(1)
(c) Explain what you understand by the + on the box plot in Figure 1, and suggest an instrument that the
owner of this luggage might play.
(2)
(d) Describe the skewness of this distribution. Give a reason for your answer.
(2)
One musician of the orchestra suggests that the weights of luggage, in kg, can be modelled by a normal
distribution with quartiles as given in Figure 1.
(Total 10 marks)
Page |2
Figure 2 shows a histogram for the variable t which represents the time taken, in minutes, by a group of
people to swim 500m.
(2)
(b) Estimate the number of people who took longer than 20 minutes to swim 500m.
(2)
(c) Find an estimate of the mean time taken.
(4)
(d) Find an estimate for the standard deviation of t.
(3)
(e) Find the median and quartiles for t.
(4)
(f) Evaluate this measure and describe the skewness of these data.
(2)
(Total 17 marks)
Page |3
The histogram in Figure 1 shows the time taken, to the nearest minute, for 140 runners to complete a fun
run.
Use the histogram to calculate the number of runners who took between 78.5 and 90.5 minutes to
complete the fun run.
(5)
(Total 5 marks)
Page |4
Cotinine is a chemical that is made by the body from nicotine which is found in cigarette smoke. A doctor
tested the blood of 12 patients, who claimed to smoke a packet of cigarettes a day, for cotinine. The
results, in appropriate units, are shown below.
(a) Find the mean and standard deviation of the level of cotinine in a patient's blood.
(4)
(b) Find the median, upper and lower quartiles of these data.
(3)
A doctor suspects that some of his patients have been smoking more than a packet of cigarettes per day.
He decides to use Q3+1.5(Q3–Q1) to determine if any of the cotinine results are far enough away from the
upper quartile to be outliers.
(c) Identify which patient(s) may have been smoking more than a packet of cigarettes a day. Show your
working clearly.
(4)
Research suggests that cotinine levels in the blood form a skewed distribution.
(d) Evaluate this measure and describe the skewness of these data.
(3)
(Total 14 marks)
Page |5
The age in years of the residents of two hotels are shown in the back to back stem and leaf diagram
below.
(e) Compare the two age distributions of the residents of each hotel.
(3)
(Total 13 marks)
Page |6
In a study of how students use their mobile telephones, the phone usage of a random sample of 11
students was examined for a particular week.
The values of Sxx and Sxy for these 10 students are Sxx = 3463.6 and Sxy = −18.3.
(e) Calculate the product moment correlation coefficient between the number of text messages sent and
the total length of calls for these 10 students.
(2)
A parent believes that a student who sends a large number of text messages will spend fewer minutes on
calls.
(f) Comment on this belief in the light of your calculation in part (e).
(1)
(Total 14 marks)
Page |7
In a shopping survey a random sample of 104 teenagers were asked how many hours, to the nearest
hour, they spent shopping in the last month. The results are summarised in the table below.
A histogram was drawn and the group (8 − 10) hours was represented by a rectangle that was 1.5 cm
wide and 3 cm high.
(a) Calculate the width and height of the rectangle representing the group (16 − 25) hours.
(3)
(b) Use linear interpolation to estimate the median and interquartile range.
(5)
(c) Estimate the mean and standard deviation of the number of hours spent shopping.
(4)
(d) State, giving a reason, the skewness of these data.
(2)
(e) State, giving a reason, which average and measure of dispersion you would recommend to use to
summarise these data.
(2)
(Total 16 marks)
Page |8
A researcher measured the foot lengths of a random sample of 120 ten-year-old children. The lengths are
summarised in the table below.
Coefficient of skewness =
(c) Evaluate this coefficient and comment on the skewness of these data.
(3)
Greg suggests that a normal distribution is a suitable model for the foot lengths of ten-year-old children.
(d) Using the value found in part (c), comment on Greg's suggestion, giving a reason for your answer.
(2)
(Total 13 marks)
Page |9
The variable x was measured to the nearest whole number. Forty observations are given in the table
below.
A histogram was drawn and the bar representing the 10 - 15 class has a width of 2 cm and a height of 5
cm. For the 16 - 18 class find
(a) the width,
(1)
(b) the height
(2)
of the bar representing this class.
(Total 3 marks)
P a g e | 10
The 19 employees of a company take an aptitude test. The scores out of 40 are illustrated in the stem
and leaf diagram below.
Find
The company director decides that any employees whose scores are so low that they are outliers will
undergo retraining.
An outlier is an observation whose value is less than the lower quartile minus 1.0 times the interquartile
range.
(c) Explain why there is only one employee who will undergo retraining.
(2)
(d) On the graph paper on page 5, draw a box plot to illustrate the employees' scores.
(3)
(Total 9 marks)
P a g e | 11
The birth weights, in kg, of 1500 babies are summarised in the table below.
(Total 11 marks)
P a g e | 12
A teacher selects a random sample of 56 students and records, to the nearest hour, the time spent
watching television in a particular week.
(a) Find the mid-points of the 21-25 hour and 31-40 hour groups.
(2)
A histogram was drawn to represent these data. The 11-20 group was represented by a bar of width 4 cm
and height 6 cm.
The distances travelled to work, D km, by the employees at a large company are normally distributed with
D N(30, 82 ).
(a) Find the probability that a randomly selected employee has a journey to work of more than 20 km.
(3)
(b) Find the upper quartile, Q3, of D.
(3)
(c) Write down the lower quartile, Q1 , of D.
(1)
An outlier is defined as any value of D such that D < h or D > k where
(e) Find the probability that the distance travelled to work by this employee is an outlier.
(3)
(Total 12 marks)
P a g e | 14
Keith records the amount of rainfall, in mm, at his school, each day for a week. The results are given
below.
Jenny then records the amount of rainfall, x mm, at the school each day for the following 21 days. The
results for the 21 days are summarised below.
(a) Calculate the mean amount of rainfall during the whole 28 days.
(2)
Keith realises that he has transposed two of his figures. The number 9.4 should have been 4.9 and the
number 0.5 should have been 5.0
Keith corrects these figures.
(b) State, giving your reason, the effect this will have on the mean.
(2)
(Total 4 marks)
P a g e | 15
On a randomly chosen day, each of the 32 students in a class recorded the time, t minutes to the nearest
minute, they spent on their homework. The data for the class is summarised in the following table.
(b) find the mean and the standard deviation of the times spent by the students on their homework.
(3)
(c) Comment on the skewness of the distribution of the times spent by the students on their homework.
Give a reason for your answer.
(2)
(Total 7 marks)
P a g e | 16
Over a long period of time a small company recorded the amount it received in sales per month. The
results are summarised below.
(a) On the graph paper below, draw a box plot to represent these data, indicating clearly any outliers.
(5)
(b) State the skewness of the distribution of the amount of sales received. Justify your answer.
(2)
(c) The company claims that for 75% of the months, the amount received per month is greater than £10
000. Comment on this claim, giving a reason for your answer.
(2)
(Total 9 marks)
P a g e | 17
A class of students had a sudoku competition. The time taken for each student to complete the sudoku
was recorded to the nearest minute and the results are summarised in the table below.
(d) Give a reason to support the use of a normal distribution in this case.
(1)
On another occasion the teacher calculated the quartiles for the times taken by the students to complete
a different sudoku and found
(e) Describe, giving a reason, the skewness of the times on this occasion.
(2)
(Total 11 marks)
P a g e | 18
The histogram in Figure 1 shows the time, to the nearest minute, that a random sample of 100 motorists
were delayed by roadworks on a stretch of motorway.
Figure 1
(2)
(b) Estimate the number of motorists who were delayed between 8.5 and 13.5 minutes by the roadworks.
(2)
(Total 4 marks)
P a g e | 19
The marks, x, of 45 students randomly selected from those students who sat a mathematics examination
are shown in the stem and leaf diagram below.
(c) Find the mean and the standard deviation of the marks of these students.
(3)
(d) Describe the skewness of the marks of these students, giving a reason for your answer.
(2)
The mean and standard deviation of the marks of all the students who sat the examination were 55 and
10 respectively. The examiners decided that the total mark of each student should be scaled by
subtracting 5 marks and then reducing the mark by a further 10%.
(e) Find the mean and standard deviation of the scaled marks of all the students.
(4)
(Total 13 marks)
P a g e | 20
A policeman records the speed of the traffic on a busy road with a 30 mph speed limit. He records the
speeds of a sample of 450 cars. The histogram in Figure 2 represents the results.
(a) Calculate the number of cars that were exceeding the speed limit by at least 5 mph in the sample.
(4)
(b) Estimate the value of the mean speed of the cars in the sample.
(3)
(c) Estimate, to 1 decimal place, the value of the median speed of the cars in the sample.
(2)
(d) Comment on the shape of the distribution. Give a reason for your answer.
(2)
(e) State, with a reason, whether the estimate of the mean or the median is a better representation of the
average speed of the traffic on the road.
(2)
(Total 13 marks)
P a g e | 21
A survey of 100 households gave the following results for weekly income £y.
A histogram was drawn and the class 200 ≤ y < 240 was represented by a rectangle of width 2 cm and
height 7 cm.
(a) Calculate the width and the height of the rectangle representing the class 320 ≤ y < 400
(3)
(b) Use linear interpolation to estimate the median weekly income to the nearest pound.
(2)
(c) Estimate the mean and the standard deviation of the weekly income for these data.
(4)
(d) Use this measure to calculate the skewness for these data and describe its value.
(2)
Katie suggests using the random variable X which has a normal distribution with mean 320 and standard
deviation 150 to model the weekly income for these data.
The following table summarises the times, t minutes to the nearest minute, recorded for a group of
students to complete an exam.
(f) Without further calculations, explain the effect this would have on each of the estimates found in parts
(a), (b), (c) and (d).
(3)
(Total 14 marks)
P a g e | 23
The marks of a group of female students in a statistics test are summarised in Figure 1
Figure 1
(a) Write down the mark which is exceeded by 75% of the female students.
(1)
The marks of a group of male students in the same statistics test are summarised by the stem and leaf
diagram below.
(b) Find the median and interquartile range of the marks of the male students.
(3)
An outlier is a mark that is
either more than 1.5 × interquartile range above the upper quartile
(c) In the space provided on Figure 1 draw a box plot to represent the marks of the male students,
indicating clearly any outliers.
(5)
(d) Compare and contrast the marks of the male and the female students.
(2)
(Total 11 marks)
P a g e | 24
An agriculturalist is studying the yields, y kg, from tomato plants. The data from a random sample of 70
tomato plants are summarised below.
The bar representing the yield 5 ≤ y < 10 has a width of 1.5 cm and a height of 8 cm.
(a) Calculate the width and the height of the bar representing the yield 15 ≤ y < 25
(3)
(b) Use linear interpolation to estimate the median yield of the tomato plants.
(2)
(c) Estimate the mean and the standard deviation of the yields of the tomato plants.
(4)
(d) Describe, giving a reason, the skewness of the data.
(2)
(e) Estimate the number of tomato plants in the sample that have a yield of more than 1 standard
deviation above the mean.
(2)
(Total 13 marks)
P a g e | 25
The mark, x, scored by each student who sat a statistics examination is coded using
y = 1.4x − 20
The coded marks have mean 60.8 and standard deviation 6.60
(Total 4 marks)
P a g e | 26
A random sample of 35 homeowners was taken from each of the villages Greenslax and Penville and
their ages were recorded. The results are summarised in the back-to-back stem and leaf diagram below.
Some of the quartiles for these two distributions are given in the table below.
(b) On the graph paper opposite draw a box plot to represent the data from Penville. Show clearly any
outliers.
(4)
(c) State the skewness of each distribution. Justify your answers.
(3)
P a g e | 27
(Total 9 marks)
P a g e | 28
The table shows the time, to the nearest minute, spent waiting for a taxi by each of 80 people one Sunday
afternoon.
(a) Write down the upper class boundary for the 2–4 minute interval.
(1)
A histogram is drawn to represent these data. The height of the tallest bar is 6 cm.
(Total 12 marks)
P a g e | 29
The times, in seconds, spent in a queue at a supermarket by 85 randomly selected customers, are
summarised in the table below.
A histogram was drawn to represent these data. The 30 – 60 group was represented by a bar of width 1.5
cm and height 1 cm.
coefficient of skewness =
(d) Evaluate this coefficient and comment on the skewness of these data.
(3)
(Total 11 marks)
P a g e | 30
Each of 60 students was asked to draw a 20° angle without using a protractor. The size of
each angle drawn was measured. The results are summarised in the box plot below.
(c) Use linear interpolation to estimate the size of the median angle drawn. Give your
answer to 1 decimal place.
(2)
(d) Show that the lower quartile is 63°
(2)
For these data, the upper quartile is 75°, the minimum is 55° and the maximum is 84°
(e) (i) Show that there are no outliers for these data.
(ii) Draw a box plot for these data on the grid on page 3.
(5)
(f) State which angle the students were more accurate at drawing. Give reasons for
your answer.
(3)
.............................................................................................................................................
P a g e | 31
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
.............................................................................................................................................
A midwife records the weights, in kg, of a sample of 50 babies born at a hospital. Her results are given in
the table below.
The bar representing the weight 2 ≤ w < 3 has a width of 1 cm and a height of 4 cm.
(a) Calculate the width and height of the bar representing a weight of 3 ≤ w < 3.5
(3)
(b) Use linear interpolation to estimate the median weight of these babies.
(2)
(c) (i) Show that an estimate of the mean weight of these babies is 3.43 kg.
(ii) Find an estimate of the standard deviation of the weights of these babies.
(3)
Shyam decides to model the weights of babies born at the hospital, by the random variable W, where W ~
N(3.43, 0.652)
(f) Without carrying out any further calculations, state, giving a reason, what effect the addition of this
newborn baby to the sample would have on your estimate of the
(i) mean,
(ii) standard deviation.
(3)
An estate agent is studying the cost of office space in London. He takes a random sample of 90 offices
and calculates the cost, £x per square foot. His results are given in the table below.
A histogram is drawn for these data and the bar representing 50 ≤ x < 60 is 2cm wide and 8cm high.
(a) Calculate the width and height of the bar representing 20 ≤ x < 40
(3)
(b) Use linear interpolation to estimate the median cost.
(2)
(c) Estimate the mean cost of office space for these data.
(2)
(d) Estimate the standard deviation for these data.
(2)
(e) Describe, giving a reason, the skewness.
(1)
Rika suggests that the cost of office space in London can be modelled by a normal distribution with mean
£50 and standard deviation £10
(f) With reference to your answer to part (e), comment on Rika's suggestion.
(1)
(g) Use Rika's model to estimate the 80th percentile of the cost of office space in London.
(3)
The following grouped frequency distribution summarises the number of minutes, to the nearest minute,
that a random sample of 100 motorists were delayed by roadworks on a stretch of motorway one Monday.
The bar representing a delay of (3–6) minutes has a width of 2 cm and a height of 9.5 cm.
(a) Calculate the width and the height of the bar representing a delay of (11–15) minutes.
(3)
(b) Use linear interpolation to estimate the median delay.
(2)
(c) Calculate an estimate of the mean delay.
(2)
(d) Calculate an estimate of the standard deviation of the delays.
(2)
(e) Evaluate this coefficient for the above data, giving your answer to 2 significant figures.
(1)
On the following Friday, the coefficient of skewness for the delays on this stretch of motorway was – 0.22
(f) State, giving a reason, how the delays on this stretch of motorway on Friday are different from the
delays on Monday.
(2)
The box plot summarises the weights of luggage for a group of tourists at an airport.
(b) Write down an estimate of the maximum weight that is allowed before having to pay the extra charge.
(1)
Four tourists with luggage weighing 10 kg, 17 kg, 21 kg and 25 kg join the group.
(c) State what, if any, changes this will make to the box plot.
Give a reason for your answer.
(1)
Peta recorded the times, t minutes, a group of children took to swim 300 metres. She summarised these
times in the following histogram.
(a) Calculate the number of children who took part in the swim.
(2)
Adam used the histogram to estimate the mean and standard deviation of the times taken by the children
to complete the swim.
(d) Explain an assumption Adam made about these data that has led him to get different answers to
Peta.
(1)
Adam and Peta each calculate a coefficient of skewness by using their statistics in the formula
(e) (i) Calculate the coefficients of skewness found by Adam and Peta.
(ii) Suggest how Peta could improve her histogram to describe the data more accurately.
(2)
A rugby club coach uses club records to take a random sample of 15 players from 1990 and an
independent random sample of 15 players from 2010. The body weight of each player was recorded to
the nearest kg and the results from 2010 are summarised in the table below.
(a) Find the estimated values in kg of the summary statistics a, b and c in the table below.
(b) Using the table in part (a), comment on the rugby coach's claim.
(2)
Figure 1
(a) Calculate the number of staff who have claimed less than 10 hours of overtime in the month.
(4)
(b) Estimate the median number of hours of overtime claimed by these 40 staff in the month.
(2)
(c) Estimate the mean number of hours of overtime claimed by these 40 staff in the month.
(2)
The manager wants to compare these data with overtime data he collected earlier to find out if the
overtime claimed by staff has decreased.
(d) State, giving a reason, whether the manager should use the median or the mean to compare the
overtime claimed by staff.
(2)
A sports teacher recorded the number of press-ups done by his students in two minutes.
He recorded this information for a Year 7 class and for a Year 11 class.
Key: means 42 press–ups for a Year 7 student and 40 press–ups for a Year 11 student
(b) Find the lower quartile and the upper quartile for the Year 7 class.
(2)
(c) Use the medians and quartiles to describe the skewness of each of the two distributions.
(3)
(d) Give two reasons why the normal distribution should not be used to model the number
of press-ups done by the Year 11 class.
(2)
Yujie is investigating the weights of 10 young rabbits. She records the weight, x grams, of each rabbit and
the results are summarised below.
(a) Calculate the mean and the standard deviation of the weights of these rabbits.
(3)
Two more rabbits weighing 776 grams and 896 grams are added to make a group of 12 rabbits.
(c) State, giving a reason, how the inclusion of these two rabbits would affect the mean.
(2)
(d) By considering the change in (x − )2, state what effect the inclusion of these two rabbits would have
on the standard deviation.
(2)
Ralph records the weights, in grams, of 100 tomatoes. This information is displayed in the histogram
below.
(a) find the number of tomatoes with a weight between 0 and 2 grams.
(2)
One of the tomatoes is selected at random.
(e) describe the skewness of the distribution of the weights of these tomatoes. Give a reason for your
answer.
(1)
Two of these 100 tomatoes are selected at random.
(f) Estimate the probability that both tomatoes weigh within 0.75 grams of the mean.
(4)
Nina weighed a random sample of 50 carrots from her shop and recorded the weight, in grams to the
nearest gram, for each carrot. The results are summarised below.
(a) Use linear interpolation to estimate the median weight of these carrots.
(2)
(b) Find an estimate for the mean weight of these carrots.
(2)
(c) Find an estimate for the standard deviation of the weights of these carrots.
(2)
A carrot is selected at random from Nina's shop.
(d) Estimate the probability that the weight of this carrot is more than 70 grams.
(2)
Each student achieved a score, x, on the test and their scores are summarised in the table below.
The mean score for Class A is 77 and the mean score for Class B is 61
(c) State, with a reason, which class you believe they were from.
(1)
The two classes are combined into one group of 25 students.
(d) (i) Find the mean test score for all 25 students.
(ii) Find the variance of the test scores for all 25 students.
(4)
The teacher of class A later realises that he added up the test scores for his class incorrectly. Each
student's test score in class A should be increased by 3
(e) Without further calculations, state, with a reason, the effect this will have on
(i) the variance of the test scores for class A
(ii) the mean test score for all 25 students
(iii) the variance of the test scores for all 25 students.
(3)
Two youth clubs, Eastyou and Westyou, decided to raise money for charity by running a 5 km race. All
the members of the youth clubs took part and the time, in minutes, taken for each member to run the 5 km
was recorded.
The times for the Westyou members are summarised in Figure 1.
(a) Write down the time that is exceeded by 75% of Westyou members.
(1)
The times for the Eastyou members are summarised by the stem and leaf diagram below.
(b) Find the value of the median and interquartile range for the Eastyou members.
(3)
An outlier is a value that falls either
(c) On the grid on page 7, draw a box plot to represent the times of the Eastyou members.
(4)
(d) State the skewness of each distribution. Give reasons for your answers.
(3)
The weights, in grams, of a random sample of 48 broad beans are summarised in the table.
A histogram was drawn to represent these data. The 2.1 < x ≤ 2.7 class was represented by a bar of
width 1.5 cm and height 1 cm.
(a) Find the width and height of the 0.9 < x ≤ 1.1 class.
(3)
(b) Give a reason to justify the use of a histogram to represent these data.
(1)
(c) Estimate the mean and the standard deviation of the weights of these broad beans.
(4)
(d) Use linear interpolation to estimate the median of the weights of these broad beans.
(2)
One of these broad beans is selected at random.
(e) Estimate the probability that its weight lies between 1.1 grams and 1.6 grams.
(1)
One of these broad beans having a recorded weight of 0.95 grams was incorrectly weighed. The correct
weight is 1.4 grams.
(f) State, giving a reason, the effect this would have on your answers to part (c). Do not carry out any
further calculations.
(2)
A group of 100 adults recorded the amount of time, t minutes, they spent exercising each day. Their
results are summarised in the table below.
The bar representing the time 0 ≤ t < 15 has width 0.5 cm and height 6 cm.
(a) Calculate the width and height of the bar representing a time of 60 ≤ t < 120
(3)
(b) Use linear interpolation to estimate the median time spent exercising by these adults each day.
(2)
(c) Find an estimate of the mean time spent exercising by these adults each day.
(2)
(d) Calculate an estimate for the standard deviation of these times.
(2)
(e) Describe, giving a reason, the skewness of these data.
(1)
Further analysis of the above data revealed that 18 of the 25 adults in the 0 ≤ t < 15 group took no
exercise each day.
(f) State, giving a reason, what effect, if any, this new information would have on your answers to
(i) the estimate of the median in part (b),
(ii) the estimate of the mean in part (c),
(iii) the estimate of the standard deviation in part (d).
(3)
The heights, x metres, of 40 children were recorded by a teacher. The results are summarised as follows
(a) Find the mean and the variance of the heights of these 40 children.
(3)
The teacher decided that these statistics would be more useful in centimetres.
(b) Find
(i) the mean of these heights in centimetres,
(ii) the standard deviation of these heights in centimetres.
(2)
Two more children join the group. Their heights are 130 cm and 160 cm.
(c) (i) State, giving a reason, the mean height of the 42 children.
(ii) Without recalculating the standard deviation, state, giving a reason, whether the standard deviation
of the heights of the 42 children will be greater than, less than or the same as the standard deviation of
the heights of the group of 40 children.
(4)
Chi wanted to summarise the scores of the 39 competitors in a village quiz. He started to produce the
following stem and leaf diagram
He did not complete the stem and leaf diagram but instead produced the following box plot.
(7)
Some of the competitors complained that the questions were biased towards the younger generation. The
product moment correlation coefficient between the age of the competitors and their score in the quiz is –
0.187
(d) State, giving a reason, whether or not the complaint is supported by this statistic.
(2)
(Total for question = 13 marks)
P a g e | 49
A group of students took some tests. A teacher is analysing the average mark for each student. Each
student obtained a different average mark.
For these average marks, the lower quartile is 24, the median is 30 and the interquartile range (IQR) is 10
The three lowest average marks are 8, 10 and 15.5 and the three highest average marks are 45, 52.5 and
56
The median and the upper quartile are the same but the lower quartile is now 26
(e) Give ranges of values within which each of these students' average marks must lie.
(2)
Only use these grids if you need to redraw your answer for part (b) or part (d).
The time taken to complete a puzzle, in minutes, is recorded for each person in a club. The times are
summarised in a grouped frequency distribution and represented by a histogram.
One of the class intervals has a frequency of 20 and is shown by a bar of width 1.5 cm and height 12 cm
on the histogram. The total area under the histogram is 94.5 cm2
A researcher recorded the time, t minutes, spent using a mobile phone during a particular afternoon, for
each child in a club.
The researcher coded the data using and the results are summarised in the table below.
The stem and leaf diagram below shows the ages (in years) of the residents in a care home.
A disc of radius 1 cm is rolled onto a horizontal grid of rectangles so that the disc is equally likely to land
anywhere on the grid. Each rectangle is 5 cm long and 3 cm wide. There are no gaps between the
rectangles and the grid is sufficiently large so that no discs roll off the grid.
If the disc lands inside a rectangle without covering any part of the edges of the rectangle then a prize is
won.
(a) show that the probability of winning a prize on any particular roll is
(3)
A group of 15 students each roll the disc onto the grid twenty times and record the number of times, x,
that each student wins a prize. Their results are summarised as follows
(b) Find the standard deviation of the number of prizes won per student.
(2)
A second group of 12 students each roll the disc onto the grid twenty times and the mean number of
prizes won per student is 3.5 with a standard deviation of 2
(c) Find the mean and standard deviation of the number of prizes won per student for the whole group of
27 students.
(7)
The 27 students also recorded the number of times that the disc covered a corner of a rectangle and
estimated the probability to be 0.2216 (to 4 decimal places).
(d) Explain how this probability could be used to find an estimate for the value of π and state the value of
your estimate.
(3)
A random sample of 100 carrots is taken from a farm and their lengths, L cm, recorded.
The data are summarised in the following table.
(a) Find the width and height of the bar representing the class 15 ≤ L < 20
(3)
(b) Use linear interpolation to estimate the median length of these carrots.
(2)
(c) Estimate
(i) the mean length of these carrots,
(2)
(ii) the standard deviation of the lengths of these carrots.
(3)
A supermarket will only buy carrots with length between 9 cm and 22 cm.
(d) Estimate the proportion of carrots from the farm that the supermarket will buy.
(2)
Any carrots that the supermarket does not buy are sold as animal feed.
The farm makes a profit of 2.2 pence on each carrot sold to the supermarket, a profit of 0.8 pence on
each carrot longer than 22 cm and a loss of 1.2 pence on each carrot shorter than 9 cm.
(e) Find an estimate of the mean profit per carrot made by the farm.
(2)
The stem and leaf diagram shows the ages of the 35 male passengers on a cruise.
(e) Comment on any difference in the distributions of ages of male and female passengers on the cruise.
State the values of any statistics you have used to support your comment.
(1)
Anja, along with her 2 daughters and a granddaughter, now join the cruise.
Anja's granddaughter is younger than both of Anja's daughters.
Anja had her 23rd birthday on the day her eldest daughter was born.
When their 4 ages are included with the other female passengers on the cruise, the box plot does not
change.
The stem and leaf diagram shows the number of deliveries made by Pat each day for 24 days
An outlier is defined as any value greater than 1.5 × interquartile range above the upper quartile.
The data in the stem and leaf diagram are coded using
x = d – 125
The company Seafield requires contractors to record the number of hours they work each week. A
random sample of 38 weeks is taken and the number of hours worked per week by contractor Kiana is
summarised in the stem and leaf diagram below.
The quartiles for this distribution are summarised in the table below.
(b) Showing your working clearly, identify any outliers that Kiana finds.
(2)
(c) Draw a box plot for these data in the space provided on the grid opposite.
(3)
(d) Use the formula
to find the skewness of these data. Give your answer to 2 significant figures.
(2)
Kiana's new employer, Landacre, wishes to know the average number of hours per week she worked
during her employment at Seafield to help calculate the cost of employing her.
(e) Explain why Landacre might prefer to know Kiana's mean, rather than median, number of hours
worked per week.
(1)
Gill buys a bag of logs to use in her stove. The lengths, l cm, of the 88 logs in the bag are summarised in
the table below.
The bar representing logs with length has a width of 1.5 cm and a height of 4 cm.
(a) Calculate the width and height of the bar representing log lengths of
(3)
(b) Use linear interpolation to estimate the median of l
(2)
The maximum length of log Gill can use in her stove is 26 cm.
Gill estimates, using linear interpolation, that x logs from the bag will fit into her stove.
(d) Using x = 62 , find the probability that all 4 logs will fit into her stove.
(2)
The weights, W grams, of the logs in the bag are coded using y = 0.5w - 255 and summarised by
n = 88 ∑ y = 924 ∑ y2 = 12 862
(e) Calculate
(i) the mean of W
(3)
(ii) the variance of W
(3)
The stem lengths of a sample of 120 tulips are recorded in the grouped frequency table below.
(a) Calculate the exact area of the bar representing the class.
(2)
The height of the tallest bar in the histogram is 10 cm.
A random sample of 8 squirrels is taken and the data for each squirrel is coded using
(c) find
(i) the body length of the 9th squirrel,
(2)
(ii) the standard deviation of x for all 9 squirrels.
(2)
The histogram shows the times taken, t minutes, by each of 100 people to swim 500 metres.
(a) Use the histogram to complete the frequency table for the times taken by the 100 people to swim 500
metres.
(1)
(b) Estimate the number of people who took less than 16 minutes to swim 500 metres.
(2)
(c) Find an estimate for the mean time taken to swim 500 metres.
(2)
2
Given that ∑ ft = 41 033
(d) find an estimate for the standard deviation of the times taken to swim 500 metres.
(2)
Given that Q3 = 23
(e) use linear interpolation to estimate the interquartile range of the times taken to swim 500 metres.
(3)
Jim records the length, l mm, of 81 salmon. The data are coded using x = l – 600 and the following
summary statistics are obtained.
(c) Find the maximum number of salmon that have weights in the interval
1.5 × IQR above the upper quartile or 1.5 × IQR below the lower quartile
The histogram shows the distances, in km, that 274 people travel to work.