Unit 9 StatProbRevision
Unit 9 StatProbRevision
Formulae
Mean of a data set
Xn Xn
xi xi fi
i=1 i=1
x̄ = = n
n X
fi
i=1
1
Easier questions
1. A sample of 15 measurements has a mean of 14.2 and a sample of 10 measurements
has a mean of 12.6. Find the mean of the total sample of 25 measurements.
(a) Determine the mean and standard deviation for this data-set.
(b) If each number is increased by 2, determine the new mean and standard
deviation.
3. The mean height of footballers in a league competition is 178 cm. The standard
deviation is 4 cm. Assuming the heights are normally distributed, calculate the
percentage of footballers with a height between 174 cm and 180 cm.
4. Two baseballers compare their batting performances for a ten game stretch. The
number of safe hits per game were recorderded as
Daryl 5 4 1 0 5 4 0 5 4 2
Tom 1 2 3 3 3 4 6 2 3 3
(a) Show that each baseballer has the same mean and range.
(b) Calculate the standard deviation for each distribution.
(c) Hence comment on which baseballer is more likely to have more safe hits per
game.
5. The number of customers received per day in a shop was recorded over a period
of 99 days, as shown in the table below.
Number 0 − 19 20 − 29 30 − 39 40 − 49 50 − 80
Frequency 18 17 25 15 24
6. A six–sided die and coin are tossed. What is the probability that
2
7. Bag X contains 3 black and 2 red marbles. Bag Y contains 4 black and 1 red
marble. A bag is selected at random and then two marbles are selected without
replacement. Determine the probability that:
(a) Construct a fully labelled Venn Diagram illustrating the above information.
(b) Find the probability that a randomly selected student from the class studies:
i. Chemistry, but not Biology
ii. both Chemistry and Biology
Harder questions
1. Consider the data-set x1 , x2 , x3 , . . . , xn .
(a) Write down the formulas for the mean, x̄, and variance, s2 , for this data-set.
(b) If each number is increased by m, determine the new mean, x̄new and variance,
s2new .
3. Each athlete on a running team recorded the distance (M miles) they ran in 30
minutes. The median distance is 4 miles and the interquartile range is 1.1 miles.
The information is shown in the following box-and-whiskers plot.
3
There were 400 athletes who took between 22 and m minutes to complete the 5
km race.
5. Six cards, 3 hearts, 2 diamonds and 1 club, are placed face down and randomly
shuffled. You win 10 dollars if you choose the club and you win 5 dollars if you
4
chose a diamond. Determine the greateet losing amount, should you choose a heart
such that the game is worth your while to play.
6. Suppose P (C) = 0.6 and P (D) = 0.7. Explain why C and D are not mutually
exclusive.
7. Bag X contains 3 black and 2 red marbles. Bag Y contains 4 black and 1 red
marble. A six-sided die has four sides marked X and 2 sides marked Y. The die
is rolled and then a bag is chosen based on the letter the die shows, and then two
marbles are selected from that bag without replacement. Determine the probability
that:
8. Pot A has 4 silver and 7 gold coins, while Pot B has 4 silver and 2 gold coins. A
coin is randomly selected from Pot A and placed in Pot B. Then, a coin is randomly
selected from Pot B and placed in Pot A. Finally, a coin is randomly selected from
Pot A. Find the probability that this coin is gold.
9. Latoya and Michael are world travellers. On any given day the probability that
Latoya is in Paris is 0.99, while the probability that Michael is in Paris is 0.98.
What is the probability that Latoya is in Paris given that only one of them is
there?
2 5
10. Given P (X 0 | Y ) = , P (Y ) = and P (X 0 ∩ Y 0 ) = 0, find P (X).
3 6
11. It is estimated that 35% of deer carry the TPC gene. Of those that carry the TPC
gene, it is estimated that 58% carry the SD gene, while 23% of the deer without
the TPC gene carry the SD gene. If a deer is randomly chosen and is found to
carry the SD gene, what is the probability it does not carry the TPC gene?