0% found this document useful (0 votes)
13 views5 pages

Unit 9 StatProbRevision

This document is a revision booklet for Statistics and Probability, covering essential formulas for mean, variance, standard deviation, and probability laws. It includes a variety of easier and harder questions related to data analysis, probability calculations, and statistical concepts. The booklet serves as a comprehensive guide for students preparing for assessments in this subject area.

Uploaded by

jh seo
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views5 pages

Unit 9 StatProbRevision

This document is a revision booklet for Statistics and Probability, covering essential formulas for mean, variance, standard deviation, and probability laws. It includes a variety of easier and harder questions related to data analysis, probability calculations, and statistical concepts. The booklet serves as a comprehensive guide for students preparing for assessments in this subject area.

Uploaded by

jh seo
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

UNIT 8

Statistics & Probability


Revision booklet
IM3+

May 27, 2021

Formulae
Mean of a data set
Xn Xn
xi xi fi
i=1 i=1
x̄ = = n
n X
fi
i=1

Variance & standard deviation for a data set


v
Xn u n
uX
2
(xi − x̄) u
u (xi − x̄)2
v = i=1 i=1
t
s=
n n
Probability laws
P (A ∪ B) = P (A) + P (B) − P (A ∩ B)
P (A ∩ B) P (B | A)P (A)
P (A | B) = =
P (B) P (B | A)P (A) + P (B | A0 )P (A0 )
Mutually exclusive events
P (A ∩ B) = 0
Independent events
P (A ∩ B) = P (A)P (B)
Expected value
n
X
E(X) = xi pi
i=1

1
Easier questions
1. A sample of 15 measurements has a mean of 14.2 and a sample of 10 measurements
has a mean of 12.6. Find the mean of the total sample of 25 measurements.

2. Consider the data-set 7, 5, 7, 2, 8, 7.

(a) Determine the mean and standard deviation for this data-set.
(b) If each number is increased by 2, determine the new mean and standard
deviation.

3. The mean height of footballers in a league competition is 178 cm. The standard
deviation is 4 cm. Assuming the heights are normally distributed, calculate the
percentage of footballers with a height between 174 cm and 180 cm.

4. Two baseballers compare their batting performances for a ten game stretch. The
number of safe hits per game were recorderded as

Daryl 5 4 1 0 5 4 0 5 4 2
Tom 1 2 3 3 3 4 6 2 3 3

(a) Show that each baseballer has the same mean and range.
(b) Calculate the standard deviation for each distribution.
(c) Hence comment on which baseballer is more likely to have more safe hits per
game.

5. The number of customers received per day in a shop was recorded over a period
of 99 days, as shown in the table below.

Number 0 − 19 20 − 29 30 − 39 40 − 49 50 − 80
Frequency 18 17 25 15 24

(a) Construct a bar-graph to illustrate the data.


(b) Using 1 mm ≡ 1 customer, construct a cumulative frequency graph.
(c) Using your graph, estimate the median number of customers received by the
shop per day.
(d) Calculate estimates for the mean number of customers received by the shop
per day and the standard deviation.

6. A six–sided die and coin are tossed. What is the probability that

(a) a multiple of 3 and head is obtained


(b) a factor of 15 or tail is obtained.

2
7. Bag X contains 3 black and 2 red marbles. Bag Y contains 4 black and 1 red
marble. A bag is selected at random and then two marbles are selected without
replacement. Determine the probability that:

(a) both marbles are red


(b) two black marbles are picked from Bag Y.

8. In a class of 24 students, 10 study Biology, 12 study Chemistry and 5 study neither


Biology nor Chemistry.

(a) Construct a fully labelled Venn Diagram illustrating the above information.
(b) Find the probability that a randomly selected student from the class studies:
i. Chemistry, but not Biology
ii. both Chemistry and Biology

Harder questions
1. Consider the data-set x1 , x2 , x3 , . . . , xn .

(a) Write down the formulas for the mean, x̄, and variance, s2 , for this data-set.
(b) If each number is increased by m, determine the new mean, x̄new and variance,
s2new .

2. A set of ten data-values, x1 , x2 , . . . , x10 , has mean x̄ and standard deviation s.


Using the formulae for each of these prove that if each data value is increased by
30%, then both the mean and standard deviation increase by 30% as well.

3. Each athlete on a running team recorded the distance (M miles) they ran in 30
minutes. The median distance is 4 miles and the interquartile range is 1.1 miles.
The information is shown in the following box-and-whiskers plot.

The distance in miles, M , can be converted to the distance in kilometres, K, using


the formula K = 58 M . The variance of the distances run by the athletes is 16
9 km .
2

The standard deviation of the distances is b miles.


A total of 600 athletes from teams compete in a 5 km race. The times the 600
athletes took to run the 5 km race are shown in the following cumulative frequency
graph.

3
There were 400 athletes who took between 22 and m minutes to complete the 5
km race.

(a) Find the value of a.


(b) Write down the value of the median distance in kilometres (km).
(c) Find the value of b.
(d) Find m.
(e) The first 150 athletes that completed the race won a prize. Given that an
athlete took between 22 and m minutes to complete the 5 km race, calculate
the probability that they won a prize.

4. Events A and B are independent with P (A ∪ B) = 0.9 and P (A ∩ B) = 0.4. Find


P (A) and P (B) given that P (A) > P (B).

5. Six cards, 3 hearts, 2 diamonds and 1 club, are placed face down and randomly
shuffled. You win 10 dollars if you choose the club and you win 5 dollars if you

4
chose a diamond. Determine the greateet losing amount, should you choose a heart
such that the game is worth your while to play.

6. Suppose P (C) = 0.6 and P (D) = 0.7. Explain why C and D are not mutually
exclusive.

7. Bag X contains 3 black and 2 red marbles. Bag Y contains 4 black and 1 red
marble. A six-sided die has four sides marked X and 2 sides marked Y. The die
is rolled and then a bag is chosen based on the letter the die shows, and then two
marbles are selected from that bag without replacement. Determine the probability
that:

(a) both marbles are red


(b) two black marbles are picked from Bag Y
(c) the marbles came from Bag X, given that they are different colours.

8. Pot A has 4 silver and 7 gold coins, while Pot B has 4 silver and 2 gold coins. A
coin is randomly selected from Pot A and placed in Pot B. Then, a coin is randomly
selected from Pot B and placed in Pot A. Finally, a coin is randomly selected from
Pot A. Find the probability that this coin is gold.

9. Latoya and Michael are world travellers. On any given day the probability that
Latoya is in Paris is 0.99, while the probability that Michael is in Paris is 0.98.
What is the probability that Latoya is in Paris given that only one of them is
there?
2 5
10. Given P (X 0 | Y ) = , P (Y ) = and P (X 0 ∩ Y 0 ) = 0, find P (X).
3 6
11. It is estimated that 35% of deer carry the TPC gene. Of those that carry the TPC
gene, it is estimated that 58% carry the SD gene, while 23% of the deer without
the TPC gene carry the SD gene. If a deer is randomly chosen and is found to
carry the SD gene, what is the probability it does not carry the TPC gene?

12. Roughly 1 in 1,000,000 is a murderer. In an ongoing murder investigation a blood


sample is taken from the murder scene. A DNA test, which accurately identifies 99
out of 100 blood samples, is performed on a randomly selected person. The DNA
test falsely identifies 1 out of 1,000 samples (false positive. The test results in a
positive match. What is the probability that this person is the murderer? [Tree
diagram helps]

You might also like