Maths 2018
Maths 2018
INSTRUCTIONS TO CANDIDATE
• This exam consists of two sections: (I) Probability and (II) Statistics.
• Attempt two questions from each section, i.e., four questions in total.
All questions carry equal marks.
• All work must be shown clearly and logically using appropriate symbols and
probability notation. Failure to do so will lose marks.
• Write down the formula you intend to use at each stage before filling it in
with numbers.
a) Assume that A and B are independent events with Pr(A) = 0.65 and
Pr(B) = 0.88. Furthermore, let C be an event which is necessary for A,
i.e., given that A has occurred, C is certain to have occurred.
Calculate the following:
i) Pr(A ∪ B) ; (2 marks)
ii) What proportion of faulty devices does Michelle miss in her inspection?
Note: these are the faulty devices that she classifies as being in
working order. (4 marks)
c) Let W ∈ [0, ∞) represent wind speed in miles per hour (mph). Assume
that W has the following density function:
2
f (w) = 2 λ w e−λ w
where λ > 0.
2
i) Show that Pr(W > w) = e−λw . (3 marks)
iii) A wind turbine only operates when the wind speed is between 10 mph
and 60 mph. Calculate the probability that the wind turbine does not
operate. Note: use λ value from part (ii). (3 marks)
Part I: Probability Question 2
a) Let X ∼ Geometric(p).
et p
M(t) = .
1 − et (1 − p) (4 marks)
b) Clearly state the appropriate distribution, and then answer the associated
short question, in each of the parts below.
iii) Assume that 35% of people have the blood type A+. In a sample
of 100 randomly selected people, what is the probability that more
than 30 of them have this blood type? (3 marks)
c) Consider a game where a die is rolled and you win if a number divisible
by three appears. The game costs e10 to play and returns e25 (i.e., a
profit of e15) if you win.
Let X be the number of times you win, and let N be the number of times
you played the game. Calculate the expected profit under both of the
above strategies. (8 marks)
Part I: Probability Question 3
Briefly explain how the above result leads to the 95% confidence interval
σ
X ± 1.96 √
n
1 2 2 4 7 7 9 10 10 14
15 16 22 23 30 30 33 34 42 54
56 57 61 129 143
i) Construct a frequency table with 5 classes (note: let “0” be the lower
limit of the first interval). (4 marks)
iii) Calculate the mean and median for this sample. State which measure
is more appropriate (provide justification). (3 marks)
ii) What is the parameter here? (provide symbol and value) (2 marks)
iii) What is the statistic here? (provide symbol and value) (2 marks)
iv) Calculate a 99% confidence interval for the parameter; does this
support the researcher’s belief? (4 marks)
5 2 2 3 1 3
Design A Design B
sample size 10 12
mean 22.2 ◦ C 21.4 ◦ C
standard deviation 3.1 ◦ C 1.4 ◦ C
i) Before comparing means, the F test was carried out, and yielded a
p-value of 0.0161.
State the null and alternative hypotheses for this test, and your
conclusion based on the above p-value reported. (3 marks)
ii) Test the hypothesis that there is no difference in the mean operating
temperatures for the two designs using the 5% level of significance,
and provide your conclusion in both statistical and non-statistical
language. (11 marks)
iii) Based on the results of parts (i) and (ii), would you recommend either
Design A or Design B – or are both equivalent? (3 marks)
Part II: Statistics Question 3
Y = ln(Lifetime)
2
L = Lifetime
40
30
0
20
−2
10
0
5 6 7 8 9 10 5 6 7 8 9 10
X = Voltage X = Voltage
c) Comment on how well the regression model fits the data based on its
coefficient of determination. (2 marks)
d) Show that, from the equation in part (b), one can obtain the equation
L = L0 ω X . Hence, or otherwise, predict the lifetime of the component
when the voltage is 2V . Explain whether or not you believe the prediction
to be accurate, and why. (4 marks)
f) Using the confidence interval from part (e), produce a confidence interval
for ω (where ω appears in the model in part (d)). (3 marks)
Useful Formulae: Page 1
Histogram:
max(x) − min(x)
• class width =
number of classes
Numerical Summaries: P
xi
• x̄ =
n
x2i − n x̄2
P
2
• s =
n−1
n+1
• Position of Qk : ×k
4
• IQR = Q3 − Q1
• LF = Q1 − 1.5 × IQR
• UF = Q3 + 1.5 × IQR
Probability:
• Pr(Ac ) = 1 − Pr(A)
Random Variables:
P R
• E(X) = xi p(xi ) or x f (x) dx
• E(aX + b) = a E(X) + b
P P RR
• E(XY ) = i j xi yj p(xi , yj ) or x y f (x, y) dx dy
• V ar(aX + b) = a2 V ar(X)
p
• Sd(X) = V ar(X)
Cov(X, Y )
• Corr(X, Y ) = p
V ar(X) V ar(Y )
Discrete Distributions:
n λx
px (1 − p)n−x • p(x) = (1 − p)x−1 p
• p(x) = x
• p(x) = x!
e−λ
1
• E(X) = n p • E(X) = p
• E(X) = λ
1−p
• V ar(X) = n p (1 − p) • V ar(T ) = p2
• V ar(X) = λ
Useful Formulae: Page 3
Continuous Distributions:
1
• f (t) = λ e−λ t • f (x) = b−a
• f (x) = not needed
b−x
• Pr(T > t) = e−λ t • Pr(X > x) = b−a
• Pr(X > x) = tables
1 b+a
• E(T ) = λ
• E(X) = 2
• E(X) = µ
1 (b−a)2
• V ar(T ) = λ2
• V ar(X) = 12
• V ar(X) = σ 2
• Z ∼ Normal(µ = 0, σ = 1)
q
2 2
⇒ Difference: X1 − X2 ∼ Normal µ1 − µ2 , σ1 + σ2
p
• For X1 , . . . , Xn ∼ any distribution with µ = E(X) and σ = Sd(X) = V ar(X)
σ
⇒ Sample mean: X ∼ Normal µ, √ if n > 30
n
Useful Formulae: Page 4
Statistics and Standard Errors:
s
µ x̄ √ large / small ν =n−1
n
r
p p̂ p̂ (1 − p̂) large confidence
interval
n
r
p0 (1 − p0 ) large hypothesis
test
n
(a + b)2
µ1 − µ2 x̄1 − x̄2 s large / small ν = a2
s21 s2
2
+ n2b−1
+ 2 n1 −1
n1 n2
s21 s2
a= , b= 2
n1 n2
s
s2p s2p
+ small ν = n1 + n2 − 2
n1 n2
assuming
(n1 − 1) s21 + (n2 − 1) s22
where s2p = σ12 = σ22
n1 + n2 − 2
s
p1 − p2 p̂1 − p̂2 p̂1 (1 − p̂1 ) p̂2 (1 − p̂2 ) large confidence
+ interval
n1 n2
s
p̂c (1 − p̂c ) p̂c (1 − p̂c )
+ large hypothesis
n1 n2 test
x1 + x2
where p̂c =
n1 + n2
Confidence Intervals:
2 × Pr(Z > |z|) if Ha : µ 6= µ0
• p-value = Pr(Z < z) if Ha : µ < µ0
Pr(Z > z) if Ha : µ > µ0
ν1 = n top − 1, ν2 = n bottom − 1
P
( yi )2
yi2 −
P
• Syy = n
P P
P ( xi )( yi )
• Sxy = xi yi − n
Sxy
• r=p
Sxx Syy
• β̂0 = ȳ − β̂1 x̄
Sxy
• β̂1 =
Sxx
e2i
P
Syy − β̂1 Sxy
• s2e = =
n−2 n−2
s
s2e
• s(β̂1 ) =
Sxx
• ν = n − 2 (degrees of freedom)
• R2 = r 2