Stats3 Topic Notes
Stats3 Topic Notes
org/step
Algebra of Expectations
Not in the formula book (X, Y do not have to be independent for the first one):
• E(aX + bY + c) = aE(X) + bE(Y ) + c
• Var(aX + b) = a2 Var(X)
Distribution functions
The probability distribution function is also called the probability density function.
If you are given the probability distribution function of X, then you can find the distribution func-
tion of related random variables, such as Y = X 2 . You do this via the cumulative distribution
function, e.g.:
FY (y) = P(Y 6 y)
= P(X 2 6 y)
√
= P(X 6 y)
Z √y
= f(t) dt
−∞
Once you have the C.D.F. for Y you can differentiate (with respect to y) and hence find the P.D.F.
The probability generating function of X is GX (t) = E(tX ), so if the random variable takes integer
values greater than or equal to 0 we have:
We have GX (1) = 1 as this is the sum of all the possible probabilities. The probability P(X = r)
is the coefficient of tr in the P.G.F. expansion.
These formulae, and the generating functions for Binomial, Poisson and Geometric distributions
are given in the formula book.
Z
Substituting t = 0 gives MX (0) = f(x)dx = 1.
In a similar way you can differentiate with respect to t again and show that M00X (0) = E(X 2 ) and
2
hence Var(X) = M00X (0) − M0X (0) .
These formulae and the M.G.F.’s for the Uniform, Exponential and Normal distributions are given
in the formula book.
Sampling
Geometric Distribution
The Geometric distribution models the number of trials needed up to and including the first “suc-
cess”. It has one parameter, p, the probability of success. It is not on all A-level specifications, but
does sometimes appear in STEP questions.
For example, X could be the number of rolls of a dice until a six is rolled. Then the probability of
success is p = 16 and X ∼Geo 16 .
The probability of a six occurring on the rth roll (and not before) is P(X = r) = (1 − p)r−1 p =
5 r−1 1 th roll).
6 6 (as the first r − 1 rolls are all “not sixes” followed by a six on the r
Writing q = 1 − p we have:
1
=p× Sum of a infinite GP
1−q
1
=p× =1 as expected!
p
0 + 1 + 2q + 3q 2 + 4q 3 + . . . = (1 − q)−2 (*)
1 p
We then have E(X) = p × (1−q)2
= p2
= p1 .
2−p
Hence E(X 2 ) = p + 22 qp + 32 q 2 p + 42 q 3 p + · · · = p2
and
2
Var(X) = E(X 2 ) − E(X)
2
2−p 1
= 2
−
p p
1−p
=
p2