Binomial and Poisson Distributions
Binomial and Poisson Distributions
𝑘
𝜎 𝜎
Answer:
1. Yes
2. No
0% 0%
20
1 2 Countdown
Binomial or not?
Note: If the box were much larger then consecutive picks would be nearly independent
- Binomial then a good approximation.
e.g. if box of 1000 screws, with 500 galvanized
499 1
𝑃 ( second galvanized|first galvanized )= =0.4995 ≈
999 2
Mean and variance of a binomial distribution
If , 𝜇= 𝐸 ( 𝑋 )= 𝑛𝑝 = var(X) =
Derivation
Suppose first that we have a single Bernoulli trial. Assign the value 1 to success, and 0 to
failure, the first occurring with probability p and the second having probability 1 − p.
Since the n trials are independent, the total expected value is just the sum of the
expected values for each trial, hence .
Hence the variance for the sum of n independent trials is, by the rule for
summing the variances of independent variables,
Mean and variance of a
Polling binomial distribution
1. 6.3
2. 12.6
3. 25.3
4. 120
5. 160 0% 0% 0% 0% 0% 30
1. 2. 3. 4 5 Countdown
Polling
( 𝑘¿) 𝑝
𝑃 ( 𝑋 =𝑘 )=
𝑛 𝑘
( 1 − 𝑝 )𝑛 − 𝑘
The probability of events per unit time does not have to be constant for the total
number of events to be Poisson – can split up the total into a sum of the number of
events in smaller intervals.
Example: On average lightning kills three people each year
in the UK, What is the probability that only one person is
killed this year?
Answer:
Assuming these are independent random events, the
number of people killed in a given year therefore has a
Poisson distribution:
𝑃 ( 𝑋 = 𝑘)
𝑘
Let the random variable be the number of people killed in a year.
Reminder: 0% 0% 0% 0% 20
1 2 3 4 Countdown
Poisson distribution
−𝜆 𝑘 −6 5
𝑒 𝜆 𝑒 6
𝑃 ( 𝑋 =𝑘=5 )= =
𝑘! 5!
Mean and variance
Messages arrive at a switching centre at random and at an average rate of 1.2 per
second.
(b) For how long can the operation of the centre be interrupted, if the probability of
losing one or more messages is to be no more than 0.05?
Answer:
−𝜆 𝑘 − 2.4 5
𝑒 𝜆 𝑒 2.4
𝑃 ( 𝑌 =𝑘=5 )= = =0.060
𝑘! 5!
Question: (b) For how long can the operation of the centre be interrupted, if the
probability of losing one or more messages is to be no more than 0.05?
Answer:
(b) Let the required time = t seconds. Average rate of arrival is 1.2/second.
Poisson, with
The Poisson distribution is an approximation to B(n, p), when n is large and p is small
(e.g. if , say).
The probability of a certain part failing within ten years is 10 -6. Five million of the parts have
been sold so far.
What is the probability that three or more will fail within ten years?
Answer:
Evaluating the Binomial probabilities is rather awkward; better to use the Poisson
approximation.
For such small and large the Poisson approximation is very accurate
(exact result is also 0.875 to three significant figures).
Poisson Distribution Summary
Describes discrete random variable that is the number of independent and randomly
occurring events, with mean number . Probability of such events is
−𝜆 𝑘
𝑒 𝜆
𝑃 ( 𝑋 =𝑘 )=
𝑘!
if then where
Continuous Random Variables
A continuous random variable is a random variable which can take values measured on a
continuous scale e.g. weights, strengths, times or lengths.
𝑓 ( 𝑥)
Probability of in the range a to b.
Normalization:
Variance Variance of :
∞
¿ ∫ 𝑥 𝑓 ( 𝑥 ) 𝑑𝑥 −𝜇
2 2
−∞
Note: the mean and variance may not be well defined for distributions with broad tails.
Mean
𝑃 ( 𝑋 < 𝑥 )=0.5
The mode is the value of where is
maximum (which may not be unique). Mode
𝑃 ( 𝑋 > 𝑥 )=0.5
Median
Probability density function
The continuous random variable has the Uniform distribution between and , with if
1 1 x 2
𝑓 ( 𝑥 ) =¿ 2 1 otherwise
0
, for short.
f(x)
2 x
1
Occurrence of the Uniform distribution
1) Waiting times from random arrival time until a regular event (see later)
(b) Find the mean and standard deviation of the wait time.
Wait time can be anything between 0 and 8.33ms and each time in this range is as
likely as any other time.
Mid-point
Proof:
f(x)
𝑦
Let y be the distance from the mid-
point,
𝑤 𝑤
𝜃 2 +𝜃 1
2 2
𝜃 2 +𝜃 1 1 𝜃 1+ 𝜃 2
¿
2
+ ∫ 𝑦 𝑓 ( 𝑦 ) 𝑑𝑦¿
2
+∫ 𝑦
𝑤 𝑤
𝑑𝑦 ¿
2
+0
𝑤 −
− 2
2
Unsurprisingly the mean is the midpoint!
∞ 𝜇
𝜎 = ∫ ( 𝑥 − 𝜇 ) 𝑓 ( 𝑥 ) 𝑑𝑥
2
Variance:
2
f(x)
𝑦
−∞
𝑤
2
1
¿ ∫ 𝑦2
𝑤
𝑑𝑦
−
𝑤
2 x
2 1
[ ]
3 𝑤
1 𝑦 2
𝑤=𝜃 2 − 𝜃 1
¿ 𝑤
𝑤 3 −
2
¿
1 𝑤3 𝑤 3
3𝑤 8
+
8 (
=
𝑤2
12 )
( 𝜃2 − 𝜃1 )2
¿
12
Example: Disk wait times
(b) Find the mean and standard deviation of the wait time.
Answer: (c)
𝜇=4.2 ms For 2000 reads the mean total time is 2000 4.2ms = 8.3s.