s2 Chapter 1 Binomial Distribution
s2 Chapter 1 Binomial Distribution
Distribution
Dr J Frost ([email protected])
www.drfrostmaths.com
1 Outcomes
expressed as a set
2 A probability distribution
which maps outcomes to probabilities
However, there are certain ‘off-the-shelf’ probability distributions where the probability
function is already written for us and formulae are given for 𝐸[𝑋] and 𝑉𝑎𝑟[𝑋] – we have
some limited control over them by setting the values of some variables known as
‘parameters’.
There are 5 choices for the first beads. 4 for the second, and so on.
So 5 × 4 × 3 × 2 × 1 = 5! = 120 possibilities.
? the number of ways of
𝒏! is the factorial function, and gives us
arranging 𝒏 distinguishable objects.
Factorial and Choose Function
How many ways are there of arranging 2 red beads and 3 blue beads in a line?
Q2
We can’t distinguish each of the red beads, nor the blue beads.
4 Given that in the lottery you pick 6 Hard: How many possible lottery tickets are there
distinct numbers from 1 to 49, what’s the 8
where exactly 3 of the 6 numbers match (given there
probability you win the lottery?
𝟒𝟗 were again 49 numbers to choose from)
There are = 𝟏𝟑𝟗𝟖𝟑𝟖𝟏𝟔 possible There are
𝟔
possible ways of choosing the 3 numbers
𝟔
?
tickets. So probability is
𝟏 𝟑
which matched. However there are
𝟒𝟑
ways of choosing
𝟏𝟑𝟗𝟖𝟑𝟖𝟏𝟔
? 𝟑
𝟔 𝟒𝟑
the 3 numbers which didn’t match. So × =
𝟑 𝟑
𝟐𝟒𝟔𝟖𝟐𝟎
Probability based Questions
A fair die is rolled 8 times. Find the probability of:
a) No sixes
Q b) Only 3 sixes
c) 4 twos and 4 sixes
𝑃 𝑛𝑜 𝑠𝑖𝑥𝑒𝑠 =
5
8
?
6
𝟖
But there are = 𝟐𝟖 possible ways in which 6
𝟔 ?
people in the room could be left handed.
𝟖
So probability is × 𝟎. 𝟏𝟔 × 𝟎. 𝟗𝟐 = 𝟎. 𝟎𝟎𝟎𝟐𝟐𝟔𝟖 ≈
𝟔
𝟏 𝒊𝒏 𝟒𝟒 𝟎𝟎𝟎
Test Your Understanding
I throw a unfair coin 10 times, where the probability of heads is 𝑝. What’s the
probability of throwing 3 heads?
Q Let 𝑋 be a random variable which counts the number of heads seen in 10 throws.
Determine the probability distribution.
For convenience you can use 𝑞 as the probability of tails (where 𝑞 = 1 − 𝑝)
𝒙 𝟎 𝟏 𝟐 𝟑 𝟒 𝟓 𝟔 𝟕 𝟖 𝟗 𝟏𝟎
𝑃 𝑋=𝑥 𝑞10 10
𝑝𝑞 9
10 2 8 10 3 7 10 4 6 10 5 5 10 6 4 10 7 3 10 8 2 10 9 1 𝑝10
? ?
1 ?
2
𝑝 𝑞
?
3
𝑝 𝑞
?
4
𝑝 𝑞
?
5
𝑝 𝑞
?
6
𝑝 𝑞
?
7
𝑝 𝑞
?
8
𝑝 𝑞
9 ?
𝑝 𝑞
?
4. TWO outcomes in each trial, “success” and “failure”. We’re counting heads, so throwing
a heads is the “success”.
𝑋~𝐵 𝑛, 𝑝
𝑛 𝑥 𝑛−𝑥
𝑃 𝑋=𝑥 = 𝑝 1−𝑝
𝑥
FICT
Quickfire Questions
Show the calculation required to find the indicated probability given the distribution.
𝟏𝟎
𝑋~𝐵 10,0.3 𝑷 𝑿=𝟒 = 𝟎. 𝟑𝟒? 𝟎. 𝟕𝟔
𝟒
𝟏𝟎
𝑋~𝐵 10,0.2 𝑷 𝑿=𝟑 = 𝟎. 𝟐𝟑? 𝟎. 𝟖𝟕
𝟑
𝟓
𝑋~𝐵 5,0.1 𝑷 𝑿=𝟐 = 𝟎. 𝟏𝟐 ?𝟎. 𝟗𝟑
𝟐
1. We have a fixed No, not fixed. This is
number 𝑛 trials. known as a ‘Geometric
Distribution’ (which we
won’t cover)
2. Each trial has two
possibilities,
“success” and
“failure”.
?
Usually. But in my
? ? ?
3. The trials are Technically the probability of Only if balls drawn
story, genetics has having a girl increases if you
independent. previously had a girl, and vice with replacement.
an influence on
versa. But the probability is
handedness. still close to 0.5, so Binomial
Distribution is appropriate.
a How is 𝑋 distributed?
𝑿~𝑩 ?𝟓, 𝟎. 𝟐
b Determine the probability that I chose 3 red balls.
𝟓
𝑷 𝑿=𝟑 = 𝟐𝟑 𝟎. 𝟖𝟐 = 𝟎. 𝟎𝟓𝟏𝟐
𝟎.?
𝟑
(If you get these quickly, go on to Exercise 1B)
Exercise 1B
1 A student suggests using a binomial distribution
1 The random variable 𝑋~𝐵 8, 3 5 to model the following situations. Give a
Find description of the random variable, state any
assumptions that must be made and give
𝑃 𝑋 = 2 = 𝟎. 𝟐𝟕𝟑
possible values for 𝑛 and 𝑝.
?
𝑃 𝑋 = 5 = 𝟎. 𝟎𝟔𝟖𝟑
𝑃 𝑋 ≤ 1 = 𝟎. 𝟏𝟗𝟓 a A sample of 20 bolts is checked for defects from a
large batch. The production process should
2 produce 1% of defective bolts.
3 The random variable 𝑇~𝐵 15, 𝑿~𝑩 𝟐𝟎, 𝟎. 𝟎𝟏 assuming bolts being defective
Find
3
?
are independent from each other.
?
𝑃 𝑇 = 5 = 𝟎. 𝟎𝟎𝟔𝟕𝟎 b Some traffic lights have three phases: stop 48% of
?
𝑃 𝑇 = 10 = 𝟎. 𝟐𝟏𝟒 the time, wait or get ready 4% of the time and go
?
𝑃 3 ≤ 𝑇 ≤ 4 = 𝟎. 𝟎𝟎𝟏𝟕𝟖 48% of the time. Assuming that you only cross a
traffic light when it is in the go position, model
the number of times that you have to wait or
4 A balloon manufacturer claims that 95% stop on a journey passing through 6 sets of traffic
of his balloons will not burst when blown lights.
𝑿~𝑩 𝟔, 𝟎. 𝟓𝟐 assuming lights operate
up. You have 20 balloons. independently. ?
What is the probability that none of
them burst? c When Stephanie plays tennis with Tim on
𝟎. 𝟗𝟓𝟐𝟎 = 𝟎.? 𝟑𝟓𝟖 𝒕𝒐 𝟑𝒔𝒇 average one in eight of her serves is an ‘ace’. How
many ‘aces’ does Stephanie serve in the next 30
What is the probability exactly 2 burst? serves against Tim?
= 𝟎. 𝟏𝟖𝟗 ? 𝟏
𝑿~𝑩 𝟑𝟎, assuming serves are independent
𝟖
?
and probability of an ace is constant.
Overview So Far
Still to cover:
• The Cumulative Distribution Function 𝐹(𝑥)
• Calculating 𝐸 𝑋 and 𝑉𝑎𝑟 𝑋
S2 – Chapter 2 – Binomial Distribution
We therefore use tables (From Page 135 in the newer S2 Edexcel textbooks).
Quickfire Questions
𝑃 𝑋 < 5 = 𝑷 𝑿?≤𝟒 ?
𝑃 10 ≤ 𝑋 ≤ 20 = 𝑷 𝑿 ≤ 𝟐𝟎 − 𝑷 𝑿 ≤ 𝟗
𝑃 𝑋 ≥ 7 = 𝟏 − 𝑷 ?𝑿 ≤ 𝟔 ?
𝑃 𝑋 = 100 = 𝑷 𝑿 ≤ 𝟏𝟎𝟎 − 𝑷 𝑿 ≤ 𝟗𝟗
𝑃 𝑋 > 7 = 𝟏 − 𝑷 ?𝑿 ≤ 𝟕 ?
𝑃 20 < 𝑋 < 30 = 𝑷 𝑿 ≤ 𝟐𝟗 − 𝑷 𝑿 ≤ 𝟐𝟎
𝑃 10 ≤ 𝑋 < 20 = 𝑷 𝑿 ≤ 𝟏𝟗 ?− 𝑷 𝑿 ≤ 𝟗 ?
“at least 30” = 𝑷 𝑿 ≥ 𝟑𝟎 = 𝟏 − 𝑷 𝑿 ≤ 𝟐𝟗
?
“greater than 30” = 𝑷 𝑿 > 𝟑𝟎 = 𝟏 − 𝑷 𝑿 ≤ 𝟑𝟎
Test Your Understanding
An awkward Tiffin boy ventures into Tiffin Girls. He asks 20 girls out on the date. The
probability that each girl says yes is 0.3.
Determine the probability that he will end up with:
a) Less than 6 girls on his next date.
b) At least 9 girls on his next date.
Q
The boy considers the evening a success if he dated at least 9 girls that evening.
He repeats this process across 5 evenings.
c) Calculate the probability that he had at least 4 successful evenings.
(Note: You won’t be able to use your table for (c) as 𝑝 is not a nice round number – calculate prob directly)
You can start on your homework questions if you’ve done all of these (ask for sheet).
S2 – Chapter 2 – Binomial Distribution
Note that the textbook does this in a less methodical way: but the method above is
what you find in exam mark schemes, so ignore at your peril.
Test Your Understanding
At Camford University, students have 20 exams at the end of the year. All students
pass each individual exam with probability 0.45. Students are only allowed to continue
Q into the next year if they pass some minimum of exams out of the 20.
What do the university administrators set this minimum number such that the
probability of continuing to next year is at least 90%?
𝑋~𝐵 200, 0.1 gives a binomial distribution over the possible numbers of left-
handed people.
Based on what you learnt in Year 8/9, how many people would you expect on
average to be left-handed?
𝟐𝟎𝟎 × 𝟎. 𝟏 = 𝟐𝟎
i.e. We multiplied the number of trials 𝒏 and the probability of success 𝒑 to
work out the mean number of successes.
?
The variance is much harder to work out and we won’t give the proof here.
!If 𝑋~𝐵 𝑛, 𝑝
𝐸 𝑋 = 𝜇 = 𝑛𝑝 Bro Tip: It’s perhaps easier to
𝑉𝑎𝑟 𝑋 = 𝜎 2 = 𝑛𝑝 1 − 𝑝 remember variance as 𝑛𝑝𝑞
(where 𝑞 = 1 − 𝑝 is the
probability of failure, although
these are given the formula book)
Examples
A fair, 4-sided die has the numbers 1, 2, 3, 4 on its faces. The die is rolled 20 times. The random
variable 𝑋 represents the number of 4s obtained.
David believes that 35% of people in a certain town will vote for him in the next election and he
commissions a survey. Find the minimum number of people the survey should ask to have a
Q mean number of more than 100 voting for David.
𝟎. 𝟑𝟓𝒏 = 𝟏𝟎𝟎 𝒏 = 𝟐𝟖𝟓. 𝟕
?
So 𝟐𝟖𝟔 people should be asked.
An examiner is trying to design a multiple choice test. For students answering the test at
random, he requires that the mean score on the test should be 20 and standard deviation at
least 4. Find how many choices each question should have and the number of questions there
should be. Number of choices is fixed across questions and show be as small as possible.
Q 𝒏𝒑 = 𝟐𝟎
𝒏𝒑 𝟏 − 𝒑 ≥ 𝟒 → 𝟐𝟎 𝟏 − 𝒑 ≥ 𝟒 → 𝒑 ≤ 𝟎. 𝟐
So there should be 5 choices per question. ?
𝒏𝒑 = 𝟐𝟎 → 𝟎. 𝟐𝒏 = 𝟐𝟎 → 𝒏 = 𝟏𝟎𝟎
Exercise 1D
Using tables when 𝑝 > 0.5
Just as the z-table for the Normal Distribution only had probabilities greater than 0.5,
the Cumulative Distribution Function tables only have values for 𝑝 less than 0.5!
We similarly have to do some manipulation when 𝑝 > 0.5.
𝑃 ≥ 60 = 𝑷 𝒀 ≤? 𝟒𝟎
𝑋
𝑃 < 30 = 𝑃 𝑌 > 70 = 𝟏? − 𝑷 𝒀 ≤ 𝟕𝟎
𝑋
𝑃 > 20 = 𝑃 𝑌 < 80 =? 𝑷 𝒀 ≤ 𝟕𝟗
𝑋
𝑃 𝑋
≥ 30 = 𝑷 𝒀 ≤ 𝟕𝟎 ?
𝑃 𝑋
≤ 25 = 𝑃 𝑌 ≥ 75 = 𝟏? − 𝑷 𝒀 ≤ 𝟕𝟒
𝑃 𝑋
< 30 ≤ 0.2
→ 𝑷 𝒀 > 𝟕𝟎 ≤ 𝟎. 𝟐
𝑷 𝒀 ≤ 𝟕𝟎 ≥ 𝟎.? 𝟖
𝑃 𝑋 ≤ 65 ≥ 0.55
→ 𝑷 𝒀 ≥ 𝟑𝟓 ≥ 𝟎. 𝟓𝟓
𝑷 𝒀 ≤ 𝟑𝟒 ≤ ?𝟎. 𝟒𝟓
Test Your Understanding
Historically, 75% of people who have visited KGS have been eaten by bears.
If 30 Tiffin Boys visit KGS, determine the probability that at least 20 of them will be eaten by bears.
S2 May 2012 Q8 (Part (b) was very challenging according to the Examiner’s Report!)
𝑋~𝐵 50,0.6
𝑌~𝐵 50,0.4
𝑃 𝑋 < 𝑛 = 𝑃 𝑌 > 50 − 𝑛 ≥ 0.9
?
𝑃 𝑌 ≤ 50 − 𝑛 ≤ 0.1
50 − 𝑛 ≤ 15
𝑛 ≥ 35
𝑛 = 35
And more general probability questions…
In Joe’s café 70% of customers buy a cup of tea.
a) In a random sample of 20 customers find the probability that more than 15 buy a cup of
tea.
Let 𝑿 = the number out of 20 who buy a cup of tea.
𝑿~𝑩 𝟐𝟎, 𝟎. 𝟕
Let 𝒀 = the number of customers who do not buy a cup of tea.
𝒀~𝑩 𝟐𝟎, 𝟎. 𝟑
𝑷 𝑿 > 𝟏𝟓 = 𝑷 𝒀 ≤ 𝟒 = 𝟎. 𝟐𝟑𝟕𝟓
b) The proportion of customers who buy a chocolate muffin at Joe’s café is 0.35. Find the
probability that Joe sells the first chocolate muffin to his fifth customer.
The first four customers don’t buy a muffin, and the fifth does.
?
𝟎. 𝟔𝟓𝟒 × 𝟎. 𝟑𝟓 = 𝟎. 𝟎𝟔𝟐𝟓 (𝒕𝒐 𝟑𝒔𝒇)
c) Find the probability that in a random sample of 20 customers more than 15 buy a cup of
tea and a least 6 buy a chocolate muffin.
S2 May 2013 Q7
? 𝟎. 𝟏
𝑿~𝑩 𝒏,
𝒀~𝑩 𝟏𝟎, 𝟎. 𝟏
𝑷 𝒀≥𝟒
?
=𝟏−𝑷 𝒀≤𝟑
= 𝟎. 𝟎𝟏𝟐𝟖
𝑷 𝒀≥𝟏
=𝟏−𝑷 𝒀=𝟎
𝟏 − 𝟎. 𝟗𝒏 > 𝟎. 𝟗𝟓
?
𝟎. 𝟗𝒏 < 𝟎. 𝟎𝟓
𝒏 > 𝟐𝟖. 𝟒
𝒏 ≥ 𝟐𝟗
𝑃 𝑋 = 0 = 1 − 𝑝 12 = 0.05
12
1 − 𝑝 = 0.05 = 0.779
𝑝 = 0.221 ?
Exercise 1C