ChapterStat 2
ChapterStat 2
Kavikumar PhD
October 1, 2018
Outline
Introduction
Binomial Distribution
Poisson Distribution
Poisson Approximation to the Binomial Distribution
Probability
Distribution
Standard Normal
Distribution
Binomial Distribution
Binomial Distribution is a basic distribution that models chance
variation in repetitions of an experiment that has only two possible
outcomes.
Experiment
Success Failure
p q
Probability : p+q =1
Binomial Experiment: The experiment consists of repeating a trial
over certain number of times. Each trial had two possible
outcomes.
Binomial Distribution B(n, p)
r
Solution: The binomial 0distribution
1 is2given3by the4 following table
1 4 6 4 1
p(r) x 16 016 1 162 16 3 416
1 4 6 4 1
P(X = x) 16 16 16 16 16
ponding histogram is shown in Figure 19.1
The corresponding histogram is shown in Figure:
Problem 1
Suppose that in a box of 100 computer chips, the probability of a
chip to be defective is 3%. Inspection process for defective chips
consists of selecting with replacement 5 randomly chosen chips in
the box and to send the box for shipment if none of the five chips
is defective. Write down the random variable, the corresponding
probability distribution and then determine the probability that the
box described here will be allowed to be shipped.
Now,
Solution:.
We have n = 12 and p = 0.5.
Thus, Var (X ) = np(1 − p) = 6(1
√ − 0.5) = 3.
The standard deviation is σ = 3.
Example 1
Suppose a tennis player makes a successful first serve 70% of the
time. Assume that each service is independent of the others.
Assume he serves 80 times in a match:
1. What are the mean and SD of the number of good first serves
expected?
2. Verify that you can use a normal mode to fit the distribution
of the number of good serves.
Proof.
The first serve is a Bernoulli trial, with P(Good Serve)=0.7. Let
X =number of good serves in 80 times. Then X ∼ B(80, 0.7).
Then
1. µ = np = 80(0.70) √
= 56 and
√
SD(X ) = npq = 56 × 0.3 = 4.10
2. Since, np = 56 ≥ 10, nq = 24 ≥ 10, B(80, 0.7) may be
approximated by N(56, 4.10).
Example 2
Proof.
Let X be the number of mice that contract the disease of 5 mice
inoculated. For
n= 5, and p = 0.4, we have
5
P(X = x) = (0.4)x (0.6)5−x , x = 0, 1, 2, · · · , 5
x
1. P(X = 0) = 0.0778.
2. P(X < 2) = P(X ≤ 1) = 0.3370.
3. P(X > 3) = 1 − P(X ≤ 3) = 1 − 0.9130 = 0.0.870.
Example 6
In a study of brand recognition, 95% of consumers recognized
Coke. The company randomly selects 4 consumers for a taste test.
Let X be the number of consumers who recognize Coke of 4
consumers.
1. Write out the PMF table for this.
2. Find the probability that among the 4 consumers, 2 or more
will recognize Coke.
3. Find the expected number of consumers who will recognize
Coke.
4. Find the variance for the number of consumers who will
recognize Coke.
Solution: For n = 4 and p = 0.95, we have
4
P(X = x) = (0.95)x (0.05)4−x , x = 0, 1, 2, 3, 4
x
Solution
1.
X p(x)
4
0 (0.95)0 (0.05)4 = 0.0000062
0
4
1 (0.95)1 (0.05)3 = 0.00047
1
4
2 (0.95)2 (0.05)2 = 0.01354
2
4
3 (0.95)3 (0.05)1 = 0.17147
3
4
4 (0.95)4 (0.05)0 = 0.81451
4
2.
P(X ≥ 2) = P(X = 2) + P(X = 3) + P(X = 4)
= 0.01354 + 0.17147 + 0.81451
= 0.99952
Solution
3.
or
E (X ) = np = 4 × 0.95 = 3.8
4.
Var (X ) = np(1 − p) = 4 × (0.95) × (0.05) = 0.19
Example 7
A company installs new central heating furnaces, and has found
that for 15% of all installations a return visit is needed to make
some modifications. Six installations were made in a particular
week. Assume independence of outcomes for these installations.
1. What is the probability that a return visit was needed in all of
these cases?
2. What is the probability that a return visit was needed in none
of these cases?
3. What is the probability that a return visit was needed in more
than one of these cases?
Solutions:
1.
6
P(X = 6) = (0.15)6 (0.85)0
6
= 1 × (0.0000114) × 1 = 0.0000114
Example 7
2.
6
P(X = 0) = (0.15)0 (0.85)6
0
= 1 × (1) × 0.3771 = 0.3771
3.
I When b = 4,
y 0 1 2 3 4
profit -40 -25 -10 5 20 and
P(Y = y ) 0.017 0.087 0.195 0.260 0.441
e −λ λx
P(X = x) = , x = 0, 1, · · ·
x!
where: λ is mean number of event.
x is number of success event.
Mean and Variance of the Poisson Distribution
The mean E (X ) of the Poisson distribution is
∞
X ∞
X e −λ λx−1 X e λ λx ∞
e −λ λx
E (X ) = x. =λ =λ = λe −λ e λ = λ.
x! (x − 1)! x!
x=0 x=1 x=0
Then, we find
E (X 2 ) = E (X (X − 1)) + E (X ) = λ2 + λ.
Thus,
Var (X ) = E (X 2 ) − (E (X ))2 = λ.
Problem 1
Solution:
The probability that 4 accidents will occur on a given day is given
by
(2.1)4
P(X = 4) = e −2.1 ≈ 0.0992.
4!
Problem 2
The number of people entering a movie theatre averages one every
two minutes. Assuming that a Poisson distribution is appropriate.
1. What is the probability that no people enter between 12:00
and 12:05?
2. Find the probability that at least 4 people enter during
[12:00,12:05].
Solution:
1. Let X be the number of people that enter between 12:00 and
12:05. We model X as a Poisson random variable with
parameter λ, the average number of people that arrive in the
5-minute interval. But, if 1 person arrives every 2 minutes, on
average (so 1/2 a person per minute), then in 5 minutes an
average of 2.5 people will arrive. Thus, λ = 2.5. Now
2.50
P(X = 0) = e −2.5 = e −2.5 .
0!
Problem 2
2.
P(X ≥ 4) = 1 − (PX ≤ 3)
= 1 − P(X = 0) − P(X = 1) − P(X = 2) − P(X = 3)
2.50 2.51 2.52 2.53
= 1 − e −2.5 − e −2.5 − e −2.5 − e −2.5
0! 1! 2! 3!
Problem 3
The number of weekly life insurance sold by an insurance agent
averages 3 per week. Assuming that this number follows a Poisson
distribution, calculate the probability that in a given week the
agent will sell
1. some policies
2. 2 or more policies but less than 5 policies.
3. Assuming that there are 5 working days per week, what is the
probability that in a given day the agent will sell one policy?
Solution:
1. Let X be the number of policies sold in a week. Then
e −3 30
P(X ≥ 1) = 1 − P(X = 0) = 1 − ≈ 0.95021.
0!
2.
P(2 ≤ X < 5) = P(X = 2) + P(X = 3) + P(X = 4)
e −3 32 e −3 33 e −3 34
= + + ≈ 0.61611.
2! 3! 4!
Problem 3
e −0.6 (0.6)
P(X = 1) = ≈ 0.32929.
1!
Problem 4
Solution: Since the book has 1 misprint per 10 pages, the number
of misprints in a stack of 50 pages is 5. Thus, X is a Poisson
random variable with parameter
λ = 5.
Hence,
E (X ) = λ = 5
and √
σ= 5.
Problem 6
Let X represent the number of customers arriving during the
morning hours and let Y represent the number of customers
arriving during the afternoon hours at a diner. You are given:
1. X and Y are Poisson distributed.
2. The first moment of X is less than the first moment of Y by
8.
3. The second moment of X is 60% of the second moment of Y .
Calculate the variance of Y .
Solution: We have
E (X ) = E (Y ) − 8 ⇒ E (X )2 = E (Y )2 − 16E (Y ) + 64. But
E (X 2 ) = Var (X ) + E (X )2 = E (X ) + E (X )2 = 0.6E (Y 2 ) =
0.6(E (Y ) + E (Y )2 ). Hence
0.6(E (Y ) + E (Y )2 ) = E (Y )2 − 16E (Y ) + 64 yields the quadratic
equation 0.4E (Y )2 − 15.5E (Y ) + 56 = 0 whose roots are
E (Y ) = 4 and E (Y ) = 35. The value E (Y ) = 4 yields
E (X ) = 4 − 8 = −4 which is impossible since the expected value of
a Poisson random variable is always positive. Hence, E (Y ) = 35.
Problem 7
Suppose hurricanes arrive with a mean of 2.35 per year. Assuming
Poisson model,
1. What is the probability that no hurricanes next year?
2. What is the probability that during the next 2 years, there is
exactly one hurricane?
Solution: Let X =number of hurricanes in a year. Given,
X ∼ P(2.35). Hence, E (X ) = λ = 2.35.
e −2.35 (2.35)0
1. P(no hurricane next year) = P(X = 0) = =
0!
0.095.
2.
P(exactly one hurricane in next 2 year)
= P(one in I year) × P(no hurricane in II year)
+ P(no hurricane in I year) × P(one hurricane in 2 year)
= P(X1 = 1)P(X2 = 0) + P(X1 = 0)P(X2 = 1)
= 2.e −4.70 (2.35) = 0.04347,
Problem 8
X ∼ B(n, p) → X ∼ P(x, µ)
Problem 1
In a group of 100 individuals, let X be the random variable
representing the total number of people in the group with a
birthday on Thanksgiving day. Then X is a binomial random
variable with parameters n = 100 and p = 1/365. What is the
probability at least one person in the group has a birthday on
Thanksgiving day?
Solution: We have
100 1 0 364 100
P(X ≥ 1) = 1−P(X = 0) = 1− ≈ 0.2399.
0 365 365
Proof.
We have
λk+1
e −λ
P(X = k + 1) (k + 1)!
=
P(X = k) λk
e −λ
k!
λ
=
k +1
Normal Distribution
Definition
A random variable X has a normal distribution and it is referred to
as a normal random variable if and only if its probability density is
given by
1 1 x−µ 2
N(x; σ 2 ) = √ e − 2 ( σ ) for − ∞ < x < ∞
σ 2π
where σ > 0.
ntinuous Random Variables and Distributions
Normal Density Curves
rmal Density Curves
µ = 0; =1
µ = 0; = 0.6
0.6 µ = 0; =2
µ = 2; = 0.8
0.4
0.2
0
6 4 2 0 2 4 6
Example
Solution: We have
Z 950
1 (x−950)2
P(947 ≤ X ≤ 950) = √ e− 200 dx ≈ 0.118.
10 2π 947
The Standard Normal Distribution
Definition
Standard Normal Distribution is that normal variable, X transform
into Standard Normal Variable, Z with mean, µ = 0 and standard
deviation σ = 1.
X −µ
If X ∼ N(µ, σ 2 ), then Z = has a standard normal
σ
N(0, 1) distribution.
z curve
99th percentile
z curve
=
P(Z < a) = P(Z ≤ a) = 1 − P(Z < a)
For example P(Z < 1) = 1 − 0.1587 = 0.8413
−
P(Z > −a) = 1 − P(Z < a)
For example P(Z > −1) = 1 − 0.1587 = 0.8413
−
P(|Z | > a) = 2 × P(Z > a)
= 2×
P(|Z | < a) = 1 − 2 × P(Z > a)
For example P(|Z | < 1) = 1 − 2 × 0.1587 = 0.6827
−2×
P(a < Z < b) = P(Z > a) − P(Z > b)
P(0.25 < Z < 1.5) = 0.3345,
P(Z > 0.25) − P(Z > 1.5) = 0.4013 − 0.0668 = 0.3345
−
P(−a < Z < −b) = P(Z > b) − P(Z > a)
P(−1.5 < Z < −0.25) = 0.3345,
P(Z > 0.25) − P(Z > 1.5) = 0.4013 − 0.0668 = 0.3345
−
P(−a < Z < b) = 1 − P(Z > a) − P(Z > b)
P(−1 < Z < 2) = 0.8186,
1 − P(Z > 1) − P(Z > 2) = 1 − 0.1587 − 0.0228 = 0.8186
− −
Problem 1
0.15
0.10
0.05
0.00
0 5 10 15 20
x
Normal Approximation to Binomial
Solution: