3 ProbabilityDistributions
3 ProbabilityDistributions
• We write ~ Bernoulli () or
• Also,
• (show)
• (show)
• where
• We write
• We may show
•
•
• When then
• Practical Situations
• There are 100000 vehicles on the streets of Lucknow.
• On a given day, the probability of a vehicle meeting with an accident is 0.00005.
• accidents per day (accident rate)
• A machine produces 50000 parts per day, with 0.001 probability of a part being defective.
• 0 (average number of defective parts produced in a day)
Gaurav Garg (IIM Lucknow)
• The Poisson distribution and Binomial Distribution have some similarities
and differences.
• Binomial Distribution describes a distribution of two possible outcomes – Success
and Failure.
• Poisson Distribution focuses only on the number of discrete occurrences over some
interval.
• A Poisson experiment does not have a given number of trials as a binomial
experiment does.
• The Poisson describes the occurrence of rare events. E.g., no. of aircraft accidents.
• If the occurrences are too frequent, the time interval can be reduced enough so that the
occurrences are rare.
• Other Applications:
• No. of deaths from a disease (not in the form of an epidemic);
• No. of suicides reported in a particular city;
• No. of defective items in a packing manufactured by a good concern;
• No. of printing mistakes on each page of a book.
Gaurav Garg (IIM Lucknow)
• Example:
• In a book of 520 pages, 390 typographical errors occur.
• Assuming Poisson probability law for the number of errors per page, find the
probability that a random sample of 5 pages will contain no error.
• Ans: e – 3.75
• Example:
• Six coins are tossed 6400 times.
• Using the Poisson Distribution, find the approximate probability of getting six
heads 150 times.
• Ans:
• Then Poisson
• Example:
• Suppose that the number of calls at a customer care unit between 10:00 AM and
11:00 AM is a random variable X.
• X~ Poisson (2).
• Similarly, the calls arriving between 11:00 AM and 12:00 Noon is Y.
• Y ~ Poisson (6).
• X and Y are independent
• What is the probability that more than 5 calls come between 10:00 AM and 12:00
Noon?
• Ans: 0.8088
Gaurav Garg (IIM Lucknow)
Negative Binomial Distribution Discrete Distributions
• If we toss a coin a fixed number of times (say times).
• Number of heads out of tosses is a random variable following a binomial distribution.
• If we continue tossing the coin until we get a fixed number of heads (say r heads).
• Number of tosses needed to get heads is a random variable.
• This random variable is said to have a Negative Binomial Distribution.
• In general,
• we have a random experiment having two complimentary outcomes – Success and Failure
• Probability of Success (𝑝) remains constant across trials
• Trials of the experiment are independent of the one another.
• We repeat the trials till we get Successes.
• The number of trials
• Then,
•
•
•
Gaurav Garg (IIM Lucknow)
• A discrete random variable X is said to have a Negative Binomial
Distribution if its pmf is given by
• We write
• (!!!)
• (!!!)
• A particular case
• When is said to have a Geometric distribution with parameter .
• Negative Binomial Distribution may also be defined as the distribution of
=number of failures
Sample
Population X Red Balls + (n-X) White Balls
A Red Balls + (N – A) White Balls
• pmf of is
• We write X~U(a, b)
• (show)
• Var (show)
1/(b-a)
a t b
P(X ≤ t)
Gaurav Garg (IIM Lucknow)
• Example
• A company manufactures field rifles for the Defense forces.
• One of the production lines is producing firing pins with length varying from 0.85 inch
to 1.05 inch with equal probability.
• Firing pins with length less than 0.90 inch or greater than 1.00 inch do not meet the
specifications and must be discarded.
• Find the fraction of the pins that must be discarded.
• Required Fraction =
• Example
• Department of Transportation (DOT) invites the bids for the construction of a road.
• The DOT estimate of the cost of the job is .
• The qualifying bid can take any value in the interval with equal probabilities.
• What fraction of qualifying bids on road construction contracts are less than the DOT
estimate?
• 𝑋 ~ 𝑈 (2𝑑/5, 2𝑑)
• Ans = 𝑃(𝑋< 𝑑)×100 %= 37.5%
Gaurav Garg (IIM Lucknow)
Exponential Distribution Continuous Distributions
• Exponential distribution is used widely to model
• Waiting time in a bank’s queue;
• Life time of some component or human being; etc.
• We write
• Var
• is usually the average number of occurrences (or arrivals) per unit time.
Gaurav Garg (IIM Lucknow)
• Example
• A processing machine at Wiley Inc. breaks down on an average of once in four
weeks.
• What is the probability that the next breakdown will not occur for at least six
weeks after the previous breakdown?
• Assume that the time between breakdowns has an exponential distribution.
• Example
• A teller at New York Industrial Bank serves, on average, 30 customers per hour.
• Assume that the service time for a customer has an exponential probability
distribution.
• What is the probability that the next customer will take two to four minutes to be
served.
Gaurav Garg (IIM Lucknow)
• Memoryless Property of exponential distribution
• has exponential distribution with mean
• for any
• After 10 minutes:
• For exponential distribution, it is the same as , no matter how long you have waited.
Gaurav Garg (IIM Lucknow)
• Relationship between Poisson and Exponential Distributions
• If
• Then, the time-gap between any two successive occurrences follows Exp().
• We write X ~ N(μ, σ ).
• Var
μ
Gaurav Garg (IIM Lucknow)
= 10 = 10 = 10
=5 = 10 = 15
=5
= 10
= 15
=0
=5
= 15 = 10
=5 = 10 = 15
Gaurav Garg (IIM Lucknow)
• In a continuous distribution
• Probability is given by the area below the probability curve.
Area =0.341345
100 130
σ = 30
μ = 60
σ = 35 Area = 0.341345
60 95
σ = 35
• For a N(, ) distribution
• approximately 68% of all the values lie in (-, +)
• approximately 95.5% of all the values lie in (-2, +2)
• approximately 99.7% of all the values lie in (-3, +3)
σ σ 2σ 2σ
.15% of .15% of
area 99.7% area
of area
3σ 3σ
• Linearity Property of Normal Distribution
• are constants
• s a linear function of
• Then,
• Standard Normal Distribution
• Choose
• Then,
• Thus, we can transform any distribution to distribution
• is called standard normal distribution
• is called standard normal variable
X~N(10, σ)
(X+5)~N(15, σ)
(X-5)~N(5, σ)
=5 = 10 = 15
X/2~N(0, 5)
X~N(0, 10)
2X~N(0, 20)
=0
z value corresponding to a
N(0,1) N(,)
Za= (a-)/ a
• Computing Normal Probabilities
• Alternatively,
• Values of depend on .
• Statistical Table of values is available. Here
• Therefore,
•
a b
F(a) F(b)
• Example
• Daily cement production of a plant follows normal distribution with mean of 50
tons and standard deviation of 3.5 tons.
• On how many days in a year the production is expected to exceed 55 tons.
4
76 80
• Ans = 0.1587
• Example:
• Time required to finish an exam is known to be normally distributed with a mean
of 60 Min. and a Std Dev. of 12 minutes.
• How much time should be allowed in order for 90% of the students to finish?
.9 12
60 x
n=4 n=10
n=25
n=20
Gaurav Garg (IIM Lucknow)
• Let X ~ B(n, p),
• P(X=20) = P(19.5 < X < 20.5) = P(-1.1 < Z < -0.9) = 0.1841 – 0.1357 = 0.0484
Exponential ()
n ∞, p 0
Poisson ()
Bernoulli Trial Binomial (n, p)
n ≥ 20, p ≤ .05
large
n large > 10
np > 5, nq > 5
Normal (, )
??
Uniform (a, b)