Prob. Distribution
Prob. Distribution
Probability Distributions
Random Variables
• Random variable is a
function that maps every
outcome in the sample
space to a real number.
i =1
= VAR (X )
Probability Density Function (pdf)
• The probability density function, f(xi), is defined as
probability that the value of random variable X lies
between an infinitesimally small interval defined by xi
and xi + x
P( xi X xi + x)
f ( x) = lim
x →0 x
Cumulative Distribution Function (CDF)
• The cumulative distribution function (CDF) of a
continuous random variable is defined by
a
F (a) = P( X a) =
−
f ( x)dx
+
F () = f ( x ) dx = 1
−
b
P(a X b) = f ( x)dx = F (b) − F (a)
a
• The expected value of a continuous random variable,
E(X), is given by
+
E( X ) = xf ( x)dx
−
n n
n
Var( X ) = ( x − E ( X )) PMF( x) = ( x − E ( X )) 2 p x (1 − p) n − x = np(1 − p)
2
x =0 x =0 x
If the number of trials (n) in a binomial distribution is large, then it can be approximated
by normal distribution with mean np and variance npq.
Example
(d)The average number of customers who are likely to return the items is
E(X) = n × p = 20 × 0.1 = 2
(e) Variance of a binomial distribution is given by
Var(X) = n × p × (1 − p) = 20 × 0.1 × 0.9 = 1.8
and the corresponding standard deviation is 1.3416
Poisson Distribution
• Poisson distribution is used when we have to find the
probability of number of events
• The probability mass function of a Poisson distribution is given
by
e− k
P( X = k ) = , k = 0, 1, 2, ...
k!
• where is the rate of occurrence of the events per unit of
measurement
• Cumulative distribution function of a Poisson distribution is
given by
e− k
k
P[ X k ] =
i =0 k!
• The mean and variance of a Poisson random variable are given by E( X ) =
and Var( X ) =
Solution
P( X i) = 1 − P( X i) = 1 − [1 − (1 − p)i ] = (1 − p)i
P( X i + j X i) P( X i + j ) (1 − p)i + j
P( X i + j | X i ) = = = = (1 − p ) j
P( X i ) P( X i ) (1 − p)i
(a) Calculate the probability that the customer’s first purchase of milk
happens during the 5th visit.
(b) Calculate the average time between purchases of milk.
(c) If a customer has not purchased milk during the past 3 shopping
visits, what is the probability that the customer will not buy milk for
another 2 visits?
Solution
(a) Probability that the customer’s first purchase of milk happens
on 5th trip is given by
P( X = 5) = (1 − 0.2) 4 0.2 = 0.08192
(b) The average time between purchase of milk is
1 1
E( X ) = = =5
p 0.2
(c) Given that a customer has not purchased milk for the past 3
shopping visits, the probability that the customer will not buy
for another 2 visits is given by
f ( x) = e−x , 0
− x
F ( x) = 1 − e
• The parameter is the scale parameter and represents the rate of occurrence
of the event, (1/) is the mean time between events.
Probability density function of an exponential
distribution
P( X t + s | X t ) = P( X s )
P ( X t + s X t ) P ( X t + s ) e − (t + s ) − s
P( X t + s | X t ) = = = − t = e
P( X t ) P( X t ) e
Example
The time to failure of an avionic system follows an exponential
distribution with a mean time between failures (MTBF) of 1000
hours.
(a) Calculate the probability that the system will fail before 1000
hours.
(b) Calculate the probability that it will not fail up to 2000 hours.
(c) Calculate the time by which 10% of the systems will fail (that is
calculate P10 life)
Solution
(a) The probability that the system will fail by 1000 hours is
F (1000) = 1 − et 1
− 1000
In this case = 1/ 1000, t = 1000 so , F (1000) = 1 − e 1000 = 1 − e−1 = 0.6321
variance 2 + 2
1 2
z2
1 −
f ( z) = e 2
2
x2
z 1 −
F ( z) = e 2 dz
− 2
• By using the following transformation, any normal random
variable X can be converted into a standard normal variable
X −
Z=
• The random variable X can be written in the form of a standard
normal random variable using the relationship
X=+Z
• A simple approximation of standard normal CDF is given by
Tocher (1963)
e2 kz
P( Z z ) = F ( z )
1 + e 2 kz
where k = 2/
z 2
+ A z + A −z2 / 2
P( Z z ) = F ( z ) = 1 − 1 2 e
2 z 3 + B z 2 + B z + 2 A
1 2 2
Example
x− 90 − 68
Z= = = 1.8333
12
That is, F(X = 90) = F(Z = 1.8333). From standard normal distribution table,
we get for Z = 1.8333. The area under the standard normal distribution curve
is 0.9666. Thus , P(X 90) = 1− P(X 90) = 1 − F(90) = 1 – 0.9666 = 0.0334