Probability Distribution
Probability Distribution
PROBABILITY DISTRIBUTIONS
Introduction
While dealing with random variables and their probabilities it is often found that there exists a
functional relationship between the value taken by the random variable and the corresponding
probability. This initiates to express the relation between random variables and their probabilities whit
the help of mathematical functions. These functions are called as probability distributions. Depending
on the nature of the random variable distributions can de either discrete or continuous. If the random
variable X takes discrete values only, then its probability distribution is called a discrete probability
distribution or probability mass function (pmf). However if the random variable X, is such that it can
take any value within a given interval them the corresponding distribution is called as continuous
probability distribution or probability density function (pdf). Binomial distribution, Poisson
distribution, geometric distribution and negative binomial distribution are some examples of discrete
random variable. Examples of continuous distribution are normal distribution, beta distribution,
gamma distribution etc.
Bernoullian Trial
A particular trial having only two outcomes either success or failure in which the probability
of success is constant is called as Bernoullian trial. The sample space of a Bernoullian trial has only
two sample points, S = {success, failure}. Some examples of Bernoullian trials are:
(a) A toss of a single coin (head or tail)
(b) The throw of a die (even or odd)
(c) Result of a student in an examination (pass or fail)
(d) The selection of items produced in an industry (defective or nondefective)
Bernoulli’s Distribution
Let us define a random variable X, which represents the result of a Bernoullian trial. Thus, X
takes only two values.
Let, X = 1, if the result of the trial is a success.
= 0, if the trial results to failure.
Let p be the probability of success. So, we have, X takes the value 1 with probability p and 0 with
probability q. Accordingly, we have,
X 0 1
P(X = x) q p
=p
1 1
Now, E(X2) = x 2 P( X x) x 2 p x (1 p)1 x 1 p1 (1 p)11 p
i 0 i 0
V(X) = E(X ) [E(X)] = p p = p(1p)
2 2 2
Binomial distribution was discovered by James Bernoulli (16541705) in the year 1700. But
it was published in the year 1713.
Let a random experiment having only two outcomes either success or failure, be performed a
number of times (n, say) under identical conditions. Let X be the random variable that represents the
number of successes in n trials, with ‘p’ the probability of success which remains constant for each
time the random experiment is performed. Thus, ‘q =1-p’ is the probability of failure in any trial.
Under the above conditions the random variable X is said to follow binomial distribution if its
probability mass function is given by
So, we have
E(X) = np
(2)
Variance = E(X2) (E(X))2
Now,
n n n
E(X2) = x 2 P( X x) x 2 n C x p x q n x [ x( x 1) x]n C x p x q n x
x 0 x 0 x 0
n n n
n!
= x( x 1) n C x p x q n x x nC x p x q n x x( x 1)
x 0 x 0 x 0 x!(n x)!
p x q n x np [using (1)]
n
n(n 1) (n 2)!
= x( x 1) x( x 1) ( x 2)!(n x)! p
x 0
x
q n x np
n
(n 2)!
= n( n 1) p 2 ( x 2)!(n x)! p
x2
x2
q n x np n(n 1) p 2 ( p q ) 2 np
Recurrence Relation
The recurrence relation of the probabilities of binomial distribution is given by the following
expression:
nx p
P( X x 1) P( X x)
x 1 q
We know that, the binomial probability is given by
P(X=x) = nCx px qn x where x = 0,1,2,…,n and p + q =1
Now,
n!
x 1 n x 1
P( X x 1) n
C x 1 p q ( x 1)!(n x 1)! p
x n x
P( X x) n
Cx p q n! q
x!(n x)!
P( X x 1) x!(n x)! p x!(n x) (n x 1)! p
P( X x) ( x 1)!(n x 1)! q ( x 1) x!(n x 1)! q
P( X x 1) nx p
P( X x) ( x 1) q
Thus,
nx p
P ( X= x+1) = P( X = x )
( x 1) q
Note
The recurrence relation is used to calculate the probability of P(X=x+1) if the value of the parameters
of the distribution and P(X = x) is known.
Illustration 1: Five fair coins are tossed. Find the probability of (i) Exactly three heads. (ii) Atleast
three heads.
= P[X = 3] + P[ X = 4] + P[ X = 5]
= 5
C3 ( ½ )3 (1 - ½)5 3 + 5C4 ( ½ )4 (1 - ½)5 4 + 5C5 ( ½ )5 (1 - ½)5 5
1 . 16 .
= 54 1.( 1)2 + 5 1 . 1 + 1 . = 10 . + 5 . + = = 1.
2 8 2 16 2 32 32 32 32 32 2
Illustration 2: The mean of a binomial distribution is 6 and the standard deviation is given by (3/2).
Find the distribution.
Solution: We, know that for a binomial distribution with parameters n and p. We have mean = np and
Variance = npq
Also, Variance = (Standard Deviation)2 = 3/2
Thus, np = 6 and npq = 3/2
Now, q = npq/np = (3/2)/6 = ¼
p = 1 – q = 1- ¼ = ¾
Also, np = 6
n¾=6
n=8
So, the required distribution is
P(X=x) = 8Cx ( ¾)x ( ¼ ) n x where x = 0,1,2,…,n
Illustration 3: A random variable X follows binomial distribution with mean 5/3 and P( X=2 ) = P (X
= 1). Find variance, P(X = at least 1) and P(X = at most 1).
Illustration 4: The chances of a bomb hitting a target to that it will not are 3:2. Find the probability
that the target will hit atleast once in five shots.
Solution: Let X be a random variable which represents the number of shots required to hit a target out
of five shots.
By the question we have, n = 5, p = 3/5 and q = 1– p = 1 – (3/5) = 2/5
Now,
P( The target is hit atleast once in 5 shots) = P (X 1)
= 1 – P( X 0) = 1 – P(X = 0) [ Since, X cannot be less than zero
= 1 – 5C0 (3/5)0 (2/5)5 = 1- (2/5)5 = 3093/3125
Poisson Distribution
Poisson distribution was discovered by Simeon Denis Poisson (17811840) and it was
published in the year 1837. A random variable X is said to follow Poisson distribution if it assumes
only nonnegative values and if its probability mass function is given by
e x
P(X = x) = , x = 0,1,2,…, and >
x!
Let X be a random variable which follows binomial distribution with parameters n and p, so we have,
P(X=x) = nCx px qn x
Now,
n(n 1)(n 2)...( n x 1) x n x
Lt P(X=x) = Lt nCx px qn x = Lt p q
n n n x!
1 2 x 1
n x (1 )(1 )...(1 )
= Lt n n n px (1p) n x
n x!
1 2 x 1
(1 )(1 )...(1 ) n x
n n n x np
= Lt (np) 1
n x! n
n x
1 1 2 x 1 (
Lt 1 1 ...1 1
x! n n n n n
e x
e 1 e 2 e 3
Mean = E(X) = xP[ X x] x
x 0 x 0 x!
0 1
1!
2
2!
3
3!
...
2 3 2
= e[ + ...] = e [ 1+ ... ] = ee = (1)
1! 2! 1! 2!
Now,
e x
e x
E(X ) = x P[ X x] x
2 2 2
[ x( x 1) x]
x 0 x 0 x! x 0 x!
x x
e
e
e x
= x( x 1) x x( x 1) [Using (1)]
x 0 x! x 0 x! x 0 x( x 1) ( x 2)!
x 2
= e 2 ( x 2)! e e
x 2
2 2
(2)
Thus,
variance = E(X2) [E(X)]2 = E(X2) 2 Using (i)
= + =
2 2
Using (ii)
Thus, the mean and variance of a Poisson distribution are equal and their value is equal to .
e x 1
Thus, P(X = x+1) =
( x 1)!
So,
P( X x 1) e x 1 e x
P( X x) ( x 1)! x! x 1
P(X = x+1) = P( X = x) gives the required recurrence relation.
x 1
Properties of Poisson Distribution
1. The Poisson distribution is a discrete distribution where the random variable X takes the values
0,1,2,…
2. The Poisson distribution has one parameter i.e., .
3. The mean and variance of the Poisson distribution are equal that is, mean = variance = .
4. The standard deviation is equal to .
5. Skewness and kurtosis of Poisson distribution are 1/ and 1/ respectively.
6. The Poisson distribution may have either one or two modes.
7. The Poisson distribution is a positively skewed distribution as 1/ is always positive.
8. The distribution is said to be leptokurtic as 1/ is always positive.
9. The Poisson distribution may be obtained as a limiting case of Binomial distribution.
10. If X and Y are two independent Poisson variates with parameters 1 and 2 then X+Y is also a
Poisson variate with parameter 1 + 2.
Illustration 2: In a Poisson distribution we have the probability that X takes the value 0 is 0.1. Find
the mean of the distribution.
Solution: We know that the mean of the Poisson distribution is equal to its parameter .
Now,
So, putting x = 0 we have
P (X = 0 ) =
0.1 = e e = 10 = log e 10
Illustration 3: In a factory manufacturing blades it is found that on an average 2% of the blades are
defective. What is the probability that atmost 5 defective blades will be found in a box of 200 blades.
e 4 4 x 4x 4 4 4 2 43 4 4 45
5 5 0
4
= e 4
e
x 0 x! x 0 x! 0! 1! 2! 3! 4! 5!
= 0.0183 ( 1 + 4 + 8 + 10.6667+ 10.6667+ 8.5333)
= 0.785
Thus, required probability is 0.785
Illustration 4: Vijay Lodge, of Guwahati has three rooms only. The number of demands for a room is
Poisson distributed variate with mean 1.5. Calculate the proportion of days on which (i) neither room
is demanded (ii) some demand for rooms are refused because of non-availability of rooms.
Solution: Let X be the random variable which represents the number of demands made for rooms. So
the proportion of days on which there are x demands for rooms is given by,
e x e 1.5 1.5 x
P[ x demands for rooms ] = P[ X = x ] = = [since, mean = 1.5]
x! x!
e 1.5 1.5 0
P[ X = 0 ] = = e1.5 = 0.223
0!
The Normal distribution also called as the Gaussian distribution, is a continuous probability
distribution with two parameters and and is defined by the probability density function (p.d.f.)
1 x 2
1
2
P[ X = x ] = e
2
Here is the mean and is the standard deviation of the distribution. and e are two
mathematical constants having the approximate values 22/7 and 2.718 respectively.
The history of the distribution is very interesting. It was discovered by an English
Mathematician De-Moivre in 1733, used by Laplace, later in the year 1774 but the credit of the
distribution was wrongly attributed to Gauss, who first made the reference of the distribution in 1809
to study the errors in the measurement of Astronomy.
This is the most useful distribution in theoretical statistics because of its many important
characteristics. Most of the probability distributions of statistics whether discrete or continuous tends
to normal distribution especially when the number of observations are large. The probability curve of
normal distribution is known as Normal Curve. The curve is symmetrical about its mean (), bell-
shaped and the two tails extend to infinitely on either side.
If a random variable X is normally distributed with mean and standard deviation , then the
random variable Z = (X-)/ is called as the “standard normal variate” and the corresponding
distribution the standard normal distribution . It has the density function
z2
1
p( Z=z ) = e 2
where, - < z <
2
This is actually a special case of normal distribution with mean 0 and standard deviation 1.
As in all continuous probability distributions, the total area under the normal curve is 1. For a
continuous distribution we cannot calculate the probability at a point but we can calculate probability
with in a range. The probability that X lies between c and d, denoted by P(c X d), is given by the
area under the curve between the vertical lines at c and d. This is also equal to the area under
‘Standard normal curve’ between the vertical lines at the standardized values of c and d; i.e.
P(c X d) = Area under ‘standard normal curve’ between the vertical lines at c and d.
Where c = (c— )/ and d’ = (d — )/. Extensive tables showing the areas under standard normal
curve are available in this book (see Statistical Tables).
The cumulative distribution function (c.d.f.) of standard normal distribution, viz.
(z) = Probability that the standard normal variable takes a value less than or equal to z.
= Area under ‘standard normal curve’ to the left of the ordinate at z.
Mathematically,
z z2
1
(z) =
2
e 2
dz
The values of (z) are given in statistical tables only for positive values of z. For, negative values the
relation
( z) = 1 (z)
is used to find the area, because of the symmetry of standard normal curve about 0.
Probability Integral of Standard Normal Distribution
Normal distribution plays a very important role in statistical theory and its application becomes useful
for the following reasons:
1. Most of the distribution occurring in practice, for example binomial, Poisson, hypergeometric
distribution are approximated by Normal Distribution.
2. Even if a variable is not normally distributed it can sometimes be brought to normal form by
simple transformation of variable.
3. Many of the distributions of sample statistics tend to normality and as such they can be best
studied with the help of normal curve.
4. The proof of all the test of significance in sampling are based upon the fundamental assumption
that the population from which the sample has been drawn is normal.
5. The theory of normal curve can be applied to the graduation of the curves, which are not normal.
6. Normal distribution is extensively used in statistical quality control.
Note
1. If X follows binomial distribution with parameters n and p then the conditions under which
binomial distribution tends to normal distribution are:
(i) The number of trials (n) is infinitely large, i.e. n
(ii) Neither p nor q is very small.
2. If X follows Poisson distribution with parameter then the conditions under which Poisson
distribution tends to normal distribution is that the mean of the distribution i.e. .
SOLVED ILLUSTRATIONS (NORMAL DISTRIBUTION)
Illustration 1: The mean weight of 500 male students at a certain college is 151 lbs. and the standard
deviation is 15 lbs. Assuming that the weights are normally distributed, find how many students
weight (i) between 120 and 155 lbs., (ii) more than 155 lbs.
[Given (0.27) = 0.6064 and (2.07) = 0.9808, where (t) denotes the area under standard normal
curve to the left of the ordinate at t.]
Solution: The mean and the standard deviation are = 151 lbs., = l5 lbs.
(i) Proportion of students whose weights lie between 120 & 155 lbs. = Area under standard normal
curve between the vertical lines at the standardized values,
viz. z = (120 151)/15 = 2.07 and z = (155 151)/15 = 0.27.
P(120 x 155)
= (0.27) ( 2.07)
= (0.27) [1 (2.07)]
= 0.6064 1 + 0.9808 = 0.5872 [see Figure].
Solution: Here, the mean and the standard deviation are 30 and 5 respectively.
(i) When X = 26 we have Z = (X - )/ = ( 26 – 30)/ 5 = -0.8
and when X = 40 we have Z = (X - )/ = ( 40 – 30)/ 5 = 2
P (26 X 40)
= P (-0.8 X 2)
= P (-0.8 X 0) + P (0 X 2)
= P (0 X 0.8) + P (0 X 2)
[ from symmetry
= 0.2881+ 0.4772
[ from table
= 0. 7653
Illustration 3. In a distribution exactly normal, 7% of the items are under 35 and 89% are under 63.
What are the mean and Standard deviation of the distribution?
Solution: Let X be a random variable following normal distribution with mean and standard
deviation . Let Z = (X )/.
By, the question we have,
P[ X 35 ] = 0.07 and P[ X 63 ] = 0.89,
Now, P[ X 35 ] = P[(X )/ (35 )/]
= P[ Z (35 )/] = [(35 )/]
Thus,
[(35 )/] = 0.07
= 1 (1.48) [ value obtained from table.
= (1.48)
Thus, (35 )/ = 1.48
(1)
Similarly,
P[ X 63 ] = P[(X )/ (63 )/]
= P[ Z (63 )/] = [(63 )/]
Thus,
[(63 )/] = 0.89
= (1.23) [ value obtained from table.
2.7128
10.33
Putting the value of 10.33 in (1) we have
+ = 35 + 15.2884 = 50.2884
Thus, for the random variable X the mean is 50.2884 and standard deviation is 10.33.
Illustration 4: There are six hundred Economics students in the postgraduate classes of a university
and the probability for any student to need a copy of a particular book from the university library on
any day is 005. How many copies of the book should be kept in the university library so that the
probability may be greater than 090 that none of the students needing a copy from the library has to
come back disappointed? (Use normal approximation to the binomial distribution).
Formulae
1. Explain the concept of probability distributions. How does probability mass function differ from
probability density function?
2. Let X be a random variable with p.m.f. f(x) = (1 / 32) 5C2, where x = 0, 1, 2, ... 5. Find the mean
and standard deviation of the random variable.
[ Ans: 5/2 and 5/2]
3. Define Binomia1 distribution and state the conditions under which the distribution holds.
4. Derive the mean and variance of binomial distribution.
5. Obtain the recurrence relation for the probabilities of binomial distribution.
6. Four coins are tossed simultaneously. What is the probability of getting 2 heads and 2 tails?
[ Ans: 3/8 ]
7. Find the probability that in a family of 5 children there will be (i) at least one boy, (ii) at least one
boy and one girl. (Assume that the probability of a female birth is 1/2). [ Ans: (i) 31/32 (ii) 15/16]
8. In a shooting competition, the probability of a man hitting a target is 1/5. If he fires 5 times, what
is the probability of hitting the target at least twice? [ Ans: 821/3125]
9. Assume that on the average 30% of the candidates appearing in an examination from a certain
college get First Division. What is the probability that out of a group of 4 such candidates not more
than two will fail to get a First Division? [ Ans: 0.3483]
10. For a binomial distribution, the mean is 3 and the variance is 2. Find the values of n and p. Hence
find the probability that X (the variable value) is 5. [Ans: 9, 1/3, 224/2187]
11. For a binomial distribution, the mean and S.D. are respectively 4 and calculate the probability of
getting a nonzero value from this distribution. . [ Ans: 1 (0.75)10]
12. Write down the expressions which define Binomial, Poisson and Normal probability distributions.
Give 3 physical situations illustrating a Poisson random variable.
13. Derive the mean and variance of Poisson distribution.
14. State and prove the recurrence relation of probabilities of a Poisson variate.
15. State the conditions under which binomial distribution tends to Poisson distribution. Derive the
same.
16. State some examples of binomial distribution and Poisson distribution.
17. A random variable x follows Poisson distribution with parameter m = 2. Find the probabilities
P(x = 1), P(x 1), P(x < 1), P(x > 1), P(1 x 3). Given e-2 = 0.1353.
[ Ans: 0.2706, 0.4059, 0.1353, 0.5941, 0.7216]
18. The standard deviation of a Poisson distribution is 2. Find the probability that x =3. (Given e4 =
.0183).
[ Ans: 0.1952]
19. Is it possible that a Poisson distribution has the same mean and standard deviation? If so, what is
the probability that the variable takes the value zero? [ Ans: yes, e1]
20. For a Poisson distribution, Pr(x=0) Pr(x= 1). Find Pr(x>0). [Ans: 1 e1]
21. A discrete random variable x follows Poisson distribution such that P(x= 1) = P(x= 2). Find the
mean and variance of the distribution. [Ans: 2, 2]
22. The probability that a Poisson variate X takes a positive value is (1e2). Find the (i) Mean, (ii)
Mode, (iii) probability that X lies between 1 and 1.5. [ Ans: 2,1 and 3e2
23. If 3% of the bolts manufactured by a company are defective, what is the probability that in a
sample of 200 bolts, 5 will be defective? (Given e6 = 0.00248).
[ Ans: 0.16
24. Suppose that the number of telephone calls an operator receives from 11.00 am. to 11.05 a.m.
follows a Poisson distribution with m = 3. (i) Find the probability that the operator will receive no
calls in that time interval to-morrow. (ii) Find the probability that in the next 3 days the operator
will receive a total of 1 call in that time interval. (e = 2.7 18). [ Ans: 0.05, 0.0011]
25. The average number of misprints per page of a book is 2. Assuming Poisson distribution, what is
the probability that a particular page is free from misprints? if the book contains 1000 pages, how
many of the pages contain more than 2 misprints’? [ Ans: e2, 1000(1 5e2)]
26. State the importance of Normal distribution in statistics.
27. Explain some of the features of normal distribution.
28. Find the areas under the normal curve (i.e., the probabilities) in the following cases using table:
(i) between z = 0 and z = 1.8;
(ii) between z = 0.78 and z = 0;
(iii) between z = 0.85 and z= 2.15;
where z is a standard normal variate. [Ans: (i) 0.4641, (ii) 0.2823, (iii) 0.1819]
29. The mean height of 1000 students at a certain college is 165 cms. and SD is 10 cms. Assuming that
the height distribution is normal, find the number of students whose heights are (i) less than 172
cms; (ii) between 159 and 178 cms; and (iii) more than 173.2 cms.
[Ans: (i) 258; (ii) 629; (iii) 2061]
30. The mean of a normal distribution is 60 and 6% of the values are greater than 70. Find the
standard deviation of this distribution. (Given that the area under the standard normal curve
between z = 0 and z = 1.56 is 0.44 or z = to 1.56 = 0.94] [ Ans. 6.41]
@#@#@#@#@#@#@#@