Block 4
Block 4
13.1 INTRODUCTION
In Units 9 to 12, we have studied standard discrete distributions. From this unit
onwards, we are going to discuss standard continuous univariate distributions.
This unit and the next unit deal with normal distribution. Normal distribution
has wide spread applications. It is being used in almost all data-based research
in the field of agriculture, trade, business, industry and the society. For
instance, normal distribution is a good approximation to the distribution of
heights of randomly selected large number of students studying at the same
level in a university.
The normal distribution has a unique position in probability theory, and it can
be used as approximation to most of the other distributions. Discrete
distributions occurring in practice including binomial, Poisson,
hypergeometric, etc. already studied in the previous block (Block 3) can also
be approximated by normal distribution. You will notice in the subsequent
courses that theory of estimation of population parameters and testing of
hypotheses on the basis of sample statistics have also been developed using
the concept of normal distribution as most of the sampling distributions tend to
normality for large samples. Therefore, study of normal distribution is very
important.
Due to various properties and applications of the normal distribution, we have
covered it in two units – Units 13 and 14. In the present unit, normal
distribution is introduced and explained in Sec. 13.2. Chief characteristics of
normal distribution are discussed in Sec. 13.3. Secs. 13.4, 13.5 and 13.6
describes the moments, mode, median and mean deviation about mean of the
distribution.
Objectives
After studing this unit, you would be able to:
introduce and explain the normal distribution;
5
Continuous Probability
Distributions know the conditions under which binomial and Poisson distributions
tend to normal distribution;
state various characteristics of the normal distribution;
compute the moments, mode, median and mean deviation about mean
of the distribution; and
solve various practical problems based on the above properties of
normal distribution.
q p 1 2p
1 1 , and
npq npq
1 6pq
2 2 3 .
npq
6
From the above results, it may be noticed that if n , then moment Normal Distribution
coefficient of skewness ( 1 ) 0 and the moment coefficient of kurtosis i.e.
2 3 or 2 0 . Hence, as n , the distribution becomes symmetrical
and the curve of the distribution becomes mesokurtic, which is the main
feature of normal distribution.
Normal Distribution as a Limiting Case of Poisson Distribution
You have already studied in Unit 10 of this course that Poisson distribution is a
limiting case of binomial distribution under the following conditions:
i) n, the number of trials is indefinitely large i.e. n
ii) p, the constant probability of success for each trial is very small i.e. p 0.
iii) np is a finite quantity say ‘’.
As we have discussed above that there is a relation between the binomial and
normal distributions. It can, in fact, be shown that the Poisson distribution
approaches a normal distribution with standardized variable given by
X
Z as λ increases indefinitely.
For Poisson distribution, you have already studied in Unit 10 of the course that
32 2 1 1
1 3
3 1 1 ; and
2
4 3 2 1 1
2 2
2 3 2 2 3 .
2
Like binomial distribution, here in case of Poisson distribution also it may be
noticed from the above results that the moment coefficient of skewness
( 1 ) 0 and the moment coefficient of kurtosis i.e. 2 3 or 2 0 as
λ . Hence, as λ , the distribution becomes symmetrical and the curve
of the distribution becomes mesokurtic, which is the main feature of normal
distribution.
Under the conditions discussed above, a random variable following a binomial
distribution or following a Poisson distribution approaches to follow normal
distribution, which is defined as follows:
Definition: A continuous random variable X is said to follow normal
distribution with parameters ( ) and 2(>0) if it takes on any real
value and its probability density function is given by
2
1 x
1
f x e 2
, x ;
2
which may also be written as
1 1 x 2
= exp , x .
2 2
7
Continuous Probability
Distributions Remark
i) The probability function represented by f x may also be written as
f x; , 2 .
ii) If a random variable X follows normal distribution with mean and
variance 2, then we may write, “X is distributed to N(, 2)” and is
expressed as X N(, 2).
iii) No continuous probability function and hence the normal distribution
can be used to obtain the probability of occurrence of a particular value
of the random variable. This is because such probability is very small,
so instead of specifying the probability of taking a particular value by
the random variable, we specify the probability of its lying within
interval. For detail discussion on the concept, Sec. 5.4 of Unit 5 may be
referred to.
X
iv) If X ~ N , 2 , then Z is standard normal variate having
mean ‘0’ and variance ‘1’. The values of mean and variance of standard
normal variate are obtained as under, for which properties of
expectation and variance are used (see Unit 8 of this course).
X 1
Mean of Z i.e. E Z E = E X
1
E X
1
= 0 [ E(X) = Mean of X = ]
X
Variance of Z i.e. V(Z) = V
1 1
V X 2 V X
2
1 2
2
[ variance of X is 2]
= 1.
X
v) The probability density function of standard normal variate Z
1 12 z2
is given by z e , z .
2
8
vi) The graph of the normal probability function f x with respect to x is Normal Distribution
famous ‘bell-shaped’ curve. The top of the bell is directly above the
mean . For large value of , the curve tends to flatten out and for
small values of , it has a sharp peak as shown in (Fig. 13.1):
Fig. 13.1
μ = 40, σ 2 = 25 σ = ± 25
5 0always
Now, the p.d.f. of random variable X is given by
2
1 x
1
2 σ
f(x) = e
σ 2π
2
1 x 40
1
2 5
= e , x
5 2π
9
Continuous Probability
Distributions
(ii) Here we are given X ~ N ( 36, 20).
in usual notations, we have
μ 36, σ 2 20 σ 20
Now, the p.d.f. of random variable X is given by
2
1 x µ
1
2 σ
f(x) = e
σ 2π
2
1 x ( 36) 1
1 1 (x +36) 2
2 20
= e = e 40
20 2π 40π
1
1 (x +36) 2
= e 40 , x
2 10π
(iii) Here we are given X ~ N (0, 2).
in usual notations, we have
μ 0, σ 2 2 σ 2
Now, the p.d.f. of random variable X is given by
2 2
1 x µ 1 x 0
1
2 σ
1
2 2
f(x) = e = e
σ 2π 2 2π
1
1 x2
4
e , x
2 π
Example 2: Below, in each case, there is given the p.d.f. of a normally
distributed random variable. Obtain the parameters (mean and variance)
of the variable.
2
1 x 46
1
2 6
(i) f(x) = e , x
6 2π
1
1 (x 60) 2
(ii) f(x) = e 32 , x
4 2π
2
1 x 46
1
2 6
Solution: (i) f(x) = e , x
6 2π
Comparing it with,
2
1 x µ
1
2 σ
f(x) = e
σ 2π
we have
µ 46, 6
10
1
1 (x 60) 2 Normal Distribution
(ii) f(x) = e 32 , x
4 2π
2
1 X 60
1
2 4
= e
4 2π
Comparing it with,
2
1 x µ
1
2 σ
f(x) = e
σ 2π
we get
µ = 60, 4
11
Continuous Probability
Distributions iv) f x , being the probability, can never be negative and hence no portion
of the curve lies below x-axis.
v) Though x-axis becomes closer and closer to the normal curve as the
magnitude of the value of x goes towards or , yet it never touches it.
µ 32
β1 = 3
0, 2 24 3
µ2 2
i.e. the distribution is symmetrical and curve is always mesokurtic.
Note: Not only µ1 and µ 3 are 0 but all the odd order central moments
are zero for a normal distribution.
P X f x dx 0.6827,
1
Or P 1 Z 1 z dz = 0.6827,
1
2
P 2 X 2 f x dx 0.9544,
2
2
Or P 2 Z 2 z dz = 0.9544, and
2
3
P 3 X 3 f x dx 0.9973.
3
3
Or P 3 Z 3 z dz = 0.9973.
3
This property and its applications will be discussed in detail in Unit 14.
Let us now establish some of these properties.
n .
2 x 31 x
e.g. x e dx x e dx 3
0 0
1
1/ 2 x
1
x 1
and 0 x e dx 0 x 2 e dx 2
Some properties of the gamma function are
i) If n > 1, n n 1 n 1
ii) If n is a positive integer, then n n 1
1
iii) .
2
13
Continuous Probability
Distributions
Now, the first four central moments of normal distribution are obtained as
follows:
First Order Central Moment
As first order central moment (1) of any distribution is always zero [see Unit
3 of MST-002], therefore, first order central moment (1) of normal
distribution = 0.
Second Order Central Moment
2
2 = x f x dx
[See Unit 8 of MST-003]
2
1 x
2 1
x
2
. e dx
2
x
Put z x z
Differentiating
dx
dz
dx dz
Also, when x , we have z and
and when x , z
1
2 1 z2
2 z
2
e 2 dz
1
2 2
z2
2
ze dz
2
2
z
2 2 2
2
z e dz
2 0
z2
2
on changing z to – z, the integrand i.e. z e does not get changed 2
f z dz 2 f z dz if f z is even function of z
0
1
1 12
Now, put z 2 2t z 2 t 2 t 2 dz 2 t dt
2
14
1 Normal Distribution
2 1 2
2 2 (2t)e t 1
dt 2 2 t 2 e t dt
0 0
2t 2
2 2 32 1 t
t e dt
0
2 2 3
[By def. of gamma function]
2
2 2 1 1
[By Property (i) of gamma function]
2 2
2
[By Property (iii) of gamma function]
2
Third Order Central Moment
3
3 = x f x dx
2
1 x
3 1
x
2
e dx
2
x
Put z x z dx dz
and hence
3 1 12 z 2
3
z .
2
e dz
1
1 z2
3 z 3e 2
dz
2
1 1 1
z2 z2 z2
3 3 3
Now, as integrand z e 2 changes to z e 2 on changing z to – z i.e. z e 2
is an odd function of z.
Therefore, using the following property of definite integral:
a
we have,
1
3 3 0 = 0
2
15
Continuous Probability
Distributions Fourth Order Central Moment
2
1 x
4 4 1
x f x dx x
2
4 = e dx
2
x
Putting z
dx dz
4 1 12 z 2
4 z
2
e dz
1 1
4 z2 2 4 z2
z 4e 2
dz z 4
e 2
dz
2 2 0
z2
4
integrand z . e does not get changed on changing z to – z and hence it is
2
z2
Put t z2 = 2t
2
2zdz = 2dt
z dz = dt
dt dt
dz =
z 2t
2 4 1
4 (2t) 2 e t dt
2 0 2t
24 .4 2 t 1 4 4 32 t
= t e dt t e dt
2 2 0 t 0
4 4 52 1 t
t e dt
0
4 4 5
[By definition of gamma function]
2
4 4 3 3
[By Property (i) of gamma function]
2 2
4 4 3 1 1
[By Property (i) of gamma function]
22 2
3 4 1
[Using (Property (iii) of gamma function)]
2
3 4
16
Thus, the first four central moments of normal distribution are Normal Distribution
1 0, 2 2 , 3 0, 4 3 4 .
32 4 34 3 4
1 = 0, 2 = 3
23 22 2
2
4
Therefore, moment coefficient of skewness ( 1 ) 0
Now, let us obtain the mode and median for normal distribution in the next
section.
(x )
f(x) 0
2
x 0 as f (x) 0
17
Continuous Probability x
Distributions
Now differentiating (2) w.r.t. x, we have
x 1
f (x) f '(x) f (x)
2
f () f ( )
f (x) at x 0 2
2 0
x = is point where function has a maximum value.
Mode of X is .
Median
Let M denote the median of the normally distributed random variable X.
We know that median divides the distribution into two equal parts
M
1
f (x)dx f (x)dx
M
2
M
1
f (x)dx 2
2
µ 1 x M
1
1
e 2 dx f (x)dx
σ 2π
2
x
In the first integral, let us put z
Therefore, dx dz
Also when x z 0 , and
when x z .
Thus, we have
0 M
1 12 z 2 1
e dz f (x)dx
2
2
0 1 M
1 z2 1
e 2 dz f (x)dx
2
2
1 12 z2
M Z is s.n.v.with p.d.f. (z) e
1 1 2
+ f (x) dx
2 2 0
1
So (z)dz 1 (z)dz
2
M
f(x)dx = 0
µ
M as f (x) 0
18
Median of X Normal Distribution
2
1 x
1
2
x e dx
2
x
Put z x z
dx
dz
1
1 z2
M.D. about mean = | z | e 2 dz
2
1
z2
2
| z |e dz
2
1
z2
Now, | z | e 2 (integrand) is an even function z as it does not get changed on
changing z to –z, by the property,
a a
“ f (x)dx 2 f (x)dx, if f x is an even function of x ”, we have
a 0
1
z2
M.D. about mean = 2 z e 2 dz
2 0
Now, as the range of z is from 0 to ∞ i.e. z takes non-negative values,
z = z and hence
1
2 z2
2
M.D. about mean ze dz
2 0
z2
Put t z 2 2t 2zdz = 2dt zdz = dt
2
2 e t 2 2
M.D. about mean = e t
dt 2 0 1 =
2 0 1 0
19
Continuous Probability
Distributions 2
In practice, instead of , its approximate value is mostly used and that is
4
.
5
2 2 7 7 4
0.6364 0.7977 0.08 or (approx.)
22 11 5
Let us now take up some problems based on properties of Normal Distribution
in the next section.
X1 ~ N 1 , 12 and X 2 ~ N 2 , 22
then
X1 X 2 ~ N 1 2 , 12 22 , and
X1 X 2 ~ N 1 2 , 12 22 [See Property xiii (Section 13.3)]
i) X1 X 2 ~ N 0 0, 1 1
i.e. X1 X 2 ~ N 0, 2 , and
ii) X1 X 2 ~ N 0 0, 1 1
i.e. X1 X 2 ~ N 0, 2
2 2 2
Mean deviation about mean = = .5 = 5
Solution: Here = 0, 2 = 1 σ = 1.
first four central moments are:
1 0, 2 2 1, 3 0, 4 3 4 3.
Mean of X1 = E(X1 ) 40
Variance of X1 Var(X1 ) 25
Mean of X 2 = E(X 2 ) 60
Variance of X 2 V ar(X 2 ) 36
Now,
(i) Mean of X = E(X) E(2X1 3X 2 ) E(2X1 ) E(3X 2 )
4Var(X1 ) + 9Var(X 2 )
E(3X1 ) E( 2X 2 )
= 3E(X1 ) + ( 2)E(X 2 )
21
Continuous Probability
Distributions 13.8 SUMMARY
The following main points have been covered in this unit:
1) A continuous random variable X is said to follow normal distribution with
parameters ( ) and 2(>0) if it takes on any real value and its
probability density function is given by
2
1 x
1
f x e 2
, x
2
X
2) If X
N , 2 , then Z
is standard normal variate.
13.9 SOLUTIONS/ANSWERS
1 4
E 1) (i) Here we are given X ~ N ,
2 9
in usual notations, we have
1 4 2
, 2
2 9 3
Now, p.d.f. of r.v. X is given by
2
1 x
1
2
f(x) = e , x
2
2
1 x 1/2
1
2 2/3
= e
2
2π
3
22
2
9 2x 1 Normal Distribution
3
2 4
= e , x
2 2π
(ii) Here we are given X ~ N(40, 16)
in usual notations, we have
40, 2 16 4
Now, p.d.f. of r.v. X is given by
2
1 x
1
2
f(x) = e , x
σ 2π
2
1 x ( 40)
1
2 4
= e
4 2π
2
1 x + 40
1
2 4
= e , x
4 2π
x2
1
E 2) (i) f(x) = e 8, x
2 2π
2
1x
1
= e 24
2 2π
2
1 x 0
1
2 2
= e ...(1) , x
2 2π
Comparing (1) with,
2
1 x
1
2
f(x) = e , x
σ 2π
we get
0, 2
2, 2
30 = x 5 x 35 Mean = 35
Given that fourth moment about 35 is 768. But mean is 35, and hence the
fourth moment about mean = 768.
4 = 768
34 = 768
768
4 = 4 3 4
3
4 = 256 = (4)4 = 4.
24
Area Property of
UNIT 14 AREA PROPERTY OF NORMAL Normal Distribution
DISTRIBUTION
Structure
14.1 Introduction
Objectives
14.1 INTRODUCTION
In Unit 13, you have studied normal distribution and its chief characteristics.
Some characteristics including moments, mode, median, mean deviation about
mean have been established too in Unit 13. The area property of normal
distribution has just been touched in the preceding unit. Area property is very
important property and has lot of applications and hence it needs to be studied in
detail. Hence, in the Unit 14 this property with its diversified applications has
been discussed in detail. Fitting of normal distribution to the observed data and
computation of expected frequencies have also been discussed in one of the
sections i.e. Sec. 14.3 of this unit.
Objectives
After studing this unit, you would be able to:
describe the importance of area property of normal distribution;
explain use of the area property to solve many practical life problems; and
fit a normal distribution to the observed data and compute the expected
frequencies using area property.
25
Continuous Probability
Distributions
z1 z
1 12 Z2 1
e dz z dz
0 2 0
1 12 z 2
where z e is the probability density function of standard normal
2
z1 z
1 12 z2 1
variate and the definite integral e dz i.e. z dz represents the area
0 2 0
under standard normal curve between the ordinates at Z = 0 and Z = z1. (Fig.
14.2).
You need not to evaluate the integral to find the area. Table is available to find
such area for different values of z1.
Here, we have transformed the integral from
2
x1 1 x z1
1 2
1 12 z2
2 e dx to
0 2
e dz
26
i.e. we have transformed normal variate ‘X’ to standard normal variate (S.N.V.) Area Property of
X Normal Distribution
Z .
This is because, the computation of
2
x1 1 x
1 2
e dx requires construction of separate tables for different values of
2
and as the normal variate X may have any values of mean and standard
deviation and hence different tables are required for different and . So,
infinitely many tables are required to be constructed which is impossible. But
beauty of standard normal variate is that its mean is always ‘0’ and standard
deviation is always ‘1’ as shown in Unit 13. So, whatever the values of mean
and standard deviation of a normal variate be, the mean and standard deviation
on transforming it to the standard normal variate are always ‘0’ and ‘1’
respectively and hence only one table is required.
In particular,
P X f x dx [See Fig.14.3]
X
1 Z when X , Z
1
P 1 Z 1 z dz
1 when X , Z 1
1
2 z dz [By Symmetry]
0
P 2 X 2 f x dx See Fig.14.4
2
27
Continuous Probability
Distributions X
for Z , we have
2 2
P 2 Z 2 z dx 2 z dx Z 2 whenX 2
2 0 and Z 2 when X 2
From the table given in the
= 2 0.4772 Appendix at the end of the unit
= 0.9544
= 2 0.49865 = 0.9973
P[X lies within the range 3] = 0.9973
P[X lies outside the range 3] = 1 – 0.9973 = 0.0027
which is very small and hence usually we expect a normal variate to lie within
the range from – 3 to 3, though, theoretically it ranges from – to .
45, 2 16 16 4 0 always
X X 45
Now Z
4
45 45 0
(i) When X = 45, Z 0
4 4
53 45 8
(ii) When X = 53, Z= 2
4 4
41 45 4
(iii) When X = 41, Z 1
4 4
47 45 2
(iv) When X = 47, Z 0.5
4 4
Example 2: If the r.v. X is normally distributed with mean 80 and standard
deviation 5, then find
(i) P X 95 , (ii) P X 72 , (iii) P 60.5 X 90 ,
Solution: Here we are given that X is normally distributed with mean 80 and
standard deviation (S.D.) 5.
i.e. Mean = 80 and var iance 2 (S.D.) 2 25.
X X 80
If Z is the S.N.V., then Z
5
Now
95 80 15
(i) X = 95, Z = 3
5 5
P X 95 P Z 3 [See Fig.14.6]
= 0.5 P 0 Z 3
60.5 80 19.5
(iii) X = 60.5, Z 3.9
5 5
90 80 10
X = 90, Z 2
5 5
P 60.5 X 90 P 3.9 X 2 [See Fig.14.8]
= P 3.9 X 0 P 0 Z 2
normal curve is
= P 0 X 3.9 P 0 Z 2 symmetrical about
the line Z 0
30
Area Property of
Normal Distribution
= P 0 Z 3.4 P 0 Z 1
64 80 16
(v) X = 64, Z 3.2
5 5
76 80 4
X = 76, Z 0.8
5 5
P 64 X 76 P 3.2 Z 0.8 [See Fig.14.10]
31
Continuous Probability
Distributions
(b) What is the lowest weight of the 100 heaviest male students?
(Assuming that the weights are normally distributed)
Solution: Let X be a normal variate, “The weights of the male students of the
university”. Here, we are given that µ = 60 kg, σ = 16 kg, therefore,
X ~ N(60, 256).
We know that if X ~ N(µ, σ2), then the standard normal variate is given by
X
Z .
X 60
Hence, for the given information, Z
16
55 60
(a) i) For X = 55, Z 0.3125 0.31 .
16
Therefore,
P[X < 55] = P [Z < 0.31] = P [Z > 0.31] [See Fig. 14.11]
area on both
= 0.5 P [0 < Z < 0.31] sides of Z 0 is 0.5
Using table area
= 0.5 0.1217 under normal curve
= 0.3783
Number of male students having weight more than 70 kg = N P[X > 70]
= 1000 0.2643
= 264
45 60
iii) For X 45, Z 0.9375 0.94
16
65 60
For X 65, Z 0.3125 0.31
16
P 45 X 65 P 0.94 Z 0.31 [See Fig. 14.13]
33
Continuous Probability
Distributions
x1 60
Now, for X x1 , Z z1 (say) .
16
100
P [X x1 ] 0.1 [See Fig.14.14]
1000
P[Z z1 ] 0.1
Therefore, the lowest weight of 100 heaviest male students is 80.48 kg.
Example 4: In a normal distribution 10% of the items are over 125 and 35% are under
60. Find the mean and standard deviation of the distribution.
Solution:
Fig. 14.15: Area Representing the Items under 60 and over 125
Let X ~ N(µ, σ2), where µ and σ2 are unknown and are to be obtained.
34
Here we are given Area Property of
Normal Distribution
P[X > 125] = 0.1 and P[X < 60] = 0.35. [See Fig. 14.15]
X
We know that if X ~ N(µ, σ2), then Z .
60 vesign is taken because
For X = 60, Z z1 (say) ... (1) P[Z 0] P[Z 0] 0.5
125
For X 125, Z z 2 (say) ...(2)
Now P X 60 P Z z1 0.35
P[0 Z z1 ] 0.15
P[0 Z z 2 ] 0.40
60
0.39 … (3)
125
1.28 … (4)
(4) – (3) gives
125 60
1.28 0.39
65 65
1.67 38.92
1.67
From Eq. (4), 125 1.28 125 1.28 38.92 = 75.18
Hence mean 75.18; S.D. 38.92
Example 5: Find the quartile deviation of the normal distribution having mean µ
and variance 2 .
Solution: Let X N(, 2). Let Q1 and Q3 are the first and third quartiles. Now
as Q1, Q2 and Q3 divide the distribution into four equal parts, therefore, areas
35
Continuous Probability
Distributions
under the normal curve to the left of Q1 , between Q1 and Q2 (Median), between
Q2 and Q3 and to the right of Q3 all are equal to 25 percent of the total area. This
has been shown in Fig. 14.16.
and when
Q3
X Q3 , Z z1
Due to symmetry of normal curve, the values of Z corresponding to Q1 and Q3
are equal in magnitude because they are equidistant from mean.
Q3 z1 Q3 z1
X
E1) If X ~ N(150, 9) and Z is a S.N.V. i.e Z then find Z scores
corresponding to the following values of X
(i) X = 165 (ii) X = 120
36 E2) Suppose X ~ N (25, 4) then find
(i) P[X < 22], (ii) P [X > 23], (iii) P[X – 24< 3], and (iv) P[X – 21 > 2] Area Property of
Normal Distribution
E3) Suppose X ~ N (30, 16) then find in each case
(i) P[X ] 0.2492
(ii) P[X ] 0.0496
E4) Let the random variable X denote the chest measurements (in cm) of
2000 boys, where X ~ N(85, 36).
a) Then find the number of boys having chests measurement
i) less than or equal to 87 cm,
ii) between 86 cm and 90 cm,
iii) more than 80 cm.
b) What is the lowest value of the chest measurement among the 100
boys having the largest chest measurements?
E5) In a particular branch of a bank, it is noted that the duration/waiting time
of the customers for being served by the teller is normally distributed
with mean 5.5 minutes and standard deviation 0.6 minutes. Find the
probability that a customer has to wait
a) between 4.2 and 4.5 minutes, (b) for less than 5.2 minutes, and (c)
more than 6.8 minutes
E6) Suppose that temperature of a particular city in the month of March is
normally distributed with mean 24 C and standard deviation 6 C . Find
the probability that temperature of the city on a day of the month of
March is
(a) less than 20 C (b) more than 26 C (c) between 23 C and 27 C
37
Continuous Probability
Distributions X
(ii) Find the standard normal variate Z corresponding to each lower
limit. Suppose the values of the standard normal variate are obtained as z1,
z2, z3, …
(iii) Find P[Z z1], P[Z z2], P[Z z3],…i.e. the areas under the normal curve
to the left of ordinate at each value of Z obtained in step (ii). Using table
given in the Appendix at the end of the unit Z = zi may be to the right or left
of Z = 0.
If Z = zi is to the right of Z = 0 as shown in the following figure:
Then
P[Z zi] = 0.5 – P[zi Z 0]
= 0.5 – P[0 Z – zi] [Due to symmetry]
e.g. zi = 2 (say),
Then P[Z – 2] = 0.5 – P[–2 Z 0]
= 0.5 – P[0 Z – (–2)]
= 0.5 – P[0 Z 2]
(iv) Obtain the areas for the successive class intervals on subtracting the area
corresponding to every lower limit from the area corresponding to the
succeeding lower limit.
38
e.g. suppose 10, 20, 30 are three successive lower limits. Area Property of
Normal Distribution
Then areas corresponding to these limits are
P[X 10], P[X 20], P[X 30] respectively.
Now the difference P[X 30] – P[X 20] gives the area corresponding to
the interval 20-30.
(v) Finally, multiply the differences obtained in step (iv) i.e. areas
corresponding to the intervals by N (the sum of the observed frequencies),
we get the expected frequencies.
X f
0-10 3
10-20 5
20-30 8
30-40 3
40-50 1
Solution: First we are to find the mean and variance of the given frequency
distribution. This you can obtain yourself as you did in Unit 2 of MST-002 and
at many other stages. So, this is left an exercise for you.
You will get the mean and variance as
= 22 and 2 = 111 respectively
= 10.54
Hence, the equation of the normal curve is
2
1 x
1
f x e 2
2
2
1 x 22
1
= e 2 10.54 , x
10.54 2
39
Continuous Probability
Distributions
Expected frequencies are computed as follows:
The areas under the normal curve shown in the fourth column of the above tables
are obtained as follows:
there is no value
P[Z < –] = 0 to the left of
P[Z – 2.09] = 0.5 – P[–2.09 Z 0] [See Fig. 14.19]
= 0.5 – P[0 Z 2.09] [Due to symmetry]
From table given at
= 0.5 – 0.4817 the end of the unit
= 0.0183
40
Area Property of
Normal Distribution
Similarly
P[Z 1.71] = 0.5 + 0.4564 = 0.9564
P[Z 2.66] = 0.5 = 0.4961 = 0.9961
You can now try the following exercises:
E7) Fit a normal curve to the following distribution and find the expected
frequencies by area method.
41
Continuous Probability
Distributions
14.4 SUMMARY
The main points covered in this unit are:
1) Area property and its various applications has been discussed in detail.
2) Quartile deviation has also been obtained using the area property in an
example.
3) Fitting of normal distribution using area property and computation of
expected frequencies using area method have been explained.
14.5 SOLUTIONS/ANSWERS
E1) We are given X ~ N(150, 9)
in usual notations, we have
150, 2 9 3
X X 150
Now, Z
3
165 150 15
(i) When X = 165, Z= 5
3 3
120 150 30
(ii) When X = 120, Z= 10
3 3
E2) Here X ~ N(25, 4)
in usual notations, we have
Mean = 25, var iance 2 4 2
X X 25
If Z is the S.N.V then Z
2
22 25 3
i) X = 22, Z 1.5
2 2
P[X 22] P[Z 1.5] [See Fig. 14.21]
due to symmetry of
P[Z 1.5] normal curve
0.5 P[0 Z 1.5]
42
Area Property of
Normal Distribution
23 25 2
ii) X = 23, Z 1
2 2
P[X 23] P[Z 1] [See Fig.14.22]
due tosymmetry of
= P[Z 1] normal curve
= 0.5 P[0 Z 1]
x a b
iii) P[| X 24 | 3] P[ 3 X 24 3]
b x a b
= P[ 3 24 X 3 24)
= P[21<X 27]
21 25 4
X = 21, Z 2
2 2
27 25 2
X = 27, Z 1
2 2
P[| X 24 | 3] P[21 X 27] See Fig.14.23
P[ 2 Z 1]
43
Continuous Probability
Distributions
= P[–2< Z< 0] + P[0 < Z < 1]
= P[0 Z 2] P[0 Z 1]
= 0.4772 – 0.3413 = 0.1359
x a b x a b
x a b or x a b
= P[X 23or X 2 21]
y a
= P[X 23or X 19] y a
19 25 6
For X=19, Z 3
2 2
23 25 2
For X=23, Z 1
2 2
P[| X 21| 2] P[X 23or X 19] See Fig14.24
= P[Z 1or Z 3]
44
Area Property of
Normal Distribution
P 0 Z z1 0.2508
30
= 0.67
4
30 2.68
30 2.68= 32.68
30
ii) For X , Z z 2 (say) … (2)
4
Now P[X ] 0.0496
30
1.65
4
30 1.65 4
30 6.60
30 6.6 = 23.4
E4) We are given X ~ N(85, 36), N = 2000
i.e. 85cm, 2 36cm, N 2000
x
If X ~ N(µ, σ2) and Z then we know that Z ~ N(0, 1)
87 85 2
a) i) For X = 87, Z 0.33
6 6
Now P[X < 87] = P [Z < 0.33] [See Fig. 14.27]
= 0.5 + P [0 < Z < 0.33]
47
Continuous Probability
Distributions
b) Let x1 be the lowest chest measurement amongst 100 boys having the
largest chest measurements.
x 85
Now, for X x1 , Z 1 z1 (say) .
6
100
P[X x1 ] 0.05
2000
P Z z1 0.05 See Fig.14.30
Fig. 14.30: Area Representing the 100 Boys having Largest Chest Measurements
Therefore, the lowest value of the chest measurement among the 100
boys having the largest chest measurement is 94.84 cm.
E5) We are given
5.5 minutes, = 0.6 minutes
X
If X ~ N(, 2 ) and Z then we know that Z ~ N (0, 1)
4.2 5.5 1.3 13
a) For X = 4.2, Z 2.17
0.6 0.6 6
4.5 5.5 1.0 10 5
For X = 4.5, Z 1.67
0.6 0.6 6 3
48
Area Property of
Normal Distribution
Fig. 14.31: Area Representing Probability of Waiting Time between 4.2 and 4.5 Minutes
Fig. 14.32: Area Representing Probability of Waiting Time Less than 5.2 Minutes
49
Continuous Probability
Distributions
Fig. 14.33: Area Representing Probability of Waiting Time Greater than 6.8 Minutes
X
We know that if X ~ N(, 2 ), and Z then Z ~ N (0, 1)
20 24 4 2
a) For X = 20, Z 0.67
6 6 3
Fig. 14.34: Area Representing Probability of Temperature Less than 20 C
50
Area Property of
Normal Distribution
Fig. 14.35: Area Representing Probability of Temperature Greater than 26 C
Since, P[X > 26] = P [Z > 0.33] [See Fig. 14.35]
= 0.5 – P[0 < Z < 0.33]
From the table of areas
= 0.5 – 0.1293
under normal curve
= 0.3707
Therefore, probability that temperature of the city is more than
26 C is 0.3707
23 24 1
c) For X= 23, Z 0.17
6 6
27 24 3 1
For X= 27, Z 0.5
6 6 2
P[23 < X < 27] = P[–0.17 < Z < 0.5] [See Fig. 14.36]
= P [–0.17 < Z < 0] + P [0 < Z < 0.5]
= P[0 < Z < 0.17] + P[0 < Z < 0.5]
From the table of areas
= 0.0675 + 0.1915 under Normal Curve
= 0.2590
Therefore, probability that temperature of the city is between
23 C and 27 C is 0.2590
51
Continuous Probability
Distributions E7) Mean () 73, variance( 2 ) 39.75
and hence S.D. ( ) 6.3
The equation of the normal curve fitted to the given data is
2
1 x 73
1
f(x)= e 2 6.3
, x
(6.3) 2
Using area method,
The expected frequencies are obtained as follows:
Class Lower X Area under Difference Expected
interval limit Z= normal curve between frequency
X X 73 to the left of successive areas 40 col. V
z
6.3
Below 0 0.0197 – 0 0.8 1
60
= 0.0197
30
E8) P[X 40] 0.3,
100
33
P[40 X 50] 0.33, and
100
52
37 Area Property of
P[X 50] 0.37, Normal Distribution
100
Now, Let X ~ N(, 2 ),
Standard normal variate is
X
Z=
It is taken as ve as area to
40
When X = 40, Z z1 , (say) the left of this value is 30%
as probabilityis 0.3
It is taken as +ve as
50 area to the right of this
When X = 50, Z = z2 (say)
valueis given as37%
Now,
P[X 40] P[Z z1 ] 0.3
P[0 Z z1 ] 0.2
From table at the end of this unit,
P[0 Z z 2 ] 0.13
53
Continuous Probability
Distributions
40 50
0.525 and 0.33
40 0.525 and 50 0.33
Solving these equations for and , we have
11.7 and 46.14
54
APPENDIX Area Property of
Normal Distribution
AREAS UNDER NORMAL CURVE
The standard normal probability curve is given by
1 1
(z)= exp z 2 , z
2 2
The following table gives probability corresponding to the shaded area as shown
in the following figure i.e. P[0 Z z] for different values of z
TABLE OF AREAS
z 0 1 2 3 4 5 6 7 8 9
0.0 .0000 .0040 .0080 .0120 .0160 .0199 .0239 .0279 .0319 .0359
0.1 .0398 .0438 .0478 .0517 .0557 .0596 .0636 .0675 .0714 .0759
0.2 .0793 .0832 .0871 .0910 .0948 .0987 .1026 .1064 .1103 .1141
0.3 .1179 .1217 .1255 .1293 .1331 .1368 .1406 .1443 .1480 .1517
0.4 .1554 .1591 .1628 .1664 .1700 .1736 .1772 .1808 .1844 .1879
0.5 .1915 .1950 .1985 .2019 .2054 .2088 .2123 .2157 .2190 .2224
0.6 .2257 .2291 .2324 .2357 .2389 .2422 .2454 .2486 .2517 .2549
0.7 .2580 .2611 .2642 .2673 .2703 .2734 .2764 .2794 .2823 .2852
0.8 .2881 .2910 .2939 .2967 .2005 .3023 .3051 .3078 .3106 .3133
0.9 .3159 .3186 .3212 .3238 .3264 .3289 .3315 .3340 .3365 .3389
1.0 .3413 .3438 .3461 .3485 .3508 .3531 .3554 .3577 .3599 .3621
1.1 .3643 .3655 .3686 .3708 .3729 .3749 .3770 .3790 .3810 .3820
1.2 .3849 .3869 .3888 .3907 .3925 .3944 .3962 .3980 .3997 .4015
1.3 .4032 .4049 .4066 .4082 .4099 .4115 .4131 .4147 .4162 .4177
1.4 .4192 .4207 .4222 .4236 .4251 .4265 4279 .4292 .4306 .4319
55
Continuous Probability
Distributions
1.5 .4332 .4345 .4357 .4370 .4382 .4394 .4406 .4418 .4429 .4441
1.6 .4452 .4463 .4474 .4484 .4495 .4505 .4515 .4525 .4535 .4545
1.7 .4554 .4564 .4573 .4582 .4591 .4599 .4608 .4616 .4625 .4633
1.8 .4641 .4649 .4656 .4664 .4671 .4678 .4686 .4693 .4699 .4706
1.9 .4713 .4719 .4726 .4732 .4738 .4744 .4750 .4756 .4761 .4767
2.0 .4772 .4778 .4783 .4788 .4793 .4798 .4803 .4808 .4812 .4817
2.1 .4821 .4826 .4830 .4834 .4838 .4842 .4846 .4850 .4854 .4857
2.2 .4861 .4864 .4868 .4871 .4875 .4678 .4881 .4884 .4887 .4890
2.3 .4893 .4896 .4898 .4901 .4904 .4906 .4909 .4911 .4913 .4916
2.4 .4918 .4920 .4922 .4925 .4927 .4929 .4931 .4932 .4934 .4936
2.5 .4938 .4940 .4941 .4943 .4945 .4946 .4948 .4959 .4951 .4952
2.6 .4953 .4955 .4956 .4957 .4959 .1960 .4961 .4962 .4963 .4964
2.7 .4965 .4966 .4967 .4968 .4969 .4970 .4971 .4972 .4973 .4974
2.8 .4974 .4975 .4976 .4977 .4977 .4978 .4979 .4879 .4980 .4981
2.9 .4981 .4982 .4982 .4983 .4984 .4984 .4985 .4985 .4986 .4986
3.0 .4987 .4987 .4987 .4988 .4988 .4989 .4989 .4989 .4990 .4990
3.1 .4990 .4991 .4991 .4991 .4992 .4992 .4992 .4992 .4993 .4993
3.2 .4993 .4493 .4994 .4994 .4994 .4994 .4994 .4995 .4995 .4995
3.3 .4995 .4995 .4995 .4996 .4996 .4996 .4996 .4996 .4996 .4997
3.4 .4997 .4997 .4997 .4997 .4997 .4997 .4997 .4997 .4997 .4998
3.5 .4998 .4998 .4998 .4998 .4998 .4998 .4998 .4998 .4998 .4998
3.6 .4998 .4998 .4999 .4999 .4999 .4999 .4999 .4999 .4999 .4999
3.7 .4999 .4999 .4999 .4999 .4999 .4999 .4999 .4999 .4999 .4999
3.9 .5000 .5000 .5000 .5000 .5000 .5000 .5000 .5000 .5000 .5000
56
Continuous Uniform and
UNIT 15 CONTINUOUS UNIFORM AND Exponential Distributions
EXPONENTIAL DISTRIBUTIONS
Structure
15.1 Introduction
Objectives
15.1 INTRODUCTION
In Units 13 and 14, you have studied normal distribution with its various
properties and applications. Continuing our study on continuous distributions,
we, in this unit, discuss continuous uniform and exponential distributions. It
may be seen that discrete uniform and geometric distributions studied in Unit
11 and Unit 12 are the discrete analogs of continuous uniform and exponential
distributions. Like geometric distribution, exponential distribution also has the
memoryless property. You have also studied that geometric distribution is the
only discrete distribution which has the memoryless property. This feature is
also there in exponential distribution and it is the only continuous distribution
having the memoryless property.
The present unit discusses continuous uniform distribution in Sec. 15.2 and
exponential distribution in Sec. 15.3.
Objectives
After studing the unit, you would be able to:
define continuous uniform and exponential distributions;
state the properties of these distributions;
explain the memoryless property of exponential distribution; and
solve various problems on the situations related to these distributions.
1
for a x b
f x b a
0, otherwise
57
Continuous Probability The distribution is called uniform distribution since it assumes a constant
Distributions
(uniform) value for all x in (a, b). If we draw the graph of y = f(x) over x-axis
and between the ordinates x = a and x = b (say), it describes a rectangle as
shown in Fig. 15.1
1
ba
X
a b
Fig. 15.1: Graph of uniform function
For x b
x
F x P X x f x dx
a b
f x dx f x dx f x dx
a b
a b
1
0 dx
a
ba
dx 0 dx
b
1 b ba
=0+ x a 0 = 1.
ba ba
So,
0 for x a
x a
Fx for a x b
b a
1 for x b
58
On plotting its graph, we have Continuous Uniform and
Exponential Distributions
Fx
a b
Fig. 15.2: Graph of distribution function
b
1 x2 1 b2 a 2
=
b a 2 a b a 2 2
=
b a b a
ab
2b a 2
b b
1 1 x3 1 b3 a 3
= x2. dx = =
a
ba b a 3 a ba 3 3
b3 a 3
=
3 b a
b a b2 ab a 2
= x 3 y 3 x y x 2 xy y 2
3 b a
a 2 ab b 2
3
2
2 a 2 ab b 2 a b
2
Variance of X = E(X ) – [E(X)] =
3 2
2
4 a 2 ab b 2 3 a b
12
59
Continuous Probability
Distributions 4a 2 4ab 4b 2 3a 2 3b 2 6ab
12
2
b2 a 2 2ab b a
.
12 12
2
So, Mean =
ab
and Variance =
b a .
2 12
Let us now take up some examples on continuous uniform distribution.
Example 1: If X is uniformly distributed with mean 2 and variance 12, find
P[X < 3].
Solution: Let X U [a, b]
probability density function of X is
1
f x , a x b.
ba
Now as Mean = 2
ab
2
2
a+b=4 … (1)
Variance = 12
2
b a 12
12
2
b a 144
b – a = 12
Variance =
b a
12 0
144
12.
12 12 12
S.D. = 12
Thus, the coefficient of variation
S.D.
= 100 [Also see Unit 2 of MST-002]
Mean
12
100 = 57.74%
6
Example 3: Metro trains are scheduled every 5 minutes at a certain station. A
person comes to the station at a random time. Let the random variable X count
the number of minutes he/she has to wait for the next train. Assume X has a
uniform distribution over the interval (0, 5). Find the probability that he/she
has to wait at least 3 minutes for the train.
Solution: As X follows uniform distribution over the interval (0, 5),
probability density function of X is
1 1 1
f x , 0 x5
ba 50 5
Thus, the desired probability
5 5 5
1 1
P X 3 f x dx dx 1 dx
3 3
5 53
1 5 1 2
x 3 5 3 0.4
5 5 5
E2) A random variable X has a uniform distribution over (–2, 2). Find k for
1
which P[X > k] = .
2
61
Continuous Probability Now, let us discuss exponential distribution in the next section.
Distributions
x
ex x x
x
1 e 0 e e
0
0
e x 1 1 e x .
1 e x for x 0
So, F x .
0, elsewhere
x ex dx
0
ex
ex
x 1 dx [Integrating by parts]
0 0
62
Continuous Uniform and
1 ex
Mean 0 0 Exponential Distributions
0
1 1 1
2 0 1 2 .
Now, E X 2 x 2f x dx x 2 e x dx
0 0
x 2e x dx
0
ex ex
= x 2 2x dx [Integrating by parts]
0 0
2
0 0 x e x dx
0
2 2
0 0
x e x dx = x ex dx
2
E X
21
[E(X) is mean and has already been obtained]
2
2
2
2 1 2 1 1
Thus, Variance = E(X2) – [E(X)] 2 = 2
2 2 2
1 1
So, Mean = and Variance = 2 .
1 1 Mean
Remark 1: Variance = 2
Mean = Variance
.
So,
Value of Implies
<1 Mean < Variance
=1 Mean = Variance
>1 Mean > Variance
63
Continuous Probability Memoryless Property of Exponential Distribution
Distributions
Now, let us discuss a very important property of exponential distribution and
that is the memoryless (or forgetfulness) property. Like geometric distribution
in the family of discrete distributions, exponential distribution is the only
distribution in the family of continuous distributions which has memoryless
property. The memorless property of exponential distribution is stated as:
If X has an exponential distribution, then for every constant a 0, one has
P[X x + a X a] = P X x for all x i.e. the conditional probability of
waiting up to the time ' x a ' given that it exceeds ‘a’ is same as the
probability of waiting up to the time ‘ x ’. To make you understand the above
concept clearly let us take the following example: Suppose you purchase a TV
set, assuming that its life time follows exponential distribution, for which the
expected life time has been told to you 10 years (say). Now, if you use this TV
set for say 4 years and then you ask a TV mechanic, without informing him/her
that you had purchased it 4 years ago, regarding its expected life time. He/she,
if finds the TV set as good as new, will say that its expected life time is 10
years.
So, here, in the above example, 4 years period has been forgotten, in a way,
and for this example:
P[life time up to 10 years]
= P[life time up to 14 years | life time exceeds 4 years]
i.e. P[X 10] = P [X 14 X 4]
or P[X 10] = P[X 10 + 4 X 4]
Here a = 4 and x = 10.
Let us now prove the memoryless property of exponential distribution.
X x a X a
Proof: P X x a X a [By conditional probability]
P X a
where
P X x a X a P a X x a
xa xa
x
f x dx e dx
a a
xa
ex e x a e a
a
x a
e ea e x .ea ea
x e x a a
P[X a] = f x dx = e dx 0 e e
a a a
64
e a 1 e x Continuous Uniform and
P X x a X a 1 e x Exponential Distributions
ea
x
Also, P[ X x ] e x dx
0
x ex
Ae dx 1 A 1
0 (1) 0
–A [0 –1] = 1 A = 1
f x e x
=1
1 1
Hence, mean = 1,
1
1 1
and variance = 1.
2 1
So, the mean and variance are equal for the given exponential distribution.
Example 5: Telephone calls arrive at a switchboard following an exponential
distribution with parameter = 12 per hour. If we are at the switchboard, what
is the probability that the waiting time for a call is
i) at least 15 minutes
ii) not more than 10 minutes.
Solution: Let X be the waiting time (in hours) for a call.
f x ex , x 0
= 1– e 2 = 1– (0.1353) = 0.8647
Now, we are sure that you can try the following exercises.
E3) What are the mean and variance of the exponential distribution given
by:
f x 3e3x , x 0
E4) Obtain the value of k > 0 for which the function given by
f x 2e kx , x 0
66
We now conclude this unit by giving a summary of what we have covered in it. Continuous Uniform and
Exponential Distributions
15.4 SUMMARY
Following main points have been covered in this unit.
1) A random variable X is said to follow a continuous uniform (rectangular)
distribution over an interval (a, b) if its probability density function is given
by
1
for a x b
f x b a
0, otherwise
ab
2) For continuous uniform distribution, Mean and
2
2
variance
b a .
12
3) A random variable X is said to follow exponential distribution with
parameter > 0, if it takes any non-negative real value and its probability
density function is given by
ex for x 0
f x
0 , elsewhere
1 1
4) For exponential distribution, Mean = and Variance = 2 .
5) Mean > or = or < Variance according to whether > or = or < 1.
6) Exponential distribution is the only continuous distribution which has
the memoryless property given by:
P[X x + a X a] = P[X x].
15.5 SOLUTIONS/ANSWERS
E1) As X U[ a, a],
probability density function of X is
1 1 1
f x , a x a .
a ( a) a a 2a
1
i) Given that P[X > 4] =
3
a
1 1
2a dx 3
4
1 1
x a4
2a 3
67
Continuous Probability a4 1
Distributions
2a 3
3a – 12 = 2a
a = 12.
3
ii) P X 1
4
1
1 3
dx
a
2a 4
1 3
x 1 a
2a 4
1 3
1 a
2a 4
3
1+ a = a
2
2 2a 3a
a2
iii) P X 2 P X 2
X 2 X 2
X 2 or X 2
2 X 2 and
P 2 X 2 P X 2 or X 2
X 2 X 2
X 2or X 2
X 2or X 2
By Addition law of
P 2 X 2 P X 2 P X 2 probability for mutually
exclusive events
2 2 a
1 1 1
dx dx dx
2
2a a
2a 2
2a
1 1 1
2a
4 2 a a 2
2a 2a
4 (2 a) (a 2)
4 4 2a
2a = 8
a=4
E2) As X ~ U [ 2, 2],
1
f x , 2 x 2.
68 4
1 Continuous Uniform and
Now P X k Exponential Distributions
2
2
1 1
4dx 2
k
2k 1
4 2
2–k=2
k = 0.
E3) Comparing it with the exponential distribution given by
f x e x , x 0
We have = 3
1 1 1 1
Mean = and Variance = 2
3 9
E4) As the given function is exponential distribution i.e. a p.d.f.,
f x dx 1
0
we have
= 2 and = k
k=2
1
x
E5) Here P X x F x 1 e x = 1 – e 20
ii) P[First accident occurs on second week from starting of working day
on Tuesday till end of working day on Wednesday]
=P[First accident occurs after 7 working days
and before the end of 9 working days]
= P[7 < X 9]
= P[X 9] – P[X 7]
69
Continuous Probability 9 7
Distributions
1 e 20 1 e 20
9 7
20 20
e e
7 9
20 20
e e
e0.35 e 0.45
= 0.7047 – 0.6376 [See the table give at the end of Unit 10]
= 0.0671.
70
Gamma and Beta
UNIT 16 GAMMA AND BETA Distributions
DISTRIBUTIONS
Structure
16.1 Introduction
Objectives
16.1 INTRODUCTION
In Unit 15, you have studied continuous uniform and exponential
distributions. Here, we will discuss gamma and beta distributions. Gamma
distribution reduces to exponential distribution and beta distribution reduces
to uniform distribution for special cases. Gamma distribution is a
generalization of exponential distribution in the same sense as the negative
binomial distribution is a generalization of geometric distribution. In a sense,
the geometric distribution and negative binomial distribution are the discrete
analogs of the exponential and gamma distributions, respectively. The present
unit discusses the gamma and beta distributions which are defined with the
help of special functions known as gamma and beta functions, respectively.
So, before defining these distributions, we first define gamma and beta
functions in Sec. 16.2 of this unit. Then gamma distribution and beta
distribution of first kind followed by beta distribution of second kind are
discussed in Secs. 16.3 to 16.5.
Objectives
After studing this unit, you would be able to:
define beta and gamma functions;
define gamma and beta distributions;
discuss various properties of these distributions;
identify the situations where these distributions can be employed; and
solve various practical problems related to these distributions.
71
Continuous Probability
Distributions Beta Function
1
m 1 n 1
Definition: If m > 0, n > 0, the integral x 1 x
0
dx is called a beta
On the basis of the above discussion, you can try the following exercise.
E1) Express the following as a beta function:
1 1 1
i) x 3
1 x 2 dx
0
1
2 5
ii) x 1 x
0
dx
x2
iii) 1 x 5
dx
0
1
2
x
iv) 1 x 2
dx
0
72
Gamma Function Gamma and Beta
Distributions
Though we have defined Gamma function in Unit 13, yet we are again
defining it with more properties, examples and exercises to make you clearly
understand this special function.
n 1 x
Definition: If n > 0, the integral x e dx is called a gamma function and is
0
denoted by n
e.g.
2 x
(i) x e dx 2 1 3
0
x 1 3
(ii) xe dx 1
0 2 2
Some Important Results on Gamma Function
1. If n > 1, n n 1 n 1
2. If n is a positive integer, n n 1 !
1
3.
2
Relationship between Beta and Gamma Functions
10
(ii) 1 x
0
dx
1
x
(iii) x 2 e dx
0
Remark 1:
(i) It can be verified that
f x dx 1
0
Verification:
r ex x r 1
x dx
0 0 r
dx
r 1
ex x
dx
0 r
Putting x = y dx dy
Also, when x 0, y 0 and when x , y
1 y
e y r 1dx
r 0
1
r [Using gamma function defined in Sec. 16.2]
r
=1
(ii) If X is a gamma variate with two parameters r > 0 and > 0, it is expressed
as X γ(, r).
(iii) If we put r = 1, we have
e x x 0
f (x) ,x 0
1
e x , x 0
which is probability density function of exponential distribution.
e x .x r 1
f (x) , x 0, r 0
r
74
It is known as gamma distribution with single parameter r. This form of the Gamma and Beta
gamma distribution is also widely used. If X follows gamma distribution with Distributions
e x e x
x 1 dx [Integrating by parts]
1 5 5 1
ex
0 5e5 e x dx 5e5
5 1 5
5e 5 0 e5
= 6 e 5
75
Continuous Probability
Distributions
= 6 0.0070 [See the table given at the end of Unit 10]
= 0.042
ii) In this case r = 1, = 1 and hence
r ex .x r 1
P X 5 dx
5 r
(1)1 e x x 0 e x
dx e x dx 0 e5 0.0070
5 (1) 5 1 5
Alternatively,
As r = 1, so it is a case of exponential distribution for which
f x ex , x 0
e x
P X 5 e x x
dx 1e dx 0 e 5 0.0070
5 5 1 5
Here is an exercise for you.
E3) Telephone calls arrive at a switchboard at an average rate of 2 per minute.
Let X denotes the waiting time in minutes until the 4th call arrives and
follows gamma distribution. Write the probability density function of X.
Also find its mean and variance.
Let us now discuss the beta distributions in the next two sections:
m, n
m n
m n
Now, we are in a position to define beta distribution which is defined with the
help of beta function. There are two kinds of beta distribution beta
distribution of first kind and beta distribution of second kind. Beta distribution
of second kind is defined in next section of the unit whereas beta distribution
of first kind is defined as follows:
Definition: A random variable X is said to follow beta distribution of first
kind with parameters m > 0 and n > 0, if its probability density function is
given by
1 m 1 n 1
m, n x 1 x , 0 x 1
f (x)
0, otherwise
The random variable X is known as beta variate of first kind and can be
expressed as X 1(m, n)
76
Remark 5: If m = 1 and n = 1, then the beta distribution reduces to Gamma and Beta
Distributions
1 11
f x x11 1 x , 0 x 1
1,1
0
x 0 1 x
, 0 x 1
1,1
1
,0 x 1
1,1
11 0 0
But 1,1
2 1
Therefore, f (x)
11 1
1
f (x) 1, 0 x 1
1
,0 x 1
1 0
which is uniform distribution on (0, 1).
1
[p.d.f. of uniform distribution on (a, b) is f (x) , a x b]
ba
So, continuous uniform distribution is a particular case of beta
distribution.
Mean and variance of Beta Distribution of First Kind
Mean and Variance of this distribution are given as
m
Mean =
mn
mn
Variance = 2
m n m n 1
Example 4: Determine the constant C such that the function
6
f (x) Cx 3 1 x , 0 x 1 is a beta distribution of first kind. Also, find its
mean and variance.
Solution: As f x is a beta distribution of first kind.
1
f x 1
0
1
3 6
Cx 1 x
0
dx 1
1
6
C x 3 1 x dx 1
0
77
Continuous Probability
Distributions C 3 1, 6 1 1 [By definition of Beta distribution of first kind]
1
C
4, 7
47 m n
m, n
47 m n
11 10
4 7 3 6
10 9 8 7 6
840
3 2 6
6
Thus, f x 840x 3 1 x
7 1
= 840x 4 1 1 x
7 1
x 41 1 x
4, 7
1
[ 840 just obtained above in this example]
4, 7
m = 4, n = 7
m 4 4
Mean = ,
m n 4 7 11
mn
and Variance = 2
m n m n 1
47
2
4 7 4 7 1
28 7 7
12112 121 3 363
Now, you can try the following exercises.
E4) Using beta function, prove that
1
2 3
60x 1 x
0
dx 1
78
Gamma and Beta
16.5 BETA DISTRIBUTION OF SECOND KIND Distributions
Let us now define beta distribution of second kind.
Definition: A random variable X is said to follow beta distribution of second
kind with parameters m > 0, n > 0 if its probability density function is given
by
x m 1
mn
, 0x
f x m, n 1 x
0, elsewhere
x m 1
Remark 6: It can be verified that m, n 1 x mn
dx 1
0
Verification:
x m 1 1 x m 1
m, n 1 x dx dx
0
m n
m, n 0 1 x m n
x m-1
mn
dx is another form
0 1+x
1 of beta function.
m, n
m, n
(see Sec. 16.2 of this Unit)
=1
f x dx 1
0
79
Continuous Probability
Distributions kx 3
1 x dx 1
0
7
x 4 1
k 4 3
dx 1
0 1 x
k 4, 3 1
1 7 6 6 5 4
k 60
4,3 4 3 3 2 2
Here m = 4, n = 3
m 4 4
Mean = 2
n 1 3 1 2
m m n 1 4(4 3 1) 46
Variance = 2
2
6
n 1 n 2 (3 1) 3 2 4 1
E7) Obtain mean and variance for the beta distribution whose density is given
by
60x 2
f x 7
,0 x
1 x
16.6 SUMMARY
The following main points have been covered in this unit:
1) A random variable X is said to follow gamma distribution with
parameters r > 0 and > 0 if its probability density function is given by
r ex x r 1
, x0
f x r
0, elsewhere
1 m 1 n 1
m, n x 1 x , 0 x 1
f x
0, otherwise
m mn
Its mean and variance are and 2
, respectively.
mn m n m n 1
5) A random variable X is said to follow beta distribution of second kind
with parameters m > 0, n > 0 if its probability density function is given by:
x m 1
mn
,0 x
f x m, n 1 x
0, elsewhere
m m m n 1
Its Mean and Variance are , n 1; and 2
,n 2
n 1 n 1 n 2
respectively.
6) Exponential distribution is a particular case of gamma distribution and
continuous uniform distribution is a particular case of beta distribution.
16.7 SOLUTIONS/ANSWERS
1 1 1
1 1 2 3
1 x 2 dx B
3
E1) (i) x 1, 1 B ,
0 3 2 3 2
1 1
2 11 5 6 1
(ii) x 1 x dx x 1 x
0 0
dx
5
E2) e x .x 5/ 2 dx 1
0 2
7
2
81
Continuous Probability
Distributions 5 5 5 3 3 5 3 1 1
2 2 2 2 2 2 2 2 2
Result 1on gamma
function (See Sec. 16.2)
5 3 1
Result 3 on gamma function
2 2 2
15
8
1 10
(ii) x 1 x
0
dx = β(1 + 1, 10 + 1)
= β(2,11)
=
1!10 ! [Result 2 on gamma function]
12 !
=
10 ! =
1
1
12 1110 ! 12 11 132
1
x 1 1
(iii) 0 2 e dx 2 1 2
x
E3) Here = 2, r = 4.
r ex .x r 1
f (x) ,x 0
r
24.e2x .x 3
,x 0
4
16e2x .x 3
= ,x 0
3
8
= x 3e 2x , x 0
3
r 4
Mean = 2,
2
r 4
Variance = 2
2 1
2
1 1
3 4 1
E4) 60x 2 1 x dx 60 x 31 1 x dx 60 3, 4
0 0
82
34 2 3 60 2 3 2 Gamma and Beta
= 60 = 60 1 Distributions
7 6 6 5 4 3 2
1 1 1
E5) kx 2
1 x 2 dx 1
0
1 1
k 1, 1 1
2 2
1 2 1 2 2
k
1 3
, 1 3 1
1
2 2 2 2 2 2
Now, as the given p.d.f. of beta distribution of first kind is
2 12 1
f (x) x 1 x 2 , 0 x 1
1 3
1 1
x2 1 x 2
, 0 x 1
1 3
,
2 2
1 3
m , n
2 2
1
m 1
and hence mean = 2
mn 1 3 4
2 2
mn
Variance = 2
m n m n 1
1 3 3
3 1
2
2 4
2
2
1 3 1 3 2 3 4 4 3 16
1
2 2 2 2
x3 x 4 1
E6) 1 x 13/ 2
dx 5
dx
4
0 0 1 x 2
5 5
4 3
5 2 2
= 4,
2 5 13
4 2
2
5
6
2 6 32 64
=
13 11 9 7 5 5 13 11 9 7 5 15015
. . . .
2 2 2 2 2 2
83
Continuous Probability
Distributions 60x 2
E7) f x 7
,0 x
1 x
60x 31
3 4
,0 x
1 x
x 31 34 23 1
3 4
, 0x 3, 4
3, 4 1 x 6 6 60
m =3, n = 4
m 3
Hence, mean = 1
n 1 4 1
m m n 1 3 3 4 1 3 6
Variance = 2
= 2
= 1.
n 1 n 2 4 1 4 2 9 2
84