Normal Distribution Normal Probability Distribution: Mean Continuous Random Variable (X)
Normal Distribution Normal Probability Distribution: Mean Continuous Random Variable (X)
Lesson Objectives
At the end of this lesson you shall be able to:
define what a normal distribution is;
enumerate the properties of a normal distribution;
determine the family of normal distribution; and
discuss the relationship between the mean, median and the mode in a normal distribution;
NORMAL DISTRIBUTION
The normal distribution (or normal probability distribution) is the most important theoretical
continuous distribution used in statistics because:
1. it has played a central role in the development of inferential statistics;
2. many real-world random variables exhibit frequency (or relative frequency) distribution that
closely resemble normal distribution; and
3. it can conveniently be used to approximate many other probability distributions, such as binomial
and the Poisson.
The normal probability distribution is defined by the function called the normal probability density
function (or normal probability function), given below, which for the continuous random variable X can
take a specific values X = x.
2
− ( x−μ )
f ( x )=¿ 1 e , for −∞< x <∞
2
2σ
σ √2 π
where:
e (Euler’s constant) = 2.71828…
π = 3.14159…
μ= E ( X ) = mean of the normal distribution
σ =E [ ( X−μ ) ] = variance of the normal distribution
2 2
Density
Mean
68.3%
95.4%
99.7%
Example:
Use the normal probability density function to calculate f ( x) for X =1 for a normal distribution with the
parameters μ=0 and σ 2=1.
Solution:
2
− ( x−μ )
The equation is f ( x )=¿ 1 2σ
2
e
σ √2 π
2
− ( x−0 )
1
Inserting μ=0 and σ =1. Note that
2
f ( x )=¿ e 2 (1 )
√1 ( √ 2 π )
equation requires σ , hence σ =√ 1
2
−( 1−0)
1 2(1)
Insert the value of π (3.1416) and X =x=1 f ( x )=¿ e
√1 ( √ 2(3.1416))
−1
1 2
Simplify f ( x )=¿ e
√6.2832
INTRODUCTION TO NORMAL DISTRIBUTION
−1
1
f ( x )=¿ e 2
2.5066
−1
f ( x )=¿ 0.3989 e 2
−1
Insert the value of e (2.7183) f ( x )=¿ 0.3989 ( 2.7183 ) 2
Example:
For the two normal distributions (A and B) shown below, which has larger (a) μ, (b) σ and (c) σ 2?
Solution:
(a) Because in any random distribution the mean μ indicates the location on the horizontal axis
(continuous random variable) of the median of the distribution, we know that a vertical line
erected above μ ot f ( x) will divide the distribution into two mirror-image halves. From this you
can see in the figure above that the mean of B ( μ B) is to the right of the mean of A ( μ A ).
Therefore, as it is true for any rectangular Cartesian coordinate system that numbers along the
horizontal axis are positive and increasing to the right of the origin, we know that μ A < μ B .
(b) Standard deviation σ is a measure of the dispersion (or spread) of the values around the mean.
We also know from the empirical rule that 68.3% of the area in a normal distribution will
always lie within one standard deviation from the mean, and consequently the distribution will
have a more distinct peak or greater “ peakedness”. From this we can see on the figure that
σ A >σ B.
2 2
(c) Since σ A >σ B, hence σ A >σ B.
Kurtosis
Not all unimodal, symmetrical distributions are normal distributions. They may differ from a
normal in terms of kurtosis which is the degree of peakedness. Three distributions that differ in kurtosis
are shown below.
The middle distribution is a normal distribution, which is called mesokurtic (meso, middle). The
distribution on the left, which is flatter and less peaked than the normal with a relatively even distribution
of values and with shorter tails, is called platykurtic (platy, flat). The distribution to the right which is more
peaked than the normal with values concentrated in the middle and with long tails, is called leptokurtic
(lepto, slender).
A mesokurtic distribution has a kurtosis ( a 4) of exactly 3 or very near to 3 with an excess kurtosis (
g2 for population, G2 for sample) of 0 or very near to 0. A distribution with a kurtosis of less than 3 and an
excess kurtosis of less than 0 is platykurtic, while a distribution with a kurtosis larger than 3 and an excess
kurtosis larger than 0 is leptokurtic.
We compute the moment coefficient of kurtosis is computed using the following equation:
m4
kurtosis: a 4 = 2
(σ2)
where:
4
m4 =¿ ∑ f ( x−μ )
n
2
2 ∑f ( x−μ )
σ =variance=¿
n
n−1
Excess kurtosis (sample): G 2=¿
(n−2)(n−3)
[ ( n+1 ) g2 +6 ]
Example:
The following data describes college men’s height (sample size, n = 100) in terms of inches. From the data
set, μ=67.45. Determine the kurtosis of the said distribution.
m4
Solving for the kurtosis, a 4 a4 = 2
(σ2)
4
Solving m4 m4 =¿ ∑ f ( x−μ )
n
Insert ∑ f ( x−μ )4=19937.59 and
INTRODUCTION TO NORMAL DISTRIBUTION
19937.59
n = 100 m4 =¿
100
m4 =¿ 199.3759
∑ f ( x−μ )2
Solving σ 2 2
σ =¿
n
Insert∑ f ( x−μ )2=852.75 and
852.75
n = 100 σ 2=¿
100
σ 2=¿ 8.5275
199.3759
a4 =
( 8.5275 )2
199.3759
a4 =
72.7183
a 4=2.7418
But since our data is from a sample and not a population we will compute excess kurtosis for a sample, G 4 .
n– 1
G4 =
( n−2 ) ( n−3 )
[ ( n+1 ) g2 +6 ]
100 – 1
G4 = [( 100+1 ) (−0.2582)+6 ]
(100−2 ) (100−3 )
99
G4 = [ ( 101 ) (−0.2582)+6 ]
( 98 )( 97 )
99
G4 = [ −26.0782+6 ]
9506
G4 =0.0104(−20.0782)
G4 =−0.2088
Since the kurtosis (2.7418) is less than 3 and the excess kurtosis ( −0.2088 ) is less than zero, then the
distribution is not a normal distribution but a slightly platykurtic one.
ACTIVITY
1. Calculate f (x) for X =1 for the normal distribution with the following parameters:
a. μ=0 and σ 2=4
b. μ=0 and σ 2=9
c. μ=0 and σ 2=15