Example II: The Normal Distribution (A Continuous Random Variable)
Example II: The Normal Distribution (A Continuous Random Variable)
The Normal distribution is the most important in statistics. Many other distributions are
based upon or related to the Normal distribution. The Normal distribution is:
bell shaped
unimodal
symmetric
extends from - infinity to + infinity.
The parameters of the distribution are µ and ², like n and p for the Binomial. turns out to
be the mean of the distribution, 2 its variance. Note that the parameters have different
meanings from the Binomial. Different values of and 2 give rise to different Normal
distributions, changing the mean and/or variance.
Examples of variables which are Normally distributed are: IQ, height of men, height of
women, the sample mean, errors in a regression equation, etc. The Normal distribution tends
to arise when a random variable is influenced by many independent factors, none of which is
predominant. Hence height is determined by genes, diet, environment, etc. Men and women
are different because of the dominating influence of sex. Hence the height of people is not
(quite) Normally distributed.
Men's heights are approximately Normally distributed with mean 174 (cm) and variance 100,
i.e. H ~ N(174, 100). What proportion of men are over 180cm tall? Note this is equivalent to
the probability of picking out a man at random from the population who is over 180cm tall.
The answer corresponds to the area under the Normal distribution, to the right of H = 180.
180
To answer this, we could find the integral of the Normal distribution formula above, but this
is difficult mathematically. Fortunately, the answers have been tabulated, once we know how
many standard deviations 180 lies above the mean. This is calculated as the z-score:
X 180 174
z 0.6
10
Hence 180 lies 0.6 standard deviations above the mean. From tables of the standard Normal
distribution, we can then read off that 27.43% of the distribution lies above the value of z =
0.6 or, equivalently, above H = 180cm in our example.
The table gives areas under the standard Normal distribution, which has mean 0 and variance
(and standard deviation) 1, i.e. z ~ N(0,1). Thus we have established that
171 174
z 0.3 hence 38.21% (ignore the negative sign when consulting the table)
10
2
A B
85 100 110
Area A: z = (85 - 100)/10 = -1.5, hence 6.68% in the tail, hence A = 50% - 6.68% = 43.32%.