Stats Yr2 Chapter 3::: Normal Distribution
Stats Yr2 Chapter 3::: Normal Distribution
Distribution
[email protected]
www.drfrostmaths.com
@DrFrostMaths
180cm Height in cm ()
A variable with this kind of distribution is
said to have a normal distribution. For normal distributions we tend to draw
the axis at the mean for symmetry.
What does it look like?
𝑓 ( 𝑥)
We can set the mean and the standard deviation of the Normal
Distribution. If a random variable is normally distributed, then we write
Normal Distribution Q & A
Q1 For a Normal Distribution to be used, the
variable has to be:
?
continuous
𝜇− 𝜎 𝜇+ 𝜎 Height in cm ()
Just For Your InterestTM: The distribution, with a given mean and given standard
deviation , that ‘assumes the least’ (i.e. has the maximum possible ‘entropy’) is… the
Normal Distribution!
Difficult Proof: https://en.wikipedia.org/wiki/Differential_entropy#Maximization_in_the_normal_distribution
Extra Context: This is important in something called Bayesian Statistics. We often have to choose a suitable distribution for the
‘prior’ in the model (i.e. some ‘hidden’ variable). When making inferences based on observed data, we want to assume as little
as possible about any hidden variable, so using a Normal distribution therefore is the most mathematically appropriate choice.
The 68-95-99.7 rule
The 68-95-99.7 rule is a shorthand
used to remember the percentage
of data that is within 1, 2 and 3
standard deviations from the mean
respectively.
You need to memorise this!
!
of data is within one standard
deviation of the mean.
of data is within two standard
deviations of the mean.
of data is within three standard
deviations of the mean.
a b
8? 7.8 ?8 8.2
a b
68%
16%
70
?
100 130 100? 115
We know that 95% of data 68% lies within one standard deviation,
lies within 2 standard so there must be 16% at each tail.
deviations of the mean.
Exercise 3A
Pearson Stats/Mechanics Year 2
Pages 40-41
Getting normal values from your calculator
“Use of Technology” Monkey says:
If for some reason you haven’t got a Classwiz by this point, for Kong’s sake, buy one now. [Surreptitiously
pockets banana handed to me by Casio]. The instructions below assume you have a Classwiz.
b Use 93 for the lower limit and something Fro Note: vs makes no
difference as the
arbitrarily high (e.g. 100000) for the upper limit.
? distribution is continuous.
93 100
a ?
b Let be the number of adults among 30 who are eligible to join.
Froculator Note: You
? absolutely can’t use tables in a
formula booklet here. You’ll
need to use a calculator.
Exercise 3B
Pearson Stats/Mechanics Year 2
Pages 40-41
Inverse Normal Distribution
We now know how to use a calculator to value of the variable to obtain a
probability. But we might want to do the reverse: given a probability of being in a
region, how do we find the value of the boundary?
0.75
20 𝑎
DRAW A SKETCH!
Further Example
If the IQ of a population is distributed using .
a. Determine the IQ corresponding to the
top 30% of the population.
b. Determine the interquartile range of IQs.
a b
0.70 0.50
0.25
100 𝑎 𝑄1 100 𝑄3
? ?
𝑃 ( 𝑋>𝑘 ) =0.3 In general the quartiles of a
normal distribution are
approximately
Test Your Understanding
. Using your calculator,
a. determine the such that
b. determine the such that
c. determine the such that
d. determine the interquartile range of .
a c
0.75 ? 0.2
80 𝑎 𝑐76 80
b d
0.4 ? 0.50
𝑃 (?𝑋<𝑄1 )= 0.25∴𝑄1=75.28
0.2375 0.25
75 80 𝑏 𝑄1 80 𝑄3
Exercise 3C
Pearson Stats/Mechanics Year 2
Pages 46-47
Standard Normal Distribution
! is the number of standard deviations above the mean.
If again we use IQ distributed as then: This formula makes sense if you think about
(in your head!) the definition above. For an IQ of 130:
as expected.
IQ Z
100 0 ? represents the coding:
130 2 ? and . is known as a standard normal distribution.
85 -1 ?
165 4.333
?
62.5 -2.5
?
The 0 and 1 of are the result of the coding. If we’ve subtracted from each value the mean of
the normal distribution is now 0. If we’ve divided all the values by the standard deviation is
now
Standard Normal Distribution
The point of coding in this context is that all different possible normal distributions become a single
unified distribution where we no longer have to worry about the mean and standard deviation.
It means for example when we calculate , this will always give the same probability regardless of the
original distribution.
𝑃 ( 𝑋 ≥55 )
a b
? ?
‘Standardise’ to turn into .
5%
0 𝑧
Further Examples
(a) Determine
(b) Determine
(c) Determine the such that
(d) Determine the such that
a
𝑃 ( 𝑍>−1.3?)=𝑃 ( 𝑍<1.3 )
Fro Tip: Either changing <
to/from > or changing the sign
(+ to/from -) has the effect of
?
𝑃 ( 𝑎<𝑍<𝑏) = 𝑃 ( 𝑍<𝑏 ) − 𝑃 ( 𝑍<𝑎 )
c
From the second sketch we can see that
our value is in the bottom half, so it is
0.7 negative. However, our table only gives d By symmetry, if 0.6 at
positive values of .
the centre, must be 0.2
0 0.5244 From the graphs on the left, we can see
at each tail.
?
by symmetry that the value for the top
30% must be the negation of the -value
?
for the bottom 30% (i.e. top 70%).
0.6
𝑃 ( 𝑍 >𝑎 )=0.7
0.7 0.2 0.2
−𝑎 0 𝑎
−0.5244 0
Test Your Understanding
1 IQ is distributed with mean 100 and standard deviation 15. Using an
appropriate table, determine the IQ corresponding to the
(a) top 10% of people.
(b) bottom 20% of people.
a
2 If , determine, in terms of :
? (a)
(b)
b a
? ?
b
3 Find the such that:
(a) ?
(b)
𝜇20
0.2
𝑃 ( 𝑋>20 )= 0.2
?
?
?
Exercise 3E
Pearson Stats/Mechanics Year 2
Pages 51-53
Approximating a Binomial Distribution
𝑝(𝑥)
(#successes)
The graph shows the probability function for different Binomial Distributions. Which
one resembles another distribution and what distribution does it resemble?
When is close to 0.5, and is fairly large, it resembles a normal distribution.
?
The results in the distribution being symmetrical. e.g. For a fair coin toss with 10
throws, we’re just as likely to get 1 Head out fo 10 as we are 1 Tail.
Approximating a Binomial Distribution
If we’re going to use a normal distribution to approximate a Binomial distribution,
it makes sense that we set the mean and standard deviation of the normal
distribution to match that of the original binomial distribution:
𝜇=𝑛𝑝 ?
?
! If is large and close to 0.5, then the binomial distribution can be approximated by
the normal distribution where
The random variable represents the time to finish a race in hours. We’re
interested in knowing the probability Alice took 6 hours to the nearest hour.
How would you represent this time on a number line given hours is discrete?
And what about if hours was now considered to be continuous (as )?
Discrete:
Continuous: ?
3 4 5 6 7 8 9 10
We can’t just find when is continuous, because the probability is effectively 0. But would
seem a sensible interval to use because any time between 5.5 and 6.5 would have rounded to 6
hours were it discrete.
Continuity Corrections
If is a discrete variable, and is its continuous equivalent, how would you represent for
?
Discrete: 𝑋≥5
Continuous: ? 𝑌 ≥ 4.5
Notice the range has been
enlarged by an extra 0.5.
3 4 5 6 7 8 9 10
How would represent for ?
Discrete: 𝑋<9 → 𝑋 ≤ 8
Continuous: ? 𝑌 ≤ 8.5
3 4 5 6 7 8 9 10
! A continuity correction is approximating a
discrete range using a continuous one.
1. If > or <, convert to first.
2. Enlarge the range by 0.5.
Examples
Discrete Continuous
?
?
?
?
?
?
?
?
! A continuity correction is approximating a
discrete range using a continuous one.
1. If > or <, convert to first.
2. Enlarge the range (at each end) by 0.5.
Full Example
[Textbook - Edited] For a particular type of flower bulbs, 55% will produce yellow
flowers. A random sample of 80 bulbs is planted.
(a) Calculate the actual probability that there are exactly 50 flowers.
(b) Use a normal approximation to find a estimate that there are exactly 50 flowers.
(c) Hence determine the percentage error of the normal approximation for 50 flowers.
a
?
b Percentage error:
?
c
?
Test Your Understanding
Edexcel S2 Jan 2004 Q3
?
?
?
Exercise 3F
Pearson Stats/Mechanics Year 2
Page 55
Hypothesis Testing on the Sample Mean
0 1 2 3 4 5 6 7 8 9
150
100 as that of the variance of the population of
50 children?
0 No! On the left, we can see that how spread
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 6.5 7 7.5 8 8.5 9
out the sample means are depends on the
Mean of Sample of 10 Units
sample size. If the sample size is small, the
! For a random sample of size taken from ?
sample means are likely to vary quite a bit.
But with a larger sample size, we expect the
a random variable , the sample mean is different to be closer to the population
normally distributed with mean .
Examples
[Textbook] A certain company sells fruit juice in cartons. The amount of juice in a carton has
a normal distribution with a standard deviation of 3ml.
The company claims that the mean amount of juice per carton, , is 60ml. A trading inspector
has received complaints that the company is overstating the mean amount of juice per
carton and wishes to investigate this complaint. The trading inspector takes a random
sample of 16 cartons and finds that the mean amount of juice per carton is 59.1ml.
Using a 5% level of significance, and stating your hypotheses clearly, test whether or not
there is evidence to uphold this complaint.
Fro Note: Don’t
? As always, state the
distribution and its confuse and . The is
?
Let be the amount of juice per carton. distribution under .
the distribution over
amounts of drink in
Assuming , ? each individual
? Use to work out the
distribution of the
carton.
is the distribution
so there is insufficient ?
sample means, . over sample means,
evidence to reject and conclude that the i.e. the possible
sample means we
mean amount of juice in the whole population is less than 60 see as we take
ml. samples of 16
? cartons. might not
be normally
distributed, but will
be.
Finding the critical region
[Textbook] A machine products bolts of diameter where has a normal distribution with mean 0.580 cm
and standard deviation 0.015 cm. The machine is serviced and after the service a random sample of 50
bolts from the next production run is taken to see if the mean diameter of the bolts has changed from
0.580 cm. The distribution of the diameters of bolts after the service is still normal with a standard
deviation of 0.015 cm.
(a) Find, at the 1% level, the critical region for this test, stating your hypotheses clearly.
The mean diameter of the sample of 50 bolts is calculated to be 0.587 cm.
(b) Comment on this observation in light of the critical region.
?
Use the usual formula , except this
Critical region is or time we use the standard deviation of :
Observed value falls within critical region so sufficient evidence at 1%
level that mean diameter has changed from 0.580cm.
?
b
?
Test Your Understanding
Edexcel S3 June 2011 Q7a
?
Exercise 3G
Pearson Stats/Mechanics Year 2
Pages 58-60
Conditional Probabilities
This is not in the textbook. But given the recent Chapter 2 on Conditional
Probabilities and the fact that the type of question below occurred frequently in S1
papers, it seems worthwhile to cover!
𝑃 ( 𝑋>𝜇+15|𝑋>𝜇−15)
The intersection of “above ” and
“above ” is just “above ” because the
0.35
? stronger statements takes precedent.
If I said “my age is above 20, and
above 30”, this is equivalent to saying
𝜇−15 𝜇 𝜇+15 just “my age is above 30”.
Test Your Understanding
Edexcel S1 Jan 2013 Q4a,c
?
One Last Toughie…
IQ is distributed as .
A person is considered to be ‘pretty smart’ if their IQ is at least 130.
Determine the median IQ of ‘pretty smart’ people.