Stats2 Normal Distribution
Stats2 Normal Distribution
Studying Human
Performance
Mode
Mode
Median
Median
Mean Mean
Frequency
Frequency
s s
Scores Scores
15.87% 15.87%
2.28% 2.28%
0.13% 0.13%
-3 -2 -1 0 1 2 3
No. of Standard Deviations
above and below the mean
Calculating the standard deviation?
Sample Variance (S2)is an estimate of the variability of a set of data.
The average of the squared differences from the mean.
(1) Work out the mean for the data
(2) For each data entry, subtract the mean from it and square the result
(3) Add all the squared differences together
(4) Divide by N - 1
Standard deviation (S or SD) is an estimate of the average variability of a set of
data measured in the
same units of measurement as the original data.
Description of data: 18 male students,
Height: mean 182.2cm, s.d. 8.39cm
Standard deviation is the square root of the variance.
(5) Sqrt (S2)
Two things to note in the
calculation
• IN GENERAL, WHY IS THERE SO MUCH SQUARING AND SQUARE
ROOTING GOING ON IN STATISTICAL FORMULAE?
– Usually because we’re trying to work out a difference and sometimes there will
be a negative number we need to lose the sign off
WHY N-1, WHY NOT JUST CALCULATE THE MEAN OF THE DIFFERENCES (ie N)?
– This is called “Bessel’s correction”
– In some books you’ll see just “N” in the ‘Variance’ formula but this is the variance
of the whole population. If you can measure everyone in “the world”, use N.
– We calculate here the “Sample Variance”, and we use N-1. This is a convention to
recognise the fact we’re assuming our sample is representative but just a
sample. We get a slightly bigger estimated variance because we’re dividing by a
smaller number.
Hrm ok.
Assume that we know:
Mean height of French men (not on OK Cupid) = 171.5 cm
SD = 6.9 cm
So, we know that if that is the mean, 50% of the distribution is above, 50% is
below.
Since one SD = 6.9 cm, 171.5+6.9cm = 178.4cm.
15.87% of French men are taller than 178.4cm.
This is useful but what if I want to talk about percentages that aren’t 15.87 or
what about Standard Deviation that isn’t exactly 1, 2 or 3.
Using the normal distribution
Once the mean and SD of a normally distributed variable is known we have a
perfect description of an entire distribution. We can also then calculate the
percentile rank associated with any score.
𝑥 −𝑥 is the score
𝑧= is the mean
𝑆
S is the SD
Normal distribution table relates distances from the
mean in SD
• 1. Its unitsthe
called (z-scores) to percentages
“D. normal distribution table” in your set of
tables…
• 2. First two digits are the rows, next digit is the column.
then
Look up z=1.0
• Z = 1 is -0.34134
z = 2.0
• -0.47725
• 34.1+13.6 = 47.7
z = 1.5, -0.4332 or -43%
86% between +/- 1.5 SD
So…
• We can also read the table backwards. Find the value in the
table and match it to the z-score.
• So, 40% to the left of the mean. We need to find the closest
figure to -.4
• In the table this is .3997. z = 1.28
10% Mean
Practical application
• If we have anthropometric
data (e.g., mean and
standard deviation for
reach) we can calculate the
percentage of people who
can reach a given height or
alternatively how high to put
something that 80% (or 83%
or 95% or 99% etc) of
people can reach it.
Examples
• E.g., if French men’s height mean = 171.5cm, SD = 6.9cm.
• How tall are the bottom 10%?
𝑥 −𝑥
𝑧=
𝑆
𝑥 − 171.5
− 1.28= = 162.67 cm (or less)
6.9
Examples
• E.g., if French men’s height mean = 171.5cm, SD = 6.9cm.
• If a man is 175cm tall, what percentile is he in?
𝑥 −𝑥
𝑧=
𝑆
175 − 171.5
𝑧= = 0.51
6.9
Look-up 0.51 as z-score, 0.1950