Lecture 4
Lecture 4
X= 25 n=5 X =5
X X X
6 +1.00 This is an example of data
4 -1.00 with low variability
6 +1.00
5 0.00
4 -1.00
X= 25 n=5 X =5
X X X
8 +3.00 This is an example of data
1 -4.00 with higher variability
9 +4.00
5 0.00
2 -3.00
X= 25 n=5 X =5
Note:
• Let’s say we wanted to figure out the average
deviation from the mean. Normally, we would want
to sum all deviations from the mean and then divide
by n, i.e.,
X X
n
8 +3.00 9.00
1 -4.00 16.00
9 +4.00 16.00
5 0.00 0.00
2 -3.00 9.00
X = 25 X X = 0.00
2 = 50.00
XX
X
2
2
n
Population Standard Deviation
Square root of the variance 2
X
2
n
Sample Variance
• The sum of squared deviations from the mean
divided by the number of degrees of freedom (an
estimate of the population variance, n-1)
s
2 X x 2
n 1
Sample Standard Deviation
• Square root of the variance s2
s X x 2
n 1
Why use Standard Deviation and not
Variance!??!
• Normally, you will only calculate variance in
order to calculate standard deviation, as standard
deviation is what we typically want.
Mean
Median
Mode
Normal Distribution
• We study the normal distribution because many
naturally occurring events yield a distribution
that approximates the normal distribution.
Properties of Area Under the Normal
Distribution
• One of the properties of the Normal Distribution
is the fixed area under the curve.
• If we split the distribution in half, 50% of the
scores of the sample lie to the left of the mean (or
median, or mode), and 50% of the scores lie to
the right of the mean (or median, or mode).
• The mean, median, and mode always cut the
Normal Distribution in half, and are equal since
the Normal Distribution is unimodal and
symmetrical.
50% of 50% of
scores scores
• Z-score formula:
X
z
• The mean and standard deviation are
always notated in Greek letters.
z = (75-89)/5.2 z
z = (-14)/5.2
z = -2.69
• This means that a mark of 75% is actually 2.69
standard deviations BELOW the mean.
• The student would have done poorly on this test,
as compared to the rest of the class.
• z = 0 represents the mean score (which would be
89 in this example).
• z < 0 represents a score less than the mean (which
would be less than 89).
• z > 0 represents a score greater than the mean
(which would be greater than 89).
• A z-score expresses the position of the raw score
above or below the mean in standard deviation
sized units.
• E.g.,
z = +1.50 means that the raw score is 1 and one-half
standard deviations above the mean.
z = -2.00 means that the raw score is 2 standard
deviations below the mean.
Z-score Example
• If you write two exams, in Math and English, and
get the following scores:
Math 70% (class = 55, = 10)
English 60% (class = 50, = 5)
• Which test mark represents the better performance
(relative to the class)?
• Math mark:
z = (70-55)/10
z = +1.50
• English mark:
z = (60-50)/5 X
z = +2.00 z
Z-score Example Illustration
Mean
Z=0.00 Z=1.50 Z=2.00
The Answer
• Because: Z = +2.00 is greater than Z = +1.50, the
English class mark of 60% reflects a better
performance relative to that class than does the
Math class mark of 70%.
Z-score: Solving for X
• The z-score formula can be rearranged to solve
for X:
X X (z)( )
z
• This formula is used when you know the z-score
of a data point, and want to solve for the raw
score.
Example
• E.g., if a class midterm exam has = 65 and = 5,
what exam mark has a z-score value of 1.25?
X = (1.25)(5) + 65
X (z)( ) = 6.25 + 65
= 71.25