Review: - Measures of Central Tendency - Measures of Variation
Review: - Measures of Central Tendency - Measures of Variation
Review: - Measures of Central Tendency - Measures of Variation
2 2
s = Σ ( X - X̄ )
N
The Normal Curve
• The mean and standard deviation, in
conjunction with the normal curve allow for
more sophisticated description of the data
and (as we see later) statistical analysis
• For example, a school is not that
interested in the raw GRE score, it is
interested in how you score relative to
others.
• Even if the school knows the average
(mean) GRE score, your raw score still
doesn’t tell them much, since in a perfectly
normal distribution, 50% of people will
score higher than the mean.
• This is where the standard deviation is so
helpful. It helps interpret raw scores and
understand the likelihood of a score.
• So if I told you if I scored 710 on the
quantitative section and the mean score is
591. Is that good?
• It’s above average, but who cares.
• What if I tell you the standard deviation is
148?
• What does that mean?
• What if I said the standard deviation is 5?
• Calculating z-scores
Converting raw scores to z scores
What is a z score? What does it represent
Z = (x-µ) / σ
pi = ≈3.14159265
e = ≈2.71
Probability and the Normal Curve
• Why t?
The t Distribution
• Empirical testing and models shows that a
standard deviation from a sample
underestimates the standard deviation of
the true population
• This is why we use N-1 not N when
calculating the standard deviation and the
standard error
• So in reality, we are calculating t-scores,
not z-scores since we are not using the
true sd.
• So when we are using a sample and
calculating a 95% confidence interval (CI)
we need to multiply the standard error by t,
not 1.96
• How do we know what t is?
• Table in back of book
• Df = N - 1
Confidence Intervals for
Proportions
Calculate the standard error of the
proportion:
P 1 P
Sp =
N
95% conf. Interval =
P +/- (1.96)Sp
Example
• National sample of 531 Democrats and
Democratic-leaning independents, aged
18 and older, conducted Sept. 14-16, 2007
• Clinton 47%; Obama 25%; Edwards 11%
• P(1-P) = .47(1-.47) = .47(.53) = .2491
• Divide by N = .2491/531 = .000469
• Take square root = .0217
• 95% CI = .47 +/- 1.96 (.0217)
• .47 +/- .04116 or 0.429 to .511
Midterm
• Key terms from Schutt chapters 1-5
• Statistical Calculations by hand
– Mean, Median, Mode
– Variance/Standard Deviation
– Z-scores
– Standard errors and confidence intervals
using z or t