ch04 2013
ch04 2013
ch04 2013
INTRODUCTORY STATISTICS
FOR CRIMINAL JUSTICE
Chapter 4:
Variability
Instructor:
Dr. John J. Kerbs, Associate Professor
Joint Ph.D. in Social Work and Sociology
© 2013 - - DO NOT CITE, QUOTE, REPRODUCE, OR DISSEMINATE WITHOUT
WRITTEN PERMISSION FROM THE AUTHOR:
Dr. John J. Kerbs can be emailed for permission at [email protected]
Variability
The goal for variability is to obtain a measure of
how spread out the scores are in a distribution.
Variability indicates if scores are clustered
together or spread out over a large distance
Variability is usually defined in terms of
distance
Example: Distance between two scores or
distance between a score and a mean
(average)
A measure of variability usually accompanies a
measure of central tendency as basic descriptive
statistics for a set of scores.
Central Tendency and Variability
Central tendency describes the central
point of the distribution, and variability
describes how the scores are scattered
around that central point.
Together, central tendency and variability
are the two primary values that are used
to describe a distribution of scores.
Variability
Variability serves both as a descriptive
measure and as an important component
of most inferential statistics.
As a descriptive statistic, variability measures
the degree to which the scores are spread out
or clustered together in a distribution.
In the context of inferential statistics,
variability provides a measure of how
accurately any individual score or sample
represents the entire population.
Variability (Continued)
When the population variability is small, all
of the scores are clustered close together
and any individual score or sample will
necessarily provide a good representation
of the entire set.
On the other hand, when variability is large
and scores are widely spread, it is easy for
one or two extreme scores to give a
distorted picture of the general population.
Variability (Example)
Measuring Variability
Variability can be measured with
1. The range
2. The variance
3. The standard deviation
SS = (9+1+25+1) – [(10) 2 / 4 ]
SS = 36 – 25 = 11
Variance (σ2) = SS/N = 11/4 = 2.75
Standard Deviation (σ) =
Standard Deviation for Sample
As compared to the calculations for populations, the
variance (and thus the standard deviation) for samples is
computed by dividing the sum of the squared deviations
(SS) by n - 1, rather than N.
The value, n - 1, is know as degrees of freedom (df), which
determine the number of scores in the sample that are
independent and free to vary.
DF is used so that the sample variance will provide an
unbiased estimate of the population variance.
Without correcting the denominator in the calculations for
variance with the proper degrees of freedom, the
variance will be under-estimated (i.e., biased).
All of the prior steps for calculating SS still apply in relation
to both the use of definitional and computational formulas,
except that you will replace μ with M and N with n
Standard Deviation for Sample
Sample Variance
a.k.a., Estimated Population Variance
In the context of inferential statistics, the variance that exists in a set
of sample data is often classified as error variance to indicate that
the variance represents unexplained or uncontrolled differences
between scores.
Sample Standard Deviation
a.k.a., Estimated Population Standard Deviation
As “estimates,” there is always the potential for biases.
A sample statistic is biased if the average value of the statistic
either underestimates or overestimates the corresponding
population parameter.
A sample statistic is unbiased if the average value of the statistic
equals the population parameter.
NOTE: The average value of the statistic is obtained from all the
possible samples for a specific sample size, n.
Standard Deviation for Sample
Consider the following sample (n = 7, M = 5)
Score Deviation Squared Deviation
(X) (X - M) (X - M) 2
1 (1-5) = - 4 (-4)2 = 16
6 (6-5) = + 1 (+1)2 = 1
4 (4-5) = - 1 (-1)2 = 1
3 (3-5) = - 2 (-2)2 = 4
8 (8-5) = + 3 (+3)2 = 9
7 (7-5) = + 2 (+2)2 = 4
6 (6-5) = + 1 (+1)2 = 1
SSsample = 16+1+1+4+9+4+1 = 36
Variance (s2) = SS/(n-1) = 36/(7-1) = 36/6 = 6
Standard Deviation (s) =
Standard Deviation for Sample
with Computational Formula
Consider the following sample (n = 7, M = 5)
Score Score
(X) Squared (X ) Σ X = 35
2
1 1 (Σ X) 2 = 1225
6 36
4 16
Σ X 2
= 211
Do NOT confuse the
3 9
SS denominator with the
8 64 denominator of the
7 49 sample variance (s2)
6 36