Chapter 02
Chapter 02
and Statistics
Twelfth Edition
Chapter 2
Describing Data
with Numerical Measures
Some graphic screen captures from Seeing Statistics ® Copyright ©2006 Brooks/Cole
Some images © 2001-(current year) www.arttoday.com A division of Thomson Learning, Inc.
Describing Data with Numerical
Measures
• Graphical methods may not always be
sufficient for describing data.
• Numerical measures can be created for
both populations and samples.
– A parameter is a numerical descriptive
measure calculated for a population.
population
– A statistic is a numerical descriptive
measure calculated for a sample.
sample
xi 2 9 11 5 6 33
x 6.6
n 5 5
n 25 8/25
Relative frequency
• Median? 6/25
m2 4/25
mode 2
0
0 1 2 3 4 5
Quarts
MY APPLET
5 -4 16 ( x x ) 2
s2 i
12 3 9 n 1
6 -3 9
60
8 -1 1 15
4
14 5 25
Sum 45 0 60
s s 2 15 3.87
5 25 xi i
s2 n
12 144
n 1
6 36
452
8 64 465
14 196 5 15
Sum 45 465 4
s s 2 15 3.87
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
Some Notes MY APPLET
x 44.9
12/50
Relative frequency
10/50
8/50
s 10.73 6/50
4/50
2/50
0
25 33 41 49 57 65 73
•Yes. Tchebysheff’s
•Do the actual proportions in the three
Theorem must be
intervals agree with those given by true for any data
Tchebysheff’s Theorem? set.
•Do they agree with the Empirical •No. Not very well.
Rule?
•The data distribution is not very
•Why or why not? mound-shaped, but skewed right.
Actual s = 10.73
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
Measures of Relative Standing
• Where does one particular measurement stand
in relation to the other measurements in the data
set?
• How many standard deviations away from the
mean does the measurement lie? This is
measured by the z-score.
Suppose s = 2. s
xx xx 4
score
zz --score s s
ss
x 5 x9
x = 9 lies z =2 std dev from the mean.
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
z-Scores
• From Tchebysheff’s Theorem and the Empirical Rule
– At least 3/4 and more likely 95% of measurements lie within
2 standard deviations of the mean.
– At least 8/9 and more likely 99.7% of measurements lie
within 3 standard deviations of the mean.
• z-scores between –2 and 2 are not unusual. z-scores should not
be more than 3 in absolute value. z-scores larger than 3 in
absolute value would indicate a possible outlier.
-3 -2 -1 0 1 2 3
Somewhat unusual Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
Measures of Relative Standing
• How many measurements lie below
the measurement of interest? This is
measured by the pth percentile.
p% (100-p) %
x
p-th percentile
10% 90%
$319
$319isisthe
the10
th
10th
$319 percentile.
percentile.
Q1 m Q3
*
Q1 m Q3
m
Q1 Q3
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
Example
IQR = 340-292.5 = 47.5
Lower fence = 292.5-1.5(47.5) = 221.25
Upper fence = 340 + 1.5(47.5) = 411.25
MY APPLET
Outlier: x = 520
*
m
Q1 Q3
Copyright ©2006 Brooks/Cole
A division of Thomson Learning, Inc.
Interpreting Box Plots
Median line in center of box and whiskers
of equal length—symmetric distribution
Median line left of center and long right
whisker—skewed right
Median line right of center and long left
whisker—skewed left
22((xxi i))22
xx
( x x ) 22 ii
ss22 ( xi i x ) nn
nn11 nn11
3. Standard deviation
Population standard deviation :
Population standard deviation :
22