Chapter 3
Chapter 3
Weighted
Mean
The average number of days you worked per week is 4.4 days.
Copyright © 2020, 2015, 2013 Pearson Education, Inc.
3-9
Advantages and Disadvantages of Using
the Mean to Summarize Data
Advantages:
• Simple to calculate
• Summarizes the data with a single value
Disadvantages:
• With only a summary value you lose information about
the original data.
• Sample 1 with n = 3: 999, 1000, 1001 = 1000
• Sample 2 with n = 3: 0, 1000, 2000 = 1000
• Just knowing the mean does not help you know what the
underlying data looks like.
• The value of the mean is sensitive to outliers (values
that are much higher or lower than most of the data).
Number
of children Frequency
0 4 The value that
1 5 appears most
2 8 often is 2
0 1 2 3 4 5 (occurs 8 times),
3 4
4 2 so the mode = 2
children.
5 1
Mode = 2
Distribution Shape
Symmetric Skewed
Left- Right-
Skewed Skewed
Choose the
“Descriptive
Statistics”
option…
Measures of Variability
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Range = 13 - 1 = 12
Copyright © 2020, 2015, 2013 Pearson Education, Inc.
3-27
The Range
Advantages:
• Easy to calculate and understand
Disadvantages:
• Only based on two numbers in the data set
(Ignores the way in which data are distributed)
• Sensitive to outliers
Example:
Sample
Data (xi) : 4 6 8 9 11 12 12 18
n=8 Mean = = 10
Short-cut formula
for the sample
variance:
Short-cut formula
for the population
variance:
Coefficient of Variation:
Nike:
Although Google
Google: had a larger
deviation, it had
the more
consistent price.
Table 3.14, based on https://www.nasdaq.com/
Copyright © 2020, 2015, 2013 Pearson Education, Inc.
3-43
The z-Score
-1 +1 -2 +2 -3 +3
Copyright © 2020, 2015, 2013 Pearson Education, Inc.
3-47
Chebyshev’s Theorem
Chebyshev’s Theorem states that for any number z greater
than 1, the percent of the values that fall within z standard
deviations above and below the mean will be at least
1 to under 5 6
5 to under 9 12
9 to under 13 10
13 to under 17 4
The merchant would like to calculate the average number of
viewed pages.
Percentiles Quartiles
=QUARTILE.EXC(array, quart)
where: array = the data range of interest
quart = 1, 2, or 3 (for the first, second, or third quartile)
* *
n = 15
Q1
Similarly, we find
Q2 = 3.27
Q3 = 4.26
Min Q1 Q2 Q3 Max
0.59 2.37 3.27 4.26 5.97 11.31
(outlier)
Sample Sample
Covariance Correlation
Coefficient
Scatterplot
Wee Number of Number of
k sales cars sold
representatives (y)
(x)
1 2 4
2 5 10
3 3 7
4 4 7
5 3 6
6 4 8
Number Number
of Sales of Cars
Reps Sold
xi y
2 3.5 -1.5 4 7 -3 4.5
5 3.5 1.5 10 7 3 4.5
3 3.5 -0.5 7 7 0 0
4 3.5 0.5 7 7 0 0
3 3.5 -0.5 6 7 -1 0.5
4 3.5 0.5 8 7 1 0.5
=
10
Copyright © 2020, 2015, 2013 Pearson Education, Inc.
3-79
Covariance Calculations
Completing the calculation: