Describing Distributions With Numbers
Describing Distributions With Numbers
with Numbers
Section 1.2
What Does That Mean?
We are about to learn specific ways to calculate
center and spread of a distribution. You can
calculate these numerical values for any
quantitative variable. But to interpret these
measures of center and spread, and to choose
among the several methods you will learn, you
must think about the shape of the distribution
and the meaning of the data. The numbers, like
the graphs, are aids to understanding, not “the
answer” in themselves.
Measures of Center
Mean:
of a sample: of a population:
x
x i
x i
n n
Median:
the value that divides the data into
equal halves (*it may or may not
be a value in the data set)
The median The mean is
divides the the balance
distribution into point of the
two equal areas. distribution
Practice:
1. Find the mean and median for
each list and contrast their
behavior:
1. 1, 2, 6
2. 1, 2, 9
3. 1, 2, 297
Measures of Spread
Range: maximum - minimum
1
of a sample: sx
n 1
( xi x ) 2
1
of a population: x
n
( xi x ) 2
More About Standard Deviation
The differences of each value from
the mean are deviations: xx
n 1
( xi x ) 2
a. 1, 1, 1, 1 i. 0
b. 1, 2, 2 ii. 0.058
c. 1, 2, 3, 4, 5 iii. 0.577
d. 10, 20, 20 iv. 1.581
e. 0.1, 0.2, 0.2 v. 3.162
f. 0, 2, 4, 6, 8 vi. 3.606
g. 0, 0, 0, 0, 5, 6, 6, 8, 8 vii. 5.774
The Five-Number Summary
The Five-Number Summary includes:
minimum, Q1, median, Q3, maximum
Side-by-Side Boxplots:
maximum
Q1
median
Q3
minimum
Calculating Outliers
An observation is considered an Outlier if it falls
outside the interval:
xnew = a + bx
So What Does That Mean?
Adding the constant a shifts all
of the values of x left or right by
the same amount (the data is
recentered.)