Chapter 2 Handout Jan 30
Chapter 2 Handout Jan 30
• Sample: The subset of the population from which information is actually obtained
1. Following the 2014 national midterm election, 18% of the governors of the 50
United States were female.
2. The average score for a class of 28 students taking a calculus midterm was 72%.
4. Ty Cobb is one of Major League Baseball’s greatest hitter of all time with a career
batting average of 0.366
5. Only 12 men have walked on the moon. The average time these men spent on
the moon was 43.92 hours.
6. A study of 6076 adults in public rest rooms found that 23% did not wash their
hands before exiting.
7. Interviews of 100 adults 18 years or older, conducted nationwide, found that 44%
could state the minimum age requirement for the office of U.S. president.
• The mean of a variable is computed by adding all values of the variable in the data
set and dividing by the number of observations. If a population has N observations
and a sample has n, then
x1 + x2 + · · · + xN
– Population mean: µ =
N
x1 + x2 + · · · + xn
– Sample mean: x̄ =
n
• The median of a variable is the value that lies in the middle of the data when arranged
in ascending order.
– If n (or N ) is odd, the median is the value exactly in the middle: that is, at the
n+1
2
position.
n
– If n (or N ) is even, the median is the average of the two middle values: 2
and
n
2
+1
• The mode of a variable is the most frequent observation of the variable that occurs in
the data set. If no observation occurs more than once, we say the data have no mode.
(A set of data can have no mode, one mode, or more than one mode.)
• The range of a variable is the di↵erence between the largest and the smallest data
value.
2
• Use the data in Table 1 to answer the questions below.
1. Computer the population mean, µ (i.e., the average of all 10 test scores).
4. Remove Justine’s score from the set of student scores in Table 2 and calculate the
new median.
3
4
• The data in the table below represents the birth weights (in pounds) of 25 randomly
selected babies.
The mean x̄ =
4. The histogram below shows the birth weight data. Use it to determine the shape
of the data (is it skewed left, skewed right, or bell shaped (normal))?
5
• The sample variance is given by
• Example:
• Use technology to calculate the standard deviation of the birth weight data.
6
• Mean, median, standard deviation, and distribution shapes
7
• Quartiles divide data sets into fourths, or four equal parts.
8
• The interquartile range, IQR gives the spread of the middle 50% of the values or
observations.
– IQR = Q3 Q1
• Drawing a Boxplot
9
• Use the data in Table 2 to answer the questions below.
3. Find the upper and lower fences. Are there any outliers?
10
• The results of an experiment in which researchers placed colored boards at random
locations in a field and then counted the number of beetles attracted to the board in
a 48-hour period appear given below.
3. Which color had the smallest spread in the middle 50% of its data?
11
• Shapes of boxplots
• Choosing measures
– Some measures (such as the mean, the standard deviation, and the range) are
a↵ected by outliers.
– When data are quantitative, the distribution is reasonable symmetric, and there
are no outliers, use the mean and standard deviation.
– When data are quantitative, the distribution is skewed, and outliers are present,
use the five-number summary.
– When most frequent observation is desired or data is qualitative, use the mode.
12