Elementary Statistics
Elementary Statistics
Introduction to
Statistics
§ 1.1
An Overview of
Statistics
Data and Statistics
Data consists of information coming from
observations, counts, measurements, or responses.
Responses of all
students at Union
College (population)
Responses of
students in
survey (sample)
Parameter Population
Statistic Sample
Statistic
s
Descriptive Inferential
statistics statistics
Involves the Involves using
organization, a sample to
summarization, draw
and display of conclusions
data. about a
Larson & Farber, Elementary Statistics: Picturing the World, 3e 7
Descriptive and Inferential
Statistics
Example:
In a recent study, volunteers who had less than 6 hours of sleep were four times more likely to answer incorrectly on a science test than were participants
who had at least 8 hours of sleep. Decide which part is the descriptive statistic and what conclusion might be drawn using inferential statistics.
Qualitative Quantitativ
Data e Data
Consists of Consists of
attributes, numerical
labels, or measurements
nonnumerical or counts.
entries.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 10
Qualitative and Quantitative
Data
Example:
The grade point averages of five students are listed in the table. Which data are qualitative data and which are quantitative data?
Studen GPA
t
Sally 3.22
Bob 3.98
Cindy 2.75
Mark 2.24
Kathy 3.84
Qualitative Quantitative
data data
Larson & Farber, Elementary Statistics: Picturing the World, 3e 11
Levels of Measurement
The level of measurement determines which statistical
calculations are meaningful. The four levels of measurement
are: nominal, ordinal, interval, and ratio.
Nominal
Levels Lowest
Ordinal to
of
Measurement Interval highes
t
Ratio
Levels
of Ordinal
Measurement Arranged in order, but
differences between
data entries are not
meaningful.
Class Numbers on Top 50
standings: the back of songs
freshman, each player’s played on
sophomore, shirt the radio
junior, senior
Larson & Farber, Elementary Statistics: Picturing the World, 3e 14
Interval Level of
Measurement
Data at the interval level of measurement are quantitative.
A zero entry simply represents a position on a scale; the entry
is not an inherent zero.
Levels
of
Measurement Interval
Arranged in order, the
differences between data
entries can be calculated.
Temperature Years on a Atlanta
s timeline Braves
World
Series
victories
Larson & Farber, Elementary Statistics: Picturing the World, 3e 15
Ratio Level of Measurement
Data at the ratio level of measurement are similar
to the interval level, but a zero entry is meaningful.
All
members
in each
selected
group are
used.
Descriptive
Statistics
§ 2.1
Frequency
Distributions
and Their
Graphs
Frequency Distributions
A frequency distribution is a table that shows
classes or intervals of data with a count of the
number in each class. The frequency f of a class is
the number of data points in the class.
Class Frequency,
f
Upper
Lower 1–4 4
Class 5–8 5 Frequencies
Limits 9 – 12 3
13 – 16 4
17 – 20 2
Class Frequency,
f
5–1=4 1–4 4
9–5=4 5–8 5
13 – 9 = 9 – 12 3
17 – 13 =4 13 – 16 4
4 17 – 20
The class width 2is 4.
Ages of Students
18 20 21 27 29 20
19 30 32 19 34 19
24 29 18 37 38 22
30 39 32 44 33 46
54 49 18 51 21 21
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 31
Constructing a Frequency
Distribution
Example continued:
36 = 7.2 Round up to 8.
Class width
= 5
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 32
Constructing a Frequency
Distribution
Example continued:
3. The minimum data entry of 18 may be used for the
lower limit of the first class. To find the lower class
limits of the remaining classes, add the width (8) to
each lower limit.
The lower class limits are 18, 26, 34, 42, and
50. upper class limits are 25, 33, 41, 49, and
The
57.
4. Make a tally mark for each data entry in the
appropriate class.
Midpoint 1 4 5 2.5
2 2
=
Larson & Farber, Elementary Statistics: Picturing the World, 3e 35
Midpoint
Example:
Find the midpoints for the “Ages of Students” frequency
distribution.
Ages of Students
Class Frequency, f Midpoint
18 + 25 = 43
18 – 25 13 21.5
43 2 =
26 – 33 8 29.5 21.5
34 – 41 4 37.5
42 – 49 3 45.5
50 – 57 2 53.5
f 30
Larson & Farber, Elementary Statistics: Picturing the World, 3e 36
Relative Frequency
The relative frequency of a class is the portion or
percentage of the data that falls in that class. To find
the relative frequency of a class, divide the
frequency f by the sample size n.
Class frequency f
Relative frequency =
Sample size n
Relative
Frequency
Class Frequenc
,f
y
0.222
1–4 f 418
f 4
0.222
Relative frequency
n 18
Larson & Farber, Elementary Statistics: Picturing the World, 3e 37
Relative Frequency
Example:
Find the relative frequencies for the “Ages of Students”
frequency distribution.
Relative Portion of
Class Frequency, f Frequency students
18 – 25 13 0.433 f 13
26 – 33 8 0.267 n 30
34 – 41 4 0.133 0.433
42 – 49 3 0.1
50 – 57 2 0.067
f
f 30 1
n
Larson & Farber, Elementary Statistics: Picturing the World, 3e 38
Cumulative Frequency
The cumulative frequency of a class is the sum of the
frequency for that class and all the previous classes.
Ages of Students
Cumulative
Class Frequency, f Frequency
18 – 25 13 13
26 – 33 +8 21
34 – 41 +4 25
42 – 49 +3 28
Total number
50 – 57 +2 30 of students
f 30
Ages of Students
Class
Class Frequency, f Boundaries
The distance from 18 – 25 13 17.5 25.5
the upper limit of
the first class to 26 – 33 8 25.5 33.5
the lower limit of 34 – 41 4 33.5 41.5
the second class is
1. 42 – 49 3 41.5 49.5
Half this 50 – 57 2 49.5 57.5
distance is
0.5.
f 30
14 13 Ages of Students
12
10
8
8
f 6
4
4 3
2 2
0
17.5 25.5 33.5 41.5 49.5 57.5
Broken axis
Age (in years)
Larson & Farber, Elementary Statistics: Picturing the World, 3e 42
Frequency Polygon
A frequency polygon is a line graph that emphasizes the continuous change in frequencies.
14
Ages of Students
12
10
8 Line is
extended to the
f 6 x-axis.
4
2
0
13.5 21.5 29.5 37.5 45.5 53.5 61.5
Broken axis
Age (in years) Midpoints
0.5
0.433
Relative frequency
0.3
0.267
0.2
0.133
0.1
0.1 0.067
0
17.5 25.5 33.5 41.5 49.5 57.5
Age (in years)
Larson & Farber, Elementary Statistics: Picturing the World, 3e 44
Cumulative Frequency
Graph
A cumulative frequency graph or ogive, is a line
graph that displays the cumulative frequency of
each class at its upper class boundary.
30 Ages of Students
(portion of students)
24
Cumulative
frequency
18
The graph
ends at the
12 upper
boundary of
6 the last class.
0
17.5 25.5 33.5 41.5 49.5 57.5
Age (in years)
Larson & Farber, Elementary Statistics: Picturing the World, 3e 45
§ 2.2
More Graphs
and Displays
Stem-and-Leaf Plot
In a stem-and-leaf plot, each number is separated
into a stem (usually the entry’s leftmost digits) and a
leaf (usually the rightmost digit). This is an example
of exploratory data analysis.
Example:
The following data represents the ages of 30
students in a statistics class. Display the data in a
stem-and-leaf plot.Ages of Students
18 20 21 27 29 20
19 30 32 19 34 19
24 29 18 37 38 22
30 39 32 44 33 46
54 49 18 51 21 21 Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 47
Stem-and-Leaf Plot
Ages of Students
Key: 1|8 =
1 888999 18
2 0 0 1 1 1 2 4 7 9 9 Most of the values lie
3 002234789 between 20 and 39.
4 469
5 14
This graph allows us to
see the shape of the data
as well as the actual
values.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 48
Stem-and-Leaf Plot
Example:
Construct a stem-and-leaf plot that has two lines for
each stem.
Ages of Students
1 Key: 1|8 =
1 888999 18
2 0011124
2 799
3 002234
3 789 From this graph, we can
4 4 conclude that more than
4 69 50% of the data lie
5 14 between 20 and 34.
5
Larson & Farber, Elementary Statistics: Picturing the World, 3e 49
Dot Plot
In a dot plot, each data entry is plotted, using a
point, above a horizontal axis.
Example:
Use a dot plot to display the ages of the 30 students
in the statistics class.
Ages of Students
18 20 21 27 29 20
19 30 32 19 34 19
24 29 18 37 38 22
30 39 32 44 33 46
54 49 18 51 21 21
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 50
Dot Plot
Ages of Students
1 1 2 2 2 3 3 3 3 4 4 4 5 5 5
5 8 1 4 7 0 3 6 9 2 5 8 1 4 7
Relative
Frequen
Type Frequen
cy
cy
Motor Vehicle 43,500 0.578
Falls 12,200 0.162
Poison 6,400 0.085
Drowning 4,600 0.061
Fire 4,200 0.056
Ingestion of
2,900 0.039
Food/Object n = 75,200
Firearms 1,400 0.019 Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 53
Pie Chart
Next, find the central angle. To find the central angle,
multiply the relative frequency by 360°.
Relative
Frequen
Type Frequen Angle
cy
cy
Motor Vehicle 43,500 0.578 208.2°
Falls 12,200 0.162 58.4°
Poison 6,400 0.085 30.6°
Drowning 4,600 0.061 22.0°
Fire 4,200 0.056 20.1°
Ingestion of
2,900 0.039 13.9°
Food/Object
Continued.
Firearms 1,400 0.019
Larson & Farber, Elementary Statistics: Picturing the World, 3e
6.7°
54
Pie Chart
Ingestion Firearms
3.9% 1.9%
Fire
5.6%
Drowning
6.1%
Poison
8.5% Motor
vehicles
Falls 57.8%
16.2%
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 58
Scatter Plot
Absences Grade
x y
Final 100
grade 90 8 78
2 92
(y) 80
5 90
70 12 58
60 15 43
50 9 74
6 81
40
0 2 4 6 8 10 12 14 16
Absences (x)
200
Minutes
150
100
50
0
Jan Feb Mar Apr May June
Month
“mu” “x-
bar”
53 32 61 57 39 44 57
Calculate the population mean.
Example:
Calculate the median age of the seven employees.
53 32 61 57 39 44 57
To find the median, sort the data.
32 39 44 53 57 57 61
The median age of the employees is 53 years.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 65
Mode
The mode of a data set is the data entry that occurs
with the greatest frequency. If no entry is repeated,
the data set has no mode. If two entries occur with
the same greatest frequency, each entry is a mode
and the data set is called bimodal.
Example:
Find the mode of the ages of the seven employees.
53 32 61 57 39 44 57
The mode is 57 because it occurs the most times.
Recalculate the mean, the median, and the mode. Which measure
of central tendency was affected when this new age was added?
Example:
Grades in a statistics class are weighted as follows:
Tests are worth 50% of the grade, homework is worth 30% of the
grade and the final is worth 20% of the grade. A student receives a
total of 80 points on tests, 100 points on homework, and 85 points
on his final. What is his current grade?
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 68
Weighted Mean
Example:
The following frequency distribution represents the
ages of 30 students in a statistics class. Find the
mean of the frequency distribution.
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 70
Mean of a Frequency
Distribution
Class
midpoint
Class x f (x · f )
18 – 25 21.5 13 279.5
26 – 33 29.5 8 236.0
34 – 41 37.5 4 150.0
42 – 49 45.5 3 136.5
50 – 57 53.5 2 107.0
n= Σ=
30 909.0
(x f ) 909 30.3
x
n 30
The mean age of the students is 30.3 years.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 71
Shapes of Distributions
A frequency distribution is symmetric when a
vertical line can be drawn through the middle of a
graph of the distribution and the resulting halves are
approximately the mirror images.
A frequency distribution is uniform (or
rectangular) when all entries, or classes, in the
distribution have equal frequencies. A uniform
distribution is also symmetric.
A frequency distribution is skewed if the “tail” of the
graph elongates more to one side than to the other.
A distribution is skewed left (negatively skewed) if
its tail extends to the left. A distribution is skewed
right (positively skewed) if its tail extends to the
Larson & Farber, Elementary Statistics: Picturing the World, 3e 72
Symmetric Distribution
10 Annual Incomes
15,000
20,000
22,000
5
24,000 Income
4
25,000
25,000 f 3
2
26,000
28,000 1
30,000 0
$25000
35,000
mean = median = mode
= $25,000
Larson & Farber, Elementary Statistics: Picturing the World, 3e 73
Skewed Left Distribution
10 Annual Incomes
0
20,000
22,000
24,000 5
25,000 4
Income
25,000
f 3
26,000 2
28,000 1
30,000 0
35,000 $25000
mean = $23,500
median = mode = $25,000 Mean < Median
Larson & Farber, Elementary Statistics: Picturing the World, 3e 74
Skewed Right Distribution
10 Annual Incomes
15,000
20,000
22,000
5
24,000 Income
25,000 4
25,000 f 3
26,000 2
28,000 1
30,000 0
$25000
1,000,000
mean = $121,500
median = mode = $25,000 Mean > Median
Larson & Farber, Elementary Statistics: Picturing the World, 3e 75
Summary of Shapes of
Distributions
Symmetric Uniform
Mean = Median
Example:
The following data are the closing prices for a certain stock
on ten successive Fridays. Find the range.
Stock 56 5 57 58 61 63 63 67 6 67
6 7
The range is 67 – 56 = 11.
Example:
Stock Deviation
The following data are the closing x x–μ
prices for a certain stock on five 56 56 – 61 = –5
successive Fridays. Find the 58 58 – 61 = –3
deviation of each price. 61 61 – 61 = 0
63 63 – 61 = 2
The mean stock price is 67 67 – 61 = 6
μ = 305/5 = 61.
Σx = 305 Σ(x – μ) = 0
14 14
12 x=4 12 x =4
Frequency
Frequency
10 s = 1.18 10 s=0
8 8
6 6
4 4
2 2
0 0
2 4 6 2 4 6
Data value Data value
Larson & Farber, Elementary Statistics: Picturing the World, 3e 84
Empirical Rule (68-95-
99.7%)
Empirical Rule
For data with a (symmetric) bell-shaped distribution,
the standard deviation has the following
characteristics.
1. About 68% of the data lie within one standard
deviation of the mean.
2. About 95% of the data lie within two standard
deviations of the mean.
3. About 99.7% of the data lie within three standard
deviation of the mean.
34% 34%
2.35% 2.35%
13.5% 13.5%
–4 –3 –2 –1 0 1 2 3 4
68%
1 1 8
For k = 3: In any data set, at least 1 32 1 9 9 , or 88.9%, of the
(x x )2f
Sample standard deviation = s n 1
where n = Σf is the number of entries in the data set, and x is the
data value or the midpoint of an interval.
Example:
The following frequency distribution represents the
ages of 30 students in a statistics class. The mean
age of the students is 30.3 years. Find the standard
deviation of the frequency distribution.
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 91
Standard Deviation for
Grouped Data
The mean age of the students is 30.3
years.
Class x f x – x (x – x )2 (x – x )2f
18 – 25 21.5 13 – 8.8 77.44 1006.72
26 – 33 29.5 8 – 0.8 0.64 5.12
34 – 41 37.5 4 7.2 51.84 207.36
42 – 49 45.5 3 15.2 231.04 693.12
50 – 57 53.5 2 23.2 538.24 1076.48
n = 30 2988.80
Median
Q1 Q2 Q3
0 25 50 75 100
Example:
The quartiles for 15 quiz scores are listed below. Find the
interquartile range.
Q1 = 37 Q2 = 43 Q3 = 48
28 37 43 48 55
28 32 36 40 44 48 52 56
Larson & Farber, Elementary Statistics: Picturing the World, 3e 98
Percentiles and Deciles
Fractiles are numbers that partition, or divide, an
ordered data set.
Example:
The test scores for all statistics finals at Union
College have a mean of 78 and standard deviation
of 7. Find the z-score for
a.) a test score of 85,
b.) a test score of 70,
c.) a test score of 78.
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 100
Standard Scores
Example continued:
a.) μ = 78, σ = 7, x = 85
x 85 78
z 1.0 This score is 1 standard
7 deviation higher than the mean.
b.) μ = 78, σ = 7, x = 70
x 70 78
z
7 1.14 deviations lower than the mean.
This score is 1.14 standard
c.) μ = 78, σ = 7, x = 78
x 78 78
z 0 This score is the same as the
7 mean.
Probability
§ 3.1
Basic Concepts
of Probability
Probability Experiments
A probability experiment is an action through which
specific results (counts, measurements or responses)
are obtained.
Example:
Rolling a die and observing the
number that is rolled is a
probability experiment.
The result of a single trial in a probability
experiment is the outcome.
The set of all possible outcomes for an experiment is
the sample space.
Example:
The sample space when rolling a die has six
outcomes.
{1, 2, 3, 4, 5, 6}
Larson & Farber, Elementary Statistics: Picturing the World, 3e 105
Events
An event consists of one or more outcomes and is a
subset of the sample space.
Events are
Example: represented by
A die is rolled. Event A uppercase
is rollingletters.
an even
number.
A simple event is an event that consists of a single
outcome.
Example:
A die is rolled. Event A is rolling an even number.
Example:
A die is rolled.
Find the probability of Event A: rolling a 5.
There is one outcome in Event A: {5}
1
P(A) = 0.167
“Probability of 6
Event A.”
Example:
There are 5 red chips, 4 blue chips, and 6 white
chips in a basket. Find the probability of randomly
selecting a chip that is not blue.
4
P (selecting a blue chip) 0.267
15
4 11
1
P (not selecting a blue chip) 0.733
15 15
Larson & Farber, Elementary Statistics: Picturing the World, 3e 112
§ 3.2
Conditional
Probability and
the
Multiplication
Rule
Conditional Probability
A conditional probability is the probability of an
event occurring, given that another event has already
occurred.
P (B | “Probability of B, given A”
A)
Example:
There are 5 red chip, 4 blue chips, and 6 white
chips in a basket. Two chips are randomly
selected. Find the probability that the second chip
is red given that the first chip is blue. (Assume that
the first chip is not replaced.)
Because the first chip is selected and not
replaced, there are only 14 chips remaining.
5
P (selecting a red chip|first chip is 0.357
14
blue)
Larson & Farber, Elementary Statistics: Picturing the World, 3e 114
Conditional Probability
Example:
100 college students were surveyed and asked how
many hours a week they spent studying. The results are
in the table below. Find the probability that a student
spends more than 10 hours studying given that the
student is a male.
Less More
5 to 10 Total
then 5 than 10
Male 11 22 16 49
Female 13 24 14 51
Total 24 46 30 100
The sample space consists of the 49 male students.
Of these 49, 16 spend more than 10 hours a week
studying. 16
0.327
P (more than 10 hours|male)
49
Larson & Farber, Elementary Statistics: Picturing the World, 3e 115
Independent Events
Two events are independent if the occurrence of
one of the events does not affect the probability of
the other event. Two events A and B are
independent if
P (B |A) = P (B) or if P (A |B) = P (A).
Events that are not independent are dependent.
Example:
Decide if the events are independent or dependent.
Selecting a diamond from a standard
deck of cards (A), putting it back in the
deck, and then selecting a spade from the
deck13
(B).1 13 1 The occurrence of A does
P (B A ) and P (B ) . not affect the probability
52 4 52 4
of B, so the events are
independent.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 116
Multiplication Rule
The probability that two events, A and B will occur in
sequence is
P (A and B) = P (A) · P (B |A).
If event A and B are independent, then the rule can
be simplified to P (A and B) = P (A) · P (B).
Example:
Two cards are selected, without replacement, from
a deck. Find the probability of selecting a diamond,
and then selecting a spade.
Because the card is not replaced, the events are
dependent.
P (diamond and spade) = P (diamond) · P (spade |diamond)
13 13 169
0.064
52 51 2652
Larson & Farber, Elementary Statistics: Picturing the World, 3e 117
Multiplication Rule
Example:
A die is rolled and two coins are tossed.
Find the probability of rolling a 5, and flipping two
tails.
1
P (rolling a 5) = .
6
1
Whether or not the roll is a 5, P (Tail ) ,=
2
so the events are independent.
A and B
A
B A B
A B
1
4
2
A J9 2 B
3 10
J J A 7
K 4
J 5 8
6Q
Start
1 Coin
st
Tossed
Heads Tails 2 ways to flip the
2nd Coin coin
Tossed
Heads Tails Heads Tails 2 ways to flip the
coin
Example:
How many different surveys are required to cover all
possible question arrangements if there are 7
questions in a survey?
7! = 7 · 6 · 5 · 4 · 3 · 2 · 1 = 5040 surveys
Example:
Jessie wants to plant 10 plants along the sidewalk in
her front yard. She has 3 rose bushes, 4 daffodils,
and 3 lilies. In how many distinguishable ways can
the plants be arranged?
10! 10 9 8 7 6 5 4!
3!4!3! 3!4!3!
4,200 different ways to arrange the plants
Larson & Farber, Elementary Statistics: Picturing the World, 3e 132
Combination of n Objects Taken r at
a Time
A combination is a selection of r objects from a group
of n things when order does not matter. The
number of combinations of r objects selected
from a group of n objects is n !
Cr .
# in the
n
(n r)! r !
collection
# taken from
the collection
Example:
You are required to read 5 books from a list of 8.
In how many different ways can you do so if the
order doesn’t matter?
8! 8 7 6 5!
C5 = =
8
3!5! 3!5!
=56 combinations
Larson & Farber, Elementary Statistics: Picturing the World, 3e 133
Application of Counting
Principles
Example:
In a state lottery, you must correctly select 6 numbers (in any order)
out of 44 to win the grand prize.
a.) How many ways can 6 numbers be chosen from
the 44 numbers?
b.) If you purchase one lottery ticket, what is the
probability of winning the top prize?
44!
a.) 44C 6 7,059,052 combinations
6!38!
b.) There is only one winning ticket, therefore,
1
P (win) 0.00000014
7059052
Larson & Farber, Elementary Statistics: Picturing the World, 3e 134
Chapter 4
Discrete Probability
Distributions
§ 4.1
Probability
Distributions
Random Variables
A random variable x represents a numerical value
associated with each outcome of a probability distribution.
x
0 2 4 6 8 10
x
0 2 4 6 8 10
In Words In Symbols
1. The probability of each 0 P (x) 1
value of the discrete random
variable is between 0 and 1,
inclusive.
2. The sum of all the ΣP (x) = 1
probabilities is 1.
Guidelines
Let x be a discrete random variable with possible
outcomes x1, x2, … , xn.
1. Make a frequency distribution for the possible
outcomes.
2. Find the sum of the frequencies.
3. Find the probability of each possible outcome by
dividing its frequency by the sum of the
frequencies.
4. Check that each probability is between 0 and 1
and that the sum is 1.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 140
Constructing a Discrete Probability
Distribution
Example:
The spinner below is divided into two sections. The probability
of landing on the 1 is 0.25. The probability of landing on the 2
is 0.75. Let x be the number the spinner lands on. Construct a
probability distribution for the random variable x.
1 x P (x)
1 0.25 Each probability
2 is between 0 and
2 0.75
1.
The sum of the probabilities is
1.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 141
Constructing a Discrete Probability
Distribution
Example:
The spinner below is spun two times. The probability of landing
on the 1 is 0.25. The probability of landing on the 2 is 0.75.
Let x be the sum of the two spins. Construct a probability
distribution for the random variable x.
Sum of P(x)
Sum of Two
spins, P (x) 0.6
Spins
x 0.5
2 0.0625
Probability
3 0.375 0.
4
4 0.5625 0.
3
0.
2
0.
1
0 x
2 3 4
Sum
Larson & Farber, Elementary Statistics: Picturing the World, 3e 145
Mean
The mean of a discrete random variable is given by
μ = ΣxP(x).
Each value of x is multiplied by its corresponding
probability and the products are added.
Example:
Find the mean of the probability distribution for the sum of the two spins.
x P (x) xP (x)
2 0.062 2(0.0625) = ΣxP(x) = 3.5
5 0.125
The mean for
3 0.375 3(0.375) = the two spins is
4 0.562 1.125 3.5.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 146
Variance
The variance of a discrete random variable is given by
2 = Σ(x – μ)2P (x).
Example:
Find the variance of the probability distribution for the sum of the two spins.
The mean is 3.5.
σ σ2
x P (x) x–μ (x – P (x)(x –
μ)2 μ)2 0.376 0.613
2 0.062 –
5 1.5 2.25 0.141 Most of the sums
differ from the
3 0.375 – 0.25 0.094 mean by no more
4 0.562 0.5 0.25 0.141 than 0.6 points.
5 0.5
Larson & Farber, Elementary Statistics: Picturing the World, 3e 148
Expected Value
The expected value of a discrete random variable is
equal to the mean of the random variable.
Expected Value = E(x) = μ = ΣxP(x).
Example:
At a raffle, 500 tickets are sold for $1 each for two prizes of $100 and $50.
What is the expected value of your gain?
Gain,
P (x) E(x) = ΣxP(x).
x 1
$99 500 1 1 498
$99 $49 ( $1)
1 500 500 500
$49 500
$0.70
–$1 498
500
Because the expected value
Winning is negative, you can expect to
no prize
lose $0.70 for each ticket you
buy.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 150
§ 4.2
Binomial
Distributions
Binomial Experiments
A binomial experiment is a probability experiment
that satisfies the following conditions.
1. The experiment is repeated for a fixed number of
trials, where each trial is independent of other
trials.
2. There are only two possible outcomes of interest
for each trial. The outcomes can be classified as
a success (S) or as a failure (F).
3. The probability of a success P (S) is the same for
each trial.
4. The random variable x counts the number of
successful trials.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 152
Notation for Binomial
Experiments
Symbol Descriptio
n
n The number of times a trial is
repeated.
p = P (S) The probability of success in a single
trial.
q = P (F) The probability of failure in a single trial.
(q = 1 – p)
n 8 4 1 1 12 x 0,1,2,3,4,5,6,7,8
p q 1
52 13 13 13
Larson & Farber, Elementary Statistics: Picturing the World, 3e 154
Binomial Experiments
Example:
Decide whether the experiment is a binomial
experiment. If it is, specify the values of n, p, and q,
and list the possible values of the random variable x.
If it is not a binomial experiment, explain why.
• You roll a die 10 times and note the number the
die lands on.
3
p = the probability of selecting a red chip 0.3
10
q = 1 – p = 0.7 P (1) 3C 1(0.3)1(0.7)2
n=3 3(0.3)(0.49)
x=1 0.441
Larson & Farber, Elementary Statistics: Picturing the World, 3e 156
Binomial Probability
Distribution
Example:
A bag contains 10 chips. 3 of the chips are red, 5 of the chips are white, and 2 of the chips are blue. Four
chips are selected, with replacement. Create a probability distribution for the number of red chips selected.
3
p = the probability of selecting a red chip
0.3
10
q = 1 – p = 0.7
n=4 x P (x)
x = 0, 1, 2, 3, 4 0 0.240 The binomial
1 0.412 probability
2 0.265 formula is
3 0.076 used to find
4 0.008 each
probability.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 157
Finding Probabilities
Example:
The following probability distribution represents the probability of selecting 0, 1, 2, 3, or 4 red chips
when 4 chips are selected.
x P (x) P
0 0.24 0.5 (x) Selecting Red
1 0.412 Probability Chips
0.
2 0.265 4
0.
3 0.076 3
4 0.008 0.
2
0.
1
0 x
0 1 2 3 4
Number of red chips
Larson & Farber, Elementary Statistics: Picturing the World, 3e 159
Mean, Variance and Standard
Deviation
Population Parameters of a Binomial
Distribution
Mean: μ np
Variance: σ 2 npq
Standard σ npq
Example: deviation:
One out of 5 students at a local college say that they skip breakfast in the morning. Find the
mean, variance and standard deviation if 10 students are randomly selected.
p = 0.20 q = 0.80
a.) x = 3 b.) x = 3, 4
P (3) = (0.2)(0.8)3 – 1 P (3 or 4) = P (3) + P (4)
= (0.2)(0.8)2 0.128 + 0.102
= (0.2)(0.64) 0.230
= 0.128
Larson & Farber, Elementary Statistics: Picturing the World, 3e 163
Geometric Distribution
Example:
A fast food chain puts a winning game piece on every fifth package of
French fries. Find the probability that you will win a prize,
a.) with your third purchase of French fries,
b.) with your third or fourth purchase of French fries.
p = 0.20 q = 0.80
a.) x = 3 b.) x = 3, 4
P (3) = (0.2)(0.8)3 – 1 P (3 or 4) = P (3) + P (4)
= (0.2)(0.8)2 0.128 + 0.102
= (0.2)(0.64) 0.230
= 0.128
Larson & Farber, Elementary Statistics: Picturing the World, 3e 164
Poisson Distribution
The Poisson distribution is a discrete probability
distribution of a random variable x that satisfies the
following conditions.
1. The experiment consists of counting the number of
times an event, x, occurs in a given interval. The
interval can be an interval of time, area, or volume.
2. The probability of the event occurring is the same for
each interval.
3. The number of occurrences in one interval is
independent of the number of occurrences in other
Theintervals.
probability of exactly x occurrences in an interval is
μ xe μ
P (x )
x!
where e 2.71818 and μ is the mean number of
occurrences.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 165
Poisson Distribution
Example:
The mean number of power outages in the city of Brunswick is 4 per
year. Find the probability that in a given year,
a.) there are exactly 3 outages,
b.) there are more than 3 outages.
Normal curve
Total area = 1
x
μ 3σ μ 2σ μσ μ μ+σ μ + 2σ μ + 3σ
x
1 3 5 7 9 11 13
x
6 7 8 9 10
Height (in feet)
z
3 2 1 0 1 2 3
0.0 .5000 .5040 .5080 .5120 .5160 .5199 .5239 .5279 .5319 .5359
0.1 .5398 .5438 .5478 .5517 .5557 .5596 .5636 .5675 .5714 .5753
0.2 .5793 .5832 .5871 .5910 .5948 .5987 .6026 .6064 .6103 .6141
2.6 .9953 .9955 .9956 .9957 .9959 .9960 .9961 .9962 .9963 .9964
2.7 .9965 .9966 .9967 .9968 .9969 .9970 .9971 .9972 .9973 .9974
2.8 .9974 .9975 .9976 .9977 .9977 .9978 .9979 .9979 .9980 .9981
3.4 .0002 .0003 .0003 .0003 .0003 .0003 .0003 .0003 .0003 .0003
3.3 .0003 .0004 .0004 .0004 .0004 .0004 .0004 .0005 .0005 .0005
0.3 .3483 .3520 .3557 .3594 .3632 .3669 .3707 .3745 .3783 .3821
0.2 .3859 .3897 .3936 .3974 .4013 .4052 .4090 .4129 .4168 .4207
0.1 .4247 .4286 .4325 .4364 .4404 .4443 .4483 .4522 .4562 .4602
0.0 .4641 .4681 .4724 .4761 .4801 .4840 .4880 .4920 .4960 .5000
z
0 1.23
1. Use the table to find
the area for the z-score.
z
0.75 0 1.23
Always
draw the
curve!
2.33 0
z
1.98 0 1.07
Normal
Distributions:
Finding
Probabilities
Probability and Normal
Distributions
If a random variable, x, is normally
distributed, you can find the probability that
x will fall in a given interval by calculating
the area under the normal curve for that
interval.
μ = 10
P(x < σ=5
15)
x
μ =10 15
x z
μ =10 15 μ =0 1
Same area
P(x < 15) = P(z < 1) = Shaded area under the curv
= 0.8413
Larson & Farber, Elementary Statistics: Picturing the World, 3e 189
Probability and Normal
Distributions
Example:
The average on a statistics test was 78 with a
standard deviation of 8. If the test scores are
normally distributed, find the probability that a
student receives a test score less than 90.
μ = 78 x - μ 90-78
σ=8 z =
σ 8
=1.5
P(x < 90)
(x > 85) = P(z > 0.88) = 1 P(z < 0.88) = 1 0.8106 = 0.18
60 < x < 80) = P(2.25 < z < 0.25) = P(z < 0.25) P(z < 2.25)
= 0.5987 0.0122 = 0.5865
Larson & Farber, Elementary Statistics: Picturing the World, 3e 192
§ 5.3
Normal
Distributions:
Finding Values
Finding z-Scores
Example:
Find the z-score that corresponds to a cumulative
area of 0.9973. Appendix B: Standard Normal Table
z .00 .01 .02 .03 .04 .05 .06 .07 .08
.08 .09
0.0 .5000 .5040 .5080 .5120 .5160 .5199 .5239 .5279 .5319 .5359
0.1 .5398 .5438 .5478 .5517 .5557 .5596 .5636 .5675 .5714 .5753
0.2 .5793 .5832 .5871 .5910 .5948 .5987 .6026 .6064 .6103 .6141
2.6 .9953 .9955 .9956 .9957 .9959 .9960 .9961 .9962 .9963 .9964
2.7
2.7 .9965 .9966 .9967 .9968 .9969 .9970 .9971 .9972 .9973 .9974
2.8 .9974 .9975 .9976 .9977 .9977 .9978 .9979 .9979 .9980 .9981
3.4 .0002 .0003 .0003 .0003 .0003 .0003 .0003 .0003 .0003 .0003
0.2 .0003 .0004 .0004 .0004 .0004 .0004 .0004 .0005 .0005 .0005
Use the
closest
0.3 .3483 .3520 .3557 .3594 .3632 .3669 .3707 .3745 .3783 .3821
area.
0.2
0.2 .3859 .3897 .3936 .3974 .4013 .4052 .4090 .4129 .4168 .4207
0.1 .4247 .4286 .4325 .4364 .4404 .4443 .4483 .4522 .4562 .4602
0.0 .4641 .4681 .4724 .4761 .4801 .4840 .4880 .4920 .4960 .5000
Area = 0.75
z
μ =0 0.67
?
? 1.25
1.25 ( 1.41)0.1
1.11
1.11
The least a bag can weigh and still work in the machine is 1.11
ounces.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 198
§ 5.4
Sampling
Distributions and
the Central Limit
Theorem
Sampling Distributions
A sampling distribution is the probability
distribution of a sample statistic that is formed when
samples of size n are repeatedly taken from a
population.
Sample Sample
Sample Sample
Sample
Sample
Sample
Sample
Populati Sample
Sample
on
σx ,
2. The standard deviation of theσ,sample means, is equal to
the population standard σ σ
deviation, divided by the square
= x
root of n.
n
x selected.
5 10 15 20
Population Continued.
values
Larson & Farber, Elementary Statistics: Picturing the World, 3e 204
Sampling Distribution of Sample
Means
Example continued:
The population values {5, 10, 15, 20} are written on slips
of paper and put in a hat. Two slips are randomly
selected, with replacement.
c. List all the possible samples of size n = 2 and
calculate the mean of each.
Sample x
Sample mean, Sample x
Sample mean,
5, 5 5 15, 5 10 These means
5, 10 7.5 15, 10 12.5 form the
5, 15 10 15, 15 15 sampling
5, 20 12.5 15, 20 17.5 distribution
10, 5 7.5 20, 5 12.5 of the sample
10, 10 10 20, 10 15 means.
10, 15 12.5 20, 15 17.5
10, 20 15 20, 20 20
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 205
Sampling Distribution of Sample
Means
Example continued:
The population values {5, 10, 15, 20} are written on slips
of paper and put in a hat. Two slips are randomly
selected, with replacement.
d. Create the probability distribution of the sample
means.
x f Probability
5 1 0.0625
7.5 2 0.1250 Probability
10 3 0.1875 Distribution of
Sample Means
12.5 4 0.2500
15 3 0.1875
17.5 2 0.1250
20 1 0.0625
0.20 Distribution
The shape of the graph
0.15
is symmetric and bell
0.10
shaped. It approximates
0.05 a normal distribution.
x
5 7.5 10 12. 15 17. 20
Sample5mean 5
Larson & Farber, Elementary Statistics: Picturing the World, 3e 207
The Central Limit Theorem
If a sample of size n 30 is taken from a population
with any type of distribution that has a mean =
and standard deviation = ,
x x
the sample means will have a normal
distribution. x x
x x
x x x
x x x x x x
Larson & Farber, Elementary Statistics: Picturing the World, 3e 208
The Central Limit Theorem
If the population itself is normally distributed,
with mean = and standard deviation = ,
x
the sample means will have a normal
distribution for any sample size n.
xx
x x
x x x
x x x x x x
Larson & Farber, Elementary Statistics: Picturing the World, 3e 209
The Central Limit Theorem
In either case, the sampling distribution of sample
means has a mean equal to the population mean.
μx μ Mean of the
sample means
x
75 78 79
z
1.88
? 00.63
? Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 215
Probability and Normal
Distributions
Example continued:
x
75 78 79
z
?
1.88 0 0.63
?
P(75 < <x79) = P(1.88 < z < 0.63) = P(z < 0.63) P(z < 1.88)
= 0.7357 0.0301 = 0.7056
Approximately 70.56% of the 25 students will have a
mean score between 75 and 79.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 216
Probabilities of x and x
Example:
The population mean salary for auto mechanics is
= $34,000 with a standard deviation of =
$2,500. Find the probability that the mean salary for a
randomly selected sample of 50 mechanics is greater
μthan
x
$35,000.
=34000
x μx 35000 34000 =2.83
σ 2500 z =
σx = =353.55 σx 353.55
n 50
P ( x > 35000)
= P (z > 2.83)
= 1 P (z < 2.83)
= 1 0.9977= 0.0023
Normal
Approximations to
Binomial
Distributions
Normal Approximation
The normal distribution is used to approximate
the binomial distribution when it would be
impractical to use the binomial distribution to
find a probability.
Normal Approximation to a Binomial
Distribution
If np 5 and nq 5, then the binomial random
variableμ xis
npapproximately normally distributed with
mean
σ npq.
and standard deviation
c
x
Confidence Intervals
§ 6.1
Confidence
Intervals for the
Mean (Large
Samples)
Point Estimate for Population
μ
A point estimate is a single value estimate for a
population parameter. The most unbiased point
x.
estimate of the population mean, , is the sample
mean,
Example:
A random sample of 32 textbook prices (rounded to the nearest
dollar) is taken from a local college bookstore. Find a point
estimate for the population mean, .
34 34 38 45 45 45 45 54
56 65 65 66 67 67 68 74 x 74.22
79 86 87 87 87 88 90 90
94 95 96 98 98 101 110 121
Point estimate
for textbooks
•
74.22
interval estimate
0.05 0.05
z
zc = 1.645
zc z = 0 zc = z1.645
c
0.025 0.025
z
zc = z1.96
c
z = 0 zc =zc1.96
0.005 0.005
z
zc = 2.575
zc z = 0 zc = z2.575
c
Example:
A random sample of 32 textbook prices is taken from
x
a local college bookstore. The mean of the sample is
= 74.22, the sample standard deviation is s =
23.44, and the margin of error is E = 8.12.
Construct a 95% confidence interval for the mean
price of all textbooks in the bookstore.
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 238
Confidence Intervals for μ
Example continued:
Construct a 95% confidence interval for the mean
price of all textbooks in the bookstore.
x =74.22 s = 23.44 E = 8.12
Left endpoint = ? Right endpoint
=?
• •
x =74.22 •
x E 74.22 8.12 x E 74.22 8.12
= 66.1 = 82.34
With 95% confidence we can say that the cost for all
textbooks in the bookstore is between $66.10 and
$82.34.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 239
Finding Confidence Intervals
for μ
Finding a Confidence Interval for a Population
Mean (n 30 or σ known with a normally
distributed
In Words population) In Symbols
1. Find the sample statistics n and
x. x
x
n
2. Specify , if known. Otherwise, if n (x x)2
30, find the sample standard deviation s
n 1
s and use it as an estimate for .
Use the
3. Find the critical value zc that
Standard Normal
corresponds to the given level of Table.σ
confidence. E zc
n
4. Find the margin of error E. L eft endpoint: x E
Right endpoint: x E
5. Find the left and right endpoints I nterval: x E x E
and form Larson
the confidence
& Farber, Elementaryinterval.
Statistics: Picturing the World, 3e 240
Confidence Intervals for μ (
Known)
Example:
A random sample of 25 students had a grade point
average with a mean of 2.86. Past studies have
shown that the standard deviation is 0.15 and the
population is normally distributed.
Construct a 90% confidence interval for the
population mean grade point average.
n = 25 x =2.86 = 0.15
σ 0.15
zc = 1.645 E zc 1.645 0.05
n 25
x E 2.86±0.05 2.81 < μ < 2.91
With 90% confidence we can say that the mean grade point
average for all students in the population is between 2.81
and 2.91. Larson & Farber, Elementary Statistics: Picturing the World, 3e
241
Sample Size
Given a c-confidence level and a maximum error of
estimate, E, the minimum sample size n, needed to
estimate , the population mean, is
2
zc
n .
E
If is unknown, you can estimate it using s provided
you have a preliminary sample with at least 30
members.
Example:
You want to estimate the mean price of all the
textbooks in the college bookstore. How many books
must be included in your sample if you want to be 99%
confident that the sample mean is within $5 of Continued. the
population Larson
mean? & Farber, Elementary Statistics: Picturing the World, 3e 242
Sample Size
Example continued:
You want to estimate the mean price of all the
textbooks in the college bookstore. How many books
must be included in your sample if you want to be 99%
confident that the sample mean is within $5 of the
population mean?
x =74.22 s = 23.44 zc = 2.575
2 2
zc 2.575 23.44
n
E 5
145.7 (Always round up.)
c = 0.95
t
tc = 2.776 tc =
2.776
We are 90% confident that the mean wait time for all
customers is between 86.9 and 103.1 seconds.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 250
Normal or t-Distribution?
Use the normal distribution
Yes σ with
Is n 30? E zc .
n
No
If is unknown, use s instead.
Is the population normally,
You cannot use the normal
or approximately No
distribution or the t-
normally, distributed?
distribution.
Yes
Use the normal distribution
Is known? Yes σ with
E zc .
n
No
x 450
pˆ 0.36
n 1250
where pq
ˆ ˆ.
E zc
n
2 (n 1)s 2
σ2
forms a chi-square distribution for samples of
any size n > 1.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 262
The Chi-Square
Distribution
Four properties of the chi-square distribution are as
follows.
1. All chi-square values χ2 are greater than or equal to
zero.
2. The chi-square distribution is a family of curves, each
determined by the degrees of freedom. To form a
confidence interval for 2, use the χ2-distribution with
degrees of freedom. To form a confidence interval for
2, use the χ2-distribution with degrees of freedom
equal to one less than the sample size.
3. The area under each curve of the chi-square
distribution equals one.
4. Find the critical value zc that corresponds to the given
level of confidence.
5. Chi-squareLarson
distributions are positively skewed.
& Farber, Elementary Statistics: Picturing the World, 3e 263
Critical Values for X2
There are two critical values for each level of
confidence. The value χ2R represents the right-tail
critical value and χ2L represents the left-tail critical
value.
1 c
1 1 2c 1 2c
2
X2 X2
X 2
R X 2
L
1 c c
The area between the left and
2 1 c right critical values is c.
2
X2
X 2
L X 2
R
and
A c-confidence interval for a population variance
and standard deviation is as follows.
and
Constructing a Confidence Interval for a Variance and
a Standard Deviation
In Words In Symbols
1. Verify that the population has a
normal distribution.
2. Identify the sample statistic n d.f. = n 1
and the degrees of freedom.
3. Find the point estimate s . 2 2 (x x )2
s
n 1
4. Find the critical value χ2R and χ2L Use Table 6 in
that correspond to the given Appendix B.
level of confidence.
Continued
Larson & Farber, Elementary Statistics: Picturing the World, 3e
. 268
Confidence Intervals for 2
and
Constructing a Confidence Interval for a Variance and
a Standard Deviation
In Words In Symbols
5. Find the left and right (n 1)s 2 2 (n 1)s 2
σ
endpoints and form the XR2
X L2
confidence interval.
(n 1)s 2 (n 1)s 2
6. Find the confidence interval for 2
σ
XR X L2
the population standard
deviation by taking the square
root of each endpoint.
Hypothesis Testing
with One Sample
§ 7.1
Introduction to
Hypothesis
Testing
Hypothesis Tests
A hypothesis test is a process that uses sample
statistics to test a claim about the value of a
population parameter.
If a manufacturer of rechargeable batteries
claims that the batteries they produce are good
for an average of at least 1,000 charges, a
sample would be taken to test this claim.
Actual Truth of H0
Decision H0 is true H0 is false
P is the area
to the left of
the test
statistic.
z
-3 -2 -1 0 1 2 3
Test
statistic
Larson & Farber, Elementary Statistics: Picturing the World, 3e 284
Right-tailed Test
2. If the alternative hypothesis contains the greater-
than symbol (>), the hypothesis test is a right-
tailed test.
H0: μ k
H a: μ > k
P is the area
to the right of
the test
statistic.
z
-3 -2 -1 0 1 2 3
Test
statistic
Larson & Farber, Elementary Statistics: Picturing the World, 3e 285
Two-tailed Test
3. If the alternative hypothesis contains the not-
equal-to symbol (), the hypothesis test is a two-
1
tailed test. In a two-tailed test, each
2 tail has an
area ofk P.
H0: μ =
H a: μ k
P is twice the
P is twice the area to the
area to the left right of the
of the negative positive test
test statistic. statistic.
z
-3 -2 -1 0 1 2 3
Test Test
statistic statistic
Larson & Farber, Elementary Statistics: Picturing the World, 3e 286
Identifying Types of Tests
Example:
For each claim, state H0 and Ha. Then determine whether
the hypothesis test is a left-tailed, right-tailed, or two-
tailed
a.) A test.
cigarette manufacturer claims that less than one-
eighth of the US adult population smokes
cigarettes.
H0: p 0.125
Ha: p < 0.125 Left-tailed test
(Claim)
b.) A local telephone company claims that the
average length of a phone call is 8 minutes.
H0: μ = 8 (Claim)
Ha: μ 8 Two-tailed test
Claim
Decision Claim is H0 Claim is Ha
There is enough evidence There is enough evidence
Reject H0 to reject the claim. to support the claim.
There is not enough There is not enough
Do not reject H0 evidence to reject the evidence to support the
claim. claim.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 288
Interpreting a Decision
Example:
You perform a hypothesis test for the following claim.
How should you interpret your decision if you reject
H0? If you fail to reject H0?
H0: (Claim) A cigarette manufacturer claims that
less than one-eighth of the US adult population
smokes cigarettes.
If H0 is rejected, you should conclude “there is sufficient
evidence to indicate that the manufacturer’s claim is false.”
If you fail to reject H0, you should conclude “there is not
sufficient evidence to indicate that the manufacturer’s claim
is false.”
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 290
Steps for Hypothesis
Testing
5. Find the P-value.
6. Use the following decision rule.
Is the P-value less than
or equal to the level of No Fail to reject H0.
significance?
Yes
Reject H0.
b.) 0.01?
0.0043
z
2.63 0
x
The test statistic is the sample mean and the
standardized test statistic is z.
x μ σ
z standard error σx
σ n n
When n 30, the sample standard deviation s can be
substituted for .
Larson & Farber, Elementary Statistics: Picturing the World, 3e 297
Using P-values for a z-Test
Using P-values for a z-Test for a Mean μ
In Words In Symbols
1. State the claim mathematically State H0 and Ha.
and verbally. Identify the null
and alternative hypotheses.
2. Specify the level of Identify .
significance. x μ
z
3. Determine the standardized σ n
test statistic.
Use Table 4 in
4. Find the area that corresponds Appendix B.
to z.
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 298
Using P-values for a z-Test
Using P-values for a z-Test for a Mean μ
In Words In Symbols
5. Find the P-value.
a. For a left-tailed test, P = (Area in left tail).
b. For a right-tailed test, P = (Area in right tail).
c. For a two-tailed test, P = 2(Area in tail of test statistic).
z 1.43
The area to the right of P-value is greater than
z = 1.43 is P = 0.0764. = 0.01, fail to reject H0.
0 1.43 z
0.025 0.025
z
z0 = 1.96 0 z0 = 1.96
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 307
Testing with Rejection
Regions
Example continued:
A local telephone company claims that the average
length of a phone call is 8 minutes. In a random
sample of 58 phone calls, the sample mean was 7.8
minutes and the standard deviation was 0.5
minutes. Is there enough evidence to support this
Hclaim at (Claim)
0: = 8
= 0.05? Ha: 8
The standardized test statistic is The test statistic falls
x μ 7.8 8 in the rejection
z
σ n 0.5 58 region, so H0 is
3.05. rejected.
z
z0 = 1.96 0 z0 = 1.96
x μ
t
s n
pˆ μ pˆ p p
z ˆ
σ pˆ pq n
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 322
Hypothesis Test for
Proportions
Using a z-Test for a Proportion p
Verify that np 5 and nq 5.
In Words In Symbols
5. Determine any rejection
regions. p̂ p
z
6. Find the standardized test pq n
statistic.
If z is in the
7. Make a decision to reject or fail rejection region,
to reject the null hypothesis. reject H0.
Otherwise, fail to
8. Interpret the decision in the
reject H0.
context of the original claim.
Example:
Find the critical value for a two-tailed test when n =
26 and = 0.01.
There are 25 d.f. The areas to the right of the
1 1
critical values
2
are = 0.005
2
and 1 – = 0.995.
From Table 6, the critical values are χ2L = 10.520
and χ2R = 46.928.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 331
The Chi-Square Test
The χ2-test for a variance or standard deviation is
a statistical test for a population variance or standard
deviation. The χ2-test can be used when the
population is normal.
0.01
X2
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 335
Hypothesis Test for Standard
Deviation
Example continued:
A college professor claims that the standard
deviation for students taking a statistics test is less
than 30. 10 tests are randomly selected and the
standard deviation is found to be 28.8. Test this
professor’s claim at the = 0.01 level.
H0: 30 Ha: < 30 (Claim)
χ =
2 0.01 2(n 1)s 2 (10 1)(28.8)2
0 χ 2
σ 302
2.088 8.29
X2
X 2
0 = 2.088 Fail to reject H0.
At the 1% level of significance, there is not enough
evidence to support the professor’s claim.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 336
Hypothesis Test for
Variance
Example:
A local balloon company claims that the variance for
the time its helium balloons will stay afloat is 5 hours.
A disgruntled customer wants to test this claim. She
randomly selects 23 customers and finds that the
variance of the sample is 4.5 seconds. At = 0.05,
does she have enough evidence to reject the
company’s claim?
H0: 2 = 5 (Claim) Ha: 2 5
This is a two-tailed test with d.f.= 22 and = 0.05.
1 1
0.025 0.025
2 2
X2
X 2
L X 2
R Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 337
Hypothesis Test for
Variance
Example continued:
A local balloon company claims that the variance for
the time its helium balloons will stay afloat is 5 hours.
A disgruntled customer wants to test this claim. She
randomly selects 23 customers and finds that the
variance of the sample is 4.5 seconds. At = 0.05,
does she have enough evidence to reject the
company’s claim?
H0: 2 = 5 (Claim) Ha: 2 5
Hypothesis Testing
with Two Samples
§ 8.1
Testing the
Difference Between
Means (Large
Independent
Samples)
Two Sample Hypothesis
Testing
In a two-sample hypothesis test, two parameters
from two populations are compared.
H0: μ 1 =
μ2 H0 : μ 1 μ 2 H0 : μ 1 μ 2
Ha: μ1 Ha: μ1 > μ2 Ha: μ1 < μ2
μ2
μx x μx μx μ1 μ2
1 2 1 2
and
2 2
σ σ
σx x σx2 σx2 1 2 .
1 2 1
n1 n22
Sampling
x1 x2
distribution σx x
1 2
μ1 μ2 σx x
1 2
x1 x2
for
Larson & Farber, Elementary Statistics: Picturing the World, 3e 345
Two Sample z-Test
Two-Sample z-Test for the Difference Between
Means
A two-sample z-test can be used to test the difference
between two population means μ1 and μ2 when a large
sample (at least 30) is randomly selected from each
population and
x1 xthe samples are independent. The test
2
statistic is and the standardized test statistic is
z
x1 x2 μ1 μ2
where σx x
σ12 σ22
.
σx x
1 2
n1 n2 1 2
-3 -2 -1 0 1 2 3
z
z0 = 1.28 Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 349
Two Sample z-Test for the
Means
Example continued:
H0: 1 2 z0 = 1.28
Ha: 1 > 2 (Claim) z
-3 -2 -1 0 1 2 3
t
x1 x2 μ1 μ2
6. Find the standardized test σx x
1 2
statistic.
If t is in the
rejection region,
7. Make a decision to reject or fail reject H0.
to reject the null hypothesis. Otherwise, fail to
reject H0.
8. Interpret the decision in the
context of the& Farber,
Larson original claim.
Elementary Statistics: Picturing the World, 3e 357
Two Sample t-Test for the
Means
Example:
A random sample of 17 police officers in Brownsville
has a mean annual income of $35,800 and a standard
deviation of $7,800. In Greensville, a random sample
of 18 police officers has a mean annual income of
$35,100 and a standard deviation of $7,375. Test the
claim at = 0.01 that the mean annual incomes in
the two cities are not the same. Assume the
population variances are equal.
H0: 1 = 2 1 1
0.005 0.005
Ha: 1 2 (Claim)2 2
t
d.f. = n1 + n2 – 2 -3 -2 -1 0 1 2 3
t0 = 2.576
–t0 = –2.576
= 17 + 18 – 2 = Continued.
33 Larson & Farber, Elementary Statistics: Picturing the World, 3e 358
Two Sample t-Test for the
Means
Example continued:
H0: 1 = 2
Ha: 1 2 (Claim) -3 -2 -1 0 1 2 3
t
–t0 = –2.576 t0 = 2.576
σx x σˆ
1 1
n1 1s12 n2 1s22 1 1
1 2
n1 n2 n1 n2 2 n1 n2
17 178002 18 173752 1 1
17 18 2 17 18
7584.0355(0.3382)
2564.92 Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 359
Two Sample t-Test for the
Means
Example continued:
H0: 1 = 2
Ha: 1 2 (Claim) -3 -2 -1 0 1 2 3
t
–t0 = –2.576 t0 = 2.576
t
x1 x2 μ1 μ2 35800 35100 0
σx x 0.273
1 2
2564.92
Fail to reject H0.
There is not enough evidence at the 1% level to
support the claim that the mean annual incomes
differ.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 360
§ 8.3
Testing the
Difference Between
Means (Dependent
Samples)
Independent and Dependent
Samples
Two samples are independent if the sample
selected from one population is not related to the
sample selected from the second population. Two
samples are dependent if each member of one
sample corresponds to a member of the other
sample. Dependent samples are also called paired
samples or matched samples.
d
–t0 μd t0
n( d 2) ( d )2
sd
n(n 1)
d μd
t
sd n
7. Find the standardized test
statistic.
t0 = 2.015
The standardized test statistic is
d μd 7.167 0
t 1.714.
sd n 10.245 6
Note:
wherep x1 x2 and q 1 p. n1p, n1q , n2p, and n2q
n1 n2
must be at least 5.
x1 x2
4. Determine the rejection p
n1 n2
region(s).
Continued.
5. Find the weighted estimate of
Larson & Farber, Elementary Statistics: Picturing the World, 3e 378
Two Sample z-Test for
Proportions
Using a Two-Sample z-Test for the Difference
Between Proportions
In Words In Symbols
6. Find the standardized test z
(pˆ1 pˆ2) (p1 p2)
statistic. 1 1
pq
n1 n2
If z is in the
7. Make a decision to reject or fail rejection region,
to reject the null hypothesis. reject H0.
Otherwise, fail to
8. Interpret the decision in the
reject H0.
context of the original claim.
Correlation and
Regression
§ 9.1
Correlation
Correlation
A correlation is a relationship between two variables. The
data can be represented by the ordered pairs (x, y) where
x is the independent (or explanatory) variable, and y is
the dependent (or response) variable.
x
Example: 2 4 6
x 1 2 3 4 5 –2
y –4 –2 –1 0 2
–4
Larson & Farber, Elementary Statistics: Picturing the World, 3e 385
Linear Correlation
y y
As x As x
increases, y increases, y
tends to tends to
decrease. increase.
x x
Negative Linear Correlation Positive Linear Correlation
y y
x x
No Correlation Nonlinear Correlation
Larson & Farber, Elementary Statistics: Picturing the World, 3e 386
Correlation Coefficient
The correlation coefficient is a measure of the
strength and the direction of a linear relationship
between two variables. The symbol r represents the
sample correlation coefficient. The formula for r is
n xy x y
r .
2 2
n x 2 x n y 2 y
r = 0.91 r = 0.88
x
x
Strong negative correlation
Strong positive correlation
y
y
r = 0.42
r = 0.07
x
x
Weak positive correlation
Nonlinear Correlation
Larson & Farber, Elementary Statistics: Picturing the World, 3e 388
Calculating a Correlation
Coefficient
Calculating a Correlation Coefficient
In Words In Symbols
1. Find the sum of the x-values. x
2. Find the sum of the y-values. y
3. Multiply each x-value by its xy
corresponding y-value and find the
sum.
4. Square each x-value and find the x2
sum. y2
5. Square each y-value and find the n xy x y
r .
sum. n x x
2 2 2
n y y
2
60 There is a strong
0.986
50 74 positive linear
correlation between x
and y.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 390
Correlation Coefficient
Example:
The following data represents the number of hours
12 different students watched television during the
weekend and the scores of each student who took a
test the following Monday.
a.) Display the scatter plot.
b.) Calculate the correlation coefficient r.
Hours, x 0 1 2 3 3 5 5 5 6 7 7 10
Test score,
96 85 82 74 95 68 76 84 58 65 75 50
y
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 391
Correlation Coefficient
Example continued:
Hours, x 0 1 2 3 3 5 5 5 6 7 7 10
Test score,
96 85 82 74 95 68 76 84 58 65 75 50
y
y
100
80
Test score
60
40
20
x
2 4 6 8 10
Hours watching TV
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 392
Correlation Coefficient
Example continued:
Hours, x 0 1 2 3 3 5 5 5 6 7 7 10
Test score,
96 85 82 74 95 68 76 84 58 65 75 50
y
xy 0 85 164 222 285 340 380 420 348 455 525 500
x2 0 1 4 9 9 25 25 25 36 49 49 100
y2 9216 7225 6724 5476 9025 4624 5776 7056 3364 4225 5625 2500
x 54 y 908 xy 3724 x 2 332 y 2 70836
Hours, x 0 1 2 3 3 5 5 5 6 7 7 10
Test score,
96 85 82 74 95 68 76 84 58 65 75 50
y
Is the correlation coefficient significant at = 0.01?
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 396
Testing a Population Correlation
Coefficient
Example continued: Appendix B: Table 11
r 0.831 n = 0.05 = 0.01
4 0.950 0.990
n = 12
5 0.878 0.959
= 0.01 6 0.811 0.917
10 0.632 0.765
11 0.602 0.735
12 0.576 0.708 |r| > 0.708
13 0.553 0.684
Because, the population correlation is significant, there is
enough evidence at the 1% level of significance to conclude
that there is a significant linear correlation between the
number of hours of television watched during the weekend
and the scores of each student who took a test the following
Monday. Larson & Farber, Elementary Statistics: Picturing the World, 3e 397
Hypothesis Testing for ρ
A hypothesis test can also be used to determine
whether the sample correlation coefficient r provides
enough evidence to conclude that the population
correlation coefficient ρ is significant at a specified
level of significance.
A hypothesis test can be one tailed or two tailed.
H0: ρ 0 (no significant negative correlation)
Left-tailed test
Ha: ρ < 0 (significant negative
correlation)
H0: ρ 0 (no significant positive correlation)
Ha: ρ > 0 (significant positive
Right-tailed
test
correlation)
H0: ρ = 0 (no significant correlation)
Two-tailed test
Ha: ρ 0 (significant correlation)
Larson & Farber, Elementary Statistics: Picturing the World, 3e 398
Hypothesis Testing for ρ
Hours, x 0 1 2 3 3 5 5 5 6 7 7 10
Test score,
96 85 82 74 95 68 76 84 58 65 75 50
y
Test the significance of this correlation coefficient
significant at = 0.01?
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 402
Hypothesis Testing for ρ
Example continued:
H0: ρ = 0 (no correlation) Ha: ρ 0 (significant
correlation)
The level of significance is =
0.01.
Degrees of freedom are d.f. = 12 – 2 = 10.
The critical values are t0 = 3.169 and t0 =
3.169.
The standardized test statistic is
r 0.831 The test statistic falls
t
2 2
1 r 1 ( 0.831) in the rejection region,
n 2 12 2
so H0 is rejected.
4.72.
t
t0 = 3.169 0 t0 = 3.169
Predicted d
3
y-value
x
Each data point di represents the difference between the
observed y-value and the predicted y-value for a given x-value on
the line. These differences are called residuals.
x
x
Larson & Farber, Elementary Statistics: Picturing the World, 3e 414
Variation About a
Regression Line
The total variation about a regression line is the sum of
the squares of the differences between the y-value of
each ordered pair and the mean of y.
2
Total variation y i y
The explained variation is the sum of the squares of
the differences between each predicted y-value and the
mean of y.
Explained variation yˆ y
2
i
Example:
The correlation coefficient for the data that
represents the number of hours students watched
television and the test scores of each student is r
0.831. Find the coefficient of determination.
r 2 ( 0.831)2 About 69.1% of the variation in the
test scores can be explained by the
0.691
variation in the hours of TV watched.
About 30.9% of the variation is
unexplained.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 416
The Standard Error of
Estimate
When a ŷ-value is predicted from an x-value, the
prediction is a point estimate.
An interval can also be constructed.
The standard error of estimate se is the standard deviation of the observed yi -values about the predicted
ŷ-value for a given xi -value. It is given by
(y i yˆi )2
se
n 2
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 423
Prediction Intervals
Construct a Prediction Interval for y for a Specific
Value of x
In Words In Symbols
4. Find the standard error (y i yˆi )2
of estimate se. se
n 2
Left endpoint: ŷ – E
6. Find the left and right Right endpoint: ŷ + E
endpoints and form the Interval: ŷ – E < y < ŷ +
prediction interval. E
Hours, x 0 1 2 3 3 5 5 5 6 7 7 10
Test score,
96 85 82 74 95 68 76 84 58 65 75 50
y
ŷ = –4.07x + 93.97 se 8.11
Example:
A researcher claims that the distribution of favorite
pizza toppings among teenagers is as shown below.
Topping Frequency, f
Each outcome is Cheese 41% The probability for
classified into Pepperoni 25% each possible
categories. Sausage 15% outcome is fixed.
Mushrooms 10%
Onions 9%
Larson & Farber, Elementary Statistics: Picturing the World, 3e 433
Chi-Square Goodness-of-Fit
Test
A Chi-Square Goodness-of-Fit Test is used to test whether
a frequency distribution fits an expected distribution.
To calculate the test statistic for the chi-square goodness-of-
fit test, the observed frequencies and the expected
frequencies are used.
The observed frequency O of a category is the frequency for the category observed in the sample data.
The expected frequency E of a category is the calculated frequency for the category. Expected frequencies are obtained assuming the specified (or
hypothesized) distribution. The expected frequency for the ith category is
Ei = npi
where n is the number of trials (the sample size) and pi is the assumed probability of the ith category.
Continued.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 445
Expected Frequency
Example continued:
Age
Gender 16 – 20 21 – 30 31 – 40 41 – 50 51 – 60 61 and Total
older
Male 32 51 52 43 28 10 216
Female 13 22 33 21 10 6 105
Total 45 73 85 64 38 16 321
F
1 2 3 4
1 2 3 4 5 6
1 161.4 199.5 215.7 224.6 230.2 234.0
27
2 4.21 19.00
18.51 3.35 2.96
19.16 2.73
19.25 2.57
19.30 2.46
19.33
28 4.20 3.34 2.95 2.71 2.56 2.45
29 4.18 3.33 2.93 2.70 2.55 2.43
1 2 3 4 5 6
1 161.4 199.5 215.7 224.6 230.2 234.0
2 18.51 19.00 19.16 19.25 19.30 19.33
3 10.13 9.55 9.28 9.12 9.01 8.94
4 7.71 6.94 6.59 6.39 6.26 6.16
5 6.61 5.79 5.41 5.19 5.05 4.95
The critical
6 value 5.99is F5.14
0 = 4.53. 4.76 4.53 4.39 4.28
7 Larson5.59
& Farber, 4.74 4.35 Picturing
Elementary Statistics: 4.12the World,
3.973e 3.87 459
Two-Sample F-Test for
Variances
Two-Sample F-Test for Variances
A two-sample F-test is used to compare two
population σ12variances
and σ22 when a sample is
randomly selected from each population. The
populations must be independent and normally
distributed.
s12 The test statistic is
F 2
s2
s12 and s22
s12 s22.
where represent the sample variances with
The degrees of freedom for the numerator is
d.f.N = n1 – 1 and the degrees of freedom for the
2
s
denominator is d.f.D = n12 – 1, where n1 is the size of
2
the sample having s2. variance and n2 is the size of
the sample Larson
having variance
& Farber, Elementary Statistics: Picturing the World, 3e 460
Two-Sample F-Test for
Variances
σ12 and σ22
Using a Two-Sample F-Test to Compare
In Words In Symbols
1. Identify the claim. State the State H0 and Ha.
null and alternative
hypotheses.
Identify .
2. Specify the level of
significance. d.f.N = n1 – 1
d.f.D = n2 – 1
3. Identify the degrees of
freedom. Use Table 7 in
Appendix B.
Continued.
4. Determine the critical value.
Larson & Farber, Elementary Statistics: Picturing the World, 3e 461
Two-Sample F-Test for
Variances
σ12 and σ22
Using a Two-Sample F-Test to Compare
In Words In Symbols
5. Determine the rejection
region. s12
F 2
6. Calculate the test statistic. s2
If F is in the
7. Make a decision to reject or rejection region,
fail to reject the null reject H0.
hypothesis. Otherwise, fail to
reject H0.
1. The variance between samples MSB measures the differences related to the treatment given to
each sample and is sometimes called the mean square between.
2. The variance within samples MSW measures the differences related to entries within the same
sample. This variance, sometimes called the mean square within, is usually due to sampling
error.
If F is in the
7. Make a decision to reject or rejection region,
fail to reject the null reject H0.
hypothesis. Otherwise, fail to
reject H0.
Sum of Degrees
Mean
Variation square of F
squares
s freedom
SS B
Between SSB d.f.N MS B MS B MSW
d.f.N
SSW
Within SSW d.f.D MSW
d.f.D
d.f.D = N – k = 20 – 4 = 16