Estimating Proportions With Confidence: Some Definitions: Details For Proportions
Estimating Proportions With Confidence: Some Definitions: Details For Proportions
Confidence interval example from Fri lecture Remember population versus sample:
Gallup poll of n = 1018 adults found 39% • Population proportion: the fraction of the population
believe in evolution. So p̂ = .39 that has a certain trait/characteristic or the probability
of success in a binomial experiment – denoted by p.
A 95% confidence interval or interval estimate for The value of the parameter p is not known.
the proportion (or percent) of all adults who believe in • Sample proportion: the fraction of the sample
evolution is .36 to .42 (or 36% to 42%). that has a certain trait/characteristic – denoted by p̂ .
The statistic p̂ is an estimate of p.
Confidence interval: an interval of estimates
that is likely to capture the population value. The Fundamental Rule for Using Data for Inference:
Available data can be used to make inferences about a
Goal today: Learn to calculate and interpret confidence much larger group if the data can be considered to be
intervals for p and for p1 − p2 and learn general format. representative with regard to the question(s) of interest.
Copyright ©2004 Brooks/Cole, a division of Thomson Learning, Inc., updated 3 Copyright ©2004 Brooks/Cole, a division of Thomson Learning, Inc., updated 4
by Jessica Utts Feb 2010 by Jessica Utts Feb 2010
Copyright ©2004 Brooks/Cole, a division of Thomson Learning, Inc., updated 5 Copyright ©2004 Brooks/Cole, a division of Thomson Learning, Inc., updated 6
by Jessica Utts Feb 2010 by Jessica Utts Feb 2010
Multiplier and Confidence Level More about the Multiplier
• The multiplier is determined by the desired confidence
level.
• The confidence level is the probability that the procedure
used to determine the interval will provide an interval that
includes the population parameter. Most common is .95.
• If we consider all possible randomly selected samples of Note: Increase confidence level => larger multiplier.
same size from a population, the confidence level is the
fraction or percent of those samples for which the Multiplier, denoted as z*, is the
confidence interval includes the population parameter. standardized score such that the
See picture on board. area between −z* and z* under the
• Often express the confidence level as a percent. Common standard normal curve corresponds
levels are 90%, 95%, 98%, and 99%. to the desired confidence level.
Copyright ©2004 Brooks/Cole, a division of Thomson Learning, Inc., updated 7 Copyright ©2004 Brooks/Cole, a division of Thomson Learning, Inc., updated 8
by Jessica Utts Feb 2010 by Jessica Utts Feb 2010
Copyright ©2004 Brooks/Cole, a division of Thomson Learning, Inc., updated 17 Copyright ©2004 Brooks/Cole, a division of Thomson Learning, Inc., updated 18
by Jessica Utts Feb 2010 by Jessica Utts Feb 2010
Finding the formula for a 95% CI for Same holds for any confidence level;
a Proportion – use Empirical Rule: replace 2 with z*
For 95% of all samples, p̂ is within 2 st.dev. of p pˆ (1 − pˆ )
pˆ ± z ∗
Sampling distribution of p̂ tells us for 95% of all samples: n
where:
−2 standard deviations < p̂ − p < 2 standard deviations
Don’t know true standard deviation, so use standard error. • pˆ is the sample proportion
For approximately 95% of all samples,
−2 standard errors < p̂ − p < 2 standard errors • z* denotes the multiplier.
which implies for approximately 95% of all samples,
pˆ (1 − pˆ )
p̂ – 2 standard errors < p < p̂ + 2 standard errors •. is the standard error of pˆ .
n
Copyright ©2004 Brooks/Cole, a division of Thomson Learning, Inc., updated 19 Copyright ©2004 Brooks/Cole, a division of Thomson Learning, Inc., updated 20
by Jessica Utts Feb 2010 by Jessica Utts Feb 2010
Copyright ©2004 Brooks/Cole, a division of Thomson Learning, Inc., updated 29 Copyright ©2004 Brooks/Cole, a division of Thomson Learning, Inc., updated 30
by Jessica Utts Feb 2010 by Jessica Utts Feb 2010
95% confidence interval Section 10.5: Using confidence
intervals to guide decisions
• A 95% confidence interval for the difference is
.035 to .334 or 3.5% to 33.4%. • A value not in a confidence interval can be rejected
• We are 95% confident that the population as a likely value for the population parameter.
proportions of men and women who would date • When a confidence interval for p1 − p2 does not cover
someone they didn’t find attractive differ by 0 it is reasonable to conclude that the two population
between .035 and .334, with a lower proportion values differ.
for men than for women. • When confidence intervals for p1 and p2 do not
• We can conclude that the two population overlap it is reasonable to conclude they differ, but if
proportions differ because 0 is not in the interval. they do overlap, no conclusion can be made. In that
case, find a confidence interval for the difference.
Copyright ©2004 Brooks/Cole, a division of Thomson Learning, Inc., updated 31 Copyright ©2004 Brooks/Cole, a division of Thomson Learning, Inc., updated 32
by Jessica Utts Feb 2010 by Jessica Utts Feb 2010