Chapter 8: Hypothesis Testing For Population Proportions
Chapter 8: Hypothesis Testing For Population Proportions
Population Proportions
Testing a claim
● Earlier we calculated a 95% confidence interval for the
true population proportion of UCLA students who
travelled outside the US.
(0.26, 0.44)
● A 95% confidence interval of 26% to 44% means that
● We are 95% confident that the true population
● p = 0.34 or p = 0.36?
● p = 0.25 or p = 0.45?
hypothesis
Hypothesis Testing for One Proportion
Research conducted a few years ago showed that 35% of UCLA students
had travelled outside the US. UCLA has recently implemented a new
study abroad program and results of a new survey show that out of the
100 randomly sampled students 42 have travelled abroad. Is there
significant evidence to suggest that the proportion of students at UCLA
who have travelled abroad has increased after the implementation of the
study abroad program?
● Population proportion used to be 0.35.
● New sample proportion is 0.42.
● We are testing the claim that the population proportion is now
greater than 0.35.
● The new data indicate that it may be since 0.42 is clearly grater than
0.35.
● But is this difference statistically significant, i.e. are the data
inconsistent enough?
● We do a formal hypothesis test to answer this question.
Setting Up Hypotheses
● Null hypothesis, denoted by Ho, specifies a population model
parameter of interest and proposes a value for that parameter
(p).
Ho: p = 0.35
● Alternative hypothesis, denoted by HA, is the claim we are
testing for.
HA: p > 0.35
● Even though we are testing for the alternative hypothesis, we
check to see whether or not the null hypothesis is plausible.
● If the null hypothesis is not plausible, we reject the null
hypothesis and conclude that there is sufficient evidence to
support the alternative. If the null hypothesis is plausible, we fail
to reject the null hypothesis and conclude that there isn't
sufficient evidence to support the alternative.
Why do we check if Ho is plausible and
not if HA is plausible?
● Think about the logic of jury trials:
● To prove someone is guilty, we start by assuming they are innocent.
● We retain that hypothesis until the facts make it unlikely beyond a
reasonable doubt.
● Then, and only then, we reject the hypothesis of innocence and declare
the person guilty.
● The same logic used in jury trials is used in statistical tests of hypotheses:
● We begin by assuming that the null hypothesis is true.
● Next we consider whether the data are consistent with this hypothesis.
● If they are, all we can do is retain the hypothesis we started with. If
they are not, then like a jury, we ask whether they are unlikely beyond
a reasonable doubt.
A Trial as a Hypothesis Test
● Hypothesis testing is very much like a court trial.
● This doesn't prove the hypothesis but we can say that the
data support it.
● Ho: Defendant is innocent
HA: Defendant is guilty
● We then present the evidence - collect data.
● Then we judge the evidence - “Could these data plausibly have
happened by chance if the null hypothesis were true?”
● If they were very unlikely to have occurred, then the evidence
raises more than a reasonable doubt in our minds about the
null hypothesis.
● Ultimately we must make a decision. How unlikely is unlikely?
A Trial as a Hypothesis Test
● If the evidence is not strong enough to reject the presumption of
innocent, the jury returns with a verdict of “not guilty”
● The jury does not say that the defendant is innocent.
● All it says is that there is not enough evidence to convict, to
reject innocence.
● The defendant may, in fact, be innocent, but the jury has no
way to be sure.
● Said statistically, we fail to reject the null hypothesis.
● We never declare the null hypothesis to be true, because we
simply do not know whether it's true or not.
● Therefore we never“accept the null hypothesis”
A Trial as a Hypothesis Test
● In a trial, the burden of proof is on the prosecution.
● In a hypothesis test, the burden of proof is on the
unusual claim.
● The null hypothesis is the ordinary state of affairs (the
status quo), so it's the alternative hypothesis that we
consider unusual (and for which we must gather
evidence).
How do we determine if Ho is plausible?
● In Statistics we can quantify our level of doubt.
● How unlikely is it to get a random sample of 100
Note: The p comes from the null hypothesis, i.e. it is value proposed for
the parameter in the null hypothesis.
Check the Conditions for the CLT
● Random and Independent: The sample is collected
randomly and the trials are independent of each other.
● Large Sample:
● The sample has at least 10 successes, np ≥ 10, and
● HA: p ≠ po
where
alternatively... Ho: pM − pF = 0
HA: pM − pF ≠ 0
Check the Conditions
The conditions are the same as in the CI for two proportions. We have
already checked these and found that the sampling distributions for
male and female voters follow a normal distribution.