Inference About A Population: - Population Mean - Population Proportion
Inference About A Population: - Population Mean - Population Proportion
Inference About A Population: - Population Mean - Population Proportion
•Population mean
•Population proportion
Inference About A Population…
Population
Sample
Inference
Statistic
Parameter
12.5
Example 1
12.6
IDENTIFY
Example 1
Our objective is to describe the population of the amount of
newspaper discarded per household, which is an interval
variable. Thus the parameter to be tested is the population
mean µ.
Example 1
x
The test statistic is: t
s/ n
ν = n −1
data: newspaper
t = 2.2369, df = 147, p-value = 0.0134
alternative hypothesis: true mean is greater than 2
95 percent confidence interval:
2.046905 Inf
sample estimates:
mean of x
2.180405
12.12
Check Required Conditions
The Student t distribution is robust, which means that if the
population is nonnormal, the results of the t-test and
confidence interval estimate are still valid provided that the
population is “not extremely nonnormal”.
25
20
F re q u e n c y
15
10
5
0
0 1 2 3 4
newspaper
Inference: Population Proportion…
When data are nominal, we count the number of occurrences of
each value and calculate proportions. Thus, the parameter of
interest in describing a population of nominal data is the
population proportion p.
12.15
Inference: Population Proportion…
When np and n(1–p) are both greater than 5, the
sampling distribution of is approximately normal
with
mean:
standard deviation:
Hence:
12.16
Inference: Population Proportion
Test statistic for p:
12.18
Example 2
In American presidential elections the candidate who
receives the most votes in a state receives the state’s
entire Electoral College vote.
12.20
IDENTIFY
Example 2
The problem objective is to describe the population of votes in
the state. The data are nominal because the values are
“Democrat” and “Republican.” Thus the parameter to be
tested is the proportion of votes in the entire state that are for
the Republican candidate. Because we want to determine
whether the network can declare the Republican to be the
winner at 8:01 P.M., the alternative hypothesis is
Example 2
The test statistic is
p̂ p
z
p(1 p) / n
12.22
>data2=read.csv("C:\\Users\\TOSHIBA\\Desktop\\eslasca\\ex2_week3_onepop
_prop.csv")
> votes=data2[,1]
> x=407
> n=765
> prop.test(x,n,alternative="greater")
Example 2
At the 5% significance level we reject the null
hypothesis and conclude that there is enough
evidence to infer that the Republican candidate
will win the state.
12.24
INTERPRET
Example 2
One of the key issues to consider here is the cost
of Type I and Type II errors.
12.25
INTERPRET
Example 2
Such an error would mean that a network would
announce at 8:01 P.M. that the Republican has
won and then later in the evening would have to
admit to a mistake.
Example 2
This is exactly what happened on the evening of the
U. S. presidential elections in November 2000.
Example 2
Several hours later they again admitted a mistake and finally
declared the race too close to call.
Fortunately for each network all the networks made the same
mistake.
However, if one network had not done this it would have developed
a better track record, which could have been used in future
advertisements for news shows and likely drawn more viewers.
12.28
Identifying Factors
Factors that identify the z-test and interval
estimator of p:
12.29