Hypothesis Testing 2: Testing Percentages and Differences of Means
Hypothesis Testing 2: Testing Percentages and Differences of Means
Example
Career unit claims 50% of population of graduates have a job by June. Sample n=30 gives p = 33% Test the hypothesis that careers unit claims are correct. Ho: 0.5 Ha: < 0.5 Significance level = 0.05
Critical value z = -1.65 SE(p) = [(0.5)(0.5)/30] = 0.09 as p = 0.33, converts to z score z = (0.33-0.5)/0.09 = -1.88 z = -1.88 < -1.65 so is in the rejection region Reject Ho. Accept Ha: less than 50% of graduates have a job by June
p=0.33
= 0.5
ps
-1.88 -1.65
=0 Standard Normal
Zs
SE (x1 -x2)
(x1 -x2)s
Standard Normal 1
Zs
-1.96
=0
1.96
SE (x1 -x2)
(x1 -x2)s
Standard Normal 1
Zs
-1.96
=0
1.96
3.32
Interpretation
Rejecting Ho: 1 = 2 The observed difference of sample mean wages is not due to chance We reject the idea that our two samples of daily wages were drawn from the same population of wages with only 1 population mean wage
Interpretation
Accepting H1: 1 2 means that the samples were drawn from 2 different populations of data The observed difference of sample means is significant and reflects a difference in the population means from which they were drawn It is likely that firm 1 pays more than firm 2 on average
USING EXCEL
Use Data Analysis add-in under tools Choose z test: two sample for means or if either sample is smaller than 30 choose t test: two sample assuming equal variances Highlight variable range for first sample Highlight variable range for second sample Choose sig level.
Difference of proportions
A similar test can be done when you have data on 2 samples which give 2 sample proportions or percentages You can test whether the observed difference in proportions is due to chance or due to differences in the populations from which the samples were drawn, the only difference is the standard error of the difference of proportions
SE(p1-p2)
(p1 -p2)s
Zs =0 Standard Normal
An example
Data is available from 2 surveys of the % of households with smokers n1 = 50 n2 = 50 and p1 = 20% p2 = 30% Survey 2 shows a higher % of smokers but is this difference due to chance or due to a difference in the populations from which the 2 surveys were drawn?
SE(p1-p2)
(p1 -p2)s
Zs
-2.58 -1.15
=0 Standard Normal
Interpretation
We accept the null Ho: 1 = 2 There is no difference between the population proportions from which the samples were drawn The apparent difference of proportions is due to chance Both samples are drawn from a population of people with an estimated population of 25% smokers
Final Points
You now know that observed sample data values do not necessarily mean that the population also has this value You also know that observed differences between 2 sample values does not necessarily mean that the difference is significant Be careful, observed patterns from sample data may be due to chance. Test them.
Final Points
You now know that observed sample data values do not necessarily mean that the population also has this value You also know that observed differences between 2 sample values does not necessarily mean that the difference is significant Be careful, observed patterns from sample data may be due to chance. Test them.
Final Points
You now know that observed sample data values do not necessarily mean that the population also has this value You also know that observed differences between 2 sample values does not necessarily mean that the difference is significant Be careful, observed patterns from sample data may be due to chance. Test them.