CH-3 HypothesisTesting
CH-3 HypothesisTesting
HYPOTHESIS TESTING
Introduction
Most statistical inference revolves around the parameters of a population. In Chapter II,
Estimation, we used the information obtained in a simple random sample to construct a
confidence interval estimate of the unknown value of a population parameter. In this
chapter (in hypothesis testing), we start with an assumed value of a population
parameter. Then sample evidence is used to test the reasonableness of hypotheses; i.e
to decide whether the assumed value is reasonable it should be accepted or whether it's
reasonable unreasonable it should be rejected; hence the statistical inferences made in
this chapter are referred to as hypothesis testing.
Example
1. The manager of a hotel has stated that the mean guest bill for a weekend is Birr 400
or less. A member of the hotel’s accounting staff has noticed that the total charges
for guest bills have been increasing in recent months. The accountant will use a
sample of weekend guest bills to test the manager’s claim. State the null and
alternative hypotheses
Solution:
Ho: μ Birr 400
Ha: Birr 400
2. Production workers at XY Company have been trained in their jobs by using two
different training programs. The company training director would like to know
whether there is a difference in mean productivity for workers trained in the two
programs. Required; Develop the null and alternative hypotheses.
Solution
Ho: 1= 2 or 1- 2 =0
Ha: 12 1-2 0
3. The manager at a drug store claims that the company’s employees are honest.
However, there have been many shortages from the cash register lately. Required:
Specify the null and alternative hypothesis
Solution:
Ho: Employees are honest
Ha: Employees are dishonest
Leads to two – tailed test leads to a right – tailed test Leads to a left
tests
“In many vsituations, the choice of Ho and Ha isn't obvious; in such cases, judgment on the part of the user is needed to
select the proper farm of Ho and Ha. However, the equality part of the expression (either =, or ) always appears in
the null hypothesis.
Example; in a given firm manager’s claims that there is no difference between the
mean income of males and the mean income of females.
Ho: there is no difference b/n the mean income of males & the mean income of females.
Ha: there is a difference in the mean income of males & the mean income of females.
If Ho is rejected and Ha accepted the mean income of males could be greater than or
less than that of females or vis versa. To accommodate these two possibilities, the 5%
level of significance representing the area of rejection is divided equally in to two tails of
the sampling distribution. If the level of significant is 0.05 each rejection region will have
0.025 probability. Note that the total area under the normal curve is one found by 0.95
+ 0.025 + 0.025. The standardized z-value (critical points) at the 0.05 level of
significance is ±1.96
Z/2=0.025 Z0.025=±1.96
Non-rejection
Rejection region Region or do not reject H 0 Rejection region
0.95 Probability
-1.96 0 +1.96
Z
0.025 Probability 0.025 Probability
A. Right–tailed/upper–tailed test: a right–tailed test will reject the null hypothesis if the
4
sample statistic is significantly higher than the hypothesized population parameter.
B. Left–tailed/lower–tailed test: a left–tailed test will reject the null hypothesis if the
sample statistic is significantly lower thanthe hypothesized population parameter.
Non-rejection
Region or do not reject H 0 Rejection region
Scale
of Z
0 1.6 45
0.95 Probability 0.05 Probability
Initial Value
The above chart portrays the rejection region for a test of significance. The level of
significance selected is 0.05.
The area where the Ho is not rejected includes the area to the left of 1.645
The area of rejection is to the right of 1.645
The 0.05 level of significant was chosen
The sampling distribution is for the test statistic Z , the standard normal deviate.
The value 1.645 separates the regions where the Ho is rejected & where it is not
rejected.
The value 1.645 is called the critical value. It is the corresponding value of the test
statistic for the selected level of significance i.e. z-value at the 0.05 level of
significance is 1.645.
5
One–Tailed Left–tailed/lower–tailed) tests of significance at 5% level of significance;
Non-rejection
Rejection region Region or do not reject H 0
0.95 Probability
Z
-1.6 45 0
0.05 Probability 0.95 Probability
Initial Value
The region of rejection is only in one tail of the curve. The above example indicates that
the region of rejection is in the left (lower) tail of the curve.
II. State the distribution (Z-table or t-table), type of tailed test, level of significance,
III. State/Determine the decision rule: Find the level of significance (z or t value
accordingly) from the table and determine the decision rule,
Example:
6
1. Matador-Addis Tyre Share Company claims that its tires have a mean life of 35,000
miles. A random sample of 36 of these tires is tested if the sample mean in 33,000
miles. Assume that the population standard deviation is 4,500 miles and the lives of
tires are approximately normally distributed. Test the share company’s claim using
a 5% level of significance.
Solution
1.Ho:=35,000 miles 2. Z–distribution, two tailed
Ha:≠35,000 miles test
3.=0.05 4. =33,000 miles
/2=0.025 =4,500 miles
Z0.025=±1.96 n= 36 tires
Reject Ho if |Sample Z|>1.96 SampleZ=?
Reject Ho, if the computed
value sample Z doesn't fall
between the critical value
±1.96
2. A fertilizer company claims that the useofits product will result in a yield of atleast
35 quintals of wheat per hectare, on average, Application at the fertilizer to a
randomly selected 36 sample hectares resulted in a yield of 34 quintals per hectare.
Assume the population standard deviation is 5 quintals and yields per hectare are
normally distributed. Test the company’s claim at 1% levelof significance.
Solution
1. Ho: ≥35 quintals
Ha: 35 quintals
7
2. Z–distribution, left–tailed test
3. =0.01
Z=Z0.01=2.33
Reject Ho if sample Z-2.33
4. =34 quintals
n=36
=5quintals
Sample Z =?
Example:
1. A contractor assumes that construction workers are idle for 75 minutes or less
perday. A random sample of 25 construction workers was taken and the mean idle
time was found to be 84 minutes per day with a sample standard deviation of 20
minutes. Assume that the population is approximately normally distributed, use a
5% level of significance to test the contractor’s assumption.
Solution
8
1. Ho:≤75
minutes Ha:75
minutes
2. t–distribution, Right–tailed test
3. =0.05
n=25
=n–1=25–1=24
t,= t0.05,24= 1.711
Reject Ho if sample t1.71
4. n=25
=84
minutes S
=20 minutes
Sample t =?
5. Reject Ho; because + 2.25 1.71 workers are idle for more than 75
minutes per day.
2. A director of a secretarial school claims that its graduates can type at least 50
words per minute on average. Suppose you want to hire some of these graduates if
the director’s claimis true; and you test the typing speed of18 of the graduates and
obtain a mean of 40 wards per minute with a sample variance of 720. Assuming the
typing speed for the graduates of the secretarial school is normally distributed, test
the director’s claim and decide whether to hire the graduates or not, using a 5%
level of significance.
Solution
1. Ho:50
words Ha:50
words
2. t– distribution, Left–tailed test
3. =0.05
9
n=18
=n–1
=18–1=17
t,=t0.05,17=1.74
Reject Ho if sample t-1.74
4. = 40 words
n=18
s2=720
Samplet=?
A proportion is a value between 0 & 1 that expresses the part of the whole that
possesses a given characteristic. The formula (methods) for proportions based on the
central limit theorem make possible the testing of hypotheses about the population
proportion in a manner similar to that of the formul used to test sample means. Similar
to that of hypothes is testing about a population mean, hypothesis testing about a
population proportion has three terms.
1. Ho:P=y 2. Ho:Py 3. Ho:Py
Ha:Py Ha:Py Ha:Py
Example:
1. A magazine claims that 25% of its readers are college students. A random sample of
200 readers is taken. It is found that 42 of these readers are college students. Use a
10% level of significance and test the magazine’s claim.
Solution
1. Ho:
P=0.25
Ha:
P0.25
2. Z–distribution; two tailed test
10
3. =0.1 /2=0.05
Z/2=Z0.05=1.64
Reject Ho if |sample Z| 1.64
4. n=200
x=42
=0.21
SampleZ=?
2. An Economist states that more than 35% of Addis’s labor force in unemployed. You
don’t know if the econom ist’s estimateis too high or too low. Thus, you want to test
the economist’s claim using a 5% level of significance. You obtain a random sample
of 400 people in the labor force, of whom 128 are unemployed. Would you reject
the economist’s claim?
Solution
1. Ho:P>0.35
Ha:P≤0.3
5
2. Z–distribution, left-tailed test
3. =0.05
Z0.05=-1.64
Reject Ho if sample Z ≤ -1.64
4. n=400
x=128
=0.32
SampleZ=?
3.6 GLOSSARY
Hypothesis: A statement about the value of population parameter.
11
Hypothesis testing: A procedure based on sample evidence and probability theory used
to determine whether the hypothesis stated is a reasonable statement and cannot be
rejected or is unreasonable and should be rejected.
Null hypothesis: Is a pre assumed statement about the population, which may be
accepted or rejected on the basis of a sampled data.
Alternative hypothesis: Is a hypothesis which we are inclined to accept in the event the
null hypothesis is rejected on the basis of a prescribed test.
Level of significance: The probability of rejecting the null hypothesis when it is actually
true or the probability of committing type I error.
Type I error: Rejecting the null hypothesis, HO, when it is actually true.
Type II error: Accepting the null hypothesis when it is actually false.
Test statistic: A value, determined from sample information, used to determine whether
or not to reject the null hypothesis
Critical value: The dividing point between the region where the null hypothesis is
rejected and the region where it is not rejected.
One-tailed-test: Is used when the rejection region is only in one tail or when the
alternative hypothesis states a direction. Eg. H 1: > 40 or H1 : < 40
Two-tailed-test: Used when the rejection region is in each tail (both tails) or used when
the alternative hypothesis does not state a direction. Eg. H 1 : 40
12