Ch5 MMW BSN
Ch5 MMW BSN
Ch5 MMW BSN
Disclaimer
Chapter 5
HYPOTHESIS TESTING
Introduction
A test of hypothesis is a method of using sample data to decide between two
competing claims (hypotheses) about a population characteristic. It concerns itself with
the decision-making rules for choosing alternatives while controlling and minimizing
the risks of wrong decisions.
Once a hypothesis has been formulated, we need a method for evaluating the
sample data to determine whether Ho should be accepted or rejected. The decision
rule that we use for this purpose is called a test procedure.
The hypothesis testing procedure is simple a decision rule that specifies for
every possible value of a statistic observable in a simple random sample of size n
whether the null hypothesis should be rejected or accepted. The set of possible values
of the sample statistic is referred to as the sample space. Therefore, the test procedure
divides the sample space into mutually exclusive parts called the acceptance region
and the rejection (or critical) region.
In carrying out a test of null hypothesis versus alternative hypothesis, the null
hypothesis will be rejected in favor of the alternative hypothesis only if the sample
evidence strongly suggests that Ho is false. If the sample does not contain such
evidence, Ho will not be rejected.
Two basic types of decision problems can be considered for hypothesis testing
procedures. The first problem is one in which we want to know whether the population
parameter has changed or differs from a particular value. Here, we are interested in
detecting whether the population parameter is either larger or smaller than a particular
value. The second type of hypothesis test is one in which we want to find out whether
1) the sample came from a population that has a parameter less than the hypothesized
value, or 2) from a population that has a parameter more than a hypothesized value.
These situations give rise to a one-tailed test. The type of test in which we want to
determine whether a population parameter has changed is referred to as two-tailed
test, since the null hypothesis can be rejected by observing a statistic that falls in either
of the two tails of the sampling distribution.
In summary, a one-tailed test of hypothesis is a test where the alternative
hypothesis specifies a one-directional difference from the parameter of interest while
a two-tailed test specifies a difference that can fall in either of the two tails of the
distribution. In other words, based on the alternative hypothesis, the one-tailed test is
directional, while the two-tailed test is non-directional.
What is a Hypothesis?
A hypothesis is a claim or statement about the population parameter
Republic of the Philippines
CATANDUANES STATE UNIVERSITY
College of Arts and Sciences
Virac, Catanduanes
Example of Hypothesis
1. The mean body temperature for patients admitted to elective surgery is not
equal to 370C.
The parameter of interest here is μ which is the mean body temperature for
patients admitted to elective surgery.
2. The proportion of registered voters in Quezon City favoring Candidate A
exceeds 0.60.
The parameter of interest here is p which is the proportion of registered voters
in Quezon City favoring Candidate A.
Null Hypothesis
denoted by Ho
the statement being tested
it represents what the experimenter doubts to be true
must contain the condition of equality and must be written with the symbol =,
≤, or ≥.
The null hypothesis corresponding to the common belief that the mean body
temperature is 37°𝐶 is expressed as
Ho: 𝜇 = 37°𝐶
We test the null hypothesis directly in the sense that we assume it is true and
reach a conclusion to either reject Ho or fail to reject Ho.
Alternative Hypothesis
denoted by Ha
Is the statement that must be true if the null hypothesis is false
the operational statement of the theory that the experimenter believes to be true and
wishes to prove
Is sometimes referred to as the research hypothesis
a. If you believe that your brand of refrigerator lasts longer than the mean of 14 years for
other brands, state the claim that > 14 , where 𝜇 is the mean life of your refrigerators.
b. An engineer hypothesizes that the mean number of defects can be decreased in a
manufacturing process of compact disks by using robots instead of humans for certain
tasks. The mean number of defective disks per 1000 is 18.
c. A psychologist feels that playing soft music during a test will change the results of the
test. The psychologist is not sure whether the grades will be higher or lower. In the
past, the mean of the scores was 73.
Solution:
a. 𝐻𝑜 : 𝜇 = 14 and 𝐻𝑎 : 𝜇 > 14
b. 𝐻𝑜 : 𝜇 = 18 and 𝐻𝑎 : 𝜇 < 18
c. 𝐻𝑜 : 𝜇1 = 73 and 𝐻𝑎 : 𝜇1 ≠ 73
Some Notes:
In this context of trying to support the goal of the research, the alternative hypothesis
is sometimes referred to as the research hypothesis.
Also in this context, the null hypothesis is assumed true for the purpose of conducting
the hypothesis test, but it is hoped that the conclusion will be rejection of the null
hypothesis so that the research hypothesis is supported.
There are two kinds of errors that can be made in significant testing. The type I error
(alpha error), considers a conclusion that is drawn stating that the null hypothesis FALSE
when in fact it is TRUE. This means that a true hypothesis can be incorrectly rejected. This
probability of a Type I error is called the significance level and is usually set by a researcher.
On the other hand, the type II error (beta error) occurs when the hypothesis is not rejected
when in fact it is false.
Level of Significance
Test Statistic
A test statistic is a quantity calculated from the sample data. It is used to decide
whether or not the null hypothesis should be rejected in a hypothesis test. The choice of the
test statistic will depend on the assumed probability model and the hypothesis under
consideration. The Z-test, t-test, chi-square test, F-test, and ANOVA are examples of test
statistic.
The critical or tabular value for a hypothesis test is a threshold to which the
value of the test statistic in a sample is compared to determine whether or not the null
hypothesis is rejected. The critical value can be on the right side of the mean or on the
left side of the mean for a one-tailed test. Its location depends on the inequality sign
of the alternative hypothesis. For example, in situation A, where the alternative
hypothesis is 𝐻𝑎 : 𝜇 > 14. Since the inequality sign is >, the null hypothesis will be
rejected only when the sample mean is significantly greater than 14. Hence, the critical
value must be on the right side of the mean.
The critical region or rejection region is a set if values if the test statistic for which the
null hypothesis is rejected in a hypothesis test; that is, the sample space for the test statistic
is partitioned into regions. The critical region will lead us to reject the hull hypothesis and
accept the other region represented by the alternative hypothesis. Therefore, if the observed
values of the test statistic is a member of the critical region, we conclude “reject Ho.” If it is not
a member of the critical region, then we can conclude “do not reject Ho.”
The p-value - the smallest level of significance at which Ho will be rejected based on
the information contained in the sample.
Republic of the Philippines
CATANDUANES STATE UNIVERSITY
College of Arts and Sciences
Virac, Catanduanes
An alternative form of decision rule (based on the p-value) is to reject Ho if the p-value
is less than or equal to the level of significance (𝛼).
p-value Decision
0.01 Reject Ho
0.05 Reject Ho
0.10 Do not reject Ho
Figure 1
Procedure for
Finding p value
Republic of the Philippines
CATANDUANES STATE UNIVERSITY
College of Arts and Sciences
Virac, Catanduanes
*Important Note:
Accept/Fail to Reject
A few textbooks continue to say “accept the null hypothesis” instead of “fail to reject
the null hypothesis.” The term accept is somewhat misleading, because it seems to imply
incorrectly that the null hypothesis has been proved, but we can never prove a null hypothesis.
The phrase fail to reject says more correctly that the available evidence isn’t strong enough to
warrant rejection of the null hypothesis. In this text we use the terminology fail to reject the null
hypothesis, instead of accept the null hypothesis.
Figure 2
Wording of Final Conclusion
Republic of the Philippines
CATANDUANES STATE UNIVERSITY
College of Arts and Sciences
Virac, Catanduanes
Exercise 5A
1. Define null and alternative hypotheses, and give an example of each.
2. What symbols are used to represent the null hypothesis and the alternative
hypothesis?
3. What is meant by a type I error? A type II error? How are they related?
4. In hypothesis testing, why can’t the hypothesis be proved true?
5. For each conjecture, state the null and alternative hypotheses.
a. The average age of community college students is 24.6 years.
b. The average age of attorneys is greater than 25.4 years.
c. The average score of high school basketball games is less than 88.
d. The average pulse rate of male marathon runners is less than 70 beats
per minute.
e. The average weight loss for a sample of people who exercise 30 minutes
per day for 6 weeks is 8.2 pounds.
6. Given the following value of alpha (𝛼), decide whether to reject or do not reject the null
hypothesis (H o)
If the sample size is 30 or morte sample units, we shall use the z-test formulas, as
follows:
A. z-Test using One-sample Mean
(𝑥̅ −𝜇)√𝑛
𝑧=
𝑠
𝑥̅ 1 −𝑥̅ 2
𝑧=
𝑠 2 𝑠 2
√1 +2
𝑛1 𝑛2
Example:
A manufacturer claims that the average life of batteries used in their electronic games
is 150 hours. It is known that the standard deviation of this type of battery is 20 hours. A
consumer wishes to test the manufacturer’s claim and accordingly tests 100 electronic games
using this battery and found out that the mean life is equal to 169 hours. Test the hypothesis
by using a level of significance 𝛼 = 5%.
Solutions:
Given:
𝜇 = 150 hours 𝑥̅ = 169 hours
𝑠 = 20 hours 𝑛 = 100 electronic games
Since the population standard deviation is known and the sample size is greater than
30, the z-test will be used.
To have a better understanding of the application of this test, we shall present both
the one-tailed and two-tailed tests.
Republic of the Philippines
CATANDUANES STATE UNIVERSITY
College of Arts and Sciences
Virac, Catanduanes
Step 5. Decision Rule: Reject Ho if computed z-value is greater than or equal to the tabular
value. Otherwise, do not reject.
Decision: Since the computed value is greater than the tabular value, we reject the
null hypothesis.
Step 6. Conclusion: The manufacturer’s claim that the average life of its batteries at 150
hours is not true at 𝛼 = 0.05.
Republic of the Philippines
CATANDUANES STATE UNIVERSITY
College of Arts and Sciences
Virac, Catanduanes
The bank wishes to test the null hypothesis that the two neighborhoods are the same
mean income. What should the bank conclude? Test the hypothesis using 𝛼 = 0.05.
Solutions: For the purpose of discussion, we present both the one-tailed and the two-tailed
tests in solving the problem.
10,800−10,300
= 2 2
√300 +400
100 100
500
=
√900+1,600
500
=
50
Computed 𝑧 = 10
Step 5. Decision: Since the computed value is greater than the tabular value, we reject the
null hypothesis.
Step 6. Conclusion: The average family income of neighborhood A is higher than that of
neighborhood B at 𝛼 = 0.05.
10,800−10,300
= 2 2
√300 +400
100 100
500
=
√900+1,600
500
=
50
Computed 𝑧 = 10
Step 5. Decision: Since the computed value is greater than the tabular value, we reject the
null hypothesis.
Step 6. Conclusion: The two neighborhoods, A and B, do not have the same average family
income at 𝛼 = 0.05.
When the sample size is less than 30 sample units, we shall use the t-test formulas,
as follows.
Example:
The personnel department of a company developed an aptitude test for a certain group
of semi-skilled workers. The individual test scores were assumed to be normally distributed.
The department asserted a tentative hypothesis that the arithmetic mean grade obtained by
this group of semi-skilled workers is 100. It was agreed that this hypothesis would be subjected
to a two-tailed test at 5% level of significance. The aptitude test was given to a sample of 16
semi-skilled workers and the results are:
𝑥̅ = 94 𝑠=5 𝑛 = 16 𝜇 = 100
Is the personnel department’s tentative hypothesis correct at 𝛼 = 5%?
Solution: Since the sample size is less than 30 units, we shall use the t-test using one-sample
mean.
Step 1. State the null and alternative hypothesis.
Ho: The arithmetic mean obtained by the semi-skilled workers is 100.
(𝐻𝑜 : 𝜇 = 100)
Ha: The arithmetic mean obtained by the semi-skilled workers is not equal to 100.
(𝐻𝑎 : 𝜇 ≠ 100)
Step 2. Specify the level of significance. Use 𝛼 = 0.05.
Step 3. Determine the test statistic to be used and the degrees of freedom (df)
𝑑𝑓 = 𝑛 − 1 (for one-sample mean)
𝑑𝑓 = 𝑛1 + 𝑛2 − 2 (for two groups of samples)
The t-test using one-sample mean shall be used since 𝑛 = 16 (𝑛 < 30). At 𝛼 = 0.05,
and 𝑑𝑓 = 𝑛 – 1 = 16 – 1 = 15, the t-tabular value is ±2.131 for a two-tailed test.
Step 4. Compute the value of t.
(𝑥̅ −𝜇)√𝑛
Computed 𝑡 =
𝑠
(94−100)√16
=
5
−6(4)
=
5
Computed 𝑡 = −4.80
Example: A taxi company is trying to determine whether the use of radial tires and
belted tires provide the same fuel consumption. Twelve cars were driven twice over a
prescribed test course; and for each test a car used a different type of tire (radial or belted) in
random order. The data obtained were recorded, as follows:
Type of Wire 𝑥̅ s
Radial 5.75 1.10
Belted 5.61 1.30
At 0.05 level of significance, can we conclude that cars equipped with radial tires and
belted tires provide the same fuel consumption? Assume that the data are normally distributed.
Solution:
Step 1. State the null and alternative hypothesis.
Ho: There is no significant difference on the fuel consumption of cars using radial and
belted tires.
(𝐻𝑜 : 𝜇1 = 𝜇2 )
Ha: There is no significant difference on the fuel consumption of cars using radial and
belted tires.
(𝐻𝑎 : 𝜇1 ≠ 𝜇2 )
Step 2. Specify the level of significance. Use 𝛼 = 0.05.
Step 3. Determine the test statistic to be used and the degrees of freedom (df).
The t-test using two-sample mean shall be used since 𝑛1 and 𝑛2 are both 12 , (n < 30)
for a two-tailed test. At 𝛼 = 0.05 using the degrees of freedom of 22 based on 𝑑𝑓 = 𝑛1 +
𝑛2 – 2 = 12 + 12 – 2 = 22, we obtain the t-tabular value of ±2.074 for a two-tailed test.
𝑥̅ 1 −𝑥̅ 2
Computed 𝑡 =
[𝑠 2 (𝑛 −1)]+[𝑠22 (𝑛2−1)] 1 1
√[ 1 1 ][𝑛 +𝑛 ]
[𝑛1 +𝑛2 ]−2 1 2
5.75−5.61
=
[(1.10)2 (12−1)]+[(1.3)2(12−1)] 1 1
√[ ][12+12]
[12+12]−2
0.14
=
13.31+18.59 1
√[ ][6]
22
0.14
=
31.90 1
√[ ][ ]
22 6
0.14
=
√(1.45)(0.1667)
0.14
=
√0.2417
0.14
=
0.4916
Computed 𝑡 = 0.2848
Republic of the Philippines
CATANDUANES STATE UNIVERSITY
College of Arts and Sciences
Virac, Catanduanes
Step 5. Decision
Since the |computed 𝑡 = 0.2848| < |𝑡𝑎𝑏𝑢𝑙𝑎𝑟 𝑣𝑎𝑙𝑢𝑒 = ±2.131|, we shall not
reject the null hypothesis.
Step 6. Conclusion
There is no significant difference on the fuel consumption of cars using the
radial tires and the belted tires at 𝛼 = 0.05, with 𝑑𝑓 = 22.
Exercise 5B
A. z-test Using One-Sample Mean
2. In a particular community, it is claimed that the mean household water usage for a
particular month is 45 cubic meters. The following year, a countrywide water
conservation campaign was conducted. Fifty homes were randomly selected and
found that the mean consumption is 47 cubic meters with a standard deviation of 2.5
cubic meters. Is there enough evidence to accept the claim that the mean household
water usage per month is 45 cubic meters at 0.01 level.
determine if there is a significant difference in job satisfaction between the two groups
of workers. Use 0.01 level.
Work Schedule 𝑥̅ s n
Fixed Shift 7.29 3.12 49
Rotating Shift 6.08 2.96 42
Gender 𝑥̅ s n
Men 20 35.5 11.9
Women 25 36.6 12.3
Republic of the Philippines
CATANDUANES STATE UNIVERSITY
College of Arts and Sciences
Virac, Catanduanes