Module 7A
Module 7A
Introduction
Hypothesis testing was introduced by Sir Ronald Fisher (1890 – 1962),
Jerzy Newman (1894 – 1981), Karl Pearson (1857 – 1936) and Egon Pearson (Karl
Pearson’s son). Hypothesis testing is a statistical method that is used in making
statistical decisions using experimental data. Hypothesis testing is basically an
assumption that we make about the population parameter. There are three
methods used to test hypothesis by traditional method, p – value method, or by
confidence interval method.
The traditional method will be discussed initially which is the most widely
used especially in statistics books and instructional manuals. The p – value
method becomes popular since the advent of modern computer and high -
powered statistical software’s. The confidence interval is explained after the p -
value and some illustrative examples were discussed to its relationships with
hypothesis testing.
Discussion
I. Procedure in Hypothesis Testing
All hypothesis testing situations start with stating the statistical hypothesis. A
statistical hypothesis is a conjecture about the population parameter. This
conjecture may or may not be true. There are two types of statistical
hypotheses: the null hypothesis and alternative hypothesis.
1|Page
Module 7A: Hypothesis Testing
A. Two Types of Statistical Hypotheses
Null Hypothesis, symbolized by H 0, is a statistical hypothesis testing
that assumes that the observation is due to a chance factor. In
hypothesis testing, null hypothesis is denoted by; H0: μ1 = μ2, which
shows that there is no difference between the two population
means (or parameters).
Alternative Hypothesis, symbolized by H 1, is the opposite of the null
hypothesis; it shows that observations are the result of real effect. It
states that there is a difference between two population means (or
parameter).
B. Level of Significance
In hypothesis testing, the level of significance refers to the degree of
significance in which we accept or reject the null hypothesis. In hypothesis
testing, 100% accuracy is not possible for accepting or rejecting a null
hypothesis. So, we therefore select a level of significance that is usually 1% and
5%. Level of significance is the maximum probability of committing a Type I
error. That is, P(Type I error) = α . This probability is symbolized by α (Greek
letter alpha).
After the significance level is chosen, a critical value is selected from a table
for the appropriate test statistic. The critical value determines the critical and
noncritical regions. The critical value is a value that separates the critical region
from the non-critical region. The critical value or rejection region is the range of
the values of the test value that indicates that there is significant difference and
that the null hypothesis (H0) should be rejected. On the contrary, noncritical or
nonrejection region is the range of the values of the test value that indicates that
the difference was probably due to chance and that the null hypothesis (H 0)
should be rejected.
Another table below represents some common phrases that are used in
hypotheses, conjunctures, and its corresponding symbols. The table served as a
guide in translating verbal conjunctures into mathematical symbols or
representations.
Is equal to Is not equal to
= Is the same as ≠ Is not the same
Is exactly the same as Is different from
2|Page
Module 7A: Hypothesis Testing
Is increased Is decreased
> Is greater than < Is less than
Is higher than Is lower than
Is at least Is at most
≥ Is not less than ≤ Is not more than
Is greater than or equal Is less than or
to equal to
D. The Critical Value Approach to Hypothesis Testing
The observed value of the statistic (sample observation) is compared to
critical values (population observation). These critical values are expressed as
standard z values.
Illustration of Critical
Values for α = 0.05
A Type I error occurs if one rejects the null hypothesis when it is true. In
hypothesis testing, Type I error is denoted by alpha ( α ). In hypothesis testing,
the normal curve that shows the critical region is called the alpha region.
A Type II error occurs if one does not reject the null hypothesis when it is
false. In hypothesis testing, the normal curve that shows the acceptance region
is called the beta region.
The following are the steps in conducting a hypothesis testing:
1. State the null hypothesis (H0) and the alternative hypothesis (H 1).
2. Choose the level of significance, α , and the sample size.
3. Determine the test statistic and sampling distribution.
4. Determine the critical values that divide the rejection and nonrejection
regions.
5. Collect the data and compute the value of the test statistic.
6. Make a statistical decision.
7. State the conclusion.
3|Page
Module 7A: Hypothesis Testing
The p – value (or probability value) is the probability of getting a sample
statistic or a mean extreme sample statistic in the direction of the H 1 when the
H0 is true. We cam also say that p – value is the actual area under the standard
normal distribution curve representing the probability of a particular sample
statistic or a more extreme sample occurring if the H 0 is true.
The p – value method for testing hypotheses differs from traditional method
in some parts. The steps for p – value method are:
1. State the null hypothesis (H0) and the alternative hypothesis (H 1).
2. Choose the level of significance, α , and the sample size.
3. Determine the test statistic and sampling distribution.
4. Compute the test value.
5. Determine the p – value.
6. Make a statistical decision.
7. State the conclusion.
There is a different approach in the decision rule when using p – value method.
If p – value ≤ α , reject the H0, and if the p – value > α , do not reject the
H0.
There are also some important guidelines for p – values.
If p – value ≤ 0.01, reject the H0, thus difference is highly significant.
If p – value > 0.01 and p – value ≤ 0.05, reject the H 0, thus difference is
significant.
If p – value > 0.05 and p – value ≤ 0.10, consider a consequence of
Type I error before rejecting the H0, thus difference is significant.
If p – value > 0.10, do not reject the H 0, thus difference is not
significant.
4|Page
Module 7A: Hypothesis Testing
x−μ
z= Formula 7A – 1
σ /√n
x−μ
z= Formula 7A – 2
s/√n
where: z = one sample z test σ = population standard deviation
x = sample mean s = sample standard deviation
μ = population mean n = number of observation in the sample
The denominator σ / √ n or s/ √ n represents the standard error of the mean. The
term standard error of the mean may sound uncomfortable, but it is to be
interpreted in the same way as any other standard deviation. We could just as
well say “the standard deviation of the mean”. The important thing is that
standard error of the mean means the standard deviation of the sampling
distribution of the means.
x=
∑x where: x = sample mean n = number observation in
n
the sample
Note: Calculate the sample standard deviation if σ is unknown.
4. Calculate the value of the one sample z test by using the Formula 7A – 1 if
σ is known, otherwise use Formula 7A – 2.
5. Statistical decision for hypothesis testing
If z computed < z critical, do not reject H0.
If z computed ≥ z critical, reject H0.
6. State the conclusion.
x=
∑x
n
s=
√ ∑ (x−x )2
n−1
where: x = sample mean
s = sample standard deviation
n = number observation in the sample
4. Calculate the value of the one sample t test by using
x−μ
t= Formula 7A – 3
s/√n
where: t = one sample t – test
x = sample mean
μ= population mean
s = sample standard deviation
n = number observation in the sample
5. Statistical decision for hypothesis testing
6|Page
Module 7A: Hypothesis Testing
If t computed < t critical, do not reject H0.
If t computed ≥ t critical, reject H0.
6. State the conclusion.
7|Page
Module 7A: Hypothesis Testing
Do not reject the null hypothesis at α = 0.01, since -2.303 > -2.878
6. State the conclusion.
Since we fail to reject the null hypothesis, we can conclude that the
starting salary of civil engineers is Php. 18, 000.
Example 3: A recent survey done by Philippine Housing Authority found that 35%
of the population owns their homes. In a random sample of 240 heads of
households, 78 responded that they owned their homes. At the 0.01 level of
significance, does that indicate a difference from the national proportion?
Solution:
Given: x = 78 n = 240 p = 35% = 0.35
Exercises
Answer the following. Show your complete solution.
1. A researcher claims that 20-year-old women on a special diet will have an
average weight of 110 pounds. A sample of 15 women has an average
weight of 112.5 pounds and a standard deviation of 5 pounds. At α = 0.01,
can the claim be rejected? Also, find the 99% confidence interval of the
true mean. (20pts)
Prepared by:
9|Page
Module 7A: Hypothesis Testing