Hypothesis Testing
Hypothesis Testing
Definition: Any statement made on the distribution of a population, which is either true or false, is
called a statistical hypothesis.
Remark: Unless one is given with the population, it cannot be determined with certainty whether a
hypothesis is true or false. Thus, in any test of hypothesis there are two contradicting statements. The
objective of the test is to determine which of these two statements is more acceptable as true.
Definition: The statement which is being tested is called the null hypothesis. In the event that the null
hypothesis is rejected, the alternative hypothesis is the one that is accepted in place of the null
hypothesis. The null and alternative hypotheses shall be denoted by Ho and Ha, respectively.
Remark:
1. The null and the alternative hypotheses must be non-overlapping so that whatever outcome of
the test of hypothesis, there will be no confusion so as to the state of the population that has
been accepted in the face of evidence coming from the sample.
2. Due to the fact that the true state of the nature is unknown, one is taking the risk of committing
an error or a wrong decision when conducting a test of hypothesis.
2. Determine the test statistics and the tail in the distribution of the test statistics where the
rejection region is to be located. From the direction of the inequality in the alternative
hypothesis, a test is classified as one-tailed if inequality is one-directional so that the rejection
region is only in one tail of the distribution. Otherwise, the test is two-tailed. That is, there is a
rejection region in each of the two tails of the distribution of the test statistic.
3. For a specified level of significance, state the decision rule. That is specify foe which values of
the test statistic Ho is to be rejected. This should be consistent with the alternative hypothesis.
4. Compute for the value of the test statistic for the available sample data
Teaching Performance
MALE (x1) FEMALE (x2) (x1)2 (x2)2
4.35 4.18 18.9225 17.4724
4.06 4.49 16.4836 20.1601
3.93 4.49 15.4449 20.1601
4.22 4.74 17.8084 22.4676
4.00 4.31 16.0000 18.5761
4.39 4.48 19.2721 20.0704
4.19 3.93 17.5561 15.4449
3.67 4.69 13.4689 21.9961
3.67 3.07 13.4689 9.4249
3.62 4.42 13.1044 19.5364
∑x1=40.1 ∑x2=42.8 ∑(x1)2=161.5298 ∑(x2)2=185.309
n(n − 1) n(n − 1)
otherwise reserve judgment.
10(161.5298) - (40.1)(40.1) 10(185.309) − (42.8)(42.8)
= =
Test Statistic 10(9) 10(9)
t = −1.516 = 0.080978 = 0.236111
Solve for t – value
Decision / Conclusion: x1 − x 2
t=
Since the absolute value of (n1 − 1) s + (n 2 − 1) s 22
2
1 1
+
1
the computed t - value is less n1 + n 2 − 2 n1 n 2
than 2.101, the null hypothesis
4.01 − 4.28
cannot be rejected. Hence, =
there is no significant 9(0.080978) + 9(0.236111) 1 1
+
difference between the 10 + 10 − 2 10 10
teaching performance when t = −1.516
grouped according to gender.
EXERCISES:
1. In a Mathematics test, 25 grade six children from a city school registered a mean score of 78 and
a standard deviation of 8. In the same test a group of 30 children from the rural area got a mean
of 72 with a standard deviation of 5. Is there a difference between the performance of the two
groups?
2. A personality test was administered to 15 boys and 10 girls in a private school. With the data
given below, test whether the personality of boys and girls significantly differ. Higher scores
indicate better personality traits.
Boys Girls
13 18
16 19
16 22
19 24
20 25
20 25
20 26
21 32
24 36
24 36
26
27
28
30
32
3. The following data are grades of the students from A.M and P.M class. Do their academic
performance vary significantly?
Example:
A new teaching strategy is being checked if it is effective by testing the students before and
after conducting the teaching strategy. The data are as follows:
Pretest PostTest di d i2
3 5 -2 4
0 1 -1 1
6 5 1 1
7 7 0 0
4 10 -6 36
3 9 -6 36
2 7 -5 25
1 11 -10 100
4 8 -4 16
∑ di = –33 ∑ di2= 219
n(n − 1)
d
reserve judgment.
9(219) − (−33)(−33)
=
Test Statistic 9(8)
t = −3.143
s d2 = 12.25
Decision / Conclusion: s d = 12.25
Since the absolute value of s d = 3 .5
the computed t - value is greater Solve for t-value
than 2.306, the null hypothesis is d − d0
rejected. Hence, there is a t=
sd / n
significant difference between
the pretest and posttest scores of − 3.667
=
the students. This means that the 3.5 / 9
new teaching strategy is more effective t = −3.143
than the old one.
EXERCISES:
Pre-Module Post-Module
Difference
Score Score
18 22
21 25
16 17
22 24
19 16
24 29
17 20
21 23
23 19
18 20
14 15
16 15
16 18
19 26
18 18
20 24
12 18
22 25
15 19
17 16
ONE-WAY Analysis of Variance (ANOVA)
(Comparing 3 or more different groups)
BRAND
A B C D E
21 35 45 32 45
35 12 60 53 29
32 27 33 29 31
28 41 36 42 22
14 19 31 40 36
Determine whether or not the five brands sell, on the average, the same number of cigarettes at
the given supermarket.
Pearson-r (Relationship between 2 variables)
Teaching Emotional
x2 y2 xy
Performance (x) Quotient (y)
Test Statistic:
The pearson-r value of 0.982
indicates that there is a very high/strong r=0.982
and positive relationship between the
Teaching performance(x) and Emotional Decision/Conclusion:
quotient(y). A positive relationship Since 0.982>0.632, reject Ho and
indicates that as the Teaching conclude that there is a significant
performance increases, the Emotional relationship between the teaching
quotient also increases. And if the performance and emotional
Teaching performance decreases, the quotient.
Emotional quotient also decreases, and
vice versa.
EXERCISES:
1. Nine students held their breath, once after breathing normally and relaxing for one
minute, and once after hyperventilating for one minute. The table indicates how long (in
sec) they were able to hold their breath. Is there a association between the two
variables?
Normal Hypervent
56 87
56 91
65 85
65 91
50 75
25 28
87 122
44 66
35 58
2. Is there a significant relationship between entrance exam and board exam results.
Entrance Board
Exam Exam
49 75
47 72.2
69 81
65 78.4
49 75
58 72.4
76 65.4
79 75
79 76.2
56 75.8
63 78.8
74 78.4
74 80.8
57 80.2
56 75.4
3. A dietetics student wanted to look at the relationship between calcium intake and knowledge
about calcium in sports science students. Is there a significant relationship between the 2
variables?
Knowledge Score Calcium Intake
10 450
42 1050
38 900
15 525
22 710
32 854
40 800
14 493
26 730
32 894
38 940
25 733
48 985
28 763
22 583
45 850
18 798
24 754
30 805
43 1085