Unit 8 Content
Unit 8 Content
Introduction
In the previous unit, we dealt with the process of hypothesis testing. The process
is fundamental to how statistics are used to connect empirical observations to
theories. It is now that we can apply it to two commonly used statistical tests for
comparing means. In this unit, you will learn about the different statistical tests
used for testing differences. In the testing hypothesis of comparison involving
one or two groups or variables, we will utilize one sample t-test, independent
samples t-test and paired samples t-test. Moreover, we have Analysis of
Variance (ANOVA) for comparing means of three or more groups. Emphasis
will be given to the interpretation of statistical analysis results, thus SPSS output
tables are provided to ease you from a mathematical approach (minimal
coverage/use of formula) and can focus and enjoy learning Statistics.
Learning Outcomes
At the end of this unit, you are expected to:
A. Use appropriate statistical test in testing the difference between and
among means;
B. Employ hypothesis testing in comparing mean difference; and
C. Interpret statistical analyses results utilizing SPSS outputs.
1
Unit 8: Inferential Statistics: Test of Difference
Topic 1: T-test
Learning Objectives
Presentation of Content
T-test
Some studies require the investigation on the differences between two means.
T-test is a statistical test that can be used to achieve this research objective. As
a parametric test, t-test assumes the normality of data distribution and the
other assumptions. Depending on the nature of samples, t-test can be classified
as follows:
a. One-Sample T-test
b. Independent Samples T-test
c. Paired Samples T-test
One-Sample T-test
By using a one-sample t-test, it is possible to test whether the mean of a
particular variable differs from a specified value (Wagner, 2015). Specifically,
is employed when we are comparing the population mean and the sample
mean. It will determine whether that difference is statistically significant or
not. The formula in determining the t-computed value is:
𝑥̅ − 𝜇 (𝑥̅ − 𝜇)√𝑛 − 1
𝑡= 𝑠 𝑜𝑟 𝑡=
⁄ 𝑛−1 𝑠
√
𝑑𝑓 = 𝑛 − 1
2
Unit 8: Inferential Statistics: Test of Difference
Where:
𝑥̅ = 𝑠𝑎𝑚𝑝𝑙𝑒 𝑚𝑒𝑎𝑛
𝜇 = 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑚𝑒𝑎𝑛
𝑠 = 𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛
𝑛 = 𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑖𝑧𝑒
You can also watch a tutorial video on how to run one-sample t-test at:
https://youtu.be/C2Qa5d9ij0Y
Example:
Based from the records, Alice found out that the average highest year of
school completed by most of their students’ parents is 12 which corresponds to
the completion of high school education. She then randomly selected 120
respondents to test her hypothesis at 𝑎 = 0.05 that it should be higher.
Analysis using SPSS generated the following outputs:
One-Sample Statistics
One-Sample Test
Test Value = 12
Highest year of
1.493 119 .138 .25000 -.0816 .5816
school completed
Solution:
To answer the problem, we will test the hypotheses that:
𝐻0 : There is no difference between the mean highest year school
completed of the parent population and of the sample.
𝐻𝑎 : The mean highest year school completed of the parent population
and of the sample differ.
3
Unit 8: Inferential Statistics: Test of Difference
The first table reveals that the mean highest year of school completed by the
sample respondents is 12.25 with standard deviation of 1.83. One-sample test
results showed no significant difference between the two groups (𝑡 =
1.493, 𝑑𝑓 = 119, 𝑝 > 0.05). It means that the 𝐻0 should not be rejected,
hence, the average highest year of school completed by the sample
respondents is equal to the population mean.
Note: From the table, the p-value or the probability value is located under the
“Sig. (2-tailed)”. This value should be less than or equal to the set level of
significance to conclude that the two groups’ means differ.
𝑑𝑓 = 𝑛1 + 𝑛2 − 2
Data analysis employing independent samples t-test can also be watched at:
https://youtu.be/-qGFZFOQx7Q
Example:
From the 120 respondents, Alice is interested to which sex has higher number
of years completed. At 𝑎 = 0.05, she analyzed the data in SPSS and resulted
to the following. Is there a significant difference between the numbers of years
completed?
4
Unit 8: Inferential Statistics: Test of Difference
Group Statistics
Std. Error
Sex N Mean Std. Deviation Mean
Equality of
95% Confidence
Interval of the
Solution:
We will test the hypotheses that:
𝐻0 : The mean highest years completed by the male and female
respondents are equal.
𝐻𝑎 : The mean highest years completed by the male and female
respondents differ.
Descriptive statistics presented in the first table shows that the mean highest
years of school completed of the male and female respondents were 12.45
with 𝑆𝐷 of 1.56 and 12.03 with 𝑆𝐷 of 1.76 respectively. Independent samples
test analysis results indicated no significant difference between the male and
female respondents (𝑡 = 1.357, 𝑑𝑓 = 118, 𝑝 > 0.05). Hence, there is not
enough evidence to reject the null hypothesis. It is concluded that the means of
the number of years completed by male and female respondents are equal.
5
Unit 8: Inferential Statistics: Test of Difference
Where:
𝑑̅ = 𝑚𝑒𝑎𝑛 𝑜𝑓 𝑡ℎ𝑒 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑐𝑒𝑠 𝑜𝑓 𝑡ℎ𝑒 𝑝𝑎𝑖𝑟𝑒𝑑 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠
𝑠𝑑 = 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 𝑜𝑓 𝑡ℎ𝑒 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑐𝑒𝑠
𝑛 = 𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑖𝑧𝑒
You can watch a video on how to run using the software at:
https://youtu.be/vll22ZnFOP0
Example:
Teacher Alice believes that she can deliver the lesson effectively by
employing Collaborative Approach in teaching their topics on hypothesis
testing. She random sampled 50 students and administered the pre-test. After
the 2-week discussion, she gave the post-test and analyzed the data. She run
the test using SPSS and generated the following tables:
N Correlation Sig.
6
Unit 8: Inferential Statistics: Test of Difference
Paired Differences
95% Confidence
Pair Post-test -
1.88000 1.06215 .15021 1.57814 2.18186 12.516 49 .000
1 Pre-test
Solution:
The null and alternative hypotheses for the given problem are:
𝐻0 : There is no significant difference between the pre-test and post-test
mean scores of the respondents.
𝐻𝑎 : There is a significant difference between the pre-test and post-test
mean scores of the respondents.
Paired samples statistics revealed that in the pre-test, the mean score of the
respondents is 14.18 with SD of 2.27 while their score in the post-test
is 16.06 with SD of 2.27. Analysis employing paired samples t-test resulted to
a statistically significant difference between the pre-test and post-test
scores (𝑡 = 12.516, 𝑑𝑓 = 49, 𝑝 < 0.05). Therefore, the null hypothesis is
rejected. The post-test mean score is significantly higher than the pre-test
mean score. Moreover, the approach employed in teaching hypothesis testing
is effective.
7
Unit 8: Inferential Statistics: Test of Difference
Learning Objectives
Presentation of Content
Analysis of Variance
Analysis of Variance (ANOVA) is another inferential statistics for comparing
means like the t-test. However, it involves test for significance between mean
scores of at least two groups across one or more than one variable. It can be
used to test statistical significance using categorical independent variables in
conjunction with a continuous dependent variable.
One-way ANOVA
The analysis of variance to compare the means of three or more groups in the
light of one independent variable. It has two estimates of the population
variance enumerated as:
1. Between-Group Variance
2. Within-Group Variance
Between-Group Variance
This estimate involves the computation of variance by using the means of the
groups or between the groups.
Within-Group Variance
This estimate includes all the data in computing for the variance. It is not
affected by the differences in the means.
Note: If found out that there is no difference in the means, the between-group
variance estimate is approximately equal to the within-group variance
estimate, and the F-computed value will be approximately equal to 1.
F-computed Value
ANOVA is based on the comparison of variance between groups to the
variance within groups, emerging as the 𝐹 ratio or 𝐹 statistic. For the
computation, let us consider the following ANOVA table:
8
Unit 8: Inferential Statistics: Test of Difference
Degrees
Source of Sum of Mean
of F-statistic
Variation Squares Squares
Freedom
(∑ 𝑥)2 (∑ ∑ 𝑥)2 𝑆𝑆𝑏 𝑀𝑆𝑏
Between 𝑆𝑆𝑏 = ∑ − 𝑘−1 𝑀𝑆𝑏 = 𝐹=
𝑛 𝑛𝑡 𝑘−1 𝑀𝑆𝑤
(∑ 𝑥)2 𝑆𝑆𝑤
Within 𝑆𝑆𝑤 = ∑ ∑(𝑥 2 ) − ∑ 𝑛𝑡 − 𝑘 𝑀𝑆𝑤 =
𝑛 𝑛𝑡 − 𝑘
Total 𝑆𝑆𝑡 = 𝑆𝑆𝑏 + 𝑆𝑆𝑤 𝑛𝑡 − 1
You can also watch a video on how to carry out Analysis of Variance using
SPSS at: https://youtu.be/rS3k8ONVN-o
9
Unit 8: Inferential Statistics: Test of Difference
Example:
Mr. Manuel hypothesized that the students of the College of Teacher
Education enrolled for the school year 2020-2021 have equal knowledge on
the free tuition fee in tertiary education (R.A. 10931). He randomly selected
200 respondents from the current three year levels (there were no seniors
during this S.Y.) and conducted the survey. Test his hypothesis at 𝑎 = 0.5.
Refer to the succeeding SPSS output tables for the data analysis results.
Descriptives
Knowledge on R.A. 10931
95% Confidence
Interval for Mean
ANOVA
Knowledge on R.A. 10931
Sum of Squares df Mean Square F Sig.
10
Unit 8: Inferential Statistics: Test of Difference
Multiple Comparisons
Dependent Variable: Knowledge on R.A. 10931
Tukey HSD
Solution:
In this problem, we will test the hypotheses that:
𝐻0 : The mean scores (knowledge on RA 10931) of the three year levels
are equal.
𝐻𝑎 : At least two of the three year levels differ in mean scores
(knowledge on RA 10931).
Note: To identify pair or pairs with significant mean difference from the post
hoc analysis results, determine those who have probability value of less than
or equal to the set level of significance. If the mean difference is positive, the
group in the first column has the higher mean compared to the corresponding
group in the second column.
11