HYPOTHESIS TESTING - Edited
HYPOTHESIS TESTING - Edited
Introduction
Science and research are processes of formulating assumptions around a concept or problem and
then accepting or rejecting them after statistical scrutiny based upon a set methodology. The
assumption in the form of a statement is referred to as a hypothesis (Newey & McFadden, 1994).
A hypothesis is a well-informed guess about anything in the immediate environment. It ought to
be verifiable through experimentation or observation. There are many examples like a prediction
of the functionality of a new machine or concept on the successful teaching process or a fairer
way to administer standardized tests. Each concept or problem can have two logical outcomes –
positive or negative – that are labelled alternate/ experimental hypothesis (H 1) and the null
hypothesis (H0) respectively. The null hypothesis (Anderson, Burnham, & Thompson, 2000)
wishes to disprove and the alternate hypothesis wishes to prove. Inferential statistics is all about
testing hypotheses. Typically, the research hypothesis will be that there is a link between the
independent and dependent variables, or that therapy has a population-wide effect. The null
hypothesis, on the other hand, will typically be that there is no association between the
independent and dependent variables, or that the therapy has no real effect. In other words,
apparent variations in the samples are the result of chance. Many different types of statistical
tests are employed to investigate these possibilities. These tests are chosen based on the variables
being investigated and the types of samples.
The process of hypothesis testing goes through the following stages after the identification
of a research problem/ concept:
1. Framing of hypotheses
2. Ascertaining level of significance
3. Statistical testing for significance – Depending upon the type of research problem, type of
data, and purpose the researcher may want to test for mean, proportion, and/or variance
using appropriate statistical formulae/tests.
4. Critical value
5. Decision making
Framing of the hypothesis should meet the following criteria:
Testability: the hypothesis should be testable. There must be available parameters that
can be employed with the sample population to obtain data. The researcher must not
frame a hypothesis based on something that cannot be measured e.g. pain levels in
comatose patients.
The hypothesis should be specific and realistic in terms of the size and scope of what the
researcher aims to accomplish.
The wording of the hypothesis should be consistent with the research objectives.
The variables should be lucid, precise, and implicit.
Based upon the sign in the alternate hypothesis, for the first possibility, the researcher employs a
two-tailed test. For the other possibilities, one-tailed tests (right or left) are employed. This can
be also comprehended in terms of critical value/ value of significance that the researcher pre-
assigns.
Fig. 1 & 2: Representation of one and two-tailed tests
Actual Situation
Decision Taken H0 True H0 False
H0 Accepted No error Type-II error
Probability = 1 - Probability = β
Probability = Probability = 1 -β
During the process of hypothesis testing, the research may make two types of errors.
Type-I error: This error is deemed to occur when the researcher fails to accept a true null
hypothesis. It is also known as the error of commission because the researcher has taken an
unnecessary decision.
Type-II error: This error is deemed to occur when the researcher accepts a false null hypothesis.
The gravity of this error is more than the Type-I error. It is also known as an error of omission
because the researcher has failed to take the correct decision.
The errors in statistical testing can be comprehended with the help of Table 1.
It is important to test the hypotheses especially for ascertaining the statistical significance and
validity of the research statement. Further, the researcher will be able to understand a qualitative
problem in quantitative terms. This is the inferential part of statistics.
During the process of hypothesis testing, the problems that the researcher faces are:
1. Testing a hypothesis based upon a set of known observations against a new set of
observations.
2. Statistical evidence does not provide irrefutable proof regarding the truth of a hypothesis.
The whole process of hypothesis testing is built on the premise of disproving and
rejecting a hypothesis thereby giving scope to accept the alternate possibility.
3. The hypothesis must find logic in the context of the established existing knowledge.
4. The researcher may construe a positive correlation as a basis for causation, which may
actually be erroneous.
5. Some parametric tests depend upon reliable existing population data. The incompleteness
or variability in the existing population data will adversely impact the reliability of the
outcomes from these tests.
6. The researcher may have an ideological bias that may impel him to manipulate his data
and interpretation(s) to prove or disprove the hypothesis of his interest.
Most of the studies carried out in health sciences are either experimental or correlational by
design and explore a relationship between the variables. The former estimates the qualitative and
quantitative differences between variables whilst the latter observes the pattern(s) among them.
Once the researcher is clear about the design of his research, he can choose and apply the
appropriate statistical test(s). The following steps may be of help for this.
Step 1 – What is the nature of the variable?
The dependent and independent variables (Irlbeck, 2002) may be nominal, ordinal, or
continuous. The combination of variables that the researcher needs to interact with will dictate
the choice of statistical test that can be understood with the help of Table 2 as given below.
Table 2: Choosing the statistical test on basis of interacting variables.
Based on these three elements – study design, data type, and analysis type – the appropriate test
is chosen/ applied as illustrated from Figs. 2 and 3.
3. Wilcoxon Signed-Rank Sum Test – This is a non-parametric test that is used in place of
a one-sample t-test and paired t-test. It also suits continuous but ordinal data where the
researcher can rank the observations. Medians belonging to two data sets from dependent
samples (same or matched subjects)are compared in this analysis but not data from
independent samples (different groups) – for this Mann-Whitney U Test is used.
4. Mann-Whitney U Test – In this non-parametric test, the data from both independent
samples are first clubbed as if belonging to a single population and then ranked
preferably in ascending order. Hence, the null hypothesis essentially assumes that the
samples come from identical populations. For tied/ identical values, the mean of the two
values is given during the ranking process.
5. Z-test- It is a parametric statistical calculation to test that the means of two population
samples are different when their standard deviation is known. For a Z-test, the sample has
to be large (more than 30) and normally distributed. When the sample is small, the
standard deviation cannot be used reliably, and hence, the t-test is more suitable. Further,
as the sample size increases, it becomes distributed normally and better represents the
population from where the sample was drawn.
It is important to note that the t-test compares means using standard error while the z-test
compares means using standard deviation.
6. Karl Pearson’s coefficient of correlation: It is a parametric test that measures the
strength of association between two variables (same subject or matched group) that are
continuous data by nature. The actual values of the variable are known.
However, if the two data are ordinal (nominal/ categorical), Spearman rank correlation
needs to be applied. It is a non-parametric test in which the actual values of variables are not
known but ranks are available.
7. Linear Regression: This is a parametric analysis whereby the researcher predicts a
variable based on another variable. It works when the researcher has two sets of related
variables (Aalen, 1989). The interaction of variables is charted on a graph called
scattergram and the grouping of the dots (points of interaction) gives an estimate of the
correlation between the two variables and based on which further predictions can be
made.
Chapter Summery
Science and research are processes of formulating assumptions around a concept or problem. The
assumption in the form of a statement is referred to as a hypothesis. A hypothesis is a well-
informed guess about anything in the immediate environment. It ought to be verifiable through
experimentation or observation. Hypothesis testing is one of the most integral parts of research
paper/ thesis writing. The above mentioned tests can be used as per the requirement.
Key Terms
Hypothesis testing: Hypothesis testing is a type of statistical inference that employs sample data
to conclude a population parameter or probability distribution.
T-test: The one-sample t-test is a statistical hypothesis test used to see if an unknown population
mean differs from a given value.
ANOVA: An ANOVA test is used to determine whether or not survey or experiment results are
significant. In other words, they assist you in determining whether you should reject the null
hypothesis or accept the alternate hypothesis.
ANCOVA: The method of analysis of covariance is used to test the main and interaction effects
of categorical variables on a continuous dependent variable while controlling for the effects of
other continuous variables that co-vary with the dependent.
Questions:
1. What is meant by hypothesis testing?
2. What is hypothesis testing and examples?
3. What are the different steps in hypothesis testing?
4. Explain ANOVA and ANCOVA with suitable example?
5. What is T –test and Chi-Square Test ?
References:
Aalen, O. O. (1989). A linear regression model for the analysis of life times. Statistics in
medicine , 8 (8), 907--925.
Anderson, D. R., Burnham, K. P., & Thompson, W. L. (2000). Null hypothesis testing:
problems, prevalence, and an alternative. The journal of wildlife management , 912--923.
Irlbeck, S. A. (2002). Human performance technology: An examination of definitions through
dependent and independent variables. Performance Improvement Quarterly , 15 (2), 84--95.
Newey, W. K., & McFadden, D. (1994). Large sample estimation and hypothesis testing.
Handbook of econometrics , 4, 2111--2245.
Rao, C. (2002). Karl Pearson chi-square test the dawn of statistical inference. Goodness-of-fit
tests and model validity , 9--24.
Simar, L., & Wilson, P. W. (2002). Non-parametric tests of returns to scale. European Journal of
Operational Research , 139 (1), 115--132.
Square, M. (2018). Appendix A ANOVA Definitions. Laser Fabrication of Microstructured
Polymer-based Ultra Thin Layer Chromatography Platforms , 149.
Sudret, B., & Caniou, Y. (2013). Analysis of covariance (ANCOVA) using polynomial chaos
expansions. CRC Press/Balkema.
Tawali, T. (2018). THE EFFECT OF DEFINITIONS TOWARD STUDENTS’VOCABULARY.
Journal of Languages and Language Teaching , 6 (1), 34--41.