Reading Material - Hypothesis Testing
Reading Material - Hypothesis Testing
Think of a statistical test as being like a legal trial. The population is accused of the “crime” of
having an effect, and the sample is the criminal evidence. In many countries, a person accused of
a crime is assumed to be innocent until proven guilty. Similarly, we start by assuming the
population is “innocent” of having an effect.
In other words, the null hypothesis (i.e., that there is no effect) is assumed to be true until the
sample provides enough evidence to reject it. Null hypotheses often include phrases such as “no
effect,” “no difference,” or “no relationship.” When written in mathematical terms, they always
include an equality (usually =, but sometimes ≥ or ≤).
You can never know with complete certainty whether there is an effect on the population. Some
percentage of the time, your inference about the population will be incorrect. When you incorrectly
reject the null hypothesis, it’s called a type I error. When you incorrectly fail to reject it, it’s a type
II error.
Examples of null hypotheses
The table below gives examples of research questions and null hypotheses. There’s always more
than one way to answer a research question, but these null hypotheses can help you get started.
The alternative hypothesis (Ha) is the other answer to your research question. It claims that there’s
an effect on the population. Often, your alternative hypothesis is the same as your research
hypothesis. In other words, it’s the claim that you expect, or hope will be true.
The alternative hypothesis is a complement to the null hypothesis. Null and alternative hypotheses
are exhaustive, meaning that together they cover every possible outcome. They are also mutually
exclusive, meaning that only one can be true at a time.
Tip: Be careful with your words when you report the results of a statistical test in a research
paper or thesis. If you reject the null hypothesis, you can say that the alternative hypothesis
is supported. On the other hand, if you fail to reject the null hypothesis, then you can say that the
alternative hypothesis is not supported. Never say that you’ve proven or disproven a hypothesis.
Alternative hypotheses often include phrases such as “an effect,” “a difference,” or “a
relationship.” When alternative hypotheses are written in mathematical terms, they always include
an inequality (usually ≠, but sometimes < or >). As with null hypotheses, there are many acceptable
ways to phrase an alternative hypothesis.
The table below gives examples of research questions and alternative hypotheses to help you get
started with formulating your own.
General Test-specific
However, there are important differences between the two types of hypotheses, summarized in the
following table.
Definition A claim that there is no effect in the A claim that there is an effect in the
population. population.
Also known as H0 Ha
H1
Typical phrases • No effect • An effect
used • No difference • A difference
• No relationship • A relationship
• No change • A change
• Does not increase. • Increases
• Does not decrease • Decreases
Symbols used Equality symbol (=, ≥, or ≤) Inequality symbol (≠, <, or >)
p≤α Rejected Supported
p>α Failed to reject Not supported
The null hypothesis (H0) answers “No, there’s no effect in the population.”
The alternative hypothesis (Ha) answers “Yes, there is an effect in the population.”
The null and alternative are always claims about the population. That’s because the goal of
hypothesis testing is to reach conclusion about a population based on a sample. Often, we infer
whether there’s an effect in the population by looking at the sample. It’s important for your
research to write strong hypotheses. You can use a statistical test to decide whether the evidence
collected from the research supports the null or alternative hypothesis.
4 Step Process
• State the hypotheses.
• Formulate an analysis plan, which outlines how the data will be evaluated.
• Carry out the plan and analyze the sample data.
• Analyze the results and either reject the null hypothesis, or state that the null hypothesis is
plausible, given the data.
If the result of research is found statistically significant, it means that the observed result is unlikely
due to random factors or bias. The data found from samples are true for the whole population.
If the result of research is found statistically insignificant, it means that the current study’s results
aren’t strong enough to draw a conclusion. The effect might be real for the sample, but it may not
be for the overall population. The sample size used may be too small, or other factors might
influence grades.
Scene 1:
You decide to get tested for COVID-19 based on mild symptoms. There are two errors that could
potentially occur:
Type I error (false positive): The test result says you have coronavirus, but you actually don’t.
Type II error (false negative): The test result says you don’t have coronavirus, but you actually
do.
In statistics, a Type I error is a false positive conclusion, while a Type II error is a false negative
conclusion. Making a statistical decision always involves uncertainties, so the risks of making
these errors are unavoidable in hypothesis testing.
Scene 2:
You test whether a new drug intervention can alleviate symptoms of an autoimmune disease.
In this case:
• The null hypothesis (H0) is that the new drug has no effect on symptoms of the disease.
• The alternative hypothesis (H1) is that the drug is effective for alleviating symptoms of the
disease.
Then, you decide whether the null hypothesis can be rejected based on your data and the results of
a statistical test. Since these decisions are based on probability, there is always a risk of making
the wrong conclusion.
• If your results show statistical significance, that means they are very unlikely to occur if
the null hypothesis is true. In this case, you would reject your null hypothesis. But
sometimes, this may actually be a Type I error.
• If your findings do not show statistical significance, they have a high chance of occurring
if the null hypothesis is true. Therefore, you fail to reject your null hypothesis. But
sometimes, this may be a Type II error.
A Type I error happens when you get false positive results: you conclude that the drug intervention
improved symptoms when it actually didn’t. These improvements could have arisen from other
random factors or measurement errors.
A Type II error happens when you get false negative results: you conclude that the drug
intervention didn’t improve symptoms when it actually did. Your study may have missed key
indicators of improvements or attributed any improvements to other factors instead.
A Type I error means rejecting the null hypothesis when it’s actually true. It means concluding
that results are statistically significant when, in reality, they came about purely by chance or
because of unrelated factors.
The risk of committing this error is the significance level (alpha or α) you choose. That’s a value
that you set at the beginning of your study to assess the statistical probability of obtaining your
results (p value).
The significance level is usually set at 0.05 or 5%. This means that your results only have a 5%
chance of occurring, or less, if the null hypothesis is actually true.
If the p value of your test is lower than the significance level, it means your results are statistically
significant and consistent with the alternative hypothesis. If your p value is higher than the
significance level, then your results are considered statistically non-significant.
What Is a T-Test?
A t-test is an inferential statistical test used to determine if there is a significant difference between
the means of two groups and how they are related. The t-test is a test used for hypothesis testing
in statistics and uses t-statistics, the t-distribution values, and the degrees of freedom to determine
statistical significance.
Calculating T-value