Hypothesis Testing
Hypothesis Testing
Learning Outcomes:
After careful study of this chapter, you should be able to:
Understand the concept and general procedure of hypothesis testing.
Conduct and interpret hypothesis tests for a single population mean, population
standard deviation known.
Conduct and interpret hypothesis tests for two population means, population
standard deviations known.
Conduct and interpret hypothesis tests for a single population mean, population
standard deviation unknown.
Conduct and interpret hypothesis tests for two population means, population
standard deviations unknown.
Conduct and interpret hypothesis tests for a single population proportion.
Conduct and interpret hypothesis tests for two population proportions.
Conduct and interpret chi-square goodness-of-fit hypothesis tests.
Conduct and interpret chi-square test of independence hypothesis tests.
Hypothesis testing:
Hypothesis testing or significance testing is a method for testing a claim or hypothesis
about a parameter in a population, using data measured in a sample. In this method, we
test some hypotheses by determining the likelihood that a sample statistic could have been
selected, if the hypothesis regarding the population parameter were true.
https://www.sagepub.com/sites/default/files/upm-binaries/40007_Chapter8.pdf
Hypothesis: Any statement about any phenomenon is known as a hypothesis.
Statistical hypothesis: Statistical hypothesis is a hypothesis about population
characteristics that can be tested on the basis of sample data.
[From textbook]
There are two types of statistical hypotheses:
Null Hypothesis (H0) – A statistical hypothesis that states that there is no difference
between a parameter and a specific value, or that there is no difference between two
parameters.
Alternative Hypothesis (H1) – A statistical hypothesis that states the existence of a
difference between a parameter and a specific value, or states that there is a difference
between two parameters.
We tend to want to reject the null hypothesis so we assume it is true and look for enough
evidence to conclude it is incorrect. We tend to want to accept the alternative hypothesis. If
the null hypothesis is rejected then we must accept that the alternative hypothesis is true.
Note: H0 will ALWAYS have an equal sign (and possibly a less than or greater than symbol,
depending on the alternative hypothesis).
The alternative hypothesis has a range of values that are alternatives to the one in H0 .
The null and alternative hypotheses are stated together. The following are typical
hypothesis for means, where k is a specified number.
After stating the hypotheses, the researcher designs the study.
Select the correct statistical test
Choose an appropriate level of significance
Formulate a plan for conducting the study
Statistical Test – uses the data obtained from a sample to make a decision about whether
the null hypothesis should be rejected.
Test Value (test statistic) – the numerical value obtained from a statistical test.
https://math.ucdenver.edu/~ssantori/MATH2830SP13/Math2830-Chapter-08.pdf
Type I Error
A Type I Error occurs when the statistical test rejects the null hypothesis, when the null
should have been retained. In other words, you claim that you have found a difference (or
a relationship), when really, there was no difference (no relationship).
You can control for this type of error in several ways:
Increase your sample size
Select a smaller alpha
Type II Error
A Type II Error occurs when the statistical test retains (fails to reject) the null hypothesis,
when the null should have been rejected. In other words, you claim that you did not find
evidence of a difference (or a relationship), when really, there was difference (no
relationship).
You can control for this type of error in several ways:
Increase your sample size
Select a bigger alpha
Increase the strength of your independent variable with better measuring
Decrease the variability within your sample
https://courses.fortlewis.edu/courses/17335/pages/week-6-statistical-errors-
reading?module_item_id=438331
Significance level - is the maximum probability of committing a Type I error. This
probability is symbolized by
P(Type I error|H0 is true) =
Critical or Rejection Region – the range of values for the test value that indicate a
significant difference and that the null hypothesis should be rejected.
Acceptance region or Non-critical or Non-rejection Region – the range of values for the
test value that indicates that the difference was probably due to chance and that the null
hypothesis should not be rejected.
Critical Value (CV) – separates the critical region from the non-critical region, i.e., when
we should reject H0 from when we should not reject H0.
The location of the critical value depends on the inequality sign of the alternative
hypothesis.
Depending on the distribution of the test value, you will use different tables to find
the critical value.
Degrees of Freedom:
Degrees of freedom refers to the maximum number of logically independent values, which
are values that have the freedom to vary, in the data sample.
The formula for degrees of freedom equals the size of the data sample minus one:
Df=N−1
where:Df =degrees of freedom, N=sample size
Figure 8.2 shows that the alternative hypothesis is used to determine which tail or tails to
place the level of significance for a hypothesis test.
Step 3: Compute the test statistic. Suppose we measure a sample mean equal to 4 hours
per week that children watch TV. To make a decision, we need to evaluate how likely this
sample outcome is, if the population mean stated by the null hypothesis (3 hours per week)
is true. We use a test statistic to determine this likelihood. Specifically, a test statistic tells
us how far, or how many standard deviations, a sample mean is from the population mean.
The larger the value of the test statistic, the further the distance, or number of standard
deviations, a sample mean is from the population mean stated in the null hypothesis. The
value of the test statistic is used to make a decision in Step 4.
Step 4: Make a decision. We use the value of the test statistic to make a decision about the
null hypothesis. The decision is based on the probability of obtaining a sample mean, given
that the value stated in the null hypothesis is true. If the probability of obtaining a sample
mean is less than 5% when the null hypothesis is true, then the decision is to reject the null
hypothesis. If the probability of obtaining a sample mean is greater than 5% when the null
hypothesis is true, then the decision is to retain the null hypothesis. In sum, there are two
decisions a researcher can make:
1. Reject the null hypothesis. The sample mean is associated with a low probability of
occurrence when the null hypothesis is true.
2. Retain the null hypothesis. The sample mean is associated with a high probability of
occurrence when the null hypothesis is true.
The probability of obtaining a sample mean, given that the value stated in the null
hypothesis is true, is stated by the p value. The p value is a probability: It varies between 0
and 1 and can never be negative. In Step 2, we stated the criterion or probability of
obtaining a sample mean at which point we will decide to reject the value stated in the null
hypothesis, which is typically set at 5% in behavioral research. To make a decision, we
compare the p value to the criterion we set in Step 2.
When the p value is less than 5% (p < .05), we reject the null hypothesis. We will refer to p
< .05 as the criterion for deciding to reject the null hypothesis, although note that when p
= .05, the decision is also to reject the null hypothesis. When the p value is greater than 5%
(p > .05), we retain the null hypothesis. The decision to reject or retain the null hypothesis
is called significance. When the p value is less than .05, we reach significance; the decision
is to reject the null hypothesis. When the p value is greater than .05, we fail to reach
significance; the decision is to retain the null hypothesis. Figure 8.3 shows the four steps of
hypothesis testing.
https://www.sagepub.com/sites/default/files/upm-binaries/40007_Chapter8.pdf
The z test is a statistical test for the mean of a population. It can be used when n ≥ 30, or
when the population is normally distributed and σ is known.
https://math.ucdenver.edu/~ssantori/MATH2830SP13/Math2830-Chapter-08.pdf
Chi-square test (χ2 test)- chi-square test is used to compare two categorical variables.
Calculating the Chi-Square statistic value and comparing it against a critical value from the
Chi-Square distribution allows to assess whether the observed frequency are significantly
different from the expected frequency.
The hypothesis being tested for chi-square is-
Ho: Variable x and Variable y are independent
Ha: Variable x and Variable y are not independent.
https://medium.com/@anushka.da3/types-of-statistical-tests-b8ceb90e13b3