LR22 Test Statistic
LR22 Test Statistic
To compute the test statistic value in hypothesis testing, there is a need to get
utilized. This module presents ways how to deal with such tests appropriately.
Significance
when
Wait!
It would be easier to understand the lessons in this material if you have the prior
The concepts in the said topics will aid you in identifying appropriate form of
significance.
Let’s Remember
Variance is a measure of how data points differ from the mean. It is a measure
of how far a set of data (numbers) are spread out from their mean (average) value.
Variance means to find the expected difference of deviation from actual value and
the standard deviation gives a rough estimate of it. Standard deviation is obtained
after getting the square root of the variance. In this lesson, most problems and
Before we get into the core of the lesson, let us try to check your prior
t-distribution.
standard deviation.
______ 3. This is best to use in a statistical test if the population standard
deviation is known.
B. Direction: Identify the data present in the given situations that may be
High School students was more than P80. A sample of 35 students had
mean allowance of P100. At 𝛼= 0.01 test, it was the claim that the
students had allowance of more than P80. The standard deviation of the
population is P40.
randomly asked about the price of their cellular phone, data collected
3. The average number of ad clicks per day for Facebook before was
192,000 and the standard deviation was 100,000. Sixty-four (64) days
after the redesign, the mean number of ad clicks per day was 200,000.
by a light bulb company. Thinking that the average life of bulbs is less
light bulbs. The rest resulted to sample mean of 1,300 hours and
the average life of the company's light bulb is less than 1,500 hours?
data against the claim that the overall height of all entering HRM
C. Direction: Illustrate the rejection region given the critical value and
LESSON 1
hypothesis testing technique using a data sample. Using this technique, we check whether
the population parameter hypotheses were correct as well as the likelihood that the sample
statistic might be chosen. Setting up the null hypothesis and the alternative hypothesis as
two opposing hypotheses is the first step in the hypothesis testing procedure. A random
sample is chosen, and summary statistics are computed after determining the likelihood
that the sample data support the alternative hypothesis, using the appropriate test statistics.
You learned how to create null and alternative hypotheses in the previous module.
You are now prepared to examine statistical evidence to ascertain the correct test statistics
Let’s Learn
In the previous lessons, you learned the basic concepts of random sampling,
parameter, and statistic. In this lesson, we will use them to determine the needed data in
Definition of Terms:
TERM MEANING
populations
Population mean (µ) The mean of all the values in the population. If the
sample is randomly selected and sample size is large then
population mean
population.
Population variance It indicates how the population data points are spread out.
The test statistic shows how closely your observed data matches the distribution
expected under the null hypothesis of that statistical test. The use of test statistic is to
determine whether to accept or reject the null hypothesis. It compares the data with what
variance or standard deviation is known or unknown. The table below shows the
difference between z-test and t-test and will guide you on how to use the test.
z-test t-test
The population standard deviation (σ) is A t-test is used when the population
known.
The population is normally distributed.
the sample drawn belongs to the same It is used to validate a hypothesis that the
population.
The sample size is large (n ≥30)
Limit Theorem]
However, the Central Limit Theorem is used in Statistics and Probability dealing
with the distribution of a sample variable in approximating a normal distribution (i.e., a
“bell curve”) as the sample size becomes larger, assuming that all samples are identical
Put another way, CLT is a statistical premise that, given a sufficiently large sample
size from a population with a finite level of variance, the mean of all sampled variables
from the same population will be approximately equal to the mean of the whole
variances being approximately equal to the variance of the population as the sample size
unknown.
is unknown.
Summing it up, the following is a simplistic diagram that specify when to use the
z-test vs t-test.
If the population standard deviation is known, and the size of the sample is
The z-test and t-test are different statistical hypothesis tests that help determine
statistic is used to test for the null hypothesis in relation to whether there is a difference
between the populations means or proportions given the population standard deviation is
known, data belongs to normal distribution, and sample size is larger enough (greater
than 30). T-tests are used when the population standard deviation is unknown, the data
belongs to a normal distribution and the sample size is small (lesser than 30).
their electronic games is 150 hours. It is known that the standard deviation
manufacturer's claim and accordingly tested 100 electronic games using the
battery. It was found out that the mean is equal to 144 hours. Identify the
Solution: Use the diagram previously presented to determine the given data.
Is n ≥ 30 or n <30?
known unknown
✓ ✓ ✓
In the given problem, 100 electronic games were tested using the battery which
reveals a sample size of more than 30. Also, the population standard deviation is
Example 2: An English teacher wanted to test whether the mean reading speed
sample mean of 240 words per minute with a standard deviation of 5 words
per minute. At 0.05 significance level, is the reading speed different from
Solution: Based on the problem above, the sample size is 12, so n<30. Is the
words per minute with a standard deviation of 5 words per minute. You may
mistakenly get 5 as the standard deviation, but note that 5 in this problem is
given.
known unknown
✓ ✓ ✓
Example 3: A study was conducted to look at the average time students exercise.
A researcher claimed that in average, students exercise less than 15 hours per
month. In a random sample size n =115, it was found that the mean time
students exercise is x̄ = 11.3 hours per month with s = 6.43 hours per month.
Solution: The sample size in this problem n = 115, is extremely large and the
✓ ✓ ✓
known unknown
✓ ✓ ✓
to be used.
2. A school administrator claims that less than 50% of the students of the school
are dissatisfied by the community cafeteria service. Test this claim by using
sample data obtained from a survey of 500 students of the school where 54%
0.05.
Grade 6 pupils. In this sample, X = 80 and s = 10. The mean u and the
4.6, s=15. Test the null hypothesis that the mean of the population is 6
LESSON 2
One of the processes in hypothesis testing is the calculation of the test statistic.
of hypothesis testing, the test statistic is the value, determined by a computational formula
that is compared with a confidence coefficient (like 1.96 and 2.58). The decision that we
make depends on the computed test statistic whether we accept or reject the null hypothesis.
The image below shows the acceptance and rejection region under the normal curve
The REJECTION REGION, also known as critical region, is the set of all
values of the test statistic that causes us to reject the null hypothesis.
The NON-REJECTION REGION (or acceptance region is the set of all values
of the test statistic that causes us to fail to reject the null hypothesis.
acceptance region.
acceptance region.
coefficient is divided by 2.
hypothesis.
Is the sample size large enough to apply the Central Limit Theorem
(CLT)?
Step 6 Compare the computed test statistic and the critical value. Then, based on
the decision rule, decide whether to reject or not to reject the null
hypothesis.
For purposes of facilitating the examples below, the solution will go directly to
the steps necessary in this lesson as prior steps were discussed in the previous
Illustrative examples:
their electronic games is 150 hours. It is known that the standard deviation
battery. It was found that the mean is equal to 144 hours. Use α = 0.05.
and the sample size s =100 electronic games. The population standard deviation
is known; the sample indicates n ≥ 30; therefore, z-test will be used. Likewise, it
will use two-tailed test determined by the “average life of batteries used in their
The curve below shows the rejection region and the critical value
within the rejection region, we reject the null hypothesis. Otherwise, accept the
null hypothesis
Example 2: According to a survey, the average daily usage of social media
worldwide of global internet users amounts to 142 minutes per day. Sofia
conducts her own survey among her 10 friends to find out if her time spent
Solution: H0 : The average daily usage of social media of her friends is the same as the
H𝑎 : The average daily usage of social media of her friends is higher than the
The alternative hypothesis includes the term “higher than”, so this problem
is a one-tailed test. Investigating the problem, it will use t-test since the
population standard deviation and the sample size in less than 30. (Refer to the
previous lessons in finding the t-value). After computing for the t-value, which
in this case is, the computed t is 𝑡𝑐𝑜𝑚𝑝𝑢𝑡𝑒𝑑 = 1.593, determine the critical value.
(acceptance region). When the computed value is lower than the critical value, it is
under non-rejection rejection. Therefore, it will mean failing to reject the null
Example 3: Sketch the rejection region of the test hypothesis with critical
values of −2.715 and determine if the computed t-value of -1.52 lies in that
region.
Solution:
Direction: Read each problem or situation and determine if the computed t-value or z-value
lies in the rejection region or non-rejection region.
Grade 6 pupils. In this sample, sample mean = 80 and s = 10. The mean µ
the test were 75 and 15, respectively. Use the 95% confidence in determining
2. A school administrator claims that less than 50% of the students of the school
are dissatisfied by the community cafeteria service. Test this claim by using
sample data obtained from a survey of 500 students of the school where 54%
0.05.
Grade 6 pupils. In this sample, X = 80 and s = 10. The mean u and the
4. A librarian of a school claims that all their grade 8 students read an average
interviewed 50% of the employees and found out that their mean working
Let’s Remember
A. Direction: Tell which test statistic (z-test or t-test) must be used in each of
1. A statistical test that is used to check if the means of two data sets are different
1. The owner of a factory that sells a particular bottled fruit juice claims that the
average capacity of their product is 250 ml. To test the claim, a consumer
group gets a sample of 100 such bottles, calculates the capacity of each bottle,
and then finds the mean capacity to be 248 ml. The standard deviation s is 5
s=15. Test the null hypothesis that the mean of the population is 6 against the
s=15. Test the null hypothesis that the mean of the population is 6 against the
Answer Key
distribution.
1. t
2. t
3. t
4. z
5. z
B. Direction: Identify the data present in the given situations that may be used
C. Direction: Illustrate the rejection region given the critical value and identify
1. acceptance
2. rejection
3. rejection
4. acceptance
5. rejection
Direction: Read each problem or situation and identify the appropriate form of test
statistic to be used.
1. z-test
2. z-test
3. z-test
4. t-test
5. z-test
1. z-test
2. t-test
3. t-test
4. z-test
5. t-test
1. Rejection region
2. Non-rejection region
3. Non-rejection region
4. Rejection region
5. Rejection region
References