Statistics and Probability 2nd Quarter
Statistics and Probability 2nd Quarter
Probability
Quarter 2 – Module 1:
Testing Hypothesis
1
Republic Act 8293, Section 176 states that: No copyright shall subsist in any work
of the Government of the Philippines. However, prior approval of the government agency or
office wherein the work is created shall be necessary for exploitation of such work for profit.
Such agency or office may, among other things, impose as a condition the payment of
royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names,
trademarks, etc.) included in this module are owned by their respective copyright holders.
Every effort has been exerted to locate and seek permission to use these materials from
their respective copyright owners. The publisher and authors do not represent nor claim
ownership over them.
Published by the Department of Education
Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio
2
Statistics and
Probability
Quarter 2 – Module 1:
Testing Hypothesis
3
Introductory Message
For the facilitator:
Welcome to the Statistics and Probability for Senior High School Alternative
Delivery Mode (ADM) Module.
This learning resource hopes to engage the learners into guided and
independent learning activities at their own pace and time. Furthermore,
this also aims to help learners acquire the needed 21st century skills while
taking into consideration their needs and circumstances.
In addition to the material in the main text, you will also see this box in the
body of the module:
As a facilitator, you are expected to orient the learners on how to use this
module. You also need to keep track of the learners' progress while allowing
them to manage their own learning. Furthermore, you are expected to
encourage and assist the learners as they do the tasks included in the
module.
4
For the learner:
Welcome to the Statistics and Probability for Senior High School Alternative
Delivery Mode (ADM) Module.
The hand is one of the most symbolical parts of the human body. It is often
used to depict skill, action, and purpose. Through our hands, we may learn,
create, and accomplish. Hence, the hand in this learning resource signifies
that as a learner, you are capable and empowered to successfully achieve
the relevant competencies and skills at your own pace and time. Your
academic success lies in your own hands!
This module was designed to provide you with fun and meaningful
opportunities for guided and independent learning at your own pace and
time. You will be enabled to process the contents of the learning resource
while being an active learner.
5
the answers to the exercises using the
Answer Key at the end of the module.
What I Have This includes questions or blank
Learned sentences/paragraphs to be filled in to
process what you learned from the
lesson.
If you encounter any difficulty in answering the tasks in this module, do not
hesitate to consult your teacher or facilitator. Always bear in mind that you
are not alone.
We hope that through this material, you will experience meaningful learning
and gain deep understanding of the relevant competencies. You can do it!
6
What I Need to Know
Are you ready now to study hypothesis testing using your ADM module?
Good luck and may you find it helpful.
What I Know
7
A. Level of error C. Level of acceptance
B. Level of hypothesis D. Level of significance
5. In a one-tailed test, in which critical value listed below will the computed
z of 2.313 fall in the acceptance region?
A. 1.383 C. 2.228
B. 1.533 D. 2.365
6. Which of the following would be an appropriate null hypothesis?
A. The mean of a sample is equal to 75.
B. The mean of a population is equal to 75.
C. The mean of a sample is not equal to 75.
D. The mean of a population is greater than 75.
11. Which of the following must be used as the level of significance if we want
a higher possibility of correct decision?
A. 1% B. 5% C. 10% D. 25%
8
13. Using a left-tailed test, which of the following value of z falls in the
rejection region where the critical value is – 1.725?
A. – 1.700 B. – 1.715 C. – 1.724 D. – 1.728
14. If the computed z-value is 2.015 and the critical value is 1.833, which of
the following statements could be true?
A. It lies in the rejection region, 𝐻𝑜 must be rejected.
B. It lies in the rejection region, we failed to reject 𝐻𝑜 .
C. It lies in the non-rejection region, 𝐻𝑜 must be rejected.
D. It lies in the non-rejection region, we failed to reject 𝐻𝑜 .
15. If the computed z-value is – 1.290 and the critical value is – 2.571, which
of the following statements could be true?
A. It lies in the rejection region, 𝐻𝑜 must be rejected.
B. It lies in the rejection region, we failed to reject 𝐻𝑜 .
C. It lies in the non-rejection region, 𝐻𝑜 must be rejected.
D. It lies in the non-rejection region, we failed to reject 𝐻𝑜 .
Lesson
1 Testing Hypothesis
Have you at a certain time asked yourself how you could possibly decide to
put a business in place and gain your expected profit? Or wonder if a judge
in a trial could have given a wrong decision in determining who’s guilty? Or
think if your classmates’ average weights differ significantly among your
age? Or imagine how a newly discovered medicine is being tested for human
treatment?
This lesson will help you make sound decisions in dealing with these
situations.
9
What’s In
Where Am I Now?
Directions: Identify the region where each of the given values falls.
Region B
Region A
Region C
1. 𝑡 = 1.95 ______________________________
2. 𝑡 = 0.15 ______________________________
3. 𝑡 = −1.45 ______________________________
4. 𝑡 = −2.4 ______________________________
5. 𝑡 = 2.73 ______________________________
10
What’s New
Keep Me Connected!
Directions: Analyze the situation below and answer the questions that follow.
11
2. What is the average daily usage of social media of her friends? Compare
it with the previous average usage.
3. Which of the two claims could probably be true? Why?
4. If Sofia computed the average daily internet usage of her friends to be
higher than the global survey, do you think it would be significantly
higher?
5. What is your idea of an average value being significantly higher than the
global average value?
6. What do you think is the difference between simple comparison of data
and hypothesis testing?
What Is It
Here are the examples of questions you can answer with a hypothesis test:
Does the mean height of Grade 12 students differ from 66 inches?
Do male and female Grade 7 and Grade 12 students differ in height
on average?
Is the proportion of senior male students’ height significantly
higher than that of senior female students?
Note: You can think of the null hypothesis as the current value of the
population parameter, which you hope to disprove in favor of your
alternative hypothesis.
12
Take a look at this example.
The school record claims that the mean score in Math of the incoming
Grade 11 students is 81. The teacher wishes to find out if the claim is true.
She tests if there is a significant difference between the batch mean score
and the mean score of students in her class.
Solution:
Let 𝜇 be the population mean score and 𝑥̅ be the mean score of
students in her class.
You may select any of the following statements as your null and
alternative hypothesis as shown in Option 1 and Option 2.
Option 1:
𝐻𝑜 : The mean score of the incoming Grade 11 students is 81 or 𝜇 = 81.
𝐻𝑎 : The mean score of the incoming Grade 11 students is not 81 or 𝜇 ≠ 81.
Option 2:
𝐻𝑜 : The mean score of the incoming Grade 11 students has no significant
difference with the mean score of her students or 𝜇 = 𝑥̅ .
𝐻𝑎 : The mean score of the incoming Grade 11 students has a significant
difference with the mean score of her students or 𝜇 ≠ 𝑥̅ .
Level of Significance
The level of significance denoted by alpha or 𝛂 refers to the degree of
significance in which we accept or reject the null hypothesis.
100% accuracy is not possible in accepting or rejecting a hypothesis.
The significance level α is also the probability of making the wrong decision
when the null hypothesis is true.
Do you know that the most common levels of significance used are 1%, 5%,
or 10%?
Some statistics books can provide us table of values for these levels of
significance.
13
Take a look at this example.
Maria uses 5% level of significance in proving that there is no
significant change in the average number of enrollees in the 10 sections for
the last two years. It means that the chance that the null hypothesis (𝐻𝑜 )
would be rejected when it is true is 5%.
𝛼 = 0.05
14
However, if the school registrar believes that the average number of enrollees
this school year is less than the previous school year, then you will have:
𝐻𝑜 : 𝜇 = 𝜇0
𝐻𝑎 : 𝜇 < 𝜇0
On the other hand, if the school registrar believes that the average number
of enrollees this school year is greater than the previous school year, then
you will have:
𝐻𝑜 : 𝜇 = 𝜇0
𝐻𝑎 : 𝜇 > 𝜇0
Now back to the two claims of Sofia, what do you think should be the type of
test in her following claims?
Claim A: The average daily usage of social media of her friends is
the same as the global average usage.
15
Illustration of the Rejection Region
The rejection region (or critical region) is the set of all values of the test
statistic that causes us to reject the null hypothesis.
The non-rejection region (or acceptance region) is the set of all values of
the test statistic that causes us to fail to reject the null hypothesis.
The critical value is a point (boundary) on the test distribution that is
compared to the test statistic to determine if the null hypothesis would
be rejected.
Non-Rejection
Region Rejection Region
Critical Value
Illustrative Example 1:
Now, let’s take a look at Sofia’s first claim. She assumed that the average
online usage of her friends is the same as the global usage (𝐻𝑜 ).
𝑥̅ −𝜇
She computed for the t-value using the formula 𝑡 = 𝑠 where 𝜇 = 142, 𝑥̅
√𝑛
= 152, s = 19.855, and n = 10.
𝑥̅ − 𝜇
𝑡= 𝑠
Use a scientific
√𝑛
This t-test formula calculator to
was discussed in 152 − 142 verify the
𝑡=
the last chapter. 19.855 computed t-
√10 value.
10
𝑡=
6.2787
𝑡 = 1.593
16
From the table of t-values, determine the critical value. Use df = n-1 = 9,
one-tailed test at 5% level of significance.
The critical t-value is 1.833.
How did we get that value?
Look at this illustration!
Now, you can sketch a t distribution curve and label showing the rejection
area (shaded part), the non-rejection region, the critical value, and the
computed t-value. This is how your t distribution curve should look like!
Rejection
Region
Non-Rejection
Region
1.593 1.833
(Computed Value) (Critical Value)
17
Illustrative Example 2:
A medical trial is conducted to test whether or not a certain drug reduces
cholesterol level. Upon trial, the computed z-value of 2.715 lies in the
rejection area.
Illustrative Example 3:
Sketch the rejection region of the test hypothesis with critical values of
±1.753 and determine if the computed t-value of –1.52 lies in that region.
Solution:
Draw a t-distribution curve. Since there are two critical values, it is a two
tailed test. Locate the critical values and shade the rejection regions.
Now, locate the computed t-value of –1.52. You can clearly see that it is not
at the rejection region as shown in the following figure. The computed t-value
is at the non-rejection region. Therefore, we fail to reject the null hypothesis,
𝐻𝑜 .
– 1.52
– 1.753 1.753
(critical value) (critical value)
18
Type I and Type II Errors
α
This is the region of Type II
error.
β = P [type II error]
= P [𝐻𝑜 is false, Fail to reject 𝐻𝑜 ]
Region where 𝐻𝑜 is
false
𝜷
To summarize the difference between the Type I and Type II errors, take a
look at the table below.
19
Now, complete the statements that follow.
Illustrative Example:
The Type I error is the first statement because he rejected the true
null hypothesis.
20
What’s More
Directions: State the null and the alternative hypotheses of the following
statements.
1. A medical trial is conducted to test whether or not a new medicine
reduces uric acid by 50%.
𝐻𝑜 : ____________________________________________________
𝐻𝑎 : ____________________________________________________
2. We want to test whether the general average of students in Math is
different from 80%.
𝐻𝑜 : ____________________________________________________
𝐻𝑎 : ____________________________________________________
3. We want to test whether the mean height of Grade 8 students is 58
inches.
𝐻𝑜 : ____________________________________________________
𝐻𝑎 : ____________________________________________________
4. We want to test if LPIHS students take more than four years to graduate
from high school, on the average.
𝐻𝑜 : ____________________________________________________
𝐻𝑎 : ____________________________________________________
5. We want to test if it takes less than 60 minutes to answer the quarterly
test in Calculus.
𝐻𝑜 : ____________________________________________________
𝐻𝑎 : ____________________________________________________
6. A medical test is conducted to determine whether or not a new vaccine
reduces the complications of dengue fever.
𝐻𝑜 : ____________________________________________________
𝐻𝑎 : ____________________________________________________
7. The enrolment in high school this school year increases by 10%.
𝐻𝑜 : ____________________________________________________
𝐻𝑎 : ____________________________________________________
8. The intelligence quotient of male grade 11 students is the same as the
female students.
𝐻𝑜 : ____________________________________________________
𝐻𝑎 : ____________________________________________________
21
9. The school want to test if the students in Grade 7 prefer online distance
learning as the method of instruction.
𝐻𝑜 : ____________________________________________________
𝐻𝑎 : ____________________________________________________
10. The school librarian wants to find out if there was an increase in the
number of students accessing the school library.
𝐻𝑜 : ____________________________________________________
𝐻𝑎 : ____________________________________________________
Directions: Illustrate the rejection region given the critical value and
identify if the t-values lie in the non-rejection region or rejection
region.
22
2. critical t-value of −1.671
computed t-value of −2.45
23
7. critical t-value of −2.306
computed t-value of −2.110
24
1. What would be the consequence of a Type I error in this context?
They do not conclude that the They do not conclude that the
solar panel is effective when it is solar panel is effective when it is
not actually effective. actually effective.
They conclude that the solar They conclude that the solar panel
panel is effective when it is is effective when it is not actually
actually effective. effective.
Situation 2:
A resort owner does a daily water
quality test in their swimming pool. If
the level of contaminants is too high,
then he temporarily closes the pool to
perform a water treatment.
We can state the hypotheses for his
test as:
𝐻𝑜 : The water quality is acceptable.
𝐻𝑎 : The water quality is not acceptable.
The owner closes the pool when it The owner does not close the pool
does not need to be closed. when it does not need to be closed.
25
5. In terms of safety, which error has more dangerous consequences in this
setting?
26
What I Can Do
Directions: Cite five (5) situations in your community where you can apply
hypothesis testing. Then, just choose one situation and:
1. create a problem statement;
2. formulate the null and alternative hypothesis;
3. select the level of significance and sketch the rejection region; and
4. state the possible Type I and Type II errors.
Assessment
27
5. Which of the following describes a null hypothesis using two-tailed test?
A. 𝐻0 : 𝜇 = 𝜇0 B. 𝐻0 : 𝜇 ≠ 𝜇0 C. 𝐻0 : 𝜇 ≥ 𝜇0 D. 𝐻0 : 𝜇 ≤ 𝜇0
13. If the computed z-value is 1.286 and the critical value is 1.383, which of
the following statements could be true?
A. It lies in the rejection region, 𝐻𝑜 must be rejected.
B. It lies in the rejection region, hence we fail to reject𝐻𝑜 .
C. It lies in the non-rejection region, 𝐻𝑜 must be rejected.
D. It lies in the non-rejection region, hence we fail to reject𝐻𝑜 .
14. Using a left-tailed test, which of the following value of z will not fall in the
rejection region where the critical value is – 1.638?
A. – 1.637 B. – 1.639 C. – 1.641 D. – 1.706
28
15. If the computed z-value is 1.915 and the critical value is 1.812, which of
the following statements could be true?
A. It lies in the rejection region, 𝐻𝑜 must be rejected.
B. It lies in the rejection region, hence we fail to reject𝐻𝑜 .
C. It lies in the non-rejection region, 𝐻𝑜 must be rejected.
D. It lies in the non-rejection region, hence we fail to reject𝐻𝑜 .
Additional Activities
A medical trial is conducted to test whether or not a certain drug can treat a
certain allergy. Upon trial, the t-value is computed as 1.311. Sketch and
complete the table below to discuss the findings of the medical trial.
29
30
References
Textbooks
Albacea, Zita, Mark John Ayaay, Imelda Demesa, and Isidro David. Teaching
Guide for Senior High School: Statistics and Probability. Quezon City:
Commission on Higher Education, 2016.
Caraan, Avelino. Introduction to Statistics & Probability. Mandaluyong City:
Jose Rizal University Press, 2011.
Chan Shio, Christian Paul, and Maria Angeli Reyes. Statistics and
Probability for Senior High School. Quezon City: C & E Publishing Inc.,
2017.
De Guzman, Danilo. Statistics and Probability. Quezon City: C & E
Publishing Inc., 2017.
Jaggia, Sanjiv, and Alison Kelly. Business Statistics: Communicating with
Numbers. 2nd Ed. New York: McGraw-Hill Education, 2016.
Sirug, Winston. Statistics and Probability for Senior High School CORE
Subject A Comprehensive Approach K to 12 Curriculum Compliant.
Manila: Minshapers Co., Inc., 2017.
Online Resources
Khan Academy. “Consequences of Errors and Significance.” Accessed
February 2, 2019. https://www.khanacademy.org/math/ap-
statistics/tests-significance-ap/error-probabilities-
power/a/consequences-errors-significance
Minitab.com. “About the Null and Alternative Hypotheses.” Accessed
February 4, 2019. https://support.minitab.com/en-
us/minitab/18/help-and-how-to/statistics/basic-
statistics/supporting-topics/basics/null-and-alternative-hypotheses/
Minitab. com. “What are Type I and Type II Errors?” Accessed February 4,
2019. https://support.minitab.com/en-us/minitab/18/help-and-how-
to/statistics/basic-statistics/supporting-topics/basics/type-i-and-type-
ii-error/
Zaiontz, Charles. “Null and Alternative Hypothesis.” Accessed February 2,
2018. http://www.real-statistics.com/hypothesis-testing/null-
hypothesis/
31
Statistics and
Probability
Quarter 2 – Module 2:
Identifying Parameters for
Testing in Given Real-Life
Problems
32
What I Need to Know
With this module, you will learn how to identify the parameter to be
tested in a statistical hypothesis. The first step in hypothesis testing,
defining the parameter, will be given emphasis in this module.
What I Know
33
5. Which is the parameter in the given situation below? “The average age of
10 college students is 24 years.”
A. 24 years C. age of the college students
B. 10 college students D. the average age of 24 years
6. A parameter is...
A. a numerical value summarizing the sample data
B. a planned activity with results yielding a set of data
C. a numerical value that summarizes all the data of an entire
population
D. the set of values collected from the variable from each of the elements
that belongs to the sample
9. SWS survey was trying to see if people in the Philippines thought the
pollution was too high. Which choice best represents a parameter?
A. all people in the Philippines
B. 500 randomly selected residents of the Philippine
C. 71% of the residents surveyed who thought the pollution was too high
D. percentage of all people in the Philippines who thought the pollution
was too high
10. A research conducted on a certain company last year showed that 25% of
the employees would rather drink coffee than soft drinks during break
time. Which choice best represents a parameter?
A. 25 employees C. the percentage of employee
B. all employees D. coffee rather than soft drinks
34
A. average of all residents
B. disapproval rating below 21%
C. political committee paying a newspaper ad
D. percentage of city residents who express disapproval
14. Three percent (3%) of cars of a certain model have needed new engines
after being driven between 0 and 80 miles. The manufacturer hopes
that redesigning one of the engine's components has solved this
problem. What kind of parameter is illustrated in the problem?
A. mean C. proportion
B. variance D. standard deviation
For instance, you might be interested in the average age of your section
where you belong and found the average age was 17. Do you think this is an
example of parameter? To be able to answer this question, read and
understand this module.
35
What’s In
1. The ____________ in a set of data is the sum of the values divided by the
total number of values.
A. mean B. range C. variance D. standard deviation
3. The ________ is the item of data that appears most frequently in a set of
data.
A. mean B. mode C. median D. standard deviation
4. The measurement that shows how data are spread above and below the
mean is the _____.
A. mean B. range C. variance D. standard deviation
36
What’s New
Activity 2: Grouping!
Directions: Group the following symbols into two. Place the first group
inside Box A and the second group in Box B.
ഥ
𝒙 𝒑 𝒔 𝒔𝟐 ෝ
𝒑 𝝁 𝝈 𝝈𝟐
A B
Guide Questions:
1. What are the symbols that you placed in Box A? Box B?
2. How did you categorize each symbol or notation?
3. What mathematical principle did you consider in answering the
activity?
4. Which symbols seemed to be familiar to you and which are not?
What Is It
37
Different symbols are used to denote parameters. Based on Activity 2,
symbols are grouped as indicated in the table below.
Measure Statistic Parameter
𝒎𝒆𝒂𝒏 𝑥̅ (x-bar) 𝜇 (myu)
𝒗𝒂𝒓𝒊𝒂𝒏𝒄𝒆 𝑠2 𝜎 2 (sigma squared)
𝒔𝒕𝒂𝒏𝒅𝒂𝒓𝒅 𝒅𝒆𝒗𝒊𝒂𝒕𝒊𝒐𝒏 𝑠 𝜎 (sigma)
𝒑𝒓𝒐𝒑𝒐𝒓𝒕𝒊𝒐𝒏 𝑝̂ (p hat) 𝑝
In this claim, there are different parameters used but the parameter
to be tested in this hypothesis would be the average allowance of Senior
High School students since it relates to the population, not in sample.
Statistical hypothesis is a conjecture about the population parameter that’s
why you will look for the population mean, population standard deviation, or
population proportion but not sample mean.
38
What’s More
1. The television habits of children were observed and found out that the
standard deviation is 12.4 hours per week.
2. A newspaper article stated that students in the country take an average
of 4 years to finish their undergraduate degrees. Suppose that you
believe the mean time is longer, you conducted survey on 49 students.
The result obtained a sample mean of 5 with a sample standard deviation
of 1.2.
3. According to DOLE, registered nurses in government earned an average
monthly salary of ₱9,700. For that same year, a survey was conducted on
41 registered nurses to determine if the mean salary is higher than the
previous survey. The sample average was ₱10,000 with a sample
standard deviation of ₱2,500.
4. Records of the Department of Health (DOH) revealed that 14.7% of the
country's Filipino smokers have maintained their habit of smoking.
39
What I Have Learned
1. What is a parameter?
________________________________________________________________________
2. What are the two commonly used parameters? What are their symbols or
notations?
________________________________________________________________________
3. What are the other notations used as parameters?
________________________________________________________________________
4. To identify the parameter to be tested in a claim/hypothesis, what are
the concepts to consider?
________________________________________________________________________
What I Can Do
List down five (5) different real-life situations where hypothesis testing can
be done. Identify the parameter to be tested in each situation.
1. ________________________________________________________________________
2. ________________________________________________________________________
3. ________________________________________________________________________
4. ________________________________________________________________________
5. ________________________________________________________________________
Assessment
40
3. Anna wants to estimate the average shower time of teenagers. From the
sample of 50 teenagers, she found out that it takes 5 minutes for
teenagers to shower. What is the parameter?
A. sample of 50 teenagers C. average shower time of teenagers
B. 50 teenagers in 5 minutes D. took 5 minutes for teenagers to shower
11. A researcher claims that the mean monthly consumption of coffee per
person is more than 19 cups. In a sample of 60 randomly selected
people, the mean monthly consumption was 20. The standard deviation
41
of the sample was 4 cups. Which is the parameter to be tested in this
claim?
A. sample of 60 randomly selected people
B. mean consumption of 60 selected people
C. the mean consumption of coffee per person
D. the standard deviation of the sample which was 4 cups
12. A certified public accountant (CPA) claims that more than 30% of all
accountants advertise. What kind of parameter is used in this claim?
A. mean B. variance C. proportion D. standard deviation
Additional Activities
2. A car dealer claims that the average price of Honda Vios is at least
₱662,000.00. A client suspected that the claim is incorrect and found
42
that random sample of 15 similar vehicles has the mean price of
₱640,000.00 and standard deviation of ₱ 24,000.00. Is there enough
evidence to reject the dealer’s claim at 𝛼 = 0.05?
References
Books
Sirug, Winston S. Statistics and Probability for Senior High School CORE
Subject A Comprehensive Approach K to 12 Curriculum Compliant.
Manila: Mindshapers Co., Inc., 2017.
Online Resources
Mahesh. “Everything You Need to Know About Hypothesis Testing - Part I.”
Accessed May 20, 2020. https://towardsdatascience.com/everything-
you-need-to-know-about-hypothesis-testing-part-i-4de9abebbc8a
43
Statistics
Quarter 2 – Module 3:
Formulating Appropriate Null
and Alternative Hypotheses on a
Population Mean
44
What I Need to Know
In this module, you will learn how to formulate null and alternative
hypotheses on a population mean.
What I Know
2. A vacuum cleaner consumes less than 46 kwh per year. What hypothesis
test can you use in this claim?
A. left-tailed C. null hypothesis
B. right- tailed D. alternative hypothesis
45
4. The sign of the alternative hypothesis in a left-tailed test is always ________
A. Equal C. less than
B. not equal D. greater than
10. Which is the correct null hypothesis of the claim below? “Students take
an average of less than five (5) years to graduate from college.”
A. 𝐻𝑜 : µ = 5 B. 𝐻𝑜 : µ < 5 C. 𝐻𝑜 : µ ≠ 5 D. 𝐻𝑎 : µ < 5
11. In driver’s test, an average of 300 drivers pass on their first try. We want
to test if more than an average of 300 passes on the first try. Which
46
inequality symbols is correct (=, ≠, ≥, <, ≤, >) for the null and alternative
hypotheses - 𝐻𝑜 : µ __ 300 𝐻𝑎 : µ __ 300?
A. <, > B. =, ≠ C. ≤, ≥ D. = , >
13. In a commercial, a new diet program would like to claim that their
methods result in a mean weight loss of more than 22kgs in two (2)
weeks. To determine if this is a valid claim, they hire an agency that then
selects 25 people to be placed on this diet. What is the test of
hypothesis?
A. null C. one tailed- test
B. alternative D. two tailed- test
15. A survey reported that teenagers spend an average at most four (4)
hours each day on social media. The organization thinks that, currently,
the mean is higher. Fifteen (15) randomly chosen teenagers were asked
how many hours per day do they spend on social media. The sample
mean was 4.5 hours with a sample standard deviation of 2.0. What is
the test of hypothesis?
A. left-tailed test C. hypothesis test
B. two-tailed test D. right-tailed test
47
Lesson Formulating Appropriate Null
What’s In
1. The survey shows that the number of students (n) who have parents with
a house of their own is less than 20.
A. 𝑛 < 20 B. 𝑛 > 20 C. 𝑛 ≤ 20 D. 𝑛 ≥ 20
48
Guide Questions:
1. How did you find the previous activity? Was it easy or difficult?
2. What previously learned principle did you apply in the activity?
3. Were you able to determine the correct expression that correspond to
each situation? Elaborate.
4. Do you think you will apply these activities in formulating null and
alternative hypotheses?
What’s New
Directions: Examine the pictures below then answer the guide questions
that follow.
“Effect of a Fertilizer on Plant Growth”
49
Guide Questions:
What Is It
In statistical hypothesis testing, there are always two hypotheses: the null
and alternative hypotheses. Below is a comparison between the two.
50
𝑯𝒐 : µ = 𝒌 𝑯𝒐 : µ ≤ 𝒌 𝑯𝒐 : µ ≥ 𝒌
𝑯𝒂 : µ ≠ 𝒌 𝑯𝒂 : µ > 𝒌 𝑯𝒂 : µ < 𝒌
The claim used the word “less than” which as seen in the table above,
corresponds to the symbol (<). Therefore, the answer is n<20.
Note:
𝐻𝑜 always has = symbol in it. 𝐻𝑎 never has an = symbol in it. The choice of
symbol depends on the wording of the hypothesis test. However, be aware
that many researchers use = (equal sign) in the null hypothesis, even with
> or < as the symbol in the alternative hypothesis. Notice also that the
notation of alternative hypothesis complements the null hypothesis.
Illustrative Examples:
Solution: First, identify the parameter which is the mean height of all
Grade 11 students. Since it is a population mean, use the notation 𝝁.
The claim in this example is that the average weight is 169 cm which
translates to 𝝁 = 𝟏𝟔𝟗 and is considered as null hypothesis. To formulate
51
the alternative hypothesis, write the complement/opposite of the null
hypothesis which is the average weight is not equal to 169 cm.
Solution: In this example, the parameter is the average and the claim
is that the average is at least ₱730,000. The word at least has the
notation of (≥) which means that the claim is at the null hypothesis. In
52
the alternative hypothesis, you will use (<) as its complement.
Therefore:
𝑯𝑶 : µ ≥ ₱𝟕𝟑𝟎, 𝟎𝟎𝟎 or 𝑯𝑶 : µ = ₱𝟕𝟑𝟎, 𝟎𝟎𝟎 (claim)
𝑯𝒂 : µ < ₱𝟕𝟑𝟎, 𝟎𝟎𝟎
Solution: The parameter used in this example is average (µ) and the
claim is that average is at most 240 minutes. The word ‘at most’ has
the notation of (≤) which means that claim is at the null hypothesis.
The null hypothesis would be 𝑯𝟎 : µ ≤ 𝟐𝟒𝟎. To formulate the alternative,
use the notation (>) as the complement of (≤). Therefore, alternative
hypothesis is 𝑯𝒂 : µ > 𝟐𝟒𝟎.
𝑯𝑶 : µ ≤ 𝟐𝟒𝟎 or 𝑯𝑶 : µ = 𝟐𝟒𝟎 (claim)
𝑯𝒂 : µ > 𝟐𝟒𝟎
On the other hand, some hypotheses predict only that one value will
be different from another, without additionally predicting which will be
higher. The test of such a hypothesis is nondirectional or two-
tailed because an extreme test statistic in either tail of the distribution
(positive or negative) will lead to the rejection of the null hypothesis of no
difference.
One-Tailed Two-Tailed
Alternative hypothesis contains Alternative contains the
the greater than (>) or less than inequality (≠) symbol.
(<) symbols
It is directional (either right-tailed It has no direction.
or left-tailed)
53
The table below shows the null and alternative hypotheses stated
together with the directional test.
2. A piggery owner believes that using organic feeds on his pigs will
yield greater income. His average income from the previous year
was ₱120, 000. State the hypothesis and identify the directional
test.
In this example, the null hypothesis is 𝑯𝑶 ∶ 𝝁 = 𝟏𝟐𝟎, 𝟎𝟎𝟎 . You may
notice that the hypothesis used the phrase ‘greater income’ that is
associated with greater than. Therefore, 𝑯𝒂 ∶ 𝝁 > 𝟏𝟐𝟎, 𝟎𝟎𝟎. This
hypothesis uses inequality symbol (>) so it is one-tailed test and it uses
greater than which specifically called for the right-tailed test.
𝑯𝑶 ∶ 𝝁 = 𝟏𝟐𝟎, 𝟎𝟎𝟎 and 𝑯𝒂 ∶ 𝝁 > 𝟏𝟐𝟎, 𝟎𝟎𝟎 (right-tailed test)
54
What’s More
1. A school principal claims that the Grade 11 students in her high school
have a mean score of 92.
Parameter: ___________ Null Hypothesis: ___________
Claim: mean score of 92 Alternative Hypothesis: 𝐻𝑎 : µ ≠ 92
4. The Bureau of Internal Revenue claims that the mean wait time for
taxpayer during a recent tax filing is at most 8.7 minutes. A random
sample of 11 taxpayers has a mean wait time of 8.7 minutes and a
standard deviation of 2.7 minutes. Is there enough evidence to reject the
claim at a significance level of 0.10?
Parameter: mean Null Hypothesis: ___________
Claim: ___________ Alternative Hypothesis: 𝐻𝑎 : µ > 8.7
55
Activity 3. Let’ s Hypothesize
1. Mrs. Dela Cruz claims that her students scored an average of 91 in their
Mathematics quiz. The master teacher wants to know whether the
teacher’s claim is acceptable or not.
𝐻𝑜 : _________________________________________________
𝐻𝑎 : _________________________________________________
2. A car manufacturer claims that the mean selling price of all cars
manufactured is only ₱150,000. A consumer agency wants to test
whether the mean selling price of all the cars manufactured exceeds
₱150, 000.
𝐻𝑜 : _________________________________________________
𝐻𝑎 : _________________________________________________
4. A bus company in Manila claims that the mean waiting time for a bus
during rush hour is less than 12 minutes. A random sample of 30
waiting times has a mean of 15 minutes with a standard deviation of 4.8
minutes.
𝐻𝑜 : _________________________________________________
𝐻𝑎 : _________________________________________________
56
Activity 4. One-Tailed or Two-Tailed!
1. A used car dealer says that the mean price car in the Philippines is at
least ₱350,000.
Activity 5. Formu-Tail
57
4. The average price of a certain type of car is greater than ₱600,000.
𝐻𝑜 : _________________ 𝐻𝑎 : _________________ _______- tailed test
6. A study claims that the mean survival period for certain cancer patients
treated immediately with chemotherapy and radiation is 24 months.
𝐻𝑜 : _________________ 𝐻𝑎 : _________________ _______- tailed test
7. The average pre-school cost for tuition fees last year was ₱ 15,500. The
following year, 20 schools had a mean of ₱ 13, 100 and standard
deviation of ₱ 2,500.
𝐻𝑜 : _________________ 𝐻𝑎 : _________________ _______- tailed test
9. The principal of Mabundok High School claims that the students in his
school have above average intelligence. A random sample of 30 students’
IQ scores have a mean score of 113. The mean population IQ is 100 with
a standard deviation of 15. Is there an evidence to support his claim?
𝐻𝑜 : ________________ 𝐻𝑎 : __________________ _______-tailed test
10. The owner of BYD manufacturer claims that their batteries last an
average of at most 350 hours under normal use. A researcher randomly
selected 20 batteries from the production line and tested them. The
tested batteries had a mean life span of 270 hours with a standard
deviation of 50 hours.
𝐻𝑜 : ________________ 𝐻𝑎 : __________________ _______-tailed test
58
What I Have Learned
What I Can Do
Cite five (5) research questions used in real life and formulate your null and
alternative hypotheses.
Example: Is it true that turmeric can prevent viruses?
𝐻𝑜 : Drinking turmeric cannot prevent viruses.
𝐻𝑎 : Drinking turmeric can prevent viruses.
59
Assessment
4. If the researcher wishes to test the claim that the mean of the population
is 75, the appropriate null hypothesis is:
A. 𝜇 ≤ 75 B. 𝜇 ≥ 75 C. 𝜇 ≠ 75 D. 𝜇 = 75
60
8. “A modern approach in advertisement will not increase the demand for a
product.” This is an example of _______________ hypothesis.
A. Null C. alternative
B. Mean D. right-tailed
13. A teacher in Math announced that the mean score of Grade 9 students in
the first quarterly assessment in Mathematics was 89 and standard
deviation was 6. One student, who believed that the mean score was less
than this, randomly selected 30 students and computed the mean score.
What kind of test of hypothesis can describe this?
A. left-tailed B. two-tailed C. right-tailed D. multiple-tailed
61
15. Which directional test is illustrated in the given problem below?
In a recent survey, the average amount of money students have in their
wallet is ₱200.00 with standard deviation of 45. A teacher feels that the
average amount is lower. She surveyed 80 randomly selected students
and found that the average amount is ₱35.
A. left-tailed B. two-tailed C. alternative D. right tailed
Additional Activities
62
References
Textbooks
Online Resources
63
Statistics and
Probability
Quarter 2 – Module 4:
Identifying Appropriate Test
Statistics Involving Population
Mean
64
What I Need to Know
In the previous module, you have learned more about hypothesis. You
identified the two kinds of hypotheses and the directionality test of
hypothesis. The module also discussed about the notations commonly used
in formulating a hypothesis. You also accomplished activities identifying the
test of hypothesis to be used after formulating null and alternative
hypotheses.
This time, you are ready to identify the test statistic to be used when
the population variance is known and unknown.
What I Know
1. If the variance is unknown and the sample size is small, which test
statistic is appropriate?
A. t-test C. two-tailed test
B. z-test D. one-tailed test
65
3. Based on the Central Limit Theorem, when the sample (n) is extremely
large and the variance is known, what is the statistical test to be used?
A. t-test C. two-tailed test
B. z-test D. one-tailed test
8. The t-test for single sample mean may be used when all the following
conditions are true except ____________.
A. Sample size is less than 30.
B. Sample standard deviation (𝑠) is known.
C. Population standard deviation (𝜎) is known.
D. Data are approximately normally distributed.
10. A tire manufacturer tests the braking performance of one of its tire
models on a test track. From long-term records, the company knows the
value of σ. The company tried the tires on 10 different cars, recording the
stopping distance for each car on both wet and dry streets. Which test
statistic is appropriate to use?
A. t-test C. one-tailed test
B. z-test D. hypothesis test
66
11. “The average production of corn in the Philippines is 3,000 kgs. A new
plan on food has been developed and is tested on 60 plots. The mean
yield with the new plan on food is 3,200 kgs with standard deviation of
600 kgs. At α = 0.05 level of significance, can you conclude that the
production increased?” What test statistic is to be used on the given
problem?
A. t-test C. left-tailed test
B. z-test D. right-tailed test
12. In the given situation below, identify the population standard deviation.
“In a recent survey, the average amount of money a college student gets
is ₱200.00 with a standard deviation of ₱62.00. A teacher feels that the
average amount is higher. She surveys 80 randomly selected students
and finds that the average amount is ₱245.”
A. 𝜎 = 80 B. 𝜎 = ₱62.00 C. 𝜎 = ₱200.00 D. 𝜎 = ₱245.00
13. An agent believes that the average closing cost of purchasing a new home
is ₱328,250. She selects 40 new home sales at random and finds that the
average closing cost is ₱333,300. The standard deviation of the
population is ₱6,060. What is the test statistic appropriate to used?
A. t-test C. standard deviation
B. z-test D. Central Limit Theorem
14. What test static is appropriate to use in the given problem below? “A
random sample of 29 medical doctors showed that they work an average
of 55 hours per week with a standard deviation of 7.5 hours per week. If
the average is 48 hours per week, is this given evidence significantly
greater than the rest of the medical doctors?”
A. t-test C. variance
B. z-test D. two-tailed test
15. Last 2015, the government made a claim that the average income of the
Filipino people was ₱18,000. However, a sample was taken recently
showing an average income of ₱20,000 with a population standard
deviation of ₱1,300. Which test statistic is appropriate to use?
A. t-test C. one -tailed test
B. z-test D. two-tailed test
67
Lesson Identifying Appropriate Test
In the previous module, you were taught how to formulate null and
alternative hypotheses. You are now ready to analyze statistical hypothesis
to determine the correct test statistics to be used in computing the results
and making decisions.
What’s In
68
Follow-up Questions:
What’s New
Directions: Determine the needed data for each given problem. First, read
and understand the examples below before you proceed to the items that
follow.
Examples:
69
2. According to a cell phone company, the average price of cellular phone in
the Philippines is ₱12,999. However, in a sample of 20 costumers
randomly asked about the price of their cellular phone, data collected
showed an average of ₱9,999 and standard deviation of ₱7,999. Using
𝛼 = 0.05 level of significance, is there enough evidence proving that the
average price of cellular phone is less than ₱12,999?
𝜇 = 12,999 𝑥̅ = 9,999 𝑛= 20 𝑠 = 7,999
1. The average number of ad clicks per day for Facebook before was
192,000 and the standard deviation was 100,000. Sixty-four (64) days
after the redesign, the mean number of ad clicks per day was 200,000.
𝜇 = ______ 𝑥̅ = ______ 𝑛 = ______ 𝜎 = ______
Guide Questions:
70
What Is It
Example:
Now you already know how to get the data needed in choosing test
statistics. This time, you will determine what test statistic is appropriate in
computing test value in the hypothesis testing.
71
A test statistic is a random variable that is calculated from sample
data and used in a hypothesis test. You can use test statistics to determine
whether to reject or accept the null hypothesis. The test statistic compares
your data with what is expected under the null hypothesis.
To identify the test statistic, you must consider whether the
population standard deviation/variance is known or unknown. If the
population standard deviation σ is known, then the mean has a normal
distribution. Use z-test. If the population standard deviation σ is unknown,
then the mean has a t- distribution. Use t-test. Instead of the population
standard deviation, use the sample standard deviation.
z-test
In a z-test, the sample is assumed to be normally distributed. A z-score
is calculated with population parameters such as “population
mean” and “population standard deviation”. It is used to validate a
hypothesis that the sample drawn belongs to the same population. When the
variance is known and either the distribution is normal or sample size is
large, use a z-test statistic.
t-test
Like a z-test, a t-test also assumes a normal distribution of the
sample. A t-test is used when the population variance or standard deviation
are not known. When the variance is unknown and a sample size is less
than 30, use a t-test statistic assuming that the population is normal or
approximately normal.
72
The table shows what test statistic is appropriate when:
Population Variance Is Population Variance Is Central Limit Theorem
Known Unknown (CLT)
Population is normal or Population may not be
Population is normally
nearly normally normally distributed.
distributed.
distributed.
𝑛 ≥ 30 or considered
𝑛 ≥ 30 𝑛 < 30
sufficiently large
Population standard Sample standard
Variance is known/
deviation (𝜎) is known. deviation (s) is known.
unknown.
Population standard
deviation (𝜎) is unknown.
Use z-test by replacing
population standard
z-test t-test deviation (𝜎) by sample
standard deviation (𝑠) in
the formula.
Identifying Appropriate Test Statistic
𝒏 ≥ 𝟑𝟎 𝒏 < 𝟑𝟎
Illustrative Examples:
1. A manufacturer claimed that the average life of batteries used in their
electronic games is 150 hours. It is known that the standard deviation of
this type of battery is 20 hours. A consumer wished to test the
manufacturer’s claim and accordingly tested 100 electronic games using
the battery. It was found out that the mean is equal to 144 hours.
Here, the sample size (n) is 100 (extremely large) and population
standard deviation (20 hours) is known, then the appropriate test
statistic to be used is z-test.
73
The sample size (n) is 12 which is less than 30 and sample
standard deviation (5 words per minute) was given. Therefore, the
appropriate test is t-test.
Note:
The illustrative examples above used standard deviations instead of
variances. Variance is the square of the standard deviation and conversely,
the standard deviation is the square root of the variance. Hence, if the
standard deviation is known in the problem, then basically, variance is also
known.
What’s More
74
students. Among the sampled students, the average IQ is 108 with a
standard deviation of 10.
5. A new energy-efficient lawn mower engine was developed by a well-known
inventor. He claims that the engine will run continuously for 5 hours on
a single gallon of regular gasoline. From his stock of 2,000 engines, the
inventor selects a simple random sample of 50 engines for testing. The
engines run for an average of 295 minutes with a standard deviation of
20 minutes.
Activity 4. Check It Out!
Directions: Read and analyze each problem. On the table below, put a
check on the columns of the criteria that correspond to the given problem.
75
𝒏 ≥ 𝟑𝟎 𝒏 < 𝟑𝟎 𝝈 is known. 𝝈 is unknown. z-test t-test
1.
2.
3.
4.
5.
___________2. Based on the report of the school nurse, the average height of
Grade 11 students has increased. Five years ago, the average height of
Grade 11 students was 170cm with standard deviation of 38cm. She took a
random sample of 150 students and derived the average height of 165cm.
76
What I Have Learned
Complete the following sentences by filling each blank with the correct word
or phrase.
1. __________________ is a random variable that is calculated from sample
data and is used in a hypothesis test.
2. ____________ includes all of the elements from a set of data while
______________ consists of one or more observations drawn from the
population.
3. ___________ is a measure of variability calculated from every individual in
the population while ______________ is calculated from only some of the
individuals in a population.
4. The two common test statistics to be computed in hypothesis testing are
________________________ and ____________________________________.
5. A z-score is calculated with population parameters such as population
mean and ______________________.
6. A t-test is used when the __________________ or standard deviation is not
known.
7. The number of sample for z-test is ________________________ while
________________________ in t-test.
8. If the population standard deviation is known, use
______________________ and if it’s unknown, use
________________________.
9. The notations that need to be considered in identifying test statistics are
_____________________ and ____________________.
10. If the number of samples is sufficiently large and the variance is
unknown, then ________________________ is appropriate to be used.
What I Can Do
Make a comics strip on how to determine the appropriate tool when the
variance is known, variance is unknown, and when Central Limit Theorem
is used. Your work will be evaluated using the following rubric.
77
Clear Understanding of Mathematical Concept 30
Organization and Accuracy of Solution(s) 30
Clear Understanding of Vocabulary 10
Accuracy of Analysis 20
Presentation 10
Total 100
Assessment
78
7. If in a sample n=16 selected from a normal population, 𝑥̅ = 56 and 𝑠 = 12,
what statistical test is applicable to be used?
A. f-test C. z-test
B. t-test D. Central Limit Theorem
8. Based on Central Limit Theorem, the z-test for single sample may be
used when all the following conditions are TRUE except
_________________.
A. Sample size is less than 30.
B. Data are normally distributed.
C. Population standard deviation is known.
D. Population standard deviation is unknown.
9. What is the sample standard deviation if a simple random sample of 220
students is drawn from a population of 2,740 college students? Among
the sampled students, the average IQ score is 115 with standard
deviation of 10.
A. 10 B. 115 C. 220 D. 2,740
10. The supervisor of a certain company claimed that the mean workday of
his workers is 8.3 hours per day. A sample of 20 workers was taken and it
was found out that the mean workday is 8 hours with standard deviation
of 1 hour. At 0.01 level of significance, is the mean workday less than 8.3
hours?
What test statistic is to be used in the given problem?
A. z-test C. right-tailed test
B. t-test D. left-tailed test
12. A leader of an association of jeepney driver claims that the average daily
take-home pay of all jeepney drivers in Caloocan is ₱350.00. A random
sample of 100 jeepney drivers in Caloocan was interviewed and the take-
home pay was found to be ₱420.00. If 0.05 significance level was used to
find out whether the average take home pay is different from ₱350.00 and
population variance was assumed to be ₱92.00, what is the appropriate
test statistic?
A. t-test C. left-tailed test
B. z-test D. right-tailed test
13. L.V. Co. has an average sale of ₱37 million per week from their products
in all their outlets. An area manager found out that the average gross
sales from the 28 outlets under her jurisdiction is ₱32.5 million per week
with standard deviation of ₱1.5 million. Does the mean sales of all outlets
79
differ from the mean sales of the 28 outlets under her jurisdiction? In the
given problem, what statistical tool is suitable to use?
A. t-test C. ANOVA
B. z-test D. chi-square test
14. A cellular battery manufacturer claims that his battery when fully
charged has mean life of 24 hours with standard deviation of 4 hours. A
dealer randomly chose sample of 35 batteries to be tested and resulted to
22.5 hours mean life. In the given situation, 22.5 hours is __________.
A. sample mean C. sample standard deviation
B. number of sample D. population standard deviation
15. According to a study, there is an increase on average monthly expenses
of ₱250.00 for cell phone loads of Senior High School students in the city.
Is there a reason to believe that the amount increased if sample of 60
students has an average monthly expense of ₱280.00 and the population
standard deviation is ₱77.00? What is the tool to be used in computing
the test value?
A. z-test C. left-tailed test
B. t-test D. alternative test
Additional Activities
80
References
Textbooks
Caraan, Avelino Jr S. Introduction to Statistics & Probability: Modular
Approach. Mandaluyong City: Jose Rizal University Press, 2011.
Chua, Jedd Amerson S. Soaring 21st Century Mathematics: Statistics and
Probability. Quezon City: Phoenix Publishing House Inc., 2016.
De Guzman, Danilo. Statistics and Probability. Quezon City: C & E
Publishing Inc, 2017.
Punzalan, Joyce Raymond B. Senior High School Statistics and Probability.
Malaysia: Oxford Publishing, 2018.
Sirug, Winston S. Statistics and Probability for Senior High School CORE
Subject A Comprehensive Approach K to 12 Curriculum Compliant.
Manila: Mindshapers Co., Inc., 2017.
Online Resources
LaMorte, Wayne W. “Central Limit Theorem (CLT).” Accessed May 27, 2020
http://sphweb.bumc.bu.edu/otlt/MPHModules/BS/BS704_Probabili
ty/ BS704_Probability12.html
MacEwan University. “1 Hypotheses Test About µ If σ Is Not Known.”
Accessed May 27, 2020. https://academic.macewan.ca/burok/
Stat141/notes/ttests.pdf
Nigam, Vibhor. “Statistical Tests - When to Use Which?” Accessed May 26,
2020. https://towardsdatascience.com/statistical-tests-when-to-use-
which-704557554740
81
Statistics and
Probability
Quarter 2 – Module 5:
Identifying the Appropriate
Rejection Region for a Given
Level of Significance
82
What I Need to Know
Since you already know how to choose the test statistic applicable in
hypothesis testing, you are now ready to identify the appropriate rejection
region when population variance is known or unknown. In determining
rejection region, you will also be defining other statistical concepts such as
critical value.
What I Know
83
3. For a two-tailed test with variance unknown, n= 19, and 𝛼 = 0.05, what is
the critical value?
A. ±2.092 B. ±2.101 C. ±2.145 D. ±2.878
4. For a two-tailed test with a sample size of 40, the null hypothesis will be
rejected at 5% level of significance if the test statistic is:
A. 𝑧 ≤ −1.28 𝑜𝑟 𝑧 ≥ 1.28 C. 𝑧 ≤ −1.96 𝑜𝑟 𝑧 ≥ 1.96
B. 𝑧 ≤ −1.645 𝑜𝑟 𝑧 ≥ 1.645 D. 𝑧 ≤ −2.33 𝑜𝑟 𝑧 ≥ 3.33
5. If the alpha level is increased from 0.01 to 0.05, then the boundaries for
the critical region move farther away from the center of the distribution.
A. True C. both A and B
B. False D. cannot be determined
6. In the two-tailed test, the rejection region lies on ___________ of the
normal distribution.
A. center B. left tail C. right tail D. both tails
7. Given the illustration at the right, which of the following is NOT TRUE?
A. This is a left-tailed test.
B. This is a right-tailed test. 1.645
C. This has a critical value of 1.645.
D. This has a level of significance of 0.5.
8. Given the normal curve at the right, what is the rejection region?
A. 𝑧 ≤ 1.645 𝑜𝑟 𝑧 ≥ 1.645
B. 𝑧 ≥ −1.645 𝑜𝑟 𝑧 ≥ 1.645
C. 𝑧 ≥ −1.96 𝑜𝑟 𝑧 ≤ 1.96 -1.96 1.96
D. 𝑧 ≤ −1.96 𝑜𝑟 𝑧 ≥ 1.96
9. What is the critical value if the population variance is unknown, 𝑛 = 13,
𝛼 = 0.05, and it is a one-tailed test?
A. 𝑡 =1.782 B. 𝑡 =2.179 C. 𝑡 =2.681 D. 𝑡 =3.055
10. Given a two-tailed test, population variance is known, and 𝛼 = 0.10, what
is critical region?
A. 𝑧 ≥ 1.28 C. ≤ −2.33 or 𝑧 ≥ 2.33
B. 𝑧 ≤ −1.96 D. 𝑧 ≤ −1.645 or 𝑧 ≥ 1.645
11. Which of the following is the sketch of the normal curve if 𝑧 ≥ 1.645?
A. B. C. D.
84
13. In the given problem below, identify the rejection region.
It is claimed that the mean distance of a certain type of vehicle is 35
miles per gallon of gasoline with population standard deviation σ = 5
miles. What can be concluded about the claim using α = 0.1 if a random
sample of 49 such vehicles has sample mean, x̅ = 36 miles?
A. 𝑧 ≤ −1.28 C. 𝑧 ≤ −1.645 𝑜𝑟 𝑧 ≥ 1.645
B. 𝑧 ≥ 2.33 D. 𝑧 ≤ −2.575 𝑜𝑟 𝑧 ≥ 2.575
14. Based on the problem in no. 13, which is the correct graph?
A. B. C. D.
Before we discuss the topic, let us recall some concepts that will lead
you to the concept of rejection region.
85
What’s In
Directions: Given a standard normal curve, shade the required area with
color GREEN and for the remaining area, use color RED.
86
What’s New
Guide Questions:
1. How did you find the activity?
2. What are the similarities and differences of the two problems?
3. Have you encountered previously learned statistical concepts? If yes,
will you discuss those concepts?
4. Were you able to answer all the follow-up questions? If not, why?
5. What are the concepts that seemed to be familiar and unfamiliar to
you?
6. How do these concepts relate to the rejection region?
87
What Is It
Level of Significance
Test Type
𝛼 = 0.01 𝛼 = 0.025 𝛼 = 0.05 𝛼 = 0.10
left-tailed test −2.33 −1.96 −1.645 −1.28
right-tailed test 2.33 1.96 1.645 1.28
two-tailed test ±2.575 ±2.33 ±1.96 ±1.645
88
Examples:
Find the critical z values. In each case, assume that the normal distribution
applies.
1. left-tailed test with α= 0.01 𝒛 = −𝟐. 𝟑𝟑 (based on the table of critical value
of z)
2. two-tailed test with α=0.05 𝒛 = ±𝟏. 𝟗𝟔
3. right-tailed test with α=0.025 𝒛 = 𝟏. 𝟗𝟔
89
20 1.725 2.086 2.528 2.845
21 1.721 2.080 2.512 2.831
22 1.717 2.074 2.508 2.819
23 1.714 2.069 2.500 2.807
24 1.711 2.064 2.492 2.797
25 1.708 2.060 2.485 2.787
26 1.706 2.056 2.479 2.779
27 1.703 2.052 2.473 2.771
28 1.701 2.048 2.467 2.763
29 1.699 2.045 2.462 2.756
30 1.697 2.042 2.457 2.750
Examples:
a) Find the critical t-value for a left-tailed test with α= 0.05 and n =21.
Answer: 𝒕 = −𝟏. 𝟕𝟐𝟓
b) Find the critical t-value for a right-tailed test with α=0.01 and n = 17.
Answer: 𝒕 = 𝟐. 𝟓𝟖𝟑
c) Find the critical t-values for a two-tailed test with α=0.05 and n =26.
Answer: 𝒕 = ±𝟐. 𝟎𝟔𝟎
The “tails” of a test are the values outside of the critical values. In
other words, the tails are the ends of the distribution and they begin at the
greatest or least value in the alternative hypothesis (the critical values).
90
Rejection Region If Population Variance Is Unknown
A left-tailed test only has a tail on the left side of the graph.
rejection
region
A right-tailed test only has a tail on the right side of the graph
rejection
region
A two-tailed test has tails on both ends of the graph. This is a test
where the null hypothesis is a claim of a specific value.
rejection
rejection
region
region
Illustrative Examples:
Determine the critical values and the appropriate rejection region. Sketch
the sampling distribution.
91
hypothesis test is right-tailed, so the inequality symbol would be ≥. Hence, the
rejection region for a one-tailed test is z ≥ 1.645.
To sketch the graph, locate first the critical value of 1.645 which is
between the 1 and 2 in the normal curve. Then, shade the region greater than
the critical value because it is a right-tailed test.
rejection
region critical value
𝒛 = −𝟐. 𝟑𝟐𝟔
In the first three examples, you were able to find rejection region given
the hypothesis test, population variance known or unknown, number of
sample, and level of significance. The following example will discuss on how
to determine the appropriate rejection region in a real-life problem.
92
4. A survey reports a customer in the drive thru lane of one fast food
chain spends eight minutes to wait for his/her order. A sample of 24
customers at the drive thru lane showed mean of 7.5 minutes with a
standard deviation of 3.2 minutes. Is the waiting time at the drive
thru lane less than that of the survey made? Use 0.05 significance
level.
Hypotheses Hypothesis Population Level of Number z-value
Test Standard Significance of or t-
Known/Unknown Sample value
𝑯𝑶 : 𝝁 = 𝟖, left-tailed 𝛔 is unknown. 𝜶 = 𝟎. 𝟎𝟓 𝒏 = 𝟐𝟒 t-value
𝑯𝒂 : 𝝁 < 𝟖, test
A one-tailed test with 0.05 level of significance has 95% of the area
under the curve outside of the critical region. Since the variance is unknown,
we use t-score with df = 24-1=23 as the reference to determine the critical
value. This is a left-tailed test, so the critical value we need is negative. The
critical value is 2. 069 and the rejection region is 𝒕 ≤ −𝟐. 𝟎𝟔𝟗.
5. A banana company claims that the mean weight of their banana is 150
grams with a standard deviation of 18 grams. Data generated from a
sample of 49 bananas randomly selected indicated a mean weight of
153.5 grams per banana. Is there sufficient evidence to reject the
company’s claim? Use 𝛼 = 0.05.
Hypotheses Hypothesis Population Level of Number z-value
Test Standard Significance of or
Known/Unknown Sample t-value
𝑯𝑶 : 𝝁 = 𝟏𝟓𝟎 two-tailed σ is known. 𝜶 = 𝟎. 𝟎𝟓 𝒏 = 𝟒𝟗 z-
𝑯𝒂 : 𝝁 ≠ 𝟏𝟓𝟎 test value
93
After you find the appropriate rejection region, you will then
compute the standard (z or t) value based on the given data in the
hypothesis problem. If the computed value is in the rejection region,
then reject the null hypothesis and if not, do not reject the null
hypothesis. More discussions about this decision making will be on the
next module.
What’s More
94
Activity 4. Think Critically!
Directions: Identify the critical value of each given problem. Find the
rejection region and sketch the curve on a separate sheet of paper.
1. 𝐻𝑂 : 𝜇 = 90
𝐻𝑎 : 𝜇 ≠ 90
The sample mean is 69 and sample size is 35. The population follows a
normal distribution with standard deviation 5. Use 𝛼 = 0.05.
2. A survey reports the mean age at death in the Philippines is 70.95 years
old. An agency examines 100 randomly selected deaths and obtains a
mean of 73 years with standard deviation of 8.1 years. At 1% level of
significance, test whether the agency’s data support the alternative
hypothesis that the population mean is greater than 70.95.
3. The mean time costumer waits in line before checking in a grocery chain
is less than 10 minutes. To verify the performance of the store, the
obtaining mean time of 25 costumers is 9.5 minutes with standard
deviation of 1.6 minute. Use these data to test the null hypothesis that
the mean time is 10 minutes, at 0.01 level of significance.
4. A fast food restaurant cashier claimed that the average amount spent by
the customers for dinner is ₱125.00. Over a month period, a sample of 50
customers was selected and it was found that the average amount spent
for dinner was ₱130.00. Using 0.05 level of significance, can it be
concluded that the average amount spent by customers is more than
₱125.00? Assume that the population standard deviation is ₱7.00.
5. According to the radio announcer, the average price of kilogram of pork
liempo is more than ₱210.00. However, a sample of 15 prices randomly
collected from different markets showed an average of ₱215.00 and
standard deviation of ₱9.00. Using 0.05 level of significance, is there
sufficient evidence to conclude that the average price of pork liempo is
more than ₱210.00?
95
4. Z-score is used when the population standard deviation is _____________
while t-score is used when the population standard deviation
is_____________.
5. _____________, also known as the critical region, describes the entire area
of values that indicates you reject the null hypothesis.
6. The values outside the critical values are the _____________.
7. To determine the critical region if population variance is known, use table
for _____________ distribution while if the variance is unknown, use table
for _____________ distribution.
8. If the hypothesis test is a right-tailed test, then the z-values or t-values
on the rejection region are _____________ the critical value.
9. When the given hypothesis test is a two-tailed test, then the rejection
regions are on ___________________ tails of the distribution.
10. To sketch the graph of the rejection region, locate first the _____________.
What I Can Do
96
Assessment
3. For a two-tailed test with variance unknown, n= 16, and 𝛼 = 0.05, what is
the critical value?
A. ±2.092 B. ±2.134 C. ±2.145 D. ±2.145
4. For a one-tailed test with a sample of 15, the null hypothesis will not be
rejected at 5% level of significance if the test statistics is:
A. 𝑡 ≤ −1.761 B. 𝑡 ≤ −1.753 C. 𝑡 ≤ −1.703 D. 𝑡 ≤ −1.697
5. If the level of significance decreased from 0.1 to 0.05, then the
boundaries for the critical region move farther away from the center of
the distribution.
A. true B. false C. both A and B D. cannot be determined
6. In a right-tailed test, the rejection lies in the ________ tails of distribution.
A. up B. left C. right D. down
7. Based on the graph, which of the following is TRUE?
A. This is a two-tailed test.
B. This is a right-tailed test.
−1.725
C. Level of significance is 0.025.
D. The rejection region is 𝑡 ≤ -1.725.
8. What is the rejection region of the given normal curve at the right?
A. 𝑧 ≥ 1.28
B. 𝑧 ≥ 1.645
C. 𝑧 ≥ 1.96
D. 𝑧 ≤ 2.33
9. Given a left-tailed test, population standard deviation is unknown, 𝑛 =
27, 𝛼 = 0.01, what is the critical value?
97
A. 𝑡 = −2.528 B. 𝑡 = −2.479 C. 𝑡 = −1.706 D. 𝑡 = 2.479
10. What is the critical value if the population variance is known, 𝑎 = 0.025,
and it is a two-tailed test?
A. 𝑧 = ±1.28 B. 𝑧 = ±1.645 C. 𝑧 = ±1.96 D. 𝑧 = ±2.33
11. Which of the following is the correct illustration of rejection region 𝑡 ≤
−1.943?
A. B. C. D.
12. Which of the following is the sketch of the normal curve if 𝑧 < −1.645 𝑜𝑟
𝑧 > 1.645?
A. B. C. D.
13. Given the graph of the normal curve at the right, what are the directional
test of hypothesis and critical z value if 𝛼 = 0.01?
A. two-tailed test, ±2.33
B. two-tailed test, ± 2.575
C. left-tailed test, −1.645
D. right-tailed test, 1.645
A. C.
B. D.
98
Additional Activities
Activity 5. Do It Now!
Directions: Read and analyze the given problem. Supply the data being
asked for on the items that follow.
1. Effects of drug and alcohol on the nervous system have been the subject
of significant research. A neurologist wants to test the effect of a drug by
injecting 100 rats with a unit dose of the drug, subjecting each rat to
stimulus, and recording its response time. It has been found out that the
mean is x̅ = 1.05 with standard deviation of s = 0.5. The mean response
time of a rat not to respond is 1.2 seconds. She wishes to test whether
the mean response time for drug-injected rats differs from 1.2 seconds.
Assume that the population is normal using α = 0.05.
99
References
Textbooks
Online Resources
Bognar, Matt. “Normal Curve Generator.” Accessed May 29, 2020
https://homepage.divms.uiowa.edu/~mbognar/applets/normal.html
Ku Leuven. “Critical Region.” Accessed May 28, 2020 https://lstat.
kuleuven.
be/training/coursedescriptions/Goodyear/critical_region.pdf
LibreTexts. “Testing Hypothesis.” Accessed May 29, 2020
https://stats.libretexts.org/Bookshelves/Introductory_Statistics/
Book%3A_Introductory_Statistics_(Shafer_and_Zhang)/08%3A_Testin
g_Hypotheses/8.E%3A_Testing_Hypotheses_(Exercises)
Stephanie, Glen. “Critical Values: Find a Critical Value in Any Tail.”
Accessed May 28, 2020 https://www.statisticshowto.com/
probability-and-statistics/find-critical-values/
100
Statistics and
Probability
Quarter 2 - Module 6:
Computing of Test Statistic on
Population Mean
101
What I Need to Know
In this module, you will learn how to compute the test statistic on a
population mean particularly the t-test and z-test. It is a skill that you need
to develop to be able to determine whether you reject the null hypothesis or
otherwise (to be discussed in the next module). Perform each activity
independently. If you find any difficulty in answering the exercises, you may
ask the assistance of your teacher or you may consult your peers.
After going through this module, you are expected to:
1. determine the appropriate test statistic to be used in the given
problem/situation; and
2. compute for the test statistic value (population mean).
What I Know
102
5. When finding the z-computed value, which formula should be used for
hypothesis testing?
𝑥̅ −𝜇 𝑥̅ −𝜇 𝜇− 𝑥̅ 𝜇− 𝑥̅
A. 𝑧 = 𝜎 B. 𝑧 = 𝑠 C. 𝑧 = 𝑠 D. 𝑧 = 𝜎
√𝑛 √𝑛 √𝑛 √𝑛
103
12. Find the computed value.
A. -5.916 B. -4.950 C. 4.950 D. 5.916
13. Which test statistic will be used if the sample size is 15?
A. t-test C. cannot be determined
B. z-test D. neither t-test nor z-test
14. Which statistical method can you use when you have a normal
distribution of data?
A. t–test only C. either t–test or z–test
B. z–test only D. neither t–test nor z–test
15. A tire manufacturer claims that its tires have a mean life of 40,000 km. A
random sample of 46 of these tires is tested and the sample mean is
38,000 km. Assume that the population’s standard deviation is 2,000 km
and the lives of the tires are approximately normally distributed.
Determine the computed value at 5% level of significance.
A. -6.782 B. -3.033 C. 3.033 D. 6.782
How do you find this pre-test? Did you encounter both familiar and
unfamiliar terms? Kindly compare your answer in the Answer Key on the
last part of this module
104
Lesson
Computing Test Statistic on
6 Population Mean
Do you still remember when to use t-test? How about z-test? Answer
the activity that follows for a short review on t-test and z-test.
What’s In
Is It T or Z?
105
What’s New
t-Test vs z-Test
YES NO
1.Use: Is the sample
size above
__________
30?
YES NO
2. Use: 3. Use:
__________ __________
I think you are very much ready for this topic. Read, analyze, and
study the given examples carefully.
What Is It
There are two specific test statistics used for hypothesis testing
concerning means: z-test and t-test.
106
On the other hand, t- test is used when 𝑛 < 30, the population is
normal or nearly normal, and sample standard deviation (𝑠) is unknown.
The formula for the t- value is:
𝑥̅ − 𝜇
𝑡= 𝑠
√𝑛
where: 𝑥̅ = sample mean 𝜇 = population mean
𝑛 = sample 𝑠 = sample standard deviation
The degrees of freedom is 𝑛 − 1 or 𝑑𝑓 = 𝑛 − 1.
Example 1: Compute the z-value given the following information. Use one-
tailed test and 0. 05 level of significance.
𝑥̅ = 70 𝜇 = 71.5 𝜎=8 𝑛 = 100
Solution: Since σ is known and n ≥ 30, we will use z-test. Thus, we have:
1.5
𝑧=
0.8
𝐳 = 𝟏. 𝟖𝟕𝟓
Therefore, the computed z-value is 1.875.
Example 2: In the first semester of the school year, a random sample of 200
students got a mean score of 81.72 with a population standard deviation of
15 in Statistics and Probability test. The population mean is 79.83. Use 0.05
level of significance.
Solution: To answer the problem, let us first identify the given. We have:
𝑥̅ = 81.72 𝜇 = 79.83 𝜎 = 15 𝑛 = 200
Since σ is known and n ≥ 30, we will use z-test.
107
𝑥̅ − 𝜇 Use the formula for z-test.
𝑧= 𝜎
√𝑛
81.72 − 79. 83
𝑧= Substitute the given value to the
15
√200 formula.
1. 89
𝑧=
15 Simplify.
14. 14
1. 89
𝑧=
1.06
Therefore, the computed z-value is
𝐳 = 𝟏. 𝟕𝟖𝟑 1.783.
−5
𝑧= Simplify.
30
12.25
−5
𝑧=
2.45 Therefore, the computed z – value
𝐳 = − 𝟐. 𝟎𝟒𝟏 is -2.041.
108
Example 4: Compute the t-value given the following information:
𝑥̅ = 129.5 𝜇 = 127
𝑠=5 𝑛 = 12
Solution: Since σ is unknown and n < 30, we will use t-test. Thus, we have:
109
Now, it’s your turn to answer the following exercises.
What’s More
Directions: Find the computed z-value of the following. Write your answer
to the nearest thousandths. Show your solutions.
4. 𝑥̅ = 45 000 5. 𝑥̅ = 1.72
𝜇 = 46 100 𝜇 = 1.83
𝜎 = 1 795 𝜎 = 1.05
𝑛 = 50 𝑛 = 36
110
Activity 2: Find My t-Value!
Directions: Compute the t-value of the following. Write your answer to the
nearest thousandths. Show your solutions.
4. 𝑥̅ = 1.83 5. 𝑥̅ = 30. 18
𝜇 = 1. 27 𝜇 = 31. 23
𝑠 = 2.15 𝑠 = 3.15
𝑛 = 10 𝑛 = 23
111
1. 𝑥̅ = 7.7 2. 𝑥̅ = 19.8 3. 𝑥̅ = 12.5
𝜇 = 8.1 𝑠=4 𝑠=3
𝜎=5 𝜇 = 18.3 𝜇 = 10.75
𝑛 = 135 𝑛 = 11 𝑛 = 18
Directions: Determine the test statistic used. Then, find the value of the
following based on the given information.
112
1. 𝐻𝑜 : 𝜇 = 85 𝐻𝑎 : 𝜇 ≠ 85
The sample mean is 83, the sample size is 39, and the standard
deviation is 5. Use 𝛼 = 0.05.
2. 𝐻𝑜 : 𝜇 = 7. 5 𝐻𝑎 : 𝜇 > 7. 5
The sample mean is 8.3 and the sample size is 52. The population
follows a normal distribution with standard deviation 3.17. Use 𝛼 =
0.01.
3. 𝐻𝑜 : 𝜇 = 15 𝐻𝑎 : 𝜇 < 15
The sample mean is 10, the sample standard deviation is 6.1, and the
sample size is 9. Use 𝛼 = 0.05.
5. 𝐻𝑜 : 𝜇 = 215 𝐻𝑎 : 𝜇 ≠ 215
The population is approximately normal. The sample mean is 219.3,
the sample standard deviation is 13.12, and the sample size is 22.
Use 𝛼 = 0.05.
6. 𝐻𝑜 : 𝜇 = 15 𝐻𝑎 : 𝜇 ≠ 15
The population is approximately normal. The sample mean is 15.3,
the sample standard deviation is 2.5, and the sample size is 12. Use 𝛼
= 0.05.
7. 𝐻𝑜 : 𝜇 = 65 𝐻𝑎 : 𝜇 > 65
The sample mean is 63, the sample size is 43, and the standard
deviation is 4. Use 𝛼 = 0.05.
113
8. 𝐻𝑜 : 𝜇 = 25 𝐻𝑎 : 𝜇 < 25
The sample mean is 23.75, the sample standard deviation is 4.5, and
the sample size is 12. Use 𝛼 = 0.05.
Directions: What new realizations did you have about the computation of test
statistic? To answer the question, complete the sentences below.
1. The __________ is used if the sample size is large, 𝑛 ≥ ___, and the
population standard deviation (𝜎) is __________.
2. The formula of the z-test is __________.
3. The __________ is used when 𝑛 < 30, the population is normal or nearly
normal, and sample standard deviation (𝑠) is __________.
4. The formula of the t-test is __________.
5. The formula for degree of freedom is __________.
114
What I Can Do
The Corona!
Directions: Determine the test statistic to be used, then find its computed
vale.
115
Assessment
116
7. What is the sample size?
A. 0. 15 B. 0. 48 C. 0. 5 D. 36
11. How many samples are best when dealing with t-test?
A. cannot be determined C. smaller than 30
B. exactly 30 D. equal or larger than 30
13. Which statistical method can you use when you have a normal
distribution of data?
A. t-test only C. either t-test or z-test
B. z-test only D. neither t-test nor z-test
117
Additional Activities
2. The maximum heart rate of a person at the age of 20 is 200 beats per
minute. Conduct a survey with your neighbors whose age is between 15-
20. Collect a data of 10 samples, then compute its value for 𝛼 = 0. 01?
118
References
Textbooks
Albacea, Zita VJ., Mark John V. Ayaay, Isidoro P. David, and Imelda E. De Mesa.
Teaching Guide for Senior High School: Statistics and Probability. Quezon City:
Commision on Higher Education, 2016.
De Guzman, Danilo. Statistics and Probability. Quezon City: C & E Publishing Inc.,
2017.
Punzalan, Joyce Raymond B. Senior High School Statistics and Probability.
Malaysia: Oxford Publishing, 2018.
Sirug, Winston S. Statistics and Probability for Senior High School CORE Subject A
Comprehensive Approach K to 12 Curriculum Compliant. Manila: Mindshapers
Co., Inc., 2017.
Online Resources
Lauer, Stephen A., Kyra H. Grantz, Qifang Bi, Forrest K. Jones, Qulu Zheng,
Hannah R. Meredith, Andrew S. Azman, Nicolas G. Reich, and Justin Lessler.
“The Incubation Period of Coronavirus. Disease 2019 (COVID-19) From
Publicly Reported Confirmed Cases: Estimation and Application,” Annals of
Internal Medicine 172, no. 9 (2020); 577-582. Accessed May 21, 2020
https://www.acpjournals.org/doi/10.7326/M20-0504
119
Statistics and
Probability
Quarter 2 - Module 7:
Drawing Conclusion About
Population Mean Based on
Test Statistic Value and
Critical Region
120
What I Need to Know
In this module, you will learn how interpret the result based on the
computed value of t-test and z-test. Perform each activity independently. If
you find any difficulty in answering the exercises, you may consult your
peers or ask the assistance of your teacher.
121
What I Know
4. If the t-computed value is 1.093 and the critical value is 1.699, what will
be the decision?
A. Reject both hypotheses. C. Do not reject the null hypothesis.
B. Reject the null hypothesis. D. Support the alternative hypothesis.
5. On the given figure below, the t-computed value is 2.130. What
conclusion can be drawn?
122
6. What does it mean when the null hypothesis is rejected?
A. The null hypothesis is incorrect.
B. The alternative hypothesis is correct.
C. There is sufficient evidence to support the null hypothesis.
D. There is sufficient evidence to disprove the null hypothesis.
7. If the z-computed value is 2.505 and the critical value is 2.011, what will
be the decision?
A. Reject both hypotheses. C. Support the null hypothesis.
B. Reject the null hypothesis. D. Do not reject the null hypothesis.
8. From the given figure below, the z-computed value is 1.375. What
conclusion can be drawn?
123
C. rejecting both the null and alternative hypotheses
D. supporting both the null and alternative hypotheses
124
Drawing Conclusion About
Lesson
Population Mean Based on
7 Test Statistic Value and
Critical Region
The final step in testing hypothesis is to interpret the results or draw
conclusions out of the computed value. In this module, you will decide
whether you reject or not the null hypothesis.
What’s In
Fact or Bluff?
Directions: Write FACT if the statement is true and BLUFF if not. Then,
answer the guide questions that follow.
___________ 1. The notation 𝜇 and 𝜎 are sample values.
___________ 2. The alternative hypothesis is a statement that there is no
significance difference between the two given properties.
___________ 3. In the given, 𝐻𝑜 : 𝜇 = 21.5 and 𝐻𝑎 : 𝜇 > 21.5 show a one-tailed
test since it shows direction of the distribution.
___________ 4. The rejection region for a hypothesis test is also called the
critical region.
___________ 5. The two types of significance test are one-tailed and two-
tailed test.
___________ 6. The level of significance refers to the degree of significance in
which we reject or fail to reject the null hypothesis.
___________ 7. We don’t need to set the level of the significance because we
can get 100% accuracy level in testing hypothesis.
___________ 8. A two-tailed test shows that the null hypothesis may be
rejected when the test value is on the critical region on either
side of the distribution.
___________ 9. Hypothesis testing is basically testing an assumption that we
make about the population parameter.
125
__________ 10. In a two-tailed test, the null hypothesis should not be rejected
when the test value is on either of the two critical regions.
Guide Questions:
1. Were you able to answer the questions correctly? If yes, you may proceed
to the next question. If not, you may go back to the previous discussion
so that you can recall the different terms related to hypothesis testing.
3. After obtaining the computed value of your statistic, how will you
interpret the result?
What’s New
Which Is Greater?
Directions: Write the symbols greater than (>), less than (<), or equal to (=)
in the following numbers. Then, answer the questions that follow.
126
Were you able to write the correct symbols? If not, which part was
confusing? Why do you think so?
You must know how to use these symbols in preparation for this
lesson.
What Is It
Level of Significance
Type of Test
𝜶 = 1% 𝜶 = 2.5% 𝜶 = 5% 𝜶 = 10%
127
12 1.782 2.179 2.681 3.055
13 1.771 2.160 2.650 3.012
14 1.761 2.145 2.624 2.977
15 1.753 2.131 2.602 2.947
16 1.746 2.120 2.583 2.921
17 1.740 2.110 2.567 2.898
18 1.734 2.101 2.552 2.878
19 1.729 2.093 2.539 2.861
20 1.725 2.086 2.528 2.845
21 1.721 2.080 2.512 2.831
22 1.717 2.074 2.508 2.819
23 1.714 2.069 2.500 2.807
24 1.711 2.064 2.492 2.797
25 1.708 2.060 2.485 2.787
26 1.706 2.056 2.479 2.779
27 1.703 2.052 2.473 2.771
28 1.701 2.048 2.467 2.763
29 1.699 2.045 2.462 2.756
30 1.697 2.042 2.457 2.750
31 1.695 2.040 2.453 2.744
32 1.694 2.037 2.449 2.738
33 1.692 2.035 2.445 2.733
34 1.691 2.032 2.441 2.728
35 1.690 2.030 2.438 2.724
36 1.688 2.028 2.434 2.719
37 1.687 2.026 2.431 2.715
38 1.686 2.024 2.429 2.712
39 1.685 2.023 2.426 2.708
40 1.684 2.021 2.423 2.704
42 1.682 2.018 2.418 2.698
44 1.680 2.015 2.414 2.692
46 1.679 2.013 2.410 2.687
48 1.677 2.011 2.407 2.682
50 1.676 2.009 2.403 2.678
60 1.671 2.000 2.390 2.660
Infinity 1.645 1.960 2.326 2.576
128
In general, if the absolute value of the computed value is greater
than the absolute value of the critical value, we reject the null hypothesis
and support the alternative hypothesis. But if the absolute value of the
computed value is less than the absolute value of the critical value, we do
not reject or we fail to reject the null hypothesis and the alternative
hypothesis is not supported.
In a right-tailed test, if the computed value is greater than the
critical value, we reject the null hypothesis and support the alternative
hypothesis. But if the computed value is less than the critical value, we do
not reject or we fail to reject the null hypothesis and the alternative
hypothesis is not supported.
In a left-tailed test, if the computed value is less than the critical
value, we reject the null hypothesis and support the alternative
hypothesis. But if the computed value is greater than the critical value, we
do not reject or we fail to reject the null hypothesis and the alternative
hypothesis is not supported.
Rejecting the null hypothesis doesn’t mean that it is incorrect or the
alternative hypothesis is correct. The collected data suggest a sufficient
evidence to disprove the null hypothesis, hence we reject it.
Similarly, a failure to reject the null hypothesis does not mean that it
is true -only that the test did not prove it to be false. There is an insufficient
evidence to disprove the null hypothesis; hence we do not reject it.
Decision: 1.645
The computed z-value is 1.875 which is greater than the critical value of
1.645. Therefore, we reject the null hypothesis and support the alternative
hypothesis.
129
Example 2: Compute for its value given the following information. Use 𝛼 =
0.01. Interpret the result.
𝐻𝑜 : 𝜇 = 127 𝑥̅ = 124.5 𝜇 = 127
𝐻𝑎 :𝜇 < 127 𝑠=5 𝑛 = 12
Rejection Acceptance or
𝑥̅ − 𝜇 −2. 5
𝑡= 𝑡= Region Non-Rejection
𝑠 5 Region
ξ𝑛 3.46
124. 5 − 127 −2.5
𝑡= 𝑡=
5 1.44
ξ12 𝐭 = −𝟏. 𝟕𝟑𝟔
-6 -5 -4 -3 -2 -1 0 1 2 3 4 5
-2.718
Decision:
The computed t-value is greater than the t-critical value at 𝛼 = 0.01 (i.
e.−1.736 > −2.718. Since we have a left-tailed test, our conclusion is that we
fail to reject the null hypothesis.
It is a two-tailed test, since it does not mention about the direction of the
distribution. Since σ is unknown and n < 30, we will use t-test. The degree of
freedom (df = n - 1) is 25 and 𝛼 = 2.5%. Therefore, the t-critical value from
Table 2 is 2.485. Thus, we have:
130
Non-Rejection
𝑥̅ − 𝜇 900 Region Rejection Region
𝑡= 𝑠 𝑡=
1 250
ξ𝑛 5.10
10 900 − 10 000 900
𝑡= 𝑡=
1 250 245. 10
ξ26 𝐭 = 𝟑. 𝟔𝟕𝟏
-5 -4 -3 -2 -1 0 1 2 3 4 5
-2.485 2.485
Decision:
The absolute value of the computed t-value is greater than the absolute of the
critical t-value at 𝛼 = 0.025 (i.e. |3.671|> |2.485|). Therefore, we reject the null
hypothesis.
Conclusion:
We can conclude that there is enough evidence to reject the claim of the
government that P10,000 is the monthly expenses of a Filipino family with
four members.
What’s More
131
Activity 2: Find Me
Directions: Complete the table below. Use Table 1: z-Critical Value and
Table 2: t-Critical Value. The first item is done for you.
Type of Test 𝜶 Sampl Compute Critical Decision
e Size d Value Value
1. one-tailed Reject the null
0.05 n = 17 2.015 1.746
hypothesis.
2. 0.01 n ≥ 30 1.361 ±2.575
3. two-tailed 0.05 n = 27 3.026
4. one-tailed Do not reject
0.1 2.318 2.552 the null
hypothesis.
5. one-tailed n ≥ 30 1.008 ±1.960
132
Activity 4: Interpret Me
Directions: Draw a conclusion from the given information.
1. 𝐻𝑜 : 𝜇 = 80 𝐻𝑎 : 𝜇 ≠ 80
The sample mean is 83, the sample size is 39, and the standard deviation
is 5. Use 𝛼 = 0.05.
2. 𝐻𝑜 : 𝜇 = 7. 5 𝐻𝑎 : 𝜇 > 7. 5
The sample mean is 8.3 and the sample size is 52. The population follows
a normal distribution with standard deviation of 3.17. Use 𝛼 = 0.01.
3. 𝐻𝑜 : 𝜇 = 10 𝐻𝑎 : 𝜇 > 10
The sample mean is 15, the sample standard deviation is 6.1, and the
sample size is 9. Use 𝛼 = 0.05.
5. 𝐻𝑜 : 𝜇 = 215 𝐻𝑎 : 𝜇 ≠ 215
The population is approximately normal. The sample mean is 219.3, the
sample standard deviation is 13.12, and the sample size is 22. Use
𝛼 = 0.05.
133
What I Have Learned
What I Can Do
134
Assessment
135
7. If the z-computed value is 1.253 and the critical value is 1.645, what will
be the decision?
A. Reject the null hypothesis.
B. Do not reject the null hypothesis.
C. Reject both the null and alternative hypotheses.
D. Support both the null and alternative hypotheses.
8. On the given figure below, the z-computed value is 2.431. What
conclusion can be drawn?
A. Reject the null hypothesis.
B. Fail to reject the null hypothesis.
C. Reject both the null and alternative
hypotheses.
D. Support both the null and alternative
hypotheses.
.
9. In a right-tailed test, if the critical value is greater than the computed
value, then we __________________________________.
A. reject the null hypothesis
B. fail to reject the null hypothesis
C. reject both the null and alternative hypotheses
D. support both the null and alternative hypotheses
10. A drink vending machine is adjusted so that, on average, it dispenses
200ml of fruit juice with a standard deviation of 13ml into a plastic cup.
However, the machine tends to go out of adjustment and periodic checks
are made to determine the average amount of fruit juice being dispensed.
The operator thinks that the amount dispensed is less than 200 ml. So to
verify, a sample of 25 drinks is taken to test the adjustment of the
machine and a mean of 195 is obtained. For α = 5%, an appropriate
decision rule would be _________________________________.
A. retain the null hypothesis C. support the null hypothesis
B. reject the null hypothesis D. fail to reject the null hypothesis
136
C. Reject both the null and alternative hypotheses.
D. Support both the null and alternative hypotheses.
For nos. 14-15, refer to the given statement:
Additional Activities
137
References
Textbooks
Albacea, Zita VJ., Mark John V. Ayaay, Isidoro P. David, and Imelda E. De
Mesa. Teaching Guide for Senior High School: Statistics and Probability.
Quezon City: Commision on Higher Education, 2016.
Online Resources
HackMath.net. “Normal Distribution Calculator.” Accessed May 22, 2020
https://www.hackmath.net/en/calculator/normaldistribution?mean=
0&sd=1&area=above&above=1.645&below=&ll=&ul=&outsideLL=&out
sideUL=&draw=Calculate
138
Statistics and
Probability
Quarter 2 – Module 8:
Solving Problems Involving Test
of Hypothesis on Population
Mean
139
What I Need to Know
What I Know
140
4. What should be the decision if the computed z-value lies in the critical
region?
A. Reject the null hypothesis.
B. Reject the alternative hypothesis.
C. Do not reject the null hypothesis.
D. Do not reject the alternative hypothesis.
5. The mean height of women is greater than 64" (inches). Which of the
following represents the null and alternative hypotheses?
A. H0: μ > 64" C. H0: μ < 64"
Hₐ: μ ≠ 64" Hₐ: μ > 64"
B. H0: μ > 64" D. H0: p = 64"
Hₐ: μ ≠ 64" Hₐ: p > 64"
6. What is the last step in the hypothesis testing procedure?
A. Draw conclusion.
B. Choose the level of significance.
C. State the null and alternative hypotheses.
D. Determine the test statistic and compute it.
7. A one sample t-test is conducted on Ho: μ = 81.6. The sample has a
sample mean = 84.1, s = 3.1, n = 25, and α = .01. State your null and
alternative hypotheses.
A. H0: μ = 81.6 C. H0: μ < 81.6
Hₐ: μ ≠ 81.6 Hₐ: μ > 81.6
B. H0: μ = 81.6 D. H0: p = 64"
Hₐ: μ < 81.6 Hₐ: p > 81.6
8. Perform a hypothesis test on the null hypothesis where μ = 6.9. A
random sample of 25 items is selected. The sample mean is 7.1 and the
sample standard deviation is 2.4. It can be assumed that the population
is normally distributed at α = .01.
A. There is enough evidence to reject the claim.
B. There is enough evidence to support the claim.
C. There is not enough evidence to reject the claim.
D. There is not enough evidence to support the claim.
9. In a right-tailed test, what will you do if the critical value is greater than
the computed value?
A. Reject the null hypothesis.
B. Reject the alternative hypothesis.
C. Do not reject the null hypothesis.
D. Fail to reject the alternative hypothesis.
141
10. When the null hypothesis is rejected, which of the following statements is
true?
A. The null hypothesis is incorrect.
B. The alternative hypothesis is true.
C. There is enough evidence against the null hypothesis.
D. There is a very small probability that the given null hypothesis is true.
11. What does it mean when we failed to reject the null hypothesis?
A. The conclusion is not significant.
B. The null hypothesis is definitely correct.
C. There is enough evidence to back up the null hypothesis.
D. There is insufficient evidence to disagree with the null hypothesis.
12. If the t-computed value is 1. 093 and the critical value is 1.699, what will
be the decision?
A. Reject the null hypothesis.
B. Support the null hypothesis.
C. Do not reject the null hypothesis.
D. Support the alternative hypothesis.
13. What is the first step in the hypothesis testing procedure?
A. Draw conclusion.
B. Choose the level of significance.
C. State the null and alternative hypotheses.
D. Determine the test statistic and compute it.
14. What will you do if the computed value is greater than the critical value?
A. Reject the null hypothesis.
B. Support the null hypothesis.
C. Do not reject the null hypothesis.
D. Support the alternative hypothesis.
15. If the computed z-value is 1.130 and the critical value is 1.96, what
conclusion can be drawn?
A. Fail to reject the null hypothesis.
B. Reject both the null and alternative hypotheses.
C. Reject the null hypothesis in favor of the alternative hypothesis.
D. Fail to reject both the null hypothesis and alternative hypothesis.
How did you find this pre-test? Did you encounter both familiar and
unfamiliar terms? Kindly compare your answer in the Answer Key on the
last part of this module.
If you got a perfect score or 100%, skip this module and proceed to
the next one. But if you missed even a single point, please continue with
this module as it will enrich your knowledge in hypothesis testing.
142
Lesson Solving Problems Involving
What’s In
A H Y P O T H E S I S N A V Q N T
L C W A A O A N S A I D Q A U O Y
T A N R D N S U Z D R E A R K I P
E R T A G U V P T T H I R I G R E
R S A M P L E M E A N S S A T Y I
N I R E Q L U S S I O N E N I N E
A G T T E S T I T L S W A C T W R
T N S E P A J K K W L E T E K A R
I I O R L S K L O Y O R O Q S F O
V F O N E T A I L E D T E S T G R
E I R C E I L F Y G Q U A X P H S
N C C R I T I C A L R E G I O N T
S A M P L E S I Z E I L I Z L U Y
I N A U D L R W O E L P Q P S E U
L C P O P U L A T I O N M E A N D
W E L E V E L C S E N E R X Y L J
143
Since you already know the different terms related to hypothesis
testing, you are now ready to solve problems.
In decision making, what are the factors that you need to consider?
Do you think of the consequences of your actions?
Statistics can help us in making decisions. Included in the process is
forming reliable conclusions and the decision making starts with the testing
of the hypothesis. Let us enhance your decision-making skills by answering
the next activity.
What’s New
Every day, we are faced with all sorts of decisions. Sometimes the
decisions are small, like what to wear or what to eat. But sometimes the
decisions are bigger, like what course you are going to take up or which
university you are going to enrol in college. The test of hypothesis will aid
you in the decision-making process so you can make the right choices for
better results.
What Is It
144
In general, if the absolute value of the computed value is greater than
the absolute value of the critical value, we reject the null hypothesis and
support the alternative hypothesis. But if the absolute value of the
computed value is less than the absolute value of the critical value, we fail
to reject the null hypothesis and the alternative hypothesis is not
supported.
𝑥̅ −𝜇
𝑧= 𝜎
√𝑛
165−155
𝑧= 52
√50
10
𝑧= 7.35
𝐳 = 𝟏. 𝟑𝟔𝟏
145
Step 3: Find the critical value and draw the critical region. Use the z-critical
value table.
The alternative hypothesis is directional. Hence, the one-tailed test
(right-tailed test) shall be used. From the z-value table at 0.05 level of
significance, the critical value is 1.645.
Non-Rejection
Region
Rejection Region
1.361 1.645
𝑥̅ −𝜇
𝑡= 𝑠
√𝑛
135−120
t= 38
√25
15
𝑡= 7.6
𝒕 = 𝟏. 𝟗𝟕𝟒
146
Step 3: Find the critical value and draw the critical region.
The alternative hypothesis is non-directional. Hence, the two-tailed
test shall be used. From the t-value table at 0.10 level of significance, the
critical value is ±1.711.
Rejection Region Non-Rejection Rejection Region
Region
- 1.711 1.711
𝑥̅ −𝜇
𝑧= 𝜎
√𝑛
95−99
𝑧= 15
√40
−4
𝑧= 2.37
𝐳 = −𝟏. 𝟔𝟖𝟖
147
Step 3: Find the critical value and draw the critical region. Use the z-critical
value table. The alternative hypothesis is directional. Hence, the one-tailed
test (left-tailed test) shall be used. From the z-value table at 0.05 level of
significance, the critical value is -1.645.
Non-Rejection
Region
Rejection Region
-1.645
What’s More
𝑡= 9
√ 20
148
𝑡=
𝑡 = ____
Step 3: ____________________________________________________
From the t-value table at 0.05 level of significance, the critical value is
_______________.
Acceptance Region
or Non-Rejection Rejection Region
Region
-2.093 2.093
149
that the average amount spent per day by a Filipino household has
increased? Assume normality over the population.
150
What I Can Do
Assessment
151
2. If the t-computed value is 2.430 and the critical value is 2.011, what will
be the decision?
A. Reject the null hypothesis. C. Reject the alternative hypothesis.
B. Support the null hypothesis. D. Do not reject the null hypothesis.
3. What is the third step in the hypothesis testing procedure?
A. Draw conclusion.
B. State the null and alternative hypotheses.
C. Determine the test statistic and compute it.
D. Find the critical value for the test; then draw the critical region.
4. In a left-tailed test, what will you do if the critical value is less than the
computed value?
A. Reject the null hypothesis.
B. Do not reject the null hypothesis.
C. Reject the alternative hypothesis.
D. Do not reject the alternative hypothesis.
5. The t-computed value is 1.875 and the critical value is 2.080. What
conclusion can be drawn?
A. Reject the null hypothesis.
B. Fail to reject the null hypothesis.
C. Reject the alternative hypothesis.
D. Fail to reject the alternative hypothesis.
6. What does it mean if a result is said to be significant at 1% level?
A. The null hypothesis is 99% true.
B. The null hypothesis is 99% wrong.
C. We fail to reject the false null hypothesis 1% of the time.
D. There is a 1% probability that a true null hypothesis is rejected.
7. It is a value that separates the acceptance region from the rejection region
in a normal curve when testing the hypothesis?
A. t-value C. critical value
B. z-value D. computed value
8. What should you do if the computed z-value lies in the critical region?
A. Reject the null hypothesis.
B. Reject the alternative hypothesis.
C. Do not reject the null hypothesis.
D. Do not reject the alternative hypothesis.
9. The mean height of women is less than 64" (inches). Which of the following
represents the null and alternative hypotheses?
A. H0: μ > 64" C. H0: μ < 64"
Hₐ: μ < 64" Hₐ: μ ≠ 64"
152
B. H0: μ = 64" D. H0: p = 64"
Hₐ: μ ≠ 64" Hₐ: p > 64"
10. In the hypothesis testing procedure, drawing conclusion should always be
the __________ step.
A. first B. second C. third D. last
11. A one sample t-test is conducted on Ho: μ = 81.6. The sample has a mean
of 84.1, s = 3.1, n = 25, and α = .01. What conclusion can be drawn?
A. Reject Ho. C. Fail to reject Ho.
B. Reject Ha. D. Fail to reject Ha.
12. Perform a hypothesis test where the null hypothesis is that the μ = 6.9. A
random sample of 16 items is selected. The sample mean is 7.1 and the
sample standard deviation is 2.4. It can be assumed that the population is
normally distributed at α = 0.05.
A. There is enough evidence to reject the claim.
B. There is enough evidence to support the claim.
C. There is not enough evidence to reject the claim.
D. There is not enough evidence to support the claim.
13. If the computed t-value is 2.130 while the critical value is 2.086, what
conclusion can be drawn?
A. Reject both the null and alternative hypotheses.
B. Fail to reject the null and alternative hypotheses.
C. Reject the null hypothesis in favor of the alternative hypothesis.
D. Fail to reject the null and the alternative hypothesis is not supported.
14. After formulating the hypotheses, what is the next step in the hypothesis
testing procedure?
A. Draw conclusion.
B. Choose the level of significance.
C. Determine the test statistic and compute it.
D. Find the critical value and draw the critical region.
15. Find the critical value(s) for a two-tailed test with α = 0.05.
A. z = -1.65 B. z = ±0.06 C. z = 1.65 D. z = ±1.96
153
Additional Activities
In this activity, complete the1-4-3 chart by writing down what are being
asked.
1 – 4 – 3 LIST
One (1) thing I really love about this topic:
1.
Four (4) important reasons why I love this topic:
1.
2.
3.
4.
Three (3) things I still need to understand about this topic:
1.
2.
3.
154
References
Textbooks
Albacea, Zita VJ., Mark John V. Ayaay, Isidoro P. David, and Imelda E. De
Mesa. Teaching Guide for Senior High School: Statistics and Probability.
Quezon City: Commision on Higher Education, 2016.
Online Resources
Encourage Play. “A Simple and Fun Game in Making Decisions.” Accessed
May 22, 2020 https://www.encourageplay.com/blog/a-simple-and-fun-
game-to-practice-making-decisions
HackMath.net. “Calculators.” Accessed May 22, 2020
https://www.hackmath.net/en/calculator/normaldistribution?mean=0
&sd=1&area=above&above=1.645&below=&ll=&ul=&outsideLL=&outside
UL=&draw=Calculate
Lenhart, Kira. “Country Infographic Rubric.” Accessed June 17, 2020
https://i.pinimg.com/originals/0b/04/12/0b041201f9fcc8c8ef4decc42
8765529.png
155
Statistics and
Probability
Quarter 2 – Module 9:
Formulating Appropriate
Null and Alternative Hypotheses
on a Population Proportion
156
What I Need to Know
In the previous lessons, you have studied how to formulate appropriate null
and alternative hypotheses concerning population means. Also, you’ve
learned how to draw correct conclusions after solving given problems based
on the test statistic and the rejection region.
In this module, you will have a short recall about population proportions
and all other related concepts with their equivalent symbols like test
statistic, rejection region, p-value, level of significance, etc.
After going through this module, you are expected to:
What I Know
1. A certain telephone company with a target of 5,000 items found that 300
out of the 500 items they randomly chose and tested failed to meet the
quality control guidelines. The company would like to test if more than
25% of the target might be out of assurance. What is the appropriate
alternative hypothesis at α = 0.05 level?
157
out that 35 of them smoke. Test if the claim has decreased at α = 0.01
level. Formulate the correct alternative hypothesis.
A. Ho: p ≠ 0.20 C. Ha: p = 0.20
B. Ho: p < 0.20 D. Ha: p < 0.20
4. In a public junior high school, a study found out that 40% of Grade 7
students are overweight. Is this lower for grade level age if a sample of
100 students was randomly chosen at 0.05 level of significance? What is
the appropriate alternative hypothesis?
A. The proportion of ships arriving into the port this year with
contaminated cargo is equal to 0.15.
B. The proportion of ships arriving into the port this year with
contaminated cargo is less than 0.15.
C. The proportion of ships arriving into the port this year with
contaminated cargo is not equal to 0.15.
D. The proportion of ships arriving into the port this year with
contaminated cargo is greater than 0.15.
6. Which of the following words suggests a right-tailed test?
A. smaller C. increased
B. different D. unequal
158
B. Ha: p > 0.21 D. Ha: p < 0.20
159
15. Which of the following null and alternative hypothesis are correctly
written in symbols?
A. Ho: p = .10 C. Ho: p < .30
Ha: p < .20 Ha: p = .30
How did you find this pre-test? Did you encounter both familiar and
unfamiliar terms? Kindly compare your answers in the Answer Key on the
last part of this module.
If you got a perfect score or 100%, skip this module and proceed to
the next one. But if you missed even a single point, please continue with
this module as it will enrich your knowledge in formulating hypotheses on
the population proportion.
160
What’s In
1. Alternative Hypothesis 𝑛
2. Sample Proportion 𝛼
3. Null Hypothesis 𝑝̂
4. Population Proportion 𝐻𝑎
5. Sample Size 𝑞
6. Value of 1 − 𝑝 𝐻0
7. Level of Significance 𝑝
2. Were you able to recall the different necessary symbols used in testing
hypothesis correctly?
161
What’s New
What Is It
Once you already know that you are dealing with a population
proportion, you can conduct the hypothesis test. You can start with the first
step of a hypothesis test which is to determine the hypotheses. In order to
formulate null and alternative hypotheses concerning population
proportions, you can write them in sentence form or you can use different
symbols. Here, you will use the symbol p for the population proportion.
162
Remember that the hypotheses are claims about the population
proportion, p. The null hypothesis states that the proportion is equal to a
specific value or the hypothesized proportion, po. On the other hand, the
alternative hypothesis is the competing claim that the population proportion
is less than, greater than, or not equal to po.
As a reminder, the null hypothesis is always a statement of equality.
The alternative hypothesis is always a statement of inequality, using the
symbols <, >, or ≠. Moreover, the hypotheses are stated in such a way that
they are mutually exclusive. That is, if one is true, the other must be false;
and vice versa.
If you are going to write the null hypothesis in sentence form, you
will usually use “is” or “is equal to”. In symbols, you are going to use:
HO : p = po
When testing for population proportions, there are three (3) possible
alternative hypotheses. They are based on the wording of the question
instructing you what to hypothesize. (See illustrative examples below.)
163
One-Tailed Two-Tailed
Alternative hypothesis contains Alternative contains the
the greater than (>) or less than inequality (≠) symbol.
(<) symbols
It is directional (either right-tailed It has no direction.
or left-tailed)
The next table below shows the null and alternative hypotheses stated
together with the types of hypothesis tests.
Illustrative Examples:
Example 1. It has been claimed that 40% of students in a particular senior
high school dislike Mathematics. When a survey was conducted by a
researcher, it showed that 145 of 800 students dislike Mathematics. Test if
the claim was different at α = 0.05 level.
Null Hypothesis (Ho):
In this example, the hypothesized proportion is 40% or 0.40. Hence,
the null hypothesis will be,
The proportion of students who dislike Mathematics is 40%.
In symbols, you can write,
Ho: p = 0.40
Alternative Hypothesis (Ha):
Our cue word here is “different” which means “not the same” or “not
equal”. Therefore the alternative hypothesis is,
The proportion of students who dislike Mathematics is not equal
to 40%.
In symbols, you can write,
Ha: p ≠ 0.40
164
Example 2. A certain senior high school plans to open STEM (Science and
Technology, Engineering, and Mathematics) as an academic track only if
60% of the students in their junior high school will enrol on the following
academic year. A survey conducted among a random sample of students
revealed that 450 out of 1000 students will enrol. Is the expected enrolment
significantly lower than the desired enrolment? Test at α = 0.05 level.
165
Alternative Hypothesis (Ha):
The word “larger” is synonymous to “greater” hence your alternative
hypothesis will be,
The proportion of qualified applicants in a particular job
interview was larger than 40%.
Or in symbols
Ha: p > 0.40
What’s More
166
Ho : p = 0.35
Ha : p > 0.35
_____, __________3. A research found out that 5% of the senior high school
students in a certain school are working students. A researcher randomly
selected 35 out of 300 students who are working. Is there a percentage
increase in the number of senior high school students who are working? Use
α = 0.01 level.
Ho : p = 0.05
Ha : p ≠ 0.05
_____, __________4. Before the national elections, 75% of the voters in a
certain town said that they preferred older senatorial candidates running for
senatorial positions than younger candidates. After a certain survey was
conducted, 910 out of 1,300 randomly selected voters preferred older
senatorial candidates. Does this claim indicate that voters in that town have
a greater interest in older candidates than in younger ones? Use α = 0.05.
Ho : p = 0.75
Ha : p > 0.75
_____, __________5. A researcher claimed that 55% of elementary students
would rather play than read books during break time. Another researcher
was assigned to verify the claim. He randomly selected 300 students. Two
hundred seventy-four (274) of them said they would rather play during
break time than read books.
At 0.10 level, is there enough evidence to conclude that the percentage
of elementary students who would rather play than read books has changed
to 55%?
Ho : p = 0.55
Ha : p < 0.55
ii. Ho _______________________________________________________________________________________________
Ha _______________________________________________________________________________________________
167
iii. Ho _______________________________________________________________________________________________
Ha _______________________________________________________________________________________________
iv. Ho _______________________________________________________________________________________________
Ha _______________________________________________________________________________________________
v. Ho _______________________________________________________________________________________________
Ha _______________________________________________________________________________________________
2. A drug manufacturer claims that fewer than 10% of patients who take
its new drug for treating certain pneumonia will experience nausea.
In a random sample of 250 patients, 23 experienced nausea. Perform
a significance test at the 5% significance level to test this claim.
168
What I Have Learned
What I Can Do
13
169
Assessment
170
A. Ha : p > .38 C. Ha : p < .38
B. Ha : p = .38 D. Ha : p ≠ .38
10. In a certain senior high school, a study found that 972 out that 1,100
Grade 12 students use smartphones. Using α = 0.10 level, is there
enough evidence to conclude that the percentage of students who are
using smartphones is different from 25%? What is the appropriate
alternative hypothesis?
A. Ha: p > .25 C. Ha: p = .25
B. Ha: p < .25 D. Ha: p ≠ .25
12. The president of a certain food chain claims that 70% of his 20,000
customers are very satisfied with the service they receive. In order to
test the claim, a survey was conducted among 100 customers randomly
chosen. Among them, 90% said that they are satisfied. Is there enough
171
evidence to say that 70% of the customers are satisfied at 0.05 level of
significance? Formulate the null hypothesis.
A. Ho: p ≠ 0.70 C. Ho: p < 0.70
B. Ho: p = 0.70 D. Ho: p > 0.70
Additional Activities
172
References
Textbooks
Albacea, Zita VJ., Mark John V. Ayaay, Isidoro P. David, and Imelda E. De
Mesa. Teaching Guide for Senior High School: Statistics and Probability.
Quezon City: Commission on Higher Education, 2016.
Online Resources
Minitab.com. “About the Null and Alternative Hypotheses.” Accessed
February 4, 2019. https://support.minitab.com/en-us/minitab/
18/help-and-how-to/statistics/basic-statistics/supporting-topics/
basics/null-and-alternative-hypotheses/
Minitab.com. “What Are Type I and Type II Errors?” Accessed February 4,
2019. https://support.minitab.com/en-us/minitab/18/help-and-
how-to/statistics/basic-statistics/supporting-topics/basics/type-i-
and-type-ii-error/
Zaiontz, Charles. “Null and Alternative Hypothesis.” Accessed February 2,
2018.http://www.real-statistics.com/hypothesis-testing/null-
hypothesis/
173
Statistics and
Probability
Quarter 2 – Module 10:
174
What I Need to Know
As you may recall, the Central Limit Theorem tells that if the sample
size is sufficiently large, then the mean of the random sample from a
population has a sampling distribution that is approximately normal, even
when the original population is not normally distributed. This means that
regardless of the shape of the original distribution, the sampling distribution
of the mean approaches a normal distribution as long as the sample is large
enough. Remember that the Central Limit Theorem is not limited to sample
means only. It can also be applied to sample proportions.
This module deals on identifying the appropriate form of test statistics
involving population proportion when the Central Limit Theorem is to be
used. However, the activities are limited to estimating the population
proportion and sample proportion as preparation in solving for the
appropriate test statistics.
After going through this module, you are expected to:
1. define population proportion and sample proportion;
2. determine the value of the population proportion and sample
proportion;
3. identify the appropriate form of the test statistic when the Central
Limit Theorem is to be used; and
4. relate population proportion in real-life situations.
175
What I Know
176
A. 8 customers C. all customers
B. 20 customers D. all customers who do not order dessert
9. Which assumption/s must be considered in testing hypothesis involving
proportion?
I. The conditions for binomial experiment are met.
II. The expression np>5 and nq>5 are both satisfied.
III.The sample size must be greater than or equal to 30.
A. I and II B. I and III C. II and III D. III only
14. In problem no. 13, what is the value of the sample proportion, 𝑝̂ ?
A. 0.60 B. 0.67 C. 0.70 D. 0.75
15. In a learning study, 1,200 respondents were asked if they can assimilate
concepts while watching television wherein 586 said YES. What is the
proportion of those who said yes?
A. 0.40 B. 0.49 C. 0.51 D. 0.58
How did you find this pre-test? Did you encounter both familiar and
unfamiliar terms? Kindly compare your answers in the Answer Key on the
last part of this module.
If you got a perfect score or 100%, skip this module and proceed to
the next one. But if you missed even a single point, please continue with
this module as it will enrich your knowledge in hypothesis testing involving
population proportion.
177
Lesson Identifying Appropriate Test
What’s In
178
Activity 1: Remind Me Please…
1 2 3 4
A. C.
Ratio Mean
Fraction Average
Percentage Calculation
B. D.
Part Rate
.
Section Percentage
Calculation Measurement
179
Notes to the Teachers
It is encouraged that the learners be asked to provide a
separate activity notebook where they will write their answers to
all assessments and activities in Modules 9-14 in which topics
are all interrelated. Through this, learners’ progress can easily be
monitored and parts of the lesson where intervention is needed
can be identified.
What’s New
Now to start this lesson, accomplish the activity below. Do not forget
to keep your answers because we will be using them in our discussions.
180
Column A Column B
1. What is the survey 500 senior citizen
all about? residents
What Is It
Dealing with various problems or situations oftentimes leads to
confusion. In this section, take note that problems involving proportions,
unlike in population mean and sample mean, never use terms such as
“average” and “mean” but “percentage” instead. Let us first define what
population proportion is.
Population Proportion and Sample Proportion
Population proportion (p) is a part of the population with a particular
attribute or trait expressed as a fraction, decimal, or percentage of the whole
population. In symbol:
181
𝐧𝐮𝐦𝐛𝐞𝐫 𝐨𝐟 𝐦𝐞𝐦𝐛𝐞𝐫𝐬 𝐢𝐧 𝐭𝐡𝐞 𝐩𝐨𝐩𝐮𝐥𝐚𝐭𝐢𝐨𝐧 𝐰𝐢𝐭𝐡 𝐚 𝐩𝐚𝐫𝐭𝐢𝐜𝐮𝐥𝐚𝐫 𝐚𝐭𝐭𝐫𝐢𝐛𝐮𝐭𝐞
p=
𝐧𝐮𝐦𝐛𝐞𝐫 𝐨𝐟 𝐦𝐞𝐦𝐛𝐞𝐫𝐬 𝐢𝐧 𝐭𝐡𝐞 𝐩𝐨𝐩𝐮𝐥𝐚𝐭𝐢𝐨𝐧
p= ____ %
To change percent to
decimal, see examples
below:
1. 12% = 0.12
2. 5% = 0.05
3. 12.5% = 0.125
On the other hand, there are cases where we still need to calculate 𝒑
̂.
Examples of these kinds are:
- “70 out of 200 residents are married.”
- “150 out of 500 listeners are interviewed.”
- “10 out of 1000 bulbs are defective.”
In this case, we need to solve for the value of the sample proportion
̂ (read as “p hat”).
𝒑
Sample proportion (𝒑 ̂ ) is the ratio of the number of elements in the
sample possessing the characteristics of interest over the number of
elements in the sample or n. It is computed by the formula:
182
where: ̂ is the proportion of the number of successes in n samples
𝒑
and read as “p hat”.
x represents the number of “successes” in n samples; and
n represents the size of the sample.
The example below will help you understand better how we can easily
estimate the value of the sample proportion.
Remember that in a situation
describing a population
proportion/sample proportion, the
words “mean” or “average” are not
used.
Illustrative Example:
For a class project, a Grade 12 STEM student wants to estimate the
percentage of students in his school who are registered voters. From 45%
Grade 12 students, he surveys 500 students and finds that 200 are
registered voters. Determine the value of p and compute for the sample
proportion.
Solution:
The population proportion is the rate or percent used from the entire
Grade 12 students. Therefore:
Population Proportion, p = 45% = 0. 45
To find the sample proportion ( 𝒑̂ ), identify the ff:
Surveyed Grade 12 students = n = 500
Registered Grade 12 students = x = 200
183
2. The conditions np ≥ 5 and nq ≥ 5 are both satisfied so that the
binomial distribution of sample proportion can be approximated by a
normal distribution with 𝜇 = 𝑛𝑝 and 𝜎 = √𝑛𝑝𝑞. (However, the specific
number varies from source to source, some authors use 10 instead of 5
depending on how good an approximation one wants.)
Likewise, the second assumption served as the basis to determine
whether the sample size from the population proportion is sufficiently large
or not. Remember that this time, the condition that sample be large is not n
to be at “least 30” but it should satisfy the second assumption. For a large
size of sample proportions, the Central Limit Theorem (CLT) can be used.
Bear in mind that if the sample size is sufficiently large, then the mean of
the random sample from a population has a sampling distribution that is
approximately normal, even when the original distribution is normally
distributed and n ≥ 30.
Now, let us check the assumptions from the previous situation:
1. It is evident that the responses have only two outcomes: “registered
voter” (success) or “not registered voter” (failure). Therefore, the first
assumption is met.
2. To be able to satisfy the second condition, we find the hypothesized
value of the population proportion p = 0.45 while n = 500. To get q, q
= 1 – p which makes q = 1 – 0.45 = 0.55.
184
Therefore, the formula for the value of z-test statistic for population
proportion would be:
𝑝̂−𝑝 𝑝̂−𝑝
Zcom = or Zcom =
𝑝𝑞 𝑝 ( 1−𝑝 )
√𝑛 √
𝑛
where:
zcom is the z-test statistic for proportion.
𝑥
𝑝̂ is the sample proportion ( 𝑛 ).
p is the hypothesized value of the population proportion.
n is the sample size or the number of observations in the
sample.
q is equal to 1 – p.
Remember this formula because you are going to use this in Module
12 where the actual computation for the test statistic involving population
proportion will be held.
What’s More
Activity 3: I Can
Directions: In each item, complete the set of solutions.
1. The iCare Center for Internet & Society at Kaliwanagan Province
recently conducted a study analyzing the privacy management habits
of 80% teen internet users. In a group of 50 teens, 13 are reported to
have more than 500 friends on Facebook. Determine the value of p
and sample proportion 𝒑 ̂.
Solution:
𝒙
p = ___ % ̂=
𝒑
𝒏
= ____ ̂ = _____
𝒑
̂ = ______
𝒑
185
are against the new legislation. Determine the value of p and the
sample proportion 𝒑
̂.
Solution:
𝒙
p = ___ % ̂=
𝒑
𝒏
= ____ ̂ = _____
𝒑
̂ = ______
𝒑
4. A poll taken prior to election day finds that 45% registered voters
intend to vote for Mayumi Caliwanagan as barangay chairperson of
Brgy. Kapatagan. A concerned citizen surveyed that 380 out of 700
registered voters favored for Mayumi. Determine the value of p and
the sample proportion 𝒑̂.
Solution:
𝒙
p = ___ % ̂=
𝒑
𝒏
= ____ ̂ = _____
𝒑
̂ = ______
𝒑
5. A survey to the pet owners in Green Village is taken and 40% of those
surveyed say they have dogs as their pet for protection for self/family.
A group of 180 pet owners are interviewed and 100 said that they
have dogs for protection of self/family. Determine the value of p and
the sample proportion 𝒑̂.
Solution:
𝒙
p = ___ % ̂=
𝒑
𝒏
= ____ ̂ = _____
𝒑
̂ = ______
𝒑
186
What I Have Learned
What I Can Do
187
Activity 4.2. Puzzle
Direction. Identify the statements describe below. Write your answer in the
puzzle box. Copy the box.
POPROP Puzzle
Across: Down:
1. Sample _______ 1. _________ Proportion
3. Symbolized as q
4. Opposite of ‘Failure’ 2. An experiment with 2
6. Central _______Theorem outcomes only
5. Same as percentage
7. It is symbolized as n
8. Test statistic for population
proportion
9. The symbol 𝑝̂ is read as ___
3 5
188
2. It is believed that in the coming election, 65% of the voters in the
Province of Kaunlaran will vote for the administrative candidate for
governor. Out of 1,170 randomly selected voters, 640 indicated that
they would vote for the administrative candidate.
10. A certified public accountant (CPA) claims that more than 25% of all
accountants advertise. A sample of 112 accountants in Metro Manila
showed that 40 use some form of advertising.
189
Assessment
190
9. In problem no. 8, what is the value of 𝑝̂ ?
A. 0.28 B. 0.35 C. 3.57 D. 60.0
10. In a certain senior high school, it is estimated that approximately 15% of
the students ride bicycles in going to school. In a random sample of 90
senior high students, 19 are found to ride bicycles in coming to class.
What is the value of the population proportion?
A. 0.15 B. 0.21 C. 0.29 D. 4.74
191
Additional Activities
192
References
Textbooks
Albacea, Zita VJ., Mark John V. Ayaay, Isidoro P. David, and Imelda E. De Mesa.
Teaching Guide for Senior High School: Statistics and Probability. Quezon
City: Commision on Higher Education, 2016.
Arciaga, Ronald L., and Dan Andrew H. Magcuyao. Statistics and Probability. Pasay
City: JFS Publishing Services, 2016.
Caraan, Avelino Jr S. Introduction to Statistics & Probability: Modular Approach.
Mandaluyong City: Jose Rizal University Press, 2011.
De Guzman, Danilo. Statistics and Probability. Quezon City: C & E Publishing Inc.,
2017.
Punzalan, Joyce Raymond B. Senior High School Statistics and Probability.
Malaysia: Oxford Publishing, 2018.
Sirug, Winston S. Statistics and Probability for Senior High School CORE Subject A
Comprehensive Approach K to 12 Curriculum Compliant. Manila: Mindshapers
Co., Inc., 2017.
Stephens, Larry J. Schaum’s Outline Of Theory And Problems Of Beginning Statistics.
McGraw-Hill Companies, Inc.,1998.
Online Resources
Bluman, Allan. “Elementary Statistics A Step by Step Approach.” Accessed May 24,
2020https://www.academia.edu/35770135/_Allan_Bluman_Elementary_St
atistics_A_Step_By_St_BookFi.org_1_
193
Statistics and
Probability
Quarter 2 – Module 11:
Identifying Appropriate
Rejection Region Involving
Population Proportion
194
What I Need to Know
Recall that the normal curve evolves from the probability distribution.
With the area under the curve being 1, it serves as a mathematical model in
hypothesis testing. The areas are the probability value that we will need in
decision-making on whether to accept or reject the hypothesis.
This module will help you identify the appropriate rejection region for a
given level of significance when the Central Limit Theorem is to be used.
What I Know
195
5. It is the range of the values of the test value indicating that there is
significant difference and that the null hypothesis (Ho) should be rejected.
A. critical value C. level of significance
B. rejection region D. non-rejection region
6. It is the basis for the critical or rejection region dictated by the alternative
hypothesis.
A. critical value C. acceptance region
B. rejection region D. level of significance
8. The z-score value in the critical region means that you should _______.
A. reject the null hypothesis
B. not reject the null hypothesis
C. reject the alternative hypothesis
D. not reject the alternative hypothesis
12. What is the critical value(s) for a two-tailed test with α = 0.05?
A. z = -1.64 B. z = ±0.06 C. z =1.64 D. z = ± 1.96
For nos. 13-15, refer to the situation below.
During the previous year, 55% of the people believed that there was
an improvement in the country’s economy. This year, only 280 out of 500
randomly selected people believe that there is an improvement in the
country’s economy. Using 0.05 level of significance, answer the following
questions to determine if this indicates a decrease in the number of people
who believe that there is an improvement in the country’s economy.
196
15. Which of the following shows the appropriate rejection region?
A. C.
-2.325 -1.285
B. D.
-1.645 -1.960
How did you find this pre-test? Did you encounter both familiar and
unfamiliar terms? Kindly compare your answers in the Answer Key on the last
part of this module.
If you got a perfect score or 100%, skip this module and proceed to the
next one. But if you missed even a single point, please continue with this
module as it will enrich your knowledge in hypothesis testing involving
population proportion.
197
Lesson Identifying Appropriate
11 Rejection Region Involving
Population Proportion
One part in testing hypothesis is determining if the results of a theory or
the hypothesis from the experiment is probably true or statistically significant.
To be able to do this, the rejection region or critical region will be employed.
Every rejection region can be drawn on a probability distribution. Its image
can be done using the normal curve. It can either be one-tailed or two-tailed
rejection region. More specifically, one-tailed rejection region can be left-tailed
or right-tailed. Now, how can we determine the rejection regions? Let us find
out!
What’s In
Directions: The following are the common terms or phrases used to describe
whether the alternative hypothesis (Ha) is directional or non-directional such
as right-tailed, left-tailed, or two-tailed. Copy the table and write each under
the group where it should belong.
198
Left –Tailed Two – Tailed Right – Tailed
1
2
3
4
5
6
7
8
9
10
199
What’s New
3. The hypothesis that more than 45% of the barangay population is male
residents
_________ ; __________
4. The claim that less than 35% of the students are fluent Filipino language
speakers
_________ ; __________
200
What Is It
There are two ways to test the hypothesis: with a p-value approach and
with a critical value approach. Here, we will consider the rejection region with
the critical value approach. The critical value enables us to reject or not the
null hypothesis. Also, it is calculated through alpha ( α ) levels and symbolized
by Z or Ztab.
This is the first statement in Activity 2: “The hypothesis that less than
20% of the population are right-handed” wherein Ha: p < 0.20 and it indicates a
left-tailed rejection region. Illustrating it in the normal curve, we will come up
with the picture below:
Rejection
Region Non-Rejection
(α) Region This is the
critical value.
Ztab
The illustration above is for you to visualize how the statement would
look like when put into the normal curve. Notice that the line represented by
ztab separates the curve into two regions. The shaded part is the rejection
region while the non-shaded part is the non-rejection region or the acceptance
region/area. Therefore, it is important that we determine the value of z tab or
the critical value. Now, let us proceed!
Let us now describe the following important terms that we will be
needing in our discussion.
201
Level of Significance
Test Type
𝛼 = 0.01 𝛼 = 0.025 𝛼 = 0.05 𝛼 = 0.10
left-tailed test −2.33 −1.96 −1.645 −1.28
right-tailed test 2.33 1.96 1.645 1.28
two-tailed test ±2.575 ±2.33 ±1.96 ±1.645
Rejection Region
- the range of the values of the test value which indicates that there is a
significant difference and that the null hypothesis (Ho) should be
rejected
Non-Rejection Region
- the range of the values of the test value which indicates that the
difference was statistically insignificant and that we failed to reject the
null hypothesis (Ho)
Illustrative Example1:
A sample of 100 students is randomly selected from Pinagpala High
School and 18 of them said they are left-handed. Test the hypothesis that less
than 20% of the students are left-handed by using 𝛼 = 0.05 as the level of
significance.
What to do:
a. Identify the level of significance.
b. Formulate the alternative hypothesis, Ha.
c. Determine the critical value, ztab.
d. Illustrate the rejection region in the normal curve.
202
Solution:
a. The level of significance is 𝛼 = 0.05.
b. The alternative hypothesis is Ha: p < 0.20.
It is one directional or left-tailed as determined by the term “less than”.
c. To determine the critical value using the table, we consider the
intersection of the row for the left-tailed test and the column for = 0.05.
Hence, the table tells us that the critical value is – 1.645.
d. Illustrating it under the normal curve makes:
Rejection
Region
.
𝛼 = 0.05 Non-rejection
Region
-3 -2 -1.645 0
-1.645 1 2 3
From here, you will decide whether the null hypothesis will be
rejected or not, although that Region
part will be discussed in the next module.
Illustrative Example 2:
The claim is made that 40% of tax filers use computer software to file
their taxes. In a sample of 50 tax filers, 14 used computer software to file their
taxes. If Ha: p < 0.40 at α = 0.025 where p is the population proportion who
use computer software to file their taxes. Determine the critical value, Ztab
and illustrate the rejection region in the normal curve.
Solution:
Rejection
Region α = 0.025
Non-rejection
Region
Ztab = - 1.96
203
Illustrative Example 3:
In Kalinga Special Education School, a sample of 144 students was
chosen and among them, 48 are diagnosed with Attention Deficit Hyperactivity
Disorder (ADHD). At 𝛼 = 0.01, test the hypothesis that the proportion of ADHD
students in the school is not 0.40.
When a
What to do: statement did not
a. Identify the level of significance. specify any cue
word that describes
b. Formulate the alternative hypothesis, Ha: p ≠ po. direction, then it is
non-directional or
c. Determine the critical value.
two-tailed.
d. Illustrate the rejection region in the normal curve.
Solution:
a. The level of significance is 𝛼 = 0.01.
b. The alternative hypothesis is p ≠ 0.40 due to the expression “is not 0.40”.
This explains why it is non-directional or two-tailed.
c. To determine the critical value using the table, we consider the intersection
of the row for the two-tailed test and the column for 𝛼 = 0.01. Hence, the
table tells us that the critical value is ±2.575.
d. Illustrating the rejection region in the normal curve gives:
Rejection
Region Acceptance 𝛼 0.01
= = 0.005
Region 2 2
𝛼
2
204
What’s More
A. Directions: Determine the critical value and illustrate the rejection region
under the normal curve by using the given information.
1. Ha: p ≠ 0.52
𝛼 = 0.05
5. Ha: p ≠ 0.46
α = 0.05
205
Activity 3.2. Be Quick!
Direction. The following are the different critical values under the
various level of significance and tails. By using their respective codes,
tell the direction of their tail and the corresponding level of significance.
Your answer will be a combination of codes, tail and α. Set time limit.
Direction:
A. Fill in each blank with the correct word or phrase to complete the
statement.
1. The range of the values of the test value which indicates that there is a
significant difference and that the null hypothesis (Ho) should be
rejected is called ____________.
2. The basis for the critical or the rejection region dictated by the
alternative hypothesis is called ________________.
3. The _______________ separates the rejection region from the non-rejection
region.
4. The _______________ is the range of the values of the test value which
indicates that the difference was statistically insignificant and that we
failed to reject the null hypothesis (Ho).
5. The __________ is the symbol used to represent the critical value.
B. Carefully read and answer the following questions
1. Is it true that if the rejection region is two-tailed, α needs to be divided
by 2 to be able to identify the rejection region?
206
2. The computed value should be negative if the rejection region is right-
tailed. Is it true? Explain.
3. A 0.01 level of significance means that the size of the rejection region is
10%. Is this correct? Why?
4. If a problem does not indicate any term of direction, it is non-directional
or two-tailed. Is it true or false?
5. In a right-tailed test, what is the critical value at α = 0.10?
What I Can Do
Directions: The following are the steps in creating the rejection region in
testing hypothesis for population proportion. Arrange them in their best order
by writing the codes indicated in each. What is the mystery word?
Steps 1 2 3 4 5 6 7 8
Answer
Directions: Carefully read and analyze the following situations. Identify the
information being asked. Then, determine the critical value and shade the
area of the rejection region under the normal curve.
1. Suppose that in the past, 40% of all adults favored capital punishment. Do
we have reason to believe that this proportion has increased if in a random
207
sample of 150 adults, 80 favored capital punishment? Use a 0.05 level of
significance.
Ha: _______
α = _______
α = _______
Ha: _______
α = _______
Critical Value: _________
5. A sample poll of 300 voters from Town A showed that 56% were in favor of a
given candidate. At a significance level of 0.10, determine the critical value
and illustrate the rejection region.
Ha: _______
α = _______
Critical Value: _________
208
Time to sum up
what you’ve learned
Assessment today.
Good luck!
4. Which of the following is the critical value if the level of significance is 0.01
tailed to the right?
A. 2.330 B. 2.325 C. 2.320 D. 2.315
5. It is the range of the values of the test value which indicates that there is
significant difference and that the null hypothesis (Ho) should be rejected.
A. critical value C. level of significance
B. rejection region D. non-rejection or acceptance region
6. What graphical model is appropriate for decision-making?
A. bell shape C. normal curve
B. test statistic D. graphical representation
7. It separates the rejection region from the acceptance region.
A. critical value C. acceptance region
B. rejection region D. level of significance
209
A. at most B. effective C. different D. not the same as
10. A farmer believes that using organic fertilizer on his plants will yield a
greater income. His income increased by 20% from last year. State the
alternative hypothesis in symbols.
A. Ha: p < 0.02 C. Ha: p > 0.20
B. Ha: p > 0.02 D. Ha: p < 0.20
12. What is the critical value(s) for a left-tailed test with α = 0.01 level of
significance?
A. –2.325 B. –1.960 C. –1.645 D. –1.285
A. C.
B. D.
210
Additional Activities
2. What do you think will your conclusion be if the computed test statistic
(Zcom) is found outside the rejection region?
211
References
Textbooks
Albacea, Zita VJ., Mark John V. Ayaay, Isidoro P. David, and Imelda E. De Mesa.
Teaching Guide for Senior High School: Statistics and Probability. Quezon City:
Commision on Higher Education, 2016.
Arciaga, Ronald L., and Dan Andrew H. Magcuyao. Statistics and Probability. Pasay
City: JFS Publishing Services, 2016.
Caraan, Avelino Jr S. Introduction to Statistics & Probability: Modular Approach.
Mandaluyong City: Jose Rizal University Press, 2011.
De Guzman, Danilo. Statistics and Probability. Quezon City: C & E Publishing Inc.,
2017.
Punzalan, Joyce Raymond B. Senior High School Statistics and Probability. Malaysia:
Oxford Publishing, 2018.
Sirug, Winston S. Statistics and Probability for Senior High School CORE Subject A
Comprehensive Approach K to 12 Curriculum Compliant. Manila: Mindshapers
Co., Inc., 2017.
Online Resources
Statistics How To. “Rejection Region (Critical Region) for Statistical Tests.” Accessed
May 19, 2020 https://www.statisticshowto.com/rejection-region/
212
Statistics and
Probability
Quarter 2 – Module 12:
Computing Test Statistic Value
Involving Population Proportion
213
What I Need to Know
214
What I Know
215
8. When should you NOT use z-test?
A. when you are testing for a mean
B. when you are given the population standard deviation
C. when you are ONLY given the sample standard deviation
D. when you are testing a proportion/percentage of a population
9. When performing a test about population proportion, what test statistic
would you need to use?
A. t-test B. z-test C. chi-square D. standard deviation
10. Considering the pandemic, a survey is held to 1,000 randomly chosen
students in which more than 80% are in favor of holding online classes.
What is the value of the sample proportion 𝑝̂ ?
A. 0.013 B. 0.160 C. 0.640 D. 0.800
11. Which of the following is NOT included in the computation of the z-test for
population proportion?
A. n B. p C. 𝑝̂ D. 𝜇
12. What is the formula to find the z-test for population proportion?
𝑝̂−𝑝 𝑝̂−𝑝
A. z = 𝑝 ( 1−𝑝 ) C. z = 𝑝 ( 1−𝑝 )
√ √
𝑛 𝜇
𝑝̂− 𝜇 𝑝−𝑝̂
B. z = D. z =
𝑝 ( 1−𝑝 ) 𝑝 ( 1−𝑝 )
√ √
𝑛 𝑛
216
Lesson
Computing Test Statistic Value
12 Involving Population Proportion
What’s In
1. α Level of Significance
3. 𝑝̂ Sample Proportion
4. p Population Proportion
5. n Size of Samples
6. Rejection Region
217
Notes to the Teacher
In this module, all situations presented involve sample
sizes which are large enough so that CLT or Central Limit
Theorem can be applied. Therefore, there is no need for the
learners to present or prove the two assumptions presented and
discussed in the lesson. Instead, the lesson will mainly focus on
calculating for the value of z-test statistics involving population
proportion.
What’s New
Here, you will use the concepts that you have learned in module 10.
Directions: Look for the word by carefully reading and answering the guide
questions that follow. Choose your answer from the list. Copy the answer
box and write the letter that corresponds in your answer.
Guide Questions.
1. What part of the whole population own their homes?
2. What is the value of p?
3. What is the size of the sample, n?
4. How many owned their homes, x?
5. Compute for the value of 𝒑
̂.
1 2 3 4 5
218
What Is It
It is observable that the previously cited situation did not use nor
mention words like “mean” or “average” but “percentage” instead. Also, it
utilized count data. Problems such as this involves population proportion.
Inferences involving proportions are made in the context of probability of
“success”, p, in a binomial distribution.
219
Z – Test Statistic for Population Proportion
Remember that the formula for the value of z-test statistic for
population proportion would be:
𝑝̂−𝑝 𝑝̂−𝑝
Zcom = 𝑝𝑞
or Zcom =
𝑝 ( 1−𝑝 )
√𝑛 √
𝑛
where:
zcom is the z-test statistic for proportion.
𝑥
𝑝̂ is the sample proportion ( 𝑛 ).
p is the hypothesized value of the population proportion.
n is the sample size or the number of observations in the
sample.
q is equal to 1 – p.
220
Illustrative Example 2:
Determine the value of Zcom given the following information:
p = 0.42
Sample Size: n = 150
Sample Proportion: 𝑝̂ = 0.45
Solution:
Zcom = ?
𝑝̂ = 0.45
p = 0.42
n = 150
q = 1 – p = 1 – 0.42 = 0.58
0.45−0.42
=
0.42 ( 0.58 )
√
150
0.03
=
0.2436
√
150
0.03
=
√0.001624
0.03
=
0.0403
Zcom = 0.7444
Illustrative Example 3:
The claim is made that 40% of tax filers use computer software to file
their taxes. In a sample of 50, 14 used computer software to file their taxes.
To test Ho: p = 0.4 versus Ha: p > 0.4 at α= 0:05 where p is the population
proportion who use computer software to file their taxes. And to test using
the binomial distribution and test using the normal approximation to the
binomial distribution. Determine first the value of zcom.
221
Solution:
First, determine the value of the following:
Zcom = ?
𝑥 14
𝑝̂ = = = 0.28
𝑛 50
p = 40% = 0.40
n = 50
q = 1 – p = 1 – 0.40 = 0.60
What’s More
Problem A:
An insurance industry report indicated that 30% of those persons
involved in minor traffic accidents this year have been involved in at least
one traffic accident in the last five years. Believing it was too large, an
advisory group decided to investigate this claim. A sample of 200 traffic
accidents this year showed that 56 persons were also involved in another
accident in the last five years. Determine the value of zcom.
Solution:
First, prepare the data that will be used in the formula.
222
Zcom = ?
p = percentage of those who were involved in an accident
during the last five years
= 30%
p = 0.03
number of persons involved in another accident in the last five years
𝑝̂ =
number of persons involved in accidents in the last five years 1
56
= = 0.28
200
n = 200
q = 1 – p = 1 – 0.30 = 0.70
Substitute the values in the formula.
𝑝̂−𝑝
Zcom =
𝑝 ( 1−𝑝 )
√
𝑛
0.28− 0.30
= 2
0.30 ( 0.70 )
√
200
−0.20
= 0.2010 3
√ 200
−0.20
= 4
√0.00105
−0.02 5
= 0.0324
Zcom = 0.6173 6
Problem B:
In the website of Sweet Choco, it was stated that an ideal bag of
chocolates contains 24% white chocolates. Suppose we counted the number
of white chocolates in 40 chocolate sachet packs and the proportion from
the sample is found to be 23.04%, what is the value of z?
Solution:
As preparation for our solution, let us again identify first the needed
data for the formula.
Zcom = ?
p = 24% = 0.24
1
𝑝̂ = 23.04% = .02304
n = 40
q = 1 – p = 1 – 0.24 = 78
Then, substitute these values in the formula.
𝑝̂−𝑝
Zcom =
𝑝 ( 1−𝑝 )
√
𝑛
223
0.2304−0.24
= 2
0.24 ( 0.76 )
√
40
−0.0096
= 0.1724
3
√
40
−0.0096
= 4
√0.00456
−0.0069
= 5
0.0675
6
Zcom = 0.1422
Problem C:
Solution:
Zcom = ?
p = 90% = 0.90
2000
𝑝̂ = = .80
2500
1
n = 2500
q = 1 – p = 1 – 0.90 = 0.10
Substituting these values in the formula,
𝑝̂−𝑝
Zcom =
𝑝 ( 1−𝑝 )
√
𝑛
0.80−0.90
= 2
0.90 ( 0.10 )
√
2500
−0.01
= 0.09
√ 3
2500
224
−0.10
= 4
√0.00036
−0.10
= 5
0.06
Zcom = − 16.67 6
225
What I Can Do
n = _____ ______
q = 1 – p = ______
zcom = ?
n = ______ √
𝑝 ( 1−𝑝 )
𝑛
q = 1 – p = ______
zcom = ?
226
3. Health-care coverage for employees varies with company size. It is
reported that 30% of all companies with fewer than 10 employees
provide health benefits for their employees. A sample of 50 companies
with fewer than 10 employees is selected. It is found that 19 of the 50
companies surveyed provide health benefits for their employees.
Identify the needed data:
Solution:
p = _____
𝑝̂ = _____ Zcom =
𝑝̂−𝑝
n = _____ ______ √
𝑝 ( 1−𝑝 )
𝑛
q = 1 – p = ______
zcom = ?
n = _____ ______ √
𝑝 ( 1−𝑝 )
𝑛
q = 1 – p = ______
zcom = ?
5. A group of online shoppers were surveyed and 70% said that they
spent around P1000 to P1500 every month on the internet shopping.
From 400 respondents, 235 said that they consumed P1000 – P1500
in online shopping.
Identify the needed data:
Solution:
p = _____
𝑝̂ = _____ Zcom =
𝑝̂−𝑝
𝑝 ( 1−𝑝 )
n = _____ ______ √
𝑛
q = 1 – p = ______
zcom = ?
6. A poll taken just prior to election day finds that 389 of 700 registered
voters intend to vote for Kris P. Bacon for mayor of a certain city. The
poll resulted that 50% of all voters intend to vote for Kris.
Identify the needed data:
p = _____ Solution:
𝑝̂ = _____ Zcom =
𝑝̂−𝑝
n = _____ ______ √
𝑝 ( 1−𝑝 )
𝑛
q = 1 – p = ______
zcom = ?
227
Assessment
1. Which of the following is NOT necessary in solving the statistic z-test for
population proportion?
A. Mean C. size of the sample
B. sample proportion D. population proportion
228
company has decided to give free milk tea during break time. In a recent
study, out of 100 randomly sampled employees, 28% said that they would
rather drink milk tea than coffee.
8. What is the value of the hypothesized population proportion?
A. 0.25 B. 0.28 C. 0.75 D. 100
9. What is the value of the sample proportion?
A. 0.25 B. 0.28 C. 0.75 D. 100
10. What is the z-value?
A. 0.5682 B. 0.6065 C. 0.6928 D. 0.7713
11. Which of the following should NOT be considered in using z-test for
proportion?
A. when you are testing for the mean
B. when you are given the sample proportion
C. when you are testing a proportion/percentage of a population
D. when each sample point can result in just two possible outcomes:
success or failure
12. Which of the following is NOT included in the computation of the z-test
for population proportion?
A. N B. 𝜎 C. 𝑝̂ D. 𝑝
For nos. 13-15, refer to the problem below.
In the recent city triathlon, the sponsors have encouraged more
women to participate in the event. A sample is chosen randomly and among
70 runners, 32 are women. The sponsors would somewhat like to be 90%
certain that at least 40% of the participants are women.
229
Additional Activities
230
References
Textbooks
Albacea, Zita VJ., Mark John V. Ayaay, Isidoro P. David, and Imelda E. De Mesa.
Teaching Guide for Senior High School: Statistics and Probability. Quezon
City: Commision on Higher Education, 2016.
Arciaga, Ronald L., and Dan Andrew H. Magcuyao. Statistics and Probability. Pasay
City: JFS Publishing Services, 2016.
Caraan, Avelino Jr S. Introduction to Statistics & Probability: Modular Approach.
Mandaluyong City: Jose Rizal University Press, 2011.
De Guzman, Danilo. Statistics and Probability. Quezon City: C & E Publishing Inc.,
2017.
Punzalan, Joyce Raymond B. Senior High School Statistics and Probability.
Malaysia: Oxford Publishing, 2018.
Sirug, Winston S. Statistics and Probability for Senior High School CORE Subject A
Comprehensive Approach K to 12 Curriculum Compliant. Manila: Mindshapers
Co., Inc., 2017.
Stephens, Larry J. Schaum’s Outline of Theory and Problems Of Beginning Statistics.
McGraw-Hill Companies, Inc.,1998.
Stephens, Larry J.Theory and Problems of Statistics Fourth Edition. The McGraw-
Hill Companies, Inc. 2008.
Online Resources
CliffsNotes. “Test for a Single Population Proportion.” Accessed May 28, 2020
https://www.cliffsnotes.com/study-guides/statistics/univariate -inferential-
tests/test-for-a-single-population-proportion
Quizziz. “Hypothesis Tests for Population Proportions.” Accessed May 27, 2020
https://quizizz.com/admin/quiz/5c70658f40b384001a7dc327/ hypothesis-
tests-for-population-proportions
Quizizz. “Population Proportion.” Accessed May 29, 2020 https://quizizz. com/
admin/search/population%20proportion
UCI Donald Bren School of Information & Computer Sciences. “Sample Multiple
Choice Questions.” Accessed May 28, 2020
https://www.ics.uci.edu/~jutts/8/SampleFinalMCKey.pdf
231
Statistics and
Probability
Quarter 2 - Module 13:
Drawing Conclusions About Population
Proportion Based on Test Statistic
Value and Rejection Region
232
What I Need to Know
On the previous lessons, you were already taught how to compute test
statistic concerning population proportions as well as how to determine the
rejection or non-rejection region by using an illustration on a curve.
What I Know
1. What do you call the part of the sample or the proportion of individuals in a
sample sharing a certain trait?
A. sample mean
B. sample variance
C. sample proportion
D. sample standard deviation
233
3. Which of the following is usually expressed as a fraction, decimal, or percentage
of the whole population which has a certain trait or characteristic?
A. sample mean
B. population mean
C. sample proportion
D. population proportion
A. 𝑝̂ B. n C. p D.
7. When the computed z-value (zcom) is 3.16 at α = 0.05 level of significance, which
of the following will be the correct decision?
A. Reject the null hypothesis.
B. Fail to reject the null hypothesis.
C. Reject the alternative hypothesis.
D. Accept the alternative hypothesis.
234
9. What is the relationship between a Type I error and the null hypothesis (Ho)?
A. A Type I error corresponds to rejecting the null hypothesis when it is
true.
B. A Type I error corresponds to rejecting the null hypothesis when it is false.
C. A Type I error corresponds to failing to reject the null hypothesis when it
is false.
D. A Type I error corresponds to failing to reject the null hypothesis when it
is true.
10. What distribution do we use when testing claims about population proportions?
A. F
B. Z
C. t
D. chi
11. Researchers used the given data below and the results to test the claim that
more than 50% of adults support the tax increase.
n = 200 𝑝̂ = 56.5
Ho : p = 0.5 Ha : p > 0.5
z ≈ 1.84 P-value of approximately 0.033
α = 0.05 level
235
13. The given problem is a ____________________________.
A. one-tailed test
B. one-sided test
C. non-directional
D. cannot be determined
15. What is the phrase that best completes the conclusion below?
What’s In
DIRECTIONS: Determine the value of sample proportion (𝑝̂ ) using the given sample
size (n) and the number of elements or observed values (X). Each
number has a corresponding letter below. After you solve for 𝑃̂,
write its corresponding value and letter on the blanks to decode the
secret message. The formula to be used is provided in the box
below.
FORMULA:
𝑋
𝑝̂ =
𝑛
236
1. n = 100 ; X = 48 𝑝̂ = ____ _____
2. n = 225 ; X = 214 𝑝̂ = ____ _____
3. n = 450 ; X = 356 𝑃̂ = ____ _____
4. n = 1000 ; X = 772 𝑃̂ = ____ _____
5. n = 1330 ; X = 988 𝑃̂ = ____ _____
6. n = 2020 ; X = 1915 𝑃̂ = ____ _____
7. n = 2500 ; X = 2301 𝑃̂ = ____ _____
8. n = 3 000 ; X = 2 650 𝑃̂ = ____ _____
9. n = 3 800 ; X = 3 316 𝑃̂ = ____ _____
10. n = 10 000 ; X = 8 900 𝑃̂ = ____ _____
LEGEND:
E – 0.95 W - 0.48 M – 0.92
A – 0.88 L – 0.79 O – 0.77
H – 0.89 T – 0.87 V – 0.74
Guide Questions:
What’s New
237
What Is It
In drawing conclusions, there are two different approaches that you may
apply: the critical z-approach (computed z-value) and the P-value approach.
In applying the first approach which is determining the critical value (which
you were already taught in the previous modules), you need to consider the
following:
If the computed z-statistic (zcom) is > or < the tabular value (ztab), reject
the null hypothesis (Ho).
If the computed z-statistic (zcom) falls in the rejection region, reject the
null hypothesis (Ho).
If the computed z-statistic (zcom) does not fall in the rejection region,
fail to reject the null hypothesis (Ho).
Illustrative Example:
Example 1
a. Ho : p = 0.85
Ha : p < 0.85
b. Level of Significance: α = 0.01
c. Computed Test Statistic:
238
𝑋
𝑝̂ =
𝑛
325
=
400
̂=
𝒑 0.81
𝑝̂−𝑝
z=
𝑝(1−𝑝)
√
𝑛
0.81−0.85
=
0.85 (1−0.85)
√
400
z = -2.24
Using the Areas Under the Normal Curve Table, the critical value is
-2.326 at α = 0.01 level. There is a negative sign in the value due to the
direction of the alternative hypothesis.
d. DECISION: Since the computed test statistic (zcom) z = -2.24 does not fall
in the rejection region, fail to reject the null hypothesis (Ho).
P-VALUE APPROACH
What is P-value?
239
P-value approach uses the following basic procedures:
Illustrative Example:
Given:
Ho: p = 0.5 = 0.05 n= 25,468
Ha: p > 0.5
Solution:
z = 5.49
P=P(Z≥5.49)=0.0000⋯≈0
CONCLUSION: Because the p-value is smaller than the significance
level α=0.05, we can reject the null hypothesis. Again, we
would say that there is sufficient/enough evidence to
conclude that boys are more common than girls in the
entire population at α=0.05 level.
As should always be the case, the two approaches (critical value approach
and p-value approach) lead to the same conclusion.
240
OTHER ILLUSTRATIVE EXAMPLES USING TWO-TAILED TEST
Example 1
Given:
a. n= 50
b. = 0.01 significance level
c. H0 : The proportion of students that want to go to the zoo is 85%.
(H0: p = 0.85)
Ha: The proportion of students that want to go to the zoo is not 85%.
(Ha: p ≠ 0.85 )
d. p = 0.7554
Example 2
Given:
a. n= 150
b. = 0.1 significance level
c. Ho : The proportion of households that have three or more cell phones is
30%. (Ho : p = 0.3)
Ho : The proportion of households that have three or more cell phones is
different from 30%. (Ha : p ≠ 0.3)
d. 𝑝̂ = 0.287
e. Zcom = 0.347
241
DECISION/CONCLUSION: Fail to reject the null hypothesis (Ho). There is
insufficient evidence supporting that the proportion of households with three or
more cell phones is different from 30%.
NOTE:
Conclusions are answers in sentence form which include: 1) whether there is
enough evidence or not (based on the decision); 2) the level of significance; and 3)
whether the original claim is supported or rejected.
Conclusions are based on the original claim which may be the null or
alternative hypothesis. The decisions are always based on the null hypothesis.
Original Claim
H0 Ha
Decision "REJECT" "SUPPORT"
NOTE:
If the null hypothesis isn’t rejected, this doesn’t necessarily mean that it’s
true. It simply means that there is not enough evidence to justify rejecting it.
242
Notes to the Teacher
Students should be aware of p-value approach
since many statistical packages give the p-value but
not the critical value. One advantage of p-value is
that we can immediately know at what level the
testing becomes significant. For example, a p-value of
0.03 would be rejected at 0.01 level of significance,
but it would fail to be rejected at 0.05 level of
significance. Remember, we must decide first on the
level of significance before calculating the test
statistic and finding the p-value.
What’s More
243
2. A research states that 28% of college degrees are from engineering courses.
A researcher doesn’t believe that this is correct. A sample of 1,000 graduates
was used and it was found out that 295 have finished engineering courses.
Test the claim if it has increased at α = 0.10 level. What is the correct null
hypothesis?
a. Ho : p = 0.28
Ha : p > 0.28
b. Level of Significance: α = 0.10
c. Computed Test Statistic: zcom = ____
d. Critical Value: 1.282
e. DECISION: Since the computed test statistic zcom = ____ does not fall in
the rejection region, _________________ the null hypothesis (H o).
Guide:
Given:
Ha: The proportion of students who are overweight is less than 25%.
(Ho : p < .25)
244
Problem 1
Given:
Ho: The proportion of employees in a shoe factory who smoke cigarette is
30%. ( Ho : p = .30)
245
What I Can Do
Directions: Read job vacancies posts on the classified ads section of a newspaper.
Then, draw conclusions about the type of people who will apply for
each job. Write your conclusions based on facts and include the
newspaper clippings where you got the information. You will be
graded using the given rubric below.
RUBRIC
CATEGORY 4 3 2 1
Focus and There is a The idea is The idea is The idea is not
Support for clear and well- clear but the quite clear clear and not
Topic focused topic supporting and not supported
which is details/facts supported with needed
relevant and are not with needed details/facts.
supported complete. details/facts.
with
details/facts.
Grammar & There are no There are 1-3 There are 4-6 There are
Spelling errors in errors in errors in more than 6
grammar or grammar or grammar or errors in
spelling. spelling. spelling. grammar or
spelling.
246
Assessment
Directions: Choose the best answer to the given questions or statements. Write
the letter of your choice on a separate sheet of paper.
247
5. What is the correct decision?
A. There is no possible decision.
B. Reject the null hypothesis.
C. Fail to reject the null hypothesis.
D. Change the alternative hypothesis.
A. -1.75
B. -0.08
C. 0.08
D. 1.75
248
D. Accept both null and alternative hypotheses.
249
Additional Activities
Directions: Read and analyze the following statements. Write ACCEPT if the
statement is correct and write REJECT if it is incorrect. Write your
answer on a sheet of paper.
____________1. The claim being assessed in a hypothesis test is the null hypothesis.
____________2. Critical value is the probability that the null hypothesis is true given
the observed results.
____________4. When p-value is greater than alpha (0.05 used), we fail to reject Ho.
250
References
Books
Albacea, Zita VJ., Mark John V. Ayaay, Isidoro P. David, and Imelda E. De Mesa.
Teaching Guide for Senior High School: Statistics and Probability. Quezon City:
Commission on Higher Education, 2016.
De Guzman, Danilo. Statistics and Probability. Quezon City: C & E Publishing Inc.,
2017.
Punzalan, Joyce Raymond B. Senior High School Statistics and Probability.
Malaysia: Oxford Publishing, 2018.
Sirug, Winston S. Statistics and Probability for Senior High School CORE Subject A
Comprehensive Approach K to 12 Curriculum Compliant. Manila: Mindshapers
Co., Inc., 2017.
Online Resources
https://online.stat.psu.edu/statprogram/reviews/statistical-concepts/proportions
https://people.richland.edu/james/lecture/m170/ch09-int.html
https://www.khanacademy.org/math/ap-statistics/tests-significance-ap/one-
sample-z-test-proportion/v/comparing-p-value-to-significance-level-example
251
Statistics and
Probability
Quarter 2 – Module 14:
Solving Problems Involving Test
of Hypothesis on Population
Proportion
252
What I Need to Know
What I Know
253
2. In problem no. 1, which of the following would be the null hypothesis?
A. The sample proportion is 0.85.
B. The population proportion is 0.85.
C. The sample proportion is not equal to 0.85.
D. The population proportion is not equal to 0.85.
5. What is the critical value (in a test about proportions) for a left-tailed test with
α = 0.05 and n ≥ 30?
A. Zcom = -2.33
B. Zcom = -1.96
C. Zcom = -1.645
D. Zcom = 2.58
6. Suppose the P-value for a hypothesis test is 0.0304. Using a = 0.05, what is the
appropriate conclusion?
A. Reject the null hypothesis.
B. Fail to reject the null hypothesis.
C. Reject the alternative hypothesis.
D. Accept the alternative hypothesis.
8. Tina wants to know if the proportion of people who buy suman is affected at all
by her open microphone reading. If p=0.8 before her reading, what is the
appropriate set of hypotheses?
A. Ho: p = 0.8
Ha: p > 0.8
B. Ho: p = 0.8
Ha: p < 0.8
254
C. Ho: p ≠ 0.8
Ha: p = 0.8
D. Ho: p = 0.8
Ha: p ≠ 0.8
255
15. Which of the following is the best decision and conclusion based on the results
of the test statistic? The computed z-statistic or zcom is 1.35.
A. Since the computed test statistic zcom = 1.35 does not fall in the rejection
region, do not reject the null hypothesis. Therefore, we conclude that at
0.05 level of significance, there was not enough evidence that the number of
people who believed that there was an improvement in our economy has
increased.
B. Since the computed test statistic z = 1.35 does not fall in the rejection
region, reject the null hypothesis. Therefore, we conclude that at 0.05 level
of significance, there was not enough evidence that the number of people
who believed that there was an improvement in our economy has increased.
C. Since the computed test statistic z = 1.35 falls on the rejection region, do
not reject the null hypothesis. Therefore, we conclude that at 0.05 level of
significance, there was enough evidence that the number of people who
believed that there was an improvement in our economy has increased.
D. Since the computed test statistic z = 1.35 does not fall in the rejection
region, do not reject the null hypothesis. Therefore, we conclude that at
0.05 level of significance, there was enough evidence that the number of
people who believed that there was an improvement in our economy has
increased.
What’s In
Directions: Read, analyze, and identify the given on the following problems
involving population proportions.
1. It has been claimed that 30% of students in a particular senior high school
dislike Mathematics. When a survey was conducted by a researcher, it showed
that 150 of 1,000 students dislike Mathematics. Test if the claim was different
from the population at α = 0.01 level.
256
Given:
a. Ho : _______________(symbols)
___________________________________________(statement)
b. Ha : _______________(symbols)
___________________________________________(statement)
c. Level of Significance = __________
d. n = ________
e. X = ________
f. 𝑝̂ = ________
2. In a public senior high school, a survey conducted last year by the barangay
health workers showed that 10% of the students drink alcohol. This year, a new
survey was conducted randomly on 320 students from the same school and it
was found out that 28 of them drink alcohol. Determine if the claim that there is
a decrease on the proportion of senior high school students who drink alcohol is
true. Use α = 0.05.
Given:
a. Ho : _______________(symbols)
___________________________________________(statement)
b. Ha : _______________(symbols)
___________________________________________(statement)
c. Level of Significance = __________
d. n = ________
e. X= _______
f. 𝑝̂ = ________
What’s New
PROBLEM:
257
________ DECISION: Does not fall in the rejection region; fail to reject the Ho
________ computed z-statistic: zcom = -1.15 and critical z-value: -1.645
What Is It
Just like in puzzles, you need to think of different ways on how you will be
able to solve it. Same with solving problems involving test of hypotheses on
population proportions, you need to follow important steps in order to arrive at the
correct answer.
Here are the five (5) steps in solving problems for a test of hypothesis on the
population proportion.
Ho : p = po
Ha : p < po or Ha : p > po or Ha : p ≠ po
258
Remember:
STEP 5. DECISION/CONCLUSION:
259
NOTE:
Illustrative Examples
Example 1: Every year, the assigned teachers determine the Body Mass Index
(BMI) of students. In a certain public junior high school, a study finds
that 10% of Grade 7 students observed are underweight. A sample of
780 Grade 7 students were randomly chosen and it was found out
that 125 of them are underweight. Is this claim different for their
grade level age? Use 0.05 level of significance.
SOLUTION:
𝑋
𝑝̂ =
𝑛
125
=
780
̂=
𝒑 0.16
𝑝̂−𝑝
z=
𝑝(1−𝑝)
√
𝑛
0.16−0.10
=
0.10 (1−0.10)
√
780
260
0.06
=
0.03
zc = 5.61
𝛼 0.05
= = 0.25
2 2
𝑍𝛼
NOTE: Using the Areas Under the Normal Curve Table, critical
2
𝑣𝑎𝑙𝑢𝑒𝑠 at 0.05 level of significance are ± 1.96.
Rejection Region
𝛼 𝛼
= 0.25 = 0.25
2 2
Rejection Region
261
STEP 5: Make a decision whether to reject or fail to reject the null
hypothesis. Draw a conclusion.
DECISION: Since the computed test statistic zcom = 2.0 is greater than the
critical value or it falls in the rejection region, reject the null
hypothesis.
CONCLUSION: Therefore, we conclude that at 0.05 level of significance,
there is enough evidence that the percentage of Grade 7
students who are underweight is different from 10%.
What’s More
1. = 0.05
Ho: p = 0.6
Ha: p ≠ 0.6
2. = 0.05
p = 0.45
right-tailed test
262
What I Have Learned
1. _____________________________________________________________________________
2. _____________________________________________________________________________
3. _____________________________________________________________________________
4. _____________________________________________________________________________
5.______________________________________________________________________________
What I Can Do
A. Give three (3) best experiences in your life wherein you think you made the
right decisions. Share some things, ideas, or techniques that you considered
before finally deciding. You are going to present your answers through a collage
in a short bond paper. (Use recyclable materials like old magazines, newspaper,
etc.)
263
Assessment
1. A hypothesis test is done in which the alternative hypothesis is that more than
10% of the population is left-handed. The calculated p-value for the test is 0.25.
Which statement is correct?
3. A potato chip producer and a supplier of potatoes agree that each shipment
must meet certain quality standards. If the producer is convinced that more
than 8% of the potatoes in the shipment have blemishes, the truck will be sent
away and another one would have to be sent. In a recent shipment, an SRS of
80 potatoes was selected and 7 had blemishes. Use α = 0.01.
264
4. In problem no. 3, what will be the correct conclusion regarding the claim?
A. There is no sufficient evidence that more than 8% of the potatoes in the
shipment have blemishes. Therefore, the truck should be returned.
B. There is sufficient evidence that more than 8% of the potatoes in the
shipment have blemishes. Therefore, the truck should be returned.
C. There is no sufficient evidence that more than 8% of the potatoes in the
shipment have blemishes. Therefore, the truck should not be returned.
D. There is sufficient evidence that more than 8% of the potatoes in the
shipment have blemishes. Therefore, the truck should not be returned.
8. If the computed p-value is greater than the given , which is the correct
decision?
A. Reject the null hypothesis.
B. Fail to reject the null hypothesis.
C. Reject both null and alternative hypotheses.
D. Fail to reject both null and alternative hypotheses.
265
9. From the correct decision in no. 8, what should be your conclusion?
A. There is a missing data.
B. There is an error in the claim.
C. There is sufficient evidence to claim that the reading comprehension rate is
higher during the current year than the previous year.
10. Which of the following will NOT result to a decision of rejecting the null
hypothesis?
A. The z-score is located at the rejection region.
B. The p-value is equal to the level of significance.
C. The test statistic is smaller or larger than the critical value.
D. The p-value is greater than the level of significance.
11. Why do you need to set the level of significance in solving problems for test of
hypothesis?
A. to determine the test statistic
B. to identify the margin of error
C. to easily compute the critical value
D. to make the probability of making a Type I error small
12. Which is true about using critical value approach and P-value approach?
A. They are used only for proportions.
B. They will give you different decisions.
C. They are used as alternative solutions.
D. They both have the same results used for drawing conclusions.
B. Ho : p = 0.08
Ha : p < 0.08
C. Ho : p = 0.08
Ha : p > 0.08
D. Ho : p = 0.08
Ha : p ≥ 0.08
266
14. This test is a _____________________.
A. left-tailed test
B. one-tailed test
C. right-tailed test
D. two-tailed test
267
Additional Activities
1. One thousand five hundred (1,500) randomly selected pine trees were tested for
traces of the Bark Beetle infestation. It was found that 153 of them showed
such traces. Test the hypothesis that more than 10% of the pine trees have
been infested. (Use 5% level of significance.)
=0.5
Ho : p = 0.10
Ha : p > 0.10
zcom = 1.645
ERROR: ___________________
CORRECTED: _____________
2. A sample of 100 students were randomly selected from Pinagpala High School
and 18 of them said they are left-handed. Test the hypothesis that less than
20% of the students are left-handed by using 𝛼 = 0.05 as the level of
significance.
Ho: p = 0.20
Ha: p ≠ 0.20.
zcom = 1.96
𝛼 = 0.05.
It is a one-directional or left-tailed test.
ERROR: __________________
CORRECTED: ____________
3. Newborn babies are more likely to be boys than girls. A random sample found
13,173 boys were born among 25,468 newborn children. The sample
proportion of boys was 0.5172. Is this sample evidence that the birth of boys is
more common than the birth of girls in the entire population?
Ho : p = 0.5
Ha: p > 0.5
Zcom = 5.49
It is a non-directional or two-tailed test.
ERROR: ___________________
CORRECTED: _____________
268
4. Traditionally, about 70% of students in a Statistics course at ECC are
successful. If only 15 students in a class of 28 randomly selected students are
successful, is there enough evidence at 5% level of significance to say that
students of a particular instructor are successful at a rate of less than 70%?
Ho : p = 0.70
Ha: p < 0.70
P-value = 0.0289
ERROR: ____________________
CORRECTED: ______________
Since P-value < , we fail to reject the null hypothesis (Ho).
5. For a class project, a Grade 12 STEM student wants to estimate the percentage
of students who are registered voters in his school. From 45% Grade 12
students, he surveys 500 students and finds that 200 are registered voters.
Test the claim at = 0.05 if there is enough evidence proving that there is a
change in the percentage of registered voters.
= 0.05
Ho : p ≠ 0.45
Ha: p ≠ 0.45
It is a non-directional test.
ERROR: ____________________
CORRECTION: _____________
269
References
Books
Albacea, Zita VJ., Mark John V. Ayaay, Isidoro P. David, and Imelda E. De Mesa.
Teaching Guide for Senior High School: Statistics and Probability. Quezon City:
Commission on Higher Education, 2016.
De Guzman, Danilo. Statistics and Probability. Quezon City: C & E Publishing Inc.,
2017.
Punzalan, Joyce Raymond B. Senior High School Statistics and Probability.
Malaysia: Oxford Publishing, 2018.
Sirug, Winston S. Statistics and Probability for Senior High School CORE Subject A
Comprehensive Approach K to 12 Curriculum Compliant. Manila: Mindshapers
Co., Inc., 2017.
Online Resources
270
Statistics and
Probability
Quarter 2 – Module 15:
Illustrating the Nature of
Bivariate Data
271
What I Need to Know
Are you ready now to study bivariate data using your ADM module? Good
luck and may you find it helpful.
272
What I Know
273
6. “A teacher computed that the mean percentage of score of Grade 8-
Integrity on their 50-item test in Mathematics is 72.50.” What type of
data is illustrated on the situation?
A. bivariate
B. multivariate
C. trivariate
D. univariate
274
11. Which of the following is not used to describe data that fall under
univariate category?
A. mean
B. mode
C. correlation analysis
D. measure of dispersion
15. “Rommel got the following grades on his 9 subjects: six 90s, one 92,
and two 89s. Without computing the average, he estimated that his
general average would be around 90.” Based on the given situation,
what is/are the variable/s?
A. 9 subjects
B. grades on his 9 subjects
C. average on his 9 subjects
D. his general average and his 9 subjects
275
Lesson
Illustrating the Nature of
15 Bivariate Data
A variable is an attribute or characteristic that may take more than
one value which can either be measured or classified. The height and weight
of students, number of hours students spend in studying at home, and daily
allowance of students are examples of variables. From such variables,
information are collected and analyzed. If we are given a bivariate data, the
degree of association between the two variables can be determined.
In this lesson, we will deal with the nature of variables and data
collected.
What’s In
Where Am I Now?
Directions: Identify the variables involved in the following situations.
Situation Variable/s
Involved
Example: Height
Luffy measured the height of his 10
classmates and determined their average
height.
1. Zorro surveyed his cousins’ shoe sizes
and weight.
2. Nami conducted a survey to determine
the number of household members in
their barangay.
3. Sanji interviewed 10 students about
their daily money allowance and
weight.
4. Teacher Kim recorded his students’
scores from IQ and math tests.
5. Karina recorded her daily profit in
selling cassava cake.
276
From the activity, answer the following questions.
277
What’s New
Math Analogy!
Directions: Examine the following sets of words or phrase. Look at the first
pair and examine how the two concepts relate to each other. Then, select the
best word/phrase that would complete the second pair to show the same
relationship.
Guide Questions:
278
What Is It
Data that involve two variables are called bivariate data. The
statistical procedure used to determine and describe the relationship
between two variables is called correlation analysis.
279
What’s More
Activity 1.1
280
Activity 1.2
Situation Variable/s
281
Activity 1.3: Univariate or Bivariate?
Activity 1.4:
282
What I Have Learned
What I Can Do
283
Assessment
284
5. Determine the variables involved in the situation below:
“Asta’s goal for the summer vacation is to have a healthy and fit body.
He recorded the number of minutes he spends daily in doing
abdominal exercises and his weight for 30 days. He found out that the
longer he does abdominal exercise, the more weight he loses.”
A. Asta’s weight
B. Number of days spent and weight
C. Number of minutes doing abdominal exercise and weight
D. Number of minutes doing abdominal exercise and weight loss
285
10. Which of the following can be used to describe data that fall under
univariate category?
A. scatter plot
B. scatter diagram
C. correlation analysis
D. measure of central tendency
11. A Grade 11 student learned from his Economics subject that when
the supply of a product is limited, its price gets higher than the
average price. On the other hand, if there is an increase in supply, its
price gets lower. What type of data is being presented?
A. bivariate
B. multivariate
C. trivariate
D. univariate
12. “A teacher found out that 80% to 90% of the students in class
decided to enroll for the incoming school year despite the threat of
COVID-19 infections in their city.” What type of data is presented
above?
A. bivariate
B. univariate
C. multivariate
D. cannot be determined due to lack of data
286
14. “Carla got the following grades on her 8 subjects: three 87s, one 90,
two 89s, and two 85s. Without computing the average, she estimated
that his general average would be around 87.” Based on the given
situation, what type of data is presented?
A. bivariate
B. multivariate
C. trivariate
D. univariate
15. “A student asked his 30 classmates their Body Mass Index (BMI) and
the number of glasses of water they drink daily. He found out that
those students who consume 8-12 glasses of water daily have normal
BMI.” What type of data is presented on the situation above?
A. bivariate
B. multivariate
C. trivariate
D. univariate
287
Additional Activities
Data collected from surveys, studies, and the likes can involve one,
two, or more variables. These quantitative variables are anything
measurable like the height of students, weight, test scores, and many
(1)
more. If data involves only one variable, it is called __________ data, while
(2)
if data involve two variables, it is called _________ data.
Data that involve two variables are usually described through the use
(4)
of __________ analysis and graphs like scatterplot or scatter diagram.
288
References
Books
Albacea, Zita VJ., Mark John V. Ayaay, Isidoro P. David, and Imelda E. De
Mesa. Teaching Guide for Senior High School: Statistics and Probability.
Quezon City: Commision on Higher Education, 2016.
Online Resources
289
Statistics and
Probability
Quarter 2 – Module 16:
Constructing a Scatter Plot
290
What I Need to Know
In the previous module, you were able to differentiate bivariate data from
univariate data as well as identify the variable/s that are present on given
situations. Illustrating the bivariate data and identifying the variables involved are
important especially in dealing with researches and studies that you will later
encounter on different fields. In this ADM module, we will specifically deal with
bivariate data and how presented variables are related as you construct a diagram
called “scatter plot”.
Are you ready now to study how to construct a scatter plot using your ADM
module? Good luck and may you find it helpful.
291
What I Know
Directions: Choose the best answer to the given questions or statements. Write the
letter of your choice on a separate sheet of paper.
5. Joana read an article from the Parenting.Firstcry.com that the height of the
offspring of a couple is inherited from their father. She surveyed the height
of her 10 classmates and their father to construct a scatter plot. Which
variable should be plotted on the x-axis?
A. height of their father
B. height of their mother
C. height of her classmates
D. annot be determined
292
6. In constructing a scatter plot, which of the variables should assume the
values of y-coordinate?
A. consistent variable
B. dependent variable
C. independent variable
D. cannot be determined
8. Which of the following chart can be used BEST to show the correlation
between two variables?
A. bar graph
B. line graph
C. pictograph
D. scatter plot
10. The trend of the points in a scatter plot suggests the ______ of the
correlation of the variables.
A. direction
B. shape
C. strength
D. variation
11. The closeness of the points around the line in a scatter plot suggests the
_________ of correlation of the variables.
A. form
B. mean
C. shape
D. variation and strength
293
12. If the points on a scatter plot follow a trend of a line, then there is _______
correlation between the variables involved.
A. curvilinear
B. form
C. linear
D. non-linear
13. Luffy finds out that the IQ score of a child is inherited from his/her mother.
What is the independent variable on the situation above?
A. IQ score
B. IQ score of the father
C. IQ score of the child
D. IQ score of the mother
14. Which of the following shows the scatter plot of the data below?
A. C.
B. D.
15. From number 14, what is the variable that will assume the values of
x-coordinate?
A. average grade
B. grades
C. order of the subject
D. cannot be determined
294
Lesson
Step 3: Mark the point where y-coordinate stops. That point is where
the x and y-coordinate met. Name point A.
295
What’s In
Where Am I Now?
Directions: Plot the following points in a Cartesian plane or xy-plane.
1. S (3, 2)
2. C (1, 4)
3. A (- 2, 3)
4. T ( -3,3)
5. E (- 2, -2)
6. R (2, -4)
7. P (2,2)
8. L (3, -1)
9. O (0,3)
296
What’s New
Directions: Refer to the Cartesian plane below. Determine the coordinate of the
points to decode the answer to the question given. Then, write the
name of the point to its corresponding coordinates.
From the activity, what do you call the graph showing the relationship of the
variables of bivariate data?
297
What Is It
Example 1:
The given numbers are the age of a person in years and his/her
corresponding weight.
Age of a 11 12 13 14 15 16 17 18 19 20
person (x)
Weight (y) 40 42 38 35 45 51 48 48 50 47
298
Example 2:
From the data given, the independent variable is the order of the subject and
the dependent variable is the average grade. From this, order of the subject will be
plotted on the x-axis and grades will be plotted on the y-axis as illustrated below.
299
Example 3:
A researcher asked for the weight of 10 students together with the weight of
their mother (biological) and created a scatter plot as presented below.
Weight of mother 65 69 74 78 59 81 76 80 81 75
Weight of student 52 55 62 63 47 66 63 69 68 65
On the given, the independent variable is the weight of the mother while the
dependent variable is the weight of the student. The scatter plot is presented below.
300
What’s More
Activity 1.1: Construct the scatter plot of the following data by plotting the
points.
Daily allowance 35 55 60 65 45 55 70 70 77
Weight 40 38 45 43 60 41 63 57 60
Age 16 18 20 24 26 30 45
Number of
minutes 50 65 70 35 45 60 70
80
70
number of minutes
60
50
40
30
20
10
0
0 10 20 30 40 50
Age
3. A
301
researcher interviewed 10 students about their height and the height of their
father. The results are as follows:
Height 71 69 67 68 68 66 70 72 65 60
of the father
(x)
Height 71 69 69 65 66 63 68 70 60 58
of the
student (y)
80
70
Height of the student
60
50
40
30
20
10
0
58 60 62 64 66 68 70 72 74
Height of father
1. Matalino High School is known for students who excel in Math. A researcher
recorded the IQ of the students and their scores on a 50- item Math test as the
focus of his study.
60
______________________
50
40
30
20
10
0
0 20 40 60 80 100 120 140
______________________________
302
2. ABM 11 students believe on the value of thriftiness. That’s why they conducted
a research on selected elementary learners regarding the amount they save from
their daily allowance and their corresponding weight as follows.
Amount saved in
10 8 15 20 5 3 5 25 10 15
peso
Weight in kilogram 38 40 37 36 42 41 39 35 36 37
43
42
______________________
41
40
39
38
37
36
35
34
0 5 10 15 20 25 30
________________________
3. The data below are aptitude test scores (x) and scores in a long test in
Mathematics (y) of 15 students.
x 38 35 30 28 25 24 20 18 16 15 12 10 8 7 5
y 25 20 17 15 12 15 18 10 12 10 10 10 7 6 5
30
_________________________
25
20
15
10
0
0 5 10 15 20 25 30 35 40
_________________________
303
Activity 1.3: Based on the scatter plot, determine the data and supply them
on the table below.
1. 50
45
40
35
Scores in Math test
30
25
20
15
10
5
0
0 20 40 60 80 100 120 140
No.of minutes spent in studying
Table:
2. 87
86
85
84
83
Grades
82
81
80
79
78
77
76
0 20 40 60 80 100
Number of minutes spent in playing ML
Table:
Number of
minutes spent
in playing ML
Grades
304
3.
40
35
Price of Lumpia 30
25
20
15
10
5
0
0 50 100 150 200
No.of produced Lumpia
Table:
Number of
produced
lumpia
Price of
lumpia
Activity 1.4
Directions: Construct the scatter plot based on the given data. Create the scatter
plot on a separate sheet of graphing paper.
1.
Age of COVID-19
recovered patients 26 28 32 36 40 43 45 22 50 54 34 65
Number of days they
spent in the hospital 8 8 9 10 12 12 10 8 14 16 10 18
2.
Number of glasses
of water drank by
students per day 10 8 12 11 15 7 12 13 9 7 5 12
Number of glasses
of soft drinks
drank by students
per day 2 4 2 1 0 4 2 3 4 5 4 2
305
What I Have Learned
What I Can Do
Ask 15 of your classmates regarding their arm span and their height. Make a
scatter plot of the data collected on a separate sheet of paper. Your output will be
graded according to the rubrics below:
Neatness of the Output 10 points
Accuracy of the Scatter Plot 25 points
Presentation of Data 15 points
Total: 50 points
306
Assessment
Directions: Choose the best answer to the given questions or statements. Write the
letter of your choice on a separate sheet of paper.
1. It is a graphical representation that shows the correlation of two variables using
points or dots.
A. bar graph
B. line graph
C. pictograph
D. scatter plot
2. The direction of the points on a scatter plot determines the ______ of the
correlation of the variables.
A. form
B. strength
C. trend
D. variation
The _______ variable is for the x-axis while the ________ variable is for the y-axis.
A. consistent, inconsistent
B. independent, dependent
C. inconsistent, consistent
D. dependent, independent
92
90
88
86
84
82
80
78
76
74
0 2 4 6 8 10 12
307
A. x 2 3 4 5 6 7 8 9 10
y 82 78 81 80 79 85 90 76 75
B. x 2 3 4 5 6 7 8 9 10
y 80 78 81 82 79 83 80 76 75
C.
x 2 3 4 5 6 7 8 9 10
y 80 78 81 80 79 85 90 76 80
D.
x 2 3 4 5 6 7 8 9 10
y 84 78 81 80 79 85 90 76 82
5. Which of the following shows the scatter plot of the data below?
A. C.
B. D.
6. In constructing a scatter plot, which of the variables should assume the values
of x-coordinate?
A. consistent variable
B. dependent variable
C. independent variable
D. cannot be determined
308
7. The direction of the points around the line in a scatter plot suggests the
_________ correlation of the variables.
A. form
B. shape
C. trend
D. variation/strength
9. The form of the points in a scatter plot suggests the ______ of the correlation of
the variables whether linear or non-linear.
A. direction
B. shape
C. strength
D. variation
10. The relationship of the variables in a scatter plot is described in terms of its
______________.
A. shape, trend, and variance
B. mean, mode, and median
C. form, trend, and variation
D. form, direction, and standard deviation
11. Nami studied the results of the IQ test scores of children and the recorded IQ
scores of their parents. She noticed that the IQ of a child is more likely related
to the mother than that of the father. If a scatter plot will be constructed, what
is the independent variable on the situation above?
A. IQ score
B. IQ score of the father
C. IQ score of the child
D. IQ score of the mother
309
12. Which scatter plot is appropriate for the given data below?
x 14 13 17 16 13 15 16
y 160 154 164 162 158 165 170
A.
B.
C.
D.
310
13. Which of the tables contain the correct data based on the scatter plot below?
A.
x 1 2 3 4 5 6 7
y 92 90 87 90 92 91 88
B.
x 1 2 3 4 5 6 7
y 90 92 85 88 90 90 88
C.
x 1 2 3 4 5 6 7
y 90 92 87 88 91 90 88
D.
x 1 2 3 4 5 6 7
y 92 90 88 87 91 90 88
14. Choose the data that best complete the table based on the scatter plot below.
x 30 40 50 50 60 60 80
y
311
A.
y 90 92 88 90 87 90 88
B.
y 92 90 88 91 90 90 87
C.
y 91 92 90 88 92 90 88
D.
y 89 92 90 88 91 92 88
15. Mary Joy read from an article that the height of the offspring of a couple is
inherited from the father. She surveyed the height of her 10 classmates and
their fathers to construct a scatter plot. Which variable should be plotted on the
x-axis?
A. height of her classmates
B. height of their mothers
C. height of their fathers
D. cannot be determined
312
Additional Activities
313
References
Books
Albacea, Zita VJ., Mark John V. Ayaay, Isidoro P. David, and Imelda E. De Mesa.
Teaching Guide for Senior High School: Statistics and Probability. Quezon City:
Commision on Higher Education, 2016.
De Guzman, Danilo. Statistics and Probability. Quezon City: C & E Publishing Inc,
2017.
Punzalan, Joyce Raymond B. Senior High School Statistics and Probability.
Malaysia: Oxford Publishing, 2018.
Sirug, Winston S. Statistics and Probability for Senior High School CORE Subject A
Comprehensive Approach K to 12 Curriculum Compliant. Manila: Mindshapers
Co., Inc., 2017.
Online Resources
Rourke, Emily O. “Performance Based Learning and Assessment Task Tuition Cost
Activity.” Accessed May 25, 2020. https://www.radford.edu/rumath-
smpdc/Performance/src/Emily O’Rourke - Tuition Cost Activity.pdf
314
Statistics and
Probability
Quarter 2 – Module 17:
Describing the Shape (Form), Trend
(Direction), and Variation (Strength)
Based on a Scatter Plot
315
What I Need to Know
Are you ready now to study bivariate data using your ADM module? Good luck and
may you find it helpful.
316
What I Know
Directions: Choose the best answer to the given questions or statements. Write the
letter of your choice on a separate sheet of paper.
1. If the points on the scatter graph follow a trend of rising from right to left,
how will you describe the correlation of the variables involved?
A. moderate C. positive
B. negative D. zero
2. Joan noticed that the value of one variable corresponds to either low or high
value of the second variable of a set of bivariate data.
What conclusion can you draw from the direction of correlation?
A. The variables have positive correlation.
B. The variables have negative correlation.
C. The variables have moderate correlation.
D. The variables have zero correlation or negligible correlation.
3. Shanks observed that the points on the scatter plot are close AROUND the
trend line. What conclusion can he draw based from the scatter plot?
A. The variables have no correlation.
B. The variables have strong correlation.
C. The variables have perfect correlation.
D. The variables have moderate correlation
4. Zorro noticed that in constructing his scatter plot, the high values of one
variable correspond to low values of the second variable. What conclusion
can you draw from his data?
A. There is zero correlation between the variables.
B. There is a perfect correlation between the variables.
C. There is a positive correlation between the variables.
D. There is a negative correlation between the variables.
5. If the points on the scatter graph rise from left to right, then the variable
involved has a ______ correlation.
A. moderate C. positive
B. negative D. zero
6. Joan noticed that the high value of one variable corresponds to low value of
the second variable or low value of the first corresponds to high value of
second variable. What conclusion can you draw from the direction of
correlation?
317
A. The variables have zero correlation.
B. The variables have positive correlation.
C. The variables have negative correlation.
D. The variables have moderate correlation.
7. Given the scatter plot below, describe the variation of correlation of the
variables involved.
95
90
85
80
0 10 20 30 40 50 60
9. The strength of the correlation is associated with the ______ of the points to
around the trend line on a scatter plot.
A. closeness C. form
B. direction D. number
10. If the points on the scatter plot fall almost on the trend line, then the
variables are said to have _______ correlation.
A. negative C. positive
B. perfect D. strong
11. Noah noticed that the points on the scatter plot follow a trend of rising from
right to left. He also noticed that the points are scattered moderately from
the trend line. What is the correlation of the variables involved?
A. strong negative C. weak negative
B. strong positive D. weak positive
318
12. Estimate the strength of correlation
of the scatter plot on the right
A. strong negative
B. strong positive
C. weak negative
D. weak positive
13. What conclusion can you draw from the scatter plot below?
14. Complete the statement: “Variables have _____ positive correlation if the
points fall closely to the trend line.”
A. negligible
B. moderate
C. strong
D. weak
15. Zorro noticed that in constructing his scatter plot, the points are scattered
and do not follow any direction. What conclusion can be drawn on the
situation?
A. The variables have perfect correlation.
B. The variables are not related or associated.
C. The variables are moderately and negatively related.
D. The variables involved are strongly and positively related
319
Lesson Form (Shape), Trend (Direction),
What do you think happens with the height of a person as he grows older?
Does the person get taller as he ages or is there a certain period when he gets a bit
shorter after he stopped growing taller? Likewise, will your monthly electric bills get
higher if you continually increase your monthly electric consumption? To describe
the relationship of these variables, one way is to graph its scatter plot and analyze
the shape, trend, and variation of the scatter plot being formed.
What’s In
Where am I Now?
Directions: Create a scatter plot for each of the following situations.
Weight in kg. 23 45 25 37 50 35 40 45 37 35
IQ scores 85 87 93 95 87 97 105 110 115 120
140
120
100
IQ SCORES
80
60
40
20
0
0 10 20 30 40 50 60 70
WEIGHT
320
2. An ABM student interviewed 10 students regarding the amount they save
from their allowance and their weight. Data are shown on the table below.
Amount
saved in peso 10 8 15 20 5 3 5 25 10 15
Weight in kg. 38 40 37 36 42 41 39 35 36 37
43
42
41
40
Weight
39
38
37
36
35
34
0 5 10 15 20 25 30
Amount in peso
321
What’s New
Activity 1. Directions: Study the scatter plot on each situation below and answer
the guide questions.
Situation 1: Teacher Koro recorded the IQ scores of his 10 students and their
average scores in Mathematics. He also constructed a scatter plot of his collected
data as shown below.
Test
scores 23 20 33 34 25 34 36 39 45 49
IQ
scores 85 87 93 95 87 97 105 110 115 130
140
120
100
80
IQ Scores
60
40
20
0
0 10 20 30 40 50 60
Test Score
Situation 2: Enrique plotted the age and height of 10 individuals in the graph
below.
Age 15 18 23 29 35 40 45 56 65 70
Height 146 150 152 155 157 154 153 148 146 145
158
156
154
Height
152
150
148
146
144
0 10 20 30 40 50 60 70 80
Age
322
Guide Questions:
Do You Remember?
1.
2.
A line has a positive slope if the line rises from left to right. A line has a
negative slope if the line rises from right to left. A line with a zero slope is parallel
to x-axis.
Using the same concept, the variables have positive correlation if the
points on the scatter plot follow a trend of rising from left to right portion of the
graph. The variables have negative correlation if the points on the scatter plot
follow a trend of rising from right to left. Finally, the variables have no or
negligible correlation if the points are scattered with no trend or direction of
rise.
323
Activity 2: Positive, Negative, or Zero?
Directions: Determine the trend of correlation based on the scatter plots below.
2.
3.
4.
324
5.
6.
70
60
50
40
30
20
10
0
0 50 100 150
7.
7.
8.
Aside from the form and trend of points on a scatter plot, the correlation of
the variables can also be described by the closeness of the points on scatter plot.
This is called the variation or simply the strength of correlation of the variables.
325
Activity 3: Stop, Look, and Observe!
Directions: Observe the closeness and the direction of the points on each scatter
plot. Then, answer the questions that follow by writing the letter that corresponds
to your answer.
a. b. c.
d. e. f.
326
5. The scatter plot that shows no correlation or negligible correlation
is _______.
A. scatter plot a C. scatter plot e
B. scatter plot c D. scatter plot f
6. Which of these scatter plots shows negative correlation with points
close to one another?
A. scatter plot c C. scatter plot d
B. scatter plot d D. scatter plot e
7. Which of the scatter plots shows a negative correlation with points
moderately dispersed?
A. scatter plot a C. scatter plot c
B. scatter plot b D. scatter plot d
8. The graph with positive correlation with points moderately spread
apart is ______.
A. scatter plot b C. scatter plot d
B. scatter plot c D. scatter plot f
9. The graph that shows points following no direction and correlation
is ______.
A. scatter plot a C. scatter plot d
B. scatter plot b D. scatter plot e
What Is It
327
Trend Graph Direction of Description
the Points
A positive
Positive The points correlation
Correlation follow a exists when
trend rising high values of
from left to one variable
right. correspond to
high values of
another
variable or low
values of one
variable
correspond to
low values of
another
variable.
328
The closeness of the points around the trend line determines the variation or
strength of the correlation between the variables involved. The closer the points to
the trend line, the stronger the correlation of the variables is. The strength of
correlation between two variables can be perfect, strong, weak, or no/negligible
correlation. To summarize the strength of correlation, refer to the table below.
329
Weak Negative The points in this
Correlation correlation are
scattered a bit far
from the trend line
from right to left.
Two variables can also have perfect positive or perfect negative correlation.
In a scatter plot, the variables with perfect correlation will show points that fall into
a straight line.
330
What’s More
Activity 1.1: Forms of a Scatter Plot
Directions: Determine whether the form of the given scatter plot is linear or
curvilinear.
2.
50
40
30
20
10
0
0 20 40 60 80
3.
40
35
30
25
20
15
10
5
0
0 20 40 60 80
331
4.
80
60
40
20
0
0 20 40 60 80
5.
60
50
40
30
20
10
0
0 20 40 60 80
Directions: Examine the given variables below and determine the trend of
correlation as to positive, negative, or no/negligible correlation.
332
8. Height Weight of a person
of a person
9. Speed of a car Distance travelled
10. Salary Number of overtime
of an employee rendered
1. 2. 3.
_____________________ ____________________
4. 5. 6.
7. 8. 9.
333
Activity 1.4: Matchy-Matchy!
Directions: Match Column A with Column B by choosing the letter of the
description under Column B pertaining to the corresponding strength
of correlation listed under Column A. To decode the Word of the Day
below, arrange the letters of your answers accordingly from 1-8.
COLUMN A COLUMN B
_____ 4. Strong Positive Correlation H. points fall far from the trend
line that rises from right to left
334
What I Have Learned
The correlation of variables can be determined by studying its scatter plot. The
scatter plot can be described through its form, also known as (1)____. The form of
scatter plot is either linear or (2)____.
In terms of (3)____of correlation, it could be categorized as positive or negative
correlation depending on the behavior of the points. The variables can also have no
correlation.
The strength of correlation also known as (4)____determines the closeness of the
points in a line. If the points are plotted and they form a line, then there is
(5)____ correlation.
What I Can Do
Directions: Identify pair of variables that fall under the following strengths and
directions of correlation. Explain your answer.
Strength of Correlation Variables involved
335
Assessment
Directions: Choose the best answer to the given questions or statements. Write the
letter of your choice on a separate sheet of paper.
1. If the points on the scatter graph rise from left to right, then the variables
involved have a ______ correlation.
A. moderate C. positive
B. negligible D. zero
2. The strength of the correlation is associated with the ______ of the points to
the trend on a scatter plot.
A. closeness C. form
B. direction D. number
3. Noah noticed that the points on a scatter plot follow a trend rising from right
to left. He also noticed that the points are plotted closely around the trend
line. What is the correlation of the variables involved?
A. strong negative C. weak negative
B. strong positive D. weak positive
4. What conclusion can you draw from the scatter plot below?
5. If the points on the scatter plot fall almost in line, then the variables are said
to have ____ correlation.
A. negative C. positive
B. perfect D. strong
6. Joan noticed that the high value of one variable corresponds to high value of
the second variable or low value of the first corresponds to low value of
second variable. What conclusion can you draw from the direction of
correlation?
A. The variables have zero correlation.
336
B. The variables have positive correlation.
C. The variables have negative correlation.
D. The variables have moderate correlation.
9. What can you say about the relationship of the variables shown on the
scatter plot below?
10. If the points on the scatter plot fall almost on the trend line, rising from
right to left, then the variables are said to have _______ correlation.
A. perfect negative C. perfect positive
B. strong negative D. strong positive
11. Noelle noticed that the points on a scatter plot follow a trend of rising from
left to right. He also noticed that the points are moderately scattered from
the trend line. What is the correlation of the variables involved?
337
A. strong negative correlation C. weak negative correlation
B. strong positive correlation D. weak positive correlation
12. Estimate the strength of correlation of the scatter plot on the right.
A. strong negative
B. strong positive
C. weak negative
D. weak positive
13. What conclusion can you draw from the scatter plot below?
14. Complete the statement: “Variables have _____ and ______correlation if the
points rise from left to right falling closely to the trend line.”
A. negative, perfect C. positive, strong
B. negative, strong D. positive, perfect
15. Robin constructed a scatter plot based on the data she collected. What
conclusion can he draw about the relationship of the variables based on the
scatter plot?
338
Additional Activities
Directions: Create a scatter plot based on the given data. Then, determine the
form, trend, and variation of the scatter plot.
1. Age 12 14 15 16 18 19 20 23 24 25
Weight
40 43 48 47 47 49 52 55 50 58
(in kg)
2.
Father's height
166 170 158 178 162 156 175 180 175 183
(cm)
Son's height
160 162 150 165 157 156 170 175 172 180
(cm)
339
References
Books
Albacea, Zita VJ., Mark John V. Ayaay, Isidoro P. David, and Imelda E. De Mesa.
Teaching Guide for Senior High School: Statistics and Probability. Quezon City:
Commision on Higher Education, 2016.
De Guzman, Danilo. Statistics and Probability. Quezon City: C & E Publishing Inc,
2017.
Punzalan, Joyce Raymond B. Senior High School Statistics and Probability.
Malaysia: Oxford Publishing, 2018.
Sirug, Winston S. Statistics and Probability for Senior High School CORE Subject A
Comprehensive Approach K to 12 Curriculum Compliant. Manila: Mindshapers
Co., Inc., 2017.
Online Resources
Rourke, Emily O. “Performance Based Learning and Assessment Task Tuition Cost
Activity.” Accessed May 25, 2020. https://www.radford.edu/rumath-
smpdc/Performance/src/Emily O’Rourke - Tuition Cost Activity.pdf
340
Statistics and
Probability
Quarter 2 – Module 18:
Calculating the Pearson’s
Sample Correlation Coefficient
341
What I Need to Know
This module was designed and written with you in mind. It is here to help you
master computing Pearson’s sample correlation coefficient r. The scope of this
module permits its use in many different learning situations. The language used
recognizes the diverse vocabulary level of students. The concepts are arranged to
follow the standard sequence of the learning area.
Are you ready now to study about the calculation of Pearson’s sample correlation
coefficient using your ADM module? Good luck and may you find it helpful.
342
What I Know
Directions: Choose the best answer to the given questions or statements. Write the
letter of your choice on a separate sheet of paper.
6. Based on the bivariate data below, which among the choices is the correctly
constructed table?
X 2 8 11 9
Y 13 20 22 5
343
A. C. X Y XY X2 Y2
X Y XY X2 Y2
13 2 2 13
20 8 8 20
22 11 11 22
5 9 9 5
B. D.
X Y X2 Y2 X Y XY X3 Y3
2 13 2 13
8 20 8 20
11 22 11 22
9 5 9 5
A. C.
X Y XY X2 Y2 X Y XY X2 Y2
1 18 18 1 324 1 18 1 324 18
2 13 26 4 169 2 13 4 169 26
3 7 21 9 64 3 7 9 64 21
6 39 68 14 557 6 39 14 557 68
B. D.
X Y XY X2 Y2 X Y XY X2 Y2
1 18 18 324 1 1 18 1 18 324
2 13 26 169 4 2 13 4 26 169
3 7 21 64 9 3 7 9 21 64
6 39 68 557 14 6 39 14 68 557
8. Using the following summation values below, what is the value of Pearson r ?
n=4 ∑ 𝑋 = 10 ∑ 𝑌 = 15 ∑ 𝑋𝑌 = 39 ∑ 𝑋2 = 30 ∑ 𝑌 2 = 65
A. -0.02
B. 0
C. 0.23
D. 1
9. Using the following summation values below, what is the value of Pearson r ?
n=3 ∑𝑋 = 6 ∑ 𝑌 = 39 ∑ 𝑋𝑌 = 68 ∑ 𝑋2 = 14 ∑ 𝑌 2 = 557
A. -1
B. -0. 74
C. 0
D. 0.39
344
X 1 2 3
Y 5 9 8
10. Which of the following is the CORRECT completed table for the bivariate
data?
A. C.
X Y XY X2 Y2 X Y XY X2 Y2
1 5 1 25 5 1 5 5 1 25
2 9 4 81 18 2 9 18 4 81
3 8 9 64 24 3 8 24 9 64
6 22 14 170 47 6 22 47 14 170
B. D.
X Y XY X2 Y2 X Y XY X2 Y2
1 5 1 5 25 1 5 5 25 1
2 9 4 18 81 2 9 18 81 4
3 8 9 24 64 3 8 24 64 9
6 22 14 47 170 6 22 47 170 14
11. When you substitute all the summation ( ∑ ) values in the formula for
Pearson r, which among the choices is its best representation?
4(47)−(6)(22)
A. 𝑟 =
√[3(14)−222][3(170)− 62]
4(47)−(6)(22)
B. 𝑟 =
√[3(14)−62 ][3(170)− 222]
3(47)−(6)(22)
C. 𝑟 =
√[3(14)−222][3(170)− 62]
3(47)−(6)(22)
D. 𝑟 =
√[3(14)−62 ][3(170)− 222]
X 3 2 0 1 3
Y 10 24 21 15 28
13. Which of the following is the CORRECT completed table for the bivariate
data?
X Y XY X2 Y2
10 3 30 9 100
24 2 48 4 576
345
A. 21 0 0 0 441 C. X Y XY X2 Y2
15 1 15 1 225 10 3 30 100 9
28 3 84 9 784 24 2 48 576 4
98 9 177 23 2126 21 0 0 441 0
15 1 15 225 1
28 3 84 784 9
98 9 177 2126 23
B. X Y X2 Y2 XY2 D. X Y XY X2 Y2
3 10 30 9 100 3 10 30 9 100
2 24 48 4 576 2 24 48 4 576
0 21 0 0 441 0 21 0 0 441
1 15 15 1 225 1 15 15 1 225
3 28 84 9 784 3 28 84 9 784
9 98 177 23 2126 9 98 177 23 2126
14. When you substitute all the summation ( ∑ ) values in the formula for
Pearson r, which among the choices is its best representation?
5(177)−(9)(98)
A. 𝑟 =
√[5(23)−92 ][5(2126)− 982 ]
5(177)+(9)(98)
B. 𝑟 =
√[5(23)−92 ][5(2126)− 982 ]
5(2126)−(9)(98)
C. 𝑟 =
√[5(23)−92 ][5(2126)− 982 ]
5(2126)+(9)(98)
D. 𝑟 =
√[5(23)+92 ][5(2126)+ 982 ]
346
Lesson
Calculating the Pearson’s
18 Sample Correlation Coefficient
In the previous lesson, we learned about bivariate data and pairs of variables
that are related to each other. We also learned how to construct the scatter plots of
these bivariate data and determine the strength and direction of their association or
relationship based on how the points are scattered. In this module, you will focus on
the correlation of bivariate data. Check your readiness for this lesson by answering
the following exercises.
What’s In
Directions: Identify the trend and strength of correlation of the scatter plots below.
Choose your answer from the choices inside the box.
1. 3.
2. 4.
347
5.
We can only estimate the direction and strength of the relationship between
variables using a scatter plot. Is there a way to get the exact direction and strength
of the relationship between variables? Just like any other measurement, correlation
between two variables can be represented by a single number. This number can
determine exactly whether the relationship is negative or positive. It can also tell
exactly the degree or strength of the relationship. Let’s try the next activity.
What’s New
The following tables show the bivariate data x and y. Without constructing a
scatter plot, tell whether they have positive, negative, or no/negligible correlation.
Then, briefly explain your answer.
1. ____________________________
x 1 2 3 4 5 6 ____________________________
y 5 10 10 15 25 30 ____________________________
2. ____________________________
x 1 3 11 10 6 9 ____________________________
y 14 6 12 11 10 9 ____________________________
3. ____________________________
x 10 8 6 4 2 1 ____________________________
y 16 19 26 24 29 36 ____________________________
348
Guide Questions:
1. How do you assess the bivariate data do determine the trend of its
correlation?
___________________________________________________________________________
___________________________________________________________________________
2. Do you think it is easy to determine the trend of its correlation? Why and
why not?
___________________________________________________________________________
___________________________________________________________________________
3. Is there a way to get the exact number that will represent its correlation?
___________________________________________________________________________
___________________________________________________________________________
What Is It
Illustrative Example:
Teachers of Pag-asa National High School instilled among their students the
value of time management and excellence in everything they do. The table below
shows the time in hours spent in studying (X) by six Grade 11 students and their
scores in a test (Y). Solve for the Pearson’s sample correlation coefficient r.
X 1 2 3 4 5 6
Y 5 10 10 15 25 30
349
The next section will guide you on how to compute the Pearson product
moment correlation r.
STEPS SOLUTION
1. Construct a table as shown on
the right side. X Y XY X2 Y2
1 5
2 10
3 10
4 15
5 25
6 30
3.
a. Get the sum of all entries in the X Y XY X2 Y2
X column. This is ∑ 𝑿.
1 5 5 1 25
b. Get the sum of all entries in the 2 10 20 4 100
Y column. This is ∑ 𝒀.
3 10 30 9 100
c. Get the sum of all entries in the 4 15 60 16 225
XY column. This is ∑ 𝑿𝒀.
5 25 125 25 625
d. Get the sum of all entries in the 6 30 180 36 900
X2 column. This is ∑ 𝑿𝟐 .
∑ 𝑿= ∑ 𝒀= ∑ 𝑿𝒀= ∑ 𝑿𝟐 = ∑ 𝒀𝟐 =
e. Get the sum of all entries in the 21 95 420 91 1,975
Y2 column. This is ∑ 𝒀𝟐.
4. Substitute the values obtained Here n = 6 because there are six (6)
from Step 3 in the formula: pairs of values.
6(420) − (21)(95)
=
√[6(91) − (21)2 ][6(1,975) − (95)2 ]
350
2,520 − 1,995
=
√[546 − 441][11,850 − 9,025]
You may use your
calculator here!
525
=
√[105][2,825]
525
=
√296,625
r ≈ 0.96395 or 0.96
In the next module, we will interpret the strength of value of computed r and
we will involve more real-life problems to solve using Pearson r. In the meantime,
let’s focus on computing the Pearson’s sample correlation coefficient r.
351
What’s More
X 1 3 4 5 7
Y 35 20 15 10 15
X Y XY X2 Y2
1 35 35
3 20 9 400
4 15 60 225
5 10 25
7 15 105 225
∑ 𝑋= ∑ 𝑌= ∑ 𝑋𝑌= ∑ 𝑋2 = ∑ 𝑌2 =
20 ______ 310 ______ 2,175
5(310) − (20)(____)
= Be careful in
√[5(_____) − (20)2 ][5(2,175) − (____)2 ] substituting the
values, make sure
1,550 − ________
= they are correct.
√[______ − 400][10,875 − ________] Always double check.
−_____
=
√[_____][_____]
−_____
=
√_____________
−_____
=
_____
𝒓 ≈ −𝟎. 𝟖𝟏
352
Activity 1.2 Complete Me!
Directions: Complete the table below. Then, fill in the blanks in the formula to
arrive at the computed Pearson r.
X Y XY X2 Y2
15 5 225
23 3
11 8 64
9 10 100
15 8 64
20 20 400
∑ 𝑋= ∑ 𝑌= ∑ 𝑋𝑌= ∑ 𝑋2 = ∑ 𝑌2 =
_____ _____ 842 1,581 _____
___(842) − (____)(____)
𝑟= r ≈ 0.03
√[___(1,581)−(_____)2 ][____(_____)− (____)2 ]
1.
n=5 ∑ 𝑋 = 17 ∑ 𝑌 = 85 ∑ 𝑋𝑌 = 375 ∑ 𝑋2 = 75 ∑ 𝑌 2 = 1,875
Letter Box
Y T I L R
1 0.73 -0.14 0.31 0
DECODE…
Let me guide you because…
3 2 1
353
Activity 1.4 You Can Do It!
In Mapalad Integrated High School, a guidance counselor believes that
aptitude score is related to performance. The following sample data obtained from
six students show their aptitude and performance score. Compute the Pearson r.
Show your solution. Aptitude Quarterly Assessment
Score (X) Score (Y)
8 14
15 5
11 8
7 12
5 2
10 11
What I Can Do
354
Assessment
Directions: Choose the best answer to the given questions or statements. Write the
letter of your choice on a separate sheet of paper.
1. Which of the following is used to measure the strength of the association between
bivariate data?
A. z – value
B. diagram
C. Pearson - b
D. Pearson’s sample correlation coefficient
(∑ 𝑥𝑦)+(∑ 𝑥)(∑ 𝑦)
D. 𝑟 =
√[(∑ 𝑥 2 )+(∑ 𝑥)2 ][(∑ 𝑦 2 )+(∑ 𝑦)2 ]
355
A. 1 9 C. X Y XY X2 Y2
2 15 10 -1
13 0
9 1
15 2
B. X Y XY X2 Y2 D. X Y XY X3 Y3
-1 15 -1 15
0 9 0 9
1 13 1 13
2 10 2 10
A. C.
X Y XY X2 Y2 X Y XY X2 Y2
2 1 2 4 1 2 1 2 4 1
4 3 12 16 9 4 3 12 16 9
6 5 30 36 25 6 5 30 36 25
12 9 35 56 44 12 9 44 56 35
B. X Y XY X2 Y2 D. X Y X2 Y2 XY
2 1 2 1 4 2 1 2 4 1
4 3 12 9 16 4 3 12 16 9
6 5 30 25 36 6 5 30 36 25
12 9 44 35 56 12 9 44 56 35
8. Using the given summation values below, what is the value of Pearson r?
n=3 ∑𝑋 = 6 ∑ 𝑌 = 30 ∑ 𝑋𝑌 = 60 ∑ 𝑋2 = 14 ∑ 𝑌 2 = 450
A. -0.06
B. 0
C. 0.11
D. 1
9. Using the given summation values below, what is the value of Pearson r?
n = 5 ∑ 𝑋 = 10 ∑ 𝑌 = 15 ∑ 𝑋𝑌 = 90 ∑ 𝑋2 = 60 ∑ 𝑌 2 = 135
A. – 1
B. 0
C. 0.99
D. 1
For numbers 10-12, refer to the following X 1 2 3
bivariate data: Y 10 8 9
10. Which of the following is the CORRECT completed table for the bivariate data?
X Y XY X2 Y2
1 10 100 10 1
2 8 64 16 4
356
3 9 81 27 9
A. C. X Y XY X2 Y2
6 27 245 53 14
1 10 100 10 1
2 8 64 16 4
3 9 81 27 9
6 27 245 53 14
B. D.
X Y XY X2 Y2 X Y XY X2 Y2
1 10 10 1 100 1 10 10 1 100
2 8 16 4 64 2 8 16 4 64
3 9 27 9 81 3 9 27 9 81
6 27 53 14 245 6 27 53 245 14
11. When you substitute all the summation ( ∑ ) values in the formula for
Pearson r, which among the choices is its best representation?
3(53)−(6)(27)
A. 𝑟 =
√[3(14)−62 ][3(245)− 272]
3(53)−(6)(27)
B. 𝑟 =
√[3(14)+62 ][3(245)+ 272 ]
3(27)−(6)(27)
C. 𝑟 =
√[3(14)−62 ][3(245)− 272 ]
3(53)−(6)(27)
D. 𝑟 =
√[3(6)−142 ][3(27)− 2452 ]
B. X Y XY X2 Y2 D. X Y XY X2 Y2
-2 8 64 4 -16 -2 8 -16 64 4
0 5 25 0 0 0 5 0 25 0
3 8 64 9 24 3 8 24 64 9
4 13 169 16 52 4 13 52 169 16
1 20 400 1 20 1 20 20 400 1
6 54 722 30 80 6 54 80 722 30
357
14. When you substitute all the summation ( ∑ ) values in the formula for
Pearson r, which among the choices is its best representation?
5(80)−(6)(54)
A. 𝑟 =
√[5(30)−62 ][5(722)− 542 ]
5(80)−(6)(54)
B. 𝑟 =
√[5(30)+62 ][5(722)+ 542 ]
5(80)−(6)(54)
C. 𝑟 =
√[5(6)−302 ][5(54)− 7222]
(6)(54)−5(80)
D. 𝑟 =
√[5(30)−62 ][5(722)− 542 ]
Additional Activities
An ice cream vendor records the maximum daily temperature and the
number of ice creams he sells each day. An eight-day result is shown in the
table below.
Maximum
Temperature 26 28 24 28 23 24 27 32
(OC)
Number of
Ice Creams 21 38 42 47 29 19 52 56
Sold
1. Display the data in a scatter plot and identify the trend of correlation.
2. Compute the Pearson’s sample correlation coefficient r.
3. Interpret the result of the data.
358
References
Books
Albacea, Zita VJ., Mark John V. Ayaay, Isidoro P. David, and Imelda E. De Mesa.
Teaching Guide for Senior High School: Statistics and Probability. Quezon City:
Commision on Higher Education, 2016.
De Guzman, Danilo. Statistics and Probability. Quezon City: C & E Publishing Inc,
2017.
Punzalan, Joyce Raymond B. Senior High School Statistics and Probability. Malaysia:
Oxford Publishing, 2018.
Sirug, Winston S. Statistics and Probability for Senior High School CORE Subject A
Comprehensive Approach K to 12 Curriculum Compliant. Manila: Mindshapers
Co., Inc., 2017.
Ubarro, Arvie D., Josephine Lorenzo S. Tan, Renato Guerrero, Simon L. Chua, and
Roderick V. Baluca. Soaring 21st Century Mathematics Precalculus. Quezon City,
Philippines: Phoenix Publishing House Inc., 2016.
Online Resources
Project Maths Development Team. “Teaching & Learning Plans: The Correlation
Coefficient.” Accessed May 23, 2020.
https://www.projectmaths.ie/documents/T&L/CorrelationCoefficient.pdf
359
Statistics and
Probability
Quarter 2 – Module 19:
Solving Problems Involving
Correlation Analysis
360
What I Need to Know
In the previous lessons, you already familiarized yourself with the concepts
about correlation of bivariate data. You learned about constructing a scatter plot and
identifying the form, direction, and strength of correlation. Also, you already know
how to compute for the Pearson’s sample correlation coefficient. Hence, you’ll have a
recall first on calculating correlation coefficient r. Then, the proceeding activities will
help you master solving problems involving correlation analysis especially in
interpreting Pearson’s r.
361
What I Know
Directions: Choose the best answer to the given questions or statements. Write the
letter of your choice on a separate sheet of paper.
2. Miguel needs to analyze the strength of the relationship between two variables.
What is the correct statistical method he needs to conduct?
A. z-test
B. Pearson’s r
C. Pearson’s co
D. regression analysis
4. Given the scatter plot below, describe the strength of correlation of the
variables involved.
A. The variables have no
correlation.
B. The variables have perfect
positive correlation.
C. The variables have strong
positive correlation.
D. The variables have weak
positive correlation.
362
6. Which of the following interpretation best describes the given scatter plot?
A. As the values of x increases,
the values of y increases.
B. As the values of x increases,
the values of y remain the
same.
C. As the values of x decreases,
the values of y decreases.
D. As the values of x increases,
the values of y decreases.
7. A recent study was conducted in school and it was found out that as a
student’s number of absences increase, the academic performance tends to
decline. Which of the following values of r is appropriate to represent the
correlation of the variables in the given statement?
A. -0.48
B. 0
C. 0.15
D. 1
9. Researchers found out that the correlation coefficient r between the number
of hours, people are watching television and their body weight was nearly 0.
Which of the following best describes the result?
A. The longer the time of watching television, the more a person gains
weight.
B. The longer the time of watching television, the more a person loses
weight.
C. The lesser the time of watching television, the more a person gains
weight.
D. There is no relationship between the number of hours watching
television and weight of a person.
A trainer wants to find out the relationship between the person’s height and
pulse rate. The table below shows the gathered data from 7 people.
363
10. What is the computed Pearson’s sample correlation coefficient?
A. -0.75
B. -0.28
C. -0.17
D. 0.123
12. Based on the findings, which of the following best describes the result?
A. The taller a person, the higher his pulse rate.
B. The taller a person, the lower is his pulse rate.
C. The shorter a person, the lower is his pulse rate.
D. There is no correlation between a person’s height and his pulse rate.
15. Based on the findings, which of the following best describes the result?
A. As a student spends more time in studying, the higher is the final exam
grade.
B. A student spending less time in studying tends to have a higher final
exam grade.
C. As a student spends more time in studying, the lower is the final exam
grade.
D. There is no correlation exists between the hours spend in studying and
the final exam grade of the students.
364
Lesson
Solving Problems Involving
19 Correlation Analysis
Correlation analysis is one of the most important statistical tools that you may
consider employing in conducting your research studies. In this module, you will
learn how to solve problems involving correlation analysis. Remember your previous
lessons on describing the form, direction, and strength of association between two
variables and on calculating the Pearson’s r. These skills you learned in the previous
modules will help you understand the concepts presented here. You will dig deeper
in the concept of correlation analysis by solving problems and interpreting the results
in the context of the problems presented.
What’s In
What is the value of Pearson’s r? What does this value mean in terms of
student’s number of absences and their final average grade? Can you interpret the
value of Pearson’s r? To give you more background on problems involving correlation
analysis, let’s have another activity.
365
What’s New
Direction/ Degree/
Estimated
Bivariate Data Trend of Strength of
r Value
Correlation Correlation
1. age of a child and his clothing
size
2. volume of alcohol intake and
level of safety in driving a car
3. weight of a person and his/her
skill level in memorizing poem
4. hours spent working out at the
gym and the volume of body
fats
5. score in Quarterly Assessment
in Mathematics
and numbers of hours spent in
studying Mathematics
Choices:
Guide Questions:
2. How did you come up with the trend of the correlation in each item?
______________________________________________________________________________
______________________________________________________________________________
3. Did you find it easy to pick from the choices for the estimated r value? Why?
______________________________________________________________________________
______________________________________________________________________________
366
4. What helped you decide the degree/strength of correlation?
______________________________________________________________________________
______________________________________________________________________________
5. Knowing that r value ranges from -1 to 1, make a scale for the degree/strength of
correlation as to no/negligible, weak, strong, and perfect. What are your
considerations in deciding the boundaries for each category?
______________________________________________________________________________
______________________________________________________________________________
What Is It
In the first activity in What’s In, you solved for the value of r and identified its
trend. In What’s New activity, you identified the trends, estimated the values of r,
and based on the values, you chose the correct descriptions of the strength of the
correlation. So now, we will interpret r value by looking at the scale that gives both
strength and direction of correlation.
Using the correlation scale, we can determine the strength of the correlation
coefficient r. For example, you have r = 0.63 which means that there is a “strong
positive correlation” between the two variables. To interpret, we can simply state it
this way: “As x values increase, y values also increase and vice versa.”
367
In interpreting the linear relationship of two bivariate data, refer to the value
of r and the scale presented above. We can state our interpretation in different ways.
In order for you to solve problems involving correlation analysis, you must know how
to calculate the value of r and interpret this value using the scale. Since computing
for r value is a necessary skill, you may go back to the previous lesson if you feel that
you haven’t mastered it yet. Otherwise, proceed to the following examples of solving
for r.
Scenario: Filipino employees are known for being persistent and hardworking. That
is why they truly value every single cent of their salary. Here are some
situations showing the relationship between the salary and spending of a
Filipino employee.
Computed
Bivariate Data Interpretation
Pearson’s r
Temperature and the There is a strong negative correlation
number of hot chocolate -0.781 between the temperature and the
products sold number of hot chocolate products sold.
Amount of coffee intake There is a weak positive correlation
and number of hours 0.426 between the amount of coffee intake
you stay awake and number of hours you stay awake.
Height and salary of There is no correlation between the
0
teachers height and salary of teachers.
368
If data are in a scatter plot, we can determine the strength of correlation and
value of r by estimating it. Refer to the given examples below:
Estimated
Estimated
Scatter Plot Strength of the
Value of r
Correlation
The value of r should
be in the range
between
0.5 and 1.
Strong Positive
Correlation We can say 0.8 or
0.75
as long as it is within
the range in the
correlation scale.
The value of r should
be in the range
between
0 and -0.5
Weak Negative
Correlation
We can say -0.39
as long as it is within
the range in the
correlation scale.
The closeness of the points around the trend line determines the strength
of the correlation. The closer the points to the trend line, the stronger the
correlation of the variables is.
This is comparable to Filipino family ties, right? The closeness of each family
member will lead them to stronger family relationship.
369
What’s More
Perfect Negative
Correlation
2.
Strong Negative
Correlation
No Correlation
370
3.
Strong Positive
Correlation
Weak Positive
Correlation
4.
Strong Negative
Correlation
Weak Negative
Correlation
LETTER BOX
R S A I L K
DECODE…
PEARSON
1 2 3 4
Directions: The following are the results of some other researches with their
correlation coefficients. Make an interpretation for each survey. Item
number 1 is already given as example.
2. In another survey, the correlation coefficient r between engine size and fuel
consumption was found to be -0.261.
Interpretation: ___________________________________________________________
___________________________________________________________
3. In a survey, the correlation between the number of hours per week students
spent studying and their performance in an exam was found to be 0.72.
Interpretation: ___________________________________________________________
___________________________________________________________
371
4. In another survey, the correlation between the numbers of hours per week
students spent studying and their performance in an exam was found to be
0.483.
Interpretation: ____________________________________________________________
____________________________________________________________
Interpretation: ____________________________________________________________
____________________________________________________________
Interpretation: ____________________________________________________________
____________________________________________________________
Directions: Read the following situations. Using each data, calculate the Pearson’s
sample correlation coefficient. After obtaining Pearson’s r, analyze and
interpret the result. (Show your solution.)
1. The table shows the data obtained from six students of Mapalad Integrated
High School in a study about the number of hours a student exercises each
week and the score s/he gets in a test.
372
Notes to the Teacher
Tell your students to be cautious in interpreting
correlations. One can easily conclude that because two variables
have a strong correlation, one variable causes the change in the
other. In other words, correlation does not imply causation
(cause-and-effect relationship).
Also, the size of the sample affects the size of a
correlation. Generally, we calculate the sample correlation to
infer the true correlation of our population. The larger the
sample, the more reliable is the obtained correlation. That’s why
a significant test should be done to assess the sample
correlation’s reliability.
373
What I Can Do
The following are data on the height of a father and his eldest son, in inches:
Assessment
Directions: Choose the best answer to the given questions or statements. Write the
letter of your choice on a separate sheet of paper.
2. Miguel needs to analyze the strength of the relationship between two variables.
What is the correct statistical method he needs to conduct?
A. z-test
B. Pearson co
C. Pearson alpha
D. Pearson’s sample correlation coefficient
374
4. Given the scatter plot below, describe the strength of correlation of the
variables involved.
A. The variables have perfect positive
correlation.
B. The variables have strong positive
correlation.
C. The variables have strong negative
correlation.
D. The variables have weak negative
correlation.
6. Which of the following interpretation best describes the given scatter plot?
A. As x values increase,
y values increase.
B. As x values increase,
y values decrease.
C. As x values increase, y values
remain the same.
D. As x values decrease,
y values increase.
7. There is a survey indicating that as weather gets colder, air conditioning costs
decrease. Which of the following values of r is appropriate to the result of the
survey?
A. -0.93
B. 0
C. 0.19
D. 0.87
8. The correlation coefficient r between the number of bags of popcorn and the
number of sodas sold at each performance of the circus over one week was
found to be 0.62. Which conclusion can be drawn from the result?
A. There is no correlation between popcorn sales and soda sales.
B. There is a strong positive correlation between popcorn sales and soda
sales.
C. There is a weak positive correlation between popcorn sales and soda
sales.
D. There is a strong negative correlation between popcorn sales and soda
sales.
9. An ornithologist, a person who studies every aspect of birds, found out that the
correlation coefficient r between wing length and tail length of 12 different
species of bird was 0.43. Which conclusion can be drawn from the result?
A. A bird with longer wing length has shorter tail length.
B. A bird with shorter wing length has longer tail length.
C. A bird with longer wing length also has longer tail length.
D. There is no correlation between bird’s wing length and tail length.
375
For numbers 10-12, refer to the following situation:
Price in Peso 11 12 13 16 18 19 20
Demand 38 31 26 23 20 20 17
12. Based on the findings, which of the following best describes the result?
A. As price of good increases, the quantity of demand increases.
B. As price of good decreases, the quantity of demand decreases.
C. As price of good increases, the quantity of demand decreases.
D. As price of good increases, the quantity of demand remains the same.
Score in Score in
Mathematics Science
(X) (Y)
12 13
10 9
5 8
7 8
11 14
6 7
376
14. What is the strength of correlation?
A. strong negative
B. perfect positive
C. strong positive
D. weak positive
15. Based on the findings, which of the following best describes the result?
A. A student who is good in Mathematics is also good in Science.
B. A student who is good in Mathematics is not good in Science.
C. A student who is not good in Mathematics is good in Science.
D. There is no correlation between the performance of the students in
Mathematics and Science.
Additional Activities
1. Compute the correlation coefficient of the following bivariate data. Then, give a
conclusion based on the results.
Number Selling
of years price
owned (Y)
(X)
1 23
2 20
3 17
4 14
5 11
2. As shown in the table below, a person’s heart rate during exercise changes as he
gets older. Compute the Pearson’s r, then interpret the result.
377
References
Books
Albacea, Zita VJ., Mark John V. Ayaay, Isidoro P. David, and Imelda E. De Mesa.
Teaching Guide for Senior High School: Statistics and Probability. Quezon City:
Commision on Higher Education, 2016.
De Guzman, Danilo. Statistics and Probability. Quezon City: C & E Publishing Inc,
2017.
Punzalan, Joyce Raymond B. Senior High School Statistics and Probability. Malaysia:
Oxford Publishing, 2018.
Sirug, Winston S. Statistics and Probability for Senior High School CORE Subject A
Comprehensive Approach K to 12 Curriculum Compliant. Manila: Mindshapers
Co., Inc., 2017.
Ubarro, Arvie D., Josephine Lorenzo S. Tan, Renato Guerrero, Simon L. Chua, and
Roderick V. Baluca. Soaring 21st Century Mathematics Precalculus. Quezon City,
Philippines: Phoenix Publishing House Inc., 2016.
Online Resources
378
Statistics and
Probability
Quarter 2 – Module 20:
Identifying Dependent and
Independent Variables
379
What I Need to Know
This module was designed and written with you in mind. In this module, you
will be introduced to the two types of variables: dependent and independent.
Furthermore, you will learn how to distinguish the two.
Are you ready now to study using your ADM module? Good luck and may
you find it helpful.
380
What I Know
5. Marco believes that health is wealth. That is why as part of his routine, he
jogs every morning. The number of calories he burns during a jog depends
on the distance he jogs. In the given situation, which are the variables?
A. Marco
B. number of calories
C. distance Marco jogs
D. number of calories Marco burns and distance Marco jogs
381
6. Which of the following concepts is TRUE about dependent and independent
variables?
A. Dependent variable is the one being studied and measured.
B. Dependent variable is the one you do not expect to change.
C. Dependent variable causes a change in independent variable.
D. Dependent variable is the variable the experimenter changes or
controls.
7. In graphing the scatter plot, where do you plot the independent variable?
A. origin
B. x-axis
C. y-axis
D. z-axis
8. Andrew knows that through education, he can achieve his dreams. That is
why to support his schooling, Andrew is doing a part-time job. For each hour
of working, he earns Php40. What is the independent variable?
A. Php 40
B. part-time job
C. amount earned
D. hours spent in doing part-time job
9. Jeny and Camille are preparing relief goods to serve at an evacuation center.
The more relief goods they prepare, the more people they will be able to
serve. Which of the following is the dependent variable?
A. Jeny and Camille
B. evacuation center
C. number of relief goods prepared
D. number of people they will be able to serve
10. Suppose a study found that spending time with friends or family decreases
the amount of stress someone is feeling and allows them to perform better
on tests. In the given statement, which represents the dependent variable?
A. total number of tests
B. spending time with friends or family
C. spending stress with friends or family
D. amount of stress someone is feeling
382
12. Which of the following sentence structures is CORRECT in featuring
dependent and independent variables?
A. How far you can drive depends on the weight of the car.
B. The amount of gas you have depends on your driver’s license.
C. The amount of gas you have depends on how far you can drive.
D. How far you can drive depends on the amount of gas you have.
13. If the dependent variable is the plant’s height, which of the following
CANNOT possibly be the independent variable?
A. amount of water provided to the plant
B. type of fertilizer provided to the plant
C. exposure of plant to classical music
D. amount of sunlight received by the plant
15. If the dependent variable is the speed of goldfish growth, which of the
following can possibly be the independent variable?
A. type of food
B. type of water
C. size of aquarium
D. all of the above
383
Lesson
Dependent and Independent
20 Variables
What’s In
Univariate or
Statements
Bivariate
1. A researcher looked into salary/income and civil status of
government employees.
3. The school nurse recorded the age and the blood pressure
of the teachers.
Guide Questions:
1. How were you able to differentiate univariate from bivariate data?
___________________________________________________________
2. Which among the statements above may deal with relationship between
variables?
___________________________________________________________
When we are examining bivariate data, the two variables could depend on
each other and one variable could influence another. In this case, try to answer the
next activity.
384
What’s New
Column A Column B
____ 1. A.
____ 2. B.
____ 3. C.
____ 4. D.
____ 5. E.
385
Guide Questions:
1. How did you match each picture in Column A to Column B? What are the things
you considered?
______________________________________________________________________
3. Suppose you didn’t eat breakfast, what could possibly happen to you?
______________________________________________________________________
5. Which situation do you think should happen first before another situation
happens? Explain your answer.
_______________________________________________________________________
386
What Is It
In the previous activity, you became familiar with bivariate data. Bivariate
data always involve two variables. One of these variables is the dependent variable
and the other one is the independent variable.
For example, the researcher wants to determine the effects of use of social
media in the academic performance of students in Mathematics.
The bivariate data in the study are use of social media and academic
performance. The academic performance depends on the use of social media, or we
can say that academic performance is affected by the use of social media.
Therefore, independent variable here is the use of social media and the
dependent variable is the academic performance.
affects
INDEPENDENT DEPENDENT
VARIABLE VARIABLE
depends on
387
Let’s have more examples.
Identify the dependent and independent variables in the next statements.
Taking a nap in the afternoon makes people more relaxed and less
irritable for the rest of the day. If you want to be more relaxed and less
irritable, it depends on the number of hours you take a nap in the afternoon.
affects
INDEPENDENT DEPENDENT
VARIABLE VARIABLE
depends on
affects
INDEPENDENT DEPENDENT
VARIABLE VARIABLE
improving frequency of
fundamental using Math app
basic skills and (Math Tricks)
retention
depends on
388
What’s More
1.
kilowatts used in a
electricity bill
household
2.
person’s running time it takes to run a
speed mile
3.
genes they inherit
height of a person
from their parents
number of
chance of
cigarettes a person 1.
developing cancer
smokes
2.
taking care of our
global warming
natural resources
389
tardiness, cutting 3.
chance of dropping classes, and
out absenteeism of a
student
4.
educational opportunities for
attainment high-paying jobs
5.
total calories and number of junk
fat foods you eat
Direction: Complete the following statements. In nos. 3-4, choose the expression
on the parentheses that best completes the sentences.
391
What I Can Do
Independent Dependent
Research Title
Variable Variable
Example:
Math Minute and Peer Tutoring in Math Minute student’s mastery
Raising the Level of Student’s and peer tutoring in solving basic
operation
Mastery in Solving Basic
of integers
Operations of Integers
1. Organizational Commitment and
Teaching Performance
of Elementary Teachers in Rizal
2. Conceptual, Interpersonal,
and Technical Skills of Bank
Managers: Their Relationship
to Operational Efficiency
3. Increasing Mathematics
Achievement Through
Contextualized and Localized
Materials
4. Impact of Blended Learning
on Student Achievement in Social
Studies
5. Effectiveness of Exposing
Students to Classical Music
on Reading Comprehension
392
Assessment
Directions: Choose the best answer to the given questions or statements. Write
the letter of your choice on a separate sheet of paper.
1. What do you call a data that involves two variables?
A. bivariate
B. constant
C. dependent
D. independent
3. Which of the following variables may change or vary and can be considered
as outcome?
A. bivariate
B. constant
C. dependent
D. independent
6. Which of the following concepts below is NOT TRUE about dependent and
independent variables?
A. The dependent variable causes change in the independent variable.
B. The independent variable causes change in the dependent variable.
C. The dependent variable is affected by the independent variable.
D. The dependent variable depends on the independent variable.
393
7. In graphing the scatter plot, where do you plot the dependent variable?
A. origin
B. x-axis
C. y-axis
D. z-axis
8. You want to determine which type of fertilizer helps plants grow the fastest,
so you add a different brand of fertilizer to each plant and see how tall they
grow. What is the independent variable?
A. plants
B. type of fertilizer
C. how plants grow
D. fruit of the plant
9. Students took a test after they studied either in silence or with the television
turned on. What is the dependent variable?
A. silence
B. subject
C. score on test
D. television turned on
10. A team from the Department of Public Works and Highways (DPWH) is
painting lines on the freeway. The number of hours needed to finish the
project depends on the length of road that needs to be lined. In the given
statement, which represents the independent variable?
A. road
B. length of the road that needs to be lined
C. number of hours need to finish the project
D. Department of Public Works and Highways
12. According to a study, the more time people spend using social media, the
less able they are to express themselves in conversation. In the given
statement, which represents the dependent variable?
A. more time for themselves
B. using social media in conversation
C. more time people spend in using social media
D. less able to express themselves in conversation
394
C. The type of soil used depends on the type of grass.
D. Grass doesn’t grow in any type of soil used.
15. If the dependent variable is “increasing the ability to learn in school”, what is
the best possible independent variable?
A. travelling every week
B. taking selfies three times a day
C. eating breakfast in the morning
D. watching television for 12 hours
395
Additional Activity
Directions: Give five (5) research proposal titles that involve bivariate data. Identify
the dependent and independent variables in the title.
1. ______________________________________________________________________
______________________________________________________________________
2. ______________________________________________________________________
______________________________________________________________________
3. ______________________________________________________________________
______________________________________________________________________
4. ______________________________________________________________________
______________________________________________________________________
5. ______________________________________________________________________
______________________________________________________________________
396
References
Books
Albacea, Zita VJ., Mark John V. Ayaay, Isidoro P. David, and Imelda E. De Mesa.
Teaching Guide for Senior High School: Statistics and Probability. Quezon City:
Commision on Higher Education, 2016.
De Guzman, Danilo. Statistics and Probability. Quezon City: C & E Publishing Inc,
2017.
Punzalan, Joyce Raymond B. Senior High School Statistics and Probability.
Malaysia: Oxford Publishing, 2018.
Sirug, Winston S. Statistics and Probability for Senior High School CORE Subject A
Comprehensive Approach K to 12 Curriculum Compliant. Manila: Mindshapers
Co., Inc., 2017.
Ubarro, Arvie D., Josephine Lorenzo S. Tan, Renato Guerrero, Simon L. Chua, and
Roderick V. Baluca. Soaring 21st Century Mathematics Precalculus. Quezon
City, Philippines: Phoenix Publishing House Inc., 2016.
Online Resources
Carpentieri, Danielle. “Grades 7-8 Independent and Dependent Variables.”
Accessed May 20, 2020.
https://digitalcommons.pace.edu/cgi/viewcontent.cgi?article=1003&context
=middle_math.
Khan Academy. “Dependent and Independent Variables Review.” Accessed January
21, 2019. https://www.Khanacademy.Org/Math/Pre-Algebra/Pre-Algebra-
Equations Expressions/Pre-Algebra-Dependent-Independent/A/Dependent-
And-Independent-Variables-Review
Northern Arizona University. “Understanding Variables.” Accessed January 25,
2019.
http://Jan.Ucc.Nau.Edu/~Mid/Edr610/Class/Variables/Variables/Lesson3-
1-1.Html
397
For inquiries or feedback, please write or call:
398