MATH - Stat Proba Q3 - 2022 2023 1
MATH - Stat Proba Q3 - 2022 2023 1
STATISTICS &
PROBABILITY
QUARTER 3 – MODULE 1
LESSONS 1-10
i
(DO_Q3_STAT&PROB_MODULE1_LESSONS1-10)
RESOURCE TITLE: Statistics & Probability
Alternative Delivery Mode
Quarter 3 – Lessons 1-10
Second Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of
the Government of the Philippines. However, prior approval of the government agency or office
wherein the work is created shall be necessary for exploitation of such work for profit. Such
agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names,
trademarks, etc.) included in this module are owned by their respective copyright holders.
Every effort has been exerted to locate and seek permission to use these materials from their
respective copyright owners. The publisher and authors do not represent nor claim ownership
over them.
ii
SENIOR HIGH SCHOOL
STATISTICS &
(LEARNING AREA)
PROBABILITY
(QUARTER NUMBER)
(MODULE
QUARTER 3 –NUMBER)
MODULE 1
LESSON 1:
Exploring Random Variables
3 (DO_Q3_STAT&PROB_MODULE1_LESSON1)
Introductory Message
This Self-Learning Module (SLM) is prepared so that you, our dear learners, can
continue your studies and learn while at home. Activities, questions, directions,
exercises, and discussions are carefully stated for you to understand each lesson.
Each SLM is composed of different parts. Each part shall guide you step-by-
step as you discover and understand the lesson prepared for you.
Pre-tests are provided to measure your prior knowledge on lessons in each
SLM. This will tell you if you need to proceed on completing this module or if you
need to ask your facilitator or your teacher’s assistance for better understanding of
the lesson. At the end of each module, you need to answer the post-test to self-check
your learning. Answer keys are provided for each activity and test. We trust that
you will be honest in using these.
In addition to the material in the main text. Notes to the Teacher are also
provided to our facilitators and parents for strategies and reminders on how they can
best help you on your home-based learning.
Please use this module with care. Do not Put Unnecessary marks on any part
of this SLM. Use a separate sheet of paper in answering the exercises and tests. And
read the instructions carefully before performing each task.
If you have any questions in using this SLM or any difficulty in answering the
tasks in this module, do not hesitate to consult your teacher or facilitator.
Thank you.
iv (DO_Q3_STAT&PROB_MODULE1_LESSON1)
Targets:
1. illustrate random variables (discrete and continuous) (M11/12SP-IIIa-1);
2. distinguish between a discrete and a continuous random variable(M11/12SP-
IIIa-2);
3. find the possible values of a random variable(M11/12SP-IIIa-3); and
4. illustrate a probability distribution for a discrete random variable and its
properties(M11/12SP-IIIa-4).
Do this Pre-Test: Choose the letter of the best answer. Write your answer in your
notebook.
______1.) It is a random variable that has countable set of possible outcomes.
A. constant B. continuous C. discrete D. finite
Three coins are tossed and the random variable Y gives the number of tails.
Lesson
1 (DO_Q3_STAT&PROB_MODULE1_LESSON1)
What’s In
Directions: Determine whether the given is countable or measurable. Write your
answer in the space provided below:
What’s New
Random Variable
2 (DO_Q3_STAT&PROB_MODULE1_LESSON1)
Random Variable
➢ also called as stochastic variable
➢ a set of possible values from a random experiment
➢ essentially a variable, usually denoted as X or any capital letter of the
alphabet because its value is not constant
➢ assumes different values due to chance
What is It
There are many examples of random variables in our life. Read each example
below:
Below are examples of discrete random variables because their possible values are
obtained through counting.
Example 1. Suppose two coins are tossed. Let Y be the random variable
representing the number of tails that occur. Find the values of the random variable
Y.
Illustration:
So, the possible values of the random variable (range space) Y are 0,1 and 2.
Example 2. Anne conducted an experiment and tested four cell phones at random.
She wants to find out the number of defective cell phones that occur. Thus, to each
outcome in a sample space she assigned a value. These are 0, 1, 2, 3 and 4. If there
is no defective cell phone, she assigned number 0; if there is 1 defective cell phone,
she assigned number 1; if there are two defective cell phones; she assigned 2; she
assigned 3, if there are three defective cell phones; and 4 if the four cell phones are
all defective. The number of defective cell phones is a random variable.
3 (DO_Q3_STAT&PROB_MODULE1_LESSON1)
Illustration: Let D represent the defective cell phone and N represent the non-defective cell
phone. If we let X be the random variable representing the number of defective cell phones,
can you show the values of the random variable X?
The possible values of this random variable (range space) are 0, 1, 2, 3 and 4.
Example 2. XYZ machine is run and the recorded time it starts to experience
a glitch G illustrates a continuous random variable since the value of the
variable may be assigned using measurement.
Probability Distribution
The following properties are observed given that 𝑝𝑖 are individual probabilities
for each value (𝑥𝑖 ) of the random variable:
4 (DO_Q3_STAT&PROB_MODULE1_LESSON1)
Sample space: {HHH, HHT, HTH, HTT, THH, THT, TTH, TTT}
Example 2: Suppose two coins are tossed. Let Y be the random variable representing
the number of tails that occur. Find the values of the random variable Y.
What’s More
Example 1: An investor has 5 stocks that she follows each day. The random variable
being studied is Y. Based on the table below, what is P(0)?
Y 0 1 2 3 4 5
P (Y) ? 0.27 0.34 0.11 0.07 0.02
Solution:
Since the sum of all the individual probabilities in the distribution is equal
to 1,
P(0) + P(1) + P(2) + P(3) + P(4) + P(5)= 1
P(0) + 0.27 + 0.34 + 0.11 + 0.07 + 0.02= 1
P(0) + 0.81= 1
P(0) = 1 - 0.81
P(0) = 0.19
Therefore, the probability that the investor has zero (0) stocks is 0.19.
Example 2: Tell whether the given values can serve as the values of a probability
distribution of the random variable X that can take on only the values 1, 2, and 3.
P(1) = 0.08, P(2) = 0.12 and P(3) = 0.17
5 (DO_Q3_STAT&PROB_MODULE1_LESSON1)
Solution:
Since the sum of all the individual probabilities in the distribution is equal to 1,
P(1) + P(2) + P(3) =1
0.08 + 0.12 + 0.17 =1
0.37 ≠1
Since their sum is not equal to 1, we can say that the given values cannot serve as
the values of a probability distribution of the random variable X that can take on
only the values 1,2, and 3.
4. It is a statistical function that describes all the possible values and likelihoods
that a random variable can take within a given range. Like any other statistical
distribution, a probability mass function may be graphed using a histogram.
________________________________
5. The sum of all the individual probabilities in the distribution is equal to _____.
What I Can Do
Roxas family has three children. Let A represent the number of boys. Construct a
probability distribution for the random variable A.
6 (DO_Q3_STAT&PROB_MODULE1_LESSON1)
Assessment
______4.)
X 0 2 4 6 8
P(X) 1 1 1 1 1
6 6 3 6 6
______5.)
X 1 2 3 5
P(X) 1 1 1 1
4 8 4 8
Problem solving.
1. Two balls are drawn in succession without replacement from an urn
containing
2. 2 green balls and 6 yellow balls. Let M be the random variable representing
the number of yellow balls.
a. What are the possible values of this random variable (range
space)?
b. Construct a probability distribution.
Additional Activities
Five coins are tossed, and the random variable X gives the number of heads.
Determine the following:
7 (DO_Q3_STAT&PROB_MODULE1_LESSON1)
(DO_Q3_STAT&PROB_MODULE1_LESSON1) 8
What I know Additional
Activities
1. C 1. 4
2. B 2. 3
3. C 3. 4
4. C 4. 0
5. C 5. {0, 1, 2, 3, 4, 5}
What’s In
a. Countable
1. Number of soda cans recycled
2. Number of chairs
3. Number of students in a class
b. Measurables
1. Height
2. Weight
3. Volume of water
4. The time required to run a mile
5. Amount of solution in alcohol
What I have learned
1. Random variable
2. Discrete random variable
3. Continuous random variable
4. Probability distribution
5. 1
Assessment
A. B.
1. Discrete 1. NPD
2. Continuous 2. NPD
3. Discrete 3. PD
4. Discrete 4. PD
5. Discrete 5. NPD
Answer Key
SENIOR HIGH SCHOOL
STATISTICS &
(LEARNING AREA)
PROBABILITY
(QUARTER NUMBER)
(MODULE
QUARTER 3 –NUMBER)
MODULE 1
LESSON 2:
Computing the Mean and
Variance of a Discrete
Probability Distribution
9 (DO_Q3_STAT&PROB_MODULE1_LESSON1)
Targets:
1. compute probabilities corresponding to a given random variable (M11/12SP-
IIIa-6);
2. illustrate the mean and variance of a discrete random variable (M11/12SP-
IIIb-1); and
3. calculate the mean and variance of a discrete random variable M11/12SP-
IIIb-2).
Do this Pre-Test: Choose the letter of the best answer. Write your answer in your
notebook.
______1.) The sum of the product of each value of a discrete random variable X and
its probability is referred to as its _____.
A. standard deviation C. mean
B. variance D. variables
______2.) Consider the given discrete probability distribution. Find the probability
that M equals 5.
M 3 5 7 9
P(M) 0.12 ? 0.26 0.04
A. 0.58 C. 1
B. 0.28 D. 0.48
______3.) Mang Memong’s bakery has determined a probability distribution for the
number of ensaymada it sells in a given day. The distribution is as follows:
Number sold in a 0 5 10 15
day
Probability (number 0.13 0.20 0.32 0.35
sold)
Find the number of ensaymada that Mang Memong’s bakery expects to sell in a day.
A. 5.25 C. 5.52
B. 1 D. 9.45
_______4.) Let V is denoted as the number of heads in three tosses of a coin.
Determine the variance of this random variable.
A. 3.5 C. 1.5
B. 2.5 D. 0.5
_______5.) A random variable Y can take only two values, 2 and 5 such that P(2) =
0. 40 and P(5)= 0.60. Determine the mean of Y.
A. 3 C. 2.8
B. 3.8 D. 2.45
10 (DO_Q3_STAT&PROB_MODULE1_LESSON2)
Lesson Computing the Mean and
2 Variance of a Discrete
Probability Distribution
In this lesson you will learn to compute probabilities corresponding to a given
random variable, illustrate the mean and variance of a discrete random variable; and
calculate the mean and variance of a discrete random variable (M11/12SP-IIIa-6-
M11/12SP-IIIb-1-2).
What’s In
Directions: Construct a probability distribution table and determine the
probabilities given using the scores of 50 students in a test.
The scores of 50 students in a test is shown below:
SCORES NUMBER OF
STUDENTS
50 4
43 12
35 15
25 10
24 9
P(X) 4 12 15 10 9
50 50 50 50 50
X 50 43 35 25 24
Note: In this material, decimals are rounded off into two decimal places.
What is It
12 (DO_Q3_STAT&PROB_MODULE1_LESSON2)
Step 3. Sum up the product of X and P(X).
𝝁 = ∑ 𝑿 ∙ 𝑷(𝑿)
𝜇 = 0 + 0.5 + 0.5
𝝁 =1
Example 2. Calculate the mean and variance for a random variable X defined as
the sum of the scores on the two dice.
Illustration:
The range
space/possible values of
Y are 2, 3, 4, 5, 6, 7, 8,
9, 10, 11, 12.
1.19 1.12
= 0.06 + 0.18 + 0.32 + 0.55 + 0.84 + 1.19 + 1.12 + 0.99 + 0.8 + 0.66 + 0.36
𝝁 = 7.07
13 (DO_Q3_STAT&PROB_MODULE1_LESSON2)
Variance (𝝈𝟐 ) = ∑(𝑿 − 𝝁)2∙ 𝑷(𝑿)
= 0.77 + 0.99 +
0.75 + 0.47 + 0.16 + 0 + 0.12 +
0.41 + 0.69 + 0.93 + 0.73
𝝈𝟐 = 6.02
What’s More
A survey conducted by a researcher showed the number of students’ siblings
of Grade 11 – Albert Bandura (GAS) in Bignay National High School-SHS.
Number of siblings V 1 2 3 4
Probability P(V) 0.09 0.36 0.43 0.12
14 (DO_Q3_STAT&PROB_MODULE1_LESSON2)
2. How do we solve for the variance of a random
variable?______________________________________________________________________
_______________________________________________________________________________
________________________________________________________.
What I Can Do
The probability distribution below shows the number of hours spent by students
using mobile phones in a day. Compute the mean and variance.
Y 6 8 10
Assessment
A. The table below shows the probability distribution of a discrete random variable B.
B 0 1 2 3 4 5
7–8: 9 – 10 :
Y 0 1 2 Z 1 2 3
15 (DO_Q3_STAT&PROB_MODULE1_LESSON2)
A. The probabilities that the Elijah will buy 2, 3, 4, 5 or 6 items in ABC grocery
3 1 1 2 3
store are , , , and respectively. What is the average number of items
10 10 10 10 10
that Elijah will buy?
Additional Activities
A die and a coin are tossed together. Let variable C be the number that head
will appear. Compute for the mean and the variance.
16 (DO_Q3_STAT&PROB_MODULE1_LESSON2)
(DO_Q3_STAT&PROB_MODULE1_LESSON2) 17
What I have learned
1. To solve for the mean of a discrete random
variable, follow steps 1-3. First, construct
probability distribution table then, multiply X
and P(X) and lastly, sum up the product of X and
P(X)
2. To solve for the variance of random variable,
follow the steps 1-4. First, compute the mean
value of the random variable. Second, subtract
each value from the mean and square the
differences then, multiply the squared differences
by the corresponding probabilities. Lastly, add all
the products.
Assessment
A.
1. 0.17
2. 0.27
3. 0.88
4. 0.03
5. 0.7
6. 2.74
B.
7. 1.13
8. 0.35
9. 2.09
10. 0.54
C. 4.1
Additional Activities
Mean: 0.5
Variance: 1
Answer Key
SENIOR HIGH SCHOOL
STATISTICS &
(LEARNING AREA)
PROBABILITY
(QUARTER NUMBER)
(MODULE
QUARTER 3 –NUMBER)
MODULE 1
LESSON 3:
Solving Problems involving
the Mean and Variance of a
Discrete Probability
Distribution
18 (DO_Q3_STAT&PROB_MODULE1_LESSON2)
Targets:
1. interpret the mean and variance of a discrete random variable (M11/12SP-
IIIb-3), and
2. solve problems involving the mean and variance of probability distributions
(M11/12SP-IIIb-4).
Do this Pre-Test: Choose the letter of the best answer. Write your answer in
your notebook.
______1.) In a pizza parlor, the following probability distribution was obtained for the
number of toppings ordered on a large pizza. Find the mean and standard deviation
for the random variable.
A 0 1 2 3 4
P(A) 0.30 0.40 0.20 0.06 0.04
______4.) A’s Burger house has a probability distribution of sales as shown below:
P P(P)
25 0.20
40 0.20
50 0.60
Using this probability distribution, what is the expected value and variance?
A. Expected Value: 43 C. Expected Value: 43
Variance: 46 Variance: 96
B. Expected Value: 43 D. Expected value: 43
Variance: 69 Variance: 60
19 (DO_Q3_STAT&PROB_MODULE1_LESSON3)
_______5.) Three patients are treated with a specific medicine. The probabilities for
0, 1, 2, or 3 successes are 0.24, 0.12, 0.30, and 0.34 respectively. What is the
average number of successes in this treatment?
A. 1.71 B. 1.72 C. 1.73 D. 1.74
What’s In
The probabilities of a factory machine manufacturing 0, 1, 2 defective
parts in one day are 0.90, 0.07, and 0.03 respectively. Find the mean and
variance.
X 0 1 2
= 0 + 0.07 + 0.06
𝝁 = 0.13
𝝈𝟐 = 0.18
20 (DO_Q3_STAT&PROB_MODULE1_LESSON3)
What’s New
What is It
Solutions:
Mean (𝝁) = ∑ 𝑿 ∙ 𝑷(𝑿) = 3(0.3) +
X P(X) X∙ X- 𝝁 (X- (X- 𝝁)2∙ 𝑷(𝑿)
4(0.1) + 5(0.6) = 4.3
𝑷(𝑿) 𝝁)𝟐
Variance (𝝈𝟐 )
3 3 0.9 -1.3 1.69 0.51
= 0.3
10 = [(3)² (0.3) + (4) ² (0.1) + (5) ²
(0.6)] – (4.3) ²
4 1
= 0.1 0.4 -0.3 0.09 0.01
10
= 19.3 – 18.49
5 6
= 0.6 3 0.7 0.49 0.29
10 𝝈𝟐 = 0.81
21 (DO_Q3_STAT&PROB_MODULE1_LESSON3)
Interpretation: So, the average number of numbered balls that will be picked is
4.3. Although the bag will never show a ball with number 4.3, this implies that
picking a ball many times, the theoretical mean would be 4.3.
What’s More
The owner of Malinamnam Restaurant asks the staff to record the number
of people who order their specialty in weekdays. The table below shows the
probability distribution. Solve for the: a) mean, b) variance, and c) standard
deviation.
W 1 2 3 4 5
= [(12 ∙ 0.13) + (22 ∙ 0.15) + (32 ∙ 0.25) + (42 ∙ 0.19) + (52 ∙ 0.28)] (3.34)2
= (0.13 + 0.6 + 2.25 + 3.04 + 7) – 11.16
= 13.02 - 11.16
𝝈𝟐 = 1.86
22 (DO_Q3_STAT&PROB_MODULE1_LESSON3)
What I Can Do
The probability distribution of Q is shown below. Find the mean, variance and
standard deviation.
Mean (𝝁) = (1∙ 0.3) + (2 ∙ 0.1) + (3 ∙ 0.2) + (4 ∙ 0.4) = 2.7
Variance (𝝈𝟐 ) = [(12∙ 0.3) + (22∙ 0.1) + (32∙ 0.2) + (42∙ 0.4)] − (2.7)2 = 1.61
Assessment
A. Directions: Given the table below; calculate the mean, variance and standard deviation.
D 1 2 3
P(D) 0.41 ? 0.41
23 (DO_Q3_STAT&PROB_MODULE1_LESSON3)
Additional Activities
Suppose three coins are flipped. Let X be the random variable representing
the number of tails that occur.
Find:
a) Construct a probability distribution table of the random variable X.
b) Solve for the mean.
c) Compute the variance
d) Give the value of standard deviation
24 (DO_Q3_STAT&PROB_MODULE1_LESSON3)
(DO_Q3_STAT&PROB_MODULE1_LESSON3) 25
What I know
1. A
2. C
3. A
4. C
5. D
What I have learned
Expected value, probability Distribution, variance, standard deviation, variance
Assessment
A.
1. 0.18
2. 0.82
3. 2
4. 0.91
B. 1. 0.8, 0.89
2. 1.44, 1.2
3. 0.57, 0.75
Answer Key
SENIOR HIGH SCHOOL
STATISTICS &
PROBABILITY
QUARTER 3 – MODULE 1
LESSON 4:
Normal Random Variable
26 (DO_Q3_STAT&PROB_MODULE1_LESSON3)
Targets:
Do this Pre-Test: Choose the letter of the best answer. Write your answer in your
notebook.
_______1. The total area under the normal curve is______?
a. 2 b. 1 c. 4 d. 3
_______2. It is a random variable where the data can take infinitely many values.
a. discrete random
b. continuous random variable
c. both discrete and continuous random variable
d. none of these
_______3. Let 𝑥 = 8 be a Normal random variable with parameters 𝜇 = 1 𝑎𝑛𝑑 𝜎 = 3.
Convert it to Z-score.
a. 1.325 b. 5.322 c. 0 d. 2.333
_______4. What proportion of cases in a normally distributed population will have z-
scores greater than 1.0 or less than -1.0?
a. 0.68 b. 0.50 c. 0.32 d. 0.10
_______5. In normal distribution, define the relationship of the mean, median and
the mode.
a. mean>median>mode b. mean<median<mode
c. mean=median=mode d. no relation found
27 (DO_Q3_STAT&PROB_MODULE1_LESSON4)
Lesson
In this lesson you will learn to identify regions under the normal curve corresponding
to different standard normal values, converts a normal random variable to a standard
normal variable and vice versa, computes probabilities and percentiles using the
standard normal table.M11/12SP-IIIc-3-4, M11/12SP-IIIc-d-1
What’s In
Warm Up Activity
Analyze this activity. Take note of your analyzation on your lecture notebook.
Each of these students below made a mistake when calculating the mean. Describe each
mistake.
Siblings Frequency
0+1+2+3+4+5
0 1 Student 1: 6
= 2.5
1 4 1+4+6+4+2+1
Student 2: 18
=1
2 6
1+4+6+4+2+1
Student 3: =3
3 4 6
0+1+2+3+4+5
4 2 Student 4: = 0.83
18
5 1
Total 18
This activity will make you realized what is the common mistake of the students like you on
finding the mean from the frequency distribution table which is very important in the
discussion of the normal distribution which includes mean and standard deviation. You may
recall the concepts from your basic statistic lessons in finding the right way on computing for
the mean from the frequency distribution table.
28 (DO_Q3_STAT&PROB_MODULE1_LESSON4)
Continuous Random Variable can assume infinitely many values
corresponding to points on a line interval.
29 (DO_Q3_STAT&PROB_MODULE1_LESSON4)
Normal Distribution Standard Normal Distribution
30 (DO_Q3_STAT&PROB_MODULE1_LESSON4)
Empirical Rule
What can we say about the distribution of values around the mean? There are
some general rules:
But to find the probability beyond the empirical rule is possible through the
use of the standardized normal table (See appendices).
The following is the general procedure for finding probabilities under the
normal curve as to find P(a < X < b) when X is distributed normally:
1. Draw the normal curve for the problem in terms of X
2. Translate x-values to z-values
3. Use the standard normal table
31 (DO_Q3_STAT&PROB_MODULE1_LESSON4)
Note that the distribution is the
same, only the scale has changed.
We can express the problem in
original units (X) or in standardized
units (Z)
Example 2: Suppose X is normal with mean 8.0 and standard deviation 5.0.
a. Find 𝑃(𝑥 < 8.6) b. Find 𝑃(𝑥 > 8.6) c. Find 𝑃(80 < 𝑥 < 8.6)
Solution (a): 𝑃(𝑥 < 8.6)
𝑥−𝜇 8.6−8.0
Step1 Step 2. 𝑧 = 𝜎
= 5.0
= 0.12
Step 3:
32 (DO_Q3_STAT&PROB_MODULE1_LESSON4)
Solution c. 𝑃(80 < 𝑥 < 8.6)
Step1. Step 3.
2. Describe the relationship between mean, median and mode when you say that
normal distribution is symmetric at the mean?
3. What is the area under the whole normal curve?
4. How many percent of observations in the population lie within 1, 2, and 3
standard deviation of the mean, respectively and as empirical rule?
5. What is the z-score formula?
2. 95% of the students at school are between 1.1m and 1.7m tall. Assuming that the
data is normally distributed, calculate the mean and standard deviation? Construct the
normal curve corresponding to the data.
Solution: The mean is halfway
between 1.1m and 1.7m:
1.1𝑚 + 1.7𝑚
𝜇= = 1.4𝑚
2
33 (DO_Q3_STAT&PROB_MODULE1_LESSON4)
95% is 2 standard deviations either side of the mean (a total of 4
standard deviations). So:
1.7𝑚 − 1.1𝑚
𝜎= = 0.15𝑚
4
Additional Activities
1. The distribution of the heights of adult Filipino men is approximately Normal with
a mean of 69 inches and a standard deviation of 2.5 inches. Between what heights
do the 95% of the middle men fall?
3. Students can pass a test if they obtain a score of 50% or more. The marks of a
large number of students were sampled and the mean and standard deviation
were calculated as 42% and 8% respectively. Assuming that this data is normally
distributed, what percentage of students pass the test?
34 (DO_Q3_STAT&PROB_MODULE1_LESSON4)
(DO_Q3_STAT&PROB_MODULE1_LESSON4) 35
Short Answer
Additional Activities 1. b
1. 64 inches to 74 inches 2. b
3. d
2. The maximum and minimum
4. a
time to load the truck
5. c
3. 16% Short Answer
Problem Solving 1. (continuous random variable)
1. 𝜇 = 76𝑘𝑔 , 𝜎 = 7𝑘𝑔 2. (false)
2. 𝜇 = 1200𝑐𝑚 , 𝜎 = 8 𝑐𝑚 3. bell curve)
3. 𝜎 = 6.5 4. (True)
4. a= 90.49% b= 9.34% c= 5. (False)
80.81%
Answer Key
SENIOR HIGH SCHOOL
STATISTICS &
PROBABILITY
QUARTER 3 – MODULE 1
LESSON 5:
Random Sampling and
Sampling Distributions of
Statistics
36 (DO_Q3_STAT&PROB_MODULE1_LESSON4)
Targets:
Do this Pre-Test: Answer the following pre-test question to measure what you
already know about the foregoing lessons. Write your answer on
your lecture notebook.
5. x̄, 𝑠 2 , and s are commonly used symbols to represent the mean, variance and
standard deviation of the ___________.
a. population b. sample d. statistics
Lesson
Random Sampling and Sampling
5 Distributions of Statistics
In this lesson, the learners will understand the concept of random sampling and
sampling distributions of statistics. Parameter and sample will also be defined as
an extension of the lesson M11/12SP-IIId-2 M11/12SP-IIId-3.
37 (DO_Q3_STAT&PROB_MODULE1_LESSON5)
What’s In
A random sample is a sample that is chosen randomly. It could be more accurately called a
randomly chosen sample, as you have learned from your previous disciplines.
Random samples are used to avoid bias and other unwanted effects. Of course, it isn’t quite
as simple as it seems: choosing a random sample isn’t as simple as just picking 100 people
from 10, 000 people. But what is the process of choosing a sample? How can you determine
if you are on the right track in selecting your subjects for study? Do you know something
about this concept?
38 (DO_Q3_STAT&PROB_MODULE1_LESSON5)
What is parameter?
Parameters are numbers that summarizes data for an entire population. It is any
numerical quantity that characterizes a given population or aspect of it. It tells
something about the whole population.
What is statistics?
Statistics are numbers that summarize data from a sample. It is a single measure
of some attribute of a sample. It is any function (attribute) of a sample.
Example:
39 (DO_Q3_STAT&PROB_MODULE1_LESSON5)
The button might say “RANDOM” (SHARP). Other makes may have a button
“Ran” or “Ran#” or “RanInt”. Whichever you have, selecting and pressing
ENTER repeatedly gives random numbers.
Generate a Random Number between 0 and 99.
40 (DO_Q3_STAT&PROB_MODULE1_LESSON5)
Exercise what you have learned by answering the questions below: write
your answer on your activity notebook.
1. How many of each of the 3 types of computer component should be taken
in a sample of 100 categorized by type of component?
Type A B C D
Number 300 260 40 600
2. A manager samples the receipts of every fifth person who goes through the
line. Out of 50 people, 4 had a misprices item. If 600 people go to this store
each day, how many people would you expect to have a mispriced item?
Additional Activities
Choose the best answer. Give an explanation to your choice.
1. MJ wants to find out where the greatest number of people buy fast food for dinner.
She surveys every fift person on a random street and asks them where they get food
for dinner regularly? What would have been an improvement in MJ’s experiment?
41 (DO_Q3_STAT&PROB_MODULE1_LESSON5)
(DO_Q3_STAT&PROB_MODULE1_LESSON5) 42
PreTest
Application 1. b
3. yes 2. c
4. parameter 3. b
5. statistic 4. a
Short Answer 5. b
1. statistics
2. parameter
3. random sampling
4. True
5. True
Answer Key
SENIOR HIGH SCHOOL
STATISTICS &
PROBABILITY
QUARTER 3 – MODULE 1
LESSON 6-7:
Sampling Distribution of the
MEAN
43 (DO_Q3_STAT&PROB_MODULE1_LESSON5)
TARGETS:
1. Construct sampling distribution of the mean
2. Compute the mean and variance of the sampling distribution of the mean
Do this Pre-Test: Choose the letter of the best answer. Write your answer in your
notebook.
1. A random variable is a variable usually X that has a __________ numerical value
determined by chance for each outcome of a procedure or experiment
a. small b. large c. single d. double
2. A probability distribution is a graph, table, or function that gives the _______ for
each value of the random variable.
a. input c. outcome
b. output d. probability
3. The mean of a probability distribution is also called __________.
a. extreme value c. extraneous value
b. expected value d. erroneous value
4. In a probability distribution the probability of each random variable must be
between __________.
a. 0 and 1 b. 0 and 10 c. -1 and 0 d. 1 and 2
5. In a probability distribution the sum of all the probabilities must be equal to
__________.
a. 100 b. 10 c. 1 d. 0.99
44 (DO_Q3_STAT&PROB_MODULE1_LESSON6-7)
What’s In
Let’s say we have a finite population of size N=5 that consists of the numbers
1, 2, 3, 4, and 5. (We know that for this population, the mean, µ = 3 and the variance, σ 2 =
2). Now let’s take a random sample of size n=3, say 1, 4, and 2, and compute the mean.
1+4+2
Sample: 1, 4, 2 𝑋̅ = = 2.3
3
What can you say about the sample mean, 𝑋̅ and the population mean, µ?
How many different samples of the same size, n=3, do you think we can get from a
population of N=5? (Clue: nCr → 5C3)
In this module we will try to list all the possible samples that we can get from a given finite
population, and present them in a tabular form.
Let’s say we have a finite population of size N=5 that consists of the numbers
1, 2, 3, 4, and 5. (We know that for this population, the mean, µ = 3 and the variance,
σ2 = 2). Now let’s take a random sample of size n=3, say 1, 4, and 2, and compute
the mean.
1+4+2
Sample: 1, 4, 2 𝑋̅ = 3
= 2.3
What can you say about the sample mean, 𝑋̅ and the population mean, µ?
How many different samples of the same size, n=3, do you think we can get from
a population of N=5? (Clue: nCr → 5C3)
In this lesson we will try to list all the possible samples that we can get from a given
finite population, and present them in a tabular form.
We need to learn about sampling distribution because it is an essential element
in Statistical Inference. We remind you that Statistical Inference is the process of
using sample results to draw conclusions about the characteristics of a population.
45 (DO_Q3_STAT&PROB_MODULE1_LESSON6-7)
1, 3, 4 2.7
1, 3, 5 3
1, 4, 5 3.3
2, 3, 4 3
2, 3, 5 3.3
2, 4, 5 3.7
3, 4, 5 4
There are 10 possible samples of size 3 that can be drawn from our given population.
Remember: The number of samples of size n that can be drawn from a population of
size N is given by NCn. ( Look for the function nCr in your scientific calculator.)
From the above hanging questions, let’s compute for the mean and variance.
46 (DO_Q3_STAT&PROB_MODULE1_LESSON6-7)
̅:
Mean of the Sampling Distribution of 𝑿
̅ ̅
µ𝑥̅ = ∑ 𝑋 • P(𝑋)
= 2(0.1) + 2.3(0.1) + 2.7(0.2) + 3(0.2) + 3.3(0.2) + 3.7(0.1) + 4(0.1)
=3
̅:
Variance of the Sampling Distribution of 𝑿
𝜎2 𝑁−𝑛
σ2𝑥̅ = ( )
𝑛 𝑁−1
Consider a population of 4 typists from a certain company who were asked to type the same
page of a manuscript. The number of errors made by each typist is presented below.
47 (DO_Q3_STAT&PROB_MODULE1_LESSON6-7)
3. The sampling distribution of the mean is the probability distribution of
the __________.
4. __________is the process of using sample results to draw conclusions
about the characteristics of a population.
5. If 𝑋̅ represents the mean of the toss of two fair dice, the probability of
getting a mean equal to 6 is __________.
Problem Solving.
There are 10 students enrolled in a preschool. Their ages are
Student Age Student Age
A 4 F 5
B 5 G 4
C 4 H 4
D 3 I 3
E 5 J 6
a) Compute for the mean, µ and variance, σ2 of the ages of the students in
this preschool.
Additional Activities
1. Let X represent the result of the toss of a fair die. Find the following
probabilities. (Keller/Warrack, 2003, p. 281)
a. P(X=2) b. P(X=5) c. P(X=1) d. P(X=4)
2. Let 𝑋̅ represent the mean of the toss of two fair dice. Find the following
probabilities. (Keller/Warrack, 2003, p. 281)
a. P(𝑋̅=3) b. P(𝑋̅=4) c. P(𝑋̅=1) d. P(𝑋̅=6)
3. An experiment consists of tossing five balanced dice. Find the following
probabilities. (Keller/Warrack, 2003, p. 281)
a. P(𝑋̅=1) b. P(𝑋̅=6)
48 (DO_Q3_STAT&PROB_MODULE1_LESSON6-7)
(DO_Q3_STAT&PROB_MODULE1_LESSON6-7) 49
Assessment: 1. a) 1/6 b) 1/6 c) 1/6 d) 1/6
2. a) 5/36 b) 5/36 c) 1/36 d) 1/36
3. a) 1/7776 = .0001286
b) 1/7776 = .0001286
Enrichment: a) µ = 4.3 σ2 = 0.81
b)
𝑋 P(𝑋)
3 1/45
3.5 8/45
4 12/45
4.5 14/45
5 7/45
5.5 3/45
Answer Key
SENIOR HIGH SCHOOL
STATISTICS &
PROBABILITY
QUARTER 3 – MODULE 1
LESSON 8:
The Central Limit Theorem
50 (DO_Q3_STAT&PROB_MODULE1_LESSON6-7)
TARGETS:
1. Illustrate the Central Limit Theorem (CLT)
2. Define sampling distribution of the mean using CLT
3. Solve problems involving sampling distribution of the mean
Do this Pre-Test: Choose the letter of the best answer. Write your answers on your
notebook.
1. From a finite population of N=7, the number of samples of size n=3 that can
be drawn is __________.
a. 25 b. 30 c. 35 d. 40
2. From a finite population of N=10, the number of samples of size n=5 that
can be drawn is __________.
a. 252 b. 254 c. 256 d. 258
3. In a sampling distribution of the mean, as the sample size increases the
variance __________.
a. decreases b. increases c. remains unchanged
4. The standard deviation of the sampling distribution of the sample means is
known as ___________.
a. Standard estimate of the mean b. standard error of the mean
5. A __________ is one that consists of a finite or fixed number of elements,
measurements, or observations.
a. Finite Population b. Frequent Population
Lesson
51 (DO_Q3_STAT&PROB_MODULE1_LESSON8)
The table below shows the results that we obtained from the last lesson when we
studied the sampling distribution of the mean. From a finite population of N=5, we
drew samples of different sizes and computed the Mean and Variance of each
sampling distribution of the mean.
POPULATION SAMPLING DISTRIBUTION OF THE MEAN
N=5 n=2 n=3 n=4
MEAN µ=3 µ𝑥̅ = 3 µ𝑥̅ = 3 µ𝑥̅ = 3
VARIANCE σ2 = 2 σ2𝑥̅ = 0.75 σ2𝑥̅ = 0.33 σ2𝑥̅ = 0.12
What do you notice about the values of the variance of the sampling distribution as
the sample size increases?
Do you think the same thing happens to the standard deviation?
Now consider the experiment of throwing a fair die so many times until we get
a very large population and our muscles ache because of continuous throwing.
In this case we have an infinite population. (Keller/Warrack, p.270)
Let X be the random variable that gives the number of dots showing after each
throw, the probability distribution for the random variable is given below.
X 1 2 3 4 5 6
P(X) 1/6 1/6 1/6 1/6 1/6 1/6
Population Mean, µ = ∑ X • P(X)
1 1 1 1 1 1
= 1( ) + 2( ) + 3( ) + 4( ) + 5( ) + 6( )
6 6 6 6 6 6
= 3.5
1+6+4+5+3+5+5+5+1+2
10 Throws, n = 10 → 𝑋̅ = = 3.7
10
5+3+1+4+6+1+3+3+4+3+5+2+5+5+6+4+2+2+4+6+6+4+2+1+1
25 Throws, n = 25 → 𝑋̅ =
25
= 3.5
That’s right! As the number of throws of the die increases, the mean becomes
closer and closer to 3.5, the population mean. In other words, as the sample size
increases, the probability that the sample mean will be close to 3.5 also increases.
52 (DO_Q3_STAT&PROB_MODULE1_LESSON8)
In the last lesson, we observed in the sampling distribution of the mean that as the
sample size increases, the variance decreases.
We expect the same thing in our experiment today. Notice that as the sample size
increases, 𝑋̅ becomes closer to µ=3.5. As the sample size increases, we are likely to obtain
𝑋̅’s from our samples that crowd around the µ of the population which is 3.5. When this
happens, we know that the variance will decrease.
This does not happen by coincidence. This can be explained by the formula
𝜎2
σ2𝑥̅ = where σ2𝑥̅ = variance of the sampling distribution
𝑛
σ2 = variance of the population
n = sample size
Another thing that happens as n gets larger is that the sampling distribution of 𝑋̅
becomes increasingly bell shaped. (Keller/Warrack, p.274)
All these things that we learned today can be summarized by the following theorem.
If random samples of size n are drawn from a population, then as n becomes larger, the
sampling distribution of the mean approaches the normal distribution, regardless of the
shape of the parent population. (Belecina, et. al., p.119)
Because of this phenomenon, we can use the Central Limit Theorem to solve
a lot of real-life problems. This amazing theorem justifies the use of the Normal
Curve and the Z-table.
𝑋−µ
From our original formula of Z= 𝜎 , we now have this new formula if
we want to compute for the probability that 𝑋̅ will take some values in the
sampling distribution of 𝑋̅.
FORMULA
𝑋̅−µ
Z= where 𝑋̅ = mean
𝜎/√𝑛
µ = population mean
σ = population standard deviation
n = sample size
Example
The scores on a Statistics midterm exam are normally distributed with a mean
of 78 and a standard deviation of 6. If a student is selected at random
(a) what is the probability that his/her score is higher than 90?
(b) what is the probability that his/her score is less than 75?
(c) what is the probability that a class of 30 students will have an average
score of less than 75?
SOLUTION:
Given: µ = 78 σ=6
The population is normally distributed.
(a) P(X>90) = P(Z> ?) = ?
𝑋−µ 90−78 12
Let’s solve for Z: Z= 𝜎 = 6 = 6 =2
Use the Z-table to determine the required area,
53 (DO_Q3_STAT&PROB_MODULE1_LESSON8)
P(X>90) = P(Z>2) = 0.4772
Therefore, the probability that his/her score is higher than 90 is 0.0228 or
2.28%.
(c) In this question we are given a sample of size 30, and we are asked to
determine the probability that the mean 𝑋̅ of this sample will be less than 75.
As explained above, the Central Limit Theorem allows us to use the Z-table
in this type of problem.
P(𝑋̅<75) = P(Z<?) = ?
𝑥̅ −µ 75−78
Let’s convert our 𝑋̅ to Z: Z = 𝜎/√𝑛 = 6/√30 = -2.74
Use the Z-table to determine the required area.
P(𝑋̅<75) = P(Z<-2.74) = 0.0031
Therefore, the probability that the sample will have a mean of less than 75 is
0.0031 or 0.31%.
The supervisor of a bottling company has observed that the amount of softdrink in
each Litro bottle is actually a normally distributed random variable , with a mean
of 1006ml and a standard deviation of 10ml. (Keller/Warrack, p.277)
(a) If a customer buys one bottle , what is the probability that the bottle will
contain more than 1000ml?
(b) If a customer buys 4 bottles, what is the probability that the mean amount
of the 4 bottles will be greater than 1000ml?
SOLUTION:
Given: µ = 1006 σ = 10 n=4
The population is normally distributed.
(a) P(X>1000) = P(Z>?) = ?
𝑥−µ 1000−1006
Let’s convert X to Z: Z= 𝜎 = 10
= -6/10 = -3/5 = -0.6
Use the ZX-table to determine the required area.
P(X>1000) = P(Z>-0.6) = 0.7257
Therefore, the probability that a bottle will contain more than 1000ml is
0.7257 or 72.57%.
(b) P(𝑋̅>1000) = P(Z>?) = ?
𝑥̅ −µ 1000−1006
Let’s convert 𝑋̅ to Z: Z = 𝜎/√𝑛 = 10/√4 = -1.2
Use the Z-table to determine the required area.
P(𝑋̅>1000) = P(Z>-1.2) = 0.8849
Therefore, the probability that the mean of the 4 bottles is 0.8849 or 88.49%.
54 (DO_Q3_STAT&PROB_MODULE1_LESSON8)
Short Answer. Supply the correct answer.
__________1. In a sampling distribution, as the sample size increases the sampling
distribution of the mean approaches the __________ distribution.
__________2. In a sampling distribution, if the parent population is normal, then the
_________ is normally distributed for all sample sizes.
__________3. If the parent population is nonnormal, then the Mean is approximately
normal only for __________ values of n (sample size).
__________4. If the population is extremely nonnormal, the sampling distribution will
also be __________ even for moderately large values of n.
__________5. In many practical situations, a sample size of _________may be
sufficiently large to allow us to use the normal distribution as an
approximation for the sampling distribution of the Mean.
Application
The time it takes a group of Grade 11 students to complete the Final Exam in
Statistics and Probability is known to be normally distributed. The mean is 58
minutes and the standard deviation is 4 minutes.
(a) What is the probability that a randomly selected student will complete
the examination in less than 54 minutes?
(b) If 5 randomly selected students take the exam, what is the probability
that the mean time it takes the group to finish will be less than 54?
Additional Activities
55 (DO_Q3_STAT&PROB_MODULE1_LESSON8)
(DO_Q3_STAT&PROB_MODULE1_LESSON8) 56
ASSESSMENT
1. (a) P(X<54) = P(Z<?) = ?
𝑥−µ 54−58
Let’s convert X to Z: Z= = =-1
𝜎 4
Use the Z-table to determine the required area.
P(X<54) = P(Z< -1) = 0.1587
Therefore, the probability that he/she will finish the exam in less than 54 minutes is 0.1587
or 15.87%.
(b) P(𝑋̅<54) = P(Z<?) = ?
𝑥̅ −µ 54−58
Let’s convert 𝑋̅ to Z: Z= = = - 2.24
𝜎/√𝑛 4/√5
Use the Z-table to determine the required area.
P(𝑋̅<54) = P(Z<- 2.24) = 0.0125
Therefore, the probability that the 5 students will have a mean time of less than 54
minutes is 0.0125 or 1.25%.
2. (a) P(X>475) = P(Z>?) = ?
𝑥−µ 475−450 25 5
Let’s convert X to Z: Z= = = or = 0.83
𝜎 30 30 6
Use the Z-table to determine the required area.
P(X>475) = P(Z> 0.83) = 0.2033
Therefore, the probability that a pack of this instant noodles will have a
cholesterol content of more than 475mg is 0.2033 or 20.33%.
(b) P(𝑋̅>475) = P(Z>?) = ?
𝑥̅ −µ 475−450
Let’s convert 𝑋̅ to Z: Z= = = 2.64
𝜎/√𝑛 30/√10
Use the Z-table to determine the required area.
P(𝑋̅>475) = P(Z>2.64) = 0.0041
Therefore, the probability that the 10 packs will have a mean cholesterol content of
more than 475mg is 0.0041 or 0.41%.
ENRICHMENT
a) 6.125
b) 0.4219
c)0.75
Answer Key
SENIOR HIGH SCHOOL
STATISTICS &
PROBABILITY
QUARTER 3 – MODULE 1
LESSON 9:
The t-distribution
57 (DO_Q3_STAT&PROB_MODULE1_LESSON8)
What I Need to Know
Targets:
1. Illustrate the t-distribution (M11/12SP-IIIg-2); and
2. Identify critical values (percentiles) using the t-table (M11/12SP-IIIg-5).
What I Know
Read and analyze each item carefully. Choose the letter that correspond to your
answer and write it in your notebook.
For items 1-2, refer to the statements below:
i. The t-distribution is a family of curves.
ii. As the sample size decreases, the t-distribution approaches normal distribution.
iii. The t-curve is asymmetrical about the mean.
iv. The measures of central tendency in a t-distribution are all equal and at the
center.
1. Which of the above statements is/are TRUE?
A. i only B. i and iv C. iii only D. ii and iii
2. Which of the above statements is/are FALSE?
A. i only B. i and iv C. ii only D. ii and iii
3. Which of the following refers to the number of values that can vary in a certain
analysis or computation?
A. sample size C. degrees of freedom
B. confidence level D. t-value
4. How do we compute for the degrees of freedom?
A. Add one to the sample size.
B. Subtract one from the sample size.
C. Divide the sample standard deviation by the sample size.
D. Divide the sample size by the sample standard deviation.
5. Which of the following gives the upper confidence limit of an interval estimate of
the population mean when 𝜎 is unknown?
𝑠 𝑠 𝑠 𝑠
A. 𝑋 − 𝑡 ( 𝑛) B. 𝑋 + 𝑡 ( 𝑛) C. 𝑋 ± 𝑡 ( 𝑛) D. 𝑡 ( 𝑛)
√ √ √ √
6. The size of a sample is 19. What is the degrees of freedom?
A. 18 B. 20 C. 34 D. 100
7. If the degrees of freedom is 26, what must be the sample size?
A. 25 B. 26 C. 27 D. 28
Consider the situation below to answer item numbers 8 to 10.
A certain sample whose size is 10 is known to have a mean of 73.25 with a standard
deviation of 2.5.
8. What is the standard deviation?
A. 2.5 B. 10 C. 99% D. 73.25
9. Find 𝑡𝛼/2 for a 99% confidence interval.
A. 1.833 B. 2.228 C. 2.262 D. 3.250
B.
10. Which of the following gives the interval estimate for the population mean with
99% confidence?
A. 71.801 < 𝜇 < 74.699 C. 70.681 < 𝜇 < 75.819
B. 71.489 < 𝜇 < 75.011 D. 71.462 < 𝜇 < 75.038
58 (DO_Q3_STAT&PROB_MODULE1_LESSON9)
Lesson
The t-distribution
9
In this lesson, the learners will understand the concept of t-distribution. The process
of identifying critical values or percentiles to be used in constructing confidence
intervals using the t-table will also be presented.
What’s In
Previously, when 𝜎 is known and the sample size is at least 30 or the sample size is
less than 30 but is taken from a population that is approximately normally
distributed, the confidence interval can be easily determined with the help of the z-
distribution.
Unfortunately, in most cases, we seldom know the population standard deviation 𝜎
and what we have at hand is just the sample standard deviation 𝑠. Finding a
confidence interval for the population mean 𝜇 using only 𝑠 becomes a different task
to do.
Thus, this situation needs the help of a more suited distribution which is the t-
distribution.
What’s New
The general expression for estimating the population mean using 𝑠 is given by
𝑠
𝑋 ± 𝑡 ( ).
√𝑛
59 (DO_Q3_STAT&PROB_MODULE1_LESSON9)
after a sample statistic has been computed. It also identifies the curve to be used as
the t-distribution. This concept is seen on the figure below.
To decide which t-distribution curve to follow, we must compute for the degrees of
freedom. The degrees of freedom is computed by 𝑑𝑓 = 𝑛 − 1, where 𝑛 is the sample
size.
Note that when 𝜎 is unknown, we can only use 𝑠 to find a confidence interval for 𝜇 if
the following conditions are satisfied:
i. the sample is randomly selected; and
ii. either 𝑛 ≥ 30 or the population from which the sample is taken is
approximately normal if 𝑛 < 30.
The confidence interval limits are obtained using the general expression presented
above.
What is It
Sample Problems:
1. Find 𝑡𝛼/2 for a 95% confidence interval of a sample with size 16.
60 (DO_Q3_STAT&PROB_MODULE1_LESSON9)
Solution:
In a t-distribution table, the values represent the proportions of areas in the tails
of the t-curve. We note that 𝑛 = 16 and that the confidence interval is 95%, so 𝛼 =
0.05. Thus, 𝛼/2 = 0.025, and we wish to find 𝑡0.025 .
Thus, 𝑡𝛼/2 for a 95% confidence interval of a sample with size 16 is 2.131.
2. Find the 99% confidence interval for a sample with 𝑛 = 26, 𝑋 = 2, and 𝑠 = 3.
Solution:
First, we determine 𝑡𝛼/2 as illustrated above. Clearly, when 𝑛 = 26, our 𝑑𝑓 = 25.
Also, given that the confidence interval is 99%, then 𝛼 = 0.01, so 𝛼/2 = 0.005 .
Referring to the t-table below, we have 𝑡𝛼/2 = 2.787.
61 (DO_Q3_STAT&PROB_MODULE1_LESSON9)
𝑠
Now, we compute for the confidence interval limits using 𝑋 ± 𝑡 ( 𝑛).
√
Substituting the given values, we have as follows:
𝑠 3
𝑋 ± 𝑡 ( 𝑛) = 2 ± (2.787) ( ) = 2 ± 1.640 .
√ √26
The upper confidence interval limit is 2 + 1.640 = 3.640 while the lower
confidence interval limit is 2 – 1.640 = 0.360. Thus, the required interval
estimate for the population mean is given by 0.360 < 𝜇 < 3.640 at 99%
confidence interval.
Solution:
From the given, the sample was taken randomly and that the
population from which it was taken is normally distributed. Thus, we can
use 𝑠 and the t-distribution to find an interval estimate for 𝜇.
62 (DO_Q3_STAT&PROB_MODULE1_LESSON9)
It follows that the confidence interval limits are:
Upper: 65 + 1.006 = 66.006 and Lower: 65 – 1.006 = 63.094.
Therefore, with 95% confidence, the mean height of the population lies
in the interval 63.094 < 𝜇 < 66.006.
What’s More
1. Find 𝑡𝛼/2 for a 95% confidence interval of a sample with size 20.
2. Find the 99% confidence interval for a sample with 𝑛 = 17, 𝑋 = 5.6, and
𝑠 = 2.3.
What I Can Do
63 (DO_Q3_STAT&PROB_MODULE1_LESSON9)
Assessment
Read and analyze each item carefully. Choose the letter that correspond to your
answer and write it in your notebook.
For items 1-2, refer to the statements below:
j. The t-distribution is a family of curves.
ii. As the sample size decreases, the t-distribution approaches normal distribution.
iii. The t-curve is asymmetrical about the mean.
iv. The measures of central tendency in a t-distribution are all equal and at the
center.
1. Which of the above statements is/are TRUE?
A. i only B. i and iv C. iii only D. ii and iii
64 (DO_Q3_STAT&PROB_MODULE1_LESSON9)
Additional Activities
The final grade in Mathematics 10 of 25 male and 25 female students, who were
randomly selected, were recorded. The corresponding means and standard
deviations for these group samples are shown in the table below.
Find an interval estimate of the population mean for each group using: (a) 95%
confidence interval; (b) 99% confidence interval.
65 (DO_Q3_STAT&PROB_MODULE1_LESSON9)
(DO_Q3_STAT&PROB_MODULE1_LESSON9) 66
What I Know What’s More
1. B 6. A 1. 2.093
2. D 7. C 2. 3.971 < µ < 7.229
3. C 8. A
What I can Do
4. B 9. D
63.625 < µ < 66.375
5. B 10. C
Assessment Additional Activities
1. B 6. A a. 83.840 < µ < 85.160
2. D 7. C b. 81.202 < µ < 83.999
3. C 8. A
4. B 9. D
5. A 10. D
Answer Key
Almeda,Capistrano, Ferry Sarte. (2010). Elementary Statistics. Quezon City: University of the
Philippines Press.
Belecina, R., Baccay, E., & Mateo, E. (2016). Statistics and Probability. Manila,Philippines:
REX Book Store Inc.
Bluman, A. (2018). Elementary Statistics: A Step by Step Approach 10th edition. McGraw
Hill. New York, USA.
Canlapan, R. (2016). Statistics and Probability. Makati, Philippines: Diwa Learning System
Inc.
Keller, Warrack. (2003). Statistics For Management and Economics. California USA:
Thomson Learning, Inc.
Levine, et. al. (2005). Statistics: A Handbook for Managers. New Jersey: Prentice Hall.
PERCDC Learnhub
Walpole, R., Myers, R., Myers, S., and Ye, K., (2012). Probability and Statistics for Engineers
and Scientists 9th edition. Pearson Education Inc. Massachusetts, USA.
67 (DO_Q3_STAT&PROB_MODULE1_LESSON9)
SENIOR HIGH SCHOOL
STATISTICS &
PROBABILITY
QUARTER 3 – MODULE 1
LESSON 10:
Confidence Level and
Sample Size
68 (DO_Q3_STAT&PROB_MODULE1_LESSON9)
What I Need to Know
Targets:
5. Identify the length of a confidence interval (M11/12SP-IIIj-1);
6. Compute for the length of a confidence interval (M11/12SP-IIIj-2);
7. Compute for an appropriate sample size using the length of the interval
(M11/12SP-IIIj-3); and
8. Solve problems involving size determination (M11/12SP-IIIj-4).
What I Know
Read and analyze each item carefully. Choose the letter that corresponds to your
answer and write it in your notebook.
1. How is the expected length of a confidence interval related to the margin of error?
A. The expected length is half the expected length.
B. The expected length is twice the margin of error.
C. The expected length is equal to the margin of error.
D. The expected length is two more than the margin of error.
2. What is the expected length of the confidence interval if the margin of error
was found to be 0.181?
A. 0.181 B. 0.362 C. 0.724 D. 2.362
3. Which of the following is the correct formula for determining the margin
of error?
𝑧𝛼/2 𝑧𝛼/2 𝑧𝛼/2 𝑧𝛼/2
A. 𝐸 = ⋅𝜎 B. 𝐸 = C. 𝐸 = D. 𝐸 =
√𝑛 √𝑛 ⋅𝜎 √𝑛⋅𝜎 √𝑛
4. A certain sample has a size of 200 and a standard deviation of 1.4. If a 99%
confidence interval is to be made, what is the value of the margin of error?
A. 0.193 B. 0.098 C. 0.117 D. 0.255
5. How long is the confidence interval that may be constructed in item number 4?
A. 0.196 B. 0.234 C. 0.386 D. 0.510
6. Which of the following is the correct formula for determining the appropriate
sample size?
𝑧 ⋅𝜎 𝑧 2 𝑧 2 𝑧 ⋅𝜎 2
A. 𝑛 = 𝛼/2𝐸
B. 𝑛 = ( 𝛼/2
𝐸
) ⋅𝜎 𝛼/2
C. 𝑛 = ( 𝐸⋅𝜎 ) D. 𝑛 = ( 𝛼/2
𝐸
)
Refer to the problem below to answer item numbers 7 to 10.
The standard deviation of the temperature of all the mall goers from a certain city on
a Tuesday morning is 0.16°C. How many of them must be chosen to get an accurate
estimate of the mean temperature within 0.015 at 95% confidence level?
7. What does 0.16 refer to?
A. population mean C. sample mean
B. population standard deviation D. sample standard deviation
8. To which quantity does 0.015 refer?
A. length of the interval C. mean
B. margin of error D. standard deviation
9. What needs to be determined in the above problem?
A. length of interval C. population mean
B. margin of error D. sample size
10. Which of the following is the correct answer to the problem?
A. 437 B. 438 C. 757 D. 758
69 (DO_Q3_STAT&PROB_MODULE1_LESSON10)
Lesson
Confidence Level and
10 Sample Size
In this lesson, the learners will understand the relationship between confidence level
and sample size. The process of determining an appropriate sample size will also be
presented.
What’s In
You may recall that a confidence interval is an interval estimate of the true
population mean composed of a range of values that possibly contains it. In the
estimation of this parameter, the concepts of margin of error, confidence level, and
sample size seem to equally interplay.
It is interesting to note that researchers wanted to come up with sufficiently enough
number of samples for both practical and theoretical reasons. Determining the
minimum sample size needed precisely ensures that at a certain confidence level
this sample will have an accurate estimate of the population mean under a given
margin of error. This implies that the sample size mainly depends on two things:
confidence level and the margin of error of the confidence interval.
What’s New
Researchers often use the 90%, 95% and 99% as the conventional levels of
confidence for a certain interval estimate. While for the length, we can anticipate how
long the confidence interval will be for any given sample size and confidence level
using the margin of error.
When the population standard deviation is known, the formula for the margin of
𝑧
error 𝐸 is given by 𝐸 = 𝛼/2
𝑛
⋅ 𝜎. This indicates the number that must be added to or
√
subtracted from the sample mean to have the limits of a confidence interval. Thus,
the expected length 𝐿 of a confidence interval is simply 𝑳 = 𝟐𝑬.
Alternatively, when the population standard deviation is unknown, the sample
standard deviation 𝑠 may be used as a substitute. In this case, the margin of error
𝑧
may be computed as 𝐸 ≈ 𝛼/2 𝑛
⋅ 𝑠.
√
70 (DO_Q3_STAT&PROB_MODULE1_LESSON10)
As an illustration, suppose we want to find the expected length of a 95% confidence
interval for estimating the population mean of a sample with 𝑛 = 120 and 𝜎 = 1.5.
Substituting these values to the formula for the margin of error, we have,
𝑧𝛼/2 1.96
𝐸= ⋅𝜎 = ⋅ 1.5 = 0.268
√𝑛 √120
Thus, we expect the length of the confidence interval to be 𝐿 = 2𝐸 = 2(0.268) = 𝟎. 𝟓𝟑𝟔.
What is It
Using the above formula for the margin of error, we can derive a formula for
the suitable sample size 𝑛 given a certain confidence level and margin of error. This
derivation is shown below.
𝑧𝛼/2
𝐸 = 𝑛 ⋅𝜎 [Formula for the Margin of Error]
√
√𝑛 𝑧 √𝑛 √𝑛
(𝐸) = [ 𝛼/2 ⋅ 𝜎] [Multiply both sides by ]
𝐸 √𝑛 𝐸 𝐸
𝒛𝜶/𝟐 ⋅𝜎
√𝑛 = ( 𝐸
) [Simplify]
2 𝒛𝜶/𝟐 ⋅𝜎 2
(√𝑛) = ( 𝐸
) [Square both sides]
𝒛𝜶/𝟐 ⋅𝝈 𝟐
𝒏= ( 𝑬 ) [Simplify]
The last equation above is used to determine the appropriate sample size. As
an illustration, let us find out how large a sample should be if one wants to be 99%
confident that the interval estimate of the population mean is accurate to within 0.05
if the population standard deviation is known to be 2.5.
Substituting the known values to the above formula, we obtain the following:
𝒛𝜶/𝟐 ⋅ 𝝈 𝟐 2.58 ⋅ 2.5 2
𝒏=( ) =( ) = 𝟏𝟔 𝟔𝟒𝟏
𝑬 0.05
Sample Problems:
Solution:
First, we note that the population standard deviation is 0.75 years. Also, the
margin of error is set to 0.02 while the confidence level is 95%. Using the formula
for finding the minimum sample size suited to accurately estimate the population
mean, we have,
71 (DO_Q3_STAT&PROB_MODULE1_LESSON10)
𝒛𝜶/𝟐 ⋅ 𝝈 𝟐 1.96 ⋅ 0.75 2
𝒏=( ) =( ) = 5 402.25
𝑬 0.02
2. Based on previous school records, the standard deviation of the height of Grade
10 students in a particular city is 1.2 inches. A researcher wanted to be sure on
estimating the population mean height to within 0.035 of its true value. Using
99% confidence interval, find the sample size that the researcher needs.
Solution:
Clearly, we have the following given information: 𝜎 = 1.2, 𝐸 = 0.035, and the
confidence level is 99%. Thus, we may calculate the value of 𝑛 as shown below.
What’s More
1. What is the relationship between the length of a confidence interval and the
margin of error?
___________________________________________________________________________
__________________________________________________________________________.
2. What quantities are needed to compute for the appropriate sample size that
can fairly be used as an estimate of a certain parameter?
___________________________________________________________________________
__________________________________________________________________________
.
3. What are the things to be considered in solving problems involving sample
size determination?
___________________________________________________________________________
__________________________________________________________________________.
72 (DO_Q3_STAT&PROB_MODULE1_LESSON10)
What I Can Do
Assessment
Read and analyze each item carefully. Choose the letter that corresponds to your
answer and write it in your notebook.
1. How is the expected length of a confidence interval related to the margin of error?
A. The expected length is equal to the expected length.
B. The expected length is twice the margin of error.
C. The expected length is half the margin of error.
D. The expected length is two more than the margin of error.
2. What is the expected length of the confidence interval if the margin of error was
found to be 0.362?
A. 0.181 B. 0.362 C. 0.724 D. 2.362
3. Which of the following is the correct formula for determining the margin of error?
𝑧𝛼/2 𝑧𝛼/2 𝑧𝛼/2 𝑧𝛼/2
A. 𝐸 = 𝑛 ⋅ 𝜎 B. 𝐸 = 𝑛 ⋅𝜎 C. 𝐸 = 𝑛⋅𝜎 D. 𝐸 = 𝑛
√ √ √ √
4. A certain sample has a size of 200 and a standard deviation of 1.4. If a 95%
confidence interval is to be made, what is the value of the margin of error?
A. 0.194 B. 0.098 C. 0.117 D. 0.255
5. How long is the confidence interval that may be constructed in item number 4?
A. 0.196 B. 0.234 C. 0.388 D. 0.510
6. Which of the following is the correct formula for determining the appropriate
sample size?
𝑧𝛼/2 ⋅𝜎 𝑧𝛼/2 2 𝑧𝛼/2 2 𝑧𝛼/2 ⋅𝜎 2
A. 𝑛 = B. 𝑛 = ( ) ⋅𝜎 C. 𝑛 = ( ) D. 𝑛 = ( )
𝐸 𝐸 𝐸⋅𝜎 𝐸
73 (DO_Q3_STAT&PROB_MODULE1_LESSON10)
8. To which quantity does 0.015 refer?
A. length of the interval C. mean
B. margin of error D. standard deviation
9. What needs to be determined in the above problem?
A. length of interval C. population mean
B. margin of error D. sample size
10. Which of the following is the correct answer to the problem?
A. 437 B. 438 C. 757 D. 758
Additional Activities
74 (DO_Q3_STAT&PROB_MODULE1_LESSON10)
(DO_Q3_STAT&PROB_MODULE1_LESSON10) 75
What I Know What’s More
1. B 6. D 37 443
2. B 7. B
3. A 8. B
What I can Do
4. D 9. D
4 516
5. D 10. B
Assessment Additional Activities
1. B 6. D 1.a. 2 033
2. C 7. B b. 3 522
3. A 8. B 2. a. 4 058
4. A 9. D b. 7 031
5. C 10. D
Answer Key
Almeda,Capistrano, Ferry Sarte. (2010). Elementary Statistics. Quezon City: University of the
Philippines Press.
Belecina, R., Baccay, E., & Mateo, E. (2016). Statistics and Probability. Manila,Philippines:
REX Book Store Inc.
Bluman, A. (2018). Elementary Statistics: A Step by Step Approach 10th edition. McGraw
Hill. New York, USA.
Canlapan, R. (2016). Statistics and Probability. Makati, Philippines: Diwa Learning System
Inc.
Keller, Warrack. (2003). Statistics For Management and Economics. California USA:
Thomson Learning, Inc.
Levine, et. al. (2005). Statistics: A Handbook for Managers. New Jersey: Prentice Hall.
PERCDC Learnhub
Walpole, R., Myers, R., Myers, S., and Ye, K., (2012). Probability and Statistics for Engineers
and Scientists 9th edition. Pearson Education Inc. Massachusetts, USA.
76 (DO_Q3_STAT&PROB_MODULE1_LESSON10)
For inquiries or feedback, please write or call:
Department of Education – SDO Valenzuela
77 (DO_Q3_STAT&PROB_MODULE1_LESSON10)