Statistics & Probability: Quarter 3
Statistics & Probability: Quarter 3
Statistics & Probability: Quarter 3
STATISTICS &
PROBABILITY
QUARTER 3
MODULE 1
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of
the Government of the Philippines. However, prior approval of the government agency or office
wherein the work is created shall be necessary for exploitation of such work for profit. Such
agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names,
trademarks, etc.) included in this module are owned by their respective copyright holders.
Every effort has been exerted to locate and seek permission to use these materials from their
respective copyright owners. The publisher and authors do not represent nor claim ownership
over them.
ii
Targets:
1. Illustrate random variables (discrete and continuous) (M11/12SP-IIIa-1);
2. Distinguish between a discrete and a continuous random variable
(M11/12SP-IIIa-2);
3. Find the possible values of a random variable(M11/12SP-IIIa-3); and
4. Illustrate a probability distribution for a discrete random variable and its
properties (M11/12SP-IIIa-4).
Do this Pre-Test: Choose the letter of the best answer. Write your answer in your
notebook.
______1.) It is a random variable that has countable set of possible outcomes.
A. constant B. continuous C. discrete D. finite
Three coins are tossed and the random variable Y gives the number of tails.
DO_Q3_STATISTICS&PROBABILITY_
1 GRADE 11_LESSON1
Lesson
Exploring Random Variables
1
Random Variable
- a function that associates a numerical value with every outcome of
an experiment
Below are examples of discrete random variables because their possible values
are obtained through counting.
Example 1. Suppose two coins are tossed. Let Y be the random variable representing
the number of tails that occur. Find the values of the random variable, Y.
Illustration:
So, the possible values of the random variable (range space) Y are 0,1 and 2.
DO_Q3_STATISTICS&PROBABILITY_
2
GRADE 11_LESSON1
Example 2. Anne conducted an experiment and tested four cell phones at random.
She wants to find out the number of defective cell phones that occur. Thus, to each
outcome in a sample space she assigned a value. These are 0, 1, 2, 3 and 4. If there
is no defective cell phone, she assigned number 0; if there is 1 defective cell phone,
she assigned number 1; if there are two defective cell phones; she assigned 2; she
assigned 3, if there are three defective cell phones; and 4 if the four cell phones are
all defective. The number of defective cell phones is a random variable.
Illustration: Let D represent the defective cell phone and N represent the non-
defective cell phone. If we let X be the random variable representing the number of
defective cell phones, what are the values of the random variable X?
The possible values of this random variable (range space) are 0, 1, 2, 3 and 4.
Below are examples of continuous random variable.
Example 1. Suppose an experiment is conducted to determine the distance that a
certain type of car will travel using 20 liters of gasoline over a prescribed test course.
If distance D is a random variable, then we have an infinite number of distances that
cannot be equated to the number of whole numbers.
Example 2. XYZ machine was operated, and the recorded time it starts to experience
a glitch G illustrates a continuous random variable since the value of the variable may
be assigned using measurement.
PROBABILITY DISTRIBUTION
➢ also known as probability mass function/discrete probability distribution
➢ a table that gives a list of probability values along with their associated value in
the range of a discrete random variable
The following properties are observed given that 𝑝𝑖 are individual probabilities
for each value (𝑥𝑖 ) of the random variable:
1. Each probability value ranges from 0 to 1, in symbols, 0≤ 𝒑𝒊 ≤ 𝟏.
2. The sum of all the individual probabilities in the distribution is equal to 1.
Thus, n
𝑝𝑖 + 𝑝2 + 𝑝3 +……𝑝𝑛 = ∑ 𝑝𝑖 = 1.
i=1
Example 1. Three coins are tossed, and the random variable Z gives the number of
heads. Find the values of the random variable Z.
DO_Q3_STATISTICS&PROBABILITY_
3
GRADE 11_LESSON1
Sample space: {HHH, HHT, HTH, HTT, THH, THT, TTH, TTT}
Example 1: An investor has 5 stocks that she follows each day. The random variable
being studied is Y. Based on the table below, what is P(0)?
Y 0 1 2 3 4 5
P (Y) ? 0.27 0.34 0.11 0.07 0.02
Solution:
Since the sum of all the individual probabilities in the distribution is equal
to 1,
P(0) + P(1) + P(2) + P(3) + P(4) + P(5)= 1
P(0) + 0.27 + 0.34 + 0.11 + 0.07 + 0.02= 1
P(0) + 0.81= 1
P(0) = 1 - 0.81
P(0) = 0.19
Therefore, the probability that the investor has zero (0) stocks is 0.19.
Roxas family has three children. Let A represent the number of boys.
Construct a probability distribution for the random variable A.
DO_Q3_STATISTICS&PROBABILITY_
4
GRADE 11_LESSON1
A. Directions: Classify the following random variables. In your notebook, write
DISCRETE if it is a discrete random variable. Otherwise, write
CONTINUOUS.
____________1.) number of defective computers
____________2.) weight of newborns in a hospital
____________3.) the time students had breakfast
____________4.) number of hours spent in watching TV
____________5.) the amount of time a student spends on Facebook
Targets:
1. Compute probabilities corresponding to a given random variable
(M11/12SP-IIIa-6);
2. Illustrate the mean and variance of a discrete random variable
(M11/12SP-IIIb-1); and
3. Calculate the mean and variance of a discrete random variable
(M11/12SP-IIIb-2).
Do this Pre-Test: Choose the letter of the best answer. Write your answer in your
notebook.
______1.) The sum of the product of each value of a discrete random variable X and
its probability is referred as its _____.
A. standard deviation C. mean
B. variance D. variables
______2.) Consider the given discrete probability distribution. Find the probability
that M equals 5.
M 3 5 7 9
P(M) 0.12 ? 0.26 0.04
A. 0.58 C. 1
B. 0.28 D. 0.48
DO_Q3_STATISTICS&PROBABILITY_
5
GRADE 11_LESSON2
______3.) Mang Memong’s bakery has determined a probability distribution for the
number of ensaymada it sells in a given day. The distribution is as follows:
DO_Q3_STATISTICS&PROBABILITY_
6
GRADE 11_LESSON2
➢ It is also sometimes called the expected value or expectation of X denoted by E(X).
Sample Problems:
Example 1. Calculate the mean and variance for a random variable X defined as the
number of tails in two tosses of a coin.
2 0.25 0.5
Step 3. Sum up the product of X and P(X).
𝝁 = ∑ 𝑿 ∙ 𝑷(𝑿)
𝜇 = 0 + 0.5 + 0.5
𝝁 =1
DO_Q3_STATISTICS&PROBABILITY_
7
GRADE 11_LESSON2
Example 2. Calculate the mean and variance for a random variable X defined as
the sum of the scores on the two dice.
Illustration:
X 2 3 4 5 6 7 8 9 10 11 12
P(X) 1/36 2/36= 3/36= 4/36= 5/36= 6/36= 5/36= 4/36= 3/36= 2/36= 1/36=
=0.03 0.06 0.08 0.11 0.14 0.17 0.14 0.11 0.08 0.06 0.03
X ∙ P(X) 0.06 0.18 0.32 0.55 0.84 1.19 1.12 0.99 0.8 0.66 0.36
Variance
(𝝈𝟐 ) = ∑(𝑿 − 𝝁)2∙ 𝑷(𝑿)
= 0.77 + 0.99 + 0.75 + 0.47
+ 0.16 + 0 + 0.12 + 0.41 + 0.69
+ 0.93 + 0.73
𝝈𝟐 = 6.02
DO_Q3_STATISTICS&PROBABILITY_
8
GRADE 11_LESSON2
A survey conducted by a researcher showed the number of students’ siblings
of Grade 11 – Albert Bandura (GAS) in Bignay National High School-SHS.
Y 6 8 10
DO_Q3_STATISTICS&PROBABILITY_
9
GRADE 11_LESSON2
A. The table below shows the probability distribution of a discrete random
variable B.
B 0 1 2 3 4 5
Y 0 1 2
Targets:
1. Interpret the mean and variance of a discrete random variable
(M11/12SP-IIIb-3); and
2. Solve problems involving the mean and variance of probability distributions
(M11/12SP-IIIb-4).
Do this Pre-Test: Choose the letter of the best answer. Write your answer in
your notebook.
______1.) In a pizza parlor, the following probability distribution was obtained for the
number of toppings ordered on a large pizza. Find the mean and standard
deviation for the random variable.
A 0 1 2 3 4
P(A) 0.30 0.40 0.20 0.06 0.04
DO_Q3_STATISTICS&PROBABILITY_
10
GRADE 11_LESSON3
______2.) Michaella is a star player on her college soccer team. She loves to score
goals and she does this on a regular basis. The probability that she will
score 0, 1, 2, or 3 goals are 0.25, 0.35, 0.25, and 0.15 respectively. What is
her most likely number of goals?
A. 1 B. 1.2 C. 1.3 D. 1.4
______3.) Two fair coins are tossed. If Z represents the number of tails, what is the
expected value of Z?
A. 1 B. 1.5 C. 0.5 D. 2
______4.) A’s Burger house has a probability distribution of sales as shown below:
P P(P)
25 0.20
40 0.20
50 0.60
Using this probability distribution, what is the expected value and
variance?
A. Expected Value: 43 C. Expected Value: 43
Variance: 46 Variance: 96
B. Expected Value: 43 D. Expected value: 43
Variance: 69 Variance: 60
_______5.) Three patients are treated with a specific medicine. The probabilities for
0, 1, 2, or 3 successes are 0.24, 0.12, 0.30, and 0.34 respectively. What
is the average number of successes in this treatment?
A. 1.71 B. 1.72 C. 1.73 D. 1.74
DO_Q3_STATISTICS&PROBABILITY_
11
GRADE 11_LESSON3
To get the variance of a discrete probability distribution we can use this
alternative formula: Variance (𝝈𝟐 ) = ∑ 𝑿𝟐 ∙ 𝑷(𝑿) - 𝝁𝟐
The standard deviation of a discrete probability distribution is given by the
formula:
Standard deviation (𝝈)=√∑ 𝑿𝟐 ∙ 𝑷(𝑿) − 𝝁𝟐
where: X = value of the random variable
P(X) = probability of the random variable X
𝜇 = mean of the probability distribution
The owner of Malinamnam Restaurant asks the staff to record the number
of people who order their specialty in weekdays. The table below shows the
probability distribution. Solve for the following:
a) mean b) variance c) standard deviation
12 DO_Q3_STATISTICS&PROBABILITY_
GRADE 11_LESSON3
W 1 2 3 4 5
DO_Q3_STATISTICS&PROBABILITY_
13
GRADE 11_LESSON3
A. Directions: Given the table below; calculate the mean, variance and standard
deviation.
D 1 2 3
P(D) 0.41 ? 0.41
Targets:
1. Identify regions under the normal curve corresponding to different
standard normal values (M11/12SP-IIIc-3);
2. Convert a normal random variable to a standard normal variable and vice versa
(M11/12SP-IIIc-4); and
3. Compute probabilities and percentiles using the standard normal table
(M11/12SP-IIId-1).
Do this Pre-Test: Choose the letter of the best answer. Write your answer in your
notebook.
Lesson
Normal Random Variable
4
In this lesson you will learn to identify regions under the normal curve
corresponding to different standard normal values, converts a normal random
variable to a standard normal variable and vice versa, computes probabilities and
percentiles using the standard normal table. (M11/12SP-IIIc-3-4, M11/12SP-IIIc-
d-1)
But to find the probability beyond the empirical rule is possible through the
use of the standardized normal table (See appendices).
The Cumulative Standardized
Normal table gives the probability
less than a desired value for Z (i.e.,
from negative infinity to Z)
The following is the general procedure for finding probabilities under the
normal curve as to find P (a < X < b) when X is distributed normally:
1. Draw the normal curve for the problem in terms of X
2. Translate x-values to z-values
3. Use the standard normal table
Example 2: Suppose X is normal with mean 8.0 and standard deviation 5.0.
DO_Q3_STATISTICS&PROBABILITY_
17 GRADE 11_LESSON4
a. Find 𝑃(𝑥 < 8.6) b. Find 𝑃(𝑥 > 8.6) c. Find 𝑃(80 < 𝑥 < 8.6)
Solution (a): 𝑃(𝑥 < 8.6)
𝑥−𝜇 8.6−8.0
Step1 Step 2. 𝑧 = = = 0.12
𝜎 5.0
Step 3:
Solution (c):
Step 3
Step 3.
DO_Q3_STATISTICS&PROBABILITY_
18
GRADE 11_LESSON4
The percentage of students at school between 1.1m and 1.7m tall is 2. 95%.
Assuming that the data is normally distributed, calculate the mean and standard
deviation. Construct the normal curve corresponding to the data.
1.1𝑚 + 1.7𝑚
𝜇= = 1.4𝑚
2
95% is 2 standard deviations either side of the mean (a total of 4
standard deviations). So:
1.7𝑚 − 1.1𝑚
𝜎= = 0.15𝑚
4
Short Answer. Supply the correct answer. Write your answer in your notebook.
_________1. It is a random variable where the data can take infinitely many values.
_________2. True or False. Getting the probability distribution in discrete random
variable is the same in continuous random variable.
_________3. The graphical form of the probability distribution for continuous
random variable is a________________.
Problem Solving
1. Sixty eight percent of the marks in a test are between 51 and 64. Assuming this
data is normally distributed, what is the standard deviation?
2. Using the table of values find: (a) 𝑃(𝑍 ≤ 1.31), (b) 𝑃(𝑍 ≥ 1.32), (c)
Targets:
1. Illustrate random sampling (M11/12SP-IIId-2); and
2. Distinguish between parameter and statistic (M11/12SP-IIIc-3).
DO_Q3_STATISTICS&PROBABILITY_
19
GRADE 11_LESSON5
Do this Pre-Test: Answer the questions below to measure what you already know
about the next lessons. Write your answer on your lecture
notebook.
1. Sampling is about ______ a sample from a population.
A. adding B. selecting C. including
2. Simple random sampling is a ______ sampling method.
A. confusing B. non-probability C. probability
3. It is defined as the characteristic of a sample.
A. Parameter B. Statistic C. Random sampling
4. It refers to the numbers that summarize data for entire population.
A. Parameter B. Statistic C. Random sampling
5. x̄, 𝑠 2 , and s are commonly used symbols to represent the mean, variance and
standard deviation of the ___________.
A. population B. sample C. statistics
20 DO_Q3_STATISTICS&PROBABILITY_
GRADE 11_LESSON5
Multistage random sample is constructed by taking a series of simple
random samples in stages. This type of sampling is often more practical than simple
random sampling for studies requiring “on location” analysis such as door-to-door
surveys.
What is parameter?
What is statistic?
Mean x̄ 𝜇
Variance s2 𝜎2
Standard Deviation s 𝜎
1. A new business with 100 employees wants to know whether staff would like to
have childcare facilities on site. An estimate could be made by asking a sample of
20 employees if they would use the childcare facilities and multiplying the
number who said yes by 5. Decide which people to ask.
Solution:
To get a random sample of 20 people, we could assign each person a number from
1 to 100 and then select 20 numbers using a random method.
One Random method is to write the 100 numbers on separate slips, put them in
a bag, shake them, and take 20 of them out without looking.
DO_Q3_STATISTICS&PROBABILITY_
21
GRADE 11_LESSON5
Alternative solution using technology:
The button might say “RANDOM” (SHARP). Other makes may have a button “Ran”
or “Ran#” or “RanInt”. Whichever you have, selecting and pressing ENTER
repeatedly gives random numbers.
Generate a Random Number between 0 and 99.
Exercise what you have learned by answering the questions below. Write
your answer on your activity notebook.
1. How many of each of the 3 types of computer component should be taken in a
sample of 100 categorized by type of component?
Type A B C D
2. A manager samples the receipts of every fifth person who goes through the line.
Out of 50 people, 4 had a mispriced item. If 600 people go to this store each day,
how many people would you expect to have a mispriced item?
Targets:
1. Identifies sampling distributions of statistics (sample mean)
(M11/12SP-IIId-4); and
2. Finds the mean and variance of the sampling distributions of the sample mean
(M11/12SP-IIId-5.
Do this Pre-Test: Choose the letter of the correct answer. Write your answer in
your notebook.
1. A random variable is a variable usually denoted by X that has a __________
numerical value determined by chance for each outcome of a procedure or
experiment.
A. small B. large C. single D. double
2. A probability distribution is a graph, table, or function that gives the _______ for
each value of the random variable.
A. input B. output C. outcome D. probability
3. The mean of a probability distribution is also called __________.
A. extreme value C. extraneous value
B. expected value D. erroneous value
4. In a probability distribution, the probability of each random variable must be
between __________.
A. 0 and 1 B. 0 and 10 C. -1 and 0 D. 1 and 2
5. In a probability distribution, the sum of all the probabilities must be equal to
__________.
A. 100 B. 10 C. 1 D. 0.99
DO_Q3_STATISTICS&PROBABILITY_
23
GRADE 11_LESSON6
Lesson
Sampling Distribution of the MEAN
6
In this lesson, the learners will deepen their concept of sampling distribution
of the mean. Skills on the computation of the mean and variance of the sampling
distribution of the mean will also be developed.
Let’s say, we have a finite population of size N=5 that consists of the numbers
1, 2, 3, 4, and 5. (We know that for this population, the mean, µ = 3 and the variance,
σ2 = 2). Now let’s take a random sample of size n=3, say 1, 4, and 2, and compute
the mean.
1+4+2
Sample: 1, 4, 2 𝑋̅ = 3
= 2.3
What can you say about the sample mean, 𝑋̅ and the population mean, µ?
How many different samples of the same size, n=3, do you think we can get
from a population of N=5? (Clue: nCr → 5C3)
In this lesson we will try to list all the possible samples that we can get from a given
finite population, and present them in a tabular form.
We need to learn about sampling distribution because it is an essential
element in Statistical Inference. We remind you that Statistical Inference is the
process of using sample results to draw conclusions about the characteristics of a
population.
Definition: The Sampling Distribution of the Mean is the probability
distribution of the sample means.
DO_Q3_STATISTICS&PROBABILITY_
24
GRADE 11_LESSON6
Remember: The number of samples of size n that can be drawn from a
population of size N is given by NCn. (Look for the function nCr in your scientific
calculator.)
Now, let us make a frequency distribution of the sample means.
Sample Mean, 𝑋̅ Frequency
2 1
2.3 1
2.7 2
3 2
3.3 2
3.7 1
4 1
Total = 10
At this point, we simply use the classical definition of probability to compute
for the probability of getting each sample mean, P(𝑋̅).
DO_Q3_STATISTICS&PROBABILITY_
25
GRADE 11_LESSON6
Consider a population of 4 typists from a certain company who were asked to
type the same page of a manuscript. The number of errors made by each typist is
presented below.
Typist Number of Errors
Anna 1
Betty 3
Carol 2
Denice 4
Required: List all the possible samples of size n=2, and then check if µ𝑥̅ = µ.
(Levine, et. al., 2005, pp. 253-254)
Problem Solving
There are 10 students enrolled in a preschool. Their ages are
Student Age Student Age
A 4 F 5
B 5 G 4
C 4 H 4
D 3 I 3
E 5 J 6
a) Compute for the mean, µ and variance, σ2 of the ages of the students in this
preschool.
DO_Q3_STATISTICS&PROBABILITY_
26
GRADE 11_LESSON6
Targets:
Do this Pre-Test: Choose the letter of the best answer. Write your answers on your
notebook.
1. From a finite population of N=7, the number of samples of size n=3 that can be
drawn is __________.
A. 25 B. 30 C. 35 D. 40
2. From a finite population of N=10, the number of samples of size n=5 that can be
drawn is __________.
A. 252 B. 254 C. 256 D. 258
3. In a sampling distribution of the mean, as the sample size increases, the variance
__________.
A. -1 B. decreases C. increases D. remains unchanged
4. The standard deviation of the sampling distribution of the sample means is
known as ___________.
A. Variance C. Standard Deviation
B. Standard estimate of the mean D. Standard error of the mean
5. A __________ is one that consists of a finite or fixed number of elements,
measurements, or observations.
A. Finite Population C. Frequent Population
B. Infinite Population D. cannot be determined
Lesson
7 The Central Limit Theorem
This lesson will provide information, activities and exercises which will lead the
leaners to the concept of central limit theorem and will enhance skills on solving for
the problems involving the sampling distributions of the mean.
The table below shows the results that we obtained from the last lesson
when we studied the sampling distribution of the mean. From a finite population of
N=5, we drew samples of different sizes and computed the Mean and Variance of
each sampling distribution of the mean.
POPULATION SAMPLING DISTRIBUTION OF THE MEAN
N=5 n=2 n=3 n=4
MEAN µ=3 µ𝑥̅ = 3 µ𝑥̅ = 3 µ𝑥̅ = 3
VARIANCE σ2 = 2 σ2𝑥̅ = 0.75 σ2𝑥̅ = 0.33 σ2𝑥̅ = 0.12
What do you notice about the values of the variance of the sampling
distribution as the sample size increases? Do you think the same thing happens to
27 DO_Q3_STATISTICS&PROBABILITY_
GRADE 11_LESSON7
the standard deviation? Now consider the experiment of throwing a fair die so many
times until we get a very large population and our muscles ache because of
continuous throwing. In this case we have an infinite population. (Keller/Warrack,
p.270)
Let X be the random variable that gives the number of dots showing after each
throw, the probability distribution for the random variable is given below.
X 1 2 3 4 5 6
P(X) 1/6 1/6 1/6 1/6 1/6 1/6
Population Mean, µ = ∑ X • P(X)
1 1 1 1 1 1
= 1( ) + 2( ) + 3( ) + 4( ) + 5( ) + 6( )
6 6 6 6 6 6
= 3.5
1+4+6+4+4
5 Throws, n = 5 → 𝑋̅ = = 3.8
5
1+6+4+5+3+5+5+5+1+2
10 Throws, n = 10 → 𝑋̅ = = 3.7
10
5+3+1+4+6+1+3+3+4+3+5+2+5+5+6+4+2+2+4+6+6+4+2+1+1
25 Throws, n = 25 → 𝑋̅ =
25
= 3.5
That’s right! As the number of throws of the die increases, the mean becomes
closer and closer to 3.5, the population mean. In other words, as the sample size
increases, the probability that the sample mean will be close to 3.5 also increases.
Why do you think is this so?
(CLUE: It has something to do with the averaging process.)
In the last lesson, we observed in the sampling distribution of the mean that
as the sample size increases, the variance decreases.
We expect the same thing in our experiment today. Notice that as the sample
size increases, 𝑋̅ becomes closer to µ=3.5. As the sample size increases, we are likely
to obtain 𝑋̅’s from our samples that crowd around the µ of the population which is
3.5. When this happens, we know that the variance will decrease.
This does not happen by coincidence. This can be explained by the formula
𝜎2
σ2𝑥̅ = 𝑛
where σ2𝑥̅ = variance of the sampling distribution
σ2 = variance of the population
n = sample size
28 DO_Q3_STATISTICS&PROBABILITY_
GRADE 11_LESSON7
Another thing that happens as n gets larger is that the sampling distribution
̅
of 𝑋 becomes increasingly bell shaped. (Keller/Warrack, p.274)
All these things that we learned today can be summarized by the following
theorem.
THE CENTRAL LIMIT THEOREM
If random samples of size n are drawn from a population, then as n becomes
larger, the sampling distribution of the mean approaches the normal distribution,
regardless of the shape of the parent population. (Belecina, et. al., p.119)
Because of this phenomenon, we can use the Central Limit Theorem to solve
a lot of real- life problems. This amazing theorem justifies the use of the Normal
Curve and the Z-table.
𝑋−µ
From our original formula of Z= , we now have this new formula if
𝜎
we want to compute for the probability that 𝑋̅ will take some values in the sampling
distribution of 𝑋̅.
FORMULA
𝑋̅ −µ
Z= where 𝑋̅ = mean
𝜎/√𝑛
µ = population mean
σ = population standard deviation
n = sample size
Example
The scores on a Statistics midterm exam are normally distributed with a mean of 78
and a standard deviation of 6. If a student is selected at random
(a) what is the probability that his/her score is higher than 90?
(b) what is the probability that his/her score is less than 75?
(c) what is the probability that a class of 30 students will have an average score of
less than 75?
SOLUTION:
Given: µ = 78 σ=6
The population is normally distributed.
(a) P(X>90) = P(Z> ?) = ?
𝑋−µ 90−78 12
Let’s solve for Z: Z= = = =2
𝜎 6 6
Use the Z-table to determine the required area,
P(X>90) = P(Z>2) = 0.4772
Therefore, the probability that his/her score is higher than 90 is 0.0228 or
2.28%.
DO_Q3_STATISTICS&PROBABILITY_
29
GRADE 11_LESSON7
(c) In this question we are given a sample of size 30, and we are asked to determine
the probability that the mean 𝑋̅ of this sample will be less than 75. As explained
above, the Central Limit Theorem allows us to use the Z-table in this type of problem.
P(𝑋̅<75) = P(Z<?) = ?
𝑥̅ −µ 75−78
Let’s convert our 𝑋̅ to Z: Z= = = -2.74
𝜎/√𝑛 6/√30
Use the Z-table to determine the required area.
P(𝑋̅<75) = P(Z<-2.74) = 0.0031
Therefore, the probability that the sample will have a mean of less than 75 is
0.0031 or 0.31%.
Therefore, the probability that a bottle will contain more than 1000ml is
0.7257 or 72.57%.
(b) P(𝑋̅>1000) = P(Z>?) = ?
𝑥̅ −µ 1000−1006
Let’s convert 𝑋̅ to Z: Z= = = -1.2
𝜎/√𝑛 10/√4
Use the Z-table to determine the required area.
P(𝑋̅>1000) = P(Z>-1.2) = 0.8849
Therefore, the probability that the mean of the 4 bottles is 0.8849 or 88.49%.
Targets:
1. Illustrate the t-distribution (M11/12SP-IIIg-2).
2. Identify critical values (percentiles) using the t-table (M11/12SP-IIIg-5).
Read and analyze each item carefully. Choose the letter that corresponds to
your answer and write it in your notebook.
1. Which of the following refers to the number of subjects or observations taken
from a population?
A. sample size C. degrees of freedom
B. confidence level D. t-value
2. Given the sample size, how do we compute the degrees of freedom?
A. Add one to the sample size.
B. Subtract one from the sample size.
C. Divide the sample standard deviation by the sample size.
D. Divide the sample size by the sample standard deviation.
3. The size of a sample is 19. What is the degrees of freedom?
A. 18 B. 20 C. 34 D. 100
Consider the situation below to answer item numbers 4 and 5.
A certain sample whose size is 10 is known to have a mean of 73.25 with a
standard deviation of 2.5.
4. Find 𝑡𝛼/2 for a 99% confidence interval.
A. 1.833 B. 2.228 C. 2.262 D. 3.250
5. Which of the following gives the interval estimate for the population mean with
99% confidence?
A. 71.801 < 𝜇 < 74.699 C. 70.681 < 𝜇 < 75.819
B. 71.489 < 𝜇 < 75.011 D. 71.462 < 𝜇 < 75.038
DO_Q3_STATISTICS&PROBABILITY_
31 GRADE 11_LESSON8
Lesson
8 The t-Distribution
In this lesson, the learners will understand the concept of t-distribution. The
process of identifying critical values or percentiles to be used in constructing
confidence intervals using the t-table will also be presented.
Previously, when 𝜎 is known and the sample size is at least 30 or the sample
size is less than 30 but is taken from a population that is approximately normally
distributed, the confidence interval can be easily determined with the help of the z-
distribution.
Unfortunately, in most cases, we seldom know the population standard
deviation 𝜎 and what we have at hand is just the sample standard deviation 𝑠.
Finding a confidence interval for the population mean 𝜇 using only 𝑠 becomes a
different task to do.
Thus, this situation needs the help of a more suited distribution which is the
t-distribution.
The general expression for estimating the population mean using 𝑠 is given by
𝒔
𝑿 ± 𝒕 ( ).
√𝒏
DO_Q3_STATISTICS&PROBABILITY_
32
GRADE 11_LESSON8
Characteristics of the t-distribution
➢ The variance is greater than 1.
➢ The t-distribution curve is bell-shaped.
➢ The curve is symmetrical about the mean.
➢ The distribution curve is asymptotic to the x-axis.
➢ The mean, median, and mode are equal to zero and are located at the center of
the distribution.
➢ As the sample size increases, the t-distribution approaches the normal
distribution (see Figure 1)
Sample Problems:
1. Find 𝑡𝛼/2 for a 95% confidence interval of a sample with size 16.
Solution:
In a t-distribution table, the values represent the proportions of areas in the tails
of the t-curve. We note that 𝑛 = 16 and that the confidence interval is 95%, so 𝛼 =
0.05. Thus, 𝛼/2 = 0.025, and we wish to find 𝑡0.025 .
First, we compute for the degrees of freedom as follows:
𝑑𝑓 = 𝑛 − 1
𝑑𝑓 = 16 − 1
𝑑𝑓 = 15
Next, we refer to the t-table below, we get the intersection of df=15 and 𝑡0.025 .
Thus, 𝑡𝛼/2 for a 95% confidence interval of a sample with size 16 is 2.131.
1. A sample composed of 20 male senior high school students were taken
randomly from a normally distributed population. Their heights were taken, and
it was found out that the mean is 65 inches with a standard deviation of 2.15
inches. With 95% confidence, find an interval estimate for the population mean.
Solution:
DO_Q3_STATISTICS&PROBABILITY_
33
GRADE 11_LESSON8
From the given, the sample was taken randomly and that the population from
which it was taken is normally distributed. Thus, we can use 𝑠 and the t-
distribution to find an interval estimate for 𝜇.
Now, we find 𝑡𝛼/2 where 𝛼 = 0.05. With 𝑑𝑓 = 20 − 1 = 19, it can be verified from
the t-table that 𝑡𝛼/2 = 2.093.
Next, we compute for the confidence interval limits as shown below:
𝑠 2.15
𝑋 ± 𝑡 ( ) = 65 ± (2.093) ( ) = 65 ± 1.006
√𝑛 √20
It follows that the confidence interval limits are:
Upper: 65 + 1.006 = 66.006 and Lower: 65 – 1.006 = 63.094.
Therefore, with 95% confidence, the mean height of the population lies in the
interval 𝟔𝟑. 𝟎𝟗𝟒 < 𝝁 < 𝟔𝟔. 𝟎𝟎𝟔.
Read and analyze each item carefully. Choose the letter that corresponds to
your answer and write it in your notebook.
1. Which of the following refers to the number of values that can vary in a certain
analysis or computation?
A. confidence level C. sample size
B. degrees of freedom D. t-value
2. How do we compute the degrees of freedom?
A. Add one to the sample size.
B. Subtract one from the sample size.
C. Divide the sample standard deviation by the sample size.
D. Divide the sample size by the sample standard deviation.
3. The size of a sample is 17. What is the degrees of freedom?
A. 16 B. 18 C. 34 D. 100
Consider the situation below to answer item numbers 4 and 5.
A certain sample whose size is 10 is known to have a mean of 73.25 with a
standard deviation of 2.5.
4. Find 𝑡𝛼/2 for a 95% confidence interval.
A. 1.833 B. 2.228 C. 2.262 D. 3.250
DO_Q3_STATISTICS&PROBABILITY_
34
GRADE 11_LESSON9
5. Which of the following gives the interval estimate for the population mean with
95% confidence?
A. 71.801 < 𝜇 < 74.699 C. 70.681 < 𝜇 < 75.819
B. 71.489 < 𝜇 < 75.011 D. 71.462 < 𝜇 < 75.038
Targets:
1. Identify the length of a confidence interval (M11/12SP-IIIj-1).
2. Compute for the length of a confidence interval (M11/12SP-IIIj-2).
3. Compute for an appropriate sample size using the length of the interval
(M11/12SP-IIIj-3).
4. Solve problems involving size determination (M11/12SP-IIIj-4).
Read and analyze each item carefully. Choose the letter that corresponds to your
answer and write it in your notebook.
1. How is the expected length of a confidence interval related to the margin of error?
A. The expected length is equal to the expected length.
B. The expected length is twice the margin of error.
C. The expected length is half the margin of error.
D. The expected length is two more than the margin of error.
Refer to the problem below to answer item numbers 2 to 5.
The standard deviation of the temperature of all the mall goers from a certain
city on a Tuesday morning is 0.16°C. How many of them must be chosen to get
an accurate estimate of the mean temperature within 0.015 at 95% confidence
level?
2. What does 0.16 refer to?
A. population mean C. sample mean
B. population standard deviation D. sample standard deviation
3. To which quantity does 0.015 refer?
A. length of the interval C. margin of error
B. mean D. standard deviation
4. What needs to be determined in the above problem?
A. margin of error C. length of interval
B. population mean D. sample size
5. Which of the following is the correct answer to the problem?
A. 437 B. 438 C. 757 D. 758
Lesson
Confidence Level and Sample Size
9
In this lesson, the learners will understand the relationship between
confidence level and sample size. The process of determining an appropriate sample
size will also be presented.
DO_Q3_STATISTICS&PROBABILITY_
35
GRADE 11_LESSON9
You may recall that a confidence interval is an interval estimate of the true
population mean composed of a range of values that possibly contains it. In the
estimation of this parameter, the concepts of margin of error, confidence level, and
sample size seem to equally interplay.
When the population standard deviation is known, the formula for the margin
𝑧
of error 𝐸 is given by 𝐸 = 𝛼/2
𝑛
⋅ 𝜎. This indicates the number that must be added to or
√
subtracted from the sample mean to have the limits of a confidence interval. Thus,
the expected length 𝐿 of a confidence interval is simply 𝐿 = 2𝐸.
Alternatively, when the population standard deviation is unknown, the sample
standard deviation 𝑠 may be used as a substitute. In this case, the margin of error
𝑧
may be computed as 𝐸 ≈ 𝛼/2 ⋅ 𝑠.
√𝑛
As an illustration, suppose we want to find the expected length of a 95%
confidence interval for estimating the population mean of a sample with 𝑛 = 120 and
𝜎 = 1.5. Substituting these values to the formula for the margin of error, we have,
𝑧𝛼/2 1.96
𝐸= ⋅𝜎= ⋅ 1.5 = 0.268
√𝑛 √120
Thus, we expect the length of the confidence interval to be 𝐿 = 2𝐸 = 2(0.268) =
𝟎. 𝟓𝟑𝟔.
Using the above formula for the margin of error, we can derive a formula for
the suitable sample size 𝑛 given a certain confidence level and margin of error. This
derivation is shown below.
𝑧
𝐸 = 𝛼/2
𝑛
⋅𝜎 [Formula for the Margin of Error]
√
√𝑛 𝑧 √𝑛 √𝑛
(𝐸) = [ 𝛼/2 ⋅ 𝜎] [Multiply both sides by ]
𝐸 √𝑛 𝐸 𝐸
𝑧𝛼/2 ⋅𝜎
√𝑛 = ( 𝐸
) [Simplify]
2 𝑧𝛼/2 ⋅𝜎 2
(√𝑛) = ( 𝐸
) [Square both sides]
𝑧𝛼/2 ⋅𝜎 2
𝑛=( 𝐸
) [Simplify]
DO_Q3_STATISTICS&PROBABILITY_
36
GRADE 11_LESSON9
The last equation above is used to determine the appropriate sample size. As
an illustration, let us find out how large a sample should be if one wants to be 99%
confident that the interval estimate of the population mean is accurate to within 0.05
if the population standard deviation is known to be 2.5.
Substituting the known values to the above formula, we obtain the following:
𝑧𝛼/2 ⋅ 𝜎 2 2.58 ⋅ 2.5 2
𝑛=( ) =( ) = 𝟏𝟔, 𝟔𝟒𝟏
𝐸 0.05
Sample Problems:
1. It is known that the length of life of the batteries produced by a certain
manufacturing company is normally distributed with a standard deviation of
0.75 years. How many sample batteries must be taken to estimate the true mean
length to within 0.02 years at 95% confidence?
Solution:
First, we note that the population standard deviation is 0.75 years. Also, the
margin of error is set to 0.02 while the confidence level is 95%. Using the formula
for finding the minimum sample size suited to accurately estimate the population
mean, we have,
𝑧𝛼/2 ⋅ 𝜎 2 1.96 ⋅ 0.75 2
𝑛=( ) =( ) = 5, 402.25
𝐸 0.02
Hence, 5, 403 batteries must be taken to ensure an accurate estimate of
the population mean.
2. Based on previous school records, the standard deviation of the height of
Grade 10 students in a particular city is 1.2 inches. A researcher wanted to be
sure on estimating the population mean height to within 0.035 of its true value.
Using a 99% confidence interval, find the sample size that the researcher needs.
Solution:
Clearly, we have the following given information: 𝜎 = 1.2, 𝐸 = 0.035, and the
confidence level is 99%. Thus, we may calculate the value of 𝑛 as shown below.
𝑧𝛼/2 ⋅ 𝜎 2 2.58 ⋅ 1.2 2
𝑛=( ) =( ) = 7 824.67
𝐸 0.035
Thus, the researcher needs to take 7, 825 Grade 10 students as a sample.
DO_Q3_STATISTICS&PROBABILITY_
37
GRADE 11_LESSON9
Read and analyze each item carefully. Choose the letter that corresponds to your
answer and write it in your notebook.
1. How is the expected length of a confidence interval related to the margin of error?
A. The expected length is half the margin of error.
B. The expected length is twice the margin of error.
C. The expected length is equal to the margin of error.
D. The expected length is two more than the margin of error.
38 DO_Q3_STATISTICS&PROBABILITY_
GRADE 11_LESSON9