Statistics & Probability: Quarter 3

Download as pdf or txt
Download as pdf or txt
You are on page 1of 40

SENIOR HIGH SCHOOL

STATISTICS &
PROBABILITY
QUARTER 3
MODULE 1

DO_Q3_STATISTICS & PROBABILITY_GRADE 11_MODULE1


i
RESOURCE TITLE: Statistics & Probability
Alternative Delivery Mode
Quarter 3 – Lessons 1-9
Second Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of
the Government of the Philippines. However, prior approval of the government agency or office
wherein the work is created shall be necessary for exploitation of such work for profit. Such
agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names,
trademarks, etc.) included in this module are owned by their respective copyright holders.
Every effort has been exerted to locate and seek permission to use these materials from their
respective copyright owners. The publisher and authors do not represent nor claim ownership
over them.

Published by the Department of Education

Development Team of the Module

Writers: MARY ANN B. ORIO, Dalandanan National SHS, SDO-Valenzuela


JHERALD Q. GABICA, Bignay National SHS, SDO-Valenzuela
VIRGILIO G. VENTURA, Caruhatan National SHS, SDO-Valenzuela
DAISY LYN F. MARIANO, Parada National SHS, SDO-Valenzuela
Reviewers:
Content Editor: REBECCA M. BIÑAS, Malinta National SHS, SDO-Valenzuela
Language Editor: HELEN P. ADVENCOLA, Valenzuela City School of Mathematics
and Science JHS, SDO- Valenzuela
Illustrator:
Layout Artist: RAPHAEL A. LOPEZ
Management Team:
MELITON P. ZURBANO, Assistant Schools Division Superintendent (OIC-SDS)
FILMORE A. CABALLERO, CID Chief
JEAN A. TROPEL, Division EPS In-Charge of LRMS
EDNA LLANERA, Division SHS Focal Person
MARILYN B. SORIANO, Division Mathematics Coordinator

Printed in the Philippines by ________________________

Department of Education – National Capital Region – SDO VALENZUELA

Office Address: Pio Valenzuela St., Marulas, Valenzuela City


Telefax: (02) 292 – 3247
E-mail Address: [email protected]

ii
Targets:
1. Illustrate random variables (discrete and continuous) (M11/12SP-IIIa-1);
2. Distinguish between a discrete and a continuous random variable
(M11/12SP-IIIa-2);
3. Find the possible values of a random variable(M11/12SP-IIIa-3); and
4. Illustrate a probability distribution for a discrete random variable and its
properties (M11/12SP-IIIa-4).

Do this Pre-Test: Choose the letter of the best answer. Write your answer in your
notebook.
______1.) It is a random variable that has countable set of possible outcomes.
A. constant B. continuous C. discrete D. finite

______2.) The height and weight of a student-respondent represent what type of


random variable?
A. constant B. continuous C. discrete D. finite

For numbers 3-5, refer to the experiment below.

Three coins are tossed and the random variable Y gives the number of tails.

_____3.) What could be the values of random variable Y?


A. 1, 2, 3 B. 0, 1, 2 C. 0, 1, 2, 3 D. 1, 2, 3

_____4.) Based on the random experiment, the value of P(Y=0) is ____


2 3 1 1
A. 3 B. 8 C. 4 D. 8

_____5.) The value of P(Y=2) should be____.


2 3 1 1
A. 3 B. 8 C. 4
D. 8

DO_Q3_STATISTICS&PROBABILITY_
1 GRADE 11_LESSON1
Lesson
Exploring Random Variables
1

Random Variable
- a function that associates a numerical value with every outcome of
an experiment

Discrete Random Variable Continuous Random Variable


- one which may take on only - one which takes an infinite
a countable number of number of possible values,
distinct values typically measurable quantities
.
RANDOM VARIABLE
➢ also called as stochastic variable
➢ a set of possible values from a random experiment
➢ essentially a variable, usually denoted as X or any capital letter of the alphabet
because its value is not constant
➢ assumes different values due to chance

A random variable is a discrete random variable if its set of possible outcomes


is countable. Mostly, discrete random variables represent count data, such as
number of soda cans, number of chairs and number of students in a class.

A random variable is a continuous random variable if it takes on values on a


continuous scale. Often, continuous random variables represent measured data
such as height, weight, volume of water, amount of solution in an alcohol and the
time required to run a mile.

Below are examples of discrete random variables because their possible values
are obtained through counting.
Example 1. Suppose two coins are tossed. Let Y be the random variable representing
the number of tails that occur. Find the values of the random variable, Y.

Illustration:

So, the possible values of the random variable (range space) Y are 0,1 and 2.

DO_Q3_STATISTICS&PROBABILITY_
2
GRADE 11_LESSON1
Example 2. Anne conducted an experiment and tested four cell phones at random.
She wants to find out the number of defective cell phones that occur. Thus, to each
outcome in a sample space she assigned a value. These are 0, 1, 2, 3 and 4. If there
is no defective cell phone, she assigned number 0; if there is 1 defective cell phone,
she assigned number 1; if there are two defective cell phones; she assigned 2; she
assigned 3, if there are three defective cell phones; and 4 if the four cell phones are
all defective. The number of defective cell phones is a random variable.
Illustration: Let D represent the defective cell phone and N represent the non-
defective cell phone. If we let X be the random variable representing the number of
defective cell phones, what are the values of the random variable X?

Value of the Random Value of the Random


Possible Possible
Variable X (Number of Variable X (Number of
Outcomes Outcomes
defective cell phone) defective cell phone)
NNNN 0 DNNN 1
NNND 1 DNND 2
NNDN 1 DNDN 2
NNDD 2 DNDD 3
NDNN 1 DDNN 2
NDND 2 DDND 3
NDDN 2 DDDN 3
NDDD 3 DDDD 4

The possible values of this random variable (range space) are 0, 1, 2, 3 and 4.
Below are examples of continuous random variable.
Example 1. Suppose an experiment is conducted to determine the distance that a
certain type of car will travel using 20 liters of gasoline over a prescribed test course.
If distance D is a random variable, then we have an infinite number of distances that
cannot be equated to the number of whole numbers.
Example 2. XYZ machine was operated, and the recorded time it starts to experience
a glitch G illustrates a continuous random variable since the value of the variable may
be assigned using measurement.
PROBABILITY DISTRIBUTION
➢ also known as probability mass function/discrete probability distribution
➢ a table that gives a list of probability values along with their associated value in
the range of a discrete random variable

The following properties are observed given that 𝑝𝑖 are individual probabilities
for each value (𝑥𝑖 ) of the random variable:
1. Each probability value ranges from 0 to 1, in symbols, 0≤ 𝒑𝒊 ≤ 𝟏.
2. The sum of all the individual probabilities in the distribution is equal to 1.
Thus, n
𝑝𝑖 + 𝑝2 + 𝑝3 +……𝑝𝑛 = ∑ 𝑝𝑖 = 1.
i=1
Example 1. Three coins are tossed, and the random variable Z gives the number of
heads. Find the values of the random variable Z.

DO_Q3_STATISTICS&PROBABILITY_
3
GRADE 11_LESSON1
Sample space: {HHH, HHT, HTH, HTT, THH, THT, TTH, TTT}

The range space is the set {0,1, 2, 3}.


1
P (0) = = 0.125
8
3 Number of heads (Z) 0 1 2 3
P (1) = 8
= 0.375 P(Z) 0.125 0.375 0.375 0.125
3
P (2) = = 0.375
8
1
P (3) = 8
= 0.125

Let’s learn more about the properties of Discrete Probability Distribution.

Example 1: An investor has 5 stocks that she follows each day. The random variable
being studied is Y. Based on the table below, what is P(0)?

Y 0 1 2 3 4 5
P (Y) ? 0.27 0.34 0.11 0.07 0.02

Solution:
Since the sum of all the individual probabilities in the distribution is equal
to 1,
P(0) + P(1) + P(2) + P(3) + P(4) + P(5)= 1
P(0) + 0.27 + 0.34 + 0.11 + 0.07 + 0.02= 1
P(0) + 0.81= 1
P(0) = 1 - 0.81
P(0) = 0.19

Therefore, the probability that the investor has zero (0) stocks is 0.19.

Roxas family has three children. Let A represent the number of boys.
Construct a probability distribution for the random variable A.

Value of the Random Value of the Random


Possible Possible
Variable A (Number of Variable A (Number of
Outcome Outcome
Boys) Boys)
BBB 3 GBB 2
BBG 2 GBG 1
BGB 2 GGB 1
BGG 1 GGG 0

DO_Q3_STATISTICS&PROBABILITY_
4
GRADE 11_LESSON1
A. Directions: Classify the following random variables. In your notebook, write
DISCRETE if it is a discrete random variable. Otherwise, write
CONTINUOUS.
____________1.) number of defective computers
____________2.) weight of newborns in a hospital
____________3.) the time students had breakfast
____________4.) number of hours spent in watching TV
____________5.) the amount of time a student spends on Facebook

B. Directions: Write PD if the given represents a probability distribution and


NPD if does not. Write your answer in your notebook.

______1.) P(1) = 0, P(2) = 0.71, P(3) = 0.39


______2.) P(1) = 0.42, P(2) = 0.31, P(3) = 0.37
______3.) X 0 2 4 6 8
1 1 1 1 1
P(X)
6 6 3 6 6

Targets:
1. Compute probabilities corresponding to a given random variable
(M11/12SP-IIIa-6);
2. Illustrate the mean and variance of a discrete random variable
(M11/12SP-IIIb-1); and
3. Calculate the mean and variance of a discrete random variable
(M11/12SP-IIIb-2).

Do this Pre-Test: Choose the letter of the best answer. Write your answer in your
notebook.
______1.) The sum of the product of each value of a discrete random variable X and
its probability is referred as its _____.
A. standard deviation C. mean
B. variance D. variables
______2.) Consider the given discrete probability distribution. Find the probability
that M equals 5.
M 3 5 7 9
P(M) 0.12 ? 0.26 0.04
A. 0.58 C. 1
B. 0.28 D. 0.48

DO_Q3_STATISTICS&PROBABILITY_
5
GRADE 11_LESSON2
______3.) Mang Memong’s bakery has determined a probability distribution for the
number of ensaymada it sells in a given day. The distribution is as follows:

Number sold in a day 0 5 10 15


Probability (number sold) 0.13 0.20 0.32 0.35
Find the number of ensaymada that Mang Memong’s bakery expects to sell
in a day.
A. 5.25 C. 5.52
B. 1 D. 9.45
_______4.) Let V is denoted as the number of heads in three tosses of a coin.
Determine the variance of this random variable.
A. 3.5 C. 1.5
B. 2.5 D. 0.5
_______5.) A random variable Y can take only two values, 2 and 5 such that P(2) =
0. 40 and P(5)= 0.60. Determine the mean of Y.
A. 3 C. 2.8
B. 3.8 D. 2.45

Lesson Computing the Mean and Variance of


2 a Discrete Probability Distribution

Mean of a Discrete Probability Distribution


The mean 𝜇 of a discrete random variable is the central value or average of its
corresponding probability mass function. It is computed using the formula:
𝝁 = ∑ 𝑿 ∙ 𝑷(𝑿)

Variance of Discrete Probability Distribution


The variance of a discrete probability distribution is given by the formula:
𝝈𝟐 = ∑(𝑿 − 𝝁)2 ∙ 𝑷(𝑿)
Note: In this material, decimals are rounded off into two decimal places.

Mean of a Discrete Random Variable

➢ The mean can be regarded as a measure of ‘central location’ of a random variable.


It is the weighted average of the values that X can take, with weights provided by
the probability distribution.

DO_Q3_STATISTICS&PROBABILITY_
6
GRADE 11_LESSON2
➢ It is also sometimes called the expected value or expectation of X denoted by E(X).

Variance of a Discrete Random Variable

➢ It shows the variability or the scatterings of the random variables.


➢ It shows the distance of a random variable from its mean.

Sample Problems:
Example 1. Calculate the mean and variance for a random variable X defined as the
number of tails in two tosses of a coin.

Steps to compute the Mean (𝝁):


Step 1. Construct Probability Step 2. Multiply X and P (X).
Distribution Table.
X P(X) X∙ 𝑷(𝑿)
X 0 1 2 0 0.25 0

P (X) ¼ = 0.25 2/4 = ½ = 0.50 ¼ = 0.25 1 0.50 0.5

2 0.25 0.5
Step 3. Sum up the product of X and P(X).
𝝁 = ∑ 𝑿 ∙ 𝑷(𝑿)
𝜇 = 0 + 0.5 + 0.5
𝝁 =1

Steps to compute the Variance( 𝝈𝟐 ) ∶


The following steps will simplify the process of computing the variance.
Step 1. Compute the mean value of the random variable.
Step 2. Subtract each value from the mean and square the differences.
Step 3. Multiply the squared differences by the corresponding probabilities.
Step 4. Add all the products.

Note: From our previous example, 𝜇 =1


X P(X) (X-𝝁) (𝐗 − 𝝁)𝟐 (𝐗 − 𝝁)𝟐 ∙ 𝑷(𝑿)

0 0.25 0 -1= -1 (−1)2 =1 (1)(0.25) = 0.25

1 0.50 1-1= 0 (0)2 =0 (0)(0.50) = 0

2 0.25 2-1 = 1 (1)2 =1 (1)(0.25) = 0.25

𝝈𝟐 = ∑(𝑿 − 𝝁)2∙ 𝑷(𝑿) = 0.25 + 0 + 0.25 𝝈𝟐 = 𝟎. 𝟓

DO_Q3_STATISTICS&PROBABILITY_
7
GRADE 11_LESSON2
Example 2. Calculate the mean and variance for a random variable X defined as
the sum of the scores on the two dice.

Illustration:

The range space/possible


values of Y are 2, 3, 4, 5, 6, 7,
8, 9, 10, 11, 12.

X 2 3 4 5 6 7 8 9 10 11 12

P(X) 1/36 2/36= 3/36= 4/36= 5/36= 6/36= 5/36= 4/36= 3/36= 2/36= 1/36=
=0.03 0.06 0.08 0.11 0.14 0.17 0.14 0.11 0.08 0.06 0.03
X ∙ P(X) 0.06 0.18 0.32 0.55 0.84 1.19 1.12 0.99 0.8 0.66 0.36

Mean (𝝁) = ∑ 𝑿 ∙ 𝑷(𝑿)


= 0.06 + 0.18 + 0.32 + 0.55 + 0.84 + 1.19 + 1.12 + 0.99 + 0.8 + 0.66 + 0.36
𝝁 = 7.07

Variance
(𝝈𝟐 ) = ∑(𝑿 − 𝝁)2∙ 𝑷(𝑿)
= 0.77 + 0.99 + 0.75 + 0.47
+ 0.16 + 0 + 0.12 + 0.41 + 0.69
+ 0.93 + 0.73

𝝈𝟐 = 6.02

DO_Q3_STATISTICS&PROBABILITY_
8
GRADE 11_LESSON2
A survey conducted by a researcher showed the number of students’ siblings
of Grade 11 – Albert Bandura (GAS) in Bignay National High School-SHS.

Calculate the mean and variance.

Number of siblings (V) 1 2 3 4

Probability P(V) 0.09 0.36 0.43 0.12

Mean (𝝁) = ∑ 𝑽 ∙ 𝑷(𝑽) = 1(0.09) + 2(0.36) + 3(0.43) + 4(0.12)


= 0.09 + 0.72 + 1.29 + 0.48
𝝁 = 2.58

V P(V) V- 𝝁 (V- 𝝁)2 P(V) ∙(V- 𝝁)2

1 0.09 -1.58 2.50 0.23 Variance (𝝈𝟐 )


2 0.36 -0.58 0.34 0.12 𝝈𝟐 = 0.23 + 0.12 + 0.08 + 0.24
3 0.43 0.42 0.18 0.08 𝝈𝟐 = 0.67
4 0.12 1.42 2.02 0.24

The probability distribution below shows the number of hours spent by


students using mobile phones in a day. Compute the mean and variance.

Y 6 8 10

P(Y) 0.14 0.52 0.34

Solutions: Y P(Y) Y∙ 𝑷(𝒀) Y- 𝝁 (Y- 𝝁)2 (Y- 𝝁)2∙ 𝑷(𝒀)

6 0.14 0.84 -2.4 5.76 0.81

8 0.52 4.16 0.4 0.16 0.08

10 0.34 3.4 1.6 2.56 0.87

Mean (𝝁) = 𝟖. 𝟒 Variance (𝝈𝟐 ) = 1.76

DO_Q3_STATISTICS&PROBABILITY_
9
GRADE 11_LESSON2
A. The table below shows the probability distribution of a discrete random
variable B.
B 0 1 2 3 4 5

P(B) 0.03 0.17 0.22 0.31 0.15 0.12

For items 1-3. Find the following probabilities:


1. P(B=1) =________ 2. P(B≤ 4) = __________ 3. P(0<B<4) = ________
B. Compute for the mean and variance of the given probability distribution
below.

Y 0 1 2

P(Y) 0.12 0.63 0.25

4. Mean: _____ 5. Variance: _____

Targets:
1. Interpret the mean and variance of a discrete random variable
(M11/12SP-IIIb-3); and
2. Solve problems involving the mean and variance of probability distributions
(M11/12SP-IIIb-4).

Do this Pre-Test: Choose the letter of the best answer. Write your answer in
your notebook.
______1.) In a pizza parlor, the following probability distribution was obtained for the
number of toppings ordered on a large pizza. Find the mean and standard
deviation for the random variable.
A 0 1 2 3 4
P(A) 0.30 0.40 0.20 0.06 0.04

A. mean: 1.14; standard deviation: 1.04


B. mean: 1.30; standard deviation: 2.38
C. mean: 1.30; standard deviation: 1.54
D. mean: 1.54; standard deviation: 1.30

DO_Q3_STATISTICS&PROBABILITY_
10
GRADE 11_LESSON3
______2.) Michaella is a star player on her college soccer team. She loves to score
goals and she does this on a regular basis. The probability that she will
score 0, 1, 2, or 3 goals are 0.25, 0.35, 0.25, and 0.15 respectively. What is
her most likely number of goals?
A. 1 B. 1.2 C. 1.3 D. 1.4
______3.) Two fair coins are tossed. If Z represents the number of tails, what is the
expected value of Z?
A. 1 B. 1.5 C. 0.5 D. 2
______4.) A’s Burger house has a probability distribution of sales as shown below:
P P(P)
25 0.20
40 0.20
50 0.60
Using this probability distribution, what is the expected value and
variance?
A. Expected Value: 43 C. Expected Value: 43
Variance: 46 Variance: 96
B. Expected Value: 43 D. Expected value: 43
Variance: 69 Variance: 60
_______5.) Three patients are treated with a specific medicine. The probabilities for
0, 1, 2, or 3 successes are 0.24, 0.12, 0.30, and 0.34 respectively. What
is the average number of successes in this treatment?
A. 1.71 B. 1.72 C. 1.73 D. 1.74

Lesson Solving Problems Involving the Mean


3 and Variance of a Discrete
Probability Distribution

The probabilities of a factory machine manufacturing 0, 1, 2 defective parts in one


day are 0.90, 0.07, and 0.03 respectively. Find the mean and variance.
X 0 1 2
P(X) 0.90 0.07 0.03

Solutions: 𝑀𝑒𝑎𝑛(𝝁) = ∑ 𝑿 ∙ 𝑷(𝑿)


= [(0 )(0.90)] + [(1)(0.07)] + [(2)(0.03)]
= 0 + 0.07 + 0.06
𝝁 = 0.13
X P(X) X-𝝁 (𝐗 − 𝝁)𝟐 (𝐗 − 𝝁)𝟐 ∙ 𝑷(𝑿)
0 0.90 -0.13 (−0.13)2 = 0.02 (0.02)(0.90) = 0.018
1 0.07 0.87 (0.87)2 =0.76 (0.76)(0.07) = 0.05
2 0.03 1.87 (1.87)2 =3.50 (3.50)(1.87) = 0.11
Variance (𝝈𝟐 ) = ∑(𝑿 − 𝝁)2∙ 𝑷(𝑿)
= 0.02 + 0.05 + 0.11
𝝈 = 0.18
𝟐

DO_Q3_STATISTICS&PROBABILITY_
11
GRADE 11_LESSON3
To get the variance of a discrete probability distribution we can use this
alternative formula: Variance (𝝈𝟐 ) = ∑ 𝑿𝟐 ∙ 𝑷(𝑿) - 𝝁𝟐
The standard deviation of a discrete probability distribution is given by the
formula:
Standard deviation (𝝈)=√∑ 𝑿𝟐 ∙ 𝑷(𝑿) − 𝝁𝟐
where: X = value of the random variable
P(X) = probability of the random variable X
𝜇 = mean of the probability distribution

The Mean and Variance of a Discrete Probability Distribution


The Mean, also called expected value, summarizes the probability
distribution of a discrete random variable. It also gives the behavior of the random
variable under observation. Variance is another important characteristic in
describing a given probability distribution. It can measure the degree of homogeneity
or heterogeneity of a given set of data.
A data is nothing without the knowledge of interpreting it. So, in this module,
you’ll learn how to interpret the mean and variance of a discrete probability
distribution.
Example 1: A plastic bag contains 10 balls. Three are numbered 3, one is numbered
4, and six are numbered 5. The balls were mixed, and one is recorded. Then it is
replaced. If the experiment is repeated many times, find the mean and the variance.
Solutions:
Mean (𝝁) = ∑ 𝑿 ∙ 𝑷(𝑿) = 3(0.3) +
X P(X) X∙ X- 𝝁 (X- (X- 𝝁)2∙
4(0.1) + 5(0.6) = 4.3
𝑷(𝑿) 𝝁)𝟐 𝑷(𝑿)
Variance (𝝈𝟐 )
3 3 0.9 -1.3 1.69 0.51 = [(3)² (0.3) + (4) ² (0.1) + (5) ²
= 0.3 (0.6)] – (4.3) ²
10
= 19.3 – 18.49
4 1
= 0.1 0.4 -0.3 0.09 0.01
10 𝝈𝟐 = 0.81
Standard deviation (𝝈) = √0.81
5 6
= 0.6 3 0.7 0.49 0.29
10 =0.9
Interpretation: So, the average
number of numbered balls that will be picked is 4.3. Although the bag will never
show a ball with number 4.3, this implies that picking a ball many times, the
theoretical mean would be 4.3.

The owner of Malinamnam Restaurant asks the staff to record the number
of people who order their specialty in weekdays. The table below shows the
probability distribution. Solve for the following:
a) mean b) variance c) standard deviation

12 DO_Q3_STATISTICS&PROBABILITY_
GRADE 11_LESSON3
W 1 2 3 4 5

P(W) 0.13 0.15 0.25 0.19 0.28

a.) Mean (𝜇) = ∑ 𝑋 ∙ 𝑃(𝑋) = 1(0.13) + 2(0.15) + 3 (0.25) + 4(0.19) + 5(0.28)


= 0.13 + 0.30 + 0.75 + 0.76 + 1.4
𝝁 = 3.34
b.) To answer this, we will use the alternative solution:
Variance (𝜎 2 ) = ∑(𝑊)2 ∙ 𝑃(𝑊) - 𝜇2
= [(12 ) (0.13)] + [(22 )(0.15)] + [(32 )(0.25)] + [(42 )(0.19)] +
[(52 )(0.28)] − (3.34)2
= (0.13 + 0.6 + 2.25 + 3.04 + 7) – 11.16
= 13.02 - 11.16
𝝈 = 1.86
𝟐

c.) Standard deviation (𝝈) = √𝜎 2 = √1.86 = 1.36


Interpretation: We can say that the average number of customers who will order
their specialty is about 3.34.

Organizers of a concert are budgeting ticket sales to a maximum of 4 tickets


per customer. Let Q be the number of tickets purchased by a random customer. The
probability distribution of Q is shown below. Find the mean, variance and standard
deviation.

Mean (𝝁) = [(1)(0.3)] + [(2)(0.1) + [(3)(0.2)] + [(4)(0.4)] = 2.7

Variance (𝝈𝟐 ) = {[(12)(0.3)] + [(22)(0.1)] + [(32)(0.2)] + [(42)(0.4)]} − (2.7)2 = 1.61

Standard Deviation (𝝈) = √1.61= 1.27

Interpretation: If we look at in large number of customers, then each


customer on the average will have purchased about 2.7 tickets (theoretical
mean).

DO_Q3_STATISTICS&PROBABILITY_
13
GRADE 11_LESSON3
A. Directions: Given the table below; calculate the mean, variance and standard
deviation.
D 1 2 3
P(D) 0.41 ? 0.41

1. P(2): ____________ 3. Variance: ___________


2. Mean: ___________ 4. Standard deviation: ___________

B. Directions: Find the variance and standard deviation of the following:


1. A probability distribution of X, which can take only the values 1,
2 1 2
2, and 3, given that P(1) = 5 , P(2) = 5, and P(3) = 5.

Variance: _________ Standard Deviation: __________

Targets:
1. Identify regions under the normal curve corresponding to different
standard normal values (M11/12SP-IIIc-3);
2. Convert a normal random variable to a standard normal variable and vice versa
(M11/12SP-IIIc-4); and
3. Compute probabilities and percentiles using the standard normal table
(M11/12SP-IIId-1).

Do this Pre-Test: Choose the letter of the best answer. Write your answer in your
notebook.

_______1. The total area under the normal curve is______?


A. 2 B. 1 C. 4 D. 3
_______2. It is a random variable where the data can take infinitely many values.
A. Discrete random
B. Continuous random variable
C. Both discrete and continuous random variable
D. None of these
_______3. Let 𝑥 = 8 be a Normal random variable with parameters 𝜇 = 1 𝑎𝑛𝑑 𝜎 = 3.
Convert it to Z-score.
A. 1.325 B. 5.322 C. 0 D. 2.333
_______4. What proportion of cases in a normally distributed population will have z-
scores greater than 1.0 or less than -1.0?
A. 0.68 B. 0.50 C. 0.32 D. 0.10
_______5. In normal distribution, define the relationship of the mean, median and
DO_Q3_STATISTICS&PROBABILITY_
14
GRADE 11_LESSON4
the mode.
A. Mean > median > mode C. Mean < median < mode
B. Mean = median = mode D. No relation found

Lesson
Normal Random Variable
4
In this lesson you will learn to identify regions under the normal curve
corresponding to different standard normal values, converts a normal random
variable to a standard normal variable and vice versa, computes probabilities and
percentiles using the standard normal table. (M11/12SP-IIIc-3-4, M11/12SP-IIIc-
d-1)

When random variable is discrete, we can assign a probability to each value


that it can take and obtain its probability distribution. However, not all random
variables are discrete like of what the previous lessons are about.
Continuous Random Variable can assume infinitely many values
corresponding to points on a line interval. Therefore, we must use a different
approach to generate the probability distribution for a continuous random variable.
The graphical form of probability
distribution for continuous random
variable is a smooth curve (bell curve),
this curve is also known as probability
density function. As we observed, if
continuous random variable has a bell-
shaped probability distribution, it is
also known as normal random variable
and its probability is called normal
distribution.

Illustration: The Normal Distribution


You can visit the website below for more detailed explanation:
https://www.youtube.com/watch?v=79duxPXpyKQ&feature=youtu.be

Characteristics of A Normal Distribution

➢ The total area under the curve is 1.


➢ A normal distribution curve is unimodal.
➢ The curve is symmetrical about the mean.
DO_Q3_STATISTICS&PROBABILITY_
15
GRADE 11_LESSON4
➢ The normal distribution curve is a bell-shaped.
➢ The distribution curve is asymptotic to the x-axis.
➢ The mean, median, and mode are equal and located at the center of the
distribution.

In order to avoid unnecessary calculations and graphing the scale a Normal


Distribution curve is Normal Distribution to Standard Normal Distribution converted
to a standard scale called the z score or standard unit scale.
Normal Distribution Standard Normal Distribution

Standard Normal Distribution


This graph gives the Standard Normal Graph with a standardized scale.

The area between the Standard


Normal Curve and the z – axis
between - and + is 1.

Standard Units (z-scores)


z – scores define the position of a score in relation to the mean using the
standard deviation as a unit of measurement.
z – scores are very useful for comparing data points in different distributions.
The z – score is the number of standard deviations by which the score departs
from the mean. This standardizes the distribution.
The formula for the z-scores is x – data point
𝒙−𝝁
𝒛 = 𝝈 , where: 𝜇 – population mean
𝜎 – standard deviation of the population
Normal Probabilities – is the area under the normal curve.

Note that the area of any individual


value is zero.

Probability as the Area Under the Normal Curve

The total area under the curve is


1.0, and the curve is symmetric, so
half is above the mean, half is
below.
16 DO_Q3_STATISTICS&PROBABILITY_
GRADE 11_LESSON4
Empirical Rule
What can we say about the distribution of values around the mean? There are
some general rules:

But to find the probability beyond the empirical rule is possible through the
use of the standardized normal table (See appendices).
The Cumulative Standardized
Normal table gives the probability
less than a desired value for Z (i.e.,
from negative infinity to Z)

The ROW shows The COLUMN gives the value of Z to


the value of Z to the second decimal point
the first decimal
point The value within the table gives the
PROBABILITY

The following is the general procedure for finding probabilities under the
normal curve as to find P (a < X < b) when X is distributed normally:
1. Draw the normal curve for the problem in terms of X
2. Translate x-values to z-values
3. Use the standard normal table

Let’s learn more about Normal Probability Distribution by analyzing the


following examples:
Example 1:
If X is distributed normally with mean of 100 and standard deviation of 50, the Z
value for X = 200 is?
𝑥−𝜇 200−100
Solution: 𝑧= 𝜎
= 50
= 2.0

This says that X = 200 is two standard deviations (2 increments of 50 units)


above the mean of 100 Note that the distribution is the same,
only the scale has changed. We can
express the problem in original units (X)
or in standardized units (Z)

Example 2: Suppose X is normal with mean 8.0 and standard deviation 5.0.

DO_Q3_STATISTICS&PROBABILITY_
17 GRADE 11_LESSON4
a. Find 𝑃(𝑥 < 8.6) b. Find 𝑃(𝑥 > 8.6) c. Find 𝑃(80 < 𝑥 < 8.6)
Solution (a): 𝑃(𝑥 < 8.6)
𝑥−𝜇 8.6−8.0
Step1 Step 2. 𝑧 = = = 0.12
𝜎 5.0

Step 3:

Solution (b): 𝑃(𝑥 > 8.6)


Step 2. 8.6 [x] is equivalent to 0.12 [z]
Step1.
from solution (a)

Solution (c):

Step 3

Solution c. 𝑃(80 < 𝑥 < 8.6)


Step1. Step 2. z-value of 80 is 0 since it is the
mean and z-value of 86 is 0.12
as solution (a)

Step 3.

DO_Q3_STATISTICS&PROBABILITY_
18
GRADE 11_LESSON4
The percentage of students at school between 1.1m and 1.7m tall is 2. 95%.
Assuming that the data is normally distributed, calculate the mean and standard
deviation. Construct the normal curve corresponding to the data.

Solution: The mean is halfway


between 1.1m and 1.7m:

1.1𝑚 + 1.7𝑚
𝜇= = 1.4𝑚
2
95% is 2 standard deviations either side of the mean (a total of 4
standard deviations). So:
1.7𝑚 − 1.1𝑚
𝜎= = 0.15𝑚
4

Short Answer. Supply the correct answer. Write your answer in your notebook.
_________1. It is a random variable where the data can take infinitely many values.
_________2. True or False. Getting the probability distribution in discrete random
variable is the same in continuous random variable.
_________3. The graphical form of the probability distribution for continuous
random variable is a________________.
Problem Solving
1. Sixty eight percent of the marks in a test are between 51 and 64. Assuming this
data is normally distributed, what is the standard deviation?
2. Using the table of values find: (a) 𝑃(𝑍 ≤ 1.31), (b) 𝑃(𝑍 ≥ 1.32), (c)

Targets:
1. Illustrate random sampling (M11/12SP-IIId-2); and
2. Distinguish between parameter and statistic (M11/12SP-IIIc-3).

DO_Q3_STATISTICS&PROBABILITY_
19
GRADE 11_LESSON5
Do this Pre-Test: Answer the questions below to measure what you already know
about the next lessons. Write your answer on your lecture
notebook.
1. Sampling is about ______ a sample from a population.
A. adding B. selecting C. including
2. Simple random sampling is a ______ sampling method.
A. confusing B. non-probability C. probability
3. It is defined as the characteristic of a sample.
A. Parameter B. Statistic C. Random sampling
4. It refers to the numbers that summarize data for entire population.
A. Parameter B. Statistic C. Random sampling
5. x̄, 𝑠 2 , and s are commonly used symbols to represent the mean, variance and
standard deviation of the ___________.
A. population B. sample C. statistics

Lesson Random Sampling and Sampling


5 Distributions of Statistics
In this lesson, the learners will understand the concept of random sampling
and sampling distributions of statistics. Parameter and sample will also be defined
as an extension of the lesson.

Random Sampling is the process of choosing a representative sample from a


target population and collecting data from that sample in order to understand
something about the population as a whole. The types of random sampling are
simple, stratified and multistage.
Simple random sampling is the basic sampling techniques where we select
a group of subjects (a sample) for study from a larger group (a population). A simple
random sample is meant to be an unbiased representation of a group. It is considered
a fair way to select a sample from a larger population since every member of the
population has an equal chance of getting selected. Example: names of 25 employees
being chosen from a company of 250 employees.
Stratified random sampling is a factor which divides the population into
sub-populations (groups/strata) and we may expect the measurement of interest to
vary among the different sub-populations.
The word Strata is the plural form of stratum, which means subgroup. If there
are two or more subgroup, it is called strata. We use stratified random sampling for
population that has a distinct element because in a stratified random sampling, we
are dividing the population into a certain group based on their classification.
Example: a farmer wishes to milk each cow type in his herd which consists of four
(4) breeds. He could divide his cows into four subgroups and collect milk.

20 DO_Q3_STATISTICS&PROBABILITY_
GRADE 11_LESSON5
Multistage random sample is constructed by taking a series of simple
random samples in stages. This type of sampling is often more practical than simple
random sampling for studies requiring “on location” analysis such as door-to-door
surveys.

What is parameter?

Parameter refers to numbers that summarize the data for an entire


population. It is any numerical quantity that characterizes a given population or
aspect of it. It tells something about the whole population.

What is statistic?

Statistic refers to numbers that summarize data from a sample. It is a single


measure of some attribute of a sample. It is any function (attribute) of a sample.

What is the difference between a parameter and a statistic?

Parameter is a characteristic of a population, while statistic is a


characteristic of a sample. A Population Parameter is a summary measure to
describe the characteristics of the whole population. It is usually denoted by Greek
letters. A Sample Statistic is a summary measure computed from a sample to
describe the characteristic of the whole population.

Measure Statistic Parameter

Mean x̄ 𝜇

Variance s2 𝜎2

Standard Deviation s 𝜎

Activity Understanding Sampling Distribution

1. A new business with 100 employees wants to know whether staff would like to
have childcare facilities on site. An estimate could be made by asking a sample of
20 employees if they would use the childcare facilities and multiplying the
number who said yes by 5. Decide which people to ask.
Solution:
To get a random sample of 20 people, we could assign each person a number from
1 to 100 and then select 20 numbers using a random method.
One Random method is to write the 100 numbers on separate slips, put them in
a bag, shake them, and take 20 of them out without looking.

DO_Q3_STATISTICS&PROBABILITY_
21
GRADE 11_LESSON5
Alternative solution using technology:
The button might say “RANDOM” (SHARP). Other makes may have a button “Ran”
or “Ran#” or “RanInt”. Whichever you have, selecting and pressing ENTER
repeatedly gives random numbers.
Generate a Random Number between 0 and 99.

3. A shipment to a warehouse consists of 3,500 MP3 players. The manager chooses


a random sample of 50 MP3 players and finds that 3 are defective. How many
MP3 players in the shipment are likely to be defective?
Solution: We will use ratio and proportion:
𝑑𝑒𝑓𝑒𝑐𝑡𝑖𝑣𝑒 𝑀𝑃3𝑠 𝑖𝑛 𝑎 𝑠𝑎𝑚𝑝𝑙𝑒 𝑑𝑒𝑓𝑒𝑐𝑡𝑖𝑣𝑒 𝑀𝑃3𝑠 𝑖𝑛 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛
=
𝑠𝑖𝑧𝑒 𝑜𝑓 𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑖𝑧𝑒 𝑜𝑓 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛
3 𝑥
= 𝑥 = 210
50 3,500
Based on the sample, you can predict that 210 MP3 players in the shipment
would be defective.

Exercise what you have learned by answering the questions below. Write
your answer on your activity notebook.
1. How many of each of the 3 types of computer component should be taken in a
sample of 100 categorized by type of component?

Type A B C D

Number 300 260 40 600

2. A manager samples the receipts of every fifth person who goes through the line.
Out of 50 people, 4 had a mispriced item. If 600 people go to this store each day,
how many people would you expect to have a mispriced item?

Short Answer. Supply the correct answer. DO_Q3_STATISTICS&PROBABILITY_


_______1. It is a characteristic of a sample.
GRADE 11_LESSON5
22
_______2. These are numbers that summarize data for entire population.
_______3. It is used to acquire a section of the population to perform an experiment
or observational study.
_______4. True or False. A magician picks 6 children out of 30 attendees is an
example of a random sampling.
_______5. True or False. The governor who picks scholars among all the students
in his province is not an example of random sampling.
Application
_______6. At a fundraising event, every attendee’s name is entered for a chance to
win a prize. Names are selected at random, then returned to the jar,
giving individuals multiple opportunities to win the prizes. Is this an
example of random sampling?
_______7. If I surveyed all the girls in a particular school, and find out that 60% of
the girls have blue eyes, for the population of girls in the school, 60%
blue eyes represent ______.
_______8. If a survey of 500 16-year-old boys in a city finds that their average.

Targets:
1. Identifies sampling distributions of statistics (sample mean)
(M11/12SP-IIId-4); and
2. Finds the mean and variance of the sampling distributions of the sample mean
(M11/12SP-IIId-5.

Do this Pre-Test: Choose the letter of the correct answer. Write your answer in
your notebook.
1. A random variable is a variable usually denoted by X that has a __________
numerical value determined by chance for each outcome of a procedure or
experiment.
A. small B. large C. single D. double
2. A probability distribution is a graph, table, or function that gives the _______ for
each value of the random variable.
A. input B. output C. outcome D. probability
3. The mean of a probability distribution is also called __________.
A. extreme value C. extraneous value
B. expected value D. erroneous value
4. In a probability distribution, the probability of each random variable must be
between __________.
A. 0 and 1 B. 0 and 10 C. -1 and 0 D. 1 and 2
5. In a probability distribution, the sum of all the probabilities must be equal to
__________.
A. 100 B. 10 C. 1 D. 0.99

DO_Q3_STATISTICS&PROBABILITY_
23
GRADE 11_LESSON6
Lesson
Sampling Distribution of the MEAN
6
In this lesson, the learners will deepen their concept of sampling distribution
of the mean. Skills on the computation of the mean and variance of the sampling
distribution of the mean will also be developed.

Let’s say, we have a finite population of size N=5 that consists of the numbers
1, 2, 3, 4, and 5. (We know that for this population, the mean, µ = 3 and the variance,
σ2 = 2). Now let’s take a random sample of size n=3, say 1, 4, and 2, and compute
the mean.
1+4+2
Sample: 1, 4, 2 𝑋̅ = 3
= 2.3

What can you say about the sample mean, 𝑋̅ and the population mean, µ?
How many different samples of the same size, n=3, do you think we can get
from a population of N=5? (Clue: nCr → 5C3)
In this lesson we will try to list all the possible samples that we can get from a given
finite population, and present them in a tabular form.
We need to learn about sampling distribution because it is an essential
element in Statistical Inference. We remind you that Statistical Inference is the
process of using sample results to draw conclusions about the characteristics of a
population.
Definition: The Sampling Distribution of the Mean is the probability
distribution of the sample means.

Let us now construct a sampling distribution of the mean.


Given population: 1, 2, 3, 4, 5 (N=5) Size n=3
Sample Mean, 𝑋̅
1, 2, 3 2
1, 2, 4 2.3
1, 2, 5 2.7
1, 3, 4 2.7
1, 3, 5 3
1, 4, 5 3.3
2, 3, 4 3
2, 3, 5 3.3
2, 4, 5 3.7
3, 4, 5 4
There are 10 possible samples of size 3 that can be drawn from our given
population.

DO_Q3_STATISTICS&PROBABILITY_
24
GRADE 11_LESSON6
Remember: The number of samples of size n that can be drawn from a
population of size N is given by NCn. (Look for the function nCr in your scientific
calculator.)
Now, let us make a frequency distribution of the sample means.
Sample Mean, 𝑋̅ Frequency
2 1
2.3 1
2.7 2
3 2
3.3 2
3.7 1
4 1
Total = 10
At this point, we simply use the classical definition of probability to compute
for the probability of getting each sample mean, P(𝑋̅).

SAMPLING DISTRIBUTION of the MEAN


Mean, 𝑋̅ Probability, P(𝑋̅)
2 1/10 = 0.1
2.3 1/10 = 0.1
2.7 2/10 or 1/5 = 0.2
3 2/10 or 1/5 = 0.2
3.3 2/10 or 1/5 = 0.2
3.7 1/10 = 0.1
4 1/10 = 0.1
Total=1.0
(Belecina, et. al, 2016, pp. 115-117)
Based on the table above,
a) What is the probability of selecting a sample whose mean equals 2?
b) What is the probability of selecting a sample whose mean equals 3.7?

Activity 2.1 Computing for Mean and Variance of Sampling Distribution


From the above hanging questions, let’s compute for the mean and variance.
̅:
Mean of the Sampling Distribution of 𝑿
µ𝑥̅ = ∑ 𝑋̅ • P(𝑋̅)
= 2(0.1) + 2.3(0.1) + 2.7(0.2) + 3(0.2) + 3.3(0.2) + 3.7(0.1) + 4(0.1)
=3
̅:
Variance of the Sampling Distribution of 𝑿
σ2𝑥̅ = ∑(𝑋̅ - µ𝑥̅ )2 • P(𝑋̅)
= (2-3)2 (0.1) + (2.3-3)2 (0.1) + (2.7-3)2 (0.2) + (3-3)2 (0.2) + (3.3-3)2 (0.2) +
(3.7-3)2 (0.1) + (4-3)2 (0.1)
= 0.33

DO_Q3_STATISTICS&PROBABILITY_
25
GRADE 11_LESSON6
Consider a population of 4 typists from a certain company who were asked to
type the same page of a manuscript. The number of errors made by each typist is
presented below.
Typist Number of Errors
Anna 1
Betty 3
Carol 2
Denice 4
Required: List all the possible samples of size n=2, and then check if µ𝑥̅ = µ.
(Levine, et. al., 2005, pp. 253-254)

Short Answer. Supply the correct answer.


1. The total number of samples of size n that can be drawn from a finite
population of size N is __________.
2. The mean of the sampling distribution of the mean is always equal to the
__________.
3. The sampling distribution of the mean is the probability distribution of the
__________.
4. __________is the process of using sample results to draw conclusions about the
characteristics of a population.
5. If 𝑋̅ represents the mean of the toss of two fair dice, the probability of getting a
mean equal to 6 is __________.

Problem Solving
There are 10 students enrolled in a preschool. Their ages are
Student Age Student Age
A 4 F 5
B 5 G 4
C 4 H 4
D 3 I 3
E 5 J 6

a) Compute for the mean, µ and variance, σ2 of the ages of the students in this
preschool.

b) Construct the sampling distribution of the mean by selecting a sample of


size n=2 students from this preschool.
c) Compute for the mean and variance of the sampling distribution.

(Almeda, et. al., 2010, p. 390)

DO_Q3_STATISTICS&PROBABILITY_
26
GRADE 11_LESSON6
Targets:

1. Illustrates the Central Limit Theorem (CLT) (M11/12SP-IIIe-2);


2. Define sampling distribution of the mean using CLT ) (M11/12SP-IIIe-2); and
3. Solve problems involving sampling distribution of the mean ) (M11/12SP-IIIf-1).

Do this Pre-Test: Choose the letter of the best answer. Write your answers on your
notebook.
1. From a finite population of N=7, the number of samples of size n=3 that can be
drawn is __________.
A. 25 B. 30 C. 35 D. 40
2. From a finite population of N=10, the number of samples of size n=5 that can be
drawn is __________.
A. 252 B. 254 C. 256 D. 258
3. In a sampling distribution of the mean, as the sample size increases, the variance
__________.
A. -1 B. decreases C. increases D. remains unchanged
4. The standard deviation of the sampling distribution of the sample means is
known as ___________.
A. Variance C. Standard Deviation
B. Standard estimate of the mean D. Standard error of the mean
5. A __________ is one that consists of a finite or fixed number of elements,
measurements, or observations.
A. Finite Population C. Frequent Population
B. Infinite Population D. cannot be determined

Lesson
7 The Central Limit Theorem
This lesson will provide information, activities and exercises which will lead the
leaners to the concept of central limit theorem and will enhance skills on solving for
the problems involving the sampling distributions of the mean.

The table below shows the results that we obtained from the last lesson
when we studied the sampling distribution of the mean. From a finite population of
N=5, we drew samples of different sizes and computed the Mean and Variance of
each sampling distribution of the mean.
POPULATION SAMPLING DISTRIBUTION OF THE MEAN
N=5 n=2 n=3 n=4
MEAN µ=3 µ𝑥̅ = 3 µ𝑥̅ = 3 µ𝑥̅ = 3
VARIANCE σ2 = 2 σ2𝑥̅ = 0.75 σ2𝑥̅ = 0.33 σ2𝑥̅ = 0.12
What do you notice about the values of the variance of the sampling
distribution as the sample size increases? Do you think the same thing happens to

27 DO_Q3_STATISTICS&PROBABILITY_
GRADE 11_LESSON7
the standard deviation? Now consider the experiment of throwing a fair die so many
times until we get a very large population and our muscles ache because of
continuous throwing. In this case we have an infinite population. (Keller/Warrack,
p.270)
Let X be the random variable that gives the number of dots showing after each
throw, the probability distribution for the random variable is given below.
X 1 2 3 4 5 6
P(X) 1/6 1/6 1/6 1/6 1/6 1/6
Population Mean, µ = ∑ X • P(X)
1 1 1 1 1 1
= 1( ) + 2( ) + 3( ) + 4( ) + 5( ) + 6( )
6 6 6 6 6 6
= 3.5

Population Variance, σ2 = ∑ (X - µ)2 • P(X)


1 1 1
= (1 - 3.5)2 ( ) + (2 - 3.5)2 ( ) + (3 – 3.5)2 ( )
6 6 6
1 1 1
+ (4 – 3.5)2 ( ) + (5 – 3.5)2 ( ) + (6 – 3.5)2 ( )
6 6 6
= 2.92
At this point, we will ask you to get your die, throw it 5 times, (then 10 times and then
25 times) record the results and take the average or the mean. (The values below are the
results of our own experiment).

1+4+6+4+4
5 Throws, n = 5 → 𝑋̅ = = 3.8
5

1+6+4+5+3+5+5+5+1+2
10 Throws, n = 10 → 𝑋̅ = = 3.7
10

5+3+1+4+6+1+3+3+4+3+5+2+5+5+6+4+2+2+4+6+6+4+2+1+1
25 Throws, n = 25 → 𝑋̅ =
25
= 3.5

What do you notice about the means 𝑋̅’s that we obtained?

That’s right! As the number of throws of the die increases, the mean becomes
closer and closer to 3.5, the population mean. In other words, as the sample size
increases, the probability that the sample mean will be close to 3.5 also increases.
Why do you think is this so?
(CLUE: It has something to do with the averaging process.)

In the last lesson, we observed in the sampling distribution of the mean that
as the sample size increases, the variance decreases.
We expect the same thing in our experiment today. Notice that as the sample
size increases, 𝑋̅ becomes closer to µ=3.5. As the sample size increases, we are likely
to obtain 𝑋̅’s from our samples that crowd around the µ of the population which is
3.5. When this happens, we know that the variance will decrease.
This does not happen by coincidence. This can be explained by the formula
𝜎2
σ2𝑥̅ = 𝑛
where σ2𝑥̅ = variance of the sampling distribution
σ2 = variance of the population
n = sample size

28 DO_Q3_STATISTICS&PROBABILITY_
GRADE 11_LESSON7
Another thing that happens as n gets larger is that the sampling distribution
̅
of 𝑋 becomes increasingly bell shaped. (Keller/Warrack, p.274)
All these things that we learned today can be summarized by the following
theorem.
THE CENTRAL LIMIT THEOREM
If random samples of size n are drawn from a population, then as n becomes
larger, the sampling distribution of the mean approaches the normal distribution,
regardless of the shape of the parent population. (Belecina, et. al., p.119)

Activity 3.1 Solving Problems on Central Limit Theorem

Because of this phenomenon, we can use the Central Limit Theorem to solve
a lot of real- life problems. This amazing theorem justifies the use of the Normal
Curve and the Z-table.
𝑋−µ
From our original formula of Z= , we now have this new formula if
𝜎
we want to compute for the probability that 𝑋̅ will take some values in the sampling
distribution of 𝑋̅.
FORMULA
𝑋̅ −µ
Z= where 𝑋̅ = mean
𝜎/√𝑛
µ = population mean
σ = population standard deviation
n = sample size
Example

The scores on a Statistics midterm exam are normally distributed with a mean of 78
and a standard deviation of 6. If a student is selected at random
(a) what is the probability that his/her score is higher than 90?
(b) what is the probability that his/her score is less than 75?
(c) what is the probability that a class of 30 students will have an average score of
less than 75?
SOLUTION:
Given: µ = 78 σ=6
The population is normally distributed.
(a) P(X>90) = P(Z> ?) = ?
𝑋−µ 90−78 12
Let’s solve for Z: Z= = = =2
𝜎 6 6
Use the Z-table to determine the required area,
P(X>90) = P(Z>2) = 0.4772
Therefore, the probability that his/her score is higher than 90 is 0.0228 or
2.28%.

(b) P(X<75) = P(Z<?) = ?


𝑥−µ 75−78 −3
Let’s convert X to Z: Z= = = = -0.5
𝜎 6 6
Use the Z-table to determine the required area.
P(X<75) = P(Z<-0.5) = 0.3085
Therefore, the probability that his/her score is less than 75 is 0.3085 or 30.85%.

DO_Q3_STATISTICS&PROBABILITY_
29
GRADE 11_LESSON7
(c) In this question we are given a sample of size 30, and we are asked to determine
the probability that the mean 𝑋̅ of this sample will be less than 75. As explained
above, the Central Limit Theorem allows us to use the Z-table in this type of problem.
P(𝑋̅<75) = P(Z<?) = ?
𝑥̅ −µ 75−78
Let’s convert our 𝑋̅ to Z: Z= = = -2.74
𝜎/√𝑛 6/√30
Use the Z-table to determine the required area.
P(𝑋̅<75) = P(Z<-2.74) = 0.0031

Therefore, the probability that the sample will have a mean of less than 75 is
0.0031 or 0.31%.

The supervisor of a bottling company has observed that the amount of


softdrink in each Litro bottle is actually a normally distributed random variable ,
with a mean of 1006ml and a standard deviation of 10ml. (Keller/Warrack, p.277)
(a) If a customer buys one bottle , what is the probability that the bottle will
contain more than 1000ml?
(b) If a customer buys 4 bottles, what is the probability that the mean amount
of the 4 bottles will be greater than 1000ml?
SOLUTION:
Given: µ = 1006 σ = 10 n=4
The population is normally distributed.
(a) P(X>1000) = P(Z>?) = ?
𝑥−µ 1000−1006
Let’s convert X to Z: Z= 𝜎 = 10
= -6/10 = -3/5 = -0.6
Use the ZX-table to determine the required area.
P(X>1000) = P(Z>-0.6) = 0.7257

Therefore, the probability that a bottle will contain more than 1000ml is
0.7257 or 72.57%.
(b) P(𝑋̅>1000) = P(Z>?) = ?
𝑥̅ −µ 1000−1006
Let’s convert 𝑋̅ to Z: Z= = = -1.2
𝜎/√𝑛 10/√4
Use the Z-table to determine the required area.
P(𝑋̅>1000) = P(Z>-1.2) = 0.8849

Therefore, the probability that the mean of the 4 bottles is 0.8849 or 88.49%.

Short Answer. Supply the correct answer.


_______1. In a sampling distribution, as the sample size increases the sampling
distribution of the mean approaches the __________ distribution.
_______2. In a sampling distribution, if the parent population is normal, then the
_________ is normally distributed for all sample sizes.
_______3. If the parent population is nonnormal, then the Mean is approximately
normal only for __________ values of n (sample size).
DO_Q3_STATISTICS&PROBABILITY_
30
GRADE 11_LESSON7
_______4. If the population is extremely nonnormal, the sampling distribution will
also be __________ even for moderately large values of n.
_______5. In many practical situations, a sample size of _________may be sufficiently
large to allow us to use the normal distribution as an approximation for
the sampling distribution of the Mean.
Application
The time it takes a group of Grade 11 students to complete the Final Exam in
Statistics and Probability is known to be normally distributed. The mean is 58
minutes and the standard deviation is 4 minutes.
(a) What is the probability that a randomly selected student will complete the
examination in less than 54 minutes?
(b) If 5 randomly selected students take the exam, what is the probability that the
mean time it takes the group to finish will be less than 54?

Targets:
1. Illustrate the t-distribution (M11/12SP-IIIg-2).
2. Identify critical values (percentiles) using the t-table (M11/12SP-IIIg-5).

Read and analyze each item carefully. Choose the letter that corresponds to
your answer and write it in your notebook.
1. Which of the following refers to the number of subjects or observations taken
from a population?
A. sample size C. degrees of freedom
B. confidence level D. t-value
2. Given the sample size, how do we compute the degrees of freedom?
A. Add one to the sample size.
B. Subtract one from the sample size.
C. Divide the sample standard deviation by the sample size.
D. Divide the sample size by the sample standard deviation.
3. The size of a sample is 19. What is the degrees of freedom?
A. 18 B. 20 C. 34 D. 100
Consider the situation below to answer item numbers 4 and 5.
A certain sample whose size is 10 is known to have a mean of 73.25 with a
standard deviation of 2.5.
4. Find 𝑡𝛼/2 for a 99% confidence interval.
A. 1.833 B. 2.228 C. 2.262 D. 3.250
5. Which of the following gives the interval estimate for the population mean with
99% confidence?
A. 71.801 < 𝜇 < 74.699 C. 70.681 < 𝜇 < 75.819
B. 71.489 < 𝜇 < 75.011 D. 71.462 < 𝜇 < 75.038

DO_Q3_STATISTICS&PROBABILITY_
31 GRADE 11_LESSON8
Lesson
8 The t-Distribution
In this lesson, the learners will understand the concept of t-distribution. The
process of identifying critical values or percentiles to be used in constructing
confidence intervals using the t-table will also be presented.

Previously, when 𝜎 is known and the sample size is at least 30 or the sample
size is less than 30 but is taken from a population that is approximately normally
distributed, the confidence interval can be easily determined with the help of the z-
distribution.
Unfortunately, in most cases, we seldom know the population standard
deviation 𝜎 and what we have at hand is just the sample standard deviation 𝑠.
Finding a confidence interval for the population mean 𝜇 using only 𝑠 becomes a
different task to do.
Thus, this situation needs the help of a more suited distribution which is the
t-distribution.
The general expression for estimating the population mean using 𝑠 is given by
𝒔
𝑿 ± 𝒕 ( ).
√𝒏

The distribution of these values is


called t-distribution. The t-distribution
is indeed a family of curves that is based
on the concept of the so-called degrees
of freedom, which is related to the
sample size. The degrees of freedom,
usually denoted by 𝑑𝑓, refers to the
number of values that are free to vary
after a sample statistic has been
computed. It also identifies the
particular curve to be used as the t-
distribution. This concept is seen on the
figure below.

Figure 1. The t-distribution as a family of curves


To decide which t-distribution curve to follow, we must compute the degrees
of freedom. The degrees of freedom is computed by 𝒅𝒇 = 𝒏 − 𝟏, where 𝑛 is the sample
size.
Note that when 𝜎 is unknown, we can only use 𝑠 to find a confidence interval for
𝜇 if the following conditions are satisfied:
i. the sample is randomly selected; and
ii. either 𝑛 ≥ 30 or the population from which the sample is taken is
approximately normal if 𝑛 < 30.
The confidence interval limits are obtained using the general expression
presented above.

DO_Q3_STATISTICS&PROBABILITY_
32
GRADE 11_LESSON8
Characteristics of the t-distribution
➢ The variance is greater than 1.
➢ The t-distribution curve is bell-shaped.
➢ The curve is symmetrical about the mean.
➢ The distribution curve is asymptotic to the x-axis.
➢ The mean, median, and mode are equal to zero and are located at the center of
the distribution.
➢ As the sample size increases, the t-distribution approaches the normal
distribution (see Figure 1)

Sample Problems:
1. Find 𝑡𝛼/2 for a 95% confidence interval of a sample with size 16.
Solution:
In a t-distribution table, the values represent the proportions of areas in the tails
of the t-curve. We note that 𝑛 = 16 and that the confidence interval is 95%, so 𝛼 =
0.05. Thus, 𝛼/2 = 0.025, and we wish to find 𝑡0.025 .
First, we compute for the degrees of freedom as follows:
𝑑𝑓 = 𝑛 − 1
𝑑𝑓 = 16 − 1
𝑑𝑓 = 15
Next, we refer to the t-table below, we get the intersection of df=15 and 𝑡0.025 .

Thus, 𝑡𝛼/2 for a 95% confidence interval of a sample with size 16 is 2.131.
1. A sample composed of 20 male senior high school students were taken
randomly from a normally distributed population. Their heights were taken, and
it was found out that the mean is 65 inches with a standard deviation of 2.15
inches. With 95% confidence, find an interval estimate for the population mean.
Solution:
DO_Q3_STATISTICS&PROBABILITY_
33
GRADE 11_LESSON8
From the given, the sample was taken randomly and that the population from
which it was taken is normally distributed. Thus, we can use 𝑠 and the t-
distribution to find an interval estimate for 𝜇.
Now, we find 𝑡𝛼/2 where 𝛼 = 0.05. With 𝑑𝑓 = 20 − 1 = 19, it can be verified from
the t-table that 𝑡𝛼/2 = 2.093.
Next, we compute for the confidence interval limits as shown below:
𝑠 2.15
𝑋 ± 𝑡 ( ) = 65 ± (2.093) ( ) = 65 ± 1.006
√𝑛 √20
It follows that the confidence interval limits are:
Upper: 65 + 1.006 = 66.006 and Lower: 65 – 1.006 = 63.094.
Therefore, with 95% confidence, the mean height of the population lies in the
interval 𝟔𝟑. 𝟎𝟗𝟒 < 𝝁 < 𝟔𝟔. 𝟎𝟎𝟔.

Answer the following.


1. Find 𝑡𝛼/2 for a 99% confidence interval of a sample with size 20.
2. Find the 99% confidence interval for a sample with 𝑛 = 17, 𝑋 = 7.6, and 𝑠 = 2.3.

Answer the given problem below.


A sample composed of 20 male senior high school students were taken
randomly from a normally distributed population. Their heights were taken, and it
was found out that the mean is 65 inches with a standard deviation of 2.15 inches.
With 99% confidence, find an interval estimate for the population mean.

Read and analyze each item carefully. Choose the letter that corresponds to
your answer and write it in your notebook.
1. Which of the following refers to the number of values that can vary in a certain
analysis or computation?
A. confidence level C. sample size
B. degrees of freedom D. t-value
2. How do we compute the degrees of freedom?
A. Add one to the sample size.
B. Subtract one from the sample size.
C. Divide the sample standard deviation by the sample size.
D. Divide the sample size by the sample standard deviation.
3. The size of a sample is 17. What is the degrees of freedom?
A. 16 B. 18 C. 34 D. 100
Consider the situation below to answer item numbers 4 and 5.
A certain sample whose size is 10 is known to have a mean of 73.25 with a
standard deviation of 2.5.
4. Find 𝑡𝛼/2 for a 95% confidence interval.
A. 1.833 B. 2.228 C. 2.262 D. 3.250
DO_Q3_STATISTICS&PROBABILITY_
34
GRADE 11_LESSON9
5. Which of the following gives the interval estimate for the population mean with
95% confidence?
A. 71.801 < 𝜇 < 74.699 C. 70.681 < 𝜇 < 75.819
B. 71.489 < 𝜇 < 75.011 D. 71.462 < 𝜇 < 75.038

Targets:
1. Identify the length of a confidence interval (M11/12SP-IIIj-1).
2. Compute for the length of a confidence interval (M11/12SP-IIIj-2).
3. Compute for an appropriate sample size using the length of the interval
(M11/12SP-IIIj-3).
4. Solve problems involving size determination (M11/12SP-IIIj-4).

Read and analyze each item carefully. Choose the letter that corresponds to your
answer and write it in your notebook.
1. How is the expected length of a confidence interval related to the margin of error?
A. The expected length is equal to the expected length.
B. The expected length is twice the margin of error.
C. The expected length is half the margin of error.
D. The expected length is two more than the margin of error.
Refer to the problem below to answer item numbers 2 to 5.
The standard deviation of the temperature of all the mall goers from a certain
city on a Tuesday morning is 0.16°C. How many of them must be chosen to get
an accurate estimate of the mean temperature within 0.015 at 95% confidence
level?
2. What does 0.16 refer to?
A. population mean C. sample mean
B. population standard deviation D. sample standard deviation
3. To which quantity does 0.015 refer?
A. length of the interval C. margin of error
B. mean D. standard deviation
4. What needs to be determined in the above problem?
A. margin of error C. length of interval
B. population mean D. sample size
5. Which of the following is the correct answer to the problem?
A. 437 B. 438 C. 757 D. 758

Lesson
Confidence Level and Sample Size
9
In this lesson, the learners will understand the relationship between
confidence level and sample size. The process of determining an appropriate sample
size will also be presented.

DO_Q3_STATISTICS&PROBABILITY_
35
GRADE 11_LESSON9
You may recall that a confidence interval is an interval estimate of the true
population mean composed of a range of values that possibly contains it. In the
estimation of this parameter, the concepts of margin of error, confidence level, and
sample size seem to equally interplay.

It is interesting to note that researchers wanted to come up with a sufficient


enough number of samples for both practical and theoretical reasons. Determining
the minimum sample size needed precisely ensures that at a certain confidence level
this sample will have an accurate estimate of the population mean under a given
margin of error. This implies that the sample size mainly depends on two things:
confidence level and the margin of error of the confidence interval.
Researchers often use the 90%, 95% and 99% as the conventional levels of
confidence for a certain interval estimate. While for the length, we can anticipate how
long the confidence interval will be for any given sample size and confidence level
using the margin of error.

When the population standard deviation is known, the formula for the margin
𝑧
of error 𝐸 is given by 𝐸 = 𝛼/2
𝑛
⋅ 𝜎. This indicates the number that must be added to or

subtracted from the sample mean to have the limits of a confidence interval. Thus,
the expected length 𝐿 of a confidence interval is simply 𝐿 = 2𝐸.
Alternatively, when the population standard deviation is unknown, the sample
standard deviation 𝑠 may be used as a substitute. In this case, the margin of error
𝑧
may be computed as 𝐸 ≈ 𝛼/2 ⋅ 𝑠.
√𝑛
As an illustration, suppose we want to find the expected length of a 95%
confidence interval for estimating the population mean of a sample with 𝑛 = 120 and
𝜎 = 1.5. Substituting these values to the formula for the margin of error, we have,
𝑧𝛼/2 1.96
𝐸= ⋅𝜎= ⋅ 1.5 = 0.268
√𝑛 √120
Thus, we expect the length of the confidence interval to be 𝐿 = 2𝐸 = 2(0.268) =
𝟎. 𝟓𝟑𝟔.

Finding Appropriate Sample Size

Using the above formula for the margin of error, we can derive a formula for
the suitable sample size 𝑛 given a certain confidence level and margin of error. This
derivation is shown below.
𝑧
𝐸 = 𝛼/2
𝑛
⋅𝜎 [Formula for the Margin of Error]

√𝑛 𝑧 √𝑛 √𝑛
(𝐸) = [ 𝛼/2 ⋅ 𝜎] [Multiply both sides by ]
𝐸 √𝑛 𝐸 𝐸
𝑧𝛼/2 ⋅𝜎
√𝑛 = ( 𝐸
) [Simplify]
2 𝑧𝛼/2 ⋅𝜎 2
(√𝑛) = ( 𝐸
) [Square both sides]
𝑧𝛼/2 ⋅𝜎 2
𝑛=( 𝐸
) [Simplify]
DO_Q3_STATISTICS&PROBABILITY_
36
GRADE 11_LESSON9
The last equation above is used to determine the appropriate sample size. As
an illustration, let us find out how large a sample should be if one wants to be 99%
confident that the interval estimate of the population mean is accurate to within 0.05
if the population standard deviation is known to be 2.5.
Substituting the known values to the above formula, we obtain the following:
𝑧𝛼/2 ⋅ 𝜎 2 2.58 ⋅ 2.5 2
𝑛=( ) =( ) = 𝟏𝟔, 𝟔𝟒𝟏
𝐸 0.05
Sample Problems:
1. It is known that the length of life of the batteries produced by a certain
manufacturing company is normally distributed with a standard deviation of
0.75 years. How many sample batteries must be taken to estimate the true mean
length to within 0.02 years at 95% confidence?
Solution:
First, we note that the population standard deviation is 0.75 years. Also, the
margin of error is set to 0.02 while the confidence level is 95%. Using the formula
for finding the minimum sample size suited to accurately estimate the population
mean, we have,
𝑧𝛼/2 ⋅ 𝜎 2 1.96 ⋅ 0.75 2
𝑛=( ) =( ) = 5, 402.25
𝐸 0.02
Hence, 5, 403 batteries must be taken to ensure an accurate estimate of
the population mean.
2. Based on previous school records, the standard deviation of the height of
Grade 10 students in a particular city is 1.2 inches. A researcher wanted to be
sure on estimating the population mean height to within 0.035 of its true value.
Using a 99% confidence interval, find the sample size that the researcher needs.
Solution:
Clearly, we have the following given information: 𝜎 = 1.2, 𝐸 = 0.035, and the
confidence level is 99%. Thus, we may calculate the value of 𝑛 as shown below.
𝑧𝛼/2 ⋅ 𝜎 2 2.58 ⋅ 1.2 2
𝑛=( ) =( ) = 7 824.67
𝐸 0.035
Thus, the researcher needs to take 7, 825 Grade 10 students as a sample.

Answer the problem below.


It is known that the length of life of the batteries produced by a certain
manufacturing company is normally distributed with a standard deviation of 0.75
years. How many sample batteries must be taken to estimate the true mean length
to within 0.02 years at 99% confidence?

Answer the problem below.


Based on previous school records, the standard deviation of the height of
Grade 10 students in a particular city is 1.2 inches. A researcher wanted to be sure
on estimating the population mean height to within 0.035 of its true value. Using a
95% confidence interval, find the sample size that the researcher needs.

DO_Q3_STATISTICS&PROBABILITY_
37
GRADE 11_LESSON9
Read and analyze each item carefully. Choose the letter that corresponds to your
answer and write it in your notebook.
1. How is the expected length of a confidence interval related to the margin of error?
A. The expected length is half the margin of error.
B. The expected length is twice the margin of error.
C. The expected length is equal to the margin of error.
D. The expected length is two more than the margin of error.

Refer to the problem below to answer item numbers 2 to 5.


The standard deviation of the temperature of all the mall goers from a certain
city on a Tuesday morning is 0.16°C. How many of them must be chosen to get an
accurate estimate of the mean temperature within 0.015 at 99% confidence level?

2. What does 0.16 refer to?


A. population mean C. sample mean
B. population standard deviation D. sample standard deviation
3. To which quantity does 0.015 refer?
A. length of the interval C. margin of error
B. mean D. standard deviation
4. What needs to be determined in the above problem?
A. margin of error C. length of interval
B. population mean D. sample size
5. Which of the following is the correct answer to the problem?
A. 437 B. 438 C. 757 D. 758

Almeda, Capistrano, Ferry Sarte. (2010). Elementary Statistics. Quezon City:


University of the Philippines Press.
Belecina, R., Baccay, E., & Mateo, E. (2016). Statistics and Probability.
Manila,Philippines: REX Book Store Inc.
Bluman, A. (2018). Elementary Statistics: A Step by Step Approach 10th edition.
McGraw Hill. New York, USA.
Canlapan, R. (2016). Statistics and Probability. Makati, Philippines: Diwa Learning
System Inc.
Keller, Warrack. (2003). Statistics For Management and Economics. California USA:
Thomson Learning, Inc.
Levine, et. al. (2005). Statistics: A Handbook for Managers. New Jersey: Prentice Hall.
PERCDC Learnhub
Walpole, R., Myers, R., Myers, S., and Ye, K., (2012). Probability and Statistics for
Engineers and Scientists 9th edition. Pearson Education Inc. Massachusetts,
USA.

For inquiries or feedback, please write or call:


Department of Education – SDO Valenzuela
Office Address: Pio Valenzuela Street, Marulas, Valenzuela City
Telefax: (02) 8292-4340
Email Address: [email protected]

38 DO_Q3_STATISTICS&PROBABILITY_
GRADE 11_LESSON9

You might also like