MODULE Chapter I Random Variables and Probability Distribution1
MODULE Chapter I Random Variables and Probability Distribution1
Mathematics
Statistics and Probability
Introduction
In the study of basic probability, you have learned that an experiment is any activity which
can be done repeatedly under similar condition. The set of all possible outcomes of an experiment
is called the sample space. You have also learned how to mathematically list the possible outcome
of a given experiment. In tossing a coin, for instance, the possible outcomes are turning up a head
or a tail.
This module will help you understand the process of illustrating random variables as well as
distinguishing between discrete and continuous random variables. God bless and enjoy learning.
Competency
Procedure/Learning Experience
Activity
Complete the following table. The first one is done for you.
Experiment Number X Possible Value of X
Two cards are drawn Sum of the numbers 4, 5, 6, 7, 8, 9, 10,
from a deck. on the cards 11, 12, 13, 14, 15,
16, 17, 18, 19, 20
Roll a pair of dice Sum of the number of
dots on the top faces
Toss a pair of coin Number of tosses until
repeatedly the coin lands head
Height of individuals Height of your father
Analysis
•
This time, you need to reflect on the following questions because it will help you to
understand the appropriate way on illustrating and classifying a random variable. Answer
the following questions:
1. How do you find the activity/?
2. Where you able to complete the table? If yes, how? If no, why?
4. In your own words how will you differentiate between discrete and continuous random
variables?
Abstraction
Discrete Continuous
From the example in the table above, Examples 1, 2, and 3 are all discrete while example
4 is continuous.
Introduction
We first learned about the notion of variables in the introduction of Algebra, and we
know from our Algebra lessons that a variable is a placeholder for real number values that
can be assigned to it. Some examples of variables includes X = number of heads or Y =
number of cell phones or Z = running time to movies.
If three coins are tossed, what numbers can be assigned for the frequency of heads
that will occur? If three cards are drawn from a deck, what number can be assigned for the
frequency of face cards that will occur? These questions may be answered using random
variables which you will learn in this module.
Competency
Abstraction
Application
This time, it’s your turn to shine. Answer the problem below.
The households of a local community were surveyed about the number of occupants
who are working. It was found out that 25 households have one occupant working, 18 have
two occupants working, 12 have three occupants working, and 5 have four occupants
working. Let X be the number of occupants working from a randomly selected household.
What are the possible values for the random variable?
Reflection
This time, it’s your turn to shine. Answer the following as directed.
1. How do you find the values of a random variables?
2. How is this variable, as described in Algebra, similar to a random
variable? How do they differ?
Competency
Objective
Procedure/Learning Experience
Abstraction
A random variable is a set whose elements are the numbers assigned to the outcomes
of an experiment. It is usually denoted by uppercase letters such as X, whose elements
are denoted by lower case letters x1, x2, x3 and so on.
Probability distribution of a discrete random variable specifies the probability of each
possible value of the random variable. The distribution functions of discrete random
variables are concentrated as a mass for a particular value, and generally known as
Probability Mass Function.
2. The sum of all the probabilities of all values of the random variable
must be equal to 1. In symbol, we write it as ΣP(X) = 1
Probability Histogram
Histogram
Construct a histogram for this probability distribution. A histogram is a bar graph. To
construct a histogram for a probability distribution, follow these steps.
Plot the values of the random variable along the horizontal axis.
a) In plotting the probabilities along the vertical axis, you can change fractions to
decimals (ex. ¼=0.25 and ½= 0.5)
0.6
0.5
0.4
P(Y)
0.3
0.2
0.1
0
0 1 2
No. of Green Balls (Y)
Ex. 2.Write the probability mass function of the random variable X defined by the outcomes
of rolling a fair die and construct its corresponding histogram.
a) Since a die is fair, each of the 6 outcomes has a probability 1/6 or approximately 0.17,
thus, the probability mass function is:
X 1 2 3 4 5 6
P(X) 1/6 1/6 1/6 1/6 1/6 1/6
b) Histogram
0.18
0.16
0.14
0.12
0.10
P(X)
0.08
0.06
0.04
0.02
0.00
1 2 3 4 5 6
Ex. 3.Let X be a random variable giving the number of girls in a randomly selected three-
child family. Assuming that boys and girls are equally likely, construct the probability
distribution of X and its corresponding histogram.
a) Possible Outcomes Value of the Random Variable X
(No. of Girls)
BBB 0
BBG 1
BGB 1
GBB 1
BGG 2
GBG 2
GGB 2
GGG 3
b)
No. of Girls 0 1 2 3
Probability P(X) 1/8 or 0.125 3/8 or 0.375 3/8 or 0.375 1/8 or 0.125
ΣP(X)= 1/8 +3/8
+ 3/8 + 1/8 = 8/8 =1
c) Histogram
0.4
0.35
0.3
0.25
P(X)
0.2
0.15
0.1
0.05
0
0 1 2 3
No. of Girls (X)
4. X 1 5 7 8 9
P(X) 1/3 1/3 1/3 1/3 1/3
Σ P(X)= 1/3 + 1/3 +1/3 +1/3 +1/3=5/3
This is not a probability distribution because ΣP(X) ≠ 1
X 1 3 5 7
5. P(X) 0.35 0.25 0.28 0.12
Σ P(X)= 0.35 + 0.25 + 0.28 + 0.12 =1
This is a probability distribution because ΣP(X) = 1
Reflection
Look back and reflect on what you have done, then answer the following questions.
1. Why should the sum of the probabilities in a probability distribution always equal to 1?
2. What is the shape of most probability distributions? Why do you think so?
If you took an examination, do you sometimes think of where would be your score lie?
Would it be below or above the average score? Afterwards, you would like also to know how
far you are from the average score. Finally, you would ask yourself what is the meaning of
those locations from the center or mean of the probability distribution? If your interest caught
by these questions, this learning module is design for you.
In this module, you will understand the process on how to illustrate the mean and
variance of a discrete random variable. Similarly, calculating the mean and the variance of
a discrete random variable will be discussed. You may find this module as piece of good
fortune in acquiring brand new learning.
Competency
Objective
Procedure/Learning Experience
Activity
𝑋 𝑝(𝑋)
0 0.03
1 0.05
2 0.12
3 0.30
4 0.28
5 0.22
Based on the data above, observe, analyze and answer the following questions:
1. Is the probability of x lies between 0 and 1?
2. What is the sum of all probabilities of x?
3. Is there a negative probability? Is it possible to have a negative
probability?
4. What is the average or mean of the probabilities of discrete random
variable?
5. What are the variance and the standard deviation of the probability distribution from
the data above?
Analysis
•
This time, you need to reflect on the following questions because it will help you to
understand the appropriate way on illustrating and calculating the mean, variance and
standard deviation of a discrete random variable. Answer the following questions:
1. How did you find the given activity?
2. Were you able to get the correct answer for each question? If yes, Very Good! If not,
which question/s you find difficult?
3. What mathematical skills help you to answer all 5 questions?
4. Do you think the concepts that you will acquire in this module would be helpful to
solve real-life problems?
5.
Abstraction
𝑬(𝒙) = 𝝁𝒙 = ∑ 𝒙 ∙ 𝑷(𝑿) = 𝟑. 𝟒𝟏
So, it indicates that the expected value or mean of the random variable is 3.41.
For question 5, you will need to understand the process on computing the variance
and the standard deviation of the discrete random variable. In order to compute those,
always remember this;
first subtracting the mean from each possible x value to obtain the deviations, then squaring
each deviation and multiplying the result by the probability of the corresponding x value, and
then finally adding these quantities. In symbol,
symbol,
From the data generated.
Applying the concepts from the activity given above, let us complete the table below.
0 ¼ or 0.25 0 -1 1 0.25
1 ½ or 0.5 0.5 0 0 0
2 ¼ or 0.25 0.5 1 1 0.25
𝝁𝒙 = ∑ 𝒙 ∙ 𝑷(𝑿) = 𝟏 ∑(𝒙 − 𝝁)𝟐 ∙ 𝑷(𝑿) = 𝟎. 𝟓𝟎
We have seen that the Expected value E(x) is just the average or mean (µ) of random
variable x. It’s sometimes called a “weighted average” because more frequent values of X
are weighted more highly in the average. It’s also how we expect X to behave on-average
over the long run. The mean of a random variable X is a measure of the central location of
the distribution of X. If we are summarizing features of the distribution of X, it is clear that
location is not the only relevant feature. The second most vital feature is the spread of the
distribution.
If values of X is closer to its mean, μX are very likely and values further away from μX
have very small probability, then the distribution of X will be closely concentrated around μX.
In this case, the spread of the distribution of X is small. On the other hand, if values of X take
some distance from its mean μX are likely, the spread of the distribution of X will be large.
These ideas head to the most significant measure of spread, the variance, and a
closely related measure, the standard deviation.
You, as a student have met the concepts of variance and standard deviation when
summarizing data. These were the sample variance and the sample standard deviation. The
difference here is that we are referring to properties of the distribution of a discrete random
variable.
where the sum is taken over all values of x for which P(x) > 0 but less than 1. So, the variance
of X is the weighted average of the squared deviations from the mean μ, where the weights
are given by the probability function P(x) of X.
The standard deviation of X is defined to be the square root of the variance of X. That
is,
Unlike the mean, there is no simple direct interpretation of the variance or standard
deviation. The variance is parallel to the moment of inertia in physics, but that is not
necessarily widely understood by students. What is important to understand is that, in
relative terms:
➢ a small standard deviation (or variance) means that the distribution of the random
variable is concentrated narrowly around the mean
➢ a large standard deviation (or variance) means that the distribution is spread out, with
some chance of observing values at some distance from the mean.
Remember that the variance cannot be negative, because it is an average of squared
quantities. This is appropriate, as a negative spread for a distribution does not make sense.
Hence, variance is greater than 0 and standard deviation also greater than 0 always.
Reflection
Life is not measured by the number of breaths we take, but by the moments that take
our breath away and by the chances we bravely face.