Stat 115 - Chapter 1
Stat 115 - Chapter 1
Preliminaries
03 The Binomial
Distribution 04 The Normal
Distribution
00 STAT 114 REVIEW
POPULATION vs SAMPLE
Population data = 𝑿𝟏 , 𝑿𝟐 , … , 𝑿𝑵
Parameter a summary measure describing a particular characteristic of the
population that is computed using population data
∑$
!"# "!
Example population mean, 𝜇 =
#
$ ∑$
!"# "! %&
%
population variance, 𝜎 =
#
Sample data = 𝑿𝟏 , 𝑿𝟐 , … , 𝑿𝒏
Statistic a summary measure describing a particular characteristic of the
sample that is computed using sample data
∑&
!"# "!
Example sample mean, 𝑋$ =
'
$ ∑& ( %
!"# "! %"
sample variance, 𝑠 =
'%) 4
REMARKS
o Both the parameter and statistic are summary measures that
are computed using data.
o If you have population data, then the computed summary
measure is a parameter. If you only have sample data, then
the computed summary measure is a statistic.
REMARKS
o In a statistical inquiry, the answer to the research problem is based
on the value of the parameter that describes the characteristic of
interest of the population under study.
o However, the value of this parameter can only be computed using
population data.
o If you only have sample data, you cannot compute for the value of
the parameter.
Descriptive vs Inferential
Descriptive Statistics comprise those methods concerned with
collecting, describing, and analyzing a set of data without
drawing conclusions or inferences about a larger group
Ω
METHOD State a rule that the
elements must satisfy in
List down all the elements order to belong in the set
belonging in the set then then enclosing this rule in
enclosing them in braces. braces.
Illustration
Consider the random experiment of tossing a coin twice.
Using H to denote a head and T to denote a tail, we can
specify the sample space by roster method as follows:
Ω = {𝐻𝐻, 𝐻𝑇, 𝑇𝐻, 𝑇𝑇}
We can also use the rule method to specify the same sample
space as follows:
Ω = 𝑥, 𝑦 𝑥 ∈ 𝐻, 𝑇 , 𝑦 ∈ 𝐻, 𝑇 }
Illustration
Consider the random experiment of tossing a coin twice.
Recall that we defined our sample space previously as follows:
Ω = {𝐻𝐻, 𝐻𝑇, 𝑇𝐻, 𝑇𝑇}
But take note that, in the same experiment, instead of
recording what comes up on the first and second tosses, what
we can also do is to count the number of heads that will come
up in the two tosses. In this case, the sample space is
Ω = 0,1,2
Ω = 𝑥 𝑥 ∈ {0,1,2}}
REMARKS
o The previous illustration shows that the description of the sample
space is NOT UNIQUE.
o There are many ways in which we can specify the collection of all
possible outcomes of the experiment.
Which representation must we choose to use?
o Well, naturally, the choice depends on the characteristic of interest
and whatever will facilitate the assignment and computation of
probabilities.
Example
Simple random sampling is an example of a probability sampling
method.
We will denote a sample of size 2 by the set {x1, x2}, where x1 and x2
are the two distinct elements included in the sample.
W = {{a,b},{a,c},{a,d},{a,e},{b,c},{b,d},{b,e},{c,d},{c,e},{d,e}}
Suppose a sample of size n=2 will be selected using SRSWR. Specify the sample
space.
We will denote a sample of size 2 by an ordered pair, (x1,x2), where x1 is the
element selected on the first draw while x2 is the element selected on the second
draw.
W ={(a,a),(a,b),(a,c),(a,d),(a,e),(b,a),(b,b),(b,c),(b,d),(b,e),(c,a), (c,b),(c,c),(c,d), (c,e), (d,a),
(d,b), (d,c), (d,d), (d,e),(e,a), (e,b),(e,c),(e,d), (e,e)}
By definition of SRSWR, all the 25 sample points (samples) will be given equal
chances of selection.
Example
In general, a sample of size 𝑛 selected using SRSWR will be
denoted by an ordered n-tuple (x1,x2,…,xn) where xi is the
element selected on the ith draw.
When the sample of size 𝑛 is selected from a population of size
𝑁 using SRSWR, then the sample space will contain 𝑵𝒏 ordered
n-tuples and by definition, all of them will be given equal
chances of selection.
EVENT
An event is a subset of the sample space whose probability is
defined.
We say that an event occurred if the outcome of the
experiment is one of the sample points belonging in the event.
Otherwise, the event did not occur.
We will use any capital Latin letter (A,B,C,...) to denote an event of interest.
Illustration
Consider the experiment of rolling a die. The sample space is given to
be:
Ω = {1, 2, 3, 4, 5, 6}
Let A = event of observing odd number of dots in a roll of a die
= {1, 3, 5}
B = event of observing even number of dots in a roll of a die
= {2, 4, 6}
C = event of observing less than 3 dots in a roll of a die
= {1, 2}
Example
The population consists of 𝑁 = 5 children: a=Janine, b=Josiel, c=Jan, d=Eryl, and
e=Eariel. Suppose a sample of size n=2 will be selected using SRSWR.
Suppose the sample selected was (a,d). Did event A occur? Did event B occur?
Impossible and Sure Events
The impossible event is the empty set ∅.
The sure event is the sample space Ω.
§ Two subsets of the sample space that will always be events are the empty set and the
sample space. Their probabilities are always defined.
§ Remember that an event occurs if the outcome of the experiment belongs in it. But ∅
is the empty set so it does not contain any sample point and thus it is impossible for
this event to happen. On the other hand, Ω is the sample space so it contains all
possible outcomes of the experiment and thus we are sure that it will always occur.
Other Events
Aside from the impossible event and sure event, the other subsets of the sample space are
also required to be events.
A C A∪B A∩B
“A complement” “A union B” “A intersection B”
the collection of sample the collection of sample the collection of sample
points in the sample points that belong in at points that belong in
space that do not belong least one of A and B both A and B.
in A occurred if only event occurred if both events
occurred if event A did A occurred, only event A and B occurred
not occur B, or both A and B simultaneously
Other Events
A1∪A2∪...∪An A1∩A2 ∩... ∩An
“the union of n events” “the intersection of n events”
the collection of sample points the collection of sample points
that belong in at least one of A1, that belong in each one of A1,
A2, ..., An A2, ..., An
Probability theory:
Set theory: A B
A and B are disjoint A Events A and B are mutually
ΩΩ exclusive
EXERCISES
Exercise Three new antivirus software are being developed to target
and clean computer virus. We then define the following
01
events:
A = event that Antivirus A detects the virus
B = event that Antivirus B detects the virus
C = event that Antivirus C detects the virus
Express the following events in terms of A, B, and C:
1. event where Antivirus A did not detect the virus
2. event where Antivirus B and C detects the virus
3. event where all three Antivirus detect the virus
4. event where Antivirus A detects the virus but not Antivirus
B
5. event where at least one of Antivirus B or C detects the
virus
Exercise Consider the random experiment of tossing a fair
coin 4 times.
02 1. Specify the sample space.
2. Let A be the event of observing heads on the
first 2 tosses. What are the elements of A?
3. Let B be the event of observing exactly two
heads. What are the elements of B?
4. Suppose you tossed a fair coin 4 times and
observed HHTT. Did A occur? Did B occur?
Did A ∪ B occur? Did A ∩ B occur?
5. Are A and B mutually exclusive?
PROBABILITY
The probability of an event A , denoted by P(A), is a function that
assigns a measure of chance that event A will occur and must
satisfy the following properties:
(a) Nonnegativity. 0 ≤ P(A) ≤ 1 for any event A
(b) Norming Axiom. P(Ω) = 1
(c) Finite Additivity. If A can be expressed as the union of n
mutually exclusive events, that is,
A = A1 ∪ A2 ∪ ... ∪ An, then
P(A) = P(A1) + P(A2) + ... + P(An).
INTERPRETATION
o A probability measure that is close to 1 means that the event has a very large chance of
occurrence. On the other hand, if the probability measure is close to 0, then the event has a
very small chance of occurrence.
o A probability of 0.5, the midpoint of the interval [0,1], means that the event has a 50-50
chance of occurrence, that is, the chance that the event will occur is just the same as the
chance that the event will not occur.
o In fact, if you are sure that an event is going to happen, then it must be assigned a
probability of 1. Similarly, the probability of the impossible event must always be equal to 0.
0 0.5 1
Impossible Hmmm... Certain
Exercise The definition of probability function is useful not only in computing
for probabilities but also in determining if our assignment of
03
probabilities is valid or not. Find the errors in each of the following
assignments of probabilities:
1. The probabilities that a couple will have 0, 1, 2, 3, or 4 or more
children are, 0.42, 0.36, 0.25, 0.12, and -0.15 respectively.
2. A person tosses a biased die three times with at least one dot
on each side. The probability that the sum of the number of
dots in all three tosses is 2 is 0.0625.
3. The probability that a selected student will pass the Physics
exam is 0.35 and the probability that this student will fail the
same exam is 0.62.
4. The probabilities that a salesperson will sell exactly 0, 1, 2, or
3 or more items on any given day are, 0.23, 0.42, 0.25, and
0.20, respectively.
Approaches to Assigning Probabilities
P(A) = 9/25
Example
The population consists of 𝑁 = 5 children: a=Janine, b=Josiel, c=Jan, d=Eryl, and
e=Eariel. Suppose a sample of size n=2 will be selected using SRSWR.
B = event that Janine and Jan are both included in the sample
= {(a,c), (c,a)}
P(B) = 2/25
Relative Frequency
The method of using a posteriori or relative frequency assigns probabilities to
events by repeating the experiment a large number of times and using the following
rule:
If a random experiment is repeated many times under uniform conditions, use the
empirical probability of an event A to assign its probability as follows:
The a posteriori definition of the probability of event A is the limiting value of its
empirical probability if we repeat the process endlessly.
REMARKS
o For any event A, the a posteriori approach defines the P(A) as the limiting value of the
relative frequency of occurrence of event A if we repeat the process endlessly.
o The advantage of using a posteriori probabilities instead of a priori probabilities is that its
use is not restricted to random experiments that generate a sample space containing
equiprobable outcomes.
o The advantage, on the other hand, of using a priori probabilities instead of a posteriori
probabilities is that its use does not require us to perform the actual experiment and can be
determined prior it.
Example
Consider the experiment of tossing a coin. Let A=event that a head comes up. The first
few outcomes of one possible sequence of trials is as follows:
Trial No. Outcome Relative Frequency
For the first few trials, the values of the
1 H 1/1 = 1 relative frequencies will fluctuate.
2 H 2/2 = 1
3 T 2/3 = 0.67 However, the relative frequencies will tend
4 H ¾ = 0.75 to stabilize to its limit as n becomes large.
5 T 3/5 = 0.60
6 H 4/6 = 0.67
7 T 4/6 = 0.57
8 T 4/8 = 0.5
9 H 5/9 = 0.56
10 H 6/10 = 0.60
Subjective Probability
Subjective probability assigns probabilities to events by using
intuition, personal beliefs, and other indirect information.
This method is more personal in its approach to assigning probabilities. The assignments
may vary from one person to another, depending on their personal assessment of the
available information on the situation at hand.
But whatever the assignments may be, these measures must still conform to Kolmogorov’s
definition of a probability.
Ideally, we want to use objective methods in assigning probabilities. However, it is
sometimes impractical or not at all possible to use a priori or a posteriori probabilities.
Exercise Consider the experiment of tossing a biased coin so that
the chance of observing a tail is twice the chance of
…
1st stage 2nd stage 3rd stage kth stage
n1 distinct n2 distinct n3 distinct nk distinct
possible possible possible possible
outcomes outcomes outcomes outcomes
Then there are n1 x n2 x n3 x ... x nk possible outcomes of the experiment.
Example
How many different 7-place license plates are possible if the first 3 places are to be
occupied by letters and the last 4 by numbers?
Assume that the letters and numbers CAN be repeated.
26 26 26 10 10 10 10
A, B, C, …, A, B, C, …, A, B, C, …, 0, 1, 2, …, 0, 1, 2, …, 0, 1, 2, …, 0, 1, 2, …,
X, Y, Z X, Y, Z X, Y, Z 7, 8, 9 7, 8, 9 7, 8, 9 7, 8, 9
26 25 24 10 9 8 7
A, B, C, …, A, B, C, …, A, B, C, …, 0, 1, 2, …, 0, 1, 2, …, 0, 1, 2, …, 0, 1, 2, …,
X, Y, Z X, Y, Z X, Y, Z 7, 8, 9 7, 8, 9 7, 8, 9 7, 8, 9
Permutation a b b c a c
P(3,2) or 3P2 b a c b c a
Combination
C(3,2) or $ a b b c a c
%
Permutation and Combination
The number of distinct r-permutations that we can form from
the n distinct elements of set Z is
𝑛!
𝑃 𝑛, 𝑟 = 𝑛 ∗ 𝑛 − 1 ∗ 𝑛 − 2 ∗ ⋯ ∗ (𝑛 − 𝑟 + 1) =
𝑛−𝑟 !
05
and green. He has three pairs of pants. One is red, the others
are white and blue. How many ways can he match his shirts
with his pants?
2. In a Science exam, a student has a choice of 8 questions out of
10. How many ways can a student choose a set of 8 questions
if he chooses arbitrarily?
3. A class consists of 10 boys and 15 girls. An examination is
given, and the students are ranked according to their
performance. Assume that no two students obtain the same
score.
a) How many different rankings are possible?
b) If the men were ranked just among themselves and the women
among themselves, how many different rankings are possible?
Example
A nongovernment organization is awarding 5 scholarships to children of
poor families. Fifty children are qualified for the scholarship. Among
these 50 children, only 10 are boys while the other 40 are girls.
a) How many ways can the nongovernment agency select the 5 children
who will be awarded the scholarship?
The experiment can be divided into two stages: (i) selection of boys and (ii) selection of
girls. The first stage has C(10,4) = 210 possible outcomes while the second stage has
C(40,1) = 40 outcomes.
A A B A B
Ω Ω Ω
𝑃(𝐴! ) = 1 – 𝑃(𝐴) 𝑃(𝐴 ∩ 𝐵𝐶 ) = 𝑃(𝐴) – 𝑃(𝐴 ∩ 𝐵) 𝑃(𝐴 ∪ 𝐵) = 𝑃(𝐴) + 𝑃(𝐵) − 𝑃(𝐴 ∩ 𝐵)
A B A B A B
Ω Ω Ω
𝑃 𝐴∩𝐵 " = 𝑃(𝐴" ∪ 𝐵" )
𝑃(𝐴 ∪ 𝐵) = 𝑃(𝐴) + 𝑃(𝐵) 𝑃 𝐴∪𝐵 " = 𝑃 𝐴" ∩ 𝐵"
Exercise Suppose A and B are events for which it is known
that 𝑃 𝐴 = 0.6, 𝑃 𝐵 = 0.7, and 𝑃 𝐴 ∩ 𝐵 = 0.4.
07 Compute the probabilities of the following events:
a. 𝑃 𝐴 ∪ 𝐵
b. 𝑃 𝐴 ∩ 𝐵"
c. 𝑃 𝐵 ∩ 𝐴"
"
d. 𝑃 𝐴 ∩ 𝐵
e. 𝑃 𝐴" ∩ 𝐵"
f. 𝑃 𝐴 ∩ 𝐵" ∪ 𝐵 ∩ 𝐴"
Event Composition Method
Here, the probabilities are computed by expressing the event of interest as a
composition of other events.
Step 1: Define the basic events. The basic events are those events in the
problem that cannot be expressed as a composition of other events.
Step 2: List the known probabilities of events as stated in the problem.
Step 3: Express the event of interest as a composition of the basic events using
the set operations.
Step 4: Use theorems or formulas for the computation of the probabilities.
Example
The probability that a randomly selected student sleeps in Stat 101 is
0.60 and the probability that he sleeps in Math 20 is 0.85. If the
probability that he sleeps in at least one of the two courses is 0.95,
a) What is the probability that the selected student sleeps in both
courses?
b) What is the probability that the selected student is awake in both
courses?
Example
First, define the basic events:
Let S = event that the selected student sleeps in Stat 101
M = event that the selected student sleeps in Math 20
Given: 𝑃 𝑆 = 0.60 𝑃 𝑀 = 0.85 𝑃(𝑆 ∪ 𝑀) = 0.95
a) What is the probability that the selected student sleeps in both courses?
𝑃 𝑆∩𝑀 =𝑃 𝑆 +𝑃 𝑀 −𝑃 𝑆∪𝑀
= 0.60 + 0.85 − 0.95
= 0.5
b) What is the probability that the selected student is awake in both courses?
𝑃 𝑆 ' ∩ 𝑀' = 𝑃 𝑆 ∪ 𝑀 '
=1−𝑃 𝑆∪𝑀
= 1 − 0.95 = 0.05
Exercise Danielle, a health worker, is studying the prevalence
of certain diseases in a particular community. Based
08 on her previous studies, she came up with the
following figures: 10% of the people in the
community will contract disease A sometime during
their lifetime; 25% will contract disease B; and 5%
will contract both diseases. Find the probability that
a randomly selected person from this community
will contract:
a. At least one of the 2 diseases
b. Disease B but not disease A
c. Exactly one of the 2 diseases
Independent Events
Two events A and B are said to be independent events if and only if
𝑃 𝐴 ∩ 𝐵 = 𝑃 𝐴 × 𝑃(𝐵)
𝑃 𝐴∩𝐵
“Walang kinalaman sa = 𝑃 𝐴 𝑃(𝐵)
isa’t isa” Independent
Example
The probability that a Japanese industry will put up a plant in Cebu is 0.7. The
probability that it will put up a plant in Bataan is 0.3, and the probability that it will put
up a plant in at least one of the two provinces is 0.79.
Define the following events:
C = event that a Japanese industry will locate in Cebu
B = event that a Japanese industry will locate in Bataan
Are C and B independent events?
Solution:
Given: 𝑃(𝐶) = 0.7, 𝑃(𝐵) = 0.3, and 𝑃(𝐶 ∪ 𝐵) = 0.79. We compute for 𝑃(𝐶 ∩ 𝐵)
𝑃 𝐶 ∩ 𝐵 = 𝑃 𝐶 + 𝑃 𝐵 − 𝑃 𝐶 ∪ 𝐵 = 0.7 + 0.3 − 0.79 = 0.21
Now, 𝑃 𝐶 𝑃 𝐵 = 0.7 0.3 = 0.21.
Since 𝑃 𝐶 ∩ 𝐵 = 0.21 = 𝑃 𝐶 𝑃(𝐵), then C and B are independent events.
Exercise At the annual Idol Star Athletics Championships,
three idols compete in the Team Archery. Each
09 idol fires one shot at the target in an attempt to
hit the bullseye. Let 𝐴> denote the event that
person j hits the bullseye, j=1,2,3. If 𝐴? , 𝐴% , and
𝐴$ are independent with 4𝑃(𝐴? ) = 𝑃(𝐴% ) =
2𝑃(𝐴$ ) = 0.8 , obtain the probability that
a) all of them hits the bullseye
b) exactly one of them hits the bullseye
02 Random Variable and
its Distribution
Random Variable
Definition
A function whose value is a real number that is determined by each sample point in
the sample space is called a random variable.
An uppercase letter, say X, will be used to denote a random variable and its
corresponding lowercase letter, x in this case, will be used to denote one of its
values.
X(•): 𝛀 → ℝ
REMARKS
o The use of the term variable is consistent with the way we use this word in
mathematics and the way we defined it in Stat 114 as the characteristic of
interest whose value varies.
o The addition of the term “random” emphasizes the requirement that the realized
or actual value of the random variable depends on the outcome of a random
experiment.
(Absolutely)
Discrete
continuous
random
random
variable
variable
Discrete Random Variable
If a sample space contains a finite number of sample points or has as
many sample points as there are counting/natural numbers, then it is
called a discrete sample space.
A random variable defined over a discrete sample space is called a
discrete random variable.
𝛀 = 𝟏, 𝟐, … , 𝒏 𝛀 = {𝟏, 𝟐, 𝟑, … }
Discrete Random Variable
Here are some examples of discrete random variables:
1) number of people vaccinated in a random day
2) number of heads in 5 tosses of a coin
3) number of fake news shared in a random day
4) number of COVID-19 active cases in a random week
5) number of Bondee friends of a random person
Probability Mass Function
The probability mass function (PMF) of a discrete random variable,
denoted by f(.), is a function defined for any real number x as
𝑓(𝑥) = 𝑃(𝑋 = 𝑥)
The values of the discrete random variable X for which f(𝑥) > 0 are
called its mass points.
We can use the probability mass function to compute for probabilities expressed in
terms of X. Also, we can use it to compute for important summary measures like the
mean and standard deviation.
Example
Define X = number of voters who will elect Renzo.
{A1,A2,A3} {A1,A2,A4} {A1,A2,A5} {A1,A2,A6} {A1,A3,A4}
3 3 2 2 3
{A1,A3,A5} {A1,A3,A6} {A1,A4,A5} {A1,A4,A6} {A1,A5,A6}
2 2 2 2 1
{A2,A3,A4} {A2,A3,A5} {A2,A3,A6} {A2,A4,A5} {A2,A4,A6}
3 2 2 2 2
{A2,A5,A6} {A3,A4,A5} {A3,A4,A6} {A3,A5,A6} {A4,A5,A6}
1 2 2 1 1
x 1 2 3
f(x) 1/5 3/5 1/5
Valid PMF
We must always check if a probability mass function (PMF) is
valid or not.
A valid PMF must satisfy the following properties:
üIt is greater than 0 when it is evaluated at the mass points.
üThe sum of its values evaluated at all mass points is equal to
1.
Using the PMF to compute probabilities
x 0 1 2 3 4 5 6 7 8 9
f(x) 0.08 0.09 0.09 0.14 0.18 0.16 0.09 0.06 0.06 0.05
Unlike the discrete random variable, the continuous random variable has infinite mass points. It is not defined as P(X=x). However, we still
consider the PDF as the counterpart of the PMF for the continuous case because we will also use it to compute probabilities and summary
measures.
Graph of the PDF
o The graph of the PDF is always above
the 𝑥 -axis because the function
X=a X=b
cannot take on negative values.
f(x)
o If we remove the lines 𝑥 = 𝑎 and 𝑥 = 𝑏
and measure the whole area below P(a ≤ X ≤ b)
𝑓(𝑥) and above the 𝑥-axis, this area is
always exactly equal to 1.
Note that the discrete random variable doesn’t share this property.
Graph of the PDF
o P(X=a) is just the same as X=a X=b
P(a≤X≤a). In this case, we
will let b=a. Then, the area f(x)
representing P(X=a) will be
P(a ≤ X ≤ b)
0 because we will only be
left with a single line.
a b
Example
The PDF of a continuous random variable, X, is given by:
0.25, 3<𝑥<7
𝑓 𝑥 =R
0, 𝑒𝑙𝑠𝑒𝑤ℎ𝑒𝑟𝑒
Find the following probabilities:
a. P(4 < X < 6.5) = (length)(width) = (0.25)(2.5) = 0.625
b. P(X ≥ 5) = (0.25)(2) = 0.5
Notes about the PMF and PDF
o The PMF of the discrete random variable and the PDF of the continuous random
variable are what we refer to as the distribution of the random variable.
o The distribution of the random variable X provides us with complete information
about the behavior of the random variable X.
o Although we cannot predict with certainty what the realized value of the random
variable X will be, we can use its distribution to compute for the probability of any
event expressed in terms of the random variable X. We will learn how to do this in
Stat 121. In Stat 115, we will use the CDF of a continuous random variable to
compute for probabilities.
Notes about the CDF
1, 𝑥≥7
F 𝑥 = C0.25𝑥 − 0.75, 3≤𝑥<7
0, 𝑥<3
Find the following probabilities using this CDF.
a) P(X≥5)
𝑃 𝑋 ≥5 =1−𝐹 5
= 1 − 0.25 5 − 0.75
= 0.5
b) P(4<X<6.5)
𝑃 4 < 𝑋 < 6.5 = 0.25 6.5 − 0.75 − 0.25 4 − 0.75
= 0.625
Exercise The CDF of a continuous random variable X is as
follows:
11 1, 𝑤ℎ𝑒𝑛 𝑥 ≥ 1
𝐹 𝑥 = W𝑥 I, 𝑤ℎ𝑒𝑛 0 ≤ 𝑥 < 1
0, 𝑤ℎ𝑒𝑛 𝑥 < 0
Find the following probabilities using the CDF above:
a) P(X > 0.25)
b) P(0.3 < X < 0.7)
c) P(0.4 ≤ X < 1.25)
d) P(X ≤ 0.5)
Expected Value of X
Let X be a discrete random variable with probability mass function (PMF):
x x1 x2 … xn
f(x)=P(X=x) f(x1) f(x2) … f(xn)
x 2 4 6 8 10 12
f(x)=P(X=x) 1/25 3/25 8/25 6/25 5/25 2/25
12 x
f(x)=P(X=x)
-1 0 1 2
1/10 2/10 5/10 2/10
Reading
Assignment
Properties of the Mean
o 𝐸(𝑋 − 𝜇) = 0
o 𝐸(𝑎𝑋) = 𝑎𝐸(𝑋)
o 𝐸(𝑏) = 𝑏
o 𝐸(𝑋 + 𝑌) = 𝐸(𝑋) + 𝐸(𝑌)
o 𝐸(𝑋 – 𝑌) = 𝐸(𝑋) – 𝐸(𝑌)
o If X and Y are independent random variables: 𝐸(𝑋𝑌) = 𝐸(𝑋)𝐸(𝑌)
Properties of the Variance
Let a and b be constants
o 𝑉𝑎𝑟 𝑎𝑋 = 𝑎% 𝑉𝑎𝑟 𝑋
o 𝑉𝑎𝑟 𝑏 = 0
o 𝑉𝑎𝑟 𝑎𝑋 + 𝑏 = 𝑎% 𝑉𝑎𝑟 𝑋 since variance is not affected by the addition/subtraction of a constant.
The random variable in a binomial experiment is the number of times a success has
occurred in a total of 𝒏 trials.
Example X = number of items correctly answered out of 10 items
Y = number of games won out of 8 games
Example
Tossing a coin 20 times to see how many tails occur.
a) It consists of observing the outcomes of a sequence of 20 tosses
(trials).
b) Each toss (trial) can result in one of only 2 possible outcomes:
heads or tails
c) The probability of success is the same for each trial which is 0.5.
d) The trials are independent. Even if you get heads or tails for the
previous tosses, the probability of a success (getting heads or tails)
is still 0.5.
Binomial Distribution
A discrete random variable X is said to follow a Binomial distribution if its probability mass
function (PMF) is given by
&
𝑝V 1 − 𝑝 &WV , 𝑥 = 0,1,2, … , 𝑛
𝑓 𝑥 =a V
0, 𝑒𝑙𝑠𝑒𝑤ℎ𝑒𝑟𝑒
where n and p are the parameters of the distribution, p is the probability of success (any
value between "0" and "1"), and n is the number of trials (any positive integer).
Probability
of success
15 , 1 − 0.25 -./, ,
𝑝 𝑥 =X 𝑥 (0.25) 𝑥 = 0,1,2, … , 15
0, 𝑒𝑙𝑠𝑒𝑤ℎ𝑒𝑟𝑒
a) What is the probability that the student will get a perfect score?
15
𝑃 𝑋 = 15 = (0.25)-. 1 − 0.25 -./-. = 0.00000000093
15
b) What is the probability that the student will get at least 3 correct answers?
𝑃 𝑋 ≥3 =1−𝑃 𝑋 <3 =1− 𝑃 𝑋 =0 +𝑃 𝑋 =1 +𝑃 𝑋 =2
= 1 − [𝑝 0 + 𝑝 1 + 𝑝 2 ] = 1 – (0.01336 + 0.0668 + 0.1559) = 0.7639
c) What is the student’s expected number of correct answers?
The expected number of correct answers is 𝑛𝑝 = (15)(0.25) = 3.75 𝑜𝑟 4
Exercise A fair die is rolled and the number of the dots on the face
that comes up is observed. What is the probability that
13 the number of dots is a perfect square (1 or 4) in 3 out of
the 6 tosses?
Exercise Westex Oil Corporation spends an extensive amount of
time, effort, and money in searching for oil. On the basis
14 of past experience, the firm has 15% rate of success of
striking oil in each drilling operation and the results of
the drilling operations are independent of each other. In
the year 2013, the corporation plans to drill 20 new
wells. What is probability that they will strike oil in at
least 3 of the 20 drilling operations in 2013?
04 Normal Distribution
Normal Distribution
A continuous random variable X is said to be normally distributed if its probability
density function (PDF) is given by
𝟏 𝟏 𝑿#𝝁 𝟐
#
𝒇 𝒙 = 𝒆 𝟐 𝝈
𝝈 𝟐𝝅
for any real number x. The constants, μ and σ2, are such that μ is any real number
and σ2>0. The values, e and π, are mathematical constants, wherein, e=2.71828
and π=3.14159.
Many consider the normal distribution as the most important distribution in Statistics. And we will see why...
Graph of the Normal Distribution
𝑋 ~ 𝑁(𝜇, 𝜎 ! ) → 𝑌 ~ 𝑁(𝑎𝜇 + 𝑏, 𝑎! 𝜎 ! )
)#*
Classic Example: 𝑍 = ~𝑁(0,1)
+
Property of the Normal Distribution
o Any random variable X that follows a normal distribution with mean µ and
variance s2 can be transformed into a standard normal random variable Z with
mean 0 and variance 1.
o The transformation is the familiar formula that we use to compute for the z-score
in Stat 114:
𝑋−𝜇
𝑍=
𝜎
o If X~N(μ,σ2), then
𝑋−𝜇 𝑎−𝜇 𝑎−𝜇
𝑃 𝑋≤𝑎 =𝑃 ≤ =𝑃 𝑍≤
𝜎 𝜎 𝜎
where Z is a standard normal random variable.
Example
Suppose X~Normal(μ=5, σ2=4).
JW) eW)
a) 𝑃 𝑋 ≤ 6 = 𝑃 ≤ = 𝑃 𝑍 ≤ 0.5 = Φ 0.5 = 0.6915
L L
𝑋−𝜇
𝑃 𝜇 − 2𝜎 < 𝑋 < 𝜇 + 2𝜎 = 𝑃 −2𝜎 < 𝑋 − 𝜇 < 2𝜎 = 𝑃 −2 < <2
𝜎
= 𝑃(−2 < 𝑍 < 2) = Φ(2) − Φ(−2) = 0.9772 − 0.0228 = 0.9544
𝑋−𝜇
𝑃 𝜇 − 3𝜎 < 𝑋 < 𝜇 + 3𝜎 = 𝑃 −3𝜎 < 𝑋 − 𝜇 < 3𝜎 = 𝑃 −3 < <3
𝜎
= 𝑃(−3 < 𝑍 < 3) = Φ(3) − Φ(−3) = 0.9987 − 0.0013 = 0.9974
za Value
The value za (read as “z sub alpha”) satisfies the condition that P(Z > za) = a.
This is equivalent to saying that P(Z ≤ za) = 1-a
a a
1-2a
-za za
za Values: Bottom of Table B.1
Example:
z0.05= 1.645
P(Z > 1.645) = 0.05 , P(Z < 1.645) = 0.95
P(Z < -1.645) = 0.05 , P(-1.645 < Z < 1.645) = 0.90.
0.05 0.05
0.90
-1.645 0 1.645
Importance of the Normal Distribution
𝑋 − 5 7.5 − 5
𝑃 𝑋 > 7.5 = 𝑃 > = 𝑃 𝑍 > 2.5
1 1
= 1 − Φ(2.5) = 1 − .9938 = 0.0062.
Example
Exercise 3 (page 373)
Suppose that the IQ’s of applicants of a certain science high school follow a
normal distribution with mean of 120 and a standard deviation of 9.
a. One of the requirements of the school in accepting a student is that
the student’s IQ must be at least 115. What proportion of the
applicants will be rejected on the basis of their IQ?
Let X be the IQ of a selected applicant
Given: X~Normal(µ=120, s2=92)
Example
Find P(X<115).
𝑋 − 120 115 − 120
𝑃 𝑋 < 115 = 𝑃 <
9 9
5
= 𝑃 𝑍 < − ≈ 𝑃 𝑍 < −.56
9
= Φ −0.56
= 0.2877
Example
Suppose X~Normal(µ=10, s2=25). Find the 99th percentile.
99
𝑃 𝑋≤𝑐 = = 0.99
100
𝑐 − 10
𝑃 𝑍≤ = 0.99
5