STA110 Lecture Notes - 100436
STA110 Lecture Notes - 100436
PART 3
Lecture Notes
◆ Introduction to Probability
◆ Use of Permutation and Combination in
Evaluating Probability
◆ Sampling from Tables of Random Numbers
◆ Types and Distribution of Random Variables
C. O. Odijie
Department of Statitsics
University of Benin
Benin City
Nigeria
© 2021 C. O. Odijie
.
© 2021 C. O. Odijie
All Rights Reserved.
Email: [email protected]
Special Features
◆ The soft copy of this document is very flexible to navigate via links and
cross-references. Some are highlighted for quick reference. Feel free to click
any of them and return to your original position with keyboard shortcut
Alt+ i.e. Alternate + left arrow key if you are using a PDF reader that
supports the shortcut.
◆ There are copious examples on each topic and/or sub-topic.
◆ There are a good number of tutorial questions for practice under each topic
or after a group of sub-topics.
◆ There are special KEYNOTES to help summarise or emphasize important
points to note.
◆ Separated definitions, remarks, etc., which are numbered within sections e.g.
Remark 3.1 means “Remark 1 of section 3” for easy cross-referencing.
Literature
The student is encouraged to consult good textbooks and materials e.g. those by
Kroese and Chan (2014), Kroese (2018) and Blitzstein and Hwang (2019) among
others. See bibiography for details.
CONTENTS CONTENTS
Contents
Introduction to Probability 6
1 Random Experiment 6
1.1 Sample Space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.2 Events . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.3 Events as Set Operations . . . . . . . . . . . . . . . . . . . . . . . . 9
1.4 Mutually Exclusive Events . . . . . . . . . . . . . . . . . . . . . . . 9
1.5 Mutually Exhaustive Events . . . . . . . . . . . . . . . . . . . . . . 9
1.6 Independent Events . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.7 Dependent Events . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Tutorial Questions Set 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2 Probability 12
2.1 Classical Definition of Probability . . . . . . . . . . . . . . . . . . . 12
2.2 Probability of Equally Likely Events . . . . . . . . . . . . . . . . . 12
2.3 Addition Law of Probability . . . . . . . . . . . . . . . . . . . . . . 15
2.4 Multiplication Law of Probability . . . . . . . . . . . . . . . . . . . 15
2.5 Conditional Probability . . . . . . . . . . . . . . . . . . . . . . . . . 16
Tutorial Questions Set 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
Bibliography 56
Appendix 57
Introduction to Probability
1 Random Experiment
A basic notion in probability theory is that of a random experiment. A ran-
dom experiment is one whose outcomes cannot be predetermined (i.e. out-
comes cannot be determined in advance). In other words, the outcomes depend on
chance/probability. Although the outcomes cannot be predetermeined, they can
still be analyzed because of the knowledge of their general nature.
An outcome is the result of a single trial of a random experiment. A very basic
example of a random experiment is the flip of a coin. It is a random experiment
because the outcome (a head appearing or a tail appearing) cannot be known
before the flip, even though it is general known that each outcome would always
be one of these two. You may have seen this random experiment several times
being carried out before the start of a football match. Other examples of random
experiments include:
Examples Set 1
(a) counting the number of cars that pass through UNIBEN Main Gate
between 8 a.m. and 4 p.m. in a day,
(b) randomly selecting 50 Statistics students in 100 Level in a particular
session and measuring their heights,
(c) measuring the weight of 100 randomly selected yam tubers harvested by a
farmer,
(d) tossing a fair die,
(e) measuring the amount of rainfall in Benin City in May.
The goal of probability theory is to describe these random experiments with ap-
propriate mathematical models with which they can be analyzed. The models
are made up of three building blocks: a sample space, a set of events, and a
probability. Let us examine each of these building blocks one after the other.
experiment. For example, although we cannot tell which face of a fair die would
show in a single toss, we know that it will definitely be any one of the numbers 1,
2, 3, 4, 5 or 6. This set of possible outcomes is what we refer to as sample space.
Definition 1.1 (Sample Space) The sample space Ω of a random experiment
is the set of all possible outcomes of the experiment.
Examples Set 2
The following are some examples of random experiments and their sample spaces.
1. Throw two fair dice one after the other,
Ω tp1, 1q, p1, 2q, . . . , p1, 6q, p2, 1q, p2, 2q, . . . , p2, 6q, p3, 1q, p3, 2q . . . , p3, 6q, . . . ,
p6, 1q, p6, 2q, . . . , p6, 6qu.
2. The number of arriving calls at a customer care centre during a specifi
ed time interval,
Ω t0, 1, 2, 3, . . . u, the set of non-negative integers.
3. The lifetime of an electric bulb (measured in days),
Ω R , the set of positive real numbers.
4. Flip two fair coins together,
Ω tHH, HT, T H, T T u.
5. Measure the height (in centimeters) of 10 randomly selected students,
Ω tpx1 , x2 , . . . , x10 q : xi ¡ 0, i 1, 2, . . . , 10u.
Remark 1.1 In the last example, px1 , x2 , . . . , x10 q represents the outcome that the
height of the first selected student is x1 , the height of the second selected student is
x2 , and so on, where each student’s height must be positive (denoted by xi ¡ 0, i
1, 2, . . . , 10). It is wiser to use the largest possible sample space when the limits
are not certain. This is the case of example 3. Of course, we know that an electric
bulb will not have a lifetime that spans the positive real axis (i.e. last forever) but
it is better to assign the whole set of positive real numbers since we are not certain
what the maximum life span of an electric bulb would be. A similar explanation
goes for the heights of students which may not even exceed 300cm (3 meters) in
example number 5.
1.2 Events
When the outcomes of a random experiment are considered as well-defined subsets
relative to the sample space, they are called events.
Examples Set 3
Let us consider some events in the random experiment of throwing two fair dice
given in the first example of Examples Set 2 of sub-section 1.1. First, the sample
space is given as
$ ,
'
'p q
1, 1 , p1, 2q, p1, 3q, p1, 4q, p1, 5q, p1, 6q,/
/
'
'
'
'p q
2, 1 , p2, 2q, p2, 3q, p2, 4q, p2, 5q, p2, 6q,/
/
/
/
Ω
&
p3, 1q, p3, 2q, p3, 3q, p3, 4q, p3, 5q, p3, 6q,.
'
' p4, 1q, p4, 2q, p4, 3q, p4, 4q, p4, 5q, p4, 6q,/
/
'
'
'
' p5, 1q, p5, 2q, p5, 3q, p5, 4q, p5, 5q, p5, 6q,/
/
/
/
%
p6, 1q, p6, 2q, p6, 3q, p6, 4q, p6, 5q, p6, 6q -
We define the following events.
1. The event A of obtaining a sum of face numbers equal to 12 is given as
A tp6, 6qu.
2. The event B that the second die is 2 is given as
B tp1, 2q, p2, 2q, p3, 2q, p4, 2q, p5, 2q, p6, 2qu.
3. The event C that both dice show equal face numbers is given as
C tp1, 1q, p2, 2q, p3, 3q, p4, 4q, p5, 5q, p6, 6qu
4. The event D that a sum of face numbers greater than 12 appears is the
empty set tu or ϕ. That is, an impossible event in this case.
5. The event E that a sum of face numbers less than or equal to 12 appears is
the sample space Ω itself.
Some events (called compound events) combine more than one simple events.
Examples of compound events defined on the same sample space Ω in Examples
Set 3 are given in the next set of examples.
Examples Set 4
1. The event F of obtaining a sum of face numbers equal to 12 or a sum of face
numbers equal to 8 is given as
F tp2, 6q, p3, 5q, p4, 4q, p5, 3q, p6, 2q, p6, 6qu
2. The event G that 2 appears on the second die or 1 appears on the first die
is given as
G tp1, 2q, p2, 2q, p3, 2q, p4, 2q, p5, 2q, p6, 2q, p1, 1q, p1, 3q, p1, 4q, p1, 5q, p1, 6qu.
3. The event H that the first die shows an odd number and the second die
shows an even number, and is given as
H tp1, 2q, p1, 4q, p1, 6q, p3, 2q, p3, 4q, p3, 6q, p5, 2q, p5, 4q, 5, 6qu.
Remark 1.2 Note that the events in Examples Set 3 are all simple events unlike
those in Examples Set 4 which are compound events. The event G for instance is a
union of the separate simple events “the event that 2 appears on the first die” and
“the event that 1 appears on the second die”. We can use set operations (mainly
union and intersection) to describe compound events as will be seen later. At this
point, the student is assumed to have a basic knowledge of set notations.
A typical pair of events that are both mutually exclusive and exhaustive is any
given event and its compliment (e.g. A and A1 ).
KEYNOTES 1
1. A X B ϕ but A X C ϕ and B X C ϕ,
2. pA X B 1q X pA1 X B q ϕ i.e. pA X B 1q and pA1 X B q are disjoint events,
3. A pA X B 1 q Y pA X B q & B pA1 X B q Y pA X B q ,
4. A Y B A Y pA1 X B q B Y pA X B 1 q.
These deductions will be very useful in the proof of some properties of probability
later.
2 Probability
A very important tool in the model for a random experiment is the specification
of the probability of occurence for the events in the sample space of an experi-
ment.This probability tells us how likely it is that a particular event will occur.
Definition 2.1 (Probability) A probability P is a rule or function which as-
signs a non-negative real number to each outcome/event of the sample space Ω of
a random experiment, and which satisfies the following axioms.
Axiom 1. PpAq ¥ 0 for an event A Ω.
Axiom 2. PpΩq 1.
Axiom 3. For any sequence A1 , A2 , A3 , . . . of mutually disjoint events in Ω, i.e.
Ai X Aj ϕ, for i j, we have that
8
¤ 8̧
P Ai PpAi q
i 1
i 1
npAq
PpAq
npΩq
where npAq means the number of elements (single outcomes) in A and npΩq means
the number of elements in the sample space, Ω.
If A and B are events in any sample space Ω and npAq ¥ npB q then PpAq ¥ PpB q.
npAq
The proof is directly from classical definition of probability. That is
npΩq
¥ nnppBΩqq
since in the numerators, npAq ¥ npB q for a fixed denominator npΩq
4. PpA1 q 1 PpAq.
Proof.
Clearly, B A YpA1 X B q where A and pA1 X B q are disjoint events. Thus applying
axiom 3, we have
PpB q PpAq PpA1 X B q, and hence PpAq ¤ PpB q follows, since PpA1 X B q ¥ 0.
Property 4. Now, Ω A Y A1 (students should verify this with the aid of venn
diagram, or otherwise, and also verify that A and A1 are mutually exclusive and
exhaustive with respect to the sample space Ω. See definitions 1.3 and 1.4). Since
A X A1 ϕ, applying axiom 3 yields PpΩq PpAq PpA1 q, and we have that
PpA1 q 1 PpAq because PpΩq 1 by axiom 2.
B pA1 X B q Y pA X B q (2.1)
and
AYB A Y p A1 X B q (2.2)
Now, since A and pA1 X B q are disjoint, we can apply axiom 3 to equation (2.2) to
obtain:
Next, since pA1 X B q and pA X B q are disjoint events, we apply axiom 3 to equation
(2.1) to obtain
PpB q PpA1 X B q PpA X B q,
ñ
PpA1 X B q PpB q PpA X B q (2.4)
Finally, substituting PpA1 X B q from equation (2.4) into equation (2.3), we have
PpA Y B q PpAq PpB q PpA X B q■
When the two events are mutually exclusive, that is, pA X B q ϕ implying
that PpA X B q Ppϕq 0, the addition law reduces to
where PpB |Aq is called the conditional probability that event B occurs given
that event A has occurred and is defined in definition 2.2.
Note that PpA and B q PpA X B q as may be seen in the literature. Note also that
A X B B X A by commutative property of intersection.
When events A and B are independent, the Multiplication Law reduces to
Or alternatively,
PpA X B q PpAq PpB q (2.10)
Similarly,
PpA|B q PpAq (2.13)
Examples Set 5
A. Addition Law (With Mutually- and Non-Mutually Exclusive Events)
1. You draw one card from a standard deck of cards, find the probability that
it is (a) either an ace or a diamond? (b) either a king or a queen (c) either a
card numbered 5 or a heart (d) odd-numbered card or a joker. (See appendix
for a pictoral sample of a deck of cards)
Now,
(a) PpA Y B q PpAq PpB q PpA X B q 0.6 0.4 0.24 0.76.
(b) The events A and B are not mutually exclusive. This is because their
probability of occuring together is not zero. i.e. PpA X B q 0.24 0
(c) A and B are independent. This is because PpA X B q PpAq PpB q i.e.
0.24 = (0.6)(0.4)
(d) A and B are not mutually exhaustive because PpA Y B q 0.76 1
PpΩq. Stated in words, the probability of their union is 0.76, which is
less than 1, which is the probability of the sample space Ω. It means
their union is not equal to or does not exhaust the sample space.
(e) - (h) are left as exercise for the students to tackle following (a) - (d).
3. In a class of 50 students, 27 are boys and 23 are girls. On a particular test,
14 boys and 15 girls made an A grade. If a student is chosen at random from
the class, what is the probability of choosing a girl or a student who made
an A grade?
Let G be the event of picking a girl and A be the event of picking a student
who made an A grade.Then,
PpX q 1{52.
Now,
PpX and Y q PpX q PpY |X q (multiplication rule or law)
After drawing the first card being queen of hearts, there are now 12 hearts
left in the hearts suit and 51 cards left in the pack. Hence, the probability
that the second card is a heart given that the first card is the queen of hearts
is given by
PpY |X q 12{51,
so that
PpH on the coin and 5 on the dieq PpH on the coinq Pp5 on the dieq
p1{2q p1{6q
1{12 or 0.0833.
6. A card is chosen at random from a deck of 52 cards. It is then replaced and
a second card is chosen at random. What is the probability of choosing a
joker and then a seven?
Now, the two events are independent because the first card was replaced
before choosing the second. So,
C. Conditional Probability
7. A Statistics lecturer gave his class tests on two courses. 25% of a the class
passed both tests and 42% of the class passed the first test. What percent
of those who passed the first test also passed the second test?
A little care has to be taken here not to make the mistake of evaluating
the required probability as under multpiplication law. Let’s get out the
probabilities of the various events to see a clear picture of the problem.
Let the events that the first and second tests were passed be A and B,
respectively. Then, from the problem,
PpA X B q 0.25 (% that passed both tests)
PpAq 0.42 (% that passed first test)
Clearly now, those who passed the first test, who also passed the second test
ñ those who passed the second test given/knowing that they have passed
the first test and is given by definition as
PpB |Aq PpA X B q{PpAq
Substituting values, we have:
PpB |Aq p0.25q{p0.42q 0.5952.
That is about 59.52% 60% passed the second test given that they passed
the first test.
8. The probability that it is Friday and a student is absent is 0.05. Assume
a typical 5 school days per week. What is the probability that a student
selected from attendance register at random is absent given that the day is
Friday? Interprete your result in terms of % chance.
Let F be the event that a day choosen at random is a Friday and A be the
event that a student is absent. Then
PpF q 1{5 0.2 (one of five days)
PpA X F q 0.05
The required probability is
There is 25% chance that a student selected randomly is absent given that
the day is Friday.
9. If the probability that Jane buys a new pair of shoes given that she travels
abroad is 0.75. What is the probability that Jane travels abroad and buys a
new pair of shoes if the probability that she travels abroad is 0.25.
Let B be the event that Jane buys a new pair of shoes and T be the event
that Jane travels abroad.
Given probablilities are
PpB |T q 0.75, PpT q 0.25
Required probability is PpB X T q
Using the definition of conditional probability, we have
PpB |T q PpB X T q{PpT q
That is
PpB X T q
0.75
0.25
6 PpB X T q p0.75qp0.25q 0.1875.
17. Show that PpA Y B q PpAq PpB q PpAq PpB q if A and B are independent
and
PpA Y B q PpAq PpB q PpAq PpB |Aq if A and B are dependent.
18. Show that P pA Y B q|A P pA Y B q|B 1
19. Given that X and Y are independent events and PpY |X q 0.25, Pp X q
0.65, Find PpX Y Y q
20. Given that PpS and T q .4, PpT |S q .5, find PpS q
21. *Two cards are drawn from a deck of 52 cards one after the other without
replacement. What is the probability that one is a spade and the other is a
numbered red suit card? (Hint: 1. Notice that there is no particular order
in which the two events must occur. 2. Study Figure 4 properly to get the
events correctly).
On the other hand, when the order of arrangement does not matter in a selection
from a set, we are dealing with combination. Considering the same letters as before,
there is only one way of combining the letters A, B and C, that is tA, B, C u. As
long as each of the letters appears in the selection or combination, the position
(order) in which a letter appears does not matter in the combination. Suppose
again that we are to take only two letters at a time. Then we would have AB, AC,
and BC. Obviously the reverse ordering arrangements (BA, CA, and CB) are not
necessary here. Mathematically stated,
KEYNOTES 3
n
1. A k-combination from a set of n items is written as or nC k and
k
defined as
n
k
k!pnn! kq!
2. A k-arrangement from a set of n items is written as nP k and defined
as
Pk
pn kq! npn 1qpn 2q . . . pn k 1q
n n!
So far, the objects being arranged have been distinct. If some of the set of objects
to be permuted have identical constituents, the permutation procedure will be
different as in the following example.
Example 3.1
In how many ways can the letters of the word BEER be arranged?
The word BEER has four letters and two are identical. Suppose we label the two
identical E’s as E1 and E2 . All the letters would be distinct and there will be
4
P 4 4! 24 ways of arranging the word. But since the two E’s are actually
identical, the number of permutations will be reduced because we cannot identify
which of the E’s is used where. E.g. the arrangements BE1 RE2 and BE2 RE1 , if
the E’s were labeled, would just be BERE as the E’s are actually identical. So
will every other arrangement where the two identical letters alternate positions be
reduced to just one. That is, the two E’s would be arranged in 2! ways if they were
Other forms of permutation with special restrictions like a particular letter must
be in a specific position can be carefully handled with a combination of the rules
learned so far. The student is advised to read up more about permutations and
combinations as this is onlya brief review.
Summarily,
or
n
(b) nC k
Pk
k!
4. Total arrangements of n items where p items are identicals of one kind, q
items are identicals of another kind,
n!
p!q!
5. Cyclic permutation of n distinct items is given by pn 1q!.
on permutation and combination to know the number of ways that outcomes can
possibly be obtained. Then probabilities can be evaluated as ratios of possible
outcomes to total outcomes. Permutation and combination often relate to random
experiments where drawing or selection is done without replacement. Let’s see
an example to explain this.
Example 4.1
Consider a problem of drawing three marbles from an urn containing four marbles
without replacement. In how many ways can this been done?
So, there are 24 ways of drawing drawing 3 marbles from an urn containing 4
different marbles.
Suppose we replace each marble selected at random before selecting the next one.
There would always be four marbles to pick from and the result of selecting three
marbles from the urn would be 4 4 4 64. That is there would be 43 64
ways of selecting 3 marbles from an urn containing 4 marbles if replacement is
allowed after each random selection.
NOTE: Ordered vectors such as p4, 3, 2q, p3, 2, 4q, p2, 3, 4q, . . . , are usually used to
represent permutations of a single set of combination t4, 2, 3u, say. Here, p2, 3, 4q
p4, 3, 2q p3, 2, 4q . . . being vectors but t3, 2, 4u t4, 3, 2u t2, 3, 4u . . . as
equal sets denoting a single combination where the order of arrangement of the
elements does not matter.
Examples Set 6
1. What is the probability that in 5 tosses of a fair coin we get exactly 3 Heads?
2. Draw 10 cards at random from a deck of cards. What is the probability that
we draw 3 Hearts and 5 Spades?
3. What is the probability that all 20 people in a group have different birthdays?
(Assume 365 days per year)
4. What is the probability of choosing an ATM PIN of 4 distinct digits? Assume
that every possible arrangement of four digits is eligible as an ATM PIN.
5. Find the probability that a word chosen at random from all the possible
permutations of the word AV AT AR starts with letter T
Solution
In all the the problems, let Ω denote the sample space and let A be the required
event so that npΩq is the total number of outcomes and npAq is the number of
outcome in event A.
1. Tossing a fair coin five times gives npΩq 25 32 (a coin has two sides i.e.
two possible outcomes, then five tosses)
The event A of getting exactly 3 Heads can be obtained in
5
3
10 ways.
That is, the combination tH, H, H, T,T u of 5 outcomes where 3 outcomes
must be H can be arranged in
5!
3!2!
5
3
since we have a total of 5 letters
out of which 3 H’s are identical and 2 T ’s are identical. (Refer to the last
bullet item of KEYNOTES 4)
6 PpAq 10
32
0.3125
13 13 26
6 PpAq 3
5
52
2
0.0076
10
3. The probability that out of 20 people in a group, all have different birthdays
(based on 365 days in a year) is
365
P 20
36520
0.5886. How?
First, taking 365 days in a year, the first person can have any of the days as
birthday. Next, for the second person to not share the first person’s birthday,
we must “remove” the first person’s birthday from the days of the year.The
the second person now has 364 days left. This reasoning carries through to
the 20th person who now has 346 days left to have a birhtday if it must not
“clash” with the previous 19 birthdays and we have:
365 364 363 362 346
Pp20 people have different birthdaysq
365 365 365 365 365
365
The numerator is clearly P 20 while the denominator is 36520
4. npΩq 104 . One way of seeing this clearly is the usual
(number of outcomes for each trial of the experiment)number of trials .
That is (10 digits to pick from)4 slots or trials .
Another way is to list all the possible outcomes (PINs) and see how many
are there. That is, how many numbers are there from 0000, 0001, 0003, . . . ,
9999? The answer is 9999 0000 1 10, 000 104 numbers because any
sequence of consecutive natural numbers in ascending order a1 , a2 , . . . , an has
pan a1 1q terms. e.g. 4, 5, 6, 7, 8 has 8 4 1 5 terms which can easily
be verified here by counting them.
For the required event A, there are 10 digits from which 4 distinct digits
can be selected in this problem and the order of the digits matters, i.e.
select four distinct digits and then arrange into all possible 4! ordered forms.
For instance (4,1,3,5) will be a different PIN from (1,4,5,3), both having four
distinct digits as required. So the outcomes in event A involves a permutation
problem. See again summary “Note” of KEYNOTES 4.
So, npAq 10P 4
5040
6 PpAq 10000
5040
0.5040
(Source: https://mathbitsnotebook.com/Algebra2/Statistics/STrandomtable.html)
of such computer packages include Statistical packages like Minitab, SPSS, R, and
others such as MATLAB, Mathematica, etc.
Examples Set 7
1. Simulate a random experiment of flipping a coin 10 times. Start from
column 1, entry 2 of Table 1 and consider only the first digit of each group
along the columns.
2. Simulate an experiment of rolling a fair die 10 times. Start selection from
row 3, group entry 4 and sample row-wise, considering all digits in each
group.
Solution
1. Let 1 represent a head and let 0 represent a tail. We are required to start
from column 1, entry 2. That is the entry group 46132 in Table 1 and we are
to consider only the first digit of each column. Reading downwards and then
from up to down the next column (column 2), etc, (up to the 4th column in
this example) we have the result as follows:
1, 0, 0, 1, 0, 1, 0, 0, 0, 0 = H, T, T, H, T, H, T, T, T, T
2. We are to start sampling from row 3 and entry 4, sampling along rows and
considering all digits (not only first digits of groups).
Now, the 3rd row from 4th entry reads: 97402 27569 90184 . . .
We are picking any of the numbers 1, 2, 3, 4, 5, or 6 to represent the face
numbers of a fair die till we pick the 10th eligible number from all the digits
of each 5-digit group.
Our result is: 4, 2, 2, 5, 6, 1, 4, 2, 3, 3
This second example buttresses Remark 5.1.
Assignment
1. A teacher wishes to to measure the heights of 8 pupils randomly selected
from his class. Design and perform an experiment, with the aid of a random
number table, to select the pupils to partake given that there are 30 pupils
in the class.
2. Perform the experiment in question 1 of Examples Set 7 ten times, using
different starting points and sampling direction of your choice. Take record
of the number of heads xi in the ith experiment. Then find the mean of the
number of heads in the i 10 experiments. Is the mean close to 0.5?
If you can perform the experiment a large number of times, say N ¥ 100,
1 ¸
N
your mean number of heads xi will be 0.5 for sure (i.e. the coin
N i1
turns up heads about np n = half of the total number of times the coin
1
2
was thrown in each experiment in the long run of N experiments). This is a
practical demonstration of the expectation of a binomial random variable X
(representing the number of heads in the throw of a fair coin some number
of times, n, in this case).
◆ normal distribution
◆ exponential distribution
◆ Student’s t distribution
◆ chi-square distribution
X ∽ N pµ, σ 2 q
»x
PpX ¤ xq F pxq f ptqdt (6.2)
8
So, for X ∽ N pµ, σ 2 q in our case, the CDF is given by
» x
# +
PpX ¤ xq F pxq ?1 2 σ µ
1t 2
exp dt (6.3)
σ 2π
8
It is not easy to integrate equation 6.3 to compute probabalities for values of X in
a given interval. So, the normal random variable X is usually standardized using
the following transformation
Z X σ µ (6.4)
Z ∽ N p0, 1q
is called the standard normal random variable with mean µ 0 and variance
σ 2 1. Its probability distribution function is obtained by substituting σ 1 and
the tranformation 6.4 into the PDF in equation 6.1 and is given by
f pz q ? 12 z2 8 8
1
exp , z (6.5)
2π
The CDF of the the standard normal random variable Z is thus given by
» z
Φpz q ?1 exp 21 t2 dt (6.6)
2π
8
The CDF of Z in equation 6.6 is clearly simplier than that of X in 6.3. Infact,
cumulative probabilities PpZ ¤ z q Φpz q (see figure 3) for different values of z
have been computed and tabulated in standard normal tables popularly called the
Z-score table or Z table.
Given a problem under normal distribution therefore, it is better to tranform to
X µ
standard normal distribtuion via Z before solving, since we cannot solve
σ
the the normal CDF algebraically or analytically.
6.3.3 How to read a standard normal table for z-value given Φpz q
Suppose we are to find the z-value up to which area under the standard normal
curve from 8 is Φpz q 0.9307. Since 0.9307 ¡ 0.5, we search under the positive
Z-scores for the given probability, 0.9307. We can only find a value close enough
in this example and that is 0.9306. So we trace to the left of the table and to the
top to get 1.4 and 0.08 respectively (see Table 2). We add these two values to get
the required z-value, i.e. 1.48. This value is sometimes written as Φ1 pz q. Hence
Φ1 p0.9307q 1.48.
Examples Set 8
1. Find Pp1 ¤ X ¤ 4q given that X ∽ N p2, 25q.
2. Given that X ∽ N p6, σ 2 q and PpX ¡ 16q 0.0228, find σ 2 of X.
3. The scores in a test are estimated to be randomly distributed with mean 55
and variance 36. If the cut-off mark if set at 60, what percentage of
students who took the test scored below the cut-off mark?
4. Find k such that Pp|Z | ¤ k q 0.95 where Z ∽ N p0, 1q
5. Find µ and σ 2 given that PpX ¤ 2q 0.5 and PpX ¥ 2q 0.7881 for a
normal random variable X.
Solution
1. µ 2, σ 2 25 ñ σ 5, Z X σ µ
Hence,
¤ 4q P 1 5 2 ¤ Z ¤ 4 5 2
Pp1 ¤ X
Pp0.2 ¤ Z ¤ 0.4q
Φp0.4q Φ p0.2q
Φp0.4q 1 Φp0.2q (property 4 applied)
Φp0.4q Φp0.2q 1
0.6554 0.5793 1
6 Pp1 ¤ X ¤ 4q 0.2347
2. µ 6, σ2 ? Pp X ¡ 16q 0.0228, Z X σ µ .
Thus,
¡ 16 σ 6 0.0228
Pp X ¡ 16q P Z
1P Z 0.0228
10
σ
10
1Φ 0.0228
σ
10
Φ
σ
0.9772
¤ 2q P Z ¤ 2 σ µ 0.5 and
5. PpX
6 2 µ 0 and 2 µ 0.8
σ σ
ñ µ 2 and σ 5
So, mean µ 2 and variance σ 2 25
(c) Find the probability that a car chosen at random on the road travels
with a speed between the mean speed and the speed limit.
3. The ages Y of pupils in a class are normally distributed as Y ∽ N p10, 4q.
Find the probability that the age of a pupil selected at random
(a) exceeds the mean age of pupils in the class
(b) lies within 2σ of the mean age.
(c) is above 12 years.
4. Suppose the weight of male adults follows a normal distribution with mean
weight 70 Kg and variance 49. Find the probability that a male adult selected
at random
(a) weighs at least 60 Kg
(b) weighs less than 60 Kg
(c) weighs between 90 Kg and 98 Kg
1. Mean EpX q
1
λ
2. Variance VpX q
1
λ2
Example 6.1 Given that jobs arrive at a business center at the rate of 2 jobs
per minute on the average, what is the probability that the center waits less than
30 seconds for the next job? What is the maximum waiting time between two job
arrivals with 95% confidence?
λ 2 (rate of job arrival is 2 jobs per minute on the average). Waiting for 30
seconds waiting for 0.5 minutes.
Let T be the wating time between two jobs. The PDF of T is given by
f pt; λq 2e2t , t ¥ 0
and then, the required probabilty is
» 0.5
PpT ¤ 0.5q 2e2tdt
0
1 e2p0.5q
1 e1
0.6321
Let t* be the maximum wait time between two jobs with 95% confidence. Then
probability that the waiting time T does not exceed t* with 95% confidence is
given by
Pp T¤ tq 0.95
1 e2t 0.95
e2t 0.05
2t lnp0.05q
lnp0.05q
t
2
1.5 minutes
This means that the business center is 95% confident that it will not wait more
than 1 minute 30 seconds between any two jobs.
Find,
(a) The expected lifetime of the device
(b) the lifetime L which a typical device is 70% certain to exceed
(c) If five devices are to a dealer, find the probability that at least one of them
will have a lifetime less than L.
Solution
f ptq et , t ¥ 0 implies that λ 1. Hence,
(a) Expected value EpT q 1{λ 1
(b) We are to find L such that PpT ¡ Lq 0.7
That is,
1 PpT ¤ Lq 0.7
1 p1 eL q 0.7
eL 0.7
6 L lnp0.7q
0.3567 years
This means that with 70% confidence, the least lifetime of a typical device
is 0.3567 years.
(c) Assuming that the lifetime of each device is independent of another’s, then
Ppa device’s lifetime is not Lq PpT ¥ Lq 0.7
and by independence,
Now,
Ppat least one of five devices’ lifetime is Lq
1 Ppnone of five devices’ lifetime is Lq
1 p0.7q5
0.8319
q 1 p or p q 1
We write X ∽ B pn, pq to mean that the random variable X follows the binomial
distribution with parameters n and p.
The mean of a binomial variable X is given by EpX q np and the variance,
VpX q npp1 pq or npq where q 1 p
Example 6.3
According to a consumer survey in a village 15% of young adults (18 -24 years of
age) own a mobile phone with internet access . In a random sample of 30 young
adults from the village, let X be a number who own a mobile phone with
internet access. If X follows a binomial distribution,
(a) what is the value of p?
(b) What is the probability that exactly half own a mobile phone with internet
access?
(c) What is the probability that they all have a mobile phone with internet
access?
Solution
(a) Let X be the number of young adults chosen at random who have a mobile
phone with internet access. Clearly, either a young adult selected at
random has a mobile phone with internet access or does not and 15%
belong to those who have. Therefore the probability of success p 0.15.
(b) Half of 30 young adults selected is 15.
Pp X 15q 30 p0.15q15p0.85q15
15
5.93 106
(b) All of them.
PpX 30q 30 p0.15q30p0.85q0
30
1.92 1025
Example 6.4
Let us revisit problem 1 of Examples Set 6.
The probability of obtaining exactly 3 Heads in 5 tosses of a fair coin can be
evaluated with the binomial distribution where p 1{2 per trial and n 5 trials,
so that given H as the random variable representing number of heads in 5 tosses
of a fair coin, we have
PpH 3q p0.5q3p0.5q2
5
3
53 p0.5q5
0.3125
which is the same as the result in solution of problem 1 of Examples Set 6
Example 6.5
In a survey with questionnaires, 25% of the respondents answered a YES to a par-
ticular question. 5 questionnaires are picked at random. What is the probability
that (a) at least 3 of the respondents answered a YES to the question (b) exactly
2 answered a YES to the question (c) none answered a YES to the question (d)
all five respondents answered a YES to the question (e) at least one respondent
answered a YES to the question.
Let X denote the number of respondents chosen at random who answered a YES
to the question. Now,
p 0.25 ñ q 1 p 0.75, n 5, PpX kq p p1 pqnk
n k
k
where
e 2.71828 . . . is the Euler’s number
k is the number of occurrences
k! k pk 1qpk 2q 2 1 is the factorial of k.
Interestingly, the mean and variance of the Poisson distribution are equal
and the same as the rate paramater λ. That is
EpX q VpX q λ, for a Poisson random variable, X.
In this topic we shall write X ∽ P oispλq to mean X follows a Poission distribution
with mean λ.
Example 6.7 The number of accidents per week in a particular district is known
to follow a Poisson distribution with rate parameter 0.5. Find the probability
that
(a) in a particular week there will be:
(i) less than 2 accidents,
(ii) more than 2 accidents,
(b) in a three week period there will be no accidents
Let Y be the number of accidents in one week. Then Y ∽ P oisp0.5q with PMF
given by
k 0.5
PpY kq 0.5 k!e , k P t0, 1, 2, . . . u
(a) (i)
PpY 2q PpY ¤ 1q
PpY 0q PpY 1q
0 0.5
0.51 e0.5
0.5 0!e 1!
0.6065 0.3033
0.9098
C. O. Odijie STA110 (Introduction to Statistics) Lecture Notes Page 50
Random Variable Tutorial Questions Set 8
(ii)
PpY ¡ 2q 1 PpY ¤ 2q
1 rPpY 0q Pp Y 1q PpY 2qs
0.50 e0.5 0.51 e0.5 0.52 e0.5
1 0! 1! 2!
1 r0.6065 0.3033 0.0758s
0.0144
(b) The time interval specified is one week. Since events in a Poisson distribution
are assumed to be independent,
Ppno accident in 3 weeksq Ppno accident in one week Ppno accident in one week
Ppno accident in one week
PpY 0q
3
p0.6065q3
0.2231
Let a success be the event that an item drawn belong to K. Then the distribu-
tion of the number of successes X is called the hypergeometric distribution with
parameters N , K, n.
Stated differently, the hypergeometric distribution describes the probability of ob-
taining k successes in n random draws from a population of size N wherein K
items are of the success state or feature.
A random variable X follows the hypergeometric distribution if its PMF is given
by
K N K
nk
PpX kq P t0, 1, 2, . . . , nu
k
, k (6.12)
N
n
where,
N is the population size,
K is the number of success states in the population,
n is the number of draws/trials.
k
is the number of observed successes,
n
is a binomial coefficient (defined here in subsection 6.5).
k
The hypergeometric distribution is the “drawing without replacement” counterpart
of the binomial distribution.
The mean and variance of the hypergeometric distribution with parameters N , K,
n are given, respectively, as
nK pN K qpN nq
EpX q and VpX q
nK
N N 2 pN 1q
Example 6.8 A bag contains 5 red balls and 4 blue balls. 3 balls are selected
randomly one after the other from the bag without replacement. What is the
probability of getting exactly 2 red balls?
Success here is picking a red ball. Hence,
K 5, n 3 N 9
Let X be the number of red balls selected in 3 trials. The PMF of X is given by
5 4
3k
PpX kq P t0, 1, 2, 3u
k
, k
9
3
5 4
32
6 PpX 2q 2
9
3
5 4
29 1 Ñ p:q
3
0.4762
Let us demonstrate how the use of permutation and combination from section 3
leads to this beautiful and easy-to-use PMF of hypergeometric distribution with
this example.
Now we first find all possible permutaions of the 3 balls drawn i.e. ordered
arrangements of the balls as follows: (let R = red and B = blue)
RRR RRB RBR BRR RBB BRB BBR and BBB
We can see that 2 red balls can be obtained in any of the following 3 ways
RRB or RBR or BRR
Recall that there is no replacement and so the events are dependent. Each
selection reduces the total number in the bag as well as the number of balls of
the type selected. For instance RRB means first pick is red (with prob = 5/9)
and second pick is red which was first picked (with prob = 4/8) and third pick is
blue not yet picked (with prob = 4/7). Continuing in this manner we evaluate
the conditional probabilities of the boxed expression as follows:
5 4 4 4 5 4
9 8 7
5 4 4
9 8 7
9 8 7
4 9 5 8 4 7 4 9 5 8 4 7
54
987
4
54
4 987 3
p4 3q 9 5 8 4 7
5
p4P 2q 9PP 2
3
Now, the term in () that is different from the result (:) of our example is
C2
4 2!2!
.
3!
Evaluating this term gives
4! 2!2!
4!
4P 1 4C 1
2!2! 3!
3!
55
BIBLIOGRAPHY BIBLIOGRAPHY
Bibliography
Blitzstein, J. K. and Hwang, J. (2019). Introduction to Probability (Second
Edition). Chapman & Hall/CRC, Boca Raton.
Kroese, D. and Chan, J. (2014). Statistical Modeling and Computation. Springer,
New York.
Kroese, D. P. (2018). A short introduction to probability.
https://people.smp.uq.edu.au/DirkKroese/asitp.pdf.
Appendix
2. a. t1H, 2H, 3H, 4H, 5H, 6H, 1T, 2T, 3T, 5T, 4T, 6T u b. tu or ϕ c. t6H, 6T u
d. (i) t1H, 1T, 2H, 2T, 3H, 3T, 5H, 5T u (ii) t3H, 3T, 5H, 5T u (iii) t2H, 2T, 4H, 4T, 6H, 6T u
(iv) t1H, 1T, 4H, 4T, 6H, 6T u (v) t1H, 1T, 2H, 2T, 4H, 4T, 6H, 6T u (vi) t4H, 4T, 6H, 6T u
e. (i) The event that an odd-numbered face or a prime-numbered face ap-
pears on the die. (ii) The event that a face both odd-numbered and prime-
numbered (i.e. odd-prime numbered face) appears on the die (iii) The event
that an odd-numbered face does not appear on the die. (iv) The event that
a prime-numbered face does not appear on the die. (v) The event that a
face both odd-numbered and prime-numbered does not appear on the die.
(vi) The event that an odd-numbered face or a prime-numbered face does
not appear on the die.
c c
¤
n £
n £
n ¤
n
4. Ai Aci and AciAi
i i i i
Alternatively, in expanded form: pA1 Y A2 Y Y An qc Ac1 X Ac2 X X Acn
and pA1 X A2 X X An qc Ac1 Y Ac2 Y Y Acn
Tutorial Questions Set 2 (pp. 21-23)
2 1 3 2 2 1 1 1 4
1. 0.26 2. 3. 4. 5. 6. 0.82 7. 8. 9. 10. 18% 11. 12.
7 2 5 3 7 49 256 221 663
16. PpAq 19. 0.7375 20. 0.8 21.
1 8 1 3
13. 14. 15.
114 16575 783 17
x
57
exp 5722
(b) PpX xq , x P t0, 1, 2, . . . u (c) i. 0.0750
57 22
4. Yes 5. (a)
22 x!
ii. 0.2691 iii. 0.4793 6. (a) 0.1647
©2021 C. O. Odijie
All Rights Reserved.
Email: [email protected]