Probability Distributions
Probability Distributions
Probability Distributions
Title
Example
For instance, in an experiment of tossing a coin thrice and we are only concerned
with the outcome of the number of heads occurring in the experiment we may associate
the number 0, 1, 2, and 3 to the number of head that may occur in a particular outcome.
To represent these values we may want to use a variable, a random variable. Random
since we are not definite about the values of our variable. We just know the possible
values it may take.
If we let X be the random variable that represents the number of tails in the
outcome then we have the following:
Given sample space S = {HHH, HHT, HTH, THH, HTT, THT, TTH, TTT}
Sample points TTT HTT, THT, TTH HHT, HTH, THH HHH
x 0 1 2 3
RECALL:
The sample space of a given experiment is the set of all possible outcomes. And
so if we define our random variable based on that sample space we can categorize a
random variable in the following manner: Discrete and Continuous.
First we define the following:
Discrete Sample Space - If a sample space contains a finite number of
possibilities or an unending sequence with as many elements as there whole numbers, it
is called a discrete sample space.
Continuous Sample Space – If a sample space contains an infinite number of
possibilities equal to a number of points on a line segment, it is called a continuous
sample space.
Example
Example
1. In an experiment of tossing a coin three times the following sample space is obtained:
S = {HHH, HHT, HTH, THH, HTT, THT, TTH, TTT}. We define the random variable
X, as the number of head in an outcome. We summarize the result of the experiment and
identify the values of our random variable as well as the associated probability with each
value of the random variable.
Sample points TTT HTT, THT, TTH HHT, HTH, THH HHH
x 0 1 2 3
1 3 3 1
f(x)
8 8 8 8
2. Find the probability distribution given a random variable x defined as the sum of
the numbers when a pair of dice is tossed.
The following table illustrates all possible outcomes when a pair of dice is tossed and the
associated probability distribution for the
Outcomes for the Sum of Two Dice
x x x x x x
1, 1 2 2, 1 3 3, 1 4 4, 1 5 5, 1 6 6, 1 7
1, 2 3 2, 2 4 3, 2 5 4, 2 6 5, 2 7 6, 2 8
1, 3 4 2, 3 5 3, 3 6 4, 3 7 5, 3 8 6, 3 9
1, 4 5 2, 4 6 3, 4 7 4, 4 8 5, 4 9 6, 4 10
1, 5 6 2, 5 7 3, 5 8 4, 5 9 5, 5 10 6, 5 11
1, 6 7 2, 6 8 3, 6 9 4, 6 10 5, 6 11 6, 6 12
x 2 3 4 5 6 7 8 9 10 11 12
1 2 3 4 5 6 5 4 3 2 1
f(x)
36 36 36 36 36 36 36 36 36 36 36
F(x)
G(x)
n
2 E X 2 xi 2 f xi
i 1
Example
1. Find the mean and variance of H, where H is a random variable which
represents the number of automobiles that are used for official business purpose on any
given workday by a certain company. The probability distribution for H is as follows:
H 1 2 3
F(H) = P(H=h) 0.3 0.4 0.3
Solution
n
E H xi f xi 10.3 2 0.4 30.3 2
i 1
n
2 E H 2 hi 2 f hi 1 22 0.3 2 22 0.4 3 22 0.3 0.6
i 1
2. A shipment of 7 television sets contains 2 defectives were delivered by Tan
Electronic Company at a certain mall in Manila. If Dusit Hotel makes a random purchase
of 3 of the sets. If G is the number of defective sets purchased by the hotel, find the mean
and variance of G.
Solution
First we have to determine the probabilities associated with each value of the random
variable G.
g 0 1 2
f g
Exercises
1. In an experiment of selecting 3 persons to form a committee from a set of 4 boys and 3
girls. Let H represent the number of boys on the committee.
2. Find the number of expected Jazz records when 4 records are selected at random from
a collection consisting of 5 jazz records, 2 classical records, and 3 polka records.
3. A coin is tossed three times. Let Y be the random variable that represents the number
of tails. Find the probability distribution of Y. Find the mean and variance of the
probability distribution of the random variable Y.
4. In an experiment of tossing a dice first and then tossing a coin, where the coin is tossed
once if the dice resulted in an even number and twice if the dice resulted to an odd
number. Find the probability distribution of the random variable Y, where Y represents
the number or heads in the outcome.
Binomial Distribution
A binomial experiment has the following properties:
The experiment consists of n repeated trials
Each trial results in an outcome that may be classified as a success or a failure
The probability of a success, denoted by p, remains constant from trial to trial.
The repeated trials are independent.
Usually if the first 3 conditions are already met, the last condition is presumably a
forgone conclusion. For a random variable X to have a binomial distribution, the
conditions of a binomial experiment must be satisfied.
The number x of success of a random variable X in n trials of a binomials experiment is
called a binomial random variable.
If a binomial experiment can result in a success with probability p and the failure
with the probability q = 1- p, then the probability distribution of the binomial random
variable X, the number of success in n independent trials is
n
b x; n; p p x q n x for x 0 ,1,2 ,3,..., n
x
n n!
Note: n C x
x n x ! x!
The mean and variance of the binomial distribution b x; n; p are given by the formulas
np and 2 npq
Image taken from the book “The Cartoon Guide to Statistics by Larry
Solution:
Suppose X is the random variable representing the number of 2’s occurring in tossing a
dice 5 times.
Check if the conditions of the binomial experiment are satisfied.
The experiment consists of n repeated trials
There are 5 repeated trials of tossing a dice
Each trial results in an outcome that may be classified as a succe ss or a failure
The outcome can be classified as a success when the result of the dice is 2 and a
failure if the outcome is not 2.
The probability of a success, denoted by p, remains constant from trial to trial.
1
The probability of a success on each of the 5 trials is and the probability of failure
6
5
is .
6
The repeated trials are independent.
We conclude that the trials are independent from one another since the result of the
first toss does not affect of the resul t of the next toss.
1 5
Thus we have, n = 5, q , q and x 3 .
6 6
3 53
1 n x n x 5 1 5
b x 3; n 5, p p q 0.032
6 x 3 6 6
2. A survey in Cavite indicated that nine out of ten cars carry automobile liability
insurance. If 4 cars in Cavite are involved in accidents, what is the probability that:
Solution
If we consider the random variable X to be present the number of automobiles carrying
liability insurance out of the 4 cars involved in an accident. Checking if the conditions of
the binomial experiment are satisfied, we have the following
The experiment consists of n repeated trials
The repeated trial can than can be considered is the checking of the automobile if it
has a liability insurance. Thus, there are 4 repeated inspections whether the 4 accidents of
automobiles carries with them a liability insurance.
Each trial results in an outcome that may be classified as a success or a failure
The inspection can result to a success if the automobile associated in the accident
carries a liability insurance otherwise the result is considered a failure.
The probability of a success, denoted by p, remains constant from trial to trial.
9
The probability of a success on each of the 4 trials is and the probability of
10
1
failure is .
10
The repeated trials are independent.
We conclude that the trials are independent from one another since the result of the
first inspection does not affect of the result of the next inspection.
1 9
Based from the information given, we have, n = 4, q , p .
10 10
Hypergeometric Distribution
N
k
n
The mean and variance of the Hypergeometric distribution h x; N , n , k are given by the
formulas
nk 2 N n nk N k
and
N N 1 N N
Example
1. If 5 cards are dealt from a standard deck of 52 playing cards what is the
probability that 3 will be hearts?
Solution:
Clearly we can label all heart cards as our success. Hence, k = 13, since there are 13 heart
cards. And since we are selecting 5 cards from the deck our sample size n = 5. Thus we
have the following:
k N k 13 39
x n x 3 2
h x 3; N 52 , n 5, k 13 0.0815
N 52
n 5
2. If 7 cards are dealt from an ordinary deck of 52 playing cards, what is the
probability that
a) Exactly 2 of them will be face cards?
b) At least 1 of them will be a queen?
Poisson Distribution
Example
1. The average number of days school is closed due to floods during the rainy season
in a city in Pampanga is 4. What is the probability that the schools in this particular city
in Pampanga will close for 6 days during a rainy season?
Solution:
4
e 46
p x 6; 4 0.1042
6!
2. The average number of dagang bukid per acre in a 5-acre rice field in Baguio is
estimated to be 10. Find the probability that a given acre contains more than 3 dagang
bukid.
Solution:
To find the probability that a given acre contains more than 3 dagang bukid, we
need to find the probability of its complement, since it is easier to find. And just use the
theorem on probabilities for complementary events. Suppose X is our random variable
representing the number of dagang bukid in a 2 acre rice field in Baguio. Thus we have
the following: P x 3 1 P x 3
EXERCISES
1 On the average, the intersection of Taft Avenue and Buendia results in 3 traffic
accidents per month. What is the probability that in any given month at this intersection
a. Exactly 5 accidents will occur?
b. Less than 3 accidents will occur?
c. At least 2 accidents will occur?
2 A basketball player’s shooting average is 0.25, what is the probability that he gets
exactly 1 shoot in his next 5 times attempt to shoot the ball
3 A multiple-choice quiz has 10 questions, each with 4 possible answers of which
only one is correct. What is the probability that sheers guess work yields from 3 to 6
correct answers?
4 If probability that a patient recovers from a leukemia is 0.4. And if 15 people are
known to have contracted this disease, what is the probability that
b) At least 13 survive
c) From 3 to 5 person survive
d) Exactly 5 survive.
5 In a Metro Manila, MMDA says that the need for money to by drugs is given as
the reason for 55% of all thefts. What is the probability that exactly 2 of the next 4 theft
cases-reported to MMDA resulted from the need for money to buy drugs?
6 A homeowner plants 5 bulbs selected at random from a box containing 5 rose
bulbs and 4 sampaguita bulbs. What is the probability that he planted 2 sampaguita bulbs
and 3 rose bulbs?
7. A professor in biology gave a multiple choice quiz with 10 items, each with 5
possible answers and only one of which is correct.
a) What is the probability that a student took the test my merely guessi ng and got a score
of 5?
b) What is the probability that merely guessing the answers from the test would yield a
score of 4 to 8?
c) What is the probability that merely guessing the answers from the test would yield a
score of at least 5?
8. What is the probability that a waiter will refuse to serve alcoholic drinks to only 2
minors if he randomly checks the Identification cards of 5 students from among the 10
students where 4 of which are not of legal age?
9. The average number of patients arriving a t the emergency room of Philippine General
Hospital (PGH) on Monday nights between 9:00 pm up to 12:00 midnight is 5. If we
assume that the patients arrive at random and independently, what is the probability that
less than 5 patients arrive at the emergency room of PGH on a Monday night from 9:00
pm to 12:00 midnight?
10 . A box contains10 red marbles and 15 blue marbles and 5 marbles are selected at
random from the box.
a) What is the probability of obtaining at least 3 red marbles?
b) What is the probability of obtaining at most 2 blue marbles?
c) What is the probability of obtaining exactly 1 red marble?
11. Suppose that the average number of earthquakes experienced in Mindanao is 10 per
year. What is the probability that on a given year, Mindanao will experience at least 5
earthquakes?
12. In certain computer shop, the typist commits on the average two typographical error
per page. What is the probability that the typist makes
a) 3 or more errors
b) at least 1 error
c) no errors
13. In Davao, the probability that a household has a Pomelo tree in their backyard is 0.35.
Find the probability that 4 out of the 10 randomly selected houses has a Pomelo tree in
their backyard.
14. Batanes is hit by 8 storms per year on the average. What is the probabili ty that on a
certain year, Batanes will be hit by at least 5 storms?
15. Warranty records show that the probability that a new car needs repair in the first 90
days is 0.10. If a sample of ten new cars is selected,
a. what is the probability that none needs a warranty repair?
b. what is the probability that at least 3 needs a warranty repair?
c. what is the probability that from 5 to 8 (inclusive) needs a warranty repair?
d. what is the probability that at most 6 needs a warranty repair?
16. The quality control manager of Mandy's Cookies is inspecting a batch of chocolate
chip cookies that has just been baked. If the production process is in control, the average
number of chip parts per cookie is 6.0. What is the probability that in any particular
cookie being inspected,
a. exactly 5 chip parts will be found?
b. more than 3 chip parts will be found?
c. less than 7 chip parts will be found?
5.3 NORMAL DISTRIBUTION
NOTE:
If X is a normal random variable with mean and variance 2 , then the
equation of the normal curve is
2
1 x
1
N x; , e 2 , for x , where
2
3.14159...and e 2.71828...
REMARK
It is difficult to compute for the probabilities of a normal random variable using
the above formula. However, another way of calculating such probabilities is through the
transformation of a normal random variable to its corresponding standard normal
random variable. By transforming a normal random variable to a standard normal
random variable we can now determine probabilities of the said random variable. Thus
we define the standard normal random variable and its distribution.
b) P17 X 21
Solution:
17 18 21 18
P 17 X 21 P Z
2 .5 2 .5
P 0.4 Z 1.2
P Z 1 . 2 P Z 0 . 4
0.8849 0.3446
0.5403
c) The value of k such that P X k 0.2578
Solution:
To find the value of k, we use the formula for transforming the random variable X
to a standard normal random variable that is;
k 18
P X k P Z 0.2578
2 .5
By referring to our standard normal table, we would find that the value of z is
0.65 such that the area under the curve or the probability is 0.2578. Thus,
k 18
0.65 which implies that k 0.65 * 2.5 18 16.18
2 .5
= -
Transforming X to Z we have the following:
X 45 50 5 X 62 50 12
Z1 0 .5 Z2 1 .5
10 10 10 10
Thus we have,
P45 x 62 P 0.5 Z 1.2 PZ 1.2 PZ 0.5
0.8849 0.3085
0.5764
Exercises
1. Given a normally distributed random variable X with mean 18 and standard deviation
of 2.5, find the value of k such that P X k 0.1539
2. A certain type of storage battery last on the average 3.0 years, with a standard
deviation of 0.5 years. Assuming that the battery lives are normally distributed, find the
probability that a given battery will last less than 2.3 years.
3. An electrical firm manufactures light bulbs that have a length of life that is normally
distributed with mean equal to 800 hours and a standard deviation of 40 hours. Find the
probability that a bulb burns between 778 and 834 hours.
4. If the average height of miniature poodles is 30 centimeters, with a standard deviation
of 4.1 cm, what percentage of miniature poodles exceeds 35 cm in height, assuming that
the height follows a normal distribution and can be measured to any desired degree of
accuracy?
5. The quality grade-point averages of 300 college freshmen follow approximately a
normal distribution with a mean of 2.1 and a standard deviation of 0.8. How many of
these freshmen would you expect to have a score between 2.5 and 3.5 inclusive if the
point averages are computed to the nearest tenth?
6. A set of final examination grades in an introductory statistics course was found to be
normally distributed, with a mean of 73 and a variance of 64.
a. What is the probability of getting a grade of 91 or less in this exam?
b. What percentage of students scored between 81 and 89?
c. Only 5% of the students taking the test scored higher than what grade?
7. Plastic bags used for packaging produce re manufactured so that the breaking strength
of the bag is normally distributed with a mean of 5 pounds per square inch and a standard
deviation of 1.5 pounds per square inch.
a. What proportion of the bags produced have a mean breaking strength of between 5
and 5.5 pounds per square inch?
b. What is the probability that a randomly selected bag will have a mean breaking
strength of at least 6 pounds per square inch?
c. What percentage of the bags have a mean breaking strength of less than 4.17
pound per square inch?
d. Between what two values symmetrically distributed around the mean will 95% of
the breaking strengths fall?
8. If we know that the length of time it takes a college student to find a parking spot in
the university parking lot follows a normal distribution with a mean of 3.5 minutes and a
standard deviation of 1 minute, find the probability that if we select 36 randomly
selected college students, the average time it would take for them to find a parking spot is
a) less than 3.2 minutes?
b) between 3.4 and 3.7 minutes?
c) more than 3.8 minutes?
Summary
1. A random variable is defined to be a function whose value is a real number determined
by each element in the sample space is called a random variable.
2. A Discrete Random Variable is a random variable which is defined on a discrete sample
space while a Continuous Random Variable is a random variable defined on a
continuous sample space.
3. Some of the discrete probability distributions are the following: Binomial distribution,
Hypergeometric distribution, and Poisson distribution.
4. Properties of the binomial experiment
The experiment consists of n repeated trials
Each trial results in an outcome that may be classified as a success or a failure
The probability of a success, denoted by p, remains constant from trial to trial.
The repeated trials are independent.
5. The most widely used continuous distribution is the normal distribution. However
calculation of probabilities in this type of distribution is difficult to derive even with the
use of computers. For this reason it is necessary to transform the random variable into a
standardized random variable, that is, standard normal random variable.
Facts and Figures in Statistics
Cartoon illustration taken from the book “Cartoon Guide to Statistics” by Larry Cognick and Woollcott
Smith
The central limit theorem explains why the normal distribution is the most widely
used distribution. It is applicable to the stock market fluctuations, students’ grades,
price of canned goods, weight of people in a city, amount of mercury in a river, thus
practically everywhere. For instance, the price of canned goods are influenced by the
price of gasoline, price of tin can used for packing, labor cost in producing the
goods, type of product to be placed in the canned good, location of the factory that
manufactures the canned goods, etc. These are all unrelated factors that influence
the price of the canned goods but when considered together, the effect you’ll get is a
normal distribution
Chapter Review
Write the letter that corresponds to the correct answer.
For #’s 1-4, given the following probability distribution of a random variable x
x 0 1 2 3
f(x) 0.23 0.25 0.41 0.11
5. In an experiment where the probability of a success is 0.3, if you are interested in the
probability of 2 successes out of 5 trials, the correct probability is
a) 0.0774. b) 0.1600. c) 0.2613. d) 0.0016.
16. There are 18 toys in a basket, of which 10 are cars and 8 are balls. A child randomly
picks 3 toys without replacement. Let X be a random variable, the number of balls
selected. What is the distribution of X?
a. Binomial c. Normal
b. Poisson d. Hypergeometric
17. The basketball player Shaq makes 45% of the free throws he tries. Find the
probability that in the next 4 throws, he will make exactly 3 hits?
a. 0.2 b. 0.3 c. 0.4 d. 0.5
18. In DLSU, there are 9 candidates from 2 political parties, 5 from TAPAT and 4 from
SANTUGON, aiming for 6 Student Council positions. Assuming that all candidates are
equally qualified for the positions, find the probability that 3 TAPAT candidates and 3
SANTUGON candidates will be elected for these positions.
a. 0.2865 b. 0.3589 c. 0.4768 d. 0.5556
19. An average of 0.8 accident per day occurs in a certain city. What is the probability
that no accident will occur in this city on given day?
a. 0.4493 b. 0.3980 c. 0.25 d. 0
20. Suppose that a random variable X has a normal distribution with mean 40 and
standard deviation 5. What is the probability that X is below 30?
a. 0.0228 a. 0.1587 c. 0.8413 d. 0.9772