Probability Distributions
Probability Distributions
For example, in a random experiment of tossing 3 coins, we get the following outcomes
Sample Points HHH HHT HTH THH HTT THT TTH TTT
For random variable, we are not interested in a particular sample point/ outcome of a random
experiment but we are interested to the numerical description of the outcome.
Suppose we are interested only in the number of heads which appear for the above example;
we assign one of the numbers (0, 1, 2, 3) to each non-numerical outcome corresponding to
the number of heads appearing.
Sample Points HHH HHT HTH THH HTT THT TTH TTT
X 3 2 2 2 1 1 1 0
Such a numerical quantity, whose value is determined by the outcomes of the random
experiment is called a random variable or stochastic variable or chance variable.
OR
“A random variable is a real valued function that takes a defined value for every point in the
sample space.”
Example: Suppose two balls are selected at random from a bag consisting red and green balls.
• If R and G represent red and green balls respectively, then the elements of the
sample space and the corresponding values of the random variable X will be:
Sample Points RR RG GR GG
X 0 1 1 2
A Sample Space, which contains a finite no: of sample points, is called a Discrete Sample
Space. A random variable defined over a discrete sample space is known as
Discrete random variables represent countable data; e.g. no: of patients in a hospital, no: of
bomb blasts in a particular year, no of vehicles crossing Kotri Barrage on a certain day, etc.
If the elements in a Sample Space are uncountable infinite, the sample space is called a
Continuous Sample Space. A random variable defined over a continuous sample space is
known as Continuous Random Variable.
Continuous random variables represent measurable data; e.g. weight, distance, temperature
height, pressure, etc.
A discrete probability distribution may take the form of a table or a formula listing all
possible values that a discrete random variable can take on along with the associated
probabilities.
The probability function for a discrete random variable must possesses following properties:
Sample Points HHH HHT HTH THH HTT THT TTH TTT
X 3 2 2 2 1 1 1 0
- Assume equal probabilities for the possible outcomes in the above table, then
P(HHH) = 1/8 , P(HHT)= 1/8, ………………. P(TTT)=1/8
- For instance, the probability that no head occurs is 1/8 i.e. P(X=0)=P(TTT) = 1/8
The possible values x of X and their probabilities P(x) are shown below:
X 0 1 2 3
P(x) 1/8 3/8 3/8 1/8
Example: Find the probability distribution of the sum of numbers when a pair of fair dice is
rolled.
Solution: Pair of dice can result in (6) (6) =36 outcomes, each of with probability 1/36.
In this random experiment, the sum of numbers are the values 2,3,4, …, 12 assumed by the
variable X, so in the probability distribution, we write the possible values of random variable
X and their corresponding probabilities:
Question: Find a formula for the probability distribution of the number of boys in a family
with four children assuming equal probabilities for boys and girls.
4
In general, x boys and 4 − x girls can occur in ways,
x
Where x can be 0, 1,2,3,4
Suppose the administration of a sugar testing clinic wants to know that how many patients to
come tomorrow for the test.
This is the expected value regarding the patients to come for test, but not sure.
Expected Value is a weighted average of the outcomes someone expects in future. Expected
Value weights each possible outcome by the frequency with which it is expected to occur.
To obtain the expected value of a discrete random variable, multiply each value that a
random variable can assume by the probability of occurrence of that value and then sum these
products.
Probability that the
Number of
random variable will
Patients Tested (1) (2)
take on these values
(1)
(2)
100 0.01 1.00
101 0.02 2.02
102 0.03 3.06
103 0.05 5.15
104 0.06 6.24
105 0.07 7.35
106 0.09 9.54
107 0.10 10.70
108 0.12 12.96
109 0.11 11.99
110 0.09 9.90
111 0.08 8.88
112 0.06 6.72
113 0.05 5.65
114 0.04 4.56
115 0.02 2.30
Expected Value → 108.02
Expected value of the random variable is 108.02 and noted that it can have significant value
to decision makers.
Suppose a fruit and vegetable wholesaler who sells tomatoes. This product has a very limited
useful life. If not sold on day of delivery, it is worthless. In a case, it costs Rs. 20 per peti and
selling price is Rs. 50 per peti. He has to decide how many peti of tomatoes per day he may
purchase for sale in order to get optimal result.
He can not specify the number of peti customers will call for on any one day, but his analysis
of past records has produced the information in the following table:
Optimal Solution: is that action, which can minimize expected losses or maximize expected
profits.
Obsolescence losses, caused by stocking too much tomato on any one day and having to
throw it (spoiled tomatoes) away the next day
Opportunity losses, caused by being out of tomatoes any time that customer’s call for them.
Examining each possible stock action, we can compute the expected loss.
On the basis of this analysis, he may decide to stock 12 peti of tomatoes each day, because at
this point the expected loss is minimized at Rs. 17.50.
- This problem can be solved by taking an alternative approach, that is, maximizing
expected gain (Rs. 50 – Rs. 20) instead of minimizing expected loss.
- The answer, 12 peti have been the same.
Solution:
The probability of each value of X, the number of persons per household, is computed as the relative frequency.
We divide the frequency for each value of X by the total number of households, producing the following
pprobability distribution.
The probability that a randomly selected household has four or more persons is
P(X ≥ 4) = P(4) + P(5) + P(6) + P(7 or more)
= .140 + .062 + .023 + .012 = .237
Question: Find the mean, variance, and standard deviation for the population of the
number of persons per household:
A mutual fund salesperson has arranged to call on three people tomorrow. Based on past
experience, the salesperson knows there is a 20% chance of closing a sale on each call.
Determine the probability distribution of the number of sales the salesperson will make.
The tree exhibits each of the eight possible outcomes and their probabilities. We see that
there is one outcome that represents no sales, and its probability is P(0) = .512.
There are three outcomes representing one sale, each with probability .128, so we add these
probabilities.
Thus, P(1) = .128 + .128 + .128 = 3(.128) = .384
The probability of two sales is computed similarly: P(X) = 3(.032) = .096
There is one outcome where there are three sales: P(3) = .008
7.4 The mark on a statistics exam that consists of 100 multiple-choice questions is a random variable.
a. What are the possible values of this random variable?
b. Are the values countable? Explain.
c. Is there a finite number of values? Explain.
d. Is the random variable discrete or continuous? Explain.
7.8 Using historical records, the personnel manager of a plant has determined the probability
distribution of X, the number of employees absent per day. It is
7.21 Refer to Exercise 7.20. If the pizzeria makes a profit of $3 per pizza, determine the mean
and variance of the profits per student.
7.26 A survey of Amazon.com shoppers reveals the following probability distribution of the
number of books purchased per hit.
x 0 1 2 3 4 5 6 7
P(x) .35 .25 .20 .08 .06 .03 .02 .01
a. What is the probability that an Amazon.com visitor will buy four books?
b. What is the probability that an Amazon.com visitor will buy eight books?
c. What is the probability that an Amazon.com visitor will not buy any books?
d. What is the probability that an Amazon.com visitor will buy at least one book?
7.34 You have been given the choice of receiving $500 in cash or receiving a gold coin that
has a face value of $100. However, the actual value of the gold coin depends on its gold
content. You are told that the coin has a 40% probability of being worth $400, a 30%
probability of being worth $900, and a 30% probability of being worth its face value.
Basing your decision on expected value, should you choose the coin?
7.38 After an analysis of incoming faxes, the manager of an accounting firm determined the
probability distribution of the number of pages per facsimile as follows:
x 1 2 3 4 5 6 7
P(x) .05 .12 .20 .30 .15 .10 .08
Compute the mean and variance of the number of pages per fax.