CH-5 Stat I
CH-5 Stat I
PROBABILITY DISTRIBUTION
Probability Distribution
A probability distribution shows the possible outcomes of an experiment and the probability of
each of these outcomes. That is, probability distribution is a complete list of all possible of
A formula giving the probability of the different values of the random variable X for:
Discrete variable is the probability massy function (pmf) and is usually denoted by p(x). If X is
a discrete random variable taking at most a countable infinite number of values x1, x2, …, then
P (xi) = P(X = xi): i= 1, 2 …is called the probability mass function of random variable X. The
set of ordered pairs {xi, P (xi)} i= 1, 2 … gives the probability distribution of the random
variable X.
Continuous variable is the probability density function (pdf) and is usually denoted by f(x). A
random variable, X, is said to be a continuous random variable if there is a non–negative
function
The probability distribution for a random variable describes how the probabilities are
distributed over the values of the random variable. For a discrete random variable X, the
probability function is denoted by P(X). The probability function provides the probability for
Definition:
random variable.
Page 1 of 20
Example 1: Construct a probability distribution for the number of heads in tossing two fair
Let the random variable X denotes the ‘number of heads’. We then use the probability function
P(X) to assign probability to each out come consequently; the probability distribution is given
below:
Outcome, X 0 1 2
Probability, P(X) ¼ ½ ¼
The probability distribution shows that the probability that the random variable can assume the
value 0 is ¼, the value 1 is ½ and the value 2 is ¼. Note that the sum of these probabilities is 1.
Example 2: The number of mistakes a typist made in ten days of assessment is shown in the
following table.
No of mistakes 2 3 4 5
No of days 1 4 3 2
Solution:
a) In Constructing the probability distribution, our random variable assumes a value for
the number of mistakes the typist committed. Let the variable X denotes this random
variable. Then, we assign a probability for each of the number of days with respect to
Page 2 of 20
No of mistakes, X 2 3 4 5
coordinate plane. The value of the random variable X is shown on the horizontal axis
(x-axis) and the probability that the random variable X assumes these values is shown
on the vertical axis (y-axis). For the probability distribution in the example, the
random variable X, which is the number of mistakes the typist committed is labeled on
Y axis
0.4
P(X)
Probability 0.3
0.2
0.1
This section discusses discrete probability distribution. Continuous probability distribution will
Page 3 of 20
In the construction of the probability distribution for a discrete random variable, the following
The sum of the probabilities of all the events in the sample space must equal 1.
i.e. P(x) =1
The probability of each event in the sample space must be between or
equal to 0 and 1.
i.e. 0 P(x) 1
For instance, in the above example, these two conditions are satisfied since
P(X) = P(2) + P(3) + P(4) + P(5) = 0.1+ 0.4 + 0.3 + 0.2 = 1 and
each of these probabilities is greater than or equal to 0 and less than or equal to 1.
For some discrete random variables, the probability distribution can be given as a formula that
Solution:
The outcome x assumes the values 0, 2 and 3
Out come, x 0 2 3
Page 4 of 20
EXPECTED VALUE AND VARIANCE OF A PROBABILITY
DISTRIBUTION
Expected Value
The expected value, or mean, of a random variable is a measure of the central location for the
random variable. It is denoted by E(x) or . The mathematical expression for the expected
x . P(x )
n
E (x) = i i
i1
Where x1, x2,-------,xn are the outcomes and P(x1), P(x2)…P(xn) are the
corresponding probabilities.
The above formula shows that in order to compute the expected value of a discrete random
variable, we must multiply each value of the random variable by the corresponding probability
Variance
While the expected value provides the mean value for the random variable, we often need a
measure of dispersion, or variability, for the random variable just as we need variance in block
5 to summarize the dispersion in a data set. The mathematical expression for the variance of a
Page 5 of 20
Variance of a discrete probability distribution, σ 2
ó ó
2
and the standard deviation is
Example 1: If three fair coins are tossed, find the expected number of heads that will occur and
Solution:
Begin by constructing the probability distribution for the number of heads in tossing the three
coins.
No of heads, x 0 1 2 3
Then,
The theoretical mean = 1.5 implies that if the experiment is done as many times as possible,
= (x1 - )2 · P(x1) + (x2 - )2 · P(x2) + (x3 - )2 · P(x3) + (x4 - )2 ·P(x4)
2 = 0.5
Page 6 of 20
Example 2: One thousand tickets are sold at $1 each for a color television valued at $350.
Solution:
When a person purchases one ticket, he has two chances, to lose $1 or gain $349.
Hence,
Or,
i.e. The average loss is $0.65 for each of the 1000 ticket holders.
The Binomial Probability Distribution is a discrete probability distribution that has many
a) Each trial can have only two outcomes or outcomes that can be reduced
d) The probability of a success must remain the same for each trial.
Page 7 of 20
Definition:
A probability distribution showing the outcomes of a Binomial experiment along with the
Px n! . px .qnx
nx!x!
Where x is the number of successes
Note: q = 1 - p and 0 x n
Example 1: Consider the experiment of tossing a coin three times. Show that it is a binomial
Solution:
iii) The probability of success, getting a head, does not change from trial to trial.
iv) The trials or tosses are independent, since the outcome of any trial is not affected by
Now, to find the probability of getting two heads, let p denotes the probability of getting a head
on a single toss.
n = 3, x=2
Page 8 of 20
Px n! . px .qnx
n x!x!
3! . 12 . 1 32 3! . 1. 1
P(2)
32!2! 2 2 1!2! 4 2
=
3 = 0.375
8
Example 2: A new drug is effective 60% of the time. What is the probability that in a random
Solution:
This is a Binomial experiment as the points of the experiment are satisfied. Define ‘effective’
Required p (2) = ?
We recognized that it is tedious to calculate probabilities using the binomial formula when n is
a large number. For such cases, you may use the binomial probability distribution table that is
Let’s demonstrate how to read the table with an illustrative example. Consider the experiment
of tossing a fair coin 4 times. What is the probability of getting three heads?
Under the column `n’ choose the number 4, proceed horizontally and correspond it with x=3,
Page 9 of 20
Mean, and Variance of a Probability Distribution
Definition
The mean, variance and standard deviation of a variable that has the Binomial
Mean =n·p
Variance 2 = n·p·q
Standard deviation = npq
Example1: A coin is tossed four times. Find the mean, variance and SD of the number of
Solution:
=n.p=4.½=2
2 = n . p . q = 4 . 1/2 . 1/2 = 1
= 2 = 1= 1
Example 2: A die is rolled 240 times. Find the mean, variance and standard deviation for the
Solution:
n = 240,P=1/6
= n . p = 240(1/6) = 40
2 = n . p . q = (24)(1/6)(5/6) 33.33
= 33.33 5.77
Page 10 of 20
B. THE POISSON PROBABILITY DISTRIBUTION
A discrete probability distribution that is useful when n is large and p is small and when the
independent variables occur over a period of time is called the Poisson probability distribution.
i) The probability of an occurrence is the same for any two intervals of equal length.
Px; e .
x
x!
Where P(x, λ) is the probability of x occurrences in an interval of time, volume,
area etc for a variable, λ denotes the mean number of occurrences and e 2.7183
Example1: Past police records indicate a mean of five accidents per month while
a) Exactly 3 accidents.
3! 6
= 0.1404
Page 11 of 20
b) Fewer than 2 accidents comprise 0 and 1 accident during any month.
0 1
P0 P1 5 . 2.7183 5 2.7183
5 5
0! 1!
0.0674 + 0.3370
0.4044
Remark:- Although the above probability was determined by evaluating the probability
function, it is often easier to refer to the table for the Poisson probability distribution.
These table provides probabilities for specific values of x and . We have included the
For convenience, in example 1a, = 5 and x = 3. In the first column of the table choose
x = 3 and correspond it with =5, the intersection of these two numbers gives you the
manuscript, find the probability that a given page contains exactly 3 errors.
200 0.4
500
Or, 0.4 error per page.
Since x = 3,
x! 3!
Thus, there is less than a 1% probability that a give page contains less than 3 errors.
Page 12 of 20
SUMMARY
This unit discussed the definitions of random variable as a variable assigned to a random
probability experiment and where the probability distribution of such experiment attains the
summarized table comprising the random variable together with the probability of occurrence
of the events. A random variable can be discrete or continuous, depending on the values it
assumes.
outcomes of a random variable with the respective probabilities .The two widely used discrete
probability distributions are the Binomial and Poisson probability distributions and each of
INTRODUCTION
So far, we have been concerned with discrete probability distributions. We shall turn to cases
in which the variable can take on any value within a given range and in which the probability
distribution is continuous.
mathematician and astronomer Karl Gauss. In honor of his work, the normal probability
There are two basic reasons why the normal distribution occupies such a prominent place in
statistics. First, it has some properties that make it applicable to a great many situations in
which it is necessary to make inferences by taking samples. Second, the normal distribution
comes close to fitting the actual observed frequency distributions of many phenomena,
Page 13 of 20
THE NORMAL PROBABILITY DISTRIBUTION
Many contentious variables such as height and weight have distributions that are bell-shaped
and are called approximately normally distributed variables, deriving the most important
probability distribution used to describe a continuous random variable called the normal
probability distribution.
distribution of a variable.
The form or shape of the normal probability distribution is shown below.
The shape and position of the normal distribution curve depends on two parameters, the mean
and the standard deviation. Each normally distributed variable will have its own normal
distribution curve.
The mean, median and mode are equal and are located at the center of the
distribution.
The total area under the normal distribution curve is equal to 1, or 100%
The area under the normal curve that lies within one SD of the mean is
approximately 0.68 or 68%, within two SD’s about 0.95 or 95% and
Page 14 of 20
34.13% 34.13%
13.59% 13.59%
2.28% 2.28%
about 68%
about 95%
about 99.7%
f x 1 e 2 2 3.14159
2
Where = mean e 2.7183
= Standard deviation
A random variable that has a normal distribution with a mean of 0 and a standard deviation of
Recall that the standard score (z-score) of a value is the number of standard deviations
All normally distributed variables can be transformed into the standard normal distributed
Page 15 of 20
𝑉𝑎𝑙𝑢𝑒 − 𝑚𝑒𝑎𝑛
𝑍=
𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝐷𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛
Or,
Z X
The standard normal curve
As with other continuous random variables, probability calculations with any normal
probability distribution are made by computing areas under the graph of the probability density
function. Thus, to find the probability that a normal random variable lies within any specific
interval, we must, compute the area under the normal curve over that interval.
For the standard normal probability distribution, areas under the normal curve have been
computed and are available in tables that can be used in computing probabilities. The normal
For the solution of problems using the normal distribution, the following steps are used.
1. Draw a picture
Page 16 of 20
Example 1: Find the area under the normal curve between z=0 and z=2.34
Solution:
The standard normal curve
Example 2 : Find the area under the normal distribution curve between z = -1.93 and z = 2.35
Area = 0.4732 + 0.4906 = 0.9638 or 96.38%. Note that it is equivalent to say that the
probability of the z-value lying between z = -1.93 and z = 2.35 is 96.38%. This can also be
written as:
Example 3: Find the probability that the z-value of a normally distributed variable lies to the
left of 1.65
Solution
The probability that the z-value
Page 17 of 20
finding the area under the standard
Solution
P(z > 1.91) = area to the right of 0 area between 0 and 1.91.
i.e
P(Z > 1.91) = P(z > 0) - P(0 < z < 1.91) 0 1.91
= 0.5000 - 0.479
= 0.021 or 2.1%
The area under the normal curve is used to solve practical application problems such as finding
probabilities or percentages of values. In order to solve such problems you need only transform
the values of the variable into the z values and read the standard normal distribution table.
Example 1: The scores for an IQ test are normally distributed with a mean of 100 and a
standard deviation of 15. Find the percentage of IQ scores that will fall below 112.
Solution
Step 1: Draw a figure and represent the area
Corresponding to an IQ
Score 112.
𝑥− 112−100
𝑍 = = = 0.80
𝛿 115
Page 18 of 20
Step3: From the table,
P(z < 0.8) = P(z < 0) + P(0 < z < 0.8) = 0.5000 + 0.2881 = 0.7881
Example2: The monthly salaries of 2000 workers are normally distributed with a mean of birr
Z 600550 0.625
80
Approximately 1940 of the workers earn a monthly salary less than birr 700.
Example 3 A college desires to accept only the top 10% of all graduating seniors on the
basis of the results o a national placement test. The test has a mean of 500 and a standard
Solution:
The area is shown.
Let it be denoted by x
Page 19 of 20
From the table, the z – value that corresponds to the area 0.4000 is approximately 1.28.
Then, 1.28
x500 x 628
100
Hence the score 628 should be used as a cut –off score. Any student scoring below 628 should
not be admitted.
Page 20 of 20