Distribution of Probability
Distribution of Probability
n
U
4
4.2 INTRODUCTION
Suhani has two black sweaters and a white sweater in her cupboard. She takes out a sweater
at random, notes the colour and puts it back in the cupboard. She repeated the process once more
before making up her mind.
What shall be the sample space of the situation stated above?
Let us consider the sweaters as B1, B2 and W1
For the selection of two assignments,
the sample space is S = { B1B1, B1 B2, B2B1, B2B2, B1W1, B2W1, W1B1, W1B2, W1W1}
Clearly these draws are of a random experiment with random outcomes that cannot be predicted.
Let X represent the number of white sweaters drawn in this situation, in that case what can you
say about the value of X?
Here, X (B1B1) = X (B1 B2) = X (B2B1) = X( B2B2) = 0 as the sample element does not have any white
sweater.
Also, X(B1W1) = X(B2W1) = X(W1B1) = X(W1B2) = 1 as the sample element has one white sweater
And, X(W1W1) = 2 as the sample element has two white sweaters
X can take values 0, 1 or 2
Here, X is a function whose domain is the set of possible outcomes (or sample space) of a random
experiment. Also, the variable X take any real value, therefore, its co-domain is the set of real
numbers
In such a case X is considered as a random variable
Definition: A random variable is a real valued function whose domain is the sample space of
a random experiment
Let us consider an experiment of tossing a coin two times in succession.
Clearly the sample space of this experiment is S = {HH, HT, TH, TT}.
If X represents the number of heads obtained in this situation,
Example 1
Rajat is playing a game of rolling a die with his friends.
According to the game rules, he will win Rs 5 for
rolling an even number and for getting an odd digit on
the die, he looses 2. If X represents the amount of
money Rajat wins or loses. Show that X is a random
variable and also represent it as a function on the
sample space of the game play.
Here, the random variable is taking two distinct and countable (measurable) values.
Hence X in this case has distinct and countable outcomes with no number in between these
values, therefore it is a discrete random variable.
A wrist watch with only hour and minutes display shows time as 12:00, then
12:01, 12:02, and so on and there is no time shown in between. In this case the
random time change is distinct and countable. Therefore the change in time in this
case is discrete random variable
Each possible value of the discrete random variable can be associated with a non-
zero probability.
Probability Distribution 4.3
Whereas a wrist watch displaying the seconds count as well shows time
22:31:25 pm and 22:32:17 pm and the elapsed time in between as well. A random
variable whose value is obtained by measuring and it takes many values between
two values, is called a continuous random variable.
In other words, a continuous random variable is a random variable with a set
of possible infinite and uncountable values (known as the range).
+ 1 2 3 4 5 6
1 2 3 4 5 6 7
2 3 4 5 6 7 8
3 4 5 6 7 8 9
4 5 6 7 8 9 10
5 6 7 8 9 10 11
6 7 8 9 10 11 12
Clearly, X will take values 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 and 12 which are distinct and countable,
hence X is a discrete random variable in this case.
Now let us find the probability for each random outcome
X 2 3 4 5 6 7 8 9 10 11 12
1 2 3 4 5 6 5 4 3 2 1
P(X)
36 36 36 36 36 36 36 36 36 36 36
Table (i)
Observe that in table (i); for all possible values of the discrete random variable X, all elements of
the sample space are covered.
This table of possible outcomes and their respective probability is called Probability distribution
table for the given random variable X. A probability distribution table links every possible outcome
of the random experiment with the probability of the event to occur.
In a probability distribution table, the sum of all the probabilities is one. (Refer table (i) )
Example 2
A coin is tossed thrice and outcomes are recorded. Prepare the probability distribution table for the
number of heads.
Solution: Let a random variable X denote the number of heads in In a s
ingle
three throws of a die toss
P (he of a f
ad) = 1 air co
Here the sample space S = {HHH, HHT, HTH, THH, HTT, , in:
2 and P (ta
THT, TTH, TTT} il) = 1
2
P(X = 0) = =
P(X = 1) = 3 =
probability of obtaining two head and one tail i.e. P(HHT, HTH, THH) is denoted by
P(X = 2) = 3 =
P(X = 3) = =
the
ribution Therefore, the probability distribution table is:
st
ity di ilities
probabil e probab
In a of all
th
one xi 0 1 2 3
sum s always
i
1 3 3 1
P(xi) = pi
8 8 8 8
In a nutshell, the mathematical expectation, also known as expected value for a random variable X
is the summation of product of all possible values for the given random variable X and their
respective probabilities.
Example 3
A coin is tossed twice and outcomes are recorded. Prepare the probability distribution table for
random variable X which represents the number of heads in the experiment. Also calculate the
mathematical expectation of X.
Solution: Let a random variable X denote the number of heads in two throws of a die
the sample space S = { HH, HT, TH,TT}
xi 0 1 2
1 1 1 1
P(xi) = pi 2
4 4 2 4
1 1 1 1 1
xi p i 0 0 1 2
4 2 2 4 2
Note that = 1
n
Therefore, E(X) = i=1
( xi pi ) = 0 + + = 1
Example 4
In a manufacturing unit inspection, from a lot of 20 baskets which include 6 defectives, a sample
of 2 baskets is drawn at random without replacement. Prepare the probability distribution of the
number of defective baskets. Also calculate E(X) for the random variable X.
Solution:
As X denotes the number of defective baskets in a draw of 2 without replacement
X = 0, 1 and 2
Therefore, in a draw of two baskets;
X 0 1 2
14 13 182 14 6 168 6 5 30
P(xi) = pi 2
20 19 380 20 19 380 20 19 380
Note that = 1
n
Therefore, E(X) = i=1
( xi pi ) = 0 + =
E(X) of a random variable X, is the theoretical mean of X. It is not based on sample data
but on the distribution of it
So, the mean expectation value is a parameter and not a statistic.
Sometimes it is also represented by use of Greek letter mu () as well.
X 1 2 3 4
1 3 2 1
P(xi) = pi
7 7 7 7
1 1 3 6 2 6 1 4
xi pi 1× 2 3 4
7 7 7 7 7 7 7 7
n 1 6 6 4 17
Here, E(X) = i=1
( xi pi ) =
7 7 7 7 7
2.71
Y -1 0 4 5
1 2 3 1
P(yi) = pi
7 7 7 7
1 1 2 3 12 1 5
yi pi –1 0 0 4 5
7 7 7 7 7 7 7
n
Here, E(Y) = i=1
( xi pi ) =
Clearly the random variables X and Y with different probability distributions can have equal
means. In such cases, we need a technique to check variability and extent to which the values of
random variable are spread out.
Variance enables us to study the variability of random variable from the mean expectation
When there is a narrower range among the sample elements in a given sample space; that means
that the value of the random variable is close to mean expectation and hence the variance is less
And, when there is wide range among the sample elements, it means that the value of the random
variable is far from the mean expectation and thus the variance is high.
Basically, the variance measures the average degree to which each sample element differs from
the mean of the sample space
In a probability distribution of a discrete random variable X, the variance denoted by Var (X); is the
summation of the product of the squared deviations of xi from the mean E(X) and the corresponding
probabilities pi.
Definition: Let X be a discrete random variable whose possible finite values x1, x2, x3,..., xn occur
with probabilities p1, p2, p3, …,pn respectively.
n
2
In other words, Var (X) = E(X2) – [E(X)]2, where E (X2) = x
i 1
i pi –
x Var (X )
Example 5
A class XII has 20 students whose marks (out of 30) are 14, 17, 25, 14, 21, 17, 17, 19, 18, 26, 18,
17, 17, 26, 19, 21, 21, 25, 14 and 19 years. If random variable X denotes the marks of a selected
student given that the probability of each student to be selected is equally likely.
a) Prepare the probability distribution of the random variable X.
b) Find mean, variance and standard deviation of X.
Solution: Based on the given data, let us prepare a table
Marks 14 17 18 19 21 25 26
frequency 3 5 2 3 3 2 2
Marks = xi 14 17 18 19 21 25 26
Frequency = fi 3 5 2 3 3 2 2
3 5 2 3 3 2 2
P(xi) = pi
20 20 20 20 20 20 20
42 85 36 57 63 50 52
xi pi
20 20 20 20 20 20 20
Here, E(X) = =
And, =
= 13.9
Example 6
Let X denote the number of hours a person watches television during a randomly selected day. The
probability that X can take the values , has the following form, where k is some unknown constant.
0.2, if xi 0
kx , if xi 1 or 2
P(X = xi) = i
k(5 xi ), if xi 3
0 otherwise
xi 0 1 2 3
P(xi) = pi 0.2 k 2k 2k
a) As = 1
0.2 + k + 2k + 2k = 1
5k = 0.8
k =
d) Probability that the person will watch at most two hours of television
= P (xi = 0, 1, 2)
e) xi 0 1 2 3
1 4 8 8
P(xi) = pi 0.2 =
5 25 25 25
4 16 24
xi p i 0
25 25 25
4 32 72
xi 2 p i 0
25 25 25
n
E(X) = i 1
( xi pi ) = 0 +
n 2
f) x
i 1 i
pi = 0+
n 2 2
n 2
108 44 108 1936
Var(X)
i 1
xi pi –
i 1
xi p i
–
25 25 25
–
625
here p = and q = 1 p=1 , Since probability of success remains same in all the trials,
hence we can say these are binomial trials.
When the drawing is done without replacement, the probability of success (i.e., drawing a defective
Clearly, the probability of success is not same for all trials, hence the trials in example 4 are not
Bernoulli trials.
Probability of ‘r’ successes in ‘n’ Bernoulli trials is given by:
−
! −
P ( ‘r’ successes) = =
! ( − )!
Where n = number of trials
r = number of successful trials = 0, 1, 2, 3, …, n
p = probability of a success in a trial
q = probability of a failure in a trial
And, p + q = 1
Clearly, P (‘r’ successes), is the (r + 1)th term in the binomial expansion of (q + p)n .
The probability distribution of number of successes for a random variable X can be written as:
X = ri 0 1 2 3 … r … n
0 −0 1 −1 2 −2 3 −3 − −
P(ri) = pi 0 1 2 3 … …
As p + q = 1 q = 1 =
X = ri 0 1 2 3 4
0 4–0 1 4–1 2 4– 2 3 4– 3 4 4–4
2 1 2 1 2 1 2 1 2 1
P(ri) = pi C04 C14 C24 C34 C 44
3 3 3 3 3 3 3 3 3 3
1 8 24 32 16
= = = = =
81 81 81 81 81
Now let us calculate the mean expectation, Variance and Standard Deviation for example 7
n
Recall that E(X) = i 1
( xi pi ) = 0 = 2.67
n 2
2
n
Var(X)
i 1
xi pi –
i 1
xi pi
= 0.89
Example 8
If a fair coin is tossed 9 times, find the probability of
a) exactly five tails
b) At least five tails
c) At most five tails
Here, n = 9, p = and q = 1 – p =
As P ( ‘r’ successes) =
= 1 -
131 382
= 1
512 512
Example 9
In a manufacturing unit inspection, from a lot of 20 baskets which include 6 defectives, a sample
of 2 baskets is drawn at random with replacement. Prepare the binomial distribution of the number
of defective baskets. Also find E(X) and Var(X) for the random variable X
Solution: Here, X denotes the number of defective baskets in a draw of 2 baskets with replacement
Clearly, the trials are Bernoulli trials
And X = 0, 1 and 2
Also number of trials = n = 2
If drawing a defective basket is considered a success,
then p = q =
X 0 1 2
ri No defective baskets One defective basket Two defective baskets
0 2–0 1 2–1 2 2–2
3 7 3 7 3 7
P(ri) = pi C02 C12 C22
10 10 10 10 10 10
49 42 9
= = =
100 100 100
The probability that Rohit will hit a shooting target is . While preparing
for an international shooting competition, Rohit aims to achieve the
probability of hitting the target at least once to be 0.99. What is the
minimum number of chances must he shoot to attain this probability?
Solution: Let the number of chances Rohit shoots the target be n
Here, the trials are Bernoulli with p be the probability of success to
hit the target = and q be the probability of failure to hit the target be 1 – p =
As Rohit wants to hit the target at least once with the probability of 0.99
P(r = 1,2,3,…) 0.99
1 – P(r = 0) 0.99
1 – 0.99
1 – 0.99
0.01
100
As 35 100 n 5
Rohit should hit the target at least 5 times to achieve his target.
Example 11
Sonal and Anannya are playing a game by throwing a die alternatively till one of them gets a ‘1’
and wins the game. Find their respective probabilities of winning, if Sonal starts first
Solution: Clearly the trials are Bernoulli’s with n
Getting a 1 on a single throw of the die is considered a success
p = and q = 1 – p =
As ,
Example 12
A die is thrown again and again until three 5s’ are obtained. Find the probability of obtaining the
third 5 in the seventh throw of the die
Solution: Clearly the trials are Bernoulli with n = 6
For finding the probability of third six in the seventh throw of the die, we know that there must
have been two 5s’ on previous six throws
P (third 5 on seventh throw of die) = P (two 5s on six throws) P (a 5 on the next single throw
of die)
−
( = )= ( )=
!
Where e is Euler’s number (e = 2.71828…)
‘k’ is the number of occurrences of the event such that k = 0, 1,2, …
And = E(X) = Var(X), is a positive real number
With existence conditions:
−
1. ∑∞=0 =1
!
2. for k = 0, 1, 2, …
A restaurant is doing booming business. It was recorded that during their peak business time,
an average of 30 customers per hour arrive at the restaurant. Can we develop a Poisson probability
distribution model for the arrivals of customers, if 30 customers arrive in an interval of 1 hour on
an average?
You might say that arrival average is 1 customer every 2 minutes
But the thing to remember here is that arrival time of each customer is random and hence this
approach is inappropriate
Let us try another approach and divide each one-minute interval along an interval of an hour
so that each customer arrival is equally-likely
During each minute, let us consider one customer’s arrives in the middle of that time interval.
As probability for a customer to arrive is this is going to be a binomial distribution B (60, )
But then again, we cannot assume that the customers are arriving at uniform pace and at regular
time intervals
What if we divide the time interval in seconds and consider that probability for a customer to
(3600, )
c) With the width = of an hour with arrival rate 30 customer per hour:
One
of th
histo e fam
rical ous
an a stud
pplic y sho
distri a ti w
butio on of Po s
num n, to isson
bers estim
of at
soldie Prussian e the
rs kil cava
hors led d lry
e-kic ue to
ks in
a ye
ar!
Example 13
As the story goes, the Prussian soldiers monitored 10 cavalry corps over a period of 20 years. The
annual number of recorded deaths due to horse-kick ‘k’ observations is as shown in the table:
k 0 1 2 3 4 Total
Number
of deaths 109 65 22 3 1 200
P(k = 0) =
P(k = 1) =
P(k = 2) =
P(k = 3) =
P(k = 4) =
k 0 1 2 3 4
Number 109 65 22 3 1
of deaths
Poisson 200×0.54 200×0.33 200×0.1 200×0.02 200×0.003
predicted =108 =66 =20 =4 =0.6~1
deaths
Yes, the Poisson predictions are adequate for the given data
Example 14
A traffic engineer records the number of bicycle riders that use a particular cycle track. He records
that an average of 3.2 bicycle riders use the cycle track every hour. Given that the number of bicycles
that use the cycle track follow a Poisson distribution, what is the probability that:
a) 2 or less bicycle riders will use the cycle track within an hour?
b) 3 or more bicycle riders will approach the intersection within an hour?
Also write the mean expectation and variance for the random variable X
Solution:
For this problem, E(X) = Var(X) = = 3.2
a) The goal is to find P(X 2)
As
P(X = 0) = =
P(X = 1) = =
P(X = 2) = =
Example 15
A particular river near a small-town floods and overflows twice in every 10-years on an average.
Assuming that the Poisson distribution is appropriate, what is the mean expectation. Also calculate
the probability of 3 or less overflow floods in a 10-year interval.
Solution: As the average event of flood overflow, in every 50-years is two
In the given Poisson distribution, = 2
As
P (X = 0) = = 0.14
P (X = 1) = = 0.27
P (X = 2) = = 0.27
P (X = 3) = = 0.18
Example 16:
For a Poisson distribution model, if arrival rate of passengers at
an airport is recorded as 30 per hour on a given day. Find:
a) The expected number of arrivals in the first 10 minutes
of an hour
b) The probability of exactly 4 arrivals in the first 10 minutes
of an hour
c) The probability of 4 or fewer arrivals in the first 10 minutes
of an hour
Picture courtesy: https://
d) The probability of 10 or more arrivals in an hour given
englishlive.ef.com/blog/career-
that there are 8 arrivals in the first 10 minutes of that
english/travel-english-key-words-
hour
for-the-airport/
a) As 10 minutes = th of an hour
In the given Poisson distribution, X is defined as number of arrivals in the first 10 minutes
and P( X = k) = , where k = 0, 1, 2, 3, …
= +
= 0.007+0.03+0.08+0.14+0.18 0.44
d) We are given that there have been 8 arrivals in the first 10 minutes (= hour)
(= hour)
probability of 10 or more arrivals in an hour given that there are 8 arrivals in the first 10
minutes of that hour = P(k = 2, 3, 4, 5, …
= 1 – P (k = 0, 1)
= + )
( ) ≥ 0, ∀ ( −∞, ∞)
2
1 −
1 −
such that ( )= . 2
—————— (i)
√2
w here, (–, ) is the mean of normal distribution
> 0 is the standard deviation
When a random variable can take on any value within a given range where the probability
distribution is continuous ( refer 4.1.1), it is called a normal distribution or Gaussian distribution. A
random variable with a Gaussian distribution is said to be normally distributed, and is called a normal
deviate
In the normal distribution function given by (i), the curve known as probability curve is bell-
shaped with one peak point as shown below:
5. The distribution can be described by two values: the mean and the standard deviation
l-life
d is a rea al
r rm
n boa w no
Galto ple of ho show the
exam tion can ibution
u tr
distrib ability dis sisfun.com
prob w.math .html
://ww unx
https ata/quinc
/d
But in a real-life situation, there can be a data set with a mean as 50 and standard deviation of
3 while there can be another data set with a mean of 100 and a standard deviation of 5. How do
we compare such different normally distributed data sets?
Example 17:
Calculate Z-Score for a normal distribution of length of 7 rare species of Indian
butterfly that you have in your garden
Butterfly 1 2 3 4 5 6 7
Length
(in cm) 2 2 3 2 5 1 6
and [as ]
Butterfly 1 2 3 4 5 6 7
Length (in cm) 2 2 3 2 5 1 6
where
5. 95.45% probability of randomly selecting a score between -2 and +2 standard deviations from
the mean.
where
6. 99.73% probability of randomly selecting a score between -3 and +3 standard deviations from
the mean.
where
Z =
b) As and and Z = 5
Then Z =
c) As and and Z = -2
Then Z =
Example 19
In a district, exam scores of 300 student of class XII are recorded at the end of the session.
a) Ramesh scored 800 marks in total out of 1000. The average score for the batch was 700 and
the standard deviation was calculated to be 180. Find out how has Ramesh scored compared
to his batch mates in the whole district.
Z =
Once you have the Z-Score, the next step is choosing between the two Z- Tables. (Refer Appendix
at the end)
In the Z-table, go vertically down on the leftmost column to find the value of the first two digits
of your Z Score (0.5 in this case) and then go alongside on the topmost row to find the value of the
digits at the second decimal position (.06 in this case). Once you have mapped these two values, the
intersection of the row of the first two digits and column of the second decimal point in the table
gives the value 0.7123 i.e. the area on the left of ordinate corresponding to Z = 0.56. This area also
represents the probability of scoring < 800 marks.
Lastly, to get this as a percentage we multiply that number with 100 i.e. 0.7123 x 100 = 71.23%.
Hence, we can say that Ramesh did better than 71.23% of students in the district.
b) In the case of Sudha, and and = 420
Z =
Looking at the Z-Table we can say that it maps to 0.0594 and hence we can say that Sudha did
better than 100 5.94% of students in the district.
c) If Abhay has done 44.83% better than his batchmates, then his score on Z-Table is 44.83 ÷
100 = 0.4483 which corresponds to Z-Score = - 0.13
Here and and Z = -0.13
Therefore Z =
Which means that Abhay has scored approximately 677 marks out of 1000
Example 20:
Given that the scores of a set of candidates on an IQ test are normally distributed. If the IQ test has
a mean of 100 and a standard deviation of 10, what is the probability that a candidate who takes
the test will score between 90 and 110?
Solution: P(90 < X < 110) = P( X < 110 ) - P( X < 90 )
P( 90 < X < 110 ) = P (–1 < Z < 1) = P (Z < 1) – P(Z < –1)
= 0.8413 – 0.1587 = 0.6826
a. X -1 0 1 2
P(X) 0.1 0.8 0.001 0.2
b. X 1 2 3 4 5
P(X) 0.1 0.4 0.05 -0.2 0.2
c. X -2 2 5
P(X) 0.5 0.2 0.3
2. A lady’s bag contains 2 black and 1 red pens. One pen is drawn at random and then put
back in the box after noting its colour. The process is repeated again. If X denotes the number
of red pens recorded in the two draws. Describe X.
3. What is the mean of the numbers obtained on throwing a die having written 1 on three faces,
2 on two faces and 5 on one face?
4. Raheem tossed a fair coin 10 times, find the probability of (i) exactly six heads (ii) at least six
heads (iii) at most six heads.
5. Find the probability of getting 5 exactly twice in 7 throws of a fair die.
6. Let X denote the number of hours a class XII student studies during a randomly selected
school day. The probability that X can take the values xi, for an unknown constant ‘k’
0.1, if xi 0
kx , if xi 1 or 2
P(X = xi) = i
k(5 x ), if xi 3, 4
i
n 2
2 2
n
by x Var(X)
i 1
xi pi –
i 1
xi pi
n
2
2
In other words, Var (X) = E(X ) – [E(X)] , where E(X ) = 2 2
x
i 1
i pi
P (‘r’ successes) =
i.
ii. for k = 0, 1, 2, …
18. A continuous random variable X is defined in terms of its probability density function
also known as PDF as well
19. A continuous random variable X is designed to follow normal distribution with constant
parameters = E(X) and Var(x) = and written as X N ( , )
e. The distribution can be described by two values: the mean and the standard deviation
22. When mean = 0 and standard deviation = 1 for a data set, then the normal distribution
is called as standard normal distribution
24. When the Z-score is positive if the data point lies above the mean, and negative if it lies below
the mean
25. When the data values in a normal distribution are converted into Z-scores in a standard
normal distribution, then the percentage of the data that fall within specific numbers of
standard deviations () from the mean () for bell-shaped curve is constant.
i. Data points are symmetrical along the mean (
ii. Z-score describes the position of each data point in terms of its distance from the mean,
when measured in standard deviation units.
iii. The Z-score is positive if the data point lies above the mean, and negative if it lies below
the mean.
iv. There is a 68.27% probability of randomly selecting a Z-score between -1 and +1 standard
deviations from the mean.
b. where
b. where
vi. 99.73% probability of randomly selecting a score between -3 and +3 standard deviations
from the mean.
b. where
1. (a) no, p 1 (b) no, p < 0 (c) yes 2. X = 0,1,2 3. 2 4. 105/512, 193/512, 53/64 5.
6. k = 0.15, 0.75, 0.3, 0.55 7. 4 or more times 8. 7 9. 0.24 10. 32/81 11. 0.5 12. 7 13. 0.217 14. 99.89%
15. 2 16. 0.05 17. 0.235 18. 0.099 19. (a) 0.11507 , (b) 0.88493, (c) 0.5328, (d) 0.6826 20. better than
43 other volunteers 21. 90% 23. (i) 1781, (ii). 424 24. Mean = 50, SD = 10