Topic 4B Common Continuous Distributions
Topic 4B Common Continuous Distributions
Probability Distributions
Solution
0 < x < 10
( 1
f (x) = 10
0 otherwise.
R3
(a) P(X < 3) = 1
0 10
dx = 10 .
3
R 10
(b) P(X > 6) = 6
1
10 dx = 25 .
R8
(c) P(3 < X < 8) = 1
3 10
dx = 12 .
1
2 Elements of Probability & Statistics– MTH1202
Example 2 Let X be a uniform continuous random variable that denotes the current
measured in a thin copper wire in milliamperes. Assume that the range of X is [0, 20]
mA.
Solution
(a) f (x) = 1
20 = 0.05 for 0 ≤ x ≤ 20.
R 10
Figure 6.1: The shaded area is the needed probability 5
0.05dx.
(c) Substituting into the formulae for mean and variance seen above with
a = 0 and b = 20, we obtain E(X) = 10 mA and√Var(X) = 202 /12 = 33.33
mA2 . Consequently, the standard deviation is 33.33 = 5.77 mA.
Note: We could have used definitions and pdf to obtain the mean, variance and
standard deviation.
Exercise
1. Suppose X has a continuous uniform distribution over the interval
[2.5, 6.5].
2. Suppose Y has a continuous uniform distribution over the interval [−2, 2].
(a) Determine the mean, variance and standard deviation of the weight
for the babies.
(b) Determine the cumulative distribution function of the weight of the
babies.
(c) Determine P(X < 3.2).
5. Suppose the time it takes for a bank teller to serve a customer is uniformly
distributed between 1.5 and 2.5 minutes.
(a) What is the mean and variance of the time it takes a bank teller to
serve a customer.
(b) What is the probability that it takes less than 2 minutes for a customer
to be served?
(c) Of the next 50 customers, how many will be served with in 2.2
minutes each?
(d) Determine the cumulative distribution function of the time it takes
for a bank teller to serve a customer.
6. The probability density function of the time it takes hematology cell counter
to complete a test on a blood sample is uniform on [50, 75] seconds.
2 ∞ −λx
∞ Z ∞
2 −2 2
E(X2 ) = −x2 e−λx − xe−λx + e dx = 2 e−λx = 2 .
λ 0 λ 0 λ 0 λ
Hence
2 1 1
Var(X) = E(X2 ) − [E(X)]2 = − 2 = 2.
λ 2 λ λ
Solution
(a) P(X > x) = e−λx ⇒ P(X > 1300) = e−0.001∗1300 = e−1.3 = 0.273.
(b) P[(X > 1500)|(X > 1300)] = P(X > 200) = e−0.001∗200 = e−0.2 = 0.819.
q
(c) Standard deviation = λ12 = λ1 = 1000.
Example 5 The length of time for a person to be served at a cafeteria is a random variable
having an exponential distribution with mean 4 minutes. What is the probability that a
person is served in less than 3 minutes on at least 4 of the next 6 days?
Solution
First obtain the probability of being served in less than 3 minutes on any day.
Let X represent the length of time it takes to serve a person. Therefore, with
R3 1
mean=4, λ = 14 this implies that, P(X < 3) = 14 0 e− 4 x dx = 1 − e−0.75 = 0.5276.
Next we treat a day as a Bernoulli trial with the event "served in less than 3
minutes" a success with a probability p = 0.5276. Hence,
6 !
X 6
P(at least 4 successes in 6 days) = P(X ≥ 4) = px q6−x = 0.3968.
x
x=4
Example 7 On a busy road, accidents occur at random at the rate of 3 accidents per
day. Find the probability that after a particular accident has occurred, at least one day
goes without another accident.
Solution
8 Elements of Probability & Statistics– MTH1202
( βα α−1 −βx
Γ(α) x e for x > 0,
f (x) =
0 otherwise.
Z ∞
f (x)dx = 1.
0
Now,
∞
βα ∞
Z Z
f (x)dx = xα−1 e−βx dx,
0 Γ(α) 0
Z ∞ !α−1
∞
βα
Z
u du
f (x)dx = e−u
0 Γ(α) 0 β β
α Z ∞ α−1
β u
= e−u du
Γ(α) 0 βα
βα 1
Z ∞
= uα−1 e−u du
Γ(α) βα 0
βα 1
= Γ(α) = 1.
Γ(α) βα
α α
E(X) = and Var(X) = 2 .
β β
10 Elements of Probability & Statistics– MTH1202
Proof:
∞ ∞
βα α −βx
Z Z
E(X) = x f (x)dx = x e dx
0 0 Γ(α)
βα
Z ∞
= xα e−βx dx
Γ(α) 0
Z ∞ !α
βα u 1
= e−u du
Γ(α) 0 β β
!α+1 Z ∞
βα 1
= uα e−u du
Γ(α) β 0
!α+1 Z ∞
βα 1
= u(α+1)−1 e−u du
Γ(α) β 0
!α+1
βα 1
= Γ(α + 1)
Γ(α) β
βα α
= .αΓ(α) = .
Γ(α)βα+1 β
α(α+1)
Similarly E(X2 ) = β2
. Check it out. So,
α(α + 1) α2 α
Var(X) = − 2 = 2.
β 2 β β
Recall that:
R∞
(i) If α = 1 then f (x) = 0
βe−βx for β > 0, x > 0, this is equivalent to an
exp(β).
(ii) If β = 12 and α = n2 , then the gamma distribution is called the chi-square
(χ2 ) distribution with n degrees of freedom. The mean of a chi-square is n
and the variance is given by 2n. The pdf of a chi-square can be written as:
n−2
−x
1
2 n2 Γ( n ) x 2 e 2
for x > 0,
f (x) =
2
0 elsewhere.
• compute the mean µ and the variance σ2 of the normal random variable.
• learn the standard normal Z.
• find probabilities from Normal tables.
1 1 x−µ 2
f (x) = √ e− 2 ( σ ) , for − ∞ < x < ∞.
σ 2π
The distribution defined by a density function above is a normal distribution
and we say that X is normally distributed with the mean µ and variance σ2 ,
written as X ∼ N(µ, σ2 ).
(h) The total area under the curve and above the horizontal axis equals 1.
The normal distribution was introduced by the French mathematician
Abraham de Moivre in 1733 and was used by him to approximate probabilities
associated with binomial random variables when the binomial parameter n
is large. This result was later extended by Laplace and others and is now
encompassed in a probability theorem known as the central limit theorem,
which gives a theoretical base to the often noted empirical observation that,
in practice, many random phenomena obey, at least approximately, a normal
probability distribution. Some examples of this behaviour are the height of a
person, the velocity in any direction of a molecule in gas, and the error made in
measuring a physical quantity.
Exercise: Show that f (x) is indeed a probability density function ( that is the
area under the normal curve equals 1).
Hint: you need to show that:
Z ∞ Z ∞
1 1 x−µ 2
f (x)dx = √ e− 2 ( σ ) dx = 1.
−∞ σ 2π −∞
x−µ
By making the substitution y = σ you observe that:
Z ∞ Z ∞
1 1 x−µ 2 1 1 2
√ e− 2 ( σ ) dx = √ e− 2 y dy.
σ 2π −∞ 2π −∞
Z ∞
1 2 √
It then suffices to show that, e− 2 y dy = 2π.
−∞
B.K. Nannyonga, H.W. Kayondo, N. Muyinda, & B.N. Kirenga c 13
R∞ 1 2
Let I = −∞
e− 2 y dy, then,
Z ∞ Z ∞ Z ∞ Z ∞
e− 2 (y +x ) dydx.
1 2 1 2 1 2 2
I2 = e− 2 y dy · e− 2 x dx =
−∞ −∞ −∞ −∞
Thus,
Z ∞ Z 2π Z ∞
1 2 1 2 1 2
I2 = re− 2 r dθdr = 2π 0 = 2π.
re− 2 r = | − 2πe− 2 r |∞
0 0 0
√ √
R∞ x−µ
− 21 ( σ )2
Hence I = 2π → √1 −∞ e dx = √1 · 2π = 1. Hence the area under
σ 2π 2π
the normal curve is equal to 1, which proves that f (x) is a probability density
function (pdf).
The mean of a normal distribution is µ and the variance is σ2 . The Proof
of this will be done in a second year course unit (Probability Theory) using
Moment Generating Function technique.
The normal distribution arises in many statistical problems and forms the
foundation for the theory of statistical inference.
X−µ
Let X ∼ Normal(µ, σ2 ). Consider a random variable Z = . Then,
σ
X µ µ µ
E(Z) = E( ) − E( ) = − = 0
σ σ σ σ
and
1 µ σ2
Var(X) − Var( ) = 2 − 0 = 1.
Var(Z) =
σ2 σ σ
In this case, Z is called a standard normal random variable or Z is said to have a
standard (unit) normal distribution. Thus if X ∼ N(µ, σ2 ), then Z ∼ N(0, 1). The
Z− score indicates how many standard deviations an element is from the mean.
If a Z− score is equal to zero, (0), it is on the mean. If it is +1, it is 1 standard
deviation from the mean. Negative values indicate that the raw scores were
below the mean.
x−µ
value of Z is given by z = . Therefore if X falls between two values, X = x1
σ
and X = x2 then the standard normal Z falls between corresponding values:
x1 − µ x2 − µ
z1 = and z2 = .
σ σ
Thus, we have the following:
Z x2
1 1 x−µ 2
P(x1 < X < x2 ) = √ e− 2 ( σ ) dx,
σ 2π x1
Z z2
1 1 2
= √ e− 2 z dz,
2π z1
= P(z1 < Z < z2 ).
and
a−µ X−µ b−µ
!
P(a < X < b) = P < < ,
σ σ σ
a−µ b−µ
!
= P <Z< .
σ σ
The probabilities of N(0, 1) are tabulated and published in statistical tables. The
following examples will guide us through on how to read normal tables as
well as applying normal distribution when solving mathematical and statistical
problems.
Solution
x−µ
(a) P(X < 11) = P( σ < 11−3
4 ) = P(z < 2). The shaded area under the graph
corresponds to P(z < 2). From the tables, it follows that P(z < 2) = 0.9772.
x−µ
(b) P(X > −1) = P( σ > −1−3
4 ) = P(z > −1). The shaded area under the graph
corresponds to P(z > −1). From the tables, it follows that,
P(z > −1) = 0.8413.
x−µ
(c) P(2 < X < 7) = P( 2−3
4 < σ < 4 ) = P(−0.25 < z < 1). The shaded area
4−3
under the graph corresponds to P(−0.25 < z < 1). From the tables, it
follows that P(−0.25 < z < 1) = φ(1) − φ(−0.25) = 0.8413 − 0.4013 = 0.44.
B.K. Nannyonga, H.W. Kayondo, N. Muyinda, & B.N. Kirenga c 15
Example 10 Suppose that the distribution of monthly earnings for all people who
possess a bachelor’s degree is known to be bell-shaped and symmetric with a mean
of $2116 and a standard deviation of $455. What percentage of individuals with a
bachelor’s degree earn:
(a) less than $1661 per month?
(b) more than $1206 per month?
(c) between $3026 and $3481 per month?
Solution
Let X be the random variable representing monthly earnings for all people
who possess a bachelor’s degree. Then X ∼ Normal($2116,$4552 ).
(a) P(X < $1661) = P(z < 1661−2116
455 ) = P(z < −1).
16 Elements of Probability & Statistics– MTH1202
The shaded area under the graph corresponds to P(z < −1). From the
tables, it follows that P(z < −1) = 0.1587. Therefore, approximately 16% of
the people who possess bachelor’s degree earn less than $1661 per month.
The shaded area under the graph corresponds to P(z > −2). From the tables,
it follows that P(z > −2) = 1 − 0.0228 = 0.9772. Therefore, approximately
98% of the people who possess bachelor’s degree earn more than $1206
per month.
The shaded area under the graph corresponds to P(2 < z < 3). From the
tables, it follows that P(2 < z < 3) = φ(3) − φ(2) = 0.9987 − 0.9772 = 0.0215.
B.K. Nannyonga, H.W. Kayondo, N. Muyinda, & B.N. Kirenga c 17
The shaded area under the graph corresponds to P(−1 < z < 3). From the
tables, it follows that P(−1 < z < 3) = φ(3) − φ(−1) = 0.9987 − 0.1587 = 0.84.
Therefore, approximately 84% of American women aged 18 to 24 are
between 60 and 70 inches tall.
(b) P(Y < 65) = P(z < 65−62.5
2.5 ) = P(z < 1). The shaded area under the graph
corresponds to P(z < 1). From the tables, it follows that P(z < 1) = 0.8413.
Therefore, approximately 84% of American women aged 18 to 24 are less
than 65 inches tall.
18 Elements of Probability & Statistics– MTH1202
The shaded area under the graph corresponds to P(−3 < z < −1). From the
tables, it follows that P(−3 < z < −1) = φ(−1) − φ(−3) = 0.1587 − 0.0013 =
0.1574. Therefore, approximately 16% of American women aged 18 to 24
are between 55 and 60 inches tall.
Remark: It is good practise to always draw the normal curve and shade the
underlying area you are looking for, as done in the previous examples.
Example 12 The I.Q scores of all teenagers in a certain country are known to follow a
normal distribution with a mean of 100 and a standard deviation of 10.5.
(a) What percentage of teenagers in that country have I.Q scores between 89.5 and
121?
(b) What is the probability of finding a teenager in that country with an I.Q score of
less than 79?
(c) How many out of 1000 teenagers in that country have I.Q scores between 110.5
and 131.5?
Solution
Let X be a random variable representing I.Q scores for teenagers in that
country. Then, X ∼ Normal(100,10.5).
(a) P(89.5 < X < 121) = P( 89.5−100
10.5 < z < 121−100
10.5 ) = P(−1 < z < 2) = φ(2) −
φ(−1). From the tables, P(−1 < z < 2) = 0.9772−0.1587 = 0.8185. Therefore,
approximately 82% of teenagers in that country have I.Q scores between
89.5 and 121.
(b) P(X < 79) = P(z < 79−100
10.5 ) = P(z < −2). From the tables, P(z < −2) = 0.0228.
Therefore the probability that a teenager in that country has an I.Q score
of less than 79 is 0.0228.
(c) P(110.5 < X < 131.5) = P( 110.5−100
10.5 < z < 131.5−100
10.5 ) = P(1 < z < 3) =
φ(3) − φ(1). From the tables, P(1 < z < 3) = 0.9987 − 0.8413 = 0.1574.
Therefore, approximately 157 teenagers out of 1000 in that country have
I.Q scores between 110.5 and 131.5.
B.K. Nannyonga, H.W. Kayondo, N. Muyinda, & B.N. Kirenga c 19
Example 13 Suppose that the test scores for a particular college entrance exam are
distributed according to a bell-shaped, symmetric distribution with a mean of 450 and a
variance of 10000.
(a) What percentage of the students who take the entrance exam score between 350
and 650?
(b) Any student who scores higher than 550 is automatically admitted to the college.
What is the probability that a student who takes the entrance exam will be
automatically admitted to the college?
(c) What percentage of the students who take the entrance exam score between 250
and 350?
Solution
Example 14 For one of the examinations done by students offering a Bachelor’s degree
at the University, the average score was 72% and the standard deviation was 8%. If top
10% performers obtained A’s and the scores were normally distributed, what was the
lowest possible A and highest possible B?
Solution
From the figure, we need to first find a z-value that leaves an area of 0.1 to the
right and 0.9 to the left. From the tables, the z value is 1.28. It follows that
0.08 = 1.28. This gives x = 0.8224. Therefore the highest possible B is 82% and
x−0.72
Example 15 During measles outbreak in one village, the probability that a baby
recovered from it was 0.6. If 120 babies were known to have contracted measles, what is
the probability that less than one-half survived?
B.K. Nannyonga, H.W. Kayondo, N. Muyinda, & B.N. Kirenga c 21
Solution
Let X be the binomial random variable that represent the number of babies that
survived. Since n = 120, we can obtain fairly accurate results using the normal
approximation with:
µ = np = 120 × 0.6 = 72,
√ √
σ = npq = 120 × 0.6 × 0.4 = 5.3666.
The required probability is given by:
59
X 59.5 − 72
P(X < 60) = bin(120, 0.6) ≈ P z < = P(z < −2.33) = 0.0107.
x=0
5.3666
Solution
Let X represent the number of right answers due to guesswork, then X ∼ bin(75,
0.25). We need to work out:
30
X
P(20 ≤ X ≤ 30) = bin(75, 0.25).
20
µ = np = 75 × 0.25 = 18.75,
√ √
σ = npq = 75 × 0.25 × 0.75 = 3.75.
The required probability is given by:
30 19.5 − 18.75
X 30.5 − 18.75
P(20 ≤ X ≤ 30) = bin(75, 0.25) ≈ P ≤z≤ = P(0.2 ≤ z ≤ 3.13) = 0.4198.
20
3.75 3.75
x=4
P
Example 17 Find the percentage error is approximating bin(20, 0.1) by the normal
x=1
approximation.
Solution
x=4
P
The exact solution for bin(20, 0.1) is given by:
x=1
x=4 k=4 !
X X 20
bin(20, 0.1) = (0.1)k (0.9)20−k = 0.8353.
k
x=1 k=1
22 Elements of Probability & Statistics– MTH1202
x=4
bin(20, 0.1), we use µ = np = 20 × 0.1 = 2
P
For normal approximation of
√ x=1
√
and σ = npq = 20 × 0.1 × 0.9 = 1.3416. The normal approximation is thus
given as:
x=4 0.5 − 2
X 4.5 − 2
bin(20, 0.1) ≈ P ≤z≤ = P(−1.12 ≤ z ≤ 1.86) = 0.8372.
1.4316 1.4316
x=1
This results in an error of |0.8372 − 0.8353| = 0.0019. The percentage error is then
given as:
0.0019
%error = × 100% = 0.2%.
0.8353
Remark: We subtract or add 0.5 to the original binomial values since we are
approximating a discrete distribution by a continuous distribution. The 0.5 in
this case is called continuity correction factor.
B.K. Nannyonga, H.W. Kayondo, N. Muyinda, & B.N. Kirenga c 23
Exercise
(a) what fraction of the bottles will contain more than 285 millilitres of
soda?
(b) what is the probability that a packed bottle contains between 275 and
307 millilitres of soda?
(c) How many bottles will likely overflow if 290-millilitre bottles are
used for a special order made for 500 bottles?
(d) below what value is the smallest 18% of the packed soda bottles?
(a) What is the probability that a journey will take at least quarter an
hour?
(b) If lectures begin at 8:00 A.M and she leaves her hostel at 7:40 A.M
every day, how many days is she late for lectures given that she
attends lectures for 5 days in a week?
24 Elements of Probability & Statistics– MTH1202
(c) Find the probability that 3 of the next 5 journeys will take at least 20
minutes?
5. For a Psychology exam done last year at a certain University, grades were
approximately normally distributed with a mean of 72% and a standard
deviation of 8.5%, find:
(a) the pass mark if the lowest 14% of the students failed the exam,
(b) the highest mark for a B if the top 8% of the students obtained A’s.
6. In a certain continuous assessment test (CAT) done and marked out of 30
at the department of mathematics, the average was 24.6, with a standard
deviation of 1.5. Given that 8 students scored between 26.4 and 28.2, and
that the CAT scores were approximately normally distributed, how many
students sat for the CAT?
7. The average life of a certain type of car batteries is 2 years , with a standard
deviation of 3 months. The company replaces free all car batteries that
fail while under guarantee. If the company is willing to replace only 5%
of car batteries that fail, how long a guarantee should the company offer
assuming that the lives for car batteries follow a normal distribution?
8. Given that Y represents a binomial distribution with parameters n = 15
and p = 0.35. Find the error in approximating P(2 ≤ Y < 5) by the normal
approximation.