ISE1204 Lecture4-1
ISE1204 Lecture4-1
K. Mutangi
May 5, 2021
Denition
For a discrete random variable, its probability distribution is any
table, graph, or formula that gives each possible value and the
probability of that value.
Example
Consider the experiment of tossing a coin twice and noting the
outcome. The probability distribution for the number of heads is as
follows:
Note that S = {HH , HT , TH , TT }.
Heads (X) 0 1 2
Probability 0.25 0.5 0.25
Example
A fair six sided die is tossed. You win $2 if the result is a 1, you
win $1 if the result is a 6 but otherwise you lose $1. Construct the
probability distribution for your gains or loses. In this case we let
the r.v X= the amount won or lost, then the probability distribution
of X is:
X +2 +1 -1
Probability 1/6 1/6 4/6
Denition
Mean of X: For a discrete random variable X, the mean is denoted
n
E (X ) and is given by E (X ) = xi P (xi )
X
i =1
Where P (xi ) is the probability at xi .
K. Mutangi ISE1204: APPLIED STATISTICS
Discrete Probability Distributions
Discrete Probability Distributions
Denition
Variance: The variance for a r.v is given by
Var (X ) =X E (x − µ)2 = E (x 2 ) − µ2 . For a discrete r.v,
E (x 2 ) = x 2 P (x ), hence the variance is
n
Var (x ) = x12 P (xi ) − µ2 .
X
i =1
Note that µ = E (x ).
Example
Calculate the mean and variance for the r.v X in the game
described in the previous example.
Mean = 2( 61 ) + 1( 16 ) − 1( 46 ) = − 16
E (x 2 ) = 4( 16 ) + 16 + 46 = 96 Hence var (x ) = 9
6 − 1
36 = 53
36
K. Mutangi ISE1204: APPLIED STATISTICS
Bernoulli Distribution
Discrete Probability distributions
Denition
Bernoulli distribution: A random variable X which has two possible
outcomes (say TRUE or FALSE, Head or Tail, success or failure,
etc) is called a Bernoulli random variable. The probability
distribution function (pdf) of X is given by:
fX (x , θ) = θx (1 − θ)1−x , x = 0, 1
n
= x !(nn−! x )!
x
Mean is E (x ) = nθ and variance is var (x ) = nθ(1 − θ).
K. Mutangi ISE1204: APPLIED STATISTICS
Binomial distribution
Discrete Probability Distributions
Example
A particular mobile phone link is known to transmit 6% of bits of
information in error. Find the probability that two bits out of the
next 10 transmitted are in error.
Geometric distribution:
It's the probability that the rst occurrence of success requires
k number of independent trials, each with success probability
θ. If the probability of success on each trial is θ, then the
probability that the kth trial (out of k trials) is the rst success
is P (X = k ) = θ(1 − θ)k −1 , k = 1, 2, 3, ...
The above form of geometric distribution is used for modeling
the number of trials up to and including the rst success.
The geometric distribution is an appropriate model if the
following assumptions are true.
The phenomenon being modelled is a sequence of independent
trials.
There are only two possible outcomes for each trial, often
designated success or failure.
The probability of success, θ is the same for every trial.
Example
Products produced by a machine has a 3% defective rate. What is
the probability that the rst defective occurs in the fth item
inspected?
P [X = 5] = θ[1 − θ]k −1 = 0.03[0.97]4
Example
Let X be a geometric random variable with θ = 0 : 25. What is the
probability that the rst success occurs on the 4th trial?
P [X = 4] = 0.25[0.75]3
Example
Computer malfunctions in a big company occur at an average rate
of 1.5 per hour. What is the probability of observing
[b] P [X > 2] = 1 − P [X ≤ 2] = 1 − [P [X = 0] + P [X =
1] + P [X = 2]]
[c] Here, the mean changes to λ = 1.5 × 2 = 3 since the time
is now 2 hours. At most 2 means up to and including 2
suggesting that we have
e −3 3 0 e −3 3 2
P [X ≤ 2 ] = + e −3 × 3 +
0! 2!
K. Mutangi ISE1204: APPLIED STATISTICS
Continuous Probability distributions
Denition
Let X1 , X2 , . . . , Xn be a random sample from a population that is
normally distributed with mean µ and variance σ 2 ., the probability
density function of X is given by
1 (x − µ)2
fX (x , µ, σ 2 ) = √ exp{−[ i 2 ]}, −∞ < x < ∞
2π 2σ
Example
Let X be a normally distributed random variable with µ = 10 and
standard deviation σ = 2. Find the probability that X lies between
11 and 13.6
Solution to Example
11 − 10 13.6 − 10
P (11 < x < 13.6) = P <z <
2 2
= P (0.5 < z < 1.80)
= P (z < 1.80) − P (z < 0.5)
= 0.9641 − 0.6915 = 0.2726
Example
The grade point averages of a large population of college students
are approximately normally distributed with mean equal to 2.4 and
standard deviation equal to 0.8. What fraction of the students will
possess a grade point average in excess of 3.0?
Example
Solution:
3 − 2.4
P (x > 3.0) = P Z >
0.8
= P (z > 0.75) = 1 − P (z ≤ 0.75)
= 1 − 0.7734 = 0.2266
Theorem
Central limit theorem: If random samples of n observations are
drawn from a population with nite mean, µ, and standard
deviation, σ , then, when n is large, the sample mean x̄ will be
approximately normally distributed with mean µ and standard
deviation √σn . The approximation becomes more and more accurate
as n becomes large.
K. Mutangi ISE1204: APPLIED STATISTICS
Normal Probability Distribution
Theorem
Normal Approximation to the binomial distribution.
When n is suciently large, the binomial variable X will be
approximately normally distributed with mean µ = nθ, and variance
σ 2 = nθ(1 − θ). We may then use areas under a tted normal
curve to approximate binomial probabilities:z = √ x −nθ
nθ(1−θ)
Example
A salesman has found that, on average, the probability of a sale on
a single contact is 0.3. If the salesman contacts 50 customers, what
is the probability that at least 10 will buy?
9.5 − 15
P (x > 9.5) = P (z > √ )
10.5
= P (z > −1.70) = 1 − P (z ≤ −1.70)
= 1 − 0.0446 = 0.9554
Exponential Distribution.
the exponential distribution (negative exponential distribution)
is the probability distribution that describes the time between
events in a Poisson process, i.e. a process in which events
occur continuously and independently at a constant average
rate.
Denition
A random variable X is described as an exponential r.v if its pdf is
given by λe −λx , λ > 0, 0 < x < ∞
E (x ) = λ1 , var (x ) = λ12
The exponential distribution is used to model the time
between the occurrence of events in an interval of time, or the
distance between events in space.
Theorem
Let Z1 , Z2 , . . . , Zn be independent standard normal random
n
variables. Then U = Zi2 is chisquared with n degrees of
X
i =1
freedom.
Theorem
Student's-t distribution
Let Z ∼ N (0, 1) and U ∼ χ2 (r ) be independent variables, then
T = qZU follows a t-distribution with r degrees of freedom.
r