Lec 3
Lec 3
Lec 3
Descriptive Statistics
Discrete Probability Distributions
Continuous Probability Distributions
Stochastic Processes
0.08
0.06
0.04
0.02
0.00
-300 -200 -100 0 100 200 300 400 500 600 700 800
Profit x
OR
BOTH!!!!
3
Random Variables and Probability Density Functions
A random variable is a quantity whose value is not known exactly but its probability distribution is known. The
value of the random variable will vary from trial to trial as the experiment is repeated. The variables
probability density function (PDF) describes how these values are distributed (i.e. it gives the probability that
the variable value falls within a particular interval).
Continuous PDFs
f(x) All values between 0 f(x)
and 1 are equally likely Smallest values
are most likely
0 1 0
Uniform distribution x Exponential distribution x
(e.g. soil texture) (e.g. event rainfall)
0.3
f (x)
Expected Returns
Expected returns are based on the probabilities of possible outcomes
n
E ( R ) = pi Ri
i =1
In this context, expected means average if the process is repeated
many times
The expected return does not even have to be a possible return
6
State Probability C T
n
2 = pi ( Ri - E ( R))2
i =1
8
Stock C
s2 = .3(.15-.099)2 + .5(.1-.099)2 + .2(.02-.099)2 = .002029
s = .045
Stock T
s2 = .3(.25-.177)2 + .5(.2-.177)2 + .2(.01-.177)2 = .007441
s = .0863
9
Another Example
Consider the following information:
Another Example
Consider the following information:
State Probability ABC, Inc.
Binomial Distribution
In many Geographic studies, we often face a situation where we deal with a random
variable that only takes two values, zero-one, yes-no, presence-absence, over a given
period of time. Since there are only two possible outcomes, knowing the probability of one
knows the probability of the other.
P(1)=p
P(0)=1-p=q
If the random experiment is conducted n times, then the probability for the event to happen
x times follow binomial distribution:
n x n - x n!
P( x) = p q = p x q n - x
x x!(n - x)!
P(1)=4/25=0.16, so P(0)=1-0.16=0.84.
5!
P(4) = 0.16 4 0.845-4 = 0.003
4!(5 - 4)!
5!
P(5) = 0.165 0.845-5 = 0.000
5!(5 - 5)!
14
5!
P(4) = 0.16 4 0.845-4 = 0.003
4!(5 - 4)!
5!
P(5) = 0.165 0.845-5 = 0.000
5!(5 - 5)!
15
Poisson Probability Distribution
The Poisson distribution is
e-l l x
f ( x) = x = 0,1, 2,...
x!
Where the parameter l>0 is the mean number of successes in the interval.
The mean and variance of the Poisson distribution are
= l and s 2 = l
16
e -1.21.20 e -1.21.21
P(0) = = 0.301 P(1) = = 0.361
0! 1!
e1.21.22 e1.21.23
P(2) = = 0.217 P(3) = = 0.087
2! 3!
e1.21.24
P(4) = = 0.026
4!
17
e -1.21.20 e -1.21.21
P(0) = = 0.301 P(1) = = 0.361
0! 1!
e1.21.22 e1.21.23
P(2) = = 0.217 P(3) = = 0.087
2! 3!
e1.21.24
P(4) = = 0.026
4!
18
f ( x) = e 2s 2
- < x <
a continuous random variable. s 2p
It has been used in a wide variety of with mean and variance s 2
applications: The normal distribution is: X : N ( , s 2 )
Heights and weights of people
The visual appearance of the normal
Test scores distribution is a symmetric, unimodal or
Scientific measurements bell-shaped curve as shown in the figure.
Amounts of rainfall
It is widely used in statistical inference
We can use the following function to convert any normal random variable to a
standard normal random variable
0
22
Examples:
F (0.76) = 0.776373
F (1.3) = ?
F (-3) = 1 - F (3) = ?
F (3.86) = ?
23
Lognormal Distribution Probability Density Function
f ( x) = e 2s , for x >0
s 2p
, for x 0
f(x) 0
0 x
24
Lognormal Distribution - Probability Distribution Function
If X ~ LN(,s),
ln x -
F ( x ) = P ( X x ) = F
s
where F(z) is the cumulative probability distribution function of N(0,1)
25
Lognormal Distribution
Mean or Expected Value of X
1 2
+ s
X = E(X ) = e 2
Median of X
m edian = e
1
Standard Deviation of X
2
2 + 2 2
s X = e e - 1
26
Lognormal Distribution - Example
A theoretical justification based on a certain material failure mechanism
underlies the assumption that ductile strength X of a material has a
lognormal distribution.
If the parameters are =5 and =0.1 ,
Find:
(a) x and x
(b) P(X >120)
(c) P(110 X 130)
(d) The median ductile strength
(e) The expected number having strength at least 120, if ten different
samples of an alloy steel of this type were subjected to a strength test.
(f) The minimum acceptable strength, If the smallest 5% of strength
values were unacceptable.
27
Lognormal Distribution Example Solution
s 2
(a) +
X = E( X ) = e 2
=e 5.005
= 149.16
2 +s 2 s 2
s X = e (e - 1)
= 223
= 14.933
28
Lognormal Distribution Example Solution
(b)
P( X > 120) = 1 - P( X 120)
ln120 - 5.0
= 1 - P( Z )
0.1
= 1 - F (-2.13)
= 1 - 0.0166
= 0.9834
29
Lognormal Distribution Example Solution
(C)
ln110 - 5.0 ln130 - 5.0
P(110 X 130) = P( Z )
0.1 0.1
= P ( -2.99 Z -1.32)
= F ( -1.32) - F ( -2.99)
= 0.0934 - 0.0014
= 0.092
(d)
X 0.5 = median = e = e = 148 .41 5
30
Lognormal Distribution Example Solution
(e) Let Y=number of items tested that have strength of at least 120
y=0,1,2,,10
p = P( X > 120)
= 1 - P( X 120)
ln120 - 5.0
= 1 - P( Z )
0.1
= 1 - F (-2.12)
= 1 - 0.0170
= 0.983
31
Lognormal Distribution Example Solution
(f) The value of x, say xms, for which P( X < xms ) = 0.05 is
determined as follows:
ln xms - 5.0
and P( Z < ) = 0.05 ,
0.1
,
P( Z < -1.64) = 0.05
so that
ln xms - 5.0
= -1.64 ,
0.1
therefore xms = 125.964
32
Exponential Distribution
A random variable X is defined to be exponential random variable (or
say X is exponentially distributed) with positive parameter if its
probability density function is given by:
l e- l x if x 0, l > 0
f ( x) =
0 if x < 0
- l x
f ( x) dx = l e - l x
Note: dx = - e =1
- 0 0
Integration by part:
- l x
1 1
E[ X ] = - xe - (-e - l x ) dx = e - l x dx = - e - l x =
0 0 0 l 0 l
Variance:
E[ X 2 ] = x 2 f ( x)dx = x 2 l e - l x dx = - x 2 de - l x
- 0 0
Integration by part:
2 2 1 2
- (-e - l x )2 x dx = 2 xe - l dx =
l 0
E[ X 2 ] = - x 2e - l x xl e - l dx = =
0 0 0 l l l 2
2
1 1 2
Var [ X ] = E[ X ] - ( E[ X ]) = 2 - = 2
2 2
l l l
34
Exponential Distribution: Example
The lifetime of an alkaline battery (measured in hours) is exponentially
distributed with l = 0.05. Find the probability a battery will last between 10 & 15
hours
P (10 X 15)
= F (15) - F (10)
P(10 X 15) = e - (0.05)(10) - e - (0.05)(15)
= e -0.5 - e -0.75
= 0.1341
G (a ) = x a -1 - x
e dx
0
Properties of the gamma function:
0.8
0.6 a = 1, b = 1
0.4 a = 2, b = 2
a = 2, b = 1
0.2
0 x
0 2 4 6 8
If X~G(, ), then
Mean or Expected Value: = E ( X ) = a b
Standard Deviation: s = b a
38
Stochastic Process - Introduction
In general, the value Xt might depend on the quantity Xt-1 at time t-1,
or even the value Xs for other times s < t.
Counting process
A stochastic process {N(t) : t 0} is a counting process if N(t) represents
the total number of events that occur by time t.
eg, # of persons entering a store before time t, # of people who were
born by time t, # of goals a soccer player scores by time t.
1 2 3 4
time
t=0 S1 = X i S 2 = X i S 3 = X i S 4 = X i
i =1 i =1 i =1 i =1
The counting process { N(t), t>0 } is called a Poisson process if the inter-
occurrence times X1, X2, follow the exponential distribution
44
time
t=0 S1 = 5 min S2 = 9 min S3 = 16 min S4 = 18 min
45
time
t=0 S1 = 1 min S2 = 3 min S3 = 7 min S4 = 9 min S5 = 15 min
46
time
t=0 S1 = 10 min S2 = 16 min
47