6-A. Joint Probability Distribution
6-A. Joint Probability Distribution
Joint Probability
Distributions
王泰裕教授
成功大學工業與資訊管理學系
1/78
Learning Objectives
1. Define two discrete random variables
2. Describe two continuous random variables
3. Discuss covariance and correlation
4. Define bivariate normal Distribution
5. Describe linear combination of random variables
6. Discuss Chebyshev’s inequality
Why Two Random Variables ?
◼ To find out the simultaneous behavior of two
random variables.
◼ Example:
3. f XY (x,y) = P ( X = x, Y = y )
Two Discrete Random Variables -
--Example
◼ A financial company uses X=1, 2,3 to
represent low, medium, and high income
customers respectively. Also they use
Y=1,2,3,4 to represent mutual funds, bonds,
stocks, and options respectively.
◼ Then the joint probability of X and Y could be
represented by following table.
Two Discrete Random Variables -
--Example
Y=1 Y=2 Y=3 Y=4
X=1 0.1 0 0 0
(1) f XY ( x , y ) 0
(2) f XY ( x , y ) = 1
x y
Two Discrete Random Variables -
--Example
0.2
0.15
0.1
0.05
Y 0
1 3
2 2
3
4 1
Marginal Probability
Distribution (1/2)
f X ( x ) = P( X = x ) = f XY ( x, y )
Rx
fY ( y ) = P(Y = y ) = f XY ( x, y )
Ry
Marginal Probability
Distribution (2/2)
where
Rx denotes the set of all points in the range of
(X, Y) for which X=x and
Ry denotes the set of all points in the range of
(X, Y) for which Y=y
Mean of Marginal
Probability Distribution
◼ If the marginal probability distribution of X
has the probability mass function fX(x), then
Example(1/3)
◼ A financial company use X=1,2,3 to
represent low, medium, and high income
customers respectively. Also they use
Y=1,2,3,4 to represent mutual funds,
bonds, stocks, and options respectively.
◼ Then the joint probability of X and Y could
be represented by following table.
Example(2/3)
X=1 0.1 0 0 0
X=1 0.1 0 0 0
◼ Computed results
Y=1 Y=2 Y=3 Y=4 fX(x)
P(X=1, Y=1)=0.1
fX(Y=1)=0.4
fY(X=1)=0.1
P(X=1, Y=1) ≠ fX(Y=1)*fY(X=1)
Thus X and Y are dependent!
Two Continuous
Random Variables
Two Continuous
Random Variables
◼ A joint probability density function for the
continuous random variables X and Y, denoted
as fXY(x, y), satisfied the following properties.
Two Continuous
Random Variables
Example
∞ ∞ ∞ ∞
න න 𝑓𝑋𝑌 𝑥, 𝑦 𝑑𝑦𝑑𝑥 = න න 6 × 10−6 𝑒 −0.001𝑥−0.002𝑦 𝑑𝑦 𝑑𝑥
−∞ −∞ 0 𝑥
∞ ∞
= 6 × 10−6 0 𝑒 𝑥−0.002𝑦 𝑑𝑦 𝑒 −0.001𝑥 𝑑𝑥
∞ 𝑒 −0.002𝑥
=6× 10−6 0 𝑒 −0.001𝑥 𝑑𝑥
0.002
1000 2000
𝑃 𝑋 ≤ 1000, 𝑌 ≤ 2000 = න න 𝑓𝑋𝑌 𝑥, 𝑦 𝑑𝑦𝑑𝑥
0 𝑥
−6 1000 𝑒 −0.002𝑥 −𝑒 −4
= 6 × 10 0 𝑒 −0.001𝑥 𝑑𝑥
0.002
Rx denotes the set of all points in the range of (X,Y) for which X=x
and
Ry denotes the set of all points in the range of (X,Y) for which Y=y
Example
0.05 Why?
Conditional Probability
Distribution
Given continuous random variable X and Y
with joint probability density function
fXY(x, y) the conditional probability density
function of Y given X = x is
f XY ( x , y )
f Y |x (x, y) =
fX (x)
for all fX (x) > 0
Properties of Conditional
Probability Distribution
Example
p.225
Properties of Conditional
Probability Distribution
Conditional mean of Y given X = x
XY = E[( X − X )(Y − Y )] = E ( XY ) − X Y
Example(1/3)
X=1 0.1 0 0 0
= 0.41
Correlation
cov( X , Y ) σ XY
ρ XY = =
V ( X )V (Y ) σ Xσ Y
where - 1 ρ XY 1
Correlation
σ XY = ρ XY =0
Example
cov( X , Y ) σ XY
ρ XY = =
V ( X )V (Y ) σ Xσ Y
0.41
=
(0.41)(1.41)
Covariance Matrix for Multi-
Dimensional Data
◼ For multi-dimensional data, X=[x1, x2, …xn],
the covariance matrix is the diagonal values
of the covariance matrix and can be shown
as:
𝑛
1
𝐶𝑂𝑉 = 𝑋𝑖 − 𝑋ത 𝑋𝑖 − 𝑋ത 𝑇
𝑛−1
𝑖=1
Bivariate Normal Distribution
Bivariate Normal
Distribution
σ X = 0.01 / n 2
Solution to Example(2/2)
P(|X- μ| ≧ c σ) ≦ 1/c 2
So P (| X − | c (0.012 / n)1/ 2 ) 1 / c 2
let c = 3
P (| X − | 3(0.012 / n)1/ 2 ) 1 / 9
i.e. P (| X − | 3(0.012 / n)1/ 2 ) 8 / 9
So 3(0.012 / n)1/ 2 = 0.005
and n = 36