0% found this document useful (0 votes)
63 views31 pages

Distributions Based On Sampling From A Normal Distribution

(i) The Student's t-distribution and Snedecor's F-distribution arise as probability distributions of certain statistics when sampling from a normal distribution. (ii) The Student's t-distribution describes the distribution of the ratio of a normal random variable and the square root of a chi-squared random variable. The Snedecor's F-distribution describes the ratio of two independent chi-squared random variables. (iii) Closed-form expressions are provided for the probability density functions of the Student's t and Snedecor's F distributions. Properties such as expectations and moments are also derived.

Uploaded by

pankaj kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
63 views31 pages

Distributions Based On Sampling From A Normal Distribution

(i) The Student's t-distribution and Snedecor's F-distribution arise as probability distributions of certain statistics when sampling from a normal distribution. (ii) The Student's t-distribution describes the distribution of the ratio of a normal random variable and the square root of a chi-squared random variable. The Snedecor's F-distribution describes the ratio of two independent chi-squared random variables. (iii) Closed-form expressions are provided for the probability density functions of the Student's t and Snedecor's F distributions. Properties such as expectations and moments are also derived.

Uploaded by

pankaj kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 31

Module 30

DISTRIBUTIONS BASED ON SAMPLING FROM A


NORMAL DISTRIBUTION

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


1 / 30
We will introduce two new probability distributions, called the
Student t-distribution and the Snedecor F -distribution, which arise as
probability distributions of various statistics based on a random
sample from normal distribution.

Definition 1
(i) For a given positive integer m, a random variable X is said to have
the Student t-distribution with m degrees of freedom (written as
X ∼ tm ) if the p.d.f. of X is given by
 
m+1
Γ 2 1
fX (x) = √  m+1 , −∞ < x < ∞.
mπΓ( m2 )  2 2
1 + xm

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


1 / 30
(ii) The Student t-distribution with 1 degree of freedom is also called
the standard Cauchy distribution.

(iiii) For positive integers n1 and n2 , a random variable X is said to


have the Snedecor F -distribution with (n1 , n2 ) degrees of freedom
(written as X ∼ F (n1 , n2 )) if the p.d.f. of X is given by
n1
( nn12 x) 2 −1
( nn12 )
fX (x) =    n1 +n2 I(0,∞) (x).
n1 n2 2
β 2, 2 1 + nn12 x

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


2 / 30
Remark 1

The following observations are obvious:


d
(i) X ∼ tm ⇒ X = −X (since fX (x) = fX (−x), ∀ x ∈ R),
i.e., the distribution of X ∼ tm is symmetric about 0. Moreover the
distribution of tm is unimodal with mode at 0.;
(ii) The p.d.f. of Cauchy distribution is given by

1 1
f (y ) = · , −∞ < y < ∞.
π 1 + y2

If a random variable X has the Cauchy distribution (i.e., if X ∼ t1 )


then E (X ) does not exist;
n1
X
(iii) X ∼ Fn1 ,n2 , ⇒ Y = n2
n1
1+ n X
∼ Beta( n21 , n22 ), the beta distribution with
2
n1 n2
shape parameter ( 2 , 2 ).

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


3 / 30
Result 1:
(i) Let Z ∼ N(0, 1) and Y ∼ χ2m (where m ∈ {1, 2, . . .}) be independent
random variables. Then
Z
T = q ∼ tm .
Y
m

(ii) For positive integers n1 and n2 , let X1 ∼ χ2n1 and X2 ∼ χ2n2 be


independent random variables. Then
X1 /n1
U= ∼ Fn1 ,n2 .
X2 /n2
(iii) Let m and r be positive integers and let X ∼ tm . Then E (X r ) is not
finite if r ∈ {m, m + 1, . . .}. For r ∈ {1, 2, . . . , m − 1} and m ≥ r + 1

0, if r is odd
r
E (X ) = r
m 2 r !Γ( m−r ) .
2 ( )!Γ( m ) , if r is even
 r r 2
2 2

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


4 / 30
(iv) If X ∼ tm , then
0
µ1 = E (X ) = 0, for m ∈ {2, 3, . . .}
m
µ2 = Var(X ) = , for m ∈ {3, 4, . . .}
m−2
β1 = coefficient of skewness = 0, for m ∈ {4, 5, . . .}
3(m − 2)
and γ1 = kurtosis = , for m ∈ {5, 6, . . .}.
m−4
(v) Let n1 , n2 and r be positive integers and let X ∼ Fn1 ,n2 . Then, for
n2 ∈ {1, 2, . . . , 2r } and r ≥ n22 , E (X r ) is not finite. For
n2 ∈ {2r + 1, 2r + 2, . . .}
 r 
r Y 
n2 n1 + 2(i − 1)
E (X r ) = .
n1 n2 − 2i
i=1

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


5 / 30
(vi) If X ∼ Fn1 ,n2 then
0 n2
µ1 = E (X ) = , if n2 ∈ {3, 4, . . .}
n2 − 2
2n22 (n1 + n2 − 2)
µ2 = Var(X ) = , if n2 ∈ {5, 6, . . .}
n1 (n2 − 2)2 (n2 − 4)
β1 = coefficient of skewness
s
2(2n1 + n2 − 2) 2(n2 − 4)
= , if n2 ∈ {7, 8, . . .}
n2 − 6 n1 (n1 + n2 − 2)
and γ1 = kurtosis
12[(n2 − 2)2 (n2 − 4) + n1 (n1 + n2 − 2)n1 (5n2 − 22)]
= + 3,
n1 (n2 − 6)(n2 − 8)(n1 + n2 − 2)
if n2 ∈ {9, 10, . . .}.

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


6 / 30
Proof :

(i) The joint p.d.f. of (Y , Z ) is given by

fY ,Z (y , z) = fY (y )fZ (z)
 2
 m+1 1 − y +z m
−1
m √ e 2 y 2 , if (y , z) ∈ (0, ∞) × R
= 2 2 Γ( 2 ) π .
0, otherwise

Clearly SY ,Z = (0, ∞] × R. Consider the transformation q


y
h = (h1 , h2 ) : SY ,Z → R2 defined by h1 (y , z) = √z y and h2 (y , z) = m .
m
q
Then T = h1 (Y , Z ) = qZY . Let U = h2 (Y , Z ) = Y m.
m

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


7 / 30
Clearly the transformation h = (h1 , h2 ) : SY ,Z → R2 is one-to-one with
inverse transformation h−1 = (h1−1 , h2−1 ), where for (t, u) ∈ h(SY ,Z ),

h1−1 (t, u) = mu 2 and h2−1 (t, u) = tu.

The Jacobian determinant is


−1
∂h1−1

∂h1
0 2mu
= −2mu 2 .
∂t ∂u
J = ∂h−1 ∂h−1 =
2 2 u t
∂t ∂u

Also

h(SY ,Z ) = {(t, u) ∈ R2 : (h1−1 (t, u), h2−1 (t, u)) ∈ SY ,Z }


= {(t, u) ∈ R2 : mu 2 ∈ [0, ∞), u > 0, tu ∈ R}
= {(t, u) ∈ R2 : t ∈ R, u > 0}
= R × (0, ∞)
= A, say.

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


8 / 30
Therefore the joint p.d.f. of (T , U) is given by

fT ,U (t, u) = fY ,Z (h1−1 (t, u), h2−1 (t, u))|J|Ih(SY ,Z ) (t, u)


= fY ,Z (mu 2 , tu)| − 2mu 2 |IA (t, u)
m (m+t 2 )u 2

m2 m − , if (t, u) ∈ R × (0, ∞)
√ m−1 m u e 2

= π2 2 Γ( 2 ) .
0, otherwise

Consequently the p.d.f. of T is given by

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


9 / 30
Z∞
fT (t) = fT ,U (t, u)du
−∞
m Z∞
m2 (m+t 2 )u 2
= √ m−1 um e − 2 du, t ∈ R
π2 2 Γ( m2 )
0
Z∞
1 m−1
= √  m+1 y 2 e −y dy
t2
mπΓ( m2 )
1+ m 0
2

 
Γ m+1
2 1
= √ m · , t ∈ R,
mπΓ( 2 ) 1 + t 2  m+1
2
m

which is the p.d.f. of Student’s t-distribution with m degrees of freedom.

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


10 / 30
(ii) The joint p.d.f. of X = (X1 , X2 ) is given by

fX1 ,X2 (x1 , x2 ) = fX1 (x1 )fX2 (x2 )


n1 n2
1 x +x
− 1 2 2 2 −1 2 −1
= n1 +n2 e x1 x2 I(0,∞)2 (x1 , x2 ).
2 2 Γ( n21 )Γ( n22 )

We have SX1 ,X2 = [0, ∞)2 . Consider the one-to-one transformation


h = (h1 , h2 ) : SX1 ,X2 → R2 given by
n2 x1 x2
h1 (x1 , x2 ) = and h2 (x1 , x2 ) = .
n1 x2 n2
X1 /n1 X2
Define U = h1 (X1 , X2 ) = X2 /n2 and V = h2 (X1 , X2 ) = n2 . Then the
inverse of transformation h = (h1 , h2 ) : SX1 ,X2 → R is h −1
= (h1−1 , h2−1 ),
where for (u, v ) ∈ h(SX1 ,X2 ),

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


11 / 30
h1−1 (u, v ) = n1 uv and h2−1 (u, v ) = n2 v .
The Jacobian determinant is
−1
∂h1−1

∂h1
n v n1 u
J = ∂h∂u−1 ∂h∂v−1 = 1 = n1 n2 v .

2 2 0 n2
∂u ∂v

Also

h(SX1 ,X2 ) = {(u, v ) ∈ R2 : (h1−1 (u, v ), h2−1 (u, v )) ∈ SX1 ,X2 }


= {(u, v ) ∈ R2 : n1 uv > 0, n2 v > 0}
= {(t, u) ∈ R2 : t > 0, u > 0}
= (0, ∞)2 ,

and therefore, the joint p.d.f. of (U, V ) is given by

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


12 / 30
fU,V (u, v ) = fX1 ,X2 (h1−1 (u, v ), h2−1 (u, v ))|J|Ih(SX ) (u, v )
1 ,X2

= fX1 ,X2 (n1 uv , n2 v )|n1 n2 v |I(0,∞]2 (u, v )


n1 n2
n12 n22 n1 n +n (n +n u)v
−1 1 2 2 −1 − 2 2 1
= ( n1 +n2 n1 n2 u
2 v e I(0,∞)2 (u, v ).
2 2 )Γ( 2 )Γ( 2 )

Consequently the p.d.f. of U is given by


Z∞
fU (u) = fU,V (u, v )dv .
−∞

Clearly fU (u) = 0, if u ≤ 0. For u > 0

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


13 / 30
n1 n2
Z∞
n12 n22 n1
−1
n1 +n2
−1
(n2 +n1 u)v
fU (u) = n1 +n2 n n u 2 v 2 e− 2 dv
Γ( 21 )Γ( 22 )
2 2
0
n1
n1 +n2 −1
n
Γ 1
( nn12 u) 2
= n1
2 n2
n2
 n1 +n2 , 0 < u < ∞.
Γ 2 Γ (1 + n1
n2 u)
2 2

Therefore
X1 /n1
U= ∼ Fn1 ,n2 .
X2 /n2

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


14 / 30
(iii) For m ∈ {1, 2, . . .}, by (i),
d Z
X =q ,
Y
m

where Z ∼ N(0, 1) and Y ∼ χ2m are independent random variables. Thus,


for m ∈ {1, 2, . . .} and r > 0,
r r r r
E (X r ) = m 2 E (Z r Y − 2 ) = m 2 E (Z r )E (Y − 2 ),

(since Y and Z are independent)


provided the expectations are finite. We have,

0, if r is odd
E (Z r ) = r!  .
 2r r , if r is even
2 2
!

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


15 / 30
Moreover, for r ∈ {1, 2, . . .},
Z∞
− 2r 1 m−r
−1 − y2
E (Y )= m
m
 y 2 e dy ,
2 Γ
2
2 0

which is finite if, and only if, m > r . Also, for m > r
m−r
m−r m−r
 
− 2r 2 2 Γ 2 Γ 2
E (Y )= m
m
= r .
2 2 Γ m2

22Γ 2
Thus E (X r ) is finite if r ∈ {1, 2, . . . , m − 1}. For r ∈ {1, 2, . . . , m − 1}
and m ≥ r + 1

0,

 if r is odd
r r
E (X ) = m 2 r !Γ m−r2
.
 2r r !Γ m  , if r is even

2 2

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


16 / 30
(iv) Using (iii), we have

0
µ1 = E (X ) = 0, if m ∈ {2, 3, . . .}
0 m
µ2 = µ2 = E (X 2 ) = , if m ∈ {3, 4, . . .}
m−2
0
µ3 = µ3 = E (X 3 ) = 0 if m ∈ {4, 5, . . .}
0 3m2
and µ4 = µ4 = E (X 4 ) = , if m ∈ {5, 6, . . .}.
(m − 2)(m − 4)

Consequently
µ3
β1 = 3/2
= 0, if m ∈ {4, 5, . . .}
µ2
and
µ4 3(m − 2)
γ1 = 2
= , if m ∈ {5, 6, . . .}.
µ2 m−4

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


17 / 30
(v) Using (ii), we have

d n2 X1
X = ,
n1 X2
where X1 ∼ χ2n1 and X2 ∼ χ2n2 are independent random variables. Fix
r ∈ {1, 2, . . .}. Then
 r  r
n2 r −r n2
r
E (X ) = E (X1 X2 ) = E (X1r )E (X2−r ),
n1 n1
(since X1 and X2 are independent)
provided the expectations are finite. Since X1 ∼ χ2n1 , E (X1r ) is finite for
any r > 0 and
Z∞
1 n1 x
r
E (X1 ) = n1 x 2 +r −1 e − 2 dx
2 2 Γ n21

0
n1
+r n

2 2 Γ 21 + r
= n1
2 2 Γ n21


() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


18 / 30
n1  n1 n1
= 2r

+r −1 + r − 2 ...
2 2 2
= (n1 + 2(r − 1))(n1 + 2(r − 2)) . . . n1
Yr
= (n1 + 2(i − 1)), r ∈ {1, 2, . . .}.
i=1

Since X2 ∼ χ2n2 , E (X2−r )


is finite if, and only if, n2 > 2r . For n2 > 2r
n2
2 2 −r Γ n22 − r

−r 1
E (X2 ) = n2 = r .
n

22Γ 2 2 Q
(n2 − 2i)
i=1
It follows that, for n2 ∈ {1, 2, . . . , 2r } and r ≥ n22 , E (X r ) is not finite. For
n2 ∈ {2r + 1, 2r + 2, . . .}
 r Y r  
n2 n1 + 2(i − 1)
E (X r ) = .
n1 n2 − 2i
i=1

(vi) Follows on using (v) after some tedious calculations.


() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO
19 / 30
Corollary 1:

Let X1 , . . . , Xn (n ≥ 2) be a random sample from N(µ, σ 2 ) distribution,


n
where µ ∈ (−∞, ∞) and σ > 0. Let X̄ = n1
P
Xi and
i=1
n
1
S2 = (Xi − X̄ )2 denote the sample mean and the sample variance
P
n−1
i=1
respectively. Then

n(X̄ − µ)
∼ N(0, 1)
σ
and √
n(X̄ − µ)
∼ tn−1 .
S

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


20 / 30
Proof.

2 2
X̄ ∼ N(µ, σn ) and (n−1)S
σ2
∼ χ2n−1 are independent random variables.

n(X̄ − µ) (n − 1)S 2
⇒ ∼ N(0, 1) and ∼ χ2n−1
σ σ2
are independent random variables.

n(X̄ −µ)
σ
⇒q ∼ tn−1 ,
(n−1)S 2 /σ 2
n−1

i.e., √
n(X̄ − µ)
∼ tn−1 .
S

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


21 / 30
Corollary 2:

Let X1 , . . . , Xm (m ≥ 2) be a random sample from N(µ1 , σ12 ) distribution


and let Y1 , . . . , Yn (n ≥ 2) be a random sample from N(µ2 , σ22 )
distribution, where µi ∈ (−∞, ∞) and σi > 0, i = 1, 2. Further suppose
that X = (X1 , . . . , Xm ) and Y = (Y1 , . . . , Yn ) are independent. Let
m n
1 P 1 P
S12 = m−1 (Xi − X̄ )2 and S22 = n−1 (Yi − Ȳ )2 be the sample
i=1 i=1
variances based on random samples X = (X1 , . . . , Xm ) and
m n
Y = (Y1 , . . . , Yn ), respectively; here X̄ = m1 1
P P
Xi and Ȳ = n Yi are
i=1 i=1
the sample means based on two random samples. Then

σ22 S12
∼ Fm−1,n−1 .
σ12 S22

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


22 / 30
Proof :

We have
(m − 1)S12 2 (n − 1)S22
∼ χ m−1 and ∼ χ2n−1 .
σ12 σ22
(m−1)S12
Also the independence of X and Y implies that σ12
(a function of X
(n−1)S22
alone) and σ22
(a function of Y alone) are independent. Thus

S12 /σ12
∼ Fm−1,n−1 ,
S22 /σ22

i.e.,
σ22 S12
∼ Fm−1,n−1 .
σ12 S22

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


23 / 30
Remark 2 :

(i) Suppose that X ∼ tm . Then

d Z
X =q ,
Y
m

where Z ∼ N(0, 1) and Y ∼ χ2m are independent random variables.


Therefore
d Z2 χ2 /1 E
X2 = = 21 independent
Y /m χm /m
Thus
X ∼ tm , ⇒ X 2 ∼ F1,m .

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


24 / 30
(ii) Suppose that X ∼ Fn1 ,n2 . Then,

d χ2n1 /n1 E
X = independent
χ2n2 /n2

1 d χ2n2 /n2 E
⇒ = 2 independent ∼ Fn2 ,n1
X χn1 /n1
Thus,
1
X ∼ Fn1 ,n2 ⇒ ∼ Fn2 ,n1 .
X

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


25 / 30
Note that if X ∼ tm then, the distribution of X is symmetric about 0
and its kurtosis is
3(m − 2)
γ1 = > 3, m > 4.
m−4
Thus a t-distribution with m (> 4) degrees of freedom is symmetric
and leptokurtic (i.e., it has shaper peak and longer fatter tails).
Note that the kurtosis ν1 decreases as m increases and γ1 → 3, as
m → ∞. This suggests that, for large degrees of freedom, Studentś
t-distribution behaves like N(0, 1). distribution. This is infact true.

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


26 / 30
Suppose that X ∼ tm and, for a fixed α ∈ (0, 1), let tm,α be the
(1 − α)-th quantile of X , i.e.,

FX (tm,α ) = P(X ≤ tm,α ) = 1 − α.

Then
d
FX (−tm,α ) = 1 − FX (tm,α ) = α (since X = −X ).
Now suppose that X ∼ Fn1 ,n2 and, for a fixed α ∈ (0, 1), let fn1 ,n2 ,α
be the (1 − α)-th quantile of X , i.e.,

FX (fn1 ,n2 ,α ) = P({X ≤ fn1 ,n2 ,α }) = 1 − α.

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


27 / 30
1
Since X ∼ Fn2 ,n1 and P({X > 0}) = 1, follows that
 
1 1
P ≥ =1−α
X fn1 ,n2 ,α
 
1 1
⇒P ≤ = α = 1 − (1 − α)
X fn1 ,n2 ,α
1
⇒ fn2 ,n1 ,1−α = .
fn1 ,n2 ,α
i.e.,
fn1 ,n2 ,α × fn2 ,n1 ,1−α = 1.

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


28 / 30
Take Home Problems

(1) Let Z1 and Z2 be i.i.d. N(0, 1) r.v.s. Show that

Z1 d Z1
= .
Z2 |Z2 |
Z1
Hence show that Z = Z2 follows Cauchy distribution (i.e., Z ∼ t1 ).
(2) Let X1 and X2 be i.i.d. N(µ, σ 2 ) r.v.s. Show that X1 + X2 and
X1 − X2 are independent. Find the p.d.f. of
X1 + X2 − 2µ
Y = √ .
2|X1 − X2 |

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


29 / 30
Thank you for your patience

() Module 30 DISTRIBUTIONS BASED ON SAMPLING FROM A NORMAL DISTRIBUTIO


30 / 30

You might also like