Lecture 1 (Chapter 3) - Common Families of Distributions
Lecture 1 (Chapter 3) - Common Families of Distributions
Chapter 3
where h(x) ≥ 0 and t1 (x), . . . , tk (x) are real-valued functions of the obser-
vation x (they cannot depend on θ), and c(θ) ≥ 0 and w1 (θ), . . . , wk (θ)
are real-valued functions of the possibly vector-valued parameter θ (they
cannot depend on x).
Then,
d d p 1
w(p) = log = ,
dp dp 1−p p(1 − p)
d2 1 1 2p − 1
2
w(p) = − 2 + 2
= 2 ,
dp p (1 − p) p (1 − p)2
d d n
log c(p) = n log(1 − p) = − ,
dp dp 1−p
d2 n
2
log c(p) =− .
dp (1 − p)2
1 x
f (x|θ) = exp 1 − , for θ < x < ∞ and θ > 0.
θ θ
Show that this is NOT an exponential family. The pdf above can be
written using an indicator function:
1 x
f (x|θ) = exp 1 − I[θ,∞) (x).
θ θ
then
n n p
h(x) = I{0,1,...,n} (x) , c(p) = (1 − p) , t(x) = x and w(p) = log .
x 1−p
Note: 0 < p < 1, and f (x|p) is different for p = 0, 0 < p < 1 and p = 1.
The above formula must match all x. Therefore, f (x|p) is an exponential
family only if 0 < p < 1.
λx e−λ 1
= I{0,1,... } (x) e−λ exp x log(λ)
f (x|λ) = I{0,1,... } (x)
x! x!
then
1
h(x) = I{0,1,... } (x) , c(λ) = e−λ , t(x) = x and w(λ) = log(λ).
x!
then
1 1
h(x) = I[0,∞) (x), c(β) = , t(x) = x and w(β) = − .
β β
STA3020 - Statistical Inference 6
(x − µ)2 x2 µ2
2 1 1 xµ
f (x|µ, σ ) = √ exp − =√ exp − 2 + 2 − 2
2πσ 2σ 2 2πσ 2σ σ 2σ
then
µ2
1
h(x) = 1, c(µ, σ) = √ exp − 2 ,
2πσ 2σ
2
x 1 µ
t1 (x) = − , w1 (µ, σ) = 2 , t2 (x) = x and w2 (µ, σ) = 2 .
2 σ σ
is called the natural parameter space for the family. (The integral is re-
placed by a sum over the values of x for which h(x) > 0 if X is discrete.)
Since the original f (x|θ) in (3.4.1) is a pdf or pmf, it must hold that
n o
η = w1 (θ), . . . , wk (θ) : θ ∈ Θ ⊂ H.
STA3020 - Statistical Inference 7
(x − µ)2 x2
1 1 1 x
f (x|µ) = √ exp − =√ exp − exp − 2 +
2πµ 2µ2 2πµ 2 2µ µ
Let η1 = 1/µ2 and η2 = 1/µ. The Normal distribution n(µ, µ2 ) can
reparameterized as:
√
η1 1 η
1
f (x|η1 , η2 ) = √ exp − exp − x2 + η2 x .
2π 2 2
Since d = 1 and k = 2, it is a curved exponential family.
STA3020 - Statistical Inference 8
Theorem)
X̄ ∼ n(λ, λ/n),
Remark:
1. Theorem 3.4.2 also applied to curved exponential families.
2. Exponential families have nice properties that are very useful in
statistical inference.
STA3020 - Statistical Inference 9
Note:
1. Each of these families is constructed from a single pdf (or pmf)
known as the standard pdf (pmf) for the family;
2. All other pdfs (or pmfs) in the family are obtained by transforming
the standard pdf (or pmf) in a prescribed way.
Theorem 3.5.1: Let f (x) be any pdf and let µ and σ > 0 be any given
constants. Then the function
1 x−µ
g(x|µ, σ) = f
σ σ
is a valid pdf.
Proof.
1 x−µ
g(x|µ, σ) = f ≥0
σ σ
∞ ∞ ∞
(y= x−µ
x−µ σ )
Z Z Z
1
g(x|µ, σ)dx = f dx ======= f (y)dy = 1.
−∞ −∞ σ σ −∞
STA3020 - Statistical Inference 10
Definition 3.5.2: Let f (x) be any pdf. Then the family of pdfs f (x−µ),
indexed by the parameter µ (−∞ < µ < ∞), is called the location family
with standard pdf f (x) and µ is called the location parameter for the
family.
Remark:
1. The effect of location parameters shifts the density to the left or
right but the shape remains unchanged.
2. If Z has a pdf f (z), then X = Z + µ has density f (x − µ).
Definition 3.5.4: Let f (x) be any pdf. Then for any σ > 0, the family
1 x
of pdfs f , indexed by the parameter σ, is called the scale family
σ σ
with standard pdf f (x) and σ is called the scale parameter of the family.
x2
1
f (x|σ) = √ exp − 2 , −∞ < x < ∞, σ > 0,
2πσ 2σ
where σ is the scale parameter of the scale family with standard pdf below
2
1 x
√ exp − , −∞ < x < ∞.
2π 2
Definition 3.5.5: Let f (x) be any pdf. Then for any µ (−∞ < µ < ∞),
1 x−µ
and any σ > 0, the family of pdfs f , indexed by the parameter
σ σ
(µ, σ), is called the location-scale family with standard pdf f (x); µ is called
the location parameter and σ is called the scale parameter.
(x − µ)2
1
f (x|σ) = √ exp − , −∞ < x < ∞, −∞ < µ < ∞, σ > 0.
2πσ 2σ 2
1 |x − µ|
f (x|σ) = exp − , −∞ < x < ∞, −∞ < µ < ∞, σ > 0.
2σ σ
STA3020 - Statistical Inference 12
Theorem 3.5.6: Let f (·) be any pdf. Let µ be any real number, and let
σ be any positive
real number. Then X is a random variable with pdf
1 x−µ
f if and only if there exists a random variable Z with pdf f (z)
σ σ
and X = σZ + µ.
It is similar to prove the “only if” part: define g(x) = (x − µ)/σ and
let Z = g(X).