02 Point Estimators
02 Point Estimators
Advanced Statistics II
2 Finite-sample properties
3 Large-sample properties
4 Up next
Outline
2 Finite-sample properties
3 Large-sample properties
4 Up next
Definition
A statistic, T = t(X ), whose outcomes are used to estimate the value of a
scalar or vector function, q(θ), of the parameter vector, θ, is called a
point estimator.
An observed outcome of an estimator is called a point estimate.
3.0
Density
Density
2.0
2.0
1.0
1.0
0.0
0.0
−1.0 −0.5 0.0 0.5 1.0 −1.0 −0.5 0.0 0.5 1.0
Estimator 1 Estimator 2
3.0
Density
2.0
1.0
0.0
Minimize risk
To evaluate decisions, we need to known what the average loss (risk) is,
R (δ, θ) = Eθ (L (δ)) ,
Specific details (focussing on MSE loss) in due time. Until then, ...
Statistics and Econometrics (CAU Kiel) Summer 2021 8 / 33
Estimators and their risk
Recall, this is the expected (MSE) loss, where the expectation is taken
w.r.t. the sampling distribution.
Outline
2 Finite-sample properties
3 Large-sample properties
4 Up next
Finite-sample risk
We agree that
Some implications
The MSE can be decomposed into the variance and the bias of the
estimator, as
Eθ (T − q(θ))2 = Eθ (T − Eθ (T ) + Eθ (T ) − q(θ))2
= Eθ (T − Eθ (T ))2 + Eθ (Eθ (T ) − q(θ))2
+2 Eθ ((T − Eθ (T ))(Eθ (T ) − q(θ)))
= Varθ (T ) + (Eθ (T ) − q(θ))2 ≥ 0
| {z }
Bias
Large MSE of T
is implied by large variance, large bias, or both;
and both variance and bias depend on θ!
Estimators with smaller MSEs are preferred (are “more efficient”).
Definition
Let T and T ∗ be two estimators of a scalar q(θ). The relative efficiency of
T w.r.t. T ∗ is given by
MSEθ (T ∗ )
REθ (T, T ∗ ) = , ∀ θ ∈ Ω.
MSEθ (T )
Bernoulli
Example
Suppose (X1 , ..., Xn ) is a random sample from a Bernoulli distribution
with P(Xi = 1) = p and n = 25.
Unbiasedness
But
Definition
An estimator T is said to be an unbiased estimator of q(θ) iff
Eθ (T ) = q(θ) ∀ θ ∈ Ω.
1 2
= (1 − )σ 6= σ 2
n
1 1 Pn
An unbiased estimator is S 2 = 1 T
(1− n )
= n−1 i=1 (Xi − X̄)2 .
Definition
An estimator T is said to be a minimum-variance unbiased estimator of
q(θ) iff
1. Eθ (T ) = q(θ) ∀ θ ∈ Ω, that is, T is unbiased, and
2. Varθ (T ) ≤ Varθ (T ∗ ) ∀θ ∈ Ω ∀ other unbiased estimator T ∗ .
The Minimum Variance Unbiased Estimator has the smallest MSE within
the class of unbiased estimators.
Definition
An estimator T is said to be a BLUE of q(θ) iff
1. T is a linear function of the random sample X = (X1 , ..., Xn )0 , i.e.,
T = a0 X = a1 X1 + · · · + an Xn ,
BLUEs
Example
Let (X1 , ..., Xn ) be a random sample from a population with E(X) = µ
and Var(X) = σ 2 .
Outline
2 Finite-sample properties
3 Large-sample properties
4 Up next
Asymptotic properties
Consistent estimators
Definition
An estimator T n is said to be a consistent estimator of q(θ) iff
plimθ T n = q(θ) ∀ θ ∈ Ω.
Non-uniqueness
Example
Let (X1 , ..., Xn ) be a random sample from a population
Pn with E(X) = µ and
Var(X) = σ 2 . Then the sample mean Tn = X̄n = n1 i=1 Xi is a consistent
estimator for µ since
σ2
E(X̄n ) = µ, and Var(X̄n ) = n →0 as n → ∞.
Pn
Now, consider as an alternative estimator Tn∗ = n−k1
i=1 Xi for a fixed value of
k. Even though Tn∗ is a biased estimator for µ, it is consistent, since
nσ 2
E(Tn∗ ) = nµ
n−k → µ, and Var(Tn∗ ) = (n−k)2 →0 as n → ∞.
Definition
An estimator T n is said to be a CAN estimator of q(θ) iff
√ d
n(T n − q(θ)) → N (0, Σ),
Example
Let (X1 , ..., Xn ) be a random sample from a population with
P E(X) = µ
and Var(X) = σ 2 . Then the sample mean Tn = X̄n = n1 ni=1 Xi is a
CAN estimator for µ since by Lindeberg-Lévy’s CLT
√ d a
n(X̄n − µ) → N (0, σ 2 ) X̄n ∼ N µ, n1 σ 2 .
and
The asymptotic MSE for a CAN estimator Tn for the scalar q(θ) with
a
Tn ∼ N (q(θ), n1 σ 2 ) is
EA (Tn − q(θ))2
AMSEθ (Tn ) = (EA : Expect. w.r.t. asym. distribution)
= AVar(Tn ) + [EA (Tn ) − q(θ)]2 (Avar : Var of the asy. distribution)
| {z }
Asymtotic Bias = 0
= Avar(Tn ),
1 2
= nσ .
Definition
Let Tn and Tn∗ be CAN estimators of q(θ) such that
d
n1/2 (Tn − q(θ)) N 0, σT2
→
d
n1/2 (Tn∗ − q(θ)) N 0, σT2 ∗ .
and →
3
In the scalar case; see see Mittelhammer (1996, Def. 7.16) for the vector case.
Statistics and Econometrics (CAU Kiel) Summer 2021 30 / 33
Large-sample properties
Unless...
Definition
If Tn is a CAN estimator of q(θ) having the smallest asymptotic variance
among all CAN estimators ∀ θ ∈ Ω, except on a set of Lebesque measure
zero, Tn is said to be asymptotically efficient.
Outline
2 Finite-sample properties
3 Large-sample properties
4 Up next
Coming up