Chap 2 Parameter Estimation
Chap 2 Parameter Estimation
Parameter Estimation
In many cases, there are natural candidate for particular parameters. For example,
X̄ for µ and p̂ = X/n for the true proportion of a binomial experiment. However,
there are complex cases where we need some specialised techniques to give us reason-
able candidate estimators for consideration. There are four main methods of finding
estimators, though these won’t be elaborated in this course.
i. Methods of moments
2-1
2.1. Point estimation 2. Parameter Estimation
All these techniques provide reasonable estimators which can be different. Therefore,
the next question would then be how do we choose one estimator as the best for a
given parameter? Well, there is no obvious answer to this question. However, there is
list of desirable attributes that determines the quality of a good estimator.
E(θ̂) = θ (2.1)
Pn
Solution 2.1.1. Recall from Section 1.2.4 that we can write i=1 (Xi − X̄)2 =
Pn 2 2
i=1 (Xi − µ) − n(X̄ − µ) . Therefore,
n
!
1 X
E(S 2 ) = E (Xi − µ)2 − nE (X̄ − µ)2
n−1 i=1
2-2
2. Parameter Estimation 2.1. Point estimation
By the definition of variance, i.e V(X) = E[(X − u)2 ] and the sampling distribu-
tion of X̄ being N (µ, σ 2 /n), we get
n
!
1 X σ2
= V (Xi ) − n
n−1 i=1
n
n
!
1 X
= σ2 − σ2
n−1 i=1
1
= (n − 1)σ 2 = σ 2 .
n−1
2. Efficiency: Suppose θ̂1 and θ̂2 are two unbiased estimators of the parameter, θ.
Then we would want to choose an estimator which does not vary a lot; that is,
an estimator with smaller variance. We say θ̂1 is efficient than θ̂2 if
2-3
2.2. Interval estimation 2. Parameter Estimation
It is very important to remember that point estimates are almost never equal to the
true value of the parameter. In order for the point estimate to be useful, it should
reported with their standard deviation (uncertainty) which measures how precise the
point estimator is. In this section, we show how to obtain more information about its
precision by computing its confidence intervals. The confidence intervals provide an
interval within which we would expect to find the value of the parameter. Such an
interval is called an interval estimate.
An interval estimate of parameter θ is an interval of the form θ̂L < θ < θ̂U , where θ̂L
and θ̂U depend on the value of the statistic θ̂ from a particular sample and its sampling
distribution. From the sampling distribution of θ̂ we shall be able to determine θ̂L and
θ̂U such that
for 0 < α < 1, so that we have a probability of 1 − α of selecting a random sample that
will produce an interval containing θ. The interval θ̂L < θ < θ̂U is called a 100(1 − α)%
confidence interval, α is called the significance level and the endpoints, θ̂L and
θ̂U , are called the lower and upper confidence limits.
2-4
2. Parameter Estimation 2.2. Interval estimation
Let us now consider the interval estimate of µ. If our sample is selected from a normal
population or, failing this, if n is sufficiently large, we can establish a confidence interval
for µ by considering the sampling distribution of X̄. According to the CLT, X̄ ∼
N (µ, σ 2 /n) and for standard normal variates, we have that
x̄ − µ
Since √ is standard normal then we have
σ/ n
X̄ − µ
P −zα/2 < √ < zα/2 = 1 − α
σ/ n
√ √
P X̄ − zα/2 σ/ n < µ < X̄ + zα/2 σ/ n = 1 − α
√ √
x̄ − zα/2 σ/ n < µ < x̄ + zα/2 σ/ n (2.4)
For small samples selected from non-normal populations, we cannot expect our
degree of confidence to be accurate. However, for samples of size n ≥ 30, with the
2-5
2.2. Interval estimation 2. Parameter Estimation
shape of the distributions not too skewed, sampling theory guarantees good results.
Example 2.2.1. The average zinc concentration recovered from a sample of measure-
ments taken in 36 different locations in a river is found to be 2.6 grams per milliliter.
Find the 95% and 99% confidence intervals for the mean zinc concentration in the river.
Assume that the population standard deviation is 0.3 gram per milliliter.
Variance unknown
n n
1X 2 1 X 2
X̄ = Xi and S = Xi − X̄ .
n i=1 n − 1 i=1
X̄ − µ
T = √ (2.5)
S/ n
2-6
2. Parameter Estimation 2.2. Interval estimation
procedure is the same as that with σ known except that here is replaced by S and the
standard normal distribution is replaced by the t-distribution.
Let x̄ and s be the mean and standard deviation of a random sample from a
normal population with unknown variance σ 2 , respectively. Then a
100(1 − α)% confidence interval for µ is
s
x̄ ± tα/2 √ (2.6)
n
where tα/2 is the t-value with n − 1 degrees of freedom leaving an area of α/2
to the right.
Example 2.2.2. The contents of seven similar containers of sulfuric acid are 9.8, 10.2,
10.4, 9.8, 10.0, 10.2, and 9.6 liters. Find a 95% confidence interval for the mean contents
of all such containers, assuming an approximately normal distribution
of specified width
It follows from (2.4) that the width of the confidence interval for a population mean
based on a sample of size n drawn from a population with standard deviation, σ, is
√
±zα/2 σ/ n. If the confidence level is specified, we can look up the value zα/2 and the
standard deviation is specified, then we can compute the value of n needed to produce
a specified width.
If x̄ is used as an estimate of µ, we can be 100(1 − α)% confident that the error will
2-7
2.2. Interval estimation 2. Parameter Estimation
z
α/2 σ
2
n= (2.7)
e
Example 2.2.3. The sugar content in a one-cup serving of a certain breakfast cereal
was measured for a sample of 140 servings. The average was 11.9g and the standard
deviation was 1.1g
(a) Find a 99% confidence interval for the mean sugar content.
(c) How large a sample is needed so that a 95% confidence interval specifies the mean
to within ±0.1?
r
p̂(1 − p̂)
p̂ ± zα/2 (2.8)
n
2-8
2. Parameter Estimation 2.2. Interval estimation
(a) Find a 95% confidence interval for the proportion of cans in the shipment that
meet the specification.
(b) Find the sample size needed for a 90% confidence interval to specify the propor-
tion to within ±0.05.
r
2
σX σ2
X̄ − Ȳ ± zα/2 + Y (2.9)
n m
when the values of σX and σY are unknown, they can be replaced with the
sample standard deviations, sX and sY .
Example 2.2.5. A random sample of size n1 = 25, taken from a normal population
with a standard deviation σ1 = 5, has a mean x̄ = 80. A second random sample of size
n2 = 36, taken from a different normal population with a standard deviation σ2 = 3,
has a mean x̄2 = 75. Find a 95% confidence interval for µ1 − µ2 .
Example 2.2.6. A study was conducted to determine if a certain treatment has any
effect on the amount of metal removed in a pickling operation. A random sample of
100 pieces was immersed in a bath for 24 hours without the treatment, yielding an
average of 12.2 millimeters of metal removed and a sample standard deviation of 1.1
2-9
2.2. Interval estimation 2. Parameter Estimation
millimeters. A second sample of 200 pieces was exposed to the treatment, followed by
the 24-hour immersion in the bath, resulting in an average removal of 9.1 millimeters of
metal with a sample standard deviation of 0.9 millimeter. Compute a 99% confidence
interval estimate for the difference between the population means. Does the treatment
appear to reduce the mean amount of metal removed?
Consider the case where σ12 and σ22 are unknown. If σ12 and σ22 are equal, we obtain a
standard normal variable of the form
We have also seen from Chapter 1 that the two random variables
2-10
2. Parameter Estimation 2.2. Interval estimation
p
T = Z/ V /(n1 + n2 − 2) (2.12)
where Z and V are given by (2.10) and (2.11), has t-distribution with n1 + n2 − 2
degrees of freedom.
Definition 2.2.1. A point estimate of the unknown common variance σ 2 can be ob-
tained by pooling the sample variances defined as follows
We can now substitute for the unknown variance with Sp2 then use the T statistic
in (2.12) to construct the 100(1 − α)% confidence interval for the difference of means
when the variance is unknown.
Let x̄1 and x̄2 be the means of two independent random samples of sizes n1
and n2 respectively, from approximately normal populations with unknown
but equal variances. Then a 100(1 − α)% confidence interval for µ1 − µ2 is
given by
r
1 1
(x̄1 − x̄2 ) ± tα/2 sp + (2.14)
n1 n2
where s2p is the pooled variance estimate and tα/2 is the t-value with
n1 + n2 − 2 degrees of freedom leaving an area of α/2 to the right.
2-11
2.2. Interval estimation 2. Parameter Estimation
Paired observations
We shall consider estimation procedures for the difference of two means when the sam-
ples are not independent and the variances of the two populations are not necessarily
equal. The situation considered here deals with a very special experimental condi-
tion, namely that of paired observations. Unlike in the two independent samples, each
experimental unit has a pair of observations, one for each population.
For example, if we run a test on a new diet using 15 individuals, the weights be-
fore and after going on the diet form the information for our two samples. The two
populations are ‘before’ and ‘after’, and the experimental unit is the individual. To
determine if the diet is effective, we consider the differences d1 , d2 , . . . , dn in the paired
observations. These differences are the values of a random sample D1 , D2 , . . . , Dn from
a population of differences that we shall assume to be normally distributed with mean
2
µD = µ1 − µ2 and variance σD .
2-12
2. Parameter Estimation 2.2. Interval estimation
If d¯ and sd are the sample mean and sample standard deviation, respectively,
of the normally distributed differences of n random pairs of measurements, a
100(1 − α)% confidence interval for µD = µ1 − µ2 is
sd
d¯ ± tα/2 √ (2.15)
n
Example 2.2.8. Fortune magazine (March 1997) reported the total returns to in-
vestors for the 10 years prior to 1996 and also for 1996 for 431 companies. The total
returns for 10 of the companies are listed below. Find a 95% confidence interval for
the mean change in percent return to investors.
2-13
2.2. Interval estimation 2. Parameter Estimation
portions
Let p̂1 and p̂2 be the proportions of successes in random samples of sizes n1
and n2 respectively. Then the 100(1 − α)% confidence interval for the
difference of two binomial parameters, p1 − p2 , is given by
s
p̂1 (1 − p̂1 ) p̂2 (1 − p̂2 )
(p̂1 − p̂2 ) ± zα/2 + (2.16)
n1 n2
2-14