7 Estimation
7 Estimation
E ( )=µ
Properties of good estimates
A. Unbiased Estimator
♣ A statistic is said to be an unbiased estimator
of the corresponding population parameter
if, taken over its sampling distribution, is
equal to the population parameter value.
♣ The "long run" average of the statistic is
equal to the population parameter value.
♣ The sample mean and median are unbiased
estimators of the population mean .
B. Minimum Variance
Estimating the Sampling Error
• Any estimates derived from samples are
subject to the sampling error.
• This comes from the fact that only a part of
the population was observed, instead of the
whole.
• A different samples could have come up with
different results. The amount of variation
that exists among the estimates from the
different possible samples is the sampling
error.
• The set of sample means in repeated random
samples of size n from a given population has
variance .
n sem
1 10.3
20 2.3
100 1.0
99%
Finding the Critical Value
Margin of Error
(Precision of the estimate)
Factors Affecting Margin of Error
• Although the true mean may or may not be in this interval, 95% of the
intervals formed in this manner will contain the true mean.
0.4
Z distribution
0.3
density
0.1
0.0
-5 0 5
Value
As the df gets larger, the student’s t-distribution looks more and more
like the SND with mean=0 and variance=1.
What happens to CI as
sample gets larger?
s For large samples:
x Z
n Z and t values
become almost
identical, so CIs are
s almost identical.
x t
n
Degrees of Freedom (df)
df = Number of observations that are free to vary after
sample mean has been calculated
df = n-1
Student’s t Table
t distribution values
• With comparison to the Z value
Example
Example
• Standard error =
• t-value at 90% CL at 19 df =1.729
Exercise
• Compute a 95% CI for the mean birth
weight based on n = 10, sample mean =
116.9 oz and s =21.70.
• From the t Table, t9, 0.975 = 2.262
• Ans: (101.4, 132.4)
2. CIs for single population
proportion, p
Hence,
•
Example 3
• Suppose that among 10,000 female operating-room
nurses, 60 women have developed breast cancer over
five years. Find the 95% for p based on point estimate.
• Point estimate = 60/10,000 = 0.006
• The 95% CI for p is given by the interval:
• SE of the difference =
• 95% CI
– Lower = ( point estimate ) - (Zα/2) (SE)
= 0.38 – (1.96)(0.0925) = 0.20
– Upper = ( point estimate ) + (Zα/2) (SE)
= 0.38 + (1.96)(0.0925) = 0.56
• 95% CI = (0.20, 0.56)
CI for the true OR
• The odds ratio is defined as the odds of
disease among exposed individuals
divided by the odds of disease among the
unexposed
• It is estimated by:
• Upper and lower confidence limits for the natural log of the odds ratio are
calculated with the formula:
ˆ Z 1 1 1 1
ln OR
a b c d
1 1 1 1 1 1 1 1
ln ORˆ Z
ln ORˆ Z
a b c limits
d a b c d
e
ntiate the upper and lower confidence for ,e
• where
log of the
– Z OR:
is the standard normal value for the level of confidence desired
– a-d are the cells of a 2x2 table