8 Statistical Estimation
8 Statistical Estimation
Introduction
There are 2 types of statistical inferences:-
(1) Statistical estimation (2) Hypothesis testing
Hypothesis testing involves the setting up of a hypothesis (or theory) about the
population and then sampling in order to see if the hypothesis is supported or
rejected.
Statistical estimation
Because of time and cost factors, the population parameters ( , , p ) are
frequently estimated by using sample statistics ( x , s, p ).
Point estimate
An estimate of a population parameter given by a single value and
calculated from sample data is called a point estimate of the population
parameter.
(a) x is a point estimate for .
(b) s is a point estimate for where
s = x n (for ungrouped data)
2
X 2
X
n n n 1
s1
Sample 2 of size n2 , sample mean X and sample standard deviation
2
s2
1
Then the point estimate for the population mean is
n1 X 1 n 2 X 2
X
n1 n 2
Determine a point estimate for (i) the population mean, . (ii) the
population standard deviation .
6.36 40.4496
6.32 39.9424
6.37 40.5769
6.33 40.0689
6.37 40.5769____
31.75 201.6147
2
Confidence interval estimates or confidence limits
Model I :
which contains 99% or 95% of all sample means can be obtained i.e. C. I.
estimate of can be obtained.
1%
The 99% C. I. estimate for is X 2.5758 X where P( Z > 2.5758) = 2
.
3
5%
Similarly, the 95% C. I. for is X 1.96 X where P( Z > 1.96) = 2
.
In general, the 100(1- )% C. I. for is X Z X
2
where P( Z > Z
2
)= 2
.
E.g. Suppose that a random sample of 5 observations was taken from a normal
population whose variance is 25. The results are 8, 15, 12, 6, 7. Find the 99%
C.I. estimate of the population mean.
E.g. A normal population has unknown mean and standard deviation 15. A
random sample of size 25 drawn from this population was found to have a mean
of 950.
(i) Construct (a) a 90% C.I. for ; (b) a 95% C.I. for ; (c) a 99% C.I. for
Solution: Given:
4
cm. What must be the size of the sample that he should examine if he wants to
be 95% certain?
Model II :
E.g. Fifty bags of sugar were randomly selected and carefully weighed. The
mean weight was 1.04 kg. and the standard deviation 0.002 kg. Construct a 99%
confidence interval for the mean weight of all bags.
5
E.g. The management of a company making a certain type of car component
wishes to ascertain the average number of components per hour produced by the
workers. The company employs a very large number of workers and it is decided
to use a sample of the output of 400 workers. After checking the output of this
sample, it was found that the average output produced by each worker every hour
is 100 with a standard deviation of 20.
(a) Calculate a 95% confidence interval for the average output produced by each
worker per hour for the whole factory.
(b) How large a sample is needed if the management wishes to be 95% confident
that the sample mean will be within one unit of the true mean?
Model III :If is unknown then it is estimated by s, and the sample size n is
small (n<30) and the population is normal or approximately normal.
s s N n
The 100(1- ) % C.I. for is X t
, n 1
SX where S X
n
or
n
N 1
2
6
t
and 2
, n 1 is obtained from t distribution table.
30
40
60
120
1.282 1.645 1.960 … 3.090 3.291
7
(b) Obtain a 95% C.I. for .
Population I Population II
8
Pop. Mean 1 2
Sample size n1 n2
Sample mean X1 X2
1 1 N 1 n1
X1 ~ N [ X , X 2
] where X 1 & X or
1 1 1 1
n1 n1 N1 1
2 2 N 2 n2
X2 ~ N [ X , X 2
] where X 2 & X or
2 2 2 2
n2 n2 N2 1
then X1 X 2 ~ N [ X X2
, X X2
2
] where X X2
X X 1 2
1 1 1 1 2
2 2 2 2
1 1 N 1 n1 N n2
and X
1 X2
n1
2
n2
or
n1 N 1 1
2 2
n2 N2 1
2 2 2 2
s1 s s1 N1 n1 s N n2
SX 2 or 2 2
1 X2
n1 n2 n1 N 1 1 n2 N2 1
9
In general, the 100(1- ) % C. I. for 1 2 is ( X 1 X 2 ) Z S X
2
1 X2
Z
where P ( Z > 2
) = 2
10
p Z s p (1 p ) p (1 p ) N n
p where s or .
2 p n n N 1
E.g. A producer of steel pipes selected a simple random sample of 300 pipes
from the production process to estimate the proportion of defective pipes.
There were 15 defective pipes in the sample.
(a) What is the point estimate of the proportion of defective pipes in the
population?
(b) Construct a 95% confidence interval estimate of the proportion of the
defective pipes in the population.
(c) How large a sample would be needed if the probability is to be 0.95 that
the error of estimate will not exceed 0.02 unit?
11
Point estimate for difference of 2 pop. proportions ( p1 - p2) is p1 - p2
2 p 2 (1 p 2 )
p2 ~ N [ , ) where p2 and
p2 p2 p2 p2 n2
p1 p 2 ~ N [ ,
2
] where
p1 p2 p1 p 2 p1 p2 p1 p2
p1 (1 p1 ) p 2 (1 p 2 )
and which is estimated by
2 2
p1 p 2 p1 p2 n1 n2
p1 (1 p1 ) p 2 (1 p 2 )
S
p1 p2 n1 n2
Z
P(Z > 2
) = 2
12