Chapter Two
Chapter Two
Chapter Two
Estimation
.
1
Objectives
After completing this chapter you will be able to:
compute point and confidence interval estimate of the
population mean and population proportion
Explain properties of best estimator
Determine sample size necessary in estimating population
parameter
2
Statistical Estimation
Estimation is the process of approximate or estimate various unknown
population parameters from sample statistics.
Inference is the process of making interpretations or conclusions from sample
data for the totality of the population.
In statistics, inference can be made in two ways .
i. Statistical estimation
ii. Statistical hypothesis testing.
count…
Population Analyzed
Inference Data
Sample Numerical
data
Data analysis is the process of extracting relevant information from the summarized
data.
4
Statistical Estimation
•It is way of making inference about the population parameter where the investigator
does not have any prior notion about values or characteristics of the population
parameter.
• There are two ways estimation.
1. Point Estimation
• It is a procedure that results in a single value as an estimate for a parameter.
2. Interval estimation
• It is the procedure that results in the interval of values as an estimate for
a parameter.
• It deals with identifying the upper and lower limits of a parameter. The limits by
themselves are random variable.
5
Definition of terms
• Estimator: is a sample statistic which is used to estimate a
population parameter.
It must be unbiased, consistent, and relatively efficient.
i. Unbiased Estimator: is an estimator whose expected value is
the value of the parameter being estimated.
ii. Consistent Estimator: is an estimator which gets closer to the
value of the parameter as the sample size increases.
iii. Relatively Efficient Estimator: The estimator with the smallest
6
Count…
7
Count…
Confidence level is the probability that the value of the parameter falls within the
specified range by the confidence interval.
• A point estimate is a single number that is used as an estimate of population parameter, and
is derived from a random sample taken from the population.
• point estimator is the mathematical way to compute the point estimate.
• Some of the most important point estimators are given below.
Parameter (population values) Estimator (statistic)
Population Mean, 𝜇 ത σ 𝑛𝑖 =1 𝑥 𝑖
𝑋= 𝑛
2
𝑖 =1(𝑋 𝑖 −𝑋)
σ𝑛
Population variance, 𝜎 2 𝑆2 = 𝑛 −1
Population S.D, 𝜎 S = 𝑆2
Population proportion, P 𝑥
11
𝑃ത 𝑛
Confidence Interval Estimation
• The statistic is "close to" the parameter. That leads to the obvious question,
what is "close"? Or How confident can we be that the value of the statistic
falls within a certain "distance" of the parameter?
The confident that the value of the statistic can falls within a certain distance
/range of the parameter is the confidence interval.
There are different cases to be considered to construct confidence intervals.
Case 1: If sample size is large or if the population is normal with known variance
• Recall the Central Limit Theorem, which applies to the sampling distribution of the
mean of a sample.
𝜎
The sampling distribution of 𝑋ሜ will have a mean𝑥lj𝜇 𝑥lj = 𝑛,
𝜇, standard deviation
& approaches a normal𝜎 distribution as n gets large. This
= allows us to use the normal
⇒ 𝜀 Taking
= 𝑍 𝜎ΤZ small
- To obtain the value
𝑛 of Z, we have to attach this to a theory of chance. That is,
is an area of size 1 − 𝛼 such that
there
𝑃(−𝑍𝛼Τ2 < 𝑍 < 𝑍𝛼Τ2) = 1 − 𝛼
𝑊ℎ𝑒𝑟𝑒 𝛼 = 𝑖𝑠 𝑡ℎ𝑒 𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑡ℎ𝑎𝑡 𝑡ℎ𝑒 𝑝𝑎𝑟𝑎𝑚𝑒𝑡𝑒𝑟 𝑙𝑖𝑒𝑠 𝑜𝑢𝑡𝑠𝑖𝑑𝑒 𝑡ℎ 𝑒 14
𝑍𝛼Τ2 = 𝑠𝑡𝑎𝑛𝑑𝑠 𝑓𝑜𝑟 𝑡ℎ𝑒 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑒𝑑 𝑛𝑜𝑟𝑚𝑎𝑙 𝑣𝑎𝑟𝑖𝑎𝑏𝑙𝑒 𝑡𝑜 𝑡ℎ𝑒 𝑟𝑖𝑔ℎ𝑡 𝑜𝑓 𝑤ℎ𝑖𝑐ℎ
𝟏𝟎𝟎(𝟏 − 𝑎) % 𝑎 𝑎Τ 𝟐 𝒁𝑎Τ 𝟐
Case 2: If sample size is small and the population variance, 𝝈𝟐is not known.
The unit of measurement of the confidence interval is the standard error. This is just the
standard deviation of the sampling distribution of the statistic.
16
Examples
1. From a normal sample of size 25 a mean of 32 was found. Given that the
population standard deviation is 4.2. Find
2. A drug company is testing a new drug which is supposed to reduce blood pressure.
From the six people who are used as subjects, it is found that the average drop in
blood pressure is 2.28 points, with a standard deviation of .95 points. What is the
95% confidence interval for the mean change in pressure? 17
Solution
= 32 ± 1.96 ∗ 4.2Τ 25 = 32 ±
1.65 =
(30.35,33.65)
18
2. Solution
95% confident that the mean decrease in blood pressure is between 1.28 and
3.28 points.
19
Interval Estimation of the Population Proportion
Sample proportion, 𝑝, is an unbiased estimator of a population proportion P and if the
sample size is large then, the sampling distribution of 𝑝 is normal with 𝑍 = 𝑃−𝑃 = 𝑃−𝑃
𝑃�� .
𝜎𝑝
𝑛
ൗ
• However, p is unknown, it estimate by 𝑝 & 𝜎 𝑝 substituted by 𝑆 𝑝 and Z
becomes
𝑝��ൗ 𝑃−𝑃
𝑍= 𝑛
, Solving for P
𝑃 =𝑝+𝑍 𝑝��ൗ
𝑛 and Z can assume both
positive and negative values,
𝑃 =𝑝±𝑍 𝑝��ൗ
𝑛 . Z represents the confidence
level
𝑃 = 𝑝 ± 𝑍𝛼/2 𝑝��ൗ
𝑛 = 𝑝 ± 𝑍 𝛼/2 𝑆 𝑝
Example
1. Recently, a study of 87 randomly selected companies with
telemarketing operation was completed. The study revealed that
39% of the sampled companies had used telemarketing to assist
them in order processing. Estimate the population proportion of
telemarketing companies who use their telemarketing operation to
assist them in order processing taking a 95% confidence level.
21
Solution:
=
𝑃 = 𝑝 ±𝑛 𝑍𝛼/2 𝑆𝑝 0.39 ± 1.96(0.0523)
𝛿2 Solving for n,
𝑒2 = 𝑍2
𝛼/2 𝑛
𝑍2 𝜎2
𝛼/2
𝑛=
𝑒2 2
𝑛𝜇 = 𝑍 𝛼/2 if 𝜎 is known
𝑒 𝜎
2
𝑛𝜇 = 𝑍 𝛼/2 𝑠 if 𝜎 is not known
𝑒 24
Examples
1. A gasoline service station shows a standard deviation of Birr 6.25 for the changes
made by the credit card customers. Assume that the station’s management would
like to estimate the population mean gasoline bill for its credit card customers to
be with in ± Birr 1.00. For a 95% confidence level, how large a sample would be
necessary?
2. The National Travel and Tour Organization (NTO) would like to estimate the
mean amount of money spent by a tourist to be with in Birr 100 with 95%
confidence. If the amount of money spent by tourist is considered to be normally
distributed with a standard deviation of Br 200, what sample size would be
necessary for the NTO to meet their objective in estimating this mea n amount?
2 5
Solution
1. e = Birr 1.00, σ = Birr 6.25, C = 0.95, 𝑍𝛼/2 = 𝑍0.025 = 1.96
2
𝑛𝜇 = 𝑍 𝛼/2 𝜎
𝑒
1.96∗6.25 2
𝑛𝜇 = = 150.06 ≈ 151
1
26
Sample size for estimating population proportion, p.
𝑝𝑞 𝑝𝑞
• The confidence interval for p is 𝑃 = 𝑝 ± 𝑍𝛼 /2 . The expression 𝑍 𝛼 /2 is
𝑛 𝑛
𝑝𝑞
𝑒 2 = 𝑍𝛼2 /2 𝑛
, solving for n
𝑍2 𝑝 𝑞
𝑛𝑝 = 𝛼/2
𝑒2
2
• Since if 𝑝 𝑎𝑛𝑑 𝑞 not given, we use p and q 𝑝 becomes, 𝑛𝑝 = 𝑍 𝛼/2 𝑝𝑞
𝑒2
Examples
1. Suppose that a production facility purchases a particular component parts in large
lots from a supplier. The production manager wants to estimate the proportion of
defective parts received from this supplier. She believes that the proportion of
defects is no more than 0.2 and wants to be with in 0.02 of the true proportion of
defects with a 90% level of confidence. How large a sample should she take?
2. What is the largest sample size that would be needed in estimating a population
proportion to with in ± 0.02, with a confidence coefficient of 0.95?
28
Solution
1. e = p = 0.2, q =0.8, C = 0.90 𝑍𝛼/2 = 𝑍0.05 = 1.64
2
0.02, and 𝑛𝑝 = 𝑍 𝛼/2 𝑝𝑞
𝑒
2
1.64
𝑛𝑝 = 0.2 ∗ 0.8 = 1075.84 ≈ 1076
0.02