QEM 2004 - Module 2 (Confidence Interval Estimation)
QEM 2004 - Module 2 (Confidence Interval Estimation)
Estimation
Agenda of the Day
Lower Upper
Confidence Confidence
Point Estimate Limit
Limit
Width of
confidence interval
Point Estimates
Mean μ X
Proportion p ps
Confidence Intervals
(continued)
Sample
General Formula
Confidence Level
Confidence in which the interval
will contain the unknown
population parameter
A percentage (less than 100%)
Confidence Level, (1-)
(continued)
Suppose confidence level = 95%
Also written (1 - ) = .95
A relative frequency interpretation:
In the long run, 95% of all the confidence
intervals that can be constructed will contain the
unknown true parameter
A specific interval either will contain or will
not contain the true parameter
Confidence Intervals(continued)
Confidence
Intervals
Population Population
Mean Proportion
σ Known σ Unknown
Confidence Interval for μ
(σ Known)
Assumptions
Population standard deviation σ is known
Population is normally distributed
If population is not normal, use large sample
σ
(where Z is the normal distribution X Z
critical value for a probability of α/2 in each tail) n
Finding the Critical Value, Z
α α
.025 .025
2 2
/2 1 /2
x
Intervals μx μ
extend from x1
σ x2 (1-α)x100%
XZ of intervals
n
to constructed
σ contain μ;
X Z
n (α)x100% do
not.
Confidence Intervals
Example
Confidence
Intervals
Population Population
Mean Proportion
σ Known σ Unknown
Confidence Interval for μ
(σ Unknown)
If the population standard deviation σ is
unknown, we can substitute the sample
standard deviation, S
This introduces extra uncertainty, since
S is variable from sample to sample
So we use the t distribution instead of the
normal distribution Z
Confidence Interval for μ
(σ Unknown)-continued
Assumptions
Population standard deviation is unknown
Population is normally distributed
If population is not normal, use large sample
Use Student’s t Distribution
Confidence Interval Estimate:
S
X t n-1
(where t is the critical value of the t n
distribution with n-1 d.f. and an area of α/2 in each tail)
Student’s t Distribution
d.f. = n - 1
Degrees of Freedom (df)
Idea: Number of observations that are free to vary
after sample mean has been calculated
Standard
Normal
(t with df = 50)
t (df = 13)
t-distributions are bell-
shaped and symmetric, but
have ‘fatter’ tails than the t (df = 5)
normal
0 t
Student’s t Table
Confidence t t t Z
Level (10 d.f.) (20 d.f.) (30 d.f.) ____
Note: t Z as n increases
Example- t distribution
A random sample of n = 25 has X = 50 and
S = 8. Form a 95% confidence interval for μ
Confidence
Intervals
Population Population
Mean Proportion
σ Known σ Unknown
Confidence Intervals for the
Population Proportion, p
p(1 p)
σp
n
We will estimate this with sample data:
ps(1 ps )
n
Confidence Interval Endpoints
Upper and lower confidence limits for the
population proportion are calculated with the
formula
ps(1 ps )
ps Z
n
where
Z is the standard normal value for the level of confidence desired
ps is the sample proportion
n is the sample size
Example-Proportions
ps Z ps(1 ps )/n
Determining
Sample Size
For the
Mean Sampling error
(margin of error)
σ σ
X Z e Z
n n
Determining Sample Size for
mean(continued)
Determining
Sample Size
For the
Mean
σ 2
Z σ 2
e Z Now solve
for n to get n 2
n e
Determining Sample
Size(continued)
2 2 2 2
Z σ (1.645) (45)
n 2 2
219.19
e 5
Determining
Sample Size
For the
Proportion
ps(1 ps ) p(1 p)
ps Z e Z
n n
Sampling error
(margin of error)
proportions(continued)
Determining
Sample Size
For the
Proportion
2 2
Z p (1 p) (1.96) (.12)(1 .12)
n 2
2
450.74
e (.03)
So use n = 451
Applications in Auditing
Six advantages of statistical sampling in
auditing
S N n
NX N ( t n 1 )
n N 1
(This is sampling without replacement, so use the finite population
correction in the confidence interval formula)
Confidence Interval for
Population Total: Example
SD N n
ND N ( t n 1 )
n N 1
where n
i
(D D ) 2
SD i1
n 1
One Sided Confidence Intervals
Application: find the upper bound for the
proportion of items that do not conform with
internal controls
ps(1 ps ) N n
Upper bound ps Z
n N 1
where
Z is the standard normal value for the level of confidence desired
ps is the sample proportion of items that do not conform
n is the sample size
N is the population size
Ethical Issues