Lecture 5
Lecture 5
2
◼ Introduction:
◼ Statistical inference is the procedure by which we
reach to a conclusion about a population on the basis
of the information contained in a sample drawn from
that population.
◼ Suppose that:
◼ an administrator of a large hospital is interested in
the mean age of patients admitted to his hospital
during a given year.
1. It will be too expensive to go through the records of
all patients admitted during that particular year.
2. He consequently elects to examine a sample of the
records from which he can compute an estimate of
the mean age of patients admitted to his that year.
3
• To any parameter, we can compute two types of
estimate: a point estimate and an interval estimate.
◼ A point estimate is a single numerical value used to
estimate the corresponding population parameter.
◼ An interval estimate consists of two numerical values
defining a range of values that, with a specified degree
of confidence, we feel includes the parameter being
estimated.
◼ The Estimate and The Estimator:
◼ The estimate is a single computed value, but the
estimator is the rule that tell us how to compute this
value, or estimate.
For example,
◼
x = xi
i
◼ is an estimator of the population mean,. The
single numerical value that results from
evaluating this formula is called an estimate of
the parameter .
4
6.2 Confidence Interval for
a Population Mean: (C.I)
Suppose researchers wish to estimate the mean
of some normally distributed population.
◼ They draw a random sample of size n from the
population and compute , which they use as a
point estimate of .
◼ Because random sampling involves chance, then x
can’t be expected to be equal to .
◼ The value of x may be greater than or less
than .
◼ It would be much more meaningful to estimate
by an interval.
5
The 1- percent confidence
interval (C.I.) for :
P( L ≤ ≤ U ) = 1-
6
For example:
◼ When,
◼ = 0.01,
then 1- = 0.99
◼ = 0.05,
then 1- = 0.95
◼ = 0.10,
then 1- =0.90
7
We have the following cases
a) When the population is normal
1) When the variance is known and the sample size is large
or small, the C.I. has the form:
𝝈 𝝈
◼ ഥ−𝒁
𝑷 𝑿 𝜶 ഥ+𝒁
< 𝝁 <𝑿 𝜶 =1− α
𝟏−𝟐 𝒏 𝟏−𝟐 𝒏
8
b) When the population is not
normal and n large (n>30)
1) When the variance is known the C.I. has the
form:
𝝈 𝝈
◼ ഥ−𝒁
𝑷 𝑿 𝜶 ഥ+𝒁
< 𝝁 <𝑿 𝜶 =1− α
𝟏− 𝟐 𝒏 𝟏− 𝟐 𝒏
10
Solution:
◼ 1- =0.95→ =0.05→ /2=0.025, x = 22
◼ variance = σ2 = 45 → σ= 45,n=10
11
Example
The activity values of a certain enzyme measured in
normal gastric tissue of 35 patients with gastric
carcinoma has a mean of 0.718 and a standard
deviation of 0.511.We want to construct a 90 %
confidence interval for the population mean.
◼ Solution:
12
Then 90% confident interval for is given
by :
13
Example6.3.1 Page 174:
◼ Suppose a researcher , studied the effectiveness of
early weight bearing and ankle therapies following
acute repair of a ruptured Achilles tendon. One of the
variables they measured following treatment the
muscle strength. In 19 subjects, the mean of the
strength was 250.8 with standard deviation of 130.9
we assume that the sample was taken from is
approximately normally distributed population.
Calculate 95% confident interval for the mean of the
strength ?
14
Solution:
◼ 1- =0.95→ =0.05→ /2=0.025, x = 250.8
◼ Standard deviation= S = 130.9 ,n=19
15
6.3 Confidence Interval for
the difference between two
Population Means: (C.I)
If we draw two samples from two independent population
and we want to get the confident interval for the
difference between two population means , then we have
the following cases :
a) When the population is normal
1) When the variance is known and the sample sizes
is large or small, the C.I. has the form:
12 22 12 22
( x1 − x2 ) − Z + 1 − 2 ( x1 − x2 ) + Z +
1− n1 n2 1− n1 n2
2 2
16
2) When variances are unknown but equal, and the
sample size is small, the C.I. has the form:
1 1 1 1
( x1 − x2 ) − t Sp + 1 − 2 ( x1 − x2 ) + t Sp +
1− ,( n1 + n2 − 2 ) n1 n2 1− , ( n1 + n 2 − 2 ) n1 n2
2 2
where
(n1 − 1) S12 + (n2 − 1) S 22
S =
2
n1 + n2 − 2
p
17
Example 6.4.1 P174:
The researcher team interested in the difference between serum uric
and acid level in a patient with and without Down’s syndrome .In a
large hospital for the treatment of the mentally retarded, a sample of
12 individual with Down’s Syndrome yielded a mean of x1 = 4.5
mg/100 ml. In a general hospital a sample of 15 normal individual of
the same age and sex were found to have a mean value of x2 = 3.4
If it is reasonable to assume that the two population of values are
normally distributed with variances equal to 1 and 1.5,find the 95%
C.I for μ1 - μ2
Solution:
1- =0.95→ =0.05→ /2=0.025 → Z (1- /2) = Z0.975 = 1.96
12 22 1 1.5
( x1 − x2 ) Z + = (4.5 − 3.4) 1.96 +
1− n1 n2
2 12 15
18
Example 6.4.1 P178:
The purpose of the study was to determine the effectiveness of an
integrated outpatient dual-diagnosis treatment program for mentally ill
subject. The authors were addressing the problem of substance abuse
issues among people with sever mental disorder. A retrospective chart
review was carried out on 50 patient ,the recherché was interested in the
number of inpatient treatment days for physics disorder during a year
following the end of the program. Among 18 patient with schizophrenia,
The mean number of treatment days was 4.7 with standard deviation of
9.3. For 10 subject with bipolar disorder, the mean number of treatment
days was 8.8 with standard deviation of 11.5. We wish to construct 99%
C.I for the difference between the means of the populations Represented
by the two samples
19
Solution :
1-α =0.99 → α = 0.01 → α/2 =0.005 → 1- α/2 = 0.995 ◼
n1+n2 – 2 = 18 + 10 -2 = 26 ◼
t (1- /2),(n1+n2-2) = t0.995,26 = 2.7787, then 99% C.I for μ1 – μ2 ◼
1 1
( x1 − x2 ) t Sp +
1− , ( n1 + n2 − 2 ) n1 n2
2
where ◼
(n1 − 1) S12 + (n2 − 1) S 22 (17 x9.32 ) + (9 x11.52 )
◼
S =2
= = 102.33
n1 + n2 − 2 18 + 10 − 2
p
then ◼
20
6.5 Confidence Interval for a
Population proportion (P):
A sample is drawn from the population of interest ,then
compute the sample proportion P̂ such as
no. of element in the sample with some charachtaristic 𝑎
𝑝Ƹ = =
Total no. of element in the sample 𝑛
21
Example 6.5.1
The Pew internet life project reported in 2003 that 18%
of internet users have used the internet to search for
information regarding experimental treatments or
medicine . The sample consist of 1220 adult internet
users, and information was collected from telephone
interview. We wish to construct 98% C.I for the
proportion of internet users who have search for
information about experimental treatments or medicine
22
Solution :
1-α =0.98 → α = 0.02 → α/2 =0.01 → 1- α/2 = 0.99
18
Z 1- α/2 = Z 0.99 =2.33 , n=1220, pˆ = 100 = 0.18
The 98% C. I is
ˆ (1 − P
P ˆ) 0.18(1 − 0.18)
ˆZ
P = 0.18 2.33
1− n 1220
2
23
6.6 Confidence Interval for the
difference between two Population
proportions :
Two samples is drawn from two independent population
of interest ,then compute the sample proportion for each
sample for the characteristic of interest. An unbiased
point estimator for the difference between two population
proportions P ˆ −P ˆ
1 2
24
Example 6.6.1
25
Solution :
1-α =0.99 → α = 0.01 → α/2 =0.005 → 1- α/2 = 0.995
Z 1- α/2 = Z 0.995 =2.58 , nF=68, nM=255,
aF 31 aM 53
pˆ F = = = 0.4559, pˆ M = = = 0.2078
nF 68 nM 255
The 99% C. I is
ˆ (1 − P
P ˆ ) ˆ (1 − P
P ˆ )
ˆ −P
(P ˆ )Z F F
+ M M
F M
1− nF nM
2
26
◼ Exercises:
◼ Questions :
◼ 6.2.1, 6.2.2,6.2.5 ,6.3.2,6.3.5, 6.4.2
◼ 6.5.3 ,6.5.4,6.6.1
27