0% found this document useful (0 votes)

20 views14 pages

BSDS Slides Module 8 9 11

The document discusses hypothesis testing for binomial proportions and normal means, detailing methods such as the UMP test and Student's t-test. It provides examples of testing a new drug's cure rate against a standard and assessing university students' sleep hours, including critical values and power functions. Additionally, it outlines large sample tests and confidence intervals associated with these tests.

Uploaded by

priyamsahoojnvkp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views14 pages

BSDS Slides Module 8 9 11

Uploaded by

priyamsahoojnvkp

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Statistics II : Introduction to Inference

Modules 8,9,11: Some Important Hypothesis Testing and Associated Confidence Intervals

1 Testing of binomial proportions

Example 1. A pharmaceutical company develops a new drug intended to cure certain disease. They want
to test whether the cure rate (proportion of patients cured) is better than the current standard drug, which
has a known cure rate of 60%. They conduct a clinical trial where 30 patients are given the new drug.
Among those, 22 are cured. Is the cure rate of the new drug significantly higher than 60%?
iid
(i) Setup: X1 , . . . , Xn ∼ Bernoulli(p). Consider the problem of testing H0 : p = p0 against H1,1 :
p > p0 (or, H1,2 : p < p0 , or, H1,3 : p ̸= p0 ). The UMP test under H1,1 (or, H1,2 ) at level α is

1 if nX̄n > cα (or, nX̄n < kα ),

ϕ(X) = γ if nX̄n = cα ,

0 otherwise.


Under H0 ,
n
X
nX̄n = Xi ∼ Binomial(n, p0 ).
i=1
Find the critical value cα (or, kα ) and γ from the equation Ep0 (ϕ(X)) = α.
Further, under the alternative H1,3 , the above test can be generalized to



1 if nX̄n > c′α or, nX̄n < kα′ ,
γ if nX̄ = c′ ,

1 n
ϕ′ (X) = α


γ2 if nX̄n = kα′ ,

0 otherwise.

The critical values c′α , kα′ and (γ1 , γ2 ) should satisfy Ep0 (ϕ′ (X)) = α. However, note that it is not
possible to obtain four unknown quantities from one equation. Therefore, one imposes additional
restrictions to get unique solutions. One possible restriction can be γ1 = γ2 and c′α = n − kα′
(symmetric critical region).
(ii) Large sample test. By CLT,
nX̄ − np0 D
p n −−−−−−→ Z,
np0 (1 − p0 ) under H0
where Z ∼ N (0, 1).
Using this result, one can construct a large sample test for testing H0 against the possible alternatives
as follows:



 1 if nX̄n > cα,LS (against H1,1 ),

 nX̄n < kα,LS (against H1,2 ),
ϕLS (X) =


 nX̄n > c′α,LS or nX̄n < kα,LS
′ (against H1,3 ),

0 otherwise.

1
1 1 3
Fig. 1: Power curve for the exact test ϕ against H1,1 : p > p0 with p0 = (left), p0 = (middle) and p0 = (right).
4 2 4

The critical points are now determined by solving the equation Ep0 (ϕLS (X)) = α, under the large
sample distribution of nX̄n . For example, cα,LS can be obtained to satisfy
!
nX̄n − np0 cα,LS − np0
Ep0 (ϕLS (X)) = Pp0 nX̄n > cα,LS = Pp0 p >p = α,
np0 (1 − p0 ) np0 (1 − p0 )
p
which provides cα,LS = np0 + τα np0 (1 − p0 ). Here τα is the upper-α point of N (0, 1) distribution,
i.e., τα is such that P (X ≥ τα | X ∼ N (0, 1)) = α.
Similarly, kα,LS can be obtained to satisfy
!
nX̄ − np0 kα,LS − np0
p n

Ep0 (ϕLS (X)) = Pp0 nX̄n < kα,LS = Pp0 <p = α,
np0 (1 − p0 ) np0 (1 − p0 )
p
which provides kα,LS = np0 − τα np0 (1 − p0 ), as τ1−α = −τα .
Finally, (c′α,LS , kα,LS
′ ) are such that they satisfy
′
+ Pp0 nX̄n > c′α,LS = α.

Ep0 (ϕLS (X)) = Pp0 nX̄n < kα,LS
As it is not possible to uniquely solve for (c′α,LS , kα,LS
′ ) from the above equation, we put the additional
restriction that
′
Pp0 nX̄n > c′α,LS = α/2,

Pp0 nX̄n < kα,LS = α/2, and
and obtain the solutions
′
c′α,LS = np0 + τα/2
p p
kα,LS = np0 − τα/2 np0 (1 − p0 ), and np0 (1 − p0 ).

2
1 1 3
Fig. 2: Power curve for the exact test ϕ against H1,2 : p < p0 with p0 = (left), p0 = (middle) and p0 = (right).
4 2 4

(iii) Power function. The power curve for the exact test ϕ (under the alternative H1,1 ) for three choices
of p0 ∈ { 14 , 12 , 34 } is given in Figure 1.

The power curve for the exact test ϕ (under the alternative H1,2 ) for three choices of p0 ∈ { 14 , 12 , 34 }
is given in Figure 2.

(iv) Large sample confidence interval associated with the test. The large sample test ϕLS under
the alternative H1,3 leads to a possible (large sample) confidence interval for binomial proportion p.
Under H0 : p = p0 ,
nX̄ − np0 D
p n −−−−−−→ Z,
np0 (1 − p0 ) under H0
p
where Z ∼ N (0, 1). Note that X̄n →
− p0 (by WLLN), and so by continuous mapping
s
X̄n (1 − X̄n ) p
→
− 1.
p0 (1 − p0 )

Therefore, by Slutsky’s theorem

√
n(X̄n − p0 ) D
p −−−−−−→ Z,
X̄n (1 − X̄n ) under H0

where Z ∼ N (0, 1). Based on this result one can obtain that
√ !
n(X̄n − p0 )
Pp0 −τα/2 ≤ p ≤ τα/2 ≈ 1 − α,
X̄n (1 − X̄n )

3
which is equivalent to saying
√ !
n(X̄n − p)
Pp −τα/2 ≤ p ≤ τα/2 ≈ 1 − α,
X̄n (1 − X̄n )
r r !
X̄n (1 − X̄n ) X̄n (1 − X̄n )
⇔ Pp X̄n − τα/2 ≤ p ≤ X̄n + τα/2 ≈ 1 − α.
n n

for sufficiently large n.

h p p i
Therefore, X̄n − τα/2 X̄n (1 − X̄n )/n, X̄n + τα/2 X̄n (1 − X̄n )/n is a confidence interval for p
with confidence coefficient 1 − α.

2 Testing of normal mean (Student t-test)

Example 2. A researcher wants to find out if university students are getting less than the recommended
8 hours of sleep per night. She randomly samples 25 students and records their sleep hours. The sample
mean turns out to be 7.5 hours, and the sample standard deviation is 1.2 hours. Is there enough evidence
to conclude that the students sleep less than 8 hours on average?
iid
(i) Setup: X1 , . . . , Xn ∼ N (µ, σ 2 ), where σ 2 is unknown. We want to test H0 : µ = µ0 against
H1,1 : µ > µ0 (or, H1,2 : µ < µ0 , or, H1,3 : µ ̸= µ0 ).

• We obtained UMP test against H1,1 and H1,2 when σ 2 is known with the test statistic X̄n .
• In the same spirit, let us construct the following test for H0 against H1,1 (or, H1,2 ) as
(
1 if X̄n ≥ cα ,
ϕ(X) =
0 otherwise.

However, the problem arises in finding the critical value as

cα − µ0
PH0 (X̄n ≥ cα ) = α ⇔ 1 − Φ √ = α,
σ/ n

which can not be solved when σ is unknown.

• Therefore, we need a test statistic which is a function of X̄n , but it’s distribution will be
completely known under H0 .
• Define
X̄n − µ0
Tn = √ , (1)
σ̂n / n
where σ̂n2 = (n − 1)−1 ni=1 (Xi − X̄n )2 .
P
√ H
As Zn = n(X̄n − µ0 )/σ ∼0 N (0, 1), Wn = (n − 1)σ̂n2 /σ 2 ∼ χ2n−1 and Zn is independent of Wn ,
we have
Zn H
p = Tn ∼0 tn−1 distribution.
Wn /(n − 1)

4
• Test procedure. Based on the proposed t-statistic the test function for testing H0 against
the possible alternatives is as follows:

1

 if Tn > cα (against H1,1 ),

 Tn < kα (against H1,2 ),
ϕ(X) =


 Tn > c′α or Tn < kα′ (against H1,3 ),

0 otherwise.

The critical points are now determined by solving the equation Eµ0 (ϕ(X)) = α, under the
distribution of Tn under H0 . In particular, cα = tα,n−1 and kα = t1−α,n−1 = −tα,n−1 , where
tα,n−1 is the upper-α point of tn−1 distribution, i.e., tα,n−1 is such that P (T ≥ tα,n−1 | T ∼
tn−1 ) = α. Further, under the additional assumption of symmetric critical region, one obtains
c′α = tα/2,n−1 and kα′ = t1−α/2,n−1 = −tα/2,n−1 ,
• The test statistic in (1) is called the student t statistic, and the corresponding test is called the
student t-test.
(ii) Power function. Define µ − µ0 = ∆. Under H0 , ∆ = 0, and under the alternatives it can take
any value in R \ {0}. The power function of the test ϕ under the alternative H11 can be derived as
follows:
√
n∆
βϕ (µ) = Pµ Tn > tα,n−1 | Tn + ∼ tn−1
σ̂n
√ √ √
n∆ n∆ n∆
= Pσ2 Tn + > tα,n−1 + | Tn + ∼ tn−1
σ̂n σ̂n σ̂n
√
n∆
= 1 − G tα,n−1 + ,
σ̂n
where G is the CDF of tn−1 distribution.
Similarly, one can derive the power function of ϕ under the alternatives H1,2 and H1,3 . The power
curve of ϕ under the alternative H1,3 for µ0 = 10 is given in Figure 3.
d
(iii) Large sample test. Let Tn ∼ tn distribution. Then as n → ∞, Tn −
→ Z where Z ∼ N (0, 1).
iid
To see this, let X, X1 , . . . , Xn ∼ N (0, 1). Then
D X
Tn = p 2
∼ tn ,
−1
n (X1 + . . . + Xn2 )
D
where U = V implies U, V have same distribution (U and V are not necessarily same variables).
p p
By WLLN and continuous mapping n−1 (X12 + . . . + Xn2 ) →
− 1 (verify), and therefore by Slutsky’s
D
lemma, Tn → Z where Z ∼ N (0, 1).
Based on the above results a large sample version of the above t-test can be obtained as follows:



 1 if Tn > cα,LS (against H1,1 ),

 Tn < kα,LS (against H1,2 ),
ϕLS (X) =


 Tn > c′α,LS or Tn < kα′,LS (against H1,3 ),

0 otherwise.

The critical points are now determined by solving the equation Eµ0 (ϕLS (X)) = α, under the large
sample distribution of Tn under H0 . In particular, cα,LS = τα , kα,LS = τ1−α = −τα , and c′α,LS = τα/2 ,
′
kα,LS = τ1−α/2 = −τα/2 (the last two obtained under the additional restriction of symmetric critical
region), where τα is the upper-α point of N (0, 1) distribution.

5
Fig. 3: Power curve for H1,3 : µ ̸= µ0 with µ0 = 10.

(iv) Confidence interval. By inverting the test statistic of the exact or large sample test of H0 : µ = µ0
against H1,3 : µ ̸= µ0 , one can obtain a confidence interval for µ.

From the exact test above, we obtain

X̄n − µ0
kα′ c′α

Pµ 0 ≤ Tn ≤ = Pµ0 −tα/2,n−1 ≤ √ ≤ tα/2,n−1 = 1 − α,
σ̂n / n

X̄n − µ
⇔ Pµ −tα/2,n−1 ≤ √ ≤ tα/2,n−1 = 1 − α,
σ̂n / n

σ̂n σ̂n
⇔ Pµ X̄n − √ tα/2,n−1 ≤ µ ≤ X̄n + √ tα/2,n−1 = 1 − α.
n n
√ √
Therefore, X̄n − σ̂n tα/2,n−1 / n, X̄n + σ̂n / ntα/2,n−1 is a confidence interval for µ with confidence co-
efficient 1 − α. Similarly, one can obtain a large sample (1 − α) confidence interval for µ by replacing
tα/2,n−1 in the above confidence interval by τα/2 .

2.1 Paired-t test

Example 3. The marks obtained by 10 students in a mock test and an actual test are given below. Based
on this data, is there sufficient evidence to conclude that students score significantly lower in the mock
test compared to the actual test?

Student ID 1 2 3 4 5 6 7 8 9 10
Marks in Mock Test 45 38 40 36 41 39 35 37 42 44
Marks in Actual Test 50 42 47 38 44 43 40 42 46 47

iid 2 , σ2 , σ
(i) Setup. Let Z1 , . . . , Zn ∼ N2 (µX , µY , σX Y XY ) (i.e., Bivariate normal distribution) and Zi =

6

Xi
, for i = 1, 2, . . . , n, which implies
Yi
iid 2 iid
Xi ∼ N (µX , σX ), Yi ∼ N (µY , σY2 ) and Cov(Xi , Yi ) = σXY .

To test

H0 : µX = µY (i.e., µX − µY = 0)
against
H1,1 : µX − µY > 0 (or, H1,2 : µX − µY < 0, or, H1,3 : µX − µY ̸= 0).

(ii) Test statistic. Consider the difference of two variables Wi = Xi − Yi . By properties of bivariate
2 ), where µ
normal distribution Wi ∼ N (µW , σW 2 2 2
W = µX − µY and σW = σX + σY − 2σXY .

Therefore, the testing problem can be regarded as a special case of the testing of normal mean with
variance unknown, i.e., H0 : µW = 0 against possible alternatives.
The related test statistic is
X̄n − Ȳn
Tn = √ , (2)
σ̂W,n / n
Pn
2
where σ̂W,n = (n − 1)−1 i=1 (Xi − Yi − X̄n + Ȳn )2 .

(iii) The test statistic in (2) is called the paired t-test statistic, and the corresponding test is called the
paired t-test. Further, the assumption of same variances for the two normal populations, i.e.,
var(X1 ) = var(Y1 ) = σ 2 is called the homoscedasticity assumption.

3 Testing of normal variance

Example 4. A factory produces ball bearings with a target diameter of 10 mm. The manufacturing
process is designed to keep the variation (standard deviation) in the diameters small, say with a target
variance of 0.01 mm2 . The quality control (QC) team suspects that a recent change in materials or
machine calibration may have increased the variability of the product. They collect a random sample of
25 ball bearings and measure their diameters. Is there sufficient evidence to conclude that the variance in
ball bearing diameters has increased from the target value of 0.01 mm2 ?
iid
(i) Setup: X1 , . . . , Xn ∼ N (µ, σ 2 ), where µ is unknown. We want to test H0 : σ 2 = σ02 against
H1,1 : σ 2 > σ02 (or, H1,2 : σ 2 < σ02 , or, H1,3 : σ 2 ̸= σ02 ).

• We obtained UMP test against H1,1 and H1,2 with the test statistic nSn2 = ni=1 (Xi − X̄n )2
P
(verify starting from Neyman Pearson’s lemma, and then by generalizing).
• In the same spirit, we can propose the following test for testing H0 against H13 .
(
1 if nSn2 ≥ cα , or, nSn2 ≤ kα
ϕ(X) =
0 otherwise.

The critical points are now determined by solving the equation Eσ02 (ϕ(X)) = α, under the
distribution of Tn under H0 . As
nSn2 H0 2
∼ χn−1 ,
σ02

7
Fig. 4: The power curve of the test function ϕ for testing H0 : σ 2 = 25 against H13 : σ 2 ̸= 25.

the above equation provides

nSn2 nSn2

Pσ02 ≥ cα + Pσ02 ≤ kα = α.
σ02 σ02
As two unknowns (cα , kα ) can not be solved from one equation, we additionally impose the
restriction of symmetry as follows:
2 2
nSn α nSn α
Pσ02 ≥ cα = , P 2
σ0 ≤ kα = ,
σ02 2 σ02 2

and obtain the solutions cα = σ02 χ2α/2,n−1 , kα = σ02 χ21−α/2,n−1 , where χ2α,n−1 is the upper-α
point of χ2n−1 distribution, i.e., χ2α,n−1 is such that P (T ≥ χ2α,n−1 | T ∼ χ2n−1 ) = α.
(ii) Power function. The power curve of the above test can be derived as
2
σ02 2
2
σ02 2

2 nSn nSn
βϕ (σ ) = Pσ2 ≥ 2 χα/2,n−1 + Pσ2 ≤ 2 χ1−α/2,n−1
σ2 σ σ2 σ
2 2
σ0 2 σ0 2
= 1−G χα/2,n−1 + G χ ,
σ 2 σ 2 1−α/2,n−1
where G is the CDF of χ2n−1 distribution. The power curve of the test function for testing H0 : σ 2 =
25 against H13 : σ 2 ̸= 25 is given in Figure 4.
(iii) Confidence interval. By inverting the test statistic of the test of H0 : σ 2 = σ02 against H1,3 : σ 2 ̸=
σ02 , one can obtain a confidence interval for σ 2 .
From the exact test above, we obtain

Pσ02 (kα ≤ nSn ≤ cα ) =Pσ02 σ02 χ21−α/2,n−1 ≤ nSn2 ≤ σ02 χ21−α/2,n−1 = 1 − α,

⇔ Pσ2 σ 2 χ21−α/2,n−1 ≤ nSn2 ≤ σ 2 χ21−α/2,n−1 = 1 − α,
!
nSn2 2 nSn2
⇔ Pσ 2 ≤σ ≤ 2 = 1 − α.
χ2α/2,n−1 χ1−α/2,n−1

8
" #
nSn2 nSn2
Therefore, , is a confidence interval for σ 2 with confidence coefficient 1 − α.
χ2α/2,n−1 χ21−α/2,n−1

4 Two-sample normal mean test

Example 5. A school wants to know if a new teaching method improves student test scores compared to
the traditional method.
• Group 1: 30 students taught with the new method. Sample mean score = 78, sample standard
deviation = 10.
• Group 2: 28 students taught with the traditional method. Sample mean score = 74, sample standard
deviation = 11.
Is there a significant difference in mean test scores between the two methods?
iid iid
(i) Setup: X1 , . . . , Xn ∼ N (µX , σ 2 ) and Y1 , . . . , Ym ∼ N (µY , σ 2 ). Further, {X1 , . . . , Xn , Y1 , . . . , Ym }
are mutually independent, and σ 2 is unknown.
We want to test H0 : µX = µY against H1,1 : µX > µY (or, H1,2 : µX < µY , or, H1,3 : µX ̸= µY ).
(ii) Test statistic:
• As before, we need a test statistic based on (X̄n − Ȳm ). [However, it does not make sense to
consider (Xi − Yi ) as the samples are not paired.]
• Further, we need a test statistic whose distribution, under H0 , is completely known.
• Observe that
σ2 σ2

X̄n ∼ N µX , , and Ȳm ∼ N µY , , independently, which
n m

2 1 1 X̄n − Ȳm − (µX − µY )
implies X̄n − Ȳm ∼ N µX − µY , σ + , i.e., q ∼ N (0, 1).
m n σ m 1
+ n1

• Also,
nSX2 mSY2 2 + mS 2
nSX
∼ χ2n−1 , and ∼ χ 2
m−1 , independently, implying, Y
∼ χ2n+m−2 ,
σ2 σ2 σ2
where SP 2 2 of {X1 , . . . , Xn } and {Y1 , . . . , Ym }, respectively, i.e.,
X and SY are sample variances
2 n
nSX = i=1 (Xi − X̄n )2 , mSY2 = m 2
P
j=1 j − Ȳm )
(Y
• Finally, note that all the four quantities X̄n , Ȳm , SX 2 , S2

Y are mutually independent (WHY
?), which implies that X̄n − Ȳm is independent of nSX + mSY2 .
2

• Combining the last three points, we get

,s
2 + mS 2
X̄n − Ȳm X̄n − Ȳm − (µX − µY ) nSX Y
Tn,m = q = q 2
∼ t(n+m−2) , (3)
σ̂ 1
+ 1
σ 1
+ 1 σ (n + m − 2)
n,m n m m n
2
where σ̂n,m = (n + m − 2)−1 (nSX
2 + mS 2 ).
Y
Therefore, under H0 ,
,s
2 + mS 2
X̄n − ȲM nSX Y H0
q 2
∼ tm+n−2 .
σ 1 +1 σ (n + m − 2)
m n

9
(iii) Derive the complete test function, the power function and corresponding confidence interval based
on the test based on the above test statistic.
(iv) The test statistic in (3) is called the two-sample student t-test statistic, and the corresponding test
is called the two-sample student t-test.
(v) Finally when min{m, n} → ∞, an asymptotic version of the above test can be formed, where the
critical values are obtained from the normal distribution.

5 Testing equality of two normal variances

Example 6. A factory uses two different machines (Machine A and Machine B) to produce metal rods.
The management wants to check if the consistency (variation in rod length) is the same for both machines.
• From Machine A, they take a sample of 20 rods and find a sample variance of 0.04 cm2 .
• From Machine B, they take a sample of 25 rods and find a sample variance of 0.06 cm2 .
Is there a significant difference between the variances of the two machines?
iid 2 ), Y , . . . , Y iid 2
(i) Setup: X1 , . . . , Xn ∼ N (µX , σX 1 m ∼ N (µY , σY ), and {X1 , . . . , Xn , Y1 , . . . , Ym } are
2 2
mutually independent, where µX , µY , σX and σY are unknown.
We want to test
2
σX 2
σX 2
σX 2
σX
2 = σ2 ⇔
H0 : σX = 1 against H : > 1 (or, H : < 1, or, H : ̸= 1).
Y 1,1 1,2 1,3
σY2 σY2 σY2 σY2
(ii) Test statistic.
• As the test concerns ratio of the population variances, it is natural to consider a test statistic
S2
based on the ratio of the sample variances X .
SY2
• Recall that
nSX2 mSY2 2 /(n − 1) σ 2
nSX
2 ∼ χ2n−1 , and 2 ∼ χ 2
m−1 , independently, implying, 2
Y
2 ∼ χ2n+m−2 .
σX σY mSY /(m − 1) σX
Therefore, under H0 , we have
2 /(n − 1)
nSX H0
Tn,m = 2 ∼ Fn−1,m−1 .
mSY /(m − 1)
(iii) Based on the above test statistic, the following test function can be derived:



 1 if Tn,m > cα (against H1,1 ),

 Tn,m < kα (against H1,2 ),
ϕ(X) =


 Tn,m > c′α or Tn < kα′ (against H1,3 ),

0 otherwise.

The critical points are now determined by solving the equation EH0 (ϕ(X)) = α, under the distri-
bution of Tn under H0 . In particular, cα = Fα,n−1,m−1 , kα = F1−α,n−1,m−1 , and c′α = Fα/2,n−1,m−1 ,
kα′ = F1−α/2,n−1,m−1 , where Fα,n−1,m−1 is the upper-α point of Fn−1,m−1 distribution, i.e., Fα,n−1,m−1
is such that P (F ≥ Fα,n−1,m−1 | F ∼ Fn−1,m−1 ) = α.

10
2
Fig. 5: Power function for the F-test for H0 : σX = σY2 against H1,1 : σX
2
> σY2 (top), H1,1 : σX
2
< σY2 (middle),
2 2
and H1,3 : σX ̸= σY (bottom).

11
(iv) Power function. Define σX 2 /σ 2 = σ 2 . Under H , σ 2 = 1, and other the alternative it can take any
Y 0
positive value. The power function of the test ϕ under the alternative H11 can be derived as follows:

βϕ (σ 2 ) = Pσ2 Tm,n > Fα,n−1,m−1 | Tm,n /σ 2 ∼ Fn−1,m−1

= Pσ2 Tm,n /σ 2 > Fα,n−1,m−1 /σ 2 | Tm,n /σ 2 ∼ Fn−1,m−1

= 1 − G Fα,n−1,m−1 /σ 2 ,

where G is the CDF of Fn−1,m−1 distribution.

Similarly, one can derive the power function of ϕ under the alternatives H1,2 and H1,3 . The graph
of the power function of ϕ under possible alternatives are given in Figure 5.

(v) Confidence interval. Based on the above test one can form a confidence interval for the ratio of
variances of two normal populations. From the test we get

nSX2 /(n − 1)
′ ′

Pσ2 /σ2 =1 kα ≤ Tm,n ≤ cα = Pσ2 /σ2 =1 F1−α/2,n−1,m−1 ≤ ≤ Fα/2,n−1,m−1
X Y X Y mSY2 /(m − 1)
= 1 − α.
2 /σ 2 (not necessarily 1), we have
For a general σX Y

nSX2 /(n − 1) σ 2
Y
Pσ2 /σ2 F1−α/2,n−1,m−1 ≤ 2 ≤ Fα/2,n−1,m−1 = 1−α
X Y mSY2 /(m − 1) σX
!
nSX2 /(n − 1) 2
σX nSX2 /(n − 1)
Pσ2 /σ2 ≤ ≤ = 1 − α.
X Y mSY2 /(m − 1)Fα/2,n−1,m−1 σY2 mSY2 /(m − 1)F1−α/2,n−1,m−1
" #
2 /(n − 1)
nSX 2 /(n − 1)
nSX
Therefore, , is a confidence interval for
mSY2 /(m − 1)Fα/2,n−1,m−1 mSY2 /(m − 1)F1−α/2,n−1,m−1
2 /σ 2 with confidence coefficient 1 − α.
σX Y

12
6 Exercises
1. Suppose that the proportion p of defective items in a large population of items is unknown, and
that it is desired to test the following hypotheses:

H0 : p = 0.2 vs H1 : p ̸= 0.2.

Suppose also that a random sample of 20 items is drawn from the population. Let Y denote the
number of defective items in the sample, and consider a test procedure ϕ such that the critical
region contains all the outcomes for which either Y ≥ 7 or Y ≤ 1.

(a) Determine the value of the power function βϕ (p) at the points p = 0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6,
0.7, 0.8, 0.9 and 1; and sketch the power function.
(b) Determine the size of the test.

2. An auto manufacturer gives a warranty for 3 years for its new vehicles. In a random sample of 60
of its vehicles, 20 of them needed five or more major repairs within the warranty period. Estimate
the 95% (large sample) confidence interval of the true proportion of vehicles from this manufacturer
that need five or more major repairs during the warranty period, with confidence coefficient 0.95.
Interpret the result.

3. Suppose that a random sample of 10,000 observations is taken from the normal distribution with
unknown mean µ and known variance is 1, and it is desired to test the following hypotheses at the
level of significance 0.05:
H0 : µ = 0 vs H1 : µ ̸= 0.
Suppose also that the test procedure specifies rejecting H0 when |X̄n | ≥ c, where the constant c is
chosen so that P (|X̄n | ≥ c | µ = 0) = 0.05. Find the probability that the test will reject H0 if (a)
the actual value of µ is 0.01, and (b) the actual value of µ is 0.02.

4. A random sample of size 50 from a particular brand of tea packets produced a mean weight of
15.65 ounces and standard deviation of 0.59 ounce. Assume that the weights of these brands of tea
packets are normally distributed. Find a 95% confidence interval for the true mean µ.

5. Fifteen vehicles were observed at random for their speeds (in mph) on a highway with speed limit
posted as 70 mph, and it was found that their average speed was 73.3 mph. Suppose that from past
experience we can assume that vehicle speeds are normally distributed with σ = 3.2. Construct a
90% confidence interval for the true mean speed µ, of the vehicles on this highway. Interpret the
result.

6. Studies have shown that the risk of developing coronary disease increases with the level of obesity,
or accumulation of body fat. A study was conducted on the effect of exercise on losing weight. Fifty
men who exercised lost an average of 11.4 lb, with a standard deviation of 4.5 lb. Construct a 95%
confidence interval for the mean weight loss through exercise. Interpret the result and state any
assumptions you have made.

7. Two statistics professors want to estimate average scores for an elementary statistics course that has
two sections. Each professor teaches one section and each section has a large number of students.
A random sample of 50 scores from each section produced the following results:

(a) Section I: x̄1 = 77.01, s1 = 10.32

(b) Section II: x̄2 = 72.22, s2 = 11.02

Calculate 95% confidence intervals for each of these three samples.

13
8. Suppose that X1 , . . . , Xm form a random sample from the normal distribution with unknown mean
µ1 and unknown variance σ12 , and that Y1 , . . . , Yn form an independent random sample from the
normal distribution with unknown mean µ2 and unknown variance σ22 . Suppose also that it is
desired to test the following hypotheses with the usual F-test at the level of significance α = 0.05:

H0 : σ12 = σ22 vs H1 : σ12 > σ22 .

Assuming that m = 16 and n = 21, show that the power of the test when σ12 = 2σ22 is given by
P (V ≥ 1.1), where V is a random variable having the F-distribution with 15 and 20 degrees of
freedom.

9. The scores of a random sample of 16 people who took the TOEFL (Test of English as a Foreign
Language) had a mean of 540 and a standard deviation of 50. Construct a 95% confidence interval
for the population mean µ of the TOEFL score, assuming that the scores are normally distributed.

10. The following data represent the rates (micrometers per hour) at which a razor cut made in the
skin of anesthetized newts is closed by new cells.

28, 20, 21, 39, 32, 23, 18, 31, 14, 23, 18, 22, 28, 24, 33, 12, 23, 21, 25, and 25

(a) Find the 95% confidence interval for population mean rate µ for the new cells to close a razor
cut made in the skin of anesthetized newts.
(b) Find a 99% confidence interval for µ. Is the 95% CI wider or narrower than the 99% CI? Briefly
explain why.
(c) Find the 95% confidence interval for population variance σ 2 .

11. A study of two kinds of machine failures shows that 58 failures of the first kind took on the average
79.7 minutes to repair with a standard deviation of 18.4 minutes, whereas 71 failures of the second
kind took on average 87.3 minutes to repair with a standard deviation of 19.5 minutes. Find a 99%
confidence interval for the difference between the true average amounts of time it takes to repair
failures of the two kinds of machines.

12. The management of a supermarket wanted to study the spending habits of its male and female
customers. A random sample of 16 male customers who shopped at this supermarket showed that
they spent an average of $55 with a standard deviation of $12. Another random sample of 25
female customers showed that they spent $85 with a standard deviation of $20.50. Assuming that
the amounts spent at this supermarket by all its male and female customers were approximately
normally distributed, construct a 90% confidence interval for the ratio of variance in spending for
males and females, σ12 /σ22 .

13. The following information was obtained from two independent samples selected from two normally
distributed populations with unknown but equal variances.

• Sample I: 14, 15, 12, 13, 6, 14, 11, 12, 17, 19, 23.
• Sample II: 16, 18, 12, 20, 15, 19, 15, 22, 20, 18, 23, 12, 20.

Test whether the difference of the population means is equal to zero or not. Construct a 95%
confidence interval for the difference between the population means and interpret.

05-Hypothesis Testing T-Test (1) - 54
No ratings yet
05-Hypothesis Testing T-Test (1) - 54
56 pages
STSM3714 (With Notes From Class)
No ratings yet
STSM3714 (With Notes From Class)
110 pages
Chapter 3
No ratings yet
Chapter 3
35 pages
SST 306 LECTURE NOTES TWO (Power of A Test)
No ratings yet
SST 306 LECTURE NOTES TWO (Power of A Test)
22 pages
Small Sample Tests AND Introduction To Anova: Unit-V
No ratings yet
Small Sample Tests AND Introduction To Anova: Unit-V
49 pages
Assignment 1 - ST36252 Testing of Hypothesis
No ratings yet
Assignment 1 - ST36252 Testing of Hypothesis
4 pages
Tests of Hypo PDF
No ratings yet
Tests of Hypo PDF
176 pages
EC501 Lecture 03
No ratings yet
EC501 Lecture 03
30 pages
Section 5.7
No ratings yet
Section 5.7
47 pages
SmartAlAnswers ALL
50% (2)
SmartAlAnswers ALL
322 pages
Test Bank For Psychological Testing and Assessment 8th Edition by Cohen
33% (3)
Test Bank For Psychological Testing and Assessment 8th Edition by Cohen
45 pages
AES Lecture5 Testing
No ratings yet
AES Lecture5 Testing
58 pages
Adv Stat II
No ratings yet
Adv Stat II
140 pages
5 Estimation and Hypothesis Testing
No ratings yet
5 Estimation and Hypothesis Testing
25 pages
ch6 PDF
No ratings yet
ch6 PDF
47 pages
2024-11-22 Slides 10
No ratings yet
2024-11-22 Slides 10
24 pages
Asymptotic Relative Efficiency of Tests: ARE on a G String: H θ H θ T H T K π θ P T K, θ n α α, π θ α
No ratings yet
Asymptotic Relative Efficiency of Tests: ARE on a G String: H θ H θ T H T K π θ P T K, θ n α α, π θ α
8 pages
22-23 323 Week6Notes
No ratings yet
22-23 323 Week6Notes
28 pages
Lecture BDS 8-23-24 Print
No ratings yet
Lecture BDS 8-23-24 Print
11 pages
P&S UNIT-5 Testing of Hypothesis
No ratings yet
P&S UNIT-5 Testing of Hypothesis
47 pages
HypothesisTesting IIIA
No ratings yet
HypothesisTesting IIIA
6 pages
Chapter 6 Hypothesis Testing
No ratings yet
Chapter 6 Hypothesis Testing
4 pages
Chapter 9 (Independent Means Only) UPDATED!!!
No ratings yet
Chapter 9 (Independent Means Only) UPDATED!!!
27 pages
Day 2-Statistical Measures of Data Rev
100% (1)
Day 2-Statistical Measures of Data Rev
82 pages
Unit 5. Test of Significance
No ratings yet
Unit 5. Test of Significance
56 pages
Inference Using Normal and T Distribution
No ratings yet
Inference Using Normal and T Distribution
9 pages
WLLN and LST Overleaf
No ratings yet
WLLN and LST Overleaf
3 pages
Chapter 8 - Hypothesis Testing - 2Populations-L2 - Jan 2024
No ratings yet
Chapter 8 - Hypothesis Testing - 2Populations-L2 - Jan 2024
28 pages
Statistics07 TwoSamplesHypothesisTest
No ratings yet
Statistics07 TwoSamplesHypothesisTest
45 pages
18.650 - Fundamentals of Statistics
No ratings yet
18.650 - Fundamentals of Statistics
62 pages
Hypo - Test - Lec - 2
No ratings yet
Hypo - Test - Lec - 2
21 pages
Lect 8
No ratings yet
Lect 8
22 pages
Beg Guide Measurement Mech Eng PDF
100% (1)
Beg Guide Measurement Mech Eng PDF
52 pages
Topic 2. Distributions, Hypothesis Testing, and Sample Size Determination
No ratings yet
Topic 2. Distributions, Hypothesis Testing, and Sample Size Determination
15 pages
Stroop (Stroop, 1935)
100% (3)
Stroop (Stroop, 1935)
20 pages
CH 04
No ratings yet
CH 04
34 pages
Stat 255 Supplement 2011 Fall
100% (1)
Stat 255 Supplement 2011 Fall
78 pages
Tests TD1
No ratings yet
Tests TD1
4 pages
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Hypothesis Testing Exam
No ratings yet
Hypothesis Testing Exam
8 pages
Inference Using Normal and T Distribution
No ratings yet
Inference Using Normal and T Distribution
9 pages
Malnutrition
No ratings yet
Malnutrition
34 pages
Ex 2301 Eng
No ratings yet
Ex 2301 Eng
6 pages
6 Hypothesis Testing
No ratings yet
6 Hypothesis Testing
22 pages
EC212: Introduction To Econometrics Multiple Regression: Inference (Wooldridge, Ch. 4)
No ratings yet
EC212: Introduction To Econometrics Multiple Regression: Inference (Wooldridge, Ch. 4)
89 pages
Statistics
No ratings yet
Statistics
6 pages
Module 4 Slides
No ratings yet
Module 4 Slides
47 pages
Test of Significance For Small Samples
No ratings yet
Test of Significance For Small Samples
35 pages
Module 1 Lesson 2
No ratings yet
Module 1 Lesson 2
17 pages
Statistical Tests Martin G 161131 V15 UPLOAD
No ratings yet
Statistical Tests Martin G 161131 V15 UPLOAD
33 pages
Statistical+Inference+1 Shaw2007
No ratings yet
Statistical+Inference+1 Shaw2007
66 pages
High-Dimensional, Two-Sample Testing
No ratings yet
High-Dimensional, Two-Sample Testing
9 pages
Probability and Statistics - 3
No ratings yet
Probability and Statistics - 3
59 pages
MTH4106 Introduction To Statistics: Notes 6 Spring 2013
No ratings yet
MTH4106 Introduction To Statistics: Notes 6 Spring 2013
7 pages
Andersen Jessen 2003 PDF
No ratings yet
Andersen Jessen 2003 PDF
5 pages
I P S F E Sampling Distributions: Ntroduction To Robability AND Tatistics Ourteenth Dition
No ratings yet
I P S F E Sampling Distributions: Ntroduction To Robability AND Tatistics Ourteenth Dition
37 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Power
No ratings yet
Power
29 pages
Lecture Note 5
No ratings yet
Lecture Note 5
8 pages
MMW Descriptive Statistics
No ratings yet
MMW Descriptive Statistics
14 pages
Hypothesis Testing For The Mean (Small Samples)
No ratings yet
Hypothesis Testing For The Mean (Small Samples)
40 pages
Handout 7
No ratings yet
Handout 7
16 pages
Lectures on Integral Equations
From Everand
Lectures on Integral Equations
Harold Widom
4.5/5 (2)
Unit 5: Hypothesis Testing
No ratings yet
Unit 5: Hypothesis Testing
6 pages
1 Tests of The Equality of Two Means
No ratings yet
1 Tests of The Equality of Two Means
8 pages
CI For A Proportion
No ratings yet
CI For A Proportion
24 pages
Karhunen-Loève Transform - KLT: Jankees Van Der Poel D.Sc. Student, Mechanical Engineering
No ratings yet
Karhunen-Loève Transform - KLT: Jankees Van Der Poel D.Sc. Student, Mechanical Engineering
70 pages
Full An Introduction To Statistics 1st Edition George Woodbury PDF All Chapters
100% (1)
Full An Introduction To Statistics 1st Edition George Woodbury PDF All Chapters
55 pages
Ford MSA Validation Data
No ratings yet
Ford MSA Validation Data
13 pages
Understanding Calibration and QC
No ratings yet
Understanding Calibration and QC
37 pages
MAST20005 Statistics Assignment 3
No ratings yet
MAST20005 Statistics Assignment 3
8 pages
MAST20005 Statistics Assignment 2
No ratings yet
MAST20005 Statistics Assignment 2
9 pages
BSDS Notes Weeks10 11
No ratings yet
BSDS Notes Weeks10 11
8 pages
BSDS Third Round Allocation List
No ratings yet
BSDS Third Round Allocation List
3 pages
Chapter 6-The Normal Distribution and Other Continuous Distributions
No ratings yet
Chapter 6-The Normal Distribution and Other Continuous Distributions
52 pages
Hypothesis testing in univariate statistics based on N (µ, σ)
No ratings yet
Hypothesis testing in univariate statistics based on N (µ, σ)
15 pages
Module 3
No ratings yet
Module 3
5 pages
Hypothesis Testing Skills Set
No ratings yet
Hypothesis Testing Skills Set
6 pages
Statistics Using Libreoffice
No ratings yet
Statistics Using Libreoffice
56 pages
Confidence Intervals For Variances and Standard Deviations
No ratings yet
Confidence Intervals For Variances and Standard Deviations
2 pages
ASGKIT PROG1 (Primera Asignacion) PSP
No ratings yet
ASGKIT PROG1 (Primera Asignacion) PSP
14 pages
Session 41 - Normal Distribution
No ratings yet
Session 41 - Normal Distribution
12 pages
Lecture 6
No ratings yet
Lecture 6
28 pages
Jacka. Maternal and Early Postnatal Nutrition and Mental Health of Offspring by Age 5 Years A Prospective Cohort Study Jacka 2013
No ratings yet
Jacka. Maternal and Early Postnatal Nutrition and Mental Health of Offspring by Age 5 Years A Prospective Cohort Study Jacka 2013
10 pages
Darp Midterm v3
No ratings yet
Darp Midterm v3
4 pages
Session 02 - Anderson, Banker, and Janakiraman (2003)
No ratings yet
Session 02 - Anderson, Banker, and Janakiraman (2003)
17 pages
Abstract
No ratings yet
Abstract
18 pages
Hypothesis Testing Review
No ratings yet
Hypothesis Testing Review
5 pages
H T S M: Ypothesis Ests FOR A Ingle EAN
No ratings yet
H T S M: Ypothesis Ests FOR A Ingle EAN
1 page
RC
No ratings yet
RC
3 pages
Summative Stat Prob Q3 W1 26
No ratings yet
Summative Stat Prob Q3 W1 26
5 pages
67-1426444406 FinalPublishedVersion PDF
No ratings yet
67-1426444406 FinalPublishedVersion PDF
16 pages
Sample Questions
No ratings yet
Sample Questions
2 pages
EEC-13 English Solved Assignment 2017-18
No ratings yet
EEC-13 English Solved Assignment 2017-18
14 pages
Updated BTech BTech-MTech Curriculum DSE
No ratings yet
Updated BTech BTech-MTech Curriculum DSE
2 pages
Quality Control in Laboratory
No ratings yet
Quality Control in Laboratory
35 pages
Sample Questions
No ratings yet
Sample Questions
1 page

BSDS Slides Module 8 9 11

Uploaded by

BSDS Slides Module 8 9 11

Uploaded by

Statistics II : Introduction to Inference

1 Testing of binomial proportions

Therefore, by Slutsky’s theorem

for sufficiently large n.

2 Testing of normal mean (Student t-test)

However, the problem arises in finding the critical value as

which can not be solved when σ is unknown.

From the exact test above, we obtain

2.1 Paired-t test

3 Testing of normal variance

the above equation provides

4 Two-sample normal mean test

• Combining the last three points, we get

5 Testing equality of two normal variances

βϕ (σ 2 ) = Pσ2 Tm,n > Fα,n−1,m−1 | Tm,n /σ 2 ∼ Fn−1,m−1

= Pσ2 Tm,n /σ 2 > Fα,n−1,m−1 /σ 2 | Tm,n /σ 2 ∼ Fn−1,m−1

where G is the CDF of Fn−1,m−1 distribution.

(a) Section I: x̄1 = 77.01, s1 = 10.32

Calculate 95% confidence intervals for each of these three samples.

H0 : σ12 = σ22 vs H1 : σ12 > σ22 .

You might also like