Chapter 2 Design of Experiment
Chapter 2 Design of Experiment
Chapter - Two
Contents
This is one way of making inference about the population parameter where the investigator does
not have any prior notion about values or characteristics of the population parameter.
Estimation: is a process by which we estimate the unknown population parameter from sample statistic.
[email protected]
10
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
Point Estimation
Confidence Level: The percent of the time the true value will lie in the interval estimate given.
Degrees of Freedom: The number of data values which are allowed to vary once a statistic has been
determined.
Hypothesis Testing
- This is also one way of making inference about population parameter, where the investigator has prior
notion about the value of the parameter.
Definitions:
[email protected]
11
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
- Test statistic: is a statistics whose value serves to determine whether to reject or accept the
hypothesis to be tested.
- Statistic test: is a test or procedure used to evaluate a statistical hypothesis and its value depends on
sample data.
There are two types of Hypothesis:
Null hypothesis:
- The following table gives a summary of possible results of any hypothesis test:
Decision
[email protected]
12
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
1. First specify the null hypothesis (H0) and the alternative hypothesis (H1).
6. Making decision.
7. Making conclusion.
When we deal with the inference about the difference in means, two things will come into our
mind. The first is independent sample and the second is paired sample. In the case of paired
sample, the sample is selected from a single population and these sample elements may be
treated at two different circumstances at different period of time such as pre and post, before and
after and the like. Moreover, the sample size at two different situations and the subjects are the
same.
In the case of independent samples, the samples are selected from different or the same
population independently. That is, the subjects are different and the sample sizes may or may not
be the same.
2.1.1 independent samples
Comparative studies are designed to discover and evaluate difference between treatments
(groups). In situations where we are making inferences about µ1 - µ2 based on random samples
independently selected from two populations to make inference on the difference between two
populations mean. The population and sample size may not be equal.
Case One: Consider the situation in which we are independently selecting random samples from
two populations that have normal distributions with different means µ1 and µ2 and population
standard deviations δ1 and δ2, are known (If the two populations variance are known), whatever
[email protected]
13
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
the sample size (i.e. it may be less than 30, or greater than or equal to 30), then the appropriate
test statistics is.
(̅ ̅ ) ( )
Zcal = , If we have the same standard deviation on the two population, then the
√
(̅ ̅ ) ( )
test statistics becomes: Zcal =
√
The hypothesis test statistics about the difference between two population means. As with any
test procedure, we begin by specifying a hypothesis for the difference in population means.
Then one can formulate the hypothesis as follows
Hypothesis Test Statistics Criteria for Rejection
Reject H0 if | |> Zα/2
(̅ ̅ ) ( )
Zcal = Reject H0 if Zcal > Zα
√
A 100 (1 - α) % confidence interval for the difference in mean for the two independent
samples when the population variances are known whatever the sample size is:
(̅ ̅ ) Zα/2 √ (̅ ̅ ) Zα/2 √ , Or
(̅ ̅ ) ± Zα/2 √
Example: To compare average score of urban and rural students’ independent samples
of n1 10 for urban, n2 10 for rural applicants to a college were selected. The sample
mean scores were calculated to be 16.015 and 16.005 for urban and rural respectively.
Past appearance shows that the admission test scores for both urban and rural students
B, Find a 95 percent confidence interval on the difference in the mean of the urban and
rural students.
[email protected]
14
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
Solution:-
Given
Urban Rural
A,
Step 1: versus
Step 2: 0.05
Step 3: Identify the test statistic
Z – Test Because, The Population standard deviation is Known.
Step 4. Determine the test Statistic
(̅ ̅ ) ( ) ( ) ( )
Zcal =
√ √
[email protected]
15
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
B,
(̅ ̅ ) Zα/2 √ (̅ ̅ ) Zα/2 √
( ) 1.96 √ ( ) 1.96 √
Conclusion: This implies at 5% level of significance, we have enough evidence to conclude that
there is no difference (equal) between scores mean of urban and rural students.
Or we have 95% confident that the difference scores mean of urban and rural students are found
or lie between -0.0045 and 0.0245.
Case Two: Consider the situation in which we are independently selecting random samples from
two populations that have normal distributions with different means µ1 and µ2 and standard
deviations δ1 and δ2 are unknown (Population variance are unknown) that means sample variance
is known but large sample size (n ), the appropriate test statistics is Z - test statistics.
(̅ ̅ ) ( )
Zcal =
√
[email protected]
16
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
A 100 (1-α) % confidence interval for the difference in mean for the two independent
samples when the sample variances are known with large sample size is:
(̅ ̅ ) Zα/2 √ (̅ ̅ ) Zα/2 √
Or
̅ ̅ ± Zα/2 √
located throughout a country for the purpose of comparing the retail prices per pound
of coffee of brands A and B. The results of the investigation are summarized below.
n1 = 75 n2 = 64
̅ =3 ̅ = 2.95
S1 = 0.11 S2 = 0.09
A, Test the hypothesis that the mean retail price per pound of brand A coffee is
significantly higher than the mean retail price per pound of brand B coffee? Use a level
of significance α = 0.01.
B, Find a 99 percent confidence interval on the difference in the mean retail price per
pound of brand A coffee and in the mean retail price per pound of brand B coffee.
Solution:-
Given
n1 = 75 n2 = 64
̅ =3 ̅ = 2.95
S1 = 0.11 S2 = 0.09
[email protected]
17
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
A,
Step 1: H0: µ1 - µ2 = 0 (µ1 = µ2) (i.e., no difference between mean retail prices for brand A and B)
H1: µ1 - µ2 > 0 (µ1 > µ2) (i.e., mean retail price per pound of brand A is higher than that of
brand B).
Where: µ1= Mean retail price per pound of brand A coffee at all super-markets.
µ2= Mean retail price per pound of brand B coffee at all super-markets.
NB: Zα/2 = =
(̅ ̅ ) Zα/2 √ (̅ ̅ ) Zα/2 √
[email protected]
18
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
Or we have 99% confident that the mean difference for retail price per pound of brand A coffee
and the mean retail price per pound of brand B coffee is lie between 0.0062 and 0.0938.
Case Three: If the two population variance is assumed to be equal ( 12 22 ), but unknown and
(̅ ̅ ) ( )
If two population variances are assumed to be equal but, the common variance 2 is unknown
we have to estimate 2 by pooled sample variance S p2 .
( ) ( )
= Pooled Variance,
Where are the variance of the first group and the Variance of the second group
respectively.
(̅ ̅ ) ( ) ( )
Reject H0 if
√ ( )
Reject H0 if
[email protected]
19
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
A 100(1 − )% confidence interval for the difference in mean for the two independent
samples if is :-
( ) ( )
(̅ ̅ ) ⁄ √ (̅ ̅ ) ⁄ √
( ) ( )
(( ̅ ̅ ) ⁄ √ (̅ ̅ ) ⁄ √ )
Example, A new filtering device is installed in a chemical unit; Before its installation
a random sample yielded the following information about the percentage of impurity:
Assume the population variance are assumed to be equal ( ) but unknown then
A, Test the hypothesis at 0.05, is there a significant difference in the filtering device
percentage of impurity.
B, Make the 95% confidence interval in the mean difference of filtering device percentage of
impurity.
Solution:-
Given
[email protected]
20
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
A, Step1: versus
Step 2: 0.05
Step 3: Identify the test statistic
t – Test, because, the population variance is assumed to be equal and sample size is small.
Step 4. Determine the test Statistic
(̅ ̅ ) ( ) ( ) ( )
√ √
( )
Step 5: Identify the critical region, ⁄ = 2.131
( )
Step 6: Make Decision, Reject H0 if | | ⁄
( ) ( )
(̅ ̅ ) ⁄ √ (̅ ̅ ) ⁄ √
( ) ( ) √ ( ) ( ) √
Or we have 95% confident that that the difference mean of new filtering device before
installation and after installation is lie between -7.9409 and 12.5409.
[email protected]
21
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
Case Four: If the two population variance is assumed to be unequal ( 12 22 ), and unknown
and n1 and n2 are small (( )), then we estimate 12 and 22 by S12 and S 22 . In
this case: The test statistics becomes: -
2
S12 S 22
(̅ ̅ ) ( ) n1 n2
( ) , Where U 2 2
S12 S 22
√
n1 n2
n1 1 n2 1
Then one can formulate the hypothesis as follows
(̅ ̅ ) ( ) ( )
Reject H0 if
√ ( )
Reject H0 if
A 100(1 − )% confidence interval for the difference in mean for the two independent
samples if is :-
{( ̅ ̅ ) ( )√ (̅ ̅ ) ( )√ }
Example: An experimenter was an interested in dieting and weight loss among men and women.
It was believed that in the first two weeks of standard dieting program, men would tend to loss
less weight than women. As check on this notion, a random sample of 15 women’s and 15 men
were put on the same diet and their mean weight losses are ̅ pound for men and
̅ pounds for women with variances of 2.56 and 0.31 respectively.
A, Did men weight losses significantly less than women at 5% level of significance?
Assume that the two populations are normally distributed with unequal variances
( ).
B, Make the 95% confidence interval in the mean difference of weight losses.
[email protected]
22
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
Solution:
Given
n1 n2 15 ,
̅ , ̅
S12 2.56 , S 22 0.31
A,
Step1: 1 2 0 versus H 1 : 1 2 0
Step2: 0.05
Step 3: Identify the test statistic
t – Test, because the population variance is assumed to be unequal and small sample size.
Step 4: Determine the test Statistic
(̅ ̅ ) ( ) ( ) ( )
√ √
[email protected]
23
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
(̅ ̅ ) ( )√ (̅ ̅ ) ( )√
2
S12 S 22 2.56 0.31
2
Where, U n1 n2
15 15
17
2 2 2 2
S12 S 22 2.56 0.31
n1 n2 15 15
n1 1 n2 1 15 1 15 1
( ) √ ( ) √
In this section more emphasis is given to selecting similar units. Select similar units, form pairs
and then apply the treatments or apply the same treatment on the same units but before and after
some condition.
The aim of paring is to make more accurate comparison by having members in pair as like as
possible except difference due to treatments that the investigator deliberately introduces.
o In paired sample the sample size in each treatment must be equal but the sample size is
small (i.e. n ).
The actual analysis of paired data requires us to compute the differences in the n pairs of
measurements, = , and, obtain ̅ , SD, the mean and standard deviations in the
Dis. Also, we must formulate the hypotheses about the mean of the differences, µD = 0.
[email protected]
24
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
The conditions required developing at procedure for testing hypotheses and constructing
confidence intervals for µD are:
The test Statistics is as follows:-
̅ ∑ ∑( ̅) ∑ ̅
tcal = , where ̅ √
√
A 100(1-α) % confidence interval for the difference in mean for the paired samples is:
Example: The following are physiological measures taken of patients before and after the
administration of some medication. Assume that the data are from normally distributed
populations.
Patient 1 2 3 4 5 6 7 8 9 10
Before 101 96 98 102 95 99 107 103 102 104
After 111 110 107 101 121 115 122 118 123 105
Di -10 -14 -9 1 -26 -16 -15 -15 -21 -1
a) Test for significant difference between the mean measures before and after the
medication at 5% level of significance.
b) Construct a 95% confidence interval for the true mean change.
[email protected]
25
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
Solution:
n
D i
126
DD i 1
12.6
n 10
D D
n
2
i
S D2 i 1
68.27 S D 68.27 8.26
n 1
S D2 S 8.26
S .E ( D ) Var( D ) D 2.61
n n 10
H0 : D 0
a), Step 1: versus H 1 : D 0
Step 2: 0.05
Step 3: Identify the test statistic.
The test statistic is t – test because the sample size is small.
Step 4: Determine the test statistic.
D D 12.6 0
= 4.83
S D2 n 2.61
[email protected]
26
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
Note: To test whether the population variance 2 , is equal to some value o2 or not, we
use the sampling distribution of sample variance. If sample size n is taken from normally
distributed population , then
( )
, Then one can formulate the hypothesis as follows
Reject H0 if ( )
( )
Reject H0 if ( )
A 100 1 % Confidence Interval (CI) for true value of population variance 2 is:
( ) ( )
( ) ( )
( ) ( )
Or ( )
( ) ( )
Note: If you get once the CI for 2 , then a 100 1 % Confidence Interval (CI) for can be
[email protected]
27
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
Example 1: Based on past data, the variance of a normal population is hypothesized to be 48. If
a sample of 15 observations yields a variance of 56.
A, Test the hypothesis that the variance has increased? Use 1% level of significance.
B, Construct 99% confidence interval for population variance 2 .
A,
H 0 : 2 48
.
Step 1:
H 1 : 2 48
Step 2: 0.01
Step 3: Identify the test statistic. The test statistic is .
(n 1) S 2 14 * 56
Step 4: Determine the test statistic, cal
2
16.33
2
0 48
Step 5: Identify the critical value, the critical value is 2 (n 1) 02.01 (14 ) 29 .141
n 1S 2 n 1S 2 15 156 15 156
2 , 2 = 2 , 2
, (n 1) 1 , (n 1) 0.01 , (15 1) 1 0.01 , (15 1)
2 2 2 2
[email protected]
28
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
Example 2: A company manufacturing radio tubes for the last 10 years found that the life of
their tubes has variance of 0.6 year2. As a result of some qualitative improvement, the company
claims that the variance of life of their tubes has decreased. If the sample variance of 9 randomly
selected tubes is found to be 0.45 year2
A, Using 0.05 level of significance, test the claim made by the company.
B, Make the 95% confidence interval for population variance, .
Solution: Given: 02 0.6, n 9 and S 2 0.45
Step 1: versus
Step 2: 0.05
Step 5: Identify the critical value, the critical value is 12 (n 1) 120.05 (8) 02.95 (8) 2.733
n 1S 2 n 1S 2 9 10.45 9 10.45
2 , 2 = 2 , 2
, (n 1) 1 , (n 1) 0.05 , (9 1) 1 0.05 , (9 1)
2 2 2 2
9 10.45 9 10.45
= , = (0.2053, 1.6514)
17.535 2.180
Conclusion: we have 95% confident that the population variance is lie between 0.2053 and
1.6514.
[email protected]
29
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
B. Hypothesis Testing and Interval Estimation for Comparing Two Population Variances
Another major application of a test for the equality of two population variances is for
A statistical test comparing 1 and 2 utilizes the test statistic s12 / s22 . When 12 =
2 2
Reject H0 if > ( )
NB: ( ) ( )
[email protected]
30
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
When comparing population variances, 12 / 22 the appropriate measure is the ratio of the population
A 100 1 % confidence interval for the ratio the two population variance ( 12 / 22 ) is given by.
( ) ( )
( ( ) ( ))
Note: If you get once the CI for , then a 100 1 % Confidence Interval (CI) for the ratio of
population standard deviation can be obtained by taking a positive square root of confidence
limits to .
Example: Suppose that independent random samples, one consisting 13 cases and the other
consisting 9 cases were drawn from two normal populations. Sample standard deviations are
found to be 48.1 and 89.2 respectively.
A, Test whether the two populations have equal variances at 5% level of significance.
B, Make 95% confidence interval for the ratio of the two population variance ( 1 / 2 ).
2 2
A,
12
. H0 : 1
22
Step 1:
12
H1 : 2 1
2
Step 2: 0.05
Step 3: Identify the test statistic:
[email protected]
31
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
S12 48.12
Step 4: Determine the test statistic, Fcal 2 0.29
S 2 89.2 2
Step 5: Identify the critical value, the critical value is
F (n1 1, n2 1) F0.025 (12, 8) 4.20
2
, so don’t reject .
Step 7: Conclusion
Therefore, at 5% level of significance we have enough evidence to conclude that , there is no
significance difference between the two variances (the two variances are equal). Or we have
found that insufficient statistical evidence to conclude that the variance of the two population
variance is different.
( ) ( )
( ) ( ) ( )
,
( ) ( ) ( ) ( )
( )
Conclusion: we have 95% confident that the ratio of population variance ( ) is lie between
[email protected]
32
Lecture notes for Design and analysis of experiments (Stat 2043) Chapter - 2
Exercise: A chemical engineer is investigating the inherent variability of two types of test
equipment that can be used to monitor the output of a production process. He suspects that the old
equipment , type 1 , has a larger variance than the new one .
B, Make 95% confidence interval for the ratio of the two population variance ( 1 / 2 ).
2 2