Chap13 2012
Chap13 2012
Design of Experiments
Introduction
• “Listening” or passive statistical tools: control
charts.
• “Conversational” or active tools: Experimental
design.
– Planning of experiments
– A sequence of experiments
13.1 A Simple Example of
Experimental Design Principles
• The objective is to compare 4 different brands of tires for
tread wear using 16 tires (4 of each brand) and 4 cars in an
experiment.
• Illogical Design:
– Randomly assign the 16 tires to the four cars
– Assign each car will have all 4 tires of a given brand (confounded
with differences between cars, drivers, and driving conditions)
– Assign each car will have one tire of each brand
(poor design because brands A and Wheel Car
B would be used only on the front of Position 1 2 3 4
each car, and brands C and D would
be used only on the rear positions. LF A B A B
Brand effect would be confounded RF B A B A
with the position effect. LR D C D C
RR C D C D
13.1 A Simple Example of
Experimental Design Principles
• Logical Design:
– Each brand is used once at each position, as well as once with
each car.
Wheel Car
Position 1 2 3 4
LF A B C D
RF B A D C
LR C D A B
RR D C B A
13.2 Principles of
Experimental Design
• The need to have processes in a state of statistical
control when designed experiments are carried out.
• It is desirable to use experimental design and
statistical process control methods together.
• General guidelines on the design of experiments:
1. Recognition of and statement of the problem
2. Choice of factors and levels
3. Selection of the response variable(s)
4. Choice of experimental design
5. Conduction of the experiment
6. Data analysis
7. Conclusions and recommendations
• The levels of each factor used in an experimental run
should be reset before the next experimental run.
13.3 Statistical Concepts in
Experimental Design: Example
• Assume that the objective is to determine the effect of two
different levels of temperature on process yield, where the
current temperature is 250F and the experimental setting
is 300F.
• Assume that temperature is the only factor that is to be
varied.
13.3 Statistical Concepts in
Experimental Design: Example
Day 250F 300F
M 2.4 2.6
Tu 2.7 2.4
W 2.2 2.8
Th 2.5 2.5
F 2 2.2
M 2.5 2.7
Tu 2.8 2.3
W 2.9 3.1
Th 2.4 2.9
F 2.1 2.2
13.3 Statistical Concepts in
Experimental Design: Example
Observations:
• Neither temperature setting is uniformly superior to the
other over the entire test period.
• The fact that the lines are fairly close together would
suggest that increasing temperature may not have a
perceptible effect on the process yield.
• The yield at each temperature setting is the lowest on
Friday of each week.
• There is considerable variability within each temperature
setting.
13.4 t-Tests
• The t statistic is of the general form
(13.1)
13.4.1 Exact t-Test
• The exact t-test is of the form
(13.2)
where is the square root of the estimate of the (assumed)
common variance ()
Prob(t<-.893)19=.1916
13.4.1 Assumptions for
Exact t-Test
• should be checked. (This assumption is not crucial when
n1=n2.)
• The two samples are independent.
• The observations are independent within each sample.
13.4.2 Approximate t-Test
• If n1 and n2 differ considerably and is unknown, an
approximate t-test is used
(13.3)
where the degrees of freedom is calculated as
13.4.3 Confidence Intervals for
Differences
• 100(1-)% Confidence Bound
SUMMARY
Groups Count Sum Average Variance
250F 10 24.5 2.45 0.087222222
300F 10 25.7 2.57 0.093444444
ANOVA
Source of Variation SS df MS F P-value F crit
Between Groups 0.072 1 0.0727 0.797 0.3838 4.4139
Within Groups 1.626 18 0.0903
Total 1.698 19
13.5 Analysis of Variance (ANOVA)
for One Factor: Example
Output from Minitab
Source DF SS MS F P
Temp 1 0.0720 0.0720 0.80 0.384
Error 18 1.6260 0.0903
Total 19 1.6980
where represents the total of the obs for the ith level, (13.4)
is the number of levels of the factor,
represents the number of obs for the ith level,
denotes the grand total of all obs.
N is the number of total obs.
• For the example
13.5.1 ANOVA for a Single Factor
with More than Two Levels
• Total sum of squares
SUMMARY
Groups Count Sum Average Variance
250F 10 24.5 2.45 0.087222
300F 10 25.7 2.57 0.093444
350F 10 29.8 2.98 0.079556
ANOVA
Source of
Variation SS df MS F P-value F crit
Between
Groups 1.544667 2 0.772333 8.903928 0.001072 3.354131
Within
Groups 2.342 27 0.086741
Total 3.886667 29
13.5.1 ANOVA for a Single Factor
with More than Two Levels: Example
Output from Minitab
One-way ANOVA: Yield versus Temp
Source DF SS MS F P
Temp 2 1.5447 0.7723 8.90 0.001
Error 27 2.3420 0.0867
Total 29 3.8867
LF A B A B LF A B C D
RF B A B A RF B A D C
LR D C D C LR C D A B
RR C D C D
RR D C B A
Randomized block design Latin square design
13.5.4 Additional Terms and Concepts
in One-Factor ANOVA
• Regression model for One-factor ANOVA:
(13.6)
where j denotes the jth level of the single factor
represents the ith obs for the jth level
represents the effect of the jth level
is a constant
represents the error term
• If the effects were all the same,
(13.7)
• F-test determines whether the appropriate model is
(13.6) or (13.7)
13.5.4 Additional Terms and Concepts
in One-Factor ANOVA
• Factors are generally classified as fixed (250F, 300 F,
350 F) or random (any number from a population)
• Data in one-factor ANOVA are analyzed in the same way
regardless of whether the factor is fixed or random, but
the interpretation does differ.
• is a constant if the factor is fixed, and a random variable
if the factor is random.
• The error term is NID(0, 2) in both cases.
• are assumed to be normally distributed in both cases
• are not independent in the random-factor case.
13.5.4 Additional Terms and Concepts
in One-Factor ANOVA
• The data in the temperature example were “balanced” in
that there was the same number of obs for each level of
the factor.
13.6 Regression Analysis of Data from
Designed Experiments
• Regression and ANOVA both could be used as methods
of analysis.
• Regression provides the tools for residual analysis, and
the estimation of parameters.
• For fixed factors, ANOVA should be supplemented or
supplanted.
13.6 Regression Analysis of Data from
Designed Experiments
• The least squares estimator in regression analysis
resulted from minimizing the sum of squared errors.
so that
data.
13.6 Regression Analysis of Data from
Designed Experiments
• The effect can be thought as a deviation from the
overall mean .
30
25
20
P1
15 P2
10
0
T1 T2
13.7.1.1 Conditional Effects
• Factor effects are generally called main effects.
• Conditional effects (simple effects): the effects of one
factor at each level of another factor.
13.7.2 Effect Estimates
•Temperature effect: (Effect of 25
and P2. 15
P1
10 P2
• Pressure Effect:
5
0
T1 T2
13.7.2 Effect Estimates
•Interaction effect: 35
30
25
20
P1
15 P2
10
0
T1 T2
13.7.2 Effect Estimates
•Temperature effect: (Effect of 25
and P2. 15
P1
10 P2
• Pressure Effect:
• Interaction Effect
5
0
T1 T2
• T=P=0, TP=-10
13.7.3 ANOVA Table for
Unreplicated Two-Factor Design
ANOVA
Source of Variation SS df MS F
T 0 1 0
P 0 1 0
TP (residual) 100 1 100
Total 100 3
• When both factors are fixed, the main effects and the
interaction are tested against the residual.
• When both factors are random, the main effects are tested
against the interaction effect, and the interaction effect is
tested against the residual.
• When one factor is fixed and the other random, the fixed
factor is tested against the interaction, the random factor is
tested against the residual, and the interaction is tested
against the residual.
13.7.4 Yates’s Algorithm
• For any design, where is the number of factors and 2
is the number of levels of each factor, any treatment
combination can be represented by the presence or
absence of each of lowercase letters, where presence
would denote the high level, and absence the low level.
• For example, if = (A high, B high); = (A high, B low); = (A
low, B high); = (A low, B low)
A
Low High
B Low 10, 12, 16 8, 10, 13
High 14, 12, 15 12, 15, 16
13.7.4 Yates’s Algorithm
• The procedure is initiated by writing down the treatment
combinations in standard order:
– 1 is always written first
– The other combinations are listed relative to the natural
ordering, including combinations of letters
A
Low High
B Low 10, 12, 16 8, 10, 13
High 14, 12, 15 12, 15, 16
Treatment
Total (1) (2) SS
Combination
38
31
41
43
13.7.4 Yates’s Algorithm
• The columns designated by (1) and (2) are columns in
which addition and subtraction are performed for each
ordered pair of numbers. (In general, there will be such
columns for factors.)
• Specifically, the number in each pair are first added,
and then the first number in each pair is subtracted
from the second number.
13.7.4 Yates’s Algorithm
Treatment
Total (1) (2) SS
Combination
38 69=38+31
31 84=41+43
41 -7=31-38
43 2=43-41
Treatment
Total (1) (2) SS
Combination
38 69 153=69+84
31 84 -5=-7+2
41 -7 15=84-69
43 2 9=2-(-7)
13.7.4 Yates’s Algorithm
• The process is continued on each new column that is
created until the number of such columns is equal to
the number of factors.
• The last column that is created by these operations is
used to compute the sum of squares for each effect.
• Specifically, each number (except the first) is squared
and divided by the number of replicates times .
13.7.4 Yates’s Algorithm
Treatment
Total (1) (2) SS
Combination
38 69 153
31 84 -5 (-5)2/(3*22)=2.08 (A)
41 -7 15 (15)2/(3*22)=18.75 (B)
43 2 9 (9)2/(3*22)=6.75 (AB)
13.7.4 Yates’s Algorithm
• The first number in the last column is actually the sum
of all of the obs. (
ANOVA
Source of Variation SS df MS F
A 2.08 1 2.08 <1
B 18.75 1 18.75 3.36
AB 6.75 1 6.75 1.21
Residual 44.67 8 5.58
Total 72.25 11
𝐹 1 ,8 ,.95 =5.32
13.7.4 Yates’s Algorithm
Source DF SS MS F P
B 1 18.7500 18.7500 3.36 0.104
A 1 2.0833 2.0833 0.37 0.558
Interaction 1 6.7500 6.7500 1.21 0.304
Error 8 44.6667 5.5833
Total 11 72.2500