Anova Zoom
Anova Zoom
Probability and
Dung Nguyen Statistics
Anova
Outline I
1 Anova
2 Post-hoc analysis
3 Further discussion
1 Anova
Example 1
A manufacturer of paper used for making
grocery bags is interested in improving
the product’s tensile strength. Product
engineering believes that tensile strength
is a function of the hardwood
concentration in the pulp and that the
range of hardwood concentrations of
practical interest is between 5% and 20%.
A team of engineers responsible for the
study decides to investigate four levels
Dung Nguyen Probability and Statistics 6/50
Anova
Target
Effect of “Hardwood concentration” on
“Tensile strength”.
H0 : µ1 = µ2 = µ3 = µ4 versus H1 : ∃µi ̸= µk .
H0 : µ1 = µ2 = · · · = µI versus H1 : ∃µi ̸= µk .
10 x 1 = 0, x 2 = 10.
Are µ1 and µ2
0 the same or
different?
x 1 = 0, x 2 = 10.
10
Are µ1 and µ2
0 the same or
different?
10 10
0 0
21.17
The error SOS
I X
X Ni
21.17 21.17
17.00 17.00
15.67 15.67 15.99
10.00 10.00
2 2 2 3832
SST = 7 + 8 + · · · + 20 − = 512.9583.
24
1 2 2 2 2
3832
SSTr = 60 + 94 + 102 + 127 − = 382.7917.
6 24
SSE = 512.9583 − 382.7917 = 130.1667.
Anova
The statistic
The mean square for treatment:
MSTr = SSTr/ df(SSTr) = SSTr/(I − 1).
The mean square for error:
MSE = SSE/ df(SSE) = SSE/(N − I).
Consider the following
statistic
SSTr If H0 is true then
MSTr −1. F ∼ F(I − 1, N − I).
F= = ISSE
MSE
N −I
Dung Nguyen Probability and Statistics 26/50
Anova
ANOVA Table
ANOVA Table
Source of Df Sum of Mean F
variation squares square
Treatment 3 382.79 127.60 19.60
Error 20 130.17 6.51
Total 23 512.96
Example 2
Consider the following computer output for
a balanced design.
Source of Df Sum of Mean F
variation squares square
Treatment ? ? 39.1 ?
Error ? 396.8 ?
Total 19 514.2
Fill in the missing information in the
ANOVA table and make a conclusion about
differences in the factor-level means.
Dung Nguyen Probability and Statistics 29/50
Anova
Solution
Source of Df Sum of Mean F
variation squares square
Treatment 3 117.4 39.1 1.5766
Error 16 396.8 24.8
Total 19 514.2
F0.05,3,16 ≈ 3.24 =⇒ Fail to reject H0.
Example 3
Consider the following computer output for
a balanced experiment. The factor was
tested over four levels. Fill in the
missing information in the ANOVA table and
make a conclusion.
Source of Df Sum of Mean F
variation squares square
Treatment ? ? 330.4716 4.42
Error ? ? ?
Total 31 ?
Dung Nguyen Probability and Statistics 31/50
Anova
Solution
Source of Df Sum of Mean F
variation squares square
Treatment 3 991.4148 330.4716 4.42
Error 28 2093.485 74.76733
Total 31 3084.9
F0.05,3,16 ≈ 2.95 =⇒ Reject H0.
2 Post-hoc analysis
Confidence Intervals
σ 2 ≈ MSE
Multiple Comparisons
The comparisons among the observed
treatment averages are as follows
(LSD=3.07):
4 vs. 1 = 21.17 - 10.00 = 11.17 > 3.07
4 vs. 2 = 21.17 - 15.67 = 5.50 > 3.07
4 vs. 3 = 21.17 - 17.00 = 4.17 > 3.07
3 vs. 1 = 17.00 - 10.00 = 7.00 > 3.07
3 vs. 2 = 17.00 - 15.67 = 1.33 < 3.07
2 vs. 1 = 15.67 - 10.00 = 5.67 > 3.07
Dung Nguyen Probability and Statistics 37/50
Further discussion
3 Further discussion
Anova vs t-test
Random-effect models
RCBD
Anova vs t-test
Can Anova replace t-test?
Can multiple t-test replace Anova?
Random-Effect Models
In Montgomery’s book, he describes a
single-factor experiment involving the
random-effects model in which a textile
manufacturing company weaves a fabric on a
large number of looms. The company is
interested in loom-to-loom variability in
tensile strength.
Solution
Solution
ANOVA table:
Source of Df Sum of Mean F
variation squares square
Loom 3 89.188 29.729 16.183
Error 12 22.045 1.837 (> 5.953)
Total 15 111.938
Fixed-Effect vs Random-Effect
Loom Tensile
H.C. Tensile strength
strength
5% 7 8 15 11 9 10
1 98 97 99 96
10% 12 17 13 18 19 15
2 91 90 93 92
15% 14 18 19 17 16 18
3 96 95 97 95
20% 19 25 22 23 18 20
4 95 96 99 98
Solution
Fabric Sample
Chemical 1 2 3 4
5 Sum Av.
Type
1 1.3 1.6 0.5 1.2 1.1 5.7 1.14
2 2.2 2.4 0.4 2.0 1.8 8.8 1.76
3 1.8 1.7 0.6 1.5 1.3 6.9 1.38
4 3.9 4.4 2.0 4.1 3.4 17.8 3.56
Total 9.2 10.1 3.5 8.8 7.6
Average 2.30 2.53 0.88 2.20 1.90
Dung Nguyen Probability and Statistics 48/50
Further discussion RCBD
Solution
Anova table:
Source of Df Sum of Mean F
variation squares square
Chemical types 3 18.08 6.01 75.13
Fabric samples 4 6.69 1.67 (> 5.95)
Error 12 0.96 0.08
Total 19 25.69
Summary
Anova
Post-hoc analysis
Confidence interval
Multiple comparison
Further discussion
Anova vs t-test
Random effect
RCBD