Group 2 - Chap 11 Analysis of Variance
Group 2 - Chap 11 Analysis of Variance
Baranda, Dulay,
Escobillo, Estares, Liboa
The Completely Randomized Design:
One-Way ANOVA
||Erica Dulay and April Escobillo||
Assumptions:
● Populations are normally distributed
● Populations have equal variances
● Samples are randomly and independently drawn
Hypothesis of One-Way ANOVA
● Null Hypothesis of no difference
● All population means are equal
● i.e., no factor effect (no variation in means among groups)
● Alternative
● At least one population mean is different
● i.e., there is a factor effect
● Does not mean that all population means are different (some pairs may
be the same)
One-Way ANOVA
Partitioning the Variation
● Total variation can be split into two parts:
One-Way ANOVA
1. Compute Sample Means
2. Compute Grand Mean
3. Compute SSA
4. Compute SSW
5. Compute SST
6. Compute Mean Squares: MSA and MSW
7. Compute Fstat
The situation/problem
You are the production manager at the Perfect Parachutes Company.
Parachutes are woven in your factory using a synthetic fiber purchased from
one of four different suppliers. Strength of these fibers is an important
characteristic that ensures quality parachutes.You need to decide whether
the synthetic fibers from each of your four suppliers result in parachutes
of equal strength. Furthermore, your factory uses two types of looms to
produce parachutes, the Jetta and the Turk. You need to establish that the
parachutes woven on both types of looms are equally strong.You also want to
know if any differences in the strength of the parachute that can be attributed
to the four suppliers are dependent on the type of loom used. How would you
go about finding this information?
Step 1: Compute sample means
Step 2: Compute the Grand Mean
Sum all values and divide the sum
by the total number of values:
Step 3: Compute SSA
AMONG GROUP VARIATION ( SUM
OF SQUARES AMONG GROUP)
● Follows an F distribution
Degrees of Freedom
○ df1 = c – 1 (numerator)
○ df2 = n – c (denominator)
● The ratio must always be positive
● df1 = c – 1 will typically be small
● df2 = n – c will typically be large
Because is Fstat 3.4616 is greater
than the upper-tail critical value of
3.24, you reject the null hypothesis
c= number of groups
n= sum of the sample sizes from all groups
df= degrees of freedom
MULTIPLE COMPARISONS:
The Tukey-Kramer Procedure
where:
Qα = Upper Tail Critical Value from Studentized
MSW = Mean Square Within
Range Distribution with c and n - c degrees
nj and nj’ = Sample sizes from groups j and j’
of freedom (see appendix E.7 table)
NOTE: Because 4.74 > 4.4712, there is a significant difference between the
means of Suppliers 1 and 2. All other pairwise differences are less than 4.4712.
Step 4: Interpret the results
With 95% confidence, you can conclude that parachutes woven using fiber
from Supplier 1 have a lower mean tensile strength than those from Supplier
2, but there no statistically significant differences between Suppliers 1 and 3,
Suppliers 1 and 4, Suppliers 2 and 3, Suppliers 2 and 4, and Suppliers 3 and 4.
ANOVA Assumptions
● Randomness and Independence
○ Select random samples from the c groups (or randomly assign the levels)
● Normality
○ The sample values for each group are from a normal population
● Homogeneity of Variance
○ All populations sampled from have the same variance
○ Can be tested with Levene’s Test
Levene Test for Homogeneity of Variance
You conclude that there is no evidence of significant difference among the four variances.
In other words, it is reasonable to assume that the materials from the four suppliers
produce parachutes with an equal amount of variability. Therefore, the homogeneity-of-
variance assumption for the ANOVA procedure is justified.
The Randomized Block Design
Randomized Block Design
● Evaluates differences among more than two groups that contain matched
samples or repeated measures that have been placed in blocks.
SSA 1,787.46
SST 2,295.63
SSBL 283.375
_____
Testing for Block Effect
Blocking Test:
df1 = r – 1 = 6 – 1 = 5
df2 = (r – 1)(c – 1) = (6-1)(4-1) = 15
______
Estimated Relative Efficiency
r= 6
c= 4
MSBL = 56.675
MSE = 14.986
A relative efficiency of 1.6 means that it would take 1.6 times as many observations in one
ANOVA design as compared to the randomized block design in order to have the same
precision in comparing the restaurant,
Multiple Comparisons: The Tukey Procedure
FACTOR A VARIATION
FACTOR B VARIATION
INTERACTION VARIATION
= 6.9723
Step 2.2: Compute SSB
= 134.3488
Step 2.3: Compute SSAB
= 0.2867
Step 2.4: Compute SSE
=275.5920
Step 3: Compute Degree of Freedom
Degrees of Freedom
SSA =r–1
SSB =c-1
SSAB = (r–1)(c-1)
SSE = rc(n’ – 1)
Total =n-1
Step 4: Compute Mean Squares
Step 5: Compute Fstat
Step 6: Interpret Results
Testing for Factor and Interaction Effects
F Test for Interaction Effect
Reject H0 if FSTAT > Fα
where Q is the upper-tail critical value from a Studentized range distribution having r and
rc(n’ – 12) degrees of freedom.
Qα = 3.84
MULTIPLE COMPARISONS: THE TUKEY PROCEDURE
2. Find the Qα value from the table in appendix 1. Compute the absolute mean differences:
E.7 [for a = 0.05, c = 4, and rc(n’-1) = 32 ] Because there are four suppliers, there are
degrees of freedom. 4(4-1)/2= 6 pairwise comparisons.
Qα = 3.84
4. Compare:
MULTIPLE COMPARISONS: THE TUKEY PROCEDURE
4. Compare: 1. Compute the absolute mean differences:
Because there are four suppliers, there are
4(4-1)/2= 6 pairwise comparisons.
Critical Range = 3.56
And there are no statistically significant differences between Suppliers 1 and 3, Suppliers 1 and 4, Suppliers 2 and 3, Suppliers 2
and 4, and Suppliers 3 and 4. Note that by using you are able to make all six comparisons with an overall error rate of only 5%.
VISUALIZING & INTERPRETING INTERACTION
EFFECTS: The Cell Means Plot
Cell means are obtained and plotted when more than two
groups are compared at a time. Cell means are plotted to
check whether there is any difference between the means of
the different groups of the data or not.
As production manager, the business problem you decide to
examine involved not just the different suppliers but also
whether parachutes woven on the Jetta looms are as strong
as those woven on the Turk looms. In addition, you need to
determine whether any differences among the four suppliers
in the strength of the parachutes are dependent on the type
of loom being used.
Tensile Strengths of Parachutes Woven by Two
Types of Loom
This cell means plot
shows two parallel
series, although they
are slightly shifted
from one another this
is a clear indication
that there is no
interaction between
Factor A and B.
Since the p-value = 0.9984 > 0.05, we
can conclude that there is insufficient
evidence of an interaction effect
between loom and supplier.
A2
B1 B2 B3 B4
If there is interaction, the line
would be nonparallel. Some
levels of factor A would
respond better with certain
levels of factor B.