Single Factor Design: Analysis of Variance (ANOVA)
Single Factor Design: Analysis of Variance (ANOVA)
1
5/4/2011
ANOVA (Cont’d)
Factorial ANOVA for Repeated Measures
There are two or more independent variables
The same participants are used in all experiment conditions (within-subject
factorial design)
Factorial Mixed-Design ANOVA
There are two or more independent variables
Both between-subject and within-subject factors are present
2
5/4/2011
Suppose you have designed a procedure that you think can help people learn and
remember things. You believe that when people use your procedure, they can learn
them faster and remember them longer.
To test your hypothesis, you compare two groups of people, an experimental group in
which individuals study your memory procedure, and a control group in which
individuals are given the same amount of time studying something irrelevant to the
task. Following the study period, all individuals are given the same set of 5 facts to
learn and then are tested the next day. You recruit 10 people, 5 of whom are randomly
assigned to the experiment group, and the rest the control group.
3
5/4/2011
One-Way ANOVA
One factor with a (a ≥ 2) levels; Compare the means of a treatments
Observations
A Totals
1 2 … n
1 y11 y12 … y1n y1.
2 y21 y22 … y2n y2.
. . . … . .
Factor A
. . . … . .
. . . … . .
4
5/4/2011
Where µ is the overall mean, a parameter common to all treatments, and τi a parameter
unique to the ith treatment, called ith treatment effect
a
∑τ i = 0 ε ij independent N (0, σ 2 )
i =1
ANOVA Decomposition
Total sum of squares, SST, is the sum of total sum of squares due to treatments (i.e. between
treatments), SSTrt , and the sum of squares dues to error (i.e. within treatments), SSErr
10
5
5/4/2011
ANOVA Table
Sources of Sum of
df Mean squares F p-value
variation squares
Treatment a-1 SSTrt MSTrt=SSTrt/(a-1) MSTrr/MSErr
Error a(n-1) SSErr MSErr=SSErr/(a(n-1))
Total a·n-1 SSTrt
11
Self-Disclosure Example
An experiment is conducted to study the effect of physical distance on self-discourse.
Each participant is seated in a room; the interviewer comes into the room and sits at
one of three distances from the participant: close (2 ft), medium (4ft), or far (6ft).
Participants are randomly assigned to one of the three distance conditions, with 5
participants for each condition. The dependent variable is the number of personal,
revealing statements made by the participant during the interview.
6
5/4/2011
a n 2 2
SST = ∑∑ yij2 − yN.. = (332 + 242 + ... + 142 ) − 341
15
= 792.93
i =1 j =1
a 2 2
SSTreatment = 1n ∑ yi2. − yN.. = 15 (1512 + 1182 + 722 ) − 341
15 = 629.73
i =1
ANOVA Table
Sources
of Sum of Mean
variation squares df squares F p-value
Distance 629.73 2 314.87 23.15 <0.0001
Error 163.2 12 13.6
Total 792.93 14
There seem to be significant differences among the three treatment means, but
which treatment means differ significantly?
Looking at the three treatment means, we can tell that “close” and “far” would be
significantly different because they have the highest and lowest means, but we do
not know whether “close” and “medium” differ significantly or whether
“medium” and “far” differ significantly.
14
7
5/4/2011
15
2 MSE
LSD = tα / 2 (a(n - 1)) n
where MSE is the mean square error obtained in the original ANOVA, n is the number
of samples per treatment, and a is the number of treatments
If | yi. − y j . | > LSD, we conclude that the population means µi and µj differ
significantly.
16
8
5/4/2011
Pair Difference
close vs. medium 6.6 > LSD Conclusion: All of the three
levels of distance are
close vs. far 15.8 > LSD significantly different between
each other
medium vs. far 9.2 > LSD
17
For any particular pair of means among a treatments, if the larger and smaller
means are denoted as ML and MS, respectively, then Studentized range statistic can
be calculated for any particular pair as
M L −MS
q= MSE / n
~ q(a, a(n − 1))
where MSE is the mean square error obtained in the original ANOVA, n is the
number of samples per treatment
9
5/4/2011
Pair Difference
19
Dunnett’s Test
Dunnett’s Test
Specifically designed for comparing each of the experimental groups against the
control group (not between different experimental groups)
Based on d statistics
For each pair of an experimental group i and the control group c, whose means
are denoted as Mi and Mc, respectively
|M i - M c |
d= MSE / n
~ dα ((a - 1), a(n - 1))
where MSE is the mean square error obtained in the original ANOVA, n
is the number of samples per treatment, and a is the number of groups
(including the control group)
10
5/4/2011
DD = dα (a − 1, a(n − 1)) MS E
n
α = 0.05, a = 3, n = 5, MSE = 13.6
Pair Difference
Conclusion: Both medium and
close vs. medium 6.6 > DD far distances are significantly
different from the close distance
close vs. far 15.8 >DD
21
2
H0: σ τ = 0
H1: σ τ2 > 0
22
11
5/4/2011
23
24
12
5/4/2011
13
5/4/2011
Y* =Yλ
It can be shown that σ Y ∝ µ λ +α −1
*
27
28
14
5/4/2011
29
30
15
5/4/2011
Inverse
31
The total variability in the one-way within-subject A×S design can be divided
into SSA, SSS, and SSA×S.
Participants in the A×S design are a random sample of people from a larger
population of interest; therefore, the participants represent a random effect.
32
16
5/4/2011
F = SSASS A /( a −1)
×S /( a −1)( n −1)
MSA
= MSA×S
~ Fa −1,( a −1)(n−1)
Participant (S)
1 2 3 4 5 6 7 8 yi.
A1 6 5 5 7 4 3 5 4 4.875 y.. = 5.688
Interface A)
A2 8 6 9 6 6 5 5 7 6.5
y. j 7 5.5 7 6.5 5 4 5 5.5
34
17
5/4/2011
F0.05,1,7 = 5.59
F0.05,1,17 < FA, so we can reject the null hypothesis, i.e. there is evidence
that users’ satisfaction with the two visual interfaces differ significantly
35
18
5/4/2011
38
19
5/4/2011
2
H0: σ τ = 0
H1: σ τ2 > 0
39
40
20