CH 10
CH 10
ANALYSIS OF VARIANCE(변량분석)
Analysis of variance (ANOVA)
• A one-way ANOVA is used to compare two or more
treatment means
• Analysis of “VARIANCE”: a hypothesis test where the
variance of group means is compared to variance within
those groups; produces an F-ratio
• Instead of comparing means one pair at a time, it requires
calculating a single number to tell how much all of those
means vary from each other.
• Research questions for the one-way ANOVA should ask if
there are any differences between the groups
• Example: Do any of the groups have different scores than the other
groups?
ANOVA Example
Joyce Kuhlman manages a regional financial center. She wishes to
compare the productivity, as measured by the number of customers served,
among three employees. Four days are randomly selected and the number
of customers served by each employee is recorded. Is there a difference in
the mean number of customers served?
One-way ANOVA
• Research questions for the one-way ANOVA should
ask if there are any differences between the groups
• Example: Do any of the groups have different scores than the
other groups?
55 115 86 71
71 86 108 62
72 98 66 48
62 120 37 69
67 115 90 55
103 57
Filling in the ANOVA table
SS df MS F
Total 11770.59 21
21
• α = .05
• Fcrit ( , ) =
29
• α = .05
• Fcrit (2, 12) = 3.89
30
.2666665/1.03
.533333/2 =
Between 0.533333 3-1 =2 3333 =
0.266665
0.258064
12.40/12 =
Within 12.40 15 - 3 =12
1.033333
Learning check
• Beta Inc. wants to compare the number of sales made by
employees in three teams of employees. The company
wants to know whether there are differences in sales
between the three teams. Data is provided below:
Learning check
• RQ: Are there differences in the number of sales between
the three teams?
• Hypotheses:
‒ H0: μ1 = μ2 = μ3
‒ H1: At least two means differ
• α = .05
• Fcrit ( , ) =
• SSBetween =
• SSTotal =
• SSWithin =
37
Learning check
• RQ: Are there differences in the number of sales between
the three teams?
• Hypotheses:
‒ H0: μ1 = μ2 = μ3
‒ H1: At least two means differ
• α = .05
• Fcrit (2, 6)= 5.14
Learning check
SS df MS F
Learning check
SS df MS F
366.333355
3-1 = 732.666709/2
Between 732.666709 /49.888882
2 = 366.333355
= 7.342956
9-3 299.333291/1
Within 299.333291
=12 2 =49.888882
Total 1032 9 - 1 =8
Learning check
• η2 = 732.666709/1032 = .709948
• SSB/SST
1 1
xD = (3 −1)5.14 49.888882 + =18.4906477
crit 3 3
Learning check
• Reject the null and accept the alternative. The difference
was statistically significant. There are differences in the
number of sales between the three teams. There is a
significant difference between the number of sales for
team 2 and team 3. 71% of the variance in number of
sales can be explained by the differences between the
teams.
A Two-Way ANOVA
• 공부한 시간(공부량), 난이도 – 난이도 ‘하’일 때, 공부시간이 증가할수록 성적
낮음, 난이도 ‘상’일 때는, 공부시간이 증가할수록 성적 증가.
• In a two-way ANOVA, we consider a second treatment variable
• This reduces the amount of error variance
• The second treatment variable is called the blocking variable
• It is determined using equation below
• SSB: SSBlock (SSB1, SSB2)
At the .05 significance level, is there a difference in the mean travel time along
the four routes? If we remove the effects of the drivers, is there a difference in
the mean travel time?
ANOVA Test Example Continued
Step 1: State the null and alternate hypothesis
H0: μ1 = μ2 = μ3 = μ4
H1: Not all treatment means are the same
Step 2: Select the level of significance, we decide to use .05
Step 3: Select the test statistic, we use F
Step 4: State the decision rule, Reject H0 if F > 3.24
Step 5: Make decision, F = 2.483, we do not reject the null hypothesis
Step 6: Interpret, there is no reason to conclude that any one of the routes is
faster than any other.
The Blocking Variable
• In the WARTA example, we only considered the variation
due to routes and took all other variables to be random
• Now, we’ll include the variance due to the drivers by
letting the drivers be the blocking variable (Route, Driver)
Next, we test to find if the travel times for the various drivers are equal.
One way crit: F(k-1, n-k) vs. Two way crit: F(k-1, (k-1)(b-1)), F(4-1, (4-1)(5-1)),
F(3, 12)
Hypothesis Test of Equal Block Means Continued
State the decision rule for the second set of hypotheses, reject H0 if F(4, 12) >
3.26
Make a decision, the computed F ratio is 9.78 so we reject the null hypothesis
MSB 29.93
F = MSE = 3.06 = 9.78
Interpret, we conclude at least one driver’s mean travel time is different from
the others. WARTA management can conclude, based on the sample results,
that there is a difference in the mean travel times of drivers.
Excel has a two-factor
ANOVA procedure. The
output for the WARTA
example just completed is
shown.
Interaction Plot
• Interaction(상호작용): IV1이 DV에 미치는 영향이 IV2에 의해 달라진다.
• IV1-DV의 관계가 IV2 (A, B, C)의 A에서는 유의, B, C에서는 유의하지 않음
• IV1-DV의 관계가 A, B, C에서 모두 유의. 그러나 형태가 다른 경우.
• IV2 – A일 때: IV1-DV – positive, sig, B일때 – negative, sig, C일때 – positive, sig (기울기가 다를 수 있음)
• An interaction plot illustrates the interaction of the two factors, route and
driver; Travel time is the response variable
INTERACTION The effect of one factor on a response variable
differs depending on the value of another factor.
Routes
Drivers U.S. 6 West Hickory Rte.
End 59
Deans 18 17 21 22
16 22.33 23 22
Snaverly
Ormson 18 23 26 22
Zollaco 23 22 29 23.67
Filbeck 25 24 28 28
Hypothesis Tests for Interaction
• The next step is to investigate the interaction effects
• Is there an interaction between drivers and routes?
• Are the mean travel times for drivers the same?
• Are the mean travel times for the routes the same?
• Test three sets of hypotheses
• H0: There is no interaction between drivers and routes
• H1: There is interaction between drivers and routes
• H0: The driver means are equal
• H1: At least one driver travel time mean is different
• H0: The route means are equal
• H1: At least one route travel time mean is different
ANOVA Table including Interactions
• The complete ANOVA table including interactions
A One-Way ANOVA to Test a Hypothesis
• We will continue the analysis by conducting a one-way ANOVA for
each route by testing the hypothesis (각 route마다, driver간 주행시간
차이가 있는가?) H0: Driver times are equal