Analysis of Variance and Covariance: Chapter 16 Marketing Research
Analysis of Variance and Covariance: Chapter 16 Marketing Research
Variables
ii. Decomposition of the Total Variation
iii. Measurement of Effects
iv. Significance Testing
v. Interpretation of Results
• Illustrative Data
• Illustrative Applications of One-Way Analysis of
Variance
• Assumptions in Analysis of Variance
• N-Way Analysis of Variance
• Analysis of Covariance
• Issues in Interpretation
i. Interactions
ii. Relative Importance of Factors
iii. Multiple Comparisons
Relationship Among Techniques
• Analysis of variance (ANOVA) is used as a test of
means for two or more populations. The null
hypothesis, typically, is that all means are equal.
• Analysis of variance must have a dependent
variable that is metric (measured using an interval
or ratio scale).
• There must also be one or more independent
variables that are all categorical (nonmetric).
Categorical independent variables are also called
factors.
• A particular combination of factor levels, or
categories, is called a treatment.
• One-way analysis of variance involves only one
categorical variable, or a single factor. In one-way
analysis of variance, a treatment is the same as a
factor level.
• If two or more factors are involved, the analysis is
termed n-way analysis of variance.
• If the set of independent variables consists of both
categorical and metric variables, the technique is
called analysis of covariance (ANCOVA). In this
case, the categorical independent variables are still
referred to as factors, whereas the metric-
independent variables are referred to as covariates.
Relationship Amongst Test, Analysis of
Variance, Analysis of Covariance, & Regression
One-way Analysis of Variance
• Marketing researchers are often interested in examining
the differences in the mean values of the dependent
variable for several categories of a single independent
variable or factor. For example:
Sy2= SS x / (c - 1)
= Mean square due to X
= MS x
or
Sy2 = SS error/(N - c)
= Mean square due to error
= MS error
• The null hypothesis may be tested by the F
statistic based on the ratio between these two
estimates:
SS x /(c - 1) MS x
F= =
SS error/(N - c) MS error
• This statistic follows the F distribution, with (c -
1) and (N - c) degrees of freedom (df).
Interpret the Results
• If the null hypothesis of equal category means is
not rejected, then the independent variable does
not have a significant effect on the dependent
variable.
• On the other hand, if the null hypothesis is
rejected, then the effect of the independent
variable is significant.
• A comparison of the category mean values will
indicate the nature of the effect of the independent
variable.
Illustrative Applications of One-way
Analysis of Variance
• We illustrate the concepts discussed in this chapter
using the data presented in Table 16.2.
• The department store is attempting to determine the
effect of in-store promotion (X) on sales (Y).
• For the purpose of illustrating hand calculations, the
data of Table 16.2 are transformed in Table 16.3 to
show the store sales (Y ij) for each level of promotion.
• The null hypothesis is that the category means are
equal:
H0: µ1 = µ2 = µ3
Table 16.2
Store Num ber Coupon Level In-Store Prom otion Sales Clientel Rating
1 1.00 1.00 10.00 9.00
2 1.00 1.00 9.00 10.00
3 1.00 1.00 10.00 8.00
4 1.00 1.00 8.00 4.00
5 1.00 1.00 9.00 6.00
6 1.00 2.00 8.00 8.00
7 1.00 2.00 8.00 4.00
8 1.00 2.00 7.00 10.00
9 1.00 2.00 9.00 6.00
10 1.00 2.00 6.00 9.00
11 1.00 3.00 5.00 8.00
12 1.00 3.00 7.00 9.00
13 1.00 3.00 6.00 6.00
14 1.00 3.00 4.00 10.00
15 1.00 3.00 5.00 4.00
16 2.00 1.00 8.00 10.00
17 2.00 1.00 9.00 6.00
18 2.00 1.00 7.00 8.00
19 2.00 1.00 7.00 4.00
20 2.00 1.00 6.00 9.00
21 2.00 2.00 4.00 6.00
22 2.00 2.00 5.00 8.00
23 2.00 2.00 5.00 10.00
24 2.00 2.00 6.00 4.00
25 2.00 2.00 4.00 9.00
26 2.00 3.00 2.00 4.00
27 2.00 3.00 3.00 6.00
28 2.00 3.00 2.00 10.00
29 2.00 3.00 1.00 9.00
30 2.00 3.00 2.00 8.00
TABLE 16.3
EFFECT OF IN-STORE PROMOTION ON SALES
Store Level of In-store Promotion
No. High Medium Low
Normalized Sales _________________
1 10 8 5
29 8 7
3 10 7 6
48 9 4
59 6 5
68 4 2
79 5 3
87 5 2
97 6 1
10 6 4 2
_____________________________________________________
Column Totals 83 62 37
Category means: ȳ j 83/10 62/10 37/10
= 8.3 = 6.2 = 3.7
Grand mean, ȳ = (83 + 62 + 37)/30 = 6.067
• It can be verified that
SSy = SSx + SSerror
as follows:
185.867 = 106.067 +79.80
• The strength of the effects of X on Y are measured
as follows:
2 = SS x/SS y
= 106.067/185.867
= 0.571
• In other words, 57.1% of the variation in sales (Y)
is accounted for by in-store promotion (X),
indicating a modest effect. The null hypothesis
may now be tested.
SS x /(c - 1)
F= = MS X
SS error/(N - c) MS error
106.067/(3-1)
F=
79.800/(30-3)
= 17.944
• In the Statistical Appendix we see that for 2 and 27
degrees of freedom, the critical value of F is 3.35
for = 0.05 . Because the calculated value of F is
greater than the critical value, we reject the null
hypothesis.
• We now illustrate the analysis of variance
procedure using a computer program.
• The results of conducting the same analysis by
computer are presented in Table 16.4.
Table 16.3
(SS x 1 + SS x 2 + SS x 1x 2)/dfn
F=
SS error/dfd
SS x 1,x 2,x 1x 2/ dfn
=
SS error/dfd
MS x 1,x 2,x 1x 2
=
MS error
where
dfn= degrees of freedom for the numerator
= (c1 - 1) + (c2 - 1) + (c1 - 1) (c2 - 1)
= c1c2 - 1
dfd = degrees of freedom for the denominator
= N - c1c2
MS = mean square
• If the overall effect is significant, the next step is to examine
the significance of the interaction effect. Under the null
hypothesis of no interaction, the appropriate F test is:
SS x 1x 2/dfn
F=
SS error/dfd
MS x 1x 2
=
MS error
where
dfn = (c1 - 1) (c2 - 1)
dfd= N - c1c2
Analysis of Covariance
• When examining the differences in the mean values
of the dependent variable related to the effect of the
controlled independent variables, it is often
necessary to take into account the influence of
uncontrolled independent variables.
• For example:
• In determining how different groups exposed to
different commercials evaluate a brand, it may be
necessary to control for prior knowledge.
• In determining how different price levels will affect
a household's cereal consumption, it may be
essential to take household size into account.
Issues in Interpretation
• Important issues involved in the interpretation of ANOVA
results include interactions, relative importance of factors, and
multiple comparisons.
Interactions
.
• In ordinal interaction, the rank order of the effects
attributable to one factor does not change across
the levels of the second factor.
• Dis-ordinal interaction involves the change in rank
order of the effects of one factor across the levels
of another.
Patterns of Interaction
Case 1: No Interaction Case 2: Ordinal Interaction
X X
22 22
Y X Y X
21 21
X X X X X X
11 12 13 11 12 13
Case 3: Disordinal Interaction: Case 4: Disordinal Interaction:
Noncrossover Crossover
X X
22 22
Y X Y
21
X
21
X X X X X X
11 12 13 11 12 13
Relative Importance of Factors
• Experimental designs are usually balanced, in that
each cell contains the same number of
respondents.
• This results in an orthogonal design in which the
factors are uncorrelated.
• Hence, it is possible to determine unambiguously
the relative importance of each factor in
explaining the variation in the dependent variable.
• The most commonly used measure in ANOVA is
omega squared, ω2 .
• This measure indicates what proportion of the
variation in the dependent variable is related to a
particular independent variable or factor.
• The relative contribution of a factor X is calculated
as follows:
SS x - (dfx x MS error)
2x =
SS total + MS error
104.133
=
186.834
= 0.557
Note, in Table 16.5, that
SStotal = 106.067 + 53.333 + 3.267 + 23.2
= 185.867
Likewise, the ω2 associated with couponing is:
53.333 - (1 x 0.967)
2
c =
185.867 + 0.967
52.366
=
186.834
= 0.280