Nutrition & Food Science: Article Information
Nutrition & Food Science: Article Information
Access to this document was granted through an Emerald subscription provided by emerald-srm:126741 []
For Authors
If you would like to write for this, or any other Emerald publication, then please use our Emerald for Authors service
information about how to choose which publication to write for and submission guidelines are available for all. Please
visit www.emeraldinsight.com/authors for more information.
About Emerald www.emeraldinsight.com
Emerald is a global publisher linking research and practice to the benefit of society. The company manages a portfolio of
more than 290 journals and over 2,350 books and book series volumes, as well as providing an extensive range of online
products and additional customer resources and services.
Emerald is both COUNTER 4 and TRANSFER compliant. The organization is a partner of the Committee on Publication
Ethics (COPE) and also works with Portico and the LOCKSS initiative for digital archive preservation.
Department of Applied Consumer Studies, Queen paired with each other one. A series of the
Margaret College, Edinburgh, UK.
appropriate independent or related sample
tests can then be used, but from a statistical
Abstract
standpoint this approach is not recommend-
Describes statistical methods applied to three or more
ed, and is usually condemned. There is an
sample groups. Discusses analysis of variance in
increased risk of committing a Type I error
parametric forms and the requirement for experimental
(SFS IIIa) and rejecting a null hypothesis
design control before its application.
which is in fact true, i.e. accepting a result as
significant when it occurred by chance. This
arises because of the increased number of
comparisons being performed and applies to
any “multiple testing” circumstance. A statis-
tical test done with a single pair, at the 5 per
cent level involves a 5 per cent risk of commit-
ting a Type I error. This condition applies to
randomly selected samples and subsequent
randomly selected pairs which are indepen-
dent of any other pair. For experiments which
have three or more sample groups, pairing
would result in many more comparisons, e.g.
five treatment groups demand ten inter-
comparisons, not all independent of one
another, and these would be at an increased
risk level; for seven groups 21 tests would be
required and one of these would commit the
error. The procedure may be justified if the
experiment was organized in a paired or two-
sample manner, but this constitutes a differ-
ent type of design which would usually require
a different form of analysis. The separate
treatment “global” approach has organiza-
tional advantages. It is simpler and easier to
present a series of single treatments for analy-
sis or assessment than to prepare many sets of
Nutrition & Food Science
Number 2 · March/April 1997 · pp. 78–84 pairs, especially for sensory evaluation experi-
© MCB University Press · ISSN 0034-6659 mentation in food science.
78
Statistics for food science V: comparison of many groups (part A) Nutrition & Food Science
John A. Bower Number 2 · March/April 1997 · 78–84
Which analysis method is suitable for which depends on the scale of the experiment,
comparing more than two sample ranging from laboratory scale to full produc-
groups? tion scale. These experimental units are
For such experiments the recommended randomly allocated to the treatments which
statistical design and analysis approach is by are the specific settings or combinations of
analysis of variance (ANOVA). Irrespective of factor level(s). Any source of variation can be
treatment numbers the risk level is main- examined as an experimental factor. A
tained at the specified significance level. Some process condition such as temperature, or an
of the simpler designs where ANOVA is ingredient quantity in a food formulation
applied are given in Table I. Terminology of would be examples of quantitative factors
these designs and tests can vary slightly, where the variable is continuous. The levels of
depending on individual texts. Other impor- these factors would be the individual temper-
tant tests include tests based on chi-square atures or the ingredient weights respectively,
which can be applied to analysis of categorical i.e. different magnitudes of the factor. Factors
data from three or more groups. can also be qualitative, and set at two or more
Although experiments using these tech- levels where magnitude is not considered, e.g.
niques can appear to be straightforward in the type or identity of an ingredient such as
operation, the design features become the type of salt in a formulation. Here the
levels could be sodium chloride and potassi-
increasingly important and form the founda-
Downloaded by Auburn University At 06:58 22 February 2016 (PT)
Table I Some experimental designs analysed by ANOVA (comparison of three or more groups of data)
Some assumptions
Sample Measurement Population Parameter tested/
Design/test nature scale distribution(s) compared
Parametric ANOVA:
Completely randomized Independent Interval Normal Means of groups
design (CRD) 1-way
Randomized block design Related Interval Normal Treatment effect
(RBD) 2-way
Non-parametric:
Kruskal-Wallis 1-way Independent Ordinal Identical except Distribution; median
ANOVA (CRD) for location
Friedman’s 2-way ANOVA Related Ordinal None Distribution; central
(RBD) tendency (median)
79
Statistics for food science V: comparison of many groups (part A) Nutrition & Food Science
John A. Bower Number 2 · March/April 1997 · 78–84
becomes. Consideration must be given to all As with other statistical tests there are both
other sources of variation and uncontrolled parametric and non-parametric forms of
factors which might affect validity. If possible, ANOVA (Table I). Much of the explanation
these must be controlled or limited using below applies to parametric analysis of vari-
design techniques such as randomization, ance. Parametric ANOVA can also be applied
blocking and replication. Alternatively, there to two data set comparisons in which case it is
are statistical procedures which can circum- equivalent to a t-test, but the commonest
vent these problems or allow some quantifica- application is for three or more groups.
tion of them so that they can be taken into
account in eventual analysis. The importance
What does ANOVA do? – the ANOVA
of replication has been discussed several times
in this series and blocking is dealt with below
method
where design types are examined. Random- The design details above indicate that sources
ization has been proven to reduce error and of variation are assigned under various factors
bias caused by taking a systematic, non- and levels. The technique of ANOVA parti-
random approach to organization of experi- tions this source of total variation into its
mental stages. The technique involves ran- component parts, enabling an assessment of
domization of the order in which stages and the magnitude of effect of each source on the
individual treatments are performed or allo- response(s). Essentially ANOVA compares
cated, e. g. in sensory experiments designs to variability between treatments with that with-
Downloaded by Auburn University At 06:58 22 February 2016 (PT)
counteract order and other bias effects such as in treatments. The between treatment varia-
carry-over should be employed, etc. tion is caused by the nature of the treatments,
Adequate controls or base line measures e.g. the difference in an analytical, instrumen-
should be included so that the test results can tal or sensory measure caused by different
be gauged relative to these. A control treat- food processes or formulations. The within
ment is often represented by an experimental treatment variation is due to the variation in
unit which is actually “untreated”. In an the values for the replicates within each treat-
experiment studying the effect of processing
ment group, caused by several error effects.
conditions the control could be a unit consist-
This error effect is also referred to as the
ing of raw unprocessed food. Space precludes
residual variation, i.e. the “ left over” variation
giving a fuller account of experimental design
which cannot be accounted for by treatment
at present. Some features of an experiment in
food science research[1] which employed a or factor effects. The variance (SFS I) from
number of these design features is illustrated these two sources is used to calculate two
in Table II. different estimates of population variance.
The control for this experiment was a These measures, referred to in ANOVA ter-
formulation which excluded the dairy ingredi- minology as mean squares (MS), are calculat-
ents (the main factor of interest). A large ed for these data groups and their relationship
number of experimental runs were required to is expressed as a variance ratio (F) :
accommodate the many factors and levels
causing the researchers to chose partial MS of treatments
F=
replication, albeit of randomly allocated full MS of error
treatments.
The F ratio for the experiment is compared set there is likely to be more than one source
with tabular F values, calculated for different of variation: one is due to random error at the
levels of significance over a range of experi- measurement stage – end measurement varia-
mental conditions, i.e. the numbers of treat- tion or analytical error; the other is caused by
ments and replicates, expressed as the degrees sampling variation due to inherent variation
of freedom. If the experimental F ratio
within the food material. The magnitude of
exceeds the critical F value (usually at the 5
per cent level or less) then a statistically signif- this latter variation will depend on the food,
icant result is obtained for the data as a whole. e.g. a series of experimental units drawn from
If the null hypothesis is true, i.e. there is no freshly mixed whole milk will be more homo-
evidence to suggest that the population means geneous in chemical composition than those
are not equal, then within the context of the drawn from a day’s harvest of root vegetables.
experiment, the treatments are not demon- Whether or not both these sources of variation
strating a measurable effect. In this case the are measured and hence make an impact on
two estimates of population variance (via the the statistical analysis, depends on procedure.
treatment measures and the background If a treatment is applied once and several
error) will be similar and the F ratio will be
samples are taken for response measurement,
low – theoretically equal to unity.
then the variability of this set of results usually
If the null hypothesis is false and there is a
reflects the analytical variance. Any inherent
real and detectable effect due to the treat-
variation effect will depend on the form of the
Downloaded by Auburn University At 06:58 22 February 2016 (PT)
problems the experimenter must ensure What if the ANOVA assumptions are
adequate sample size and some degree of broken and how can the data be
full treatment repetition within the assessed?
experiment.
As stated previously (SFS IIIa) such assump-
In sensory experiments this procedure is
tions need only be approximate, and proce-
often not followed and it is common practice
dures such as transformation can be applied
to draw multiple servings from single product to allow data to “qualify” in this respect[4] .
batches for panellist assessment. Although The experimenter is recommended to estab-
more work and planning would be involved, it lish some awareness of the status of the data,
is recommended[3] that full replication of especially as they may reveal additional infor-
each production run is included in the design. mation – spurious data, data entry errors,
hidden trends, etc. Visual examination can
often show trends if the data are of small size,
What assumptions does ANOVA
e.g. inconsistency between replicates, obvious
make? differences in treatment means and even
The assumptions for this analysis method are suspicion of non-additivity. Suggested checks
similar to those of the t-test (SFS IV) in terms would include inspection by graphical means,
of normality of distribution and equality of to allow an impression of distribution shape
and characteristics (if sample size permits), as
variance between groups, etc. If the same
Downloaded by Auburn University At 06:58 22 February 2016 (PT)
Table III Potassium content (mg/100g wet basis) of raw potato for replicate end analyses (atomic absorption
spectroscopy) and replicate treatments (raw controls)
ANOVA can only be viewed as indicative of give more focused attention on the main
differences, and non-parametric ANOVA treatment factor. Thus, on analysis the proce-
should be used if an appropriate design is dure for “2-way ANOVA” is applicable. The
available, otherwise the parametric form block effect can be any factor which is likely to
(which is more powerful) is applicable. Other have a potentially uncontrolled effect on the
important points include the general design response – e.g. if the experiments require to
features of the experiment in respect of be done over several days, then each day could
whether the samples are related or indepen- constitute a block. The blocks could also be a
dent (SFS IV). This depends on the experi- series of different process machines, different
mental objectives and any limits which are laboratories, analysts or indeed, as above,
imposed and needs to be decided at the plan- panellists. In all these cases the variation due
ning stage. to different days, machines, or panellists, etc.
If independent sample groups are warrant- would be calculated and removed from the
ed then a completely randomized design error term. An improved estimate of treat-
(CRD) is applicable with each experimental ment effects is obtained and the RCB design
unit being randomly allocated to a group. In is one of the commonest design types, possi-
its simplest form this design deals with varia- bly because of its relatively simple manner of
tion of one factor and the analysis method is increasing experimental precision.
referred to as “1-way ANOVA”. It is com- Beyond these simple designs the number of
Downloaded by Auburn University At 06:58 22 February 2016 (PT)
monly applied to simple experiments in food factors which can be examined is theoretically
science, involving chemical, physical or unlimited but practical considerations limit
instrumental measure changes caused by most experiments to a selected few. Thus
factors such as processing, etc. For effective there are 3-way, 4-way designs, etc., the com-
use the experimental units must be homoge- plexity of calculation increasing with the
neous or “equally variable” within limits of number of factors. There are distinct advan-
inherent variation. Food experiments where tages in examining the effect of several factors
the response measurement is by objective or together rather than one at a time. This
instrumental techniques can often meet this approach is ultimately more economical of
requirement. In the case of sensory experi- experimental time and resources. In addition
ments, the CRD would require separate to an assessment of main effects, such designs
groups of panellists for each treatment and which include replication allow detection of
they would be assumed to be uniform in their interaction between factors. The presence of
ratings[6]. This is less likely and limits the use significant interaction indicates that factors
of the CRD for much sensory work. Such a are not operating independently but depend
design could apply in a consumer study where on the levels of other factors. More complex
each consumer was only able to be tested designs are available to cover a wide variety of
once, or possibly in storage studies where the experimental demands[7,8]. For non-para-
original panellists were not available for each metric analysis the options are more limited,
assessment stage during storage. The problem and for the more complex designs there may
is avoided by use of a related group design be no readily available non-parametric equiv-
where the panellist effect is viewed as a second alent.
factor. If a CRD is used in sensory studies it is The analysis method and the information
recommended[6] that in order to counteract obtained in all these designs also depends on
panellist variation a large number(> 100) of the exact nature of the treatments. If these are
consumers is recruited. The CRD does not under direct control and are selected by the
require equal numbers within each treatment experimenter they are described as fixed
group which could be a useful feature in the effects and subsequent inferences only apply
above sensory studies. to those factors and levels. A random-effects
The related groups circumstance of a model, with random selection of factors and
single sensory panel assessing several treat- levels, allows inference to a wider population
ments requires a block design such as the of possible factors and levels. The ANOVA
randomized complete block (RCB), with data calculation differs for these two cases and
items in each treatment group having corre- looks at means and variances respectively . A
sponding members in each other group. The mixed-effects model is also possible with both
variation in the blocks could be considered as fixed and random factors. The fixed-effect
another factor but is usually removed so as to model is the more common. Random effects
83
Statistics for food science V: comparison of many groups (part A) Nutrition & Food Science
John A. Bower Number 2 · March/April 1997 · 78–84
would apply to a sensory study where a 3 Meilgaard, M., Civille, G. V. and Carr, B.T., Sensory
randomly selected panel of consumers was Evaluation Techniques, 2nd ed., CRC Press, Boca
enlisted to measure product preference. The Raton, FL, 1991, pp. 257-62.
random-effects model would allow inferences 4 Steel, R. G. D. and Torrie, J. H., Principles and
Procedures of Statistics – a Biometrical Approach, 2nd
to be made about the consumer population’s
ed. McGraw-Hill International, Singapore, 1981,
preference attitudes. pp.167-71.
Irrespective of the number of factors, 5 Huck, S. W. and Cormier, W. H., Reading Statistics and
replicates, etc. one or more F ratio values are Research, 2nd ed., HarperCollins College Publishers,
obtained, each stemming from a source of New York, NY, 1996.
variation (factor effects). If the F ratio is 6 Gacula, M. C., Design and Analysis of Sensory Opti-
sufficiently large and its P value is equal to or mization, Food & Nutrition Press, Trumbull, CT, 1993,
less than the stated α, then a significant effect pp. 29-34.
has been detected for the experiment as a 7 Box, G. E. P., Hunter, W. G. and Hunter, J.S., Statistics
whole. for Experimenters, John Wiley & Sons, New York, NY,
1978.
8 Cochran, W. G. and Cox, G. M., Experimental Designs,
Does ANOVA identify individual 2nd ed., John Wiley & Sons, New York, NY, 1957.
treatment differences?
Downloaded by Auburn University At 06:58 22 February 2016 (PT)
84
This article has been cited by: