0% found this document useful (0 votes)
43 views14 pages

6.5 Analysis of Variance (ANOVA)

The document discusses Analysis of Variance (ANOVA), a statistical technique used to partition total variation in data into components to analyze means. It outlines the null and alternative hypotheses, assumptions for One Way ANOVA, and the calculation of Between and Within Groups Variation. Additionally, it explains the F-statistic, its interpretation, and the use of post-HOC tests for identifying differences in means.

Uploaded by

sergekouassi065
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
43 views14 pages

6.5 Analysis of Variance (ANOVA)

The document discusses Analysis of Variance (ANOVA), a statistical technique used to partition total variation in data into components to analyze means. It outlines the null and alternative hypotheses, assumptions for One Way ANOVA, and the calculation of Between and Within Groups Variation. Additionally, it explains the F-statistic, its interpretation, and the use of post-HOC tests for identifying differences in means.

Uploaded by

sergekouassi065
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 14

Biostatistics & Research Methodology

2021/22

Block 6 – Lecture 5

Analysis of Variance
(ANOVA)
Aktham Osama Abdulazeez, MBChB
Analysis of Variance (ANOVA)
o It is a technique where a total variation present in a set of
data is partitioned into several components, each component
has its specific variation.

o In ANOVA, inferences about means are made by analyzing


variance.

o ANOVA can ascertain the magnitude of the contribution of


each of these sources to the total variation.
Analysis of Variance (ANOVA)
o The null hypothesis is that the population means are all equal.
i.e. the observed samples belong to a single population.

o The alternative hypothesis is that at least one of the


population means is different. They don’t all have to be
different, just one of them.
One Way ANOVA - Assumptions
o The populations have approximately normal distributions.
o The populations have the same variance.
o The samples are simple random samples.
o The samples are independent of each other.
o The different samples are from populations which are
categorized in only one way.
Analysis of Variance (ANOVA)
o Variation is the mean sum of the squares of the deviations
between data values and the mean.

o ANOVA measures two sources of variation in the data and


compares their relative sizes; Between groups and Within
groups variation.
ANOVA
Between Groups
Variation
o Sum of squares: computing
for each group the squared
deviation of the group mean
from the grand mean and to
multiply the result by the size
of the group.

o df : count of groups – 1
Within Groups Variation
o Sum of squares:
computing within each
group the sum of the
squared deviations of the
individual observation
from their mean.

o df : total sample size for


all groups – count of
groups
F Statistic (ANOVA)
o The ANOVA (F-statistic) is a ratio of the Between Group Variation (mean
sum of squares) divided by the Within Group Variation.

o Numerator df = k – 1
o Denominator df = N – k

o A large F is evidence against H0, since it indicates that there is more


difference between groups than within groups.
F Distribution
o The F-distribution is not symmetric;
it is skewed to the right.

o The values of F can be 0 or positive,


they cannot be negative.

o There is a different F-distribution for


each pair of degrees of freedom for
the numerator and denominator.

o Critical values of F are given in F


Tables.
ANOVA - Interpretation
o Rejection of the null hypothesis means there is a statistically significant
difference in mean score between at least one group compared to others.

o It is not possible to identify where exactly is the difference in mean.

o Repetitive use of Student’s t-test to localize the difference is imprudent, as


repeat testing on the same group inflates the total alpha error beyond 5%.

o For this purpose, post-HOC tests can be used to localize the difference in
means:
- Bonferroni t‐test
- Tukey’s HSD Test
Example: ANOVA
o The lecture classroom is divided into three rows: front,
middle, and back. The instructor noticed that the further the
students were from him, the more likely they were to miss
class or use an instant messenger during class.
o He wanted to see if the students further away did worse on
the exams.
o The scores of a quiz test were tabulated for students sitting in
each row.
Thank You
Questions? Ask in the group or note it down for our next
live session!
Hope to see you next time ☺

You might also like