Chapter Five
Chapter Five
Types of ANOVA
ANOVA assumptions
ANOVA analysis
ANOVA Terminology
Types of ANOVA
One-way ANOVA
Two-way ANOVA
ANOVA assumptions
ANOVA relies on the main assumptions that must be met for the
test results to be valid.
Normality
The first assumption is that the groups each fall into what is called
a normal distribution. This means that the groups should have a
bell-curve distribution with few or no outliers.
Homogeneity of variance
Independence
Why is that useful? Because when you understand how the means
of each group in your independent variable differ, you can begin to
understand which of them has a connection to your dependent
variable (such as landing page clicks) and begin to learn what is
driving that behavior.
You could also repeat this test multiple times to see whether or not
a single independent variable (such as temperature) affects multiple
dependent variables (such as purchase rates of sun cream,
attendance at outdoor venues and likelihood to hold a cook-out)
and if so, which ones.
Using resources
ANOVA doesn’t just tell you that differences exist between groups –
it can also reveal the interaction between different variables. This
can help businesses better understand complex relationships and
dynamics, leading to more effective interventions and strategies.
ANOVA application
Does the time of year and type of product have an effect on the
sales of a company?
Anova Formula
Analysis of variance, or ANOVA, is a strong statistical technique
that is used to show the difference between two or more means or
components through significance tests. It also shows us a way to
make multiple comparisons of several populations means. The
Anova test is performed by comparing two types of variation, the
variation between the sample means, as well as the variation within
each of the samples.
F = MST/MSE
1. Where,
F = Anova Coefficient.
Limitations of ANOVA
ANOVA can only tell if there is a significant difference between the
means of at least two groups, but it can’t explain which pair differs
in their means. If there is a requirement for granular data,
deploying further follow up statistical processes will assist in
finding out which groups differ in mean value. Typically, ANOVA is
used in combination with other statistical methods.
ANOVA also makes assumptions that the dataset is uniformly
distributed, as it compares means only.
Chapter 6
Correlation and Simple Linear Regression
Introduction
CORRELATION
(r = 1) (r = 0) (0 < r < 1)
d) negative correlation
Types of Correlations
If the data points assume an oval pattern, the r value is somewhere
between 0 and 1, and a moderate relationship is said to exist. A
positive correlation (Figure c) occurs when the dependent variable
increases as the independent variable increases. A negative
correlation (Figure d) occurs when the dependent variable increases
as the independent variable decreases or vice versa.
REGRESSION
Y = a + bX
where Y is the dependent variable, X is the single independent
variable, a is the Y-intercept of the regression line, and b is the
slope of the line (also known as the regression coefficient).
Once the equation has been derived, it can be used to predict the
change in Y for any change in X. It can therefore be used to
extrapolate between the existing data points as well as predict
results which have not been previously observed or tested.