Unit - 5 Multivariate Analysis
Unit - 5 Multivariate Analysis
MULTIVARIATE &
TIME SERIES
ANALYSIS
Introducing a Third Variable - Causal
Explanations - Three-Variable Contingency
Tables and Beyond - Longitudinal Data –
Fundamentals of TSA – Characteristics of
time series data – Data Cleaning – Time-
based indexing – Visualizing – Grouping –
Resampling.
MULTIVARIATE ANALYSIS
Multivariate analysis (MVA) involves
evaluating multiple variables (more than
two) to identify any possible association
among them.
It allows more than two variables at a
time.
WHY DO COMPANIES USE
MULTIVARIATE ANALYSIS?
Conducting multivariate analyses can help companies
forecast future opportunities, risks, and demand for
products.
This helps with investment strategies, business decisions,
and setting expectations.
A company may also use MVA to gain new insights.
This could include uncovering new customer targets or
identifying market patterns that exist during certain times of
the year or hours of the day.
MULTIVARIATE DATA
ANALYSIS
There are many different techniques for
multivariate analysis, and they can be divided
into two categories:
1. Dependence techniques
2. Interdependence techniques
DEPENDENCE METHODS
Dependence methods are used when one or some of
the variables are dependent on others.
Dependence looks at cause and effect; in other
words, can the values of two or more independent
variables be used to explain, describe, or predict the
value of another, dependent variable?
Example, the dependent variable of “weight” might
be predicted by independent variables such as
“height” and “age.
INTERDEPENDENCE METHODS
Interdependence methods are used to understand
the structural makeup and underlying patterns
within a dataset.
In this case, no variables are dependent on others, so
you’re not looking for causal relationships.
Rather, interdependence methods seek to give
meaning to a set of variables or to group them
together in meaningful ways.
One is about the effect of certain variables on others,
while the other is all about the structure of the
dataset.
PRINCIPLE OF CASUAL EXPLANATION
Causal explanation is valuable because it explains how and
why an effect occurs, and consequently, provides
information regarding when and where the relationship can
be replicated.
To discover whether X causes Y, or whether the independent
variable causes changes in the dependent variable.
An explanation is an answer to a why-question, and so a
causal explanation is an answer to ‘Why X?’ that says
something about the causes of X.
For example, ‘Because it rained’ as an answer to ‘Why is the
ground wet?’ is a causal explanation.
CORRELATION AND CAUSATION
CORRELATION AND CAUSATION
Causation means one thing causes another—in
other words, action A causes outcome B.
On the other hand, correlation is simply a
relationship where action A relates to action B—
but one event doesn’t necessarily cause the
other event to happen.
THREE VARIABLE CONTINGENCY
TABLES & BEYOND
Three-variable contingency tables involve three binary or
categorical variables.
We described two-way contingency tables earlier as
having I rows and J columns, forming an I × J matrix.
If a third variable is included, the three-way contingency
table is described as I × J × K, where I is the number of
rows, J is the number of columns, and K is the number of
superordinate columns each level containing a 2 × 2
matrix
EXPLANATORY VARIABLE VS.
RESPONSE VARIABLE