Correlation
Correlation
Analysis
Unit 3
Correlation analysis
• Correlation analysis is a statistical method used to
evaluate the strength of relationship between two
quantitative variables. A high correlation means that
two or more variables have a strong relationship with
each other, while a weak correlation means that the
variables are hardly related.
• Scatter Diagram
• Coefficient of Correlation
Scatter Plot
• A scatter plot is a type of plot or mathematical diagram using
Cartesian coordinates to display values for typically two
variables for a set of data. If the points are coded, one
additional variable can be displayed.
Meaning of
Correlation
X Y
3 11
7 16
4 9
2 4
1 7
4 6
1 3
2 8
Regression
Unit 3
Regression Analysis
• Regression analysis is a set of statistical
methods used for the estimation of
relationships between a dependent variable
and one or more independent variables. It
can be utilized to assess the strength of the
relationship between variables and for
modeling the future relationship between
them.
Regression Lines
• The Regression Line is
the line that best fits the
data, such that the overall
distance from the line to
the points (variable
values) plotted on a graph
is the smallest. In other
words, a line used to
minimize the squared
deviations of predictions is
called as the regression
line.
The functional relation developed
between the two correlated
variables are called regression
equations.
n Analysis
The value of the residual (error) is zero.
– Linear
model The value of the residual (error) is constant across all
observations.
assumptio The value of the residual (error) is not correlated across
ns all observations.
Regression
using the following equation:
Y = a + bX + ϵ
Analysis – Where:
Regression
used in the model. The mathematical
representation of multiple linear
regression is:
Multiple Where: