Correlation Analysis1
Correlation Analysis1
• Introduction
• Significance of the study of Correlation
• Types of Correlation
• Methods of Studying Correlation
Correlation
• “Correlation” means relationship between the
values of two or more variables of the same
phenomenon or individual series. Correlation
indicates the strength of the relationship
between two variables.
• For ex: If we measure the heights and weights
of ‘n’ individuals, we assume two values – one
relating to height and the other relating to
weights.
• The statistical tool with the help of which the
relationship between two or more variables
are studied is called Correlation. The measure
of correlation is called the Correlation
Coefficient, and it tells us in one figure the
direction and degree of correlation.
Over
Number of Formal Clothing Outfits
28
26
24
22
20
18
16
14
12
10
8
6
4
2
Over
10
15
20
25
30
35
40
45
50
55
60
65
70
5
Over
30
28
Number of CDs in house
26
24
22
20
18
16
14
12
10
8
6
4
Over
20
10
15
20
25
30
35
40
45
50
55
60
65
70
5
Over
30
Paper plates used monthly
28
26
24
22
20
18
16
14
12
10
8
6
4
20
Over
10
15
20
25
30
35
40
45
50
55
60
65
70
5
Positive Correlation
X Y
90 40
80 30
70 20
60 10
50 5
Negative Correlation
X Y
10 40
20 30
30 20
40 10
50 5
Negative Correlation
X Y
100 10
90 20
80 30
70 40
60 50
• Simple, Partial and Multiple correlation.
The distinction between Simple, Partial and
Multiple correlation is based upon the
number of variables studied. When only two
variables are studied it is simple correlation.
When three or more variables are studied it is
either multiple or partial correlation.
• Linear and Non-linear (Curvilinear) Correlation
The distinction between linear and non-
linear correlation is based upon the constancy
of the ratio of change between variables. If
the amount of change in one variable tends to
bear a constant ratio to the amount of change
in the other variable, then the correlation is
said to be Linear.
If the change between the two variables is the
same, then such variables if plotted on a
graph paper, all the plotted points would fall
on a straight line.
• Non Linear or Curvilinear Correlation
Correlation would be called non-linear or curvi
linear if the amount of change in one variable
does not bear a constant ratio to the amount
of change in the other variable.
Properties of Coefficient of
Correlation
• The value of the Karl Pearson’s Coefficient of
Correlation lies between +1 or -1. It cannote
be greater than 1 in any case.
• The Pearson’s Coefficient of Correlation is
independent of change of scale and origin of X
and Y variables.
• Pearson’s Coefficient of Correlation is the
geometric mean of regression coefficients ie.,
Correlation and Causation
Merits of Karl Pearson’s Coefficient
of Correlation
• Among the mathematical methods used for
measuring the degree of relationship , Karl
Pearson’s method is the most popular
method.
• The Pearson’s Coefficient of Correlation
summarises in one figure the degree of
correlation, but also it’s direction that is,
whether correlation is positive or negative.
Limitations of Pearson’s Coefficient
of Correlation
• The Pearson’s Coefficient of Correlation
always assumes linear relationship, regardless
of the fact whether assumption is correct or
not.
• The value of Pearson’s Coefficient of
Correlation is unduly affected by extreme
values.
• This method takes more time to compute the
value when compared to the other methods.