0% found this document useful (0 votes)
16 views32 pages

CH 7 Correlation 2

Uploaded by

Harmanjeet Singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views32 pages

CH 7 Correlation 2

Uploaded by

Harmanjeet Singh
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 32

CORRELATION

ANALYSIS
It is a statistical
measure which shows
relationship between two or
morevariable
moving in the same or
in opposite direction
08/01/12 1
Visual Displays and
Correlation • Correlation
Analysis
Analysis
• The sample correlation coefficient (r) measures the
degree of linearity in the relationship between X
and Y.
-1 < r < +1
Strong negative relationship Strong positive relationship

• r = 0 indicates no linear relationship


• In Excel, use =CORREL(array1,array2),
where array1 is the range for X and array2 is
the range for Y.
Types of correlation

correlation

Simple ,
positive Linear
& negative & non-linear
multiple
& 3

08/01/12
partial
Methods of correlation

• Scatter diagram
• Product moment or covariance
• Rank correlation

08/01/12
Scatter diagram

• Perfectly
+ve

08/01/12
Less-degree +ve

Weak Positive
Correlation

08/01/12
High degree +ve

Strong Positive
Correlation

08/01/12
Perfectly -ve

08/01/12
High degree -ve

Strong Negative Correlation

08/01/12
Less degree -ve

Weak Negative
Correlation
10

08/01/12
Zero degree

11

08/01/12
Karl Pearson correlation
coefficient

ο .
cov(x,
x σ y)
y
r=

r x.y

=
x2 . y2
∑ ∑
12

08/01/12
Where

x=X −X
and
y =Y −Y

13

08/01/12
problem

From the following data find the


coefficient of correlation by Karl
Pearson method
X:6, 2, 10, 4,8.
Y:9, 11, 5, 8, 7.
Sol.

X Y X-6 Y-8 x. y
x2 y2

6 9 0 1 0 1 0
2 11 -4 3 16 9 -12
10 5 4 -3 16 9 -12
8 8 -2 0 4 0 0
4 7 2 -1 4 1 -2
30 40 0 0 40 20 -26
Sol.cont.

∑X
X =
= 30 = 6
N 5
N 5
∑Y 40
Yr = x.y = − 26 = − 26
==8
∑ =

∑ x .∑ y
2 2
≈ −0.92
40.20 800
Direct method

N. XY − X. Y
∑ ∑ ∑
r
= [N ∑ X − ( ∑ X ) ].
2 2

[N.∑ Y − (∑ Y ) ]
2 2

17

08/01/12
Short-cut method

N dx .d y − dx .
r ∑ ∑ ∑
2. N2 d
y − (
N ∑ d dx ∑ ∑ ∑ y
2 2
= x y
−( d
d
) )
18

08/01/12
Where

dx =X−A
&
dy =Y−A
A = assume
mean

19

08/01/12
Product moment method

r b .b
xy yx
=
where

b xy = xy

∑ y 2

b yx =∑ xy
∑x 2 20

08/01/12
spearman’s Rank correlation

R= 6 D2
1− N ∑
(N 2

where −1)
D = Rx − Ry
Rx =
rank.of .X Ry
=
21

08/01/12
problem

Calculate spearman’s rank correlation


coefficient between advt.cost & sales from
the following data
Advt.cost : 39 65 62 90 82 75 25 98 36 78
Sales(lakhs): 47 53 58 86 62 68 60 91 51 84
X Y R-x R-y D
D2
Sol. 39 47 8 10 -2 4
65 53 6 8 -2 4
62 58 7 7 0 0
90 86 2 2 0 0
82 62 3 5 -2 4
75 68 5 4 1 1
25 60 10 6 4 16
98 91 1 1 0 0
36 51 9 9 0 0
78 84 4 3 1 1
30
Sol.cont.

R = 6 D2
1− N∑3

N 6.30
⇒R=
1− 10 3
−10
⇒ R = 1−
2
11
9
⇒ R = 11
= 0.82
In case of equal rank

6
⎨ D 2 1 m3(  (
1 3
m−


⎩ 12 + ........ ⎭
R=
1−

∑ + 12 − mN N)(
2 m )
where −1)
m = no.of repeated
items

08/01/12 25
problem
A psychologist wanted to compare two methods
A & B of teaching. He selected a random
sample of 22 students. He grouped them into
11 pairs so that the students in a pair have
approximately equal scores in an intelligence
test. In each pair one student was taught by
method A and the other by method B and
examined after the course. The marks
obtained by them as follows
Pair:1 2 3 4 5 6 7 8
9 10 11
A: 24 29 19 14 30 19 27 30 20 28 11
B: 37 35 16 26 23 27 19 20 16 11 21
A B R-A R-B D D2
Sol. 24 37 6 1 5 25
29 35 3 2 1 1
19 16 8.5 9.5 -1 1
14 26 10 4 6 36
30 23 1.5 5 -3.5 12.25
19 27 8.5 3 5.5 30.25
27 19 5 8 -3 9
30 20 1.5 7 -5.5 30.25
20 16 7 9.5 -2.5 6.25
28 11 4 11 -7 49
11 21 11 6 5 25
225
Sol.cont.

in A series the items19 & 30 are repeated twice and in B


series16 is repeated twice∴

+ 2(4 −1) + 2(4 −1) +


6⎡⎢ D2 12 12 12 ⎥
R= ∑ 2(4 −1) ⎤
1- ⎣ 11(121−1) ⎦
⇒R=
−0.0225
Properties of correlation coefficient

• r always lies between +1 & -1


i.e. -1<r<+1
• Two independent variables are
uncorrelated but converse is not true
• r is independent of change in origin and
scale
• r is the G.M. of two regression coefficients
• r is symmetrical 29

08/01/12
Probable error
1−r 2
SE(r )
= n
PE(r ) = 0.6745×SE(r )
or
1−r 2
PE(r ) = 0.6745×
n
30
m
08/01/12 o
The coefficient of determination

► It is the primary way we can measure the extent or


strength of the association that exists between two
variables x & y . Because we have used a sample
of points to develop regression lines .

2
It is denoted by r
Thank
you

08/01/12 32

You might also like