Correlation Regression L2
Correlation Regression L2
• Rectangular coordinate
• Two quantitative variables
• One variable is called independent (X) and the
second is called dependent (Y)
• Points are not joined Y
* *
• No frequency table *
X
Example
Wt. 67 69 85 83 74 81 97 92 114 85
(kg)
SBP 120 125 140 160 130 180 150 140 200 130
mmHg)
Wt. 67 69 85 83 74 81 97 92 114 85
(kg)
SBP 120 125 140 160 130 180 150 140 200 130
(mmHg)
➢ negative relationship
➢ no relationship
POSITIVE RELATIONSHIP
18
16
14
12
Height in CM
10
0
0 10 20 30 40 50 60 70 80 90
Age in Weeks
NEGATIVE RELATIONSHIP
Reliability
Age of Car
NO RELATION
CORRELATION COEFFICIENT
➢ If r = l = perfect correlation.
How to compute the simple correlation coefficient (r)
x y
xy −
n
r=
( x) 2
( y) 2
x −
2 . y −
2
n n
EXAMPLE:
A sample of 6 children was selected, data about their age in
years and weight in kilograms was recorded as shown in the
following table . It is required to find the correlation
between age and weight.
xy − x y
r = n
x2 −
( x) 2
. y 2 −
( y) 2
n n
Age Weight
Serial
(years) (Kg) xy X2 Y2
n.
(x) (y)
1 7 12 84 49 144
2 6 8 48 36 64
3 8 12 96 64 144
4 5 10 50 25 100
5 6 11 66 36 121
6 9 13 117 81 169
Total ∑x= ∑y= ∑xy= ∑x2= ∑y2=
41 66 461 291 742
41 66
461 −
r= 6
(41)2 (66)2
291 − .742 −
6 6
r = 0.759
strong direct correlation
EXAMPLE: RELATIONSHIP BETWEEN ANXIETY
AND TEST SCORES
Anxiety Test score X2 Y2 XY
(X) (Y)
10 2 100 4 20
8 3 64 9 24
2 9 4 81 18
1 7 1 49 7
5 6 25 36 30
6 5 36 25 30
∑X = 32 ∑Y = 32 ∑X2 = 230 ∑Y2 = 204 ∑XY=129
CALCULATING CORRELATION COEFFICIENT
r = - 0.94
➢ Square each di and compute ∑di2 which is the sum of the squared
values.
5. Apply the following formula
6 (di) 2
rs = 1 −
n(n 2 − 1)
∑ di2=64
6 64
rs = 1 − = −0.1
7(48)
Comment:
There is an indirect weak correlation between level of education
and income.
EXERCISE
REGRESSION ANALYSES
➢ Correlation
describes the strength of a linear relationship
between two variables
ŷ = a + bX
x y
xy −
ŷ = y + b(x − x) bb1 = n
( x) 2
x 2
−
n
REGRESSION EQUATION
200
180
160
140
120
100
80
Wt (kg)
60 70 80 90 100 110 120
LINEAR EQUATIONS
ŷ = a + bX
Y = bX + a
Change
b = Slope in Y
Change in X
a = Y-intercept
X
HOURS STUDYING AND GRADES
REGRESSING GRADES ON HOURS
Linear Regression
90.00 Final grade in c ourse = 59.95 + 3 .17 * study
R-Squa re = 0.88
Final grade in course
80.00
70.00
41 66
461 −
b= 6 = 0.92
2
(41)
291 −
6
Regression equation
−
x 2
n
41678 −
20
ŷ =112.13 + 0.4547 x
for age 25
B.P = 112.13 + 0.4547 * 25=123.49 = 123.5 mm hg
MULTIPLE REGRESSION
Mean
VARIANCE
2
-
VARIANCE
2
-
2
-
VARIANCE
1
---------------- ……… 2 2
+ - + - + ………
No. of Data
Points
VARIANCE FORMULA
STANDARD DEVIATION
Positive
Relation
COVARIANCE
COVARIANCE
COVARIANCE
COVARIANCE
COVARIANCE
Negative
Relation
COVARIANCE
COVARIANCE
No
Relation
COVARIANCE MATRIX
COVARIANCE MATRIX
COVARIANCE MATRIX