0% found this document useful (0 votes)
79 views97 pages

18MAB303T - Regression

This document discusses regression analysis and the key concepts of regression lines. It introduces regression lines where Y is dependent on X (Y=a+bX) and where X is dependent on Y (X=a+bY). It defines the independent and dependent variables, intercept, slope, and regression coefficients (bxy and byx) in regression lines. Formulas for calculating the regression coefficients are also provided.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
79 views97 pages

18MAB303T - Regression

This document discusses regression analysis and the key concepts of regression lines. It introduces regression lines where Y is dependent on X (Y=a+bX) and where X is dependent on Y (X=a+bY). It defines the independent and dependent variables, intercept, slope, and regression coefficients (bxy and byx) in regression lines. Formulas for calculating the regression coefficients are also provided.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 97

18MAB303T - BioStatistics for Biotechnologists

Unit - V
Regression Analysis

Dr. E. Suresh,
Assistant Professor, Department of Mathematics,
SRM Institute of Science and Technology,
Kattankulathur - 603203.
Regression Analysis

Regression

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Analysis

Regression

Mathematical Relationship

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Analysis

Regression

Mathematical Relationship
Estimation

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Analysis

Regression

Mathematical Relationship
Estimation

Regression analysis is a set of statistical processes for


estimating the relationships between a dependent
variable (often called the ’outcome’ or ’response’) and one
or more independent variables (often called ’predictors’,
’covariates’, ’explanatory variables’ or ’features’).
Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists
Regression Line Y on X is

Y = a + bX

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Line Y on X is

Y = a + bX

1 X is indepedent variable

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Line Y on X is

Y = a + bX

1 X is indepedent variable
2 Y is depedent variable

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Line Y on X is

Y = a + bX

1 X is indepedent variable
2 Y is depedent variable
3 a is intercept

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Line Y on X is

Y = a + bX

1 X is indepedent variable
2 Y is depedent variable
3 a is intercept
4 b is slope

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Line Y on X is

Y = a + bX

1 X is indepedent variable
2 Y is depedent variable
3 a is intercept
4 b is slope
5 b is regression coefficient of Y on X . i.e., byx

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Formulae
(i) Regression equation of Y on X is

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Formulae
(i) Regression equation of Y on X is
 σy 
Y −Y =r X −X
σx

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Formulae
(i) Regression equation of Y on X is
 σy 
Y −Y =r X −X
σx
where
X and Y are means of X and Y series.

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Formulae
(i) Regression equation of Y on X is
 σy 
Y −Y =r X −X
σx
where
X and Y are means of X and Y series.
σy
r is known as the regression coefficient of Y on X .
σx

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Formulae
(i) Regression equation of Y on X is
 σy 
Y −Y =r X −X
σx
where
X and Y are means of X and Y series.
σy
r is known as the regression coefficient of Y on X .
σx
σy
It is denoted by denoted by byx , i.e byx = r .
σx

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Formulae
(i) Regression equation of Y on X is
 σy 
Y −Y =r X −X
σx
where
X and Y are means of X and Y series.
σy
r is known as the regression coefficient of Y on X .
σx
σy
It is denoted by denoted by byx , i.e byx = r .
σx

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Formulae
(i) Regression equation of Y on X is
 σy 
Y −Y =r X −X
σx
where
X and Y are means of X and Y series.
σy
r is known as the regression coefficient of Y on X .
σx
σy
It is denoted by denoted by byx , i.e byx = r .
σx
 
Y − Y = byx X − X

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Formulae
(i) Regression equation of Y on X is
 σy 
Y −Y =r X −X
σx
where
X and Y are means of X and Y series.
σy
r is known as the regression coefficient of Y on X .
σx
σy
It is denoted by denoted by byx , i.e byx = r .
σx
 
Y − Y = byx X − X

Regression coefficient
P P P
n XY − ( X ) ( Y )
byx =
n X 2 − ( X )2
P P

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Line X on Y is

X = a + bY

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Line X on Y is

X = a + bY

1 Y is indepedent variable

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Line X on Y is

X = a + bY

1 Y is indepedent variable
2 X is depedent variable

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Line X on Y is

X = a + bY

1 Y is indepedent variable
2 X is depedent variable
3 a is intercept

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Line X on Y is

X = a + bY

1 Y is indepedent variable
2 X is depedent variable
3 a is intercept
4 b is slope
5 b is regression coefficient of X on Y . i.e., bxy

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Line X on Y is
(ii) Regression equation of X on Y is

 σx 
X −X =r Y −Y
σy

X and Y are means of X and Y series.

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Line X on Y is
(ii) Regression equation of X on Y is

 σx 
X −X =r Y −Y
σy

X and Y are means of X and Y series.


σx
r is known as the regression coefficient of X on Y .
σy

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Line X on Y is
(ii) Regression equation of X on Y is

 σx 
X −X =r Y −Y
σy

X and Y are means of X and Y series.


σx
r is known as the regression coefficient of X on Y .
σy
σx
It is denoted by denoted by bxy , i.e bxy = r .
σy

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Line X on Y is
(ii) Regression equation of X on Y is

 σx 
X −X =r Y −Y
σy

X and Y are means of X and Y series.


σx
r is known as the regression coefficient of X on Y .
σy
σx
It is denoted by denoted by bxy , i.e bxy = r .
σy


X - X = bxy Y − Y

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Regression Line X on Y is
(ii) Regression equation of X on Y is

 σx 
X −X =r Y −Y
σy

X and Y are means of X and Y series.


σx
r is known as the regression coefficient of X on Y .
σy
σx
It is denoted by denoted by bxy , i.e bxy = r .
σy


X - X = bxy Y − Y

P P P
n XY − ( X ) ( Y )
Regression coefficient bxy =
n Y 2 − ( Y )2
P P

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Properties

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Properties


1 The regression lines were passes through the point X,Y

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Properties


1 The regression lines were passes through the point X,Y

2 Either all byx , bxy , r are Positive or


all byx , bxy , r are Negative.

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Properties


1 The regression lines were passes through the point X,Y

2 Either all byx , bxy , r are Positive or


all byx , bxy , r are Negative.

3 It is never possible that


byx is positive and bxy is negative and versa.

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Properties


1 The regression lines were passes through the point X,Y

2 Either all byx , bxy , r are Positive or


all byx , bxy , r are Negative.

3 It is never possible that


byx is positive and bxy is negative and versa.
4 Both bxy and byx cannot be greater than one.

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Properties

1 Relation between correlation and regression coefficient


between X and Y

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Properties

1 Relation between correlation and regression coefficient


between X and Y
p
r =± bxy × byx

p
2 If bxy , byx are positive then r = + bxy × byx

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Properties

1 Relation between correlation and regression coefficient


between X and Y
p
r =± bxy × byx

p
2 If bxy , byx are positive then r = + bxy × byx

p
3 If bxy , byx are negative then r = − bxy × byx

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1

Problem No. 1
Obtain the two regression lines and correlation coefficient from the
following data

X : 1 2 3 4 5 6 7 8 9
Y : 9 8 10 12 11 13 14 16 15

Also estimate the value of y when x = 6.2.

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1

X Y X2 Y2 XY
1 9
2 8
3 10
4 12
5 11
6 13
7 14
8 16
9 15
X2 Y2
P P P P P
X Y XY

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1
X Y X2 Y2 XY
1 9 1 81 9
2 8 4 64 16
3 10 9 100 30
4 12 16 144 48
5 11 25 121 55
6 13 36 169 78
7 14 49 196 98
8 16 64 256 128
9 15 81 225 135
P P P 2 P 2 P
X Y X Y XY
45 108 285 1356 597

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1
X Y X2 Y2 XY
1 9 1 81 9
2 8 4 64 16
3 10 9 100 30
4 12 16 144 48
5 11 25 121 55
6 13 36 169 78
7 14 49 196 98
8 16 64 256 128
9 15 81 225 135
P P P 2 P 2 P
X Y X Y XY
45 108 285 1356 597
X X X
n = 9, X = 45, Y = 108, X 2 = 285,

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1
X Y X2 Y2 XY
1 9 1 81 9
2 8 4 64 16
3 10 9 100 30
4 12 16 144 48
5 11 25 121 55
6 13 36 169 78
7 14 49 196 98
8 16 64 256 128
9 15 81 225 135
P P P 2 P 2 P
X Y X Y XY
45 108 285 1356 597
X X X
n = 9, X = 45, Y = 108, X 2 = 285,
X X
Y 2 = 1356, XY = 597, .
Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists
Problem No. 1
X Y X2 Y2 XY
1 9 1 81 9
2 8 4 64 16
3 10 9 100 30
4 12 16 144 48
5 11 25 121 55
6 13 36 169 78
7 14 49 196 98
8 16 64 256 128
9 15 81 225 135
P P P 2 P 2 P
X Y X Y XY
45 108 285 1356 597
X X X
n = 9, X = 45, Y = 108, X 2 = 285,
X X
Y 2 = 1356, XY = 597, .
Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists
Problem No. 1
P P
X 45 Y 108
X = = = 5, Y = = = 12
n 9 n 9

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1
P P
X 45 Y 108
X = = = 5, Y = = = 12
n 9 n 9
(i) Regression equation of Y on X is

Y - Y = byx X − X

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1
P P
X 45 Y 108
X = = = 5, Y = = = 12
n 9 n 9
(i) Regression equation of Y on X is

Y - Y = byx X − X

Here

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1
P P
X 45 Y 108
X = = = 5, Y = = = 12
n 9 n 9
(i) Regression equation of Y on X is

Y - Y = byx X − X

Here
P P P
n XY − ( X ) ( Y )
byx =
n X 2 − ( X )2
P P

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1
P P
X 45 Y 108
X = = = 5, Y = = = 12
n 9 n 9
(i) Regression equation of Y on X is

Y - Y = byx X − X

Here
P P P
n XY − ( X ) ( Y ) 9(597) − (45)(108)
byx = P 2 = = 0.95
9(285) − (45)2
P 2
n X − ( X)

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1
P P
X 45 Y 108
X = = = 5, Y = = = 12
n 9 n 9
(i) Regression equation of Y on X is

Y - Y = byx X − X

Here
P P P
n XY − ( X ) ( Y ) 9(597) − (45)(108)
byx = P 2 = = 0.95
9(285) − (45)2
P 2
n X − ( X)

Y − 12 = (0.95)(X − 5)

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1
P P
X 45 Y 108
X = = = 5, Y = = = 12
n 9 n 9
(i) Regression equation of Y on X is

Y - Y = byx X − X

Here
P P P
n XY − ( X ) ( Y ) 9(597) − (45)(108)
byx = P 2 = = 0.95
9(285) − (45)2
P 2
n X − ( X)

Y − 12 = (0.95)(X − 5)
Simplifying we get
Y = 0.95X + 7.25

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1

(ii) Regression equation of X on Y is

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1

(ii) Regression equation of X on Y is



X - X = bxy Y − Y

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1

(ii) Regression equation of X on Y is



X - X = bxy Y − Y

Here

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1

(ii) Regression equation of X on Y is



X - X = bxy Y − Y

Here
P P P
n XY − ( X ) ( Y )
bxy =
n Y 2 − ( Y )2
P P

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1

(ii) Regression equation of X on Y is



X - X = bxy Y − Y

Here
P P P
n XY − ( X ) ( Y ) 9(597) − (45)(108)
bxy = P 2 = = 0.95
9(1356) − (108)2
P 2
n Y −( Y)

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1

(ii) Regression equation of X on Y is



X - X = bxy Y − Y

Here
P P P
n XY − ( X ) ( Y ) 9(597) − (45)(108)
bxy = P 2 = = 0.95
9(1356) − (108)2
P 2
n Y −( Y)

X − 5 = 0.95(Y − 12)

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1

(ii) Regression equation of X on Y is



X - X = bxy Y − Y

Here
P P P
n XY − ( X ) ( Y ) 9(597) − (45)(108)
bxy = P 2 = = 0.95
9(1356) − (108)2
P 2
n Y −( Y)

X − 5 = 0.95(Y − 12)
Simplifying we get
X = 0.95Y − 6.4

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1
(iii) Compute the correlation coefficient between X and Y is r .

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1
(iii) Compute the correlation coefficient between X and Y is r .

byx = 0.95, bxy = 0.95

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1
(iii) Compute the correlation coefficient between X and Y is r .

byx = 0.95, bxy = 0.95

p
r = ± byx × bxy

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1
(iii) Compute the correlation coefficient between X and Y is r .

byx = 0.95, bxy = 0.95

p
r = ± byx × bxy

Since both regression coefficients are positive then the correlation


coefficient is positive.

p
r= byx × bxy

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1
(iii) Compute the correlation coefficient between X and Y is r .

byx = 0.95, bxy = 0.95

p
r = ± byx × bxy

Since both regression coefficients are positive then the correlation


coefficient is positive.

p √
r= byx × bxy = 0.95 × 0.95

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 1
(iii) Compute the correlation coefficient between X and Y is r .

byx = 0.95, bxy = 0.95

p
r = ± byx × bxy

Since both regression coefficients are positive then the correlation


coefficient is positive.

p √
r= byx × bxy = 0.95 × 0.95 = 0.95

When X = 6.2
Y = 0.95X + 7.25 = (0.95 × 6.2) + 7.25 = 13.14.
Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists
Simple Shortcut

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Simple Shortcut

For two given lines


a1 x + b1 y + c1 = 0 and a2 x + b2 y + c2 = 0

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Simple Shortcut

For two given lines


a1 x + b1 y + c1 = 0 and a2 x + b2 y + c2 = 0
coefficient of x
byx = −
coefficient of y

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Simple Shortcut

For two given lines


a1 x + b1 y + c1 = 0 and a2 x + b2 y + c2 = 0
coefficient of x
byx = −
coefficient of y

coefficient of y
bxy = −
coefficient of x

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Simple Shortcut

For two given lines


a1 x + b1 y + c1 = 0 and a2 x + b2 y + c2 = 0
coefficient of x
byx = −
coefficient of y

coefficient of y
bxy = −
coefficient of x

r 2 = bxy · byx ≤ 1
Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists
Simple Shortcut

For two given lines


a1 x + b1 y + c1 = 0 and a2 x + b2 y + c2 = 0
coefficient of x
byx = −
coefficient of y

coefficient of y
bxy = −
coefficient of x

r 2 = bxy · byx ≤ 1
Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists
Problem No. 3
Problem No. 3
In a partially destroyed laboratory record of an analysis of
correlation data, following result only legible. Variance of X = 9.
The regression equations are 8X − 10Y + 66 = 0 and
40X − 18Y = 214. What are
(i) The means value of X and Y
(ii) the correlation coefficient bet X and Y
(iii) the S.D of Y .

Solution:

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
Problem No. 3
In a partially destroyed laboratory record of an analysis of
correlation data, following result only legible. Variance of X = 9.
The regression equations are 8X − 10Y + 66 = 0 and
40X − 18Y = 214. What are
(i) The means value of X and Y
(ii) the correlation coefficient bet X and Y
(iii) the S.D of Y .

Solution: (i) To find the means value of X and Y

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
Problem No. 3
In a partially destroyed laboratory record of an analysis of
correlation data, following result only legible. Variance of X = 9.
The regression equations are 8X − 10Y + 66 = 0 and
40X − 18Y = 214. What are
(i) The means value of X and Y
(ii) the correlation coefficient bet X and Y
(iii) the S.D of Y .

Solution: (i) To find the means value of X and Y


The equations of the regression lines are

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
Problem No. 3
In a partially destroyed laboratory record of an analysis of
correlation data, following result only legible. Variance of X = 9.
The regression equations are 8X − 10Y + 66 = 0 and
40X − 18Y = 214. What are
(i) The means value of X and Y
(ii) the correlation coefficient bet X and Y
(iii) the S.D of Y .

Solution: (i) To find the means value of X and Y


The equations of the regression lines are
8X − 10Y = −66 (1)

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
Problem No. 3
In a partially destroyed laboratory record of an analysis of
correlation data, following result only legible. Variance of X = 9.
The regression equations are 8X − 10Y + 66 = 0 and
40X − 18Y = 214. What are
(i) The means value of X and Y
(ii) the correlation coefficient bet X and Y
(iii) the S.D of Y .

Solution: (i) To find the means value of X and Y


The equations of the regression lines are
8X − 10Y = −66 (1)
and

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
Problem No. 3
In a partially destroyed laboratory record of an analysis of
correlation data, following result only legible. Variance of X = 9.
The regression equations are 8X − 10Y + 66 = 0 and
40X − 18Y = 214. What are
(i) The means value of X and Y
(ii) the correlation coefficient bet X and Y
(iii) the S.D of Y .

Solution: (i) To find the means value of X and Y


The equations of the regression lines are
8X − 10Y = −66 (1)
and
40X − 18Y = 214 (2)

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 10


Since the regression lines were passes through the point X , Y .
Hence equations (1) and (2) becomes

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 10


Since the regression lines were passes through the point X , Y .
Hence equations (1) and (2) becomes

(1) ⇒ 8X − 10Y = −66

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 10


Since the regression lines were passes through the point X , Y .
Hence equations (1) and (2) becomes

(1) ⇒ 8X − 10Y = −66 (3)

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 10


Since the regression lines were passes through the point X , Y .
Hence equations (1) and (2) becomes

(1) ⇒ 8X − 10Y = −66 (3)

(2) ⇒ 40X − 18Y = 214

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 10


Since the regression lines were passes through the point X , Y .
Hence equations (1) and (2) becomes

(1) ⇒ 8X − 10Y = −66 (3)

(2) ⇒ 40X − 18Y = 214 (4)


Solve the equations (3) and (4), we get

X = 13, Y = 17

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
(ii) To find the correlation coefficient bet X and Y

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
(ii) To find the correlation coefficient bet X and Y
Let us consider equation (1)

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
(ii) To find the correlation coefficient bet X and Y
Let us consider equation (1)

(1) ⇒ 8X − 10Y = −66

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
(ii) To find the correlation coefficient bet X and Y
Let us consider equation (1)

(1) ⇒ 8X − 10Y = −66

⇒ 10y = 8x + 66

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
(ii) To find the correlation coefficient bet X and Y
Let us consider equation (1)

(1) ⇒ 8X − 10Y = −66

⇒ 10y = 8x + 66

8 66
⇒y = x+
10 10
y = 0.8x + 6.6

The regression coefficient of y on x is

8
byx = = 0.8
10
Problem No. 3
(ii) To find the correlation coefficient bet X and Y
Let us consider equation (1)

(1) ⇒ 8X − 10Y = −66

⇒ 10y = 8x + 66

8 66
⇒y = x+
10 10
y = 0.8x + 6.6

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
(ii) To find the correlation coefficient bet X and Y
Let us consider equation (1)

(1) ⇒ 8X − 10Y = −66

⇒ 10y = 8x + 66

8 66
⇒y = x+
10 10
y = 0.8x + 6.6

The regression coefficient of y on x is

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
(ii) To find the correlation coefficient bet X and Y
Let us consider equation (1)

(1) ⇒ 8X − 10Y = −66

⇒ 10y = 8x + 66

8 66
⇒y = x+
10 10
y = 0.8x + 6.6

The regression coefficient of y on x is

8
byx = = 0.8
10

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
(2) can be written as

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
(2) can be written as
(2) ⇒ 40X − 18Y = 214

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
(2) can be written as
(2) ⇒ 40X − 18Y = 214

⇒ 40x = 18y + 214

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
(2) can be written as
(2) ⇒ 40X − 18Y = 214

⇒ 40x = 18y + 214

18 214
⇒x = y+
40 40
x = 0.45x + 5.35

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
(2) can be written as
(2) ⇒ 40X − 18Y = 214

⇒ 40x = 18y + 214

18 214
⇒x = y+
40 40
x = 0.45x + 5.35
The regression coefficient of x on y is

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
(2) can be written as
(2) ⇒ 40X − 18Y = 214

⇒ 40x = 18y + 214

18 214
⇒x = y+
40 40
x = 0.45x + 5.35
The regression coefficient of x on y is

18
byx = = 0.45
40
The correlation coefficient between X and Y is

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists


Problem No. 3
(2) can be written as
(2) ⇒ 40X − 18Y = 214

⇒ 40x = 18y + 214

18 214
⇒x = y+
40 40
x = 0.45x + 5.35
The regression coefficient of x on y is

18
byx = = 0.45
40
The correlation coefficient between X and Y is
p √
r = byx × bxy = 0.8 × 0.45 = 0.60.
Problem No. 3
(2) can be written as
(2) ⇒ 40X − 18Y = 214

⇒ 40x = 18y + 214

18 214
⇒x = y+
40 40
x = 0.45x + 5.35
The regression coefficient of x on y is

18
byx = = 0.45
40
The correlation coefficient between X and Y is
p √
r = byx × bxy = 0.8 × 0.45 = 0.60.
Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists
Problem No. 3

(iii) To find the S.D of Y .


We know that
σy
byx = r
σx

8 r σy 8
byx = ⇒ =
10 σx 10

r σy 8 0.6 × σy
⇒ = ⇒ = 0.8
σx 10 3

0.8 × 3
⇒ σy = = 4 ⇒ σy = 4
0.6

Dr. E. Suresh 18MAB303T - BioStatistics for Biotechnologists

You might also like