0% found this document useful (0 votes)
8 views52 pages

BIO203 Lecture 11 (Correlation) SHF 2024

The document discusses regression and correlation in biostatistics, focusing on the relationship between two variables, specifically in the context of protein assays. It outlines key questions regarding the mathematical relationship, correlation, and the quality of that correlation, along with the mechanics of determining regression coefficients from data. Additionally, it highlights the use of regression lines for making predictions and the differences between regression and correlation.

Uploaded by

theofix301
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views52 pages

BIO203 Lecture 11 (Correlation) SHF 2024

The document discusses regression and correlation in biostatistics, focusing on the relationship between two variables, specifically in the context of protein assays. It outlines key questions regarding the mathematical relationship, correlation, and the quality of that correlation, along with the mechanics of determining regression coefficients from data. Additionally, it highlights the use of regression lines for making predictions and the differences between regression and correlation.

Uploaded by

theofix301
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 52

BIO203 - Biostatistics: Lecture 12 – Correlation and Regression

I didn’t
break it!

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

ctr drug A

protein extraction

sample A sample B

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

Bradford protein assay


ctr drug A

protein extraction

protein concentration
absorbance @ 595 nm
sample A sample B

Abs: 10 Abs: 15

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

Bradford protein assay


BSA Abs
sample conc. 595 nm

1 1 2

2 2 5

3 3 5

4 4 9

5 5 11

6 6 12

7 7 14

8 8 15

9 9 17
protein concentration
absorbance @ 595 nm 10 10 21

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

Bradford protein assay


BSA Abs
sample conc. 595 nm

1 1 2

2 2 5

3 3 5

4 4 9

5 5 11

6 6 12

7 7 14

8 8 15

9 9 17
protein concentration
absorbance @ 595 nm 10 10 21

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

Bradford protein assay


BSA Abs
sample conc. 595 nm

1 1 2

2 2 5

3 3 5

4 4 9

5 5 11

6 6 12

7 7 14

8 8 15

9 9 17
protein concentration
absorbance @ 595 nm 10 10 21

test 1 ??? 10
test 2 ??? 15

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

Bradford protein assay


BSA Abs
sample conc. 595 nm

1 1 2

2 2 5

3 3 5

4 4 9

5 5 11

6 6 12

7 7 14

8 8 15

9 9 17
protein concentration
absorbance @ 595 nm 10 10 21

test 1 ??? 10
test 2 ??? 15

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

3 questions that can be asked

1. what is the mathematical relationship between the variables?

2. is there a correlation (= association) between the variables ?

3. how good is the correlation between the variables ?

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

3 questions that can be asked

1. what is the mathematical relationship between the variables?


→ regression line (y = a + bx)
→ prediction of values for the Y variable

2. is there a correlation (= association) between the variables ?


→ regression coefficient r

3. how good is the correlation between the variables ?


→ coefficient of determination r2
→ p-value (difference from slope 0)

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables
dependent variable

independent variable

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

regression line
→ mathematical relationship between x and y
dependent variable

independent variable

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

line: y = a + b x
dependent variable

regression coefficients: a, b

slope (b)

y intercept (a)

independent variable

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

line: y = a + b x
dependent variable

regression coefficients: a, b

y 1x

slope (b) = how many y for 1 x?

y intercept (a)

independent variable

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

which one is the correct regression line ?

dependent variable

independent variable

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

which one is the correct regression line ?

dependent variable

independent variable

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

how to determine the regression coefficients from the measured data points ?

dependent variable

independent variable

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

line = a + b x
dependent variable

regression coefficients: a, b

slope (b)
y intercept (a)

independent variable

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

meanx = 5.5

line = a + b x
dependent variable

regression coefficients: a, b

y
meany = 11.1

slope (b)
y intercept (a)

independent variable

BIO203.01 - Biostatistics shf - 2024


Regression: mechanics of the procedure

how to determine the regression coefficients from the measured data points ?

sample x y
1 1 2
2 2 5
3 3 5
4 4 9
5 5 11
6 6 12
7 7 14 y
8 8 15
9 9 17
10 10 21

Σ 55 111

mean 5.5 11.1


x

σ 𝒙−𝒙ഥ (𝒚 − 𝒚 ഥ)
𝒃= ഥ − 𝒃ഥ
𝒂= 𝒚 𝒙
ഥ )𝟐
σ(𝒙 − 𝒙

BIO203.01 - Biostatistics shf - 2024


Regression: mechanics of the procedure

how to determine the regression coefficients from the measured data points ?

sample x y ഥ
𝒙 − 𝒙 ഥ
𝒚 − 𝒚 produc t ഥ)𝟐
(𝒙 − 𝒙

1 1 2 -4.5 -9.1 40.95 20.25


2 2 5 -3.5 -6.1 21.35 12.25
3 3 5 -2.5 -6.1 15.25 6.25
4 4 9 -1.5 -2.1 3.15 2.25
5 5 11 -0.5 -0.1 0.05 0.25
6 6 12 0.5 0.9 0.45 0.25
7 7 14 1.5 2.9 4.35 2.25 y
8 8 15 2.5 3.9 9.75 6.26
9 9 17 3.5 5.9 20.65 12.25
10 10 21 4.5 9.9 44.55 20.25

Σ 55 111 0 0 160.5 / 82.5

mean 5.5 11.1 b= 1.945


x

σ 𝒙−𝒙ഥ (𝒚 − 𝒚 ഥ)
𝒃= = 𝟏. 𝟗𝟒𝟓 ഥ − 𝒃ഥ
𝒂= 𝒚 𝒙 = 𝟎. 𝟒
ഥ )𝟐
σ(𝒙 − 𝒙

BIO203.01 - Biostatistics shf - 2024


Regression: mechanics of the procedure

how to determine the regression coefficients from the measured data points ?

sample x y ഥ
𝒙 − 𝒙 ഥ
𝒚 − 𝒚 produc t ഥ)𝟐
(𝒙 − 𝒙 y’
1 1 2 -4.5 -9.1 40.95 20.25 2.3
2 2 5 -3.5 -6.1 21.35 12.25 4.3
3 3 5 -2.5 -6.1 15.25 6.25 6.2
4 4 9 -1.5 -2.1 3.15 2.25 8.2
5 5 11 -0.5 -0.1 0.05 0.25 10.1
6 6 12 0.5 0.9 0.45 0.25 12.1
7 7 14 1.5 2.9 4.35 2.25 14.0 y
8 8 15 2.5 3.9 9.75 6.26 16.0
9 9 17 3.5 5.9 20.65 12.25 17.9
10 10 21 4.5 9.9 44.55 20.25 19.9

Σ 55 111 0 0 160.5 / 82.5 111

mean 5.5 11.1 b= 1.945 11.1


x

σ 𝒙−𝒙ഥ (𝒚 − 𝒚 ഥ)
𝒃= = 𝟏. 𝟗𝟒𝟓 ഥ − 𝒃ഥ
𝒂= 𝒚 𝒙 = 𝟎. 𝟒 𝒚′ = 𝟏. 𝟗𝟒𝟓𝒙 + 𝟎. 𝟒
ഥ )𝟐
σ(𝒙 − 𝒙

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

the regression line allows you to make predictions

𝒚′ = 𝟏. 𝟗𝟒𝟓𝒙 + 𝟎. 𝟒

predicted absorbance for BSA concentration of 6.5

𝑨𝒃𝒔 = 𝟏. 𝟗𝟒𝟓 𝒙 𝟔. 𝟓 + 𝟎. 𝟒 = 𝟏𝟑. 𝟎𝟒

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

the regression line allows you to make predictions

𝒚′ = 𝟏. 𝟗𝟒𝟓𝒙 + 𝟎. 𝟒

𝒚 − 𝟎. 𝟒
𝒙′ =
𝟏. 𝟗𝟒𝟓

sample with unknown BSA concentration

Abs: 15

𝟏𝟓 − 𝟎. 𝟒
𝑩𝑺𝑨 = = 𝟕. 𝟓
𝟏. 𝟗𝟒𝟓

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

the regression line allows you to make predictions

extrapolation interpolation extrapolation

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

the regression line allows you to make predictions

regression on x regression on y

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

the regression line allows you to make predictions

regression on y

regression on x

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

the regression line allows you to make predictions (but be careful !!!)

regression on y regression on y

regression on x
regression on x

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

the regression line allows you to make predictions (but be careful !!!)

regression on y regression on y

regression on x
regression on x

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

BIO203.01 - Biostatistics shf - 2024


Regression: describes the relationship between two variables

BIO203.01 - Biostatistics shf - 2024


Regression: and correlation

regression - vs - correlation
dependent variable

dependent variable
measured
independent variable in response independent variable

random
sampling pre-determined
chosen set

the math / statistics is identical

BIO203.01 - Biostatistics shf - 2024


Regression: and correlation

two relevant measures

goodness of fit
of the regression line
dependent variable

→ correlation coefficient r
→ coefficient of determination r2
y

deviation from slope = 0


→ p-value

x
independent variable

BIO203.01 - Biostatistics shf - 2024


Regression: and correlation

H0: no relationship between variables – H1: not H0


p-value - deviation from slope = 0

deviation from slope = 0 goodness of fit of the regression line


dependent variable

dependent variable

independent variable independent variable

BIO203.01 - Biostatistics shf - 2024


Regression: and correlation

sample x y ഥ
𝒚 − 𝒚 ഥ)𝟐
(𝒚 − 𝒚

1 1 2 -8 64
2 3 9 -1 1
3 5 9 -1 1
4 6 11 1 1
5 7 14 4 16
6 9 15 5 25

Σ 31 60 0 108
total
sum of squares
mean 5.2 10

BIO203.01 - Biostatistics shf - 2024


Regression: and correlation

sample x y ഥ
𝒚 − 𝒚 ഥ)𝟐
(𝒚 − 𝒚

1 1 2 -8 64
2 3 9 -1 1
3 5 9 -1 1
4 6 11 1 1
5 7 14 4 16
6 9 15 5 25

Σ 31 60 0 108
total
sum of squares
mean 5.2 10

sample x y 𝒚′ 𝒚 − 𝒚′ (𝒚 − 𝒚′)𝟐

1 1 2 3.57 -1.57 2.46


2 3 9 6.66 2.34 5.48
3 5 9 9.74 -0.74 0.55
4 6 11 11.29 -0.29 0.08
5 7 14 12.83 1.17 1.37
6 9 15 15.91 -0.91 0.83

Σ 31 60 60 0 10.8
residuals
mean 5.2 10 10 sum of squares

BIO203.01 - Biostatistics shf - 2024


Regression: and correlation

sample x y ഥ
𝒚 − 𝒚 ഥ)𝟐
(𝒚 − 𝒚

1 1 2 -8 64
2 3 9 -1 1
3 5 9 -1 1
4 6 11 1 1
5 7 14 4 16
6 9 15 5 25

Σ 31 60 0 108
total
r2 = 0.9
sum of squares
mean 5.2 10

sample x y 𝒚′ 𝒚 − 𝒚′ (𝒚 − 𝒚′)𝟐

1 1 2 3.57 -1.57 2.46


2 3 9 6.66 2.34 5.48
3 5 9 9.74 -0.74 0.55
4 6 11 11.29 -0.29 0.08
5 7 14 12.83 1.17 1.37
6 9 15 15.91 -0.91 0.83

Σ 31 60 60 0 10.8
residuals
mean 5.2 10 10 sum of squares

BIO203.01 - Biostatistics shf - 2024


Regression: and correlation

r2 = coefficient of determination
→ a measure of how much the X variable explains the variation of the Y variable
→ range: 0 - 1

𝑺𝑺𝒓𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔 𝑺𝑺𝒓𝒆𝒈𝒓𝒆𝒔𝒔𝒊𝒐𝒏
𝒓𝟐 = 𝟏 − =
𝑺𝑺𝒕𝒐𝒕𝒂𝒍) 𝑺𝑺𝒕𝒐𝒕𝒂𝒍)

BIO203.01 - Biostatistics shf - 2024


Regression: and correlation

r2 = coefficient of determination
→ a measure of how much the X variable explains the variation of the Y variable
→ range: 0 - 1

𝑺𝑺𝒓𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔 𝑺𝑺𝒓𝒆𝒈𝒓𝒆𝒔𝒔𝒊𝒐𝒏
𝒓𝟐 = 𝟏 − =
𝑺𝑺𝒕𝒐𝒕𝒂𝒍) 𝑺𝑺𝒕𝒐𝒕𝒂𝒍)

r = correlation coefficient (Pearson product moment correlation coefficient)

→ strength of the association between two variables


→ negative for negative slope, positive for positive slope of the regression line
→ range: -1 (anticorrelated) and +1 (positively correlated)

σ 𝒙−𝒙
ഥ (𝒚 − 𝒚
ഥ)
𝒓=
σ 𝒙−𝒙
ഥ 𝟐 ഥ
𝒚−𝒚 𝟐

BIO203.01 - Biostatistics shf - 2024


Regression / Correlation: another look

total residuals
sample x y ഥ
𝒚 − 𝒚 ഥ )𝟐
(𝒚 − 𝒚 𝒚′ 𝒚 − 𝒚′ (𝒚 − 𝒚′)𝟐

1 1 2 -9.1 40.95 2.35 -0.35 0.12


2 2 5 -6.1 21.35 4.29 0.71 0.50
3 3 5 -6.1 15.25 6.24 -1.24 1.53
4 4 9 -2.1 3.15 8.18 0.82 0.67
5 5 11 -0.1 0.05 10.13 0.87 0.76
6 6 12 0.9 0.45 12.07 -0.07 0.01
7 7 14 2.9 4.35 14.02 -0.02 0.00
8 8 15 3.9 9.75 15.96 -0.96 0.93
9 9 17 5.9 20.65 17.91 -0.91 0.83
10 10 21 9.9 44.55 19.85 1.15 1.31

Σ 55 111 0 318.9 111 0 6.65

mean 5.5 11.1 11.1

𝑺𝑺𝒓𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔 𝟔. 𝟔𝟓
𝒓𝟐 = 𝟏 − =𝟏− = 𝟎. 𝟗𝟕𝟗
𝑺𝑺𝒕𝒐𝒕𝒂𝒍) 𝟑𝟏𝟖. 𝟗

BIO203.01 - Biostatistics shf - 2024


Regression / Correlation: another look

total residuals regression


sample x y ഥ
𝒚 − 𝒚 ഥ )𝟐
(𝒚 − 𝒚 𝒚′ 𝒚 − 𝒚′ (𝒚 − 𝒚′)𝟐 ഥ
𝒚′ − 𝒚 ഥ )𝟐
(𝒚′ − 𝒚

1 1 2 -9.1 40.95 2.35 -0.35 0.12 -8.75 76.64


2 2 5 -6.1 21.35 4.29 0.71 0.50 -6.81 46.36
3 3 5 -6.1 15.25 6.24 -1.24 1.53 -4.86 23.65
4 4 9 -2.1 3.15 8.18 0.82 0.67 -2.92 8.25
5 5 11 -0.1 0.05 10.13 0.87 0.76 -0.97 0.95
6 6 12 0.9 0.45 12.07 -0.07 0.01 0.97 0.95
7 7 14 2.9 4.35 14.02 -0.02 0.00 2.92 8.52
8 8 15 3.9 9.75 15.96 -0.96 0.93 4.86 23.65
9 9 17 5.9 20.65 17.91 -0.91 0.83 6.81 46.36
10 10 21 9.9 44.55 19.85 1.15 1.31 8.75 76.64

Σ 55 111 0 318.9 111 0 6.65 0 312.2

mean 5.5 11.1 11.1

𝑺𝑺𝒓𝒆𝒔𝒊𝒅𝒖𝒂𝒍𝒔 𝟔. 𝟔𝟓 𝑺𝑺𝒓𝒆𝒈𝒓𝒆𝒔𝒔𝒊𝒐𝒏 𝟑𝟏𝟐. 𝟐


𝒓𝟐 = 𝟏 − =𝟏− = 𝟎. 𝟗𝟕𝟗 𝒓𝟐 = = = 𝟎. 𝟗𝟕𝟗
𝑺𝑺𝒕𝒐𝒕𝒂𝒍) 𝟑𝟏𝟖. 𝟗 𝑺𝑺𝒕𝒐𝒕𝒂𝒍) 𝟑𝟏𝟖. 𝟗

BIO203.01 - Biostatistics shf - 2024


Regression: and correlation

BIO203.01 - Biostatistics shf - 2024


Regression: and correlation

calculate the p value for H0: slope of regression line = 0

(𝒏 − 𝟐) ∙ 𝒓𝟐
𝒕=
𝟏 − 𝒓𝟐

𝟏𝟎 − 𝟐 ∙ 𝟎. 𝟗𝟕𝟗𝟏
𝒕= = 𝟏𝟗. 𝟒
𝟏 − 𝟎. 𝟗𝟕𝟗𝟏

𝒑 = 𝟓. 𝟐𝟑 𝟏𝟎−𝟖

𝒅𝒇 = # 𝒐𝒇 𝒑𝒂𝒊𝒓𝒔 − 𝟐

BIO203.01 - Biostatistics shf - 2024


Regression: and correlation

BIO203.01 - Biostatistics shf - 2024


Regression: and correlation

BIO203.01 - Biostatistics shf - 2024


Regression: and correlation

BIO203.01 - Biostatistics shf - 2024


Regression / Correlation: another look

total residuals regression


sample x y ഥ
𝒚 − 𝒚 ഥ )𝟐
(𝒚 − 𝒚 𝒚′ 𝒚 − 𝒚′ (𝒚 − 𝒚′)𝟐 ഥ
𝒚′ − 𝒚 ഥ )𝟐
(𝒚′ − 𝒚

1 1 2 -9.1 40.95 2.35 -0.35 0.12 -8.75 76.64


2 2 5 -6.1 21.35 4.29 0.71 0.50 -6.81 46.36
3 3 5 -6.1 15.25 6.24 -1.24 1.53 -4.86 23.65
4 4 9 -2.1 3.15 8.18 0.82 0.67 -2.92 8.25
5 5 11 -0.1 0.05 10.13 0.87 0.76 -0.97 0.95
6 6 12 0.9 0.45 12.07 -0.07 0.01 0.97 0.95
7 7 14 2.9 4.35 14.02 -0.02 0.00 2.92 8.52
8 8 15 3.9 9.75 15.96 -0.96 0.93 4.86 23.65
9 9 17 5.9 20.65 17.91 -0.91 0.83 6.81 46.36
10 10 21 9.9 44.55 19.85 1.15 1.31 8.75 76.64

Σ 55 111 0 318.9 111 0 6.65 0 312.2

mean 5.5 11.1 11.1

sum of squares sum of squares sum of squares


total residuals regression

BIO203.01 - Biostatistics shf - 2024


Regression / Correlation: another look

sum of squares residual sum of squares regression

𝑺𝑺𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍 = ෍(𝒚 − 𝒚′ )𝟐 ഥ)𝟐


𝑺𝑺𝑹𝒆𝒈𝒓𝒆𝒔𝒔𝒊𝒐𝒏 = ෍(𝒚′ − 𝒚

residual variance regression variance

𝒔𝟐𝒚𝒙 =
σ(𝒚−𝒚′ )𝟐
= 𝑴𝑺𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍 ഥ)𝟐
σ(𝒚′ − 𝒚
𝒏−𝟐 𝒔𝟐𝒚𝒙 = = 𝑴𝑺𝑹𝒆𝒈𝒓𝒆𝒔𝒔𝒊𝒐𝒏
𝟏

standard error of estimate

σ(𝒚 − 𝒚′ )𝟐
𝒔𝒚𝒙 = 𝒔𝟐𝒚𝒙 =
𝒏−𝟐

BIO203.01 - Biostatistics shf - 2024


ANOVA: steps - calculation of mean squares

n A B C
1 30 51 46 𝑺𝑺𝑻
2 40 54 54 𝑴𝑺𝑻 = 𝒏𝑻 − 𝟏
3 41 50 43
4 41 56 56
5 42 57 50
6 43 49 32
7 46 45 56
8 48 60 46
9 55 52 56
10 60 54 65

A B C ഥ𝟏 − 𝑿
ഥ ഥ𝒙)𝟐 + 𝒏𝟐 − 𝟏 (𝑿
ഥ𝟐 − 𝑿
ഥ 𝒙ഥ)𝟐 + 𝒏𝟑 − 𝟏 (𝑿
ഥ𝟑 − 𝑿
ഥ 𝒙ഥ)𝟐
𝒏𝟏 − 𝟏 (𝑿
mean 44.6 52.8 50.4 𝑴𝑺𝑩 = 𝒏𝒖𝒎𝒃𝒆𝒓 𝒐𝒇 𝒔𝒂𝒎𝒑𝒍𝒆𝒔 − 𝟏
n 10 10 10
stdev 8.4 4.3 9.1
var 69.8 18.8 83.6 𝒏𝟏 − 𝟏 𝒔𝟐𝟏 + 𝒏𝟐 − 𝟏 𝒔𝟐𝟐 + 𝒏𝟑 − 𝟏 𝒔𝟐𝟑
𝑴𝑺𝑾 = 𝒏𝑻 − 𝒏𝒖𝒎𝒃𝒆𝒓 𝒐𝒇 𝒔𝒂𝒎𝒑𝒍𝒆𝒔

BIO203.01 - Biostatistics shf - 2024


ANOVA: steps - calculation of mean squares

F statistic - describes the relationship between variances

larger s2
𝑭= smaller s2

MSB
𝑭= MSW
density

177.7
𝑭= 57.4
= 3.10

p value
𝑴𝑺𝑻 = 65.7
𝑴𝑺𝑩 = 177.7
value of F
𝑴𝑺𝑾 = 57.4

BIO203.01 - Biostatistics shf - 2024


Regression: and correlation

total residuals regression


sample x y ഥ
𝒚 − 𝒚 ഥ )𝟐
(𝒚 − 𝒚 𝒚′ 𝒚 − 𝒚′ (𝒚 − 𝒚′)𝟐 ഥ
𝒚′ − 𝒚 ഥ )𝟐
(𝒚′ − 𝒚

1 1 2 -9.1 40.95 2.35 -0.35 0.12 -8.75 76.64


2 2 5 -6.1 21.35 4.29 0.71 0.50 -6.81 46.36
3 3 5 -6.1 15.25 6.24 -1.24 1.53 -4.86 23.65
4 4 9 -2.1 3.15 8.18 0.82 0.67 -2.92 8.25
5 5 11 -0.1 0.05 10.13 0.87 0.76 -0.97 0.95
6 6 12 0.9 0.45 12.07 -0.07 0.01 0.97 0.95
7 7 14 2.9 4.35 14.02 -0.02 0.00 2.92 8.52
8 8 15 3.9 9.75 15.96 -0.96 0.93 4.86 23.65
9 9 17 5.9 20.65 17.91 -0.91 0.83 6.81 46.36
10 10 21 9.9 44.55 19.85 1.15 1.31 8.75 76.64

Σ 55 111 0 318.9 111 0 6.65 0 312.2

mean 5.5 11.1 11.1

MSreg / MSres = F → p-value


SSreg / SStotal = r2

BIO203.01 - Biostatistics shf - 2024


Regression: and correlation

how to determine the regression coefficients from the measured data points ?

regression variance
ഥ )𝟐
σ(𝒚 − 𝒚 𝑴𝑺𝑹𝒆𝒈𝒓𝒆𝒔𝒔𝒊𝒐𝒏 𝟑𝟏𝟐. 𝟐
𝑴𝑺𝑹𝒆𝒈𝒓𝒆𝒔𝒔𝒊𝒐𝒏 = 𝑭= = = 𝟑𝟕𝟓. 𝟓
𝟏 𝑴𝑺𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍 (𝟔. 𝟔𝟓/𝟖)

𝒅𝒇 = 𝟏, 𝟖
residual variance
σ(𝒚 − 𝒚′ )𝟐 𝒑 = 𝟓. 𝟐𝟑 𝟏𝟎−𝟖
𝑴𝑺𝑹𝒆𝒔𝒊𝒅𝒖𝒂𝒍 =
𝒏−𝟐

BIO203.01 - Biostatistics shf - 2024


Regression: and correlation

BIO203.01 - Biostatistics shf - 2024

You might also like