0% found this document useful (0 votes)

30 views4 pages

Linear Regression

Uploaded by

eduardonare700

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views4 pages

Linear Regression

Uploaded by

eduardonare700

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Bedrocks of Quantitative Finance: The Linear Regression

Riley Dunnaway (03/28/24)

Abstract
An exploration of linear regression and various methods of derivation. Exploration
of ordinary least squares, method of moments, and the method of maximum likelihood.
These estimation methods represent staples in the tool belt of all quantitative analysts
and warrant deep mathematical understanding.

1 Motivation for Linear Regression

Suppose we have two data sets corresponding to variables x and y we suspect are linearly
related. The general equation for a line is given by

y = α + βx (1)

The goal of linear regression is to fit a line of form (1) to our data. In the real world this
line will never fit our data perfectly, so we must introduce an error variable, ϵ into equation
(1):

y = α + βx + ϵ (2)
The methods we will discuss focus on minimizing the error term, and yielding the most
accurate linear representation of our data.

2 Ordinary Least Squares

Note that the error, ϵi for each observation xi is equal to the difference between the model
output ŷi and the actual data value yi . One method involves minimizing the sum of all ϵ̂2i .
Notice we square the error so all error is positive and negative/positive errors don’t cancel
out.
The residual sum of squares is given by:
N
X N
X
ϵ̂2i = (yi − ŷi )2 (3)
i=1 i=1

Where N is equal to the number of data observations.

N
X N
X N
X
ϵ̂2i = 2
(yi − ŷi ) = (yi − α̂ − β̂xi )2 (4)
i=1 i=1 i=1

To minimize this residual sum of squares, we take the first derivatives with respect to α̂
and β̂ and set them equal to 0 to find maxes and mins.

1
N N
∂ X X
(yi − α̂ − β̂xi )2 = −2 (yi − α̂ − β̂xi ) = 0 (5)
∂ α̂ i=1 i=1
N N
∂ X 2
X
(yi − α̂ − β̂xi ) = −2 (yi − α̂ − β̂xi )(xi ) = 0 (6)
∂ β̂ i=1 i=1

From equation (6):

N
X N
X N
X
−2 (yi − α̂ − β̂xi ) = yi − N α̂ − β̂ xi = N ȳ − N α̂ − N β̂ x̄ = ȳ − α̂ − β̂ x̄ = 0 (7)
i=1 i=1 i=1

Where the bar notation indicates the mean of all y or x data values. Since equation (8)
shows α̂ = ȳ − β̂ x̄, we substitute this into equation (7):
N
X N
X N
X
−2 (yi − α̂ − β̂xi )(xi ) = (yi − α̂ − β̂xi )(xi ) = (yi − ȳ + β̂ x̄ − β̂xi )(xi )
i=1 i=1 i=1
N N N N N N
(8)
X X X X X X
= xi yi − ȳ xi + β̂ x̄ xi − β̂ x2i = xi yi − N ȳx̄ + β̂N x̄2 − β̂ x2i = 0
i=1 i=1 i=1 i=1 i=1 i=1

Solving equation (9) for β̂ gives:

PN
xi yi − N x̄ȳ
β̂ = PtN1 2
(9)
2
i=1 xi − N x̄

And there we have it! equations (8) and (10) give us the values of α̂ and β̂, or the
equation of the line which minimizes error, in terms of the mean values of the x and y data
sets.

3 Method of Moments
This same result can be achieved by using moments with one additional condition. In
statistics a moment is a quantitative measure on our dataset, namely mean, variance, skew-
ness, and kurtosis. In this case, we will be looking at the mean.
Return to the problem set up in section one:

y = α + βx + ϵ (10)

Our assumption for the method of moments will be that the error is normally distributed
with a mean of 0, i.e.:
ϵ ∼ N (0, σ 2 ) (11)
Given this assumption, one can easily see that the expected value of ϵi would be 0. So,

E(ϵi ) = 0 (12)

2
Rearranging equation (11), we see that y − α − βx = ϵ, so by taking the expected value
and substituting equation (13) we see
E(yi − α − βxi ) = E(ϵi ) = 0 (13)
Since ϵ is normally distributed, we also see
E(ϵi xi ) = 0 (14)
Then plugging in equation (14) into (15),
E((yi − α − βxi )xi ) = 0 (15)
Lastly, notice that the variance of our error, ϵ2 should have the expected value of our
standard deviation squared by definition.
E(ϵ2i ) = σ 2 (16)
Now by rewriting the expected values of equations (14), (16), and (17) as the mean
of our data, we get the following. Notice the introduction of hat notation to differentiate
coefficients of the sample means from those of population moments.
N
1 X
(yi − α̂ − β̂xi ) = 0
N i=1
N
1 X
xi (yi − α̂ − β̂xi ) = 0 (17)
N i=1
N
1 X 2
ϵ̂ = σ̂ 2
N i=1 i

Notice the similarity between (18) and equations (6) and (7) derived in the least squares
method. By following the results of the previous section, we see that the special case of
normally distributed error yields the same coefficients as ordinary least squares but is a
biased estimator.

4 Method of Maximum Likelihood

One final method of for determining linear regression coefficients uses the likelihood
function, or joint-density function, from statistics. Given the same set of assumptions used
in Section 3, we get the likelihood function
N
Y 1 1 2
L= 1 e− 2σ2 ϵi
i=1 (2π) σ 2

1 1 PN 2
(18)
= e− 2σ2 i=1 ϵi
(2π)N/2 σ N

1 1 PN 2
= e− 2σ2 i=1 (yi −α−βxi )
(2π)N/2 σ N

3
To maximize this likelihood function, we want the exponential term to be minimized. To
do this we take the partial derivatives of the exponent with respect to α and β and set them
equal to 0.
N
∂ X
(yi − α − βxi )2 = 0
∂α i=1
N
(19)
∂ X
(yi − α − βxi )2 = 0
∂β i=1

Notice again the similarity to the least squares equations (7) and (8). By solving (19), we
get our linear regression coefficients.

5 Comparisons and Conclusion

Given the similarities that appear in the above derivations, one may wonder about the
differences between the three methods. Firstly, it is important to notice that the Method
of Moments and Method of Maximum Likelihood are only equivalent to the Least Squares
estimation given the normality assumption. When working with non-normal distributions
of error, one can not assume equivalence between the three derivations.
So when is one method more proper than another? Least squares is viewed as accessible
and easily applied in most cases, especially with normal distributions of error. However,
the method of moments and maximum likelihood may be more desirable with alternative
error distributions. While the method of moments is also fairly accessible, the method of
maximum likelihood is generally viewed as the most desirable when computation allows for
it. When comparing the method of maximum likelihood to least squares, one must consider
the end goal before determining regression method. One method returns the line with the
least amount of error while the other returns the most statistically likely line. Both can be
useful, but one may be more appropriate based on the distributions of data.

Multivariate Polynomial Regression
No ratings yet
Multivariate Polynomial Regression
4 pages
MAF3821 2024 Part1
100% (1)
MAF3821 2024 Part1
35 pages
Chapter1 Econometrics IntroductionToEconometrics
No ratings yet
Chapter1 Econometrics IntroductionToEconometrics
42 pages
Land Development Project
No ratings yet
Land Development Project
19 pages
CE235 Regression Analysis Moodle
No ratings yet
CE235 Regression Analysis Moodle
89 pages
Da Unit III
0% (1)
Da Unit III
43 pages
Dayo Asubiojo - Resume - Dec 26
No ratings yet
Dayo Asubiojo - Resume - Dec 26
4 pages
Chapter2 Regression SimpleLinearRegressionAnalysis PDF
No ratings yet
Chapter2 Regression SimpleLinearRegressionAnalysis PDF
42 pages
Introduction To Critical Care
No ratings yet
Introduction To Critical Care
21 pages
Notes Simple Linear Regression Analysis
No ratings yet
Notes Simple Linear Regression Analysis
39 pages
RegEstimationLS ML StatColumbia
No ratings yet
RegEstimationLS ML StatColumbia
44 pages
LECTURE2
No ratings yet
LECTURE2
13 pages
US - TMC - 06 - Curve Fitting & Interpolation
No ratings yet
US - TMC - 06 - Curve Fitting & Interpolation
64 pages
Chapter2 Regression SimpleLinearRegressionAnalysis
No ratings yet
Chapter2 Regression SimpleLinearRegressionAnalysis
41 pages
4-Curve Fitting and Interpolation
No ratings yet
4-Curve Fitting and Interpolation
48 pages
Notes 1017 Part1
No ratings yet
Notes 1017 Part1
50 pages
Unit 2
No ratings yet
Unit 2
26 pages
Regression Analysis
No ratings yet
Regression Analysis
37 pages
Module4 CSE3190 FDA Updated
No ratings yet
Module4 CSE3190 FDA Updated
46 pages
Chapter 1 - Linear Regression With 1 Predictor: Statistical Model
No ratings yet
Chapter 1 - Linear Regression With 1 Predictor: Statistical Model
35 pages
Least Squares Method
No ratings yet
Least Squares Method
36 pages
Math170S Lecture6
No ratings yet
Math170S Lecture6
13 pages
Topic 3: Simple Linear Regression
No ratings yet
Topic 3: Simple Linear Regression
19 pages
Econometrics For Finace Lecture II-Session Two
No ratings yet
Econometrics For Finace Lecture II-Session Two
19 pages
ECN 5121 Econometric Methods Two-Variable Regression Model: The Problem of Estimation By: Domodar N. Gujarati
No ratings yet
ECN 5121 Econometric Methods Two-Variable Regression Model: The Problem of Estimation By: Domodar N. Gujarati
65 pages
EstimationTheory Lecture 03
No ratings yet
EstimationTheory Lecture 03
21 pages
Linear Regression
No ratings yet
Linear Regression
47 pages
Week2+3 DataAnalysis AGW
No ratings yet
Week2+3 DataAnalysis AGW
47 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
Soal Tes TOEFL Dan Pembahasan Jawaban Written Expression (Complete Test 2 by Longman) - Pusat TOEFL
No ratings yet
Soal Tes TOEFL Dan Pembahasan Jawaban Written Expression (Complete Test 2 by Longman) - Pusat TOEFL
1 page
13 Chapter14
No ratings yet
13 Chapter14
28 pages
Experiment 1
No ratings yet
Experiment 1
17 pages
Week9 PDF
No ratings yet
Week9 PDF
34 pages
Least Squares PDF
No ratings yet
Least Squares PDF
192 pages
TSNotes 1
No ratings yet
TSNotes 1
29 pages
LMC02 App
No ratings yet
LMC02 App
3 pages
BRS 306 Notes 2923-1
No ratings yet
BRS 306 Notes 2923-1
30 pages
Lecture2 241007 162001
No ratings yet
Lecture2 241007 162001
11 pages
Lecture 2.2.1 To 2.2.2
No ratings yet
Lecture 2.2.1 To 2.2.2
20 pages
Handout3 26
No ratings yet
Handout3 26
7 pages
Regression Notes - Part-1
No ratings yet
Regression Notes - Part-1
17 pages
Unit III
No ratings yet
Unit III
18 pages
WK 06
No ratings yet
WK 06
7 pages
Unit - Iii
No ratings yet
Unit - Iii
9 pages
India's Consumer Durables Market
No ratings yet
India's Consumer Durables Market
5 pages
Mungadze Linear
No ratings yet
Mungadze Linear
21 pages
Unit-5 - Notes
No ratings yet
Unit-5 - Notes
41 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
55 pages
Chapter2
No ratings yet
Chapter2
20 pages
STAT630Slide Adv Data Analysis
No ratings yet
STAT630Slide Adv Data Analysis
238 pages
Da Unit III
No ratings yet
Da Unit III
43 pages
Unit III - Least Square Estimation
No ratings yet
Unit III - Least Square Estimation
6 pages
3 SimpleLinearRegression
No ratings yet
3 SimpleLinearRegression
30 pages
Abdi Least Squares 06 Pretty
No ratings yet
Abdi Least Squares 06 Pretty
7 pages
Simple Linear Regression.: 29.1 Method of Least Squares
No ratings yet
Simple Linear Regression.: 29.1 Method of Least Squares
4 pages
Unit - 3 PDA
No ratings yet
Unit - 3 PDA
20 pages
Simple Linear Regression.: 29.1 Method of Least Squares
No ratings yet
Simple Linear Regression.: 29.1 Method of Least Squares
4 pages
The Least Squer Method
No ratings yet
The Least Squer Method
192 pages
Method Least Squares
No ratings yet
Method Least Squares
7 pages
Least Squares.: Herv e Abdi
No ratings yet
Least Squares.: Herv e Abdi
4 pages
Untitled 472
No ratings yet
Untitled 472
13 pages
BÀI TẬP HOÀN THÀNH CÂU
No ratings yet
BÀI TẬP HOÀN THÀNH CÂU
18 pages
Regression 101
No ratings yet
Regression 101
18 pages
Design and Analysis of Computer Experiments: Theory: 1 Density Estimation
No ratings yet
Design and Analysis of Computer Experiments: Theory: 1 Density Estimation
9 pages
FMEP Interactive Handbook Gold
0% (1)
FMEP Interactive Handbook Gold
5 pages
BeagleBone and Linux
80% (5)
BeagleBone and Linux
11 pages
IBPS Clerk Previous Year Question Paper 2018: Quantitative Aptitude (Questions & Solutions)
No ratings yet
IBPS Clerk Previous Year Question Paper 2018: Quantitative Aptitude (Questions & Solutions)
18 pages
Apacible - NCM118 LP1 Introduction
No ratings yet
Apacible - NCM118 LP1 Introduction
6 pages
Ambitious Repertoire Against The Italian Game by GM Kiril Georgiev
100% (1)
Ambitious Repertoire Against The Italian Game by GM Kiril Georgiev
14 pages
CLONE HDD Beginners Guides
No ratings yet
CLONE HDD Beginners Guides
11 pages
Cover Letter Bureau Veritas
No ratings yet
Cover Letter Bureau Veritas
14 pages
01-Historical Perspectives
No ratings yet
01-Historical Perspectives
22 pages
Term 3 Revision Test
No ratings yet
Term 3 Revision Test
5 pages
For Touring Pros - The Secret That Will Make Your Mind Create Any Outrageous Outcome That You Wish - Siddha Performance Golf
No ratings yet
For Touring Pros - The Secret That Will Make Your Mind Create Any Outrageous Outcome That You Wish - Siddha Performance Golf
6 pages
Guided Observation
No ratings yet
Guided Observation
5 pages
Online Car Driving School Management System-1
No ratings yet
Online Car Driving School Management System-1
35 pages
Camiao Grua - Manual
No ratings yet
Camiao Grua - Manual
156 pages
138 Modeling Stochastic Wind - Loads - On Vertical - Axis Wind Turbines VEERS SANDIA
No ratings yet
138 Modeling Stochastic Wind - Loads - On Vertical - Axis Wind Turbines VEERS SANDIA
20 pages
Ncu A2 Extra Tasks Easier U9
No ratings yet
Ncu A2 Extra Tasks Easier U9
2 pages
NPTEL Courses - Final Course List (Jan - April 2022)
No ratings yet
NPTEL Courses - Final Course List (Jan - April 2022)
15 pages
Sitrans fmt020
No ratings yet
Sitrans fmt020
11 pages
Full The Lab Manual To Accompany The 8088 and 8086 Microprocessors Programming Interfacing Software Hardware and Applications 4th Edition Walter A. Triebel Ebook All Chapters
No ratings yet
Full The Lab Manual To Accompany The 8088 and 8086 Microprocessors Programming Interfacing Software Hardware and Applications 4th Edition Walter A. Triebel Ebook All Chapters
71 pages
‎⁨أوراق عمل انجليزي 1 2 1ث ف2 موقع مادتي⁩
No ratings yet
‎⁨أوراق عمل انجليزي 1 2 1ث ف2 موقع مادتي⁩
23 pages
Nder
No ratings yet
Nder
2 pages
Norton Introduction To Literature Shorter 12e The All Chapter Instant Download
100% (1)
Norton Introduction To Literature Shorter 12e The All Chapter Instant Download
24 pages
3 Amigos - SVS-Fault - Test & Mod - Sierrafery
No ratings yet
3 Amigos - SVS-Fault - Test & Mod - Sierrafery
11 pages
Answer Key ME Grade 5 Revision Sheet
No ratings yet
Answer Key ME Grade 5 Revision Sheet
2 pages

Linear Regression

Uploaded by

Linear Regression

Uploaded by

Bedrocks of Quantitative Finance: The Linear Regression

Riley Dunnaway (03/28/24)

1 Motivation for Linear Regression

2 Ordinary Least Squares

Where N is equal to the number of data observations.

From equation (6):

Solving equation (9) for β̂ gives:

4 Method of Maximum Likelihood

5 Comparisons and Conclusion

You might also like