MFIN 305 - Lecture2
MFIN 305 - Lecture2
Methods of Finance
Further Development of the Classical Linear Regression
Model
1. Generalizing the Simple Model to Multiple
Linear Regression
𝑦𝑡 = 𝛼 + 𝛽𝑥𝑡 + 𝑢𝑡 𝑡 = 1,2, … , 𝑇
• What if the dependent variable is influenced by more than one x?
• Arbitrage pricing theory, for example, is not a single factor model like the CAPM
2.5
1.5
0.5
0
1. The US Yield Curve: Movie
2008-07-22
6
0
1. Generalizing the Simple Model to Multiple
Linear Regression
Generalize the simple model to one with 𝑘 regressors (interdependent
variables)
• i.e., each coefficient measures the average change in 𝑦 per unit change
in one of the 𝑥s holding all other independent variables constant at
their average values
2. The Constant Term
The constant term is represented by a column of ones of length T
1
1
𝑥1 =
⋮
1
• 𝛽1 is still referred to as the intercept
• 𝑘 = number of explanatory variables including the constant term
= number of estimated parameters in the regression equation
2. The Constant Term
Matrix form of 𝑦𝑡 = 𝛽1 + 𝛽2 𝑥2𝑡 + 𝛽3 𝑥3𝑡 + ⋯ + 𝛽𝑘 𝑥𝑘𝑡 + 𝑢𝑡
𝑦 = 𝑋𝛽 + 𝑢
𝛽መ is unbiased
• Since 𝑦 = 𝑋𝛽 + 𝑢:
𝛽መ = 𝑋 ′ 𝑋 −1
𝑋 ′ (𝑋𝛽 + 𝑢)
𝛽መ = 𝛽 + (𝑋 ′ 𝑋)−1 𝑋 ′ 𝑢
5. Derivation of the OLS Standard Error in
the Multiple Regression Context
Thus, it is possible to express the variance of β as:
𝐸 (𝛽መ − 𝛽)(𝛽መ − 𝛽)′
= 𝐸 (𝛽 + 𝑋 ′ 𝑋 −1 𝑋 ′ 𝑢 − 𝛽)(𝛽 + 𝑋 ′ 𝑋 −1 𝑋 ′ 𝑢 − 𝛽)′
(𝑅𝑅𝑆𝑆−𝑈𝑅𝑆𝑆)ൗ
𝑚
= 𝑈𝑅𝑆𝑆 ~𝐹(𝑚, 𝑇 − 𝑘)
ൗ 𝑇−𝑘
• 𝑚 = df of the numerator
• 𝑇 − 𝑘 = df of the denominator
6. Testing Multiple Hypotheses: the 𝐹-test
𝑅𝑅𝑆𝑆 − 𝑈𝑅𝑆𝑆 𝑇 − 𝑘
𝑡𝑒𝑠𝑡 𝑠𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐 = ⨯
𝑈𝑅𝑆𝑆 𝑚
Where:
• RRSS: RSS from restricted regression
• URSS: RSS from unrestricted regression
• 𝑚: number of restrictions
• 𝑇: number of observations
• 𝑘: number of regressors in unrestricted regression including a constant in
the unrestricted regression (or the total number of parameters to be
estimated)
6. Testing Multiple Hypotheses: the 𝐹-test
• Generally 𝑅𝑅𝑆𝑆 ≥ 𝑈𝑅𝑆𝑆
• Intuition: if imposing the restrictions does not result in a much larger
RRSS, then the restrictions are supported by the data
Example
• 𝑦 = 𝛽1 + 𝛽2 𝑥2 + 𝛽3 𝑥3 + 𝛽4 𝑥4 + 𝑢 (unrestricted regression)
𝐻0 : 𝛽3 + 𝛽4 = 1
• The restriction 𝛽3 + 𝛽4 = 1 is under test ⇒ ቊ
𝐻1 : 𝛽3 + 𝛽4 ≠ 1
6. Testing Multiple Hypotheses: the 𝐹-test
• Restricted regression:
𝑦 = 𝛽1 + 𝛽2 𝑥2 + 𝛽3 𝑥3 + 𝛽4 𝑥4 + 𝑢 subject to 𝛽3 + 𝛽4 = 1
• Substitute the restriction back into the regression:
𝛽3 + 𝛽4 = 1 ⇒ 𝛽4 = 1 − 𝛽3
• The restricted regression is given by:
𝑦 = 𝛽1 + 𝛽2 𝑥2 + 𝛽3 𝑥3 + 1 − 𝛽3 𝑥4 + 𝑢
• After some algebra:
𝑦 − 𝑥4 = 𝛽1 + 𝛽2 𝑥2 + 𝛽3 𝑥3 − 𝑥4 + 𝑢
𝑃 = 𝛽1 + 𝛽2 𝑥2 + 𝛽3 𝑄 + 𝑢
7. The Relationship Between the t and the 𝐹-
Distributions
• Any hypothesis that could be tested with a t-test could also have been
tested using an 𝐹-test, but not the other way around
• Single hypothesis: t-test or 𝐹-test
• Multiple hypothesis: 𝐹-test
7. The Relationship Between the t and the 𝐹-
Distributions
Example:
H0 : 𝛽2 = 0.5
H1 : 𝛽2 ≠ 0.5
𝐹-critical
8. Determining the Number of Restrictions,
𝑚
• The number of restrictions is the number of = under the null
hypothesis
𝑦𝑡 − 𝑦ത 2
= 𝑦ො𝑡 − 𝑦ത 2
+ 𝑢ො 𝑡2
𝑡=1 𝑡=1 𝑡=1
𝑇 𝑇 𝑇
2 2 2
𝑦𝑡 − 𝑦ത = 𝑦ො𝑡 − 𝑦ത + 𝑦𝑡 − 𝑦ො𝑡
𝑡=1 𝑡=1 𝑡=1
• The goodness of fit statistic is given by:
𝐸𝑆𝑆 𝑇𝑆𝑆 − 𝑅𝑆𝑆
2
𝑅𝑆𝑆
𝑅 = = =1−
𝑇𝑆𝑆 𝑇𝑆𝑆 𝑇𝑆𝑆
2
10. Goodness of Fit Statistics: 𝑅
Extreme cases
𝑦𝑡 𝑦𝑡
𝑥𝑡 𝑥𝑡
𝑅𝑆𝑆 = 𝑇𝑆𝑆 ⇒ 𝑅2 =0
E𝑆𝑆 = 𝑇𝑆𝑆 ⇒ 𝑅2 = 1 (𝑅𝑆𝑆 = 0)
11. Problems with 𝑅 2 as a Goodness of Fit
Measure
(1) 𝑅2 cannot be compared across models with different 𝑦
(2) 𝑅2 never falls if more variables are added to the regression
Regression 1: 𝑦𝑡 = 𝛽1 + 𝛽2 𝑥2 + 𝛽3 𝑥3 + 𝑢
Regression 2: 𝑦𝑡 = 𝛽1 + 𝛽2 𝑥2 + 𝛽3 𝑥3 + 𝛽4 𝑥4 + 𝑢
𝑅2 regression 2 ≥ 𝑅2 regression 1
(3) 𝑅2 can be as high as 0.9 for times-series regressions
2
12. Adjusted 𝑅
2 𝑇−1
𝑅 =1− (1 − 𝑅2 )
𝑇−𝑘
• 𝑘 ↗ as more regressors are added => T-k decreases
• The net effect depends on whether the increase in R2 resulting from the addition of new
variables outweighs the increase in R2
𝑇−1
• In other words, the term increases with the addition of variables while the term (1-R2)
𝑇−𝑘
decreases
• The net effect on R2 depends on which of the two effects dominates
2
• 𝑅2 has to increase by more than off-setting amount for 𝑅 to increase
2
• If 𝑅 rises, one should include the variable or the group of variables in the regression
2
13. Relationship between 𝐹-statistic and 𝑅
• Unrestricted: 𝑦𝑡 = 𝛽1 + 𝛽2 𝑥2𝑡 + ⋯ + 𝛽𝑘 𝑥𝑘𝑡 + 𝑢 ⇒
𝑈𝑅𝑆𝑆 = 𝑅𝑆𝑆
• Restricted: 𝑦𝑡 = 𝛽1 + 𝑢𝑡
1. Number of restrictions is 𝑚 = 𝑘 − 1
2. For the restricted regression, there is no ESS ⇒ 𝑇𝑆𝑆 = 𝑅𝑅𝑆𝑆