Quantative Methods
Quantative Methods
Model Misspecification 11
Time-Series Analysis 23
Machine Learning 32
Review 50
M.M134813896.
This document should be used in conjunction with the corresponding learning modules in the 2024 Level 2 CFA® Program
curriculum. Some of the graphs, charts, tables, examples, and figures are copyright 2023, CFA Institute. Reproduced and
republished with permission from CFA Institute. All rights reserved.
Required disclaimer: CFA Institute does not endorse, promote, or warrant accuracy or quality of the products or services
offered by MarkMeldrum.com. CFA Institute, CFA®, and Chartered Financial Analyst® are trademarks owned by CFA
Institute.
1
Last Revised: 07/25/2023
b. formulate a multiple linear regression model, describe the relation between the
dependent variable and several independent variables, and interpret estimated
regression coefficients
M.M134813896.
2
Last Revised: 07/25/2023
- model:
𝐘𝐢 = 𝐛𝟎 + 𝐛𝟏 𝐗 𝟏𝐢 + 𝐛𝟐 𝐗 𝟐𝐢 + ⋯ + 𝐛𝐊 𝐗 𝐊 𝐢 + 𝛆𝐢 𝐢 = 1 ➞ n
deterministic part n > 𝐤
intercept
𝐤 IVs or slope Stochastic
coefficients part
- partial slope coefficients
%𝐛 ➞ estimated
* Describe the types of investment problems addressed by multiple linear regression and the regression process
Page 2/
partial slope coefficient: measures ∆DV for a 1 unit ∆IV holding
all other IVs constant
e.g./ RET = .0023 - 5.0585 BY - 2.1901 CS
* Formulate a multiple linear regression model, describe the relation between the dependent variable and several independent variables, and interpret estimated regression coefficients
3
Last Revised: 07/25/2023
Page 3/
Assumptions/
3/ Independence of errors - the observations are independent of
one another
∴ regression residuals are uncorrelated across observations
4/ Normality - regression residuals are normally distributed
5/ Independence of IVs
1/ IVs are not random (i.e. - they have a specific value)
2/ no exact linear relationship between 2 or more IVs
Scatterplot matrix (pairs plot)
- uses simple linear regression: DV vs. each IV
+ each IV vs. the other IVs
what to see
don’t want to see
linear relationships
linear relationships
* Explain the assumptions underlying a multiple linear regression model and interpret residual plots indicating potential violations of these assumptions
Page 4/
Scatterplot Matrix
since we can, and will, interpret
slight pos. relationship output ➞ this is not a very useful
step
- any violations will be identified
statistically, not visually
DV
IVs
pos. linear neg. linear
M.M134813896.
almost no ‘apparent’
➞ 𝐛𝐒𝐌𝐁 is sig. in the output however
linear relationship
* Explain the assumptions underlying a multiple linear regression model and interpret residual plots indicating potential violations of these assumptions
4
Last Revised: 07/25/2023
Page 5/
- helps identify outliers
* Explain the assumptions underlying a multiple linear regression model and interpret residual plots indicating potential violations of these assumptions
Page 6/
standardized residuals vs. normal distribution
- outliers
affect outlier
parameter value of 𝛆
values 5%
𝛆𝐢 − 𝛆1
directional 𝛔𝛆
relationship
= misspecified
model
5% Q-Q plot
corr(IV, 𝛆)
outliers
M.M134813896.
-1.65 +1.65
Z score
- normally distributed 𝛆 should
fall on the vertical line
* Explain the assumptions underlying a multiple linear regression model and interpret residual plots indicating potential violations of these assumptions
5
Last Revised: 07/25/2023
a. evaluate how well a multiple regression model explains the dependent variable by
analyzing ANOVA table results and measures of goodness of fit
c. calculate and interpret a predicted value for the dependent variable, given the
estimated regression model and assumed values for the independent variable
M.M134813896.
6
Last Revised: 07/25/2023
$ 𝟐 = 1 - 𝐒𝐒𝐄/𝐧 − 𝐤 − 𝟏
Adjusted 𝐑𝟐 ➞ 𝐑 𝐧−𝟏
= 𝟏 − IJ K (𝟏 − 𝐑𝟐 )L
𝐒𝐒𝐓/𝐧 − 𝟏 𝐧−𝐤−𝟏
* Evaluate how well a multiple regression model explains the dependent variable by analyzing ANOVA table results and measures of goodness of fit
Page 2/
Adjusted 𝐑 ➞ 𝐑 𝟐 $𝟐 = 1 - 𝐒𝐒𝐄/𝐧 − 𝐤 − 𝟏 𝐧−𝟏
= 𝟏 − IJ K (𝟏 − 𝐑𝟐 )L
𝐒𝐒𝐓/𝐧 − 𝟏 𝐧−𝐤−𝟏
FYI 𝐒𝐒𝐄 𝐧−𝟏
× 𝐒𝐒𝐄(𝐧 − 𝟏) 𝐧−𝟏 𝐒𝐒𝐄
𝐧−𝐤−𝟏 𝐒𝐒𝐓
= =- .- .
𝐒𝐒𝐓 𝐧−𝟏 𝐒𝐒𝐓(𝐧 − 𝐤 − 𝟏) 𝐧 − 𝐤 − 𝟏 𝐒𝐒𝐓
×
𝐧−𝟏 𝐒𝐒𝐓
𝐑𝟐 + 𝐒𝐒𝐄D𝐒𝐒𝐓 = 1
M.M134813896.
∴ 𝐒𝐒𝐄D𝐒𝐒𝐓 = 1 - 𝐑𝟐
" 𝟐 : if 𝐤 ≥ 𝟏 , 𝐑
- for 𝐑 " 𝟐 < 𝐑𝟐
" 𝟐 ↑ , else 𝐑
& if coefficient’s |𝐭 − 𝐬𝐭𝐚𝐭| > 𝟏 , 𝐑 "𝟐 ↓
* Evaluate how well a multiple regression model explains the dependent variable by analyzing ANOVA table results and measures of goodness of fit
7
Last Revised: 07/25/2023
Page 3/
application/
𝐒𝐒𝐑 𝟗𝟎. 𝟔𝟐𝟑𝟒
𝐑𝟐 = = = . 𝟔𝟏𝟓𝟓
𝐒𝐒𝐓 𝟏𝟒𝟕. 𝟐𝟒𝟏𝟔
𝟓𝟎 − 𝟏
@ 𝟐 = 𝟏 − IJ
𝐑 K (𝟏 − . 𝟔𝟏𝟓𝟓)L
𝟓𝟎 − 𝟓 − 𝟏
= . 𝟓𝟕𝟏𝟖
$ 𝟐 ↑ with Factor 1, 3, 4
𝐑
$ 𝟐 ↓ with Factor 2 & 5
𝐑
also
insignificant
F1 + F2 : 𝐑
@ 𝟐 ↓ , add F3 : 𝐑
@ 𝟐 ↑ , Add F4 : 𝐑
@ 𝟐 ↑ , Add F5 : 𝐑
@𝟐 ↓
* Evaluate how well a multiple regression model explains the dependent variable by analyzing ANOVA table results and measures of goodness of fit
Page 4/
$𝟐
𝐑 ➞ no intuitive explanation, re: %’age of variance explained
➞ no information on coefficient significance or potential
coefficient bias
➞ not a ‘goodness of fit’ measure
* Evaluate how well a multiple regression model explains the dependent variable by analyzing ANOVA table results and measures of goodness of fit