Lecture 2
Lecture 2
Chikumbe, SE
21st June, 2023
Econometrics II
Department of Economics, Kwame Nkrumah University
1 / 35
Recap of last week
• OLS assumptions
• Zero conditional mean
• No serial correlation
• Static models → yt = β 0 + β 1 xt + ut
2 / 35
Roadmap for this week
3 / 35
Lag selection
4 / 35
Putting the p in AR(p)
5 / 35
AR order and the zero conditional mean assumption
6 / 35
Choosing the p in AR(p)
• Three approaches
1. F-statistic approach
7 / 35
F-statistic approach
• Do this until the last lag becomes significant at the 95% confidence level
• Advantage: intuitive
• A test using the t-statistic will incorrectly reject the null 5% of the time
• → when the true value of p is five, this method will estimate p to be six 5% of the time.
8 / 35
Autoregressive Model of % ∆ ZAR/USD
% ∆ZAR/USD t −3 0.023
(0.055)
9 / 35
Bayes Information Criterion (BIC)
SSR (p ) ln(T )
BIC (p ) = ln + (p + 1)
T T
ln(T )
• T → log of the number of observations / number of observations
10 / 35
Bayes Information Criterion (BIC)
SSR (p ) ln(T )
BIC (p ) = ln + (p + 1)
T T
• We are interested in finding p̂, the value of p which minimizes BIC (p ) among possible
choices of p
11 / 35
Bayes Information Criterion (BIC)
12 / 35
Akaike Information Criterion (AIC)
SSR (p ) 2
AIC (p ) = ln + (p + 1)
T T
• ln(T ) is replaced by 2, in the second term → second term is smaller in the AIC than
the BIC
• Still useful, if you are concerned there are too few lags suggested by the BIC
13 / 35
Comparing the 3 approaches
Dependent Variable: % ∆ZAR/USD
AR(1) AR(2) AR(3) • F-Statistic approach → AR(2)
% ∆ZAR/USD t −1 0.307∗∗∗ 0.343∗∗∗ 0.344∗∗∗
(0.051) (0.054) (0.054) • AIC → AR(2)
% ∆ZAR/USD t −2 −0.109∗∗ −0.116∗∗
(0.054) (0.057) • BIC → AR(1)
% ∆ZAR/USD t −3 0.023
(0.055)
14 / 35
Autoregressive Distributed Lag (ARDL)
models
15 / 35
Types of time series models
• Static models
• yt = β 0 + β 1 xt + ut
16 / 35
ARDL models
yt = β 0 + β 1 yt −1 + β 2 yt −2 + σ1 xt −1 + σ2 xt −2 + ut
17 / 35
ARDL models
yt = β 0 + β 1 yt −1 + β 2 yt −2 + σ1 xt −1 + σ2 xt −2 + ut
• Why? In time series, the focus is often on forecasting, so values in time t are never known
18 / 35
ARDL example: ARDL Model of the %∆ ZAR/USD
%∆ ZAR/USDt −2 −0.132∗
(0.079)
%∆ ALSIt −2 −0.072
(0.070)
R2 0.155 0.175
19 / 35
ARDL models and lag selection
SSR (k ) ln(T ) SSR (k ) 2
BIC (k ) = ln + (k + 1) AIC (k ) = ln + (k + 1)
T T T T
• k = p+q
• k = 4, with p = 2 and q = 2
20 / 35
ARDL lag selection example: ARDL Model of the %∆ ZAR/USD
ARDL(1,1) ARDL(2,2)
%∆ ZAR/USDt −1 0.211∗∗∗ 0.221∗∗∗
(0.074) (0.081)
%∆ ZAR/USDt −2 −0.132∗
(0.079)
%∆ ALSIt −2 −0.072
(0.070)
R2 0.155 0.175
AIC -6.681 -6.676
BIC -6.624 -6.582
21 / 35
Forecasting
22 / 35
Forecasting is a core part of time series econometrics
• Given dependence in time series data, AR models are the workhorse model for
forecasting
23 / 35
Forecasting is a core part of time series econometrics
24 / 35
Forecasting is a core part of time series econometrics
• Time series models can be used to forecast, even if none of the coefficients have a
causal interpretation
25 / 35
Forecast Error
• We want to forecast yt +1 based off an AR(1) model
• yt +1 = β 0 + β 1 yt
• Since t + 1 is into the future, the true values of β 0 and β 1 are unknown
• ŷt +1|t = β̂ 0 + β̂ 1 yt
26 / 35
A forecast is not a predicted value
• Predicted values are calculated for for observations in the sample used to estimate the
regression
• Forecasts are made for a date that exists outside of the sample
27 / 35
The forecast error is not an OLS residual
• From lecture 2: the residual of a regression is the difference the actual value for y and
the predicted value for y , ŷ for observations in the sample
• The forecast error is the difference between the future value of y, which is not
contained in the sample, and the forecast of that future value
28 / 35
Forecasting with an AR(1) model
• %∆ERt = −0.023
29 / 35
Forecasting with an AR(1) model
30 / 35
Forecast uncertainty
• We’ll introduce the Root Mean Squared Forecast Error (RMSFE) that incorporates
both (1) and (2)
31 / 35
Root Mean Squared Forecast Error (RMSFE)
MSFE = E yt +1 − ŷt +1|t
= σu2 + var ( βˆ0 − β 0 ) + ( βˆ1 − β 1 )yt + ( βˆ2 − β 2 )xt
√
• The RMSFE is then simply MSFE
32 / 35
Root Mean Squared Forecast Error (RMSFE)
• We’ll introduce the Root Mean Squared Forecast Error (RMSFE) that incorporates
both (1) and (2)
33 / 35
Forecast interval
• Confidence interval is justified by CLT and therefore holds for a wide range of
distributions of ut
1 If
this sounds a bit tricky, revisit your Chapter 4 lectures with Safia to refresh your grasp of confidence
intervals, hypothesis tests and the central limit theorem
34 / 35
Forecasting interest rates
•
Figure: Fan chart of interest rate forecast from the SARB
35 / 35