Econometrics LL CH 2
Econometrics LL CH 2
Tadele Bayu
April 3, 2023
Contents
1 2.1. The nature of Time Series Data
2 2.2. Stationary and non-stationary stochastic Processes
3 2.3. Trend Stationary and Difference Stationary Stochastic
Processes
2.3.1. Pure random walk (without drift)
2.3.2. Pure random walk with Drift
2.3.3. Deterministic Trend stationary
4 2.4. Integrated Stochastic Process
5 2.5. Tests of Stationarity: The Unit Root Test
Transforming Non-stationary Time Series
6 Spurious regression and Co-integration
7 Co-integration:The Engle and Granger approach
Testing for Cointegration
Long run and Short run results
Tadele Bayu (Bahir Dar Univ.) Chapter 2 April 3, 2023 2 / 64
2.1. The nature of Time Series Data
2
A set of random variables indexed by time (such as Yt ) is
called a stochastic process or random process or time series
process.
2
A random or stochastic process is a collection of random vari-
ables ordered in time.
2
Hence, a sequence of random variables is called stochastic
process.
2
A particular realization of the stochastic process is called a
time series.
Realization in time series similar to sample in cross-section.
2
The set of all possible realizations in time series is akin to
population in cross-section.
Sample size in time series is the number of time periods over
which we observe the variable.
Mean :E(Yt ) = µ
Variance :Var (Yt ) = E(Yt − µ)2 = σ 2
Autocovariance function:
CoV (Yt , Yt+k ) = E [(Yt − µ)E(Yt+k − µ)] = γk
General Model
Yt = β1 + β2 t + β3 Yt−1 + Ut
2
Where Ut is a white noise error term and where t is time mea-
sured chronologically.
Tadele Bayu (Bahir Dar Univ.) Chapter 2 April 3, 2023 8 / 64
2.3. Trend Stationary and Difference Stationary Stochastic
Processes 2.3.1. Pure random walk (without drift)
Yt = Yt−1 + β1 + Ut
y1 = β + y0 + U1
y2 = β + y1 + U2 = β + β + y0 + U1 + U2
y3 = β + y2 + U3 = β + β + β + y0 + U1 + U2 + U3
..
.
yT = β + β + β + · · · + β + y0 + U1 + U2 + U3 + · · · + UT
T
X T
X T
X
yT == y0 + β+ Ut = y0 + tβ + Ut
t=1 t=1 t=1
∆Yt = Yt − Yt−1 = β1 + Ut
Trend stationary
β1 6= 0 β2 6= 0 β3 = 0
Yt = β1 + β2 t + Ut
β1 6= 0 β2 6= 0 β3 = 1
Yt = β1 + β2 t + β3 Yt−1 + Ut (5)
Yt = β1 + β2 t + Yt−1 + Ut (6)
∆Yt = β1 + β2 t + Ut
2
The random walk model is a specific case of a more gen-
eral class of stochastic processes known as integrated pro-
cesses.
2
The RWM without drift is non-stationary, but its first differ-
ence is stationary.
Hence, it is integrated of order 1,denoted as I(1).
2
If a time series has to be differenced twice (i.e., take the first
difference of the first differences) to make it stationary, we call
such a time series integrated of order 2.
A time series Yt integrated of order d is denoted as Yt ∼ I(d)
2
Example
Suppose we have 88 observations decide whether the cor-
relation coefficient of 0.638 at lag 10 (quarters) is statistically
significant or not at 95% CI?
Solution
r
1
ρ̂k ± 1.96
88
ρ̂k ± 1.96 × (0.1066)
0.638 ± 0.2089 ⇒ (0.4291, 0.8469)
ρ̂10 0.638
Z =p = = 5.98
1/n 0.1066
Group Test
Instead of testing the statistical significance of any individual
autocorrelation coefficient, we can test the joint hypothesis
that all the ρk up to certain lags are simultaneously equal to
zero.
This can be done by using the Q statistic developed by Box
and Pierce, which is defined as
X m
Q=n ρ̂2k
k =1
where n = sample size and m = lag length
In large samples, it is approximately distributed as the chi-
square distribution with m df.
If the computed Q exceeds the critical Q value from the
chi-square distribution , reject the null hypothesis that all the
(true) ρk are zero; at least some of them must be nonzero.
Tadele Bayu (Bahir Dar Univ.) Chapter 2 April 3, 2023 32 / 64
2.4. Integrated Stochastic Process
Alternative to Q statistics
∆Yt = δYt−1 + Ut
Dickey and Fuller have shown that under the null hypothesis
that δ = 0 the estimated t value of the coefficient of Yt−1
follows the τ (tau) statistic.
This test is known as Dickey–Fuller (DF) test, in honour of
its discoverers.
David Alan Dickey and Wayne Arthur Fuller (1979)
We noted that a random walk process may have no drift,
or it may have drift, or it may have both deterministic and
stochastic trends.
Tadele Bayu (Bahir Dar Univ.) Chapter 2 April 3, 2023 37 / 64
2.5. Tests of Stationarity: The Unit Root Test
Yt is a random walk:
∆Yt = δYt−1 + Ut
∆Yt = β1 + δYt−1 + Ut
∆Yt = β1 + β2 t + δYt−1 + Ut
H0 : δ = 0 H1 : δ < 0
tsline Lgdp
Tadele Bayu (Bahir Dar Univ.) Chapter 2 April 3, 2023 42 / 64
2.5. Tests of Stationarity: The Unit Root Test Transforming Non-stationary Time Series
2
If a time series is I(2), it will contain two unit roots, in which case we
will have to difference it twice.
2
If it is I(d), it has to be differenced d times, where dis any integer.
Tadele Bayu (Bahir Dar Univ.) Chapter 2 April 3, 2023 43 / 64
2.5. Tests of Stationarity: The Unit Root Test Transforming Non-stationary Time Series
Graph:tsline D.Lgdp
Trend-Stationary Processes
How do you check stationarity using Engel-Granger approach
using Dicky Fuller?
Yt = β0 + β1 t + ut (8)
ubt = Yt − βb0 − βb1 t
ubt is known as a (linearly) detrended time series
The figure tells us that the first differences of real log GDP is
stationary but the residuals from the trend line (uhat) are not.
If a time series is DSP but we treat it as TSP, this is called
under-differencing.
On the other hand, if a time series is TSP but we treat it as
DSP, this is called over-differencing.
If a time series is non-stationary, we can study its behavior
only for the time period under consideration.
Each set of time series data will therefore be for a particular
episode.
As a consequence, it is not possible to generalize it to other
time periods.
Therefore, for the purpose of forecasting, such
(non-stationary) time series may have little practical value.
Spurious regression
Given two independent time series Yt and Xt generated by
random walk processes:
Yt = Yt−1 + uyt and Xt = Xt−1 + uxt (10)
If we then regress:
Yt = β0 + β1 Xt + ut where ut ≡ I(0) (11)
Example
LPCEt = β0 + β1 LDPIt + ut
Where
LPCE = Log of Personal Consumption Expenditure
LDPI = Log of Gross Private Domestic Investment
ubt = LPCEt − βb0 − βb1 LDPIt
Cointegration means that despite being individually nonstationary, a linear combination of two or
more time series can be stationary.
Tadele Bayu (Bahir Dar Univ.) Chapter 2 April 3, 2023 51 / 64
Co-integration:The Engle and Granger approach Testing for Cointegration
Steps
If LPCE and LDPI are individually non stationary, there is the
possibility that this regression is spurious.
Check unit root on both LPCE and LDPI
Predict the residual and generate
Regress the first difference of the residual on
the first lag of the residual without a constant.
If the computed τ value is greater than the
given interpolated Dickey- Fuller critical value
LPCE and LDPI are cointegrated.
Estimate the error correction model.
Check diagnostic tests as usual (normality,
serial correlation, heteroscedasticity etc.)
Tadele Bayu (Bahir Dar Univ.) Chapter 2 April 3, 2023 52 / 64
Co-integration:The Engle and Granger approach Testing for Cointegration
Level
First Difference
It seems that both LPCE and LPDI series have a unit root,
or stochastic trend.
Therefore, if we regress LPCE on LPDI, we might get a
spurious regression.
if we interpret the Durbin–Watson on its own, it would
suggest that the error term in this regression suffers from
first-order autocorrelation.
Where:
LPCEt = β0 + β1 LDPIt + β2 t + Ut (16)
Ut = LPCEt − β0 − β1 LDPIt − β2 t (17)