Baum 2013
Baum 2013
Christopher F Baum
1 / 44
2 / 44
3 / 44
4 / 44
X
k =0
(k d)Lk
(d)(k + 1)
5 / 44
6 / 44
7 / 44
8 / 44
9 / 44
k
k
X
X
1
Max1k n
Qn =
xj xn Min1k n
xj xn
sn
j=1
j=1
10 / 44
The first bracketed term is the maximum of the partial sums of the first
k deviations of xj from the full-sample mean, which is nonnegative.
The second bracketed term is the corresponding minimum, which is
nonpositive. The difference of these two quantities is thus nonnegative,
so that Qn > 0. Empirical studies have demonstrated that the R/S
statistic has the ability to detect long-range dependence in the data.
11 / 44
12 / 44
13 / 44
The Stata command gphudak (Baum and Wiggins, Stata Tech. Bull.,
2000) performs the Geweke and Porter-Hudak (JTSA, 1983)
semiparametric log periodogram regression, often described as the
GPH test, for long memory (fractional integration) in a timeseries.
The GPH method uses nonparametric methodsa spectral regression
estimatorto evaluate d without explicit specification of the short
memory (ARMA) parameters of the series. The series is usually
differenced so that the resulting d estimate will fall in the [-0.5, 0.5]
interval.
14 / 44
15 / 44
X
e
s = n , s = 1, ..., m, m < n . We define x (s ) =
t
t=1
2n
as the discrete Fourier transform (dft) of the timeseries Xt ,
i
s
Ix (s ) = x (s ) x (s ) as the periodogram, and xs = log 1 e .
Ordinary least squares on (7) yields
Pm
s=1 xs log Ix (s )
d = 0.5
.
Pm
2
s=1 xs
16 / 44
17 / 44
The Stata routine modlpr (Baum and Wiggins, Stata Tech. Bull.,
2000) computes a modified form of the GPH estimate of the long
memory parameter, d, of a timeseries, proposed by Phillips (Cowles,
1999a, 1999b). Phillips (1999a) points out that the prior literature on
this semiparametric approach does not address the case of d = 1, or a
unit root, in (6), despite the broad interest in determining whether a
series exhibits unit-root behavior or long memory behavior, and his
work showing that the d estimate of (7) is inconsistent when d > 1,
with d exhibiting asymptotic bias toward unity.
18 / 44
19 / 44
x (s ) =
i
i
s
s
1e
1e
2n
and the modified dft as
Xn
eis
x (s ) = x (s ) +
1 eis 2n
with associated periodogram ordinates Iv (s ) = x (s ) x (s )
(1999b, p.9). He notes that both x (s ) and, thus, Iv (s ) are
observable functions of the data.
20 / 44
The log-periodogram
regression is now
the regression of log Iv (s ) on
P
= m1 m
.
d = 0.5
Pm
2
s=1 xs
21 / 44
m d d d N 0,
,
24
so that d has the same limiting distribution at d = 1 as does the GPH
estimator in the stationary case so that d is consistent for values of d
around unity. A semiparametric test statistic for a unit root against a
fractional alternative is then based upon the statistic (1999a, p.10):
m d 1
zd =
/ 24
with critical values from the standard normal distribution. This test is
consistent against both d < 1 and d > 1 fractional alternatives.
Christopher F Baum (BC / DIW)
22 / 44
The Stata routine roblpr (Baum and Wiggins, Stata Tech. Bull.,
2000) computes the Robinson (Ann. Stat.,1995) multivariate
semiparametric estimate of the long memory (fractional integration)
parameters, d(g), of a set of G timeseries, y (g), g = 1, G with G 1.
When applied to a set of timeseries, the d(g) parameter for each
series is estimated from a single log-periodogram regression which
allows the intercept and slope to differ for each series.
23 / 44
24 / 44
25 / 44
26 / 44
d
where Z = (Z1 , ...Zm )0 , Zk = (1, 2 log k )0 , Y = (Y1 , ...YG ) , and
0
Yg = Yg,1 , ...Yg,m for m periodogram ordinates.
27 / 44
Standard errors for dg and for a test of the restriction that two or more
of the dg are equal may be derived from the estimated covariance
matrix of the least squares coefficients. The standard errors for the
estimated parameters are derived from a pooled estimate of the
variance in the multivariate case, so that their interval estimates differ
from those of their univariate counterparts. Modifications to this
derivation when the frequency-averaging (j) or omission of initial
frequencies (l) options are selected may be found in Robinson (1995).
28 / 44
29 / 44
30 / 44
Applications
.781
N = 124
31 / 44
Applications
1.29
N = 50
32 / 44
Applications
For the full sample, the null of stationarity may be rejected at 95%
using either the Lo modified R/S statistic or the classic
HurstMandelbrot statistic. For the postwar data, the null may not be
rejected at any level of significance. Long-range dependence, if
present in this series, seems to be contributed by pre-World War II
behavior of the stock price series.
33 / 44
Applications
34 / 44
Applications
. roblpr ftaret
Robinson estimates of fractional differencing parameter for ftaret
------------------------------------------------------Power
Ords
Est d
Std Err t(H0: d=0)
P>|t|
------------------------------------------------------.9
205
.1253645
.0446745
2.8062
0.005
-------------------------------------------------------
35 / 44
Applications
36 / 44
Applications
The GPH test, applied to the stock returns series, generates estimates
of the long memory parameter that cannot reject the null at the ten
percent level using the t-test.
Phillips modified LPR, applied to this series, finds that d = 1 can be
rejected for all powers tested, while d = 0 (stationarity) may be
rejected at the ten percent level for powers 0.6, 0.7, and 0.75.
Robinsons estimate for the returns series alone is quite precise.
Robinsons multivariate test, applied to the price and dividends series,
finds that each series has d > 0. The test that they share the same d
cannot be rejected. Accordingly, the test is applied to all three series
subject to the constraint that price and dividends series have a
common d, yielding a more precise estimate of the difference in d
parameters between those series and the stock returns series.
37 / 44
Applications
38 / 44
Applications
. webuse mloa
. arima S12.log, ar(1) ma(2) vsquish nolog
ARIMA regression
Sample: 1960m1 - 1990m12
Log likelihood =
Number of obs
Wald chi2(2)
Prob > chi2
2001.564
OPG
Std. Err.
S12.log
Coef.
_cons
.0036754
.0002475
.7354346
=
=
=
372
500.41
0.0000
P>|z|
14.85
0.000
.0031903
.0041605
.0357715
20.56
0.000
.6653237
.8055456
.1353086
.0513156
2.64
0.008
.0347319
.2358853
.0011129
.0000401
27.77
0.000
.0010344
.0011914
log
ARMA
ar
L1.
ma
L2.
/sigma
Note: The test of the variance against zero is one sided, and the two-sided
confidence interval is truncated at zero.
. psdensity d_arma omega1
39 / 44
Applications
S12.log
Number of obs
Wald chi2(3)
Prob > chi2
2006.0805
=
=
=
372
248.87
0.0000
OIM
Std. Err.
P>|z|
.003616
.0012968
2.79
0.005
.0010743
.0061578
.2160894
.1015596
2.13
0.033
.0170362
.4151426
.1633916
.4042573
.051691
.080546
3.16
5.02
0.002
0.000
.062079
.2463899
.2647041
.5621246
1.20e-06
8.84e-08
13.63
0.000
1.03e-06
1.38e-06
Coef.
S12.log
_cons
ARFIMA
ar
L1.
ma
L2.
d
/sigma2
Note: The test of the variance against zero is one sided, and the two-sided
confidence interval is truncated at zero.
40 / 44
Applications
Here, too, all parameters are significant at the five percent level. The
estimate of d, 0.404, is far from zero, indicating the presence of
long-range dependence. We can compare the models ability to
capture the dynamics of the series by computing their implied spectral
densities over (0, ).
For a stationary time series, the spectral density describes the relative
importance of components at different frequencies. The integral of the
spectral density over (, ) is the variance of the time series.
We can also compute the implied spectral density of the ARFIMA
model, setting d to zero to compute the short-run estimates. The
long-run estimates have infinite density at frequency zero.
41 / 44
Applications
42 / 44
Applications
Frequency
ARMA spectral density
1.5
1
.5
0
0
Frequency
ARMA spectral density
43 / 44
Applications
The two models imply different spectral densities for frequencies close
to zero when d > 0. The spectral density of the ARMA model remains
finite, but is pulled upward by the models attempt to capture
long-range dependence. The short-run ARFIMA parameters can
capture both low-frequency and high-frequency components in the
spectral density.
In contrast, the ARMA model confounds the long-run and short-run
effects. The added flexibility of the ARFIMA model, with a separate
parameter to capture long-run dependence, is evident in these
estimates.
Although we have not illustrated it here, arfima may also fit
ARFIMA-X models with additional exogenous regressors.
44 / 44