Box InterventionAnalysisApplications 1975
Box InterventionAnalysisApplications 1975
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide
range of content in a trusted digital archive. We use information technology and tools to increase productivity and
facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected].
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
https://about.jstor.org/terms
Taylor & Francis, Ltd. and American Statistical Association are collaborating with JSTOR to
digitize, preserve and extend access to Journal of the American Statistical Association
This article discusses the effect of interventions on a given response A. Monthly Average of Hourly Readings of 03 (pphm)
variable in the presence of dependent noise structure. Difference in Downtown Los Angeles (1955-1972)a
equation models are employed to represent the possible dynamic
WEIGHTS
characteristics of both the interventions and the noise. Some proper-
ties of the maximum likelihood estimators of parameters measuring
level changes are discussed. Two applications, one dealing with the
photochemical smog data in Los Angeles and the other with changes pphm
8.0
Intervening events
7.0
1. INTRODUCTION
6.0-
70
2. Work out the appropriate data analysis based on that model; where the polynomials X1(B), 42(BS), 01(B), 02(BS) are
3. If diagnostic checks show no inadequacy in the model, make degrees pt, P2., ql, q2, respectively.
appropriate inferences; if serious deficiencies are uncovered,
make appropriate model modification, repeat the analysis, etc.
3. A DYNAMIC MODEL FOR INTERVENTION
Suppose the data . . . Yte1, Yt, Yt?, . . . are avail-
Frequently the effects of exogenous variables t can be
able as a series obtained at equal time intervals. Follow-
represented by a dynamic model of the form
ing, e.g., [Ell we will employ models of the general form
k k
(8 < 1). It is readily shown that the time constant of If it were believed that the full impact of intervention
this system is estimated by { -log,b }-' and the steady might not be felt until the second month, after which
state gain is w/ (1 - 8). When 6 approaches the value there would be a decay and possibly a residual effect as
unity, we have the transfer function model in the previous case, the model
St = {I (B/(1 (- B)) + (W2B/(1 - B))}Pt(T) no intervention. Fitting the model can make it possible to
distinguish between what can and what cannot be explained by
could be used in which the possibility is entertained that the noise.
a residual gain (or loss) in sales W2 persists. (ii) Intervention extending over several time intervals can be
represented by a series of pulses. A three month advertising
campaign might be represented, for example, by three pulses
B. Responses to a Step and a Pulse Input a whose magnitude might represent expenditure in the three
months.
Input
stT) I,,4 @ @ , I _ _ L _ _ PT
4. CALCULATIONS BASED ON THE LIKELIHOOD
STEP PULSE
p(jy) a{S(y)}-(n/2) (4.6) I,: In 1960 the opening of the Golden State Freeway and the
coming into effect of a new law (Rule 63) reducing the allow-
so that the least square estimates correspond with the
able proportion of reactive hydrocarbons in locally sold
point of maximum posterior density. gasoline.
Now if, over the region where the density is ap- I2: From 1966 onwards regulations required engine design
preciable, S(5) is approximately quadratic (and in anychanges in new cars which would be expected to reduce the
given case it is easy to check this numerically), then the production of 03.
The theory developed here is illustrated in this section rl,={i "summer" months June-October beginning 1966
by two examples, one employing the Los Angeles oxidant
data and the other, the rate of change in the United (t=l0,
f ,otherwise.
"winter" months November-May beginning 1966
This allows for a step change in the level of 03 be- controls in August 1971. As indicated in the figure, in the
ginning in 1960 of size woo associated with I, and for three months beginning September 1971, Phase I control
progressive yearly increments in the 03 level beginning was applied; and after that to the end of the recorded
1966 of (002 and C003 units, respectively, for the summer
period, Phase II was in effect.
and the winter months. This representation is admittedly Inspection of the autocorrelation functions of the first
somewhat crude, and we hope to improve on it as more 218 observations and their differences prior to Phase I
data become available. suggests a noise model of the form
Estimation Results. The maximum likelihood estimates
(1 - B)Nt = (1 - OB)at . (5.3)
and the associated standard errors are as follows:
Thus, there is evidence that (ii) that the form of the noise model remains essentially the
same.
(i) associated with I, is a step change of approximately L30
= -1.09 units in the level of 03; On these assumptions, the approximate model (ignoring
(ii) associated with I2 there is a progressive reduction in 03. estimation errors in the noise structure) is
Over the period studied, there is a yearly increment of
approximately -O2 = -.25 in the summer months, but the Yt =01itl + W02it2 + {(1 - .84B)/(1 - B)}at (5.4)
increment (if any) in the winter is slight.
where
5.2 Example 2: The Rate of Change in the U.S. Consumer
11, t = September, October and November 1971
Price Index O0, otherwise
1, t > December 1971
A second example supplies further intuitive apprecia- ~t2 y 7
=0, otherwise
tion for the kind of calculations being performed.
Figure C shows the latter part of a record of the which may be written
monthly rate of change in the consumer price index
Zt= Coo1xt1 + C002Xt2 + at * (5.5)
(CPI) given more completely in [4]. The complete (July
1953 to December 1972) data include 234 successive The sequences {ztj, {xtI}, {xt2} may be readily calcu-
values, 218 of which occurred prior to the institution of lated from the equations
9.0 _ x Phase I in effect using, e.g., the initial approximation zi =X11 = X12 = 0.
o Phase 11 in effect
8.0 _
Also, since
7.0
(1 - B)/(1 - OB)
= 1-B(1-0)(1 + OB + 02B2 +*),
5.0 6 we have
4.0 T
Zt = ye - yt-i, Xtl = 61- it-1 l, Xt2 = t2- (t-1,2
3.0
where gt-e, kt-i,l and et-1i2 are exponentially
2.0
moving averages of values prior to time t, e.g.,
1.0
of yt from its average is related to the deviations of iti so that the maximum likelihood estimator of 3 is
and it2 from their averages. Notice, however, that the n n
model copes with nonstationarity by using not the usual = E Ztxt/E (Xt)2
arithmetic averages, but local exponentially weighted t=1 t=1
with (6.4)
averages which change as the series progresses.
Using (5.5), the constants cool and CO02 may now be Var (A) = (a) 2(E (xt) 2)1
estimated by ordinary linear least squares as t=1
Parameter MLE SE For large n, we apply the results (A.6) and (A.7) in
the appendix to obtain
cool -0.0022 0.0010
002 -0.0007 0.0009 00 00 00
The analysis suggests that a real drop in the rate of and for a given k
increase of the CPI is associated with Phase I, but the
effect of Phase II is less certain.
B'Ca,(k) = Ca,(k - 1) , 1 = 0, 41, 42,
Thus,
6. NATURE OF THE MAXIMUM LIKELIHOOD = R (F) Q (B) C V (0)/R (F)R (B) C (0) (6.5)
ESTIMATORS FOR SOME LEVEL and
CHANGE PARAMETERS Var (A) = (cTa)2/R(F)R(B)Ctt(O)
The maximum likelihood estimnators of parameters Making use of (A.10) in the appendix, we can write
such as col, C02 and WO3 in (5.2) and (5.4) which measure R (B)R (F) as
level changes are functions of the data. It is instructive
to consider the nature of these functions. Several results R(B)R(F) = ro + E ri(Bl + Ft) . (6.6)
1=1
in the summation of series useful in the following discus-
sion are given in the appendix. Suppose that it = Pt(T) is a pulse at time T, and a
large number of observations are available before and
6.1 One Parameter "Linear" Dynamic Model after T. In this case
where Q(B) = qp(B)/0(B), even though in practice the A = (ro)-'R(F)Q(B)yT and Var (A) = (a)2(ro)-1 (6.8)
Yt are only available for t = 1, , n. Since the roots
where it is understood that B is operating on T.
of @(B) all lie outside the unit circle, Q(B) can be ex-
Now, nonstationarity in time series data can often be
pressed as a power series in B which converges for
removed by differencing. In what follows we suppose that
BI = 1.
the polynomial (p(B) in (6.1) is divisible by (1 - B). We
Here we discuss the situation where
consider two special cases of interest.
( (B)10 (B)) (o(B)6f(B)) = lR(B) (6.2)
Case (i). cw(B)/6(B) = fiB , (6.9)
and investigate the nature of the maximum likelihood
that is, the pulse input Pt (T) gives rise to a response at
estimator of f, assuming that (i) the coefficients in Q(B)
time (T + 1) measured by : which dissipates completely
and R (B) are known and (ii) the power series R (B)
after the (T + 1)th period. It should be noted that with
converges for IB = 1. any number of periods of pure delay, the response will
Letting
follow the same pattern but be appropriately shifted. In
Zt = Q(B)yt and Xt = R(B)%t, this case, Q(B) = R(B)F so that, from (6.6) and (6.8),
we can write (6.1) in the form of the usual linear model
f3 = YT?1 - a E2 XI (yT?1?l + YT?1l-) , (6.10)
zt = f3Xt + at (6.3) 1=1
where Xi = - 2ri/ro. Also, since q(B) is assumed di- interventions after 1966 and discuss the step change
visible by (1 - B), rO + 2E,1l ri =0 , and hence
(fB/(1 - B))Pt(T) = wo,i I T = December 1959
1 -=1 x\1 = 1.
As an example, consider the integrated moving average in the level of the series due to the intervening events
model of order one for the noise term N, for which around that time. In this case, the noise model is such
that
q'(B) = 1-B and 0(B) = 1-B . (6.11)
(B) = (1 - B12)
Since
and
sudden heat wave in the summer or the sale of beer in (1 - 61B)(1 6-02B12) E rjBi = E B.
j=O j_o
Wisconsin should the Packers win the Super Bowl.
Essentially, we are comparing the observation YT+1 with By writing urj = 12n + m, we
the neighboring ones to determine if YT+1 is an "aberrant"
or "outlying" observation. The results in (6.10) and lrl2n+m - (1 - 01) 1( - 02)-l[(01)m+l (1 -
(6.12) are appealing since, in forming the comparison, - ( 1 - 02) (02) } n+ (4 - 02) (02)n]
more weight is given to observations close to the interven- m = O, ., 11; n = O, ,oo (6.18)
ing event and less and less weight to observations remote
where c = (01)12.
from the time of the event.
From (6.18) and after some algebraic reduction, we
Case (ii). c(B)/6(B) = B/(1 - B) (6.13) obtain, on setting 1 = 12k + s,
Here, the response to the pulse P,(T) is a step change rl2k+s (1 - 1)-2(1 - (02)2))1
in the level of the observations measured by A. Thus
*[12 -s(1 -02) + l (16;):
Q(B) = (1 - B)R(B)F (6.14)
and, from (6.6), (6.8) and (A.11), we have that *( _ - ] ( 2)k + (1 _ 01)-2
where, as before in (6.1), Q(B) = 4p(B)/0(B). Assuming Some Properties of 3 and 02*
that all the coefficients in Q(B) and Rj(B) are known and
(i) Both bi and b2 are linear functions of the observa-
these k + 1 power series converge for B = 1, the
tions yt. By setting B = F = 1, the sum of the co-
model is then linear in the k parameters 5 = (f1, , Ok)'.
efficients associated with yt is zero for both of these
It readily follows that, for large n, the maximum likeli-
functions. Thus, A and A2 are linear contrasts in yt.
hood estimator satisfies the normal equations
(ii) The estimator A2 can be expressed in the form
A = b (6.22) 00 00
32 =E allYT+l+l - E a2lYT-l
where A is a k X k matrix and b a k X 1 vector such that
where ?=0 1=0 (6.25)
00 00
and 132 represents the eventual change in the level of the it follows from (A.11) that EO= _ di = a22.
observations induced by the pulse input Pt(T) (see Further, making use of (A.12) and (A.13), we see that
Figure B(e) for the special case r1(B) = (1 -B)-1). a12 in (6.24) is also the coefficient of BO in R(B)R(F)
When fl = 0, the model reduces to that considered in * (1 - F)r(F). If we now set
(6.13). It is, therefore, of particular interest to know to
what extent the nature and precision of the estimator of G1(B) = R(B)R(F)(1 - F)>(F) , H1(B) = 1 - B
12 is affected by the presence of 01. We again suppose and
that the noise term is nonstationary so that s (B) is
b= E
l=00 lyrll
divisible by (1 - B).
To facilitate comparison with the model (6.13) we
again define a quantity R(B) such that we then have E0= _ di* = a12. The desired result
follows since
Q(B) = (1 - B)R(B)F, 00 0 0
R1 (B) = Q (B) >i(B)B = R (B) >q (B) (1 -B) This property is similar to that of A in (6.15) for the
and model (6.13), except that the weight functions are no
R2(B) = R (B) longer symmetrical. From least squares theory, we have
S3 = E: E VkYuXu+k,
k=O ui1
0' I 2 3 4 5 6 7 LAG 00 00
a so that
-.5 2.00 2.80 4.00 6.00 10.00 The quantity S3 in (6.1) can be expressed as
-.25 1.56 2.00 2.67 3.78 6.00
0 1.33 1.60 2.00 2.67 4.00 S3 = E VkCz1y(-k)
k -0
.25 1.20 1.36 1.60 2.00 2.80
.5 1.11 1.20 1.33 1.56 2.00 and, by letting C,( (-k) = BkCzv (0), we have
Statistical methods have, in a word, "lacked memory." Lemma 3: Let GJ(B) = 2? q, gBi and H(B) = ,^??_o0 hk?Bk
The dynamic characteristics of both the transfer function be two power series in B and converge for lB l 1, and let D (B)
so that x-:*=I di = -go and ?=_-o di = go; [5] Glass, G.V., "Estimating the Effects of Int
Nonstationary Time Series," American Educat
(iv) if gi = g- and hj = Oj- <-1, then Journal, 9, No. 3 (1972), 463-77.
[6] Tiao, G.C., Box, G.E.P. and Hamming, W.J., "Analysis of Los
do= F hjgj; (A.12) Angeles Photochemical Smog Data: A Statistical Overview,"
Journal of Air Pollution Control Association, 25 (March 1975).