Notes 10
Notes 10
real numbers, X (t ) is a continuous time process. If T is the set of integers (or some other set of
equally spaced numbers) then X (t ) is a discrete time process. The process X (t ) is strictly stationary
P (X (t 1 + τ) ∈ S 1 , . . . , X (tn + τ) ∈ Sn ) = P (X (t 1) ∈ S 1 , . . . , X (tn ) ∈ Sn )
for all real τ, where t 1 , . . . , tn are arbitrary points in T , and S 1 , . . . , Sn are arbitrary events. Here,
the finite- dimensional distributions depend on the relative time separations of the indices, but not on
their absolute time locations. If the X (t ) have finite means and variances, stationarity implies that
The process is said to be weakly stationary if the above conditions are met. All stationary
processes with finite variance are weakly stationary. Weak stationarity does not in general imply strict
stationarity. For Gaussian processes, however, any weakly stationary process is also strictly stationary.
In the rest of this section, we will assume that X (t ) is a continuous time weakly stationary process with
zero mean.
n n
Σ Σ a j adk C (t j − tk ) ≥ 0
j =1 k =1
∞
i λτ
C (τ) = ∫e
−∞
F (d λ) .
-2-
The function F (λ) is called the spectral distribution function. F (λ) determines a distribution, denoted
(with an abuse of notation) by F (A ) for measurable sets A . F (A ) is called the spectral distribution.
Hilbert Spaces
A vector space is said to be an inner product space if there exists an inner product (x , y ) which
(x , y ) = (y
ddddd
,x) ,
(x , x ) ≥ 0 , (x , x ) = 0 <==
> x =0 .
and form a basis for the space, they are said to be an orthonormal basis. Any vector z can be
represented as
z = β 1x 1 + . . . + β n x n ,
where βi = (z , xi ). The βi are called the Fourier coefficients of z . The Parseval relation is
n
e ez e e2 = Σ
i =1
e (z , xi ) e 2 .
A sequence {xn } converges to a vector x in the space (and we write xn → x ) if lim ||xn − x || = 0. A
n →∞
sequence for which lim ||xm − xn || = 0 is called a Cauchy Sequence. It can be shown that every con-
m ,n →∞
vergent sequence is a Cauchy sequence. If, conversely, every Cauchy sequence converges to an ele-
ment of the space, the space is said to be complete. A complete inner product space is called a Hil-
bert Space. All finite-dimensional inner product spaces are Hilbert spaces. Some specific Hilbert
spaces of importance to time series analysis are discussed in Koopmans, pp. 17-25. A Hilbert space
needed for the spectral representation is L 2(P ): the space of zero mean complex valued random vari-
variables Yn is said to converge to Y in mean square if ||Yn − Y ||2 → 0. Since L 2(P ) is complete, a
mean square Cauchy sequence will always have a limit in L 2(P ). The other Hilbert space needed for
the spectral representation is L 2(F ) (where F is the spectral distribution of the weakly stationary sto-
chastic process {Xt }), the space of complex-valued functions g (λ) of a real variable such that
∞
Xt = ∫ exp(i λt )dZ (λ)
−∞
,
where the increments dZ (λ) are uncorrelated random variables with mean zero and variance dF (λ). We
Let MX denote the subspace of L 2(P ) generated by the linear combinations of the components of
generate the space L 2(F ). Thus, we can define a correspondence between MX and L 2(F ) by the rule
Xt ←
→ exp(i λt ) ,
and then extending by linearity to include all elements in the two spaces. This correspondence
∞
(Xt , Xu )P = E [Xt Xdud] = C (t − u ) = ∫ exp(i λ(t − u )dF (λ)
−∞
∞
= dddddddd
∫ exp(i λt )exp(i λu ) dF (λ) = (exp(i λt ) , exp(i λu ))F .
−∞
-4-
It follows that for any elements g (λ) , h (λ) in L 2(F ), if G and H are the random variables in MX such
that G ←
→ g (λ) and H ←
→ h (λ), then
∞
dd ] =
E [G H ∫ g (λ)hdddd
(λ) dF (λ) .
−∞
If A is a Borel set, its characteristic (indicator) function IA (λ) is a member of L 2(F ). Thus, there must
Z (A ) ←
→ IA (λ) .
Note that Z (A ) is a set function whose values are random variables. Z (A ) is called the (ran-
dom) spectral measure of the process, and it indeed has the properties of a measure since Z (∅) = 0
∞ ∞
(almost surely), and Z ( ∪ Ai ) =
i =1
ΣZ (Ai )
i =1
(almost surely) for disjoint measurable sets Ai . Note that
E [Z (A )] = 0 and since the process is real we have F (−A ) = F (A ) and hence Z (−A ) = Zd(A
ddd). Further-
more,
∞
E [Z (A ) Zd(B
ddd)] = ∫ IA (λ)IdBddd
(λ) dF (λ) = F (A ∩B ) .
−∞
Any function g (λ) in L 2(F ) can be written as the limit of simple functions of the form Σk ak IA (λ)
k
∫ g (λ)dZ (λ)
−∞
.
The notation is analogous to the notation used for the Lebesgue integral. The increments dZ (λ) are
ddddd
E [dZ (λ)dZ (µ)] = dF (λ) if µ = λ ,
ddddd
E [dZ (λ)dZ (µ)] = 0 if µ ≠ λ .
-5-
∞
We have proved that g (λ) ←
→ ∫ g (λ) dZ (λ).
−∞
As a special case, then,
∞
exp(i λt ) ←
→ ∫ exp(i λt )dZ (λ)
−∞
.
∞
Xt = ∫ exp(i λt )dZ (λ)
−∞
,
To interpret the spectral representation, approximate the function exp(i λt ) by the simple function
tion says that Xt can be approximated arbitrarily well in mean square by the sum
Σk exp(i λk t )Z (Ak ) .
This sum is a superposition of complex exponentials with random (complex) amplitudes Z (Ak ) which
are uncorrelated with mean zero and variances E e Z (Ak ) e 2 = F (Ak ). Note that F (Ak ) is the contribution
to the total variance of Xt from the frequency interval (λk −1 , λk ]. Passing to the limit (n → ∞), we
obtain
∞
Xt = ∫ exp(i λt )dZ (λ)
−∞
,
so that Xt is a (generalized) sum of complex exponentials with uncorrelated random amplitudes dZ (λ).
The key interpretation of the spectrum is that dF (λ), the increment in the spectrum at λ, is the
contribution to the total power (variance) of Xt due to the component at frequency λ. (A more rigorous
interpretation of this statement follows.) Thus, the spectrum provides a decomposition of the total vari-
The spectral distribution F may be decomposed (by the Lebesgue decomposition) into discrete
-6-
and continuous components. If d λ is a small interval containing the frequency λ, we can write
F (d λ) = p (λ) + f (λ)d λ .
p (λ) = F ({λ}) is called the spectral function and represents the discrete power (if any) at λ, and f (λ)
is the spectral density. Thus, the total power in d λ consists of the discrete power at λ plus the continu-
ous power f (λ)d λ. The random spectral measure Z (A ) may also be decomposed into discrete and con-
X (t ) = Xd (t ) + Xc (t ) ,
where
∞
Xd (t ) = ∫ exp(i λt )Zd (d λ)
−∞
,
and
∞
Xc (t ) =
−∞
∫ exp(i λt )Zc (d λ)
are the discrete and continuous components of the process. It can be shown that
E e Zc (d λ) e 2 = f (λ)d λ ,
so that the contribution to the total power of Xc (t ) from the interval d λ is f (λ)d λ. Strictly speaking,
Xc (t ) does not have a component at λ, and it is only meaningful to talk about the contribution to the
total power from sets (e.g., intervals) of frequencies. The total amount of continuous power or continu-
ous spectral mass in a set of frequencies A is ∫ f (µ)d µ. On the other hand, the discrete component can
A
be written as
Xd (t ) = Σj exp(i λ j t )Z j ,
where the λ j are the points for which the spectral function p (λ) is positive (there can only be a count-
able number of such points) and Z j = Z ({λ j }). Thus, Xd (t ) is (exactly) a sum of complex exponentials
-7-
at frequencies λ j . Here, there really is a component at the exact frequency λ j and the contribution to
the total power from λ j is given by the discrete power p (λ j ). Further, the contribution to the power in
E e Zd (A ) e 2 = Σ p (λ j )
λ j ∈A
.