Multivariate Characteristic and Correlation Functions
Multivariate Characteristic and Correlation Functions
Editors
Carsten Carstensen, Berlin, Germany
Nicola Fusco, Napoli, Italy
Fritz Gesztesy, Columbia, Missouri, USA
Niels Jacob, Swansea, United Kingdom
Karl-Hermann Neeb, Erlangen, Germany
Zoltán Sasvári
Multivariate Characteristic
and Correlation Functions
De Gruyter
Mathematical Subject Classification 2010: 42B10, 43A35, 60F05, 60G10, 60G50, 60G51,
60E07, 60E10.
ISBN 978-3-11-022398-9
e-ISBN 978-3-11-022399-6
Set-ISBN 978-3-11-174044-7
ISSN 0179-0986
Printed in Germany
www.degruyter.com
Preface
Our goal in writing this book is to make the basic concepts and results on multivari-
ate characteristic and correlation functions easily accessible to both students and re-
searchers in a comprehensive manner. Especially in the interest of students, every
attempt has been made to make the book as self-contained as possible. This led to
a relatively large appendix containing topics like infinite products, functional equa-
tions, special functions or compact operators. We hope that this appendix will make
the usage of the book much easier, for example in seminars. Chapters 1, 2 and parts of
Chapter 3 have been ‘tested’ in lectures and seminars given at the Technical University
Dresden, Wroclaw University of Technology, and Swansea University.
In a certain sense, characteristic functions and correlation functions are the same,
the common underlying concept is positive definiteness. Many results in probability
theory, mathematical statistics and stochastic processes can be derived by using these
functions. While there are books on characteristic functions of one variable, books de-
voting some sections to the multivariate case, and books treating the general case of
locally compact groups, interestingly there is no book devoted entirely to the multidi-
mensional case which is extremely important for applications. This book is intended to
fill this gap at least partially. Due to the abundance of results and of their applications
a single book cannot treat all of them.
Most of the presented results are fairly well known. For this reason only few refer-
ences are included, rather we cite papers and monographs where such references can be
found. A brief outline of the book is as follows. The first chapter presents basic results
and should be read carefully since it is essential for the understanding of the subsequent
chapters. The second chapter is devoted to correlation functions, their applications to
stationary processes and some connections to harmonic analysis. In Chapter 3 we deal
with several special properties, Chapter 4 is devoted to the extension problem while
Chapter 5 contains a few applications. Many results in the Appendix are presented
with proofs. The reader might find some sections of the Appendix interesting on their
own, e.g., the section Solutions of certain functional equations or the section Linear
independence of exponential functions.
The numbering of chapters, sections, theorems, etc. is traditional. Equations are
numbered consecutively within each paragraph, theorem, definition, and so forth.
Equation (i) stands for the i-th equation within the current paragraph while equa-
tion (l.k.j.i) stands for the i-th equation within the Paragraph (l.k.j).
My warmest thanks go to Georg Berschneider who read the whole manuscript, gave
valuable remarks, helped to find literature to special topics and solved several TeX
vi Preface
problems. Björn Böttcher read substantial parts of the manuscript, I am grateful for his
comments. I thank my colleagues at the Institut für Stochastik, Technische Universität
Dresden, for many helpful discussions, in particular Christiane Weber for her help in
preparing the figures, the index, and the bibliography, and Willi Schenk for the proof
idea of Lemma B.1.2. I record my gratitude to the publishers, especially to the series
editor Niels Jacob for his interest and encouragement.
Last, but not least, I would like to thank my wife, Evelyne Sasvári, for her support
and understanding.
Preface v
1 Characteristic functions 1
1.1 Basic properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Differentiability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.3 Inversion theorems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.4 Basic properties of positive definite functions . . . . . . . . . . . . . . . . . . . . 25
1.5 Further properties of positive definite functions on Rd . . . . . . . . . . . . 32
1.6 Lévy’s continuity theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
1.7 The theorems of Bochner and Herglotz . . . . . . . . . . . . . . . . . . . . . . . . . 41
1.8 Fourier transformation on Rd . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
1.9 Fourier transformation on discrete commutative groups . . . . . . . . . . . . 55
1.10 Basic properties of Gaussian distributions . . . . . . . . . . . . . . . . . . . . . . 57
1.11 Some inequalities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
2 Correlation functions 67
2.1 Random fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67
2.2 Correlation functions of second order random fields . . . . . . . . . . . . . . 70
2.3 Continuity and differentiability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
2.4 Integration with respect to complex measures . . . . . . . . . . . . . . . . . . . . 77
2.5 The Karhunen–Loève decomposition . . . . . . . . . . . . . . . . . . . . . . . . . . 86
2.6 Integration with respect to orthogonal random measures . . . . . . . . . . . 92
2.7 The theorem of Karhunen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
2.8 Stationary fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103
2.9 Spectral representation of stationary fields . . . . . . . . . . . . . . . . . . . . . . 109
2.10 Unitary representations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117
2.11 Unitary representations and positive definite functions . . . . . . . . . . . . . 125
3 Special properties 132
3.1 Strict positive definiteness . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132
3.2 Infinitely differentiable and rapidly decreasing functions . . . . . . . . . . . 134
viii Contents
Characteristic functions
In this very first chapter we prove several classical results due to, among others,
Herglotz, Bochner, Lévy, and Plancherel. We start with basic properties of charac-
teristic functions and continue with the investigation of positive definite functions.
As a byproduct, we obtain basic results on the Fourier transformation which will be
needed in subsequent chapters.
Before starting we suggest to have a short look at Appendix A which contains basic
notation used throughout the book.
The existence of the expectation above is guaranteed by the fact that the complex-
valued random variable ei.t;X/ is bounded for each t 2 Rd . Sometimes we call f the
characteristic function of the distribution , in which case we also write f . As we
will see in Section 1.3, every distribution is uniquely determined by its characteristic
function.
If X has a density p, then
Z
f .t / D ei.t;x/ p.x/ dx; t 2 Rd :
Rd
K D Œa1 ; b1 Œad ; bd Rd ;
d Z bj
Y Y
d
1 eibj tj eiaj tj
D eitj xj dxj D
.K/ aj i.bj aj /tj
j D1 j D1
where the factors of the last product are equal to 1 (by definition) if tj D 0.
In the last example, the characteristic function fX is real-valued whenever aj D
bj for all j . We will show later (cf. Theorem 1.3.13) that fX is real-valued if and
only if X and X have the same distribution, i.e., if is symmetric: D 4 . The
proof of the “if part” is very simple, but for the “only if part” we need the fact that
every distribution is uniquely determined by its characteristic function.
(ii) jf .t /j 1; t 2 Rd ;
(iii) f .t / D f .t /; t 2 Rd ;
Proof. The first three properties follow immediately from Definition 1.1.1. To prove
(iv) assume that f is the characteristic function of a random vector X . Then
where the upper bound is independent of t and, by the dominated convergence theo-
rem, tends to zero as h ! 0. This shows that f is uniformly continuous on Rd .
Section 1.1 Basic properties 3
Theorem 1.1.3. If X and Y are independent d-dimensional random vectors, then the
characteristic function of their sum is
fXCY D fX fY :
1 Note that the distribution of X C Y is so that the equation in Theorem 1.1.3 can also be written
as f D f f .
4 Chapter 1 Characteristic functions
Proof. We have
T t;X/
fAXCb .t / D EŒei.t;AXCb/ D E Œei.t;b/ ei.A D ei.t;b/ fX .AT t /:
Applying the same argument as in the proof of Corollary 1.1.4, we obtain the fol-
lowing corollary.
In the rest of this section we present some simple examples of characteristic func-
tions of one variable. These functions are shown in Figures 1.1–1.5.
Example 1.1.13.
(a) The exponential distribution with parameter ˛ > 0 has the density function
x 7! ˛1Œ0;1/ .x/ e˛x ; x 2 R , and its characteristic function is given by
Z 1
˛ e.it˛/x ˇˇxD1 ˛
˛e˛x eitx dx D ˇ D
0 it ˛ xD0 ˛ it
(see Figure 1.1).
(b) The Laplace distribution with parameter ˇ > 0 has the density function x 7!
jxj
1
2ˇ
e ˇ ; x 2 R. Its characteristic function is given by
Z 1 Z 1
1 jxj 1 1
e ˇ eitx dx D e. ˇ Cit/x dx
1 2ˇ 0 2ˇ
Z 0
1 1
C e. ˇ Cit/x dx
2ˇ
1
1 1 1 1
D C D
2ˇ it ˇ 1
it C ˇ 1 1 C ˇ2t 2
(see Figure 1.2). Using Corollary 1.1.5 and the last two examples we see that
the difference of two independent identically distributed exponential random
variables is Laplace distributed.
Im
0.5
1 Re
0:5
0.5
3 2 1 1 2 3
with parameter 0 is
1
X 1
X
k itk .eit /k
e e D e D e.e 1/
it
kŠ kŠ
kD0 kD0
Im
1
0:5 1 Re
1
Figure 1.3. The function t 7! Œ1 p C p eit n from Example 1.1.13(c) with p D 1=2 and
n D 4:
Im
1
0:8 1 Re
1
Figure 1.4. The function t 7! Œ1 p C p eit n from Example 1.1.13(c) with p D 0:7 and
n D 4.
Section 1.2 Differentiability 9
Im
0.5
1 Re
0:5
1.2 Differentiability
The characteristic functions in Example 1.1.13 are infinitely often differentiable (even
analytic) on R. We will show in Example 1.3.7 that the characteristic function of the
Cauchy distribution with parameter ˛ D 1 is given by f .t / D ejtj . This function
is not differentiable at 0. Example B.1.19 shows that characteristic functions may be
nowhere differentiable.
Throughout this section, denotes a probability distribution on Rd with charac-
teristic function f . We will see that differentiability of f is closely related to the
existence of moments of .
Theorem 1.2.1. Let ˛ 2 Nd0 be such that the moment M˛ of exists. Then the partial
derivative D ˛ f exists and we have
Z
(i) D ˛ f .t / D ij˛j x ˛ ei.t;x/ d.x/; t 2 Rd ;
Rd
(ii) D ˛ f .0/ D ij˛j M˛ ;
(iii) D˛ f is uniformly continuous and jD ˛ f j is bounded by the absolute moment
A˛ of .
Proof. Suppose that ˛j > 0 for some j and let e1 ; : : : ; ed be the standard basis of
Rd . For h 2 R n f0g we have
Z
f .t C hej / f .t / eihxj 1 i.t;x/
D xj e d.x/:
h Rd hxj
10 Chapter 1 Characteristic functions
Theorem B.1.5 with n D 0 shows that the integrand is dominated by jxj j. Taking the
limit h ! 0 and applying Lebesgue’s theorem on dominated convergence we obtain
Z
@
f .t / D i xj ei.t;x/ d.x/; t 2 Rd :
@tj Rd
Repeating this argument we obtain (i). Equation (ii) is obtained from (i) by setting
t D 0.
From (i) we infer that
Z
˛
jD f .t /j jx ˛ j d.x/ D A˛ :
Rd
by dominated convergence.
@2 @ @
cj k D f .0/ C f .0/ f .0/:
@tj @tk @tj @tk
Corollary 1.2.3. Assume that ¤ ı0 and that for some ˛ 2 Nd0 the moment M2˛ of
exists. Then the function g defined by
.1/j˛j
g.t / D D 2˛ f .t /; t 2 Rd
M2˛
is a characteristic function. The corresponding distribution is given by
1
d.x/ D x 2˛ d.x/:
M2˛
Example 1.2.4. As an application of Theorem 1.2.1 we compute the mean and the
variance of the distributions from Example 1.1.13. Let f denote the characteristic
function of a random variable X with finite variance. Then, in view of (1.2.1.ii),
and hence
Var.X / D E. X 2 / .E X /2 D f 00 .0/ f 0 .0/2 :
(a) If X is exponentially distributed with parameter ˛ > 0 then we have
d ˛ ˛i 2˛
f 0 .t / D D ; f 00 .t / D
dt ˛ it .˛ it /2 .˛ it /3
d 1 2ˇ 2 t 2ˇ 2 .3ˇ 2 t 2 1/
f 0 .t / D D ; f 00 .t / D :
dt 1 C ˇ 2 t 2 .1 C ˇ 2 t 2 /2 .1 C ˇ 2 t 2 /3
The next theorem follows immediately from Theorem 1.2.1 and Theorem B.6.1 with
a D 0.
Theorem 1.2.5. If for some positive integer n all moments M˛ ; ˛ 2 Nd0 ; j˛j n, of
exist, then
X M˛ X
f .t / D .it /˛ C R˛ .t / t ˛ ; t 2 Rd
˛Š
j˛jn j˛jDn
where Z 1
n
R˛ .t / D .D ˛ f .xt / D ˛ f .0//.1 x/n1 dx:
˛Š 0
2A˛
Moreover, jR˛ .t /j ˛Š and limt!0 R˛ .t / D 0.
Theorem 1.2.6. Under the conditions of the previous theorem there exist ; 2 .0; 1/
depending on t such that
X M˛ X Re D ˛ f . t / C i Im D ˛ f .t /
f .t / D .it /˛ C t ˛:
˛Š ˛Š
j˛jn1 j˛jDn
Corollary 1.2.7. Let X D .X1 ; : : : ; Xd / be a random vector such that E.Xj2 / < 1
for all j . For the characteristic function f of X we then have
X
d
1 X
d
f .t / D 1 C i E.Xj /tj ŒE.Xi Xj / C Ri;j .t / ti tj ; t 2 Rd
2
j D1 i;j D1
In particular, if (1) holds for all t 2 Rd , then all moments of second order of are
finite and hence all partial derivatives of order two of f exist.4
Proof. Let s 2 R n f0g; t 2 Rd and T > 0 be arbitrary. We then have
Z
1 Re f .s t / 1 cos .s.t; x//
Ct 2
D d.x/
s Rd s2
Z T Z T
1 cos .s.t; x//
d.x/:
T T s2
Using the inequality (B.1.6.2) and Lebesgue’s theorem on dominated convergence we
see that the right-hand side tends to
Z Z T
1 T
.t; x/2 d.x/
2 T T
as s ! 0. Since T > 0 is arbitrary, we find that (2) holds.
4 Compare this result with Theorem 1.11.4.
Section 1.2 Differentiability 13
The existence of D ˛ f does not imply the existence of the moment M˛ in general.5
However, the following is true.
Theorem 1.2.9. If for some ˛ 2 Nd0 n f0g and for g D Re f all partial derivatives
D ˇ g; ˇ < 2˛, exist in an open neighborhood of zero and if D 2˛ g exists at zero,
then the moment M2˛ of exists. Moreover, the partial derivative D 2˛ f exists on all
of Rd .
Proof. The second statement follows from the first one using Theorem 1.2.1. To prove
the first statement assume first that j˛j D 1. By our assumption the function
Definition 1.2.10. Let X0 be a random variable with finite expectation and let X D
.X1 ; : : : ; Xd / be a random vector. The random variable X0 is said to have polynomial
regression of order k on X if there exists a polynomial
X
Q.t / D b˛ t ˛ ; t 2 Rd (1)
˛2Nd
0 ; j˛jk
If t and s a have the same sign, then jt C s jt jk . Using this inequality and
ajk
the fact that the integrand above is zero if s D a and t D 0, we obtain
Z Z Z Z
Ak .a/ jt jk d1 .t / d2 .s/ C jt jk d1 .t / d2 .s/
Œa;1/ Œ0;1/ .1;a .1;0/
Z Z
k
D 2 Œa; 1/ jt j d1 .t / C 2 .1; a jt jk d1 .t /:
Œ0;1/ .1;0/
Theorem 1.2.13. Let ı > 0 and f be a 2n-times differentiable function on the inter-
val .ı; ı/ such that f .t / D f .t /; jt j < ı. Further, let f1 ; : : : ; fr be characteristic
functions on R and ˛1 ; : : : ; ˛r be positive real numbers such that
Y
r
jfj .t /j˛j D jf .t /j; ı < t < ı:
j D1
1 gj .t /
C; t ¤0
t2
for some constant C . Lemma 1.2.8 shows that gj is twice differentiable.
Suppose now that gj is 2k-times differentiable for some k < n. Then
.2l1/
gj .0/ D 0; l D 1; : : : ; kI j D 1; : : : ; r:
X
r
1
˛j log D h.t /; ı0 < t < ı0 (2)
gj .t /
j D1
where h D log g. This equation and Faà di Bruno’s formula (B.1.14) with m D
2k; g D log and f D gj show that for t 2 .ı0 ; ı0 / we have
!b1 !b2 .2k/ !b2k
.2k/
X
r X gj0 .t / gj00 .t / gj .t /
h .t / D ˛j c.b/ (3)
gj .t / gj .t / gj .t /
j D1
.2k/ !
X
r
gj .t / .2k/
˛j gj .0/ D
gj .t /
j D1
X
r X
.2k/ .2k/
h .t / h .0/ ˛j c.b/ ŒSj;b .t / Sj;b .0/ (4)
j D1
Section 1.2 Differentiability 17
where the last sum is over all b 2 N2k0 satisfying b2k D 0; b1 C 2b2 C C
.2k 1/b2k1 D 2k and
!b !b .2k1/ !b
gj0 .t / 1 gj00 .t / 2 gj .t / 2k1
Sj;b .t / D :
gj .t / gj .t / gj .t /
.2l1/
Using that gj is 2k-times continuously differentiable and gj .0/ D 1; gj .0/ D 0,
1 l k, we obtain
1
D 1 C O.t 2 /; t ! 0 (5)
gj .t /
and
.2l1/ .2l2/ .2l2/
gj .t / D O.t /; gj .t / D gj .0/ C O.t 2 /; t ! 0:
These relations show that Sj;b .t / Sj;b .0/ D O.t 2 / if b ¤ .0; 0; : : : ; 1/. Since h is
2k-times differentiable we conclude that the left-hand side of (4) is O.t 2 /. Applying
this and (5) we get
X
r
.2k/ .2k/
˛j gj .t / gj .0/ D O.t 2 /:
j D1
.2k/ .2k/
Since gj =gj .0/ is a characteristic function (cf. Corollary 1.2.3), inequality
(1.1.2.ii) shows that the terms in this sum have the same sign and hence they are O.t 2 /.
.2k/
By Lemma 1.2.8 the function gj is twice differentiable, completing the induction
on k.
Remark 1.2.14. If all the numbers ˛k are integers or rational numbers, then Theo-
rem 1.2.13 follows easily from Theorem 1.2.12 using the fact that fj˛ is a characteris-
tic function for all ˛ 2 N. If g is a nonnegative characteristic function and ˛ is not an
integer, then g ˛ is not necessarily a characteristic function. To see a simple example,
let g.t / D cos2 t; t 2 R. Then g 1=2 .t / D j cos t j and g 1=2 is not a characteristic
function since it is infinitely differentiable on .=2; =2/ but it is not differentiable
at =2 (cf. Theorem 1.2.9).
Remark 1.2.15. The set of all points where a given characteristic function is differen-
tiable can be quite complicated. To see examples, let b 2 be an integer and f"n g1
1 be
a sequence of nonnegative numbers tending to zero. Denote by D the set of all t 2 R
where the characteristic function g defined by
1
X
g.t / D C "n b n cos b n t; t 2R
nD1
(i) For every open interval I R the set D \ I has the same cardinality as R.
P
(ii) If 1 0
1 "n < 1 then D D R and g is continuous.
P1 2
(iii) If 1 "n D 1 then .D/ D 0.
This result is due to A. Zygmund, a proof can be found in Section 1.10 of [6].
where 8
< 1 if xj 2 .aj ; bj /
j .x/ D
1
if xj D aj or xj D bj
: 2
0 if xj … Œaj ; bj :
Proof. For T > 0 write
Z Z Y
d
1 T T
eiaj tj eibj tj
I.T / WD ::: f .t / dt
.2/d T T j D1 itj
Z Z Y
d Z
1 T T
eiaj tj eibj tj
D ::: ei.x;t/ d.x/ dt:
.2/d T T j D1
itj R d
It follows from Corollary B.1.8 that limT !1 Kj .T; x/ D j .x/. By Theorem B.1.9,
we have jKj .T; x/j < 2. Hence, the statement of the theorem follows by dominated
convergence.
The next lemma deals with a special case of Gaussian distributions which will be
investigated in more detail in Sections 1.10 and 3.5.
Proof. By definition of f
Z
e i.t;y/
f .t / D ei.t;xCy/ d.x/:
Rd
Integrating both sides with respect to and using Fubini’s theorem we obtain the
desired equation.
Throughout the rest of this section f denotes the characteristic function of a prob-
ability distribution on Rd.
The most frequently applied inversion formula is presented in the next theorem.
Lemma 1.3.5 shows that the integral on the right-hand side is equal to
Z
ei.x;y/ f .x/'n .x/ dx
Rd
and therefore Z
1 1
ktk2
pn .y/ D ei.t;y/ f .t /e 2n2 dt: (1)
.2/d Rd
The sequence fSn g1 1 converges pointwise to X . Therefore, the corresponding se-
quence of distributions converges weakly to . By Theorem E.1.7,
Z Z
lim h.y/pn .y/ dy D h.y/ d.y/ (2)
n!1 Rd Rd
22 Chapter 1 Characteristic functions
for every bounded continuous function h on Rd . Since the modulus of the integrand
on the right-hand side of (1) is bounded by jf j, Lebesgue’s dominated convergence
theorem can be applied to show that pn .y/ tends to
Z
1
p.y/ WD f .t / ei.t;y/ dt; y 2 Rd
.2/d Rd
for all y 2 Rd . From (1) we also see that the sequence fpn g1
nD1 is uniformly bounded.
Applying Lebesgue’s theorem once more and using (2) we obtain
Z Z
h.y/p.y/ dy D lim h.y/pn .y/ dy
Rd Rd n!1
Z Z
D lim h.y/pn .y/ dy D h.y/ d.y/:
n!1 Rd Rd
From this we conclude that is absolutely continuous with density p.
Z Z T !
Y d
1
D eitj .xj aj / dtj d.x/
Rd 2T T
j D1
Z !
Yd
DW Kj .xj ; T / d.x/
Rd j D1
Consequently, is continuous if and only if the limit on the left-hand side is equal
to zero.
We close this section with some simple applications of the uniqueness Theo-
rem 1.3.3.
Y
d
fX .t / D fj .tj /; t 2 Rd (1)
j D1
Thus, the values .Hy;r /; r 2 R, determine the distribution (and hence the charac-
teristic function) of the random variable ! 7! .y; X.!// for all y 2 Rd . On the other
hand, the characteristic functions of these random variables determine the character-
istic function of X (cf. Remark 1.1.10). This fact together with Theorem 1.3.3 proves
the result.
The next result can be proved in the same way as the previous one. We omit the
proof.
Theorem 1.3.13. For a random vector X the following conditions are equivalent.
(i) The distribution of X is symmetric.
(ii) The characteristic function f of X is real-valued.
R
(iii) f .t / D Rd cos..t; x// d.x/; t 2 Rd :
Proof. If is symmetric then X and X have the same distribution and hence
f .t / D E ei.t;X/ D E ei.t;X/ D f .t / D f .t /; t 2 Rd :
Thus, f is real-valued.
Section 1.4 Basic properties of positive definite functions 25
f .t / D E Œcos.t; X / C iE Œsin.t; X / :
holds for every positive integer n, for all t1 ; : : : ; tn 2 G, and for all c1 ; : : : ; cn 2 C.
We denote by P .G/ the set of all positive definite functions on G while P c .Rd / and
P m .Rd / denote the set of all continuous and Lebesgue measurable positive definite
functions on Rd , respectively.
Remark 1.4.2. By the definition above, f is positive definite if and only if the matrix
n
A D f .ti tj / i;j D1
12 Note that most of the results on positive definite functions presented in this book have their analogues
on locally compact groups (see for example [49]). The restriction of considering only commutative
groups without topology makes the treatment simpler. On the other hand, the results obtained are
sufficient for many applications in probability theory.
13 Section D.2 contains some basic results on positive semidefinite matrices.
26 Chapter 1 Characteristic functions
ure by
X
n
D cj ıtj :
j D1
Pn
Then Q D j D1 cj ıtj and
Z X
n
Q f .t / D f .t x/ d. /.x/
Q D f .t ti C tj /ci cj ; t 2 G:
G i;j D1
Setting here t D 0 and replacing ti by ti we obtain the sum in Definition 1.4.1. We
have thus proved the following lemma.
X
k1 X
k
nj < i nj
j D1 j D1
.t / D ei.x;t/ ; t 2 Rd
.n/ D z n ; n 2 Zd :
Proof. The first statement follows immediately from Theorem C.9.2 using the fact that
j j D 1.
To prove the second one let e1 ; : : : ; ed be the standard basis of Zd . If is a character
of Zd and n D .n1 ; : : : ; nd / 2 Zd then
where zi D .ei / 2 T.
Next we prove several simple properties of positive definite functions. From Theo-
rem D.2.3 we obtain the following lemma:
Lemma 1.4.8. Let f 2 P .G/. Then f is Hermitian, i.e., f .x/ D f .x/ holds for
all x 2 G.
Theorem 1.4.9. Let f1 ; f2 2 P .G/. Then the functions f1 , Ref1 , jf1 j2 , and f1 f2
are positive definite. Moreover, p1 f1 C p2 f2 is positive definite for all p1 ; p2 0.
Theorem 1.4.10. If a sequence (or net) of positive definite functions converges point-
wise to a function f then f is positive definite.
As a consequence of Theorems 1.4.9 and 1.4.10 we obtain the following corollary:
Corollary 1.4.11. The set P .G/ is a convex cone closed in the topology of pointwise
convergence.
Next we prove some frequently used inequalities.14
It is easy to check that this definition is correct, i.e., .g; h/ does not depend on the
particular representations of g and h. As f 2 P .G/, this inner product is nonnegative
definite. Thus, the Cauchy–Schwarz inequality j.g; h/j2 .g; g/ .h; h/ holds, and
this is the same as (v).
Setting m D 1, y1 D 0, and b1 D 1 in (v), we get (iv).
The inequality (ii) follows readily from (iv) by setting n D 2, x1 D x, x2 D y,
a1 D 1, a2 D 1. Putting n D 1, x1 D x, and a1 D 1 in (iv), we get jf .x/j2 f .0/2
and, since f .0/ D .1f0g ; 1f0g / 0, this gives jf .x/j f .0/.
or equivalently,
jc abj2 .1 jaj2 /.1 jbj2 /: (1)
Assume first that f .0/ D 1: Applying (1) to the (Hermitian) matrix
0 1
3 1 f .x/ f .y/
f .xi xj / i;j D1 D @ f .x/ 1 f .x C y/A
f .y/ f .x y/ 1
G0 WD f x 2 G W jf .x/j D 1 g:
f .x C y/ D f .x/f .y/
Proof. This follows immediately from inequality (1.4.12.iii) and Lemma 1.4.8.
Corollary 1.4.14. Let f 2 P .R/ be such that f .0/ D 1. If there exist n 2 N and
h > 0 such that f .h/n D 1 then f is periodic with period nh.
Proof. The case n D 1 follows readily from Corollary 1.4.13. Using this, the general
case is obtained by noting that jf .h/j D 1 and therefore, by Corollary 1.4.13, f .nh/ D
f .h/n D 1.
The functions
t 7! ektk ;
2
t 7! ei.t;x/ ; t 7! cos..t; x// and t; x 2 Rd
30 Chapter 1 Characteristic functions
provide examples for the subgroup G0 in the previous corollary. Example 1.4.4
shows that for an arbitrary subgroup G0 of G there exists f 2 P .G/ such that
G0 D f x 2 G W jf .x/j D 1 g.
Corollary 1.4.15. Let fn 2 P .G/ be such that fn .0/ D 1; n 2 N. Then the set of all
x 2 G with fn .x/ ! 1; n ! 1, is a subgroup of G. In particular, if G D Rd and
fn ! 1 on a neighborhood of 0 then fn ! 1 on Rd .
Next we list some simple results that can be used to construct positive definite func-
tions.
Proof. The lemma follows immediately from the fact that the entrywise prod-
uct of positive semidefinite matrices is positive semidefinite (see Schur’s Theo-
rem D.2.12).
Proof. By (1.4.12.i), we have jg.t /j g.0/, and hence the definition of f is correct.
The positive definiteness of f follows from Theorems 1.4.9 and 1.4.10.
For example, the functions eg and 1= .2g.0/ g/, are positive definite if g 2 P .G/
and g.0/ ¤ 0.
X
n
D cj ıtj ; n 2 N; cj 2 C; tj 2 G
j D1
Section 1.4 Basic properties of positive definite functions 31
y
is positive definite. In particular, the function fc defined by
Proof. Let be an arbitrary finitely supported complex measure on G. The first state-
ment of the lemma follows from the equation
is convex. Using the previous lemma we are now able to identify the extreme points
(cf. D.4.1) of this set. First we prove a simple but useful statement.
Proof. If f1 .0/ D 0, then f1 D 0 (cf. (1.4.12.i)) and hence f2 D f . The case f2 .0/ D
0 can be treated in the same way. If f1 .0/ and f2 .0/ are different from zero then
fi =fi .0/ 2 P0 .G/ and
f1 f2
f D f1 .0/ C f2 .0/ :
f1 .0/ f2 .0/
Since f is extremal and 1 D f .0/ D f1 .0/ C f2 .0/ we conclude that fi =fi .0/ is
equal to f .
Lemma 1.5.1. Let f 2 P c .Rd / such that jf .t /j D 1 holds for all t from a neigh-
borhood of 0 2 Rd . Then
f .t / D ei.x;t/ ; t 2 Rd
for some x 2 Rd .
The next two results can be applied to construct positive definite functions.
Proof. By Lemma 1.4.18, the statement is true if has finite support. Let now be
arbitrary, and let f˛ g be a net of finitely supported complex measures converging
weakly to (cf. Theorem E.1.8). By Theorem E.2.5, the net f˛ Q ˛ g converges
weakly to ,Q and hence f˛ Q ˛ f g converges pointwise to Q f . Thus,
the first assertion follows from Theorem 1.4.10. The second assertion follows from
the first one by setting d.x/ D h.x/ dx.
Z ˇ n ˇ2
ˇX ˇ
ˇ ˇ
D ˇ g.t i C y/ci ˇ dy 0
ˇ
Rd iD1 ˇ
i.e., f is positive definite. The remaining statements follow from Theorem E.2.4.
34 Chapter 1 Characteristic functions
0.5
3 2 1 1 2 3
1
hn WD p 1Sn
jSn j
where j j denotes the counting measure. For all r 2 Œ0; n we then have
The next lemma follows immediately from the definition of positive definiteness,
we omit the proof.
Lemma 1.5.7. Let f 2 P .Rd / be Borel measurable and let be a finite, nonnegative
measure on B1 . Then the function g defined by
Z
g.t / D f .xt / d.x/; t 2 Rd
R
Proof. The ‘only if part’ follows from Lemma 1.5.3 with t D 0 and d.x/ D h.x/ dx,
using the fact that positive definite functions are nonnegative at 0.
Q
The assumption of the ‘if part’ means that h hf .0/ 0 for all h 2 C00 .Rd /. Let
be an arbitrary finitely supported complex measure. It follows from Theorems E.1.9
and E.2.5 that there exists a sequence fhn g1
1 of functions hn 2 C00 .R / such that n
d
Q n converges weakly to , Q where dn .x/ D hn .x/ dx. From this we conclude that
Q f .0/ 0. Thus, the positive definiteness of f follows from Lemma 1.4.3.
Proof. Since the integrand is positive definite for all fixed t , it suffices to prove that
Z
f .x/ dx 0
Rd
for all f 2 P c .Rd / \ L1 .Rd /. To show this inequality we choose a sequence fgn g
as in Lemma 1.5.6. Then jfgn j jf j and Lebesgue’s theorem on dominated conver-
gence shows that
Z Z
f .x/ dx D lim f .x/gn .x/ dx 0:
Rd n!1 Rd
Proof. We may assume that .K/ > 0; g D 0 and jgn j 1. Suppose, on the contrary,
that for some compact set C Rd the sequence f'n g does not converge uniformly to 0
on C . Then there exist a positive number ı, a sequence ftn g1 1 C , and a subsequence
f'kn g11 such that j'k n
.t n /j ı. We claim that the Lebesgue measure of the set
ı
Sn D x 2 K W jgkn .tn C x/j
2.K/
ı
is at least 2 . Indeed, the inequality .Sn / < 2ı would imply that
Z Z
j'kn .tn /j jgkn .tn C x/j dx C jgkn .tn C x/j dx
Sn KnSn
ı
.Sn / C .K/ <ı
2.K/
ı
contradicting our choice of tn and kn . Thus, jgkn .y/j 2.K/ > 0 for all y 2 SQn WD
Sn C tn K C C and .SQn / D .Sn / ı . Consequently,
2
1 [
\ 1
ı
.lim sup SQn / D SQm
mDn
2
nD1
which is a contradiction since the sequence fgn g11 does not converge to 0 on the set
lim sup SQn which is contained in the compact set K C C .
We close this section with an important result concerning the convergence of posi-
tive definite functions.
That the convergence is uniform on compact sets follows from Theorem 1.5.11.
.Ka / 1 7 sup Œ1 Re f .t /
t2K1=a
Proof. Using Fubini’s theorem and the example after Definition 1.1.1 we obtain
d Z
2
sup Œ1 Re f .t / Œ1 Re f .t / dt
a t2K1=a K1=a
Z Z
D Œ1 cos..t; y// d.y/ dt
K1=a Rd
Z Z
D Œ1 cos..t; y// dt d.y/
Rd K1=a
Z " #
2 d Y d
sin yj =a
D 1 d.y/
a Rd yj =a
j D1
Z " #
2 d Y
d
sin yj =a
1 d.y/
a Rd nKa yj =a
j D1
holds for all k and m 2 N. Taking lim sup with respect to k and using Theorem E.1.11
we obtain
Now letting m ! 1 and noting that f is continuous and f .0/ D 1, we see that
.Rd / 1. Thus, .Rd / D 1 and hence fnk g1 kD1
converges weakly to (cf.
Theorem E.1.12).
From Theorem 1.6.1 we conclude that f is the characteristic function of . Since
is uniquely determined by f , we see that every weakly convergent subsequence of
fn g11 has the same limit . This shows that the whole sequence converges weakly
to .
Remark 1.6.4. In view of Theorem 1.4.10 and Corollary 1.5.2, it suffices to assume
continuity of Ref at 0 in the previous theorem. This assumption cannot be dropped.
Indeed, with the notation introduced in Remark 1.5.5, limn!1 1=n D 1f0g . The
limit is not a characteristic function though it is continuous at every point different
from zero.
The next result will be used in the proof of Schoenberg’s Theorem 3.8.5.
Proof. We have
ˇZ Z ˇ
ˇ ˇ
ˇ
jfn .rt / f .rt /j D ˇ ei.rt;x/
pn .x/ dx ei.rt;x/
p.x/ dx ˇˇ
ZR Rd
d
By Scheffe’s Lemma E.1.19 the integral above tends to zero as n ! 1. This com-
pletes the proof.
Lemma 1.7.1. Let be a probability measure on Rd and for n 2 N define the prob-
ability measure n by n .B/ D .nB/; B 2 Bd . Then
Proof. Let > 0 and t 2 Rd be arbitrary and choose a bounded, open neighborhood
U of 0 such that
jg.t / g.t x/j < ; x 2 U:
42 Chapter 1 Characteristic functions
We have
ˇZ ˇ
ˇ ˇ
jg.t / n g.t /j D ˇˇ g.t / g.t x/ dn .x/ˇˇ
Rd
n .U / C 2kgk1 n .Rd n U /
C 2kgk1 .Rd n nU /:
The lemma follows now from the fact that limn!1 .Rd n nU / D 0.
Applying the previous lemma to an absolutely continuous measure we immedi-
ately obtain the following corollary:
pn .t / D nd p.nt /; t 2 Rd :
Then
lim pn g.t / D g.t /; t 2 Rd
n!1
with some nonnegative finite Borel measure on Œ0; 2/d . The measure is uniquely
determined by f .
Remark 1.7.7. If G D Zd then we can identify GO with the set Td (see Lemma 1.4.7).
Moreover, the topology of pointwise convergence on GO is the same as the metric to-
pology of Td . Since every z 2 T can uniquely be written as z D eit with some
t 2 Œ0; 2/, we can identify GO also with Œ0; 2/d . This is sometimes more conve-
nient. However, in this case the topology of pointwise convergence is not the same as
the metric topology (note that Œ0; 2/d is not compact). Nevertheless, both topologies
generate the same Borel sets.
X
n Z ˇˇXn
ˇ2
ˇ
ˇ ˇ
f .xi xj /ci cj D ˇ ci .xi /ˇ d. / 0;
O
G ˇ ˇ
i;j D1 iD1
showing that f is positive definite. To prove the other direction we may assume that
f 2 P0 .G/. By Theorem D.4.6 (see also Remark 1.7.6), there exists a probability
measure D f on GO such that
Z
l.f / D l. / d. /
O
G
holds for every continuous linear functional l on L.G/. Taking here the linear func-
tionals lx defined by lx .h/ D h.x/; h 2 L.G/, we obtain the desired representation
of f .
To prove uniqueness, assume that is a finite Radon measure on GO such that
Z Z
.x/ d. / D .x/ d. /; x 2 G:
O
G O
G
definite function 1f0g we see that there exists a probability measure on GO such that
Z
0 D 1f0g .x y/ D .x y/ d. /:
O
G
This implies the existence of 2 GO such that .x y/ ¤ 1 from which .x/ ¤ .y/
follows.
Thus, A satisfies the conditions of the Stone–Weierstraß theorem (cf. Theo-
O can be uniformly approx-
rem B.2.4) and therefore an arbitrary functionR g 2 C.G/
R
imated by functions from A. This shows that g d D g d for all g 2 C.G/, O and
hence D .
The next statement has been show in the proof of Theorem 1.7.8. We state it explic-
itly because of its importance.
Corollary 1.7.10. If x and y are distinct elements of G, then there exists a character
of G such that .x/ ¤ .y/.
Another nice corollary of Theorem 1.7.8 is the existence and uniqueness of a
O which is called the normalized Haar
translation invariant probability measure on G,
measure.
Corollary 1.7.11. There exists a unique Radon probability measure on GO such that
Z Z
f . / d. / D f . / d. /; f 2 C.G/; O O
2 G:
O
G O
G
Proof. Let and A be as in the proof of Theorem 1.7.8. It follows from the definition
of that the equation above holds for all f 2 A, while the density property of A
O
implies that it holds for all f 2 C.G/.
Assume that is a Radon probability measure satisfying
Z Z
f . / d. / D f . / d. /; f 2 C.G/; O O
2 G:
O
G O
G
We have shown in the proof of Theorem 1.7.8 that for all x ¤ 0 there exists 2 GO
such that .x/ ¤ .0/ D 1. Using this and the relation above we conclude that
Z
.x/ d. / D 1f0g .x/; x 2 G
O
G
and hence D .
Example 1.7.12. Identifying the character group of Zd with Œ0; 2/d the restriction
of the Lebesgue measure to Œ0; 2/d is translation invariant in the sense of the previ-
ous corollary. Thus, dividing this restriction by .2/d we obtain the normalized Haar
measure.
then D 0.
Proof of Theorem 1.7.5. We identify the character group of Zd with Œ0; 2/d (see Re-
mark 1.7.7). Since the topology of pointwise convergence generates the same Borel
-algebra as the metric topology of Œ0; 2/d , Theorem 1.7.5 follows immediately from
Theorem 1.7.8.
Alternative proof of Theorem 1.7.5 in the case d D 1. We show here only the exis-
tence of . Definition 1.4.1 with cj D eijx , x 2 R, and tj D j gives
1 XX X
N N N
jmj
0 PN .x/ WD f .j k/ei.j k/x D 1 f .m/eimx
N N
j D1 kD1 mDN
Section 1.8 Fourier transformation on Rd 47
and therefore
Z Z
1 X
N
1 2
jmj 2
inx
e PN .x/ dx D 1 f .m/ ei.nm/x dx
2 0 2 N 0
mDN
jnj
D 1 f .n/
N C
In particular, N .Œ; // D f .0/. By Theorem E.1.13, the sequence fN g contains
a subsequence converging weakly to some nonnegative measure .22 From (1) we
conclude that Z
einx d.x/ D f .n/:
Œ0;2/
Note that if ' is a density, then the corresponding characteristic function is equal
L On the other hand, if is the distribution of a random vector, then the
to .2/d=2 '.
L
corresponding characteristic function is .
To see an example of the Fourier transform, let ' be the Gaussian density as defined
in Lemma 1.3.4. Then, by the same lemma,
1
'O D 'L D d
'1= :
Then
Y
d
eibj tj eiaj tj
O /D
g.t ; t 2 Rd;
itj
j D1
jg.t
O /j D j.g h/ O.t / C h.t O /j < :
O /j kg hk1 C jh.t
does not tend to zero at infinity if and only if WD q1 is a Pisot number23 different
from 2. This is the so-called Zygmund–Salem theorem; we refer to [6, Sect. 1.11] for
more details.
The Fourier transform of an integrable function may not be integrable (cf. Exam-
ple 1.8.14). However, we have the following result:24
L WD f gO W g 2 L1 .Rd / \ L2 .Rd / g
23 A real number > 1 is said to be a Pisot number if it satisfies an equation
n C b1 n1 C C bn D 0
where b1 ; : : : ; bn are integers, and all other roots of this equation have moduli less than 1.
24 See also Theorem 1.8.13.
Section 1.8 Fourier transformation on Rd 51
kgk2 D kgk
O 2; g 2 L1 .Rd / \ L2 .Rd /:
holds for all x 2 Rd , i.e., . /O D 0. The first part of Theorem 1.8.2 implies that D
0 and hence also D 0. Thus, 0 is the only element in L2 .Rd / that is orthogonal to L;
therefore L is dense in L2 .Rd /.
It follows from (1) and (1.8.9.i) that kk2 D kkL 2 . Thus, the mapping 7! L maps
L1 .Rd / \ L2 .Rd /, which is dense in L2 .Rd /, isometrically into L2 .Rd /, and there-
Section 1.8 Fourier transformation on Rd 53
fore it extends to a unique isometric mapping from L2 .Rd / onto L2 .R/. From (1) we
conclude that (i) holds while (ii) is obtained in the same way as (1.8.9.ii). Taking the
inverse Fourier transform of both sides of (1.8.9.ii) we obtain (iii).
Proof. First we note that f 2 L2 .Rd /. Let ' be the Gaussian density as defined
in Lemma 1.3.4. Applying (1.8.9.i) with g D '1=n and using that 'O1=n D nd 'n we
obtain
Z 2
Z
1 O kt k2
f .t /e 2n dt D fO.t /'O1=n .t / dt
.2/d=2 Rd Rd
Z
D f .x/'1=n .x/ dx kf k1 :
Rd
Thus, fO is integrable.
Next we give two simple examples where the Fourier transform is square integrable
but not integrable.
Example 1.8.14. By Example 1.1.13(a), the Fourier transform of the function p.x/ D
1Œ0;1/ .x/ex ; x 2 R, is equal to q.t / D p1 1Cit
1
. This function is square integrable
2
but not integrable. By (1.8.11.i), the inverse L2 Fourier transform of q is p.
A similar example is given by the indicator function p of the set Œ1; 1d . The
computations at the end of Definition 1.1.1 show that
d=2 Y
d
2 sin tj
q.t / WD p.t
O / D p.t
L /D ; t 2 Rd :
tj
j D1
Theorem 1.8.15. Assume that f 2 P c .Rd / admits the representing measure and
let 2 Mb .Rd / be arbitrary. Then the function Q f is positive definite and its
representing measure is given by d .s/ D j.s/j
O 2 d .s/.
Proof. Using Z
f .t / D ei.t;s/ d .s/
Proof. The uniqueness follows from Corollary 1.7.13, the remaining properties are
simple consequences of the definitions.
Note that the function 7! .x/ is bounded and continuous on GO for all x in G,
so the integral exists. If d. / D f . / d. / with f 2 L1 .G/ O where denotes the
normalized Haar measure on G, O then we write fL for L and call fL the inverse Fourier
transform of f .
56 Chapter 1 Characteristic functions
Proof. The uniqueness can be proved by the same arguments used in the proof of
Theorem 1.7.8. The remaining properties are simple consequences of the defini-
tions.
The next theorem is the analogue of Theorem 1.8.15 and can be proved in the same
way. We omit the proof.
Theorem 1.9.6.
(i) The relation f D .fO /L holds for all f 2 L1 .G/.
(ii) If f 2 P .G/ \ L1 .G/, then fO 0.
(iii) If f 2 L1 .G/ and fO 0, then f 2 P .G/.
(ii) By Theorem 1.7.8 there exists a unique nonnegative measure 2 MbC .G/ O
such that f D . O
L By (i) we also have f D .f /L. Since is unique we must have
D fO d. The nonnegativity of implies that fO 0. (iii) This statement follows
immediately from (i).
The extended Fourier transform 1.9.7. If 2 Mf .Zd / then we can extend the
Fourier–Stieltjes transform of from Td to Cd n f0g by setting
Z
O
.z/ WD z n d.n/; z 2 Cd n f0g:
Zd
Suppose that a random vector X has characteristic function g and consider the random
vector Y D m C BX where m 2 Rd and B is a d d real matrix. The covariance
matrix of Y is C D BB T (see page 349). By Theorem 1.1.7, the characteristic function
fY of Y is given by
1 T t;B T t/ 1
fY .t / D ei.m;t/ g.B T t / D ei.m;t/ 2 .B D ei.m;t/ 2 .C t;t/ :
For such Y we will write Y N.m; C /. It follows from Theorem 1.2.1 that m D
E.Y / and C D cov.Y /.
The next lemma follows immediately from the definition of Gaussian random vec-
tors, we omit the proof.
The next theorem gives a very useful characterization of Gaussian random vectors.
Proof. The necessity of the condition has already been established in Lemma 1.10.3.
To prove the sufficiency suppose that Xa D .a; X / is Gaussian for any a 2 Rd .
Setting m D E.X / and C D cov.X / we have E.Xa / D .m; a/ and cov.Xa / D
.C a; a/. Thus, the characteristic function fa of Xa is given by
1 2
fa .t / D E eit.a;X/ D eit.m;a/ 2 t .C a;a/ ; t 2 R:
Y D E.Y / C BX:
Proof. We may assume that E.Y / D 0. Let Y D .Y1 ; : : : ; Yd / and denote by L the
linear subspace of L2r . ; A; P / spanned by the random variables Yj . The elements
of L are Gaussian random variables with zero mean (cf. Theorem 1.10.4). By Theo-
rem D.2.16, the dimension of L is equal to the rank of C . Let X1 ; : : : ; Xk be an arbi-
trary orthonormal basis of L. Then the random vector X D .X1 ; : : : ; Xk / is standard
Gaussian. The existence of B follows from the fact that each Yj is a linear combination
of the Xi ’s. Moreover, rank.B/ D dim L D k.
Using the previous theorem we can now express the density of absolutely continu-
ous Gaussian vectors by means of the standard Gaussian density.
Y D m C BX
1
lim fn .t / D ei.m;t/ 2 .C t;t/ ; t 2 Rd
n!1
and convergence in law of the sequence fXn g1 1 follows from Theorem 1.6.3.
Assume now that fXn g1 converges in law. By Theorem 1.6.1, the sequence ffn g1
1
1
converges uniformly on every compact set. The same holds for the sequence fjfn jg1
1
hence the limit
lim .Cn t; t /; t 2 Rd
n!1
exists. By Lemma D.2.17, the sequence fCn g1 1 converges entrywise to some positive
semidefinite matrix C . Considering now the sequence ffn =jfn jg1 1 we see that e
i.mn ;t/
converges to some continuous positive definite function. The modulus of this function
is 1 and therefore it has the form ei.m;t/ with some m (cf. Lemma 1.5.1). Thus, the one
point measures ımn converge weakly to ım from which mn ! m follows.
Y
d
1 2 2
eitj EXj 2 j tj ;
1
fX .t / D ei.EX;t/ 2 .C t;t/ D t 2 Rd
j D1
Remark 1.10.9. There exist uncorrelated Gaussian random variables which are not
independent. To see an example let
p 1 2
h.x/ D 2 e 2 x ex ; x 2 R
2
and
1 h i
h.x/ ey C h.y/ ex ; .x; y/ 2 R2 :
2 2
p.x; y/ D
2
It is easy to check that p is a density and
Z 1 Z 1
1 1 2 1 1 2
p.x; y/ dy D p e 2 x ; p.x; y/ dx D p e 2 y :
1 2 1 2
Since p is an even function in x and in y we have
Z 1Z 1
xyp.x; y/ dx dy D 0:
1 1
Section 1.11 Some inequalities 61
Now let Z D .X; Y / be a two-dimensional random vector with density p. Note that Z
is not Gaussian. By what we have proved, X and Y are standard Gaussian and uncor-
related. Since p is not the product of the marginal densities, they are not independent.
Ya D .a; X / D a1 X1 C C ad Xd
and
Yb D .b; X / D b1 X1 C C bd Xd
are independent if and only if .a; b/ D 0.
(ii) The coordinates of the random vector OX are independent if and only if O is
an orthogonal matrix.
Proof. The second statement follows immediately from the first one. To prove (i)
we consider the 2-dimensional random vector Y D .Ya ; Yb /. It follows from Theo-
rem 1.10.4 that Y is Gaussian. Applying Theorem 1.10.8 we obtain that Ya and Yb are
independent if and only if
X
d
0 D E.Ya Yb / D aj bk E.Xj Xk / D .a; b/:
j;kD1
3
Using (B.1.6.1), (B.1.6.2), and (B.1.6.3) it is easy to check that for jsj < 4
s2 s2
0 cos s 1 and sin s D s C h.s/ where jh.s/j :
4 8
Since E.X / D 0, the inequalities above imply that
1 1 3
0 Re f .t / 1 t 2 E.X 2 / and jImf .t /j t 2 E.X 2 /; jt j <
4 8 4R
and therefore
1
jf .t /j 1 t 2 E.X 2 /: (2)
8
Now let d 2 N and t 2 Rd n f0g be arbitrary. Write t0 D t =kt k and denote by g the
characteristic function of the random variable Y D .t0 ; X /. Then jY j kX k R,
E.Y / D 0,
f .t / D E ei.t;X/ D E eiktkY D g.kt k/;
and E.Y 2 / D .t0 ; cov .X /t0 / D .t; cov.X /t /=kt k2 . Application of equation (2) for Y
completes the proof.
Lemma 1.11.3. Let X be a d-dimensional random vector and define the random vec-
tor XR .R > 0/ by
(
X.!/ if kX.!/k R
XR .!/ D !2 :
0 if kX.!/k > R
jRe f .t /j Re fR .t /
for all t 2 Rd with kt k 2R .
B D fx 2 Rd W kxk Rg:
Note that
R D B C .1 .B// ı0
Section 1.11 Some inequalities 63
where B is given by B .A/ D .B \ A/; A 2 Bd . If kt k 2R and kxk R, then
cos ..t; x// 0. Using this we obtain:
ˇZ ˇ
ˇ ˇ
jRe f .t /j D ˇˇ cos ..t; x// d.x/ˇˇ
ˇZR
d
ˇ ˇZ ˇ
ˇ ˇ ˇ ˇ
ˇ ˇ
ˇ cos ..t; x// d.x/ˇ C ˇ ˇ cos ..t; x// d.x/ˇˇ
B Rd nB
Z ˇZ ˇ
ˇ ˇ
D cos ..t; x// d.x/ C ˇ ˇ cos ..t; x// d.x/ˇˇ
B Rd nB
Z
cos ..t; x// d.x/ C .Rd n B/
ZB
D cos ..t; x// dB .x/ D Re fR .t /:
Rd
Theorem 1.11.4. Let X be a d-dimensional random vector such that the support of
its distribution is not contained in some hyperplane of Rd. Then there exist positive
numbers ı and such that the inequality
jfX .t /j 1 kt k2 (1)
jf .t /j 1 kt k2 ; kt k ı:
2
We choose R > 0 such that the random variable XR from Lemma 1.11.3 is not concen-
trated on a hyperplane.29 Then the covariance matrix of XR is strictly positive definite
and hence
.t; cov.XR /t / kt k2 ; t 2 Rd
where .> 0/ is the smallest eigenvalue of cov.XR /. The theorem now follows from
Corollary 1.11.2 and Lemma 1.11.3, using the fact that f and fR are real valued.
Since f is an even function, all derivatives of odd order are zero at 0. Applying the
n
upper estimate to f .4b 2 c/ . t / in the Taylor expansion
2b n
2 c1
X M2k 2k t 4b 2 c
n
n
f .t / D .1/ k
t C n f .4b 2 c/ . t /; t 2 R; 2 Œ0; 1;
.2k/Š .4b 2 c/Š
kD0
we obtain the upper estimate for f .t /. The lower estimate can be obtained in the same
way.
Inequality (1) is then equivalent to the relation .Re f .1/; Re f .n// 2 Kn . The set Kn
is convex for all n 2 N. Using this and Corollary 1.7.9 we see that it is sufficient to
show (1) for the positive definite functions n 7! z n ; z 2 T. Thus, it remains to prove
that
1 Re .z n / nŒ1 .Re z/n ; z 2 T;
Section 1.11 Some inequalities 65
2
2
4
Figure 1.7. The function hn from the proof of Theorem 1.11.6 with n D 5.
or equivalently,
(see Figure 1.7). The function h is periodic and hence it attains its maximum at some
y 2 R. From
h0 .y/ D n2 cosn1 y sin y C n sin ny D 0
we see that either sin y D 0 or
sin ny
sin y 6D 0 and n cosn1 y D : (4)
sin y
If sin y D 0, then (3) holds. If y satisfies (4), then
cos y sin ny cos y sin ny sin y cos ny
h.y/ D cos ny D
sin y sin y
sin.n 1/y
D :
sin y
Using induction on n we see that sin.n 1/y= sin y n 1; n 2 N, and therefore
h.y/ n 1.
To prove inequality (2), let z 2 T be such that Re .zf .1// D jf .1/j. Since the
function n 7! z n f .n/ is positive definite on Z, inequality (1) shows that
1 r n n.1 r/; 1 r 1;
Correlation functions
In the first section of this chapter we present a few basic results on processes with
independent increments which can be obtained easily by using characteristic functions.
Then we concentrate on those properties of second order fields which are related to
the fact that their correlation functions represent positive semidefinite kernels. In the
second part of the chapter we investigate the special case of stationary fields. At the
end of the chapter we give an introduction to the harmonic analysis of stationary fields
using unitary representations built from the correlation function.
Throughout this chapter the symbol T denotes a nonempty subset of Rd with some
d 2 N and . ; A; P / is a probability space.
the so-called increments of Z, are independent for every n 3 and every t 2 Rn such
that t1 < < tn and tj 2 T .
Next we prove the existence of processes having special properties. The proof uses
Kolmogorov’s existence theorem formulated in terms of characteristic functions (cf.
Remark F.3.4). We need the following simple lemma.
These equations show that (1) holds whenever the random variables j are independ-
ent. Assume now that (1) holds. The equations above show that
is a characteristic function for all t 2 Œ0; 1/d such that t1 td . For arbitrary
t 2 Œ0; 1/d we choose a permutation such that t.1/ t.d / and define
ft by the consistency condition (F.3.4.i) Then condition (F.3.4.ii) is satisfied as well.
Consequently, there exists a process Z on Œ0; 1/ such that the finite-dimensional dis-
tributions of Z have the ft ’s as characteristic functions. Obviously, Z.0/ D 0. The
equation above with d D 2 and x2 D x1 shows that the characteristic function of
Z.t / Z.s/ is e.ts/h if t s (cf. Theorem 1.1.7). By Lemma 2.1.2, the process Z
has independent increments.
Corollary 2.1.4 (Wiener process). There exists a process W on Œ0; 1/ satisfying the
conditions
(i) W .0/ D 0;
(ii) W has independent increments;
(iii) W .t / W .s/ has a Gaussian distribution with mean zero and variance t s
for all t s.
Corollary 2.1.5 (Poisson process). For each 0 there exists a process N on Œ0; 1/
satisfying the conditions
(i) N.0/ D 0;
(ii) N has independent increments;
(iii) N.t / N.s/ has a Poisson distribution with mean .t s/ for all t s.
2 Usually one also imposes conditions on the trajectories t 7! Zt .!/ of these processes but we do not
consider these questions in this book.
70 Chapter 2 Correlation functions
where
M.t / D E.Z.t //
is the mean of the field.
If the mean is zero, the correlation function and the covariance function are equal.
Note that all finite-dimensional distributions of a Gaussian field are completely de-
termined if we know its mean and correlation function.
Z.t / D f .t / X; t 2T
and hence
.s; t / D Var.Zs /; s t:
Section 2.2 Correlation functions of second order random fields 71
In particular, the covariance function of the Wiener process and of the Poisson process
with D 1 is given by3
Proof. Assume that K is the covariance function of a complex field Z. The case where
K is the correlation function can be treated in the same way. We have
X
n
K.ti ; tj /ci cj
i;j D1
X
n h i
D E .Z.ti / M.ti // .Z.tj / M.tj // ci cj
i;j D1
ˇ n ˇ2
ˇ X ˇ
D Eˇˇ cj .Z.tj / M.tj //ˇˇ 0
j D1
If M and K are real-valued and K is symmetric, then there exists a real Gaussian
field Z such that (2) and (3) hold.4
Proof. Assume first that M and K are real-valued and let ft1 ; : : : ; tn g T be arbi-
trary. Since K is positive semidefinite there exists a unique Gaussian distribution on
Rn with mean vector .M.t1 /; : : : ; M.tn // and covariance matrix (1). The collection
of all these distributions satisfies the conditions of Kolmogorov’s consistency Theo-
rem F.3.3. Hence there exists a real Gaussian field satisfying (2) and (3).
Now assume that M and K are complex-valued and let TQ WD T f1; 2g. Further,
let MQ .t; 1/ D Re M.t /; MQ .t; 2/ D Im M.t / and define the real kernel KQ on TQ TQ
by setting
Q Q 1
K..s; 1/; .t; 1// D K..s; 2/; .t; 2// D Re K.s; t /
2
and
Q Q 1
K..s; 1/; .t; 2// D K..s; 2/; .t; 1// D Im K.s; t /; s; t 2 T:
2
It follows from Lemma D.2.14 that KQ is positive semidefinite. By the first part of the
proof there exists a real field ZQ on TQ such that the analogues of (2) and (3) are satisfied
for MQ ; KQ and Z.Q Setting now
Q 1/ C iZ.t;
Z.t / WD Z.t; Q 2/; t 2T
Example 2.2.5. By Lemma D.2.10 and Theorem 2.2.4, there exists a real Gaussian
process X on T D Œ0; 1/ such that E.X.t // D 0 and
i.e., the increments X.t2 / X.t1 / and X.s2 / X.s1 / are uncorrelated. Since the in-
crements of a Gaussian process are Gaussian, we infer that they are independent (cf.
Theorem 1.10.8). Thus, we obtain the Wiener process constructed in Corollary 2.1.4.
Now let h 2 Œ0; 1/ be arbitrary and define the process X h by
4 If a real field satisfies (2) and (4), then E.ŒZ.t / M.t /2 / D 0 and hence Z.t / D M.t /.
Section 2.2 Correlation functions of second order random fields 73
Denote by C h the correlation function of this process. The mean of X h is zero and for
s t we have
In the same way we see that C h .s; t / D max.0; h C t s/; s t , and hence
Therefore C h .s; t / depends only on s t , a property that we will call stationarity (cf.
Definition 2.8.1).
is a positive semidefinite inner product space. The inner product space L2r .P / of all
square integrable real-valued random variables on . ; A; P / is introduced in the same
way. We denote the corresponding Hilbert spaces by L2 .P / and L2r .P /, the elements
of which are equivalence classes of random variables. In all these cases we will use
the notation k k for the norm:
p
kX k D .X; X / :
jE.X Y /j kX k kY k:
Note that the expectation can be expressed using the inner product:
E .X / D .X; 1 /:
Two square integrable random variables with zero mean are orthogonal if and only if
they are uncorrelated.
For a field Z of second order on T we define the mapping Z W T ! L2 .P / by
Zt D Zt C N where N is the class of random variables which are zero P -almost
everywhere. We call Z the L2 -valued random field corresponding to Z. The notions
mean, covariance function, correlation function and Gaussian field are defined for Z
74 Chapter 2 Correlation functions
in the same way as for Z. By H.Z/ and H.Z/ we denote the closed linear subspaces
generated by all Zt and Zt ; t 2 T , respectively.
Let fe˛ g˛2I be an orthonormal basis of H.Z/. The index set I may be finite, count-
able infinite or uncountable.5 In any case
X
Zt D f˛ .t / e˛ < 1; t 2 T (1)
˛2I
where f˛ .t / D .Zt ; e˛ / (for each t only countable many summands are different from
zero). Moreover, X
kZt k2 D kZt k2 D jf˛ .t /j2 : (2)
˛2I
Proof. We consider only correlation functions and the complex case, the rest is similar.
Assume that (1) and (2) hold. Then K is positive semidefinite:
ˇ n ˇ2
Xn X ˇˇ X ˇ
ˇ
K.tj ; tk /cj ck D ˇ f˛ .tj /cj ˇ 0:
ˇ ˇ
j;kD1 ˛2I j D1
In the same way one can define partial derivatives of a field defined on an open set
T Rd .
Next we show that strong differentiability of a process is closely related to the dif-
ferentiability of its correlation function. For a function f of two variables we write
Theorem 2.3.4. Let Z be a process of second order on .a; b/ with correlation func-
tion C . The process Z is strongly differentiable if and only if the limit
2 C.t; t; ; h/
lim
;h!0 h
exists for all t 2 .a; b/.
If Z is strongly differentiable, then the limit
2 C.t; s; ; h/
D.t; s/ WD lim
;h!0 h
.Z 0 .t1 /; : : : ; Z 0 .tk //
as n ! 1 (see Section F.2). Thus, the statement of the Lemma follows from Theo-
rem 1.10.7.
The next lemma follows immediately from the definition of the integral. We omit
the proof.
Lemma 2.4.2. Assuming that all the integrals below exist, we have:
Z Z
(i) .Z.t /; h/ d.t / D Z.t / d.t /; h ; h 2 L2 .P /
T T
Z
(ii) a1 Z1 .t / C a2 Z2 .t / d.t / D
T Z Z
a1 Z1 .t / d.t / C a2 Z2 .t / d.t /; a1 ; a 2 2 C
T T
Z Z Z
(iii) Z.t / d.1 C 2 /.t / D Z.t / d1 .t / C Z.t / d2 .t /
T T T
Section 2.4 Integration with respect to complex measures 79
Z Z Z
(iv) Z.t / d.t / D Z.t / d.t / C Z.t / d.t /
S1 [S2 S1 S2
Moreover, we have
Z 2
X
n X
n X
n
Z d D cj Z.tj /; ck Z.tk / D cj ck C.tj ; tk /: (1)
j D1 kD1 j;kD1
(b) Assume that Z.t / D f .t /X; t 2 T D Œa; b, where f is a continuous function
on T and X 2 L2 .P /. For all h 2 L2 .P / we have
Z b Z b
.Z.t /; h/ d.t / D .f .t /X; h/ d.t /
a a
Z b Z b
D .X; h/ f .t / d.t / D f .t / d.t / X; h
a a
and hence Z Z
b b
Z.t / d.t / D f .t / d.t / X:
a a
Using this and (2.4.2.ii) we obtain the more general relation
Z bX n Xn Z b
fj .t /Xj d.t / D fj .t / d.t / Xj
a j D1 j D1 a
i.e., Z
.Z.t /; h/ d.t / D .I; h/ ; h 2 L2 .P /:
T
Thus, the field Z is -integrable.
Corollary 2.4.5. If Z
kZ.t /k djj.t / < 1 (1)
T
then Z is -integrable and
Z Z
Z.t / d.t / kZ.t /k djj.t /: (2)
T T
Proof. The Cauchy–Schwarz inequality j.Z.t /; h/j kZ.t /k khk and (1) imply that
t 7! .Z.t /; h/ is jj-integrable and hence -integrable as well. Moreover,
ˇZ ˇ Z Z
ˇ ˇ
ˇ .Z.t /; h/ d.t /ˇ j.Z.t /; h/j djj.t / kZ.t /k djj.t / khk :
ˇ ˇ
T T T
Theorem 2.4.6. If Z is integrable with respect to and , then the correlation func-
tion C is -integrable and
Z Z Z
Z d; Z d D C.t; s/ d. /.t; s/: (1)
T T T T
In particular,
Z 2 Z
Z d D C.t; s/ d. /.t; s/: (2)
T T T
R
Proof. Equation (2.4.1.1) with h D Z d yields
Z Z Z Z
Z.t / d.t /; Z.s/ d.s/ D Z.t /; Z.s/ d.s/ d.t /: (3)
Theorem 2.4.7. The field Z is -integrable if and only if the correlation function C
is -integrable.
Zh to S are integrable with respect to any complex measure (see Corollary 2.4.5).
Applying Theorem 2.4.6 we obtain
Z Z Z Z
C.t; s/ d.t / d.s/ D .Zh .t /; Zh .s// d.t / d.s/
S S S S
Z Z
1
C fh .t /fh .s/ d.t / d.s/
khk2 S S
ˇZ ˇ2
1 ˇˇ ˇ
ˇ fh .t / d.t /ˇˇ
khk 2
S
(note that the first integral after the equation sign is nonnegative by (2.4.6.2)). Conse-
quently, ˇZ ˇ Z
ˇ ˇ 1=2
ˇ fh .t / d.t /ˇ jC j d j j khk:
ˇ ˇ
S T T
Using Lemma E.1.18 we conclude8 that fh is -integrable and the inequality above
remains valid if we replace S by T . By Theorem 2.4.4 the field Z is -integrable.
Proof. By Theorems 2.2.3 and 2.3.2 the kernel K is the correlation function of a
strongly continuous random field. This field is -integrable in view of Theorem 2.4.7.
The corollary follows now from (2.4.6.2).
By the first part of the proof, the statement of the theorem is true for the compact set
Tb . The inequality above shows that it is true for T as well.
Using Theorem E.1.8 we obtain the following corollary:
Corollary 2.4.10. For every complex measure there exists a sequence fn g of com-
plex measures with finite support converging weakly to and satisfying the rela-
tion (2.4.9.1).
(ii) Corollary 2.3.5 shows that Z is strongly differentiable and the correlation func-
tion of Z 0 is continuous. Consequently, Z 0 is strongly continuous and hence inte-
grable with respect to the Lebesgue measure on Œt0 ; t . By Theorem 2.3.4, the function
t 7! .Z.t /; h/ is continuously differentiable for all h 2 L2 .P /. Applying the classical
Newton–Leibniz formula to this function we obtain
Z t
d
.Z.t /; h/ D .Z.t0 /; h/ C .Z.s/; h/ ds:
t0 ds
Equation (2.3.4.1) and the definition of the integral show that
Z t
.Z.t /; h/ D .Z.t0 /; h/ C Z 0 .s/ ds; h ; h 2 L2 .P /:
t0
Assume further that there exists a jj-integrable function g on T such that kZn .t /k
g.t / for all n and t . Then Z and Zn are -integrable and
Z Z
lim Zn .t / d.t / D Z.t / d.t /:
n!1 T T
Proof. Relation (1) implies that limn!1 kZn .t /k D kZ.t /k and hence kZ.t /k
g.t /. By Corollary 2.4.5, the fields Z and Zn are -integrable. Using the inequality
kZn .t / Z.t /k 2g.t /, the lemma follows from
Z Z Z
Zn .t / d Z.t / d.t / D ŒZn .t / Z.t / d.t /
T T
Z T
kZn .t / Z.t /k djj.t /
T
and from Lebesgue’s theorem on dominated convergence.
and X
f Z.t / WD Z.t n/f .n/; t 2T
n2Zd
respectively.
Proof. The first three equations follow immediately from the definition of the convo-
lution, while (5) is a consequence of Theorem 2.4.9.
It is easy to check that (4) holds for measures with finite support. In the general case
we approximate and by measures with finite support (see Corollary 2.4.10) and
apply Theorem E.2.5.
We denote by L2 .a; b/ the positive semidefinite inner product space of all complex-
valued functions on Œa; b which are square integrable with respect to the Lebesgue
R
2 .a; b/. We write b instead
measure.
R The corresponding Hilbert space is denoted by L a
of I . Finally, C W I I ! C will be a continuous kernel.
Proof. Using the fact that C is bounded on I I it is easy to see that K is a bounded
linear operator in L2 .a; b/. We have to show that for every bounded sequence fhn g,
the sequence fKhn g has a convergent subsequence. Write gn D Khn . To simplify the
notation, we also consider gn as a function defined pointwise by the integral above.
Applying the Cauchy–Schwarz inequality as in Lemma 2.5.2, we see that the sequence
fgn g is equicontinuous and uniformly bounded. In view of the theorem of Arzelà and
Ascoli (cf. Theorem B.2.9), the sequence fgn g has a subsequence converging uni-
formly on Œa; b to a continuous function. It follows that the operator K is indeed
compact.
where
(i) each 'n is an eigenfunction of C with eigenvalue n 0;
(ii) the series above is absolutely and uniformly convergent;
(iii) f'n C N g1 2
nD1 is an orthonormal basis of L .a; b/, where
N D f' 2 L .a; b/ W .'; '/ D 0g.
2
Proof. To simplify the notation we identify continuous functions on Œa; b with the
corresponding elements of L2 .a; b/. We consider the compact linear operator K
9 To be precise, Kh is the equivalence class represented by the integral on the right-hand side.
88 Chapter 2 Correlation functions
defined in Lemma 2.5.3. Using that C.t; s/ D C.s; t / it is easy to check that
.Kh; g/ D .h; Kg/ , i.e., K is self-adjoint. It follows from Corollary 2.4.8 that K is
nonnegative and hence all eigenvalues of K are nonnegative. By Remark D.3.13, there
exists an orthonormal basis f'n g1 2
1 of L .a; b/ consisting of eigenvectors of K with
1
eigenvalues fn g1 . Every h 2 L .a; b/ admits the decomposition
2
1
X
hD .h; 'n / 'n :
nD1
Thus,
1
X
C.t; / D n 'n .t /'n :
nD1
From here, again using (2) and the orthonormality of the vectors 'n , we obtain
X
n 2
0 D lim
C.t; / '
k k .t /'
k
n!1
kD1
Z b X
n
2
D lim jC.t; s/j ds 2k j'k .t /j2 ; t 2 Œa; b:
n!1 a
kD1
If k ¤ 0, then the function 'k is continuous (cf. Lemma 2.5.2). It is easy to see that
the function
Z b
t 7! jC.t; s/j2 ds
a
P
is continuous as well. By Dini’s Theorem B.2.5, the series 1 kD1 k j'k .t /j con-
2 2
verges uniformly on Œa; b. Using this and the Cauchy–Schwarz inequality
q ˇ
X ˇ X
q 1
X
q 1
ˇ ˇ 2
2
2
2
ˇk 'k .t /'k .s/ˇ k j'k .t /j k j'k .s/j
kDp kDp kDp
Section 2.5 The Karhunen–Loève decomposition 89
we conclude that (ii) holds. To show equation (1) we consider for fixed s the continuous
nonnegative function
ˇ 1
X ˇ2
ˇ ˇ
ˇ
t 7! ˇC.t; s/ n 'n .t /'n .s/ˇˇ :
nD1
Using (2) we see that its integral over I is equal to zero. Thus, the function above is
equal to zero everywhere.
where
(i) 'n and n are as in Mercer’s theorem and S D fn 2 N W n > 0g; 10
Z b
1
(ii) Xn D p Z.s/ 'n .s/ ds;
n a
(iii) the Xn ’s form an orthonormal system in L2 . ; A; P /;
(iv) the series (1) converges uniformly on I .
D ın;m ; n; m 2 S
10 The correlation function C is positive semidefinite, hence we may apply Mercer’s Theorem 2.5.4 to
it. If S D ;, then the sum (1) is zero by definition.
90 Chapter 2 Correlation functions
i.e., (iii) holds. Using property (2.4.2.i) of the integral and the definition of Xn , we
obtain
Z b
1
.Z.t /; Xn / D p Z.t /; Z.s/'n .s/ ds
n a
Z b
1
Dp C.t; s/'n .s/ ds
n a
p
D n 'n .t /; n 2 N:
Remark 2.5.6. If Z is Gaussian, then in view of Corollary 2.4.11 the random vari-
ables Xn in Theorem 2.5.5 are Gaussian. If the mean of Z is zero, then E.Xn / D 0.
This follows from (2.4.2.i) and from E.Xn / D .Xn ; 1/. By orthogonality, the random
variables Xn are then uncorrelated and hence independent (cf. Theorem 1.10.8).
The next theorem is the converse of the previous one and it is easier to prove.
Proof. To simplify the notation we assume that S D N, the general case can be treated
in the same way. It is easy to check that Z is a field of second order with correlation
Section 2.5 The Karhunen–Loève decomposition 91
function
1
X
C.t; s/ D .Z.t /; Z.s// D n 'n .t /'n .s/; t; s 2 I:
nD1
where the series is uniformly convergent on I . From this we conclude that C and
hence Z is continuous. Since f'n gn2N is an orthonormal system, we have
Z b Z 1
b X
C.t; s/'k .s/ ds D n 'n .t /'n .s/'k .s/ ds
a a nD1
1
X Z b
D n 'n .t / 'n .s/'k .s/ ds
nD1 a
D k 'k .t /; t 2 I; k 2 N:
Example 2.5.8. We consider the Wiener process W from Corollary 2.1.4 on a finite
interval I D Œ0; T ; T > 0. The correlation function of W is given by C.t; s/ D
min.t; s/. First we compute the eigenfunctions of C with positive eigenvalues. The
equation Z T
C.t; s/'.s/ ds D '.t /; 0 t T; > 0
0
can be rewritten as
Z t Z T
s'.s/ ds C t '.s/ ds D '.t /:
0 t
From here we see that '.0/ D 0. The left-hand side, hence also the right-hand side, is
differentiable with respect to t .11 We have,
Z T
'.s/ ds D ' 0 .t /; t 2 Œ0; T :
t
'.t / D ' 00 .t /:
The solutions of this equation satisfying the condition '.0/ D 0 are given by
t
'.t / D A sin p ; t 2 Œ0; T :
Taking into account the condition ' 0 .T / D 0 it is easy to check that
T2
n D ; n 2 N0
.n C 12 /2 2
are theqonly possible values of . We denote by 'n the corresponding function and set
2
A WD T, i.e.,
r
2 1 t
'n .t / D sin nC :
T 2 T
The L2 -norm of 'n is equal to 1. From Theorem 2.5.5 we obtain the representation
1
p X sin n C 12 Tt
W .t / D 2T 1
XnT ; 0 t T:
nD1
nC 2
Remark 2.5.6 shows that the random variables XnT are independent and standard
Gaussian.
Proof. Let L denote the linear manifold of all functions f of the form f D
P n
j D1 cj 1Bj . This manifold is dense in L . /. For f 2 L we define I .f / by the
2
equation above. It is easy to check that this definition is correct, i.e., the right-hand
side does not depend on the special representation of f . It is clear that I is a linear
operator from L into H./. By the definition of H./, the range of I is dense in H./.
12 It is possible to define random orthogonal measures where the structure measure is not finite (see for
example [19]). However, we do not need this more general notion in the present book.
94 Chapter 2 Correlation functions
An arbitrary step function f can be written in the form above with mutually disjoint
sets Bj . By orthogonality, we obtain
X
n
.I .f /; I .f // D jcj j2 .Bj / D .f; f /
j D1
(we denote the inner product in both spaces by .; /). Thus, the mapping I is isomet-
ric. By Theorem D.3.6, this mapping can be uniquely extended to an isometric linear
operator from L2 . / onto H./.
In the next theorem we list the basic properties of this integral which follow imme-
diately from Theorem 2.6.3.
Thus,
1
X
D Xj ıtj : (1)
j D1
The Wiener process and the Poisson process on Œ0; 1/ have orthogonal increments
(cf. Theorem 2.1.4 and Theorem 2.1.5). The next theorem shows that for each process
with orthogonal increments there is an associated random orthogonal measure.
This function is continuous and bounded. Since the increments are orthogonal we have
Example 2.6.8. Let Z be the Wiener process or the Poisson process on Œ0; 1/. Denote
by and the corresponding random orthogonal measure and the structure measure,
in the sense of the previous theorem, respectively. For B D Œ0; t / Œ0; 1/ we have
Proof. Denote by the structure measure of the random orthogonal measure corre-
sponding to Z (cf. Theorem 2.6.7). We write
t i
ti;n WD ; 0i n
n
Section 2.6 Integration with respect to orthogonal random measures 97
and define the function fn on Œ0; t by fn .s/ D f .ti;n / if s 2 Œti;n ; tiC1;n / and
fn .t / D f .t /. The sequence ffn g converges uniformly, hence also in L2 . /, to f .
Using Theorem 2.6.4 we obtain
Z t Z t X
n1
f .s/ dZ.s/ D lim fn .s/ dZ.s/ D lim fn .ti;n /ŒZ.tiC1;n / Z.ti;n /:
0 0 n!1 n!1
iD0
kZ.s/ Z.gn .s//k2 D C.s; s/ C.s; gn .s// C.gn .s/; s/ C C.gn .s/; gn .s//
and the uniform continuity of C on Œ0; t Œ0; t we see that for any > 0 the inequality
holds if n is sufficiently large. Applying this and inequality (2.4.5.2) we conclude that
the integral (1) converges to Z t
Z.s/f 0 .s/ ds
0
as n ! 1. The proof is complete.
We end this section with a lemma which will be useful in Section 5.4.
X
n X
n
15 ak .bkC1 bk / D ŒanC1 bnC1 am bm bk .akC1 ak /
kDm kDm
98 Chapter 2 Correlation functions
(ii) the g -measure of the set of zeros of g is equal to 0, the function 1=g is in
L2 . g / and
Z
1
.A/ D dg ; A 2 B:
A g
Proof. By equation (2.6.4.ii),
Z Z
2
.g .A/; g .B// D 1A 1B jgj d D jgj2 d
W A\B
the convergence being in L2 . /. From (2.6.4.iii) we see that (2.6.1.ii) holds, i.e., g
is a random orthogonal measure.
The relation (i) is true for step functions by the definition of g . Let ffn g be a se-
quence of step functions converging in L2 . g / to f . Since d g D jgj2 d , the se-
quence ffn gg converges in L2 . / to fg, from which (i) follows.
The first two statements in (ii) follow from d g D jgj2 d , while the third one
follows from (i) replacing f by 1A g1 .
In the next theorem we consider the converse direction. We assume that the correlation
function admits an integral representation of this kind and deduce from it an integral
representation for the field.
Section 2.7 The theorem of Karhunen 99
where
(i) is a finite nonnegative measure on a measurable space .W; B/;
(ii) g is a complex-valued function on T W with g.t; / 2 L2 . / for all t 2 T ;
(iii) the linear manifold Lg WD span fg.t; /; t 2 T g is dense in L2 . /.
Then there exists a uniquely determined random orthogonal measure on .W; B/ with
values in L2 . ; A; P / and with structure measure such that
Z
Z.t / D g.t; x/ d.x/; t 2 T (2)
W
We have to show that this definition is correct, i.e., the sum above depends only on f
but not on the special choice of the ck ’s and tk ’s. Using equation (1) we obtain
X
n X
n
ck Z.tk /; Z.s/ D ck C.tk ; s/
kD1 kD1
Xn Z
D ck g.tk ; x/g.s; x/ d .x/
W
kD1
Z
D f .x/g.s; x/ d .x/; s 2 T:
W
Using this and theP fact that the linear span of the set fZ.s/ W s 2 T g is dense in H.Z/
we conclude that nkD1 ck Z.tk / depends only on f .
It is clear that I is linear. A similar computation as above shows that
Xn Z
.I.f /; I.f // D cj ck C.tj ; tk / D f .x/f .x/ d .x/ D .f; f /
W
j;kD1
100 Chapter 2 Correlation functions
and hence
.Z.t /; X.s// D .Z.t /; Z.s//; t; s 2 T:
Since span fZ.t / W t 2 T g is dense in H.Z/ we must have X.s/ D Z.s/.
The representation (2) shows that H.Z/ H./. By the definition of we also
have H./ H.Z/, i.e., H./ D H.Z/.
Finally, let be a random orthogonal measure on .W; B/ with structure measure
such that Z Z
g.t; x/ d.x/ D g.t; x/ d.x/; t 2 T:
W W
The assumption (iii) implies that
Z Z
h d D h d; h 2 L2 . /:
W W
Setting h D 1B ; B 2 B, we obtain D .
It is possible to prove a similar result (see [34]) without using the density condition
(2.7.1.iii). In this case, besides losing the equality of the spaces H.Z/ and H./, the
random orthogonal measure has values in a possibly larger space. This is the motiva-
tion for the next theorem where we use the same notation as in Theorem 2.7.1.
Section 2.7 The theorem of Karhunen 101
Theorem 2.7.2. Assume that C admits the integral representation (2.7.1.1) satisfying
the conditions (2.7.1.i) and (2.7.1.ii). Then the integral representation (2.7.1.2) with
a random orthogonal measure W B ! L2 . ; A; P / having structure measure is
possible if and only if dim L?
g dim H.Z/ .
?
We then have
Z Z
.Z.s/; Y / D g.s; x/ d.x/; e .x/ d.x/
W W
Z
D g.s; x/e .x/ d .x/ D 0; s 2 T; 2 I
W
and
Z Z
.Y ; Y| / D e .x/ d.x/; e| .x/ d.x/
W W
Z
D e .x/e| .x/ d .x/ D ı ;| ; ; | 2 I:
W
Remark 2.7.3. For fixed functions C and g the representing measure in (2.7.1.1)
is in general not uniquely determined. To see classical examples consider the case of
W D R equipped with the usual Borel -algebra B.
Let first
g.t; x/ D x t ; n 2 T WD N0 ; x 2 R:
A bounded positive measure on the real line is in general not uniquely determined
by the values of the integrals
Z 1 Z 1
g.t; x/g.s; x/ d .x/ D x tCs d .x/; t; s 2 N0
1 1
(see also Remark 3.3.7). The description of all measures with given moments is the
power moment problem. For more information on the power moment problem we refer
the reader to the monographs [1] and [55].
Another example is given by the trigonometric moment problem. In this case T D
Œa; a; a > 0, and
We will see in Chapter 4 that is in general not uniquely determined by the values of
the integrals
Z 1 Z 1
g.t; x/g.s; x/ d .x/ D ei.ts/x d .x/; t; s 2 Œa; a:
1 1
The paper [5] contains more details on the connection between Karhunen’s theorem,
moment problems and quadrature formulae.
Section 2.8 Stationary fields 103
With a little abuse of terminology we call the function C , like in Definition 2.1.1,
the correlation function of Z. If Z is stationary, then the covariance
E .Z.t / M / .Z.s/ M /
K.h/ D C.h/ jM j2 ; h 2 T T:
The notions above are defined for Z (cf. Notation 2.2.6) in the same way as for Z.
Definition 2.8.2. A field Z on T is said to be strongly stationary or stationary in
the strong sense when all its finite-dimensional distributions remain the same when
shifted. That is, for all n 2 N and all t1 ; : : : tn ; h 2 Rd the random vectors
Proof. It is easy to check that the lemma holds for measures with finite support. In
the general case we approximate and by measures with finite support (see Corol-
lary 2.4.10) and apply Theorem E.2.5.
Definition 2.8.5. A second order field Z is called white noise with expectation
and variance 2 ; 0, if the random variables X.t /; t 2 T , are uncorrelated and
E.Z.t // D and Var.Z.t // D 2 for all t 2 T .
We will use the notation Z W N.; 2 / to indicate that Z is white noise with
expectation and variance 2 . White noise is stationary with covariance function
2; hD0
K.h/ D
0; h ¤ 0:
If the random variables Z.t /; t 2 T , are not identically distributed, then Z is not
strongly stationary.
Example 2.8.6.
(1) Let X 2 L2 . ; A; P /; T D Rd and x 2 Rd . The field
is stationary since
assume that .X; X / D 1 and f .0/ D 1. The field Z is stationary if and only if
the relation
(2) The example given by equation (1) can be generalized as follows. Let
X1 ; : : : ; Xn 2 L2 . ; A; P / be an orthonormal system such that E.Xj / D m
and let x1 ; : : : ; xn 2 Rd . Then the field Z defined by
X
n
Z.t / D ei.t;xj / Xj ; t 2 Rd
j D1
where we write t instead of ..; t //, for brevity. Then E.Z.t // D 0. Using
trigonometric identities we obtain
.Z.t C h/; Z.t // D .A cos .t C h/ C B sin .t C h/; A cos t C B sin t /
D cos .t C h/ cos t C sin .t C h/ sin t
D cos h:
Example 2.8.7. (1) In this example we generalize the field defined in equation
(2.8.6.1) by making the exponent random. Let X be a real random variable and Y be a
d-dimensional real random vector. Assume that X and Y are independent, E.X / D 0
and Var.X / D 2 < 1. We define the field Z by
Z.t / D ei.t;Y / X; t 2 Rd :
106 Chapter 2 Correlation functions
and
.Z.t C h/; Z.t // D ei.tCh;Y / X; ei.t;Y / X
D E ei.h;Y / X 2
2
D fY .h/; h 2 Rd
By Theorem F.2.3 this series converges with probability one. Now we show that the
time series X D fXn gn2N is stationary. Indeed,
1
X
.Xn ; 1/ D aj .Znj ; 1/ D 0
j D0
Example 2.8.9. Let Z be the Wiener process or the Poisson process constructed in
Corollary 2.1.4 and Corollary 2.1.5, respectively. Further, let h 2 Œ0; 1/ be arbitrary
and define the process X h by
The computations in Example 2.2.5 show that X h is stationary and its correlation
function C h is given by
holds for every positive integer n, for all t1 ; : : : ; tn 2 T and for all c1 ; : : : ; cn 2 C.
Now we reformulate some of our previous results in terms of stationary fields. The
next theorem follows from Theorem 2.2.3.
Example 2.9.7. Let Z W N.0; 1/ be a white noise on Rd . Then C D 1f0g and the
measure in the previous theorem is equal to the normalized Haar measure on d
(see Corollary 1.7.11 and its proof).
We are now able to prove integral representations for stationary fields using
Karhunen’s Theorem 2.7.1.
Proof. We have
Z Z
C.t s/ D i.ts;x/
e d .x/ D ei.t;x/ ei.s;x/ d .x/; t; s 2 Rd :
Rd Rd
The linear span L of the functions ei.t;/ ; t 2 Rd , is dense in L2 . /. To see this assume
that g 2 L? . Then
Z
ei.t;x/ g.x/ d .x/ D 0; t 2 Rd :
Rd
then
1
X
C.t / D pj ei.t;xj / ; t 2 Rd
j D1
and
1
X
Z.t / D ei.t;xj / Xj ; t 2 Rd
j D1
Conversely, let be a random orthogonal measure on .Œ0; 2/d ; B.Œ0; 2/d // with
structure measure . Then the equation above defines a stationary field Z with spec-
tral measure .
and H.Z/ D H./. Conversely, if is a random orthogonal measure on .d ; B.d //,
then the equation above defines a stationary field Z.
We will call the random orthogonal measure occurring in Theorems 2.9.8, 2.9.10
and 2.9.11, the representing measure of the field Z.
This equation, for example, is important for the prediction of stationary processes.22
In this case the problem is to find those cj ’s which minimize the left-hand side.
Another problem is to determine .Œa; b//; a < b, where is the random orthogonal
measure in the spectral representation of X . For this we choose a sequence
X
pn .x/ D cj;n eikj;n x ; n 2 N; kj;n 2 Z; x 2 Œ0; 2/
j
We will use this method in Theorems 2.9.16 and 2.9.17 to prove inversion formulae.
In Section 2.10 we will investigate the interconnection between stationary fields and
unitary representations which will provide another point of view for the operator IZ .
D C d. / Q
Rd
and Z
Z d D IZ ./
L (1)
Rd
follow from Theorem 2.4.6 and from the definition of the convolution (cf. Section E.2).
To prove the remaining two equations it suffices to consider nonnegative (finite)
measures. Using the definition of IZ and the fact that it is isometric we see that the
equations hold for measures with finite support. By Corollary 2.4.10 there exist se-
quences fn g and fn g of nonnegative measures with finite support converging weakly
to and , respectively, such that
Z Z Z Z
Z d D lim Z dn and Z d D lim Z dn :
Rd n!1 Rd Rd n!1 Rd
Theorem 1.6.1 shows that the sequences fL n g and fL n g converge uniformly on com-
pact sets (and hence also in L2 . /) to L and ,
L respectively. Now the equations in
question follow by taking the limit n ! 1.
The next theorem can be proved in the same way as the previous one. We omit the
proof.
114 Chapter 2 Correlation functions
and Z
Z d D IZ ./:
L (1)
Zd
Let t 2 R be fixed and consider the function ft W x 7! eitx on the interval Œa; a. Its
Fourier series is given by
X1
sin.at n/ ix n
e a ; x 2 Œa; a:
nD1
at n
sin 0
23 We use the convention 0 WD 1.
Section 2.9 Spectral representation of stationary fields 115
is equal to 0, then24
Z Z Y
d
1 T T
eiaj tj eibj tj
.J / D lim ::: X.t / dt:
T !1 .2/d T T j D1 itj
Proof. We have
Z Z Y
d
1 T T
eiaj tj eibj tj i.x;t/
KT .x/ WD ::: e dt
.2/d T T j D1 itj
Z Z Y
d
1 T T
ei.xj aj /tj ei.xj bj /tj
D ::: dt
.2/d T T j D1 itj
Y
d Z
1 T
sin.xj aj /tj sin.xj bj /tj
D dtj
2 T tj tj
j D1
Y
d
DW Kj .T; x/:
j D1
By Corollary B.1.8, 8
< 1; xj 2 .aj ; bj /
lim Kj .T; x/ D 1
; xj D aj or xj D bj :
T !1 : 2
0; xj … Œaj ; bj
and hence
lim KT D 1J ; -almost everywhere:
T !1
Theorem B.1.9 shows that jKj .T; x/j < 2 and therefore we also have conver-
gence in L2 . /. The isometric operator IZ being continuous we conclude that
limT !1 IZ .KT / D .J /. Now the theorem follows immediately from equa-
tion (2.9.13.1).
The next theorem can be proved in the same way by using Corollary C.3.2.
Proof. Noting that L2 . / is finite dimensional if and only if supp is finite, the equiv-
2
Pn from the fact that H.Z/ and L . / are isomorphic.
alence of (i) and (ii) follows
Assume that D j D1 pj ıxj , where the xj ’s are mutually distinct. Then the
functions 1fxj g build an orthogonal basis of L2 . /. For all functions f we have
X
n
f D f .xj /1fxj g
j D1
Proof. The statement on stationarity and on the correlation function follows from
Lemma 2.8.4 with D while equation (1) follows from Theorem 1.8.15 and Theo-
rem 1.9.5, respectively.
Section 2.10 Unitary representations 117
The statement on the representation of Z can be easily checked when the support
of is finite. In the general case we approximate by measures with finite support
using (2.4.15.5)25 and (2.6.4.iii).
To see examples, let H D L2 .d /. Since the Lebesgue measure is invariant under
translations, the equation26
Ut h.x/ WD h.x t /; h 2 H; x 2 Rd
U h.x/ WD g.x/h.x/; h 2 H; x 2 Rd
t 7! .Ut v; w/
If .Ut / is continuous, then the functions t 7! .Ut v; w/ are obviously continuous for
all v; w 2 H . Assume that these functions are continuous for all v; w 2 L. Let h 2 H
and w 2 L be arbitrary and choose a sequence fvn g in L converging to h. Using the
Cauchy–Schwarz inequality we get
Thus, the function t 7! .Ut h; w/ is continuous since it is the uniform limit of the
sequence t 7! .Ut vn ; w/ of continuous functions. Applying the same argument once
more, we see that t 7! .Ut h; g/ is continuous for all h; g 2 H . The continuity of .Ut /
follows from the identity
.t C s/ D .t / .s/; t; s 2 T:
spanfUt W t 2 T g
Example 2.10.7. The constant function 1 2 H D L2 .Œ0; 2// is a cyclic vector for
the unitary representation
of Z since the trigonometric polynomials build a dense linear space in H . This follows
immediately from the Stone–Weierstraß Theorem B.2.4.
It is easy to check that H.Z/ is .Ut /-invariant. Moreover, the restriction of .Ut / to
H.Z/ is a unitary representation with cyclic vector X .
120 Chapter 2 Correlation functions
f .t / D .h; Ut h/; t 2T
X
n X
n
f .tj ti /ci cj D .ci h; cj Utj ti h/
i;j D1 i;j D1
X n
D .ci Uti h; cj Utj h/
i;j D1
X 2
n
D ci Uti h
0;
iD1
Lemma 2.10.12. Let .Ut / and .Vt / be cyclic unitary representations of T in H1 and
H2 with cyclic vectors h1 and h2 , respectively. If
.h1 ; Ut h1 / D .h2 ; Vt h2 /; t 2 T;
X
n X
n
M ci Uti h1 WD ci Vti h2 :
iD1 iD1
That this definition is correct, i.e., the left-hand side does not depend on the special
choice of ci and ti , follows from the relation
X
n X
m X
n X
m
ci Uti h1 ; dj Usj h1 D ci dj h1 ; Usj ti h1
iD1 j D1 iD1 j D1
Xn X
m
D ci Vti h2 ; dj Vsj h2
iD1 j D1
and from the fact that h1 and h2 are cyclic vectors. This mapping is linear and
M h1 D h2 ;
.g; h/ D .M g; M h/;
M Ut g D Vt M g; t 2 T; g 2 L:
Lemma 2.10.13. Let and be complex measures with finite support on T . If the
equation Z.s/ D Z.s/ holds for some s 2 T , then
Z.t / D Z.t /
for all t 2 T .
The next theorem establishes a close relation between unitary representations and
stationary fields.
Theorem 2.10.14. For every (continuous) stationary field Z on T there exists a (con-
tinuous) cyclic unitary representation .Ut / of T in the Hilbert space H.Z/ such that
Proof. Denote by L.Z/ the linear space of all vectors v 2 H.Z/ of the form
X
v D Z.0/ D cj Z.xj /
j
P
where D j cj ıxj is a complex measure with finite support. For v as above we
define Ut v by
Ut v WD ıt Z.0/ D Z.t /; t 2 T:
It follows from Lemma 2.10.13 that this definition is correct, i.e., Ut v does not depend
on the special representation of v. It is clear that Ut is a linear operator from T .Z/ onto
T .Z/. Moreover,
Ut Us v D UtCs v; t; s 2 T;
in particular, Ut Ut v D v. It follows from Lemma 2.8.4 that Ut is isometric:
f .t / D .v0 ; Vt v0 /; t 2T
is positive definite in view of Lemma 2.10.9. By Theorem 1.7.8 there exists a unique
measure 2 MC ./ such that
Z
f .t / D .t / d. /; t 2 T;
gt . / D .t /; 2
this field is stationary. Denoting by .Ut / the canonical unitary representation in H.Z/
we have
Ut Z.s/ D Z.s t / D gst D gt gs D gt Z.s/:
From this we see that the operator Ut acts as multiplication with the function gt . By
the uniqueness of , the constant function 1 is a cyclic vector for .Ut /. Moreover,
Z
.1; Ut 1/ D .t / d. / D f .t / D .v0 ; Vt v0 /:
Lemma 2.10.12 shows that the representations .Ut / and .Vt / are equivalent. If .Vt / is
continuous, then .Ut / is continuous as well (cf. Lemma 2.10.11). The continuity of Z
follows from Ut Z.0/ D Z.t /.
Remark 2.10.15. Let Z be a continuous stationary field on Rd . The proof of the sec-
ond part of the previous theorem can be carried out by using Bochner’s Theorem 1.7.3
instead of Theorem 1.7.8, as follows.
Let be the spectral measure of Z and consider the isometric mapping
IZ W L2 . / ! H.Z/ with IZ ei.t;/ D Z.t / (cf. Remark 2.9.12). We define
the unitary representation .Vt / by
and from Lemma 2.10.12.29 We call .Vt / the canonical unitary representation of Z in
L2 . /. In Section 2.11 we will encounter another unitary representation for Z which
is constructed using the correlation function of Z.
28 This follows from the fact that is uniquely determined by its inverse Fourier–Stieltjes transform.
29 Note that .Z.0/; Z.t // D .Z.s/; Z.s t // by stationarity, so that Z.0/ can be replaced by Z.s/
where s 2 Rd is arbitrary.
124 Chapter 2 Correlation functions
Proof. The equivalence of (i) and (iii) follows from the relations
H0 is .Ut / invariant ” Ut h 2 H0 ; t 2 T; h 2 H0
” Ut h ? g; h 2 H0 ; g 2 H0?
” h ? Ut g; t; h 2 H0 ; g 2 H0?
” Ut g 2 H0? ; t; g 2 H0?
” H0? is .Ut / invariant:
h D h0 C h?
0; h 0 2 H0 ; h ? ?
0 2 H0 :
P 0 Ut h D P 0 Ut h 0 C P 0 Ut h ?
0 D Ut h 0 :
H.Z/ D H1 ˚ H2
Z D Z1 C Z2
H.Z/ D H1 ˚ ˚ Hn :
Moreover,
Z.t / D 1 .t /X1 C C n .t /Xn ; t 2 T;
where Xj 2 Hj and j is a character of T .
Proof. Since the unitary operators Ut commute, they have a common eigenvector.
Denoting by H1 the subspace spanned by this eigenvector X1 we have
H.Z/ D H1 ˚ H1? :
By Lemma 2.10.16, the subspace H1? is .Ut /-invariant. Using this, the lemma follows
easily by induction on n (see also the discussion in the first part of Example 2.8.6).
Et g.x/ WD g.x t /; t; x 2 T:
T .g/ D f g W 2 Mf .T /g:
X
n X
m
.g; h/ WD Q f .0/ D f .yj xi /ai bj :
iD1 j D1
Since
X
m X
n
.g; h/ D g.yj /bj D h.xi /ai ;
j D1 iD1
the definition of .g; h/ does not depend on the particular choice of and . Using
Lemma 1.4.3 and Lemma 1.4.8 we see that .; / is a positive semidefinite inner product
on T .f /. It follows from the definition of .; / that
from (3) and from the fact that Et is isometric on T .f / that H.f / is invariant under
Et and .Et g; Et g/ D .g; g/ holds for all g 2 H.f /. Denote by Ut the restriction of
Et to H.f /. Since Ut Ut D Ut Ut D I (the identity operator), the operator Ut is
invertible and hence unitary. It is easy to check that .Ut / is a unitary representation of
T in H.f /. The linear space spanned by the set fUt f W t 2 T g is equal to T .f / and
hence it is dense in H.f /.
Since g 7! .g; Ut f / is continuous on H.f / and T .f / is dense in H.f /, equa-
tion (1) holds for all g 2 H.f /. Using the Cauchy–Schwarz inequality again, we
obtain
jgn .t / g.t /j2 D j.gn g; Ut f /j2 kgn gk2 kf k2 :
Thus, gn ! g uniformly on T whenever fgn g converges to g in H.f /.
Property (vi) follows immediately from the equation g˛ .t / D .g˛ ; Ut f /.
To prove (vii), let g; h 2 H.f / be arbitrary and choose n ; n 2 Mf .T / such
that limn n f D g and limn n f D h, the convergence being in H.f /. Then the
function t 7! .g; Ut h/ is the uniform limit of the functions
t 7! .n f; Ut n f / D Q n n f .t /
which are continuous. Thus, the representation .Ut / is continuous. The continuity of
the functions in H.f / now follows from (iv). The last statement can be proved using
the same arguments.
Throughout the rest of this section .Ut / denotes the unitary representation of T in
H.f /, constructed in the previous theorem. If f D C is the correlation function of a
stationary process Z, then .Ut / is called the canonical unitary representation of Z in
H.C /.
gi .t / D .gi ; Ut fi /
Proof. Using that H1 ? H2 and the fact that Hi is .Ut /-invariant, we obtain
QUt D Ut Q (1)
f1 .t / D .Qf; Ut f / (2)
f2 .t / D ..I Q/f; Ut f / (3)
Proof. We define in T .f / a new positive semidefinite inner product .; /1 by setting
. f; f /1 WD Q f1 .0/; ; 2 Mf .T /: (4)
From Theorem D.3.5 we conclude that the inner product .; /1 can be extended to a
positive semidefinite inner product on H.f /, which we also denote by .; /1 . By the
second part of Theorem D.3.5 there exists a nonnegative linear operator Q on H.f /
such that kQk 1 and
From equation (4) we see that .Ut g; h/1 D .g; Ut h/1 holds for all g; h 2 H.f / and
t 2 T . Using this, we obtain
.Ut Qg; h/ D .Qg; Ut h/ D .g; Ut h/1 D .Ut g; h/1 D .QUt g; h/:
This being true for all g; h 2 H.f /, we must have Ut Q D QUt ; t 2 T . Taking into
account the definition of .; /1 , we obtain
and
f2 .t / D f .t / f1 .t / D ..I Q/f; Ut f / D ..I Q/f /.t /:
Therefore, f1 D Qf 2 H.f / and f2 D .I Q/f 2 H.f /.
Suppose now that Q is a nonnegative operator on H.f / with kQk 1 and QUpt D
Ut Q; t 2 Tp. Then the operator I Q is nonnegative as well. Setting Q1 WD Q
and Q2 WD I Q, the operators Q1 and Q2 commute with Ut for every t 2 T (cf.
Theorem D.3.7). Thus, we have
f1 .t / D .Qf; Ut f / D .Q1 f; Ut Q1 f /
Section 2.11 Unitary representations and positive definite functions 129
and
f2 .t / D ..I Q/f; Ut f / D .Q2 f; Ut Q2 f /:
It is obvious that f D f1 C f2 . It follows from Lemma 2.10.9 that f1 and f2 are
positive definite.
Proof. By the previous theorem f1 ; f2 2 H.f / and therefore the corollary follows
from the last two statements of Theorem 2.11.1.
Consequently, by the second part of Theorem D.3.5, there exists a bounded linear
operator Ch W H.f / ! H.f / for which Bh .g1 ; g2 / D .Ch g1 ; g2 / for all g1 ; g2 2
H.f /. Using the definition of Bh we obtain
Z Z
.Ch g/.t / D .Ch g; Ut f / D .Us g; Ut f /h.s/ ds D g.t s/h.s/ ds D h g.t /
G G
31 Note that h g is a continuous function because g is bounded and measurable by Theorem 2.11.1.
130 Chapter 2 Correlation functions
We are now able to prove a decomposition theorem for measurable positive definite
functions.
f D fc C f 0
Proof. Denote by Hc the set of all continuous functions in H.f / and by H0 the or-
thogonal complement of Hc . Then Hc and H0 are closed .Ut /-invariant subspaces of
H.f / and
H.f / D Hc ˚ H0 :
f D fc C f0 ; fc 2 P c .Rd / \ Hc ; f0 2 P .Rd / \ H0 :
Proof. Let B be an open ball with center 0 such that B CB V and denote by HB the
subspace of H.f / spanned by the functions Ut f; t 2 B. As f is measurable on V and
B C B V , the function Ut f is measurable on B for all t 2 B. Since convergence
in H.f / implies uniform convergence (cf. Theorem 2.11.1) every function in HB is
measurable on B. If g ? 2 HB? , then g ? .t / D .g ? ; Ut f / D 0; t 2 B. Applying the
decomposition
H.f / D HB ˚ HB?
we see that every function h 2 H.f / can be written as
h D g C g?
Special properties
holds for every positive integer n, for all pairwise distinct t1 ; : : : ; tn 2 Rd , and for all
c 2 Cn n f0g.
Proof. Setting n D 1 and c D 1 in Lemma 3.1.2 shows that f .0/ > 0 while setting
n D 2; x1 D x; x2 D 0; c1 D 1, and c2 D 1 yields
whenever x ¤ 0.
(iv) is a consequence of Schur’s Theorem D.2.12 while the remaining statements
follow immediately from the definition of strictly positive definiteness.
where n 2 N; c 2 Cn and x 2 Rd .
Theorem 3.1.4. Let f 2 P c .Rd / and let be the corresponding spectral measure.
Then f 2 SP c .Rd / if and only if there is no trigonometric polynomial P ¤ 0 on Rd
which vanishes on the support of .
where c 2 Cn nf0g and the xj ’s are pairwise distinct. The proof of inequality (1.1.2.v)
shows that Z
Xn
f .ti tj /ci cj D jP .t /j2 d.t /:
i;j D1 Rd
Since jP j2 is continuous and nonnegative, the integral on the right-hand side is zero
if and only if P vanishes on the support of . The theorem follows immediately from
this observation.
The next two theorems provide simple sufficient conditions to ensure strict positive
definiteness on Rd .
Proof. By Lemma 3.6.3 the spectral measure of f is radial. Since f is not constant
the support of contains a sphere of positive radius centered at the origin. Therefore,
Theorem C.10.3 shows that there is no trigonometric polynomial P ¤ 0 vanishing on
the support of . From Theorem 3.1.4 we infer that f is strictly positive definite.
134 Chapter 3 Special properties
Theorem 3.1.6. Let f 2 P c .Rd / and let be the corresponding spectral measure.
If the support of contains a nonempty open set then f is strictly positive definite.
Proof. The statement follows by the same argument as in the proof of Theorem 3.1.5.
Instead of Theorem C.10.3 we apply Theorem C.10.2.
Proposition 3.1.7. Let f 2 P c .R/ and let be the corresponding spectral measure.
If the support of has accumulation points, then f is strictly positive definite.
Corollary 3.1.8. If the support of is not countable, then f is strictly positive defi-
nite. In particular, if is not purely discrete, then f 2 SP c .R/.
Proof. Theorem 1.8.5 with s D 0 shows that the spectral measure of f cannot be
purely discrete so we can apply Corollary 3.1.8.
3.2.1. Recall that C1 .Rd / denotes the set of infinitely differentiable functions
on Rd while C1 d 1 d
00 .R / stands for the set of functions in C .R / which have compact
support. It is easy to check that the function
e1=t if t > 0;
g.t / WD (1)
0 if t 0
0.5
1 2 3 4
S WD fx 2 Rd W f .x/ 6D 0g
0.4
1 1
Assume that S Rd is such that (i)–(iii) hold and let s1 ; s2 ; : : : be all points of S
having rational coordinates. For each j 2 N choose rj > 0 such that
B c .rj / [ ŒB c .rj / C sj [ ŒB c .rj / sj S: (1)
Let h 2 \
P c .Rd / C1 .Rd / be such that h 0 and supp h D B c .1/. We define the
function gj ; j 2 N, by
gj .x/ D 2h.x=rj / C h..x C sj /=rj / C h..x sj /=rj /; x 2 Rd : (2)
By Lemma 1.4.18, the function gj is positive definite. In addition, gj is nonnegative,
gj 2 C1 .Rd / and
supp gj D B c .rj / [ ŒB c .rj / C sj [ ŒB c .rj / sj S:
Using equation (2) and Theorem 1.2.1 we see that
4A˛
jD ˛ gj j j˛j ; ˛ 2 Nd0
rj
where A˛ denotes the absolute moment corresponding to h. Using this, Lemma B.1.16
shows the existence of a sequence fpk g1
1 of positive real numbers such that the series
1
X
pj jD ˛ gj j
j D1
fx 2 Rd W f .x/ 6D 0g D S
follows from equation (1) and from the fact that for each s 2 S there exists a subse-
quence fskj g converging to s.
The next three lemmata are often useful as technical tools in proofs.
Lemma 3.2.3. There exists a sequence fgn g1 1 of radial positive definite functions
gn 2 C1 d / of the form g D h hQ hn 2 C1
00 .R / such that gn .0/ D 1
.R where d
00 n n n
1
and fgn g1 tends to 1 uniformly on compact sets.
Proof. This lemma can be proved in the same way as Lemma 1.5.6 by using the func-
tion f from (3.2.1.3).
Lemma 3.2.4. There exists a sequence ffn g of radial positive definite functions fn 2
C1 d
00 .R / such that
Lemma 3.2.5. For all ; ı 2 .0; 1/ there exists a radial positive definite function
p 2 C1 d
00 .R / such that p is a density and
0 p.t
O / < ı; t 2 Rd n B o . /:
Proof. Let f 2 C1 d
00 .R / be an arbitrary radial positive definite function which is a
density (cf. the proof of Lemma 3.2.4). Then 0 fO.t / 1 D f .0/ for all t . By the
Riemann–Lebesgue Lemma 1.8.4, the function fO is in C0 .Rd /. From Corollary 1.4.13
we conclude that 0 fO.t / < 1; t ¤ 0. Consequently,
for all ˛; ˇ 2 N0 .
M˛ g.x/ WD x ˛ g.x/; x 2 Rd
so that
kf k˛;ˇ D kM˛ D ˇ f k1 :
It is clear that C1
00 .R / S.R /. Another example of a rapidly decreasing function
d d
is given by
f .x/ D ekxk ; x 2 Rd :
2
Theorem 3.2.8. The Fourier transformation maps S.Rd / onto S.Rd / and
If g 2 S.Rd / we have
Z Z
d=2 jgQj
.2/ jgj
O jgj d D d kgQk1 < 1:
Rd Rd Q
Replacing here g by Mˇ D ˛ fO and using equation (1) we see that fO 2 S.Rd / and that
the second statement of the theorem is true.
In the same way, we see that fL 2 S.Rd / whenever f 2 S.Rd /. Using this, the fact
that the Fourier transformation maps S.Rd / onto S.Rd / follows from the relation
f D .fL /O ; f 2 S.Rd /:
Characteristic functions that are not differentiable at 0, like ejtj or max .1 jt j; 0/,
are not analytic. The characteristic functions et and cos t are entire, while 1it
2 1
and
1
1Ct 2
are analytic but not entire.
3.3.2. We have seen in Section 1.2 that the existence of derivatives of a characteristic
function f is closely related to the existence of moments of the corresponding distri-
bution .RWe will prove that analyticity is closely related to finiteness of integrals of
the form ers d.s/ for certain r 2 R. We introduce the numbers
Z 1
C C
s D sf D sup r 2 R W erx d.x/ < 1
1
Z 1
s D sf D inf r 2 R W erx d.x/ < 1
1
and the set
C
S D Sf D fz 2 C W s < Im z < s g:
0 s C and hence S is empty if and only if s D s C D 0.
Note that s
140 Chapter 3 Special properties
exists for all z in the strip S and it represents there a holomorphic function. More-
over, Z 1
.n/
f .z/ D .ix/n eizx d.x/; n 2 N (1)
1
for all z 2 S .7
Proof. The fact that f .z/ exists follows from jeizx j D eIm zx . We now proceed in
a similar way as in the proof of (1.2.1.i). Let z 2 S be fixed and choose a positive ı
such that z C h 2 S for all h 2 C with jhj < ı. If, in addition, h ¤ 0 then we have
Z 1
f .z C h/ f .z/ eihx 1 izx
D x e d.x/: (2)
h 1 hx
ı
Theorem B.1.5 with n D 0 shows that the integrand times 2 is dominated by
ıjxj jhxj Im zx
e e :
2
Using that r < er ; r 2 R, we see that this expression is less than
whenever jhj < 2ı . This function is -integrable with respect to x by our choice of
ı. Taking the limit h ! 0 in (2) and applying Lebesgue’s theorem on dominated
convergence we obtain (1) for n D 1. The general case is obtained by repeating the
arguments above.
Proof. (i) It follows from Theorem 3.3.3 that f and g have holomorphic extensions
to the strip S WD Sf \ Sg . Since S is connected and f D g on .a; a/, the extensions
are equal on S (cf. Theorem C.1.8). This implies that the corresponding distributions
are equal, hence Sf D Sg .
(ii) We extend f to the strip Sf and consider g on the whole complex plane. Since
Sf is simply connected and f D g on .a; a/, the extensions are equal on Sf .
(iii) By Theorem 1.2.1, the corresponding characteristic functions have the same Tay-
lor expansion at zero. Hence (iii) follows from (i).
0.7
1 2
Figure 3.3. The function p from Remark 3.3.7.
Theorem 3.3.8. If the random variables X and Y have analytic characteristic func-
tions and the equation
E.X k Y l / D E.X k / E.Y l /
holds for all k and l in N, then X and Y are independent.10
9 See, e.g., [1, 35, 55] or Chapter 2 in [6] for more information on this topic.
10 We refer to [7] for some related results.
Section 3.3 Analytic characteristic functions of one variable 143
Using Theorem 1.2.1 we see that the last sum is the Taylor series of the function g
at zero. Now fix s and t . The display above shows that r 7! g.rs; rt /; r 2 R, is an
analytic characteristic function which is equal to the analytic characteristic function
r 7! fX .rs/ fY .rt / in a neighborhood of zero. By (3.3.6.ii), the equality holds on
R. In particular,
g.s; t / D fX .s/ fY .t /:
Thus, X and Y are independent (cf. Theorem 1.3.10).
whenever
s <r < C.
s
Next we prove some simple but useful properties of analytic characteristic functions.
Proof. The first statement is obtained from Lemma 3.3.11 with zj D xj C iy=2 while
(ii) and (iv) follow from (i) (see Lemma 1.4.8 and Theorem 1.4.12).12 Let be the
distribution corresponding to f . Then
Z 1
f .iy/ D eyx d.x/ > 0:
1
12 Actually, these properties also follow easily from the integral representation of f .
Section 3.3 Analytic characteristic functions of one variable 145
n
By Lemma 3.3.11, the matrix A D f .zj z k / j;kD1 is positive semidefinite and
hence det A 0. In the special case where n D 2, zj D iyj , this inequality gives
From (iii) we see that f is strictly positive on the imaginary axis. Inequality (1) and
the continuity of f imply that the function y 7! log f .iy/ is convex.
Proof. By (3.3.12.v), the function y 7! log f .iy/ is convex and hence the inequality
g.iy/ max g.ir/; g.ir/ (1)
and g.z/ D f .z/f .z/ D jf .z/j2 ; z 2 D.0; ı/. Then g is an even holomorphic
function on D.0; ı/, the gk ’s are characteristic functions and g; gk are nonnegative
on the interval .ı; ı/. Moreover,
Y
r
gk˛k .t / D g.t /; ı < t < ı: (1)
kD1
We will prove the statement of the theorem for the functions gk . The statement for fk
follows then from Theorem 3.3.9. We choose d 2 .0; ı such that g.z/ ¤ 0 if jzj < d .
From (1) we obtain
X
r
1
˛k log D h.t /; d < t < d (2)
gk .t /
kD1
To compute the derivatives on the right-hand side we apply Faà di Bruno’s formula
(B.1.14) which yields
X
c.b; m/ gk0 .0/b1 gk00 .0/b2 : : : gk .0/bm
.m/ .m/
q;k
.0/ D (4)
.2q/
the right-hand side of (3) contains the terms 2q˛k gk we obtain
ˇX ˇ
ˇ r ˇ
ˇ 2q˛k gk .0/ˇˇ j
.2q/ .2q/
.0/j
ˇ q
kD1
and hence ˇ ˇ
ˇ .2q/ ˇ 1
ˇgk .0/ˇ j .2q/ .0/j; k D 1; : : : ; r: (5)
2q˛k q
Applying the inequality (C.1.13) to the Taylor expansion of q D g 2q with r replaced
by ı0 D 2ı we get
j
.2q/
.0/j 2 ˇ ˇ
sup ˇRe g 2q .z/ ˇ 2M 2q
q
2q
(6)
.2q/Š ı0 jzjDı0
1
where M D ı0
supjzjDı0 jg.z/j. The inequalities (5) and (6) show that
.2q/
! 2q
1
gk .0/
lim sup M
q!1 .2q/Š
define functions whose Taylor expansions have ık2 and ı 2 as radii of convergence,
respectively. We have Gk .t 2 / D gk .it /; G.t 2 / D g.it / and therefore
Y
r
Gk˛k .t / D G.t /; t 2 .ı12 ; ı12 /: (7)
kD1
Relation (1.2.1.ii) shows that
.l/
Gk .0/ 0; l 2 N0 : (8)
Let ı0 2 .0; ı12 / be such that G.ı0 / ¤ 0 and define the functions Hk by
Hk .t / D Gk .ı0 C t /; jı0 C t j < ı12 ; k D 1; : : : ; r:
148 Chapter 3 Special properties
By (7) we have
Y
r
Hk˛k .t / D G.ı0 C t /; jı0 C t j < ı12 ;
kD1
hence
Y
r
Hk .t / ˛k
G.ı0 C t /
D :
Hk .0/ G.ı0 /
kD1
We raise this equation to the power n, differentiate the resulting equation n times and
.n/ .l/
set t D 0. The left-hand side contains the term n˛1 H1 .0/=H1 .0/. Since Hk .0/ 0
for all l (cf. (8)) we conclude that
ˇ
.n/ dn G n .ı0 C t / ˇˇ
n˛1 H1 .0/ H1 .0/ n :
dt G n .ı / ˇ 0 tD0
We now choose arbitrary real numbers r1 and r2 such that ı12 < r1 < r2 < ı 2 .
Applying the inequality (C.1.13) to the Taylor expansion of G n .ı0 Ct / with r replaced
by r1 ı0 we get
ˇ
1 dn n ˇ 2 ˇ n ˇ
G .ı C t /ˇ sup ˇRe G .ı0 C z/ ˇ 2M n ;
nŠ dt n
0 ˇ .r1 ı0 /n jzjDr1 ı0
tD0
1
where M D r1 ı1
supjzjr2 jG.z/j. Consequently,
.n/
! n1
H1 .0/
lim sup M
n!1 nŠ
and therefore H1 is holomorphic on the disk D.ı0 ; 1=M /. Since M does not depend
on ı0 , we can choose ı0 such that this disk contains ı12 . This shows that G1 is bounded
on .ı12 ; ı12 C / for some positive . On the other hand, the Taylor series of
G1 at 0 has nonnegative coefficients (cf. inequality (8)) and hence this series cannot
converge for values larger than ı12 (see Lemma B.1.15). This contradiction shows that
ı1 D ı.
CC
im WD fz 2 C W Im z > 0g; C
im WD fz 2 C W Im z < 0g
and
CC
re WD fz 2 C W Re z > 0g; C
re WD fz 2 C W Re z < 0g:
is holomorphic. Moreover,
Z 1 Z 1
2
jf .x C iy/j dx 2 jF .t /j2 dt; y > 0:
1 0
Proof. If z D x C iy where y > 0, then jeitz j D ety showing that the integral above
exists.
Assume that y > ı > 0; Im zn > ı, and zn ! z. Using the Cauchy–Schwarz
inequality we obtain
Z 1 Z 1
ˇ itz ˇ
2
jf .z/ f .zn /j 2
jF .t /j dt ˇe eitzn ˇ2 dt:
0 0
The second integrand on the right is bounded by the integrable function t 7! 4e2ıt
and tends to zero as n ! 1. The dominated convergence theorem shows that f is
continuous.
Let be a closed path in CC im . By Fubini’s theorem we have
Z Z 1 Z
f .z/ dz D F .t / eitz dz dt:
0
Thus, the integral of f over is zero and hence, by Morera’s Theorem C.1.6, f is
holomorphic on CC im .
To prove the last statement let y > 0 be fixed. By the definition of f , we have
Z 1 Z 1Z 1
f .x C iy/ dx D F .t /ety eitx dt dx; y > 0:
1 1 0
Lemma 3.4.2. Let 0 < a < 1 and F 2 L2 .a; a/. The function
Z a
f .z/ D F .t /eitz dt; z 2 C
a
150 Chapter 3 Special properties
is entire and
jf .z/j C eajzj ; z2C
with some constant C 0.
Proof. The fact that f is entire can be proved by the same arguments as in the proof
of Lemma 3.4.1 while the inequality follows from
Z a Z a
jf .z/j jF .t /jety dt eajyj jF .t /j dt
a a
Definition 3.4.3. For a > 0 we denote by E.a/ the set of all entire functions of
exponential type a, i.e., F 2 E.a/ provided that F is entire and
for all z 2 C.
f .x/ WD f .x/ejxj ; x 2 R:
lim kf f k2 D 0:
!0
Plancherel’s theorem (cf. Theorem 1.8.9) implies that the Fourier transforms of the
f ’s converge in L2 .R/ to the L2 Fourier transform F of f . By equation (3) the
Section 3.4 Holomorphic L2 Fourier transforms 151
function F vanishes outside Œa; a. Since f is equal to the inverse Fourier transform
of F we see that equation (2) holds for almost every z 2 R. As each side of (2) is an
entire function (see Corollary 3.3.4), we conclude that (2) holds for all z 2 C.
To prove (3) let ˛ be the path defined by
Write
…˛ D fw 2 C W Re .wei˛ / > ag;
and for w 2 …˛ , define the function ‰˛ by16
Z Z 1
wz
f .sei˛ /ewse ds:
i˛
‰˛ .w/ D f .z/e dz D e i˛
˛ 0
Using the fact that f 2 E.a/ we see that the modulus of the integrand is at most
C e.Re .we
i˛ /a/s
:
The same arguments as in the proof of Lemma 3.4.1 show that ‰˛ is holomorphic in
the half plane …˛ . In the cases ˛ D 0 and ˛ D , we extend the definition of ‰˛ to
larger half planes by
Z 1
‰0 .w/ D f .x/ewx dx; w 2 CC re
0
and Z 0
‰ .w/ D f .x/ewx dx; w 2 C
re :
1
Inequality (1) and Lemma 3.4.1 show that ‰0 and ‰ are holomorphic in the indicated
half planes. Moreover, it is easy to check that
Z 1
f .x/eitx dx D ‰0 . C it / ‰ . C it /; t 2 R:
1
Hence, it remains to prove that the right-hand side of the equation above tends to zero
as ! 0 if jt j > a.
Next we show that any two of the functions ‰˛ are equal in the intersection of their
domains of definition.
Let 0 < ˇ ˛ < and write
˛Cˇ ˇ˛
D ; D cos :
2 2
Note that is positive. If w D rei , where r 2 .a=; 1/, then w 2 …˛ \ …ˇ . This
follows from
Re wei˛ D r D Re weiˇ :
16 See Section C.1 for the definition of line integrals.
152 Chapter 3 Special properties
is at most
C e.ar/R :
Since r 2 .a=; 1/, we conclude that the integral IR tends to zero as R ! 1.
By Cauchy’s Theorem C.1.5
Z Z
f .z/ewz dz D IR C f .z/ewz dz:
Œ0;Reiˇ Œ0;Rei˛
Theorem C.1.8 shows that ‰˛ and ‰ˇ coincide in the intersection of their domains
of definition. Thus, the difference on the right of equation (3) does not change if we
replace ‰0 and ‰ by ‰=2 if t < a, and by ‰=2 if t > a. From this we infer
that the difference tends to 0 as ! 0.
Let ˙1 ; ˙2 ; : : : be the zeros of F , counted with their multiplicities. Since F 2
E.2/, Corollary C.1.16 shows that
log C C 2jn j
n
log 2
with some constant C > 0 and hence jn j > cn, for some c > 0. Thus,
1
X 1
< 1:
jn j2
nD1
Using this and the facts that F is even and F 2 E.2/, Hadamard’s factorization Theo-
rem C.1.19 shows that
1
Y z2
F .z/ D 1 ; z 2 C: (2)
2n
nD1
Let t1 t2 be those positive numbers for which itn is a zero of F and write
Y
m
z2
Am .z/ D 1C
tn2
nD1
Qm D F=Am
Q D lim Qm :
m!1
jR.it /j D et jF1 .it /j2 D et Q.it / et C e2t ; t > 0;
with some C > 0, i.e., R is bounded on the upper half of the imaginary axis. Since
F1 is of exponential type, R is of exponential type as well. The Phragmen–Lindelöf
Theorem C.1.21 shows that R is bounded in the first and second quadrant. Applying
the same argument to the lower half plane we conclude that F1 2 E.1/ and F2 2 E.1/.
By the definition of Am and Q we have
1
Y z2
F .z/ D Q.z/ 1C
tn2
nD1
1
X
D F1 .z/F2 .z/ ck2 z 2k
kD0
with some nonnegative numbers ck . If u is real, then F1 .u/F2 .u/ D jF1 .u/j2 and
therefore
1
X
F .u/ D jck uk F1 .u/j2
kD0
X1 1
X
D jc2j u2j F1 .u/j2 C u2 jc2j C1 u2j F1 .u/j2 ; u 2 R:
j D0 j D0
Thus, the functions Gj .z/ WD c2j z 2j F1 .z/ and Hj .z/ WD c2j C1 z 2j F1 .z/ have the
desired properties.
jcn j cn .i Im zj /n ; j D 1; : : : ; n:
Taking modulus and dividing by jcn j shows that jIm zj j 1. Since jzj j D 1, we
conclude that jIm zj j D 1 for all j D 1; : : : ; n. This is only possible if n D 1 or
n D 2. Thus, the degree of P is at most 2. The case where d is arbitrary follows now
from the fact that s 7! f .s t / is a characteristic function of one variable for all t 2 Rd
(see, e.g., Remark 1.1.10) by using Lemma B.6.7.
Next we show that P has the form (1). We already know that
X
d X
d
P .t / D aj tj C bj k tj tk
j D1 j;kD1
f .t / D e˛jtjjtj ;
˛
t 2R
is a characteristic function for all ˛ 2 Œ2; 1/ (see Figure 3.4). To see this we show that
f is convex on the interval Œ0; 1/. The fact that f is a characteristic function follows
then from Pólya’s Theorem 3.9.11. The second derivative of f at t > 0 is
It suffices to prove that the expression in the brackets is positive which is equivalent to
p ˛
t C t ˛ > 1 1=˛ t 2 :
˛ ˛
This inequality holds because t t 2 if 0 t 1 and t ˛ t 2 if t 1.
1 1
Figure 3.4. The function f from Remark 3.5.3 with ˛ D 3.
Section 3.5 Further properties of Gaussian distributions 157
then
2
fj .t / D eiaj tbj t ; t 2R
for some aj 2 R and bj 0; j D 1; 2.
Proof. Without loss of generality we may assume that a D 0 and b D 1. From Corol-
lary 3.3.10 we infer that f1 and f2 are entire characteristic functions. The equation
ez D f1 .z/f2 .z/ holds clearly for every z 2 C. It follows that f1 has no zeros and
2
Proof. Suppose first that d D 1. By Theorem 3.3.14, the functions fk are entire. As
in the proofs of Theorems 3.3.14 and 1.2.13 we write
hence they correspond to Gaussian distributions. The statement of the theorem follows
now from Theorem 1.10.4.
X
d X
d
Ya D .a; X / D ak X k ; Yb D .b; X / D bk Xk
kD1 kD1
Y
d Y
d Y
d
fk .ak t C bk s/ D fk .ak t / fk .bk s/; t; s 2 R: (1)
kD1 kD1 kD1
where fk denotes the characteristic function of Xk . We may assume that ak and bk are
not both zero. Changing the enumeration if necessary we also assume that ak bk ¤ 0
if 1 k m and ak bk D 0 if k > m. If k > m and ak ¤ 0, then the factor fk .ak t /
appears on both sides of (1). The same holds for fk .bk s/ if k > m and bk ¤ 0. We
choose ı > 0 such that none of the factors in (1) is equal to zero if t; s 2 .ı; ı/. Then
we have
Y
m Y
m Y
m
fk .ak t C bk s/ D fk .ak t / fk .bk s/; t; s 2 .ı; ı/: (2)
kD1 kD1 kD1
We show that this equation implies that Xk is Gaussian. By the same argument as in
the proof of Theorem 3.5.6, we may assume that all the factors in (2) are positive.
Setting gk D log fk we write (2) in the form
X
m X
m X
m
gk .ak t C bk s/ D gk .ak t / C gk .bk s/; t; s 2 .ı; ı/:
kD1 kD1 kD1
If the numbers bk =ak are all different, then Theorem C.9.5 shows that gk is a poly-
nomial in a neighborhood of 0 and hence, by Theorem 3.5.2, Xk is Gaussian. Assume
that b1 =a1 D D bn1 =an1 for some n1 and b1 =a1 ¤ bk =ak if k > n1 . We then
write Ya and Yb in the form
a2 an
Ya D a1 X1 C X2 C C 1 Xn1 C
a1 a1
and
b2 bn
Yb D b1 X1 C X2 C C 1 Xn1 C
b1 b1
The expressions in the brackets are equal. Applying the same arguments to the remain-
ing Xj ’s, we obtain that the sums in the brackets are Gaussian. By Theorem 3.5.4 of
Lévy and Cramér the summands are Gaussian as well.
18 We refer to [33] for further reading on results of this type and their applications.
160 Chapter 3 Special properties
Proof. Proceeding in the same way as in the proof of Theorem 3.5.7, we obtain the
functional equation
X
r X
r X
r
gk .Ak t C Bk s/ D gk .Ak t / C gk .Bk s/; t; s 2 B o .ı/
kD1 kD1 kD1
g.t / D g.O t /; t 2 Rd
Setting O D I where I is the identity matrix, we see that radial positive definite
functions are real-valued and even.
X
n
gD cj 1Bj ; n 2 N; cj 2 C; Bj 2 B.Rd /
j D1
and for all O 2 O.d /. Since these functions span a dense linear subspace of L1 ./, we
conclude that equation (1) holds for all continuous bounded functions g. In particular,
Z Z
1
O t / D ei.O t;s/ d.s/ D ei.t;O s/ d.s/ D .t
.O O /:
The converse statement can be proved in the same way by using the fact that functions
of the form t 7! ei.t;s/ ; s 2 Rd , span a dense linear subspace of L1 ./.
Proof. The equivalence of (i) and (ii) is trivial while the equivalence of (ii) and (iii)
follows from Lemma 3.6.3.
Assume that (i) holds and let Oa 2 O.d / be such that Oa a D kake1 where e1 D
.1; 0; : : : ; 0/ 2 Rd . Then
i.e., fX is radial.
Proof. In view of Lemma 1.5.4 it remains to prove that f is radial. Let O 2 O.d /
be an orthogonal matrix. Using the fact that the Lebesgue measure is invariant under
orthogonal transformations we obtain
Z
f .O t / D g.O t C y/g.y/ dy
ZR d
D g.O t C Oy/g.Oy/ dy
ZR
d
D g.t C y/g.y/ dy D f .t /; t 2 Rd
Rd
i.e., f is radial.
Using Bessel functions, the Fourier transform of a radial function can be expressed
as a univariate integral.
Introducing polar coordinates .r; '/ by x1 D r cos ' and u D r sin ', it follows
Z 1 Z
O
f .t / D .S d 2
/ h.r/r d 1
eiktkr cos ' sind 2 ' d' dr:
0 0
From Lemma C.5.3 we know that
1 x Z
J .x/ D p eix cos ' sin2 ' d'; x 0; 0
C 12 2 0
Theorem 3.6.8. Let f W Œ0; 1/ ! C be such that the functions t 7! f .kt k/; t 2
Rd C2 , and t 7! f .kt k/; t 2 Rd , are in L1 .Rd C2 / and L1 .Rd /, respectively. Then
we have
1 1 d
Fd C2 .f /.r/ D Fd .f /.r/; r > 0: (1)
2 r dr
Proof. In view of Theorem 3.6.6
Z 1
Fd .f /.s/ D .2/d=2 JQd=21 .rs/f .r/r d 1 dr; s>0 (2)
0
where JQ .x/ WD x J .x/. By the second equation in Proposition C.5.5 we have20
d Q
J .x/ D x JQC1 .x/: (3)
dx
while Lemma C.5.2 shows that
jJQ .x/j C ; 0
with some positive constant C . Using equation (3) and differentiating both sides of
equation (2) with respect to s we obtain (1). The fact that interchanging differentiation
and integration is permissible follows from
Z 1ˇ ˇ
ˇ d Q ˇ
ˇ Jd=21 .rs/ˇ jf .r/jr d 1 dr
0 ds
Z 1
ˇ ˇ
Ds ˇJQd=2 .rs/ˇ jf .r/jr d C1 dr
0
Z 1
sCd=2 jf .r/jr d C1 dr < 1:
0
That the last integral is finite is a consequence of Corollary B.7.7 since, by assumption,
the function t 7! f .kt k/; t 2 Rd C2 , is in L1 .Rd C2 /.
Corollary 3.6.9. Let f W Œ0; 1/ ! C be such that the functions t 7! f .kt k/; t 2 Rn
are in L1 .Rn / for all 1 n 2d C 2. Then we have
and
1 X .1/j .2d j 1/Š 1 d j
d
F2d C2 .f /.r/ D F2 .f /.r/ :
.2/d j D1 2d j .d j /Š.j 1/Š r 2d j dr
Lemma 3.7.1. Let U be a random vector that is uniformly distributed on the closed
ball B c .R/ Rd . Then its characteristic function fU (see Figure 3.5) is given by
d=2
2
fU .t / D .d=2 C 1/ Jd=2 .Rkt k/; t 2 Rd n f0g:
Rkt k
Proof. The case d D 1 being simple, assume that d > 1. Theorem 3.6.6 and Corol-
lary C.5.5 show that
Z R
.2/d=2 1d=2
fU .t / D kt k r d=2 Jd=21 .rkt k/ dr
d .B c .R// 0
d=2 Z Rktk
.2/
D kt kd x d=2 Jd=21 .x/ dx
d .B c .R// 0
.2/d=2 h iRktk
d d=2
D kt k x J d=2 .x/
d .B c .R// xD0
from which the lemma follows.
Lemma 3.7.2. Let U be a random vector that is uniformly distributed on the d-dimen-
sional sphere of radius R. Then its characteristic function fU is given by
d=21
2
fU .t / D .d=2/ Jd=21 .Rkt k/; t 2 Rd n f0g:
Rkt k
Proof. To simplify the notation we may assume that R D 1. By Lemma 3.6.3 the
function fU is radial and hence fU .t / D fU .kt ke1 / where e1 D .1; 0; : : : ; 0/ 2 Rd .
166 Chapter 3 Special properties
0.75
0.50
0.25
2 4 6 8 10
0:25
Figure 3.5. The characteristic function fU from Lemma 3.7.1 with d D 2 and R D 1 (shown
as a function of the norm).
Z
1
D d 1
eiktku1 d d 1 .u/
d 1 .S / S d 1
Z Z 1
1
D d 1 /
.1 s 2 /.d 3/=2 eiktks ds d d 2 .v/:
d 1 .S S d 2 1
The definition of Jd=21 (cf. Definition C.5.1 and Remark C.5.6) and Proposi-
tion B.7.8 lead immediately to the stated formula.
f .t / D ektk ; t 2 Rd ; d 2 N
for all t where the power series above is absolutely convergent. The definition of the
Gamma function and Legendre’s duplication formula (cf. Proposition C.4.5) show that
the integral above is equal to
1
p 22kCd 1 .k C d=2/.k C .d C 1/=2/
and hence
1
X .1/k .k C .d C 1/=2/
fO.t / D 2d .d 1/=2 kt k2k ; kt k < 1:
kŠ
kD0
Using this and Proposition B.1.3 we see that equation (1) holds whenever kt k < 1
while Theorem 3.3.6 implies that it holds for all t 2 Rd .
The fact that f is a characteristic function follows from fO 0.
Corollary 3.7.4. Let P be a polynomial with real coefficients such that P .0/ D 1
and P .z/ ¤ 0 if Re z 0. Then
t 7! 1=P .kt k/b ; t 2 Rd
is a characteristic function for all b > 0 and all d 2 N.
Proof. Since P has no zero with nonnegative real part, it is the product of polynomials
of the form Q.t / D 1 C pt C qt 2 where p; q 0. Thus, it suffices to show that
t 7! 1=Q.kt k/b is positive definite. We have
Z 1 Z 1
b1 x
.b/ D x e dx D s b
ub1 esu du; b; s > 0:
0 0
Taking the inverse Fourier transform in equation (3.7.3.1), we see that the function
t 7! .1 C kt k2 /.d C1/=2 is the characteristic function of an absolutely continuous
distribution. The next theorem states more.
21b
fO.x/ D kxkbd=2 Kbd=2 .kxk/; x 2 Rd n f0g:
.b/
0.15
0.10
0.05
2 4 6
Figure 3.6. The function fO from Theorem 3.7.5 with b D 3 and d D 4 (shown as a function
of the norm).
21 Recall that K denotes the modified Bessel function (cf. Definition C.5.7). The function fO is shown
in Figure 3.6.
Section 3.7 Radial characteristic functions 169
Consequently,
Z
fO.x/ D .2/d=2 f .t /ei.t;x/ dt
Rd
Z Z 1
1
D .2/d=2 ub1 e.1Cktk /u ei.t;x/ du dt
2
.b/ Rd 0
Z 1 Z
1
D .2/d=2 ub1 eu ektk u ei.t;x/ dt du:
2
.b/ 0 Rd
Using Lemma 1.3.4 we obtain
Z 1
1
fO.x/ D ubd=21 eu ekxk
2 =.4u/
du:
2d=2 .b/ 0
0.08
0.06
0.04
0.02
2 4 6
Figure 3.7. The density pd from Lemma 3.7.6 with d D 2 (shown as a function of the norm).
From Lemma 3.7.1 we conclude that the Fourier transform of 1B c .1=2/ is given by
Jd=2 .kxk=2/
1O B c .1=2/ .x/ D :
.2kxk/d=2
Thus, the last statement is an immediate consequence of Theorem 1.3.6 and equa-
tion (1.8.2.iii).
Remark 3.7.7. Write the function gd in the form gd .t / D hd .kt k/. For 0 s 1
we then have h1 .s/ D 1 s and
2
h2 .s/ D .arccos s s.1 s 2 /1=2 /
(see Figure 3.8).
Partial integration of the integral in Lemma 3.7.6 gives the recursion formula24
.d=2/ .d 1/=2
hd .s/ D hd 2 .s/ p s.1 s 2 /C ; s 2 R; d 3:
..d C 1/=2/
The derivative
d .d=2/ .d 1/=2
h0n .s/ D p .1 s 2 /C
..d C 1/=2/
is nonpositive and nondecreasing, so that hd is convex and nonincreasing for all
d 2 N.25
0.5
h2
h3
h4
0.5 1
Figure 3.8. The functions h2 ; h3 and h4 from Remark 3.7.7.
Proof. If d is even, then 'd .t / D 'd C1 .t; 0/; t 2 Rd . Therefore, it suffices to con-
sider the case where d D 2n C 1; n 2 N. Lemma 3.7.6 shows that the function
gd .t / D c1B c .1/ .t /Pn .kt k/; t 2 Rd
is positive definite, where c is some positive constant and
Z 1
Pn .s/ D .1 x 2 /n dx; s 2 R:
s
By Proposition C.1.10
Pn .s/ D .1 s/nC1 Qn .s/;
where Qn is a polynomial of degree n, having only zeros with negative real part. Using
the definition of Pn we see that
1
.1 kt k/nC1
C D gd .t /; t 2 Rd :
cQn .kt k/
Thus, the statement follows from Corollary 3.7.4.
0.8
0.6
0.4
0.2
5 10 15
0:2
Proof. Since t 7! d .kt ks/ is a positive definite function on Rd for all s, the function
f .t / WD .kt k/, where is defined by (1), is positive definite as well. Thus, f is a
characteristic function by Bochner’s Theorem 1.7.3. The other direction follows from
the fact that f is the characteristic function of the random vector RX where R is a
random variable with distribution and X is a random vector uniformly distributed
on S d 1 and independent of R (cf. Theorem 1.1.6).
As the proof of the preceding theorem shows, we have the following characteriza-
tion of random vectors with a radial distribution.
Proof. Using the definition of the Bessel function Jd=21 (see Definition C.5.1) it is
p
easy to see that r 7! d .r d /; r 2 R, is the characteristic function corresponding
to the density
.d 3/=2
.d=2/ x2
pd .x/ D p 1 1.pd ;pd / ; x 2 R:
d ..d 1/=2/ d
In view of Stirling’s formula (cf. Corollary C.4.10)
.d=2/
lim p D 1:
d !1 d ..d 1/=2/
Using this and the relation
.d 3/=2
x2
D ex
2 =2
lim 1
d !1 d
we see that
1
lim pd .x/ D p1 .x/ WD p ex =2 ;
2
x 2 R:
d !1 2
The characteristic function corresponding to the density p1 is r 7! er
2 =2
. Therefore,
the lemma follows from Proposition 1.6.6.
Proof. Since t 7! ektk s =2 is a characteristic function on Rd for all s and for all
2 2
d (cf. Lemma 1.3.4), the same argument as in the proof of Theorem 3.8.2 shows that
2 ˆ1 .
Next assume that 2 ˆd for all d . Then for each d there exists a probability
measure d on Œ0; 1/ such that
Z 1 p
.r/ D d .rs d / dd .s/; r 2 R: (1)
0
where jd;a j < 2 whenever both a and d are sufficiently large. If a is a continuity
point of , letting d ! 1 we get
Z a
er s =2 d.s/ C ıa .r/
2 2
.r/ D
0
where jıa j < 2 . Finally, letting a ! 1 and then ! 0 we obtain the desired
representation of .26
Definition 3.9.1. For a 2 R and n 2 N0 we denote by Cna the set of all functions
g W Œa; 1/ ! R having the following properties:
(i) g is bounded;
(ii) g is n-times differentiable;27
(iii) g .n/ is monotone.
Note that C0a is the set of functions which are bounded and monotone.
Cna Cn1
a
C0a :
Proof. Assume that g 2 Cna for some n 1. Without loss of generality we may
further assume that g .n/ is decreasing. We prove that g .n/ is nonnegative and g .n1/
is nonpositive. The lemma follows then by induction on n.
26 See also the proof of Bernstein’s Theorem 3.9.6 for a similar argument.
27 We consider differentiability from the right at a.
176 Chapter 3 Special properties
exists. We show that ın D 0. If ın > 0, then there exists An > a such that
g .n/ .x/ > ın =2 whenever x An . Thus, the function g .n1/ is increasing on the
interval ŒAn ; 1/. Let x An . By the mean value theorem there exists 2 Œx; x C 1
such that
This relation shows that limx!1 g .n1/ .x/ D 1. Repeating this argument we obtain
that limx!1 g.x/ D 1, contradicting our assumption. In the same way we see that
ın < 0 is not possible and hence ın D 0. Using this and the fact that g .n/ is decreas-
ing, we conclude that g .n/ must be nonnegative on Œa; 1/. This implies that g .n1/
is increasing. The same argument as above shows that limx!1 g .n1/ .x/ D 0 and
hence g .n1/ is nonpositive.
Lemma 3.9.3. If n 2 N and g 2 Cna , then limx!1 x k g .k/ .x/ D 0 for all k D
1; : : : ; n.
Proof. Lemma 3.9.2 shows that g is monotone and hence the limit ı0 WD
limx!1 g.x/ exists. Replacing g by g ı0 we may suppose that ı0 D 0. With-
out loss of generality we may further assume that g is decreasing. Then g 0; g 0 0
and g 0 is increasing. We consider first the case n D 1. For 0 < c < 1 we have
Z 1 Z x
0
g.cx/ D g .u/ du g 0 .u/ du .1 c/xg 0 .x/ 0 (1)
cx cx
28 This result is essentially due to P. Lévy [40] though his formulation and proof is slightly different
from ours.
Section 3.9 Convex and completely monotone functions 177
Proof. It suffices to consider the case where g .n/ is convex. The same argument as
in the proof of Lemma 3.9.3 shows that we may suppose limu!1 g.u/ D 0. By
Theorem B.4.4 there exists a 0 such that g .n/ is monotone on Œa; 1/. Consequently,
by Lemma 3.9.3, the relation (1) holds if k n.
Since g .n/ is convex it is absolutely continuous and g .nC1/ is increasing (cf. Theo-
rem B.4.7 and Theorem B.4.3). Consequently, we may replace g by g .n/ in rela-
tion (3.9.3.1). This leads to the inequality
g .n/ .cx/ .1 c/xg .nC1/ .x/ 0; cx a
from which (1) with k D n C 1 follows.
Using the relation (1) repeated integration by parts (cf. relation (B.5.6.1)) gives
Z 1 Z 1
.1/n
g.0/ D g 0 .v/ dv D D v nC1 dg .nC1/ .v/:
0 .n C 1/Š 0
This shows that the integral on the right-hand side converges. Thus, the integral in
the definition of F converges as well. It is clear that F is monotone and bounded.
Integrating by parts again and using Theorem B.5.4 we obtain
Z 1 Z 1
.1/n
g.x/ D g 0 .u/ du D .u x/nC1 dg .nC1/ .u/
x .n C 1/Š x
Z 1
1
D .u x/nC1 nC1 dF .u/
u
Zx 1
x nC1
D 1 dF .u/
u
Zx 1
x nC1
D 1 dF .u/:
0 u C
The function x 7! eux is completely monotone for all u 2 Œ0; 1/. The next
theorem shows that this function is the prototype of completely monotone functions.
30 We used the same idea as in the proof of Schoenberg’s Theorem 3.8.5. Here the situation is simpler
since we have uniform convergence of the integrands.
Section 3.9 Convex and completely monotone functions 179
f .t / WD g.kt k/; t 2 Rd
The case d D 1 of the previous theorem has been considered by G. Pólya in [43].
Theorem 3.9.11 (Pólya). Every function of the Pólya type is the characteristic func-
tion of an absolutely continuous distribution.
3.9.12. Below we give a more detailed statement. For its proof we will need the fol-
lowing facts. Every function f of the Pólya type admits the representation
Z 1
f .t / D .1 jt =yj/C d.y/; t 2 R
0
where d.y/ D dŒ1 f .y/ C yf 0 .y/ is a probability distribution on .0; 1/ and f 0
denotes the right-hand derivative. This follows from Theorem 3.9.4 but it can also be
proved directly using the relation
Z 1 Z 1
d f .y/
f .t / D t dy D .1 t =y/ dŒ1 f .y/ C yf 0 .y/; t > 0
t dy y t
d
where dy denotes the right-hand derivative. The fact that the function
F .y/ WD 1 f .y/ C yf 0 .y/ (1)
is increasing is illustrated by Figure 3.10. For y 2 .0; 1/ and x 2 R let
2
2 sin2 yx=2 y sin yx=2
K.x; y/ WD D
yx 2 2 yx=2
F .y/
Remark 3.9.14. The fact that a function f of Pólya-type is positive definite can also
be proved in the following way. For m 2 N let the function gm W R ! Œ0; 1/ be
defined by the relations (see Figure 3.11)
(i) gm .j=m/ D f .j=m/; j D m2 ; : : : ; m2 ;
(ii) gm is linear on the intervals Œj=m; .j C 1/=m; j D m2 ; : : : ; m2 1;
(iii) gm .x/ D f .m/ if jxj > m.
It is not hard to see that gm can be written as
2
X
m
gm D f .m/ C pj j=m
j Dm2
Figure 3.11. The functions f (continuous line) and gm from Remark 3.9.14.
are of the Pólya type if 0 < ˛ 1. Note that g˛ and h˛ are characteristic functions
also for 1 < ˛ 2. To prove this we show that these functions are positive definite.
As to g˛ , we may assume that 0 < ˛ < 2 and write
Z 1
1 cos s
D˛ D ds:
0 s 1C˛
Theorem 3.10.1. Let N 2 N and f 2 P .Z/ be such that supp f Œ2N; 2N . Then
there exists a complex-valued function h on Z such that supp h ŒN; N and
X
N
Q
f .n/ D h h.n/ D h.n C k/h.k/; n 2 Z:
kDN
X
2N
p.z/ D f .n/z n ; z 2 C:
nD2N
where
X
2N
q.z/ D bk z k ; z; bk 2 C:
kD0
X
2N
f .n/ D bnCk bk ; n 2 Z:
kD0
Theorem 3.10.2 (Boas–Kac, Kreı̆n). Let f 2 P c .R/ be such that the support of f is
contained in Œ2r; 2r. Then there is a square integrable function h vanishing outside
Œr; r such that
Z r
Q
f .x/ D h h.x/ D h.x C y/h.y/ dy; x 2 R:
r
X
m
.m/ .m/
f .n=m/ D bnCk bk ; n 2 Z: (1)
kD0
It follows from (1) and from the definition of hm that fm takes the same values as f at
the points n=m and is linear in between. Since f is uniformly continuous, we see that
limm!1 fm .t / D f .t / uniformly on R. As khm k22 D fm .0/ D f .0/, there exists a
subsequence fhmk g converging weakly in L2 .R/ to some square integrable function
h (see Theorem D.5.6). If g 2 L2 .R/ and supp .g/ R n Œ1; 1 then
Z 1 Z 1
g.x/h.x/ dx D lim g.x/hmk .x/ dx D 0
1 k!1 1
w
and hence supp .h/ Œ1; 1. Therefore, h is integrable. Using that hmk ! h and the
fact that the supports of h and hmk are contained in Œ2; 2 we see that
lim hO mk .t / D h.t
O /; t 2 R:
k!1
Consequently,
fO.t / D lim fOmk .t / D jhmk .t /j2 D jh.t /j2 ;
k!1
Q
i.e., f D h h.
33 The first convolution below is defined by considering b .m/ as a complex measure in Mf .R/.
186 Chapter 3 Special properties
where g D f if d D 1 and
Z
g.t1 / D f .t1 ; t2 ; : : : ; td / dt2 dtd ; t1 2 R
Rd 1
34 The paper [14] contains an analogue of Rudin’s result where f is not supposed to be infinitely differ-
entiable and d 3. See also [15] where this analogue is stated without proof.
Section 3.11 Infinitely divisible characteristic functions 187
Since jGj .u/j2 F and jHj .u/j2 F , the functions Gj jR and Hj jR are rapidly
decreasing. Thus, s 7! Gj .ksk/ and s 7! Hj .ksk/ are integrable on Rd . We define
the function gj by Z
gj .t / WD ei.t;s/ Gj .ksk/ ds: (3)
Rd
By Lemma 3.6.3 the function g is radial while Theorem 3.2.8 shows that gj is rapidly
decreasing. Since Gj is even and entire, for fixed t2 ; : : : ; td 2 R there is an entire
function Qj D Qjt2 ;:::;td such that
Using that Gj 2 E.r/ it is easy to check that Qj belongs to E.r/. By the Paley–
Wiener Theorem 3.4.4 the Fourier transform of Qj vanishes outside Œr; r. Hence,
Z Z
gj .t1 ; 0; : : : ; 0/ D eit1 x Qj .x/ dx dt2 dtd D 0
Rd 1 R
whenever jt1 j r (if d D 1, then the relation above does not contain the outer
integral). Since gj is radial we conclude that gj vanishes outside B o .r/.
Next, we have
Xd
juj2 jHj .juj/j2 D juk Hj .juj/j2 :
kD1
Associate the function hj with Hj as the function gj was associated with Gj by equa-
tion (3). Then hj is rapidly decreasing. Moreover, if hkj denotes the partial derivative
of hj with respect to tk , then .2/d=2 tk Hj .kt k/ is the Fourier transform of hkj .
From equation (2) we see that
1
X X 1
d X
fO.t / D jgO j .t /j2 C jhO kj .t /j2
j D1 kD1 j D1
where the series converges in L1 .Rd /. Taking inverse Fourier transforms we obtain
the uniformly convergent representation
1
X X 1
d X
f D gk gQ k C hkj hQ kj
kD1 kD1 j D1
f D .fn /n :
D n n :
„ ƒ‚ …
n times
Lemma 3.11.2. If the characteristic functions f and g are infinitely divisible, then
so are f ; jf j2 and fg.
The next theorem shows that characteristic functions having zeros cannot be in-
finitely divisible.
Proof. For n 2 N let fn be a characteristic function such that f D .fn /n . The function
jf j2=n D jfn j2 is a characteristic function for all n. Define g by
Remark 3.11.4. Let g W Rd ! C n f0g be a continuous function such that g.0/ > 0.
By Theorem C.8.7 there exists a unique continuous function W Rd ! R such that
.0/ D 0 and
g.t / D jg.t /j ei.t/ ; t 2 Rd :
Section 3.12 Conditionally positive definite functions 189
Proof. Since fk is infinitely divisible, jfk j2=n is a characteristic function for all posi-
tive integers k and n. Lévy’s continuity Theorem 1.6.3 and the relation
Proof. If p is rational, then the statement follows easily from Definition 3.11.1. Using
this, the general case is a consequence of Theorem 3.11.5.
In the next section we give an integral representation for the continuous logarithm
of infinitely divisible characteristic functions.
35 Conditionally positive definite functions and their generalizations have many applications, see for
example [9, 58] and also the notes in Remark 3.12.16.
190 Chapter 3 Special properties
holds for all n 2 N; t1 ; : : : ; tn 2 Rd and for all complex numbers c1 ; : : : ; cn such that
c1 C C cn D 0.
has at most one negative eigenvalue36 for all finite systems t1 ; : : : ; tn of elements
of Rd .
36 See also Remark 3.12.16.
Section 3.12 Conditionally positive definite functions 191
X
n X
n
f .ti tj /ci cj D .f .ti tj / f .ti / f .tj / /ci cj 0:
i;j D1 i;j D1
X
n X
n
.f .ti tj / f .ti / f .tj / /ci cj D f .ti tj /ci cj
i;j D1 i;j D1
Xn X
n
C f .ti /ci cnC1 C f .tj /cnC1 cj
iD1 j D1
X
nC1
D f .ti tj /ci cj f .0/jcnC1 j2
i;j D1
0:
Proof. Suppose that epf ./ is positive definite for all p > 0. Then .epf ./ 1/=p is
conditionally positive definite for all p > 0. From the relation
epf .t/ 1
f .t / D lim ; t 2 Rd
p!C0 p
we conclude that the function f is conditionally positive definite.
To prove the converse statement, suppose that f is conditionally positive definite.
Then so is f f .0/ and therefore we may suppose that f .0/ D 0. Since pf is
conditionally positive definite if p > 0, it suffices to consider the case p D 1. If .aij /
is a positive semidefinite matrix, then so is .exp.aij // (cf. Corollary D.2.13). Using
this, it follows from Theorem 3.12.4 that the matrix
n
exp.f .ti tj / f .ti / f .tj / /
i;j D1
t 7! a C i.x; t / .C t; t /; t 2 Rd
Proof. If
X
n
D cj ıtj
j D1
Section 3.12 Conditionally positive definite functions 193
O
then .0/ D c1 C C cn . Using this the statement can be proved in the same way as
Lemma 1.4.3.
g D Q f
O
is conditionally positive definite. If .0/ D 0, then g is positive definite.
In particular, the function
t 7! 2f .t / f .t C y/ f .t y/; t 2 Rd
Proof. Let be an arbitrary finitely supported complex measure on Rd . The first two
statements of the lemma follow from the equation
Q f .0/ D . / . / Q f .0/
@2
f
@tj2
@2 2f .t / f .t C ej / f .t ej /
f .t / D lim ; t 2 Rd :
@tj2 !0 2
where e1 ; : : : ; ed C1 is the standard basis of Rd C1 . Lemma 3.12.7 shows that this func-
tion is positive definite. Its restriction to Rd is given by
Dividing by 2 and letting tend to zero we obtain the statement of the lemma.
1X l
3
D i . C il /
Q . C il /Q
Q
4
lD0
Proof. In view of Lemma 3.2.5 for every n 2 N there exists a radial probability meas-
ure n with compact support satisfying
0 O n .t / 1=2; kt k 1=n:
Setting n WD ı0 n we have O n .0/ D 0 and
1
O n .t / ; kt k 1=n: (1)
2
By Lemma 3.12.7, the function n Q n f is positive definite. Consequently, writing
rn WD n Q n f .0/, we have
jn Q n f .t /j rn ; t 2 Rd : (2)
Since kn k 2, the equation
1
X 1
0 D n Q n (3)
2n max.1; rn /
nD1
defines a finite radial measure. Using inequality (2) we see that the convolution f 0
exists and f 0 2 P c .Rd /.
Property (ii) follows from inequality (1).
Section 3.12 Conditionally positive definite functions 195
Proof. Assume first that f admits the integral representation (1). That f is condition-
ally positive definite follows immediately from the fact that the functions
0 f D L 0
with some nonnegative measure 0 . We define the measure by .f0g/ WD 0 and
1
d .y/ WD d0 .y/; y 6D 0:
O 0 .y/
Then is nonnegative and (3.12.11.ii) implies that
.Rd n U / < 1
f D L :
In particular, the right-hand side represents a finite measure. The special case x WD
ı0 ıx and WD Q x , gives
Z Z
2
jO x .y/j d .y/ D j1 ei.x;y/ j2 d .y/
Rd Rd
Z
D2 Œ1 cos..x; y// d .y/ < 1; x 2 Rd
Rd
from which the relation (2) readily follows.
Now we put
Z
p.t / WD f .t / ei.t;y/ 1 i.t; h.y// d .y/; t 2 Rd :
Rd
That the integral above exists follows from (2) using the inequality in Lemma B.1.5
and the fact that h.y/ D y in some neighborhood of zero. Next we show that p
is a constant for all ; 2 Mf .Rd / with .0/ O D .0/
O D 0. Writing WD for
short, and applying Proposition C.9.6 we obtain
Z
p.t / D f .t / O .y/ei.t;y/ d .y/
Z R
d
D L .t / O .y/ei.t;y/ d .y/; t 2 Rd :
Rd
From equation (3) we see that the last integral is equal to
Z
ei.t;y/ d .y/ D L .t / .f0g/
Rd nf0g
and hence
p.t / D .f0g/; t 2 Rd :
In particular, .ı0 ıx / .ı0 ıs / p is constant for all x; s 2 Rd . Applying
Lemma C.9.4 twice we see that p is a polynomial of degree at most 2. Setting WD ,
Q
we obtain that
Q p.0/ D Q .f0g/ 0
and therefore, by Lemma 3.12.6, the function p is conditionally positive definite. Thus,
ep is positive definite in view of Theorem 3.12.5. The special form of p follows from
(the simple part of) Theorem 3.5.1.
To prove uniqueness of the representation (1) assume that
Z
p1 .t / C ei.t;y/ 1 i.t; h.y// d1 .y/
ZR
d
D p2 .t / C ei.t;y/ 1 i.t; h.y// d2 .y/; t 2 Rd
Rd
where is a nonnegative measure on Œ0; 1/. Let > 0 and define the function h by
h .r/ WD h.r C /; r 2 .; 1/. Then we have
Z r
h .r/ D h .0/ C h0 .t / dt
0
Z rZ 1
D h .0/ e.tC/s d.s/ dt
0 0
Z 1 Z r
s
D h .0/ e ets dt d.s/
0 0
Z 1
ers 1
D h .0/ C es d.s/
0 s
Since the function ekk s 1 is conditionally positive definite on Rd for all s 2 Œ0; 1/
2
and for all d , the function h .k k2 / is conditionally positive definite on Rd for all d .
Letting tend to zero, we conclude that the same is true for h.
37 This result can easily be generalized for conditionally positive definite functions of order m (see [58]).
198 Chapter 3 Special properties
Example 3.12.15. Let a > 0; 0 < b < 1, and write g.r/ WD .a C r/b ; r 0. From
the relation
Remark 3.12.16. M. G. Kreı̆n (see the historical remarks in [49]) generalized the con-
cept of positive definiteness in the following way: A complex-valued Hermitian func-
tion f on Rd is said to have k negative squares if the Hermitian matrix
n
A D f .ti tj / i;j D1
has at most k negative eigenvalues (counted with their multiplicities) for any choice
of n and t1 ; : : : ; tn 2 Rd , and for some choice of n and t1 ; : : : ; tn the matrix A has
exactly k negative eigenvalues. Denote by Pk .Rd / the set of all functions on Rd with
k negative squares. As we have seen in Lemma 3.12.3, conditionally positive definite
functions belong to P0 .Rd / [ P1 .Rd /. Further examples are given by the functions
38 See, among others, [4, 9, 42, 52, 53] and the references therein.
Chapter 4
P0 .V / WD ff W f 2 P .V /; f .0/ 1g:
and put
1 p 1
.1/ WD ˛ WD Œa.1 C / C b.1 /; . 3/ WD ˇ WD Œa.1 C / b.1 /
2 2
and .2/ WD . To see that f is T -positive definite, write
A D .f .ti tj //5j;kD0
is positive semidefinite. Writing down A, we see that it is the Toeplitz matrix corre-
sponding to the function .1; ˛; ˇ; ; ˇ; ˛/ on the group f0; 1; 2; 3; 4; 5g D Z=6Z (factor
2 Below we use additive notation for the group operation in T which is actually multiplication.
3 This example is due to T. M. Bisgaard.
202 Chapter 4 The extension problem
Im
Re
group). By (1.9.6.iii) it suffices to show that the Fourier transform of this function is
nonnegative. So one has to verify
1 C ˛ .z k C z k / C ˇ .z 2k C z 2k / C z 3k 0
Setting v WD Œ1; 1; 1T we have .Bv; v/ D 3.1 C 2 / < 0. Thus, B is not positive
semidefinite.
In the next theorem we list basic properties of positive definite functions on V which
can be proved in the same way as in Section 1.4 and Section 1.5.
Theorem 4.1.5.
(i) If f 2 P .V /, then f is Hermitian.
(ii) Let f1 ; f2 2 P .V /. Then the functions f1 , Ref1 , jf1 j2 , and f1 f2 are positive
definite. Moreover, p1 f1 C p2 f2 is positive definite for all p1 ; p2 0.
(iii) The set P .V / is a convex cone closed in the topology of pointwise convergence,
while P0 .V / is a compact convex set.
(iv) The inequalities of Theorem 1.4.12 hold for a function f 2 P .V / whenever the
used arguments of f lie in V .
(v) Let G D Rd or G D Td , let V be open and f 2 P .V /. If Re f is continuous
at the neutral element, then f is uniformly continuous on V .
(vi) Let V Rd and assume that V is an open ball with center 0. If f 2 P c .V / is
such that jf .t /j D 1 for all t from a neighborhood of 0, then
f .t / D ei.x;t/ ; t 2V
for some x 2 Rd .
The next proposition can be proved in the same way as Lemma 1.5.8. We omit the
proof.
Further, let TV2 denote the set of all q 2 TVC of the form
X
n
qD jqj j2 for some n 2 N
j D1
where
qj D O j 2 TV and supp .j / supp .j / V: (1)
It is clear that
TV2 TVC TVr TV
204 Chapter 4 The extension problem
TV is a complex linear space, TVr is a real linear space and the sets TV2 and TVC are
convex cones. Note that TV2 contains the constant character and hence TV2 6D ;. By
(1.8.3.vi) a function p D O 2 TV belongs to TVr if and only if D .
Q On TV and TVr
we will use the supremum norm
kpk1 D supfjp. /j W O
2 Gg:
Assume that V is finite. Then both of the linear spaces TV and TVr are finite dimen-
sional and we have dim .TV / D dim .TVr / D d , where d is the number of points of
V . For a complex-valued Hermitian function f on V we set
X Z
Lf .p/ WD L
f .x/p.x/ D f d; p D O 2 TVr :
x2V V
Lemma 4.1.8. If V is finite, then the cone TV2 is closed in TVr with respect to the norm
k k1 .
Proof. Let d denote the dimension of TVr . We prove that every q 2 TV2 is a sum of d
squares jqj j2 with some qj satisfying (4.1.7.1). Suppose that
X
m
qD jqj j2 (1)
j D1
where m > d . Since m > dim .TVr /, there exist real numbers rj , not all zero, such
that
Xm
rj jqj j2 D 0:
j D1
We may assume, without loss of generality, that jr1 j jr2 j jrm j and r1 ¤ 0.
Solving the above equation for jq1 j2 and substituting the solution into (1) we obtain
X
m
qD .1 rj =r1 /jqj j2
j D2
which is a sum of m 1 squares. Repeating these arguments we see that every q 2 TV2
is a sum of d squares.
Suppose that a sequence fqn g1 1 TV converges to q 2 TV with respect to the
2 r
norm k k1 . By the first part of the proof there exist functions qj;n satisfying (4.1.7.1)
such that
X d
qn D jqj;n j2 : (2)
j D1
Section 4.1 General results 205
The sequence fqn g1 1 being convergent, there exists a constant K such that
kqn k1 K for all n. It follows from equation (2) that kqj;n k21 K for all j
and n. The normed space TV being finite dimensional, there is a sequence ni ! 1
such that qj;ni converges to some qj0 2 TV ; j D 1; : : : ; d . Since the corresponding
measures j;ni converge weakly, qj0 satisfies the condition on the support in (4.1.7.1).
Obviously we have
Xd
qD jqj0 j2 :
j D1
Lf .p/ 0 f or al l p 2 TVC :
P
Proof. To prove the first statement assume that f 2 P .V / and let D niD1 ci ıxi
be such that xi 2 V and xi xj 2 V for all i; j . Then supp ./ supp ./ V .
Since . /O
Q D jj
O 2 we have
Z X
n
O 2/ D
Lf .jj f d. /
Q D f .xi xj /ci cj :
V i;j D1
This relation shows that f is positive definite if and only if (1) holds.
To prove the second statement. suppose that f can be extended to a positive definite
function 2 P .G/. If p 2 TVC , then pL is a positive definite function on G with
L V . Using the fact that pL 2 P .G/ and applying Theorem 1.9.6 we obtain
supp .p/
X X
Lf .p/ D L
f .x/p.x/ D L
.x/p.x/ D . p/O.0/
L 0:
x2V x2G
To prove the converse statement, assume that Lf .p/ 0 for every p 2 TVC . Without
loss of generality we may suppose that f .0/ D 1. If p q; p; q 2 TVr , then q p 2
TVC and hence Lf .p/ Lf .q/. From this we conclude that
O Since
linear functional L of norm 1 on the linear space Cr .G/.
1 D L.1/ kLk D 1
L is nonnegative (cf. Lemma E.1.5). The Riesz representation theorem (see Theo-
O such that
rem E.1.3) shows the existence of a (nonnegative) measure 2 M.G/
Z
L.p/ D Lf .p/ D p. / d. /; p 2 TVr : (2)
O
G
Theorem 4.1.10. Let V be finite. The following two conditions are equivalent:
(i) every function f 2 P .V / can be extended to a function in P .G/;
(ii) TVC D TV2 .
Proof. Suppose that TVC D TV2 and let f 2 P .V /. By the first statement of
Lemma 4.1.9 the inequality Lf .p/ 0 holds for all p 2 TV2 D TVC . It follows
from the second statement of Lemma 4.1.9 that f admits a positive definite extension.
By Lemma 4.1.8, the convex cone TV2 is closed. If TV2 6D TVC then there exists a real
linear functional L on TVr such that L is nonnegative on TV2 and L.p0 / < 0 for some
p0 2 TVC (cf. Corollary D.4.3 with F D TV2 and C D fp0 g). Let f be the Hermitian
function on V for which L D Lf . Lemma 4.1.9 shows that f is positive definite on
V but has no positive definite extension.
Example 4.1.11. Let Va and r be as in Example 4.1.3 and assume that r can be
extended to a function fr 2 P .T/. By (4.1.5.v), the function fr is continuous. Arguing
as in the proof of Corollary 1.4.13, we see that fr must be a character of T. Thus,
fr .t / D eint with some n 2 Z. Consequently, r can be extended to a positive definite
function on T if and only if r is an integer.
In the rest of this section G1 and G2 are commutative groups, V2 is a symmetric
subset of G2 containing 0 and G denotes the product group G1 G2 . We identify
G1 and G2 with the subgroups G1 f0g and f0g G2 , respectively. Doing so, every
element x 2 G has a unique decomposition x D x1 C x2 with xi 2 Gi ; i D 1; 2.
Section 4.1 General results 207
is positive definite on V .
X
2
fcy .x/ D f .x C hl hk /dl dk ; x 2 V:
k;lD1
X
n X
2 X
n
fcy .xi xj /ci cj D f .xi;l xj;k /ci;l cj;k 0
i;j D1 k;lD1 i;j D1
y
i.e., fc 2 P .V /.
f .x C y/ D f .x/f .y/
Proof. This lemma can be proved by the same argument that was used in the first part
of the proof of Theorem 1.4.20. We omit the details.
f .x C y/ D f .x/f .y/; x 2 G1 ; y 2 V:
.x C y/ WD f .x/ 2 .y/; x 2 G1 ; y 2 G2
208 Chapter 4 The extension problem
Proof. By Theorem 4.1.10 it suffices to show that TVC D TV2 . The character group
of Z is T and each function p 2 TV can be written as
X
N
p.z/ D cn z n ; z2T
nDN
with some complex numbers cn . The equality TVC D TV2 follows now at once from
Theorem B.1.4.
Proof. (i) Let g be a positive definite function on R such that the support of g is a
finite subset of V . We show that the function h, which is equal to fg on V and is
equal to zeronon R n V , is positive definite on R. For this we prove that the matrix
h.xi xj / i;j D1 is positive semidefinite for all x1 ; : : : ; xn 2 R. Without loss of
generality assume that x1 x2 xn . We set
Then S and cij satisfy the conditions of Theorem D.2.19 (see also
the
n beginning of
Remark 4.1.2). Thus, there exists a positive semidefinite matrix aij i;j D1 such that
Section 4.2 The cases Rd and Zd 209
aij D cij for .i; j / 2 S . Schur’s Theorem D.2.12 shows that the matrix
n
aij g.xi xj / i;j D1
Setting7 X
Lf .q/ WD L
f .x/q.x/; q 2 TVr
x2V
f .t / D 1 jt j; t 2R
and its restrictions fa WD f j.2a;2a/ ; a > 0. Using Pólya’s Theorem 3.9.11 we see
that fa is positive definite if a 1. If a > 1, then fa is not positive definite since its
1 2 2
eit.kC 2 / a ; t 2 R:
kD1
.k C 2/
9 is a so-called Nevanlinna function: it is holomorphic in the upper half plane and has a nonnegative
imaginary part there.
Section 4.2 The cases Rd and Zd 211
Proof. We consider only the case d D 2, the general case can be treated in the same
way. Denote by L the real linear space of all polynomials
X
2M
P .s; t / D am;n s m t n ; am;n 2 R
m;nD0
of two real variables. Since the character group of Z2 is T2 , the linear space TVr consists
of functions of the form
X
M
p.z1 ; z2 / D cm;n z1m z2n ; z1 ; z2 2 T
m;nDM
where cm;n 2 C and cm;n D cm;n . For p 2 TVr we define the function Lp by
sCi t Ci
.Lp/.s; t / WD .1 C s 2 /M .1 C t 2 /M p ; ; s; t 2 R:
si t i
Noting that .r C i/=.r i/ 2 T for all r 2 R, we see that Lp 2 L. The mapping L is
obviously linear and Lp ¤ 0 if p ¤ 0. From
dim .L/ D dim .TVr / D .2M C 1/2
Combining Theorem 4.1.10 and Theorem 4.2.5 we obtain the following theorems:
Theorem 4.2.6 (Calderón–Pepinsky). If d and M are greater than 1, then there exists
d
a positive definite function on SM which cannot be extended to a positive definite
function on Z .
d
Proof. Write
U WD ft 2 Rd W jtj j < 1=2 for all j g
R
and choose a continuous function h such that supp .h/ U and Rd jhj2 d D 1. Let
K WD h hQ and define by
X
.t / WD f .n/K.t n/; t 2 Rd :
d
n2SM
Note that K and are continuous. Using that K.0/ D 1 and K.m/ D 0 if m 2 Zd nf0g,
we see that is an extension of f . It remains to prove that is positive definite, i.e.,
X
n
.xi xj /ci cj 0 (1)
i;j D1
Setting f .n/ WD d
0 outside of SM and using the definition of we see that the left-hand
side of (1) is equal to
X Z
f .n/ H.y n/H.y/ dy: (3)
Rd
n2Zd
Section 4.3 Decomposition of locally defined positive definite functions 213
It follows from
Z X Z
H.y n/H.y/ dy D H.y n m/H.y m/ dy
Rd U
m2Zd
Using the definition of H we see that H.y C n/, where y 2 U and n 2 Zd , can be
different from zero only if n 2 SM
d and n has nonnegative coordinates. Thus, (4) is not
d
changed if we restrict m and n to lie in SM and to have nonnegative coordinates. For
such m and n we have m n 2 SM . Since f is positive definite on SM
d d
, the integrand
in (4) is nonnegative from which (1) follows.
In Section 4.4 we return to the extension problem and show that if V is a ball in Rd
and f is a continuous radial positive definite function on V , then there exists a radial
positive definite function on Rd that extends f .
Proof. We may assume that h is real-valued. If h does not vanish -almost every-
where, then there exist x0 2 V and an open symmetric neighborhood W of 0 such
that x0 C W C W V and h does not vanish -almost everywhere on x0 C W . We
choose a function g 2 Cr00 .Rd / vanishing off x0 C W such that
Z
h.y/g.y/ dy D 1:
W Cx0
jg.y/j dy
Rd
for all > 0. This contradiction shows that h vanishes -almost everywhere.
Section 4.3 Decomposition of locally defined positive definite functions 215
f D fc C f0
Let fWn g1
nD1 be a sequence of balls with center zero such that
1
\
W1 C W1 V; WnC1 C WnC1 Wn and Wn D f0g:
nD1
For all t 2 V let N.t / 2 N be the smallest integer such that t C WN.t/ V . Then
N.0/ D 1 and ft C Wn g1 nDN.t/
is a neighborhood basis of t . If n N.t /, then .; /
is a well-defined semidefinite inner product on the linear manifold Mf .t C Wn /.
Therefore, Mf .t C Wn / can be completed to a Hilbert space Hn .t /. The element of
10 The case where V is the whole group has been considered in [11] in a more general setting.
11 Note that if V ¤ Rd , then .; / is not an inner product in Mf .V / since it is not defined for all pairs
of and .
216 Chapter 4 The extension problem
Ut W Mf .s C WN.t;s/ / ! Mf .s t C WN.t;s/ /
hold for all t; s 2 V and for all h; g for which all the expressions above are well-
defined.
Set
\1
H0 WD Hn .0/:
nD1
Then ı0 2 H0 and H0 is a Hilbert space. Since HN.t;0/ .0/ dom.t /, the mapping
Ut is well-defined on H0 .
Next we define two semigroups of operators that will play an important role in the
proof of Theorem 4.3.4. If m > 1, then Wm C Wm Wm1 and hence the relations
t 2 Wm and 2 Mf .Wm / imply that Ut 2 Mf .W1 /. Thus, Ut maps Hm .0/ and
H0 into Hm1 .0/. For t 2 Wm we denote by Ut0 the restriction of Ut to H0 . Then Ut0
12 That this is possible can be easily seen, e.g., by applying the completion procedure which uses Cauchy
sequences.
13 We use the word “mapping” to indicate that domain and range of Ut need not be Hilbert spaces.
Section 4.3 Decomposition of locally defined positive definite functions 217
is an isometric linear operator on H0 with range in Hm1 .0/. For m > 1 set
Um WD fUt0 W t 2 Wm g
the closure of U in the weak operator topology.14 Write further
and denote by Um m
\
S0 WD Um
m>1
and let S be the closure, in the weak operator topology, of the convex hull of S0 .
and S are weakly
The set Um consists of operators of norm 1 and hence the sets Um 0
compact. By Theorem D.5.9, the set S is weakly compact as well. Moreover, in view
of Theorem D.5.5, the norm of the operators in S and S0 is at most 1.
Proof. It suffices to prove the statement concerning S0 . If m0 > m > 1, then we have
Wm0 C Wm0 Wm and
Ut0 H0 Hm .0/; t 2 W m0 :
Consequently,
Um 00 H0 Hm .0/; m00 m0 :
We conclude that S0 H0 Hm .0/ whenever m > 1 and therefore S0 H0 H0 .
In the sequel we will consider the elements of S and S0 as operators from H0 into
H0 .
Lemma 4.3.9. Let m0 < m. The mapping t 7! Ut0 c from Wm0 into Hm .0/ is weakly
continuous at 0.
sP D P; s 2 S0 S
g0 .t / WD .UtCt0 c ; h/
Proof. Since 0 D P 0 2 S0 , the lemma follows from the definitions of S and
S0 and from the fact that the weak and norm closures of the convex hull of S0 0
coincide.
.Ut0 c ; 0 / D .Ut0 0 ; c / D 0:
By the previous lemma we can choose pi and ti such that inequality (4.3.11.1) holds.
We then have
ˇ n ˇ ˇ ˇ
ˇ X ˇ ˇ X
n
ˇ
ˇ ˇ ˇ
pi Ut0 ti c ; 0 ˇ D ˇ Ut0 c ; pi Uti 0 ˇˇ kUt0 c k:
ˇ
iD1 iD1
where tj 2 V and cj 2 C.
By the previous lemma we have
f .t / D .ı0 ; Ut ı0 / D .c C 0 ; Ut .c C 0 // D fc .t / C f0 .t /; t 2 V:
The continuity of fc follows from Lemma 4.3.10. To prove that f0 averages to zero, let
> 0 and m 2 N be arbitrary and choose tj and pj as in Lemma 4.3.11. We then have
ˇ n ˇ ˇ ˇ
ˇX ˇ ˇ X
n
ˇ
ˇ ˇ ˇ
pi f0 .t ti /ˇ D ˇ 0 ; pi Utti 0 ˇˇ (2)
ˇ
iD1 iD1
ˇ n ˇ
ˇ X ˇ
D ˇˇ p i U ti 0 ; U t 0 ; ˇ
ˇ (3)
iD1
X
n
kUt 0 k p i U ti 0
: (4)
iD1
Recall that B o .r/ Rd denotes the open ball with center 0 and radius r > 0.
A function g defined on B o .r/ is called a radial function if g.t / D g.O t / for all
t 2 B o .r/ and for all orthogonal matrices O 2 O.d /. Taking O D I we see that
radial positive definite functions on B o .r/ are real-valued.
The same argument as in Lemma 3.6.2 shows that g W B o .r/ ! C is radial if and
only if there exists a complex-valued function h on Œ0; r/ such that
Proof. We may assume that f0 .0/ D 1. Let k be a positive integer and let 0 < q < r.
Let t 2 Rd be a fixed point of norm q and choose ı > 0 such that Sı Sı B o .r/
where
Sı D fx 2 Rd W kxk D q; kx t k ıg:
Choose independent random vectors x1 ; : : : ; xk on a probability space .; A; P / such
that xj .!/ 2 Sı for all ! and xj is uniformly distributed in Sı for all j . Further,
let x0 be the random variable on which is identically zero. Write ˛0 D 1 and
˛j D f0 .t /; 1 j k. Since f0 is positive definite the inequality
X
k X
k
f0 .xi .!/ xj .!//˛i ˛j D
iD0 j D0 X
1 kf .t /2 C 2f .t /2 f0 .xi .!/ xj .!// 0
1i<j k
holds for all ! 2 . Assume that d D 2. It is easy to see that the random variables
kxi xj k; i ¤ j , are absolutely continuous with respect to the Lebesgue measure on
R.16 Since f0 is radial and vanishes Lebesgue almost everywhere, we conclude that
E .f0 .xi xj // D 0; 1 i < j k. Thus, the inequality 1 kf .t /2 0 holds for
all k from which f .t / D 0 follows. This proves the lemma for d D 2.
In the case d > 2, we consider the function g0 .s/ WD f0 .s; 0; : : : ; 0/; s 2 B2o .r/.
By the first part of the proof, g0 .s/ D 0 for all s ¤ 0. Since f0 is radial we infer that
f0 .t / D 0 for all t 2 B o .r/ n f0g.
Combining Theorem 4.3.4 and the preceding lemmata we obtain the following
result.17
f D fc C p1f0g
Theorem 4.4.1 (Rudin). Every radial function ' 2 P c .Bdo .2r// can be extended to
a radial positive definite function on Rd .
where fk is infinitely differentiable and vanishes outside B o .r/. Note that fO is non-
negative and radial.
Fix f 2 C1 O
r , write Mf WD kf k1 , and choose 2 .0; 1/ such that the support
of f lies in B .2r /. By Lemma 3.2.4 there exists a sequence fgi g of functions
o
gi 2 Pr such that
Letting i ! 1 we obtain
Z
f .x/'.x/ dx .ı C Mf /'.0/:
B o .2r/
and jj.X / '.0/. It is not hard to check that is radial and real. Applying equation
(3) to gi in place of f and letting i ! 1, we obtain '.0/ D .Rd / jj.Rd /.
Section 4.4 Extension of radial positive definite functions 223
This right-hand side of this equation defines a radial positive definite function on the
whole of Rd . This completes the proof of the theorem.
Combining Theorem 4.4.1 with Theorem 4.3.15 we obtain the following result.
Selected applications
A chapter on applications could easily fill several hundred pages so that we had to be
very restrictive in our choice of the topics. We start with the probably ‘most classical’
application of characteristic functions: limit theorems. Even here we had to restrict
ourselves to present only basic methods. In the remaining sections we treat topics
which usually cannot be found in other textbooks in this form.
Theorem 5.1.1. Let be a d-dimensional distribution such that the first order mo-
ments are all equal to zero while the second order moments are finite. If X1 ; X2 ; : : :
are independent random vectors all having the distribution , then the distributions
of the random vectors
X1 C X2 C C Xn
Sn D p
n
tend weakly, as n ! 1, to the normal distribution having the same first and second
moments as .
lim P .kSn mk / D 0
n!1
Proof. Denote by f the characteristic function of the random vectors Xj . Then the
characteristic function fn of Sn is given by fn .t / D f .t =n/n . By Theorem 1.2.6,
1 X
d n
fn .t / D 1 C Œimj C Rj .t =n/ tj ; t 2 Rd
n
j D1
lim fn .t / D ei.m;t/ :
n!1
X1;1
X2;1 ; X2;2
::
:
Xn;1 ; Xn;2 ; : : : ; Xn;n
::
:
be random variables which are independent and identically distributed within each
row of the array above and such that
P .Xn;k D 1/ D pn ; P .Xn;k D 0/ D 1 pn
we have
k
lim P .Sn D k/ D e ; k 2 N0 : (1)
n!1 kŠ
Proof. The characteristic function of Sn is given by
n
fn .t / D 1 C pn eit 1 ; t 2 R:
n!1
Thus, by 1.6.3, the distributions of the Sn ’s converge weakly to the Poisson distribu-
tion with parameter (see (1.1.13.d)) from which (1) follows.
B1 C B2 C
Definition 5.2.2. If fn g1 1 is a sequence of distributions, we say that the infinite con-
volution 1 2 is convergent if there exists a distribution such that
1 n ! :
Proof. The equivalence of (i) and (ii) follows immediately from Theorems 1.6.1
and 1.6.3. Let d W N ! N be arbitrary. Writing n D nC1 nCd.n/ and
n D 1 n we have nCd.n/ D n n . Consequently, if n ! we also
have n n ! , hence n ! ı0 by Lemma 5.2.3. Conversely, assume that n ! ı0
for all d W N ! N and let K Rd be compact. Then
where kk denotes the supremum norm on K. We show that fpn g is a Cauchy sequence
with respect to the supremum metric on K from which (ii) follows. Assume, on the
contrary, that there exists > 0 such that for all n 2 N there exist positive integers
a.n/; b.n/ with n < a.n/ < b.n/ such that the distance of pa.n/ and pb.n/ is at least
. We have
pa.n/ pb.n/
D pa.n/ 1 fa.n/C1 fb.n/
1 fa.n/C1 fb.n/
1 fnC1 fb.n/ C fnC1 fb.n/ fa.n/C1 fb.n/
D 1 fnC1 fb.n/ C fa.n/C1 fb.n/ 1 fnC1 fa.n/
1 fnC1 fb.n/ C 1 fnC1 fa.n/ ! 0
Theorem 5.2.7. If Mj;j .n / is finite for every n 2 N and j D 1; : : : ; d , then the
convergence of the two series
Proof. Assume first the E.n / D 0. Since Mj;j .n / < 1, the characteristic function
gn of n possesses continuous partial derivatives of order at least two and those of the
first order vanish at 0. Using this, Corollary 1.2.7 shows that
1 X
d
gn .t / 1 D ŒRe gn;ti ;tj .n t / C i Im gn;ti ;tj .n t / ti tj
2
i;j D1
1 X d
d
jgn .t / 1j p Var.n / jti tj j p Var.n / kt k2 (1)
2 i;j D1
2
Remark 5.2.8. The converse of the previous theorem is false. To see an example,
let X1 ; X2 ; : : : be independent
P random variables such that P .Xn D 0/ D 1 pn
where 0 pn 1 and 1 1 p n < 1. The Borel–Cantelli Lemma F.2.1 shows that
230 Chapter 5 Selected applications
P1 P
1PXn is a finite sum with probability 1. On the other hand, the series 1 1 E .Xn /
and 1 1 Var .X n / may not converge. We can take for example p n D 1
n2
and choose
1
Xn such that P .Xn D n / D n2 . Then E .Xn / D 1 and Var .Xn / D n 1.
2 2
Theorem 5.2.9. If for some R 0 the supports of all n are contained in the sphere
fx W kxk Rg, then the convergence of the two series
Proof. The sufficiency follows from Theorem 5.2.7. In order to prove the neces-
sity assume that 1 2 converges to some distribution and let gn ; gnc
and g be the characteristic functions of n ; cn and , respectively. Suppose that
the series Var.1 / C Var.2 / C is divergent. Then it follows from Var.n / D
Pd
j D1 Mj;j .n / that for some value of j the series Mj;j .1 / C Mj;j .2 / C is
c c c
3 I would like to thank Christian Berg for making Jessen’s original work [30] available to me.
Section 5.2 Sums of independent random vectors and the Jessen–Wintner purity law 231
Theorem 5.2.10 (Jessen). Let f 2 L1 .PQ / and for each n 2 N define the function fn
by Z
Q D
fn .!/ f .!1 ; : : : ; !n ; !Q n / dPQn .!Q n /; !Q 2 :
Q
Qn
Then
Q D f .!/
lim fn .!/ Q
n!1
Theorem 5.2.11. A necessary and sufficient condition for the convergence of the in-
finite convolution 1 2 is that if X1 ; X2 ; : : : are independent random vectors
on a probability space .; A; P / having the distributions 1 ; 2 ; : : :, then the series
X1 C X2 C is convergent P -almost everywhere. The distribution of the random
vector S D X1 C X2 C is then D 1 2 .
The set of points where an infinite series of random vectors converges is a tail
event. Combining the previous theorem and the zero–one law of Kolmogorov (cf.
Theorem F.2.5) we obtain the following corollary.
5.2.13. The following notation will be used in the next theorem. Let fXn g be a se-
quence of d-dimensional random vectors and, as in Lemma 1.11.3, define the random
vectors Xn;R , where n 2 N and R 2 .0; 1/, by
(
Xn .!/ if kXn .!/k R
Xn;R .!/ D ! 2 :
0 if kXn .!/k > R
Denoting by n and n;R the corresponding distributions we have
n;R D n;R C .1 n .BR // ı0
where BR D fx 2 Rd W kxk Rg and n;R .A/ D n .BR \ A/; A 2 Bd .
so that X1;R C X2;R C is almost surely convergent in view of Lemma F.2.2. From
Theorem 5.2.9 we see that the two series in (2) are convergent.
Assume now that the three series in (1) and (2) are convergent. Theorem 5.2.9
then shows that X1;R C X2;R C is almost surely convergent. Applying again
Lemma F.2.2 we see that X1 C X2 C is almost surely convergent.
1 X ikx
n
sn .x/ D e ; x 2 R; n2N
n
kD1
Im
1
0:5 1 Re
1
Proof. In view of Lemma 5.3.1 the sequence fsn g converges in L2 . / to 1f0g . The
theorem follows by applying the isometric operator IZ (cf. Remark 2.9.12) to sn and
noting that IZ .1f0g / D .f0g/ (cf. equation (2.7.1.3)).
Proof. This theorem can be proved in the same way as the previous one by using the
relations Z
1 T itx eiT x 1
e dt D ; x ¤ 0; T > 0 (1)
T 0 iT x
and Z Z
1 T itx 1 T
IZ e dt D Z.t / dt
T 0 T 0
(see equation (2.9.13.1)).
span fUt g W t 2 T g:
It is easy to check that H.g/ is a .Ut /-invariant subspace of H . The orthogonal pro-
jection from H.Z/ onto H.g/ will be denoted by Pg . It follows from Lemma 2.10.16
that Pg commutes with all operators Ut :
Ut P g D P g Ut ; t 2 T; g 2 H: (1)
Ut h D h; t 2 T:
We denote by H0 the set of all fixed points. This set is a .Ut /-invariant subspace, the
orthogonal projection onto H0 will be denoted by P 0 . We have
P 0 Ut D Ut P 0 D P 0 : (2)
The first equation follows from the .Ut /-invariance of H0 , the second one from
P 0 H D H0 .
Proof. (i) Equations (5.3.6.1) and (5.3.6.2) show that Ut Pg P 0 D Pg P 0 . Thus, for
all h 2 H the vector Pg P 0 h is a fixed point, i.e., Pg P 0 h 2 H0 . Consequently,
P 0 Pg P 0 h D Pg P 0 h; h2H
and hence
P 0 Pg P 0 D Pg P 0 : (1)
Since Pg and P 0 are orthogonal projections, we see that
.P 0 Pg h; h0 / D .h; Pg P 0 h0 / D .h; P 0 Pg P 0 h0 / D .P 0 Pg P 0 h; h0 /
Section 5.3 Ergodic theorems for stationary fields 237
P 0 g D P 0 Pg g D Pg P 0 g 2 H.g/
X
kn
h0 D lim cj;n Utj;n g
n!1
j D1
with some kn 2 N; cj;n 2 C and tj;n 2 T . Using equation (5.3.6.1) we see that
X
kn X
kn
h0 D P 0 h0 D lim cj;n P 0 Utj;n g D lim cj;n P 0 g
n!1 n!1
j D1 j D1
i.e., h0 is proportional to P 0 g.
.f0g/ D P 0 Z.s/
for all s 2 T .
Proof. We consider only the case Rd , the case Zd can be treated in the same way.
(i) If h is a fixed point of .Vt /, then
H0 D C IZ .1f0g / D C .f0g/:
238 Chapter 5 Selected applications
If .f0g/ D 0, then .f0g/ D 0. The equation above shows that H0 D f0g and hence
P 0 D 0. Thus, the equation in (iii) holds whenever .f0g/ D 0.
If .f0g/ ¤ 0, then .f0g/ ¤ 0, the subspace H0 is one-dimensional and P 0 ¤ 0. We
obviously have P 0 Z.s/ 2 H0 . By equation (5.3.6.2),
and hence P 0 Z.s/ does not depend on s. From the definition of H.Z/ and from
the fact that P 0 ¤ 0 we conclude that P 0 Z.s/ ¤ 0. Since H0 is one-dimensional,
P 0 Z.s/ D P 0 Z.0/ D cIZ .1f0g / with some c 2 C n f0g. As P 0 is an orthogonal
projection we have
and
.Z.0/; P 0 Z.0// D .1; c1f0g / D .1f0g ; c1f0g /:
Thus, c D 1.
1X
n
1
D ık or D jŒ0;T :
n T
kD1
Theorem 5.3.10 (ergodic theorem). Let Z be a continuous stationary field on the set
T D Rd or T D Zd and let fn g be a sequence of probability measures such that
lim L n .t / D 0; t 2 T n f0g:
n!1
Then Z
lim Z.t / dn .t / D .f0g/ D P 0 Z.s/; s2T
n!1 T
Z
1 kt k2
lim d=2
Z.t / e 2n dt D .f0g/:
n!1 .2 n/ Rd
(ii) Let n be the uniform distribution on the cube Œ0; nd Rd . The characteristic
function is then
Yd
1 e2itj n
t 7! :
2itj n
j D1
If Z .fx W a.x/
O D 0g/ D 0, then
Z
1
Z.t / D ei.t;x/ dY .x/; t 2 Zd
Œ0;2/d O
a.x/
i.e., the convolution Z 7! a Z is invertible.
The sum on the right-hand side converges in L2 .Z /. Using the isometric operator IZ
we see that the sum on the left-hand side converges in H.Z/. Taking the limit n ! 1
we obtain the spectral representation of Y . The rest follows from Lemma 2.6.11.
Remark 5.4.2. If Z has a constant density, then the series (5.4.1.1) converges in
L2 . / for all a 2 L2 .Zd /, since the functions ei.;n/ have the same norm and build
a (complete) orthogonal system in this Hilbert space.
The next theorem is important in signal processing.
with a white noise X W N.0; 1/ on Zd and a 2 L2 .Zd / if and only if the spectral
measure of Y is absolutely continuous.
Proof. Assume that Y has the representation above. By Example 2.9.5 and Theo-
rem 5.4.1, the spectral measure of Y is given by
1
d D O 2d
jaj d
.2/d
i.e., is absolutely continuous.
Now assume that is absolutely continuous and let C be the correlation function
of Y . By Theorem 1.8.16 there exists a function g 2 L2 .Zd / such that
X
C.t / D g g.t
Q /D g.t C n/g.n/; t 2 Zd
n2Zd
Section 5.4 Filtration of discrete stationary fields 241
where
1
hD 2 L2 . /
.2/d=2 gO
(since d .x/ D jg.x/j
O 2d
d .x/ O In view of
we may ignore the zeros of g).
Z Z
1
.h .A/; h .B// D 1A 1B jhj2 jgj
O 2d d D 1d d
Œ0;2/d .2/d A\B
defines a white noise X W N.0; 1/ (see Example 2.9.5). By Lemma 2.6.11 we have
Z Z
Y .t / D e i.t;x/
d.x/ D ei.t;x/ .2/d=2 g.x/
O dh .x/
Z 1
X
D ei.t;x/ .2/d=2 ei.x;n/ g.n/ dh .x/
nD1
1
X Z
d=2
D .2/ g.n/ ei.tn;x/ dh .x/
nD1
X1
D .2/d=2 g.n/X.t n/
nD1
Basic notation
1 Recall that a continuous function f on S vanishes at infinity if and only if for all > 0 there exists a
compact set K S such that jf .x/j < ; x 2 S n K.
Section A.2 Multidimensional notation 243
and
Bdc .r/ D ft 2 Rd W kt k rg:
Appendix B
Basic analysis
X
k1
k k
z w D .z w/ z s w k1s ; k2N
sD0
we obtain
ˇ n ˇ
ˇ X zk wk ˇ X
n
1 X s k1s
k1
ˇ ˇ
ˇ ˇ jz wj jzj jwj
ˇ kŠ ˇ kŠ
kD1 kD1 sD0
X
n
1
jz wj k max.jzj; jwj/k1
kŠ
kD1
X
n1
1
D jz wj max.jzj; jwj/k
kŠ
kD0
jz wj emax.jzj;jwj/ :
By Lemma B.1.1, the first sum in the last display tends to zero if n ! 1. Denote by
sn the second sum. Applying Bernoulli’s inequality1 we obtain
X
n
jzn jk .k 1/k jzn j2 jzn j
jsn j e ! 0 .n ! 1/:
kŠ 2n 2n
kD1
Proposition B.1.3 (binomial series). For all ˛ > 0 and s 2 .1; 1/ we have
1
X
1 .k C ˛/
D .1/k sk (1)
.1 C s/˛ .k C 1/.˛/
kD0
(cf. Lemma C.4.2) we see that limk!1 jakC1 =ak j D 1. Thus, the radius of con-
vergence of the power series above is equal to 1. Applying again (2), the relation
.k C 1/ D kŠ, and using induction on k it is easy to check that (1) is the Taylor series
1
expansion of s 7! .1Cs/˛ at s D 0. This completes the proof.
X
N
p.z/ D cn z n ; cn 2 C; z 2 C n f0g:
nDN
X
N
q.z/ D bn z n ; z; bn 2 C
nD0
such that
p.z/ D q.z/q.1=z/ ; z 2 C n f0g:
In particular, p.z/ D jq.z/j2 holds for all z 2 T.
Y
N Y
N
g.z/ D cN .z zj / .z 1=z j /
j D1 j D1
and hence
Y
N Y
N
N
p.z/ D z g.z/ D cN .z zj / .1 1=zz j /
j D1 j D1
Y
N Y
N
Dc .z zj / .1=z z j / (2)
j D1 j D1
Section B.1 Miscellaneous results from classical analysis 247
Q 1
N
where c D cN j D1 .z j / . Since p is nonnegative on T the constant c is posi-
p QN
tive. Setting q.z/ WD c j D1 .z zj / and applying (2) we obtain
Proof. Assume first that n D 0 and let z D a C ib; a; b 2 R, be arbitrary. Using the
identity Z 1
eiz 1 D iz eitz dt
0
we obtain
ˇZ ˇ Z
ˇ iz ˇ ˇ 1 ˇ 1ˇ ˇ
ˇe 1ˇ D jzj ˇ eitz dt ˇˇ jzj ˇeitz ˇ dt
ˇ
0 0
Z Z (
1ˇ ˇ 1 1; if b 0;
ˇ itatb ˇ tb
D jzj ˇ e ˇ dt D jzj e dt jzj
0 0 ejzj ; if b 0:
Setting
.iz/n
rn .z/ WD eiz 1 iz ; n 2 N0
nŠ
it is not hard to check that
Z 1
rnC1 .z/ D iz rn .t z/ dt:
0
B.1.6. Here we collect some simple but useful inequalities for trigonometric functions.
Integrating the inequalities
1 1 1 5
x x 3 < sin x < x x 3 C x ; x > 0: (3)
6 6 120
The inequality
sin y
sin x x ; 0<y ; xy (4)
y 2
can be proved easily, e.g., by using geometrical arguments. Setting here y D 1 and
applying (3) we conclude that
sin x sin 1 101
< ; jxj 1: (5)
x 1 120
Next we show that
either sin y D 0 or
sin ny
sin y 6D 0 and n cosn1 y D : (7)
sin y
It is easy to check that sin y D 0 implies (6). If (7) holds, then
cos y sin ny
h.y/ D cos ny
sin y
cos y sin ny sin y cos ny sin.n 1/y
D D :
sin y sin y
A simple induction argument shows that
sin.n 1/y
n 1; n2N
sin y
and therefore h.y/ n 1.
The function Z T
sin t
Si.T / WD dt; T 2 R (8)
0 t
is frequently used (see Figure B.1). We will need its limit at infinity.
=2
5 10 15
Corollary B.1.8.
Z T
1 sin xt
lim dt D sign x; x 2 R:
T !1 T t
1
D 0:3
0.5
D1
3 2 1 1 2 3
Figure B.2. The function '0; from Corollary B.1.11.
where k 2 Nn0 .3 This formula can be proved by induction on n, applying the binomial
formula to the identity
It follows from the multinomial formula that the multinomial coefficients have a com-
binatorial interpretation as the number of partitions4 of a set with m elements in n
blocks, with k1 elements in the first block, k2 elements in the second block, and so on.
R1
2 Differentiating under the integral sign can be avoided by replacing 1t by 0 est ds and then using
Fubini’s theorem.
3 Expressions of the form 00 are defined to be equal to 1.
4 A partition of a set S is a collection of nonempty pairwise disjoint subsets of S whose union is S .
The subsets are also called blocks.
252 Appendix B Basic analysis
called the generalized Leibniz rule. This identity can be proved by induction on n
showing first that !
Xm
m
.fg/.m/ D f .k/ g .mk/ :
k
kD0
Next we prove Faà di Bruno’s formula for the higher chain rule for differentiation.5
Theorem B.1.14 (Faà di Bruno’s formula). If f and g are real functions with a
sufficient number of derivatives, then
dm
g.f .t // D
dt m
X mŠ f 0 .t / b1
f 00 .t / b2
f .m/ .t / bm
g .jbj/ .f .t // (1)
b1 Šb2 Š bm Š 1Š 2Š mŠ
where the sum is over all different solutions in nonnegative integers b1 ; : : : ; bm of
b1 C 2b2 C C mbm D m, and jbj D b1 C C bm .
except that this makes artificial distinctions among the i -blocks for each i . Therefore,
we have to divide this number by b1Š b2 Š bm Š . Thus, (1) follows from equation (2).
We prove (2) by induction on m. The case m D 1 being simple, assume that the
statement is true for some m. Every partition of SmC1 can be obtained in a unique way
by adjoining m C 1 to a partition of Sm .
5 Our proof is taken from the survey paper [32] which contains interesting remarks on the history of
this formula.
Section B.1 Miscellaneous results from classical analysis 253
Lemma B.1.16. Let fak g1 1 be a sequence of nonnegative real numbers. There exists
1
a sequence fpk g1 of positive real numbers such that
1
X
pk akn < 1
kD1
for all n 2 N.
Proof. Setting bk WD max .ak ; k/ we have ak bk and bk ! 1. Let pk D ebk .
Then for each n 2 N there exists N.n/ 2 N such that
pk bkn ebk =2 ek=2
if k N.n/. Thus,
1
X 1
X
pk akn pk bkn < 1
kD1 kD1
for all n 2 N.
for all t . Assume first that the sequence fxn g is bounded. If s is an accumulation point
of this sequence then, by equation (1), eits D 1 for all t and hence s D 0. From this
we conclude that fxn g tends to 0.
Next assume that fxn g is unbounded and choose a subsequence fyn g tending to 1
or to 1. Using equation (1) we see that
Z 1 Z 1
eiyn 1
1D lim eityn dt D lim eityn dt D lim D 0:
0 n!1 n!1 0 n!1 iyn
Thus, the sequence fxn g cannot be unbounded.
0.50
7 Note that h is equal to the function fQ1=2 from Remark 4.2.4. Therefore h, and hence also f , are
positive definite. Bochner’s Theorem 1.7.3 shows that they are characteristic functions.
256 Appendix B Basic analysis
Proof. For each x 2 Œ0; 1 and n 2 N let Ynx be a random variable on a probability
space .; A; P / such that nYnx has binomial distribution with parameters n and x. We
then have E.Ynx / D x; Var.Ynx / D x.1x/
n and
X
n n
E.f .Ynx // D f kn x k .1 x/.nk/ :
k
kD0
The theorem follows now from the fact that f is uniformly continuous.
f B5 B2 B1
1
Figure B.4. The Bernstein polynomials B1 ; B2 and B5 from Remark B.2.3 for the function
f W x 7! x 0:3 .
such that
ˇ ˇ
ˇjt j pn .t /ˇ 1 ; 1 t 1:
n
We already know that pn .g/ belongs to A. The inequalities above show that the se-
quence fpn .g/g converges uniformly to jgj, and hence jgj 2 A.
If f; g 2 A are real-valued, then max .f; g/ and min .f; g/ belong to A, since
f C g C jf gj f C g jf gj
max .f; g/ D ; min .f; g/ D :
2 2
To finish the proof it suffices to show that every real-valued function h 2 C.X / be-
longs to A. It follows at once from (iii) that for all x; y 2 X; x ¤ y, we can choose
a real-valued function fxy 2 A such that fxy .x/ D h.x/ and fxy .y/ D h.y/. Let
> 0 be arbitrary and define the sets Uxy and Vxy by
fy .t / < h.t / C ; t 2 X;
fy .t / > h.t / ; t 2 Vy WD \niD1 Vxi y :
The sets Vy are open and y 2 Vy . Using again the compactness of X we see
that there exist m 2 N and y1 ; : : : ; ym 2 X such that [m
1 Vyi D X . Choosing
f WD max .fy1 ; : : : ; fym / we have f 2 A and
Proof. We assume that the sequence is increasing. For every > 0 and x 2 X there
exists an index n.x/ D n.x; / such that
0 g.x/ fm .x/ ; m n.x/:
3
Section B.2 Uniform convergence of continuous functions 259
By compactness we can choose finitely many points xi 2 X such that the union of the
sets V .xi / is equal to X . Let n0 be the largest of the numbers n.xi /. Since each y 2 X
belongs to some V .xi / we have
Proof. We show first that f is continuous. For all x; y 2 X and for all ˛ we have
jf .x/ f .y/j jf .x/ f˛ .x/j C jf˛ .x/ f˛ .y/j C jf˛ .y/ f .y/j: (1)
For each y 2 Vx we choose an index ˛.y/ D ˛ such that jf .x/ f˛ .x/j < and
jf .y/ f˛ .y/j < . It follows from inequality (1) that jf .x/ f .y/j < 3, y 2 Vx .
Thus, f is continuous in x.
Obviously the set ff˛ ; f g is also equicontinuous. Let K X be compact. For all
x 2 K and for all > 0 there exists an open neighborhood Vx of x such that the
inequalities
jf .x/ f .y/j < and jf˛ .x/ f˛ .y/j <
260 Appendix B Basic analysis
S all ˛ and for all y 2 Vx . Since K is compact there exist x1 ; : : : ; xn such that
hold for
K niD1 Vxi . Choose ˇ such that
Now let y 2 K be arbitrary. Then y 2 Vxi for some i . Using this we see that
jf˛ .y/ f .y/j jf˛ .y/ f˛ .xi /j C jf˛ .xi / f .xi /j C jf .xi / f .y/j < 3
Proof. Let C.x/ be the closure of the set ff .x/ W f 2 F g C. Then C.x/ is bounded
and hence compact. Now F is a subset of the product space
Y
C D C.x/
x2X
Combining Theorem B.2.7 and Theorem B.2.8 we obtain the following result.
for all x 2 X . Then this net contains a subnet converging uniformly on compact sets
to a continuous function.
A proof of the next theorem can be found for example in [47].
Proof. The equivalence of (i) and (iii) is obvious. If (iii) holds then
ˇ ˇ
ˇ pn ˇ
ˇ 1ˇ < < ; n; m N
ˇp ˇ jp j ı
m m
from which (ii) follows. Assume finally that (ii) is true and let 0 < < 1. Then
ˇˇ ˇ ˇ ˇ ˇ
ˇˇ pn ˇ ˇ ˇ ˇ
ˇˇ ˇ 1ˇ < ˇ pn 1ˇ < ; n; m N:
ˇˇ p ˇ ˇ ˇp ˇ
m m
8 Note that we do not introduce the terminology convergent infinite product which is treated differently
in the literature.
262 Appendix B Basic analysis
pn D .1 C a1 / .1 C an / D 1 C a1 C C an C a1 a2 C
Proof. Write
Y
n Y
n
an D jun j; qn D .1 C aj /; pn D .1 C uj /
j D1 j D1
9 Note that Theorem B.3.5 uses only the first part of Theorem B.3.4.
Section B.3 Infinite products 263
and let > 0 arbitrary. It follows from the first part of Theorem B.3.4 that lim qn < 1.
By Lemma B.3.2, there exists N 2 N such that
qnCd
1 < ; n N; d 2 N0 :
qn
We have
ˇ ˇ
ˇ pnCd ˇ
ˇ ˇ
ˇ p 1ˇ D j.1 C unC1 /.1 C unC2 / .1 C unCd / 1j
n
D junC1 C C unCd C unC1 unC2 C j
anC1 C C anCd C anC1 anC2 C
qnCd
D 1 < ; n N; d 2 N0 :
qn
Thus, the theorem follows from Lemma B.3.2.
Proof. We have pnC1 .z/ pn .z/ D unC1 .z/pn .z/ and hence for m > n
X
m1
pm .z/ pn .z/ D pm .z/ pm1 .z/ C C pnC1 .z/ pn .z/ D uj C1 .z/pj .z/:
j Dn
X
m1
juj C1 .z/j eju1 .z/j ejuj .z/j
j Dn
X
m1
juj C1 .z/j eK :
j Dn
264 Appendix B Basic analysis
Geometrically, a convex function f has the property that for any two points x1 < x2
in I the chord joining the points .x1 ; f .x1 // and .x2 ; f .x2 // is above the graph of f .
for any n 2 N and any disjoint collection of intervals Œai ; bi Œa; b satisfying
X
n
j.bi ai /j < ı:
iD1
Theorem B.4.6 (Lebesgue). If 1 < a < b < 1 and F W Œa; b ! C, then the
following statements are equivalent:
(i) F is absolutely continuous;
Rx
(ii) F .x/ F .a/ D a f .t / dt; x 2 Œa; b
for some Lebesgue integrable function f on Œa; b;
(iii) F is Lebesgue almost
R x everywhere differentiable, F 0 is Lebesgue integrable and
0
F .x/ F .a/ D a F .t / dt; x 2 Œa; b.
Theorem B.5.2. If f is continuous and ˛ is of bounded variation in Œa; b, then the
Riemann–Stieltjes integral of f with respect to ˛ from a to b exists.
Theorem B.5.4. If f and ' are continuous and ˛ is of bounded variation in Œa; b,
and if Z x
ˇ.x/ D '.t / d˛.t /; x 2 Œa; b
c
where c 2 Œa; b is fixed, then
Z b Z b
f .x/ dˇ.x/ D f .x/'.x/ d˛.x/:
a a
B.5.6. Using the notation from the previous definition and applying Theorem B.5.3 we
obtain the formula
Z 1 Z 1
f .x/ d˛.x/ D lim Œf .b/˛.b/ f .a/˛.a/ ˛.x/ df .x/ (1)
a b!1 a
where
Z 1
n
R˛ .t / D D ˛ f .a C x.t a// D ˛ f .a/ .1 x/n1 dx:
˛Š 0
268 Appendix B Basic analysis
Proof. Since S is open and convex we can choose an open interval I Œ0; 1 such
that a C s.t a/ 2 S for all s 2 I . The function g of one real variable defined by
holds. Consequently,
X
n Z 1
g .k/ .0/ 1 .n/
f .t / D g.1/ D C g .x/g .n/ .0/ .1x/n1 dx: (2)
kŠ .n 1/Š 0
kD0
Inserting this into (2) we obtain the first statement of the theorem.
The remainders are easily seen to satisfy the given upper estimate.
Theorem B.6.2. Under the conditions of Theorem B.6.1 there exist ; 2 .0; 1/ de-
pending on t such that
X D ˛ f .a/ X
f .t / D .t a/˛ C Q˛ .t / .t a/˛
˛Š
j˛jn1 j˛jDn
where
1
Q˛ .t / D ŒRe D ˛ f .a C .t a// C i Im D ˛ f .a C .t a//:
˛Š
Section B.6 Multivariate calculus 269
Proof. The proof is almost the same as that of Theorem B.6.1, but now we apply the
one-dimensional Taylor formula
X
n1
h.k/ .0/ k 1
h.s/ D s C h.n/ .s/ s n
kŠ nŠ
kD0
2 f .x; y; ; h/
D .1;1/ f .x; y/ D fxy .x; y/ D lim ; t; s 2 .a; b/
;h!0 h
where
Proof. We may assume that f is real-valued. Let x; y 2 .a; b/ and choose ı > 0 such
that x C ; y C h 2 .a; b/ whenever ; h 2 Iı WD .ı; ı/. Setting
Now we apply the mean value theorem to the continuously differentiable function
r.h/ D fx .x C s; y C h/ fxy .x; y/h:
Lemma B.6.4. Let a; b 2 .0; 1/ be such that b > .a=3/3 . Then the polynomial
P .s; t / D s 2 t 2 .s 2 C t 2 a/ C b; s; t 2 R
is strictly positive but it is not a sum of squares of polynomials with real coefficients.
p p
Proof. It is easy to show that P attains its minimum at s D ˙ a=3; t D ˙ a=3
and the minimum is equal to b .a=3/3 > 0.
Assume that
P D P12 C C Pn2 (1)
where Pj is a polynomial with real coefficients. It follows from P .s; 0/ D P .0; t / D b
that Pj .s; 0/ and Pj .0; t / are constant for all j . Hence,
In the rest of this section we prove some simple results about multivariate poly-
nomials.
Proof. The first statement follows from the fact that Qt;s is a linear combination of
terms of the form
Yd
˛
tj C r.sj tj / j
j D1
where not all of the coefficients a˛ are equal to zero and R is a polynomial of degree
less than m. We choose t 2 S such that the sum above is different from zero. Further,
let ı ¤ 0 be a real number such that s D .1 C ı/t 2 S . Then
X
Qt;s .r/ D P .1 C rı/t D .1 C rı/m a˛ t ˛ C R t C r.s t /
j˛jDm
Applying the first statement to R we see that the degree of Qt;s is equal to m.
Proof. To simplify the notation we assume that d D 2 and write the function
f as f .x; y/; .x; y/ 2 S . We choose mutually different aj , mutually different
bj ; 0 j m, such that .ai ; bj / 2 S for all i and j and denote by Ai and Bi
the corresponding Lagrange polynomials:
Y
m
x ak Y
m
y bk
Ai .x/ D ; Bi .y/ D :
ai ak bi bk
kD1; k¤i kD1; k¤i
The function
X
m X
m
P .x; y/ D ŒAi .x/f .ai ; y/ C Bi .y/f .x; bi / Ai .x/Bj .y/f .ai ; bj /
iD1 i;j D1
Lemma B.6.7. Let P be a polynomial on Rd with complex coefficients such that for
all t 2 Rd the polynomial s 7! P .st /; s 2 R, of one variable is of degree at most k.
Then the degree of P is at most k.
Choose t0 such that Pn .t0 / ¤ 0. Using the relation Pj .st0 / D s j Pj .t0 / we see that
the degree of s 7! P .st0 / is equal to n and hence n k.
d
@hi
J h.x/ D .x/
@xj i;j D1
2 @h @h1 3
1
.x/ .x/
6 @x1 @xd 7
6 7
D6
6
::
:
::
:
7;
7 x2U
4 @h @hd 5
d
.x/ .x/
@x1 @xd
is called the Jacobian matrix of h. The real-valued function x 7! det .J h.x// is usu-
ally called the Jacobian of h.
Corollary B.7.3. Let h be as in the previous theorem and write ' D h1 . If X is a
d-dimensional random vector having density pX , then the random vector Y D h.X /
has density
pY .x/ D pX .'.x// j det .J'.x//j; x 2 Rd :
If h.x/ D Ax with a nonsingular matrix A, then
1
pY .x/ D pX .A1 x/; x 2 Rd :
j det Aj
Moreover, is rotation-invariant.
Since the map E 7! E1 takes Borel sets to Borel sets and commutes with unions,
intersections and complements, it is clear that is a Borel measure on S d 1 . The
rotation-invariance of implies that is rotation-invariant as well. Since Ea is the
image of E1 under the map t 7! at , Theorem B.7.2 shows that .Ea / D ad .E1 /.
We denote by the Borel measure on .0; 1/ S d 1 induced by the mapping
1 .B//. Further, define the measure on .0; 1/ by .A/ WD
F
R , i.e., .B/ D .F
d 1 dr. We show that D from which equation (1) follows. We have
Ar
b d ad
..a; b E/ D .Eb n Ea / D .E/
d
Z b
D r d 1 dr .E/ D ..a; b/ .E/; E 2 B.S d 1 /:
a
Using this, a standard uniqueness argument shows that for fixed E 2 B.S d 1 / the
measures and coincide on the -algebra
fA E W A 2 B..0; 1//g:
30
20
10
5 10 15 20
Figure B.5. The function d 7! 2 d=2
=.d=2/; d 2 .0; 1/, from Proposition B.7.8.
2 d=2
.S d 1 / D :
.d=2/
Proof. Using Lemma B.1.10, Corollary B.7.7, and the substitution s D r 2 we obtain
d Z
Y 1 Z
tj2
ektk dt
d=2 2
D e dtj D
j D1 1 Rd
Z 1 Z 1
d 1 d 1 r 2 .S d 1 /
D .S / r e dr D s .d 2/=2 es ds
0 2 0
.S d 1 /
D .d=2/ :
2
X
d
x D r cos 1 e1 C xj ej :
j D2
We have
X
d X
d
r 2 D kxk2 D r 2 cos2 1C xj2 and hence xj2 D r 2 sin2 1:
j D2 j D2
X
d
x D r cos 1 e1 C r sin 1 aj ej :
j D2
Pd
Since sin 1 0 we see that u2 D j D2 aj ej . We have cos 2 D .u2 ; e2 / D a2
and
X
d
u2 D cos 2 e2 C aj ej :
j D3
Moreover,
X
d X
d
1 D ku2 k2 D cos2 2C aj2 and hence aj2 D sin2 2:
j D3 j D3
X
d
u2 D cos 2 e2 C sin 2 bj ej
j D3
Pd
i. e., u3 D j D3 bj ej . Thus,
Remark B.7.11. Note that the Jacobian of the transformation (B.7.10.1) is given by
2
dY
d 1
J.r; 1; : : : ; d 2 ; / D r sink d 1k :
kD1
Section B.7 The Lebesgue integral on Rd 277
Making the substitution t D cos 1 on the right-hand side we obtain the recursive
relation
for d 3.
Appendix C
Advanced analysis
Higher order derivatives are defined recursively in the same way as for real-valued
functions. If f is differentiable at all points of D, then f is said to be holomorphic on
D. The set of all holomorphic functions on D will be denoted by H.D/. Functions
that are holomorphic on C are also called entire.
Unlike the real case, holomorphic functions are infinitely often differentiable.
where x and y are continuous real-valued functions on Œa; b. If x and y are piecewise
continuously differentiable, then is called a path. We define 0 .t / by
0
.t / D x 0 .t / C iy 0 .t /
of f along is defined by
Z Z b
0
f .z/ dz D f . .t // .t / dt:
a
.t / D z0 C .z1 z0 /t; 0 t 1:
Theorem C.1.5 (Cauchy). Suppose D is a convex open set and f 2 H.D/. Then
Z
f .z/ dz D 0
Theorem C.1.8. Suppose that D is connected and let fzn g be a sequence in D having
at least one accumulation point in D. If f and g are holomorphic functions in D and
f .zn / D g.zn / for all n, then f .z/ D g.z/ for all z 2 D.
Next we prove a result on the zeros of certain holomorphic functions.
1 The original Eneström–Kakeya theorem is on the zeros of polynomials. The present formulation and
its simple proof is due to T. Koornwinder.
Section C.1 Functions of a complex variable 281
So
j.1 z/f .z/j 1 jg.z/j 1 jzj > 0; jzj < 1:
where Qn is a polynomial of degree n, the zeros of which lie in the closed disc D.z0 ; r/
where2
1 1 1 1
z0 D nC1C and rD nC1 :
2 nC1 2 nC1
Proof. We consider the polynomial
Z
Pj .z/ D .1 t /nCj .1 C t /nj dt; 0 j n; z 2 C:
Œz;1
Then
1
Pn .z/ D .1 z/2nC1 (1)
2n C 1
and integration by parts gives
1 nj
Pj .z/ D .1 z/nCj C1 .1 C z/nj C Pj C1 .z/ (2)
nCj C1 nCj C1
for all j < n. Setting
1 n n1 nj C1
cj WD
nC1 nC2 nC3 nCj C1
from (1) and (2) we obtain
X
n
P0 .z/ D cj .1 z/nC1Cj .1 C z/nj
j D0
2 This proposition is taken from [39] where the authors state that the zeros of Qn lie in the open disc
D.z0 ; r/. However, the real part of two zeros of Q1 is equal to 12 .
282 Appendix C Advanced analysis
and therefore
X
n
1z j
Qn .z/ D .1 C z/n cj : (3)
1Cz
j D0
Using the notation
nC2 j X
n
aj WD cj ; H.w/ WD aj w j
n
j D0
Then
Z
1 2
cn r Dn
Re g reit eint dt; r 2 R; jrj < R; n 2 N:
0
Corollary C.1.14. Let g be an entire function such that for some k 2 N0 and constant
C > 0; R > 0 the inequality
Proof. By Corollary C.1.13, the inequality jcn j 2C r kn holds for all r R and
n 1. Thus, cn D 0 if n > k.
Theorem C.1.15 (Jensen’s formula). Suppose 0 < r < R, and let f be a holomor-
phic function on D.0; R/ such that f .0/ ¤ 0. Denote by z1 ; : : : ; zN the zeros of f in
the closed disc D.0; r/, listed according to their multiplicities. Then
Y
N Z
r 1
jf .0/j D exp log jf .rei /j d :
jzn j 2
nD1
Corollary C.1.16. Let f be an entire function such that f .0/ D 1 and denote by
n.r/; 0 < r < 1, the number of zeros of f in the closed disc D.0; r/. Then
log M.2r/
n.r/
log 2
where
M.r/ D sup jf .rei /j:
2Œ0;2/
284 Appendix C Advanced analysis
Proof. Let fzk g be the sequence of zeros of f , listed according to their multiplici-
ties and arranged so that the sequence fjzk jg is increasing. By Jensen’s formula and
inequality
Z
1
M.2r/ exp log jf .2rei /j d
2
Y
n.2r/
2r Y 2r
n.r/
D 2n.r/ :
jzk j jzk j
kD1 kD1
jf .z/j C ejzj ;
A
z 2 C:
The infimum of all numbers A for which this is true is called the order of f .
A polynomial is of order zero. The functions ez ; cos z and sin z are of order 1 while
zk z
e ; k 2 N, is of order k. The function ee is of infinite order.
is convergent if ˛ > .
where Q is a polynomial of degree not greater than and p is the smallest integer for
which
X 1
jzn jpC1
n1
is convergent.
Section C.2 Almost periodic functions 285
Remark C.1.20. Since the functions cos and sin are of order 1, the well-known prod-
uct formulae
1
Y z2
sin z D z 1
n2 2
nD1
1
Y z2
cos z D 1
.n 1=2/2 2
nD1
and continuous on its boundary. If jf .z/j M for all z on the boundary of S and
jf .z/j C ejzj ;
A
z2S
where 0 C and 0 A < ˇ ˛
, then jf .z/j M for all z 2 S .
4 See, e.g., § 3.23 in [56] for another proof of these product representations.
5 See Maak [41] for more information on almost periodic functions.
286 Appendix C Advanced analysis
Proof. Assume, without loss of generality, that g is periodic with period 1. Since
g is uniformly continuous on Œ0; 1 for every " > 0, there exists n 2 N such that
jg.t / g.s/j < " whenever t; s 2 Œ0; 1 and jt sj 1=n. Using this and the fact
that g is periodic with period 1 we see that
1
[
Uj WD Œk C .j 1/=n; k C j=n; j D 1; : : : ; n
kD1
is an "-partition for g.
Since
Theorem C.2.6. Let g be a continuous almost periodic function. Then for all " > 0
there exists a positive number L D L."/ such that every closed interval of the form
Section C.3 Fourier series 287
where Z 2
1
fO.n/ D f .x/einx dx; n2Z
2 0
is called the Fourier series of f . If N is a nonnegative integer, the N-th symmetric
partial sum of the Fourier series of f is
X
sN f .x/ D fO.n/einx :
jnjN
6 It can be shown that this property implies almost periodicity; see Maak [41], p. 94.
288 Appendix C Advanced analysis
where the supremum is taken over all m 2 N and all xi 2 Œa; b such that
0 D x0 < x1 < < xm D b . If V .f / is finite, then f is said to be of bounded vari-
ation.
A real-valued function f is of bounded variation if and only if it can be expressed
as the difference of two monotone functions.
A proof of the next theorem can be found, e.g., in Section 10.1 of [13].
Theorem C.3.1. Let f be a Lebesgue integrable function on the interval Œ0; 2/ and
suppose that the total variation of f is finite. Then
1
lim sN f .x/ D Œf .x C 0/ C f .x 0/; x 2 Œ0; 2/
N !1 2
and
jsN f .x/j K
with some constant K 0.7
Applying Lebesgue’s theorem on dominated convergence we obtain the following
corollary.
Corollary C.3.2. Let f be as in the previous theorem and let be a complex Borel
measure on Œ0; 2/. If .fxg/ D 0 for all discontinuity points8 of f , then sN f con-
verges to f in L2 . /.
4
n D 12
2 nD6
1 2 3 4
Figure C.1. The Gamma function (continuous line) and the function x 7! nŠnx =x.x C 1/
.x C n/ from relation (C.4.2.v).
Proof. (i) It is easy to check that .x/ < 1 while the continuity can be proved by
using Lebesgue’s theorem on dominated convergence.
(ii) Integrating by parts we obtain
Z 1 Z 1
x t
ˇ
x t ˇ1
.x C 1/ D t e dt D t e 0 C x t x1 et dt D x.x/:
0 0
Definition C.4.3. The Beta function B (see Figure C.2) is defined by the equation
Z 1
B.a; b/ D .1 t /a1 t b1 dt; a; b > 0:
0
6
y D 0:2
4
y D 0:4
2
y D 2:0
1 2 3
Figure C.2. The functions By W x 7! B.x; y/ where B is the Beta function from Definition
C.4.3.
Proof. It follows from the definition of the Gamma function that the function pa de-
fined by
x a1 ex
pa .x/ WD 1Œ0;1/ .x/ ; x2R
.a/
is a probability density. We have
Z 1 Z x
ex
pa pb .x/ D pa .x y/pb .y/ dy D .x y/a1 y b1 dy
1 .a/.b/ 0
Z 1
x aCb1 ex
D .1 t /a1 t b1 dt
.a/.b/ 0
where we made the substitution y D tx. The relation above shows pa pb D cpaCb
with some constant c. Since pa pb and paCb are probability densities, we conclude
that c D 1 from which the assertion follows.
Proof. Let f1 .x/ D ex ; f2 .x/ D ex x 1=2 and denote by g the multiplicative con-
volution of f1 and f2 , i.e.,
Z 1
1
g.x/ D f1 .x=u/f2 .u/ du; x 2 .0; 1/:
0 u
Then Mf1 .s/ D .s/; Mf2 .s/ D .s C 12 / and, by Lemma C.6.3,
p
Substituting u D t tx we obtain
Z 1 Z 1
p
.u2 C2 x/
p
2 x
p p
eu du D e2
2 x
g.x/ D e du D e :
1 1
p
Finally, set y D 2 x and obtain
Z 1
p p
M g.s/ D e2 x x s1 dy
0
Z 1 2s1 p
p y y .2s/
D e dy D :
0 2 22s1
C.4.7. For the proof of Binet’s formula we define the function ' by the equation
Z 1
'.x/ 1 p 1t x
e Dp x te dt; x > 0
2 0
so that x x p
.x C 1/ D 2x e'.x/ : (1)
e
Binet’s formula is equivalent with .x/ D '.x/. For the proof of this we need that
and ' satisfy a certain difference equation and that . 12 / D '. 12 /. First we show these
facts.
Moreover,11
Z 1
ext e.xC1/t ext C e.xC1/t
0 .x/ 0 .x C 1/ D dt:
0 t 2
Applying Lemma B.1.12 we obtain
1 1 1 1
0 .x/ 0 .x C 1/ D ln 1 C C D g 0 .x/
x 2 x xC1
completing the proof.
Proof of Binet’s formula. We have to show that '.x/ D .x/. By Lemma C.4.8,
11 The footnote to Lemma B.1.12 also holds for the differentiation below.
294 Appendix C Advanced analysis
for all n 1. Noting that 0 t e1t 1; t 0, and using the definition of ', we
conclude that
p p
e'.xCn/ e'.yCn/ . x C n y C n/e'.1/ :
Taking the limit as n ! 1, we obtain eh.x/ eh.y/ 0, i.e., h.x/ h.y/. Since
the function h is also periodic with period 1, it must be constant. Applying (1) and
Lemma C.4.9, we now obtain that h.x/ D h. 12 / D 0 for all x > 0.
In particular,
0 D 1 C 2B1 ;
0 D 1 C 3B1 C 3B2 ;
0 D 1 C 4B1 C 6B2 C 4B3 ;
0 D 1 C 5B1 C 10B2 C 10B3 C 5B4
and hence
1 1 1
B1 D ; B2 D ; B3 D 0; B4 D :
2 6 30
Using that B1 D 12 , relation (1) gives
z z
B2 2 z z z ez C 1 z e 2 C e 2
1C z C D z C D D z : (3)
e 1 2 ez 1
z
2Š 2 2 e 2 e 2
The function on the right-hand side is even, showing that B2kC1 D 0 for all k 2 N.
X
2N X
2N C1
B2j nŠ B2j
1
< log p < :
2j.2j 1/n2j n
.n=e/ 2 n 2j.2j 1/n2j 1
j D1 j D1
By Problem 154 in Part I, Chapter 4 of [44], the series on the right-hand side has the
so-called enveloping property, i.e.,
X
2N X
2N C1
B2j 2j 1 1 1 1 B2j 2j 1
t < t C < t ; 0 < t < :
.2j /Š e 1 t 2 .2j /Š
j D1 j D1
R1
Using the equation j Š D 0 t j et dt and the definition of we immediately obtain
the assertion.
296 Appendix C Advanced analysis
1 x Z 1
J .x/ D p .1 t 2 /1=2 eitx dt; x 0: (1)
C 12 2 1
For x D D 0 the expression x2 is equal to 1 by definition. Using that .1=2/ D
p
we see that
Z
1 1 1 1
J0 .0/ D p dt D arcsin t j11 D 1
1 1 t 2
while J .0/ D 0 if > 0.
If D 1=2, then the integral above can be easily evaluated. We obtain:
r
2
J1=2 .x/ D sin x:
x
Note that .1 t 2 /1=2 is even in t , hence one can replace eitx by cos tx in Defini-
tion C.5.1.
jJ .x/j C x ; ; x 0
where Z 1
1
C D p .1 t 2 /1=2 dt:
C 12 2 1
Proof. The continuity follows from the fact that the integral in Definition C.5.1 rep-
resents a characteristic function, up to a constant factor. The second statement is ob-
vious.
13 We will extend the definition of J for all 2 R in Remark C.5.6. Note that the extension is possible
for all complex but we do not need it in this book. The functions J0 and J1 are shown in Figure C.3.
Section C.5 Bessel functions 297
1
J0
J1
0.5
2 4 6 8 10
0:5
Figure C.3. The Bessel functions J0 and J1 from Definition C.5.1.
Proof. Replacing eitx by its power series expansion in Definition C.5.1 we see that
J .x/ is equal to
x X1 Z 1
1 1
p 1
.ixt /2k .1 t 2 /1=2 dt:
C 2 2
kD0
.2k/Š 1
p
Using .2k/Š D .2k C 1/; D .1=2/, and the duplication formula (cf. Proposi-
tion C.4.5), which implies
.k C 1=2/ 22k
D
.1=2/.2k C 1/ .k C 1/
we obtain the assertion.
Remark C.5.6. We use the first equation in Proposition C.5.5 to define J .x/ recur-
sively for all 2 R and x > 0. Doing so, both equations in Proposition C.5.5 hold for
Section C.5 Bessel functions 299
all such and x. Note that the first equation can also be written in the form
d
C J .x/ D J1 .x/; x > 0: (1)
dx x
q
2
From J1=2 .x/ D x sin x we obtain
r
2
J1=2 .x/ D cos x; x > 0:
x
The second equation in Proposition C.5.5 can also be written in the form
d
J .x/ D JC1 .x/; x > 0: (2)
dx x
Putting together (1) and (2) produces the equation
d 1 d
C J .x/ D J .x/
dx x dx x
which is equivalent to
d2 1 d 2
C C 1 J .x/ D 0
dx 2 x dx x2
known as Bessel’s equation. Adding and subtracting (1) and (2) we obtain the identities
Definition C.5.7. The modified Bessel function of the second kind14 of index 2 C is
defined by
Z 1
K .z/ D ez cosh t cosh.t / dt; z 2 C; Re z > 0:
0
14 This function has also been called by the now-rare name modified Bessel function of the third kind.
The function K1 is shown in Figure C.4.
300 Appendix C Advanced analysis
10
1 2 3
b
e 2 . a C u / ub1 du
2a 0
Z
1 1 r .et Cet / bt
D e 2 e dt
2 1
Z 1
1
D er cosh t ebt dt D Kb .r/:
2 1
More generally,
Z 1 Z 1
f .s/
f .s/ d .s/ D ds; f 2 L1 . /:
0 0 s
Since is invariant under translations we have
Z 1 Z 1
f .rs/ d .s/ D f .s/ d .s/; f 2 L1 . /:
0 0
for all z 2 C for which the Mellin transforms Mf .z/ and M g.z/ exist.
Remark C.6.4. Substituting s D et and setting g.t / WD f .et / in Definition C.6.2
we obtain Z 1
Mf .z/ D g.t /etz dt: (1)
1
302 Appendix C Advanced analysis
In the same way as in Section 3.3 we see that there exists a; b 2 Œ1; 1; a b,
such that the integral in (1) exists if a < Re z < b and Mf is holomorphic in this
strip.
In this case, Mf is holomorphic in the region Re .z/ > 0 and Mf is equal to the
Gamma function.
Let f .s/ D 1=.1 C s/. Using the substitution s D x=.1 x/ we see that
Z 1
Mf .z/ D z s1 .1 x/z dx:
0
Lemma C.7.3. The measure is finite if and only if its Laplace transform is bounded.
i.e., is bounded.
for all k 2 N0 .
Proof. The lemma follows by successive differentiation which can be justified since
where Kk;t is uniformly bounded for all t from a fixed compact subset of .0; 1/.
Proof. Writing d.x/ WD ex d .x/, the measure is finite. Since L.t / D L .tC1/,
it suffices to show that is uniquely determined by its Laplace transform. By
Lemma C.7.4 we have
Z 1
k .k/
.1/ .L/ .t / D x k etx d.x/; t > 0; k 2 N0
0
and hence
X Z 1 X
tk .tx/k
k
.1/ .L/ .k/
.t / D etx d.x/; a; t > 0:
kŠ 0 kŠ
0kta 0kta
The integrand on the right-hand side is at most 1. Using this, dominated convergence
and Lemma B.1.18 show that
X Z 1
tk
lim .1/k .L/.k/ .t / D 1Œ0;a .x/ d.x/ D .Œ0; a/:
t!1 kŠ 0
0kta
z D ei.z/ ; z2T
Proof. From
f .x/ D jf .x/j ei1 .x/ D jf .x/j ei2 .x/
we see that 2 .x/ D 1 .x/ C 2k.x/i with some function k W X ! Z. Since 1
and 2 are continuous, so is k. The continuous image of connected sets is connected,
therefore k.X / consists of a single integer.
is continuous, injective and it maps .˛; ˛ C 2/ onto T n fei˛ g. Denoting by the
inverse mapping we have
˚
z D ei .z/ ; z 2 T n ei˛ :
Lemma C.8.5. Let f1 and f2 be continuous functions on X with values in T such that
f1 .x/ ¤ f2 .x/ for all x 2 X . If f1 has a continuous argument, then so does f2 .
Theorem C.8.7. Let f W Rd ! Cnf0g be a continuous function such that f .0/ > 0.
Then f has a unique continuous argument such that .0/ D 0.16
ft 2 Rd W kt k < ıg:
where fj .tj / D f .tj ej /. We now apply the first part of the proof to the func-
tions fj .
Proof. It is clear that any function of the above form satisfies the equation (1). To prove
the other direction we consider only the case d D 1. The general case can be reduced
to this one in the same way as in the proof of Lemma C.9.1. From '.t C0/ D '.t /'.0/
we see that '.0/ D 1. Thus, 1 D '.t t / D '.t /'.t / and therefore '.t / ¤ 0 for
all t . The function j'j satisfies the same equation as '. Applying Lemma C.9.1 to the
logarithm of j'j, we obtain j'.t /j D ert with some r 2 R. The function D '=j'j sat-
isfies the same equation as '. By continuity, there exists ı0 > 0 such that the real part
of .t / is positive whenever t 2 .ı0 ; ı0 /. Setting f .t / D arctan.Im .t /=Re .t //
we have .t / D eif .t/ ; t 2 .ı0 ; ı0 /. The function f is continuous and satisfies
equation (C.9.1.1). Applying Lemma C.9.1 and using ' D j'j we conclude that
with some c 2 C. To show that this equation holds for arbitrary t 2 B o .ı/, let n 2 N
be such that t =n 2 .ı0 ; ı0 /. Repeated application of (1) shows that
Proof. Let a D g.0/ and b D h.0/. Setting s D 0 in the above equation we get
g.t / D f .t / b. Similarly, h.s/ D f .s/ a. Thus,
f .t C s/ D f .t / C f .s/ a b
and therefore the function f a b satisfies Cauchy’s functional equation. The as-
sertion follows now from Lemma C.9.1.
For fixed s and x these equations hold for all t in a neighborhood of zero. Comparing
the coefficients of t k we obtain
ak .s C x/ D ak .s/ C ak .x/; s; x; s C x 2 .ı; ı/:
Thus, in view of Lemma C.9.1, ak .s/ D c s with some c 2 R. Define the function g
c
by g.t / D f .t / kC1 t kC1 . Then
with some real-valued functions a˛ . For fixed t; s 2 B o .ı/ define the function h by
h.x C y/ h.x/ D f .t C x .s t / C y .s t // f .t C x .s t //
X
D a˛ .y .s t // .t C x .s t //˛ :
j˛jk
The right-hand side is a polynomial in x of degree at most k. By the first part of the
proof, h is a polynomial of degree at most k C 1.17 Application of Theorem B.6.6
completes the proof.
holds, then in some neighborhood of zero the functions g and h are polynomials of
degree at most m. If the matrices Bi A1 1
i Bj Aj are invertible for all i ¤ j , then
all functions fi are polynomials of degree at most m.
Proof. Without loss of generality we may assume that Ai D Id (the identity matrix)
for all i . We prove the assertions by induction on m. The case m D 1 follows from
Lemma C.9.3. Let m > 1 and assume that the assertions are true for 1; : : : ; m 1. For
t; s and x near to the origin we have
X
m X
m
fi .t C.Bm Bi /xCBi s/ D fi .t CBm xCBi .sx// D g.t CBm x/Ch.sx/:
iD1 iD1
X
m1
Œfi .t C.Bm Bi /xCBi s/fi .t CBi s/ D Œg.t CBm x/g.t /CŒh.sx/h.s/:
iD1
17 The first part remains valid if we replace .ı; ı/ by the possibly nonsymmetric interval I .
310 Appendix C Advanced analysis
Fi .t / D fi .t C .Bm Bi /x/ fi .t /
and
G.t / D g.t C Bm x/ g.t /; H.s/ D h.s x/ h.s/:
These functions satisfy the equation
X
m1
Fi .t C Bi s/ D G.t / C H.s/: (2)
iD1
The induction assumption (concerning g and h) and Lemma C.9.4 show that g and h
are polynomials of degree at most m. If Bm Bi is nonsingular, then the induction
assumption (concerning fi ) and Lemma C.9.4 show that fi is a polynomial of degree
at most m.
X
n
D ci ıxi ; c 2 Rn ; x 2 Rn
iD1
Pn
is such that O .0/ D iD1 ci D 0, then f is a polynomial of degree at most k 1.
Proof. It is easy to check that the statement is true for polynomials of the form
Y
d
˛
f .t / D tj j ; t 2 Rd ; ˛ 2 Nd0
j D1
x .t / WD ei.x;t/ ; x; t 2 Rd :
and assume that P .t / D 0 for all t 2 U . For each t0 2 U the function ' defined by
and let e1 ; : : : ; ed be the standard basis of Rd . We may assume that x1 D Re1 . De-
noting by P the orthogonal projection onto spanfe1 ; e2 g we have
P xj D rj Œcos.'j / e1 C sin.'j / e2
For all ' 2 R we have t WD cos.'/ e1 C sin.'/ e2 2 S d 1 . Putting this t into equation
(1) and using that .t; xj / D .P t; xj / D .t; P xj / we obtain
X
n
cj eirj cos.''j / D 0; ' 2 R:
j D1
It follows from Theorem C.1.8 that this equation holds in the whole complex plane:
X
n
cj eirj cos.z'j / D 0; z D x C iy 2 C: (3)
j D1
Since
1 i.z'j /
cos.z 'j / D e C ei.z'j /
2
we have
ˇ ir cos.z' / ˇ rj
ˇe j j ˇ
D exp Im ei.z'j / C ei.z'j /
2
D erj sinh.y/ sin.x'j / :
Let j .z/ 2 Œ0; 2/ be the argument of eirj cos.z'j / . Then, by relation (3),
X
n
cj erj sinh.y/ sin.x'j / ei j .z/
D 0:
j D1
j D1
X
n
C cj esinh.y/Œrj sin.x'j /R ei j .z/
D 0: (4)
j Dn0 C1
Section C.10 Linear independence of exponential functions 313
Functional analysis
X
d
.v; w/ D v j wj
j D1
The two equations above are called polarization identity. A result, attributed to M. R.
Fréchet, J. von Neumann and P. Jordan, states that if a seminorm (norm) k k satisfies
the parallelogram law, then the equations above define a semidefinite (positive definite,
respectively) inner product .; /.
The rank of A is equal to the maximum number of linearly independent column vectors
of A, which is the same as the maximum number of linearly independent row vectors.
If the product AB is defined then
Let A 2 Cnn be a quadratic matrix for the rest of this paragraph. A is diagonaliz-
able if there exists an invertible matrix X 2 Cnn such that XAX 1 is diagonal. A is
symmetric if A D AT and A is Hermitian if A D A . A is called unitary if A D A1 .
If A is real and AT D A1 , then A is called orthogonal. We denote by O.n/ the set
of all n n orthogonal matrices. A is said to be normal if A A D AA . A matrix is
normal if and only if it is diagonalizable by a unitary matrix.
A is positive semidefinite if x Ax 0 for all x 2 Cn and positive definite if
x Ax > 0 for all nonzero x 2 Cn . We show below (cf. Theorem D.2.3) that positive
semidefinite matrices are Hermitian.
The determinant of A is denoted by det.A/. Key properties are
The diagonal matrix D is unique up to the ordering of the eigenvalues, but the matrix
X is not unique. However, f .A/ can be shown to be independent of the particular
representation of A.3
3 There are several equivalent ways to define f .A/ for arbitrary quadratic matrices; a detailed treatment
is given in [29].
318 Appendix D Functional analysis
X
n X
p
cj ck K.xj ; xk / D dj dk K.xlj ; xlk /
j;kD1 j;kD1
where X
dk D ci ; k D 1; : : : ; p:
iW xi Dxlk
Next we give proofs of some results on matrices which are used in the book.
Taking .u; v/ equal to .1; 0/ or .0; 1/, we see that ajj and akk are nonnegative and, in
particular, real. Thus,
uvaj k C vuakj 2 R; u; v 2 C:
Taking
D .1; 1/ we get aj k C akj 2 R. Taking .u;
.u; v/ v/ D .1; i/ we get that
i aj k akj 2 R. Writing a D aj k C akj 2 R and b D i aj k akj 2 R, we infer
aj k D aCib aib
2 and akj D 2 , from which the desired equality is immediate.
h i
1 1
We remark that the matrix .aj k / D 1 1 is not Hermitian but we have
X
2
aj k rj rk D r12 C r22 > 0
j;kD1
Theorem D.2.4. A real matrix A D .aj k /nj;kD1 is positive semidefinite if and only if
it is symmetric and
Xn
aj k rj rk 0
j;kD1
for all real numbers r1 ; : : : ; rn .
Proof. The “only if” part follows immediately from Theorem D.2.3. If A is real and
symmetric, then
Xn X
n
aj k cj ck D aj k Re.cj ck / 2 R
j;kD1 j;kD1
Section D.2 Matrices and kernels 319
X
n X
n X
n
aj k cj ck D aj k .xj xk C yj yk / C i aj k .yj xk xj yk /
j;kD1 j;kD1 j;kD1
Xn X
n
D aj k x j x k C aj k yj yk
j;kD1 j;kD1
Definition D.2.5. Suppose A D .aj k /nj;kD1 is a Hermitian matrix. Then A has n real
eigenvalues, counted with multiplicity, and there exists an orthonormal basis of Cn
consisting of eigenvectors of A. The number of negative squares of A is the num-
ber of negative eigenvalues of A, counted with multiplicity. A negative (respectively
nonnegative, respectively positive) subspace for A is a linear subspace E of Cn such
that
Xn
aj k xj xk D .Ax; x/ < 0
j;kD1
Proof. Let the order of A be n and let k be the number of negative squares of A. In
view of the remark after Definition D.2.5, the matrix A has a negative subspace of
dimension k and A has a nonnegative subspace P of dimension n k. Now let N be
an arbitrary negative subspace for A. Since N cannot intersect P except in 0 we must
have dim .N / k.
f .x1 ; : : : ; xk ; 0; : : : ; 0/ W .x1 ; : : : ; xk / 2 Ck g
320 Appendix D Functional analysis
Noting that positive definite matrices are nonsingular, we obtain the following cor-
ollary.
As the example
0 0
AD
0 1
shows, there exist matrices with det Ak 0 for all k that are not positive semidefinite.
However, the following result holds.
Proof. If A D .aj k / is positive semidefinite, then the eigenvalues and hence the de-
terminant of A are nonnegative.
On the other hand, let us assume that all subdeterminants are nonnegative. For > 0
we define the matrix A D A C In . Then
X
n
l
det .A / D dl
lD0
where dn D 1 and
X
dl D det ..aj k /j;k2Sl / 0; l D 0; 1; : : : ; n 1:
Sl
Here the sum is over all sets Sl f1; : : : ; ng having n l elements. From this we
conclude that det .A / n > 0. Corollary D.2.8 shows that A is positive definite.
Consequently, the pointwise limit A D lim!0 A is positive semidefinite.
and
1
K2 .x; y/ D .jxj C jyj jx yj/; x; y 2 R
2
are positive semidefinite.4
Proof. We consider first the kernel K1 and show that the matrix
n
A D min .xi ; xj / i;j D1
is positive semidefinite for all x1 ; : : : ; xn 2 Œ0; 1/. We may assume that
x1 x2 xn . We then have
ˇ ˇ
ˇ x1 x1 x1 : : : x1 ˇ
ˇ ˇ
ˇ x1 x2 x2 : : : x2 ˇ
ˇ ˇ
ˇ ˇ
Dn .x1 ; : : : ; xn / WD det .A/ D ˇ x1 x2 x3 : : : x3 ˇ :
ˇ :: : : :: ˇˇ
:
ˇ : :: ::
ˇ ˇ
ˇ x 1 x2 x3 : : : xn ˇ
Subtracting the first row from all other rows and expanding the determinant along the
first column we see that
Dn .x1 ; x2 ; : : : ; xn / D x1 Dn1 .x2 x1 ; : : : ; xn x1 /:
Induction on n leads to
Dn .x1 ; x2 ; : : : ; xn / D x1 .x2 x1 /.x3 x2 / .xn xn1 /
from which det A 0 follows. Since all subdeterminants of A have the same structure,
they are also nonnegative. Theorem D.2.9 shows that A is positive semidefinite.
We now consider the kernel K2 . If xy 0 then K2 .x; y/ D 0, otherwise
K2 .x; y/ D K1 .jxj; jyj/. Thus, for x1 xn ,
n
K2 .xi xj / i;j D1 D diag .A1 ; A2 /
where A1 and A2 are, by the first part of the lemma, positive semidefinite matrices.
This completes the proof.
Alternative proof. We present an alternative proof for the positive semidefiniteness of
the matrix A D .min .xi ; xj //ni;j D1 , where 0 x1 x2 xn . By continuity,
it suffices to consider positive rational xj ’s. Multiplying A by a positive integer we
may even assume that the xj ’s are positive integers. Then for some m the matrix A is
a submatrix of .min.i; j //m
i;j D1 which is positive definite since it is the square of the
symmetric matrix B D .bij / where bij D 0 if i C j m and bij D 1 if i C j > m.
For example, 2 3 2 32
1 1 1 1 0 0 0 1
6 1 2 2 2 7 6 0 0 1 1 7
6 7 6 7
4 1 2 3 3 5D4 0 1 1 1 5 :
1 2 3 4 1 1 1 1
4 Note that K1 .x; y/ D K2 .x; y/ if x; y 0.
322 Appendix D Functional analysis
and hence A D Q diag .r1 ; : : : ; rn / Q . It is now easy to check that the square of the
Hermitian matrix
p p
B D Q diag . r1 ; : : : ; rn / Q
is equal to A.
Theorem D.2.12 (Schur). Let A D .aij / and B D .bij / be nn positive semidefinite
matrices. Then the matrix C D .aij bij /ni;j D1 is positive semidefinite. If A ¤ 0 and B
is positive definite, then C is positive definite, as well.
Proof. Suppose first that A and B are positive semidefinite. We assume that A ¤ 0.
By Theorem D.2.11 there exists an n n Hermitian matrix D such that A D D 2 .
Writing D D .dij /, we have for x D .xi / 2 Cn
X
n
.C x; x/ D aij bij xj xi
i;j D1
X n X n
D dli dlj bij xj xi
i;j D1 lD1
X n X n
D bij ylj yli
lD1 i;j D1
Xn
D .Byl ; yl / 0
lD1
Using the relation cj k D ckj we see that S1 is real, while S2 is purely imaginary.
Consequently,
X
n X
n
S1 iS2 D aj k .rj rk C rnCj rnCk / C bj k .rnCj rk rj rnCk /
j;kD1 j;kD1
X
2n
D dj k rj rk
j;kD1
Definition D.2.15. Let V be a positive definite inner product space over K where
K D C or K D R and let v1 ; : : : ; vn 2 V . The Gram matrix G associated with
v1 ; : : : ; vn is defined by
n
G D G.v1 ; : : : ; vn / D .vi ; vj /
i;j D1
Choosing an orthonormal basis in the linear span of the vectors vj and building a
matrix A by writing the coordinates of vj as the j -th row of A we have G D AA .
then
G.v1 ; : : : ; vn / D T G.w1 ; : : : ; wm /T (1)
where T D .tij /.
(iii) The dimension of the linear space L spanned by the vectors v1 ; : : : ; vn is equal
to the rank of G.v1 ; : : : ; vn /.
Proof. The first statement follows from5
!
X
n X
n X
n
.vi ; vj /ci cj D vi ci ; vi ci 0; ci 2 K
i;j D1 iD1 iD1
To prove the last statement choose arbitrary vectors w1 ; : : : ; wn which span the same
linear space L. Since the rank of the product of matrices is not greater than that of any
factor, equation (1) shows that
rank G.v1 ; : : : ; vn / rank G.w1 ; : : : ; wn /:
Exchanging the role of the v’s and w’s we conclude that we have equality in the above
display. The result now follows by choosing the w’s such that w1 ; : : : ; wk form an
orthonormal basis of L and wj D 0 for j > k D dim L.
5 The positive semidefiniteness also follows from the representation G D AA after Definition D.2.15.
Section D.2 Matrices and kernels 325
Proof. It is clear that the above limit also exists for all t 2 p
Cd , we denote it in this case
also with q.t /. Since Cn is positive semidefinite, kt kn D .Cn t; t / is a seminorm on
p
Cd satisfying the parallelogram law (cf. (D.1.v)). The same is true for the limit q .
By the theorem of Fréchet, von Neumann and Jordan, q is generated by a positive
semidefinite inner product h; i on Cd :
q.t / D ht; t i; t 2 Cd :
The first statement of the lemma follows from the fact that ht; t i D .C t; t / with some
positive semidefinite matrix C 2 Cd d . Let e1 ; : : : ; ed be the standard basis of Cd .
Using the polarization identity (D.1.2) with v D ej and w D ek , we obtain the second
statement.
Proof. The matrix A is obviously positive semidefinite. If it was not positive definite,
then it would have an eigenvector with eigenvalue 0. This vector would be an eigen-
vector of A with eigenvalue , contradicting the positive semidefiniteness of A.
Sn WD f.i; j / 2 Z2 W 1 i; j ng
such that
(i) .i; i / 2 S whenever 1 i n;
(ii) if .l; k/ 2 S then f.i; j / W min .l; k/ i; j max .l; k/g S .
Suppose that for each .i; j / 2 S a complex number cij is given such that
(iii) for all .l; k/ 2 S with l k the matrix .cij /ki;j Dl is positive semidefinite.
Then there exists a positive semidefinite matrix A D .aij /ni;j such that aij D cij for
.i; j / 2 S .
We consider first the special case where S D Sn n f.1; n/; .n; 1/g. For z 2 C we
define the matrix A.z/ by
2 3
c11 c12 : : : c1;n1 z
6 c21 c22 : : : c2;n1 c2n 7
6 7
6 : 7
A.z/ WD 6 :: 7:
6 7
4 cn1;1 cn1;2 : : : cn1;n1 cn1;n 5
z cn2 : : : cn;n1 cnn
Set
A1 WD .cij /n1
i;j D1 ; A3 WD .cij /ni;j D2 ; A5 WD .cij /n1
i;j D2
and let A2 .z/ .A4 .z// denote the matrix obtained by canceling the first (last) column
and the last (first) row of A.z/. Assume that all matrices in (iii) are positive definite.
By Corollary D.2.8 it suffices to prove the existence of a complex number z for which
the determinant det .A.z// is positive. We have
det .A1 / det .A3 / det .A2 .z// det .A4 .z//
det .A.z// D :
det .A5 /
Since A.z/ is Hermitian det .A4 .z// D det .A2 .z// and hence the inequality
det .A.z// > 0 holds if and only if
j det .A2 .z//j2 < det .A1 / det .A3 /: (1)
By assumption, the right-hand side of this inequality is positive. Expanding the deter-
minant det .A2 .z// along the first row we see that
det .A2 .z// D det .A5 /z C c
where det .A5 / > 0 and c is some complex number not depending on z. We conclude
that there exist infinitely many z 2 C satisfying (1).
If the matrices in (iii) are not all positive definite, then we replace A.z/ by the
matrices A .z/ WD A.z/ C I where I denotes the n n identity matrix and > 0.
By Lemma D.2.18 the corresponding matrices are positive definite. For m 2 N we
choose zm 2 C such that A1=m .zm / is positive semidefinite. Then
1
c11 C m zm
det 1 0
zm cnn C m
and hence the sequence fzm g is bounded. Consequently, there exists a subsequence
converging to some z 2 C. The matrix A.z/ is then positive semidefinite.
We now turn to the general case. We may suppose that S 6D Sn . Let k be the greatest
integer with the property that .1; 1/; : : : ; .1; k 1/ 2 S and let l be the greatest integer
with .1; k/; : : : ; .l; k/ … S . Using (i) and (ii) we see that 1 l < k n and
S \ f.i; j / W l i; j kg D f.i; j / W l i; j kg n f.l; k/; .k; l/g:
Section D.3 Hilbert spaces and linear operators 327
The first part of the proof shows the existence of a complex number z such that the
matrix .cij /ki;j Dl is positive semidefinite, where ckl WD z; clk WD z. Replacing S by
S [f.l; k/; .k; l/g, this new index set and the corresponding complex numbers cij also
satisfy the conditions of the theorem. Thus, repeating the arguments above we obtain
the desired matrix A.
lim kv vn k D 0:
n!1
where .; / denotes the inner product of H. Examples of Hilbert spaces are Rd ; Cd
and L2 . ; A; P /. Throughout the rest of this section the symbol H denotes a Hilbert
space.
We call a nonempty set L H a linear manifold if for any two vectors g; h 2 L
and any a; b 2 K we have af C bg 2 L. A linear manifold is called a subspace if it is
closed. The orthogonal complement M ? of a nonempty subset M H is a subspace
of H. If L is a subspace of H, then
H D L ˚ L? :
h D hL C hL? ; hL 2 L; hL? 2 L? :
The element hL is called the orthogonal projection of h onto L. The orthogonal pro-
jection hL is the (unique) element of L having minimal distance to h:
kh hL k D inf kh vk:
v2L
Theorem D.3.5. Let h; iL be a sesquilinear form on a dense linear manifold L H
such that
jhg; hiL j C kgk khk; h 2 L
with some C 2 Œ0; 1/. This form can be uniquely extended to a sesquilinear form
h; i on H such that
jhg; hij C kgk khk; h 2 H:
Moreover, there exists a linear operator A on H such that kAk C and
The next lemma states basic properties of compact operators which follow imme-
diately from the definition of compactness.
330 Appendix D Functional analysis
Lemma D.3.10.
(i) If K1 and K2 are compact operators, then so is a1 K1 Ca2 K2 for all a1 ; a2 2 K.
(ii) If K is a compact and A is a bounded linear operator, then AK and KA are
compact.
Proof. The case K D 0 being trivial, we may assume that kKk ¤ 0. We choose a
sequence fhn g in H such that khn k D 1 and kKhn k converges to kKk. Since K is
self-adjoint we have .K 2 hn ; hn / D .Khn ; Khn / D kKhn k2 . Using this we obtain
0 kK 2 hn kKhn k2 hn k2
D kK 2 hn k2 2kKhn k2 .K 2 hn ; hn / C kKhn k4
D kK 2 hn k2 kKhn k4
kKk2 kKhn k2 kKhn k4 :
K 2 g D kKk2 g:
.K kKk/.K C kKk/g D 0:
where
(i) S D f1; : : : ; N g with some N 2 N or S D N;
(ii) fn W n 2 S g R is the set of all nonzero eigenvalues of K;
(iii) f'n W n 2 S g is an orthonormal system in H and 'n is an eigenvector of K
with eigenvalue n .
(iv) If S D N, then limn!1 n D 0.
where N is a K-invariant subspace. By our procedure, kKjN k jn j for all n, where
KjN denotes the restriction of K to N . Thus K.N / D f0g. The decompositions (1)
and (2) follow as in the finite case.
Remark D.3.13. If the Hilbert space H in the previous theorem is separable and
infinite-dimensional, then there exists an orthonormal basis f'n g1
1 of H consisting
of eigenvectors 'n of K with (possibly zero) eigenvalues n . This follows from the
previous proof by selecting an orthonormal basis of the subspace N .
332 Appendix D Functional analysis
Theorem D.4.2. Let E be a locally convex space over K. Further, let F and C be
disjoint nonempty convex subsets of E such that F is closed and C is compact. Then
there exists a continuous linear functional l W E ! K such that
Since a locally convex space over C can also be considered as a locally convex
space over R, we obtain:
Corollary D.4.3. Under the assumptions of the preceding theorem there exists a con-
tinuous R-linear functional l W E ! R such that
In the proof of the next theorem we will repeatedly use the following simple fact:
If A is an extreme subset of B and B is an extreme subset of C then A is an extreme
subset of C .
Proof. First we prove that an arbitrary non-void compact set C L has at least
one extreme point. Applying Zorn’s lemma we see that C contains a closed extreme
subset M which is minimal with respect to inclusion. We claim that M has only one
element. This element is then an extreme point of C . Assume, on the contrary, that M
has more than one element. Then, by Corollary D.4.3, there exists a continuous real
linear functional l on L such that l is not constant on M . It is easy to check that the set
N D fx 2 M W l.x/ D sup l.M /g is a proper, closed and extreme subset of M . This
implies that N is a closed extreme subset of C contradicting the minimality of M .
Denote by K0 the closed convex hull of the extreme points of K. Assume that there
exists a y 2 K n K0 . Corollary D.4.3 with C D K0 and F D fyg shows the existence
of a continuous real linear functional l on L such that
sup l.K/ l.y/ > sup l.K0 /: (1)
By the first part of the proof, the compact convex set
N D fx 2 K W l.x/ D sup l.K/g
which is obviously not empty, contains at least one extreme point x0 . As N is an
extreme subset of K, x0 is an extreme point of K as well and hence x0 2 K0 . Since
this contradicts (1), we must have K D K0 .
Proof. By the previous theorem there exists a net fx˛ g of points of K of the form
X
n˛
x˛ D p˛j x˛j ; n˛ < 1
j D1
j j P j
where x˛ 2 ex.K/; p˛ 0; j p˛ D 1, such that lim˛ x˛ D x. We define the
probability measures ˛ on ex.K/ by
X
n˛
˛ D p˛j ıx j :
˛
j D1
334 Appendix D Functional analysis
Since ex.K/ is compact, Theorem E.1.13 shows the existence of a subnet f ˇ g con-
verging weakly to a probability measure x on ex.K/. For an arbitrary continuous
linear functional l we then have
Z Z X j j
l.y/ d x .y/ D lim l.y/ d ˇ .y/ D lim pˇ l.xˇ / D l.x/
ˇ ˇ
j
completing the proof.
In H any orthonormal sequence fen g1nD1 converges weakly to 0. This follows from
Bessel’s inequality
1
X
j.en ; h/j2 khk2 ; h 2 H:
nD1
Since ken k D 1, the sequence fen g does not converge strongly to 0.
Let H1 and H2 be Hilbert spaces. The weak operator topology on the set B.H1 ; H2 /
is the coarsest topology rendering all of the maps8
A 7! .Ag; h/; g 2 H1 ; h 2 H2
8 Recall that B.H1 ; H2 / denotes the set of all bounded linear operators from H1 to H2 (cf. D.3.3).
Section D.5 Weak topologies 335
continuous, where .; / denotes the inner product of H2 . A net fA˛ g of operators A˛ 2
B.H1 ; H2 / converges in the weak operator topology to an operator A 2 B.H1 ; H2 /
if and only if
lim .A˛ g; h/ D .Ag; h/; g 2 H1 ; h 2 H2 :
˛
The relation above is equivalent to
w
A˛ g ! Ag; g 2 H1 :
wo
In case of convergence in the weak operator topology we write A˛ ! A.
wo
Replacing g by Bg we see that A˛ B ! AB. The second statement follows from
Remark D.5.3. Multiplication of operators is not jointly continuous in the weak op-
erator topology. To see an example, let H be a Hilbert space with orthonormal basis
fen g1
nD1 and write Uen WD enC1 . It is easy to check that U can be extended to a
unitary operator in H. Using that any orthonormal sequence in H converges weakly
wo wo
to 0 (cf. D.5.1) we see that U n ! 0 and U n ! 0; n ! 1. On the other hand,
U n U n is the identity operator for all n.
Theorem D.5.4. Convergence of a net in the weak, weak- or weak operator topology
implies that the net is bounded.
w
Proof. Let V be a normed linear space and let fl˛ g be a net in V with l˛ ! l. Then for
each v 2 V the net fl˛ .v/g converges to l.v/ and hence it is bounded. Theorem D.3.8
shows that the net fl˛ g is bounded.
Assume that fv˛ g is a weakly convergent net in V and define the linear functional
L˛ on V by L˛ .l/ WD l.v˛ /. Then kL˛ k D kv˛ k and the net fL˛ g converges in the
weak- topology. By the first part of the proof fL˛ g, and hence also fv˛ g, is bounded.
wo
Let fA˛ g be a net in B.H1 ; H2 / where H1 and H2 are Hilbert spaces. If A˛ ! A,
w
then A˛ g ! Ag for all g 2 H1 . It follows that fA˛ gg is bounded for all g 2 H1 .
Applying again Theorem D.3.8, we conclude that fA˛ g is bounded.
336 Appendix D Functional analysis
Proof. (i) Using the linear functionals L˛ from Theorem D.5.4 we see that (i) follows
from (ii).
(ii) Let flˇ g be a subnet of fl˛ g such that limˇ klˇ k D lim inf˛ kl˛ k. For all v 2 V
we have
jl.v/j D lim jlˇ .v/j lim klˇ k kvk D lim inf kl˛ k kvk
ˇ ˇ ˛
from which (ii) follows.
(iii) The proof is similar to that of (ii). We choose a subnet fAˇ g of fA˛ g such that
limˇ kAˇ k D lim inf˛ kA˛ k. For all g 2 H1 and h 2 H2 we have
j.Ag; h/j D lim j.Aˇ g; h/j lim kAˇ k kgk khk D lim inf kA˛ k kgk khk
ˇ ˇ ˛
Corollary D.5.7. Let H1 and H2 be Hilbert spaces and let S B.H1 ; H2 / be closed
in the weak operator topology. Then S is compact in the weak operator topology if
and only if Sg is bounded for all g 2 H1 .
Proof. The “if part” follows by the same argument as in the proof of the previous theo-
rem. Here we use the fact that the weak operator topology is the topology of pointwise
convergence, considering the weak- topology on H2 .
To show the “only if part”, assume that Sg is unbounded for some g 2 H1 . Then
for all n 2 N there exists sn 2 S such that ksn gk n. By compactness we can choose
a subsequence fsnk g which is convergent in the weak operator topology. In view of
9 For a net fr˛ g of real numbers lim inf˛ r˛ is defined by lim inf˛ r˛ D lim˛ inf frˇ W ˇ ˛g.
Section D.5 Weak topologies 337
Theorem D.5.4 this subsequence is bounded and hence fsnk gg is bounded, as well.
This contradiction completes the proof.
Theorem D.5.8. Let V be a normed linear space and let K V be convex. Then the
weak and strong closures of K coincide.
Proof. It suffices to prove that every strongly closed convex set E V is weakly
closed. By Corollary D.4.3, for all x … E there exists a continuous real linear func-
tional lx on V such that
Then \
ED fy 2 V W lx .y/ mx g:
x…E
Since each set fy 2 V W lx .y/ mx g is weakly closed, the set E is weakly closed as
well.
Proof. Denote by H0 the set of common fixed points of the operators in S . Then
0 2 H0 and H0 is an S -invariant subspace of H. Write
S WD fs W s 2 S g:
For all h 2 H the set S h is convex. Using the fact that S is compact we see that S h
is weakly compact. By Lemma D.5.10 there exists a unique h0 2 S h having minimal
norm. Since S is a semigroup of contractions, s h0 is in S h and ks h0 k h0 for
all s 2 S . By the uniqueness of h0 we must have s h0 D h0 .
We prove that h0 D 0 if h 2 H0? . Let g 2 H be arbitrary and consider the weakly
compact convex set Sg. The same arguments as above show the existence of g0 2 Sg
such that sg0 D g0 for all s 2 S . Thus, g0 2 H0 and hence .g0 ; h0 / D 0. Choose
s0 2 S such that g0 D s0 g. Then
and hence h0 D 0. We have thus shown that for all h0 2 H0? the set S h0 contains 0.
Now let h1 ; : : : ; hn 2 H0? be arbitrary and choose s1 2 S such that s1 h1 D 0.
Next we choose s2 2 S such that s2 .s1 h2 / D 0. Continuing this process we obtain an
operator s 2 S with s hj D 0 for all j D 1; : : : ; n. A simple compactness argument
shows the existence of an operator s0 2 S such that s0 h D 0 for all h 2 H0? . It follows
that s0 H H0 and hence ss0 D s0 .
It remains to prove that s0 s D s0 . Note first that the equations s0 ss0 D s0 and
s0 ss0 s D s0 s hold because of ss0 D s0 . We have
Using this and the relation s0 s s0 D s0 we see that ks s0 hk D ks0 hk. If
s0 h ¤ s s0 h for some h 2 H, then
1 1
ks0 hk D ks s0 hk D ks0 .s s0 h C s0 h/k ks s0 h C s0 hk < ks0 hk:
2 2
This contradiction shows that s0 D s s0 and therefore s0 s D s0 . The uniqueness of
s0 follows at once from ss0 D s0 s D s0 .
Appendix E
Measure theory
Throughout this section X will denote a Hausdorff topological space. The measures
considered will be defined on the -algebra B.X / of all Borel subsets of X , i.e., the
-algebra generated by the open subsets of X . A measure (nonnegative, real or com-
plex) defined on B.X / will be called a Borel measure. In our terminology a measure
need not be nonnegative. If and are finite Borel measures and c 2 C, then the meas-
ures c and C are defined by .c /.B/ D c .B/ and . C/.B/ D .B/C.B/,
B 2 B.X /.
Definition E.1.1. A nonnegative Radon measure on X is a nonnegative Borel meas-
ure such that
(i) .K/ < 1 for each compact set K X ;
(ii) .B/ D sup f .K/ W K B; K compactg for each Borel set B X .
D. 1 2/ Ci. 3 4/
then D .
Let be a complex Radon measure on a locally compact space X . Then there exist
a finite nonnegative Borel measure j j and a Borel measurable function g on X such
that2 jg.x/j D 1 for all x 2 X and
Z Z
f d D fg dj j (1)
Theorem E.1.3 (Riesz representation theorem). Let X be locally compact. Then there
is a bijection between all nonnegative linear functionals3 L on C00 .X / and all Radon
measures on X given by
Z
L.f / D f d ; f 2 C00 .X /:
Theorem E.1.4 (Riesz–Markov theorem). Let X be locally compact. For any con-
tinuous4 linear functional L on C0 .X /, there is a unique complex Radon measure
on X such that Z
L.f / D f d ; f 2 C0 .X /:
The norm of L as a linear functional is j j.X /.
Lemma E.1.5. Let F be a linear space of real-valued bounded functions on some set
X such that 1 WD 1X 2 F. Further, let L be a real linear functional on F such that
L.1/ D 1 and
jL.f /j sup jf .x/j; f 2 F:
x2X
Then L is nonnegative, i.e., L.p/ 0 whenever p 0.
2 See Theorem 14.14 and Corollary 11.41 in [27].
3 Nonnegativity means that L.f / 0 whenever f 0.
4 We consider the supremum norm on C0 .X /
Section E.1 Borel measures, weak and vague convergence 341
Proof. If L was not nonnegative, then we could find a function p 2 F such that
0 p 1 and L.p/ < 0. From 1 D L.1/ D L.p/ C L.1 p/ it follows that
L.1 p/ > 1. This contradicts the assumptions since 0 1 p 1.
The space MbC .X / is a Hausdorff space in the weak topology. Convergence in the
weak topology is characterized by the next theorem.
Theorem E.1.7. For 2 MbC .X / and a net f ˛g in MbC .X / the following prop-
erties are equivalent:
(i) ˛ ! in the weak topology;
(ii) lim inf ˛ .G/ .G/ for all open G X and lim ˛ .X / D .X /;
(iii) lim sup ˛ .F / .F / for all closed F X and lim ˛ .X / D .X /;
R R
(iv) lim inf f d ˛ f d for all bounded lower semicontinuous
f W X ! R.
R R
(v) lim sup f d ˛ f d for all bounded upper semicontinuous
f W X ! R.
If (i)–(v) are fulfilled, then
R R
(vi) lim f d ˛ D f d for all bounded continuous f W X 7! R.
Finally, if X is a completely regular space,6 then property (vi) implies (i)–(v).
We say that a net f ˛ g of complex Radon measures on a completely regular space
X converges weakly to some complex measure if the relation (E.1.7.vi) holds.
5 Note that a real-valued function f on X is called lower (upper) semicontinuous if the set
fx 2 X W f .x/ > rg (the set fx 2 X W f .x/ < rg, respectively) is open for every r 2 R.
6 A topological space X is completely regular if, given any closed set F and any point x 2 X n F , there
is a continuous real-valued function f on X such that f .x/ D 0 and f .y/ D 1 for every y 2 F .
Locally compact Hausdorff spaces and metric spaces are completely regular.
342 Appendix E Measure theory
The next simple result shows that finitely supported measures on Rd can be approx-
imated by absolutely continuous measures with compact support.
Lemma E.1.9. For every finitely supported nonnegative measure on Rd there exists
a sequence fhn g of nonnegative functions hn 2 C00 .Rd / such that the sequence fn g
where dn .t / D hn .t / d.t / converges weakly to .
Proof. Consider first the special case D ıx ; x 2 Rd , and denote by Bn .x/ the
open ball with center Rx and radius 1=n; n 2 N. For each n we choose a nonnegative
function hn such that hn d D 1 and hn .t / D 0; t … Bn .x/, and define n as in the
statement of the lemma. Now let f be an arbitrary continuous real-valued function on
Rd . For an arbitrary > 0 there exists N. / 2 N such that
Definition E.1.10. RThe vague topology on MC .X / is the coarsest topology such that
the functions 7! f d are continuous for every function f 2 C00 .X /.
Theorem E.1.13. Suppose that X is locally compact and let p > 0. The set
f 2 MbC .X / W .X / pg
is vaguely compact. If X is compact, then the set of all Radon probability measures
on X is compact in the weak topology.
If and are Radon measures on spaces X and Y , then there is a unique Radon
measure on X Y , called product measure, satisfying
The next theorem states that the product measure depends continuously on both of its
arguments.
Theorem E.1.17. Let .O˛ /˛2I be an open covering of X and on each O˛ let a Radon
measure ˛ be given such that ˛ .B/ D ˇ .B/ for each pair of indices ˛; ˇ 2 I
and for each Borel set B O˛ \ Oˇ . Then there is a uniquely determined Radon
measure on X such that .B/ D ˛ .B/ for every Borel set B O˛ .
9 Recall that, by definition, a sequence of random vectors converges in distribution to a random vector
X if the sequence of the corresponding distributions converges weakly to the distribution of X .
344 Appendix E Measure theory
holds for all compact sets S X . Then f is -integrable and the same inequality
holds with S replaced by X .
holds for all compact sets S X . We show first that h is j j-integrable. Since
jRe zj jzj and jIm zj jzj; z 2 C, we may suppose that h is real-valued. Set-
ting hC D max .0; h/; h D min .0; h/ and ShC D fx 2 X W h.x/ 0g, we have
h D hC h and
Z Z
0 hC dj j D h dj j K:
S S \ShC
The validity of the last inequality follows from the fact that is a Radon measure and
hence the integral over S \ ShC can be approximated by integrals over compact sets.
From this we conclude that hC is j j-integrable. The integrability of h follows in the
same way. Thus h is integrable. Since jf j D jhj, the function f is integrable as well.
As is a Radon measure, there exist compact sets Kn X; n 2 N, such that
Z Z
lim f d D f d
n!1 K S
n
lim fn D f
n!1
Then Z
lim jfn f j d D 0:
n!1
Section E.2 Convolution of measures and functions 345
E.2.2. The convolution and the multiplicative convolution ? are again finite
Borel measures satisfying
Z ZZ
f d D f .x C y/ d .x/ d.y/
and Z ZZ
gd ? D g.xy/ d .x/ d.y/
E.2.3. Let 2 Mb .Rd / and f; g 2 L1 .Rd /. Recall that D d denotes the Lebesgue
measure on Rd . We define the functions f and f g for x 2 Rd by
Z
f .x/ D f .x y/ d.y/
and Z
f g.x/ D f .x y/g.y/ d.y/
Theorem E.2.5. If the nets f ˛ g; f˛ g converge weakly to and , respectively, then
f ˛ ˛ g converges weakly to .
Appendix F
Probability
In this chapter we collect some basic definitions, facts, and notation of probability
theory. For more details see, e.g., [10, 16, 17, 37, 45].
P .X 2 B; Y 2 D/ D P .X 2 B/ P .Y 2 D/
X .B/ WD P .X 2 B/; B 2 Bd
is called the distribution of the random vector X , while the distribution function FX
of X is defined by
Z
X .B/ D p.x/ d.x/; B 2 Bd
B
X D p1 d C p2 ac C p3 sc
XCY D X Y
E .X / WD .EX1 ; : : : ; EXd /:
Suppose that Xj is square integrable for all j and put mj D EXj . The matrix
cov.X / D .cj k /dj;kD1 where
If the entries of a matrix Y D .Yj k / are integrable random variables, then we write
E.Y / WD .EYj k /:
Using this notation, the covariance matrix can also be written in the form3
cov.AX / D A cov.X / A :
the moment of order ˛ of (of X ). In this case we also say that the moment of order
˛ of exists. Note that M˛ exists if and only if
Z
A˛ D A˛ . / D A˛ .X / D jx ˛ j d .x/ < 1:
Rd
3 Recall our convention from Section A.2 saying that in expressions involving matrix operations the
elements of Rd and Cd are considered as column vectors.
350 Appendix F Probability
The number A˛ is called the absolute moment of order ˛. The existence of M˛ implies
the existence of Mˇ for all ˇ ˛. In the case d D 1 the inequality
x 2k C x 2k2
jxj2k1 ; x 2 R; k 1
2
shows that
M2k C M2k2
A2k1 ; k 2 N: (4)
2
Let n > k 1 and suppose that An exists. By Hölder’s inequality,
Z Z 1=p
Ak D jxjk 1 d .x/ jxjpk d .x/
R R
n
for every p > 1. Setting p D k
we find
1=k
Ak A1=n
n ; 1 k n: (5)
lim P .kX Xn k / D 0:
n!1
Lemma F.2.2. Let fXn g and fYn g be sequences of random vectors such that
X
P .Xn ¤ Yn / < 1:
P P
Then Xn is almost surely convergent if and only if Yn is almost surely convergent
(and they converge to the same limit).
Proof. The lemma follows immediately from the fact that, by the zero–one law of
Borel–Cantelli (see Theorem F.2.1), the probability that the event ŒXn ¤ Yn happens
for infinitely many n is zero.
1
5 Note that E jX j .E jX j2 / 2 in view of (F.1.5) and therefore the condition supn E jXn j2 < 1
implies that supn E jXn j < 1.
352 Appendix F Probability
P
from which it follows that n jcn Xn .!/j is finite with probability one. Thus, the se-
quence of partial sums of (1) converges almost surely to a limit X .
If K WD supn E jXn j2 < 1 and 0 < m < n, then
ˇ ˇ2
ˇ X ˇ X X X 2
ˇ ˇ
Eˇ cj Xj ˇ D cj ck E .Xj X k / K jcj j :
ˇ ˇ
m<j n m<j n m<kn m<j n
1
\
A2 .Xn ; XnC1 ; : : : /:
nD1
Remark F.3.4. Let t1 ;:::;tk be as in Theorem F.3.3 and denote by ft1 ;:::;tk the corre-
sponding characteristic function. Using Theorem 1.1.8 and Corollary 1.1.9 we see that
the consistency conditions can be formulated in terms of the characteristic functions
as follows:
(i) For all permutations of f1; : : : ; kg and all x1 ; : : : ; xk 2 Rd
E .X jY D y/ WD g.y/; y 2 Rd
which holds for any complex-valued Borel measurable function h on Rd such that hg
is Y -integrable.
The conditional variance of a square integrable random variable X with respect to
a -algebra B A is defined by
Var.X jB/ WD E .X E.X jB//2 jB D E.X 2 jB/ .E.X jB//2 :
Section F.4 Conditional expectation 355
In the next two theorems we collect some basic properties of the conditional ex-
pectation. Equations and inequalities for conditional expectations are meant to hold
almost surely.
Theorem F.4.1. Let X and Y be integrable random variables. The following relations
hold:
(i) EB .aX C bY / D aEB .X / C bEB .Y /; a; b 2 R or C;
(ii) if X Y , then EB .X / EB .Y /;
(iii) if X is B-measurable, then EB .X / D X ;
(iv) E .EB .X // D E.X /;
(v) jEB .X /j EB .jX j/ and kEB .X /k1 kX k1 ;
(vi) if Z is a B-measurable random variable such that Z X is integrable, then
EB .Z X / D Z EB .X /;
(vii) if B1 and B2 are sub- -algebras of A and B1 B2 , then EB2 .EB1 .X // D
EB1 .EB2 .X // D EB1 .X /;
(viii) if B and .X / are independent,6 then EB .X / D E .X /.
By the previous theorem, EB is a linear operator from L1 .P / into L1 .P /. This op-
erator is nonnegative, contractive and idempotent, the B-measurable random variables
are fixed points of EB .
Proof. Multiplying both sides of (1) by ei.t;X/ and taking expectations yields
E ei.t;X/ E.X0 jX / D E ei.t;X/ g.X / :
6 Two subsets B1 and B2 of A are called independent if the events B1 and B2 are independent for
arbitrary B1 2 B1 ; B2 2 B2 .
356 Appendix F Probability
By the uniqueness theorem for the Fourier–Stieltjes transform (cf. Theorem 1.8.3) we
have E.X0 jX D x/ d .x/ D g.x/ d .x/. Thus, E.X0 jX D x/ D g.x/ for -almost
all x from which (1) follows.
Bibliography
[1] N. I. Akhiezer, The Classical Moment Problem, Oliver & Boyd, Edinburgh, 1965.
[2] H. Bauer, Measure and Integration Theory, De Gruyter, Berlin-New York, 2001.
[3] C. Berg, J. P. R. C. Christensen, and P. Ressel, Harmonic Analysis on Semigroups,
Springer, Berlin, 1984.
[4] G. Berschneider, Spectral representation of intrinsically stationary fields, Stoch. Proc.
Appl. 122 (2012), 3837–3851.
[5] G. Berschneider and Z. Sasvári, On a theorem of Karhunen and related moment prob-
lems and quadrature formulae, in: Spectral Theory, Mathematical System Theory, Evolu-
tion Equations, Differential and Difference Equations. 21st International Workshop on
Operator Theory and Applications. Berlin, July 2010, Operator Theory: Advances and
Applications 221, pp. 171–185, Springer, Basel, 2012.
[6] T. M. Bisgaard and Z. Sasvári, Characteristic Functions and Moment Sequences. Posi-
tive Definiteness in Probability, Nova Science Publishers, Huntington, 2000.
[7] T. M. Bisgaard and Z. Sasvári, When does E.X k X j / D E.X k / E.X j / imply inde-
pendence?, Statist. Probab. Lett. 76 (2006), 1111–1116.
[8] L. E. Blumenson, A derivation of n-dimensional spherical coordinates, Amer. Math.
Monthly 67 (1960), 63–66.
[9] J.-P. Chilès and P. Delfiner, Geostatistics. Modelling Spatial Uncertainty, Wiley, New
York, 1999.
[10] B. De Finetti, Theory of Probability. Vol. 2, Wiley, New York, 1975.
[11] K. de Leeuw and I. Glicksberg, The decomposition of certain group representations,
J. Anal. Math. 15 (1965), 135–192.
[12] F. Derrien, Strictly positive definite functions on the real line, HAL: Hyper Articles
en Ligne, hal-00519325, version 1 (2010). http://hal-univ-artois.archives-ouvertes.fr/
docs/00/51/93/25/PDF/SPD.pdf.
[13] R. E. Edwards, Fourier Series. A Modern Introduction. Vol. 1, Springer, New York-
Heidelberg-Berlin, 1979.
[14] A. V. Efimov, An analogue of Rudin’s theorem for continuous radial positive definite
functions of several variables (in Russian), Trudi Instituta Matematiki i Mehaniki UrO
RAN 18 (2012), 1–8.
[15] W. Ehm, T. Gneiting, and D. Richards, Convolution roots of radial positive definite
functions with compact support, Trans. Amer. Math. Soc. 356 (2004), 4655–4685.
358 Bibliography
[16] W. Feller, An Introduction to Probability Theory and its Applications. Vol. 1, 3rd edn.,
Wiley, New York, 1968.
[17] W. Feller, An Introduction to Probability Theory and its Applications, Vol. 2, 2nd edn.,
Wiley, New York, 1971.
[18] G. B. Folland, Real Analysis, Wiley, New York, 1984.
[19] I. I. Gikhman and I. V. Skorokhod, The Theory of Stochastic Processes. I, translated
from the Russian by S. Kotz. Corrected printing of the 1st edn. Classics in Mathematics,
Springer, Berlin, 2004.
[20] T. Gneiting, Radial positive definite functions generated by Euclid’s hat, J. Multivar.
Anal. 69 (1999), 88–119.
[21] T. Gneiting, Decomposition theorems for ˛-symmetric positive definite functions, Math.
Nachr. 208 (1999), 117–120.
[22] T. Gneiting, Criteria of Pólya type for radial positive definite functions, Proc. Amer.
Math. Soc. 129 (2001), 2309–2318.
[23] T. Gneiting, Strictly and non-strictly positive definite functions on spheres, Cornell Uni-
versity Library, arXiv:1111.7077v4 [math.PR] (2012), pp. 1–30.
[24] N. Goloshchapova, M. Malamud, and V. Zastavnyi, Radial positive definite functions
and spectral theory of the Schrödinger operators with point interactions, Math. Nachr.
285 (2012), 1839–1859.
[25] V. I. Gorbachuk and M. L. Gorbachuk, M.G. Krein’s Lectures on Entire Operators,
Birkhäuser, Basel-Boston-Berlin, 1997.
[26] L. Grafakos and G. Teschl, On Fourier transforms of radial functions and distributions,
Cornell University Library, arXiv:1112.5469 [math.CA] (2012), pp. 1–12.
[27] E. Hewitt and K. A. Ross, Abstract Harmonic Analysis. I, Springer, Berlin, 1963.
[28] E. Hewitt and K. Stromberg, Real and Abstract Analysis, Springer, New York-Heidel-
berg-Berlin, 1965.
[29] N. J. Higham, Functions of Matrices. Theory and Computation, SIAM, Philadelphia,
2008.
[30] B. Jessen, Abstrakt Maalog Integraltheori, Ch. Johansens Bogtrykkeri, Kobenhavn,
1947.
[31] B. Jessen and A. Wintner, Distribution functions and the Riemann zeta function, Trans.
Amer. Math. Soc. 38 (1935), 48–88.
[32] Johnson, W. P., The curious history of Faà di Bruno’s formula, Amer. Math. Monthly
109 (2002), 217–234.
[33] A. M. Kagan, Yu. V. Linnik, and C. R. Rao, Characterization Problems in Mathematical
Statistics, Wiley, New York-London-Sydney-Toronto, 1973.
[34] K. Karhunen, Über lineare Methoden in der Wahrscheinlichkeitsrechnung, Ann. Acad.
Sci. Fenn. Ser. A I 37 (1947), 1–79.
Bibliography 359
[35] C. Kleiber and J. Stoyanov, Multivariate distributions and the moment problem, J. Mul-
tivar. Anal. 113 (2013), 7–18.
[36] M. G. Krein, On the integral representation of a continuous Hermitian-indefinite func-
tion with a finite number of negative squares. (Russian), Doklady Akademii Nauk SSSR,
Serija matematika 125 (1959), 31–34.
[37] R. G. Laha and V. K. Rohatgi, Probability Theory, Wiley, New York–Chichester–
Brisbane–Toronto, 1979.
[38] H. Langer, M. Langer, and Z. Sasvári, Continuations of Hermitian indefinite functions
and corresponding canonical systems: an example, Meth. Func. Anal. Top. 10 (2004),
39–53.
[39] G. Letac and Q. I. Rahman, A factorisation of the Askey’s characteristic function
.1 kt k2nC1 /nC1
C , Ann. Inst. Henri Poincaré, Sect. B 22 (1986), 169–174.
[40] P. Lévy, Extensions d’un théorème de D. Dugué et M. Girault, Z. Wahrscheinlichkeits-
theorie Verw. Geb. 1 (1962), 159–173.
[41] W. Maak, Fastperiodische Funktionen, Springer, Berlin, 1950.
[42] G. Matheron, The intrinsic random functions and their applications, Adv. Appl. Prob. 5
(1973), 439–468.
[43] G. Pólya, Remarks on characteristic functions, Proc. Berkeley Symp. Math. Stat. Probab.
1 (1949), 115–123.
[44] G. Pólya and G. Szegö, Problems and Theorems in Analysis, Springer, Berlin-Heidel-
berg-New York, 1972.
[45] A. Rényi, Foundations of Probability, Holden-Day, San Francisco, 1970.
[46] W. Rudin, Real and Complex Analysis, McGraw-Hill, New York, 1966.
[47] W. Rudin, Principles of Mathematical Analysis, 3rd edn., McGraw-Hill, New York,
1976.
[48] Z. Sasvári, On common fixed points of linear contractions, Proc. Amer. Math. Soc. 108
(1990), 565–566.
[49] Z. Sasvári, Positive Definite and Definitizable Functions, Akademie Verlag, Berlin,
1994.
[50] Z. Sasvári, On norm dependent positive definite functions, Monatsh. Math. 120 (1995),
319–325.
[51] Z. Sasvári, On the number of negative squares of certain functions, in: Contributions
to operator theory in spaces with an indefinite metric: The Heinz Langer anniversary
volume, Operator Theory: Advances and Applications 106, pp. 337–353, Birkhäuser,
Basel, 1998.
[52] Z. Sasvári, On generalized correlation functions of intrinsically stationary processes of
order k, Statist. Probab. Lett. 78 (2008), 3381–3387.
[53] Z. Sasvári, Correlation functions of intrinsically stationary random fields, in: Modern
Analysis and Applications: The Mark Krein Centenary Conference. Vol. 1, Operator
Theory: Advances and Applications 190, pp. 451–470, Birkhäuser, Basel–Berlin, 2009.
360 Bibliography
[54] R. Schaback and Z. Wu, Operators on radial functions, J. Comput. Appl. Math. 73
(1996), 257–270.
[55] J. A. Shohat and J. D. Tamarkin, The Problem of Moments, American Math. Soc., Prov-
idence, 1943, published in 1950.
[56] E. C. Titchmarsh, The Theory of Functions, 2nd edn., Oxford University Press, London,
1939.
[57] N. G. Ushakov, Selected Topics in Characteristic Functions, VSP, Utrecht, 1999.
[58] H. Wendland, Scattered Data Approximation, Cambridge University Press, Cambridge,
2005.
[59] E. T. Whittaker and G. N. Watson, A Course of Modern Analysis, 4th edn. (repr. 1927),
Cambridge University Press, Cambridge, 1999.
[60] D. V. Widder, The Laplace Transform, Princeton University Press, Princeton, 1946.
[61] J. Yeh, Lectures on Real Analysis, World Scientific Publishing, Singapore, 2000.
Index