0% found this document useful (0 votes)
60 views16 pages

Using The Gibbs Sampler For Conditional Simulation of Gaussian-Based Random Fields

The document discusses models of random fields with continuous univariate distributions defined by operations on Gaussian random fields. It presents a model where a Gaussian random field is perturbed by a vectorial Gaussian random field, resulting in a new random field with Gaussian univariate distributions. The simulation procedure for this model is illustrated with a case study of nickel concentrations. It also describes how the univariate and bivariate distributions of the perturbed random field can be characterized, allowing it to be conditioned to data using algorithms like Gibbs sampling.

Uploaded by

haibinbn
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
60 views16 pages

Using The Gibbs Sampler For Conditional Simulation of Gaussian-Based Random Fields

The document discusses models of random fields with continuous univariate distributions defined by operations on Gaussian random fields. It presents a model where a Gaussian random field is perturbed by a vectorial Gaussian random field, resulting in a new random field with Gaussian univariate distributions. The simulation procedure for this model is illustrated with a case study of nickel concentrations. It also describes how the univariate and bivariate distributions of the perturbed random field can be characterized, allowing it to be conditioned to data using algorithms like Gibbs sampling.

Uploaded by

haibinbn
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 16

ARTICLE IN PRESS

Computers & Geosciences 33 (2007) 522–537


www.elsevier.com/locate/cageo

Using the Gibbs sampler for conditional simulation


of Gaussian-based random fields
Xavier Emery
Department of Mining Engineering, University of Chile, Avenida Tupper 2069, Santiago 837 0451, Chile
Received 12 May 2006; received in revised form 11 July 2006; accepted 9 August 2006

Abstract

This article presents models of random fields with continuous univariate distributions that are defined by simple
operations on stationary or intrinsic Gaussian fields. Realizations of these models can be conditioned to a set of data by
using iterative algorithms based on the Gibbs sampler, while parameter inference relies on the fitting of the sample
univariate and bivariate distributions. The proposed models are suited to the description of regionalized variables with a
spatial clustering of high or low values, patterns of connectivity and curvilinearity, or an asymmetry in the spatial
correlation of indicator variables with respect to the median threshold. The simulation procedure is illustrated by a case
study in environmental science dealing with nickel concentrations in the topsoil of a polluted site.
r 2006 Elsevier Ltd. All rights reserved.

Keywords: Geostatistics; Stochastic simulation; Gaussian random field; Indicator variogram; Bivariate distributions

1. Introduction property, pollutant concentration, mineral grade,


etc.) are often modeled by transforms of Gaussian
Geostatistical simulation offers a simple ap- random fields, for which every finite-dimensional
proach for risk assessment and decision-making in distribution is multivariate Gaussian and every
the face of uncertainty and has found wide weighted average of the variables has a Gaussian
acceptance in reservoir modeling, groundwater distribution. Many algorithms have been designed
hydrology, mining engineering, soil and environ- to produce realizations of such random fields
mental sciences, to name but a few application (Davis, 1987; Dietrich and Newsam, 1993; Lantué-
domains. This approach consists in interpreting the joul, 1994; Chilès and Delfiner, 1999). However,
attribute under study as a realization of a random despite its simplicity, the Gaussian model has
field and in generating alternative realizations of specific properties that limit its scope of application.
this field, conditioned to the available information In particular, it is characterized by maximum
on the attribute. multivariate entropy beyond the input mean and
In practice, attributes with continuous univariate covariance function (Cover and Thomas, 1991).
distributions (rock permeability, porosity, soil Entropy imposes spatial ‘disorder’ or spatial scat-
tering to the values of the realizations and prevents
Tel.:+56 2 978 4498; fax:+56 2 672 3504. patterns such as connected strings or clusters of
E-mail address: [email protected]. extreme values.

0098-3004/$ - see front matter r 2006 Elsevier Ltd. All rights reserved.
doi:10.1016/j.cageo.2006.08.003
ARTICLE IN PRESS
X. Emery / Computers & Geosciences 33 (2007) 522–537 523

Other random field models are needed when these 1993), which consists in composing two random
properties are not suitable for the data at hand, e.g. fields: a directing function that introduces a random
Boolean, object-based or plurigaussian models perturbation of coordinates (in the present case, the
(Lantuéjoul, 1997, 2002; Chilès and Delfiner, 1999; identity plus the vectorial field T), and a coding
Emery, 2005b). So far, most of such models process (X). Also note that Eq. (1) is quite general,
correspond to random fields with discrete univariate as the multivariate Gaussian model is met when the
distributions. This work aims at presenting new variograms in CT have infinitely small sills.
models with continuous univariate distributions and
at proposing algorithms for their conditional 2.1.2. Characterization of univariate and bivariate
simulation. The principle will be to construct distributions
complex random fields via operations on simpler Given the realization of the perturbation field, say
(Gaussian) fields, for which prior and posterior T ¼ t, Y has the same univariate distribution as X
distributions are known. The next section gives (standard Gaussian) and its multivariate distribu-
examples of random fields with Gaussian univariate tions are Gaussian, as any weighted average of Y-
distributions. Models with skewed distributions will variables is also a weighted average of X-variables,
be presented in the subsequent section. hence is normally distributed. In particular, any
pair (Y(x+h),Y(x)) has a standard bivariate
2. Random fields with Gaussian univariate Gaussian distribution with conditional correlation
distributions coefficient
 
2.1. First model: Gaussian field with random corr Y ðx þ hÞ; Y ðxÞjT ¼ t
 
coordinate perturbation ¼ cov X ðx þ h þ tðx þ hÞÞ; X ðx þ tðxÞÞ
¼ rX ðh þ tðx þ hÞ  tðxÞÞ. ð2Þ
2.1.1. Presentation of the model
A first idea to construct a random field with When restoring the randomness of T, it is seen that
Gaussian univariate distributions is to rearrange the the prior univariate distribution of Y is standard
spatial distribution of the values of a multivariate Gaussian and its prior bivariate distributions
Gaussian random field in a random fashion. For are mixtures of standard bivariate Gaussian dis-
instance, let us consider the following model in tributions with randomized correlation coeffici-
Rd ðd 2 Nn Þ: ents. Such bivariate distributions are known as
‘Hermitian’ distributions in the geostatistical litera-
8x 2 Rd ; Y ðxÞ ¼ X ðx þ T ðxÞÞ, (1)
ture (see Appendix B). Because T(x+h)T(x) is a
where T ¼ (T1, y, Td) is a vectorial Gaussian Gaussian vector with zero mean and variance-
random field with d components defined in Rd and covariance matrix 2CT(h), the prior distribution
X is an independent standard Gaussian random of (Y(x+h),Y(x)) is characterized by the random
field with stationary correlogram rX in Rd . The variable
vectorial field T appears as a random perturbation  pffiffiffiffiffiffiffiffiffiffiffiffiffiffi 
of coordinates that alters the spatial distribution of RðhÞ ¼ rX h þ 2CT ðhÞU (3)
the subjacent Gaussian field X. In the following,
assume that T1, y, Td have jointly stationary in which U is a d-dimensional random vector with
increments (intrinsic stationarity) and let CT(h) independent standard Gaussian components. The
be the d  d matrix of their simple and cross- square root of 2CT(h) is obtained by diagonalizing
variograms for lag vector h 2 Rd . this matrix in an orthonormal basis and square
When mapping the realizations of Y, a large rooting the entries of the diagonal matrix. As R(h)
variety of textures can be obtained, depending on does not depend on x, the prior bivariate distribu-
the choice of CT and rX. Four examples are shown tions of Y are second-order stationary (invariant
in Fig. 1, in which one observes patterns of spatial under translation). The prior correlogram of Y for
clustering, connectivity, curvilinearity and local lag vector h is the expected value of R(h), i.e.
anisotropies that do not occur with realizations of  
corr Y ðx þ hÞ; Y ðxÞ
stationary Gaussian random fields. Z  pffiffiffiffiffiffiffiffiffiffiffiffiffiffi 
The model presented in Eq. (1) is a particular case ¼ rX h þ 2CT ðhÞu gU ðuÞ du, ð4Þ
of the so-called substitution model (Lantuéjoul, Rd
ARTICLE IN PRESS
524 X. Emery / Computers & Geosciences 33 (2007) 522–537

Fig. 1. Four realizations of a perturbed Gaussian random field with isotropic exponential correlogram. Components of perturbation
field are assumed independent and identically distributed. Each realization corresponds to a specific perturbation variogram
model: (A) exponential, (B) cubic, (C) nested exponential plus cubic, and (D) cardinal sine.

where gU is the probability density function of U 2.1.3. Conditional simulation


(product of marginal standard Gaussian densities). Suppose now that the values of Y have been
Monte Carlo integration may be used if no observed at n sampling locations and that one
analytical expression is available for the integral in wishes to simulate Y at m other locations in Rd . To
Eq. (4). simplify notations, let us introduce A ¼ {1yn} and
To ease the inference of model parameters, I ¼ {1ym}. Let xA ¼ {xa, a ¼ 1yn} be the set of
further restrictions may be imposed to the simple sampling locations, yA ¼ {ya, a ¼ 1yn} the values
and cross-variograms of T, for instance: at these locations, and xI0 ¼ {xi0 , i ¼ 1ym} the set
of locations targeted for conditional simulation.
 Independent components: The cross-variograms The simulation problem can be solved in three
are set to zero, hence CT is defined by its diagonal steps:
entries (simple variograms of T1, y, Td).
 Intrinsic correlation: The simple and cross-vario- (1) Simulate T(xA) conditionally to Y(xA) ¼ yA.
grams are proportional to a single variogram Obtain a set of vectors tA.
model. CT is therefore characterized by this (2) Simulate T(xI0 ) conditionally to T(xA) ¼ tA and
model and the proportionality coefficients (sym- Y(xA) ¼ yA. Let tI0 denote the set of simulated
metric d  d matrix). vectors. The conditioning information actually
 Independent and identically distributed compo- reduces to T(xA) ¼ tA as, given this condition,
nents: Same as above, except that the pro- Y(xA) ¼ X(xA+tA) is independent of T [Eq. (1)].
portionality coefficient matrix is the identity (3) Simulate Y(xI0 ) conditionally to Y(xA) ¼ yA,
matrix. T(xA) ¼ tA and T(xI0 ) ¼ tI0 . Because T and X
ARTICLE IN PRESS
X. Emery / Computers & Geosciences 33 (2007) 522–537 525

are independent, the last two constraints can be generating two other realizations with the proposed
omitted and it remains to simulate X(xI0 +tI0 ) algorithm (Figs. 2C and D). The Gibbs sampler has
conditionally to X(xA+tA) ¼ yA. been stopped after 100 000 iterations, which corre-
sponds to an average of 500 update attempts (about
Steps (2) and (3) are straightforward, since they 340 successful updates) for each datum.
consist of the conditional simulation of second-
order stationary or intrinsic Gaussian random 2.2. Second model: random mixture of Gaussian
fields. Regarding step (1), an iterative algorithm fields
based on the Gibbs sampler is proposed hereafter.
The Gibbs sampler (Geman and Geman, 1984; 2.2.1. Model definition
Casella and George, 1992) is a particular case of Consider a weighted average of K independent
Markov Chain Monte Carlo method and allows stationary standard Gaussian fields {Y1, y, YK}
simulating random vectors with complicated dis- with the same covariance function C(h)
tributions; the key idea is to replace the sampling of
X
K
a joint probability distribution by that (simpler) of 8x 2 Rd ; Y ðxÞ ¼ wk Y k ðxÞ, (6)
univariate conditional distributions (see Appendix k¼1
C). In the present case, the procedure consists of the
where w ¼ (w1, y, wK) is a vector on the unit
following steps:
hypersphere of RK . Based on this model, Hu
(2000) proposed an iterative procedure (named
(a) Initialize T(xA) ¼ tA. For instance, t1,ytn can
gradual deformation) to calibrate the realizations
be independent Gaussian vectors with variance-
of Y to nonlinear data. At each iteration, K1
covariance matrix equal to the sill matrix
independent realizations, say {Y1, y, YK1}, are
CT(N).
combined with the optimized realization YK ob-
(b) Select an index a at random (uniformly) in A.
tained at the previous iteration.
Define B ¼ A{a}.
Let us generalize the model defined in Eq. (6) by
(c) Simulate T(xa) conditionally to T(xB) ¼ tB. The
putting
components of T(xa) have Gaussian distribu-
XK
tions; their first- and second-order moments are 8x 2 Rd ; Y ðxÞ ¼ W k ðxÞY k ðxÞ (7)
their cokriging estimates and variances (simple k¼1
cokriging if T is second-order stationary,
ordinary cokriging if T is intrinsic) (Emery, with
2004). Let ta0 be the new simulated value of Y
K 1
W 1 ðxÞ ¼ cos½T k ðxÞ,
T(xa) and ta the former value. k¼1
(d) Calculate the conditional densities associated
Y
K1
with the former and new states W k ðxÞ ¼ sin½T k1 ðxÞ cos½T ‘ ðxÞ ðk ¼ 2; :::K  1Þ,
    ‘¼k
p ¼ gY yA jta ; tB and p0 ¼ gY yA jt0a ; t0B . (5)
W K ðxÞ ¼ sin½T K1 ðxÞ; ð8Þ
As mentioned earlier, conditionally to T ¼ t,
Y is a standard Gaussian random field. Hence where T ¼ (T1, y, TK1) is a stationary vectorial
gY is a multivariate Gaussian probability density random field independent of Y1yYK, and
function; the entries of the covariance matrix are W ¼ (W1, y, WK) is a vectorial random field on
given by the correlation function in Eq. (2). the unit hypersphere of RK .
(e) Draw a uniform value u on [0,1].
(f) Substitute ta0 for ta if upp0 =p, in accordance 2.2.2. Univariate and bivariate distributions
with Metropolis algorithm (Metropolis et al., It is easy to show that the univariate distribution
1953). of Y is standard Gaussian. Let us now examine its
(g) Return to step (b) and loop many times. bivariate distributions. Given T ¼ t, the distribution
of a pair (Y(x),Y(x0 )) is standard bivariate Gaussian
As an illustration, a non-conditional realization with correlation coefficient
has been drawn on a 400  400 grid (Fig. 2A) and  

corr Y ðx0 Þ; Y ðxÞjT ¼ t ¼ Cðx0  xÞ wðx0 Þ; wðxÞ ,


sampled to produce a set of 200 data (Fig. 2B).
These data are used as conditioning values for (9)
ARTICLE IN PRESS
526 X. Emery / Computers & Geosciences 33 (2007) 522–537

Fig. 2. Coordinate perturbation model. (A) non-conditional realization, (B) conditioning data, (C) and (D) two conditioned realizations
(data locations are superimposed).

where w(x0 ) and w(x) are the realizations of W(x0 ) versa. Another example is obtained when T1 is a
and W(x), respectively, while /,S is the inner random field with stationary Gaussian increments
product in RK . Accordingly, the prior distribution (Fig. 3B).
of (Y(x0 ),Y(x)) is Hermitian (mixture of standard
bivariate Gaussian distributions) and is character- 2.2.3. Conditional simulation
ized by the random variable Let us come back to the problem of simulating Y
at locations xI0 given the values yA monitored at
Rðx0 ; xÞ ¼ Cðx0  xÞhWðx0 Þ; WðxÞi. (10)
locations xA (same notations as in Section 2.1.3).
Since T is stationary, the distribution of /W(x0 ), This problem is solved in five steps:
W(x)S only depends on x0 x, hence the bivariate
distributions of Y are stationary, i.e. shift-invariant. (1) Simulate T(xA) conditionally to Y(xA) ¼ yA and
This property holds even if T1 has stationary obtain a set of vectors tA. A Gibbs sampler
increments (intrinsic stationarity), hence the statio- procedure similar to that described in Section
narity assumption on the first component of T can 2.1.3 can be designed. At each iteration, one
be released. selects a uniform index a in A and attempts to
For instance, if K ¼ 2, one has (Matheron, 1982) update T(xa) by using Metropolis algorithm.
Note that the explicit values of Y1(xA), y, YK
Rðx0 ; xÞ ¼ Cðx0  xÞ cos½T 1 ðx0 Þ  T 1 ðxÞ. (11)
(xA) need not be simulated to achieve this step.
If the outcomes of T1 are 0 or p/2, the realizations (2) Simulate T(xI0 ) conditionally to T(xA) ¼ tA and
of Y consist of pieces of Y1 and Y2 fitted together as Y(xA) ¼ yA. The last constraint can actually be
in a jigsaw puzzle (Fig. 3A); sharp transitions may dropped, as {Y(xA)|T(xA) ¼ tA} only depends
be observed when T1 moves from 0 to p/2 or vice on Y1,y, YK [Eqs. (7) and (8)], not on T.
ARTICLE IN PRESS
X. Emery / Computers & Geosciences 33 (2007) 522–537 527

Fig. 3. Two realizations of a random mixture of two Gaussian fields. Random field T1 is (A) p/2 times a truncated Gaussian field, and (B)
a random field with linear variogram and stationary Gaussian increments.

(3) For k ¼ 1yK, derive the simulated values following steps:


wk(xA) and wk(xI0 ) of Wk at locations xA and
xI0 [Eq. (8)]. (1) Transform the original data into normal scores,
(4) Simulate Y1,yYK at locations xI0 conditionally i.e. data with standard Gaussian univariate
to the equality constraints distribution. Declustering weights may be
needed if the data locations are not evenly
X
K
8a 2 A; wk ðxa ÞY k ðxa Þ ¼ ya . (12) spaced in Rd .
k¼1 (2) Choose a type of Hermitian model (random
coordinate perturbation or random mixture of
Because of the linearity of these constraints and Gaussian fields).
of the independence of Y1,yYK, the conditional (3) Determine the parameters that characterize the
simulation of Yk amounts to adding the simple bivariate Hermitian distributions of Y. This step
kriging estimate of Yk and the simple kriging can be performed by trials and errors, by fitting
error calculated on a non-conditional simulation several indicator simple and cross variograms
of Yk. The kriging system is obtained from that (see Appendixes A and B).
of Y by filtering the components that do not (4) Construct realizations conditioned to the normal
correspond to Yk in the right-hand side member scores data, by using the proposed algorithms.
of the system (Wackernagel, 2003, p. 107). Also (5) Back-transform the simulated values to the scale
note that the T-values obtained in steps (1) and of the original attribute.
(2) are omitted in this step, insofar as T is
independent of Y1,yYK.
(5) Deliver Y at locations xI0 [Eq. (7)]. 3. Random fields with skewed univariate distributions

Unlike realizations of stationary Gaussian fields,


In practice, to ease steps (1) and (2), the
realizations of random fields with bivariate Hermi-
components of T should be either truncated
tian distributions can exhibit a significant spatial
Gaussian fields or stationary Gaussian fields (except
clustering of high or low values. However some
T1, which can be a random field with stationary
limitations remain, in particular regarding the
Gaussian increments).
symmetry properties of the bivariate distributions:
the indicator associated with a given percentile p has
2.3. Outline of the simulation procedure (Hermitian the same spatial correlation as the indicator
models) associated with the symmetrical percentile 1p
(Appendix B). To broaden the class of phenomena
Conditional simulation in the scope of bivari- that can be described, two random field models with
ate Hermitian models proceeds according to the skewed distributions are now proposed.
ARTICLE IN PRESS
528 X. Emery / Computers & Geosciences 33 (2007) 522–537

3.1. First model: maximum of independent Gaussian


random fields

3.1.1. Model definition


Let us consider a set of independent stationary
Gaussian random fields {T1, y, TK} and define Y in
the following fashion:
 
8x 2 Rd ; Y ðxÞ ¼ max T k ðxÞ . (13)
k2f1; :::; Kg

If K ¼ 2 and T1 and T2 have standard Gaussian


univariate distributions, then Y has a skew normal
univariate distribution. More generally, the uni-
variate and bivariate cumulative distribution func-
tions of Y are the products of that of T1, y, TK

Y
K
8y 2 R; ProbðY ðxÞoyÞ ¼ ProbðT k ðxÞoyÞ,
k¼1
8y; y0 2 R; ProbðY ðx þ hÞoy; Y ðxÞoy0 Þ
Y
K
¼ ProbðT k ðx þ hÞoy; T k ðxÞoy0 Þ. ð14Þ
k¼1

These identities are helpful for inferring the model


parameters (number K of Gaussian fields; expecta-
tion, variance and covariance function of each
Gaussian field), as they allow determining any
indicator covariance of Y as a function of that of
T1yTK. Unlike the models presented in the Fig. 4. Maximum of independent Gaussian fields. (A) Realiza-
previous section, here the spatial correlation of tion obtained by using two Gaussian fields with spherical
indicators is not symmetrical around the median covariance functions, and (B) associated quartile indicator
threshold. For instance, the upper quartile indicator correlograms.
is often better structured in space than the lower
quartile indicator (Fig. 4). A model for which the redundant with that of T1(xA), y, TK(xA), hence
lower quartile indicator has a better spatial correla- it does not need to be accounted for.
tion than the upper quartile indicator can easily be (3) Deliver Y [Eq. (13)].
obtained by using the dual model
Again, an iterative algorithm based on the Gibbs
8x 2 Rd ; Y ðxÞ ¼ min fT k ðxÞg sampler is proposed to achieve step (1). The sampler
k2f1;...;Kg
is initialized by drawing values fulfilling Eq. (13) at
¼  max fT k ðxÞg. ð15Þ the data locations
k2f1;...;Kg

3.1.2. Conditional simulation 8a 2 A; ya ¼ max ftk;a g. (16)


k2f1;:::Kg
The conditional simulation of the random field
defined in Eq. (13) can be performed in three steps Then, each iteration consists of the following steps:
(same notations as in Section 2.1.3):
(a) Select an index a at random (uniformly) in A.
(1) Simulate T1, y, TK at the data locations xA, Define B ¼ A{a}.
conditionally to Y(xA) ¼ yA. (b) For jA{1, y, K}, the distribution of Tj(xa) given
(2) Simulate T1, y, TK at locations xI0 , condition- {Tk(xB), k ¼ 1yK} is Gaussian with expected
ally to the values obtained in the previous step. value equal to the simple kriging estimate of
Because of Eq. (13), the information on Y(xA) is Tj(xa) from Tj(xB) and variance equal to the
ARTICLE IN PRESS
X. Emery / Computers & Geosciences 33 (2007) 522–537 529

corresponding simple kriging variance. Denote


by gj(.|tj,B) and Gj(.|tj,B) the probability density
and cumulative distribution functions of this
Gaussian distribution, and define
Y
K
pj;a ¼ gj ðya jtj;B Þ G k ðya jtk;B Þ. (17)
k¼1
kaj

(c) Simulate k in {1, y, K} with a probability mass


function proportional to (p1,a, y, pK,a).
(d) Set tk,a ¼ ya and, for j6¼k, simulate Tj(xa)
conditionally to Tj(xB) ¼ tj,B and Tj(xa)oya
(this amounts to simulating a truncated Gaus-
sian variable). Update the values of {tj,a, j6¼k}.
(e) Go back to step (a) and loop many times.

3.2. Second model: chi square random field

3.2.1. Model definition


Consider now a stationary standard Gaussian
random field T in Rd and a nonnegative real
number a. A random field Y with non-central chi
square univariate distribution is defined in the
following fashion:
8x 2 Rd ; Y ðxÞ ¼ ½a þ TðxÞ2 . (18)
If a is infinitely large, then a+T(x) is always
positive and Y turns out to be an increasing Fig. 5. Square of a Gaussian random field. (A) Realization
obtained with a ¼ 0.1 and with a spherical covariance function
transform of the Gaussian random field T; in
for T, and (B) associated quartile indicator correlograms.
particular, the spatial correlation of indicators is
symmetrical around the median threshold. In
(same notations as in Section 2.1.3):
contrast, if a is equal to zero, one obtains a random
field with bivariate gamma distributions, for which
the indicator associated with a percentile p above (1) Simulate T(xA) conditionally to Y(xA) ¼ yA.
the median is spatially more continuous than the (2) Simulate T(xI0 ), conditionally to the values
indicator associated with percentile 1p (Fig. 5) obtained in step (1). Because of Eq. (18), the
(Emery, 2005a). When a varies between 0 and N, information on Y(xA) is redundant with that of
the model defined in Eq. (18) spans intermediate T(xA), hence it does not need to be accounted
cases between these two extremes. for.
A model with a greater spatial correlation of the (3) Deliver Y [Eq. (18)].
indicators associated with percentiles below
the median can be obtained by considering a Step (1) can be achieved by Gibbs sampling. The
decreasing function of the previous random field, initialization is done by putting
pffiffiffiffiffi
for instance 8a 2 A; ta ¼ a ya  a, (20)
 
8x 2 Rd ; Y ðxÞ ¼ exp ½a þ TðxÞ2 . (19) where {ea, aAA} is a set of independent random
variables equal to 1 or 1 with equal probability.
Each iteration consists of the following steps:
3.2.2. Conditional simulation
Let us consider the model defined in Eq. (18). The (a) Select an index a at random (uniformly) in A.
steps for conditional simulation are the following Define B ¼ A{a}.
ARTICLE IN PRESS
530 X. Emery / Computers & Geosciences 33 (2007) 522–537

(b) Set ta0 ¼ ta2a as a candidate value to replace ta. chromium, copper, nickel, lead and zinc) have been
(c) Calculate the conditional densities associated measured. The reader is referred to Atteia et al.
with the former and new states (1994) for a description of the data and sampling
protocol.
p ¼ gT ðta jtB Þ and p0 ¼ gT ðt0a jtB Þ. (21)

Since T is a Gaussian random field, gT is the


density of a Gaussian variable with mean equal
to the simple kriging estimate of T(xa) from
T(xB) ¼ tB, and variance equal to the corre-
sponding simple kriging variance.
(d) Draw a uniform value u in [0,1].
(e) If upp0 =p, substitute ta0 for ta.
(f) Go back to step (a) and loop many times.

3.3. Steps for inference and simulation (skewed


models)

To specify the model parameters, the following


approach is proposed:

(1) Choose a type of model (maximum/minimum of


independent Gaussian fields, or chi square
random field).
(2) Calculate the sample indicator variograms
associated with a given set of percentiles, e.g.
the quartiles, quintiles or deciles.
(3) Choose a set of parameters for the model and
determine the indicator variograms induced by
these parameters.
(4) Check whether the theoretical expressions ob-
tained in the previous step are suitable for the
sample variograms obtained in the second step.
If not, go back to step (3) or change the model
(step (1)).

Once the parameters are specified, the original


Fig. 6. Jura pollution data set. (A) Location map of available
data can be transformed in order to fit the model data, (B) sample indicator variograms associated with extreme
univariate distribution. The transformed data are quintiles of univariate nickel distribution.
then used as conditioning information in the
simulation process. The simulated values are finally
Table 1
back-transformed to the initial scale.
Basic statistics on measured nickel concentrations

3.4. Case study: Swiss Jura data set Number of data 359
Minimum (mg/kg) 1.98
Maximum (mg/kg) 53.20
An application of the previous procedure to the Mean (mg/kg) 20.02
modeling of a pollutant concentration is now Standard deviation (mg/kg) 8.09
considered. The data set under study consists of First quintile (mg/kg) 12.48
359 soil samples that have been collected by the Second quintile (mg/kg) 18.68
Swiss Federal Institute of Technology in the Swiss Third quintile (mg/kg) 22.59
Fourth quintile (mg/kg) 26.40
Jura over a region of 1450 ha. In each sample, the
concentrations of heavy metals (cadmium, cobalt, Jura pollution data set.
ARTICLE IN PRESS
X. Emery / Computers & Geosciences 33 (2007) 522–537 531

Fig. 7. Sample (circles) and modeled (solid lines) direct variograms of quintile indicators (Jura pollution data set). Only omni-directional
calculations are considered.

In the following, one is interested in simulating Table 2


the nickel concentrations over the polluted site. Parameters of elementary Gaussian random fields
A map showing the location of the samples is shown First Gaussian Second Gaussian
in Fig. 6A, while basic statistics of the nickel field field
distribution are indicated in Table 1. To facilitate
the choice of a random field model, the sample Expectation 2 0
Standard deviation 5 0.1
indicator variograms associated with the extreme Covariance
quintiles of the distribution (12.48 and 26.40 mg/kg) Model Spherical Spherical
are examined. These variograms indicate that the Type Isotropic Isotropic
lower quintile indicator has a better correlation than Range (km) 1.3 0.6
the upper quintile indicator (Fig. 6B). For this
Final model is the minimum of these two Gaussian fields.
reason, a model with bivariate Hermitian distribu-
tions is discarded, since this model assumes sym- The model parameters (number K of Gaussian
metry in the spatial correlation of the indicators random fields; expectations, variances and covar-
with respect to the median threshold. Instead, the iance functions of these fields) are inferred on the
model defined in Eq. (15) (minimum of several basis of the quintile indicator direct variograms
independent Gaussian fields) will be preferred. (Fig. 7). A reasonably satisfactory fitting of these
ARTICLE IN PRESS
532 X. Emery / Computers & Geosciences 33 (2007) 522–537

Fig. 8. Sample (circles) and modeled (solid lines) cross variograms of quintile indicators (Jura pollution data set).
ARTICLE IN PRESS
X. Emery / Computers & Geosciences 33 (2007) 522–537 533

variograms is found when the model is defined by (3) Calculate the sample variograms of the values
the minimum of two independent Gaussian fields T1 obtained in the previous step and check whether
and T2 with the parameters given in Table 2. These they match (on average over the realizations) the
parameters are validated by examining the cross theoretical models defined in Table 2 or not.
variograms between quintile indicators: the fittings
remain quite satisfactory (Fig. 8), which proves that
the modeled bivariate distributions are suited to Fig. 9 presents the average sample variograms of
that of the data. Note that no check is done about T1 and T2 obtained by using 10 000, 30 000 and
the distributions of higher order, as the small 100 000 iterations in the implementation of the
number of data prevents one to calculate reliable Gibbs sampler. In the last case one observes that
sample statistics on these distributions. The data the sample variograms almost perfectly match the
values can now be transformed into a new set of theoretical models, indicating that convergence is
data that fit the univariate distribution of the model practically reached. Based on these results, it is
and that will be used as conditioning information to decided to achieve simulations of nickel concentra-
simulate nickel concentrations over the area of tions by stopping the Gibbs sampler after 100 000
interest. iterations, which corresponds to an average of 278
The implementation of the proposed simulation updates for each datum.
procedure (Section 3.1.2) raises the question of how As an illustration, two conditional realizations
many iterations are needed to ensure the conver- are displayed on the left-hand side of Fig. 10. The
gence of the Gibbs sampler. Since theoretical results right-hand side of this figure corresponds to two
in this respect are hard to derive, convergence will realizations obtained in the scope of the multi-
be assessed from an empirical point of view. The variate Gaussian model, in which nickel concentra-
proposed exercise consists of the following steps: tions are represented by an increasing transform of
a stationary Gaussian random field. One observes
that multivariate Gaussian realizations display no
(1) Construct a set of non-conditional realizations specific structural pattern, due to their maximum
of the two Gaussian fields T1 and T2 at the data entropy property. In particular, high and low nickel
locations, and derive non-conditional realiza- concentrations are scattered in space, giving an
tions of Y ¼ min(T1,T2) at these locations. impression of ‘diffusiveness’. In contrast, the
(2) For each realization of Y, simulate T1 and T2 at proposed model (Fig. 10, left) leads to a spatial
the data locations by using step (1) of the clustering of small nickel concentrations that reflects
algorithm described in Section 3.1.2 with a pre- the better spatial correlation of the lower quintile
specified number of iterations. indicator. These differences in the spatial distribu-

Fig. 9. Theoretical and sample variograms for Gaussian data obtained with Gibbs sampler. Sample variograms have been averaged over
50 realizations.
ARTICLE IN PRESS
534 X. Emery / Computers & Geosciences 33 (2007) 522–537

Fig. 10. Conditional realizations of nickel concentrations (Jura pollution data set). Left-hand side: model defined by minimum of two
Gaussian random fields. Right-hand side: multivariate Gaussian model.

tion may have an impact in the response to transfer ditioned to existing data. The combination of
functions, e.g. in the distribution of the upscaled models and of their parameters allow one to
nickel concentrations over remediation units. reproduce varied structural features, in particular
a spatial clustering of high or low values, patterns of
4. Conclusions connectivity and curvilinearity, local anisotropies,
or an asymmetry in the correlation of indicator
This paper presented four models of random variables with respect to the median threshold. The
fields with continuous univariate distributions and list of proposed models is far from being exhaustive
proposed algorithms to construct realizations con- and many variants could be designed.
ARTICLE IN PRESS
X. Emery / Computers & Geosciences 33 (2007) 522–537 535

The choice of a model rather than another may be The indicator cross variograms are derived as follows
guided by the analysis of indicator variograms. (Wackernagel, 2003, p. 147):
Hermitian models should be used if the indicator
8x; x þ h 2 Rd ; gðy; y0 ; hÞ ¼ Cðy; y0 ; 0Þ
variogram at percentile p is similar to that at
percentile 1p. Otherwise models with skewed 1
 ½Cðy; y0 ; hÞ þ Cðy0 ; y; hÞ. ðA3Þ
distributions should be preferred. Mapping the 2
values of a few non-conditional realizations can
also help practitioners to decide or to reject a Appendix B. Bivariate Hermitian distributions
particular model, depending on their background
understanding of the spatial distribution of the A bivariate distribution is said to be ‘Hermitian’
attribute under study. if its probability density function has a representa-
The Gibbs sampler proves to be a useful tool to tion of the form (Wackernagel, 2003, p. 254)
deal with conditioning information in simulation
algorithms. As for any iterative procedure, practi- 8ðy; y0 Þ 2 R2 ; gðy; y0 Þ ¼ gðyÞgðy0 Þ
tioners may wonder about its rate of convergence. Xþ1

This issue has been studied by several authors  T p H p ðyÞH p ðy0 Þ with T p ¼ EðRp Þ,
p¼0
(Chan, 1993; Roberts and Sahu, 1997; Galli and
Gao, 2001), but theoretical results are often limited ðB1Þ
or unpractical. A simple alternative is to assess the where g is the standard Gaussian probability density
rate of convergence empirically, by comparing the function, R a random variable in [1,1] and
sample variograms of the Gaussian data obtained fH p ; p 2 Ng the set of normalized Hermite poly-
with the Gibbs sampler over a few realizations with nomials
the corresponding variogram models, in order to
decide for a suitable number of iterations. 1 dp gðyÞ
8p 2 N; 8y 2 R; H p ðyÞ ¼ pffiffiffiffi . (B2)
p!gðyÞ dyp
Acknowledgements The standard bivariate Gaussian distribution with
correlation coefficient r is found when the random
This research was funded by the National Fund variable R is almost surely equal to r. In such a
for Science and Technology of Chile (FONDECYT) case, Eq. (B1) amounts to Mehler’s expansion of the
and is part of project number 1061103. The author bivariate Gaussian density (Barrett and Lampard,
acknowledges the two reviewers for constructive 1955). In the general case, the bivariate Hermitian
comments that helped to improve the manuscript. distribution is a mixture of standard bivariate
Gaussian distributions with random correlation
Appendix A. Tools for inference and validation coefficients and is fully determined by the prob-
ability distribution of R.
In general, the parameters of a random field model Let now Y ¼ fY ðxÞ; x 2 Rd g be a stationary
are inferred from the sample univariate and bivariate random field with bivariate Hermitian distributions,
distributions. Let Y be the random field of interest and R(h) the random variable characterizing the dis-
define the indicator variable associated with threshold y tribution of (Y(x),Y(x+h)) and Tp(h) the pth
moment of R(h). For any pair (y,y0 ) of real numbers,
1 if Y oy;
1Y oy ¼ (A1) the cross-variogram between 1Y(x+h)oy and 1Y(x)oy0
0 otherwise: is (Chilès and Delfiner, 1999, p. 399)
The bivariate distributions of Y are characterized by gðy; y0 ; hÞ ¼ Gðminðy; y0 ÞÞ  GðyÞGðy0 Þ  gðyÞgðy0 Þ
the indicator simple and cross non-centered covariance X H p1 ðyÞH p1 ðy0 Þ
functions. Under an assumption of stationarity, these  T p ðhÞ, ðB3Þ
functions do not depend on the current locations, but pX1
p
only on the separation vector where G is the standard Gaussian cumulative
2
8x; x þ h 2 Rd ; Cðy; y0 ; hÞ distribution function. Since Hp1 is an even
polynomial, the indicator simple variogram at
¼ Ef1Y ðxþhÞoy 1Y ðxÞoy0 g
threshold y is the same as that at threshold
¼ ProbfY ðx þ hÞoy; Y ðxÞoy0 g. ðA2Þ y: g(y; y; h) ¼ g(–y; –y; h).
ARTICLE IN PRESS
536 X. Emery / Computers & Geosciences 33 (2007) 522–537

Appendix C. Classical and metropolized Gibbs the chain, which in turn implies the invariance
samplers property (Tierney, 1994).

The Gibbs sampler aims to simulate a random References


vector T with a probability distribution pT defined
over a state space O endowed with a s-algebra A. Atteia, O., Dubois, J.P., Webster, R., 1994. Geostatistical
The principle is to construct a homogeneous analysis of soil contamination in the Swiss Jura. Environ-
Markov chain of random states, say fT i ; i 2 Ng, mental Pollution 86 (3), 315–327.
Barker, A.A., 1965. Monte Carlo calculations of the radial
the distribution of which converges to the target
distribution functions for a proton–electron plasma. Austra-
distribution. Transitions from one state to the lian Journal of Physics 18, 119–133.
subsequent one are governed by a transition Barrett, J.F., Lampard, D.G., 1955. An expansion for some
kernel P second-order probability distributions and its application to
noise problems. IRE Transactions on Information Theory 1,
8ðt; AÞ 2 ðO; AÞ; Pðt; AÞ ¼ ProbfT iþ1 2 AjT i ¼ tg 10–15.
(C1) Casella, G., George, E.I., 1992. Explaining the Gibbs sampler.
American Statistician 46 (3), 167–174.
such that P(.,A) is measurable and P(t,.) is a Chan, K.S., 1993. Asymptotic behavior of the Gibbs sampler.
probability measure on (O, A). Journal of the American Statistical Association 88 (421),
It can be shown (Tierney, 1994; Lantuéjoul, 2002) 320–326.
Chilès, J.P., Delfiner, P., 1999. Geostatistics: Modeling Spatial
that the distribution of fT i ; i 2 Ng converges to pT if
Uncertainty. Wiley, New York, NY, 695pp.
and only if the three following properties are Cover, T.M., Thomas, J.A., 1991. Elements of Information
fulfilled: (1) the chain is irreducible, i.e. one can Theory. Wiley, New York, NY, 542pp.
visit from any state to any state of positive Davis, M.W., 1987. Production of conditional simulations via the
occurrence probability in a finite number of steps; LU triangular decomposition of the covariance matrix.
Mathematical Geology 19 (2), 91–98.
(2) the chain is aperiodic, i.e. it is not forced into
Dietrich, C.R., Newsam, G.N., 1993. A fast and exact method for
cycles of fixed length between certain states; and (3) multidimensional Gaussian stochastic simulations. Water
pT is an invariant distribution for the transition Resources Research 29 (8), 2861–2869.
kernel P, that is Emery, X., 2004. Testing the correctness of the sequential
Z algorithm for simulating Gaussian random fields. Stochastic
Environmental Research and Risk Assessment 18 (6),
8A 2 A; Pðt; AÞpT ðdtÞ ¼ pT ðAÞ. (C2)
O 401–413.
Emery, X., 2005a. Conditional simulation of random fields with
When convergence takes place, the chain is said to bivariate gamma isofactorial distributions. Mathematical
be ergodic. Since all the Markov chains used in this Geology 37 (4), 419–445.
work are irreducible and aperiodic, the only critical Emery, X., 2005b. Geostatistical simulation of random fields with
bivariate isofactorial distributions by adding mosaic models.
condition to ensure ergodicity is that of invariance.
Stochastic Environmental Research and Risk Assessment 19
In the classical Gibbs sampler, the transition from (5), 348–360.
Ti to Ti+1 consists in selecting one component of the Galli, A., Gao, H., 2001. Rate of convergence of the Gibbs
current vector Ti and replacing its value by a new sampler in the Gaussian case. Mathematical Geology 33 (6),
one drawn from its univariate distribution condi- 653–677.
tional to all the other components. It can be shown Geman, S., Geman, D., 1984. Stochastic relaxation, Gibbs
distribution and the Bayesian restoration of images. IEEE
(Tierney, 1994) that pT is invariant for the transition Transactions on Pattern Analysis and Machine Intelligence 6,
kernel so defined. 721–741.
Another option to construct a transition kernel Hastings, W.K., 1970. Monte Carlo sampling methods using
fulfilling the invariance property is to use Metro- Markov chains and their applications. Biometrika 57 (1),
polis algorithm or its variants. Except for Section 97–109.
Hu, L.Y., 2000. Gradual deformation and iterative calibration of
3.1.2, this version of the Gibbs sampler is actually Gaussian-related stochastic models. Mathematical Geology
the one that has been used throughout this paper. 32 (1), 87–108.
The transition consists of the proposition of a Lantuéjoul, C., 1993. Substitution random functions. In: Soares,
candidate state according to an irreducible transi- A. (Ed.), Geostatistics Tróia’92. Kluwer Academic Publishers,
Dordrecht, pp. 37–48.
tion kernel, followed by its acceptance or rejection.
Lantuéjoul, C., 1994. Non conditional simulation of stationary
Metropolis et al. (1953), Barker (1965) and Hastings isotropic multigaussian random functions. In: Armstrong,
(1970) proposed several alternatives to define an M., Dowd, P.A. (Eds.), Geostatistical Simulations. Kluwer
acceptance criterion that ensures the reversibility of Academic Publishers, Dordrecht, pp. 147–177.
ARTICLE IN PRESS
X. Emery / Computers & Geosciences 33 (2007) 522–537 537

Lantuéjoul, C., 1997. Conditional simulation of object-based Metropolis, N., Rosenbluth, A.W., Teller, A.H., Teller, E., 1953.
models. In: Jeulin, D. (Ed.), Advances in Theory and Equation of state calculation by fast computing machines.
Applications of Random Sets. World Scientific Publishing Journal of Chemical Physics 21 (6), 1087–1092.
Company, Singapore, pp. 271–288. Roberts, G.O., Sahu, S.K., 1997. Updating scheme, correlation
Lantuéjoul, C., 2002. Geostatistical Simulation, Models and structure, blocking and parametrization for the Gibbs sampler.
Algorithms. Springer, Berlin, 256pp. Journal of the Royal Statistical Society B 59 (2), 291–317.
Matheron, G., 1982. La déstructuration des hautes teneurs et le Tierney, L., 1994. Markov chains for exploring posterior
krigeage des indicatrices. Internal Report N-761, Centre de distributions. The Annals of Statistics 22 (4), 1701–1762.
Géostatistique, Ecole des Mines de Paris, Fontainebleau, Wackernagel, H., 2003. Multivariate Geostatistics, An Introduc-
France, 33pp. tion with Applications, third ed. Springer, Berlin, 387pp.

You might also like