MPDF
MPDF
MPDF
Sampling measurements made at discrete points, such as measurements of contaminant concentrations, can be used to
build a model for the whole site. Different methods are available to make models for contaminant concentrations at all
points within the site. Simple, more complex, and advanced interpolation methods can support these models, provided that
appropriate data requirements are met.
▼Read more
Sample points and monitoring points are individual (discrete) points collected at a particular time and at a particular place
(point). Because all points in time and space cannot be sampled, values for nearby unsampled points are inferred based on
the data collected. Geospatial methods offer systematic approaches to fill in the gaps in between sampling locations. Before
interpolating or making predictions, an appropriate geospatial method must be selected based on the characteristics of the
available data and the project requirements. See Work Flow and Flow Charts for guidance on selecting methods. Many
different interpolation approaches can be used, ranging from simple methods that are easy to apply, to more complex or
advanced methods that require significant effort to estimate the parameters used by the method. Ideally, the interpolation
approach would be both easy-to-implement and accurate. Practitioners often must compromise, however, to find an
approach that is both useable and realistic.
Data Requirements Works best with Works best when other predictor Works best when sufficient data are
larger data sets variables are available. available to estimate correlation model.
Statistical None Regression residuals are spatially Residuals after trend removal are
Assumptions uncorrelated and normally stationary and normally distributed
distributed (after transformation). (after transformation).
Provide Prediction None Yes. Prediction standard error or Yes. Prediction standard error or
Uncertainty variance provides measure of variance provides measure of
uncertainty. True uncertainty is uncertainty. True uncertainty is higher
higher when the residuals are because standard error does not
spatially correlated. include uncertainty from estimation of
model parameters.
*Conditional simulation provides a set of possible interpolated values that can be used to estimate the probability
distribution of predictions at each location
Simple methods provide a quick, simple approach to modeling spatial data. First, simple methods are conceptually simpler
because they require almost no assumptions be made about the data or the variable to be mapped, only that the sampled
data relate to one another in space or time, or both. Second, these methods are computationally simpler, so that large data
sets can be efficiently mapped. Because the simple methods do not impose much structure on the data, they generally
require more data in order to produce a good interpolation. Third, these methods are statistically simpler because they do
not include a statistical error component that can be used to estimate the error in the predictions at unsampled locations.
In contrast, more complex and advanced methods make assumptions about the statistical distribution of a sampled
population and can provide estimates of prediction error. This error derives from the inability to sample everywhere, and
from the inability of the method to match the data. More complex methods include regression type methods, which are
based on statistical assumptions about the data but do not include an explicit representation of spatial correlation. See
GSMC-1, Section 5.5.1 for information about linear regression. The advanced methods, including kriging and conditional
simulation, are based on more stringent statistical assumptions and include an explicit model of spatial correlation. The
advanced methods are also commonly known as geostatistical methods.
The output of every geospatial interpolation method is the set of interpolated values for unsampled locations of interest. The
inputs to the method are the observed data. The method can be thought of as a mathematical equation that converts the
sampled inputs into the interpolated outputs. This equation or model of the data will be complicated in some cases, but the
mathematical details are handled by the software and do not need to be fully understood by the user. One way of thinking
about how the geospatial interpolation methods differ is to distinguish between the different components (spatial trend,
spatial correlation, error) that are represented within the methods. The different components for the categories of methods
are illustrated here:
Usually, the purpose of geospatial modeling is to predict the value of a variable of interest at a set of points that have not
been sampled (called point prediction, or if kriging is being used, point kriging). In geostatistics, the term sample support is
used to describe the larger mass, length, area or time represented by a smaller sample or group of composite samples. Most
samples have point support. In some cases, however, the goal is to not to predict the variable at individual points, but rather
to predict a function of the variable (such as the average) over a set of points or an area. Calculating the average over a set
of areas using kriging is called block kriging. Block kriging allows a change from point support to block support.
If a nonlinear function of the predicted variable (such as the maximum) is of interest, or the area over which the variable
exceeds some prescribed value, then a method such as indicator kriging or conditional simulation may be appropriate.
Indicator kriging is a nonlinear, nonparametric form of kriging in which continuous variables are converted to binary
(indicator) variables. Because it is a nonparametric method, indicator kriging can handle distributions of any kind and can
handle nondetected concentrations. Conditional simulation is used to produce a series of randomly simulated predictions
that match both the data points and the geospatial model. Conditional simulation is the only method that attempts to
reconstitute the intrinsic heterogeneity of the sampled environment and can therefore provide a more robust measure of
overall uncertainty in generating spatial predictions.
Measures of Uncertainty ▼Read more
Regression and kriging methods can also provide a measure of uncertainty for each prediction called the prediction variance
or standard error. The prediction variance does not indicate the full extent of the uncertainty in the predictions because it
assumes that the geospatial model parameters are known perfectly, when in fact they must be estimated from the data. The
prediction variance is still useful, however, as a relative measure of uncertainty; it shows, for example, where additional data
collection would be most useful.
Almost all geospatial methods (except conditional simulation) have a smoothing effect, meaning that the predicted surface
is much smoother than the actual surface. As a result, the predictions tend to under-predict the high values and over-predict
the low values. One of the benefits of conditional simulation is that the simulated surfaces are not overly smooth, which is
particularly important when the focus of the investigation is on identifying values that are below or above a threshold.
More recent research in geostatistical prediction methods has focused on the development of methods that can account for
the additional prediction uncertainty that results from estimating the model parameters. These methods are generally
Bayesian methods, which assume that the model parameters are also random variables. These methods are extensions of
regression and kriging that are called hierarchical models or model-based geostatistics. Bayesian methods (Cressie and
Wikle 2011; Diggle and Ribeiro 2007) are not described in this document because these methods are rarely used for
remediation optimization.
Additional information is provided elsewhere in this guidance on sources of uncertainty and the measures of uncertainty for
the different geospatial methods.