Statistical and Geostatistical Applications in Geology: Chapter Outline
Statistical and Geostatistical Applications in Geology: Chapter Outline
Chapter Outline
9.1 Why Geostatistics? 167 9.2.15 The t-Test 177
9.2 Statistical Applications 168 9.2.16 The F-Test 178
9.2.1 Universe 168 9.2.17 The Chi-Square Test 178
9.2.2 Population, Sample, and Sampling Unit 168 9.2.18 Analysis of Variance 179
9.2.3 Probability Distribution 168 9.2.19 Trend Surface Analysis 180
9.2.4 Frequency Distribution 168 9.2.20 Moving Average 183
9.2.5 Normal or Gaussian Distribution 169 9.2.20.1 Moving Weighted Average of Block
9.2.6 Minimum, Maximum, Range, Median, Mode, Mean 183
and Mean 169 9.2.20.2 Moving Arithmetic Average 183
9.2.7 Sample Variance and Standard Deviation 171 9.2.20.3 Other Mathematical Techniques 184
9.2.8 Probability Plot 171 9.3 Misclassified Tonnage 184
9.2.9 Log-Normal or Pareto Distribution 171 9.4 Geostatistical Applications 185
9.2.9.1 Log-Normal Computation 172 9.4.1 Block Variance 185
9.2.9.2 Sichel’s t-Estimate 173 9.4.2 Semivariogram 185
9.2.9.3 Confidence Limits 173 9.4.2.1 Properties of a Semivariogram 187
9.2.9.4 Coefficient of Skewness 174 9.4.2.2 Semivariogram Model 188
9.2.10 Covariance 174 9.4.2.3 Smoothing a Semivariogram 189
9.2.10.1 Computation of Covariance 174 9.4.3 Estimation Variance 190
9.2.11 Correlation Coefficient 175 9.4.4 Kriging 191
9.2.12 Scatter Diagram 175 9.4.5 Kriging Estimation: An Example 192
9.2.13 Regression 175 9.4.6 Benefits of Statistics and Geostatistics 194
9.2.14 The Null Hypothesis 175 References 194
Statistical and geostatistical study enhances the analytical experience factors are applied to compromise as and when
and interpretive skill of the operator to visualize the inside the estimate goes wrong, often resulting in write-off
characteristics of the orebody. payable deposits or overvalued uneconomic deposits that
never deliver the predicted grades.
Author. Workers in the field of mineral resource and ore reserve
estimation (Davis, 1971) corrected this shortcoming by
9.1 WHY GEOSTATISTICS? borrowing techniques from formal statistical theory. These
include analysis of sample population, range, frequency
Geologists and mining engineers advocate that the grade of
distribution, histogram plot, mean, variance, standard de-
a mineral deposit depends on the location of a subblock
viation, correlation coefficients, analysis of variances, t-test,
within the orebody and sample value of a small block. Thus
F-test, trend analysis, and inverse distance for univariate
conventional methods of resource and reserve estimation
and multivariate elements. The statistical techniques
(refer to Chapter 8) are based on spatial position and value
compute the global reserves with an overall confidence
of the surrounding ground. There is no objective way to
limit of estimation (tonnage and grade). The classical
measure the reliability of the estimation. Professional
statistical methods assume that sample values are randomly 9.2.2 Population, Sample, and Sampling
distributed and are independent of each other. They do not Unit
include the inherent geological variance within the deposit,
and ignore spatial relationships between samples. Population consists of a well-defined set of elementsd
Krige (1951) and Matheron (1971) developed the finite or infinite. Sample is a subset of elements taken from
theory of regionalized variables to resolve this problem. a population. Geological sample collection, preparation,
Regionalized variances are random quantities that as- and chemical analysis are the most critical issues for the
sume different values depending on their position within correct representation of the extremely variable nature of
a region. In ore reserve estimation, the orebody is the mineral occurrences. The sampling unit is part of the
region and grade, thickness, etc. are the regionalized universe on which the measurement is made. It is 5 kg of
variables. This technique offers the best linear unbiased rock or an ore sample, a truckload of bulk sample, or a 3 m
estimate (BLUE) of the reserves in the context of both piece of drill core. One must specify the sampling unit to
local and global bases. It also provides a direct quanti- recognize different populations, and treat them separately.
tative measure of reliability. The procedure involves the The population of 1 m drill core, channel, or chip samples
creation of a semivariogram, sample cross-validation, is likely to be statistically very different from the popula-
and an estimation technique known as point and block tion of 10 10 10 m3 blocks of ore. The size of the
kriging. sampling unit is significant. One should never make a
statement about the population of a grade without speci-
fying what grade it isdsay 2% Cu.
9.2 STATISTICAL APPLICATIONS
Statistics is a branch of applied mathematics. It deals with
9.2.3 Probability Distribution
the development and applications of procedures to collect,
organize, analyze, interpret, present, and summarize quan- Probability is the determination of the probable from the
titative data for understanding and drawing conclusions. It possible. The possible outcome of a random selection is
uses probability theory to estimate the population parame- described by the probability distribution of its parameter.
ters present, including future prediction. Univariate sta- When a coin is tossed the possible outcomes are either
tistics describes the distribution of individual variables heads or tails, each with an independent equal probability
(Koch and Link, 1986; Davis, 2002; Sahu; 2005). The data of ½. Similarly, when a perfect dice is thrown, the possible
in earth science often demonstrate inherent relationships outcomes are either 1, 2, 3, 4, 5, or 6, each with an equal
and interdependencies between variables, and are called probability of 1/6. This is the case of a discrete variable that
bivariate or multivariate
statistics. The important param- assumes only integer values; the distribution will associate
eters are the mean X , median, mode, variance (s2 or S2), with each possible value x of probability P(x). The value
standard deviation (s or s), and coefficient of variation will be nonnegative, and summation of all possible P(x)
(CV). Statistical application prefers at least 30 samples for will be equal to one. Probability, in an applied sense, is a
drawing any inference. measure of the likeliness that a random event will occur.
Statistical procedures, tests of hypotheses, and appli- Geological events are typically not discrete, having
cations involve different assumptions, and accordingly are infinite and continuous numbers of possible outcomes. The
assembled in two groups: parametric, and nonparametric range of possible outcomes is finite. The exact result that
(Henley, 1981). Parametric statistics fundamentally relies may appear cannot be predicted within the range. Such
on the assumption that the parent populations are normally events are called continuous random variables. For
distributed, and preserve homogeneity of variance within example, one may wish to know the chance of obtaining a
groups. Nonparametric statistics does not require normal grade in the interval of 2%e4% or 6%e8% in a mineral
distribution or variance assumptions about the populations deposit. In practice, such a distribution will never be
from which the samples are drawn. known. All that can be done is to compute an experimental
probability distribution and then try to infer which theo-
retical distribution may have produced such an experi-
9.2.1 Universe mental sampling distribution, e.g., normal curve.
Universe is the total mass of interest and total source of all
possible data. Universe is characterized by one or more
attributes and can be single or multidimensional. An
9.2.4 Frequency Distribution
example of a geological universe is the area of our The measurements of some attributes, say the analytical
interestda stratigraphic province, a mineral deposit with a value of samples of a mineral deposit, can be classified into
defined boundary. The characteristics of universe change grade groups of 0%e1%, 1%e2%, 2%e3%, and so forth.
with changing boundary limits. The frequency of occurrences (number of assays per grade
Statistical and Geostatistical Applications in Geology Chapter | 9 169
Sample Class of % Zn
numbers or the highest frequency in grouped data or in a Sample mean grade X ¼ ð1=nÞ Xi
i¼1
sample population. In a series of 1, 2, 3, 4, 4, 4, 5, 5, 6 , 4
is the most frequently occurring number. There may be The % zinc contents of an exploration drill hole
more than one mode when two or more numbers have an sampled at 1 m uniform length have been tabulated in
equal or relatively high number of instances (bimode or Table 9.1 for the computation of mean grade:
multimode). A mode does not exist if no number has more P
n
than one instance. xi ¼ 248:80
i¼1
Quartiles split ascending or descending order data sets
into quarters. A quarter (25%) of the data falls below the n ¼ 72
lower or first quartile (Q1). Similarly, a quarter (75%) of
Mean X ¼ 248:80=72 ¼ 3:45 % Zn
the data lies above the upper or third quartile (Q3). Deciles
split the order data set into 10 divisions. One-tenth of the The computation of arithmetic mean assumes that
data falls below the lower or the first decile, and 1/50th is all observed values are composed of the same size of
Statistical and Geostatistical Applications in Geology Chapter | 9 171
FIGURE 9.3 Schematic diagram showing (A) asymmetric and (B) symmetric probability distribution.
pffiffiffi
sample length. Therefore unequal sampling units cannot be Confidence limit at 95% probability level ¼ S= n
utilized together to calculate valid statistical parameters. 1:96 ¼ 0:34 % Zn
Transformation of raw data to equal length of the sample
CV or relative standard deviation is the ratio between
unit is a prerequisite in statistical applications.
standard deviation and mean. CV represents the alternative
The three measures of central tendency, i.e., mode,
to skewness to describe shape of the distribution. The CV
median, and mean, will be different in an asymmetric dis-
will be 1 for normal distribution. A CV of 1 indicates
tribution curve (Fig. 9.3A). The normal distribution curve is
the presence of some erratic high sample values that have a
symmetric, and the three statistical measures of central
significant impact on estimated mean grade. The erratic
tendency will coincide (Fig. 9.3B).
high values may be due to sampling error. These samples
are sorted, located, and rejected if they do not fit in the
9.2.7 Sample Variance and Standard geological system:
Deviation
CV ¼ S X ¼ 1:476=3:455 ¼ 0:427
A reasonable way to measure sample variation about the
The computation of mean from grouped data
mean value is to subtract the mean from the observed value
(Table 9.2) is:
and sum as follows: .X
X
X Mean X ¼ ðF mpÞ F ¼ ð249=72Þ
n
Xi e X
i¼1
¼ 3:458 % Zn
As the algebraic sum of the deviation of a set of where F ¼ frequency and mp ¼ midpoint:
numbers from its mean is always zero, the foregoing for- n X . o 2
mula becomes meaningless as a measure of variance. The Sample variance S2 ¼ F mp2 N X
average variance is thus expressed as the sum of the
¼ ð1012=72Þeð3:458Þ
2
squared deviation of the mean from each observed value
(Table 9.1) divided by the number of observations 1,
¼ 14:05e11:95 ¼ 2:10
representing the degree of freedom. The population sam-
ple variance estimator is expressed as:
Pn 2
Sample varianceðS2 Þ ¼ Xi e X ðn 1Þ 9.2.8 Probability Plot
i¼1
( ! 2 ) Mean and standard deviation can be obtained from the plot
Pn X n
or; ¼ Xi2 e Xi ðn 1Þ of the upper class value limit (grade) versus the cumulative
i¼1 i¼1 frequency percentages on linear probability paper
r S2 for Zn distribution ¼ 154:638=71 ¼ 2:178 (Fig. 9.4). A normal distribution will create a straight line
on normal probability paper. The grade corresponding to
The square root of sample variance is the sample the 50% probability line gives the mean. The difference in
standard deviation estimator: grade between 84% and 50% probability gives an estima-
pffiffiffi2 pffiffiffiffiffiffiffiffiffiffiffi
¼
pffiffiffi S ¼ 2:178
pffiffiffiffiffi ¼ 1:476
Sample standard deviation
(S) tion of standard deviation.
Variance of the mean ¼ S2 n ¼p2:178ffiffiffi 72 ¼ 0:256
pffiffiffiffiffi
Standard error of the mean ¼ S= n ¼ 1:476 72 ¼ 9.2.9 Log-Normal or Pareto Distribution
0:174 pffiffiffi
pffiffiffiffiffi limit at 68% probability level ¼ S= n ¼
Or confidence The log-normal or Pareto distribution plot exhibits
1:476 72 ¼ 0:174 exponentially and is skewed to the right or left. The
172 Mineral Exploration
FIGURE 9.4 Probability plot of Table 9.2 for computation of sample mean and standard deviation of Zn grade.
TABLE 9.3 Grouped Borehole Sample Data of Baroi Lead-Zinc Mine, India
FIGURE 9.6 Plot and computation of X, S2, and confidence limit of lead sample values by double logarithmic paper.
9.2.9.4 Coefficient of Skewness parameters. Covariance (COV) is the joint variation of two
variables about their common mean:
The quantity of symmetry of the sample distribution is the ( )
coefficient of skewness or skewness. The statistics are X n
plotted as a histogram and defined as: COVðX; YÞ ¼ ðxi xÞ yi y ðn 1Þ
" # i¼1
X n
3 3 ( ! )
Coefficient of skewness ¼ 1=n ðxi xÞ d X
n X
n X
n
i¼1 ¼ x i yi e xi yi n ðn 1Þ
i¼1 i¼1 i¼1
FIGURE 9.7 Plot of scatter diagrams of two variables showing different types of correlations between them depending on mutual interdependability.
Difference (d)
Sample Set I Set II Set (IeII) (Difference)2
April 2010 49 51 2 4
May 38 31 7 49
June 33 33 0 0
FIGURE 9.9 Schematic diagram showing analysis of the t-test.
July 41 41 0 0
August 43 40 3 9
where X ¼ mean of the sample, m0 ¼ hypothetical mean
September 52 43 9 81
of the population, n ¼ number of observations, and
October 45 51 6 36
S ¼ standard deviation of observations.
November 41 32 9 81 If the computed value exceeds the table value of t at a
December 36 35 1 1 specific degree of freedom, level of significance, or lies in
January 33 30 3 9
the critical region (Fig. 9.9) or region of rejection, the null
2011 hypothesis (m1 m0) is rejected, leaving the alternative that
m1 > m0. If the computed value of t is less than the table
N 10 10 10 10
value, it can only be stated that there is nothing in the
SUM (Ʃ) 411 387 24 270 sample to suggest that m1 > m0, but it cannot specify that
AVG X 41.1 38.7 2.4 27.0 m1 < m0. This indecisiveness is a consequence of the
2 manner in which statistical tests are formulated. They can
VAR (S ) 40.77 61.57 23.60 1078.67
demonstrate, with specified probabilities, what things are
STD (S) 6.38 7.85 4.86 32.84 not. They cannot stipulate what they are.
The average monthly mill feed silver grade (g/t) of a
zinc-lead mine over 10 months is given in Table 9.5. There
are two sets of samples collected at different time intervals.
variance and shape of the distributions are also equal. The hypothetical mean grade (m0) of the deposit is 40 g/t
Alternative statistical applications such as t, F, and chi- Ag. The t-value of samples at Set I is computed totest the
square can only reject a null hypothesis or fail to reject it. hypothesis of likelihood of the means of samples X and
The evidence can state that the mean of the population from population (m0):
which the samples are drawn does not equal the specified pffiffiffi
population mean and is expressed as: t ¼ X m0 S n
. .pffiffiffiffiffi
H0 : m1 sm0 ¼ ð41:10 40:00Þ 6:38 10 ¼ ð1:10=2:02Þ
¼ 0:54
9.2.15 The t-Test The calculated value of 0.54 is much less than the table
value of t (1.83) for 9 degrees of freedom and 5% (a ¼ 0.05
The t-test follows parametric statistics assuming certain for a one-tail test) level of significance (Box 9.1). The
conditions of normal probability distribution and equality t-computed value lies away from the region of rejection.
of variance between groups and parent populations. The The test is indecisive to suggest that the population mean is
t-test or Student’s t-test is based on t-probability distribu- greater than 40 g/t Ag.
tion, similar to normal distribution with widespread, and The two sets of samples (I and II) collected at different
dependent upon the size of the samples. This is powerful time intervals are unpaired and independent. The pooled
for testing the hypothesis and useful for establishing the
likelihood that a given sample could be a member of a
population with specified characteristics (mean), or for
testing a hypothesis about the equivalency of two samples. BOX 9.1 CRITICAL VALUE OF T
The t-statistic is computed by: Critical Values of t for Various Degrees of Freedom and
pffiffiffi Selected Levels of Significance from Table 21,
t ¼ X m0 S n Penguin-Honeywell Book of Tables (Kellaway, 1968).
178 Mineral Exploration
t-estimate tests the null hypothesis and the equality of the The estimated t-value of 1.56 is less than the table value
mean of two normally distributed populations. The test of 1.83 for 9 degrees of freedom and the 5% (a ¼ 0.05 for
statistical form is stated as: the one-tail test) level of significance. It does not fall into
.n pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi o the critical region and it will be concluded that there is no
t ¼ X1 X2 Sp ð1=n1 þ 1=n2 Þ evidence to suggest that the two samples came from pop-
ulations having different means. Three assumptions are
where Sp is the pooled estimate of the population standard necessary to perform this test:
deviation based on both sample sets:
1. Both samples are selected at random.
Sp2 ¼ ðn1 1Þ S1 2 þ ðn2 1Þ S2 2 ðn1 þ n2 2Þ 2. Populations from which the samples were drawn are
. normally distributed.
Pooled t ¼ ð41:10e38:70Þ 3. Variances of the two populations are equal.
hnpffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi o pffiffiffiffiffiffiffi i
ð9 40:77Þ þ ð9 61:57Þ=18 0:2 9.2.16 The F-Test
.pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi pffiffiffiffiffiffiffiffiffi The F-test pertains to parametric statistics and follows the
¼ 2:40 fð366:93 þ 544:13Þ=18g 0:20 assumptions and conditions of normal probability distribu-
. pffiffiffiffiffiffiffiffiffiffiffi pffiffiffiffiffiffiffiffiffi tion of populations from which the samples are drawn.
¼ 2:40 51:17 0:20 The equality of variances of two data sets can be found and
tested based on a probability distribution known as the
¼ 2:40=ð7:15 0:44Þ ¼ 2:40=3:146 ¼ 0:76 F-distribution under the null hypothesis. The F-test is sen-
sitive to nonnormality. F is the ratio between the variances of
The table value of t for a two-tailed test with 18 or
two populations. The F-distribution is dependent upon two
9 degrees of freedom and the 10% (5% in each tail) level of
values: each associated with each variance in the ratio. The
significance are 1.73 and þ1.73. The calculated t-value of
samples are expected to be randomly collected from a
0.76 does not fall into either critical region. The null hy-
normal distribution to satisfy the condition of F-distribution.
pothesis cannot be rejected. There is no evidence to suggest
The F-hypothesis test is defined as:
that the two samples came from populations having
different means. H 0 : d1 2 ¼ d2 2
The paired t-test provides a hypothesis test of the
differences between population means for a pair of random d1 2 < d2 2 for a lower one tailed test
samples whose differences are approximately normally
distributed. Let us assume that the two sets of samples in d1 2 > d2 2 for a lower one tailed test
Table 9.5 are a repeat analysis of silver at two different
laboratories. The paired t-value can be computed for testing d1 2 sd1 2 for a two tailed test
equivalency of the means: F ¼ Explained variance=Unexplained variance
.
Paired t ¼ d Sd
F ¼ S1 2 S2 2
where di ¼ difference of each paired sample and d ¼ mean where S1 2 and S2 2 are the sample variances of the population.
difference: It is concluded that the greater this ratio deviates from 1,
( !2 ) the stronger the evidence for unequal population variances.
2 X n Xn
In other words, the variance in the populations is not same
Sd ¼ n 2
di e di nðn 1Þ
i¼1 i¼1
in the two groups if the computed F-value exceeds the table
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi value. If the calculated value is less than the table value we
would have no evidence that the variances are different.
Sd ¼ ðSd2 nÞ
The two-sample data sets will represent the same popula-
From Table 9.5: tion in the case of S1 2 ¼ S2 2 .
d ¼ 2:4 Sd ¼ 24 Sd2 ¼ 270
9.2.17 The Chi-Square Test
Sd2 ¼ fð10 270Þe576g=ð10 9Þ ¼ 23:60
The chi-square (c2) test relates to nonparametric statistics,
pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi pffiffiffiffiffiffiffiffiffi
Sd ¼ ð23:60=10Þ ¼ 2:36 ¼ 1:536 is typically easy to compute, and is capable of processing
data measured on a nominal scale (categorical variables).
Paired t ¼ 2:40=1:536 ¼ 1:56 c2 is a statistical test frequently used to compare observed
Statistical and Geostatistical Applications in Geology Chapter | 9 179
2
¼ 95:20eð52:77Þ 30 ¼ 2:37
BOX 9.2 CRITICAL VALUES OF c2
Critical Values of c2 for Various Degrees of Freedom and SSW ¼ SST SSA ¼ 3:84 e 2:37 ¼ 1:47
Selected Levels of Significance from Table 24,
Penguin-Honeywell Book of Tables (Kellaway, 1968).
TABLE 9.8 % Pb Assay Value of Five Samples Analyzed at Six Standard Laboratories
m[5
Zn ¼ a0 þ a1 x þ a2 y þ a3 x2 þ a4 xy þ a5 y2 þ e
ðsecond degree equationÞ
and by similar types of equations, where:
Zn ¼ dependent variable ðsample valueÞ
X; Y and Z ðelevationÞ ¼ independent variables ðlocation
of sample coordinatesÞ
e ¼ random error component;
FIGURE 9.10 Analysis of the F-test from the table value at a 5% level of a0 .an ¼ unknown coefficients.
confidence.
The number of coefficients is dependent on the degree
(NDEG) of the polynomial used, and is computed as
fðNDEG þ 1Þ ðNDEG þ 2Þg=2. The higher degree of
The results of the analysis of variances as derived from polynomial attempts to reduce the noise level in data
data set in Table 9.8 are given in Table 9.9. minimizes the sum of squares of residuals, and allows
The computed F-value of 10.17 is greater than the table definition of the trend in a unique manner. However, it is
value of 2.76 at a 5% level of confidence (Fig. 9.10). observed that after reaching a certain degree, the linear fit
Therefore it can be concluded that the variation is not the becomes static due to error propagation and the noisy na-
same in the two groups. ture of the large matrices.
Statistical and Geostatistical Applications in Geology Chapter | 9 181
In a first-degree equation we need three normal equa- The dependent variables, e.g., Pb, Zn, Fe, and width, are
tions to find the three coefficients: assumed as a numerical functional relation of criteria var-
P X X iables of x (strike) and y (level) coordinates. The trend
Zn ¼ a0 n þ a1 x þ a2 y
surface maps of lead and zinc at the fourth degree are
P X X X
generated on a longitudinal vertical section (Figs. 9.11 and
xZn ¼ a0 x þ a1 x2 þ a2 xy
P 9.12, respectively) considering local coordinates of x and y.
X X X
yZn ¼ a0 y þ a1 xy þ a2 y2 The zinc value has been computed at every observed
drill intersection point of the orebody using the fourth-
The equation in matrix form is:
degree trend surface equation. The comparison between
2 P P 3 2 3 2 P 3
n x y a0 Zn actual sample and computed values is given in Table 9.10.
6P P 2 P 7 6 7 6P 7 The analysis and observations from actual samples and
4 x x xy 5 4 a1 5 ¼ 4 xZn 5
P P P 2 P the trend surface equation (Table 9.10) are:
y xy y a2 yZn
The coefficients for the best-fitting linear trend surface Total variance (TV) ¼ 71.80
are obtained by solving this set of simultaneous equations. Unexplained variance (UV) ¼ 12.10
The trend surface technique is illustrated by a 2D case
study. A zinc-lead orebody extends along N42 WeS42 E Explained variance (EV) ¼ (TV UV) ¼ 59.70
over a strike length of 90 m. It is lens shaped and widest at
the center. The width varies between 2 and 16 m with an Goodness of fit (GF) ¼ (EV/TV) 100 ¼ 83.15%
average of 7 m. It pinches at both ends. Mineralization dips GF is a measure of efficiency of the trend equation
to the west and plunges 55 NW. The orebody has been applied, and depends on input data and inherent structure.
drilled from underground at four levels at 15 m intervals. The extreme values will lower the GF even with a large
The mineralized intersections are predominantly zinc rich database. GF is a useful measure for a homogeneous
with moderate iron and subordinate amounts of lead. The population, but is not appropriate for skewed or mixed types.
grades vary between 3.96 and 11.18 with an average of Hence lack of significance or high significance needs to be
6.87% Zn, 0.00 and 5.98 with an average of 0.40% Pb, and supported by geological interpretation. It determines the
2.44 and 15.25 with an average of 6.05% Fe. The drill functional relationship between predictor and criteria variates.
intersection values are plotted along a longitudinal section.
FIGURE 9.11 Fourth-degree trend surface map of lead content of orebody 7W, Balaria deposit, Rajasthan, India.
182 Mineral Exploration
FIGURE 9.12 Fourth-degree trend surface map of zinc content of orebody 7W, Balaria deposit, Rajasthan, India.
TABLE 9.10 Computed Value of Zinc Content of Orebody 7W, Balaria Deposit, Rajasthan, India
% Zn % Zn % Zn % Zn
Level Latitude (Sampled) (Computed) Level Latitude (Sampled) (Computed)
343 68 6.11 6.05 288 102 4.85 4.27
343 86 7.17 7.46 263 15 6.44 6.67
343 102 11.18 10.24 263 38 8.28 8.26
343 105 10.01 10.62 263 54 6.81 7.92
343 134 4.41 4.49 263 67 8.54 7.04
314 46 6.63 6.75 263 84 4.51 4.81
314 60 6.37 6.21 250 5 7.30 7.15
314 82 7.70 7.55 250 35 8.22 8.23
314 91 8.73 7.88 250 64 4.13 4.27
314 101 7.44 7.17 317 59 6.34 6.17
288 28 6.20 6.19 286 34 6.29 6.41
288 49 7.58 6.98 330 81 6.88 7.18
288 74 8.74 7.92 309 80 6.11 7.56
288 90 5.34 6.96
The accuracy of map generation is centered on the midpoint The trend surface map can indicate (1) possible conti-
of the 2D grid. The computed trend value and actual values nuity of mineralization in depth and strike directions, (2)
may show a strong divergence away from the midpoint. inherent pattern of mineralization like metal zoning and
The margins are the most inaccurate zones. Extrapolation of association, and (3) the grain of longer and shorter axes of
the trend equation beyond the control point will result in variations. It may often generate false anomalies since the
erroneous values. separation of noise into positive and negative sets is closely
Statistical and Geostatistical Applications in Geology Chapter | 9 183
linked and together they sum to zero. Smoothing of higher to extreme grade fluctuation due to the volume/variance
values and enhancement of lower values are an inbuilt part relations.
of the procedure. Subset trend maps show no relation to The weight factor (W) of a block is to be determined to
composite trend maps. In spite of those constraints, these establish a moving average based on block mean value Y .
are utilized for a generalized pattern of global variation. A block moving average design is shown in Fig. 9.13. Y e is
the estimated grade of block 1 from the averages of the
9.2.20 Moving Average exploration sample in blocks 1e9. Ye is the estimated grade
Moving average is a technique of smoothing data that occur of ore to be produced from the center block and Y 1 is the
infrequently at high and low values. It is an estimator mean assay value of the drill core in the same block. The
operating within a pattern of adjacent values by designing sample design will move to a new region once the equation
appropriate weighting criteria. The pattern may be linear is obtained:
combinations, circular, rectangular, or square window. X
n
fij ¼
Y Wk Yk
k¼1
9.2.20.1 Moving Weighted Average of
X
n
Block Mean Y ¼ bo þ bi Y i
i¼1
A sizable block of ore is removed during a mining opera-
tion. The minimum size depends on the orebody configu- Y ¼ Actual from past record
ration and extraction method. It varies from deposit to Y ¼ Mean of exploration data
deposit and mine to mine. Large blocks are less subjective
b ¼ Coefficients
The estimated Y e is based on the weighted sum of
adjacent observation Y:
X
n
e ij ¼
Y Wk Y k
k¼1
e ¼ estimated value
Y
W ¼ weight
Y ¼ adjacent block value
The sum of total weight must be equal to 1:
X
n
Wk ¼ 1
k¼1
FIGURE 9.14 Concept of moving arithmetic average grade computation of (A) underground workings and (B) geochemical grid samples on the surface.
184 Mineral Exploration
9.2.20.3 Other Mathematical Techniques sample grade. In reality, a number of blocks would have a
similar estimated grade with wide variation from true
The occurrence of erratic high values in the spatial distri-
grade. This indicates that any estimated value has a bias
bution, as in the case of gold, silver, platinum, and similar
(error) when compared with the true block values except
precious metals, can be smoothed by log transformation
where the line of best fit intersects the 45 degree line. The
(Fig. 9.15A and B).
average of all the blocks’ estimates would not necessarily
be much different than the true average grade. If we apply
9.3 MISCLASSIFIED TONNAGE a desired cutoff grade, four regions, namely, I, II, III
and IV, will be formed on the estimated true grade
Krige (1951) observed that blocks of ore estimated as high
distribution:
grade using long-standing traditional methods repetitively
indicate lower true grade after mill feed-grade reconcilia- 1. Region I is comprised of all those blocks that have both
tion. Similarly, estimated low-grade blocks signify a con- an estimated and true grade above the cutoff grade,
stant higher true grade. The true versus estimated grades of which would be and should be mined.
all blocks is plotted in Fig. 9.16. An elliptical envelope is 2. Region II includes all those blocks that have an esti-
observed with a major axis (least square fit) not parallel to mated and true grade below the cutoff grade, which
the 45 degree line from the origin. would not and should not be mined.
All the estimated versus actual points would fall on or 3. Region III includes blocks estimated to be above the
close to the 45 degree line if the blocks had been estimated cutoff grade but have a true grade below the cutoff
correctly. The estimation technique is expected to provide grade. These blocks would be mined when they should
close proximity between estimated and actual production not and would thus reduce the mine head grade.
FIGURE 9.15 Transformation of (A) fluctuating raw samples to (B) smooth values.
FIGURE 9.16 Misclassified tonnages as conceived by Krige (1951) portray that the “estimated” and “true production” grades vary considerably due to
inadequate sampling, estimation flaw, and complexity of mineralization.
Statistical and Geostatistical Applications in Geology Chapter | 9 185
pffiffiffi
sum of differences square is known, and a variogram is The next shortest sample distance is 2 unit distance
computed. Similarly, variograms in other directions within apart with total sample pairs of 12:
the study area are computed and compared. pffiffiffi pffiffiffiffiffiffiffi
In practice, a theoretical semivariogram is never real- r g 2 ¼ 26=24 or g 1:4 ¼ 1:08
ized. The gamma function g(h) is estimated from limited
points and is called the experimental variogram (Fig. 9.20). The gamma functions for all the possible lags are
Fig. 9.20 displays a number of sample points at equal computed and plotted as g(h) against lag. The semivario-
distances apart with their sample values, say thickness next gram is shown in Fig. 9.21.
to each point. We wish to compute g(1), i.e., the semi-
variogram value of all samples points at 1 unit distance 9.4.2.1 Properties of a Semivariogram
(lag) apart, irrespective of direction. There are 17 sample
9.4.2.1.1 Continuity
pairs at 1 unit apart:
Continuity is reflected by the growth rate of g(h) for a
gð1Þ ¼ 1=2Nð1 2Þ2 þ ð2 2Þ2 þ ð2 3Þ2 þ ð2 1Þ2 þ small value of h. The growth curve exhibits the region-
alized element of samples. The steady and smooth in-
ð1 4Þ2 þ ð4 3Þ2 þ ð2 1Þ2 þ ð1 1Þ2 þ crease is indicative of the high degree of continuity of
mineralization until it plateaus off at some distance. This
ð1 2Þ þ ð1 2Þ þ ð2 2Þ þ ð2 1Þ þ is known as the structured variance or explained variance
2 2 2 2
FIGURE 9.21 Typical semivariogram plot with curve fitting following the population variance after reaching the zone of influence.
188 Mineral Exploration
gðhÞ ¼ C0 þ C1 ; for h a
where C0 ¼ nugget value, C1 ¼ explained variance,
C0 þ C1 ¼ sill, h ¼ distance between points, and
a ¼ zone of influence or range.
2. Random model
FIGURE 9.23 Typical semivariogram from massive base metal deposits gðhÞ ¼ C0 þ C1 ; for all h
with moderate unexplained variance (C0).
Statistical and Geostatistical Applications in Geology Chapter | 9 189
However, b1 þ b2 must be equal to 1, but in this case 9.4.5 Kriging Estimation: An Example
(0.536 þ 0.882) ¼ 1.1418. Therefore the lag range multi-
plier is to be introduced to ensure b1 þ b2 ¼ 1. The new Point and block kriging estimates the value at a point or
matrix is: block, exclusively supported by a semivariogram. The ac-
2 3 2 3 2 3 curacy of kriging estimation, to a large extent, is propor-
yðw1 w1 Þ yðw1 w2 Þ 1 b1 yðw1 ; WÞ tional to the precision factor of the variogram. Point kriging
6 7 6 7 6 7
4 yðw2 w1 Þ yðw2 w2 Þ 11 5 4 b2 5 ¼ 4 yðw2 ; WÞ 5 has been suggested as a test for a semivariogram model.
1 1 0 1 Each sample point is kriged by a selected semivariogram
l
model without actual sample values being used in the
or: matrix. The kriged point values are compared with actual
2 3 2 3 2 3 sample values and the model is adjusted until the errors are
0 1 1 b1 0:536
6 7 6 7 6 7 minimized. This is known as the jackknife test.
41 0 1 5 4 b2 5 ¼ 4 0:882 5 The stepwise kriging computation for part of an ore-
1 1 0 l 1 body (sublevel stope) is demonstrated by seven sample
points. The sample details, semivariogram for kriging at a
r b2 þ l ¼ 0:536 central point “Q” of block ABCD (25 m 25 m), are given
in Table 9.11, Figs. 9.28 and 9.29.
b1 þ l ¼ 0:882
The semivariogram of the total population provides
b1 þ b2 ¼ 1 C0 ¼ 0.8, C1 ¼ 2.5, and a ¼ 50 m. Five samples, namely,
1, 2, 3, 5, and 6, are selected that fall within the radius of This relates interassay variance to variance contribution
influence of 30 m from computation point Q. The distances to point Q. The solution to the foregoing equations will
d1 (1 to Q), d2 (2 to Q), d3 (3 to Q), d5 (5 to Q), and d6 (6 to provide five weighting coefficients for five sample assays,
Q) are necessary for interpolation. The kriging technique and another coefficient l for the kriging estimator. The
computes distances and semivariances between 1 and 1, 1 coefficients are computed as:
and 2, 1 and 3, 1 and 5, 1 and 6, 2 and 1, 2 and 2, 2 and 3, 2
and 5, 2 and 6, 3 and 1, 3 and 2, 3 and 3, 3 and 5, 3 and a1 ¼ 0:10; a2 ¼ 0:31; a3 ¼ 0:07; a4 ¼ 0:31; a5 ¼ 0:21
6, 5 and 1, 5 and 2, 5 and 3, 5 and 5, 5 and 6, 6 and 1, 6 and and l ¼ 0:27
2, 6 and 3, 6 and 5, and 6 and 6, i.e., with five samples an
Therefore the estimated value at Q will be:
additional 5 5 intersample distances are computed.
Intersample variances eliminate the shadowing effect pre- a1 2:4 þ a2 2:76 þ a3 4:84 þ a4 6:38 þ a5 7:04
sent in sample distribution within the zone of influence.
The equations are formulated in matrix form to compute ¼ 0:1 2:4 þ 0:31 2:76 þ 0:07 4:84 þ
a semivariogram for expressing the relation between dis- 0:31 6:38 þ 0:21 7:04
tances and variances:
2 3 2 3 2 3 ¼ 4:82 % Zn
V11 V12 V13 V15 V16 a1 V1 Q
6V 7 6 7 6V Q7 The estimated value of 4.82% Zn has an associated
6 21 V22 V23 V25 V26 7 6 a2 7 6 2 7
6 7 6 7 6 7 variance. The process of computation provides extension
6 V31 V32 V33 V35 V36 7 6 a3 7 6V Q7
6 76 7 ¼ 6 3 7 variance of 2.2 from Q to 1 (blockepoint covariance),
6V 7 6 7 6V Q7
6 51 V52 V53 V55 V56 7 6 a4 7 6 5 7 while we utilized only 0.10 (a1) of this variance to get an
6 7 6 7 6 7
4 V61 V62 V63 V65 V66 5 4 a5 5 4 V6 Q 5 estimate of Q. Similarly, 0.31 has been used for the second
a1 a2 a3 a4 a5 l 1 sample out of 1.9 and so on. Summation of the variances
utilized for estimation at Q will be:
where a1 þ a2 þ a3 þ a4 þ a5 ¼ 1 (weights for coeffi-
cients) and V11 . Vnn represent the variances instead of a1 2:2 þ a2 1:9 þ a3 2:3 þ a4 1:8 þ a5 1:8
the distances, which were measured earlier. In the present ¼ 1:94
data set the following equation is obtained:
The total covariance utilized for the estimation, i.e.,
a1 0:0 þ a2 1:8 þ a3 2:5 þ a4 2:7 þ a5 1:9 ¼ 2:2
blockepoint covariance (1.94) and pointepoint kriging
a1 1:8 þ a2 0:0 þ a3 2:8 þ a4 2:5 þ a5 2:1 ¼ 1:9 covariance l (0.27), is 2.21. The semivariogram of the
orebody can explain 1.72 in terms of variance for a
a1 2:5 þ a2 2:8 þ a3 0:0 þ a4 1:9 þ a5 1:6 ¼ 2:3 block size of 25 m 25 m (an average distance of 12.5 m
from any point to another point within the block). The
a1 2:7 þ a2 2:5 þ a3 1:9 þ a4 0:0 þ a5 1:8 ¼ 1:8 residual variance of the estimate is 2.21 1.72 ¼ 0.49,
which is not explained by the estimate at Q and will be
a1 1:9 þ a2 2:1 þ a3 1:6 þ a4 1:8 þ a5 0:0 ¼ 1:8
equivalent to 0.7 in terms of standard deviation. The mean
a1 þ a2 þ a3 þ a4 þ a5 ¼ 1:0 grade (4.82% Zn) has a possible variability of 0.7 over the
194 Mineral Exploration
mean.pffiffiThe
ffi confidence interval at 95% probability will be directions have to be blended with the mathematical for-
(0.7/ 5) 1.96 or 0.61. The error of the mean is 4.82% mula supported by statistical and geostatistical tests and
Zn 0.61. Keeping other conditions constant, if we change case studies (Haldar et al., 1990).
the block size to 15 m 15 m the explained variance
equivalent to 7.5 m is 1.5 and the error of the mean by
computation will be 0.74. It can be concluded that as the
REFERENCES
block size decreases, the confidence limit value (range) will Armstrong, M., 1998. Basic Linear Geostatistics. Springer, p. 155.
increase. Clark, I., 1982. Practical Geostatistics. Applied Science Publishers, Lon-
Kriging methodology relies largely on the precision of don, p. 129.
the variogram, which in reality is not smooth and indicative David, M., 1977. Geostatistical ore reserve estimation. Developments in
of disturbances inherent within the samples. Since the Geo-mathematics 2. Elsevier Scientific Publishing Company,
Amsterdam, p. 364.
semivariogram used is a generalized average, the estimate
David, M., 1988. Handbook of applied advanced geostatistical ore reserve
may differ from the true value. Local trends existing in the
estimation. In: Developments in Geo-mathematics, vol. 6. Elsevier
data set should be removed before kriging. Scientific Publishing Company, New York, p. 232.
Davis, J.C., 1971. Statistics and Data Analysis in Geology, third ed. John
9.4.6 Benefits of Statistics and Wiley & Sons, New York, p. 550.
Geostatistics Davis, J.C., 2002. Statistics and Data Analysis in Geology, third ed. John
Wiley & Sons, New York, p. 656.
1. Statistical techniques (mean, variance, standard devia- Haldar, S.K., 1982. A study of the Balaria base metal deposit, Zawar
tion, confidence limit, correlation coefficient, and Group of Mines, Rajasthan, using numerical procedures with special
various tests) will provide sequential analysis of an reference to derivation of parameters for mine planning unpublished
exploration sample database with broad parameters at Ph. D thesis. Indian Institute of Technology, Kharagpur, India, p. 641.
any stage of exploration. It will focus on the adequacy Haldar, S.K., Bhatnagar, S.N., Paliwal, H.V., 1990. Application of geo-
statistics to exploration data for sample optimization, mine sub-block
of the exploration input and the power of decision-
estimation, and grade forecast system at Rampura-Agucha lead-zinc
making toward investing in mining, continuing explora-
mine, India. In: XXII APCOM-90, Berlin, West Germany, p. 11.
tion, or keeping the project in abeyance and waiting for Hans, W., 2013. Multivariate Geostatistics: An Introduction with Appli-
the next opportune time. cations. Springer Science & Business Media, p. 257.
2. The semivariogram parameters (g(h)) in geostatistics Henley, S., 1981. Nonparametric Geostatistics. Applied Science Pub-
quantify the range of influence of the samples in all di- lishers, London, p. 145.
rections, indicating the isotropic/anisotropic nature of Isaak, H.E., Srivastava, R.M., 1989. An Introduction to Applied Geo-
the deposit and selection of sample for estimation. statistics. Oxford University Press, Inc., p. 559
3. The semivariogram also extracts the explained (C1) and Journel, A.G., Huijbregts, C.J., 1978. Mining Geostatistics. Academic
unexplained nugget (C0) variances of the orebody that Press, London, p. 600.
control the estimation of block grades. Kellaway, F.W., 1968. Penguin-Honeywell Book of Tables. Penguin
Books Limited, p. 76.
4. Point, area, and block estimation by kriging techniques
Koch, G.S., Link, R.F., 1986. Statistical Analysis of Geological Data,
represent BLUE, which minimizes error variance.
second ed. Wiley, New York, p. 375.
5. Statistical and geostatistical applications compute asso- Krige, D.G., 1951. A statistical approach to some basic mine valuation
ciated estimation errors that are risk factors for invest- problems on the Witwatersrand. J. Chem. Metall. Min. Soc. S. Afr. 52,
ment decision. 119e139.
6. The advance estimation methods estimate reserves and Krige, D.G., 1962. Statistical application in mine valuation. J. Inst. Mine
resources (tonnage), and average grade of individual Survey. S. Afr. 12 (2 and 3), 82.
blocks for planning of selective mining and grade Krige, D.G., 1978. Lognormal-de Wijssian Geostatistics for Ore Evalua-
control. tion. South African Institute of Mining and Metallurgy, Johannesburg,
7. The techniques can optimize sampling design and den- p. 91.
sity (refer to Chapter 15, Section 15.10). Matheron, G., 1971. The Theory of Regionalized Variables and its Ap-
plications. Les Cahiers du Centre de Morphologies Mathematique de
8. The techniques are capable of analyzing the estimated
Fontainebleau, Ecole Nationale Superieure des Mines de Paris, p. 211.
and actual performances.
Rendu, J.-M., 1981. An introduction to geostatistical methods in mineral
9. Finally, a good knowledge of statistics, geostatistics, and evaluation, South African institute of mining and metallurgy mono-
the application of geology and mining software will not graph series. Geostatistics 2, 84.
be enough to estimate the reserves and resources of a de- Sahu, B.K., 2005. Statistical Models in Earth Sciences. B, S. Publication,
posit with full confidence. An inherent and in-depth un- Hyderabad, India, p. 211.
derstanding of mineralization by extensive fieldwork, an Sinclair, A.J., Blackwell, G.H., 2002. Applied Mineral Inventory Esti-
unbiased and unique sample database, and visualization mation. Cambridge University Press, UK, p. 381.
of physical and chemical variations of grade in various