Confidence Interval Estimation by Bootstrap Method For Uncertainty Quantification Using Random Sampling Method
Confidence Interval Estimation by Bootstrap Method For Uncertainty Quantification Using Random Sampling Method
To cite this article: Tomohiro Endo, Tomoaki Watanabe & Akio Yamamoto (2015)
Confidence interval estimation by bootstrap method for uncertainty quantification using
random sampling method, Journal of Nuclear Science and Technology, 52:7-8, 993-999, DOI:
10.1080/00223131.2015.1034216
Confidence interval estimation by bootstrap method for uncertainty quantification using random
sampling method
Tomohiro Endo a∗
, Tomoaki Watanabeb, † and Akio Yamamotob, †
a
Department of Materials, Physics and Energy Engineering, Aichi 464-8603, Japan; b Graduate School of Engineering,
Nagoya University, Furo-cho, Chikusa-ku, Aichi 464-8603, Japan
(Received 21 January 2015; accepted final version for publication 18 March 2015)
Confidence interval estimation by the bootstrap method is investigated for the uncertainty quantification
of neutronics calculation using the random sampling method. The random sampling method is a simple
and practical technique to quantify an uncertainty (standard deviation) of the target parameter calculated
by a core analysis code. It is noted that a statistical error is inevitably included in the estimated uncertainty
because of the probabilistic method using random numbers. In order to estimate the statistical error of
uncertainty, we focus on the bootstrap method. The bootstrap method is one of the resampling techniques
to evaluate variance and confidence interval of a sample estimate (e.g. variance) without the assumption of
normality. Through a lattice burnup calculation for a simplified boiling water reactor (BWR) fuel assembly,
it is verified that the bootstrap method can reasonably estimate the confidence interval of uncertainty of
infinite neutron multiplication factor (kinf ) due to covariance data of JENDL-4.0. In the case of this prob-
lem, the distribution of kinf is well approximated by a normal distribution; thus, the confidence interval
of uncertainty can be also estimated by the aid of chi-squared distribution. The merit using the bootstrap
method is to simply estimate the confidence interval of uncertainty without the assumption of normality.
Keywords: bootstrap method; uncertainty quantification; random sampling method; uncertainty; covariance;
numerical analysis
∗
Corresponding author. Email: [email protected].
†
Present address: Department of Materials, Physics and Energy Engineering, Aichi 464-8603, Japan.
C 2015 Atomic Energy Society of Japan. All rights reserved.
994 T. Endo et al.
the statistical errors can be reduced; however, it requires where σ is an RS vector of microscopic cross-
more computational costs. In actual situation, compu- section data, μ
is the mean vector, z is a column
tational resources are finite; therefore, the number of vector whose elements are standard normal ran-
sampling is limited depending on the computational dom numbers and are independent of each other,
costs of core analysis. In order to judge the adequacy of and A is a square root matrix of the covariance
estimated uncertainties, their confidence intervals are matrix :
useful information, i.e. the number of sampling can be
properly determined whether their confidence intervals = AAT , (2)
are acceptable. If a population of target parameters
follows a normal distribution, the confidence interval where the superscript T means transpose of a
of variance could be estimated by the aid of chi-squared matrix. The square root matrix A is numerically
distribution. However, a population does not necessar- calculated by the singular value decomposition
ily follow an ideal normal distribution; thus, a practical (SVD) for [9,11]. Namely, using the SVD, a
estimation method of the confidence interval is desirable real symmetric matrix can be decomposed as
without the assumption of normality. follows:
For this purpose, we investigate the applicability of
bootstrap method [12–14] to evaluate the confidence in- = UWUT , (3)
terval of uncertainty of neutronics calculation. With-
out the assumption of normality, the bootstrap method where U is an unitary matrix where each col-
enables to practically estimate the histogram, variance, umn corresponds to eigenvector of T and
and confidence interval for a sample estimate (e.g. mean, W is a diagonal matrix where each diagonal
variance, median) by a large number of resamples, which element corresponds to the square roots of
are obtained by “random sampling with replacement” the non-negative eigenvalue of T . Conse-
from an original data. It is noted that “random sampling quently, several hundreds of microscopic cross-
with replacement” is one of the technical terms in statis- section libraries σi are randomly generated as
tics, i.e. each element in the original data has an equal follows:
probability to be selected and it is permissible to be re-
dundantly selected.
σi = μ
+ UW1/2 zi (i = 1, 2, . . . , N) , (4)
The contents of the present paper are as follows. In
Section 2, methodologies of the RS and the bootstrap
where W1/2 is the square root matrix of W and
methods are described. Section 3 shows numerical re-
N indicates the total number of RS libraries.
sults using the bootstrap method for a burnup calcula- (3) By a forward calculation using a core analysis
tion of a typical boiling water reactor (BWR) fuel as- code system with each of sampled libraries, a
sembly. Finally, concluding remarks are summarized in target parameter xi (e.g. neutron multiplication
Section 4. factor, reactivity worth, or reaction-rate ratio) is
evaluated for each of corresponding ith libraries.
2. Methodology Consequently, N sets of target parameters are
randomly sampled.
2.1. Random sampling method (4) The uncertainty of target parameter due to co-
In this section, the RS method for core analysis is variance data can be estimated by standard de-
briefly explained as an example utilizing covariance data viation sx for the sampled target parameters xi ,
of microscopic cross-section data. Other uncertainties of i.e.
input parameters, e.g. fabrication tolerances for geome-
tries and material densities, can be similarly treated. ⎛ ⎞2
1 ⎝ 1 ⎠
N N
sx = xi − xj . (5)
(1) The covariance matrix for multi-group micro- N−1 N
i =1 j =1
scopic cross-section of each nuclide is generated
using the NJOY code [15]. It is noted that the
energy group structure of the covariance matrix If N is sufficiently large, it is expected that sx can
should be consistent with that of a cross-section be estimated with statistically satisfactory preci-
library for a lattice physics code. sion.
(2) A multi energy-group microscopic cross-section
library is generated for a lattice physics code, As described above, the UQ using the RS method
based on a multivariate normal distribution with is simple and practical procedure without any (gener-
the covariance matrix generated in Step (1). alized) adjoint calculations. Furthermore, the merit of
Namely, we have the RS method is high affinity to the burnup and the
thermal-hydraulics feedback effects in the core analysis
σ = μ
+ Az, (1) [10].
Journal of Nuclear Science and Technology, Volume 52, Nos. 7–8, July–August 2015 995
where #{sx∗b ≤ sx , b = 1, . . . , B} means the As explained above, using the bootstrap method,
number of sx∗b , satisfying the condition of sx∗b ≤ limited number of samples can be effectively utilized to
sx , and −1 ( p) is a quantile function of standard estimate the confidence interval of uncertainty without
996 T. Endo et al.
3. Numerical analysis (5) The procedures of Step (4) were carried out 200
3.1. Verification of the bootstrap method times, thereby 200 bootstrap confidence intervals
were obtained.
In order to verify the confidence interval estimated
(6) Using Step (5), we estimated a coverage prob-
by the bootstrap method (i.e. the BCa method), the un-
ability ps that the reference standard deviation
certainty of neutron multiplication factor is investigated
sk,ref was contained within the estimated confi-
in the burnup calculation using a lattice physics code. ∗ ∗
dence interval [sk,L , sk,U ]. If the confidence in-
The verification procedures are given as follows:
terval estimated by the BCa method is adequate,
(1) Using CASMO-4 as a lattice physics code [16], the value of ps should be close to 95%.
the burnup calculation was carried out for a sim-
plified BWR 9 × 9 type-B fuel assembly. For
As an example, Figure 1 shows the 95% bootstrap
simplicity, the 235 U enrichment was 3.7wt% for ∗ ∗
confidence intervals [sk,L , sk,U ] at 30 GWd/t assembly-
all fuel pins and there were no burnable poison
average burnup. In Figure 1, each plot point of +
fuel rods.
and × corresponds to the values of sk (50) obtained by
(2) The covariance data of JENDL-4.0 were used as
Equation (5); the gray bars (marked + ) indicate that
uncertainties in microscopic cross-section data
the confidence interval contains the reference value sk,ref ;
[3,4]. The uncertainties of nuclides were treated
on the other hand, the black bars (marked × ) indicate
for 18 heavy nuclides: 234 U, 235 U, 236 U, 238 U,
237 that sk,ref exists outside of the intervals. In the case of 30
Np, 238 Pu, 239 Pu, 240 Pu, 241 Pu, 242 Pu, 241 Am,
242 GWd/t, the coverage probability ps is (200 − 11)/200 ≈
Am, 243 Am, 242 Cm, 243 Cm, 244 Cm, 245 Cm and
246 94.5% and nearly equal to the confidence level (95%).
Cm. In addition, their capture, fission, elastic
Figure 2 shows the variation of ps with respect to
scattering microscopic cross-section (σc , σf , σs ),
the assembly-average burnup. As shown in Figure 2,
and number of neutrons emitted per fission (ν̄)
ps ranges from 91% to 96%. According to the above-
were randomly sampled on the basis of multi-
mentioned results, it is confirmed that the confidence
variate normal distribution with their covariance
data (see Section 2.1).
(3) CASMO-4 burnup calculations were carried out 100%
for 10,000 times with randomly sampled 10,000
libraries using the RS method. Then the uncer-
tainties (standard deviations) of infinite neutron 98%
multiplication factor kinf were estimated for all
burnup steps. Because of the adequately large 96%
number of sampling, the estimated uncertainties
ps [-]
0.010 0.010
0.008 0.008
0.004 0.004
0.002 0.002
0.000 0.000
10 100 1000 10000 10 100 1000 10000
Figure 3. Variation of 95% confidence intervals with respect to the number of randomly sampled libraries (30 GWd/t assembly-
average burnup).
interval for the uncertainty (standard deviation) of kinf n ( p is a probability, 0 < p < 1); and the con-
can be reasonably estimated using the BCa method. fidence level corresponds to (1 − α). For exam-
ple, if (1 − α) = 95% and N = 50, X−1 49 (0.975) ≈
70.222 and X−1 49 (0.025) ≈ 31.555 in Equations
3.2. Comparison with the confidence interval (13) and (14), respectively.
under the assumption of normality
(4) Steps (1), (2) and (3) were carried out from N =
In this section, the bootstrap confidence interval is 10 to N = 10,000.
compared with another confidence interval, which is es-
timated under the assumption of normality. The com- As an example, Figure 3 shows the variation of 95%
∗ ∗
parison procedures are as follows: confidence intervals [sk,L , sk,U N
] and [sk,L , sk,U
N
] with re-
spect to the number of randomly sampled libraries N,
(1) N microscopic cross-section libraries were ran- at the assembly-average burnup of 30 GWd/t. For com-
domly sampled without replacement from the parison, Figure 4 shows both widths of confidence in-
∗ ∗
original 10,000 libraries obtained in the verifi- tervals, i.e. (sk,U − sk,L N
) and (sk,U − sk,L
N
). As shown in
cation of Section 3.1. By the CASMO-4 burnup Figures 3 and 4, the width of confidence interval be-
calculations with these sampled libraries, the un- comes narrower as the number of sampled libraries N
∗ ∗
certainties (standard deviations) of kinf (denoted increases. Approximately, both widths of (sk,U − sk,L )
as sk (N)) were estimated in the same way as de-
scribed in Section 3.1.
(2) Using the BCa method for the result of Step (1), bootstrap method assumption of normality
∗ ∗
the 95% confidence intervals [sk,L , sk,U ] were 0.0100
evaluated. The total number of the bootstrap
Width of 95% confidence interval
0.0001
N−1
N
sk,U ≈ sk (N) , (14) 10 100 1000 10000
X−1
N−1 (α/2)
Number of sampled libraries
where X−1
n ( p) is the quantile function of chi- Figure 4. Comparison of widths of 95% confidence intervals
squared distribution with a degree of freedom (30 GWd/t assembly-average burnup).
998 T. Endo et al.
1.05
of hypothesis tests are needed to assess the normality.
On the other hand, the bootstrap method has an
advantage to simply estimate the confidence intervals
Infinite neutron multiplication factor
1.04
-4 -2 0 2 4 4. Conclusion
The RS method is one of the simple and practical
Theoretical quantile
UQ techniques. It is expected that the RS method has an
Figure 5. Normal quantile–quantile plot of kinf (30 GWd/t advantage for complicated numerical analysis, such as
assembly-average burnup). an LWR analysis which is a multi-step calculation with
burnup and thermal-hydraulics feedback effects. Note
that statistical errors are inevitably included in the es-
timated uncertainties in the RS method. Therefore, the
and (sk,UN
− sk,L
N
) decrease inversely proportional to confidence interval for estimated uncertainties is useful
√ information to determine the number of RS of input pa-
N, as suggested in the double-logarithmic graph of
rameters. In the present paper, the applicability of the
Figure 4. In addition, the bootstrap confidence intervals
bootstrap method, specifically the BCa method, was in-
are nearly equal to those under the assumption of nor-
∗ ∗ vestigated to evaluate the confidence interval of uncer-
mality, although the widths of (sk,U − sk,L ) are slightly
tainty estimated by the RS method. The merit of the
narrower than those of (sk,U − sk,L ).
N N
bootstrap method is to easily estimate the confidence in-
Figure 5 shows a normal quantile–quantile plot of terval of uncertainty only using limited number of sam-
kinf , which consists of 10,000 results obtained from ples, without further sampling.
all microscopic cross-section libraries. As shown in Through the burnup calculation for a BWR fuel as-
Figure 5, the linearity of plot points suggests that the sembly, it is confirmed that the bootstrap confidence
frequency distribution of kinf is well approximated by a interval is a reasonable and practical estimator for the
normal distribution. According to the Jarque–Bera test uncertainty (standard deviation) of neuron multiplica-
[17] for results of Figure 5, the null hypothesis of nor- tion factor. In the present problem, the distribution of
mality is not rejected, since the p-value is 0.28 and larger neutron multiplication factor is well approximated by
than the typical significance level 5%. Therefore, the as- a normal distribution. In such a case, based on the as-
sumption of normality is reasonable for kinf in this bur- sumption of normality, the confidence interval can be
nup calculation problem. Consequently, it is considered also estimated by the aid of quantile function of chi-
N
that the confidence intervals [sk,L , sk,U
N
] can be reason- squared distribution. On the other hand, the merit of
ably estimated in this case. the bootstrap method is to simply estimate the con-
In the case of an actual UQ procedure using the RS fidence interval without the assumption of normality;
method, the number of core analyses with randomly thus, it is expected that the bootstrap method is ap-
sampled library gradually increases until the statistical plicable to estimate the confidence intervals of uncer-
dispersions of estimated uncertainties are acceptably tainties without complicated mathematical procedures
small. As shown in Figures 3 and 4, the number of RS in the case of non-normal distribution. Furthermore, the
can be reasonably determined only from the calculated bootstrap method has capability to estimate confidence
results themselves without additional core analysis, intervals for other statistical estimates, e.g. covariance
by judging whether the estimated confidence interval or correlation coefficient between two different kinds of
is acceptable. A merit of the estimation method by target parameters. One of future tasks is further verifi-
assumption of normality is less calculation cost than cation of bootstrap method for the non-normal distri-
the bootstrap method and the confidence interval can bution and the other statistical estimates, which was not
be deterministically evaluated by Equations (13) and fully investigated in the present paper. The results of the
(14). It is noted that, however, Equations (13) and (14) present study indicate that the application of the boot-
are appropriate if the normal distribution model holds strap method for UQ of reactor physics calculations is
true for the target parameters; therefore, some kinds promising.
Journal of Nuclear Science and Technology, Volume 52, Nos. 7–8, July–August 2015 999