remarkRemark \newsiamremarkhypothesisHypothesis \newsiamthmclaimClaim
Random ordinate method for mitigating the ray effect in radiative transport equation simulations
Abstract
The Discrete Ordinates Method (DOM) is the most widely used velocity discretization method for simulating the radiative transport equation. The ray effect stands as a long-standing drawback of DOM. In benchmark tests displaying the ray effect, we observe low regularity in velocity within the solution. To address this issue, we propose a random ordinate method (ROM) to mitigate the ray effect. Compared with other strategies proposed in the literature for mitigating the ray effect, ROM offers several advantages: 1) the computational cost is comparable to DOM; 2) it is simple and requires minimal changes to existing code based on DOM; 3) it is easily parallelizable and independent of the problem setup. Analytical results are presented for the convergence orders of the error and bias, and numerical tests demonstrate its effectiveness in mitigating the ray effect.
keywords:
Random ordinate method, ray effect, discrete ordinate method, radiative transport equation.1 Introduction
The radiative transport equation (RTE) stands as a fundamental equation governing the evolution of angular flux as particles traverse through a material medium. It provides a statistical description of the density distribution of particles. The RTE has found extensive applications across diverse fields, including astrophysics [22, 26], fusion [25, 23], biomedical optics [16, 7], and biology, among others.
The steady state RTE with anisotropic scattering reads
(1.1) |
subject to the following inflow boundary conditions:
(1.2) |
Here, represents the spatial variable; denotes the direction of particle movement, and ; stands for the outward normal vector at position . gives the density of particles moving in the direction at position . The coefficients , , and represent the total, scattering cross-sections, and the source term, respectively. For physically meaningful situations, , for . The kernel is the scattering kernel, which provides the probability that particles moving in the direction scatter to the direction . For the notational convenience, we will use the symbol to denote the average over the domain associated with the indicated measure. Then, the scaled scattering kernel satisfies:
The numerical methods for solving the RTE are mainly divided into two categories: particle methods and PDE-based methods. Particle methods, specifically Monte Carlo (MC) methods, simulate the trajectories of numerous particles and collect the density distribution of all particles in the phase space to obtain the RTE solution. The MC methods are known to be slow and noisy but are easy to parallelize and suitable for all geometries [10, 3]. Meanwhile, the PDE-based methods are more accurate and can be faster, but they are not as flexible as the MC method for parallel computation and complex geometries [21]. In this paper, we are interested in the PDE-based method.
The Discrete Ordinates Method (DOM) [2, 6] is the most popular velocity discretization method. DOM approximates the solution to (1.2) using a set of discrete velocity directions , which are referred to as ordinates. The integral term on the right-hand side of Equation (1.1) is represented by weighted summations of the discrete velocities. DOM retains the positive angular flux and facilitates the determination of boundary conditions. However, solving the RTE using the standard DOM is expensive due to its high dimensionality since the RTE has three spatial variables and two velocity direction variables.
In real applications, people are usually interested in the spatial distributions of some macroscopic quantities, such as the particle density , the momentum , etc. Thus, the spatial resolution must be adequate. The computational costs can be significantly reduced if one can obtain the right macroscopic quantities by using a small number of ordinates in DOM.
One natural question arises: Can we improve the accuracy of macroscopic quantities without increasing the number of ordinates? Considerable effort has been invested in discovering a quadrature set with high-order convergence. For the 1D velocity in slab geometry, Gaussian quadrature exists, exhibiting spectral convergence when the solution maintains sufficient smoothness in velocity. However, devising a spectrally convergent 2 or 3-dimensional Gaussian quadrature remains unclear. Furthermore, in section 2, we highlight, through numerical tests, that the solution’s regularity in the velocity direction can be considerably low in some benchmark tests. Consequently, it remains uncertain whether better approximations can be expected, even if a 2 or 3-dimensional Gaussian quadrature with high convergence order for smooth solutions is identified.
When employing a limited number of ordinates in DOM, the ray effect becomes noticeable in numerous 2D spatial benchmark tests [11, 19]. The macroscopic particle density exhibits nonphysical oscillations along the ray paths, particularly noticeable when the inflow boundary conditions or radiation sources demonstrate strong spatial variations or discontinuities. The ray effect stems from particles being confined to move in a limited number of directions. As highlighted in section 2, in benchmark tests displaying the ray effect, we observe low regularity in velocity within the solution, indicating a low convergence order for DOM. In order to mitigate the ray effect, one has to increase the number of ordinates [27], which will significantly increase the computational cost.
The ray effect stands as a long-standing drawback in DOM simulations, and several strategies have been proposed to mitigate these ray effects at reasonable costs. Examples include approaches discussed in [4, 24, 28, 33]. The main idea is to use biased or rotated quadratures or combine the spectral method with DOM. In this paper, inspired by the randomized integration method [29], we introduce a random ordinate method (ROM) for solving RTE. Compared with other ray effect mitigating strategies proposed in the literature, the advantages of ROM are: 1) the computational cost is comparable to DOM; 2) it is simple and makes almost no change to all previous code based on DOM; 3) It is easy to parallelize and independent of the problem setup.
Randomized algorithms can achieve higher convergence order when the solution regularity is low [29]. The concept of the randomized method is straightforward: when approximating , the integral interval is partitioned into cells with a maximum size of . Subsequently, an is chosen from each interval, and is approximated by , where represents the quadrature weights. With a fixed set of , one can expect uniform first-order convergence for general that is Lipschitz continuous. On the other hand, when one randomly chooses a point inside each interval and keeps using the same , the expected error can achieve convergence, whereas the expectation of all randomly chosen quadrature provides convergence. Randomized algorithms have been developed and analyzed for simple problems, such as the initial value problem of ordinary differential equation (ODE) systems [32, 31], whose complexity has been studied in [13, 9], and for stochastic differential equations [8], whose complexity is analyzed in [5].
The main idea of ROM is that the velocity space is partitioned into cells, and a random ordinate is selected from each cell. A DOM system with those randomly chosen ordinates is then solved. The ROM solutions’ expected values can achieve a higher convergence order in velocity space. Both theoretical and numerical results indicate that, even with carefully chosen quadratures in DOM, it can only achieve a similar convergence order as ROM when the solution regularity is low. Consequently, the accuracy doesn’t decrease for a single run. However, averaging multiple runs can lead to a higher-order convergence and mitigate the ray effect. Since different runs employ distinct ordinates, ROM allows for easy parallel computation. Therefore, one can mitigate the ray effect by running a lot of samples with a few ordinates and then calculating their expectation.
This paper is structured as follows: Section 2 delves into the ray effects of DOM and illustrates the low regularity of the solution in velocity space. Details of the ROM are described in Section 3. In Section 4, analytical results are presented for the convergence orders of the error and bias when ROM is applied to RTE with isotropic scattering in slab geometry. Section 5 displays the numerical performance of the ROM, demonstrating its ability to mitigate the ray effect. Finally, Section 6 concludes the paper with discussions.
2 Ray effects and low regularity
2.1 Discrete Ordinate Method and Ray Effects
The DOM is the most popular angular discretization for RTE simulations, it writes [18]:
subject to the following inflow boundary conditions:
where is the weight of the quadrature node , satisfying ; represents the index set of the quadrature ; denotes the discrete scattering kernel; . Then and, for all ,
In slab geometry, where and , the RTE reads, for :
subject to the boundary conditions:
(2.1) |
The DOM in slab geometry takes , where is an integer. The discrete ordinates () satisfy:
Therefore, the DOM in slab geometry becomes:
(2.2) |
subject to the boundary conditions:
Two commonly used quadratures are the Uniform quadrature and the Gaussian quadrature. For the uniform quadrature, is divided into equally spaced cells, each of size . The values represent the midpoint of each cell, i.e., for and for , while . For the Gaussian quadrature, consists of distinct roots of Legendre polynomials of degree denoted by , and the weights .
For RTE in the X-Y geometry, where the 3D velocity on a unit sphere is projected to a 2D disk, suppose that the DOM has points in each quadrant. Then, the 2D discrete velocity directions defined on a disk are for when is an integer. The DOM in X-Y geometry is expressed as follows, for and :
where
The boundary conditions become
Two kinds of quadratures discussed in [21] are considered. Each quadrant has discrete velocities, and we only show the details for the first quadrant such that and . The discrete velocities in other quadrants are obtained by symmetry.
The first one is referred to as ”2D uniform quadrature”. Each quadrant has ordinates, and the nodes are uniform in the plane. More precisely, for ; and . Then for and all , there exists a pair of integers with such that and
(2.3) |
The second one is the 2D Gaussian quadrature described in [21], for which each quadrant has ordinates. Each quadrant has distinct , , which are the positive roots of , the Legendre polynomial of degree . They are arranged as
Each corresponds to distinct , and the weight for the velocity direction is uniform in such that
Then for and all , there exists a pair of integers such that , , and and
The rest part of the quadrature set can be constructed by symmetry:
Let’s take , the chosen discrete ordinates of Uniform and Gaussian quadratures in one quadrant are plotted in Figure 1. The selected ordinates on the surface of a 3D unit sphere and their corresponding projections to the 2D unit disk are displayed.
It has long been known that the solution of DOM exhibits the ray effect, especially when there are discontinuous source terms in the computational domain [4, 24, 33]. This phenomenon cannot be improved by increasing the spatial resolution. We show one typical example to demonstrate the ray effects.
Example 2.1.
We consider RTE in the X-Y geometry with a localized source term at the center of the computational domain. Let
The inflow boundary conditions are zero.
We consider the isotropic and anisotropic scattering with the following scattering kernel
(2.4) |
where is the included angle between and . When , (2.4) gives isotropic scattering, meaning particles moving with velocity will scatter into a new velocity with uniform probability for all . When , (2.4) gives anisotropic scattering, implying that when the included angle between and is smaller, the probability that particles moving with velocity scatter into is higher.
The computational domain is partitioned into spatial cells. We employ Uniform quadrature sets with various numbers of ordinates. The numerical results of the average density are displayed in Figure 2. From left to right, , , and ordinates of Uniform quadrature as in (2.3) are used. We can observe that the solutions exhibit rays that correspond to the chosen ordinates of the DOM. The solutions have poor accuracy, violate the rotational invariance property, and this phenomenon does not disappear with the refinement of the spatial grids.
2.2 Low regularity and Convergence order
In this subsection, we first demonstrate numerically that for the tests that exhibit ray effects, the solutions usually have a low regularity in velocity space. The 2D solution has sharp transition in velocity space and the positions of the transitional points are spatially dependent. No matter what quadrature sets are chosen, high order convergence can only be achieved for smooth functions. Therefore, it is difficult to improve the solution accuracy without increasing the quadrature nodes when the regularity is low. Then, we show numerically, in both slab and X-Y geometries, that the convergence order of DOM decreases as the solution regularity in the velocity space decreases.
2.2.1 The slab geometry case
We solve equation (2.2) and test both the Uniform and Gaussian quadratures. We employ second-order finite difference spatial discretization in [14] to obtain the numerical results. The number of spatial cells is fixed to be , and the grid points are (). Let be the solution to (2.2), and the average density . The reference solution is computed using ordinates. The errors of the numerical solutions with different ordinates are defined by
Though there is no ray effect in slab geometry, we will show in the following example that when the inflow boundary conditions have low regularity in velocity space, the convergence orders of DOM are low.
Example 2.2.
Let the computational domain, total and absorption cross sections, scattering kernel and source term be respectively
We consider three different inflow boundary conditions:
-
Case 1:
Continuous and smooth inflow boundary conditions:
(2.5) -
Case 2:
Continuous but non-differentiable inflow boundary conditions:
(2.6) -
Case 3:
Discontinuous inflow boundary conditions:
(2.7)
The convergence orders of DOM for different cases are shown in Figure 3. One can observe that for both Uniform and Gaussian quadratures, the convergence orders decrease from 2 to 1 when the inflow boundary conditions change from Case 1 to Case 3. Therefore, one cannot expect a high convergence order when the regularity of the inflow boundary conditions is low. In particular, Gaussian quadrature does not reach spectral convergence as shown in Figure 3(b). This is because, though the inflow boundary condition is smooth in , at the boundary, the solution jumps at . Gaussian quadrature does not provide spectral convergence for solutions with a jump at .
2.2.2 The X-Y geometry case
We solve Example 2.1. The classical second-order diamond difference (DD) method [21] is adopted for the spatial discretization. Since only the convergence order in the velocity variable is considered in this paper, after spatial discretization, the reference solution can be considered as a large integral system for unknowns at the given spatial grids. We use the same spatial mesh and discretizations throughout the paper, eliminating the need to consider the error introduced by the spatial discretization. We would like to emphasize that the main purpose of our current work is to show the convergence orders of the velocity discretization; any spatial discretization can be chosen to obtain the numerical results.
The spatial domain is divided into uniform cells. Let , , , and
We use DD with , . The grid points of the two-dimensional DD method are at the cell centers, i.e. approximations of and the average density (for , ) are obtained. The errors between the reference solution and the numerical solutions obtained by different quadratures are defined by
(2.8) |
The reference solution of the Uniform (Gaussian) quadrature () is computed by , which indicates () ordinates on the 2D disk. As shown in Table 1, the convergence orders of both quadratures are around 0.74. In Figure 4, we plot the velocity distribution of on the plane near two midpoints of the left and right boundaries. It can be seen that the solution varies rapidly in the velocity variable.
Order | ||||||
---|---|---|---|---|---|---|
Uniform | g=0 | 4.337E-03 | 5.560E-03 | 8.672E-03 | 1.629E-02 | 0.73 |
g=0.9 | 4.354E-03 | 5.580E-03 | 8.707E-03 | 2.075E-02 | 0.73 | |
Gaussian | g=0 | 3.966E-03 | 5.496E-03 | 8.862E-03 | 1.477E-02 | 0.75 |
g=0.9 | 3.978E-03 | 5.518E-03 | 8.902E-03 | 1.491E-02 | 0.75 |
3 Random ordinate method
The ROM is based on DOM, but the ordinates are chosen randomly. More precisely, the ROM is performed as following:
-
1.
The velocity space is divided into cells and each cell is denoted by (). The maximum area of all is denoted by . For example in 1D, , if uniform mesh is employed, then .
-
2.
Sample randomly one ordinate from each cell with uniform probability. Denote as the tuple of random ordinates and as the index set.
-
3.
Solve the resulting discrete ordinate system with the randomly chosen velocity directions.
(3.1) subject to the boundary conditions
(3.2)
Since are now randomly chosen, to guarantee the solution accuracy, one has to determine the corresponding discrete weights and discrete scattering kernel . We will simply choose the following approximation:
(3.3) |
where
(3.4) |
Such a choice clearly satisfies . If , then
Therefore, the expectation of the weighted summation provides a good approximation to the integration on the right hand side of (1.1). More precisely,
where means the diameter of the -th cell. By employing such a straightforward strategy, there exists the possibility that , potentially leading to a violation of mass conservation at the discrete level. Nonetheless, the positivity of the solution is maintained. Since we consider only total and scattering cross sections in this current work, only a small error is introduced.
According to [6, 17], symmetric ordinates perform better, especially when there are multiscale parameters in the computational domain. Although we do not consider multiscale parameters in this current paper, symmetric ordinates are used in the ROM. More precisely, in slab geometry, let and . () are randomly sampled from with a uniform distribution. Then
In the X-Y geometry, let . is the 1/4 disk in the first quadrant, and () are randomly sampled from with a uniform distribution. The symmetric ordinates indicate that, for ,
(3.5a) | |||
(3.5b) |
4 The convergence of ROM
As demonstrated in section 2, when the solution regularity is low in velocity space, the convergence orders of given quadratures are also low. At first glance, there is no benefit of using the ROM. For a given quadrature , one can measure the numerical errors by the difference between and the reference average density . The accuracy of cannot be improved by using random ordinates. However, if we consider the following two quantities: the first one is the expected single-run error , which gives the expected error of one run; the second quantity is the bias , which gives the distance between the expected value of and the reference solution, we will see from the convergence analysis in the subsequent part that the convergence order of bias is higher even if the solution regularity in velocity space is low. Therefore, if we take more samples of , run the system (3.1) multiple times in parallel, and then take the expectation , the solution accuracy can be improved.
To illustrate the idea, we provide the convergence proof for isotropic scattering in slab geometry. We will show that a single typical run of ROM can give a order of convergence, and the expectation of multiple runs gives a rd order convergence. The main idea is to expand the solution into the summation of a convergent sequence, then estimate the error and bias of the proposed ROM. The idea can be extended to a higher dimensional case (X-Y geometry, etc.) However, the proof is more technical, and we will leave it for future work.
4.1 Expansion of the solution.
The RTE in slab geometry with isotropic scattering kernel (i.e., and ) writes
(4.1) |
where , subject to the inflow boundary conditions (2.1).
Let , the equation (4.1) can be rewritten into
(4.2) |
where
The solution to (4.2) can be expanded as
(4.3) |
where satisfies
(4.4) |
with the boundary conditions (2.1). () satisfies
(4.5) |
with zero inflow boundary conditions. Since , if are uniformly bounded for all , the summation on the right hand side of (4.3) converges. Then it is easy to verify that (4.3) satisfies the original equation (4.1)[15].
Expansion in operator form
Solving (4.4) yields
(4.6a) | ||||
(4.6b) |
Similarity, the solution to (4.5) is
(4.7) |
It would be convenient to denote the solution operator in (4.7) by such that
Then, the solution in (4.6) can be rewritten as
where
In (4.1), multiplying and taking the integral with respect to , the skew-symmetric term can be eliminated to yield some boundary terms. This then gives the weighted norm of with weight . Hence, we consider the space with inner product
(4.8) |
Then, can be viewed as the integral operator on with kernel
More precisely,
We then introduce the iteration operator :
(4.9) |
which satisfies
and
Therefore, the average density is given by
(4.10) |
In ROM, the magnitude of relates to the mesh size thus in order to estimate the convergence order, we introduce the rescaled weights
so that and each . One run of ROM is to solve
with
Similar as for (4.10), the average density for ROM can be written as
(4.11) |
where the iteration operator becomes
(4.12) |
Our goal is to estimate the difference between and in terms of the expected single run error and bias, using the expansions listed above.
4.2 Main result and the proof
Similar as in [12], the singularity near will affect the estimates of the order of convergence. We will thus consider a truncated approximation similar to Grad’s angular cutoff for the Boltzmann equation and prove the convergence of ROM for the truncated system, using the expansion introduced above.
We take and consider the truncated velocity space . The difference between the truncated systems and the original systems can be controlled by . With this truncated approximation, one has
The average density for this truncated system is then given by (4.10) with being replaced by and being replaced by . We choose to consider the average on to approximate the average density so that the notations in formulas like (4.10) will not change. One may divide and into subintervals respectively and perform the ROM. Then, the weights are changed into . Correspondingly, the formula (4.11) will not change as well, and one just uses the new weights. From a practical viewpoint, this truncated system makes sense since the numerical velocities will not touch . For notational convenience, we will drop , and understand and to be the truncated ones.
The main convergence result for ROM is the following.
Theorem 4.1 (main result).
Consider the truncated systems and suppose that the rescaled weights are uniformly bounded. Then, there exists such that for , the expected single run error satisfies
and the bias satisfies
Here, the norm used is the norm in (4.8).
Proof 4.2 (Proof of Theorem 4.1).
Let . Then
By the definitions of , in (4.9) and (4.12) and the symmetry of the ordinates, we can then write
Here, and () are not independent, and this is why we put them together. Since (), one has
(4.13) |
Comparing (4.10) and (4.11), we denote
and
(4.14) |
Here, and are not independent either. In the estimate of , one needs to put them together as well. One can refer to the supplementary material for the details.
Below, the norm for functions will be norm and the norm for the operators will be the operator norm from to . According to (4.10) and (4.11) the expected single run error is then controlled as below
The bias can be controlled similarly.
Here, the difference of bias from the expected single run error is that the terms involving a single or vanishes under expectation. The summation index starts from 1 in and from 2 in , and the inner index starts from . This is because, from (4.13), . For , the index starts from and from , because by (4.14).
Therefore, we need to control , , and for to estimate the error and bias. We can establish the following estimates.
-
•
Lemma SM1.1 shows that , and .
-
•
Corollary SM1.8 tells that .
-
•
Lemma SM1.9 proves that .
The estimate for is the most difficult one as it involves the concentration of norms for random operators in Hilbert spaces. It is established by a type of Rosenthal inequality. Other estimates are relatively straightforward. See the detailed proof in the Supplementary material.
Using the estimates above, one then has
and
Since the series and converges, one then concludes that
Since , one has . When is large enough, , then converges and the series in the front of is controlled by a constant independent of . The estimation of then follows from the estimates of and the Hölder inequality. The estimates for and are similar to and we omit the details. The estimates for the expected single run error then follows.
Next, we consider the bias. The estimate for the three terms are similar and we take the one for as the example. By the definition of , one has
For the first inequality above, we have used the simple bound . For the second inequality above, we have used the fact due to . Then, we repeat the argument as above for . The is fine due to the convergence of . The terms are exactly the same as above where one can simply control . Hence, is bounded by a constant that is independent of multiplying .
In , when , one may bound . Besides this, the remaining estimates for and are the same as for . We omit the details.
4.3 Formal results for higher dimensional case
One can generalize the analysis to higher dimensions. The orders of error and bias depend on the error for and . If one divides the region into cells, the random ordinates are independent (Due to symmetry, strictly there are independent ordinates, but there is no significant difference), is like , where . The variance on is like , where means the diameter of the -th cell. Similar analysis holds for . One can refer to the supplementary material for the details in the 1D case. Consequently, the bias scales like
The error scales like
For , a typical order of error for ROM is then , and the bias scales like so the order is . The order of bias also improves compared to DOM. Hence, ROM could improve accuracy.
For the X-Y geometry (i.e., ), if the required accuracy for the average density is , one has to use ordinates in DOM. Since all ordinates are coupled together in the RTE simulations, the computational complexity in the velocity variable is for the standard source iteration method with anisotropic scattering kernel. If one solves the large coupled linear system directly, the computational complexity is usually higher than . On the other hand, the bias of ROM is of order . If we repeat the simulation for times, the error of the Monte Carlo approximation is . The variance can be controlled by the square of the error, which is of order . Hence, the number of ordinates for each run could be so that and . Then, taking will make the average error . Since the complexity for each run is , the total complexity is thus . The total computational costs of DOM and ROM are comparable. If one chooses to use ordinates for ROM, then a typical single run could yield also error, but it suffers from the ray effect as well. Hence, choosing ordinates and repeat times could result in the same error, but the ray effect could be improved since the total number of velocity directions is . Moreover, ROM is easy to parallel.
5 Numerical experiments
The numerical performance of the ROM is presented in this section. The numerical convergence orders of errors and bias in both slab and X-Y geometries are displayed, while its ability to mitigate the ray effect is demonstrated through a lattice problem.
5.1 The slab geometry
To apply ROM in the slab geometry, we equally divide the velocity interval into cells. The th cell is denoted by , where . We consider only even , and when , one ordinate is sampled randomly from the cell with uniform probability. When , . The weights are for all .
All settings are the same as in Example 2.2 in Section 2. We use the same spatial discretizations and the number of spatial grids as for Example 2.2. The reference solution is obtained by the DOM using 2560 ordinates with uniform quadrature. We define the error between the reference solution and the numerical solutions of ROM with spatial cells by
(5.1) |
where . The bias of ROM is defined by
(5.2) |
1/8 | 1/4 | 1/2 | 1 | Order | ||
---|---|---|---|---|---|---|
error ( | Case 1 | 1.83E-02 | 4.99E-02 | 1.33E-01 | 3.10E-01 | 1.37 |
Case 2 | 9.37E-03 | 2.30E-02 | 5.76E-02 | 1.46E-01 | 1.32 | |
Case 3 | 2.86E-02 | 5.98E-02 | 1.20E-01 | 2.51E-01 | 1.04 | |
bias ( | Case 1 | 9.18E-05 | 6.51E-04 | 2.72E-03 | 2.17E-02 | 2.57 |
Case 2 | 4.48E-05 | 6.00E-04 | 2.73E-03 | 2.31E-02 | 2.92 | |
Case 3 | 1.07E-04 | 5.63E-04 | 2.43E-03 | 1.58E-02 | 2.37 |
Figure 5 displays the convergence orders of ROM in the slab geometry. The results of different numbers of sampled simulations are shown. To obtain the convergence order of the error, a smaller number of samples is enough. As observed from Figure 5, when the number of samples increases, the convergence order of bias will gradually increase and eventually stabilize near the theoretical value. For different cases as in Example 2.2 in Section 2, Table 2 displays the error and bias using different mesh sizes when there are enough samples.
When the inflow boundary conditions are smooth and regular as in Case 1 in (2.5), the convergence orders of DOM are , while the convergence orders of the error and bias of ROM are respectively and . Due to stochastic noise, this can be considered consistent with the analysis. When the boundary conditions are continuous but nondifferentiable as in Case 2 in (2.6), the convergence orders of the DOM decrease to , and the convergence orders of the error and bias of ROM remain and , respectively. Moreover, if the boundary conditions are discontinuous at some points as in Case 3 in (2.7), the convergence order of DOM decreases to . The convergence order of the error of ROM decreases to , while the bias has a convergence order of . In summary, the errors of ROM converge no slower than DOM, and the bias converges faster, especially when the solution regularity is low in the velocity variable.
5.2 The X-Y geometry case
In X-Y geometry, we show the ordinates in the first quadrant and consider uniform partition in the plane. and are equally divided, the nodes are denoted by for . Each quadrant has cells. For any , ,
Randomly sample and with uniform probability. The ordinates in other quadrant can be obtained by symmetry as in (3.5). The weights are chosen to be . The ordinates projected to the 2D unit disk are
Let’s take as an example, the partition and one sample of the chosen discrete ordinates are plotted in Figure 6.
5.2.1 Convergence order
Similar to section 2.2.2, we use the diamond difference method to discretize the spatial variable. The unknowns at the cell centers are calculated, and the notations are the same as in section 2.2.2.
We use the same setup and spatial grid as Example 2.1 in section 2. The numerical error and bias are similar to (5.1) and (5.2), expect that the norm is given as in (2.8). The reference solution is computed by 80400 ordinates by Gaussian quadrature.
Figure 7 and Table 3 show the convergence order of the error and bias. We can observe that the convergence order of the error is between and , while the convergence order of the bias is between and for both isotropic and anisotropic scattering kernels. This is due to the non-smoothness of the solution, which leads to a slight difference between the theoretical value and the actual result.
Figure 8 demonstrates the ability of ROM to mitigate the ray effect. The expectation of the average density calculated by ROM with different number of ordinates and different number of sampled simulations are displayed. We can observe that the ray effect is effectively mitigated after multiple parallel calculations and taking the expectations, even when the sampled simulations use a very small number of ordinates. The results with the anisotropic scattering are similar.
Order | ||||||
---|---|---|---|---|---|---|
error ( | iso: g=0 | 8.09E-03 | 1.25E-02 | 2.26E-02 | 6.28E-02 | 0.74 |
aniso: g=0.9 | 8.18E-03 | 1.26E-02 | 2.28E-02 | 6.36E-02 | 0.74 | |
bias ( | iso: g=0 | 1.12E-04 | 2.62E-04 | 6.44E-04 | 6.22E-03 | 1.44 |
aniso: g=0.9 | 1.09E-04 | 2.01E-04 | 3.95E-04 | 5.42E-03 | 1.41 |
5.2.2 The lattice problem
This example is to demonstrate that the ROM can mitigate the ray effect independent of the problem setup. A benchmark test is the lattice problem in X-Y plane. The spatial domain is and the cross sections are , . The layout of the source term is shown in Figure 9. The number of spatial cells is and Diamond difference method is employed for the spatial discretization. In Figure 10, the average densities calculated with , , , , and ordinates DOM are displayed. The ray effects can be visually seen even with 144 ordinate.
The results of ROM are shown in Figure 11, where columns 1 to 4 show the expectation of average density with 5, 10, 20 and 50 simulations, respectively. As can be seen from the results, the ray effect is invisible in the ROM results when 50 sampled simulations with 4 ordinates, 10 sampled simulations with 16 ordinates, and 5 sampled simulations with 36 ordinates.
6 Discussion
In the slab geometry, the discontinuities in velocity are usually at fixed velocity directions, thus the decrease in convergence order is not as severe as in the spatial 2D case. On the other hand, the spatial discontinuities or sharp transitions of the inflow boundary conditions and source terms can induce discontinuities in the velocity along the direction of the ray. Thus, the discontinuities in velocity depend on spatial variables for the spatial 2D case. As we can see from Table 1, the commonly used quadrature sets can only achieve a convergence order around , which is the main reason for the observed ray effect in spatial 2D test cases.
We remark that other strategies for the random ordinates are possible. For example, does not have to be chosen with uniform probability from . One may make use of the transition kernel for importance sampling of . If these strategies are adopted, the weights have to be adjusted correspondingly. One may consider multiscale total and scattering cross sections as well, then, the conservation of mass may become important. In our future work, we will investigate the extension of ROM to the multiscale cases, such as the diffusion limit or Fokker-Planck limit cases.
Acknowledgments
This work is partially supported by the National Key R&D Program of China No. 2020YFA0712000. The work of L. Li was partially supported by Shanghai Municipal Science and Technology Major Project 2021SHZDZX0102, NSFC 12371400 and 12031013, the Strategic Priority Research Program of Chinese Academy of Sciences, Grant No. XDA25010403.
References
- [1] A. D. Acosta, Inequalities for B-valued random vectors with applications to the strong law of large numbers, The Annals of Probability, 9 (1981), pp. 157–161, https://doi.org/10.1214/aop/1176994517, https://www.jstor.org/stable/2243188.
- [2] G. I. Bell and S. Glasstone, Nuclear reactor theory, Van Nostrand Reinhold Company, 1970, https://www.osti.gov/biblio/4074688.
- [3] K. Bhan and J. Spanier, Condensed history monte carlo methods for photon transport problems, Journal of computational physics, 225 (2007), pp. 1673–1694, https://doi.org/10.1016/j.jcp.2007.02.012, https://linkinghub.elsevier.com/retrieve/pii/S0021999107000770.
- [4] T. Camminady, M. Frank, K. Küpper, and J. Kusch, Ray effect mitigation for the discrete ordinates method through quadrature rotation, Journal of Computational Physics, 382 (2019), pp. 105–123, https://doi.org/10.1016/j.jcp.2019.01.016, https://linkinghub.elsevier.com/retrieve/pii/S0021999119300415.
- [5] Y. Cao, J. Lu, and L. Wang, Complexity of randomized algorithms for underdamped langevin dynamics, Communications in Mathematical Sciences, 19 (2021), pp. 1827–1853, https://doi.org/10.4310/cms.2021.v19.n7.a4, http://arxiv.org/abs/2003.09906.
- [6] B. G. Carlson, Solution of the transport equation by approximations, (1955), https://doi.org/10.2172/4376236, https://www.osti.gov/biblio/4376236.
- [7] M. K. Chu, A. D. Klose, and H. Dehghani, Light transport in soft tissue based on simplified spherical harmonics approximation to radiative transport equation, biomedical optics, (2008), https://doi.org/10.1364/BIOMED.2008.BSuE42, https://opg.optica.org/abstract.cfm?URI=BIOMED-2008-BSuE42.
- [8] J. M. C. Clark and R. J. Cameron, The maximum rate of convergence of discrete approximations for stochastic differential equations, Stochastic Differential Systems Filtering and Control. Lecture Notes in Control and Information Sciences, 25 (1980), https://doi.org/10.1007/BFb0004007, https://link.springer.com/chapter/10.1007/BFb0004007.
- [9] T. Daun, On the randomized solution of initial value problems, Journal of Complexity, 27 (2011), pp. 300–311, https://doi.org/10.1016/j.jco.2010.07.002, https://linkinghub.elsevier.com/retrieve/pii/S0885064X1000066X.
- [10] J. A. Fleck and J. D. Cummings, An implicit monte carlo scheme for calculating time and frequency dependent nonlinear radiation transport, Journal of Computational Physics, 8 (1971), pp. 313–342, https://doi.org/10.1016/0021-9991(71)90015-5, https://linkinghub.elsevier.com/retrieve/pii/0021999171900155.
- [11] C. K. Garrett and C. D. Hauck, A comparison of moment closures for linear kinetic transport equations: The line source benchmark, Transport Theory and Statistical Physics, 42 (2013), pp. 203–235, https://doi.org/10.1080/00411450.2014.910226, https://doi.org/10.1080/00411450.2014.910226.
- [12] F. Golse, P. L. Lions, B. Perthame, and R. Sentis, Regularity of the moments of the solution of a transport equation, Journal of Functional Analysis, 76 (1988), pp. 110–125, https://doi.org/10.1016/0022-1236(88)90051-1, https://www.sciencedirect.com/science/article/pii/0022123688900511.
- [13] S. Heinrich and B. Milla, The randomized complexity of initial value problems, Journal of Complexity, 24 (2008), pp. 77–88, https://doi.org/doi.org/10.1016/j.jco.2007.09.002, https://www.sciencedirect.com/science/article/pii/S0885064X07000957.
- [14] S. Jin, M. Tang, and H. Houde, A uniformly second order numerical method for the one-dimensional discrete-ordinate transport equation and its diffusion limit with interface, Networks Heterogeneous Media, 4 (2009), pp. 35–65, https://doi.org/10.3934/nhm.2009.4.35, http://aimsciences.org//article/doi/10.3934/nhm.2009.4.35.
- [15] J. Kevorkian and J. D. Cole, Perturbation methods in applied mathematics, Springer-Verlag Berlin Heidelberg, 1981, https://doi.org/10.1007/978-1-4757-4213-8, http://link.springer.com/10.1007/978-1-4757-4213-8.
- [16] H. K. Kim, M. Flexman, D. J. Yamashiro, J. J. Kandel, and A. H. Hielscher, PDE-constrained multispectral imaging of tissue chromophores with the equation of radiative transfer, Biomedical Optics Express, 1 (2010), pp. 812–824, https://doi.org/10.1364/BOE.1.000812, https://opg.optica.org/boe/abstract.cfm?uri=boe-1-3-812.
- [17] E. W. larsen and J. E. Morel, Asymptotic solutions of numerical transport problems in optically thick, diffusive regimes II, Journal of Computational Physics, 83 (1989), pp. 212–236, https://doi.org/10.1016/0021-9991(89)90229-5, https://linkinghub.elsevier.com/retrieve/pii/0021999189902295.
- [18] E. W. Larsen and J. E. Morel, Advances in Discrete-Ordinates Methodology, Springer Netherlands, Dordrecht, 2010, pp. 1–84, https://doi.org/10.1007/978-90-481-3411-3_1, https://doi.org/10.1007/978-90-481-3411-3_1.
- [19] K. D. Lathrop, Ray effects in discrete ordinates equations, Nuclear Science and Engineering, 32 (1968), pp. 357–369, https://doi.org/10.13182/NSE68-4, https://www.tandfonline.com/doi/full/10.13182/NSE68-4.
- [20] P. D. Lax, Functional analysis, John Wiley & Sons, Incorporated, 2007.
- [21] E. Lewis and W. Miller, Computational methods of neutron transport, John Wiley and Sons, Incorporated, 1984.
- [22] R. B. Lowrie, J. E. Morel, and J. A. Hittinger, The coupling of radiation and hydrodynamics, Astrophysical Journal, 521 (1999), pp. 432–450, https://doi.org/10.1086/307515, https://iopscience.iop.org/article/10.1086/307515/meta.
- [23] M. M. Marinak, G. D. Kerbel, N. A. Gentile, O. Jones, D. Munro, S. Pollaine, T. R. Dittrich, and S. W. Haan, Three-dimensional hydra simulations of national ignition facility targets, Physics of Plasmas, 8 (2001), pp. 2275–2280, https://doi.org/10.1063/1.1356740, https://pubs.aip.org/pop/article/8/5/2275/859677/Three-dimensional-HYDRA-simulations-of-National.
- [24] F. Martin, K. Jonas, C. Thomas, and C. D. Hauck, Ray effect mitigation for the discrete ordinates method using artificial scattering, Nuclear Science and Engineering, 194 (2020), pp. 971–988, https://doi.org/10.1080/00295639.2020.1730665, https://www.tandfonline.com/doi/full/10.1080/00295639.2020.1730665.
- [25] M. K. Matzen, M. A. Sweeney, R. G. Adams, J. R. Asay, and E. Al, Pulsed-power-driven high energy density physics and inertial confinement fusion research, Physics of Plasmas, 12 (2005), p. 055503, https://doi.org/10.1063/1.1891746, https://pubs.aip.org/pop/article/12/5/055503/1015377/Pulsed-power-driven-high-energy-density-physics.
- [26] R. G. Mcclarren and R. B. Lowrie, Manufactured solutions for the radiation-hydrodynamics equations, Journal of Quantitative Spectroscopy & Radiative Transfer, 109 (2008), pp. 2590–2602, https://doi.org/10.1016/j.jqsrt.2008.06.003, https://linkinghub.elsevier.com/retrieve/pii/S0022407308001428.
- [27] W. F. Miller Jr. and W. H. Reed, Ray-effect mitigation methods for two-dimensional neutron transport theory, Nuclear Science and Engineering, 62 (1977), pp. 391–411, https://doi.org/10.13182/NSE62-391, https://www.tandfonline.com/doi/abs/10.13182/NSE01-48.
- [28] J. E. Morel, T. A. Wareing, R. B. Lowrie, and D. K. Parsons, Analysis of ray-effect mitigation techniques, Nuclear Science and Engineering, 144 (2003), pp. 1–22, https://doi.org/10.13182/NSE01-48, https://www.tandfonline.com/doi/abs/10.13182/NSE01-48.
- [29] E. Novak, Deterministic and stochastic error bounds in numerical analysis, lecture notes in mathematics, Springer Berlin, 1 ed., 1 1988, https://doi.org/10.1007/BFb0079792, https://link.springer.com/book/10.1007/BFb0079792.
- [30] H. P. Rosenthal, On the subspaces of spanned by sequences of independent random variables, Israel J. Math., (1970), pp. 273–303, https://doi.org/10.1007/BF02771562, http://link.springer.com/10.1007/BF02771562.
- [31] G. Stengle, Numerical methods for systems with measurable coefficients, Applied Mathematics Letters, 3 (1990), pp. 25–29, https://doi.org/10.1016/0893-9659(90)90040-I, https://www.sciencedirect.com/science/article/pii/089396599090040I.
- [32] G. Stengle, Error analysis of a randomized numerical method, Numerische Mathematik, 70 (1995), pp. 119–128, https://doi.org/10.1007/s002110050113, https://link.springer.com/article/10.1007/s002110050113.
- [33] J. Tencer, Ray effect mitigation through reference frame rotation, Journal of Heat Transfer, 138 (2016), p. 112701, https://doi.org/10.1115/1.4033699, https://asmedigitalcollection.asme.org/heattransfer/article/doi/10.1115/1.4033699/384240/Ray-Effect-Mitigation-Through-Reference-Frame.
- [34] J. A. Tropp, The expected norm of a sum of independent random matrices: An elementary approach, in High Dimensional Probability VII, Cham, 2016, Springer International Publishing, pp. 173–202, https://doi.org/doi.org/10.1007/978-3-319-40519-3_8, https://link.springer.com/chapter/10.1007/978-3-319-40519-3_8.
Appendix A Supplementary Materials: Estimates for and
Recall that , and that the operator maps to by solving
(A.1) | ||||
Recall that we have divide the velocity space into cells and the each random ordinates , is chosen randomly from the th cell. The other half and the random ordinate are chosen in asymmetric fashion. The weight and . The operator is defined as
Let then
Therefore, from the definition of and ,
(A.2) |
Here, we note that and are not independent. This is why we put and together.
For the truncated system, we consider . The equation is changed to
We divide and into subintervals respectively and perform the ROM for the approximation. Then, the weight is changed to . The operators , , , and are changed accordingly.
Lemma A.1.
The operator satisfies that
(A.3) |
and for that
(A.4) |
Consequently, , , and it holds for the truncated system that
(A.5) |
Proof A.2.
We only consider here, as the case for is similar. Multiplying in (A.1) and integrating, one has
Hence,
which implies that .
Taking the derivative of both sides of (A.1) with respect to
Hence, one has
By (A.3), one then has
The inequality (A.4) holds.
Since and are the averages of , the second claims are straightforward. For the truncated system, one has
then
The claim then follows.
Next, our main goal is to estimate where is given in (A.2).
If the independent random variables take values in a Hilbert space, some classical concentration inequalities like the Rosenthal inequality [30] can be used to achieve this. However, here the variables are operators over Hilbert spaces so they are in a Banach algebra. One cannot apply the classical Rosenthal inequality directly. Our goal in this section is then to establish a Rosenthal type inequality for these operator-valued random variables.
First, we observe the following
The first term on the right-hand side can be estimated by the following fact.
Lemma A.3.
For any separable Banach space and any finite sequence of independent -valued random vectors , with . Let . Then,
This result is a special case of [1, Theorem 2.1], where the general case is considered. The proof is a consequence of the Birkholder inequality for martingales. Here, we sketch the proof for the special case here. Consider the filtration where
and . Define , where . Then, one has
By the definition, one has
Here is an independent copy of . This then gives that .
Hence, the problem is reduced to estimation of . We will mainly make use the approach in [34]. The analysis in [34] is for matrices and the final result relies on the dimension of the space. In our case, the operator is in an infinite dimensional space. We find that the dependence on the dimension is due to the trace of the operators. Fortunately, for our case, the operator is compact and we could possibly bound the trace.
Firstly, we introduce the Mercer’s theorem about trace class mentioned in [20, Sec. 30.5, Theorem 11].
Lemma A.4.
Consider an integral operator of the form
acting on , where is a real-valued symmetric, continuous function of and is a continuous positive weight. Then, the operator is positive in the usual sense:
and is of trace class with the trace equal to the integral of its kernel along the diagonal:
The result in [20, Sec. 30.5, Theorem 11] is about the uniform weight . It is not hard to check that the proof holds for general continuous positive weight on . Based on the above lemma, we naturally deduce the following proposition.
Proposition A.5.
For any , is a compact operator. is the adjoint operator of , then and are in the trace class. Moreover,
Proof A.6.
According to [20], an integral operator with a square integrable kernel is compact. The first claim follows directly by the solution in expanded form.
We only consider and the proof for is similar. By the definition of adjoint operators, , one finds that
Then, one has
Similarly,
Hence,
(A.6a) | ||||
(A.6b) |
According to these two formulas, it is clear that both kernels are continuous in .
We adopt the argument in [34] to our case for .
First, define the symmetrization of the operators. Because is not a self-adjoint operator, we let be the symmetrization operator of , given by
where
Clearly, each are operators in . The symmetrization has a good property that is
(A.8) |
In fact, since is a self-adjoint operator, and
The other relation follows from the same argument. Hence, it reduces to estimate .
Next, we consider the Rademacher symmetrization:
where are independent Rademacher random variables (i.e., indepedent Bernoulli variables taking values in ). Introduction of brings extra randomness to utilize the independence. The following is well-known (see [34, Fact 3.1]), which indicates that introduction of would not lose control of the random variables.
Lemma A.7.
The Rademacher symmetrization satisfies
where means that the expectation is with respect to all randomness, including those in Rademacher variables and random ordinates.
The following estimate gives the control by taking the expectation over , following the approach in [34].
Lemma A.8.
It holds that
where means the expectation over the Rademacher variables.
Proof A.9.
Note that is self-adjoint, and is nonnegative and compact. Hence, for each nonnegative integer so that
The above holds because the norm of is the largest eigenvalue while the trace is the sum of all eigenvalues. For each index , define the random operators
Denote . Then, it holds that
The second line is due to the equality
and . The third line is due to the following Geometric Mean-Arithmetic Mean (GM-AM) trace inequality in [34, Fact 2.4]
(A.9) |
which can be generalized to self-adjoint compact operators in trace class. Here, are integers and , and are two arbitrary self-adjoint compact operators in trace class. In fact, one can approximate the compact operators using finite rank self-adjoint operators, and the finite rank operators (essentially matrices) satisfy (A.9). Passing the limit for the finite rank approximation then verifies the inequality for self-adjoint compact operators. The last line follows since .
Theorem A.10.
Proof A.11.
By Lemma A.7, one finds that
We conclude the following.
Corollary A.12 (Bounds of ).
It holds for the truncated system that
where is a constant that is independent of .
Proof A.13.
In (A.11), we note that
where and is similarly defined. The inequality above holds because . Moreover, using the simple control,
we conclude that by Lemma A.5 for the truncated system that
Moreover, by Lemma A.1 and noting ,
The claim then follows.
Next, we move to the estimation of , which is much easier than since it is in the Hilbert space . Here, we prove the lemma without truncation (the truncated version is clearly correct if the un-truncated one holds).
Lemma A.14.
Suppose that is Lipschitz on and is Lipschitz on . Then, it holds that
where is a constant that is independent of .
Proof A.15.
By the Hölder inequality
Define . Then it holds that
(A.12) | ||||
Above, the summation can be moved out of the norm because the expectation of the cross terms are zero, i.e.,
for . The last inequality is due to the Hölder inequality.
Using the explicit formula of , we will show now for and that
(A.13) |
Below, we consider only ( is similar). Recall the boundary propagator for :
Clearly, for , and , one has and thus
where is the Lipschitz constant of .
Let . Note the simple fact
If , since is bounded, for some constant . Since , one has
If , then is bounded. Using the simple bound , one has
The second inequality here follows by the substitution and the fact
Hence, (A.13) is proved.