\newsiamremark

remarkRemark \newsiamremarkhypothesisHypothesis \newsiamthmclaimClaim

Random ordinate method for mitigating the ray effect in radiative transport equation simulations

Lei Li School of Mathematical Sciences, Institute of Natural Sciences, MOE-LSC, Shanghai Jiao Tong University, Shanghai, P.R. China. ( ). [email protected]    Min Tang School of Mathematical Sciences,, Institute of Natural Sciences and MOE-LSC, Shanghai Jiao Tong University, Shanghai, P.R. China. (). [email protected]    Yuqi Yang School of Mathematical Sciences,, Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai, P.R. China. (). [email protected]
Abstract

The Discrete Ordinates Method (DOM) is the most widely used velocity discretization method for simulating the radiative transport equation. The ray effect stands as a long-standing drawback of DOM. In benchmark tests displaying the ray effect, we observe low regularity in velocity within the solution. To address this issue, we propose a random ordinate method (ROM) to mitigate the ray effect. Compared with other strategies proposed in the literature for mitigating the ray effect, ROM offers several advantages: 1) the computational cost is comparable to DOM; 2) it is simple and requires minimal changes to existing code based on DOM; 3) it is easily parallelizable and independent of the problem setup. Analytical results are presented for the convergence orders of the error and bias, and numerical tests demonstrate its effectiveness in mitigating the ray effect.

keywords:
Random ordinate method, ray effect, discrete ordinate method, radiative transport equation.
{MSCcodes}

1 Introduction

The radiative transport equation (RTE) stands as a fundamental equation governing the evolution of angular flux as particles traverse through a material medium. It provides a statistical description of the density distribution of particles. The RTE has found extensive applications across diverse fields, including astrophysics [22, 26], fusion [25, 23], biomedical optics [16, 7], and biology, among others.

The steady state RTE with anisotropic scattering reads

(1.1) 𝒖ψ(𝒛,𝒖)+σT(𝒛)ψ(𝒛,𝒖)=σS(𝒛)1|S|SP(𝒖,𝒖)ψ(𝒛,𝒖)d𝒖+q(𝒛),𝒖𝜓𝒛𝒖subscript𝜎𝑇𝒛𝜓𝒛𝒖subscript𝜎𝑆𝒛1𝑆subscript𝑆𝑃superscript𝒖bold-′𝒖𝜓𝒛superscript𝒖bold-′differential-dsuperscript𝒖bold-′𝑞𝒛\boldsymbol{u}\cdot\nabla\psi(\boldsymbol{z},\boldsymbol{u})+\sigma_{T}(% \boldsymbol{z})\psi(\boldsymbol{z},\boldsymbol{u})=\sigma_{S}(\boldsymbol{z})% \frac{1}{|S|}\int_{S}P(\boldsymbol{u^{\prime}},\boldsymbol{u})\psi(\boldsymbol% {z},\boldsymbol{u^{\prime}})\mathrm{d}\boldsymbol{u^{\prime}}+q(\boldsymbol{z}),bold_italic_u ⋅ ∇ italic_ψ ( bold_italic_z , bold_italic_u ) + italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( bold_italic_z ) italic_ψ ( bold_italic_z , bold_italic_u ) = italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( bold_italic_z ) divide start_ARG 1 end_ARG start_ARG | italic_S | end_ARG ∫ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_P ( bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT , bold_italic_u ) italic_ψ ( bold_italic_z , bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ) roman_d bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT + italic_q ( bold_italic_z ) ,

subject to the following inflow boundary conditions:

(1.2) ψ(𝒛,𝒖)=ψΓ(𝒛,𝒖),𝒛Γ=Ω,𝒖𝒏𝒛<0.formulae-sequenceformulae-sequence𝜓𝒛𝒖superscriptsubscript𝜓Γ𝒛𝒖𝒛superscriptΓΩ𝒖subscript𝒏𝒛0\psi(\boldsymbol{z},\boldsymbol{u})=\psi_{\Gamma}^{-}(\boldsymbol{z},% \boldsymbol{u}),\quad\boldsymbol{z}\in\Gamma^{-}=\partial\Omega,\quad% \boldsymbol{u}\cdot\boldsymbol{n}_{\boldsymbol{z}}<0.italic_ψ ( bold_italic_z , bold_italic_u ) = italic_ψ start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_z , bold_italic_u ) , bold_italic_z ∈ roman_Γ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT = ∂ roman_Ω , bold_italic_u ⋅ bold_italic_n start_POSTSUBSCRIPT bold_italic_z end_POSTSUBSCRIPT < 0 .

Here, 𝒛Ω3𝒛Ωsuperscript3\boldsymbol{z}\in\Omega\subset\mathbb{R}^{3}bold_italic_z ∈ roman_Ω ⊂ blackboard_R start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT represents the spatial variable; 𝒖𝒖\boldsymbol{u}bold_italic_u denotes the direction of particle movement, and S={𝒖𝒖3,|𝒖|=1}𝑆conditional-set𝒖formulae-sequence𝒖superscript3𝒖1S=\{\boldsymbol{u}\mid\boldsymbol{u}\in\mathbb{R}^{3},|\boldsymbol{u}|=1\}italic_S = { bold_italic_u ∣ bold_italic_u ∈ blackboard_R start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , | bold_italic_u | = 1 }; 𝒏𝒛subscript𝒏𝒛\boldsymbol{n}_{\boldsymbol{z}}bold_italic_n start_POSTSUBSCRIPT bold_italic_z end_POSTSUBSCRIPT stands for the outward normal vector at position 𝒛𝒛\boldsymbol{z}bold_italic_z. ψ(𝒛,𝒖)𝜓𝒛𝒖\psi(\boldsymbol{z},\boldsymbol{u})italic_ψ ( bold_italic_z , bold_italic_u ) gives the density of particles moving in the direction 𝒖𝒖\boldsymbol{u}bold_italic_u at position 𝒛𝒛\boldsymbol{z}bold_italic_z. The coefficients σT(𝒛)subscript𝜎𝑇𝒛\sigma_{T}(\boldsymbol{z})italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( bold_italic_z ), σS(𝒛)subscript𝜎𝑆𝒛\sigma_{S}(\boldsymbol{z})italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( bold_italic_z ), and q(𝒛)𝑞𝒛q(\boldsymbol{z})italic_q ( bold_italic_z ) represent the total, scattering cross-sections, and the source term, respectively. For physically meaningful situations, σT(𝒛)>σS(𝒛)subscript𝜎𝑇𝒛subscript𝜎𝑆𝒛\sigma_{T}(\boldsymbol{z})>\sigma_{S}(\boldsymbol{z})italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( bold_italic_z ) > italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( bold_italic_z ), for 𝒛Ωfor-all𝒛Ω\forall\boldsymbol{z}\in\Omega∀ bold_italic_z ∈ roman_Ω. The kernel k(𝒖,𝒖):=1|S|P(𝒖,𝒖)assign𝑘superscript𝒖𝒖1𝑆𝑃superscript𝒖𝒖k(\boldsymbol{u}^{\prime},\boldsymbol{u}):=\frac{1}{|S|}P(\boldsymbol{u}^{% \prime},\boldsymbol{u})italic_k ( bold_italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , bold_italic_u ) := divide start_ARG 1 end_ARG start_ARG | italic_S | end_ARG italic_P ( bold_italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , bold_italic_u ) is the scattering kernel, which provides the probability that particles moving in the direction 𝒖superscript𝒖bold-′\boldsymbol{u^{\prime}}bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT scatter to the direction 𝒖𝒖\boldsymbol{u}bold_italic_u. For the notational convenience, we will use the symbol S:=1|S|Sassignsubscriptaverage-integral𝑆1𝑆subscript𝑆\fint_{S}:=\frac{1}{|S|}\int_{S}⨏ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT := divide start_ARG 1 end_ARG start_ARG | italic_S | end_ARG ∫ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT to denote the average over the domain S𝑆Sitalic_S associated with the indicated measure. Then, the scaled scattering kernel satisfies:

P(𝒖,𝒖)=P(𝒖,𝒖),SP(𝒖,𝒖)d𝒖=1|S|SP(𝒖,𝒖)d𝒖=1.formulae-sequence𝑃superscript𝒖𝒖𝑃𝒖superscript𝒖bold-′subscriptaverage-integral𝑆𝑃superscript𝒖bold-′𝒖differential-d𝒖1𝑆subscript𝑆𝑃superscript𝒖bold-′𝒖differential-d𝒖1P(\boldsymbol{u}^{\prime},\boldsymbol{u})=P(\boldsymbol{u},\boldsymbol{u^{% \prime}}),\qquad\fint_{S}P(\boldsymbol{u^{\prime}},\boldsymbol{u})\mathrm{d}% \boldsymbol{u}=\frac{1}{|S|}\int_{S}P(\boldsymbol{u^{\prime}},\boldsymbol{u})% \mathrm{d}\boldsymbol{u}=1.italic_P ( bold_italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , bold_italic_u ) = italic_P ( bold_italic_u , bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ) , ⨏ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_P ( bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT , bold_italic_u ) roman_d bold_italic_u = divide start_ARG 1 end_ARG start_ARG | italic_S | end_ARG ∫ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_P ( bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT , bold_italic_u ) roman_d bold_italic_u = 1 .

The numerical methods for solving the RTE are mainly divided into two categories: particle methods and PDE-based methods. Particle methods, specifically Monte Carlo (MC) methods, simulate the trajectories of numerous particles and collect the density distribution of all particles in the phase space to obtain the RTE solution. The MC methods are known to be slow and noisy but are easy to parallelize and suitable for all geometries [10, 3]. Meanwhile, the PDE-based methods are more accurate and can be faster, but they are not as flexible as the MC method for parallel computation and complex geometries [21]. In this paper, we are interested in the PDE-based method.

The Discrete Ordinates Method (DOM) [2, 6] is the most popular velocity discretization method. DOM approximates the solution to (1.2) using a set of discrete velocity directions 𝒖𝒎subscript𝒖𝒎\boldsymbol{u_{m}}bold_italic_u start_POSTSUBSCRIPT bold_italic_m end_POSTSUBSCRIPT, which are referred to as ordinates. The integral term on the right-hand side of Equation (1.1) is represented by weighted summations of the discrete velocities. DOM retains the positive angular flux and facilitates the determination of boundary conditions. However, solving the RTE using the standard DOM is expensive due to its high dimensionality since the RTE has three spatial variables and two velocity direction variables.

In real applications, people are usually interested in the spatial distributions of some macroscopic quantities, such as the particle density Sψ(𝒛,𝒖)d𝒖subscript𝑆𝜓𝒛𝒖differential-d𝒖\int_{S}\psi(\boldsymbol{z},\boldsymbol{u})\mathrm{d}\boldsymbol{u}∫ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_ψ ( bold_italic_z , bold_italic_u ) roman_d bold_italic_u, the momentum S𝒖ψ(𝒛,𝒖)d𝒖subscript𝑆𝒖𝜓𝒛𝒖differential-d𝒖\int_{S}\boldsymbol{u}\psi(\boldsymbol{z},\boldsymbol{u})\mathrm{d}\boldsymbol% {u}∫ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT bold_italic_u italic_ψ ( bold_italic_z , bold_italic_u ) roman_d bold_italic_u, etc. Thus, the spatial resolution must be adequate. The computational costs can be significantly reduced if one can obtain the right macroscopic quantities by using a small number of ordinates in DOM.

One natural question arises: Can we improve the accuracy of macroscopic quantities without increasing the number of ordinates? Considerable effort has been invested in discovering a quadrature set with high-order convergence. For the 1D velocity in slab geometry, Gaussian quadrature exists, exhibiting spectral convergence when the solution maintains sufficient smoothness in velocity. However, devising a spectrally convergent 2 or 3-dimensional Gaussian quadrature remains unclear. Furthermore, in section 2, we highlight, through numerical tests, that the solution’s regularity in the velocity direction can be considerably low in some benchmark tests. Consequently, it remains uncertain whether better approximations can be expected, even if a 2 or 3-dimensional Gaussian quadrature with high convergence order for smooth solutions is identified.

When employing a limited number of ordinates in DOM, the ray effect becomes noticeable in numerous 2D spatial benchmark tests [11, 19]. The macroscopic particle density exhibits nonphysical oscillations along the ray paths, particularly noticeable when the inflow boundary conditions or radiation sources demonstrate strong spatial variations or discontinuities. The ray effect stems from particles being confined to move in a limited number of directions. As highlighted in section 2, in benchmark tests displaying the ray effect, we observe low regularity in velocity within the solution, indicating a low convergence order for DOM. In order to mitigate the ray effect, one has to increase the number of ordinates [27], which will significantly increase the computational cost.

The ray effect stands as a long-standing drawback in DOM simulations, and several strategies have been proposed to mitigate these ray effects at reasonable costs. Examples include approaches discussed in [4, 24, 28, 33]. The main idea is to use biased or rotated quadratures or combine the spectral method with DOM. In this paper, inspired by the randomized integration method [29], we introduce a random ordinate method (ROM) for solving RTE. Compared with other ray effect mitigating strategies proposed in the literature, the advantages of ROM are: 1) the computational cost is comparable to DOM; 2) it is simple and makes almost no change to all previous code based on DOM; 3) It is easy to parallelize and independent of the problem setup.

Randomized algorithms can achieve higher convergence order when the solution regularity is low [29]. The concept of the randomized method is straightforward: when approximating 01f(x)𝑑xsuperscriptsubscript01𝑓𝑥differential-d𝑥\int_{0}^{1}f(x)dx∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_f ( italic_x ) italic_d italic_x, the integral interval [0,1]01[0,1][ 0 , 1 ] is partitioned into cells with a maximum size of hhitalic_h. Subsequently, an xmsubscript𝑥𝑚x_{m}italic_x start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT is chosen from each interval, and 01f(x)𝑑xsuperscriptsubscript01𝑓𝑥differential-d𝑥\int_{0}^{1}f(x)dx∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_f ( italic_x ) italic_d italic_x is approximated by =1nωf(x)superscriptsubscript1𝑛subscript𝜔𝑓subscript𝑥\sum_{\ell=1}^{n}\omega_{\ell}f(x_{\ell})∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_f ( italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ), where ωsubscript𝜔\omega_{\ell}italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT represents the quadrature weights. With a fixed set of {x,ω}subscript𝑥subscript𝜔\{x_{\ell},\omega_{\ell}\}{ italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT }, one can expect uniform first-order convergence for general f(x)𝑓𝑥f(x)italic_f ( italic_x ) that is Lipschitz continuous. On the other hand, when one randomly chooses a point xsubscript𝑥x_{\ell}italic_x start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT inside each interval and keeps using the same ωsubscript𝜔\omega_{\ell}italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT, the expected error can achieve O(h32)𝑂superscript32O(h^{\frac{3}{2}})italic_O ( italic_h start_POSTSUPERSCRIPT divide start_ARG 3 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ) convergence, whereas the expectation of all randomly chosen quadrature provides O(h3)𝑂superscript3O(h^{3})italic_O ( italic_h start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) convergence. Randomized algorithms have been developed and analyzed for simple problems, such as the initial value problem of ordinary differential equation (ODE) systems [32, 31], whose complexity has been studied in [13, 9], and for stochastic differential equations [8], whose complexity is analyzed in [5].

The main idea of ROM is that the velocity space is partitioned into n𝑛nitalic_n cells, and a random ordinate is selected from each cell. A DOM system with those randomly chosen ordinates is then solved. The ROM solutions’ expected values can achieve a higher convergence order in velocity space. Both theoretical and numerical results indicate that, even with carefully chosen quadratures in DOM, it can only achieve a similar convergence order as ROM when the solution regularity is low. Consequently, the accuracy doesn’t decrease for a single run. However, averaging multiple runs can lead to a higher-order convergence and mitigate the ray effect. Since different runs employ distinct ordinates, ROM allows for easy parallel computation. Therefore, one can mitigate the ray effect by running a lot of samples with a few ordinates and then calculating their expectation.

This paper is structured as follows: Section 2 delves into the ray effects of DOM and illustrates the low regularity of the solution in velocity space. Details of the ROM are described in Section 3. In Section 4, analytical results are presented for the convergence orders of the error and bias when ROM is applied to RTE with isotropic scattering in slab geometry. Section 5 displays the numerical performance of the ROM, demonstrating its ability to mitigate the ray effect. Finally, Section 6 concludes the paper with discussions.

2 Ray effects and low regularity

2.1 Discrete Ordinate Method and Ray Effects

The DOM is the most popular angular discretization for RTE simulations, it writes [18]:

𝒖ψ(𝒛)+σT(𝒛)ψ(𝒛)=σS(𝒛)VwP,ψ(𝒛)+q(𝒛),V,formulae-sequencesubscript𝒖subscript𝜓𝒛subscript𝜎𝑇𝒛subscript𝜓𝒛subscript𝜎𝑆𝒛subscriptsuperscript𝑉subscript𝑤superscriptsubscript𝑃superscriptsubscript𝜓superscript𝒛subscript𝑞𝒛𝑉\boldsymbol{u}_{\ell}\cdot\nabla\psi_{\ell}(\boldsymbol{z})+\sigma_{T}(% \boldsymbol{z})\psi_{\ell}(\boldsymbol{z})=\sigma_{S}(\boldsymbol{z})\sum_{% \ell^{\prime}\in V}w_{\ell^{\prime}}P_{\ell,\ell^{\prime}}\psi_{\ell^{\prime}}% (\boldsymbol{z})+q_{\ell}(\boldsymbol{z}),\quad\ell\in V,bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⋅ ∇ italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( bold_italic_z ) + italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( bold_italic_z ) italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( bold_italic_z ) = italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( bold_italic_z ) ∑ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_V end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT roman_ℓ , roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( bold_italic_z ) + italic_q start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( bold_italic_z ) , roman_ℓ ∈ italic_V ,

subject to the following inflow boundary conditions:

ψ(𝒛,𝒖)=ψ(𝒛,𝒖),𝒛Γ=Ω,𝒖𝒏𝒛<0.formulae-sequenceformulae-sequence𝜓𝒛subscript𝒖superscriptsubscript𝜓𝒛𝒖𝒛superscriptΓΩsubscript𝒖subscript𝒏𝒛0\psi(\boldsymbol{z},\boldsymbol{u}_{\ell})=\psi_{\ell}^{-}(\boldsymbol{z},% \boldsymbol{u}),\quad\boldsymbol{z}\in\Gamma^{-}=\partial\Omega,\quad% \boldsymbol{u}_{\ell}\cdot\boldsymbol{n}_{\boldsymbol{z}}<0.italic_ψ ( bold_italic_z , bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) = italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_z , bold_italic_u ) , bold_italic_z ∈ roman_Γ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT = ∂ roman_Ω , bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⋅ bold_italic_n start_POSTSUBSCRIPT bold_italic_z end_POSTSUBSCRIPT < 0 .

where ωsubscript𝜔\omega_{\ell}italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT is the weight of the quadrature node 𝒖subscript𝒖\boldsymbol{u}_{\ell}bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT, satisfying Vω=1subscript𝑉subscript𝜔1\sum_{\ell\in V}\omega_{\ell}=1∑ start_POSTSUBSCRIPT roman_ℓ ∈ italic_V end_POSTSUBSCRIPT italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = 1; V𝑉Vitalic_V represents the index set of the quadrature {𝒖,ω}subscript𝒖subscript𝜔\{\boldsymbol{u}_{\ell},\omega_{\ell}\}{ bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT }; P,P(𝒖,𝒖)subscript𝑃superscript𝑃subscript𝒖bold-ℓsubscript𝒖superscriptbold-ℓbold-′P_{\ell,\ell^{\prime}}\approx P(\boldsymbol{u_{\ell}},\boldsymbol{u_{\ell^{% \prime}}})italic_P start_POSTSUBSCRIPT roman_ℓ , roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≈ italic_P ( bold_italic_u start_POSTSUBSCRIPT bold_ℓ end_POSTSUBSCRIPT , bold_italic_u start_POSTSUBSCRIPT bold_ℓ start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) denotes the discrete scattering kernel; q(𝒛)q(𝒛,𝒖)subscript𝑞𝒛𝑞𝒛subscript𝒖q_{\ell}(\boldsymbol{z})\approx q\left(\boldsymbol{z},\boldsymbol{u}_{\ell}\right)italic_q start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( bold_italic_z ) ≈ italic_q ( bold_italic_z , bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ). Then ψ(𝒛)ψ(𝒛,𝒖)subscript𝜓𝒛𝜓𝒛subscript𝒖\psi_{\ell}(\boldsymbol{z})\approx\psi\left(\boldsymbol{z},\boldsymbol{u}_{% \ell}\right)italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( bold_italic_z ) ≈ italic_ψ ( bold_italic_z , bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) and, for all V𝑉\ell\in Vroman_ℓ ∈ italic_V,

VωP,ψ(𝒛)SP(𝒖,𝒖)ψ(𝒛,𝒖)d𝒖.subscriptsuperscript𝑉subscript𝜔superscriptsubscript𝑃superscriptsubscript𝜓superscript𝒛subscriptaverage-integral𝑆𝑃subscript𝒖bold-ℓsubscript𝒖superscript𝜓𝒛subscript𝒖superscriptdifferential-dsubscript𝒖superscript\sum_{\ell^{\prime}\in V}\omega_{\ell^{\prime}}P_{\ell,\ell^{\prime}}\psi_{% \ell^{\prime}}(\boldsymbol{z})\approx\fint_{S}P(\boldsymbol{u_{\ell}},% \boldsymbol{u}_{\ell^{\prime}})\psi(\boldsymbol{z},\boldsymbol{u}_{\ell^{% \prime}})\mathrm{d}\boldsymbol{u}_{\ell^{\prime}}.∑ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_V end_POSTSUBSCRIPT italic_ω start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT roman_ℓ , roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( bold_italic_z ) ≈ ⨏ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_P ( bold_italic_u start_POSTSUBSCRIPT bold_ℓ end_POSTSUBSCRIPT , bold_italic_u start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) italic_ψ ( bold_italic_z , bold_italic_u start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) roman_d bold_italic_u start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT .

In slab geometry, where S=[1,1]𝑆11S=[-1,1]italic_S = [ - 1 , 1 ] and Ω=[xL,xR]Ωsubscript𝑥𝐿subscript𝑥𝑅\Omega=[x_{L},x_{R}]roman_Ω = [ italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ], the RTE reads, for (μ,x)[1,1]×[xL,xR]𝜇𝑥11subscript𝑥𝐿subscript𝑥𝑅(\mu,x)\in[-1,1]\times[x_{L},x_{R}]( italic_μ , italic_x ) ∈ [ - 1 , 1 ] × [ italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ]:

μxψ(x,μ)+σT(x)ψ(x,μ)=σS(x)1211P(μ,μ)ψ(x,μ)𝑑μ+q(x),𝜇subscript𝑥𝜓𝑥𝜇subscript𝜎𝑇𝑥𝜓𝑥𝜇subscript𝜎𝑆𝑥12superscriptsubscript11𝑃superscript𝜇𝜇𝜓𝑥superscript𝜇differential-dsuperscript𝜇𝑞𝑥\mu\partial_{x}{}\psi(x,\mu)+\sigma_{T}(x)\psi(x,\mu)=\sigma_{S}(x)\frac{1}{2}% \int_{-1}^{1}P(\mu^{\prime},\mu)\psi(x,\mu^{\prime})d\mu^{\prime}+q(x),italic_μ ∂ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_ψ ( italic_x , italic_μ ) + italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x ) italic_ψ ( italic_x , italic_μ ) = italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( italic_x ) divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∫ start_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_P ( italic_μ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_μ ) italic_ψ ( italic_x , italic_μ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) italic_d italic_μ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_q ( italic_x ) ,

subject to the boundary conditions:

(2.1) ψ(xL,μ)=ψL(μ),μ>0;ψ(xR,μ)=ψR(μ),μ<0.formulae-sequence𝜓subscript𝑥𝐿𝜇subscript𝜓𝐿𝜇formulae-sequence𝜇0formulae-sequence𝜓subscript𝑥𝑅𝜇subscript𝜓𝑅𝜇𝜇0\psi(x_{L},\mu)=\psi_{L}(\mu),\quad\mu>0;\qquad\psi(x_{R},\mu)=\psi_{R}(\mu),% \quad\mu<0.italic_ψ ( italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT , italic_μ ) = italic_ψ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ( italic_μ ) , italic_μ > 0 ; italic_ψ ( italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT , italic_μ ) = italic_ψ start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( italic_μ ) , italic_μ < 0 .

The DOM in slab geometry takes V={M,M+1,,1,1,,M1,M}𝑉𝑀𝑀111𝑀1𝑀V=\{-M,M+1,\cdots,-1,1,\cdots,M-1,M\}italic_V = { - italic_M , italic_M + 1 , ⋯ , - 1 , 1 , ⋯ , italic_M - 1 , italic_M }, where M𝑀Mitalic_M is an integer. The discrete ordinates μsubscript𝜇\mu_{\ell}italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT (V𝑉\ell\in Vroman_ℓ ∈ italic_V) satisfy:

0<μ1<<μM1<μM<1,μ=μ.formulae-sequence0subscript𝜇1subscript𝜇𝑀1subscript𝜇𝑀1subscript𝜇subscript𝜇0<\mu_{1}<\cdots<\mu_{M-1}<\mu_{M}<1,\qquad\mu_{-\ell}=-\mu_{\ell}.0 < italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < italic_μ start_POSTSUBSCRIPT italic_M - 1 end_POSTSUBSCRIPT < italic_μ start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT < 1 , italic_μ start_POSTSUBSCRIPT - roman_ℓ end_POSTSUBSCRIPT = - italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT .

Therefore, the DOM in slab geometry becomes:

(2.2) (μx+σT(x))ψ(x)=σS(x)VωP,ψ(x)+q(x),subscript𝜇subscript𝑥subscript𝜎𝑇𝑥subscript𝜓𝑥subscript𝜎𝑆𝑥subscriptsuperscript𝑉subscript𝜔superscriptsubscript𝑃superscriptsubscript𝜓𝑥subscript𝑞𝑥\Big{(}\mu_{\ell}\partial_{x}+\sigma_{T}(x)\Big{)}\psi_{\ell}(x)=\sigma_{S}(x)% \sum_{\ell^{\prime}\in V}\omega_{\ell^{\prime}}P_{\ell,\ell^{\prime}}\psi_{% \ell}(x)+q_{\ell}(x),( italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∂ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x ) ) italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x ) = italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( italic_x ) ∑ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_V end_POSTSUBSCRIPT italic_ω start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT roman_ℓ , roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x ) + italic_q start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x ) ,

subject to the boundary conditions:

ψ(xL)=ψL(μ),μ>0;ψ(xR)=ψR(μ),μ<0.formulae-sequencesubscript𝜓subscript𝑥𝐿subscript𝜓𝐿subscript𝜇formulae-sequencesubscript𝜇0formulae-sequencesubscript𝜓subscript𝑥𝑅subscript𝜓𝑅subscript𝜇subscript𝜇0\psi_{\ell}\left(x_{L}\right)=\psi_{L}\left(\mu_{\ell}\right),\quad\mu_{\ell}>% 0;\quad\psi_{\ell}\left(x_{R}\right)=\psi_{R}\left(\mu_{\ell}\right),\quad\mu_% {\ell}<0.italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ) = italic_ψ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) , italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT > 0 ; italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ) = italic_ψ start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) , italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT < 0 .

Two commonly used quadratures are the Uniform quadrature and the Gaussian quadrature. For the uniform quadrature, [1,1]11[-1,1][ - 1 , 1 ] is divided into 2M2𝑀2M2 italic_M equally spaced cells, each of size Δμ=1/MΔ𝜇1𝑀\Delta\mu=1/Mroman_Δ italic_μ = 1 / italic_M. The values {μ|V}conditional-setsubscript𝜇𝑉\{\mu_{\ell}|\ell\in V\}{ italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | roman_ℓ ∈ italic_V } represent the midpoint of each cell, i.e., μ=212Msubscript𝜇212𝑀\mu_{\ell}=\frac{2\ell-1}{2M}italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = divide start_ARG 2 roman_ℓ - 1 end_ARG start_ARG 2 italic_M end_ARG for >00\ell>0roman_ℓ > 0 and μ=2+12Msubscript𝜇212𝑀\mu_{\ell}=\frac{2{\ell}+1}{2M}italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = divide start_ARG 2 roman_ℓ + 1 end_ARG start_ARG 2 italic_M end_ARG for <00\ell<0roman_ℓ < 0, while ω=1/Msubscript𝜔1𝑀\omega_{\ell}=1/Mitalic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = 1 / italic_M. For the Gaussian quadrature, {μ|V}conditional-setsubscript𝜇𝑉\{\mu_{\ell}|\ell\in V\}{ italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | roman_ℓ ∈ italic_V } consists of 2M2𝑀2M2 italic_M distinct roots of Legendre polynomials of degree 2M2𝑀2M2 italic_M denoted by L2M(x)subscript𝐿2𝑀𝑥L_{2M}(x)italic_L start_POSTSUBSCRIPT 2 italic_M end_POSTSUBSCRIPT ( italic_x ), and the weights ω=2/[(1μ2)(L2M(μ))2]subscript𝜔2delimited-[]1superscriptsubscript𝜇2superscriptsuperscriptsubscript𝐿2𝑀subscript𝜇2\omega_{\ell}=2/[(1-\mu_{\ell}^{2})(L_{2M}^{\prime}(\mu_{\ell}))^{2}]italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = 2 / [ ( 1 - italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ( italic_L start_POSTSUBSCRIPT 2 italic_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ].

For RTE in the X-Y geometry, where the 3D velocity on a unit sphere is projected to a 2D disk, suppose that the DOM has M𝑀Mitalic_M points in each quadrant. Then, the 2D discrete velocity directions defined on a disk are 𝐮=(c,s)subscript𝐮subscript𝑐subscript𝑠\mathbf{\boldsymbol{u}_{\ell}}=(c_{\ell},s_{\ell})bold_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = ( italic_c start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) for V¯={1,2,M,,4M}¯𝑉12𝑀4𝑀{\ell}\in\bar{V}=\{1,2,\cdots M,\cdots,4M\}roman_ℓ ∈ over¯ start_ARG italic_V end_ARG = { 1 , 2 , ⋯ italic_M , ⋯ , 4 italic_M } when M𝑀Mitalic_M is an integer. The DOM in X-Y geometry is expressed as follows, for V¯¯𝑉{\ell}\in\bar{V}roman_ℓ ∈ over¯ start_ARG italic_V end_ARG and (x,y)[xL,xR]×[yB,yT]𝑥𝑦subscript𝑥𝐿subscript𝑥𝑅subscript𝑦𝐵subscript𝑦𝑇(x,y)\in[x_{L},x_{R}]\times[y_{B},y_{T}]( italic_x , italic_y ) ∈ [ italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ] × [ italic_y start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ]:

(cx+sy+σT(x,y))ψm(x,y)=σS(x,y)V¯ω¯P,ψ(x,y)+q(x,y),subscript𝑐subscript𝑥subscript𝑠subscript𝑦subscript𝜎𝑇𝑥𝑦subscript𝜓𝑚𝑥𝑦subscript𝜎𝑆𝑥𝑦subscriptsuperscript¯𝑉subscript¯𝜔superscriptsubscript𝑃superscriptsubscript𝜓superscript𝑥𝑦subscript𝑞𝑥𝑦\Big{(}c_{\ell}\partial_{x}+s_{\ell}\partial_{y}+\sigma_{T}(x,y)\Big{)}\psi_{m% }(x,y)=\sigma_{S}(x,y)\sum_{\ell^{\prime}\in\bar{V}}\bar{\omega}_{\ell^{\prime% }}P_{\ell,\ell^{\prime}}\psi_{\ell^{\prime}}(x,y)+q_{\ell}(x,y),( italic_c start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∂ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT + italic_s start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∂ start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x , italic_y ) ) italic_ψ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_x , italic_y ) = italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( italic_x , italic_y ) ∑ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ over¯ start_ARG italic_V end_ARG end_POSTSUBSCRIPT over¯ start_ARG italic_ω end_ARG start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT roman_ℓ , roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x , italic_y ) + italic_q start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x , italic_y ) ,

where

(c,s)=((1ζ2)12cosθ,(1ζ2)12sinθ),with ζ(0,1),θ(0,2π).formulae-sequencesubscript𝑐subscript𝑠superscript1superscriptsubscript𝜁212subscript𝜃superscript1superscriptsubscript𝜁212subscript𝜃formulae-sequencewith subscript𝜁01subscript𝜃02𝜋(c_{\ell},s_{\ell})=\Big{(}\left(1-\zeta_{\ell}^{2}\right)^{\frac{1}{2}}\cos% \theta_{\ell},\left(1-\zeta_{\ell}^{2}\right)^{\frac{1}{2}}\sin\theta_{\ell}% \Big{)},\quad\mbox{with }\zeta_{\ell}\in(0,1),\theta_{\ell}\in(0,2\pi).( italic_c start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) = ( ( 1 - italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT roman_cos italic_θ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , ( 1 - italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT roman_sin italic_θ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) , with italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∈ ( 0 , 1 ) , italic_θ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∈ ( 0 , 2 italic_π ) .

The boundary conditions become

{ψ(xL,y)=ψL,(y),c>0;ψ(xR,y)=ψR,(y),c<0;ψ(x,yB)=ψB,(x),s>0;ψ(x,yT)=ψT,(x),s<0.casessubscript𝜓subscript𝑥𝐿𝑦subscript𝜓𝐿𝑦subscript𝑐0subscript𝜓subscript𝑥𝑅𝑦subscript𝜓𝑅𝑦subscript𝑐0subscript𝜓𝑥subscript𝑦𝐵subscript𝜓𝐵𝑥subscript𝑠0subscript𝜓𝑥subscript𝑦𝑇subscript𝜓𝑇𝑥subscript𝑠0\left\{\begin{array}[]{llll}\psi_{\ell}\left(x_{L},y\right)=\psi_{L,\ell}(y),&% c_{\ell}>0;&\psi_{\ell}\left(x_{R},y\right)=\psi_{R,\ell}(y),&c_{\ell}<0;\\ \psi_{\ell}\left(x,y_{B}\right)=\psi_{B,\ell}(x),&s_{\ell}>0;&\psi_{\ell}\left% (x,y_{T}\right)=\psi_{T,\ell}(x),&s_{\ell}<0.\end{array}\right.{ start_ARRAY start_ROW start_CELL italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT , italic_y ) = italic_ψ start_POSTSUBSCRIPT italic_L , roman_ℓ end_POSTSUBSCRIPT ( italic_y ) , end_CELL start_CELL italic_c start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT > 0 ; end_CELL start_CELL italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT , italic_y ) = italic_ψ start_POSTSUBSCRIPT italic_R , roman_ℓ end_POSTSUBSCRIPT ( italic_y ) , end_CELL start_CELL italic_c start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT < 0 ; end_CELL end_ROW start_ROW start_CELL italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x , italic_y start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT ) = italic_ψ start_POSTSUBSCRIPT italic_B , roman_ℓ end_POSTSUBSCRIPT ( italic_x ) , end_CELL start_CELL italic_s start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT > 0 ; end_CELL start_CELL italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x , italic_y start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) = italic_ψ start_POSTSUBSCRIPT italic_T , roman_ℓ end_POSTSUBSCRIPT ( italic_x ) , end_CELL start_CELL italic_s start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT < 0 . end_CELL end_ROW end_ARRAY

Two kinds of quadratures discussed in [21] are considered. Each quadrant has M𝑀Mitalic_M discrete velocities, and we only show the details for the first quadrant such that ζ(0,1)subscript𝜁01\zeta_{\ell}\in(0,1)italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∈ ( 0 , 1 ) and θ(0,π2)subscript𝜃0𝜋2\theta_{\ell}\in(0,\frac{\pi}{2})italic_θ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∈ ( 0 , divide start_ARG italic_π end_ARG start_ARG 2 end_ARG ). The discrete velocities in other quadrants are obtained by symmetry.

The first one is referred to as ”2D uniform quadrature”. Each quadrant has M=N2𝑀superscript𝑁2M=N^{2}italic_M = italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ordinates, and the nodes are uniform in the (ζ,θ)𝜁𝜃(\zeta,\theta)( italic_ζ , italic_θ ) plane. More precisely, (ζi,θj)=(2N2i+12N,2j14Nπ)subscript𝜁𝑖subscript𝜃𝑗2𝑁2𝑖12𝑁2𝑗14𝑁𝜋(\zeta_{i},\theta_{j})=(\frac{2N-2i+1}{2N},\frac{2j-1}{4N}\pi)( italic_ζ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = ( divide start_ARG 2 italic_N - 2 italic_i + 1 end_ARG start_ARG 2 italic_N end_ARG , divide start_ARG 2 italic_j - 1 end_ARG start_ARG 4 italic_N end_ARG italic_π ) for i=1,,N𝑖1𝑁i=1,\cdots,Nitalic_i = 1 , ⋯ , italic_N; j=1,,N𝑗1𝑁j=1,\cdots,Nitalic_j = 1 , ⋯ , italic_N and ω¯=14N2subscript¯𝜔14superscript𝑁2\bar{\omega}_{\ell}=\frac{1}{4N^{2}}over¯ start_ARG italic_ω end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 4 italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG. Then for M=N2𝑀superscript𝑁2M=N^{2}italic_M = italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and all {1,,M}1𝑀\ell\in\{1,\cdots,M\}roman_ℓ ∈ { 1 , ⋯ , italic_M }, there exists a pair of integers (i,j)𝑖𝑗(i,j)( italic_i , italic_j ) with i,j{1,,N}𝑖𝑗1𝑁i,j\in\{1,\cdots,N\}italic_i , italic_j ∈ { 1 , ⋯ , italic_N } such that =(i1)N+j𝑖1𝑁𝑗\ell=(i-1)N+jroman_ℓ = ( italic_i - 1 ) italic_N + italic_j and

(2.3) (c,s,ω¯)=((1ζi2)12cosθj,(1ζi2)12sinθj,14N2).subscript𝑐subscript𝑠subscript¯𝜔superscript1superscriptsubscript𝜁𝑖212subscript𝜃𝑗superscript1superscriptsubscript𝜁𝑖212subscript𝜃𝑗14superscript𝑁2(c_{\ell},s_{\ell},\bar{\omega}_{\ell})=\Big{(}\big{(}1-\zeta_{i}^{2}\big{)}^{% \frac{1}{2}}\cos\theta_{j},\big{(}1-\zeta_{i}^{2}\big{)}^{\frac{1}{2}}\sin% \theta_{j},\frac{1}{4N^{2}}\Big{)}.( italic_c start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , over¯ start_ARG italic_ω end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) = ( ( 1 - italic_ζ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT roman_cos italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , ( 1 - italic_ζ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT roman_sin italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , divide start_ARG 1 end_ARG start_ARG 4 italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) .

The second one is the 2D Gaussian quadrature described in [21], for which each quadrant has M=N(N+1)/2𝑀𝑁𝑁12M=N(N+1)/2italic_M = italic_N ( italic_N + 1 ) / 2 ordinates. Each quadrant has N𝑁Nitalic_N distinct ζisubscript𝜁𝑖\zeta_{i}italic_ζ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, i{1,,N}𝑖1𝑁i\in\{1,\cdots,N\}italic_i ∈ { 1 , ⋯ , italic_N }, which are the N𝑁Nitalic_N positive roots of L2N(ζ)subscript𝐿2𝑁𝜁L_{2N}(\zeta)italic_L start_POSTSUBSCRIPT 2 italic_N end_POSTSUBSCRIPT ( italic_ζ ), the Legendre polynomial of degree 2N2𝑁2N2 italic_N. They are arranged as

1>ζ1>ζ2>>ζN>0.1subscript𝜁1subscript𝜁2subscript𝜁𝑁01>\zeta_{1}>\zeta_{2}>\cdots>\zeta_{N}>0.1 > italic_ζ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > italic_ζ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > ⋯ > italic_ζ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT > 0 .

Each ζisubscript𝜁𝑖\zeta_{i}italic_ζ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT corresponds to N𝑁Nitalic_N distinct θi,j=2j14iπ,j=1,2,,iformulae-sequencesubscript𝜃𝑖𝑗2𝑗14𝑖𝜋𝑗12𝑖\theta_{i,j}=\frac{2j-1}{4i}\pi,j=1,2,\cdots,iitalic_θ start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT = divide start_ARG 2 italic_j - 1 end_ARG start_ARG 4 italic_i end_ARG italic_π , italic_j = 1 , 2 , ⋯ , italic_i, and the weight for the velocity direction (ζi,θi,j)subscript𝜁𝑖subscript𝜃𝑖𝑗(\zeta_{i},\theta_{i,j})( italic_ζ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT ) is uniform in j𝑗jitalic_j such that

ω¯i=12i(1ζi2)[L2N(ζi)]2.subscript¯𝜔𝑖12𝑖1superscriptsubscript𝜁𝑖2superscriptdelimited-[]superscriptsubscript𝐿2𝑁subscript𝜁𝑖2\bar{\omega}_{i}=\frac{1}{2i\left(1-\zeta_{i}^{2}\right)\left[L_{2N}^{\prime}% \left(\zeta_{i}\right)\right]^{2}}.over¯ start_ARG italic_ω end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 2 italic_i ( 1 - italic_ζ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) [ italic_L start_POSTSUBSCRIPT 2 italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_ζ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG .

Then for M=N(N+1)/2𝑀𝑁𝑁12M=N(N+1)/2italic_M = italic_N ( italic_N + 1 ) / 2 and all {1,,M}1𝑀\ell\in\{1,\cdots,M\}roman_ℓ ∈ { 1 , ⋯ , italic_M }, there exists a pair of integers (i,j)𝑖𝑗(i,j)( italic_i , italic_j ) such that i{1,,N}𝑖1𝑁i\in\{1,\cdots,N\}italic_i ∈ { 1 , ⋯ , italic_N }, 1ji1𝑗𝑖1\leq j\leq i1 ≤ italic_j ≤ italic_i, and =i(i1)2+j𝑖𝑖12𝑗\ell=\frac{i(i-1)}{2}+jroman_ℓ = divide start_ARG italic_i ( italic_i - 1 ) end_ARG start_ARG 2 end_ARG + italic_j and

(c,s,ω¯)=((1ζi2)12cosθi,j,(1ζi2)12sinθi,j,12i(1ζi2)[L2N(ζi)]2).subscript𝑐subscript𝑠subscript¯𝜔superscript1superscriptsubscript𝜁𝑖212subscript𝜃𝑖𝑗superscript1superscriptsubscript𝜁𝑖212subscript𝜃𝑖𝑗12𝑖1superscriptsubscript𝜁𝑖2superscriptdelimited-[]superscriptsubscript𝐿2𝑁subscript𝜁𝑖2\left(c_{\ell},s_{\ell},\bar{\omega}_{\ell}\right)=\Big{(}\big{(}1-\zeta_{i}^{% 2}\big{)}^{\frac{1}{2}}\cos\theta_{i,j},\big{(}1-\zeta_{i}^{2}\big{)}^{\frac{1% }{2}}\sin\theta_{i,j},\frac{1}{2i\left(1-\zeta_{i}^{2}\right)\left[L_{2N}^{% \prime}\left(\zeta_{i}\right)\right]^{2}}\Big{)}.( italic_c start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , over¯ start_ARG italic_ω end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) = ( ( 1 - italic_ζ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT roman_cos italic_θ start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT , ( 1 - italic_ζ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT roman_sin italic_θ start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT , divide start_ARG 1 end_ARG start_ARG 2 italic_i ( 1 - italic_ζ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) [ italic_L start_POSTSUBSCRIPT 2 italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_ζ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) .

The rest part of the quadrature set can be constructed by symmetry:

ω¯=ω¯+M=ω¯+2M=ω¯+4M,subscript¯𝜔subscript¯𝜔𝑀subscript¯𝜔2𝑀subscript¯𝜔4𝑀\bar{\omega}_{\ell}=\bar{\omega}_{\ell+M}=\bar{\omega}_{\ell+2M}=\bar{\omega}_% {\ell+4M},over¯ start_ARG italic_ω end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = over¯ start_ARG italic_ω end_ARG start_POSTSUBSCRIPT roman_ℓ + italic_M end_POSTSUBSCRIPT = over¯ start_ARG italic_ω end_ARG start_POSTSUBSCRIPT roman_ℓ + 2 italic_M end_POSTSUBSCRIPT = over¯ start_ARG italic_ω end_ARG start_POSTSUBSCRIPT roman_ℓ + 4 italic_M end_POSTSUBSCRIPT ,
θ=πθ+M=θ+2M+π=θ+4M,ζ=ζ+M=ζ+2M=ζ+4M.formulae-sequencesubscript𝜃𝜋subscript𝜃𝑀subscript𝜃2𝑀𝜋subscript𝜃4𝑀subscript𝜁subscript𝜁𝑀subscript𝜁2𝑀subscript𝜁4𝑀\theta_{\ell}=\pi-\theta_{\ell+M}=\theta_{\ell+2M}+\pi=-\theta_{\ell+4M},% \qquad\zeta_{\ell}=\zeta_{\ell+M}=\zeta_{\ell+2M}=\zeta_{\ell+4M}.italic_θ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_π - italic_θ start_POSTSUBSCRIPT roman_ℓ + italic_M end_POSTSUBSCRIPT = italic_θ start_POSTSUBSCRIPT roman_ℓ + 2 italic_M end_POSTSUBSCRIPT + italic_π = - italic_θ start_POSTSUBSCRIPT roman_ℓ + 4 italic_M end_POSTSUBSCRIPT , italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_ζ start_POSTSUBSCRIPT roman_ℓ + italic_M end_POSTSUBSCRIPT = italic_ζ start_POSTSUBSCRIPT roman_ℓ + 2 italic_M end_POSTSUBSCRIPT = italic_ζ start_POSTSUBSCRIPT roman_ℓ + 4 italic_M end_POSTSUBSCRIPT .

Let’s take N=3𝑁3N=3italic_N = 3, the chosen discrete ordinates of Uniform and Gaussian quadratures in one quadrant are plotted in Figure 1. The selected ordinates on the surface of a 3D unit sphere and their corresponding projections to the 2D unit disk are displayed.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 1: Schematic diagram of selected ordinates on the surface of a 3D unit sphere and their corresponding projection to the 2D unit disk. (a)(b) Uniform quadrature; (c)(d) Gaussian quadrature.

It has long been known that the solution of DOM exhibits the ray effect, especially when there are discontinuous source terms in the computational domain [4, 24, 33]. This phenomenon cannot be improved by increasing the spatial resolution. We show one typical example to demonstrate the ray effects.

Example 2.1.

We consider RTE in the X-Y geometry with a localized source term at the center of the computational domain. Let

x×yΩ=[0,1]×[0,1],σT=1,σS=0.5.formulae-sequence𝑥𝑦Ω0101formulae-sequencesubscript𝜎𝑇1subscript𝜎𝑆0.5x\times y\in\Omega=[0,1]\times[0,1],\quad\sigma_{T}=1,\quad\sigma_{S}=0.5.italic_x × italic_y ∈ roman_Ω = [ 0 , 1 ] × [ 0 , 1 ] , italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT = 1 , italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT = 0.5 .
q(x,y)={2,(x,y)[0.4,0.6]×[0.4,0.6],0,elsewhere.q(x,y)=\left\{\begin{aligned} 2,&\quad(x,y)\in[0.4,0.6]\times[0.4,0.6],\\ 0,&\quad\mbox{elsewhere}.\\ \end{aligned}\right.italic_q ( italic_x , italic_y ) = { start_ROW start_CELL 2 , end_CELL start_CELL ( italic_x , italic_y ) ∈ [ 0.4 , 0.6 ] × [ 0.4 , 0.6 ] , end_CELL end_ROW start_ROW start_CELL 0 , end_CELL start_CELL elsewhere . end_CELL end_ROW

The inflow boundary conditions are zero.

We consider the isotropic and anisotropic scattering with the following scattering kernel

(2.4) P(𝒖,𝒖)=G(𝒖𝒖)=G(cosξ)=1+gcosξ,𝑃𝒖superscript𝒖bold-′𝐺𝒖superscript𝒖bold-′𝐺𝜉1𝑔𝜉P(\boldsymbol{u},\boldsymbol{u^{\prime}})=G(\boldsymbol{u}\cdot\boldsymbol{u^{% \prime}})=G(\cos\xi)=1+g\cdot\cos\xi,italic_P ( bold_italic_u , bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ) = italic_G ( bold_italic_u ⋅ bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ) = italic_G ( roman_cos italic_ξ ) = 1 + italic_g ⋅ roman_cos italic_ξ ,

where ξ𝜉\xiitalic_ξ is the included angle between 𝒖𝒖\boldsymbol{u}bold_italic_u and 𝒖superscript𝒖bold-′\boldsymbol{u^{\prime}}bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT. When g=0𝑔0g=0italic_g = 0, (2.4) gives isotropic scattering, meaning particles moving with velocity 𝒖𝒖\boldsymbol{u}bold_italic_u will scatter into a new velocity 𝒖superscript𝒖\boldsymbol{u}^{\prime}bold_italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT with uniform probability for all 𝒖superscript𝒖\boldsymbol{u}^{\prime}bold_italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. When g=0.9𝑔0.9g=0.9italic_g = 0.9, (2.4) gives anisotropic scattering, implying that when the included angle ξ𝜉\xiitalic_ξ between 𝒖𝒖\boldsymbol{u}bold_italic_u and 𝒖superscript𝒖bold-′\boldsymbol{u^{\prime}}bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT is smaller, the probability that particles moving with velocity 𝒖𝒖\boldsymbol{u}bold_italic_u scatter into 𝒖superscript𝒖\boldsymbol{u}^{\prime}bold_italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is higher.

The computational domain is partitioned into 100×100100100100\times 100100 × 100 spatial cells. We employ Uniform quadrature sets with various numbers of ordinates. The numerical results of the average density ϕ(x,y)=Vω¯ψ(x,y)italic-ϕ𝑥𝑦subscript𝑉subscript¯𝜔subscript𝜓𝑥𝑦\phi(x,y)=\sum_{\ell\in V}\bar{\omega}_{\ell}\psi_{\ell}(x,y)italic_ϕ ( italic_x , italic_y ) = ∑ start_POSTSUBSCRIPT roman_ℓ ∈ italic_V end_POSTSUBSCRIPT over¯ start_ARG italic_ω end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x , italic_y ) are displayed in Figure 2. From left to right, 4444, 16161616, and 36363636 ordinates of Uniform quadrature as in (2.3) are used. We can observe that the solutions exhibit rays that correspond to the chosen ordinates of the DOM. The solutions have poor accuracy, violate the rotational invariance property, and this phenomenon does not disappear with the refinement of the spatial grids.

Refer to caption
(a) 4 ordinates
Refer to caption
(b) 16 ordinates
Refer to caption
(c) 36 ordinates
Refer to caption
(d) 4 ordinates
Refer to caption
(e) 16 ordinates
Refer to caption
(f) 36 ordinates
Figure 2: Demonstration of the ray effects. The average densities ϕ(x,y)italic-ϕ𝑥𝑦\phi(x,y)italic_ϕ ( italic_x , italic_y ) calculated with different numbers of ordinates are displayed. The numerical results are calculated with 100×100100100100\times 100100 × 100 spatial cells and uniform quadrature. (a)(b)(c): isotropic scattering kernel with g=0𝑔0g=0italic_g = 0 in (2.4); (d)(e)(f) anisotropic scattering kernel with g=0.9𝑔0.9g=0.9italic_g = 0.9 in (2.4).

2.2 Low regularity and Convergence order

In this subsection, we first demonstrate numerically that for the tests that exhibit ray effects, the solutions usually have a low regularity in velocity space. The 2D solution has sharp transition in velocity space and the positions of the transitional points are spatially dependent. No matter what quadrature sets are chosen, high order convergence can only be achieved for smooth functions. Therefore, it is difficult to improve the solution accuracy without increasing the quadrature nodes when the regularity is low. Then, we show numerically, in both slab and X-Y geometries, that the convergence order of DOM decreases as the solution regularity in the velocity space decreases.

2.2.1 The slab geometry case

We solve equation (2.2) and test both the Uniform and Gaussian quadratures. We employ second-order finite difference spatial discretization in [14] to obtain the numerical results. The number of spatial cells is fixed to be I=50𝐼50I=50italic_I = 50, and the grid points are xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (i=0,1,,I𝑖01𝐼i=0,1,\cdots,Iitalic_i = 0 , 1 , ⋯ , italic_I). Let ψ(xi)subscript𝜓subscript𝑥𝑖\psi_{\ell}(x_{i})italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) be the solution to (2.2), and the average density ϕ(xi)=Vωψ(xi)italic-ϕsubscript𝑥𝑖subscript𝑉subscript𝜔subscript𝜓subscript𝑥𝑖\phi(x_{i})=\sum_{{\ell}\in V}\omega_{\ell}\psi_{\ell}(x_{i})italic_ϕ ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT roman_ℓ ∈ italic_V end_POSTSUBSCRIPT italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ). The reference solution is computed using 1280128012801280 ordinates. The 2superscript2\ell^{2}roman_ℓ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT errors of the numerical solutions with different ordinates are defined by

=1I+1i=0Iϕ(xi)ϕref(xi)2.1𝐼1superscriptsubscript𝑖0𝐼superscriptdelimited-∣∣italic-ϕsubscript𝑥𝑖superscriptitalic-ϕ𝑟𝑒𝑓subscript𝑥𝑖2\mathcal{E}=\sqrt{\frac{1}{I+1}\sum_{i=0}^{I}\mid\phi(x_{i})-\phi^{ref}(x_{i})% \mid^{2}}.caligraphic_E = square-root start_ARG divide start_ARG 1 end_ARG start_ARG italic_I + 1 end_ARG ∑ start_POSTSUBSCRIPT italic_i = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT ∣ italic_ϕ ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - italic_ϕ start_POSTSUPERSCRIPT italic_r italic_e italic_f end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∣ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG .

Though there is no ray effect in slab geometry, we will show in the following example that when the inflow boundary conditions have low regularity in velocity space, the convergence orders of DOM are low.

Example 2.2.

Let the computational domain, total and absorption cross sections, scattering kernel and source term be respectively

x[0,1],σT(x)=10x2+1,σS(x)=5x2+0.5,P(μ,μ)=1,q(x)=1+x.formulae-sequence𝑥01formulae-sequencesubscript𝜎𝑇𝑥10superscript𝑥21formulae-sequencesubscript𝜎𝑆𝑥5superscript𝑥20.5formulae-sequence𝑃superscript𝜇𝜇1𝑞𝑥1𝑥x\in[0,1],\quad\sigma_{T}(x)=10x^{2}+1,\quad\sigma_{S}(x)=5x^{2}+0.5,\quad P(% \mu^{\prime},\mu)=1,\quad q(x)=1+x.italic_x ∈ [ 0 , 1 ] , italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x ) = 10 italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 1 , italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( italic_x ) = 5 italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 0.5 , italic_P ( italic_μ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_μ ) = 1 , italic_q ( italic_x ) = 1 + italic_x .

We consider three different inflow boundary conditions:

  1. Case 1:

    Continuous and smooth inflow boundary conditions:

    (2.5) ψ(0,μ)=3μ,μ>0;ψ(1,μ)=5μ,μ<0.formulae-sequence𝜓0𝜇3𝜇formulae-sequence𝜇0formulae-sequence𝜓1𝜇5𝜇𝜇0\psi(0,\mu)=3\mu,\quad\mu>0;\qquad\psi(1,\mu)=-5\mu,\quad\mu<0.italic_ψ ( 0 , italic_μ ) = 3 italic_μ , italic_μ > 0 ; italic_ψ ( 1 , italic_μ ) = - 5 italic_μ , italic_μ < 0 .
  2. Case 2:

    Continuous but non-differentiable inflow boundary conditions:

    (2.6) ψ(0,μ)={43μ,13<μ<1,3μ,0<μ13,ψ(1,μ)={2+μ,1<μ<13,5μ,13μ<0.formulae-sequence𝜓0𝜇cases43𝜇13𝜇13𝜇0𝜇13𝜓1𝜇cases2𝜇1𝜇135𝜇13𝜇0\psi(0,\mu)=\begin{cases}\frac{4}{3}-\mu,&\frac{1}{3}<\mu<1,\\ 3\mu,&0<\mu\leq\frac{1}{3},\end{cases}\qquad\psi(1,\mu)=\begin{cases}2+\mu,&-1% <\mu<-\frac{1}{3},\\ -5\mu,&-\frac{1}{3}\leq\mu<0.\end{cases}italic_ψ ( 0 , italic_μ ) = { start_ROW start_CELL divide start_ARG 4 end_ARG start_ARG 3 end_ARG - italic_μ , end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG 3 end_ARG < italic_μ < 1 , end_CELL end_ROW start_ROW start_CELL 3 italic_μ , end_CELL start_CELL 0 < italic_μ ≤ divide start_ARG 1 end_ARG start_ARG 3 end_ARG , end_CELL end_ROW italic_ψ ( 1 , italic_μ ) = { start_ROW start_CELL 2 + italic_μ , end_CELL start_CELL - 1 < italic_μ < - divide start_ARG 1 end_ARG start_ARG 3 end_ARG , end_CELL end_ROW start_ROW start_CELL - 5 italic_μ , end_CELL start_CELL - divide start_ARG 1 end_ARG start_ARG 3 end_ARG ≤ italic_μ < 0 . end_CELL end_ROW
  3. Case 3:

    Discontinuous inflow boundary conditions:

    (2.7) ψ(0,μ)={3μ,13<μ<1,3μ,0<μ13,ψ(1,μ)={4+μ,1<μ<13,5μ,13μ<0.formulae-sequence𝜓0𝜇cases3𝜇13𝜇13𝜇0𝜇13𝜓1𝜇cases4𝜇1𝜇135𝜇13𝜇0\psi(0,\mu)=\begin{cases}3-\mu,&\frac{1}{3}<\mu<1,\\ 3\mu,&0<\mu\leq\frac{1}{3},\end{cases}\qquad\psi(1,\mu)=\begin{cases}4+\mu,&-1% <\mu<-\frac{1}{3},\\ -5\mu,&-\frac{1}{3}\leq\mu<0.\end{cases}italic_ψ ( 0 , italic_μ ) = { start_ROW start_CELL 3 - italic_μ , end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG 3 end_ARG < italic_μ < 1 , end_CELL end_ROW start_ROW start_CELL 3 italic_μ , end_CELL start_CELL 0 < italic_μ ≤ divide start_ARG 1 end_ARG start_ARG 3 end_ARG , end_CELL end_ROW italic_ψ ( 1 , italic_μ ) = { start_ROW start_CELL 4 + italic_μ , end_CELL start_CELL - 1 < italic_μ < - divide start_ARG 1 end_ARG start_ARG 3 end_ARG , end_CELL end_ROW start_ROW start_CELL - 5 italic_μ , end_CELL start_CELL - divide start_ARG 1 end_ARG start_ARG 3 end_ARG ≤ italic_μ < 0 . end_CELL end_ROW

The convergence orders of DOM for different cases are shown in Figure 3. One can observe that for both Uniform and Gaussian quadratures, the convergence orders decrease from 2 to 1 when the inflow boundary conditions change from Case 1 to Case 3. Therefore, one cannot expect a high convergence order when the regularity of the inflow boundary conditions is low. In particular, Gaussian quadrature does not reach spectral convergence as shown in Figure 3(b). This is because, though the inflow boundary condition is smooth in μ𝜇\muitalic_μ, at the boundary, the solution jumps at μ=0𝜇0\mu=0italic_μ = 0. Gaussian quadrature does not provide spectral convergence for solutions with a jump at μ=0𝜇0\mu=0italic_μ = 0.

Refer to caption
(a) Uniform quadrature
Refer to caption
(b) Gaussian quadrature
Figure 3: Example 2.2: Convergence orders of DOM with different inflow boundary conditions in (2.5)-(2.7). (a): Uniform quadratures of different sizes; (b): Gaussian quadratures of different sizes. Here Δμ=1MΔ𝜇1𝑀\Delta\mu=\frac{1}{M}roman_Δ italic_μ = divide start_ARG 1 end_ARG start_ARG italic_M end_ARG.

2.2.2 The X-Y geometry case

We solve Example 2.1. The classical second-order diamond difference (DD) method [21] is adopted for the spatial discretization. Since only the convergence order in the velocity variable is considered in this paper, after spatial discretization, the reference solution can be considered as a large integral system for unknowns at the given spatial grids. We use the same spatial mesh and discretizations throughout the paper, eliminating the need to consider the error introduced by the spatial discretization. We would like to emphasize that the main purpose of our current work is to show the convergence orders of the velocity discretization; any spatial discretization can be chosen to obtain the numerical results.

The spatial domain Ω=[xL,xR]×[yB,yT]Ωsubscript𝑥𝐿subscript𝑥𝑅subscript𝑦𝐵subscript𝑦𝑇\Omega=[x_{L},x_{R}]\times[y_{B},y_{T}]roman_Ω = [ italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ] × [ italic_y start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ] is divided into I×J𝐼𝐽I\times Jitalic_I × italic_J uniform cells. Let Δx=xRxLIΔ𝑥subscript𝑥𝑅subscript𝑥𝐿𝐼\Delta x=\frac{x_{R}-x_{L}}{I}roman_Δ italic_x = divide start_ARG italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_ARG start_ARG italic_I end_ARG, Δy=yTyBJΔ𝑦subscript𝑦𝑇subscript𝑦𝐵𝐽\Delta y=\frac{y_{T}-y_{B}}{J}roman_Δ italic_y = divide start_ARG italic_y start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT end_ARG start_ARG italic_J end_ARG, x0=xLsubscript𝑥0subscript𝑥𝐿x_{0}=x_{L}italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT, y0=yBsubscript𝑦0subscript𝑦𝐵y_{0}=y_{B}italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT and

xi=xL+iΔx,xi+12=xL+(i12)Δx,for i=1,,I,formulae-sequencesubscript𝑥𝑖subscript𝑥𝐿𝑖Δ𝑥subscript𝑥𝑖12subscript𝑥𝐿𝑖12Δ𝑥for i=1,,I,x_{i}=x_{L}+i\Delta x,\quad x_{i+\frac{1}{2}}=x_{L}+\big{(}i-\frac{1}{2}\big{)% }\Delta x,\quad\mbox{for $i=1,\cdots,I$,}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT + italic_i roman_Δ italic_x , italic_x start_POSTSUBSCRIPT italic_i + divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT + ( italic_i - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) roman_Δ italic_x , for italic_i = 1 , ⋯ , italic_I ,
yj=yB+jΔy,yj+12=yB+(j12)Δy,for j=1,,J.formulae-sequencesubscript𝑦𝑗subscript𝑦𝐵𝑗Δ𝑦subscript𝑦𝑗12subscript𝑦𝐵𝑗12Δ𝑦for j=1,,J.y_{j}=y_{B}+j\Delta y,\quad y_{j+\frac{1}{2}}=y_{B}+\big{(}j-\frac{1}{2}\big{)% }\Delta y,\quad\mbox{for $j=1,\cdots,J$.}italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT + italic_j roman_Δ italic_y , italic_y start_POSTSUBSCRIPT italic_j + divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT + ( italic_j - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) roman_Δ italic_y , for italic_j = 1 , ⋯ , italic_J .

We use DD with I=100𝐼100I=100italic_I = 100, J=100𝐽100J=100italic_J = 100. The grid points of the two-dimensional DD method are at the cell centers, i.e. approximations of ψ(xi12,yj12)subscript𝜓subscript𝑥𝑖12subscript𝑦𝑗12\psi_{\ell}(x_{i-\frac{1}{2}},y_{j-\frac{1}{2}})italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_j - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT ) and the average density ϕ(xi12,yj12)=Vω¯ψ(xi12,yj12)italic-ϕsubscript𝑥𝑖12subscript𝑦𝑗12subscript𝑉subscript¯𝜔subscript𝜓subscript𝑥𝑖12subscript𝑦𝑗12\phi(x_{i-\frac{1}{2}},y_{j-\frac{1}{2}})=\sum_{\ell\in V}\bar{\omega}_{\ell}% \psi_{\ell}(x_{i-\frac{1}{2}},y_{j-\frac{1}{2}})italic_ϕ ( italic_x start_POSTSUBSCRIPT italic_i - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_j - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT roman_ℓ ∈ italic_V end_POSTSUBSCRIPT over¯ start_ARG italic_ω end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_j - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT ) (for i=1,,I𝑖1𝐼i=1,\cdots,Iitalic_i = 1 , ⋯ , italic_I, j=1,,J𝑗1𝐽j=1,\cdots,Jitalic_j = 1 , ⋯ , italic_J) are obtained. The 2superscript2\ell^{2}roman_ℓ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT errors between the reference solution ϕrefsuperscriptitalic-ϕ𝑟𝑒𝑓\phi^{ref}italic_ϕ start_POSTSUPERSCRIPT italic_r italic_e italic_f end_POSTSUPERSCRIPT and the numerical solutions ϕitalic-ϕ\phiitalic_ϕ obtained by different quadratures are defined by

(2.8) =1IJ(i=0I1j=0J1ϕ(xi+12,yj+12)ϕref(xi+12,yj+12)2).1𝐼𝐽superscriptsubscript𝑖0𝐼1superscriptsubscript𝑗0𝐽1superscriptdelimited-∣∣italic-ϕsubscript𝑥𝑖12subscript𝑦𝑗12superscriptitalic-ϕ𝑟𝑒𝑓subscript𝑥𝑖12subscript𝑦𝑗122\mathcal{E}=\sqrt{\frac{1}{IJ}\Big{(}\sum_{i=0}^{I-1}\sum_{j=0}^{J-1}\mid\phi(% x_{i+\frac{1}{2}},y_{j+\frac{1}{2}})-\phi^{ref}(x_{i+\frac{1}{2}},y_{j+\frac{1% }{2}})\mid^{2}\Big{)}}.caligraphic_E = square-root start_ARG divide start_ARG 1 end_ARG start_ARG italic_I italic_J end_ARG ( ∑ start_POSTSUBSCRIPT italic_i = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_J - 1 end_POSTSUPERSCRIPT ∣ italic_ϕ ( italic_x start_POSTSUBSCRIPT italic_i + divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_j + divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT ) - italic_ϕ start_POSTSUPERSCRIPT italic_r italic_e italic_f end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i + divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_j + divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT ) ∣ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_ARG .

The reference solution of the Uniform (Gaussian) quadrature ϕUrefsuperscriptsubscriptitalic-ϕ𝑈𝑟𝑒𝑓\phi_{U}^{ref}italic_ϕ start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r italic_e italic_f end_POSTSUPERSCRIPT (ϕGrefsuperscriptsubscriptitalic-ϕ𝐺𝑟𝑒𝑓\phi_{G}^{ref}italic_ϕ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r italic_e italic_f end_POSTSUPERSCRIPT) is computed by N=20𝑁20N=20italic_N = 20, which indicates 1600160016001600 (840840840840) ordinates on the 2D disk. As shown in Table 1, the convergence orders of both quadratures are around 0.74. In Figure 4, we plot the velocity distribution of ψUrefsuperscriptsubscript𝜓𝑈𝑟𝑒𝑓\psi_{U}^{ref}italic_ψ start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r italic_e italic_f end_POSTSUPERSCRIPT on the (c,s)𝑐𝑠(c,s)( italic_c , italic_s ) plane near two midpoints of the left and right boundaries. It can be seen that the solution varies rapidly in the velocity variable.

Table 1: Example 2.1: Convergence orders of DOM with the scattering kernel as in (2.4). Here ΔS=π4MΔ𝑆𝜋4𝑀\Delta S=\frac{\pi}{4M}roman_Δ italic_S = divide start_ARG italic_π end_ARG start_ARG 4 italic_M end_ARG.
ΔSΔ𝑆\Delta Sroman_Δ italic_S π/100𝜋100\pi/100italic_π / 100 π/64𝜋64\pi/64italic_π / 64 π/36𝜋36\pi/36italic_π / 36 π/16𝜋16\pi/16italic_π / 16 Order
Uniform g=0 4.337E-03 5.560E-03 8.672E-03 1.629E-02 0.73
g=0.9 4.354E-03 5.580E-03 8.707E-03 2.075E-02 0.73
Gaussian g=0 3.966E-03 5.496E-03 8.862E-03 1.477E-02 0.75
g=0.9 3.978E-03 5.518E-03 8.902E-03 1.491E-02 0.75
Refer to caption
(a) point a
Refer to caption
(b) g=0
Refer to caption
(c) g=0.9
Refer to caption
(d) point b
Refer to caption
(e) g=0
Refer to caption
(f) g=0.9
Figure 4: Example 2.1: The velocity distribution of ψ𝜓\psiitalic_ψ at given spatial positions. a) d) display the locations of two different points a𝑎aitalic_a and b𝑏bitalic_b. b) c) The heat map of (cm,sm,ψm(a))subscript𝑐𝑚subscript𝑠𝑚subscript𝜓𝑚𝑎(c_{m},s_{m},\psi_{m}(a))( italic_c start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_ψ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_a ) ), mV¯for-all𝑚¯𝑉\forall m\in\bar{V}∀ italic_m ∈ over¯ start_ARG italic_V end_ARG. e) f) The heat map of (cm,sm,ψm(b))subscript𝑐𝑚subscript𝑠𝑚subscript𝜓𝑚𝑏(c_{m},s_{m},\psi_{m}(b))( italic_c start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_ψ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_b ) ), mV¯for-all𝑚¯𝑉\forall m\in\bar{V}∀ italic_m ∈ over¯ start_ARG italic_V end_ARG. The black +{}^{\prime}+^{\prime}start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT + start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT indicate the locations of all ordinates (cm,sm)subscript𝑐𝑚subscript𝑠𝑚(c_{m},s_{m})( italic_c start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ).

3 Random ordinate method

The ROM is based on DOM, but the ordinates are chosen randomly. More precisely, the ROM is performed as following:

  1. 1.

    The velocity space S𝑆Sitalic_S is divided into n𝑛nitalic_n cells and each cell is denoted by Ssubscript𝑆S_{\ell}italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT (=1,,n1𝑛\ell=1,\cdots,nroman_ℓ = 1 , ⋯ , italic_n). The maximum area of all Ssubscript𝑆S_{\ell}italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT is denoted by ΔS=max=1,,n|S|Δ𝑆subscript1𝑛subscript𝑆\Delta S=\max_{\ell=1,\cdots,n}|S_{\ell}|roman_Δ italic_S = roman_max start_POSTSUBSCRIPT roman_ℓ = 1 , ⋯ , italic_n end_POSTSUBSCRIPT | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT |. For example in 1D, S=[1,1]𝑆11S=[-1,1]italic_S = [ - 1 , 1 ], if uniform mesh is employed, then ΔS=2/nΔ𝑆2𝑛\Delta S=2/nroman_Δ italic_S = 2 / italic_n.

  2. 2.

    Sample randomly one ordinate from each cell with uniform probability. Denote 𝕍ξ={𝒖1,,𝒖n}superscript𝕍𝜉subscript𝒖1subscript𝒖𝑛\mathbb{V}^{\xi}=\{\boldsymbol{u}_{1},\cdots,\boldsymbol{u}_{n}\}blackboard_V start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT = { bold_italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , bold_italic_u start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT } as the tuple of random ordinates and Vξsuperscript𝑉𝜉V^{\xi}italic_V start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT as the index set.

  3. 3.

    Solve the resulting discrete ordinate system with the randomly chosen velocity directions.

    (3.1) 𝒖ψ(𝒛)+σT(𝒛)ψ(𝒛)=σS(𝒛)=1nωP,ψ(𝒛)+q(𝒛),𝒖𝕍ξ,formulae-sequencesubscript𝒖bold-ℓsubscript𝜓𝒛subscript𝜎𝑇𝒛subscript𝜓𝒛subscript𝜎𝑆𝒛superscriptsubscriptsuperscript1𝑛subscript𝜔superscriptsubscript𝑃superscriptsubscript𝜓superscript𝒛subscript𝑞𝒛subscript𝒖bold-ℓsuperscript𝕍𝜉\boldsymbol{u_{\ell}}\cdot\nabla\psi_{\ell}(\boldsymbol{z})+\sigma_{T}(% \boldsymbol{z})\psi_{\ell}(\boldsymbol{z})=\sigma_{S}(\boldsymbol{z})\sum_{% \ell^{\prime}=1}^{n}\omega_{\ell^{\prime}}P_{\ell^{\prime},\ell}\psi_{\ell^{% \prime}}(\boldsymbol{z})+q_{\ell}(\boldsymbol{z}),\quad\boldsymbol{u_{\ell}}% \in\mathbb{V}^{\xi},bold_italic_u start_POSTSUBSCRIPT bold_ℓ end_POSTSUBSCRIPT ⋅ ∇ italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( bold_italic_z ) + italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( bold_italic_z ) italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( bold_italic_z ) = italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( bold_italic_z ) ∑ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_ω start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , roman_ℓ end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( bold_italic_z ) + italic_q start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( bold_italic_z ) , bold_italic_u start_POSTSUBSCRIPT bold_ℓ end_POSTSUBSCRIPT ∈ blackboard_V start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ,

    subject to the boundary conditions

    (3.2) ψ(𝒛)=ψΓ(𝒛,𝒖),𝒛Γ=Ω,𝒖𝕍ξ,𝒖𝒏𝒛<0.formulae-sequenceformulae-sequencesubscript𝜓𝒛superscriptsubscript𝜓Γ𝒛subscript𝒖bold-ℓ𝒛superscriptΓΩformulae-sequencesubscript𝒖bold-ℓsuperscript𝕍𝜉subscript𝒖bold-ℓsubscript𝒏𝒛0\psi_{\ell}(\boldsymbol{z})=\psi_{\Gamma}^{-}(\boldsymbol{z},\boldsymbol{u_{% \ell}}),\quad\boldsymbol{z}\in\Gamma^{-}=\partial\Omega,\quad\boldsymbol{u_{% \ell}}\in\mathbb{V}^{\xi},\quad\boldsymbol{u_{\ell}}\cdot\boldsymbol{n}_{% \boldsymbol{z}}<0.italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( bold_italic_z ) = italic_ψ start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( bold_italic_z , bold_italic_u start_POSTSUBSCRIPT bold_ℓ end_POSTSUBSCRIPT ) , bold_italic_z ∈ roman_Γ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT = ∂ roman_Ω , bold_italic_u start_POSTSUBSCRIPT bold_ℓ end_POSTSUBSCRIPT ∈ blackboard_V start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT , bold_italic_u start_POSTSUBSCRIPT bold_ℓ end_POSTSUBSCRIPT ⋅ bold_italic_n start_POSTSUBSCRIPT bold_italic_z end_POSTSUBSCRIPT < 0 .

Since 𝒖subscript𝒖bold-ℓ\boldsymbol{u_{\ell}}bold_italic_u start_POSTSUBSCRIPT bold_ℓ end_POSTSUBSCRIPT are now randomly chosen, to guarantee the solution accuracy, one has to determine the corresponding discrete weights ωsubscript𝜔\omega_{\ell}italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT and discrete scattering kernel P,subscript𝑃superscriptP_{\ell^{\prime},\ell}italic_P start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , roman_ℓ end_POSTSUBSCRIPT. We will simply choose the following approximation:

(3.3) 1|S|SP(𝒖,𝒖)ψ(𝒛,𝒖)𝑑𝒖=1nωP,ψ(𝒛),1𝑆subscript𝑆𝑃superscript𝒖bold-′subscript𝒖𝜓𝒛superscript𝒖bold-′differential-dsuperscript𝒖bold-′superscriptsubscriptsuperscript1𝑛subscript𝜔superscriptsubscript𝑃superscriptsubscript𝜓superscript𝒛\displaystyle\frac{1}{|S|}\int_{S}P(\boldsymbol{u^{\prime}},\boldsymbol{u}_{% \ell})\psi(\boldsymbol{z},\boldsymbol{u^{\prime}})\,d\boldsymbol{u^{\prime}}% \approx\sum_{\ell^{\prime}=1}^{n}\omega_{\ell^{\prime}}P_{\ell^{\prime},\ell}% \psi_{\ell^{\prime}}(\boldsymbol{z}),divide start_ARG 1 end_ARG start_ARG | italic_S | end_ARG ∫ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_P ( bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT , bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) italic_ψ ( bold_italic_z , bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ) italic_d bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ≈ ∑ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_ω start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , roman_ℓ end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( bold_italic_z ) ,

where

(3.4) ω=|S||S|,P,=P(𝒖,𝒖).formulae-sequencesubscript𝜔superscriptsubscript𝑆superscript𝑆subscript𝑃superscript𝑃subscript𝒖superscriptsubscript𝒖\omega_{\ell^{\prime}}=\frac{|S_{\ell^{\prime}}|}{|S|},\qquad P_{\ell^{\prime}% ,\ell}=P(\boldsymbol{u}_{\ell^{\prime}},\boldsymbol{u}_{\ell}).italic_ω start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = divide start_ARG | italic_S start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | end_ARG start_ARG | italic_S | end_ARG , italic_P start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , roman_ℓ end_POSTSUBSCRIPT = italic_P ( bold_italic_u start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) .

Such a choice clearly satisfies =1nω=1superscriptsubscript1𝑛subscript𝜔1\sum_{\ell=1}^{n}\omega_{\ell}=1∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = 1. If superscript\ell^{\prime}\neq\ellroman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≠ roman_ℓ, then

𝔼μP,ψ(𝒛)=1|S|SP(𝒖,𝒖)ψ(𝒛,𝒖)𝑑𝒖.subscript𝔼subscript𝜇superscriptsubscript𝑃superscriptsubscript𝜓superscript𝒛1subscript𝑆superscriptsubscriptsubscript𝑆superscript𝑃superscript𝒖subscript𝒖𝜓𝒛superscript𝒖differential-dsuperscript𝒖\mathbb{E}_{\mu_{\ell^{\prime}}}P_{\ell^{\prime},\ell}\psi_{\ell^{\prime}}(% \boldsymbol{z})=\frac{1}{|S_{\ell^{\prime}}|}\int_{S_{\ell^{\prime}}}P(% \boldsymbol{u}^{\prime},\boldsymbol{u}_{\ell})\psi(\boldsymbol{z},\boldsymbol{% u}^{\prime})d\boldsymbol{u}^{\prime}.blackboard_E start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , roman_ℓ end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( bold_italic_z ) = divide start_ARG 1 end_ARG start_ARG | italic_S start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | end_ARG ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_P ( bold_italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) italic_ψ ( bold_italic_z , bold_italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) italic_d bold_italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT .

Therefore, the expectation of the weighted summation provides a good approximation to the integration on the right hand side of (1.1). More precisely,

=1nω𝔼μP,ψ(𝒛)=superscriptsubscriptsuperscript1𝑛subscript𝜔superscriptsubscript𝔼subscript𝜇superscriptsubscript𝑃superscriptsubscript𝜓superscript𝒛absent\displaystyle\sum_{\ell^{\prime}=1}^{n}\omega_{\ell^{\prime}}\mathbb{E}_{\mu_{% \ell^{\prime}}}P_{\ell^{\prime},\ell}\psi_{\ell^{\prime}}(\boldsymbol{z})=∑ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_ω start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT blackboard_E start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , roman_ℓ end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( bold_italic_z ) = 1|S|SP(𝒖,𝒖)ψ(𝒛,𝒖)𝑑𝒖1𝑆subscript𝑆𝑃superscript𝒖bold-′subscript𝒖𝜓𝒛superscript𝒖bold-′differential-dsuperscript𝒖bold-′\displaystyle\frac{1}{|S|}\int_{S}P(\boldsymbol{u^{\prime}},\boldsymbol{u}_{% \ell})\psi(\boldsymbol{z},\boldsymbol{u^{\prime}})\,d\boldsymbol{u^{\prime}}divide start_ARG 1 end_ARG start_ARG | italic_S | end_ARG ∫ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_P ( bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT , bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) italic_ψ ( bold_italic_z , bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ) italic_d bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT
|S||S|SP(𝒖,𝒖)ψ(𝒛,𝒖)𝑑𝒖+ωP(𝒖,𝒖)ψ(𝒛,𝒖)subscript𝑆𝑆subscriptsubscript𝑆𝑃superscript𝒖bold-′subscript𝒖𝜓𝒛superscript𝒖bold-′differential-dsuperscript𝒖bold-′subscript𝜔𝑃subscript𝒖subscript𝒖𝜓𝒛subscript𝒖\displaystyle-\frac{|S_{\ell}|}{|S|}\int_{S_{\ell}}P(\boldsymbol{u^{\prime}},% \boldsymbol{u}_{\ell})\psi(\boldsymbol{z},\boldsymbol{u^{\prime}})\,d% \boldsymbol{u^{\prime}}+\omega_{\ell}P(\boldsymbol{u}_{\ell},\boldsymbol{u}_{% \ell})\psi(\boldsymbol{z},\boldsymbol{u}_{\ell})- divide start_ARG | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | end_ARG start_ARG | italic_S | end_ARG ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_P ( bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT , bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) italic_ψ ( bold_italic_z , bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ) italic_d bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT + italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_P ( bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) italic_ψ ( bold_italic_z , bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT )
=\displaystyle== 1|S|SP(𝒖,𝒖)ψ(𝒛,𝒖)𝑑𝒖+O(|S|2D(S)).1𝑆subscript𝑆𝑃superscript𝒖bold-′subscript𝒖𝜓𝒛superscript𝒖bold-′differential-dsuperscript𝒖bold-′𝑂superscriptsubscript𝑆2𝐷subscript𝑆\displaystyle\frac{1}{|S|}\int_{S}P(\boldsymbol{u^{\prime}},\boldsymbol{u}_{% \ell})\psi(\boldsymbol{z},\boldsymbol{u^{\prime}})\,d\boldsymbol{u^{\prime}}+O% (|S_{\ell}|^{2}D(S_{\ell})).divide start_ARG 1 end_ARG start_ARG | italic_S | end_ARG ∫ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_P ( bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT , bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) italic_ψ ( bold_italic_z , bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT ) italic_d bold_italic_u start_POSTSUPERSCRIPT bold_′ end_POSTSUPERSCRIPT + italic_O ( | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_D ( italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) ) .

where D(S)𝐷subscript𝑆D(S_{\ell})italic_D ( italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) means the diameter of the \ellroman_ℓ-th cell. By employing such a straightforward strategy, there exists the possibility that P,ω1subscriptsuperscriptsubscript𝑃superscriptsubscript𝜔superscript1\sum_{\ell^{\prime}}P_{\ell,\ell^{\prime}}\omega_{\ell^{\prime}}\neq 1∑ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT roman_ℓ , roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_ω start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≠ 1, potentially leading to a violation of mass conservation at the discrete level. Nonetheless, the positivity of the solution is maintained. Since we consider only O(1)𝑂1O(1)italic_O ( 1 ) total and scattering cross sections in this current work, only a small error is introduced.

According to [6, 17], symmetric ordinates perform better, especially when there are multiscale parameters in the computational domain. Although we do not consider multiscale parameters in this current paper, symmetric ordinates are used in the ROM. More precisely, in slab geometry, let n=2m𝑛2𝑚n=2mitalic_n = 2 italic_m and S1S2Sm=[1,0]subscript𝑆1subscript𝑆2subscript𝑆𝑚10S_{1}\cup S_{2}\cup\cdots\cup S_{m}=[-1,0]italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∪ ⋯ ∪ italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = [ - 1 , 0 ]. μsubscript𝜇\mu_{\ell}italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT (=1,,m1𝑚\ell=1,\cdots,mroman_ℓ = 1 , ⋯ , italic_m) are randomly sampled from Ssubscript𝑆S_{\ell}italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT with a uniform distribution. Then

μm+=μm+1,for =1,2,,m.subscript𝜇𝑚subscript𝜇𝑚1for =1,2,,m\mu_{m+\ell}=-\mu_{m+1-\ell},\quad\mbox{for $\ell=1,2,\cdots,m$}.italic_μ start_POSTSUBSCRIPT italic_m + roman_ℓ end_POSTSUBSCRIPT = - italic_μ start_POSTSUBSCRIPT italic_m + 1 - roman_ℓ end_POSTSUBSCRIPT , for roman_ℓ = 1 , 2 , ⋯ , italic_m .

In the X-Y geometry, let n=4m𝑛4𝑚n=4mitalic_n = 4 italic_m. S1S2Smsubscript𝑆1subscript𝑆2subscript𝑆𝑚S_{1}\cup S_{2}\cup\cdots\cup S_{m}italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∪ ⋯ ∪ italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT is the 1/4 disk in the first quadrant, and μsubscript𝜇\mu_{\ell}italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT (=1,,m1𝑚\ell=1,\cdots,mroman_ℓ = 1 , ⋯ , italic_m) are randomly sampled from Ssubscript𝑆S_{\ell}italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT with a uniform distribution. The symmetric ordinates indicate that, for =1,,m1𝑚\ell=1,\cdots,mroman_ℓ = 1 , ⋯ , italic_m,

(3.5a) ζ=ζ+m=ζ+2m=ζ+3m,subscript𝜁subscript𝜁𝑚subscript𝜁2𝑚subscript𝜁3𝑚\displaystyle\zeta_{\ell}=\zeta_{\ell+m}=\zeta_{\ell+2m}=\zeta_{\ell+3m},italic_ζ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_ζ start_POSTSUBSCRIPT roman_ℓ + italic_m end_POSTSUBSCRIPT = italic_ζ start_POSTSUBSCRIPT roman_ℓ + 2 italic_m end_POSTSUBSCRIPT = italic_ζ start_POSTSUBSCRIPT roman_ℓ + 3 italic_m end_POSTSUBSCRIPT ,
(3.5b) θ=πθ+m=π+θ+2m=2πθ+3m.subscript𝜃𝜋subscript𝜃𝑚𝜋subscript𝜃2𝑚2𝜋subscript𝜃3𝑚\displaystyle\theta_{\ell}=\pi-\theta_{\ell+m}=\pi+\theta_{\ell+2m}=2\pi-% \theta_{\ell+3m}.italic_θ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_π - italic_θ start_POSTSUBSCRIPT roman_ℓ + italic_m end_POSTSUBSCRIPT = italic_π + italic_θ start_POSTSUBSCRIPT roman_ℓ + 2 italic_m end_POSTSUBSCRIPT = 2 italic_π - italic_θ start_POSTSUBSCRIPT roman_ℓ + 3 italic_m end_POSTSUBSCRIPT .

4 The convergence of ROM

As demonstrated in section 2, when the solution regularity is low in velocity space, the convergence orders of given quadratures are also low. At first glance, there is no benefit of using the ROM. For a given quadrature 𝕍ξsuperscript𝕍𝜉\mathbb{V}^{\xi}blackboard_V start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT, one can measure the numerical errors by the difference between ϕξ(𝒛)=𝒖𝕍ξωψ(𝒛)superscriptitalic-ϕ𝜉𝒛subscriptsubscript𝒖superscript𝕍𝜉subscript𝜔subscript𝜓𝒛\phi^{\xi}(\boldsymbol{z})=\sum_{\boldsymbol{u}_{\ell}\in\mathbb{V}^{\xi}}% \omega_{\ell}\psi_{\ell}(\boldsymbol{z})italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ( bold_italic_z ) = ∑ start_POSTSUBSCRIPT bold_italic_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∈ blackboard_V start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( bold_italic_z ) and the reference average density ϕ(𝒛)italic-ϕ𝒛\phi(\boldsymbol{z})italic_ϕ ( bold_italic_z ). The accuracy of ϕξ(𝒛)superscriptitalic-ϕ𝜉𝒛\phi^{\xi}(\boldsymbol{z})italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ( bold_italic_z ) cannot be improved by using random ordinates. However, if we consider the following two quantities: the first one is the expected single-run error 𝔼ϕξϕ𝔼normsuperscriptitalic-ϕ𝜉italic-ϕ\mathbb{E}\|\phi^{\xi}-\phi\|blackboard_E ∥ italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT - italic_ϕ ∥, which gives the expected error of one run; the second quantity is the bias 𝔼ϕξϕnorm𝔼superscriptitalic-ϕ𝜉italic-ϕ\left\|\mathbb{E}\phi^{\xi}-\phi\right\|∥ blackboard_E italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT - italic_ϕ ∥, which gives the distance between the expected value of ϕξsuperscriptitalic-ϕ𝜉\phi^{\xi}italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT and the reference solution, we will see from the convergence analysis in the subsequent part that the convergence order of bias is higher even if the solution regularity in velocity space is low. Therefore, if we take more samples of 𝕍ξsuperscript𝕍𝜉\mathbb{V}^{\xi}blackboard_V start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT, run the system (3.1) multiple times in parallel, and then take the expectation 𝔼ϕξ𝔼superscriptitalic-ϕ𝜉\mathbb{E}\phi^{\xi}blackboard_E italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT, the solution accuracy can be improved.

To illustrate the idea, we provide the convergence proof for isotropic scattering in slab geometry. We will show that a single typical run of ROM can give a 3/2323/23 / 2 order of convergence, and the expectation of multiple runs gives a 3333rd order convergence. The main idea is to expand the solution into the summation of a convergent sequence, then estimate the error and bias of the proposed ROM. The idea can be extended to a higher dimensional case (X-Y geometry, etc.) However, the proof is more technical, and we will leave it for future work.

4.1 Expansion of the solution.

The RTE in slab geometry with isotropic scattering kernel (i.e., P(μ,μ)=1𝑃superscript𝜇𝜇1P(\mu^{\prime},\mu)=1italic_P ( italic_μ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_μ ) = 1 and |S|=2𝑆2|S|=2| italic_S | = 2) writes

(4.1) μxψ(x,μ)+σT(x)ψ(x,μ)=σS(x)ϕ(x)+q(x),𝜇subscript𝑥𝜓𝑥𝜇subscript𝜎𝑇𝑥𝜓𝑥𝜇subscript𝜎𝑆𝑥italic-ϕ𝑥𝑞𝑥\mu\partial_{x}\psi(x,\mu)+\sigma_{T}(x)\psi(x,\mu)=\sigma_{S}(x)\phi(x)+q(x),italic_μ ∂ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_ψ ( italic_x , italic_μ ) + italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x ) italic_ψ ( italic_x , italic_μ ) = italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( italic_x ) italic_ϕ ( italic_x ) + italic_q ( italic_x ) ,

where ϕ(x)=1211ψ(x,μ)𝑑μitalic-ϕ𝑥12superscriptsubscript11𝜓𝑥𝜇differential-d𝜇\phi(x)=\frac{1}{2}\int_{-1}^{1}\psi(x,\mu)d\muitalic_ϕ ( italic_x ) = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∫ start_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_ψ ( italic_x , italic_μ ) italic_d italic_μ, subject to the inflow boundary conditions (2.1).

Let λ=σS(x)σT(x)(0,1)𝜆subscriptnormsubscript𝜎𝑆𝑥subscript𝜎𝑇𝑥01\lambda=\big{\|}\frac{\sigma_{S}(x)}{\sigma_{T}(x)}\big{\|}_{\infty}\in(0,1)italic_λ = ∥ divide start_ARG italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( italic_x ) end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x ) end_ARG ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ∈ ( 0 , 1 ), the equation (4.1) can be rewritten into

(4.2) μxψ(x,μ)+σT(x)ψ(x,μ)=λσr(x)ϕ(x)+q(x),𝜇subscript𝑥𝜓𝑥𝜇subscript𝜎𝑇𝑥𝜓𝑥𝜇𝜆subscript𝜎𝑟𝑥italic-ϕ𝑥𝑞𝑥\mu\partial_{x}\psi(x,\mu)+\sigma_{T}(x)\psi(x,\mu)=\lambda\sigma_{r}(x)\phi(x% )+q(x),italic_μ ∂ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_ψ ( italic_x , italic_μ ) + italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x ) italic_ψ ( italic_x , italic_μ ) = italic_λ italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_x ) italic_ϕ ( italic_x ) + italic_q ( italic_x ) ,

where

σr(x)=σS(x)σS(x)/σT(x)σT(x).subscript𝜎𝑟𝑥subscript𝜎𝑆𝑥subscriptnormsubscript𝜎𝑆𝑥subscript𝜎𝑇𝑥subscript𝜎𝑇𝑥\sigma_{r}(x)=\frac{\sigma_{S}(x)}{\|\sigma_{S}(x)/\sigma_{T}(x)\|_{\infty}}% \leq\sigma_{T}(x).italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_x ) = divide start_ARG italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( italic_x ) end_ARG start_ARG ∥ italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( italic_x ) / italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x ) ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT end_ARG ≤ italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x ) .

The solution to (4.2) can be expanded as

(4.3) ψ(x,μ)=p=0λpψ(p)(x,μ),𝜓𝑥𝜇superscriptsubscript𝑝0superscript𝜆𝑝superscript𝜓𝑝𝑥𝜇\psi(x,\mu)=\sum_{p=0}^{\infty}\lambda^{p}\psi^{(p)}(x,\mu),italic_ψ ( italic_x , italic_μ ) = ∑ start_POSTSUBSCRIPT italic_p = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT italic_ψ start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT ( italic_x , italic_μ ) ,

where ψ(0)superscript𝜓0\psi^{(0)}italic_ψ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT satisfies

(4.4) μxψ(0)(x,μ)+σT(x)ψ(0)(x,μ)=q(x),𝜇subscript𝑥superscript𝜓0𝑥𝜇subscript𝜎𝑇𝑥superscript𝜓0𝑥𝜇𝑞𝑥\mu\partial_{x}\psi^{(0)}(x,\mu)+\sigma_{T}(x)\psi^{(0)}(x,\mu)=q(x),italic_μ ∂ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_ψ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( italic_x , italic_μ ) + italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x ) italic_ψ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( italic_x , italic_μ ) = italic_q ( italic_x ) ,

with the boundary conditions (2.1). ψ(p)superscript𝜓𝑝\psi^{(p)}italic_ψ start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT (p1𝑝1p\geq 1italic_p ≥ 1) satisfies

(4.5) μxψ(p)(x,μ)+σT(x)ψ(p)(x,μ)=σr(x)1211ψ(p1)(x,μ)𝑑μ=σr(x)ϕ(p1)(x)𝜇subscript𝑥superscript𝜓𝑝𝑥𝜇subscript𝜎𝑇𝑥superscript𝜓𝑝𝑥𝜇subscript𝜎𝑟𝑥12superscriptsubscript11superscript𝜓𝑝1𝑥𝜇differential-d𝜇subscript𝜎𝑟𝑥superscriptitalic-ϕ𝑝1𝑥\mu\partial_{x}\psi^{(p)}(x,\mu)+\sigma_{T}(x)\psi^{(p)}(x,\mu)=\sigma_{r}(x)% \frac{1}{2}\int_{-1}^{1}\psi^{(p-1)}(x,\mu)d\mu=\sigma_{r}(x)\phi^{(p-1)}(x)italic_μ ∂ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_ψ start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT ( italic_x , italic_μ ) + italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x ) italic_ψ start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT ( italic_x , italic_μ ) = italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_x ) divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∫ start_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_ψ start_POSTSUPERSCRIPT ( italic_p - 1 ) end_POSTSUPERSCRIPT ( italic_x , italic_μ ) italic_d italic_μ = italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_x ) italic_ϕ start_POSTSUPERSCRIPT ( italic_p - 1 ) end_POSTSUPERSCRIPT ( italic_x )

with zero inflow boundary conditions. Since λ<1𝜆1\lambda<1italic_λ < 1, if ψ(p)(x,μ)subscriptnormsuperscript𝜓𝑝𝑥𝜇\|\psi^{(p)}(x,\mu)\|_{\infty}∥ italic_ψ start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT ( italic_x , italic_μ ) ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT are uniformly bounded for all p=0,1,𝑝01p=0,1,\cdotsitalic_p = 0 , 1 , ⋯, the summation on the right hand side of (4.3) converges. Then it is easy to verify that (4.3) satisfies the original equation (4.1)[15].

Expansion in operator form

Solving (4.4) yields

(4.6a) μ>0,𝜇0\displaystyle\mu>0,\quaditalic_μ > 0 , ψ(0)(x,μ)=1μxLxe1μyxσT(z)𝑑zq(y)𝑑y+e1μxLxσT(y)𝑑yψL(μ),superscript𝜓0𝑥𝜇1𝜇superscriptsubscriptsubscript𝑥𝐿𝑥superscript𝑒1𝜇superscriptsubscript𝑦𝑥subscript𝜎𝑇𝑧differential-d𝑧𝑞𝑦differential-d𝑦superscript𝑒1𝜇superscriptsubscriptsubscript𝑥𝐿𝑥subscript𝜎𝑇𝑦differential-d𝑦subscript𝜓𝐿𝜇\displaystyle\psi^{(0)}(x,\mu)=\frac{1}{\mu}\int_{x_{L}}^{x}e^{-\frac{1}{\mu}% \int_{y}^{x}\sigma_{T}(z)dz}q(y)dy+e^{-\frac{1}{\mu}\int_{x_{L}}^{x}\sigma_{T}% (y)dy}\psi_{L}\left(\mu\right),italic_ψ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( italic_x , italic_μ ) = divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ∫ start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_z ) italic_d italic_z end_POSTSUPERSCRIPT italic_q ( italic_y ) italic_d italic_y + italic_e start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_y ) italic_d italic_y end_POSTSUPERSCRIPT italic_ψ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ( italic_μ ) ,
(4.6b) μ<0,𝜇0\displaystyle\mu<0,\quaditalic_μ < 0 , ψ(0)(x,μ)=1μxxRe1μxyσT(z)𝑑zq(y)𝑑y+e1μxxRσT(y)𝑑yψR(μ).superscript𝜓0𝑥𝜇1𝜇superscriptsubscript𝑥subscript𝑥𝑅superscript𝑒1𝜇superscriptsubscript𝑥𝑦subscript𝜎𝑇𝑧differential-d𝑧𝑞𝑦differential-d𝑦superscript𝑒1𝜇superscriptsubscript𝑥subscript𝑥𝑅subscript𝜎𝑇𝑦differential-d𝑦subscript𝜓𝑅𝜇\displaystyle\psi^{(0)}(x,\mu)=-\frac{1}{\mu}\int_{x}^{x_{R}}e^{\frac{1}{\mu}% \int_{x}^{y}\sigma_{T}(z)dz}q(y)dy+e^{\frac{1}{\mu}\int_{x}^{x_{R}}\sigma_{T}(% y)dy}\psi_{R}\left(\mu\right).italic_ψ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( italic_x , italic_μ ) = - divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_z ) italic_d italic_z end_POSTSUPERSCRIPT italic_q ( italic_y ) italic_d italic_y + italic_e start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_y ) italic_d italic_y end_POSTSUPERSCRIPT italic_ψ start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( italic_μ ) .

Similarity, the solution to (4.5) is

(4.7) ψ(p)(x,μ)={1μxLxe1μyxσT(z)𝑑zσr(y)ϕ(p1)(y)𝑑y,for μ>0,1μxxRe1μxyσT(z)𝑑zσr(y)ϕ(p1)(y)𝑑y,for μ<0.superscript𝜓𝑝𝑥𝜇cases1𝜇superscriptsubscriptsubscript𝑥𝐿𝑥superscript𝑒1𝜇superscriptsubscript𝑦𝑥subscript𝜎𝑇𝑧differential-d𝑧subscript𝜎𝑟𝑦superscriptitalic-ϕ𝑝1𝑦differential-d𝑦for μ>01𝜇superscriptsubscript𝑥subscript𝑥𝑅superscript𝑒1𝜇superscriptsubscript𝑥𝑦subscript𝜎𝑇𝑧differential-d𝑧subscript𝜎𝑟𝑦superscriptitalic-ϕ𝑝1𝑦differential-d𝑦for μ<0\psi^{(p)}(x,\mu)=\begin{cases}\frac{1}{\mu}\int_{x_{L}}^{x}e^{-\frac{1}{\mu}% \int_{y}^{x}\sigma_{T}(z)dz}\sigma_{r}(y)\phi^{(p-1)}(y)dy,&\mbox{for $\mu>0$}% ,\\ -\frac{1}{\mu}\int_{x}^{x_{R}}e^{\frac{1}{\mu}\int_{x}^{y}\sigma_{T}(z)dz}% \sigma_{r}(y)\phi^{(p-1)}(y)dy,&\mbox{for $\mu<0$}.\end{cases}italic_ψ start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT ( italic_x , italic_μ ) = { start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ∫ start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_z ) italic_d italic_z end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_y ) italic_ϕ start_POSTSUPERSCRIPT ( italic_p - 1 ) end_POSTSUPERSCRIPT ( italic_y ) italic_d italic_y , end_CELL start_CELL for italic_μ > 0 , end_CELL end_ROW start_ROW start_CELL - divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_z ) italic_d italic_z end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_y ) italic_ϕ start_POSTSUPERSCRIPT ( italic_p - 1 ) end_POSTSUPERSCRIPT ( italic_y ) italic_d italic_y , end_CELL start_CELL for italic_μ < 0 . end_CELL end_ROW

It would be convenient to denote the solution operator in (4.7) by 𝒜μsubscript𝒜𝜇\mathcal{A}_{\mu}caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT such that

ψ(p)(,μ):=𝒜μ(ϕ(p1)).assignsuperscript𝜓𝑝𝜇subscript𝒜𝜇superscriptitalic-ϕ𝑝1\psi^{(p)}(\cdot,\mu):=\mathcal{A}_{\mu}(\phi^{(p-1)}).italic_ψ start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT ( ⋅ , italic_μ ) := caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ( italic_p - 1 ) end_POSTSUPERSCRIPT ) .

Then, the solution in (4.6) can be rewritten as

ψ(0)(x,μ)=𝒜μ(q/σr)(x)+bμ(x),superscript𝜓0𝑥𝜇subscript𝒜𝜇𝑞subscript𝜎𝑟𝑥subscript𝑏𝜇𝑥\psi^{(0)}(x,\mu)=\mathcal{A}_{\mu}(q/\sigma_{r})(x)+b_{\mu}(x),italic_ψ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( italic_x , italic_μ ) = caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_q / italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ) ( italic_x ) + italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x ) ,

where

bμ(x)={Bμ(x)ψL(μ),μ>0,Bμ(x)ψR(μ),μ<0,withBμ(x)={e1μxLxσT(y)𝑑y,μ>0,e1μxxRσT(y)𝑑y,μ<0.formulae-sequencesubscript𝑏𝜇𝑥casessubscript𝐵𝜇𝑥subscript𝜓𝐿𝜇𝜇0subscript𝐵𝜇𝑥subscript𝜓𝑅𝜇𝜇0withsubscript𝐵𝜇𝑥casessuperscript𝑒1𝜇superscriptsubscriptsubscript𝑥𝐿𝑥subscript𝜎𝑇𝑦differential-d𝑦𝜇0superscript𝑒1𝜇superscriptsubscript𝑥subscript𝑥𝑅subscript𝜎𝑇𝑦differential-d𝑦𝜇0b_{\mu}(x)=\begin{cases}B_{\mu}(x)\psi_{L}(\mu),&\mu>0,\\ B_{\mu}(x)\psi_{R}(\mu),&\mu<0,\end{cases}\quad\text{with}\quad B_{\mu}(x)=% \begin{cases}e^{-\frac{1}{\mu}\int_{x_{L}}^{x}\sigma_{T}(y)dy},&\mu>0,\\ e^{\frac{1}{\mu}\int_{x}^{x_{R}}\sigma_{T}(y)dy},&\mu<0.\end{cases}italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x ) = { start_ROW start_CELL italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x ) italic_ψ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ( italic_μ ) , end_CELL start_CELL italic_μ > 0 , end_CELL end_ROW start_ROW start_CELL italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x ) italic_ψ start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( italic_μ ) , end_CELL start_CELL italic_μ < 0 , end_CELL end_ROW with italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x ) = { start_ROW start_CELL italic_e start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_y ) italic_d italic_y end_POSTSUPERSCRIPT , end_CELL start_CELL italic_μ > 0 , end_CELL end_ROW start_ROW start_CELL italic_e start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_y ) italic_d italic_y end_POSTSUPERSCRIPT , end_CELL start_CELL italic_μ < 0 . end_CELL end_ROW

In (4.1), multiplying ψ𝜓\psiitalic_ψ and taking the integral with respect to x𝑥xitalic_x, the skew-symmetric term μx𝜇subscript𝑥\mu\partial_{x}italic_μ ∂ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT can be eliminated to yield some boundary terms. This then gives the weighted L2superscript𝐿2L^{2}italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT norm of ψ𝜓\psiitalic_ψ with weight σTsubscript𝜎𝑇\sigma_{T}italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT. Hence, we consider the space L2(I;σT)superscript𝐿2𝐼subscript𝜎𝑇L^{2}(I;\sigma_{T})italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_I ; italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) with inner product

(4.8) f,g:=xLxRfgσT𝑑x.assign𝑓𝑔superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅𝑓𝑔subscript𝜎𝑇differential-d𝑥\langle f,g\rangle:=\int_{x_{L}}^{x_{R}}fg\sigma_{T}\,dx.⟨ italic_f , italic_g ⟩ := ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_f italic_g italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT italic_d italic_x .

Then, 𝒜μsubscript𝒜𝜇\mathcal{A}_{\mu}caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT can be viewed as the integral operator on L2(Ω;σT)superscript𝐿2Ωsubscript𝜎𝑇L^{2}(\Omega;\sigma_{T})italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ; italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) with kernel

kμ(x,y)={1μ𝕀(yx)e1μyxσT(z)𝑑zσr(y)σT(y),μ>0,1μ𝕀(yx)e1μxyσT(z)𝑑zσr(y)σT(y),μ<0.subscript𝑘𝜇𝑥𝑦cases1𝜇subscript𝕀𝑦𝑥superscript𝑒1𝜇superscriptsubscript𝑦𝑥subscript𝜎𝑇𝑧differential-d𝑧subscript𝜎𝑟𝑦subscript𝜎𝑇𝑦𝜇01𝜇subscript𝕀𝑦𝑥superscript𝑒1𝜇superscriptsubscript𝑥𝑦subscript𝜎𝑇𝑧differential-d𝑧subscript𝜎𝑟𝑦subscript𝜎𝑇𝑦𝜇0\displaystyle k_{\mu}(x,y)=\begin{cases}\frac{1}{\mu}\mathbb{I}_{(y\leq x)}e^{% -\frac{1}{\mu}\int_{y}^{x}\sigma_{T}(z)dz}\frac{\sigma_{r}(y)}{\sigma_{T}(y)},% &\mu>0,\\ -\frac{1}{\mu}\mathbb{I}_{(y\geq x)}e^{\frac{1}{\mu}\int_{x}^{y}\sigma_{T}(z)% dz}\frac{\sigma_{r}(y)}{\sigma_{T}(y)},&\mu<0.\end{cases}italic_k start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x , italic_y ) = { start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG blackboard_I start_POSTSUBSCRIPT ( italic_y ≤ italic_x ) end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ∫ start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_z ) italic_d italic_z end_POSTSUPERSCRIPT divide start_ARG italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_y ) end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_y ) end_ARG , end_CELL start_CELL italic_μ > 0 , end_CELL end_ROW start_ROW start_CELL - divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG blackboard_I start_POSTSUBSCRIPT ( italic_y ≥ italic_x ) end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_z ) italic_d italic_z end_POSTSUPERSCRIPT divide start_ARG italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_y ) end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_y ) end_ARG , end_CELL start_CELL italic_μ < 0 . end_CELL end_ROW

More precisely,

𝒜μ(ϕ)(x)=xLxRkμ(x,y)ϕ(y)σT(y)𝑑y.subscript𝒜𝜇italic-ϕ𝑥superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅subscript𝑘𝜇𝑥𝑦italic-ϕ𝑦subscript𝜎𝑇𝑦differential-d𝑦\mathcal{A}_{\mu}(\phi)(x)=\int_{x_{L}}^{x_{R}}k_{\mu}(x,y)\phi(y)\sigma_{T}(y% )dy.caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_ϕ ) ( italic_x ) = ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_k start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x , italic_y ) italic_ϕ ( italic_y ) italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_y ) italic_d italic_y .

We then introduce the iteration operator 𝒯𝒯\mathcal{T}caligraphic_T:

(4.9) 𝒯=S𝒜μ𝑑μ=1211𝒜μ𝑑μ,𝒯subscriptaverage-integral𝑆subscript𝒜𝜇differential-d𝜇12superscriptsubscript11subscript𝒜𝜇differential-d𝜇\mathcal{T}=\fint_{S}\mathcal{A}_{\mu}d\mu=\frac{1}{2}\int_{-1}^{1}\mathcal{A}% _{\mu}\,d\mu,caligraphic_T = ⨏ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∫ start_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ ,

which satisfies

ϕ(p)=Sψ(p)(,μ)𝑑μ=S𝒜μ(ϕ(p1))𝑑μ=𝒯(ϕ(p1)),p1,formulae-sequencesuperscriptitalic-ϕ𝑝subscriptaverage-integral𝑆superscript𝜓𝑝𝜇differential-d𝜇subscriptaverage-integral𝑆subscript𝒜𝜇superscriptitalic-ϕ𝑝1differential-d𝜇𝒯superscriptitalic-ϕ𝑝1𝑝1\phi^{(p)}=\fint_{S}\psi^{(p)}(\cdot,\mu)d\mu=\fint_{S}\mathcal{A}_{\mu}(\phi^% {(p-1)})d\mu=\mathcal{T}(\phi^{(p-1)}),\quad p\geq 1,italic_ϕ start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT = ⨏ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_ψ start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT ( ⋅ , italic_μ ) italic_d italic_μ = ⨏ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ( italic_p - 1 ) end_POSTSUPERSCRIPT ) italic_d italic_μ = caligraphic_T ( italic_ϕ start_POSTSUPERSCRIPT ( italic_p - 1 ) end_POSTSUPERSCRIPT ) , italic_p ≥ 1 ,

and

ϕ(0)(x)=𝒯(q/σr)(x)+Sbμ(x)𝑑μ.superscriptitalic-ϕ0𝑥𝒯𝑞subscript𝜎𝑟𝑥subscriptaverage-integral𝑆subscript𝑏𝜇𝑥differential-d𝜇\phi^{(0)}(x)=\mathcal{T}(q/\sigma_{r})(x)+\fint_{S}b_{\mu}(x)d\mu.italic_ϕ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ( italic_x ) = caligraphic_T ( italic_q / italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ) ( italic_x ) + ⨏ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x ) italic_d italic_μ .

Therefore, the average density is given by

(4.10) ϕ(x)=p=0λpϕ(p)(x)=p=0λp(𝒯p+1(q/σr)(x)+𝒯pSbμ(x)𝑑μ).italic-ϕ𝑥superscriptsubscript𝑝0superscript𝜆𝑝superscriptitalic-ϕ𝑝𝑥superscriptsubscript𝑝0superscript𝜆𝑝superscript𝒯𝑝1𝑞subscript𝜎𝑟𝑥superscript𝒯𝑝subscriptaverage-integral𝑆subscript𝑏𝜇𝑥differential-d𝜇\phi(x)=\sum_{p=0}^{\infty}\lambda^{p}\phi^{(p)}(x)=\sum_{p=0}^{\infty}\lambda% ^{p}\left(\mathcal{T}^{p+1}(q/\sigma_{r})(x)+\mathcal{T}^{p}\fint_{S}b_{\mu}(x% )\,d\mu\right).italic_ϕ ( italic_x ) = ∑ start_POSTSUBSCRIPT italic_p = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT italic_ϕ start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT ( italic_x ) = ∑ start_POSTSUBSCRIPT italic_p = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( caligraphic_T start_POSTSUPERSCRIPT italic_p + 1 end_POSTSUPERSCRIPT ( italic_q / italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ) ( italic_x ) + caligraphic_T start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ⨏ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x ) italic_d italic_μ ) .

In ROM, the magnitude of ωsubscript𝜔\omega_{\ell}italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT relates to the mesh size thus in order to estimate the convergence order, we introduce the rescaled weights

α=nω,subscript𝛼𝑛subscript𝜔\alpha_{\ell}=n\cdot\omega_{\ell},italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_n ⋅ italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ,

so that =1nα=nsuperscriptsubscript1𝑛subscript𝛼𝑛\sum_{\ell=1}^{n}\alpha_{\ell}=n∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_n and each α=O(1)subscript𝛼𝑂1\alpha_{\ell}=O(1)italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_O ( 1 ). One run of ROM is to solve

μdψ(x)dx+σT(x)ψ(x)=λσr(x)ϕξ(x)+q(x),μ𝕍ξformulae-sequencesubscript𝜇𝑑subscript𝜓𝑥𝑑𝑥subscript𝜎𝑇𝑥subscript𝜓𝑥𝜆subscript𝜎𝑟𝑥superscriptitalic-ϕ𝜉𝑥𝑞𝑥subscript𝜇superscript𝕍𝜉\mu_{\ell}\frac{d\psi_{\ell}(x)}{dx}+\sigma_{T}(x)\psi_{\ell}(x)=\lambda\sigma% _{r}(x)\phi^{\xi}(x)+q(x),\quad\mu_{\ell}\in\mathbb{V}^{\xi}italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT divide start_ARG italic_d italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x ) end_ARG start_ARG italic_d italic_x end_ARG + italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x ) italic_ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_x ) = italic_λ italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_x ) italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ( italic_x ) + italic_q ( italic_x ) , italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∈ blackboard_V start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT

with

ϕξ(x)=Vξωψ(x)=1n=1nαψ(x).superscriptitalic-ϕ𝜉𝑥subscriptsuperscriptsuperscript𝑉𝜉subscript𝜔superscriptsubscript𝜓superscript𝑥1𝑛superscriptsubscriptsuperscript1𝑛subscript𝛼superscriptsubscript𝜓superscript𝑥\phi^{\xi}(x)=\sum_{\ell^{\prime}\in V^{\xi}}\omega_{\ell^{\prime}}\psi_{\ell^% {\prime}}(x)=\frac{1}{n}\sum_{\ell^{\prime}=1}^{n}\alpha_{\ell^{\prime}}\psi_{% \ell^{\prime}}(x).italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ( italic_x ) = ∑ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_V start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_ω start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x ) = divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x ) .

Similar as for (4.10), the average density ϕξsuperscriptitalic-ϕ𝜉\phi^{\xi}italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT for ROM can be written as

(4.11) ϕξ(x)=p=0λp((𝒯ξ)p+1(q/σr)(x)+(𝒯ξ)p(1nαbμ(x))),superscriptitalic-ϕ𝜉𝑥superscriptsubscript𝑝0superscript𝜆𝑝superscriptsuperscript𝒯𝜉𝑝1𝑞subscript𝜎𝑟𝑥superscriptsuperscript𝒯𝜉𝑝1𝑛subscriptsuperscriptsubscript𝛼superscriptsubscript𝑏subscript𝜇superscript𝑥\phi^{\xi}(x)=\sum_{p=0}^{\infty}\lambda^{p}\left((\mathcal{T}^{\xi})^{p+1}(q/% \sigma_{r})(x)+(\mathcal{T}^{\xi})^{p}\left(\frac{1}{n}\sum_{\ell^{\prime}}% \alpha_{\ell^{\prime}}b_{\mu_{\ell^{\prime}}}(x)\right)\right),italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ( italic_x ) = ∑ start_POSTSUBSCRIPT italic_p = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( ( caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_p + 1 end_POSTSUPERSCRIPT ( italic_q / italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ) ( italic_x ) + ( caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x ) ) ) ,

where the iteration operator 𝒯ξsuperscript𝒯𝜉\mathcal{T}^{\xi}caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT becomes

(4.12) 𝒯ξ=1nα𝒜μ.superscript𝒯𝜉1𝑛subscriptsubscript𝛼subscript𝒜subscript𝜇\mathcal{T}^{\xi}=\frac{1}{n}\sum_{\ell}\alpha_{\ell}\mathcal{A}_{\mu_{\ell}}.caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT .

Our goal is to estimate the difference between ϕitalic-ϕ\phiitalic_ϕ and ϕξsuperscriptitalic-ϕ𝜉\phi^{\xi}italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT in terms of the expected single run error and bias, using the expansions listed above.

4.2 Main result and the proof

Similar as in [12], the singularity near μ=0𝜇0\mu=0italic_μ = 0 will affect the estimates of the order of convergence. We will thus consider a truncated approximation similar to Grad’s angular cutoff for the Boltzmann equation and prove the convergence of ROM for the truncated system, using the expansion introduced above.

We take δ>0𝛿0\delta>0italic_δ > 0 and consider the truncated velocity space Sδ=[1,δ)(δ,1]superscript𝑆𝛿1𝛿𝛿1S^{\delta}=[-1,-\delta)\cup(\delta,1]italic_S start_POSTSUPERSCRIPT italic_δ end_POSTSUPERSCRIPT = [ - 1 , - italic_δ ) ∪ ( italic_δ , 1 ]. The difference between the truncated systems and the original systems can be controlled by δ𝛿\deltaitalic_δ. With this truncated approximation, one has

𝒯δ:=Sδ𝒜μ𝑑μ.assignsuperscript𝒯𝛿subscriptaverage-integralsuperscript𝑆𝛿subscript𝒜𝜇differential-d𝜇\mathcal{T}^{\delta}:=\fint_{S^{\delta}}\mathcal{A}_{\mu}d\mu.caligraphic_T start_POSTSUPERSCRIPT italic_δ end_POSTSUPERSCRIPT := ⨏ start_POSTSUBSCRIPT italic_S start_POSTSUPERSCRIPT italic_δ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ .

The average density ϕδ=Sδψ(x,μ)𝑑μsuperscriptitalic-ϕ𝛿subscriptaverage-integralsuperscript𝑆𝛿𝜓𝑥𝜇differential-d𝜇\phi^{\delta}=\fint_{S^{\delta}}\psi(x,\mu)d\muitalic_ϕ start_POSTSUPERSCRIPT italic_δ end_POSTSUPERSCRIPT = ⨏ start_POSTSUBSCRIPT italic_S start_POSTSUPERSCRIPT italic_δ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_ψ ( italic_x , italic_μ ) italic_d italic_μ for this truncated system is then given by (4.10) with 𝒯𝒯\mathcal{T}caligraphic_T being replaced by 𝒯δsuperscript𝒯𝛿\mathcal{T}^{\delta}caligraphic_T start_POSTSUPERSCRIPT italic_δ end_POSTSUPERSCRIPT and S𝑆Sitalic_S being replaced by Sδsuperscript𝑆𝛿S^{\delta}italic_S start_POSTSUPERSCRIPT italic_δ end_POSTSUPERSCRIPT. We choose to consider the average on Sδsuperscript𝑆𝛿S^{\delta}italic_S start_POSTSUPERSCRIPT italic_δ end_POSTSUPERSCRIPT to approximate the average density so that the notations in formulas like (4.10) will not change. One may divide [1,δ)1𝛿[-1,-\delta)[ - 1 , - italic_δ ) and (δ,1]𝛿1(\delta,1]( italic_δ , 1 ] into n/2𝑛2n/2italic_n / 2 subintervals respectively and perform the ROM. Then, the weights are changed into |S|/|Sδ|subscript𝑆superscript𝑆𝛿|S_{\ell}|/|S^{\delta}|| italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | / | italic_S start_POSTSUPERSCRIPT italic_δ end_POSTSUPERSCRIPT |. Correspondingly, the formula (4.11) will not change as well, and one just uses the new weights. From a practical viewpoint, this truncated system makes sense since the numerical velocities will not touch μ=0𝜇0\mu=0italic_μ = 0. For notational convenience, we will drop δ𝛿\deltaitalic_δ, and understand 𝒯𝒯\mathcal{T}caligraphic_T and S𝑆Sitalic_S to be the truncated ones.

The main convergence result for ROM is the following.

Theorem 4.1 (main result).

Consider the truncated systems and suppose that the rescaled weights αsubscript𝛼\alpha_{\ell}italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT are uniformly bounded. Then, there exists n0>0subscript𝑛00n_{0}>0italic_n start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT > 0 such that for n>n0𝑛subscript𝑛0n>n_{0}italic_n > italic_n start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, the expected single run error satisfies

𝔼ϕξϕCn3/2(logn)1/2,𝔼normsuperscriptitalic-ϕ𝜉italic-ϕ𝐶superscript𝑛32superscript𝑛12\mathbb{E}\|\phi^{\xi}-\phi\|\leq Cn^{-3/2}(\log n)^{1/2},blackboard_E ∥ italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT - italic_ϕ ∥ ≤ italic_C italic_n start_POSTSUPERSCRIPT - 3 / 2 end_POSTSUPERSCRIPT ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ,

and the bias satisfies

𝔼ϕξϕCλn3logn.norm𝔼superscriptitalic-ϕ𝜉italic-ϕ𝐶𝜆superscript𝑛3𝑛\|\mathbb{E}\phi^{\xi}-\phi\|\leq C\lambda n^{-3}\log n.∥ blackboard_E italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT - italic_ϕ ∥ ≤ italic_C italic_λ italic_n start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT roman_log italic_n .

Here, the norm used is the L2(σT)superscript𝐿2subscript𝜎𝑇L^{2}(\sigma_{T})italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) norm in (4.8).

Proof 4.2 (Proof of Theorem 4.1).

Let 𝒯ξ=α𝒜μsuperscriptsubscript𝒯𝜉subscript𝛼subscript𝒜subscript𝜇\mathcal{T}_{\ell}^{\xi}=\alpha_{\ell}\mathcal{A}_{\mu_{\ell}}caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT = italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Then

𝒯=𝔼Tξ=αS𝒜μ𝑑μ=nω1|S|S𝒜μ𝑑μ=n|S|S𝒜μ𝑑μ.subscript𝒯𝔼superscriptsubscript𝑇𝜉subscript𝛼subscriptaverage-integralsubscript𝑆subscript𝒜𝜇differential-d𝜇𝑛subscript𝜔1subscript𝑆subscriptsubscript𝑆subscript𝒜𝜇differential-d𝜇𝑛𝑆subscriptsubscript𝑆subscript𝒜𝜇differential-d𝜇\mathcal{T}_{\ell}=\mathbb{E}T_{\ell}^{\xi}=\alpha_{\ell}\fint_{S_{\ell}}% \mathcal{A}_{\mu}d\mu=n\omega_{\ell}\frac{1}{|S_{\ell}|}\int_{S_{\ell}}% \mathcal{A}_{\mu}d\mu=\frac{n}{|S|}\int_{S_{\ell}}\mathcal{A}_{\mu}d\mu.caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = blackboard_E italic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT = italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⨏ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ = italic_n italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | end_ARG ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ = divide start_ARG italic_n end_ARG start_ARG | italic_S | end_ARG ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ .

By the definitions of 𝒯subscript𝒯\mathcal{T}_{\ell}caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT, 𝒯ξsuperscriptsubscript𝒯𝜉\mathcal{T}_{\ell}^{\xi}caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT in (4.9) and (4.12) and the symmetry of the ordinates, we can then write

δ𝒯ξ:=𝒯ξ𝒯=1m=1m12(𝒯ξ𝒯+𝒯n+1ξ𝒯n+1)=:1m=1mδ𝒯ξ.\delta\mathcal{T}^{\xi}:=\mathcal{T}^{\xi}-\mathcal{T}=\frac{1}{m}\sum_{\ell=1% }^{m}\frac{1}{2}(\mathcal{T}_{\ell}^{\xi}-\mathcal{T}_{\ell}+\mathcal{T}_{n+1-% \ell}^{\xi}-\mathcal{T}_{n+1-\ell})=:\frac{1}{m}\sum_{\ell=1}^{m}\delta% \mathcal{T}_{\ell}^{\xi}.italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT := caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT - caligraphic_T = divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT - caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT + caligraphic_T start_POSTSUBSCRIPT italic_n + 1 - roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT - caligraphic_T start_POSTSUBSCRIPT italic_n + 1 - roman_ℓ end_POSTSUBSCRIPT ) = : divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT .

Here, 𝒯ξsuperscriptsubscript𝒯𝜉\mathcal{T}_{\ell}^{\xi}caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT and 𝒯n+1ξsuperscriptsubscript𝒯𝑛1𝜉\mathcal{T}_{n+1-\ell}^{\xi}caligraphic_T start_POSTSUBSCRIPT italic_n + 1 - roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT (=1,,m1𝑚\ell=1,\cdots,mroman_ℓ = 1 , ⋯ , italic_m) are not independent, and this is why we put them together. Since 𝔼Tξ=T𝔼superscriptsubscript𝑇𝜉subscript𝑇\mathbb{E}T_{\ell}^{\xi}=T_{\ell}blackboard_E italic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT = italic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT (=1,,n1𝑛\ell=1,\cdots,nroman_ℓ = 1 , ⋯ , italic_n), one has

(4.13) 𝔼δ𝒯ξ=0,for =1,,m𝔼δ𝒯ξ=0.formulae-sequence𝔼𝛿superscriptsubscript𝒯𝜉0for =1,,m𝔼𝛿superscript𝒯𝜉0\mathbb{E}\delta\mathcal{T}_{\ell}^{\xi}=0,\quad\mbox{for $\ell=1,\cdots,m$, }% \qquad\mathbb{E}\delta\mathcal{T}^{\xi}=0.blackboard_E italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT = 0 , for roman_ℓ = 1 , ⋯ , italic_m , blackboard_E italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT = 0 .

Comparing (4.10) and (4.11), we denote

b(x)=Sbμ(x)𝑑μ,δb(x):=1nαbμ(x)b(x),formulae-sequence𝑏𝑥subscriptaverage-integral𝑆subscript𝑏𝜇𝑥differential-d𝜇assign𝛿𝑏𝑥1𝑛subscriptsubscript𝛼subscript𝑏subscript𝜇𝑥𝑏𝑥\displaystyle b(x)=\fint_{S}b_{\mu}(x)d\mu,\quad\delta b(x):=\frac{1}{n}\sum_{% \ell}\alpha_{\ell}b_{\mu_{\ell}}(x)-b(x),italic_b ( italic_x ) = ⨏ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x ) italic_d italic_μ , italic_δ italic_b ( italic_x ) := divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x ) - italic_b ( italic_x ) ,

and

(4.14) 𝔼δb=1nα𝔼bμ(x)Sbμ(x)𝑑μ=1|S|Sbμ(x)𝑑μSbμ(x)𝑑μ=0.𝔼𝛿𝑏1𝑛subscriptsubscript𝛼𝔼subscript𝑏subscript𝜇𝑥subscriptaverage-integral𝑆subscript𝑏𝜇𝑥differential-d𝜇1𝑆subscriptsubscriptsubscript𝑆subscript𝑏subscript𝜇𝑥differential-dsubscript𝜇subscriptaverage-integral𝑆subscript𝑏𝜇𝑥differential-d𝜇0\mathbb{E}\delta b=\frac{1}{n}\sum_{\ell}\alpha_{\ell}\mathbb{E}b_{\mu_{\ell}}% (x)-\fint_{S}b_{\mu}(x)d\mu=\frac{1}{|S|}\sum_{\ell}\int_{S_{\ell}}b_{\mu_{% \ell}}(x)d\mu_{\ell}-\fint_{S}b_{\mu}(x)d\mu=0.blackboard_E italic_δ italic_b = divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT blackboard_E italic_b start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x ) - ⨏ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x ) italic_d italic_μ = divide start_ARG 1 end_ARG start_ARG | italic_S | end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x ) italic_d italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT - ⨏ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x ) italic_d italic_μ = 0 .

Here, bμsubscript𝑏subscript𝜇b_{\mu_{\ell}}italic_b start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT and bμn+1subscript𝑏subscript𝜇𝑛1b_{\mu_{n+1-\ell}}italic_b start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_n + 1 - roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT are not independent either. In the estimate of 𝔼δb𝔼norm𝛿𝑏\mathbb{E}\|\delta b\|blackboard_E ∥ italic_δ italic_b ∥, one needs to put them together as well. One can refer to the supplementary material for the details.

Below, the norm for functions will be L2(σT)superscript𝐿2subscript𝜎𝑇L^{2}(\sigma_{T})italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) norm and the norm for the operators will be the operator norm from L2(σT)superscript𝐿2subscript𝜎𝑇L^{2}(\sigma_{T})italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) to L2(σT)superscript𝐿2subscript𝜎𝑇L^{2}(\sigma_{T})italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ). According to (4.10) and (4.11) the expected single run error is then controlled as below

𝔼ϕξϕp=0λpk=1p+1(p+1k)𝒯p+1kq/σr𝔼δ𝒯ξk+p=1λpk=1p(pk)𝒯pkb(x)𝔼δ𝒯ξk+p=0λpk=0p(pk)𝒯pk𝔼δb(x)𝔼δ𝒯ξk=:E1+E2+E3.\begin{split}\mathbb{E}\|\phi^{\xi}-\phi\|&\leq\sum_{p=0}^{\infty}\lambda^{p}% \sum_{k=1}^{p+1}{p+1\choose k}\cdot\|\mathcal{T}\|^{p+1-k}\cdot\|q/\sigma_{r}% \|\cdot\mathbb{E}\|\delta\mathcal{T}^{\xi}\|^{k}\\ &+\sum_{p=1}^{\infty}\lambda^{p}\sum_{k=1}^{p}{p\choose k}\cdot\|\mathcal{T}\|% ^{p-k}\cdot\|b(x)\|\cdot\mathbb{E}\|\delta\mathcal{T}^{\xi}\|^{k}\\ &+\sum_{p=0}^{\infty}\lambda^{p}\sum_{k=0}^{p}{p\choose k}\cdot\|\mathcal{T}\|% ^{p-k}\cdot\mathbb{E}\|\delta b(x)\|\cdot\mathbb{E}\|\delta\mathcal{T}^{\xi}\|% ^{k}\\ &=:E_{1}+E_{2}+E_{3}.\end{split}start_ROW start_CELL blackboard_E ∥ italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT - italic_ϕ ∥ end_CELL start_CELL ≤ ∑ start_POSTSUBSCRIPT italic_p = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p + 1 end_POSTSUPERSCRIPT ( binomial start_ARG italic_p + 1 end_ARG start_ARG italic_k end_ARG ) ⋅ ∥ caligraphic_T ∥ start_POSTSUPERSCRIPT italic_p + 1 - italic_k end_POSTSUPERSCRIPT ⋅ ∥ italic_q / italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ∥ ⋅ blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + ∑ start_POSTSUBSCRIPT italic_p = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( binomial start_ARG italic_p end_ARG start_ARG italic_k end_ARG ) ⋅ ∥ caligraphic_T ∥ start_POSTSUPERSCRIPT italic_p - italic_k end_POSTSUPERSCRIPT ⋅ ∥ italic_b ( italic_x ) ∥ ⋅ blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + ∑ start_POSTSUBSCRIPT italic_p = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( binomial start_ARG italic_p end_ARG start_ARG italic_k end_ARG ) ⋅ ∥ caligraphic_T ∥ start_POSTSUPERSCRIPT italic_p - italic_k end_POSTSUPERSCRIPT ⋅ blackboard_E ∥ italic_δ italic_b ( italic_x ) ∥ ⋅ blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = : italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_E start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT . end_CELL end_ROW

The bias can be controlled similarly.

𝔼ϕξϕnorm𝔼superscriptitalic-ϕ𝜉italic-ϕ\displaystyle\|\mathbb{E}\phi^{\xi}-\phi\|∥ blackboard_E italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT - italic_ϕ ∥ p=1λpk=2p+1(p+1k)𝒯p+1kq/σr𝔼δ𝒯ξkabsentsuperscriptsubscript𝑝1superscript𝜆𝑝superscriptsubscript𝑘2𝑝1binomial𝑝1𝑘superscriptnorm𝒯𝑝1𝑘norm𝑞subscript𝜎𝑟𝔼superscriptnorm𝛿superscript𝒯𝜉𝑘\displaystyle\leq\sum_{p=1}^{\infty}\lambda^{p}\sum_{k=2}^{p+1}{p+1\choose k}% \cdot\|\mathcal{T}\|^{p+1-k}\cdot\|q/\sigma_{r}\|\cdot\mathbb{E}\|\delta% \mathcal{T}^{\xi}\|^{k}≤ ∑ start_POSTSUBSCRIPT italic_p = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p + 1 end_POSTSUPERSCRIPT ( binomial start_ARG italic_p + 1 end_ARG start_ARG italic_k end_ARG ) ⋅ ∥ caligraphic_T ∥ start_POSTSUPERSCRIPT italic_p + 1 - italic_k end_POSTSUPERSCRIPT ⋅ ∥ italic_q / italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ∥ ⋅ blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT
+p=2λpk=2p(pk)𝒯pkb(x)𝔼δ𝒯ξksuperscriptsubscript𝑝2superscript𝜆𝑝superscriptsubscript𝑘2𝑝binomial𝑝𝑘superscriptnorm𝒯𝑝𝑘norm𝑏𝑥𝔼superscriptnorm𝛿superscript𝒯𝜉𝑘\displaystyle+\sum_{p=2}^{\infty}\lambda^{p}\sum_{k=2}^{p}{p\choose k}\cdot\|% \mathcal{T}\|^{p-k}\cdot\|b(x)\|\cdot\mathbb{E}\|\delta\mathcal{T}^{\xi}\|^{k}+ ∑ start_POSTSUBSCRIPT italic_p = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( binomial start_ARG italic_p end_ARG start_ARG italic_k end_ARG ) ⋅ ∥ caligraphic_T ∥ start_POSTSUPERSCRIPT italic_p - italic_k end_POSTSUPERSCRIPT ⋅ ∥ italic_b ( italic_x ) ∥ ⋅ blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT
+p=1λpk=1p(pk)𝒯pk𝔼δb(x)𝔼δ𝒯ξksuperscriptsubscript𝑝1superscript𝜆𝑝superscriptsubscript𝑘1𝑝binomial𝑝𝑘superscriptnorm𝒯𝑝𝑘𝔼norm𝛿𝑏𝑥𝔼superscriptnorm𝛿superscript𝒯𝜉𝑘\displaystyle+\sum_{p=1}^{\infty}\lambda^{p}\sum_{k=1}^{p}{p\choose k}\cdot\|% \mathcal{T}\|^{p-k}\cdot\mathbb{E}\|\delta b(x)\|\cdot\mathbb{E}\|\delta% \mathcal{T}^{\xi}\|^{k}+ ∑ start_POSTSUBSCRIPT italic_p = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( binomial start_ARG italic_p end_ARG start_ARG italic_k end_ARG ) ⋅ ∥ caligraphic_T ∥ start_POSTSUPERSCRIPT italic_p - italic_k end_POSTSUPERSCRIPT ⋅ blackboard_E ∥ italic_δ italic_b ( italic_x ) ∥ ⋅ blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT
=:B1+B2+B3.\displaystyle=:B_{1}+B_{2}+B_{3}.= : italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_B start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_B start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT .

Here, the difference of bias from the expected single run error is that the terms involving a single δ𝒯ξ𝛿superscript𝒯𝜉\delta\mathcal{T}^{\xi}italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT or δb𝛿𝑏\delta bitalic_δ italic_b vanishes under expectation. The summation index p𝑝pitalic_p starts from 1 in B1subscript𝐵1B_{1}italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and from 2 in B2subscript𝐵2B_{2}italic_B start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, and the inner index k𝑘kitalic_k starts from k=2𝑘2k=2italic_k = 2. This is because, from (4.13), 𝔼(𝒯+δ𝒯ξ)=𝒯𝔼𝒯𝛿superscript𝒯𝜉𝒯\mathbb{E}(\mathcal{T}+\delta\mathcal{T}^{\xi})=\mathcal{T}blackboard_E ( caligraphic_T + italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) = caligraphic_T. For B3subscript𝐵3B_{3}italic_B start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, the index p𝑝pitalic_p starts from 1111 and k𝑘kitalic_k from 1111, because 𝔼δb=0𝔼𝛿𝑏0\mathbb{E}\delta b=0blackboard_E italic_δ italic_b = 0 by (4.14).

Therefore, we need to control 𝒯norm𝒯\|\mathcal{T}\|∥ caligraphic_T ∥, 𝒯ξnormsuperscript𝒯𝜉\|\mathcal{T}^{\xi}\|∥ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥, 𝔼δb(x)𝔼norm𝛿𝑏𝑥\mathbb{E}\|\delta b(x)\|blackboard_E ∥ italic_δ italic_b ( italic_x ) ∥ and 𝔼δ𝒯ξp𝔼superscriptnorm𝛿superscript𝒯𝜉𝑝\mathbb{E}\|\delta\mathcal{T}^{\xi}\|^{p}blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT for p>0𝑝0p>0italic_p > 0 to estimate the error and bias. We can establish the following estimates.

  • Lemma SM1.1 shows that 𝒯1norm𝒯1\|\mathcal{T}\|\leq 1∥ caligraphic_T ∥ ≤ 1, 𝒯ξ1normsuperscript𝒯𝜉1\|\mathcal{T}^{\xi}\|\leq 1∥ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ ≤ 1 and supξδ𝒯ξCn1subscriptsupremum𝜉norm𝛿superscript𝒯𝜉𝐶superscript𝑛1\sup_{\xi}\|\delta\mathcal{T}^{\xi}\|\leq Cn^{-1}roman_sup start_POSTSUBSCRIPT italic_ξ end_POSTSUBSCRIPT ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ ≤ italic_C italic_n start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT.

  • Corollary SM1.8 tells that 𝔼(δ𝒯ξ2)C(|logn|+1)n3𝔼superscriptnorm𝛿superscript𝒯𝜉2𝐶𝑛1superscript𝑛3\mathbb{E}(\|\delta\mathcal{T}^{\xi}\|^{2})\leq C(|\log n|+1)n^{-3}blackboard_E ( ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ≤ italic_C ( | roman_log italic_n | + 1 ) italic_n start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT.

  • Lemma SM1.9 proves that 𝔼δbCn3|logn|𝔼norm𝛿𝑏𝐶superscript𝑛3𝑛\mathbb{E}\|\delta b\|\leq C\sqrt{n^{-3}|\log n|}blackboard_E ∥ italic_δ italic_b ∥ ≤ italic_C square-root start_ARG italic_n start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT | roman_log italic_n | end_ARG.

The estimate for 𝔼(δ𝒯ξ2)𝔼superscriptnorm𝛿superscript𝒯𝜉2\mathbb{E}(\|\delta\mathcal{T}^{\xi}\|^{2})blackboard_E ( ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) is the most difficult one as it involves the concentration of norms for random operators in Hilbert spaces. It is established by a type of Rosenthal inequality. Other estimates are relatively straightforward. See the detailed proof in the Supplementary material.

Using the estimates above, one then has

E1q/σr(𝔼δ𝒯ξ+p=2λp1k=1p(pk)𝔼δ𝒯ξk),subscript𝐸1norm𝑞subscript𝜎𝑟𝔼norm𝛿superscript𝒯𝜉superscriptsubscript𝑝2superscript𝜆𝑝1superscriptsubscript𝑘1𝑝binomial𝑝𝑘𝔼superscriptnorm𝛿superscript𝒯𝜉𝑘E_{1}\leq\|q/\sigma_{r}\|\left(\mathbb{E}\|\delta\mathcal{T}^{\xi}\|+\sum_{p=2% }^{\infty}\lambda^{p-1}\sum_{k=1}^{p}{p\choose k}\mathbb{E}\|\delta\mathcal{T}% ^{\xi}\|^{k}\right),italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ ∥ italic_q / italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ∥ ( blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ + ∑ start_POSTSUBSCRIPT italic_p = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( binomial start_ARG italic_p end_ARG start_ARG italic_k end_ARG ) blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ) ,

and

p=2λp1k=1p(pk)𝔼δ𝒯ξkp=2λp1(p𝔼δ𝒯ξ+p(p1)2𝔼δ𝒯ξ2+k=3p(pk)supξδ𝒯ξk2𝔼δ𝒯ξ2).superscriptsubscript𝑝2superscript𝜆𝑝1superscriptsubscript𝑘1𝑝binomial𝑝𝑘𝔼superscriptdelimited-∥∥𝛿superscript𝒯𝜉𝑘superscriptsubscript𝑝2superscript𝜆𝑝1𝑝𝔼delimited-∥∥𝛿superscript𝒯𝜉𝑝𝑝12𝔼superscriptdelimited-∥∥𝛿superscript𝒯𝜉2superscriptsubscript𝑘3𝑝binomial𝑝𝑘subscriptsupremum𝜉superscriptdelimited-∥∥𝛿superscript𝒯𝜉𝑘2𝔼superscriptdelimited-∥∥𝛿superscript𝒯𝜉2\sum_{p=2}^{\infty}\lambda^{p-1}\sum_{k=1}^{p}{p\choose k}\mathbb{E}\|\delta% \mathcal{T}^{\xi}\|^{k}\leq\sum_{p=2}^{\infty}\lambda^{p-1}\Bigg{(}p\mathbb{E}% \|\delta\mathcal{T}^{\xi}\|\\ +\frac{p(p-1)}{2}\mathbb{E}\|\delta\mathcal{T}^{\xi}\|^{2}+\sum_{k=3}^{p}{p% \choose k}\sup_{\xi}\|\delta\mathcal{T}^{\xi}\|^{k-2}\mathbb{E}\|\delta% \mathcal{T}^{\xi}\|^{2}\Bigg{)}.start_ROW start_CELL ∑ start_POSTSUBSCRIPT italic_p = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( binomial start_ARG italic_p end_ARG start_ARG italic_k end_ARG ) blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ≤ ∑ start_POSTSUBSCRIPT italic_p = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p - 1 end_POSTSUPERSCRIPT ( italic_p blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ end_CELL end_ROW start_ROW start_CELL + divide start_ARG italic_p ( italic_p - 1 ) end_ARG start_ARG 2 end_ARG blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_k = 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( binomial start_ARG italic_p end_ARG start_ARG italic_k end_ARG ) roman_sup start_POSTSUBSCRIPT italic_ξ end_POSTSUBSCRIPT ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT italic_k - 2 end_POSTSUPERSCRIPT blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) . end_CELL end_ROW

Since the series p=2λp1psuperscriptsubscript𝑝2superscript𝜆𝑝1𝑝\sum_{p=2}^{\infty}\lambda^{p-1}p∑ start_POSTSUBSCRIPT italic_p = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p - 1 end_POSTSUPERSCRIPT italic_p and p=2λp1p2superscriptsubscript𝑝2superscript𝜆𝑝1superscript𝑝2\sum_{p=2}^{\infty}\lambda^{p-1}p^{2}∑ start_POSTSUBSCRIPT italic_p = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p - 1 end_POSTSUPERSCRIPT italic_p start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT converges, one then concludes that

E1𝔼δ𝒯ξ+(1+p=3λp1k=3p(pk)(C/n)k2)𝔼δ𝒯ξ2,less-than-or-similar-tosubscript𝐸1𝔼norm𝛿superscript𝒯𝜉1superscriptsubscript𝑝3superscript𝜆𝑝1superscriptsubscript𝑘3𝑝binomial𝑝𝑘superscript𝐶𝑛𝑘2𝔼superscriptnorm𝛿superscript𝒯𝜉2\displaystyle E_{1}\lesssim\mathbb{E}\|\delta\mathcal{T}^{\xi}\|+\left(1+\sum_% {p=3}^{\infty}\lambda^{p-1}\sum_{k=3}^{p}{p\choose k}(C/n)^{k-2}\right)\mathbb% {E}\|\delta\mathcal{T}^{\xi}\|^{2},italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≲ blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ + ( 1 + ∑ start_POSTSUBSCRIPT italic_p = 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( binomial start_ARG italic_p end_ARG start_ARG italic_k end_ARG ) ( italic_C / italic_n ) start_POSTSUPERSCRIPT italic_k - 2 end_POSTSUPERSCRIPT ) blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ,

Since (pk)=p(p1)k(k1)(p2k2)<p(p1)(p2k2)binomial𝑝𝑘𝑝𝑝1𝑘𝑘1binomial𝑝2𝑘2𝑝𝑝1binomial𝑝2𝑘2{p\choose k}=\frac{p(p-1)}{k(k-1)}{p-2\choose k-2}<p(p-1){p-2\choose k-2}( binomial start_ARG italic_p end_ARG start_ARG italic_k end_ARG ) = divide start_ARG italic_p ( italic_p - 1 ) end_ARG start_ARG italic_k ( italic_k - 1 ) end_ARG ( binomial start_ARG italic_p - 2 end_ARG start_ARG italic_k - 2 end_ARG ) < italic_p ( italic_p - 1 ) ( binomial start_ARG italic_p - 2 end_ARG start_ARG italic_k - 2 end_ARG ), one has k=3p(pk)(C/n)k2<p(p1)(1+Cn)p2superscriptsubscript𝑘3𝑝binomial𝑝𝑘superscript𝐶𝑛𝑘2𝑝𝑝1superscript1𝐶𝑛𝑝2\sum_{k=3}^{p}{p\choose k}(C/n)^{k-2}<p(p-1)(1+\frac{C}{n})^{p-2}∑ start_POSTSUBSCRIPT italic_k = 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( binomial start_ARG italic_p end_ARG start_ARG italic_k end_ARG ) ( italic_C / italic_n ) start_POSTSUPERSCRIPT italic_k - 2 end_POSTSUPERSCRIPT < italic_p ( italic_p - 1 ) ( 1 + divide start_ARG italic_C end_ARG start_ARG italic_n end_ARG ) start_POSTSUPERSCRIPT italic_p - 2 end_POSTSUPERSCRIPT. When n𝑛nitalic_n is large enough, λ(1+Cn)<1𝜆1𝐶𝑛1\lambda(1+\frac{C}{n})<1italic_λ ( 1 + divide start_ARG italic_C end_ARG start_ARG italic_n end_ARG ) < 1, then p=3p(p1)(λ(1+Cn))psuperscriptsubscript𝑝3𝑝𝑝1superscript𝜆1𝐶𝑛𝑝\sum_{p=3}^{\infty}p(p-1)\left(\lambda(1+\frac{C}{n})\right)^{p}∑ start_POSTSUBSCRIPT italic_p = 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_p ( italic_p - 1 ) ( italic_λ ( 1 + divide start_ARG italic_C end_ARG start_ARG italic_n end_ARG ) ) start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT converges and the series in the front of 𝔼δ𝒯ξ2𝔼superscriptnorm𝛿superscript𝒯𝜉2\mathbb{E}\|\delta\mathcal{T}^{\xi}\|^{2}blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is controlled by a constant independent of n𝑛nitalic_n. The estimation of E1subscript𝐸1E_{1}italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT then follows from the estimates of 𝔼δ𝒯ξ2𝔼superscriptnorm𝛿superscript𝒯𝜉2\mathbb{E}\|\delta\mathcal{T}^{\xi}\|^{2}blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and the Hölder inequality. The estimates for E2subscript𝐸2E_{2}italic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and E3subscript𝐸3E_{3}italic_E start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT are similar to E1subscript𝐸1E_{1}italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and we omit the details. The estimates for the expected single run error then follows.

Next, we consider the bias. The estimate for the three terms are similar and we take the one for B1subscript𝐵1B_{1}italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT as the example. By the definition of B1subscript𝐵1B_{1}italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, one has

B1q/σr(λ𝔼δTξ2+p=2λpk=2p+1(p+1k)𝔼δTξk)q/σrλ𝔼δTξ2(1+p=2λp1k=2p+1(p+1k)(C/n)k2)subscript𝐵1delimited-∥∥𝑞subscript𝜎𝑟𝜆𝔼superscriptdelimited-∥∥𝛿superscript𝑇𝜉2superscriptsubscript𝑝2superscript𝜆𝑝superscriptsubscript𝑘2𝑝1binomial𝑝1𝑘𝔼superscriptdelimited-∥∥𝛿superscript𝑇𝜉𝑘delimited-∥∥𝑞subscript𝜎𝑟𝜆𝔼superscriptdelimited-∥∥𝛿superscript𝑇𝜉21superscriptsubscript𝑝2superscript𝜆𝑝1superscriptsubscript𝑘2𝑝1binomial𝑝1𝑘superscript𝐶𝑛𝑘2\displaystyle\begin{split}B_{1}&\leq\|q/\sigma_{r}\|\left(\lambda\mathbb{E}\|% \delta T^{\xi}\|^{2}+\sum_{p=2}^{\infty}\lambda^{p}\sum_{k=2}^{p+1}{p+1\choose k% }\mathbb{E}\|\delta T^{\xi}\|^{k}\right)\\ &\leq\|q/\sigma_{r}\|\lambda\mathbb{E}\|\delta T^{\xi}\|^{2}\left(1+\sum_{p=2}% ^{\infty}\lambda^{p-1}\sum_{k=2}^{p+1}{p+1\choose k}(C/n)^{k-2}\right)\end{split}start_ROW start_CELL italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL ≤ ∥ italic_q / italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ∥ ( italic_λ blackboard_E ∥ italic_δ italic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_p = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p + 1 end_POSTSUPERSCRIPT ( binomial start_ARG italic_p + 1 end_ARG start_ARG italic_k end_ARG ) blackboard_E ∥ italic_δ italic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ ∥ italic_q / italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ∥ italic_λ blackboard_E ∥ italic_δ italic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( 1 + ∑ start_POSTSUBSCRIPT italic_p = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p + 1 end_POSTSUPERSCRIPT ( binomial start_ARG italic_p + 1 end_ARG start_ARG italic_k end_ARG ) ( italic_C / italic_n ) start_POSTSUPERSCRIPT italic_k - 2 end_POSTSUPERSCRIPT ) end_CELL end_ROW

For the first inequality above, we have used the simple bound 𝒯1norm𝒯1\|\mathcal{T}\|\leq 1∥ caligraphic_T ∥ ≤ 1. For the second inequality above, we have used the fact 𝔼δTξk(C/n)k2𝔼δTξ2𝔼superscriptnorm𝛿superscript𝑇𝜉𝑘superscript𝐶𝑛𝑘2𝔼superscriptnorm𝛿superscript𝑇𝜉2\mathbb{E}\|\delta T^{\xi}\|^{k}\leq(C/n)^{k-2}\mathbb{E}\|\delta T^{\xi}\|^{2}blackboard_E ∥ italic_δ italic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ≤ ( italic_C / italic_n ) start_POSTSUPERSCRIPT italic_k - 2 end_POSTSUPERSCRIPT blackboard_E ∥ italic_δ italic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT due to supξδ𝒯ξCn1subscriptsupremum𝜉norm𝛿superscript𝒯𝜉𝐶superscript𝑛1\sup_{\xi}\|\delta\mathcal{T}^{\xi}\|\leq Cn^{-1}roman_sup start_POSTSUBSCRIPT italic_ξ end_POSTSUBSCRIPT ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ ≤ italic_C italic_n start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. Then, we repeat the argument as above for B1subscript𝐵1B_{1}italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. The k=2𝑘2k=2italic_k = 2 is fine due to the convergence of p=2λp1p2superscriptsubscript𝑝2superscript𝜆𝑝1superscript𝑝2\sum_{p=2}^{\infty}\lambda^{p-1}p^{2}∑ start_POSTSUBSCRIPT italic_p = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT italic_p - 1 end_POSTSUPERSCRIPT italic_p start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. The k3𝑘3k\geq 3italic_k ≥ 3 terms are exactly the same as above where one can simply control (pk)=p(p1)k(k1)(p2k2)<p(p1)(p2k2)binomial𝑝𝑘𝑝𝑝1𝑘𝑘1binomial𝑝2𝑘2𝑝𝑝1binomial𝑝2𝑘2{p\choose k}=\frac{p(p-1)}{k(k-1)}{p-2\choose k-2}<p(p-1){p-2\choose k-2}( binomial start_ARG italic_p end_ARG start_ARG italic_k end_ARG ) = divide start_ARG italic_p ( italic_p - 1 ) end_ARG start_ARG italic_k ( italic_k - 1 ) end_ARG ( binomial start_ARG italic_p - 2 end_ARG start_ARG italic_k - 2 end_ARG ) < italic_p ( italic_p - 1 ) ( binomial start_ARG italic_p - 2 end_ARG start_ARG italic_k - 2 end_ARG ) . Hence, B1subscript𝐵1B_{1}italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is bounded by a constant that is independent of n𝑛nitalic_n multiplying λ𝔼δTξ2𝜆𝔼superscriptnorm𝛿superscript𝑇𝜉2\lambda\mathbb{E}\|\delta T^{\xi}\|^{2}italic_λ blackboard_E ∥ italic_δ italic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT.

In B3subscript𝐵3B_{3}italic_B start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, when k=1𝑘1k=1italic_k = 1, one may bound 𝔼δ𝒯ξ𝔼δ𝒯ξ2𝔼norm𝛿superscript𝒯𝜉𝔼superscriptnorm𝛿superscript𝒯𝜉2\mathbb{E}\|\delta\mathcal{T}^{\xi}\|\leq\sqrt{\mathbb{E}\|\delta\mathcal{T}^{% \xi}\|^{2}}blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ ≤ square-root start_ARG blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG. Besides this, the remaining estimates for B2subscript𝐵2B_{2}italic_B start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and B3subscript𝐵3B_{3}italic_B start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT are the same as for B1subscript𝐵1B_{1}italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. We omit the details.

4.3 Formal results for higher dimensional case

One can generalize the analysis to higher dimensions. The orders of error and bias depend on the error for 𝔼(δb2)𝔼superscriptnorm𝛿𝑏2\mathbb{E}(\|\delta b\|^{2})blackboard_E ( ∥ italic_δ italic_b ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) and 𝔼(δ𝒯ξ2)𝔼superscriptnorm𝛿superscript𝒯𝜉2\mathbb{E}(\|\delta\mathcal{T}^{\xi}\|^{2})blackboard_E ( ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). If one divides the region S𝑆Sitalic_S into n𝑛nitalic_n cells, the random ordinates are independent (Due to symmetry, strictly there are n/2d𝑛superscript2𝑑n/2^{d}italic_n / 2 start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT independent ordinates, but there is no significant difference), 𝔼(δ𝒯ξ2)𝔼superscriptnorm𝛿superscript𝒯𝜉2\mathbb{E}(\|\delta\mathcal{T}^{\xi}\|^{2})blackboard_E ( ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) is like n1maxVar(δ𝒯)superscript𝑛1subscriptVar𝛿subscript𝒯n^{-1}\max_{\ell}\mathrm{Var}(\delta\mathcal{T}_{\ell})italic_n start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_max start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT roman_Var ( italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ), where δ𝒯=ω𝒜μ1|S|S𝒜μ𝑑μ𝛿subscript𝒯subscript𝜔subscript𝒜subscript𝜇1𝑆subscriptsubscript𝑆subscript𝒜𝜇differential-d𝜇\delta\mathcal{T}_{\ell}=\omega_{\ell}\mathcal{A}_{\mu_{\ell}}-\frac{1}{|S|}% \int_{S_{\ell}}\mathcal{A}_{\mu}d\muitalic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG | italic_S | end_ARG ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ. The variance on Ssubscript𝑆S_{\ell}italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT is like D(S)2𝐷superscriptsubscript𝑆2D(S_{\ell})^{2}italic_D ( italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, where D(S)𝐷subscript𝑆D(S_{\ell})italic_D ( italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) means the diameter of the \ellroman_ℓ-th cell. Similar analysis holds for 𝔼(δb2)𝔼superscriptnorm𝛿𝑏2\mathbb{E}(\|\delta b\|^{2})blackboard_E ( ∥ italic_δ italic_b ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). One can refer to the supplementary material for the details in the 1D case. Consequently, the bias scales like

ϕ𝔼ϕξmaxD(S)2n1n1+2/d.similar-tonormitalic-ϕ𝔼superscriptitalic-ϕ𝜉subscript𝐷superscriptsubscript𝑆2𝑛similar-to1superscript𝑛12𝑑\|\phi-\mathbb{E}\phi^{\xi}\|\sim\frac{\max_{\ell}D(S_{\ell})^{2}}{n}\sim\frac% {1}{n^{1+2/d}}.∥ italic_ϕ - blackboard_E italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ ∼ divide start_ARG roman_max start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_D ( italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_n end_ARG ∼ divide start_ARG 1 end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 1 + 2 / italic_d end_POSTSUPERSCRIPT end_ARG .

The error scales like

𝔼ϕϕξmaxD(S)2n1n1/2+1/d.similar-to𝔼normitalic-ϕsuperscriptitalic-ϕ𝜉subscript𝐷superscriptsubscript𝑆2𝑛similar-to1superscript𝑛121𝑑\mathbb{E}\|\phi-\phi^{\xi}\|\sim\sqrt{\frac{\max_{\ell}D(S_{\ell})^{2}}{n}}% \sim\frac{1}{n^{1/2+1/d}}.blackboard_E ∥ italic_ϕ - italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ ∼ square-root start_ARG divide start_ARG roman_max start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_D ( italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_n end_ARG end_ARG ∼ divide start_ARG 1 end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 1 / 2 + 1 / italic_d end_POSTSUPERSCRIPT end_ARG .

For d=2𝑑2d=2italic_d = 2, a typical order of error for ROM is then 1111, and the bias scales like O(n2)𝑂superscript𝑛2O(n^{-2})italic_O ( italic_n start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT ) so the order is 2222. The order of bias also improves compared to DOM. Hence, ROM could improve accuracy.

For the X-Y geometry (i.e., d=2𝑑2d=2italic_d = 2), if the required accuracy for the average density ϕ(x,y)italic-ϕ𝑥𝑦\phi(x,y)italic_ϕ ( italic_x , italic_y ) is ϵitalic-ϵ\epsilonitalic_ϵ, one has to use O(ϵ1)𝑂superscriptitalic-ϵ1O(\epsilon^{-1})italic_O ( italic_ϵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) ordinates in DOM. Since all ordinates are coupled together in the RTE simulations, the computational complexity in the velocity variable is O(ϵ2)𝑂superscriptitalic-ϵ2O(\epsilon^{-2})italic_O ( italic_ϵ start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT ) for the standard source iteration method with anisotropic scattering kernel. If one solves the large coupled linear system directly, the computational complexity is usually higher than O(ϵ2)𝑂superscriptitalic-ϵ2O(\epsilon^{-2})italic_O ( italic_ϵ start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT ). On the other hand, the bias \mathcal{B}caligraphic_B of ROM is of order 2222. If we repeat the simulation for t𝑡titalic_t times, the error of the Monte Carlo approximation is O(Var/t+2)𝑂Var𝑡superscript2O(\sqrt{\mathrm{Var}/t+\mathcal{B}^{2}})italic_O ( square-root start_ARG roman_Var / italic_t + caligraphic_B start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ). The variance can be controlled by the square of the error, which is of order 1111. Hence, the number of ordinates for each run could be O(ϵ12)𝑂superscriptitalic-ϵ12O(\epsilon^{-\frac{1}{2}})italic_O ( italic_ϵ start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ) so that =O(ϵ)𝑂italic-ϵ\mathcal{B}=O(\epsilon)caligraphic_B = italic_O ( italic_ϵ ) and Var=O(ϵ)Var𝑂italic-ϵ\mathrm{Var}=O(\epsilon)roman_Var = italic_O ( italic_ϵ ). Then, taking t=O(ϵ1)𝑡𝑂superscriptitalic-ϵ1t=O(\epsilon^{-1})italic_t = italic_O ( italic_ϵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) will make the average error O(ϵ)𝑂italic-ϵO(\epsilon)italic_O ( italic_ϵ ). Since the complexity for each run is O([ϵ12]2)=O(ϵ1)𝑂superscriptdelimited-[]superscriptitalic-ϵ122𝑂superscriptitalic-ϵ1O([\epsilon^{-\frac{1}{2}}]^{2})=O(\epsilon^{-1})italic_O ( [ italic_ϵ start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) = italic_O ( italic_ϵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ), the total complexity is thus O(tϵ1)=O(ϵ2)𝑂𝑡superscriptitalic-ϵ1𝑂superscriptitalic-ϵ2O(t\epsilon^{-1})=O(\epsilon^{-2})italic_O ( italic_t italic_ϵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) = italic_O ( italic_ϵ start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT ). The total computational costs of DOM and ROM are comparable. If one chooses to use O(ϵ1)𝑂superscriptitalic-ϵ1O(\epsilon^{-1})italic_O ( italic_ϵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) ordinates for ROM, then a typical single run could yield also O(ϵ)𝑂italic-ϵO(\epsilon)italic_O ( italic_ϵ ) error, but it suffers from the ray effect as well. Hence, choosing O(ϵ1/2)𝑂superscriptitalic-ϵ12O(\epsilon^{-1/2})italic_O ( italic_ϵ start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ) ordinates and repeat t𝑡titalic_t times could result in the same error, but the ray effect could be improved since the total number of velocity directions is O(ϵ3/2)𝑂superscriptitalic-ϵ32O(\epsilon^{-3/2})italic_O ( italic_ϵ start_POSTSUPERSCRIPT - 3 / 2 end_POSTSUPERSCRIPT ). Moreover, ROM is easy to parallel.

5 Numerical experiments

The numerical performance of the ROM is presented in this section. The numerical convergence orders of errors and bias in both slab and X-Y geometries are displayed, while its ability to mitigate the ray effect is demonstrated through a lattice problem.

5.1 The slab geometry

To apply ROM in the slab geometry, we equally divide the velocity interval [1,1]11[-1,1][ - 1 , 1 ] into n𝑛nitalic_n cells. The \ellroman_ℓth cell is denoted by S=[1+22n,1+2n]subscript𝑆122𝑛12𝑛S_{\ell}=[-1+\frac{2\ell-2}{n},-1+\frac{2\ell}{n}]italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = [ - 1 + divide start_ARG 2 roman_ℓ - 2 end_ARG start_ARG italic_n end_ARG , - 1 + divide start_ARG 2 roman_ℓ end_ARG start_ARG italic_n end_ARG ], where =1,2,,n12𝑛\ell=1,2,\cdots,nroman_ℓ = 1 , 2 , ⋯ , italic_n. We consider only even n𝑛nitalic_n, and when S[1,0]subscript𝑆10S_{\ell}\subset[-1,0]italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⊂ [ - 1 , 0 ], one ordinate μsubscript𝜇\mu_{\ell}italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT is sampled randomly from the cell Ssubscript𝑆S_{\ell}italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT with uniform probability. When S[0,1]subscript𝑆01S_{\ell}\subset[0,1]italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⊂ [ 0 , 1 ], μ=μn+1subscript𝜇subscript𝜇𝑛1\mu_{\ell}=-\mu_{n+1-\ell}italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = - italic_μ start_POSTSUBSCRIPT italic_n + 1 - roman_ℓ end_POSTSUBSCRIPT. The weights are ω=2/nsubscript𝜔2𝑛\omega_{\ell}=2/nitalic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = 2 / italic_n for all μsubscript𝜇\mu_{\ell}italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT.

All settings are the same as in Example 2.2 in Section 2. We use the same spatial discretizations and the number of spatial grids I=50𝐼50I=50italic_I = 50 as for Example 2.2. The reference solution ϕrefsuperscriptitalic-ϕ𝑟𝑒𝑓\phi^{ref}italic_ϕ start_POSTSUPERSCRIPT italic_r italic_e italic_f end_POSTSUPERSCRIPT is obtained by the DOM using 2560 ordinates with uniform quadrature. We define the error between the reference solution and the numerical solutions of ROM with I𝐼Iitalic_I spatial cells by

(5.1) =𝔼ϕξ(x)ϕref(x)2=𝔼(1I+1i=0Iϕξ(xi)ϕref(xi)2)12,𝔼subscriptnormsuperscriptitalic-ϕ𝜉𝑥superscriptitalic-ϕ𝑟𝑒𝑓𝑥2𝔼superscript1𝐼1superscriptsubscript𝑖0𝐼superscriptdelimited-∣∣superscriptitalic-ϕ𝜉subscript𝑥𝑖superscriptitalic-ϕ𝑟𝑒𝑓subscript𝑥𝑖212\mathcal{E}=\mathbb{E}\parallel\phi^{\xi}(x)-\phi^{ref}(x)\parallel_{2}=% \mathbb{E}\left(\frac{1}{I+1}\sum_{i=0}^{I}\mid\phi^{\xi}(x_{i})-\phi^{ref}(x_% {i})\mid^{2}\right)^{\frac{1}{2}},caligraphic_E = blackboard_E ∥ italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ( italic_x ) - italic_ϕ start_POSTSUPERSCRIPT italic_r italic_e italic_f end_POSTSUPERSCRIPT ( italic_x ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = blackboard_E ( divide start_ARG 1 end_ARG start_ARG italic_I + 1 end_ARG ∑ start_POSTSUBSCRIPT italic_i = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT ∣ italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - italic_ϕ start_POSTSUPERSCRIPT italic_r italic_e italic_f end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∣ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ,

where ϕξ(xi)=m=1nωmψmξ(xi)superscriptitalic-ϕ𝜉subscript𝑥𝑖superscriptsubscript𝑚1𝑛subscript𝜔𝑚subscriptsuperscript𝜓𝜉𝑚subscript𝑥𝑖\phi^{\xi}(x_{i})=\sum_{m=1}^{n}\omega_{m}\psi^{\xi}_{m}(x_{i})italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_ω start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_ψ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ). The bias of ROM is defined by

(5.2) =𝔼ϕξ(x)ϕref(x)2=(1I+1i=0I𝔼ϕξ(xi)ϕref(xi)2)12.subscriptnorm𝔼superscriptitalic-ϕ𝜉𝑥superscriptitalic-ϕ𝑟𝑒𝑓𝑥2superscript1𝐼1superscriptsubscript𝑖0𝐼superscriptdelimited-∣∣𝔼superscriptitalic-ϕ𝜉subscript𝑥𝑖superscriptitalic-ϕ𝑟𝑒𝑓subscript𝑥𝑖212\mathcal{B}=\parallel\mathbb{E}\phi^{\xi}(x)-\phi^{ref}(x)\parallel_{2}=\left(% \frac{1}{I+1}\sum_{i=0}^{I}\mid\mathbb{E}\phi^{\xi}(x_{i})-\phi^{ref}(x_{i})% \mid^{2}\right)^{\frac{1}{2}}.caligraphic_B = ∥ blackboard_E italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ( italic_x ) - italic_ϕ start_POSTSUPERSCRIPT italic_r italic_e italic_f end_POSTSUPERSCRIPT ( italic_x ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = ( divide start_ARG 1 end_ARG start_ARG italic_I + 1 end_ARG ∑ start_POSTSUBSCRIPT italic_i = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT ∣ blackboard_E italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - italic_ϕ start_POSTSUPERSCRIPT italic_r italic_e italic_f end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∣ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT .
Refer to caption
(a) error \mathcal{E}caligraphic_E
Refer to caption
(b) bias \mathcal{B}caligraphic_B
Figure 5: The convergence orders of ROM in velocity in slab geometry. Here Δμ=2nΔ𝜇2𝑛\Delta\mu=\frac{2}{n}roman_Δ italic_μ = divide start_ARG 2 end_ARG start_ARG italic_n end_ARG,n=2,4,8,16𝑛24816n=2,4,8,16italic_n = 2 , 4 , 8 , 16. (a): the errors defined in (5.1) for different cases in (2.5)-(2.7); (b): the bias defined in (5.2) for different cases in (2.5)-(2.7). t𝑡titalic_t is the number of sampled simulations in ROM.
Table 2: Error, bias and slope of convergence curves in Figure 5 for t=20480𝑡20480t=20480italic_t = 20480.
ΔμΔ𝜇\Delta\muroman_Δ italic_μ 1/8 1/4 1/2 1 Order
error ()\mathcal{E})caligraphic_E ) Case 1 1.83E-02 4.99E-02 1.33E-01 3.10E-01 1.37
Case 2 9.37E-03 2.30E-02 5.76E-02 1.46E-01 1.32
Case 3 2.86E-02 5.98E-02 1.20E-01 2.51E-01 1.04
bias ()\mathcal{B})caligraphic_B ) Case 1 9.18E-05 6.51E-04 2.72E-03 2.17E-02 2.57
Case 2 4.48E-05 6.00E-04 2.73E-03 2.31E-02 2.92
Case 3 1.07E-04 5.63E-04 2.43E-03 1.58E-02 2.37

Figure 5 displays the convergence orders of ROM in the slab geometry. The results of different numbers of sampled simulations are shown. To obtain the convergence order of the error, a smaller number of samples is enough. As observed from Figure 5, when the number of samples increases, the convergence order of bias will gradually increase and eventually stabilize near the theoretical value. For different cases as in Example 2.2 in Section 2, Table 2 displays the error and bias using different mesh sizes when there are enough samples.

When the inflow boundary conditions are smooth and regular as in Case 1 in (2.5), the convergence orders of DOM are 2222, while the convergence orders of the error and bias of ROM are respectively 1.371.371.371.37 and 2.572.572.572.57. Due to stochastic noise, this can be considered consistent with the analysis. When the boundary conditions are continuous but nondifferentiable as in Case 2 in (2.6), the convergence orders of the DOM decrease to 1.51.51.51.5, and the convergence orders of the error and bias of ROM remain 1.321.321.321.32 and 2.922.922.922.92, respectively. Moreover, if the boundary conditions are discontinuous at some points as in Case 3 in (2.7), the convergence order of DOM decreases to 1111. The convergence order of the error of ROM decreases to 1111, while the bias has a convergence order of 2.372.372.372.37. In summary, the errors of ROM converge no slower than DOM, and the bias converges faster, especially when the solution regularity is low in the velocity variable.

5.2 The X-Y geometry case

In X-Y geometry, we show the ordinates in the first quadrant and consider uniform partition in the (ζ,θ)𝜁𝜃(\zeta,\theta)( italic_ζ , italic_θ ) plane. ζ[0,1]𝜁01\zeta\in[0,1]italic_ζ ∈ [ 0 , 1 ] and θ[0,π/2]𝜃0𝜋2\theta\in[0,\pi/2]italic_θ ∈ [ 0 , italic_π / 2 ] are equally divided, the nodes are denoted by (ζi,θj)=(Ni+1N,j12Nπ)subscript𝜁𝑖subscript𝜃𝑗𝑁𝑖1𝑁𝑗12𝑁𝜋(\zeta_{i},\theta_{j})=(\frac{N-i+1}{N},\frac{j-1}{2N}\pi)( italic_ζ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = ( divide start_ARG italic_N - italic_i + 1 end_ARG start_ARG italic_N end_ARG , divide start_ARG italic_j - 1 end_ARG start_ARG 2 italic_N end_ARG italic_π ) for i,j=1,,N+1formulae-sequence𝑖𝑗1𝑁1i,j=1,\cdots,N+1italic_i , italic_j = 1 , ⋯ , italic_N + 1. Each quadrant has m=N2𝑚superscript𝑁2m=N^{2}italic_m = italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT cells. For any =(i1)N+j𝑖1𝑁𝑗\ell=(i-1)N+jroman_ℓ = ( italic_i - 1 ) italic_N + italic_j, i,j=1,,Nformulae-sequence𝑖𝑗1𝑁i,j=1,\cdots,Nitalic_i , italic_j = 1 , ⋯ , italic_N,

S={(ζ,θ)|ζa+1ζζa,θbθθb+1.}S_{\ell}=\{(\zeta,\theta)|\zeta_{a+1}\leq\zeta\leq\zeta_{a},\theta_{b}\leq% \theta\leq\theta_{b+1}.\}italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = { ( italic_ζ , italic_θ ) | italic_ζ start_POSTSUBSCRIPT italic_a + 1 end_POSTSUBSCRIPT ≤ italic_ζ ≤ italic_ζ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ≤ italic_θ ≤ italic_θ start_POSTSUBSCRIPT italic_b + 1 end_POSTSUBSCRIPT . }

Randomly sample ζiξ[ζi+1,ζi]subscriptsuperscript𝜁𝜉𝑖subscript𝜁𝑖1subscript𝜁𝑖\zeta^{\xi}_{i}\in[\zeta_{i+1},\zeta_{i}]italic_ζ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ [ italic_ζ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT , italic_ζ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] and θjξ[θj,θj+1]subscriptsuperscript𝜃𝜉𝑗subscript𝜃𝑗subscript𝜃𝑗1\theta^{\xi}_{j}\in[\theta_{j},\theta_{j+1}]italic_θ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ [ italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ] with uniform probability. The ordinates in other quadrant can be obtained by symmetry as in (3.5). The weights are chosen to be ω¯=14N2subscript¯𝜔14superscript𝑁2\bar{\omega}_{\ell}=\frac{1}{4N^{2}}over¯ start_ARG italic_ω end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 4 italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG. The ordinates projected to the 2D unit disk are

(c,s,ω¯)=((1(ζiξ)2)12cosθjξ,(1(ζiξ)2)12sinθjξ,14N2).subscript𝑐subscript𝑠subscript¯𝜔superscript1superscriptsubscriptsuperscript𝜁𝜉𝑖212superscriptsubscript𝜃𝑗𝜉superscript1superscriptsubscriptsuperscript𝜁𝜉𝑖212superscriptsubscript𝜃𝑗𝜉14superscript𝑁2(c_{\ell},s_{\ell},\bar{\omega}_{\ell})=\Big{(}\big{(}1-(\zeta^{\xi}_{i})^{2}% \big{)}^{\frac{1}{2}}\cos\theta_{j}^{\xi},\big{(}1-(\zeta^{\xi}_{i})^{2}\big{)% }^{\frac{1}{2}}\sin\theta_{j}^{\xi},\frac{1}{4N^{2}}\Big{)}.( italic_c start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , over¯ start_ARG italic_ω end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) = ( ( 1 - ( italic_ζ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT roman_cos italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT , ( 1 - ( italic_ζ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT roman_sin italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT , divide start_ARG 1 end_ARG start_ARG 4 italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) .

Let’s take N=3𝑁3N=3italic_N = 3 as an example, the partition and one sample of the chosen discrete ordinates are plotted in Figure 6.

Refer to caption
Refer to caption
Figure 6: Schematic diagram of selected ordinates on the surface of a 3D unit sphere and their corresponding projection to the 2D unit disk of ROM.

5.2.1 Convergence order

Similar to section 2.2.2, we use the diamond difference method to discretize the spatial variable. The unknowns at the cell centers are calculated, and the notations are the same as in section 2.2.2.

We use the same setup and spatial grid as Example 2.1 in section 2. The numerical error \mathcal{E}caligraphic_E and bias \mathcal{B}caligraphic_B are similar to (5.1) and (5.2), expect that the 2superscript2\ell^{2}roman_ℓ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT norm is given as in (2.8). The reference solution ϕrefsuperscriptitalic-ϕ𝑟𝑒𝑓\phi^{ref}italic_ϕ start_POSTSUPERSCRIPT italic_r italic_e italic_f end_POSTSUPERSCRIPT is computed by 80400 ordinates by Gaussian quadrature.

Figure 7 and Table 3 show the convergence order of the error and bias. We can observe that the convergence order of the error is between 0.50.50.50.5 and 1111, while the convergence order of the bias is between 1111 and 2222 for both isotropic and anisotropic scattering kernels. This is due to the non-smoothness of the solution, which leads to a slight difference between the theoretical value and the actual result.

Figure 8 demonstrates the ability of ROM to mitigate the ray effect. The expectation of the average density calculated by ROM with different number of ordinates and different number of sampled simulations are displayed. We can observe that the ray effect is effectively mitigated after multiple parallel calculations and taking the expectations, even when the sampled simulations use a very small number of ordinates. The results with the anisotropic scattering are similar.

Refer to caption
(a) error \mathcal{E}caligraphic_E
Refer to caption
(b) bias \mathcal{B}caligraphic_B
Figure 7: The convergence orders of ROM with different velocity meshes in X-Y geometry. Here ΔS=π4N2Δ𝑆𝜋4superscript𝑁2\Delta S=\frac{\pi}{4N^{2}}roman_Δ italic_S = divide start_ARG italic_π end_ARG start_ARG 4 italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG,N=1,2,3,4𝑁1234N=1,2,3,4italic_N = 1 , 2 , 3 , 4. (a) the errors (\mathcal{E}caligraphic_E) for isotropic and anisotropic scattering kernel, (b): the bias (\mathcal{B}caligraphic_B) for isotropic and anisotropic scattering kernel. S is the number of sampled simulations in ROM.
Table 3: Error, bias and slope of convergence curves in Figure 7 for t=10240𝑡10240t=10240italic_t = 10240.
ΔSΔ𝑆\Delta Sroman_Δ italic_S π/64𝜋64\pi/64italic_π / 64 π/36𝜋36\pi/36italic_π / 36 π/16𝜋16\pi/16italic_π / 16 π/4𝜋4\pi/4italic_π / 4 Order
error ()\mathcal{E})caligraphic_E ) iso: g=0 8.09E-03 1.25E-02 2.26E-02 6.28E-02 0.74
aniso: g=0.9 8.18E-03 1.26E-02 2.28E-02 6.36E-02 0.74
bias ()\mathcal{B})caligraphic_B ) iso: g=0 1.12E-04 2.62E-04 6.44E-04 6.22E-03 1.44
aniso: g=0.9 1.09E-04 2.01E-04 3.95E-04 5.42E-03 1.41
Refer to caption
(a) 5 simulations, 4 ordinates
Refer to caption
(b) 5 simulations, 16 ordinates
Refer to caption
(c) 5 simulations, 36 ordinates
Refer to caption
(d) 20 simulations, 4 ordinates
Refer to caption
(e) 20 simulations,16 ordinates
Refer to caption
(f) 20 simulations,36 ordinates
Refer to caption
(g) 50 simulations, 4 ordinates
Refer to caption
(h) 50 simulations, 16 ordinates
Refer to caption
(i) 50 simulations, 36 ordinates
Figure 8: Example 2.1(isotropic). The expectation of average density of ROM with different number of sampled simulations and ordinates.

5.2.2 The lattice problem

This example is to demonstrate that the ROM can mitigate the ray effect independent of the problem setup. A benchmark test is the lattice problem in X-Y plane. The spatial domain is (x,y)[0,1]×[0,1]𝑥𝑦0101(x,y)\in[0,1]\times[0,1]( italic_x , italic_y ) ∈ [ 0 , 1 ] × [ 0 , 1 ] and the cross sections are σT=1subscript𝜎𝑇1\sigma_{T}=1italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT = 1, σS=0.5subscript𝜎𝑆0.5\sigma_{S}=0.5italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT = 0.5. The layout of the source term q(x,y)𝑞𝑥𝑦q(x,y)italic_q ( italic_x , italic_y ) is shown in Figure 9. The number of spatial cells is 50×50505050\times 5050 × 50 and Diamond difference method is employed for the spatial discretization. In Figure 10, the average densities calculated with 4444, 16161616, 36363636, 64646464, 100100100100 and 144144144144 ordinates DOM are displayed. The ray effects can be visually seen even with 144 ordinate.

The results of ROM are shown in Figure 11, where columns 1 to 4 show the expectation of average density 𝔼ϕξ𝔼superscriptitalic-ϕ𝜉\mathbb{E}\phi^{\xi}blackboard_E italic_ϕ start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT with 5, 10, 20 and 50 simulations, respectively. As can be seen from the results, the ray effect is invisible in the ROM results when 50 sampled simulations with 4 ordinates, 10 sampled simulations with 16 ordinates, and 5 sampled simulations with 36 ordinates.

Refer to caption
Figure 9: Different source term for lattice problem. The black area is 1, and the other areas are vacant.
Refer to caption
(a) 4 ordinates
Refer to caption
(b) 16 ordinates
Refer to caption
(c) 36 ordinates
Refer to caption
(d) 64 ordinates
Refer to caption
(e) 100 ordinates
Refer to caption
(f) 144 ordinates
Figure 10: The average density of discrete ordinate method with Uniform quadrature.
Refer to caption
(a) t=5, 4 ordinates
Refer to caption
(b) t=10, 4 ordinates
Refer to caption
(c) t=20, 4 ordinates
Refer to caption
(d) t=50, 4 ordinates
Refer to caption
(e) t=5, 9 ordinates
Refer to caption
(f) t=10, 9 ordinates
Refer to caption
(g) t=20, 9 ordinates
Refer to caption
(h) t=50, 9 ordinates
Refer to caption
(i) t=5, 36 ordinates
Refer to caption
(j) t=10, 36 ordinates
Refer to caption
(k) t=20, 36 ordinates
Refer to caption
(l) t=50, 36 ordinates
Figure 11: The expectation of average density of ROM for the lattice problem.

6 Discussion

In the slab geometry, the discontinuities in velocity are usually at fixed velocity directions, thus the decrease in convergence order is not as severe as in the spatial 2D case. On the other hand, the spatial discontinuities or sharp transitions of the inflow boundary conditions and source terms can induce discontinuities in the velocity along the direction of the ray. Thus, the discontinuities in velocity depend on spatial variables for the spatial 2D case. As we can see from Table 1, the commonly used quadrature sets can only achieve a convergence order around 0.740.740.740.74, which is the main reason for the observed ray effect in spatial 2D test cases.

We remark that other strategies for the random ordinates are possible. For example, 𝐮subscript𝐮\mathbf{u}_{\ell}bold_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT does not have to be chosen with uniform probability from Ssubscript𝑆S_{\ell}italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT. One may make use of the transition kernel for importance sampling of 𝐮subscript𝐮\mathbf{u}_{\ell}bold_u start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT. If these strategies are adopted, the weights ωsubscript𝜔\omega_{\ell}italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT have to be adjusted correspondingly. One may consider multiscale total and scattering cross sections as well, then, the conservation of mass may become important. In our future work, we will investigate the extension of ROM to the multiscale cases, such as the diffusion limit or Fokker-Planck limit cases.

Acknowledgments

This work is partially supported by the National Key R&D Program of China No. 2020YFA0712000. The work of L. Li was partially supported by Shanghai Municipal Science and Technology Major Project 2021SHZDZX0102, NSFC 12371400 and 12031013, the Strategic Priority Research Program of Chinese Academy of Sciences, Grant No. XDA25010403.

References

Appendix A Supplementary Materials: Estimates for 𝒯,𝒯ξ,δ𝒯ξ𝒯superscript𝒯𝜉𝛿superscript𝒯𝜉\mathcal{T},\mathcal{T}^{\xi},\delta\mathcal{T}^{\xi}caligraphic_T , caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT , italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT and δb𝛿𝑏\delta bitalic_δ italic_b

Recall that Ω=[xL,xR]Ωsubscript𝑥𝐿subscript𝑥𝑅\Omega=[x_{L},x_{R}]roman_Ω = [ italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ], and that the operator 𝒜μ():L2(Ω;σT)L2(Ω;σT):subscript𝒜𝜇superscript𝐿2Ωsubscript𝜎𝑇superscript𝐿2Ωsubscript𝜎𝑇\mathcal{A}_{\mu}(\cdot):L^{2}(\Omega;\sigma_{T})\to L^{2}(\Omega;\sigma_{T})caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( ⋅ ) : italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ; italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) → italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ; italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) maps ϕ(x)italic-ϕ𝑥\phi(x)italic_ϕ ( italic_x ) to ψ(,μ)𝜓𝜇\psi(\cdot,\mu)italic_ψ ( ⋅ , italic_μ ) by solving

(A.1) μxψ+σTψ=σrϕ,μ[1,1],formulae-sequence𝜇subscript𝑥𝜓subscript𝜎𝑇𝜓subscript𝜎𝑟italic-ϕ𝜇11\displaystyle\mu\partial_{x}\psi+\sigma_{T}\psi=\sigma_{r}\phi,\quad\mu\in[-1,% 1],italic_μ ∂ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_ψ + italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT italic_ψ = italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_ϕ , italic_μ ∈ [ - 1 , 1 ] ,
ψ(xL,μ)=0,μ>0;ψ(xR,μ)=0,μ<0.formulae-sequence𝜓subscript𝑥𝐿𝜇0formulae-sequence𝜇0formulae-sequence𝜓subscript𝑥𝑅𝜇0𝜇0\displaystyle\psi(x_{L},\mu)=0,\quad\mu>0;\quad\psi(x_{R},\mu)=0,\quad\mu<0.italic_ψ ( italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT , italic_μ ) = 0 , italic_μ > 0 ; italic_ψ ( italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT , italic_μ ) = 0 , italic_μ < 0 .

Recall that we have divide the velocity space S=[1,0)𝑆10S=[-1,0)italic_S = [ - 1 , 0 ) into n=2m𝑛2𝑚n=2mitalic_n = 2 italic_m cells and the each random ordinates μsubscript𝜇\mu_{\ell}italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT, 1m1𝑚1\leq\ell\leq m1 ≤ roman_ℓ ≤ italic_m is chosen randomly from the \ellroman_ℓth cell. The other half (0,1]01(0,1]( 0 , 1 ] and the random ordinate μ+msubscript𝜇𝑚\mu_{\ell+m}italic_μ start_POSTSUBSCRIPT roman_ℓ + italic_m end_POSTSUBSCRIPT are chosen in asymmetric fashion. The weight ω=|S|/|S|subscript𝜔subscript𝑆𝑆\omega_{\ell}=|S_{\ell}|/|S|italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | / | italic_S | and α=nωsubscript𝛼𝑛subscript𝜔\alpha_{\ell}=n\omega_{\ell}italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_n italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT. The operator 𝒯𝒯\mathcal{T}caligraphic_T is defined as

𝒯=S𝒜μ𝑑μ=1211𝒜μ𝑑μ.𝒯subscriptaverage-integral𝑆subscript𝒜𝜇differential-d𝜇12superscriptsubscript11subscript𝒜𝜇differential-d𝜇\mathcal{T}=\fint_{S}\mathcal{A}_{\mu}d\mu=\frac{1}{2}\int_{-1}^{1}\mathcal{A}% _{\mu}\,d\mu.caligraphic_T = ⨏ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∫ start_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ .

Let 𝒯ξ=α𝒜μ,superscriptsubscript𝒯𝜉subscript𝛼subscript𝒜subscript𝜇\mathcal{T}_{\ell}^{\xi}=\alpha_{\ell}\mathcal{A}_{\mu_{\ell}},caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT = italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT , then

𝒯=𝔼𝒯ξ=αS𝒜μ𝑑μ=nω1|S|S𝒜μ𝑑μ=n|S|S𝒜μ𝑑μ.subscript𝒯𝔼superscriptsubscript𝒯𝜉subscript𝛼subscriptaverage-integralsubscript𝑆subscript𝒜𝜇differential-d𝜇𝑛subscript𝜔1subscript𝑆subscriptsubscript𝑆subscript𝒜𝜇differential-d𝜇𝑛𝑆subscriptsubscript𝑆subscript𝒜𝜇differential-d𝜇\mathcal{T}_{\ell}=\mathbb{E}\mathcal{T}_{\ell}^{\xi}=\alpha_{\ell}\fint_{S_{% \ell}}\mathcal{A}_{\mu}d\mu=n\omega_{\ell}\frac{1}{|S_{\ell}|}\int_{S_{\ell}}% \mathcal{A}_{\mu}d\mu=\frac{n}{|S|}\int_{S_{\ell}}\mathcal{A}_{\mu}d\mu.caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = blackboard_E caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT = italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⨏ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ = italic_n italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | end_ARG ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ = divide start_ARG italic_n end_ARG start_ARG | italic_S | end_ARG ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ .

Therefore, from the definition of 𝒯subscript𝒯\mathcal{T}_{\ell}caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT and 𝒯ξsuperscriptsubscript𝒯𝜉\mathcal{T}_{\ell}^{\xi}caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT,

(A.2) δ𝒯ξ:=𝒯ξ𝒯=1m=1m12(𝒯ξ𝒯+𝒯n+1ξ𝒯n+1)=:1m=1mδ𝒯ξ.\delta\mathcal{T}^{\xi}:=\mathcal{T}^{\xi}-\mathcal{T}=\frac{1}{m}\sum_{\ell=1% }^{m}\frac{1}{2}(\mathcal{T}_{\ell}^{\xi}-\mathcal{T}_{\ell}+\mathcal{T}_{n+1-% \ell}^{\xi}-\mathcal{T}_{n+1-\ell})=:\frac{1}{m}\sum_{\ell=1}^{m}\delta% \mathcal{T}_{\ell}^{\xi}.italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT := caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT - caligraphic_T = divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT - caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT + caligraphic_T start_POSTSUBSCRIPT italic_n + 1 - roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT - caligraphic_T start_POSTSUBSCRIPT italic_n + 1 - roman_ℓ end_POSTSUBSCRIPT ) = : divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT .

Here, we note that δ𝒯ξ𝛿superscriptsubscript𝒯𝜉\delta\mathcal{T}_{\ell}^{\xi}italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT and δ𝒯n+1ξ𝛿superscriptsubscript𝒯𝑛1𝜉\delta\mathcal{T}_{n+1-\ell}^{\xi}italic_δ caligraphic_T start_POSTSUBSCRIPT italic_n + 1 - roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT are not independent. This is why we put \ellroman_ℓ and n+1𝑛1n+1-\ellitalic_n + 1 - roman_ℓ together.

For the truncated system, we consider μ[1,δ)(δ,1]=:Sδ\mu\in[-1,-\delta)\cup(\delta,1]=:S^{\delta}italic_μ ∈ [ - 1 , - italic_δ ) ∪ ( italic_δ , 1 ] = : italic_S start_POSTSUPERSCRIPT italic_δ end_POSTSUPERSCRIPT. The equation is changed to

μxψ+σTψ=σSSδψ(x,μ)𝑑μ+q.𝜇subscript𝑥𝜓subscript𝜎𝑇𝜓subscript𝜎𝑆subscriptaverage-integralsuperscript𝑆𝛿𝜓𝑥𝜇differential-d𝜇𝑞\mu\partial_{x}\psi+\sigma_{T}\psi=\sigma_{S}\fint_{S^{\delta}}\psi(x,\mu)d\mu% +q.italic_μ ∂ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_ψ + italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT italic_ψ = italic_σ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ⨏ start_POSTSUBSCRIPT italic_S start_POSTSUPERSCRIPT italic_δ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_ψ ( italic_x , italic_μ ) italic_d italic_μ + italic_q .

We divide [1,δ)1𝛿[-1,-\delta)[ - 1 , - italic_δ ) and (δ,1]𝛿1(\delta,1]( italic_δ , 1 ] into n/2𝑛2n/2italic_n / 2 subintervals respectively and perform the ROM for the approximation. Then, the weight is changed to |S|/|Sδ|subscript𝑆superscript𝑆𝛿|S_{\ell}|/|S^{\delta}|| italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | / | italic_S start_POSTSUPERSCRIPT italic_δ end_POSTSUPERSCRIPT |. The operators 𝒯𝒯\mathcal{T}caligraphic_T, 𝒯ξsuperscript𝒯𝜉\mathcal{T}^{\xi}caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT, 𝒯subscript𝒯\mathcal{T}_{\ell}caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT, and 𝒯ξsubscriptsuperscript𝒯𝜉\mathcal{T}^{\xi}_{\ell}caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT are changed accordingly.

Lemma A.1.

The operator 𝒜μ()subscript𝒜𝜇\mathcal{A}_{\mu}(\cdot)caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( ⋅ ) satisfies that

(A.3) 𝒜μ(ϕ)L2(Ω;σT)ϕL2(Ω;σT)𝒜μ1,μ[1,1],formulae-sequencesubscriptnormsubscript𝒜𝜇italic-ϕsuperscript𝐿2Ωsubscript𝜎𝑇subscriptnormitalic-ϕsuperscript𝐿2Ωsubscript𝜎𝑇normsubscript𝒜𝜇1for-all𝜇11\displaystyle\|\mathcal{A}_{\mu}(\phi)\|_{L^{2}(\Omega;\sigma_{T})}\leq\|\phi% \|_{L^{2}(\Omega;\sigma_{T})}\Rightarrow\|\mathcal{A}_{\mu}\|\leq 1,\quad% \forall\mu\in[-1,1],∥ caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_ϕ ) ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ; italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ≤ ∥ italic_ϕ ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ; italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ⇒ ∥ caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ∥ ≤ 1 , ∀ italic_μ ∈ [ - 1 , 1 ] ,

and for μ(1,0)(0,1)𝜇1001\mu\in(-1,0)\cup(0,1)italic_μ ∈ ( - 1 , 0 ) ∪ ( 0 , 1 ) that

(A.4) μ𝒜μ1|μ|(1+σTσr).normsubscript𝜇subscript𝒜𝜇1𝜇1subscriptnormsubscript𝜎𝑇subscript𝜎𝑟\displaystyle\|\partial_{\mu}\mathcal{A}_{\mu}\|\leq\frac{1}{|\mu|}\left(1+% \left\|\frac{\sigma_{T}}{\sigma_{r}}\right\|_{\infty}\right).∥ ∂ start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ∥ ≤ divide start_ARG 1 end_ARG start_ARG | italic_μ | end_ARG ( 1 + ∥ divide start_ARG italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT end_ARG ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ) .

Consequently, 𝒯1norm𝒯1\|\mathcal{T}\|\leq 1∥ caligraphic_T ∥ ≤ 1, 𝒯ξ1normsuperscript𝒯𝜉1\|\mathcal{T}^{\xi}\|\leq 1∥ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ ≤ 1, and it holds for the truncated system that

(A.5) δ𝒯ξCδn1.norm𝛿superscriptsubscript𝒯𝜉𝐶𝛿superscript𝑛1\|\delta\mathcal{T}_{\ell}^{\xi}\|\leq\frac{C}{\delta}n^{-1}.∥ italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ ≤ divide start_ARG italic_C end_ARG start_ARG italic_δ end_ARG italic_n start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT .

Proof A.2.

We only consider μ0𝜇0\mu\geq 0italic_μ ≥ 0 here, as the case for μ<0𝜇0\mu<0italic_μ < 0 is similar. Multiplying ψ𝜓\psiitalic_ψ in (A.1) and integrating, one has

μ12|ψ(xR,μ)|2+xLxRσT|ψ|2𝑑x=xLxRσrϕψ𝑑xxLxRσT|ϕ||ψ|𝑑x(xLxRσT|ϕ|2𝑑x)1/2(xLxRσT|ψ|2𝑑x)1/2.𝜇12superscript𝜓subscript𝑥𝑅𝜇2superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅subscript𝜎𝑇superscript𝜓2differential-d𝑥superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅subscript𝜎𝑟italic-ϕ𝜓differential-d𝑥superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅subscript𝜎𝑇italic-ϕ𝜓differential-d𝑥superscriptsuperscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅subscript𝜎𝑇superscriptitalic-ϕ2differential-d𝑥12superscriptsuperscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅subscript𝜎𝑇superscript𝜓2differential-d𝑥12\displaystyle\begin{split}\mu\frac{1}{2}|\psi(x_{R},\mu)|^{2}+\int_{x_{L}}^{x_% {R}}\sigma_{T}|\psi|^{2}\,dx&=\int_{x_{L}}^{x_{R}}\sigma_{r}\phi\psi\,dx\leq% \int_{x_{L}}^{x_{R}}\sigma_{T}|\phi||\psi|dx\\ &\leq\left(\int_{x_{L}}^{x_{R}}\sigma_{T}|\phi|^{2}\,dx\right)^{1/2}\left(\int% _{x_{L}}^{x_{R}}\sigma_{T}|\psi|^{2}dx\right)^{1/2}.\end{split}start_ROW start_CELL italic_μ divide start_ARG 1 end_ARG start_ARG 2 end_ARG | italic_ψ ( italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT , italic_μ ) | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT | italic_ψ | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_x end_CELL start_CELL = ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_ϕ italic_ψ italic_d italic_x ≤ ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT | italic_ϕ | | italic_ψ | italic_d italic_x end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ ( ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT | italic_ϕ | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_x ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ( ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT | italic_ψ | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_x ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT . end_CELL end_ROW

Hence,

(xLxRσT|ψ|2𝑑x)1/2(xLxRσT|ϕ|2𝑑x)1/2,superscriptsuperscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅subscript𝜎𝑇superscript𝜓2differential-d𝑥12superscriptsuperscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅subscript𝜎𝑇superscriptitalic-ϕ2differential-d𝑥12\left(\int_{x_{L}}^{x_{R}}\sigma_{T}|\psi|^{2}dx\right)^{1/2}\leq\left(\int_{x% _{L}}^{x_{R}}\sigma_{T}|\phi|^{2}\,dx\right)^{1/2},( ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT | italic_ψ | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_x ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ≤ ( ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT | italic_ϕ | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_x ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ,

which implies that 𝒜μ1normsubscript𝒜𝜇1\|\mathcal{A}_{\mu}\|\leq 1∥ caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ∥ ≤ 1.

Taking the derivative of both sides of (A.1) with respect to μ𝜇\muitalic_μ

xψ+μx(μψ)+σTμψ=0,μψ(xL,μ)=0.formulae-sequencesubscript𝑥𝜓𝜇subscript𝑥subscript𝜇𝜓subscript𝜎𝑇subscript𝜇𝜓0subscript𝜇𝜓subscript𝑥𝐿𝜇0\partial_{x}\psi+\mu\partial_{x}(\partial_{\mu}\psi)+\sigma_{T}\partial_{\mu}% \psi=0,\quad\partial_{\mu}\psi(x_{L},\mu)=0.∂ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_ψ + italic_μ ∂ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ( ∂ start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_ψ ) + italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ∂ start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_ψ = 0 , ∂ start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_ψ ( italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT , italic_μ ) = 0 .

Hence, one has

μψ=𝒜μ(xψ/σr)=𝒜μ(μ1(ϕσTσrψ)).subscript𝜇𝜓subscript𝒜𝜇subscript𝑥𝜓subscript𝜎𝑟subscript𝒜𝜇superscript𝜇1italic-ϕsubscript𝜎𝑇subscript𝜎𝑟𝜓\partial_{\mu}\psi=\mathcal{A}_{\mu}(-\partial_{x}\psi/\sigma_{r})=-\mathcal{A% }_{\mu}\left(\mu^{-1}(\phi-\frac{\sigma_{T}}{\sigma_{r}}\psi)\right).∂ start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_ψ = caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( - ∂ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_ψ / italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ) = - caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_μ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ϕ - divide start_ARG italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT end_ARG italic_ψ ) ) .

By (A.3), one then has

μψμ1ϕσTσr𝒜μ(ϕ)μ1(ϕ+σTσrϕ).normsubscript𝜇𝜓superscript𝜇1normitalic-ϕsubscript𝜎𝑇subscript𝜎𝑟subscript𝒜𝜇italic-ϕsuperscript𝜇1normitalic-ϕsubscriptnormsubscript𝜎𝑇subscript𝜎𝑟normitalic-ϕ\|\partial_{\mu}\psi\|\leq\mu^{-1}\|\phi-\frac{\sigma_{T}}{\sigma_{r}}\mathcal% {A}_{\mu}(\phi)\|\leq\mu^{-1}(\|\phi\|+\|\frac{\sigma_{T}}{\sigma_{r}}\|_{% \infty}\|\phi\|).∥ ∂ start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_ψ ∥ ≤ italic_μ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∥ italic_ϕ - divide start_ARG italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT end_ARG caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_ϕ ) ∥ ≤ italic_μ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( ∥ italic_ϕ ∥ + ∥ divide start_ARG italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT end_ARG ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ∥ italic_ϕ ∥ ) .

The inequality (A.4) holds.

Since 𝒯ξsuperscript𝒯𝜉\mathcal{T}^{\xi}caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT and 𝒯𝒯\mathcal{T}caligraphic_T are the averages of 𝒜μsubscript𝒜𝜇\mathcal{A}_{\mu}caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT, the second claims are straightforward. For the truncated system, one has

δ𝒯ξ=α|S|1S𝒜μ𝒜μdμ,𝛿superscript𝒯𝜉subscript𝛼superscriptsubscript𝑆1subscriptsubscript𝑆subscript𝒜𝜇subscript𝒜subscript𝜇𝑑𝜇\delta\mathcal{T}^{\xi}=\alpha_{\ell}|S_{\ell}|^{-1}\int_{S_{\ell}}\mathcal{A}% _{\mu}-\mathcal{A}_{\mu_{\ell}}d\mu,italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT = italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT - caligraphic_A start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_d italic_μ ,

then

δ𝒯ξαsupμSS+mμ𝒜μ|S|=n1|S|α2supμSS+mμ𝒜μ.norm𝛿superscriptsubscript𝒯𝜉subscript𝛼subscriptsupremum𝜇subscript𝑆subscript𝑆𝑚normsubscript𝜇subscript𝒜𝜇subscript𝑆superscript𝑛1𝑆superscriptsubscript𝛼2subscriptsupremum𝜇subscript𝑆subscript𝑆𝑚normsubscript𝜇subscript𝒜𝜇\|\delta\mathcal{T}_{\ell}^{\xi}\|\leq\alpha_{\ell}\sup_{\mu\in S_{\ell}\cup S% _{\ell+m}}\|\partial_{\mu}\mathcal{A}_{\mu}\||S_{\ell}|=n^{-1}|S|\alpha_{\ell}% ^{2}\sup_{\mu\in S_{\ell}\cup S_{\ell+m}}\|\partial_{\mu}\mathcal{A}_{\mu}\|.∥ italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ ≤ italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT roman_sup start_POSTSUBSCRIPT italic_μ ∈ italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∪ italic_S start_POSTSUBSCRIPT roman_ℓ + italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ ∂ start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ∥ | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | = italic_n start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT | italic_S | italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_sup start_POSTSUBSCRIPT italic_μ ∈ italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∪ italic_S start_POSTSUBSCRIPT roman_ℓ + italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ ∂ start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ∥ .

The claim then follows.

Next, our main goal is to estimate 𝔼δ𝒯ξ2𝔼superscriptnorm𝛿superscript𝒯𝜉2\mathbb{E}\|\delta\mathcal{T}^{\xi}\|^{2}blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT where δ𝒯ξ𝛿superscript𝒯𝜉\delta\mathcal{T}^{\xi}italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT is given in (A.2).

If the independent random variables δ𝒯ξ𝛿superscriptsubscript𝒯𝜉\delta\mathcal{T}_{\ell}^{\xi}italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT take values in a Hilbert space, some classical concentration inequalities like the Rosenthal inequality [30] can be used to achieve this. However, here the variables are operators over Hilbert spaces so they are in a Banach algebra. One cannot apply the classical Rosenthal inequality directly. Our goal in this section is then to establish a Rosenthal type inequality for these operator-valued random variables.

First, we observe the following

𝔼δ𝒯ξ2𝔼|δ𝒯ξ𝔼δ𝒯ξ|2+(𝔼δ𝒯ξ)2.𝔼superscriptnorm𝛿superscript𝒯𝜉2𝔼superscriptnorm𝛿superscript𝒯𝜉𝔼norm𝛿superscript𝒯𝜉2superscript𝔼norm𝛿superscript𝒯𝜉2\mathbb{E}\|\delta\mathcal{T}^{\xi}\|^{2}\leq\mathbb{E}\big{|}\|\delta\mathcal% {T}^{\xi}\|-\mathbb{E}\|\delta\mathcal{T}^{\xi}\|\big{|}^{2}+(\mathbb{E}\|% \delta\mathcal{T}^{\xi}\|)^{2}.blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ blackboard_E | ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ - blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ( blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT .

The first term on the right-hand side can be estimated by the following fact.

Lemma A.3.

For any separable Banach space B𝐵Bitalic_B and any finite sequence of independent B𝐵Bitalic_B-valued random vectors Xjsubscript𝑋𝑗X_{j}italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, 1jm1𝑗𝑚1\leq j\leq m1 ≤ italic_j ≤ italic_m with 𝔼Xj2<𝔼superscriptnormsubscript𝑋𝑗2\mathbb{E}\|X_{j}\|^{2}<\inftyblackboard_E ∥ italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT < ∞. Let Sm=j=1mXjsubscript𝑆𝑚superscriptsubscript𝑗1𝑚subscript𝑋𝑗S_{m}=\sum_{j=1}^{m}X_{j}italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. Then,

𝔼|Sm𝔼Sm|24j=1m𝔼Xj2.𝔼superscriptnormsubscript𝑆𝑚𝔼normsubscript𝑆𝑚24superscriptsubscript𝑗1𝑚𝔼superscriptnormsubscript𝑋𝑗2\mathbb{E}\big{|}\left\|S_{m}\right\|-\mathbb{E}\left\|S_{m}\right\|\big{|}^{2% }\leq 4\sum_{j=1}^{m}\mathbb{E}\left\|X_{j}\right\|^{2}.blackboard_E | ∥ italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∥ - blackboard_E ∥ italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∥ | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ 4 ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT blackboard_E ∥ italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT .

This result is a special case of [1, Theorem 2.1], where the general Lpsuperscript𝐿𝑝L^{p}italic_L start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT case is considered. The proof is a consequence of the Birkholder inequality for martingales. Here, we sketch the proof for the special case here. Consider the filtration {j,0jm}subscript𝑗0𝑗𝑚\{\mathcal{F}_{j},0\leq j\leq m\}{ caligraphic_F start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , 0 ≤ italic_j ≤ italic_m } where

j=σ(X:1j),1jm,\mathcal{F}_{j}=\sigma(X_{\ell}:1\leq\ell\leq j),1\leq j\leq m,caligraphic_F start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_σ ( italic_X start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT : 1 ≤ roman_ℓ ≤ italic_j ) , 1 ≤ italic_j ≤ italic_m ,

and 0={Ω,}subscript0Ω\mathcal{F}_{0}=\{\Omega,\emptyset\}caligraphic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = { roman_Ω , ∅ }. Define Yj=𝔼(Sm|j)𝔼(Sm|j1)subscript𝑌𝑗𝔼conditionalnormsubscript𝑆𝑚subscript𝑗𝔼conditionalnormsubscript𝑆𝑚subscript𝑗1Y_{j}=\mathbb{E}(\|S_{m}\||\mathcal{F}_{j})-\mathbb{E}(\|S_{m}\||\mathcal{F}_{% j-1})italic_Y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = blackboard_E ( ∥ italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∥ | caligraphic_F start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) - blackboard_E ( ∥ italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∥ | caligraphic_F start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT ), where 𝔼(Sm|0):=𝔼(Sm)assign𝔼conditionalnormsubscript𝑆𝑚subscript0𝔼normsubscript𝑆𝑚\mathbb{E}(\|S_{m}\||\mathcal{F}_{0}):=\mathbb{E}(\|S_{m}\|)blackboard_E ( ∥ italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∥ | caligraphic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) := blackboard_E ( ∥ italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∥ ). Then, one has

𝔼|Sm𝔼Sm|2=𝔼|j=1mYj|2=j=1m𝔼(|Yj|2).𝔼superscriptnormsubscript𝑆𝑚𝔼normsubscript𝑆𝑚2𝔼superscriptsuperscriptsubscript𝑗1𝑚subscript𝑌𝑗2superscriptsubscript𝑗1𝑚𝔼superscriptsubscript𝑌𝑗2\displaystyle\mathbb{E}\big{|}\left\|S_{m}\right\|-\mathbb{E}\left\|S_{m}% \right\|\big{|}^{2}=\mathbb{E}|\sum_{j=1}^{m}Y_{j}|^{2}=\sum_{j=1}^{m}\mathbb{% E}(|Y_{j}|^{2}).blackboard_E | ∥ italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∥ - blackboard_E ∥ italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∥ | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = blackboard_E | ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_Y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT blackboard_E ( | italic_Y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) .

By the definition, one has

|Yj|=|𝔼X1,,Xj1,XjX1++Xj1+Xj+Xj+1+𝔼X1,,Xj1X1++Xj1+Xj+Xj+1+|𝔼Xj|XjXj|.subscript𝑌𝑗subscript𝔼subscript𝑋1subscript𝑋𝑗1superscriptsubscript𝑋𝑗delimited-∥∥subscript𝑋1subscript𝑋𝑗1superscriptsubscript𝑋𝑗subscript𝑋𝑗1subscript𝔼subscript𝑋1subscript𝑋𝑗1delimited-∥∥subscript𝑋1subscript𝑋𝑗1subscript𝑋𝑗subscript𝑋𝑗1subscript𝔼superscriptsubscript𝑋𝑗superscriptsubscript𝑋𝑗subscript𝑋𝑗|Y_{j}|=\Bigg{|}\mathbb{E}_{X_{1},\cdots,X_{j-1},X_{j}^{\prime}}\|X_{1}+\cdots% +X_{j-1}+X_{j}^{\prime}+X_{j+1}+\cdots\|\\ -\mathbb{E}_{X_{1},\cdots,X_{j-1}}\|X_{1}+\cdots+X_{j-1}+X_{j}+X_{j+1}+\cdots% \|\Bigg{|}\leq\mathbb{E}_{X_{j}^{\prime}}|X_{j}^{\prime}-X_{j}|.start_ROW start_CELL | italic_Y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | = | blackboard_E start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_X start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + ⋯ + italic_X start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT + italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_X start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT + ⋯ ∥ end_CELL end_ROW start_ROW start_CELL - blackboard_E start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_X start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + ⋯ + italic_X start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT + italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + italic_X start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT + ⋯ ∥ | ≤ blackboard_E start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | . end_CELL end_ROW

Here Xjsuperscriptsubscript𝑋𝑗X_{j}^{\prime}italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is an independent copy of Xjsubscript𝑋𝑗X_{j}italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. This then gives that 𝔼|Yj|24𝔼(Xj2)𝔼superscriptsubscript𝑌𝑗24𝔼superscriptnormsubscript𝑋𝑗2\mathbb{E}|Y_{j}|^{2}\leq 4\mathbb{E}(\|X_{j}\|^{2})blackboard_E | italic_Y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ 4 blackboard_E ( ∥ italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ).

Hence, the problem is reduced to estimation of 𝔼δ𝒯ξ𝔼norm𝛿superscript𝒯𝜉\mathbb{E}\|\delta\mathcal{T}^{\xi}\|blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥. We will mainly make use the approach in [34]. The analysis in [34] is for matrices and the final result relies on the dimension of the space. In our case, the operator is in an infinite dimensional space. We find that the dependence on the dimension is due to the trace of the operators. Fortunately, for our case, the operator is compact and we could possibly bound the trace.

Firstly, we introduce the Mercer’s theorem about trace class mentioned in [20, Sec. 30.5, Theorem 11].

Lemma A.4.

Consider an integral operator 𝐊𝐊\boldsymbol{K}bold_italic_K of the form

(𝑲u)(s)=IK(s,t)u(t)w(t)𝑑t𝑲𝑢𝑠subscript𝐼𝐾𝑠𝑡𝑢𝑡𝑤𝑡differential-d𝑡(\boldsymbol{K}u)(s)=\int_{I}K(s,t)u(t)w(t)dt( bold_italic_K italic_u ) ( italic_s ) = ∫ start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT italic_K ( italic_s , italic_t ) italic_u ( italic_t ) italic_w ( italic_t ) italic_d italic_t

acting on L2(I;w)superscript𝐿2𝐼𝑤L^{2}(I;w)italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_I ; italic_w ), where K𝐾Kitalic_K is a real-valued symmetric, continuous function of (s,t)𝑠𝑡(s,t)( italic_s , italic_t ) and w(t)𝑤𝑡w(t)italic_w ( italic_t ) is a continuous positive weight. Then, the operator 𝐊𝐊\boldsymbol{K}bold_italic_K is positive in the usual sense:

(𝑲u,u)0,for all u in L2(I;w),𝑲𝑢𝑢0for all u in L2(I;w)(\boldsymbol{K}u,u)\geq 0,\quad\text{for all $u$ in $L^{2}(I;w)$},( bold_italic_K italic_u , italic_u ) ≥ 0 , for all italic_u in italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_I ; italic_w ) ,

and is of trace class with the trace equal to the integral of its kernel along the diagonal:

tr(𝑲)=IK(s,s)w(s)𝑑s.tr𝑲subscript𝐼𝐾𝑠𝑠𝑤𝑠differential-d𝑠\operatorname{tr}(\boldsymbol{K})=\int_{I}K(s,s)w(s)ds.roman_tr ( bold_italic_K ) = ∫ start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT italic_K ( italic_s , italic_s ) italic_w ( italic_s ) italic_d italic_s .

The result in [20, Sec. 30.5, Theorem 11] is about the uniform weight w=1𝑤1w=1italic_w = 1. It is not hard to check that the proof holds for general continuous positive weight on I𝐼Iitalic_I. Based on the above lemma, we naturally deduce the following proposition.

Proposition A.5.

For any μ0𝜇0\mu\neq 0italic_μ ≠ 0, 𝒜μsubscript𝒜𝜇\mathcal{A}_{\mu}caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT is a compact operator. 𝒜μsuperscriptsubscript𝒜𝜇\mathcal{A}_{\mu}^{*}caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is the adjoint operator of 𝒜μsubscript𝒜𝜇\mathcal{A}_{\mu}caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT, then 𝒜μ𝒜μsuperscriptsubscript𝒜𝜇subscript𝒜𝜇\mathcal{A}_{\mu}^{*}\mathcal{A}_{\mu}caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT and 𝒜μ𝒜μsubscript𝒜𝜇superscriptsubscript𝒜𝜇\mathcal{A}_{\mu}\mathcal{A}_{\mu}^{*}caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT are in the trace class. Moreover,

tr𝒜μ𝒜μ=tr𝒜μ𝒜μ1|μ||xRxL|σT2σT1.trsuperscriptsubscript𝒜𝜇subscript𝒜𝜇trsubscript𝒜𝜇superscriptsubscript𝒜𝜇1𝜇subscript𝑥𝑅subscript𝑥𝐿superscriptsubscriptnormsubscript𝜎𝑇2subscriptnormsuperscriptsubscript𝜎𝑇1\operatorname{tr}\mathcal{A}_{\mu}^{*}\mathcal{A}_{\mu}=\operatorname{tr}% \mathcal{A}_{\mu}\mathcal{A}_{\mu}^{*}\leq\frac{1}{|\mu|}|x_{R}-x_{L}|\|\sigma% _{T}\|_{\infty}^{2}\|\sigma_{T}^{-1}\|_{\infty}.roman_tr caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT = roman_tr caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG | italic_μ | end_ARG | italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT | ∥ italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT .

Proof A.6.

According to [20], an integral operator L2(I;w)L2(I;w)superscript𝐿2𝐼𝑤superscript𝐿2𝐼𝑤L^{2}(I;w)\to L^{2}(I;w)italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_I ; italic_w ) → italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_I ; italic_w ) with a square integrable kernel is compact. The first claim follows directly by the solution in expanded form.

We only consider μ>0𝜇0\mu>0italic_μ > 0 and the proof for μ<0𝜇0\mu<0italic_μ < 0 is similar. By the definition of adjoint operators, 𝒜μg,h=g,𝒜μhsubscript𝒜𝜇𝑔𝑔superscriptsubscript𝒜𝜇\langle\mathcal{A}_{\mu}g,h\rangle=\langle g,\mathcal{A}_{\mu}^{*}h\rangle⟨ caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_g , italic_h ⟩ = ⟨ italic_g , caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_h ⟩, one finds that

(𝒜μφ)(x)=xLxRk~μ(x,y)φ(y)σT(y)𝑑y,k~μ(x,y)=kμ(y,x).formulae-sequencesuperscriptsubscript𝒜𝜇𝜑𝑥superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅subscript~𝑘𝜇𝑥𝑦𝜑𝑦subscript𝜎𝑇𝑦differential-d𝑦subscript~𝑘𝜇𝑥𝑦subscript𝑘𝜇𝑦𝑥(\mathcal{A}_{\mu}^{*}\varphi)(x)=\int_{x_{L}}^{x_{R}}\tilde{k}_{\mu}(x,y)% \varphi(y)\sigma_{T}(y)dy,\quad\tilde{k}_{\mu}(x,y)=k_{\mu}(y,x).( caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_φ ) ( italic_x ) = ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT over~ start_ARG italic_k end_ARG start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x , italic_y ) italic_φ ( italic_y ) italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_y ) italic_d italic_y , over~ start_ARG italic_k end_ARG start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x , italic_y ) = italic_k start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_y , italic_x ) .

Then, one has

𝒜μ𝒜μφ=xLxRK1(x,y)φ(y)σT(y)𝑑y,K1(x,y)=xLxRkμ(x,z)k~μ(z,y)σT(z)𝑑z.formulae-sequencesubscript𝒜𝜇superscriptsubscript𝒜𝜇𝜑superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅subscript𝐾1𝑥𝑦𝜑𝑦subscript𝜎𝑇𝑦differential-d𝑦subscript𝐾1𝑥𝑦superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅subscript𝑘𝜇𝑥𝑧subscript~𝑘𝜇𝑧𝑦subscript𝜎𝑇𝑧differential-d𝑧\mathcal{A}_{\mu}\mathcal{A}_{\mu}^{*}\varphi=\int_{x_{L}}^{x_{R}}K_{1}(x,y)% \varphi(y)\sigma_{T}(y)dy,\quad K_{1}(x,y)=\int_{x_{L}}^{x_{R}}k_{\mu}(x,z)% \tilde{k}_{\mu}(z,y)\sigma_{T}(z)dz.caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_φ = ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x , italic_y ) italic_φ ( italic_y ) italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_y ) italic_d italic_y , italic_K start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x , italic_y ) = ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_k start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x , italic_z ) over~ start_ARG italic_k end_ARG start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_z , italic_y ) italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_z ) italic_d italic_z .

Similarly,

𝒜μ𝒜μφ=xLxRK2(x,y)φ(y)σT(y)𝑑y,K2(x,y)=xLxRk~μ(x,z)kμ(z,y)σT(z)𝑑z.formulae-sequencesuperscriptsubscript𝒜𝜇subscript𝒜𝜇𝜑superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅subscript𝐾2𝑥𝑦𝜑𝑦subscript𝜎𝑇𝑦differential-d𝑦subscript𝐾2𝑥𝑦superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅subscript~𝑘𝜇𝑥𝑧subscript𝑘𝜇𝑧𝑦subscript𝜎𝑇𝑧differential-d𝑧\mathcal{A}_{\mu}^{*}\mathcal{A}_{\mu}\varphi=\int_{x_{L}}^{x_{R}}K_{2}(x,y)% \varphi(y)\sigma_{T}(y)dy,\quad K_{2}(x,y)=\int_{x_{L}}^{x_{R}}\tilde{k}_{\mu}% (x,z)k_{\mu}(z,y)\sigma_{T}(z)dz.caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_φ = ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x , italic_y ) italic_φ ( italic_y ) italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_y ) italic_d italic_y , italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x , italic_y ) = ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT over~ start_ARG italic_k end_ARG start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x , italic_z ) italic_k start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_z , italic_y ) italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_z ) italic_d italic_z .

Hence,

(A.6a) K1(x,y)=xLmin(x,y)1μ2σr2(z)σT(z)exp(1μ(zxσT(w)𝑑w+zyσT(w)𝑑w))subscript𝐾1𝑥𝑦superscriptsubscriptsubscript𝑥𝐿𝑥𝑦1superscript𝜇2superscriptsubscript𝜎𝑟2𝑧subscript𝜎𝑇𝑧1𝜇superscriptsubscript𝑧𝑥subscript𝜎𝑇𝑤differential-d𝑤superscriptsubscript𝑧𝑦subscript𝜎𝑇𝑤differential-d𝑤\displaystyle K_{1}(x,y)=\int_{x_{L}}^{\min(x,y)}\frac{1}{\mu^{2}}\frac{\sigma% _{r}^{2}(z)}{\sigma_{T}(z)}\exp\left(-\frac{1}{\mu}(\int_{z}^{x}\sigma_{T}(w)% dw+\int_{z}^{y}\sigma_{T}(w)dw)\right)italic_K start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x , italic_y ) = ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_min ( italic_x , italic_y ) end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_μ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG divide start_ARG italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_z ) end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_z ) end_ARG roman_exp ( - divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ( ∫ start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_w ) italic_d italic_w + ∫ start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_y end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_w ) italic_d italic_w ) ) dz𝑑𝑧\displaystyle dzitalic_d italic_z
(A.6b) K2(x,y)=max(x,y)xR1μ2σr(x)σr(y)σT(x)σT(y)exp(1μ(xzσT(w)𝑑w+yzσT(w)𝑑w))σT(z)subscript𝐾2𝑥𝑦superscriptsubscript𝑥𝑦subscript𝑥𝑅1superscript𝜇2subscript𝜎𝑟𝑥subscript𝜎𝑟𝑦subscript𝜎𝑇𝑥subscript𝜎𝑇𝑦1𝜇superscriptsubscript𝑥𝑧subscript𝜎𝑇𝑤differential-d𝑤superscriptsubscript𝑦𝑧subscript𝜎𝑇𝑤differential-d𝑤subscript𝜎𝑇𝑧\displaystyle K_{2}(x,y)=\int_{\max(x,y)}^{x_{R}}\frac{1}{\mu^{2}}\frac{\sigma% _{r}(x)\sigma_{r}(y)}{\sigma_{T}(x)\sigma_{T}(y)}\exp\left(-\frac{1}{\mu}(\int% _{x}^{z}\sigma_{T}(w)dw+\int_{y}^{z}\sigma_{T}(w)dw)\right)\sigma_{T}(z)italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x , italic_y ) = ∫ start_POSTSUBSCRIPT roman_max ( italic_x , italic_y ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_μ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG divide start_ARG italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_x ) italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_y ) end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x ) italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_y ) end_ARG roman_exp ( - divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ( ∫ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_w ) italic_d italic_w + ∫ start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_w ) italic_d italic_w ) ) italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_z ) dz.𝑑𝑧\displaystyle dz.italic_d italic_z .

According to these two formulas, it is clear that both kernels are continuous in (x,y)𝑥𝑦(x,y)( italic_x , italic_y ).

Repeating the Lemma A.4, it is easy to find that both operators are in trace class and

tr(𝒜μ𝒜μ)=xLxRK1(x,x)σT(x)𝑑x,tr(𝒜μ𝒜μ)=xLxRK2(x,x)σT(x)𝑑x.formulae-sequencetrsubscript𝒜𝜇superscriptsubscript𝒜𝜇superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅subscript𝐾1𝑥𝑥subscript𝜎𝑇𝑥differential-d𝑥trsuperscriptsubscript𝒜𝜇subscript𝒜𝜇superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅subscript𝐾2𝑥𝑥subscript𝜎𝑇𝑥differential-d𝑥\operatorname{tr}(\mathcal{A}_{\mu}\mathcal{A}_{\mu}^{*})=\int_{x_{L}}^{x_{R}}% K_{1}(x,x)\sigma_{T}(x)dx,\quad\operatorname{tr}(\mathcal{A}_{\mu}^{*}\mathcal% {A}_{\mu})=\int_{x_{L}}^{x_{R}}K_{2}(x,x)\sigma_{T}(x)dx.roman_tr ( caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x , italic_x ) italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x ) italic_d italic_x , roman_tr ( caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ) = ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x , italic_x ) italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x ) italic_d italic_x .

These two traces are in fact equal by (A.6) and Fubini’s theorem, and are equal to

xLxRxLxR|kμ(x,y)|2σT(x)σT(y)𝑑x𝑑y.superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅superscriptsubscript𝑘𝜇𝑥𝑦2subscript𝜎𝑇𝑥subscript𝜎𝑇𝑦differential-d𝑥differential-d𝑦\int_{x_{L}}^{x_{R}}\int_{x_{L}}^{x_{R}}|k_{\mu}(x,y)|^{2}\sigma_{T}(x)\sigma_% {T}(y)dxdy.∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | italic_k start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x , italic_y ) | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x ) italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_y ) italic_d italic_x italic_d italic_y .

Then, one finds

(A.7) tr(𝒜μ𝒜μ)=tr(𝒜μ𝒜μ)1μ2σT2xLxRxLxR𝕀zxexp(2μ1σT1(xz))𝑑x𝑑z1μ|xRxL|σT2σT1.trsubscript𝒜𝜇superscriptsubscript𝒜𝜇trsuperscriptsubscript𝒜𝜇subscript𝒜𝜇1superscript𝜇2superscriptsubscriptdelimited-∥∥subscript𝜎𝑇2superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅subscript𝕀𝑧𝑥2𝜇1subscriptnormsuperscriptsubscript𝜎𝑇1𝑥𝑧differential-d𝑥differential-d𝑧1𝜇subscript𝑥𝑅subscript𝑥𝐿superscriptsubscriptdelimited-∥∥subscript𝜎𝑇2subscriptdelimited-∥∥superscriptsubscript𝜎𝑇1\begin{split}&\operatorname{tr}(\mathcal{A}_{\mu}\mathcal{A}_{\mu}^{*})=% \operatorname{tr}(\mathcal{A}_{\mu}^{*}\mathcal{A}_{\mu})\\ &\leq\frac{1}{\mu^{2}}\|\sigma_{T}\|_{\infty}^{2}\int_{x_{L}}^{x_{R}}\int_{x_{% L}}^{x_{R}}\mathbb{I}_{z\leq x}\exp\left(-\frac{2}{\mu}\frac{1}{\|\sigma_{T}^{% -1}\|_{\infty}}(x-z)\right)dxdz\\ &\leq\frac{1}{\mu}|x_{R}-x_{L}|\|\sigma_{T}\|_{\infty}^{2}\|\sigma_{T}^{-1}\|_% {\infty}.\end{split}start_ROW start_CELL end_CELL start_CELL roman_tr ( caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = roman_tr ( caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ divide start_ARG 1 end_ARG start_ARG italic_μ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∥ italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT blackboard_I start_POSTSUBSCRIPT italic_z ≤ italic_x end_POSTSUBSCRIPT roman_exp ( - divide start_ARG 2 end_ARG start_ARG italic_μ end_ARG divide start_ARG 1 end_ARG start_ARG ∥ italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT end_ARG ( italic_x - italic_z ) ) italic_d italic_x italic_d italic_z end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG | italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT | ∥ italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT . end_CELL end_ROW

The case for μ<0𝜇0\mu<0italic_μ < 0 is similar, omitted.

We adopt the argument in [34] to our case for 𝔼δ𝒯ξ𝔼norm𝛿superscript𝒯𝜉\mathbb{E}\|\delta\mathcal{T}^{\xi}\|blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥.

First, define the symmetrization of the operators. Because δ𝒯ξ𝛿superscript𝒯𝜉\delta\mathcal{T}^{\xi}italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT is not a self-adjoint operator, we let δ𝛿\delta\mathcal{H}italic_δ caligraphic_H be the symmetrization operator of δ𝒯ξ𝛿superscript𝒯𝜉\delta\mathcal{T}^{\xi}italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT, given by

δ=1m=1mδ.𝛿1𝑚superscriptsubscript1𝑚𝛿subscript\displaystyle\delta\mathcal{H}=\frac{1}{m}\sum_{\ell=1}^{m}\delta\mathcal{H}_{% \ell}.italic_δ caligraphic_H = divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT .

where

δ:=[0δ𝒯ξ(δ𝒯ξ)0].assign𝛿subscriptdelimited-[]0𝛿superscriptsubscript𝒯𝜉superscript𝛿superscriptsubscript𝒯𝜉0\displaystyle\delta\mathcal{H}_{\ell}:=\left[\begin{array}[]{cc}0&\delta% \mathcal{T}_{\ell}^{\xi}\\ (\delta\mathcal{T}_{\ell}^{\xi})^{*}&0\end{array}\right].italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT := [ start_ARRAY start_ROW start_CELL 0 end_CELL start_CELL italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL ( italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_CELL start_CELL 0 end_CELL end_ROW end_ARRAY ] .

Clearly, each δ𝛿subscript\delta\mathcal{H}_{\ell}italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT are operators in (L2(σT))2(L2(σT))2superscriptsuperscript𝐿2subscript𝜎𝑇tensor-productabsent2superscriptsuperscript𝐿2subscript𝜎𝑇tensor-productabsent2(L^{2}(\sigma_{T}))^{\otimes 2}\to(L^{2}(\sigma_{T}))^{\otimes 2}( italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT ⊗ 2 end_POSTSUPERSCRIPT → ( italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT ⊗ 2 end_POSTSUPERSCRIPT. The symmetrization has a good property that is

(A.8) δ𝒯ξ=δ,δ𝒯ξ=δ.formulae-sequencenorm𝛿superscriptsubscript𝒯𝜉norm𝛿subscriptnorm𝛿superscript𝒯𝜉norm𝛿\displaystyle\|\delta\mathcal{T}_{\ell}^{\xi}\|=\|\delta\mathcal{H}_{\ell}\|,% \quad\|\delta\mathcal{T}^{\xi}\|=\|\delta\mathcal{H}\|.∥ italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ = ∥ italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∥ , ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ = ∥ italic_δ caligraphic_H ∥ .

In fact, since δ𝛿subscript\delta\mathcal{H}_{\ell}italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT is a self-adjoint operator, and

δ2superscriptnorm𝛿subscript2\displaystyle\|\delta\mathcal{H}_{\ell}\|^{2}∥ italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT =δ2=[δ𝒯ξ(δ𝒯ξ)00(δ𝒯ξ)δ𝒯ξ]absentnorm𝛿superscriptsubscript2normdelimited-[]𝛿superscriptsubscript𝒯𝜉superscript𝛿superscriptsubscript𝒯𝜉00superscript𝛿superscriptsubscript𝒯𝜉𝛿superscriptsubscript𝒯𝜉\displaystyle=\|\delta\mathcal{H}_{\ell}^{2}\|=\Bigg{\|}\left[\begin{array}[]{% cc}\delta\mathcal{T}_{\ell}^{\xi}(\delta\mathcal{T}_{\ell}^{\xi})^{*}&0\\ 0&(\delta\mathcal{T}_{\ell}^{\xi})^{*}\delta\mathcal{T}_{\ell}^{\xi}\end{array% }\right]\Bigg{\|}= ∥ italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ = ∥ [ start_ARRAY start_ROW start_CELL italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ( italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL ( italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT end_CELL end_ROW end_ARRAY ] ∥
=max{δ𝒯ξ(δ𝒯ξ),(δ𝒯ξ)δ𝒯ξ}=δ𝒯ξ2.absentnorm𝛿superscriptsubscript𝒯𝜉superscript𝛿superscriptsubscript𝒯𝜉normsuperscript𝛿superscriptsubscript𝒯𝜉𝛿superscriptsubscript𝒯𝜉superscriptnorm𝛿superscriptsubscript𝒯𝜉2\displaystyle=\max\{\|\delta\mathcal{T}_{\ell}^{\xi}(\delta\mathcal{T}_{\ell}^% {\xi})^{*}\|,\|(\delta\mathcal{T}_{\ell}^{\xi})^{*}\delta\mathcal{T}_{\ell}^{% \xi}\|\}=\|\delta\mathcal{T}_{\ell}^{\xi}\|^{2}.= roman_max { ∥ italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ( italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ , ∥ ( italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ } = ∥ italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT .

The other relation δ𝒯ξ=δnorm𝛿superscript𝒯𝜉norm𝛿\|\delta\mathcal{T}^{\xi}\|=\|\delta\mathcal{H}\|∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ = ∥ italic_δ caligraphic_H ∥ follows from the same argument. Hence, it reduces to estimate δnorm𝛿\|\delta\mathcal{H}\|∥ italic_δ caligraphic_H ∥.

Next, we consider the Rademacher symmetrization:

δϵ=1m=1mϵδ,𝛿superscriptitalic-ϵ1𝑚superscriptsubscript1𝑚subscriptitalic-ϵ𝛿subscript\displaystyle\delta\mathcal{H}^{\epsilon}=\frac{1}{m}\sum_{{\ell}=1}^{m}% \epsilon_{\ell}\delta\mathcal{H}_{\ell},italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_ϵ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ,

where ϵsubscriptitalic-ϵ\epsilon_{\ell}italic_ϵ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT are independent Rademacher random variables (i.e., indepedent Bernoulli variables taking values in {1,1}11\{-1,1\}{ - 1 , 1 }). Introduction of ϵsubscriptitalic-ϵ\epsilon_{\ell}italic_ϵ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT brings extra randomness to utilize the independence. The following is well-known (see [34, Fact 3.1]), which indicates that introduction of ϵsubscriptitalic-ϵ\epsilon_{\ell}italic_ϵ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT would not lose control of the random variables.

Lemma A.7.

The Rademacher symmetrization satisfies

12𝔼δϵ𝔼δ2𝔼δϵ,12𝔼norm𝛿superscriptitalic-ϵ𝔼norm𝛿2𝔼norm𝛿superscriptitalic-ϵ\frac{1}{2}\mathbb{E}\|\delta\mathcal{H}^{\epsilon}\|\leq\mathbb{E}\|\delta% \mathcal{H}\|\leq 2\mathbb{E}\|\delta\mathcal{H}^{\epsilon}\|,divide start_ARG 1 end_ARG start_ARG 2 end_ARG blackboard_E ∥ italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ∥ ≤ blackboard_E ∥ italic_δ caligraphic_H ∥ ≤ 2 blackboard_E ∥ italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ∥ ,

where 𝔼𝔼\mathbb{E}blackboard_E means that the expectation is with respect to all randomness, including those in Rademacher variables and random ordinates.

The following estimate gives the control by taking the expectation over ϵsubscriptitalic-ϵ\epsilon_{\ell}italic_ϵ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT, following the approach in [34].

Lemma A.8.

It holds that

𝔼ϵδϵ3+2[logm1tr(δ2)m1δ2](1m2δ2)1/2,subscript𝔼italic-ϵnorm𝛿superscriptitalic-ϵ32delimited-[]superscript𝑚1subscripttr𝛿superscriptsubscript2superscript𝑚1subscriptnorm𝛿superscriptsubscript2superscript1superscript𝑚2subscriptnorm𝛿superscriptsubscript212\mathbb{E}_{\epsilon}\|\delta\mathcal{H}^{\epsilon}\|\leq\sqrt{3+2\left[\log% \frac{m^{-1}\sum_{\ell}\operatorname{tr}\left(\delta\mathcal{H}_{\ell}^{2}% \right)}{m^{-1}\sum_{\ell}\|\delta\mathcal{H}_{\ell}^{2}\|}\right]}\left(\frac% {1}{m^{2}}\sum_{\ell}\|\delta\mathcal{H}_{\ell}^{2}\|\right)^{1/2},blackboard_E start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT ∥ italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ∥ ≤ square-root start_ARG 3 + 2 [ roman_log divide start_ARG italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT roman_tr ( italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∥ italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ end_ARG ] end_ARG ( divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∥ italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ,

where 𝔼ϵsubscript𝔼italic-ϵ\mathbb{E}_{\epsilon}blackboard_E start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT means the expectation over the Rademacher variables.

Proof A.9.

Note that δϵ𝛿superscriptitalic-ϵ\delta\mathcal{H}^{\epsilon}italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT is self-adjoint, and (δϵ)2superscript𝛿superscriptitalic-ϵ2(\delta\mathcal{H}^{\epsilon})^{2}( italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is nonnegative and compact. Hence, δϵ2p=(δϵ)2psuperscriptnorm𝛿superscriptitalic-ϵ2𝑝normsuperscript𝛿superscriptitalic-ϵ2𝑝\|\delta\mathcal{H}^{\epsilon}\|^{2p}=\|(\delta\mathcal{H}^{\epsilon})^{2p}\|∥ italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 italic_p end_POSTSUPERSCRIPT = ∥ ( italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 italic_p end_POSTSUPERSCRIPT ∥ for each nonnegative integer p𝑝pitalic_p so that

𝔼ϵδϵ(𝔼ϵ(δϵ)2p)1/(2p)[𝔼ϵtr((δϵ)2p)]1/(2p).subscript𝔼italic-ϵnorm𝛿superscriptitalic-ϵsuperscriptsubscript𝔼italic-ϵnormsuperscript𝛿superscriptitalic-ϵ2𝑝12𝑝superscriptdelimited-[]subscript𝔼italic-ϵtrsuperscript𝛿superscriptitalic-ϵ2𝑝12𝑝\mathbb{E}_{\epsilon}\|\delta\mathcal{H}^{\epsilon}\|\leq(\mathbb{E}_{\epsilon% }\left\|(\delta\mathcal{H}^{\epsilon})^{2p}\right\|)^{1/(2p)}\leq\left[\mathbb% {E}_{\epsilon}\operatorname{tr}\left((\delta\mathcal{H}^{\epsilon})^{2p}\right% )\right]^{1/(2p)}.blackboard_E start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT ∥ italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ∥ ≤ ( blackboard_E start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT ∥ ( italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 italic_p end_POSTSUPERSCRIPT ∥ ) start_POSTSUPERSCRIPT 1 / ( 2 italic_p ) end_POSTSUPERSCRIPT ≤ [ blackboard_E start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT roman_tr ( ( italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 italic_p end_POSTSUPERSCRIPT ) ] start_POSTSUPERSCRIPT 1 / ( 2 italic_p ) end_POSTSUPERSCRIPT .

The above holds because the norm of (δϵ)2superscript𝛿superscriptitalic-ϵ2(\delta\mathcal{H}^{\epsilon})^{2}( italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is the largest eigenvalue while the trace is the sum of all eigenvalues. For each index {1,,m}1𝑚\ell\in\{1,\cdots,m\}roman_ℓ ∈ { 1 , ⋯ , italic_m }, define the random operators

δ+:=1mδH+1mjϵjδj and δ:=1mδ+1mjϵjδj.formulae-sequenceassign𝛿subscript1𝑚𝛿subscript𝐻1𝑚subscript𝑗subscriptitalic-ϵ𝑗𝛿subscript𝑗 and assign𝛿subscript1𝑚𝛿subscript1𝑚subscript𝑗subscriptitalic-ϵ𝑗𝛿subscript𝑗\delta\mathcal{H}_{+\ell}:=\frac{1}{m}\delta H_{\ell}+\frac{1}{m}\sum_{j\neq% \ell}\epsilon_{j}\delta\mathcal{H}_{j}\quad\text{ and }\quad\delta\mathcal{H}_% {-\ell}:=-\frac{1}{m}\delta\mathcal{H}_{\ell}+\frac{1}{m}\sum_{j\neq\ell}% \epsilon_{j}\delta\mathcal{H}_{j}.italic_δ caligraphic_H start_POSTSUBSCRIPT + roman_ℓ end_POSTSUBSCRIPT := divide start_ARG 1 end_ARG start_ARG italic_m end_ARG italic_δ italic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j ≠ roman_ℓ end_POSTSUBSCRIPT italic_ϵ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_δ caligraphic_H start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and italic_δ caligraphic_H start_POSTSUBSCRIPT - roman_ℓ end_POSTSUBSCRIPT := - divide start_ARG 1 end_ARG start_ARG italic_m end_ARG italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j ≠ roman_ℓ end_POSTSUBSCRIPT italic_ϵ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_δ caligraphic_H start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT .

Denote ϵ^={ϵ1,,ϵ1,ϵ+1,,ϵm}subscript^italic-ϵsubscriptitalic-ϵ1subscriptitalic-ϵ1subscriptitalic-ϵ1subscriptitalic-ϵ𝑚\hat{\epsilon}_{\ell}=\{\epsilon_{1},\cdots,\epsilon_{\ell-1},\epsilon_{\ell+1% },\cdots,\epsilon_{m}\}over^ start_ARG italic_ϵ end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = { italic_ϵ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_ϵ start_POSTSUBSCRIPT roman_ℓ - 1 end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT roman_ℓ + 1 end_POSTSUBSCRIPT , ⋯ , italic_ϵ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT }. Then, it holds that

𝔼ϵtr((δϵ)2p)subscript𝔼italic-ϵtrsuperscript𝛿superscriptitalic-ϵ2𝑝\displaystyle\mathbb{E}_{\epsilon}\operatorname{tr}\left((\delta\mathcal{H}^{% \epsilon})^{2p}\right)blackboard_E start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT roman_tr ( ( italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 italic_p end_POSTSUPERSCRIPT ) =12𝔼ϵ^tr(1mδ(δ+2p1δ2p1))absent12subscriptsubscript𝔼subscript^italic-ϵtr1𝑚𝛿subscript𝛿superscriptsubscript2𝑝1𝛿superscriptsubscript2𝑝1\displaystyle=\frac{1}{2}\sum_{\ell}\mathbb{E}_{\hat{\epsilon}_{\ell}}% \operatorname{tr}\left(\frac{1}{m}\delta\mathcal{H}_{\ell}\left(\delta\mathcal% {H}_{+\ell}^{2p-1}-\delta\mathcal{H}_{-\ell}^{2p-1}\right)\right)= divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT blackboard_E start_POSTSUBSCRIPT over^ start_ARG italic_ϵ end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_tr ( divide start_ARG 1 end_ARG start_ARG italic_m end_ARG italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_δ caligraphic_H start_POSTSUBSCRIPT + roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_p - 1 end_POSTSUPERSCRIPT - italic_δ caligraphic_H start_POSTSUBSCRIPT - roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_p - 1 end_POSTSUPERSCRIPT ) )
=1m2=1mj=02p2𝔼ϵ^tr(δδ+jδδ2p2j)absent1superscript𝑚2superscriptsubscript1𝑚superscriptsubscript𝑗02𝑝2subscript𝔼subscript^italic-ϵtr𝛿subscript𝛿superscriptsubscript𝑗𝛿subscript𝛿superscriptsubscript2𝑝2𝑗\displaystyle=\frac{1}{m^{2}}\sum_{\ell=1}^{m}\sum_{j=0}^{2p-2}\mathbb{E}_{% \hat{\epsilon}_{\ell}}\operatorname{tr}\left(\delta\mathcal{H}_{\ell}\delta% \mathcal{H}_{+\ell}^{j}\delta\mathcal{H}_{\ell}\delta\mathcal{H}_{-\ell}^{2p-2% -j}\right)= divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_p - 2 end_POSTSUPERSCRIPT blackboard_E start_POSTSUBSCRIPT over^ start_ARG italic_ϵ end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_tr ( italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_δ caligraphic_H start_POSTSUBSCRIPT + roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_δ caligraphic_H start_POSTSUBSCRIPT - roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_p - 2 - italic_j end_POSTSUPERSCRIPT )
1m2=1m2p12𝔼ϵ^tr[δ2(δ+2p2+δ2p2)]absent1superscript𝑚2superscriptsubscript1𝑚2𝑝12subscript𝔼subscript^italic-ϵtr𝛿superscriptsubscript2𝛿superscriptsubscript2𝑝2𝛿superscriptsubscript2𝑝2\displaystyle\leq\frac{1}{m^{2}}\sum_{\ell=1}^{m}\frac{2p-1}{2}\mathbb{E}_{% \hat{\epsilon}_{\ell}}\operatorname{tr}\left[\delta\mathcal{H}_{\ell}^{2}\cdot% \left(\delta\mathcal{H}_{+\ell}^{2p-2}+\delta\mathcal{H}_{-\ell}^{2p-2}\right)\right]≤ divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG 2 italic_p - 1 end_ARG start_ARG 2 end_ARG blackboard_E start_POSTSUBSCRIPT over^ start_ARG italic_ϵ end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_tr [ italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⋅ ( italic_δ caligraphic_H start_POSTSUBSCRIPT + roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_p - 2 end_POSTSUPERSCRIPT + italic_δ caligraphic_H start_POSTSUBSCRIPT - roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_p - 2 end_POSTSUPERSCRIPT ) ]
=(2p1)tr[1m2(=1mδ2)𝔼ϵ(δϵ)2p2].absent2𝑝1tr1superscript𝑚2superscriptsubscript1𝑚𝛿superscriptsubscript2subscript𝔼italic-ϵsuperscript𝛿superscriptitalic-ϵ2𝑝2\displaystyle=(2p-1)\operatorname{tr}\left[\frac{1}{m^{2}}\left(\sum_{\ell=1}^% {m}\delta\mathcal{H}_{\ell}^{2}\right)\mathbb{E}_{\epsilon}(\delta\mathcal{H}^% {\epsilon})^{2p-2}\right].= ( 2 italic_p - 1 ) roman_tr [ divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ( ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) blackboard_E start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT ( italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 italic_p - 2 end_POSTSUPERSCRIPT ] .

The second line is due to the equality

δ+2p1δ2p1=q=02p2δ+q(δ+δ)δ2p2q,𝛿superscriptsubscript2𝑝1𝛿superscriptsubscript2𝑝1superscriptsubscript𝑞02𝑝2𝛿superscriptsubscript𝑞𝛿subscript𝛿subscript𝛿superscriptsubscript2𝑝2𝑞\delta\mathcal{H}_{+\ell}^{2p-1}-\delta\mathcal{H}_{-\ell}^{2p-1}=\sum_{q=0}^{% 2p-2}\delta\mathcal{H}_{+\ell}^{q}(\delta\mathcal{H}_{+\ell}-\delta\mathcal{H}% _{-\ell})\delta\mathcal{H}_{-\ell}^{2p-2-q},italic_δ caligraphic_H start_POSTSUBSCRIPT + roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_p - 1 end_POSTSUPERSCRIPT - italic_δ caligraphic_H start_POSTSUBSCRIPT - roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_p - 1 end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_q = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_p - 2 end_POSTSUPERSCRIPT italic_δ caligraphic_H start_POSTSUBSCRIPT + roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT ( italic_δ caligraphic_H start_POSTSUBSCRIPT + roman_ℓ end_POSTSUBSCRIPT - italic_δ caligraphic_H start_POSTSUBSCRIPT - roman_ℓ end_POSTSUBSCRIPT ) italic_δ caligraphic_H start_POSTSUBSCRIPT - roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_p - 2 - italic_q end_POSTSUPERSCRIPT ,

and δ+δ=2mδ𝛿subscript𝛿subscript2𝑚𝛿subscript\delta\mathcal{H}_{+\ell}-\delta\mathcal{H}_{-\ell}=\frac{2}{m}\delta\mathcal{% H}_{\ell}italic_δ caligraphic_H start_POSTSUBSCRIPT + roman_ℓ end_POSTSUBSCRIPT - italic_δ caligraphic_H start_POSTSUBSCRIPT - roman_ℓ end_POSTSUBSCRIPT = divide start_ARG 2 end_ARG start_ARG italic_m end_ARG italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT. The third line is due to the following Geometric Mean-Arithmetic Mean (GM-AM) trace inequality in [34, Fact 2.4]

(A.9) tr(HWqHY2rq)+tr(HW2rqHYq)tr(H2(W2r+Y2r)),tr𝐻superscript𝑊𝑞𝐻superscript𝑌2𝑟𝑞tr𝐻superscript𝑊2𝑟𝑞𝐻superscript𝑌𝑞trsuperscript𝐻2superscript𝑊2𝑟superscript𝑌2𝑟\displaystyle\operatorname{tr}(HW^{q}HY^{2r-q})+\operatorname{tr}(HW^{2r-q}HY^% {q})\leq\operatorname{tr}(H^{2}(W^{2r}+Y^{2r})),roman_tr ( italic_H italic_W start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT italic_H italic_Y start_POSTSUPERSCRIPT 2 italic_r - italic_q end_POSTSUPERSCRIPT ) + roman_tr ( italic_H italic_W start_POSTSUPERSCRIPT 2 italic_r - italic_q end_POSTSUPERSCRIPT italic_H italic_Y start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT ) ≤ roman_tr ( italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_W start_POSTSUPERSCRIPT 2 italic_r end_POSTSUPERSCRIPT + italic_Y start_POSTSUPERSCRIPT 2 italic_r end_POSTSUPERSCRIPT ) ) ,

which can be generalized to self-adjoint compact operators in trace class. Here, q,r𝑞𝑟q,ritalic_q , italic_r are integers and 0q2r0𝑞2𝑟0\leq q\leq 2r0 ≤ italic_q ≤ 2 italic_r, and H,W𝐻𝑊H,Witalic_H , italic_W are two arbitrary self-adjoint compact operators in trace class. In fact, one can approximate the compact operators using finite rank self-adjoint operators, and the finite rank operators (essentially matrices) satisfy (A.9). Passing the limit for the finite rank approximation then verifies the inequality for self-adjoint compact operators. The last line follows since δ+2p2+δ2p2=2𝔼ϵ(δϵ)2p2𝛿superscriptsubscript2𝑝2𝛿superscriptsubscript2𝑝22subscript𝔼subscriptitalic-ϵsuperscript𝛿superscriptitalic-ϵ2𝑝2\delta\mathcal{H}_{+\ell}^{2p-2}+\delta\mathcal{H}_{-\ell}^{2p-2}=2\mathbb{E}_% {\epsilon_{\ell}}(\delta\mathcal{H}^{\epsilon})^{2p-2}italic_δ caligraphic_H start_POSTSUBSCRIPT + roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_p - 2 end_POSTSUPERSCRIPT + italic_δ caligraphic_H start_POSTSUBSCRIPT - roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_p - 2 end_POSTSUPERSCRIPT = 2 blackboard_E start_POSTSUBSCRIPT italic_ϵ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 italic_p - 2 end_POSTSUPERSCRIPT.

Recall that (see [20, section 30.2, Theorem 2]) if A𝐴Aitalic_A is self-adjoint, nonnegative and in trace class, then Atr=tr(A)subscriptnorm𝐴trtr𝐴\|A\|_{\operatorname{tr}}=\operatorname{tr}(A)∥ italic_A ∥ start_POSTSUBSCRIPT roman_tr end_POSTSUBSCRIPT = roman_tr ( italic_A ) and

(A.10) tr(AB)ABtrBAtr=Btr(A).tr𝐴𝐵subscriptnorm𝐴𝐵trnorm𝐵subscriptnorm𝐴trnorm𝐵tr𝐴\displaystyle\operatorname{tr}(AB)\leq\|AB\|_{\operatorname{tr}}\leq\|B\|\|A\|% _{\operatorname{tr}}=\|B\|\operatorname{tr}(A).roman_tr ( italic_A italic_B ) ≤ ∥ italic_A italic_B ∥ start_POSTSUBSCRIPT roman_tr end_POSTSUBSCRIPT ≤ ∥ italic_B ∥ ∥ italic_A ∥ start_POSTSUBSCRIPT roman_tr end_POSTSUBSCRIPT = ∥ italic_B ∥ roman_tr ( italic_A ) .

Applying (A.10) and repeating the above process, one then has

𝔼ϵtr((δϵ)2p)(2p1)1m2(=1mδH2)𝔼ϵtr(δϵ)2p2(2p1)!!1m2(=1mδH2)p1𝔼ϵtr(δϵ)2.\begin{split}\mathbb{E}_{\epsilon}\operatorname{tr}\left((\delta\mathcal{H}^{% \epsilon})^{2p}\right)&\leq(2p-1)\left\|\frac{1}{m^{2}}\left(\sum_{\ell=1}^{m}% \delta H_{\ell}^{2}\right)\right\|\mathbb{E}_{\epsilon}\operatorname{tr}(% \delta\mathcal{H}^{\epsilon})^{2p-2}\\ &\leq(2p-1)!!\left\|\frac{1}{m^{2}}\left(\sum_{\ell=1}^{m}\delta H_{\ell}^{2}% \right)\right\|^{p-1}\mathbb{E}_{\epsilon}\operatorname{tr}(\delta\mathcal{H}^% {\epsilon})^{2}.\end{split}start_ROW start_CELL blackboard_E start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT roman_tr ( ( italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 italic_p end_POSTSUPERSCRIPT ) end_CELL start_CELL ≤ ( 2 italic_p - 1 ) ∥ divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ( ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_δ italic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ∥ blackboard_E start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT roman_tr ( italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 italic_p - 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ ( 2 italic_p - 1 ) !! ∥ divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ( ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_δ italic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ∥ start_POSTSUPERSCRIPT italic_p - 1 end_POSTSUPERSCRIPT blackboard_E start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT roman_tr ( italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . end_CELL end_ROW

Since

𝔼ϵ(δϵ)2=1m2δ2subscript𝔼italic-ϵsuperscript𝛿superscriptitalic-ϵ21superscript𝑚2subscript𝛿superscriptsubscript2\mathbb{E}_{\epsilon}(\delta\mathcal{H}^{\epsilon})^{2}=\frac{1}{m^{2}}\sum_{% \ell}\delta\mathcal{H}_{\ell}^{2}blackboard_E start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT ( italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT

and (2p1)!!(2p+1e)pdouble-factorial2𝑝1superscript2𝑝1e𝑝(2p-1)!!\leqslant\left(\frac{2p+1}{\mathrm{e}}\right)^{p}( 2 italic_p - 1 ) !! ⩽ ( divide start_ARG 2 italic_p + 1 end_ARG start_ARG roman_e end_ARG ) start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT, we then arrive at

𝔼ϵδϵ2p+1e(1m2δ2)1/21/(2p)(1m2tr(δ2))1/(2p).subscript𝔼italic-ϵnorm𝛿superscriptitalic-ϵ2𝑝1𝑒superscript1superscript𝑚2subscriptnorm𝛿superscriptsubscript21212𝑝superscript1superscript𝑚2subscripttr𝛿superscriptsubscript212𝑝\mathbb{E}_{\epsilon}\|\delta\mathcal{H}^{\epsilon}\|\leq\sqrt{\frac{2p+1}{e}}% \left(\frac{1}{m^{2}}\sum_{\ell}\|\delta\mathcal{H}_{\ell}^{2}\|\right)^{1/2-1% /(2p)}\left(\frac{1}{m^{2}}\sum_{\ell}\operatorname{tr}\left(\delta\mathcal{H}% _{\ell}^{2}\right)\right)^{1/(2p)}.blackboard_E start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT ∥ italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ∥ ≤ square-root start_ARG divide start_ARG 2 italic_p + 1 end_ARG start_ARG italic_e end_ARG end_ARG ( divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∥ italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ ) start_POSTSUPERSCRIPT 1 / 2 - 1 / ( 2 italic_p ) end_POSTSUPERSCRIPT ( divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT roman_tr ( italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ) start_POSTSUPERSCRIPT 1 / ( 2 italic_p ) end_POSTSUPERSCRIPT .

Taking

p=[logtr(δ2)δ2]+1𝑝delimited-[]subscripttr𝛿superscriptsubscript2subscriptnorm𝛿superscriptsubscript21p=\left[\log\frac{\sum_{\ell}\operatorname{tr}\left(\delta\mathcal{H}_{\ell}^{% 2}\right)}{\sum_{\ell}\|\delta\mathcal{H}_{\ell}^{2}\|}\right]+1italic_p = [ roman_log divide start_ARG ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT roman_tr ( italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∥ italic_δ caligraphic_H start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ end_ARG ] + 1

gives the result.

Combining Lemma A.3 and Lemma A.8, we conclude a Rosenthal type inequality for δ𝒯ξ𝛿superscript𝒯𝜉\delta\mathcal{T}^{\xi}italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT.

Theorem A.10.

For p=2𝑝2p=2italic_p = 2, δ𝒯ξ𝛿superscript𝒯𝜉\delta\mathcal{T}^{\xi}italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT defined in (A.2), it holds that

(A.11) 𝔼(δ𝒯ξ2)8𝔼[(logm1tr((δ𝒯ξ)δ𝒯ξ)m1δ𝒯ξ2+2.7)1m2=1mδ𝒯ξ2].𝔼superscriptnorm𝛿superscript𝒯𝜉28𝔼delimited-[]superscript𝑚1subscripttrsuperscript𝛿superscriptsubscript𝒯𝜉𝛿superscriptsubscript𝒯𝜉superscript𝑚1subscriptsuperscriptnorm𝛿superscriptsubscript𝒯𝜉22.71superscript𝑚2superscriptsubscript1𝑚superscriptnorm𝛿superscriptsubscript𝒯𝜉2\displaystyle\mathbb{E}(\|\delta\mathcal{T}^{\xi}\|^{2})\leq 8\mathbb{E}\left[% \left(\log\frac{m^{-1}\sum_{\ell}\operatorname{tr}\left((\delta\mathcal{T}_{% \ell}^{\xi})^{*}\delta\mathcal{T}_{\ell}^{\xi}\right)}{m^{-1}\sum_{\ell}\|% \delta\mathcal{T}_{\ell}^{\xi}\|^{2}}+2.7\right)\frac{1}{m^{2}}\sum_{\ell=1}^{% m}\|\delta\mathcal{T}_{\ell}^{\xi}\|^{2}\right].blackboard_E ( ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ≤ 8 blackboard_E [ ( roman_log divide start_ARG italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT roman_tr ( ( italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∥ italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG + 2.7 ) divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∥ italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] .

Proof A.11.

By Lemma A.7, one finds that

𝔼δ𝒯ξ=𝔼δ2𝔼δϵ=2𝔼(𝔼ϵδϵ).𝔼norm𝛿superscript𝒯𝜉𝔼norm𝛿2𝔼norm𝛿superscriptitalic-ϵ2𝔼subscript𝔼italic-ϵnorm𝛿superscriptitalic-ϵ\mathbb{E}\|\delta\mathcal{T}^{\xi}\|=\mathbb{E}\|\delta\mathcal{H}\|\leq 2% \mathbb{E}\|\delta\mathcal{H}^{\epsilon}\|=2\mathbb{E}(\mathbb{E}_{\epsilon}\|% \delta\mathcal{H}^{\epsilon}\|).blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ = blackboard_E ∥ italic_δ caligraphic_H ∥ ≤ 2 blackboard_E ∥ italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ∥ = 2 blackboard_E ( blackboard_E start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT ∥ italic_δ caligraphic_H start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ∥ ) .

Then according to Lemma A.3 and Lemma A.8, one has

𝔼(δ𝒯ξ2)(𝔼δ𝒯ξ)2+4=1m1m2𝔼δ𝒯ξ24𝔼[(4+2logm1tr((δ𝒯ξ)δ𝒯ξ)m1δ𝒯ξ2+2log2)1m2δ𝒯ξ2],𝔼superscriptdelimited-∥∥𝛿superscript𝒯𝜉2superscript𝔼delimited-∥∥𝛿superscript𝒯𝜉24superscriptsubscript1𝑚1superscript𝑚2𝔼superscriptdelimited-∥∥𝛿superscriptsubscript𝒯𝜉24𝔼delimited-[]42superscript𝑚1subscripttrsuperscript𝛿superscriptsubscript𝒯𝜉𝛿superscriptsubscript𝒯𝜉superscript𝑚1subscriptsuperscriptnorm𝛿superscriptsubscript𝒯𝜉2221superscript𝑚2subscriptsuperscriptdelimited-∥∥𝛿superscriptsubscript𝒯𝜉2\begin{split}\mathbb{E}(\|\delta\mathcal{T}^{\xi}\|^{2})&\leq(\mathbb{E}\|% \delta\mathcal{T}^{\xi}\|)^{2}+4\sum_{\ell=1}^{m}\frac{1}{m^{2}}\mathbb{E}\|% \delta\mathcal{T}_{\ell}^{\xi}\|^{2}\\ &\leq 4\mathbb{E}\left[\left(4+2\log\frac{m^{-1}\sum_{\ell}\operatorname{tr}% \left((\delta\mathcal{T}_{\ell}^{\xi})^{*}\delta\mathcal{T}_{\ell}^{\xi}\right% )}{m^{-1}\sum_{\ell}\|\delta\mathcal{T}_{\ell}^{\xi}\|^{2}}+2\log 2\right)% \frac{1}{m^{2}}\sum_{\ell}\|\delta\mathcal{T}_{\ell}^{\xi}\|^{2}\right],\end{split}start_ROW start_CELL blackboard_E ( ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_CELL start_CELL ≤ ( blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 4 ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG blackboard_E ∥ italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ 4 blackboard_E [ ( 4 + 2 roman_log divide start_ARG italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT roman_tr ( ( italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∥ italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG + 2 roman_log 2 ) divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∥ italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] , end_CELL end_ROW

where we have used [x]xdelimited-[]𝑥𝑥[x]\leq x[ italic_x ] ≤ italic_x. Since 2log2<1.4221.42\log 2<1.42 roman_log 2 < 1.4, the result follows.

We conclude the following.

Corollary A.12 (Bounds of 𝔼δ𝒯ξ2𝔼superscriptnorm𝛿superscript𝒯𝜉2\mathbb{E}\|\delta\mathcal{T}^{\xi}\|^{2}blackboard_E ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT).

It holds for the truncated system that

𝔼(δ𝒯ξ2)C(logn+1)n3,𝔼superscriptnorm𝛿superscript𝒯𝜉2𝐶𝑛1superscript𝑛3\displaystyle\mathbb{E}(\|\delta\mathcal{T}^{\xi}\|^{2})\leq C(\log n+1)n^{-3},blackboard_E ( ∥ italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ≤ italic_C ( roman_log italic_n + 1 ) italic_n start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT ,

where C𝐶Citalic_C is a constant that is independent of n𝑛nitalic_n.

Proof A.13.

In (A.11), we note that

𝔼tr((δ𝒯ξ)δ𝒯)=𝔼tr((𝒯¯ξ)𝒯¯ξ)tr(𝒯¯𝒯¯)𝔼tr((𝒯¯ξ)𝒯¯ξ),𝔼trsuperscript𝛿superscriptsubscript𝒯𝜉𝛿subscript𝒯𝔼trsuperscriptsuperscriptsubscript¯𝒯𝜉superscriptsubscript¯𝒯𝜉trsuperscriptsubscript¯𝒯subscript¯𝒯𝔼trsuperscriptsuperscriptsubscript¯𝒯𝜉superscriptsubscript¯𝒯𝜉\mathbb{E}\operatorname{tr}((\delta\mathcal{T}_{\ell}^{\xi})^{*}\delta\mathcal% {T}_{\ell})=\mathbb{E}\operatorname{tr}((\bar{\mathcal{T}}_{\ell}^{\xi})^{*}% \bar{\mathcal{T}}_{\ell}^{\xi})-\operatorname{tr}(\bar{\mathcal{T}}_{\ell}^{*}% \bar{\mathcal{T}}_{\ell})\leq\mathbb{E}\operatorname{tr}((\bar{\mathcal{T}}_{% \ell}^{\xi})^{*}\bar{\mathcal{T}}_{\ell}^{\xi}),blackboard_E roman_tr ( ( italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) = blackboard_E roman_tr ( ( over¯ start_ARG caligraphic_T end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT over¯ start_ARG caligraphic_T end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) - roman_tr ( over¯ start_ARG caligraphic_T end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT over¯ start_ARG caligraphic_T end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) ≤ blackboard_E roman_tr ( ( over¯ start_ARG caligraphic_T end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT over¯ start_ARG caligraphic_T end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) ,

where 𝒯¯=12(𝒯+𝒯+m)subscript¯𝒯12subscript𝒯subscript𝒯𝑚\bar{\mathcal{T}}_{\ell}=\frac{1}{2}(\mathcal{T}_{\ell}+\mathcal{T}_{\ell+m})over¯ start_ARG caligraphic_T end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT + caligraphic_T start_POSTSUBSCRIPT roman_ℓ + italic_m end_POSTSUBSCRIPT ) and 𝒯¯ξsuperscriptsubscript¯𝒯𝜉\bar{\mathcal{T}}_{\ell}^{\xi}over¯ start_ARG caligraphic_T end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT is similarly defined. The inequality above holds because tr(𝒯¯𝒯¯)=tr(𝒯¯𝒯¯)0trsuperscriptsubscript¯𝒯subscript¯𝒯trsubscript¯𝒯superscriptsubscript¯𝒯0\operatorname{tr}(\bar{\mathcal{T}}_{\ell}^{*}\bar{\mathcal{T}}_{\ell})=% \operatorname{tr}(\bar{\mathcal{T}}_{\ell}\bar{\mathcal{T}}_{\ell}^{*})\geq 0roman_tr ( over¯ start_ARG caligraphic_T end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT over¯ start_ARG caligraphic_T end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) = roman_tr ( over¯ start_ARG caligraphic_T end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT over¯ start_ARG caligraphic_T end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≥ 0. Moreover, using the simple control,

tr(T1T2+T2T1)tr(T1T1+T2T2),tr(T1T2+T2T1)tr(T1T1+T2T2)formulae-sequencetrsubscript𝑇1superscriptsubscript𝑇2subscript𝑇2superscriptsubscript𝑇1trsubscript𝑇1superscriptsubscript𝑇1subscript𝑇2superscriptsubscript𝑇2trsuperscriptsubscript𝑇1subscript𝑇2superscriptsubscript𝑇2subscript𝑇1trsuperscriptsubscript𝑇1subscript𝑇1superscriptsubscript𝑇2subscript𝑇2\operatorname{tr}(T_{1}T_{2}^{*}+T_{2}T_{1}^{*})\leq\operatorname{tr}(T_{1}T_{% 1}^{*}+T_{2}T_{2}^{*}),\quad\operatorname{tr}(T_{1}^{*}T_{2}+T_{2}^{*}T_{1})% \leq\operatorname{tr}(T_{1}^{*}T_{1}+T_{2}^{*}T_{2})roman_tr ( italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + italic_T start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ roman_tr ( italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + italic_T start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) , roman_tr ( italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_T start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ≤ roman_tr ( italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_T start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT )

we conclude that by Lemma A.5 for the truncated system that

n1tr(δ𝒯ξ)δ𝒯ξC.n^{-1}\sum_{\ell}\operatorname{tr}(\delta\mathcal{T}_{\ell}^{\xi})^{*}\delta% \mathcal{T}_{\ell}^{\xi}\leq C.italic_n start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT roman_tr ( italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ≤ italic_C .

Moreover, by Lemma A.1 and noting n=2m𝑛2𝑚n=2mitalic_n = 2 italic_m,

1m2δ𝒯ξ2C1n3,|log(m1δ𝒯ξ2)|C(1+logn).formulae-sequence1superscript𝑚2subscriptsuperscriptnorm𝛿superscriptsubscript𝒯𝜉2𝐶1superscript𝑛3superscript𝑚1subscriptsuperscriptnorm𝛿superscriptsubscript𝒯𝜉2𝐶1𝑛\frac{1}{m^{2}}\sum_{\ell}\|\delta\mathcal{T}_{\ell}^{\xi}\|^{2}\leq C\frac{1}% {n^{3}},\quad|\log(m^{-1}\sum_{\ell}\|\delta\mathcal{T}_{\ell}^{\xi}\|^{2})|% \leq C(1+\log n).divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∥ italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ italic_C divide start_ARG 1 end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG , | roman_log ( italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∥ italic_δ caligraphic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) | ≤ italic_C ( 1 + roman_log italic_n ) .

The claim then follows.

Next, we move to the estimation of δb(x)norm𝛿𝑏𝑥\|\delta b(x)\|∥ italic_δ italic_b ( italic_x ) ∥, which is much easier than δ𝒯ξ𝛿superscript𝒯𝜉\delta\mathcal{T}^{\xi}italic_δ caligraphic_T start_POSTSUPERSCRIPT italic_ξ end_POSTSUPERSCRIPT since it is in the Hilbert space L2(σT)superscript𝐿2subscript𝜎𝑇L^{2}(\sigma_{T})italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ). Here, we prove the lemma without truncation (the truncated version is clearly correct if the un-truncated one holds).

Lemma A.14.

Suppose that μψμ(xL)maps-to𝜇subscript𝜓𝜇subscript𝑥𝐿\mu\mapsto\psi_{\mu}(x_{L})italic_μ ↦ italic_ψ start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ) is Lipschitz on (1,0)10(-1,0)( - 1 , 0 ) and μψμ(xR)maps-to𝜇subscript𝜓𝜇subscript𝑥𝑅\mu\mapsto\psi_{\mu}(x_{R})italic_μ ↦ italic_ψ start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ) is Lipschitz on (0,1)01(0,1)( 0 , 1 ). Then, it holds that

𝔼δbL2(σT)Cn3(1+logn),𝔼subscriptnorm𝛿𝑏superscript𝐿2subscript𝜎𝑇𝐶superscript𝑛31𝑛\mathbb{E}\|\delta b\|_{L^{2}(\sigma_{T})}\leq C\sqrt{n^{-3}(1+\log n)},blackboard_E ∥ italic_δ italic_b ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ≤ italic_C square-root start_ARG italic_n start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT ( 1 + roman_log italic_n ) end_ARG ,

where C𝐶Citalic_C is a constant that is independent of n𝑛nitalic_n.

Proof A.15.

By the Hölder inequality

𝔼δb𝔼δb2.𝔼norm𝛿𝑏𝔼superscriptnorm𝛿𝑏2\mathbb{E}\|\delta b\|\leq\sqrt{\mathbb{E}\|\delta b\|^{2}}.blackboard_E ∥ italic_δ italic_b ∥ ≤ square-root start_ARG blackboard_E ∥ italic_δ italic_b ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG .

Define b~:=bμ+bμ+massignsubscript~𝑏subscript𝑏subscript𝜇subscript𝑏subscript𝜇𝑚\tilde{b}_{\ell}:=b_{\mu_{\ell}}+b_{\mu_{\ell+m}}over~ start_ARG italic_b end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT := italic_b start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_b start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ + italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Then it holds that

(A.12) 𝔼δb2𝔼superscriptnorm𝛿𝑏2\displaystyle\mathbb{E}\|\delta b\|^{2}blackboard_E ∥ italic_δ italic_b ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT =𝔼1nαbμ1211bμ𝑑μ2=𝔼=1m(ωb~1|S|SS+mbμ𝑑μ)2absent𝔼superscriptnorm1𝑛subscriptsubscript𝛼subscript𝑏subscript𝜇12superscriptsubscript11subscript𝑏𝜇differential-d𝜇2𝔼superscriptnormsuperscriptsubscript1𝑚subscript𝜔subscript~𝑏1𝑆subscriptsubscript𝑆subscript𝑆𝑚subscript𝑏𝜇differential-d𝜇2\displaystyle=\mathbb{E}\Big{\|}\frac{1}{n}\sum_{\ell}\alpha_{\ell}b_{\mu_{% \ell}}-\frac{1}{2}\int_{-1}^{1}b_{\mu}d\mu\Big{\|}^{2}=\mathbb{E}\Big{\|}\sum_% {\ell=1}^{m}\left(\omega_{\ell}\tilde{b}_{\ell}-\frac{1}{|S|}\int_{S_{\ell}% \cup S_{\ell+m}}b_{\mu}d\mu\right)\Big{\|}^{2}= blackboard_E ∥ divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∫ start_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = blackboard_E ∥ ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT over~ start_ARG italic_b end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG | italic_S | end_ARG ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∪ italic_S start_POSTSUBSCRIPT roman_ℓ + italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
==1m𝔼ωb~1|S|SS+mbμ𝑑μ2absentsuperscriptsubscript1𝑚𝔼superscriptnormsubscript𝜔subscript~𝑏1𝑆subscriptsubscript𝑆subscript𝑆𝑚subscript𝑏𝜇differential-d𝜇2\displaystyle=\sum_{\ell=1}^{m}\mathbb{E}\Big{\|}\omega_{\ell}\tilde{b}_{\ell}% -\frac{1}{|S|}\int_{S_{\ell}\cup S_{\ell+m}}b_{\mu}d\mu\Big{\|}^{2}= ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT blackboard_E ∥ italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT over~ start_ARG italic_b end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG | italic_S | end_ARG ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∪ italic_S start_POSTSUBSCRIPT roman_ℓ + italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
=1|S|2=1m𝔼S(bμbμ)𝑑μ+S+m(bμ+mbμ)𝑑μ2absent1superscript𝑆2superscriptsubscript1𝑚𝔼superscriptnormsubscriptsubscript𝑆subscript𝑏subscript𝜇subscript𝑏𝜇differential-d𝜇subscriptsubscript𝑆𝑚subscript𝑏subscript𝜇𝑚subscript𝑏𝜇differential-d𝜇2\displaystyle=\frac{1}{|S|^{2}}\sum_{\ell=1}^{m}\mathbb{E}\Big{\|}\int_{S_{% \ell}}(b_{\mu_{\ell}}-b_{\mu})d\mu+\int_{S_{\ell+m}}(b_{\mu_{\ell+m}}-b_{\mu})% d\mu\Big{\|}^{2}= divide start_ARG 1 end_ARG start_ARG | italic_S | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT blackboard_E ∥ ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_b start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ) italic_d italic_μ + ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ + italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_b start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ + italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ) italic_d italic_μ ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
2|S|2=1n𝔼(Sbμbμ𝑑μ)2=1n2|S||S|2𝔼Sbμbμ2𝑑μ.absent2superscript𝑆2superscriptsubscript1𝑛𝔼superscriptsubscriptsubscript𝑆normsubscript𝑏subscript𝜇subscript𝑏𝜇differential-d𝜇2superscriptsubscript1𝑛2subscript𝑆superscript𝑆2𝔼subscriptsubscript𝑆superscriptnormsubscript𝑏subscript𝜇subscript𝑏𝜇2differential-d𝜇\displaystyle\leq\frac{2}{|S|^{2}}\sum_{\ell=1}^{n}\mathbb{E}\left(\int_{S_{% \ell}}\|b_{\mu_{\ell}}-b_{\mu}\|d\mu\right)^{2}\leq\sum_{\ell=1}^{n}\frac{2|S_% {\ell}|}{|S|^{2}}\mathbb{E}\int_{S_{\ell}}\|b_{\mu_{\ell}}-b_{\mu}\|^{2}d\mu.≤ divide start_ARG 2 end_ARG start_ARG | italic_S | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT blackboard_E ( ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_b start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ∥ italic_d italic_μ ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG 2 | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | end_ARG start_ARG | italic_S | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG blackboard_E ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_b start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_μ .

Above, the summation can be moved out of the norm because the expectation of the cross terms are zero, i.e.,

𝔼(ωb~1|S|SS+mbμ𝑑μ)(ωb~μ1|S|SS+mbμ𝑑μ)=0𝔼subscript𝜔subscript~𝑏1𝑆subscriptsubscript𝑆subscript𝑆𝑚subscript𝑏𝜇differential-d𝜇subscript𝜔superscriptsubscript~𝑏subscript𝜇superscript1𝑆subscriptsubscript𝑆superscriptsubscript𝑆superscript𝑚subscript𝑏𝜇differential-d𝜇0\mathbb{E}\left(\omega_{\ell}\tilde{b}_{\ell}-\frac{1}{|S|}\int_{S_{\ell}\cup S% _{\ell+m}}b_{\mu}d\mu\right)\left(\omega_{\ell^{\prime}}\tilde{b}_{\mu_{\ell^{% \prime}}}-\frac{1}{|S|}\int_{S_{\ell^{\prime}}\cup S_{\ell^{\prime}+m}}b_{\mu}% d\mu\right)=0blackboard_E ( italic_ω start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT over~ start_ARG italic_b end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG | italic_S | end_ARG ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∪ italic_S start_POSTSUBSCRIPT roman_ℓ + italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ ) ( italic_ω start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT over~ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG | italic_S | end_ARG ∫ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∪ italic_S start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT italic_d italic_μ ) = 0

for superscript\ell\neq\ell^{\prime}roman_ℓ ≠ roman_ℓ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. The last inequality is due to the Hölder inequality.

Using the explicit formula of bμsubscript𝑏𝜇b_{\mu}italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT, we will show now for =1,,m1𝑚\ell=1,\cdots,mroman_ℓ = 1 , ⋯ , italic_m and μS𝜇subscript𝑆\mu\in S_{\ell}italic_μ ∈ italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT that

(A.13) bμbμ2{C1|μ|n2inf{|μ|:μS}2n1,Cn1otherwise.\|b_{\mu_{\ell}}-b_{\mu}\|^{2}\leq\begin{cases}C\frac{1}{|\mu|n^{2}}&\inf\{|% \mu|:\mu\in S_{\ell}\}\geq 2n^{-1},\\ Cn^{-1}&\text{otherwise}.\\ \end{cases}∥ italic_b start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_b start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ { start_ROW start_CELL italic_C divide start_ARG 1 end_ARG start_ARG | italic_μ | italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_CELL start_CELL roman_inf { | italic_μ | : italic_μ ∈ italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT } ≥ 2 italic_n start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT , end_CELL end_ROW start_ROW start_CELL italic_C italic_n start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_CELL start_CELL otherwise . end_CELL end_ROW

Below, we consider only μ>0𝜇0\mu>0italic_μ > 0 (μ<0𝜇0\mu<0italic_μ < 0 is similar). Recall the boundary propagator for μ>0𝜇0\mu>0italic_μ > 0:

Bμ(x)=exp(1μxLxσT(y)𝑑y).subscript𝐵𝜇𝑥1𝜇superscriptsubscriptsubscript𝑥𝐿𝑥subscript𝜎𝑇𝑦differential-d𝑦\displaystyle B_{\mu}(x)=\exp\left(-\frac{1}{\mu}\int_{x_{L}}^{x}\sigma_{T}(y)% \,dy\right).italic_B start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_x ) = roman_exp ( - divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_y ) italic_d italic_y ) .

Clearly, for μ1,μ2Ssubscript𝜇1subscript𝜇2subscript𝑆\mu_{1},\mu_{2}\in S_{\ell}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT, μ1>0,μ2>0formulae-sequencesubscript𝜇10subscript𝜇20\mu_{1}>0,\mu_{2}>0italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > 0 , italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > 0 and μ1<μ2subscript𝜇1subscript𝜇2\mu_{1}<\mu_{2}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, one has δμ:=μ2μ1α|S|/nassign𝛿𝜇subscript𝜇2subscript𝜇1subscript𝛼𝑆𝑛\delta\mu:=\mu_{2}-\mu_{1}\leq\alpha_{\ell}|S|/nitalic_δ italic_μ := italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | italic_S | / italic_n and thus

bμ1bμ222Bμ1()Bμ2()2supμ|ψ(xL,μ)|+2α2|S|2Bμ2()2Lψ2n2,superscriptnormsubscript𝑏subscript𝜇1subscript𝑏subscript𝜇222superscriptnormsubscript𝐵subscript𝜇1subscript𝐵subscript𝜇22subscriptsupremum𝜇𝜓subscript𝑥𝐿𝜇2superscriptsubscript𝛼2superscript𝑆2superscriptnormsubscript𝐵subscript𝜇22superscriptsubscript𝐿𝜓2superscript𝑛2\|b_{\mu_{1}}-b_{\mu_{2}}\|^{2}\leq 2\|B_{\mu_{1}}(\cdot)-B_{\mu_{2}}(\cdot)\|% ^{2}\sup_{\mu}|\psi(x_{L},\mu)|+2\alpha_{\ell}^{2}|S|^{2}\|B_{\mu_{2}}(\cdot)% \|^{2}L_{\psi}^{2}n^{-2},∥ italic_b start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_b start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ 2 ∥ italic_B start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋅ ) - italic_B start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋅ ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_sup start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT | italic_ψ ( italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT , italic_μ ) | + 2 italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_S | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ italic_B start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋅ ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT ,

where Lψsubscript𝐿𝜓L_{\psi}italic_L start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT is the Lipschitz constant of ψ𝜓\psiitalic_ψ.

Let M(x)=xLxσT(y)𝑑y𝑀𝑥superscriptsubscriptsubscript𝑥𝐿𝑥subscript𝜎𝑇𝑦differential-d𝑦M(x)=\int_{x_{L}}^{x}\sigma_{T}(y)dyitalic_M ( italic_x ) = ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_y ) italic_d italic_y. Note the simple fact

Bμ1(x)Bμ2(x)=exp(M(x)μ2)(exp(M(x)δμμ1μ2)1).subscript𝐵subscript𝜇1𝑥subscript𝐵subscript𝜇2𝑥𝑀𝑥subscript𝜇2𝑀𝑥𝛿𝜇subscript𝜇1subscript𝜇21B_{\mu_{1}}(x)-B_{\mu_{2}}(x)=\exp\left(-\frac{M(x)}{\mu_{2}}\right)\left(\exp% \left(-\frac{M(x)\delta\mu}{\mu_{1}\mu_{2}}\right)-1\right).italic_B start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x ) - italic_B start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x ) = roman_exp ( - divide start_ARG italic_M ( italic_x ) end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ) ( roman_exp ( - divide start_ARG italic_M ( italic_x ) italic_δ italic_μ end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ) - 1 ) .

If inf{μ:μS}<2/ninfimumconditional-set𝜇𝜇subscript𝑆2𝑛\inf\{\mu:\mu\in S_{\ell}\}<2/nroman_inf { italic_μ : italic_μ ∈ italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT } < 2 / italic_n, since αsubscript𝛼\alpha_{\ell}italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT is bounded, μ2C/nsubscript𝜇2𝐶𝑛\mu_{2}\leq C/nitalic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ italic_C / italic_n for some constant C>0𝐶0C>0italic_C > 0. Since |exp(M(x)δμμ1μ2)1|<1𝑀𝑥𝛿𝜇subscript𝜇1subscript𝜇211|\exp\left(-\frac{M(x)\delta\mu}{\mu_{1}\mu_{2}}\right)-1|<1| roman_exp ( - divide start_ARG italic_M ( italic_x ) italic_δ italic_μ end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ) - 1 | < 1, one has

Bμ1()Bμ2()2exp(M(x)μ2)2Cn1.superscriptnormsubscript𝐵subscript𝜇1subscript𝐵subscript𝜇22superscriptnorm𝑀𝑥subscript𝜇22𝐶superscript𝑛1\left\|B_{\mu_{1}}(\cdot)-B_{\mu_{2}}(\cdot)\right\|^{2}\leq\left\|\exp\left(-% \frac{M(x)}{\mu_{2}}\right)\right\|^{2}\leq Cn^{-1}.∥ italic_B start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋅ ) - italic_B start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋅ ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ ∥ roman_exp ( - divide start_ARG italic_M ( italic_x ) end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ italic_C italic_n start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT .

If inf{μ:μS}2/ninfimumconditional-set𝜇𝜇subscript𝑆2𝑛\inf\{\mu:\mu\in S_{\ell}\}\geq 2/nroman_inf { italic_μ : italic_μ ∈ italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT } ≥ 2 / italic_n, then μ2/μ1=1+δμ/μ1subscript𝜇2subscript𝜇11𝛿𝜇subscript𝜇1\mu_{2}/\mu_{1}=1+\delta\mu/\mu_{1}italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT / italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 + italic_δ italic_μ / italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is bounded. Using the simple bound |exp(M(x)δμμ1μ2)1|M(x)δμ/(μ1μ2)𝑀𝑥𝛿𝜇subscript𝜇1subscript𝜇21𝑀𝑥𝛿𝜇subscript𝜇1subscript𝜇2|\exp\left(-\frac{M(x)\delta\mu}{\mu_{1}\mu_{2}}\right)-1|\leq M(x)\delta\mu/(% \mu_{1}\mu_{2})| roman_exp ( - divide start_ARG italic_M ( italic_x ) italic_δ italic_μ end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ) - 1 | ≤ italic_M ( italic_x ) italic_δ italic_μ / ( italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ), one has

Bμ1()Bμ2()2xLxRδμ2μ12μ22M2(x)exp(2M(x)μ2)σT(x)𝑑xCμ2n2μ12C1μ1n2.superscriptdelimited-∥∥subscript𝐵subscript𝜇1subscript𝐵subscript𝜇22superscriptsubscriptsubscript𝑥𝐿subscript𝑥𝑅𝛿superscript𝜇2superscriptsubscript𝜇12superscriptsubscript𝜇22superscript𝑀2𝑥2𝑀𝑥subscript𝜇2subscript𝜎𝑇𝑥differential-d𝑥𝐶subscript𝜇2superscript𝑛2superscriptsubscript𝜇12superscript𝐶1subscript𝜇1superscript𝑛2\begin{split}\left\|B_{\mu_{1}}(\cdot)-B_{\mu_{2}}(\cdot)\right\|^{2}&\leq\int% _{x_{L}}^{x_{R}}\frac{\delta\mu^{2}}{\mu_{1}^{2}\mu_{2}^{2}}M^{2}(x)\exp\left(% -\frac{2M(x)}{\mu_{2}}\right)\sigma_{T}(x)dx\\ &\leq C\frac{\mu_{2}}{n^{2}\mu_{1}^{2}}\leq C^{\prime}\frac{1}{\mu_{1}n^{2}}.% \end{split}start_ROW start_CELL ∥ italic_B start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋅ ) - italic_B start_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋅ ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL ≤ ∫ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT divide start_ARG italic_δ italic_μ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG italic_M start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_x ) roman_exp ( - divide start_ARG 2 italic_M ( italic_x ) end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ) italic_σ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_x ) italic_d italic_x end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ italic_C divide start_ARG italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ≤ italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG . end_CELL end_ROW

The second inequality here follows by the substitution w=x/μ2𝑤𝑥subscript𝜇2w=x/\mu_{2}italic_w = italic_x / italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and the fact

z2exp(2z)Cez.superscript𝑧22𝑧𝐶superscript𝑒𝑧z^{2}\exp(-2z)\leq Ce^{-z}.italic_z start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_exp ( - 2 italic_z ) ≤ italic_C italic_e start_POSTSUPERSCRIPT - italic_z end_POSTSUPERSCRIPT .

Hence, (A.13) is proved.

By (A.13), one finds easily that (A.12) is controlled by

𝔼δb2C1n2:inf{μ:μS}<2/n|S|+C1n2n111n2μ𝑑μ𝔼superscriptnorm𝛿𝑏2𝐶1superscript𝑛2subscript:infimumconditional-set𝜇𝜇subscript𝑆2𝑛subscript𝑆𝐶1𝑛superscriptsubscript2superscript𝑛111superscript𝑛2𝜇differential-d𝜇\mathbb{E}\|\delta b\|^{2}\leq C\frac{1}{n^{2}}\sum_{\ell:\inf\{\mu:\mu\in S_{% \ell}\}<2/n}|S_{\ell}|+C\frac{1}{n}\int_{2n^{-1}}^{1}\frac{1}{n^{2}\mu}d\mublackboard_E ∥ italic_δ italic_b ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ italic_C divide start_ARG 1 end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ : roman_inf { italic_μ : italic_μ ∈ italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT } < 2 / italic_n end_POSTSUBSCRIPT | italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | + italic_C divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∫ start_POSTSUBSCRIPT 2 italic_n start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_μ end_ARG italic_d italic_μ

and thus the result follows.