A novel model reduction method to solve inverse problems of parabolic type

Wenlong Zhang Corresponding author. Department of Mathematics &\&& National Center for Applied Mathematics Shenzhen, Southern University of Science and Technology (SUSTech), 1088 Xueyuan Boulevard, University Town of Shenzhen, Xili, Nanshan, Shenzhen, Guangdong Province, P.R.China. ([email protected]).    Zhiwen Zhang Corresponding author. Department of Mathematics, The University of Hong Kong, Pokfulam Road, Hong Kong SAR, P.R.China. ([email protected]).
Abstract

In this paper, we propose novel proper orthogonal decomposition (POD)–based model reduction methods that effectively address the issue of inverse crime in solving parabolic inverse problems. Both the inverse initial value problems and inverse source problems are studied. By leveraging the inherent low-dimensional structures present in the data, our approach enables a reduction in the forward model complexity without compromising the accuracy of the inverse problem solution. Besides, we prove the convergence analysis of the proposed methods for solving parabolic inverse problems. Through extensive experimentation and comparative analysis, we demonstrate the effectiveness of our method in overcoming inverse crime and achieving improved inverse problem solutions. The proposed POD model reduction method offers a promising direction for improving the reliability and applicability of inverse problem-solving techniques in various domains.

AMS subject classification: 35R30, 65J20, 65M12, 65N21, 78M34.

Keywords: parabolic inverse problem; regularization method; model reduction method; inverse crime; convergence analysis.

1 Introduction

Inverse crime, the phenomenon where the forward model used for solving an inverse problem is the same as the one used for generating the data, poses a significant challenge in accurate and reliable inverse problem solutions.

Inverse problems arise in various fields of science and engineering, ranging from medical imaging and geophysics to material science and finance. Inverse problems require the estimation of an unknown parameter or field of interest from indirect measurements, which are often noisy and incomplete. The solution of inverse problems is challenging due to the ill-posedness of the problem, which leads to unstable and non-unique solutions. To overcome these challenges, various regularization techniques have been proposed to impose constraints on the solution space. However, the accuracy and reliability of inverse problem solutions can be significantly impacted by inverse crime.

Inverse crime refers to a situation where the forward model used for generating the data is the same as the one used for solving the inverse problem. This scenario leads to overly optimistic results and underestimates the uncertainties associated with the solution. Inverse crime can be a significant issue in practical applications, where the forward model is often an approximation of the underlying physical system and contains modeling errors and uncertainties.

To overcome the issue of inverse crime, we propose a novel Proper Orthogonal Decomposition (POD) model reduction method for solving inverse problems. The POD method is a data-driven technique that enables the identification of the dominant modes of variability in the data and the construction of a low-dimensional representation of the data. By leveraging the inherent low-dimensional structures present in the data, the POD method enables the reduction of the forward model complexity without compromising the accuracy of the inverse problem solution.

In this paper, we outline our new POD model reduction method for solving inverse problems and demonstrate its effectiveness in overcoming inverse crime. We first introduce the basic principles of the POD method and its application in inverse problems. We then present our new method for addressing the issue of inverse crime by incorporating the POD method into the inverse problem solution process. We demonstrate the performance of our method through extensive experimentation and comparative analysis with state-of-the-art methods. The results show that our proposed POD model reduction method outperforms existing methods in terms of accuracy and reliability, and offers a promising avenue for enhancing the applicability of inverse problem-solving techniques in various domains.

One of the successful model reduction ideas in solving time-evolution problems is the proper orthogonal decomposition (POD) method [14, 4]. The POD method uses the data from an experiment or an accurate numerical simulation and extracts the most energetic modes in the system by using the singular value decomposition. This approach generates low-dimensional structures that can approximate the solutions to the time-evolution problem with high accuracy. The POD method has been used to solve many types of PDEs, including linear parabolic equations [16, 11], Navier‐Stokes equations [11], viscous G-equations [8], Hamilton–Jacobi–Bellman (HJB) equations [12], and optimal control problems [2]. The interested reader is referred to [13, 3, 9] for a comprehensive introduction to the model reduction methods.

In this paper, we will develop a novel POD method to solve the forward and inverse problems of the parabolic type.

To start with, we consider a parabolic equation as follows:

{ut+u=f(x)in Ω×(0,T),u(x,t)=0on Ω×(0,T),u(x,0)=g(x)in Ω,\left\{\begin{aligned} u_{t}+\mathcal{L}u&=f(x)&\mbox{in }\Omega\times(0,T),\\ u(x,t)&=0&\mbox{on }\partial\Omega\times(0,T),\\ u(x,0)&=g(x)&\mbox{in }\Omega\,,\end{aligned}\right.{ start_ROW start_CELL italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + caligraphic_L italic_u end_CELL start_CELL = italic_f ( italic_x ) end_CELL start_CELL in roman_Ω × ( 0 , italic_T ) , end_CELL end_ROW start_ROW start_CELL italic_u ( italic_x , italic_t ) end_CELL start_CELL = 0 end_CELL start_CELL on ∂ roman_Ω × ( 0 , italic_T ) , end_CELL end_ROW start_ROW start_CELL italic_u ( italic_x , 0 ) end_CELL start_CELL = italic_g ( italic_x ) end_CELL start_CELL in roman_Ω , end_CELL end_ROW (1.1)

where ΩdΩsuperscript𝑑\Omega\subset\mathbb{R}^{d}roman_Ω ⊂ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT (d=1,2,3)𝑑123(d=1,2,3)( italic_d = 1 , 2 , 3 ) is a bounded domain with a C2superscript𝐶2C^{2}italic_C start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT boundary or a convex domain satisfying the uniform cone condition, \mathcal{L}caligraphic_L denotes a second-order elliptic operator given by u=(q(x)u)+c(x)u𝑢𝑞𝑥𝑢𝑐𝑥𝑢\mathcal{L}u=-\nabla\cdot(q(x)\nabla u)+c(x)ucaligraphic_L italic_u = - ∇ ⋅ ( italic_q ( italic_x ) ∇ italic_u ) + italic_c ( italic_x ) italic_u, and g(x)𝑔𝑥g(x)italic_g ( italic_x ) is the initial condition. We assume the elliptic operator \mathcal{L}caligraphic_L is uniform elliptic, i.e., there exist qmin,amax>0subscript𝑞subscript𝑎0q_{\min},a_{\max}>0italic_q start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT > 0 such that qmin<q(x)<qmaxsubscript𝑞𝑞𝑥subscript𝑞q_{\min}<q(x)<q_{\max}italic_q start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT < italic_q ( italic_x ) < italic_q start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT for all xΩ𝑥Ωx\in\Omegaitalic_x ∈ roman_Ω. Additionally, we assume q(x)C1(Ω¯)𝑞𝑥superscript𝐶1¯Ωq(x)\in C^{1}(\bar{\Omega})italic_q ( italic_x ) ∈ italic_C start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( over¯ start_ARG roman_Ω end_ARG ), c(x)C(Ω¯)𝑐𝑥𝐶¯Ωc(x)\in C(\bar{\Omega})italic_c ( italic_x ) ∈ italic_C ( over¯ start_ARG roman_Ω end_ARG ) and c(x)0𝑐𝑥0c(x)\geq 0italic_c ( italic_x ) ≥ 0.

Let u𝑢uitalic_u represent the solution of the parabolic equation (1.1). We define the forward operator 𝒮::𝒮absent\mathcal{S}:caligraphic_S : 𝒮(f,g)=u(,T)𝒮𝑓𝑔𝑢𝑇\mathcal{S}(f,g)=u(\cdot,T)caligraphic_S ( italic_f , italic_g ) = italic_u ( ⋅ , italic_T ). The forward problem involves computing the solution u(,t)𝑢𝑡u(\cdot,t)italic_u ( ⋅ , italic_t ) for t>0𝑡0t>0italic_t > 0 given the source term f(x)𝑓𝑥f(x)italic_f ( italic_x ) and initial condition g(x)𝑔𝑥g(x)italic_g ( italic_x ). The inverse problem, on the other hand, aims to reconstruct f(x)𝑓𝑥f(x)italic_f ( italic_x ) or g(x)𝑔𝑥g(x)italic_g ( italic_x ) from the final time measurement m=u(,T)𝑚𝑢𝑇m=u(\cdot,T)italic_m = italic_u ( ⋅ , italic_T ). Typically, iterative methods are employed to solve the inverse problem. During each iterative step, one may need to solve the forward problem one or more times. Consequently, the majority of computations expenses are attributed to the computation of the forward problems.

In this paper, we will solve two types of inverse problems:

  1. 1.

    Inverse source problem: recover the source term f(x)𝑓𝑥f(x)italic_f ( italic_x ) using the final time measurement m=u(,T)𝑚𝑢𝑇m=u(\cdot,T)italic_m = italic_u ( ⋅ , italic_T ) and the known initial term g(x)𝑔𝑥g(x)italic_g ( italic_x ).

  2. 2.

    Backward problem: recover the initial term g(x)𝑔𝑥g(x)italic_g ( italic_x ) using the final time measurement m=u(,T)𝑚𝑢𝑇m=u(\cdot,T)italic_m = italic_u ( ⋅ , italic_T ) and the known source term f(x)𝑓𝑥f(x)italic_f ( italic_x ).

Iterative methods are usually used to solve the inverse problems. For each iterative step, one may have to solve the forward problem one or more times, thus most of the computations are costed by the computation of the forward problem.

To solve the inverse problem in a faster way, the authors construct the POD basis functions from the snapshot solutions of the parabolic equation (1.1) with fixed source functions in [17]. The proposed method accelerates the computation of the inverse source problem, yet In this paper, we develop a novel POD method to solve the forward and inverse problems of the parabolic type. We will give a brief review of the traditional POD method in the appendix A, including the construction of the POD basis functions.

The rest of the paper is organized as follows. In Section 2, we introduce the Ajoint-POD method for solving parabolic inverse source problems and provide the error estimate for the proposed methods. Similarly, in Section 3, we propose the Ajoint-POD method for solving parabolic backward problem and provide the corresponding error estimate. In Section 4, we present numerical results to demonstrate the accuracy of our method. Finally, concluding remarks are made in Section 5.

2 Ajoint-POD method for parabolic inverse source problems

The traditional POD method has a drawback: to construct the POD basis functions, one needs to know the source term f(x)𝑓𝑥f(x)italic_f ( italic_x ) or the initial term g(x)𝑔𝑥g(x)italic_g ( italic_x ) in advance. However, in inverse problems, the source term or the initial term is precisely what we want to find. This can lead to the so-called inverse crime, which should be avoided in practice. In [17], the authors studied this issue by assuming that the true source term belongs to a known function class, thus avoiding the inverse crime. However, this approach does not completely address the issue of the inverse crime.

To tackle this challenge, we propose a novel model reduction method for this type of inverse problem: the Adjoint-POD method in this paper. Our new method efficiently solves inverse problems without requiring a priori information about the source term or initial term. By combining the Adjoint method’s strengths with the POD method’s model reduction capabilities, the Adjoint-POD method can efficiently and quickly solve inverse problems while avoiding the inverse crime issue.

2.1 Ajoint POD method

To demonstrate the idea of the Adjoint-POD method, we will first apply it to solve the inverse source problem. For the inverse source problem of the parabolic equation, the objective is to recover the unknown source term f(x)𝑓𝑥f(x)italic_f ( italic_x ), given the final time measurement m(x)=𝒮(f)=u(,T)𝑚𝑥𝒮𝑓𝑢𝑇m(x)=\mathcal{S}(f)=u(\cdot,T)italic_m ( italic_x ) = caligraphic_S ( italic_f ) = italic_u ( ⋅ , italic_T ). In this case, u𝑢uitalic_u satisfies the following equation:

{ut+u=f(x)in Ω×(0,T),u(x,t)=0on Ω×(0,T),u(x,0)=0in Ω.\left\{\begin{aligned} u_{t}+\mathcal{L}u&=f(x)&\mbox{in }\Omega\times(0,T),\\ u(x,t)&=0&\mbox{on }\partial\Omega\times(0,T),\\ u(x,0)&=0&\mbox{in }\Omega\,.\end{aligned}\right.{ start_ROW start_CELL italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + caligraphic_L italic_u end_CELL start_CELL = italic_f ( italic_x ) end_CELL start_CELL in roman_Ω × ( 0 , italic_T ) , end_CELL end_ROW start_ROW start_CELL italic_u ( italic_x , italic_t ) end_CELL start_CELL = 0 end_CELL start_CELL on ∂ roman_Ω × ( 0 , italic_T ) , end_CELL end_ROW start_ROW start_CELL italic_u ( italic_x , 0 ) end_CELL start_CELL = 0 end_CELL start_CELL in roman_Ω . end_CELL end_ROW (2.1)

Here we assume u(x,t)=0𝑢𝑥𝑡0u(x,t)=0italic_u ( italic_x , italic_t ) = 0 and u(x,0)=0𝑢𝑥00u(x,0)=0italic_u ( italic_x , 0 ) = 0 for simplicity, otherwise, one just need to subtract the background solution from the measurement m(x)𝑚𝑥m(x)italic_m ( italic_x ). Since the source term f(x)𝑓𝑥f(x)italic_f ( italic_x ) is unknown, we cannot use the traditional POD method to obtain snapshots. Instead, we will acquire the snapshots from the following adjoint equation:

{u~t+u~=m(x)in Ω×(0,T),u~(x,t)=0on Ω×(0,T),u~(x,0)=0in Ω.\left\{\begin{aligned} \tilde{u}_{t}+\mathcal{L}\tilde{u}&=m(x)&\mbox{in }% \Omega\times(0,T),\\ \tilde{u}(x,t)&=0&\mbox{on }\partial\Omega\times(0,T),\\ \tilde{u}(x,0)&=0&\mbox{in }\Omega\,.\end{aligned}\right.{ start_ROW start_CELL over~ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + caligraphic_L over~ start_ARG italic_u end_ARG end_CELL start_CELL = italic_m ( italic_x ) end_CELL start_CELL in roman_Ω × ( 0 , italic_T ) , end_CELL end_ROW start_ROW start_CELL over~ start_ARG italic_u end_ARG ( italic_x , italic_t ) end_CELL start_CELL = 0 end_CELL start_CELL on ∂ roman_Ω × ( 0 , italic_T ) , end_CELL end_ROW start_ROW start_CELL over~ start_ARG italic_u end_ARG ( italic_x , 0 ) end_CELL start_CELL = 0 end_CELL start_CELL in roman_Ω . end_CELL end_ROW (2.2)

Denote the snapshots y~k=u~(,tk1)subscript~𝑦𝑘~𝑢subscript𝑡𝑘1\tilde{y}_{k}=\tilde{u}(\cdot,t_{k-1})over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = over~ start_ARG italic_u end_ARG ( ⋅ , italic_t start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ), k=1,,M+1𝑘1𝑀1k=1,\ldots,M+1italic_k = 1 , … , italic_M + 1 with M=TΔt𝑀𝑇Δ𝑡M=\frac{T}{\Delta t}italic_M = divide start_ARG italic_T end_ARG start_ARG roman_Δ italic_t end_ARG, and y~k=¯u~(,tkM1)subscript~𝑦𝑘¯~𝑢subscript𝑡𝑘𝑀1\tilde{y}_{k}=\overline{\partial}\tilde{u}(\cdot,t_{k-M-1})over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = over¯ start_ARG ∂ end_ARG over~ start_ARG italic_u end_ARG ( ⋅ , italic_t start_POSTSUBSCRIPT italic_k - italic_M - 1 end_POSTSUBSCRIPT ), k=M+2,,2m+1𝑘𝑀22𝑚1k=M+2,\ldots,2m+1italic_k = italic_M + 2 , … , 2 italic_m + 1 with ¯u~(,tk)=u~(,tk)u~(,tk1)Δt¯~𝑢subscript𝑡𝑘~𝑢subscript𝑡𝑘~𝑢subscript𝑡𝑘1Δ𝑡\overline{\partial}\tilde{u}(\cdot,t_{k})=\frac{\tilde{u}(\cdot,t_{k})-\tilde{% u}(\cdot,t_{k-1})}{\Delta t}over¯ start_ARG ∂ end_ARG over~ start_ARG italic_u end_ARG ( ⋅ , italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = divide start_ARG over~ start_ARG italic_u end_ARG ( ⋅ , italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) - over~ start_ARG italic_u end_ARG ( ⋅ , italic_t start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG roman_Δ italic_t end_ARG, k=1,,M𝑘1𝑀k=1,\ldots,Mitalic_k = 1 , … , italic_M. Then we construct the new POD basis {ψ1,,ψNpod}subscript𝜓1subscript𝜓subscript𝑁pod\{\psi_{1},...,\psi_{N_{\text{pod}}}\}{ italic_ψ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_ψ start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT end_POSTSUBSCRIPT } using the method described in Appendix A from the adjoint equation (2.2). Denote VPOD=span{ψ1,,ψNpod}subscript𝑉POD𝑠𝑝𝑎𝑛subscript𝜓1subscript𝜓subscript𝑁podV_{\text{POD}}=span\{\psi_{1},...,\psi_{N_{\text{pod}}}\}italic_V start_POSTSUBSCRIPT POD end_POSTSUBSCRIPT = italic_s italic_p italic_a italic_n { italic_ψ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_ψ start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT end_POSTSUBSCRIPT }

We consider using these new POD basis functions {ψ1,,ψNpod}subscript𝜓1subscript𝜓subscript𝑁pod\{\psi_{1},...,\psi_{N_{\text{pod}}}\}{ italic_ψ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_ψ start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT end_POSTSUBSCRIPT } to approximate the forward problem to accelerate the computation. The fully discrete scheme is constructed on Vpodsubscript𝑉podV_{\text{pod}}italic_V start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT and the solution is denoted by Uksubscript𝑈𝑘U_{k}italic_U start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT for k=1M𝑘1𝑀k=1\cdots Mitalic_k = 1 ⋯ italic_M. To be precise, we seek numerical solutions Uksubscript𝑈𝑘U_{k}italic_U start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT’s such that

(¯Uk,ψ)+a(Uk,ψ)=(f,ψ),ψVpod.formulae-sequence¯subscript𝑈𝑘𝜓𝑎subscript𝑈𝑘𝜓𝑓𝜓for-all𝜓subscript𝑉pod(\bar{\partial}U_{k},\psi)+a(U_{k},\psi)=(f,\psi),\quad\forall\psi\in{V_{\text% {pod}}}.( over¯ start_ARG ∂ end_ARG italic_U start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ψ ) + italic_a ( italic_U start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ψ ) = ( italic_f , italic_ψ ) , ∀ italic_ψ ∈ italic_V start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT . (2.3)

Here the bilinear form a(u,v)=(qu,v)+(cu,v)𝑎𝑢𝑣𝑞𝑢𝑣𝑐𝑢𝑣a(u,v)=(q\nabla u,\nabla v)+(cu,v)italic_a ( italic_u , italic_v ) = ( italic_q ∇ italic_u , ∇ italic_v ) + ( italic_c italic_u , italic_v ). We define the solution operator from the source term f𝑓fitalic_f to the final time solution UMsubscript𝑈𝑀U_{M}italic_U start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT as 𝒮podsubscript𝒮pod\mathcal{S}_{\text{pod}}caligraphic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT, i.e., 𝒮podf=UMsubscript𝒮pod𝑓subscript𝑈𝑀\mathcal{S}_{\text{pod}}f=U_{M}caligraphic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_f = italic_U start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT. Using the new POD basis functions and the reduced-order model represented by 𝒮podsubscript𝒮pod\mathcal{S}_{\text{pod}}caligraphic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT, we can efficiently solve the forward problem for each time step, significantly reducing the computational cost compared to the full-scale model. This approach is particularly useful when solving inverse problems, where multiple forward problem evaluations are required.

2.2 Convergence of the Adjoint-POD method

We will first revisit an important property of the eigenvalue distribution for the classical elliptic operator \mathcal{L}caligraphic_L [1, 7].

Proposition 2.1

Suppose ΩΩ\Omegaroman_Ω is a bounded domain in dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT and a(x),c(x)C0(Ω¯)𝑎𝑥𝑐𝑥superscript𝐶0¯Ωa(x),c(x)\in C^{0}(\bar{\Omega})italic_a ( italic_x ) , italic_c ( italic_x ) ∈ italic_C start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( over¯ start_ARG roman_Ω end_ARG ), c(x)0𝑐𝑥0c(x)\geq 0italic_c ( italic_x ) ≥ 0, then the eigenvalue problem

ψ=μψwithψΩ=0𝜓𝜇𝜓withsubscript𝜓Ω0\displaystyle\mathcal{L}\psi=\mu\,\psi~{}~{}\text{with}~{}~{}\psi_{\partial% \Omega}=0caligraphic_L italic_ψ = italic_μ italic_ψ with italic_ψ start_POSTSUBSCRIPT ∂ roman_Ω end_POSTSUBSCRIPT = 0 (2.4)

has a countable set of positive eigenvalues μ1μ2subscript𝜇1subscript𝜇2\mu_{1}\leq\mu_{2}\leq\cdotsitalic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ ⋯, with its corresponding eigenfunctions {ϕk}k=1superscriptsubscriptsubscriptitalic-ϕ𝑘𝑘1\{\phi_{k}\}_{k=1}^{\infty}{ italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT forming an orthogonal basis of L2(Ω)superscript𝐿2ΩL^{2}(\Omega)italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ). Moreover, there exist constants C1,C2>0subscript𝐶1subscript𝐶20C_{1},C_{2}>0italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > 0 such that C1k2/dμkC2k2/dsubscript𝐶1superscript𝑘2𝑑subscript𝜇𝑘subscript𝐶2superscript𝑘2𝑑C_{1}k^{2/d}\leq\mu_{k}\leq C_{2}k^{2/d}italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT 2 / italic_d end_POSTSUPERSCRIPT ≤ italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT 2 / italic_d end_POSTSUPERSCRIPT for all k=1,2,.𝑘12k=1,2,\cdots.italic_k = 1 , 2 , ⋯ .

From the Proposition above, eigenfunction set {ϕk}k=1superscriptsubscriptsubscriptitalic-ϕ𝑘𝑘1\{\phi_{k}\}_{k=1}^{\infty}{ italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT forms an orthogonal basis of L2(Ω)superscript𝐿2ΩL^{2}(\Omega)italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ). Then for any fL2(Ω)𝑓superscript𝐿2Ωf\in L^{2}(\Omega)italic_f ∈ italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ), we write f=k=1fkϕk𝑓superscriptsubscript𝑘1subscript𝑓𝑘subscriptitalic-ϕ𝑘f=\sum_{k=1}^{\infty}f_{k}\phi_{k}italic_f = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT for a set of coefficients fksubscript𝑓𝑘f_{k}italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Let u=k=1uk(t)ϕk𝑢superscriptsubscript𝑘1subscript𝑢𝑘𝑡subscriptitalic-ϕ𝑘u=\sum_{k=1}^{\infty}u_{k}(t)\phi_{k}italic_u = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_t ) italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT be the solution of the problem (2.1). Substituting these two expressions of f𝑓fitalic_f and u𝑢uitalic_u into the first equation of (2.1), we get by noting the fact that ϕk=μkϕksubscriptitalic-ϕ𝑘subscript𝜇𝑘subscriptitalic-ϕ𝑘\mathcal{L}\phi_{k}=\mu_{k}\phi_{k}caligraphic_L italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and comparing the coefficients of ϕksubscriptitalic-ϕ𝑘\phi_{k}italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT on both sides of the equation that uk(0)=0subscript𝑢𝑘00u_{k}(0)=0italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( 0 ) = 0 and

uk(t)+μkuk=fkin(0,T).subscriptsuperscript𝑢𝑘𝑡subscript𝜇𝑘subscript𝑢𝑘subscript𝑓𝑘in0𝑇u^{\prime}_{k}(t)+\mu_{k}u_{k}=f_{k}\quad\quad\text{in}~{}(0,T)\,.italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_t ) + italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT in ( 0 , italic_T ) . (2.5)

This equation expresses the time evolution of the coefficients uk(t)subscript𝑢𝑘𝑡u_{k}(t)italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_t ) in terms of the coefficients fksubscript𝑓𝑘f_{k}italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT of the source term f𝑓fitalic_f. We can write the solution as uk(T)=αkfksubscript𝑢𝑘𝑇subscript𝛼𝑘subscript𝑓𝑘u_{k}(T)=\alpha_{k}\,f_{k}italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_T ) = italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, with αk=eμkT0Teμks𝑑s=1μk(1eμkT)subscript𝛼𝑘superscript𝑒subscript𝜇𝑘𝑇superscriptsubscript0𝑇superscript𝑒subscript𝜇𝑘𝑠differential-d𝑠1subscript𝜇𝑘1superscript𝑒subscript𝜇𝑘𝑇\alpha_{k}=e^{-\mu_{k}T}\int_{0}^{T}e^{\mu_{k}s}ds=\frac{1}{\mu_{k}}(1-e^{-\mu% _{k}T})italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_s end_POSTSUPERSCRIPT italic_d italic_s = divide start_ARG 1 end_ARG start_ARG italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ( 1 - italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT ). Noting that Sf=u(,T)=k=1uk(T)ϕk𝑆𝑓𝑢𝑇subscriptsuperscript𝑘1subscript𝑢𝑘𝑇subscriptitalic-ϕ𝑘Sf=u(\cdot,T)=\sum^{\infty}_{k=1}u_{k}(T)\phi_{k}italic_S italic_f = italic_u ( ⋅ , italic_T ) = ∑ start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_T ) italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, we can formally write

S(k=1fkϕk)=k=1αkfkϕk.𝑆superscriptsubscript𝑘1subscript𝑓𝑘subscriptitalic-ϕ𝑘superscriptsubscript𝑘1subscript𝛼𝑘subscript𝑓𝑘subscriptitalic-ϕ𝑘S\Big{(}\sum_{k=1}^{\infty}f_{k}\phi_{k}\Big{)}=\sum_{k=1}^{\infty}\alpha_{k}f% _{k}\phi_{k}.italic_S ( ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT .

This representation of the solution operator S𝑆Sitalic_S provides a convenient way to compute the solution u(,T)𝑢𝑇u(\cdot,T)italic_u ( ⋅ , italic_T ) using the eigenfunctions ϕksubscriptitalic-ϕ𝑘\phi_{k}italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and the coefficients αksubscript𝛼𝑘\alpha_{k}italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. For simplicity, we approximate the source term f(x)𝑓𝑥f(x)italic_f ( italic_x ) by a finite-dimensional truncation, i.e.

fapp=k=1Lfkϕk.subscript𝑓appsuperscriptsubscript𝑘1𝐿subscript𝑓𝑘subscriptitalic-ϕ𝑘f_{\text{app}}=\sum_{k=1}^{L}f_{k}\phi_{k}.italic_f start_POSTSUBSCRIPT app end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT . (2.6)

Then, the solution u(x,t)𝑢𝑥𝑡u(x,t)italic_u ( italic_x , italic_t ) of the parabolic equation has the form:

u(,T)=k=1L1μk(1eμkT)fkϕk.𝑢𝑇subscriptsuperscript𝐿𝑘11subscript𝜇𝑘1superscript𝑒subscript𝜇𝑘𝑇subscript𝑓𝑘subscriptitalic-ϕ𝑘u(\cdot,T)=\sum^{L}_{k=1}\frac{1}{\mu_{k}}(1-e^{-\mu_{k}T})f_{k}\phi_{k}.italic_u ( ⋅ , italic_T ) = ∑ start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ( 1 - italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT ) italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT . (2.7)

After simple calculation, we will also derive that

u~(x,t)=k=1L1μk(1eμkT)(1eμkt)fkϕk.~𝑢𝑥𝑡subscriptsuperscript𝐿𝑘11subscript𝜇𝑘1superscript𝑒subscript𝜇𝑘𝑇1superscript𝑒subscript𝜇𝑘𝑡subscript𝑓𝑘subscriptitalic-ϕ𝑘\tilde{u}(x,t)=\sum^{L}_{k=1}\frac{1}{\mu_{k}}(1-e^{-\mu_{k}T})(1-e^{-\mu_{k}t% })f_{k}\phi_{k}.over~ start_ARG italic_u end_ARG ( italic_x , italic_t ) = ∑ start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ( 1 - italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT ) ( 1 - italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_t end_POSTSUPERSCRIPT ) italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT . (2.8)

Actually the POD basis (A.4) is nothing but the singular value decomposition of the matrix A~=(y~1,,y~M)~𝐴subscript~𝑦1subscript~𝑦𝑀\tilde{A}=(\tilde{y}_{1},...,\tilde{y}_{M})over~ start_ARG italic_A end_ARG = ( over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ), where y~j=(u~(x1,tj),,u~(xN,tj))Tsubscript~𝑦𝑗superscript~𝑢subscript𝑥1subscript𝑡𝑗~𝑢subscript𝑥𝑁subscript𝑡𝑗𝑇\tilde{y}_{j}=(\tilde{u}(x_{1},t_{j}),...,\tilde{u}(x_{N},t_{j}))^{T}over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( over~ start_ARG italic_u end_ARG ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) , … , over~ start_ARG italic_u end_ARG ( italic_x start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT. Here x1,,xNsubscript𝑥1subscript𝑥𝑁x_{1},...,x_{N}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT are the finite element nodes in ΩΩ\Omegaroman_Ω. Suppose A𝐴Aitalic_A has the singular value decomposition: A=UΣV𝐴𝑈Σ𝑉A=U\Sigma Vitalic_A = italic_U roman_Σ italic_V, then ψkssubscript𝜓𝑘𝑠\psi_{k}sitalic_ψ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_s are exactly the first M𝑀Mitalic_M columns of U𝑈Uitalic_U.

Let us denote A=(y1,,yM)𝐴subscript𝑦1subscript𝑦𝑀A=(y_{1},...,y_{M})italic_A = ( italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_y start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ) and A~=(y~1,,y~M)~𝐴subscript~𝑦1subscript~𝑦𝑀\tilde{A}=(\tilde{y}_{1},...,\tilde{y}_{M})over~ start_ARG italic_A end_ARG = ( over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ), the matrix Φ=(ϕ1(x,t1),,ϕL(x,t1))Φsubscriptitalic-ϕ1𝑥subscript𝑡1subscriptitalic-ϕ𝐿𝑥subscript𝑡1\Phi=(\phi_{1}(\vec{x},t_{1}),...,\phi_{L}(\vec{x},t_{1}))roman_Φ = ( italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( over→ start_ARG italic_x end_ARG , italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , italic_ϕ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ( over→ start_ARG italic_x end_ARG , italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ), F=diag(f1,,fL)𝐹diagsubscript𝑓1subscript𝑓𝐿F=\text{diag}(f_{1},...,f_{L})italic_F = diag ( italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_f start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ) and D=diag(1μk(1eμkT),,1μL(1eμLT))𝐷diag1subscript𝜇𝑘1superscript𝑒subscript𝜇𝑘𝑇1subscript𝜇𝐿1superscript𝑒subscript𝜇𝐿𝑇D=\text{diag}(\frac{1}{\mu_{k}}(1-e^{-\mu_{k}T}),...,\frac{1}{\mu_{L}}(1-e^{-% \mu_{L}T}))italic_D = diag ( divide start_ARG 1 end_ARG start_ARG italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ( 1 - italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT ) , … , divide start_ARG 1 end_ARG start_ARG italic_μ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_ARG ( 1 - italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT ) ). Additionally, let us define an L×M𝐿𝑀L\times Mitalic_L × italic_M matrix J𝐽Jitalic_J with entries J(i,j)=1μi(1eμitj)𝐽𝑖𝑗1subscript𝜇𝑖1superscript𝑒subscript𝜇𝑖subscript𝑡𝑗J(i,j)=\frac{1}{\mu_{i}}(1-e^{-\mu_{i}t_{j}})italic_J ( italic_i , italic_j ) = divide start_ARG 1 end_ARG start_ARG italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ( 1 - italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ). ΦΦ\Phiroman_Φ is a column orthogonal matrix due to the normal orthogonality of the eigenfunctions ϕksubscriptitalic-ϕ𝑘\phi_{k}italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Utilizing the formulations of u𝑢uitalic_u and u~~𝑢\tilde{u}over~ start_ARG italic_u end_ARG, we can represent the matrices A𝐴Aitalic_A and A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG as follows:

A=ΦFJ,andA~=ΦDFJ.formulae-sequence𝐴Φ𝐹𝐽and~𝐴Φ𝐷𝐹𝐽A=\Phi FJ,~{}~{}\text{and}~{}~{}\tilde{A}=\Phi DFJ.italic_A = roman_Φ italic_F italic_J , and over~ start_ARG italic_A end_ARG = roman_Φ italic_D italic_F italic_J . (2.9)

Proposition A.1 demonstrates that the low-rank space Vpodsubscript𝑉podV_{\text{pod}}italic_V start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT is the best Npodsubscript𝑁podN_{\text{pod}}italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT-rank approximation of the column space of A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG. Our objective is to show that Vpodsubscript𝑉podV_{\text{pod}}italic_V start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT is also a good approximation of the column space of A𝐴Aitalic_A, which will validate the efficiency of the new POD method. To begin, let us establish the relationship between the matrices A𝐴Aitalic_A and A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG.

Lemma 2.2

If LM𝐿𝑀L\leq Mitalic_L ≤ italic_M, then span{y1,,yM}=span{y~1,,y~M}𝑠𝑝𝑎𝑛subscript𝑦1subscript𝑦𝑀𝑠𝑝𝑎𝑛subscript~𝑦1subscript~𝑦𝑀span\{y_{1},...,y_{M}\}=span\{\tilde{y}_{1},...,\tilde{y}_{M}\}italic_s italic_p italic_a italic_n { italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_y start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT } = italic_s italic_p italic_a italic_n { over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT }, i.e. C(A)=C(A~)𝐶𝐴𝐶~𝐴C(A)=C(\tilde{A})italic_C ( italic_A ) = italic_C ( over~ start_ARG italic_A end_ARG ).

Proof. Here we provide a concise proof for the case when the eigenvalues μjsubscript𝜇𝑗\mu_{j}italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT are distinct from each other. The proof for the case of repeated eigenvalues follows a similar approach. To demonstrate the desired results, we need to show that the existence of matrices P𝑃Pitalic_P and P~~𝑃\tilde{P}over~ start_ARG italic_P end_ARG such that,

ΦDFJP=ΦFJ,Φ𝐷𝐹𝐽𝑃Φ𝐹𝐽\Phi DFJP=\Phi FJ,roman_Φ italic_D italic_F italic_J italic_P = roman_Φ italic_F italic_J ,
ΦDFJ=ΦFJP~.Φ𝐷𝐹𝐽Φ𝐹𝐽~𝑃\Phi DFJ=\Phi FJ\tilde{P}.roman_Φ italic_D italic_F italic_J = roman_Φ italic_F italic_J over~ start_ARG italic_P end_ARG .

We will only present a brief proof for the first equality, as the second can be derived in a similar manner.

Since the columns of ΦΦ\Phiroman_Φ are independent and the diagonal matrix F𝐹Fitalic_F is invertible, it suffices to prove the existence of a matrix P𝑃Pitalic_P such that

JP=DJ.𝐽𝑃𝐷𝐽JP=DJ.italic_J italic_P = italic_D italic_J .

First, we show that the matrix JL×Lsuperscriptsubscript𝐽𝐿𝐿J_{L\times L}^{\prime}italic_J start_POSTSUBSCRIPT italic_L × italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT with entries J(i,j)=1eμitjsuperscript𝐽𝑖𝑗1superscript𝑒subscript𝜇𝑖subscript𝑡𝑗J^{\prime}(i,j)=1-e^{-\mu_{i}t_{j}}italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_i , italic_j ) = 1 - italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is invertible. Denote the vector e=(1,,1)Tesuperscript11𝑇\textbf{e}=(1,...,1)^{T}e = ( 1 , … , 1 ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT. Then, we can express Jsuperscript𝐽J^{\prime}italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT as

J=eeTVL,superscript𝐽superscriptee𝑇subscript𝑉𝐿J^{\prime}=\textbf{e}\textbf{e}^{T}-V_{L},italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = bold_e bold_e start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT - italic_V start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ,

where VL(i,j)=eμitjsubscript𝑉𝐿𝑖𝑗superscript𝑒subscript𝜇𝑖subscript𝑡𝑗V_{L}(i,j)=e^{-\mu_{i}t_{j}}italic_V start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ( italic_i , italic_j ) = italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is a Vandermonde matrix. To prove the invertibility of Jsuperscript𝐽J^{\prime}italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, we assume, by contradiction, that Jsuperscript𝐽J^{\prime}italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is singular. In that case, there exists a nonzero vector c=(c1,,cL)Tcsuperscriptsubscript𝑐1subscript𝑐𝐿𝑇\textbf{c}=(c_{1},...,c_{L})^{T}c = ( italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_c start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT such that

Jc=0,superscript𝐽c0J^{\prime}\textbf{c}=0,italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT c = 0 ,

or equivalently,

VLc=eeTc.subscript𝑉𝐿csuperscriptee𝑇cV_{L}\textbf{c}=\textbf{e}\textbf{e}^{T}\textbf{c}.italic_V start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT c = bold_e bold_e start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT c .

Now, consider the function f(x)=j=1Lcjextj𝑓𝑥superscriptsubscript𝑗1𝐿subscript𝑐𝑗superscript𝑒𝑥subscript𝑡𝑗f(x)=\sum_{j=1}^{L}c_{j}e^{xt_{j}}italic_f ( italic_x ) = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_x italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. Under this assumption, we have that

f(0)=f(μ1)=f(μ2)==f(μL),𝑓0𝑓subscript𝜇1𝑓subscript𝜇2𝑓subscript𝜇𝐿f(0)=f(\mu_{1})=f(\mu_{2})=\cdots=f(\mu_{L}),italic_f ( 0 ) = italic_f ( italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) = italic_f ( italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = ⋯ = italic_f ( italic_μ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ) ,

which implies that the function f𝑓fitalic_f has L+1𝐿1L+1italic_L + 1 distinct zeros. This implies that the derivative f(x)=j=1Lcjtjextjsuperscript𝑓𝑥superscriptsubscript𝑗1𝐿subscript𝑐𝑗subscript𝑡𝑗superscript𝑒𝑥subscript𝑡𝑗f^{\prime}(x)=\sum_{j=1}^{L}c_{j}t_{j}e^{xt_{j}}italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_x ) = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_x italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT has L𝐿Litalic_L distinct zeros. Since c is a nonzero vector and all μjsubscript𝜇𝑗\mu_{j}italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPTs are all nonzero, this will imply that the Vandermonde matrix VLsubscript𝑉𝐿V_{L}italic_V start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT is singular, which contradicts the fact that VLsubscript𝑉𝐿V_{L}italic_V start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT is an invertible matrix. Consequently, Jsuperscript𝐽J^{\prime}italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT must be a nonsingular matrix.

Since the invertibility of Jsuperscript𝐽J^{\prime}italic_J start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, the first L𝐿Litalic_L columns of J𝐽Jitalic_J are independent and thus form a basis for RLsuperscript𝑅𝐿R^{L}italic_R start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT. Similarly, the matrix DJ𝐷𝐽DJitalic_D italic_J also has independent columns that form a basis for RLsuperscript𝑅𝐿R^{L}italic_R start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT. Consequently, there must exist a matrix P𝑃Pitalic_P such that

JP=DJ.𝐽𝑃𝐷𝐽JP=DJ.italic_J italic_P = italic_D italic_J .

This result establishes that the spans of the sets {y1,,yM}subscript𝑦1subscript𝑦𝑀\{y_{1},...,y_{M}\}{ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_y start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT } and {y~1,,y~M}subscript~𝑦1subscript~𝑦𝑀\{\tilde{y}_{1},...,\tilde{y}_{M}\}{ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT } are equivalent, i.e., span{y1,,yM}=span{y~1,,y~M}spansubscript𝑦1subscript𝑦𝑀spansubscript~𝑦1subscript~𝑦𝑀\operatorname{span}\{y_{1},\dots,y_{M}\}=\operatorname{span}\{\tilde{y}_{1},% \dots,\tilde{y}_{M}\}roman_span { italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_y start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT } = roman_span { over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT }. This ends the proof.

Based on Proposition A.1, the new POD basis effectively approximates the set {y~1,,y~M}subscript~𝑦1subscript~𝑦𝑀\{\tilde{y}_{1},...,\tilde{y}_{M}\}{ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT }. Given the previous results, we can now demonstrate that the new POD basis also serves as a good approximation for the original set {y1,,yM}subscript𝑦1subscript𝑦𝑀\{y_{1},...,y_{M}\}{ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_y start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT }.

Theorem 2.3

Using the same notation as in Proposition A.1, if a sufficient number of snapshots are available, i.e. LM𝐿𝑀L\leq Mitalic_L ≤ italic_M, then the following approximation error bound holds:

i=1MyiPpodyiL2(Ω)2i=1MyiL2(Ω)2CL4/dρ,superscriptsubscript𝑖1𝑀superscriptsubscriptnormsubscript𝑦𝑖subscript𝑃podsubscript𝑦𝑖superscript𝐿2Ω2superscriptsubscript𝑖1𝑀superscriptsubscriptnormsubscript𝑦𝑖superscript𝐿2Ω2𝐶superscript𝐿4𝑑𝜌\frac{\sum_{i=1}^{M}\left|\left|y_{i}-P_{\text{pod}}y_{i}\right|\right|_{L^{2}% (\Omega)}^{2}}{\sum_{i=1}^{M}\left|\left|y_{i}\right|\right|_{L^{2}(\Omega)}^{% 2}}\leq CL^{4/d}\rho,divide start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT | | italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT | | italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ≤ italic_C italic_L start_POSTSUPERSCRIPT 4 / italic_d end_POSTSUPERSCRIPT italic_ρ , (2.10)

where Ppodsubscript𝑃podP_{\text{pod}}italic_P start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT is the projection operator onto the adjoint-POD space span{ψ1,,ψNpod}spansubscript𝜓1subscript𝜓subscript𝑁pod\operatorname{span}\{\psi_{1},\dots,\psi_{N_{\text{pod}}}\}roman_span { italic_ψ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_ψ start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT end_POSTSUBSCRIPT } and ρ=k=Npod+12M+1λkk=12M+1λk𝜌superscriptsubscript𝑘subscript𝑁pod12𝑀1subscript𝜆𝑘superscriptsubscript𝑘12𝑀1subscript𝜆𝑘\rho=\frac{\sum_{k={N_{\text{pod}}}+1}^{2M+1}\lambda_{k}}{\sum_{k=1}^{2M+1}% \lambda_{k}}italic_ρ = divide start_ARG ∑ start_POSTSUBSCRIPT italic_k = italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_M + 1 end_POSTSUPERSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_M + 1 end_POSTSUPERSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG.

Proof. In the following proof, we assume L=M𝐿𝑀L=Mitalic_L = italic_M for simplicity. For the case L<M𝐿𝑀L<Mitalic_L < italic_M, the proof is similar.

Using the same notation of Lemma 2.2, ΦΦ\Phiroman_Φ and J𝐽Jitalic_J are both invertible square matrices. Then there exists a unique matrix P𝑃Pitalic_P such that,

ΦDFJP=ΦFJ,Φ𝐷𝐹𝐽𝑃Φ𝐹𝐽\Phi DFJP=\Phi FJ,roman_Φ italic_D italic_F italic_J italic_P = roman_Φ italic_F italic_J ,

and P=J1D1J𝑃superscript𝐽1superscript𝐷1𝐽P=J^{-1}D^{-1}Jitalic_P = italic_J start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_J.

Hence yj=i=1LPijy~isubscript𝑦𝑗subscriptsuperscript𝐿𝑖1subscript𝑃𝑖𝑗subscript~𝑦𝑖y_{j}=\sum^{L}_{i=1}P_{ij}\tilde{y}_{i}italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ∑ start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Then by using the Cauchy-Schwarz inequality, for any 1jL1𝑗𝐿1\leq j\leq L1 ≤ italic_j ≤ italic_L, we have that

yjPpodyj2i=1LPij2i=1Ly~iPpody~i2.superscriptnormsubscript𝑦𝑗subscript𝑃podsubscript𝑦𝑗2superscriptsubscript𝑖1𝐿subscriptsuperscript𝑃2𝑖𝑗superscriptsubscript𝑖1𝐿superscriptnormsubscript~𝑦𝑖subscript𝑃podsubscript~𝑦𝑖2\displaystyle\|y_{j}-P_{\text{pod}}y_{j}\|^{2}\leq\sum_{i=1}^{L}P^{2}_{ij}\sum% _{i=1}^{L}\|\tilde{y}_{i}-P_{\text{pod}}\tilde{y}_{i}\|^{2}.∥ italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ∥ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (2.11)

Hence,

j=1LyjPpodyj2i,j=1LPij2i=1Ly~iPpody~i2=PF2i=1Ly~iPpody~i2.superscriptsubscript𝑗1𝐿superscriptnormsubscript𝑦𝑗subscript𝑃podsubscript𝑦𝑗2superscriptsubscript𝑖𝑗1𝐿subscriptsuperscript𝑃2𝑖𝑗superscriptsubscript𝑖1𝐿superscriptnormsubscript~𝑦𝑖subscript𝑃podsubscript~𝑦𝑖2subscriptsuperscriptnorm𝑃2𝐹superscriptsubscript𝑖1𝐿superscriptnormsubscript~𝑦𝑖subscript𝑃podsubscript~𝑦𝑖2\displaystyle\sum_{j=1}^{L}\|y_{j}-P_{\text{pod}}y_{j}\|^{2}\leq\sum_{i,j=1}^{% L}P^{2}_{ij}\sum_{i=1}^{L}\|\tilde{y}_{i}-P_{\text{pod}}\tilde{y}_{i}\|^{2}=\|% P\|^{2}_{F}\sum_{i=1}^{L}\|\tilde{y}_{i}-P_{\text{pod}}\tilde{y}_{i}\|^{2}.∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ∥ italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ ∑ start_POSTSUBSCRIPT italic_i , italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ∥ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ∥ italic_P ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ∥ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (2.12)

The rest is to estimate the Frobenius norm of P𝑃Pitalic_P. Since P=J1D1J𝑃superscript𝐽1superscript𝐷1𝐽P=J^{-1}D^{-1}Jitalic_P = italic_J start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_J, we define Pd=D12subscriptnorm𝑃𝑑subscriptnormsuperscript𝐷12\|P\|_{d}=\|D^{-1}\|_{2}∥ italic_P ∥ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT = ∥ italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. It is easy to verify that d\|\cdot\|_{d}∥ ⋅ ∥ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT is a matrix norm. Then,

PFCPd=CD12CμL.subscriptnorm𝑃𝐹𝐶subscriptnorm𝑃𝑑𝐶subscriptnormsuperscript𝐷12𝐶subscript𝜇𝐿\|P\|_{F}\leq C\|P\|_{d}=C\|D^{-1}\|_{2}\leq C\mu_{L}.∥ italic_P ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ≤ italic_C ∥ italic_P ∥ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT = italic_C ∥ italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ italic_C italic_μ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT .

On the other hand, since ΦΦ\Phiroman_Φ is an orthogonal matrix, we have,

j=1Ly~j2superscriptsubscript𝑗1𝐿superscriptnormsubscript~𝑦𝑗2\displaystyle\sum_{j=1}^{L}\|\tilde{y}_{j}\|^{2}∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ∥ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT =ΦDFJF2absentsuperscriptsubscriptnormΦ𝐷𝐹𝐽𝐹2\displaystyle=\|\Phi DFJ\|_{F}^{2}= ∥ roman_Φ italic_D italic_F italic_J ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (2.13)
=DFJF2DF2FJF2absentsuperscriptsubscriptnorm𝐷𝐹𝐽𝐹2superscriptsubscriptnorm𝐷𝐹2superscriptsubscriptnorm𝐹𝐽𝐹2\displaystyle=\|DFJ\|_{F}^{2}\leq\|D\|_{F}^{2}\|FJ\|_{F}^{2}= ∥ italic_D italic_F italic_J ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ ∥ italic_D ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ italic_F italic_J ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (2.14)
CFJF2absent𝐶superscriptsubscriptnorm𝐹𝐽𝐹2\displaystyle\leq C\|FJ\|_{F}^{2}≤ italic_C ∥ italic_F italic_J ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (2.15)
Cj=1Lyj2.absent𝐶superscriptsubscript𝑗1𝐿superscriptnormsubscript𝑦𝑗2\displaystyle\leq C\sum_{j=1}^{L}\|y_{j}\|^{2}.≤ italic_C ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ∥ italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (2.16)

Alighed with Proposition A.1, we finally have,

i=1MyiPpodyiL2(Ω)2i=1MyiL2(Ω)2superscriptsubscript𝑖1𝑀superscriptsubscriptnormsubscript𝑦𝑖subscript𝑃podsubscript𝑦𝑖superscript𝐿2Ω2superscriptsubscript𝑖1𝑀superscriptsubscriptnormsubscript𝑦𝑖superscript𝐿2Ω2\displaystyle\frac{\sum_{i=1}^{M}\left|\left|y_{i}-P_{\text{pod}}y_{i}\right|% \right|_{L^{2}(\Omega)}^{2}}{\sum_{i=1}^{M}\left|\left|y_{i}\right|\right|_{L^% {2}(\Omega)}^{2}}divide start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT | | italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT | | italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG CPF2i=1My~iPpody~iL2(Ω)2i=1My~iL2(Ω)2absent𝐶subscriptsuperscriptnorm𝑃2𝐹superscriptsubscript𝑖1𝑀superscriptsubscriptnormsubscript~𝑦𝑖subscript𝑃podsubscript~𝑦𝑖superscript𝐿2Ω2superscriptsubscript𝑖1𝑀superscriptsubscriptnormsubscript~𝑦𝑖superscript𝐿2Ω2\displaystyle\leq C\|P\|^{2}_{F}\frac{\sum_{i=1}^{M}\left|\left|\tilde{y}_{i}-% P_{\text{pod}}\tilde{y}_{i}\right|\right|_{L^{2}(\Omega)}^{2}}{\sum_{i=1}^{M}% \left|\left|\tilde{y}_{i}\right|\right|_{L^{2}(\Omega)}^{2}}≤ italic_C ∥ italic_P ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT divide start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT | | over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT | | over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG (2.17)
μL2ρ.absentsuperscriptsubscript𝜇𝐿2𝜌\displaystyle\leq\mu_{L}^{2}\rho.≤ italic_μ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_ρ . (2.18)

The conclusion comes with the estimation μiCi2/dsubscript𝜇𝑖𝐶superscript𝑖2𝑑\mu_{i}\leq Ci^{2/d}italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_C italic_i start_POSTSUPERSCRIPT 2 / italic_d end_POSTSUPERSCRIPT.

2.3 Convergence of inverse parabolic source problem

To solve this inverse source problem, we use the well-established Tikhonov regularization method, expressed as

minfX𝒮(f)mL2(Ω)2+λfL2(Ω)2.subscriptmin𝑓𝑋superscriptsubscriptnorm𝒮𝑓𝑚superscript𝐿2Ω2𝜆superscriptsubscriptnorm𝑓superscript𝐿2Ω2\displaystyle\mathop{\rm min}\limits_{f\in X}\|\mathcal{S}(f)-m\|_{L^{2}(% \Omega)}^{2}+\lambda\|f\|_{L^{2}(\Omega)}^{2}.roman_min start_POSTSUBSCRIPT italic_f ∈ italic_X end_POSTSUBSCRIPT ∥ caligraphic_S ( italic_f ) - italic_m ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_λ ∥ italic_f ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (2.19)

However, in the conventional application of the POD method, the source term f𝑓fitalic_f and the initial condition g𝑔gitalic_g must be determined initially to generate snapshots and obtain the POD basis functions. In the context of inverse problems, the only available information is the measurement m(x)𝑚𝑥m(x)italic_m ( italic_x ). This predicament, referred to as the inverse crime, makes this method impossible to implement in practice. Our new method could overcome this vital drawback by setting the forward solver to be our new POD forward solver.

In the general discrete approximation of problem (2.19), we seek to solve the following least-squares regularized optimization problem:

minfVpod𝒮pod(f)mL2(Ω)2+λfL2(Ω)2.subscriptmin𝑓subscript𝑉podsuperscriptsubscriptnormsubscript𝒮pod𝑓𝑚superscript𝐿2Ω2𝜆superscriptsubscriptnorm𝑓superscript𝐿2Ω2\displaystyle\mathop{\rm min}\limits_{f\in V_{\text{pod}}}\|\mathcal{S}_{\text% {pod}}(f)-m\|_{L^{2}(\Omega)}^{2}+\lambda\|f\|_{L^{2}(\Omega)}^{2}.roman_min start_POSTSUBSCRIPT italic_f ∈ italic_V start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ caligraphic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT ( italic_f ) - italic_m ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_λ ∥ italic_f ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (2.20)

Consider the functional 𝒥pod[f]=𝒮podfmL2(Ω)2+λfL2(Ω)2subscript𝒥poddelimited-[]𝑓superscriptsubscriptnormsubscript𝒮pod𝑓𝑚superscript𝐿2Ω2𝜆superscriptsubscriptnorm𝑓superscript𝐿2Ω2\mathcal{J}_{\text{pod}}[f]=\|\mathcal{S}_{\text{pod}}f-m\|_{L^{2}(\Omega)}^{2% }+\lambda\|f\|_{L^{2}(\Omega)}^{2}caligraphic_J start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT [ italic_f ] = ∥ caligraphic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_f - italic_m ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_λ ∥ italic_f ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. By computing the Fre´´𝑒\acute{e}over´ start_ARG italic_e end_ARGchet derivative of 𝒥pod[f]subscript𝒥poddelimited-[]𝑓\mathcal{J}_{\text{pod}}[f]caligraphic_J start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT [ italic_f ], we can derive the subsequent iterative scheme:

fk+1=fkβd𝒥pod[fk],k,formulae-sequencesubscript𝑓𝑘1subscript𝑓𝑘𝛽𝑑subscript𝒥poddelimited-[]subscript𝑓𝑘for-all𝑘\displaystyle f_{k+1}=f_{k}-\beta d\mathcal{J}_{\text{pod}}[f_{k}],\quad% \forall k\in\mathbb{N},italic_f start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_β italic_d caligraphic_J start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT [ italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ] , ∀ italic_k ∈ blackboard_N , (2.21)

where β𝛽\betaitalic_β is the step size, d𝒥pod[f]=𝒮pod(𝒮podfm)+λf𝑑subscript𝒥poddelimited-[]𝑓superscriptsubscript𝒮podsubscript𝒮pod𝑓𝑚𝜆𝑓d\mathcal{J}_{\text{pod}}[f]=\mathcal{S}_{\text{pod}}^{*}(\mathcal{S}_{\text{% pod}}f-m)+\lambda fitalic_d caligraphic_J start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT [ italic_f ] = caligraphic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( caligraphic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_f - italic_m ) + italic_λ italic_f denotes the Fre´´𝑒\acute{e}over´ start_ARG italic_e end_ARGchet derivative, and f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is an initial guess [17].

The above theory is based on noise-free case, i.e. the final time measurement m=u(,T)𝑚𝑢𝑇m=u(\cdot,T)italic_m = italic_u ( ⋅ , italic_T ) is assumed to be precisely known. However, in practical applications, measurement data often contains uncertainties. We assume the measurement data is blurred by noise and takes the discrete form

min=u(di,T)+ei,i=1,,n,formulae-sequencesubscriptsuperscript𝑚𝑛𝑖𝑢subscript𝑑𝑖𝑇subscript𝑒𝑖𝑖1𝑛m^{n}_{i}=u(d_{i},T)+e_{i},i=1,\cdots,n,italic_m start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_u ( italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_T ) + italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_i = 1 , ⋯ , italic_n ,

where disubscript𝑑𝑖d_{i}italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPTs represent the positions of detectors and {ei}i=1nsubscriptsuperscriptsubscript𝑒𝑖𝑛𝑖1\{e_{i}\}^{n}_{i=1}{ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT are independent and identically distributed (i.i.d.) random variables on an appropriate probability space (𝔛,,)\mathfrak{X},\mathcal{F},\mathbb{P})fraktur_X , caligraphic_F , blackboard_P ). Based on [6] and the analysis therein, we know that uC([0,T];H2(Ω))CfL2(Ω)subscriptnorm𝑢𝐶0𝑇superscript𝐻2Ω𝐶subscriptnorm𝑓superscript𝐿2Ω\|u\|_{C([0,T];H^{2}(\Omega))}\leq C\|f\|_{L^{2}(\Omega)}∥ italic_u ∥ start_POSTSUBSCRIPT italic_C ( [ 0 , italic_T ] ; italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) ) end_POSTSUBSCRIPT ≤ italic_C ∥ italic_f ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT. According to the embedding theorem of Sobolev spaces, we know that H2(Ω)superscript𝐻2ΩH^{2}(\Omega)italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) is continuously embedded into C(Ω¯)𝐶¯ΩC(\bar{\Omega})italic_C ( over¯ start_ARG roman_Ω end_ARG ) so that u(,T)𝑢𝑇u(\cdot,T)italic_u ( ⋅ , italic_T ) is well defined point-wisely for all diΩsubscript𝑑𝑖Ωd_{i}\in\Omegaitalic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ roman_Ω. Without loss of generality, we assume that the scattered locations {di}i=1nsuperscriptsubscriptsubscript𝑑𝑖𝑖1𝑛\{d_{i}\}_{i=1}^{n}{ italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT are uniformly distributed in ΩΩ\Omegaroman_Ω. That is, there exists a constant B>0𝐵0B>0italic_B > 0 such that dmax/dminBsubscript𝑑subscript𝑑𝐵{d_{\max}}/{d_{\min}}\leq Bitalic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT / italic_d start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ≤ italic_B, where dmaxsubscript𝑑{d_{\max}}italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT and dminsubscript𝑑{d_{\min}}italic_d start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT are defined by

dmax=supxΩinf1in|xdi|anddmin=inf1ijn|didj|.subscript𝑑subscriptsup𝑥Ωsubscriptinf1𝑖𝑛𝑥subscript𝑑𝑖andsubscript𝑑subscriptinf1𝑖𝑗𝑛subscript𝑑𝑖subscript𝑑𝑗\displaystyle d_{\max}=\mathop{\rm sup}\limits_{x\in\Omega}\mathop{\rm inf}% \limits_{1\leq i\leq n}|x-d_{i}|~{}~{}~{}\mbox{and}~{}~{}~{}d_{\min}=\mathop{% \rm inf}\limits_{1\leq i\neq j\leq n}|d_{i}-d_{j}|.italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT = roman_sup start_POSTSUBSCRIPT italic_x ∈ roman_Ω end_POSTSUBSCRIPT roman_inf start_POSTSUBSCRIPT 1 ≤ italic_i ≤ italic_n end_POSTSUBSCRIPT | italic_x - italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | and italic_d start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = roman_inf start_POSTSUBSCRIPT 1 ≤ italic_i ≠ italic_j ≤ italic_n end_POSTSUBSCRIPT | italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | . (2.22)

We will first use the technique developed in [5] to recover the final time measurement u(,T)𝑢𝑇u(\cdot,T)italic_u ( ⋅ , italic_T ) from the noisy data minsubscriptsuperscript𝑚𝑛𝑖m^{n}_{i}italic_m start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for i=1,,n𝑖1𝑛i=1,...,nitalic_i = 1 , … , italic_n. We approximate u(,T)𝑢𝑇u(\cdot,T)italic_u ( ⋅ , italic_T ) by solving the following minimization problem:

m=argminuX1ni=1n(u(xi)min)2+α|u|H2(Ω)2.𝑚subscriptargmin𝑢𝑋1𝑛superscriptsubscript𝑖1𝑛superscript𝑢subscript𝑥𝑖subscriptsuperscript𝑚𝑛𝑖2𝛼superscriptsubscript𝑢superscript𝐻2Ω2\displaystyle m=\mathop{\rm argmin}\limits_{u\in X}\frac{1}{n}\sum_{i=1}^{n}(u% (x_{i})-m^{n}_{i})^{2}+\alpha|u|_{H^{2}(\Omega)}^{2}.italic_m = roman_argmin start_POSTSUBSCRIPT italic_u ∈ italic_X end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_u ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - italic_m start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_α | italic_u | start_POSTSUBSCRIPT italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (2.23)

Assume the pointwise noise eisubscript𝑒𝑖e_{i}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT has a bounded variance σ𝜎\sigmaitalic_σ, which is referred to as the noise level. [5] analyzed this problem and provided optimal convergence results. Moreover, they proposed an a posteriori algorithm to obtain the best approximation without knowing the true solution m𝑚mitalic_m and noise level σ𝜎\sigmaitalic_σ. Here, we list their main results. If one chooses the optimal regularization parameter

α1/2+d/8=O(σn1/2u(,T)H2(Ω)1),superscript𝛼12𝑑8𝑂𝜎superscript𝑛12subscriptsuperscriptnorm𝑢𝑇1superscript𝐻2Ω\alpha^{1/2+d/8}=O(\sigma n^{-1/2}\|u(\cdot,T)\|^{-1}_{H^{2}(\Omega)}),italic_α start_POSTSUPERSCRIPT 1 / 2 + italic_d / 8 end_POSTSUPERSCRIPT = italic_O ( italic_σ italic_n start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ∥ italic_u ( ⋅ , italic_T ) ∥ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT ) ,

then the solution m𝑚mitalic_m of (2.23) achieves the optimal convergence

𝔼[u(,T)mL2(Ω)]Cα1/2u(,T)H2(Ω).𝔼delimited-[]subscriptnorm𝑢𝑇𝑚superscript𝐿2Ω𝐶superscript𝛼12subscriptnorm𝑢𝑇superscript𝐻2Ω\mathbb{E}\big{[}\|u(\cdot,T)-m\|_{L^{2}(\Omega)}\big{]}\leq C\alpha^{1/2}\|u(% \cdot,T)\|_{H^{2}(\Omega)}.blackboard_E [ ∥ italic_u ( ⋅ , italic_T ) - italic_m ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT ] ≤ italic_C italic_α start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ∥ italic_u ( ⋅ , italic_T ) ∥ start_POSTSUBSCRIPT italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT . (2.24)

And if the noise {ei}i=1nsubscriptsuperscriptsubscript𝑒𝑖𝑛𝑖1\{e_{i}\}^{n}_{i=1}{ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT are independent Gaussian random variables with variance σ𝜎\sigmaitalic_σ, we further have,

(u(,T)mL2(Ω)α1/2u(,T)H2(Ω)z)2eCz2.subscriptnorm𝑢𝑇𝑚superscript𝐿2Ωsuperscript𝛼12subscriptnorm𝑢𝑇superscript𝐻2Ω𝑧2superscript𝑒𝐶superscript𝑧2\mathbb{P}(\|u(\cdot,T)-m\|_{L^{2}(\Omega)}\geq\alpha^{1/2}\|u(\cdot,T)\|_{H^{% 2}(\Omega)}z)\leq 2e^{-Cz^{2}}.blackboard_P ( ∥ italic_u ( ⋅ , italic_T ) - italic_m ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT ≥ italic_α start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ∥ italic_u ( ⋅ , italic_T ) ∥ start_POSTSUBSCRIPT italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT italic_z ) ≤ 2 italic_e start_POSTSUPERSCRIPT - italic_C italic_z start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT . (2.25)

Using this recovered function m(x)𝑚𝑥m(x)italic_m ( italic_x ), we generate the adjoint POD basis functions in Section 2.1. It can be easily shown that, with uncertainty, the POD basis functions are still good low-rank approximation of the snapshots {y1,,yM}subscript𝑦1subscript𝑦𝑀\{y_{1},...,y_{M}\}{ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_y start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT }. Combining the Theorem 2.3 and (2.24), we shall have that for any 1iM1𝑖𝑀1\leq i\leq M1 ≤ italic_i ≤ italic_M,

yiPpodyiL2(Ω)2C(ML4/dρ+α)fL2(Ω)2,superscriptsubscriptnormsubscript𝑦𝑖subscript𝑃podsubscript𝑦𝑖superscript𝐿2Ω2𝐶𝑀superscript𝐿4𝑑𝜌𝛼subscriptsuperscriptnorm𝑓2superscript𝐿2Ω\left|\left|y_{i}-P_{\text{pod}}y_{i}\right|\right|_{L^{2}(\Omega)}^{2}\leq C(% ML^{4/d}\rho+\alpha)\|f\|^{2}_{L^{2}(\Omega)},| | italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ italic_C ( italic_M italic_L start_POSTSUPERSCRIPT 4 / italic_d end_POSTSUPERSCRIPT italic_ρ + italic_α ) ∥ italic_f ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT , (2.26)

Since we replace the source term by a finite truncation (2.6), and if fH1(Ω)𝑓superscript𝐻1Ωf\in H^{1}(\Omega)italic_f ∈ italic_H start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( roman_Ω ),

ffappL2CfL2μLCfL2L1/d.subscriptnorm𝑓subscript𝑓appsuperscript𝐿2𝐶subscriptnorm𝑓superscript𝐿2subscript𝜇𝐿𝐶subscriptnorm𝑓superscript𝐿2superscript𝐿1𝑑\|f-f_{\text{app}}\|_{L^{2}}\leq C\frac{\|\nabla f\|_{L^{2}}}{\sqrt{\mu_{L}}}% \leq C\frac{\|\nabla f\|_{L^{2}}}{{L^{1/d}}}.∥ italic_f - italic_f start_POSTSUBSCRIPT app end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≤ italic_C divide start_ARG ∥ ∇ italic_f ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG square-root start_ARG italic_μ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_ARG end_ARG ≤ italic_C divide start_ARG ∥ ∇ italic_f ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_L start_POSTSUPERSCRIPT 1 / italic_d end_POSTSUPERSCRIPT end_ARG . (2.27)

If fL2(Ω)𝑓superscript𝐿2Ωf\in L^{2}(\Omega)italic_f ∈ italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ), then fappfsubscript𝑓app𝑓f_{\text{app}}\rightarrow fitalic_f start_POSTSUBSCRIPT app end_POSTSUBSCRIPT → italic_f as L+𝐿L\rightarrow+\inftyitalic_L → + ∞. We assume

ffappL22ε,subscriptsuperscriptnorm𝑓subscript𝑓𝑎𝑝𝑝2superscript𝐿2𝜀\|f-f_{app}\|^{2}_{L^{2}}\leq\varepsilon,∥ italic_f - italic_f start_POSTSUBSCRIPT italic_a italic_p italic_p end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≤ italic_ε , (2.28)

where ε𝜀\varepsilonitalic_ε depends on L𝐿Litalic_L. With those results, using a similar technique to prove the Theorem 4.1 in [17], we have the following convergence results.

Theorem 2.4

Let {ei}i=1nsubscriptsuperscriptsubscript𝑒𝑖𝑛𝑖1\{e_{i}\}^{n}_{i=1}{ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT be independent random variables satisfying 𝔼[ei]=0𝔼delimited-[]subscript𝑒𝑖0\mathbb{E}[e_{i}]=0blackboard_E [ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] = 0 and 𝔼[ei2]σ2𝔼delimited-[]subscriptsuperscript𝑒2𝑖superscript𝜎2\mathbb{E}[e^{2}_{i}]\leq\sigma^{2}blackboard_E [ italic_e start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] ≤ italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT for i=1,,n𝑖1𝑛i=1,\cdots,nitalic_i = 1 , ⋯ , italic_n. Set α1/2+d/8=O(σn1/2u(,T)H2(Ω)1)superscript𝛼12𝑑8𝑂𝜎superscript𝑛12subscriptsuperscriptnormsuperscript𝑢𝑇1superscript𝐻2Ω\alpha^{1/2+d/8}=O(\sigma n^{-1/2}\|u^{*}(\cdot,T)\|^{-1}_{H^{2}(\Omega)})italic_α start_POSTSUPERSCRIPT 1 / 2 + italic_d / 8 end_POSTSUPERSCRIPT = italic_O ( italic_σ italic_n start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ∥ italic_u start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( ⋅ , italic_T ) ∥ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT ) in (2.23), and λ=O(ML4/dρ+α)𝜆𝑂𝑀superscript𝐿4𝑑𝜌𝛼\lambda=O(ML^{4/d}\rho+\alpha)italic_λ = italic_O ( italic_M italic_L start_POSTSUPERSCRIPT 4 / italic_d end_POSTSUPERSCRIPT italic_ρ + italic_α ), then

𝔼[𝒮f𝒮podfpodL2(Ω)2]CλfL2(Ω)2+Cε,𝔼delimited-[]superscriptsubscriptnorm𝒮superscript𝑓subscript𝒮podsubscript𝑓podsuperscript𝐿2Ω2𝐶𝜆subscriptsuperscriptnormsuperscript𝑓2superscript𝐿2Ω𝐶𝜀\displaystyle\mathbb{E}\big{[}\|\mathcal{S}f^{*}-\mathcal{S}_{\text{pod}}f_{% \text{pod}}\|_{L^{2}(\Omega)}^{2}\big{]}\leq C\lambda\|f^{*}\|^{2}_{L^{2}(% \Omega)}+C\varepsilon,blackboard_E [ ∥ caligraphic_S italic_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - caligraphic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] ≤ italic_C italic_λ ∥ italic_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT + italic_C italic_ε , (2.29)
𝔼[ffpodL2(Ω)2]CfL2(Ω)2+Cε,𝔼delimited-[]superscriptsubscriptnormsuperscript𝑓subscript𝑓podsuperscript𝐿2Ω2𝐶subscriptsuperscriptnormsuperscript𝑓2superscript𝐿2Ω𝐶𝜀\displaystyle\mathbb{E}\big{[}\|f^{*}-f_{\text{pod}}\|_{L^{2}(\Omega)}^{2}\big% {]}\leq C\|f^{*}\|^{2}_{L^{2}(\Omega)}+C\varepsilon,blackboard_E [ ∥ italic_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] ≤ italic_C ∥ italic_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT + italic_C italic_ε , (2.30)

and

𝔼[ffpodH1(Ω)2]𝔼delimited-[]superscriptsubscriptnormsuperscript𝑓subscript𝑓podsuperscript𝐻1Ω2\displaystyle\mathbb{E}\big{[}\|f^{*}-f_{\text{pod}}\|_{H^{-1}(\Omega)}^{2}% \big{]}blackboard_E [ ∥ italic_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_H start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] Cλ1/2fL2(Ω)2+Cε.absent𝐶superscript𝜆12subscriptsuperscriptnormsuperscript𝑓2superscript𝐿2Ω𝐶𝜀\displaystyle\leq C\lambda^{1/2}\|f^{*}\|^{2}_{L^{2}(\Omega)}+C\varepsilon.≤ italic_C italic_λ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ∥ italic_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT + italic_C italic_ε . (2.31)

Furthermore, if we assume the noise {ei}i=1nsubscriptsuperscriptsubscript𝑒𝑖𝑛𝑖1\{e_{i}\}^{n}_{i=1}{ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT are independent Gaussian random variables with variance σ𝜎\sigmaitalic_σ, we will have a stronger type of convergence, one can refer to [6] for a similar proof. We just list the results here.

Theorem 2.5

Let {ei}i=1nsubscriptsuperscriptsubscript𝑒𝑖𝑛𝑖1\{e_{i}\}^{n}_{i=1}{ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT be independent Gaussian random variables with variance σ𝜎\sigmaitalic_σ. Set α1/2+d/8=O(σn1/2u(,T)H2(Ω)1)superscript𝛼12𝑑8𝑂𝜎superscript𝑛12subscriptsuperscriptnormsuperscript𝑢𝑇1superscript𝐻2Ω\alpha^{1/2+d/8}=O(\sigma n^{-1/2}\|u^{*}(\cdot,T)\|^{-1}_{H^{2}(\Omega)})italic_α start_POSTSUPERSCRIPT 1 / 2 + italic_d / 8 end_POSTSUPERSCRIPT = italic_O ( italic_σ italic_n start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ∥ italic_u start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( ⋅ , italic_T ) ∥ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT ) in (2.23), and λ=O(ML4/dρ+α)𝜆𝑂𝑀superscript𝐿4𝑑𝜌𝛼\lambda=O(ML^{4/d}\rho+\alpha)italic_λ = italic_O ( italic_M italic_L start_POSTSUPERSCRIPT 4 / italic_d end_POSTSUPERSCRIPT italic_ρ + italic_α ), then there exists a constant C, for any z>0𝑧0z>0italic_z > 0,

(SpodfpodSfL2(Ω)(λ1/2fL2+ε)z)2eCz2,subscriptnormsubscript𝑆podsubscript𝑓pod𝑆superscript𝑓superscript𝐿2Ωsuperscript𝜆12subscriptnormsuperscript𝑓superscript𝐿2𝜀𝑧2superscript𝑒𝐶superscript𝑧2\displaystyle\mathbb{P}(\|S_{\text{pod}}f_{\text{pod}}-Sf^{*}\|_{L^{2}(\Omega)% }\geq(\lambda^{1/2}\|f^{*}\|_{L^{2}}+\varepsilon)z)\leq 2e^{-Cz^{2}},blackboard_P ( ∥ italic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT - italic_S italic_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT ≥ ( italic_λ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ∥ italic_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT + italic_ε ) italic_z ) ≤ 2 italic_e start_POSTSUPERSCRIPT - italic_C italic_z start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT , (2.32)
(fpodfL2(Ω)(fL2+ε)z)2eCz2,subscriptnormsubscript𝑓podsuperscript𝑓superscript𝐿2Ωsubscriptnormsuperscript𝑓superscript𝐿2𝜀𝑧2superscript𝑒𝐶superscript𝑧2\displaystyle\mathbb{P}(\|f_{\text{pod}}-f^{*}\|_{L^{2}(\Omega)}\geq(\|f^{*}\|% _{L^{2}}+\varepsilon)z)\leq 2e^{-Cz^{2}},blackboard_P ( ∥ italic_f start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT - italic_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT ≥ ( ∥ italic_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT + italic_ε ) italic_z ) ≤ 2 italic_e start_POSTSUPERSCRIPT - italic_C italic_z start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT , (2.33)

and

(fpodfH1(Ω)(λ1/4fL2+ε)z)2eCz2.subscriptnormsubscript𝑓podsuperscript𝑓superscript𝐻1Ωsuperscript𝜆14subscriptnormsuperscript𝑓superscript𝐿2𝜀𝑧2superscript𝑒𝐶superscript𝑧2\displaystyle\mathbb{P}(\|f_{\text{pod}}-f^{*}\|_{H^{-1}(\Omega)}\geq(\lambda^% {1/4}\|f^{*}\|_{L^{2}}+\varepsilon)z)\leq 2e^{-Cz^{2}}.blackboard_P ( ∥ italic_f start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT - italic_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_H start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT ≥ ( italic_λ start_POSTSUPERSCRIPT 1 / 4 end_POSTSUPERSCRIPT ∥ italic_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT + italic_ε ) italic_z ) ≤ 2 italic_e start_POSTSUPERSCRIPT - italic_C italic_z start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT . (2.34)

3 Parabolic backward problem

For the backward problem of the parabolic equation, our goal is to recover the initial term g(x)𝑔𝑥g(x)italic_g ( italic_x ), given the final time measurement m=𝒮(g)=u(,T)𝑚𝒮𝑔𝑢𝑇m=\mathcal{S}(g)=u(\cdot,T)italic_m = caligraphic_S ( italic_g ) = italic_u ( ⋅ , italic_T ). In this case, u𝑢uitalic_u satisfies the following equation:

{ut+u=0in Ω×(0,T),u(x,t)=0on Ω×(0,T),u(x,0)=g(x)in Ω,\left\{\begin{aligned} u_{t}+\mathcal{L}u&=0&\mbox{in }\Omega\times(0,T),\\ u(x,t)&=0&\mbox{on }\partial\Omega\times(0,T),\\ u(x,0)&=g(x)&\mbox{in }\Omega\,,\end{aligned}\right.{ start_ROW start_CELL italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + caligraphic_L italic_u end_CELL start_CELL = 0 end_CELL start_CELL in roman_Ω × ( 0 , italic_T ) , end_CELL end_ROW start_ROW start_CELL italic_u ( italic_x , italic_t ) end_CELL start_CELL = 0 end_CELL start_CELL on ∂ roman_Ω × ( 0 , italic_T ) , end_CELL end_ROW start_ROW start_CELL italic_u ( italic_x , 0 ) end_CELL start_CELL = italic_g ( italic_x ) end_CELL start_CELL in roman_Ω , end_CELL end_ROW (3.1)

Unlike the traditional POD method, we will get the snapshots from the following adjoint equation:

{u~t+u~=0in Ω×(0,T),u~(x,t)=0on Ω×(0,T),u~(x,0)=m(x)in Ω,\left\{\begin{aligned} \tilde{u}_{t}+\mathcal{L}\tilde{u}&=0&\mbox{in }\Omega% \times(0,T),\\ \tilde{u}(x,t)&=0&\mbox{on }\partial\Omega\times(0,T),\\ \tilde{u}(x,0)&=m(x)&\mbox{in }\Omega\,,\end{aligned}\right.{ start_ROW start_CELL over~ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + caligraphic_L over~ start_ARG italic_u end_ARG end_CELL start_CELL = 0 end_CELL start_CELL in roman_Ω × ( 0 , italic_T ) , end_CELL end_ROW start_ROW start_CELL over~ start_ARG italic_u end_ARG ( italic_x , italic_t ) end_CELL start_CELL = 0 end_CELL start_CELL on ∂ roman_Ω × ( 0 , italic_T ) , end_CELL end_ROW start_ROW start_CELL over~ start_ARG italic_u end_ARG ( italic_x , 0 ) end_CELL start_CELL = italic_m ( italic_x ) end_CELL start_CELL in roman_Ω , end_CELL end_ROW (3.2)

In this case, the snapshots are generated by solving the adjoint equation (3.2) with the given final time measurement m(x)𝑚𝑥m(x)italic_m ( italic_x ) as the initial condition.

Repeat the standard procedure in section 2.2, we generate the new POD basis ψksubscript𝜓𝑘\psi_{k}italic_ψ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPTs from the snapshots {u~(,t0),u~(,t1),,u~(,tM)}~𝑢subscript𝑡0~𝑢subscript𝑡1~𝑢subscript𝑡𝑀\big{\{}\tilde{u}(\cdot,t_{0}),\tilde{u}(\cdot,t_{1}),\ldots,\tilde{u}(\cdot,t% _{M})\big{\}}{ over~ start_ARG italic_u end_ARG ( ⋅ , italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) , over~ start_ARG italic_u end_ARG ( ⋅ , italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , over~ start_ARG italic_u end_ARG ( ⋅ , italic_t start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ) }, where tk=kΔtsubscript𝑡𝑘𝑘Δ𝑡t_{k}=k\Delta titalic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_k roman_Δ italic_t with Δt=TMΔ𝑡𝑇𝑀\Delta t=\frac{T}{M}roman_Δ italic_t = divide start_ARG italic_T end_ARG start_ARG italic_M end_ARG and k=0,,M𝑘0𝑀k=0,\ldots,Mitalic_k = 0 , … , italic_M. Then we have the following error formula similar to Proposition A.1:

i=12M+1y~ik=1Npod(y~i,ψk())L2(Ωψk()L2(Ω)2i=12M+1y~iL2(Ω)2=ρ,\frac{\sum_{i=1}^{2M+1}\left|\left|\tilde{y}_{i}-\sum_{k=1}^{{N_{\text{pod}}}}% \big{(}\tilde{y}_{i},\psi_{k}(\cdot)\big{)}_{L^{2}(\Omega}\psi_{k}(\cdot)% \right|\right|_{L^{2}(\Omega)}^{2}}{\sum_{i=1}^{2M+1}\left|\left|\tilde{y}_{i}% \right|\right|_{L^{2}(\Omega)}^{2}}=\rho,divide start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_M + 1 end_POSTSUPERSCRIPT | | over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_ψ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( ⋅ ) ) start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( ⋅ ) | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_M + 1 end_POSTSUPERSCRIPT | | over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG = italic_ρ , (3.3)

where the number Npodsubscript𝑁podN_{\text{pod}}italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT is determined according to the decay of the ratio ρ=k=Npod+12M+1λkk=12M+1λk𝜌superscriptsubscript𝑘subscript𝑁pod12𝑀1subscript𝜆𝑘superscriptsubscript𝑘12𝑀1subscript𝜆𝑘\rho=\frac{\sum_{k={N_{\text{pod}}}+1}^{2M+1}\lambda_{k}}{\sum_{k=1}^{2M+1}% \lambda_{k}}italic_ρ = divide start_ARG ∑ start_POSTSUBSCRIPT italic_k = italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_M + 1 end_POSTSUPERSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_M + 1 end_POSTSUPERSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG.

We consider using these new POD basis functions to approximate the forward problem to accelerate the computation. The fully discrete scheme is constructed on Vpodsubscript𝑉podV_{\text{pod}}italic_V start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT and the solution is denoted by Uksubscript𝑈𝑘U_{k}italic_U start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT for k=1M𝑘1𝑀k=1\cdots Mitalic_k = 1 ⋯ italic_M with M=TΔt𝑀𝑇Δ𝑡M=\frac{T}{\Delta t}italic_M = divide start_ARG italic_T end_ARG start_ARG roman_Δ italic_t end_ARG. To be precise, we seek numerical solutions Uksubscript𝑈𝑘U_{k}italic_U start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT’s such that

(¯Uk,ψ)+a(Uk,ψ)=0,ψVpod,formulae-sequence¯subscript𝑈𝑘𝜓𝑎subscript𝑈𝑘𝜓0for-all𝜓subscript𝑉𝑝𝑜𝑑(\bar{\partial}U_{k},\psi)+a(U_{k},\psi)=0,\quad\forall\psi\in{V_{pod}},( over¯ start_ARG ∂ end_ARG italic_U start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ψ ) + italic_a ( italic_U start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ψ ) = 0 , ∀ italic_ψ ∈ italic_V start_POSTSUBSCRIPT italic_p italic_o italic_d end_POSTSUBSCRIPT , (3.4)

with U0=g(x)subscript𝑈0𝑔𝑥U_{0}=g(x)italic_U start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_g ( italic_x ). We define the solution operator from the ini term g𝑔gitalic_g to the final time solution UMsubscript𝑈𝑀U_{M}italic_U start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT as 𝒮podsubscript𝒮pod\mathcal{S}_{\text{pod}}caligraphic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT, i.e., 𝒮podg=UMsubscript𝒮pod𝑔subscript𝑈𝑀\mathcal{S}_{\text{pod}}g=U_{M}caligraphic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_g = italic_U start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT.

3.1 Convergence of the adjoint-POD method

For any gL2(Ω)𝑔superscript𝐿2Ωg\in L^{2}(\Omega)italic_g ∈ italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ), we write g=k=1gkϕk𝑔superscriptsubscript𝑘1subscript𝑔𝑘subscriptitalic-ϕ𝑘g=\sum_{k=1}^{\infty}g_{k}\phi_{k}italic_g = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT for a set of coefficients gksubscript𝑔𝑘g_{k}italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Let u=k=1uk(t)ϕk𝑢superscriptsubscript𝑘1subscript𝑢𝑘𝑡subscriptitalic-ϕ𝑘u=\sum_{k=1}^{\infty}u_{k}(t)\phi_{k}italic_u = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_t ) italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT be the solution of the problem (2.1). Substituting these two expressions of g𝑔gitalic_g and u𝑢uitalic_u into the first equation of (2.1), we get by noting the fact that Lϕk=μkϕk𝐿subscriptitalic-ϕ𝑘subscript𝜇𝑘subscriptitalic-ϕ𝑘L\phi_{k}=\mu_{k}\phi_{k}italic_L italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and comparing the coefficients of ϕksubscriptitalic-ϕ𝑘\phi_{k}italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT on both sides of the equation that uk(0)=gksubscript𝑢𝑘0subscript𝑔𝑘u_{k}(0)=g_{k}italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( 0 ) = italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and

uk(t)+μkuk=0in (0,T).subscriptsuperscript𝑢𝑘𝑡subscript𝜇𝑘subscript𝑢𝑘0in 0𝑇u^{\prime}_{k}(t)+\mu_{k}u_{k}=0\ \ \ \ \mbox{in }~{}(0,T)\,.italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_t ) + italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 0 in ( 0 , italic_T ) .

We can write the solution as uk(T)=αkgksubscript𝑢𝑘𝑇subscript𝛼𝑘subscript𝑔𝑘u_{k}(T)=\alpha_{k}\,g_{k}italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_T ) = italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, with αk=eμkTsubscript𝛼𝑘superscript𝑒subscript𝜇𝑘𝑇\alpha_{k}=e^{-\mu_{k}T}italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT. Noting that Sg=u(,T)=k=1uk(T)ϕk𝑆𝑔𝑢𝑇subscriptsuperscript𝑘1subscript𝑢𝑘𝑇subscriptitalic-ϕ𝑘Sg=u(\cdot,T)=\sum^{\infty}_{k=1}u_{k}(T)\phi_{k}italic_S italic_g = italic_u ( ⋅ , italic_T ) = ∑ start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_T ) italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, we can formally write

S(k=1gkϕk)=k=1αkgkϕk.𝑆superscriptsubscript𝑘1subscript𝑔𝑘subscriptitalic-ϕ𝑘superscriptsubscript𝑘1subscript𝛼𝑘subscript𝑔𝑘subscriptitalic-ϕ𝑘S\Big{(}\sum_{k=1}^{\infty}g_{k}\phi_{k}\Big{)}=\sum_{k=1}^{\infty}\alpha_{k}g% _{k}\phi_{k}.italic_S ( ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT .

This representation shows the relationship between the initial condition g𝑔gitalic_g and the solution u(,T)𝑢𝑇u(\cdot,T)italic_u ( ⋅ , italic_T ) at the final time T𝑇Titalic_T. The operator S𝑆Sitalic_S maps the initial condition to the solution at time T𝑇Titalic_T through the coefficients αksubscript𝛼𝑘\alpha_{k}italic_α start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, which depend on the eigenvalues μksubscript𝜇𝑘\mu_{k}italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT of the operator L𝐿Litalic_L and the final time T𝑇Titalic_T. This relationship can be used to analyze the properties of the solution and the backward problem.

For simplicity, we approximate the source term g(x)𝑔𝑥g(x)italic_g ( italic_x ) by a finite-dimensional truncation, i.e.

gapp=k=1Lgkϕk.subscript𝑔appsuperscriptsubscript𝑘1𝐿subscript𝑔𝑘subscriptitalic-ϕ𝑘g_{\text{app}}=\sum_{k=1}^{L}g_{k}\phi_{k}.italic_g start_POSTSUBSCRIPT app end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT . (3.5)

Then the solution u(x,t)𝑢𝑥𝑡u(x,t)italic_u ( italic_x , italic_t ) of the parabolic equation has the form: u(,T)=k=1LeμkTfkϕk𝑢𝑇subscriptsuperscript𝐿𝑘1superscript𝑒subscript𝜇𝑘𝑇subscript𝑓𝑘subscriptitalic-ϕ𝑘u(\cdot,T)=\sum^{L}_{k=1}e^{-\mu_{k}T}f_{k}\phi_{k}italic_u ( ⋅ , italic_T ) = ∑ start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. After simple calculation, we can also have that u~(x,t)=k=1LeμkTeμktfkϕk~𝑢𝑥𝑡subscriptsuperscript𝐿𝑘1superscript𝑒subscript𝜇𝑘𝑇superscript𝑒subscript𝜇𝑘𝑡subscript𝑓𝑘subscriptitalic-ϕ𝑘\tilde{u}(x,t)=\sum^{L}_{k=1}e^{-\mu_{k}T}e^{-\mu_{k}t}f_{k}\phi_{k}over~ start_ARG italic_u end_ARG ( italic_x , italic_t ) = ∑ start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_t end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Choosing the POD basis is to compute the singular value decompositio of the matrix A~=(y~1,,y~M)~𝐴subscript~𝑦1subscript~𝑦𝑀\tilde{A}=(\tilde{y}_{1},...,\tilde{y}_{M})over~ start_ARG italic_A end_ARG = ( over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ), where y~j=(u~(x1,tj),,u~(xN,tj))Tsubscript~𝑦𝑗superscript~𝑢subscript𝑥1subscript𝑡𝑗~𝑢subscript𝑥𝑁subscript𝑡𝑗𝑇\tilde{y}_{j}=(\tilde{u}(x_{1},t_{j}),...,\tilde{u}(x_{N},t_{j}))^{T}over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( over~ start_ARG italic_u end_ARG ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) , … , over~ start_ARG italic_u end_ARG ( italic_x start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT. Here x1,,xNsubscript𝑥1subscript𝑥𝑁x_{1},...,x_{N}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT are the finite element nodes in ΩΩ\Omegaroman_Ω. Suppose A𝐴Aitalic_A has the singular value decomposition: A=UΣV𝐴𝑈Σ𝑉A=U\Sigma Vitalic_A = italic_U roman_Σ italic_V, then the ψkssubscript𝜓𝑘𝑠\psi_{k}sitalic_ψ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_s are exactly the first M𝑀Mitalic_M columns of U𝑈Uitalic_U.

Denote A=(y1,,yM)𝐴subscript𝑦1subscript𝑦𝑀A=(y_{1},...,y_{M})italic_A = ( italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_y start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ) and A~=(y~1,,y~M)~𝐴subscript~𝑦1subscript~𝑦𝑀\tilde{A}=(\tilde{y}_{1},...,\tilde{y}_{M})over~ start_ARG italic_A end_ARG = ( over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ), the matrix Φ=(ϕ1(x,t1),,ϕL(x,t1))Φsubscriptitalic-ϕ1𝑥subscript𝑡1subscriptitalic-ϕ𝐿𝑥subscript𝑡1\Phi=(\phi_{1}(\vec{x},t_{1}),...,\phi_{L}(\vec{x},t_{1}))roman_Φ = ( italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( over→ start_ARG italic_x end_ARG , italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , italic_ϕ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ( over→ start_ARG italic_x end_ARG , italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ), F=diag(f1,,fL)𝐹diagsubscript𝑓1subscript𝑓𝐿F=\text{diag}(f_{1},...,f_{L})italic_F = diag ( italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_f start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ) and D=diag(eμ1T,,eμLT)𝐷diagsuperscript𝑒subscript𝜇1𝑇superscript𝑒subscript𝜇𝐿𝑇D=\text{diag}(e^{-\mu_{1}T},...,e^{-\mu_{L}T})italic_D = diag ( italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT , … , italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT ), and the L×M𝐿𝑀L\times Mitalic_L × italic_M matrix J𝐽Jitalic_J with entries: J(i,j)=eμitj𝐽𝑖𝑗superscript𝑒subscript𝜇𝑖subscript𝑡𝑗J(i,j)=e^{-\mu_{i}t_{j}}italic_J ( italic_i , italic_j ) = italic_e start_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. Obviously, ΦΦ\Phiroman_Φ is a column orthogonal matrix since the normal orthogonality of eigenfunctions ϕksubscriptitalic-ϕ𝑘\phi_{k}italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPTs. With the formulations of u𝑢uitalic_u and u~~𝑢\tilde{u}over~ start_ARG italic_u end_ARG, we could represent the matrix A𝐴Aitalic_A and A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG by:

A=ΦFJ,andA~=ΦDFJ.formulae-sequence𝐴Φ𝐹𝐽and~𝐴Φ𝐷𝐹𝐽A=\Phi FJ,\quad\quad\text{and}\quad\quad\tilde{A}=\Phi DFJ.italic_A = roman_Φ italic_F italic_J , and over~ start_ARG italic_A end_ARG = roman_Φ italic_D italic_F italic_J . (3.6)

These matrix representations of A𝐴Aitalic_A and A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG provide a compact way to express the relationship between the coefficients of the eigenfunctions ϕksubscriptitalic-ϕ𝑘\phi_{k}italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and the solutions u(x,t)𝑢𝑥𝑡u(x,t)italic_u ( italic_x , italic_t ) and u~(x,t)~𝑢𝑥𝑡\tilde{u}(x,t)over~ start_ARG italic_u end_ARG ( italic_x , italic_t ).

Proposition A.1 shows that the low-rank space Vpodsubscript𝑉podV_{\text{pod}}italic_V start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT is the best Npodsubscript𝑁podN_{\text{pod}}italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT rank approximation of the column space of A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG, we want to show that Vpodsubscript𝑉𝑝𝑜𝑑V_{pod}italic_V start_POSTSUBSCRIPT italic_p italic_o italic_d end_POSTSUBSCRIPT is also a good approximation of the column space of A𝐴Aitalic_A, which will certify the efficiency of the new POD method. First of all, let us show the connection of the matrices A𝐴Aitalic_A and A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG.

Lemma 3.1

If LM𝐿𝑀L\leq Mitalic_L ≤ italic_M, then span{y1,,yM}=span{y~1,,y~M}𝑠𝑝𝑎𝑛subscript𝑦1subscript𝑦𝑀𝑠𝑝𝑎𝑛subscript~𝑦1subscript~𝑦𝑀span\{y_{1},...,y_{M}\}=span\{\tilde{y}_{1},...,\tilde{y}_{M}\}italic_s italic_p italic_a italic_n { italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_y start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT } = italic_s italic_p italic_a italic_n { over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT }. This means the column spaces of A𝐴Aitalic_A and A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG are identical, i.e., C(A)=C(A~)𝐶𝐴𝐶~𝐴C(A)=C(\tilde{A})italic_C ( italic_A ) = italic_C ( over~ start_ARG italic_A end_ARG ).

Proof. The proof is similar to the Lemma 2.2, just using the fact that the matrix J𝐽Jitalic_J is actually a Vandermonde matrix.

From (3.3), the new POD basis is a good approximation of {y~1,,y~M}subscript~𝑦1subscript~𝑦𝑀\{\tilde{y}_{1},...,\tilde{y}_{M}\}{ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT }. With the above preparation, we will show the new POD basis is also a good approximation of {y1,,yM}subscript𝑦1subscript𝑦𝑀\{y_{1},...,y_{M}\}{ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_y start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT }.

Theorem 3.2

Using the same notation in this section, if one has enough snapshots, i.e. LM𝐿𝑀L\leq Mitalic_L ≤ italic_M, then

i=1MyiPpodyiL2(Ω)2i=1MyiL2(Ω)2Ce2μLTρ,superscriptsubscript𝑖1𝑀superscriptsubscriptnormsubscript𝑦𝑖subscript𝑃𝑝𝑜𝑑subscript𝑦𝑖superscript𝐿2Ω2superscriptsubscript𝑖1𝑀superscriptsubscriptnormsubscript𝑦𝑖superscript𝐿2Ω2𝐶superscript𝑒2subscript𝜇𝐿𝑇𝜌\frac{\sum_{i=1}^{M}\left|\left|y_{i}-P_{pod}y_{i}\right|\right|_{L^{2}(\Omega% )}^{2}}{\sum_{i=1}^{M}\left|\left|y_{i}\right|\right|_{L^{2}(\Omega)}^{2}}\leq Ce% ^{2\mu_{L}T}\rho,divide start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT | | italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_p italic_o italic_d end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT | | italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ≤ italic_C italic_e start_POSTSUPERSCRIPT 2 italic_μ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT italic_ρ , (3.7)

where Ppodsubscript𝑃𝑝𝑜𝑑P_{pod}italic_P start_POSTSUBSCRIPT italic_p italic_o italic_d end_POSTSUBSCRIPT is the projection operator on the adjoint-POD space span{ψ1,,ψNpod}𝑠𝑝𝑎𝑛subscript𝜓1subscript𝜓subscript𝑁𝑝𝑜𝑑span\{\psi_{1},...,\psi_{N_{pod}}\}italic_s italic_p italic_a italic_n { italic_ψ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_ψ start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT italic_p italic_o italic_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT } and ρ=k=Npod+1Mλkk=1Mλk𝜌superscriptsubscript𝑘subscript𝑁𝑝𝑜𝑑1𝑀subscript𝜆𝑘superscriptsubscript𝑘1𝑀subscript𝜆𝑘\rho=\frac{\sum_{k={N_{pod}}+1}^{M}\lambda_{k}}{\sum_{k=1}^{M}\lambda_{k}}italic_ρ = divide start_ARG ∑ start_POSTSUBSCRIPT italic_k = italic_N start_POSTSUBSCRIPT italic_p italic_o italic_d end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG

Proof. In the following proof, we assume L=M𝐿𝑀L=Mitalic_L = italic_M for simplicity. For the case L<M𝐿𝑀L<Mitalic_L < italic_M, the proof is similar.

Using the same notation of Lemma 3.1, ΦΦ\Phiroman_Φ and J𝐽Jitalic_J are both invertible square matrices. Then, there exists a unique matrix P𝑃Pitalic_P such that,

ΦDFJP=ΦFJ,Φ𝐷𝐹𝐽𝑃Φ𝐹𝐽\Phi DFJP=\Phi FJ,roman_Φ italic_D italic_F italic_J italic_P = roman_Φ italic_F italic_J ,

and P=J1D1J𝑃superscript𝐽1superscript𝐷1𝐽P=J^{-1}D^{-1}Jitalic_P = italic_J start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_J.

Hence

yj=i=1LPijy~i.subscript𝑦𝑗subscriptsuperscript𝐿𝑖1subscript𝑃𝑖𝑗subscript~𝑦𝑖y_{j}=\sum^{L}_{i=1}P_{ij}\tilde{y}_{i}.italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ∑ start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT .

Then by Cauchy-Schwarz inequality, for any 1jL1𝑗𝐿1\leq j\leq L1 ≤ italic_j ≤ italic_L, we have the estimate

yjPpodyj2i=1LPij2i=1Ly~iPpody~i2.superscriptnormsubscript𝑦𝑗subscript𝑃podsubscript𝑦𝑗2superscriptsubscript𝑖1𝐿subscriptsuperscript𝑃2𝑖𝑗superscriptsubscript𝑖1𝐿superscriptnormsubscript~𝑦𝑖subscript𝑃podsubscript~𝑦𝑖2\displaystyle\|y_{j}-P_{\text{pod}}y_{j}\|^{2}\leq\sum_{i=1}^{L}P^{2}_{ij}\sum% _{i=1}^{L}\|\tilde{y}_{i}-P_{\text{pod}}\tilde{y}_{i}\|^{2}.∥ italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ∥ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (3.8)

Hence,

j=1LyjPpodyj2i,j=1LPij2i=1Ly~iPpody~i2=PF2i=1Ly~iPpody~i2.superscriptsubscript𝑗1𝐿superscriptnormsubscript𝑦𝑗subscript𝑃podsubscript𝑦𝑗2superscriptsubscript𝑖𝑗1𝐿subscriptsuperscript𝑃2𝑖𝑗superscriptsubscript𝑖1𝐿superscriptnormsubscript~𝑦𝑖subscript𝑃podsubscript~𝑦𝑖2subscriptsuperscriptnorm𝑃2𝐹superscriptsubscript𝑖1𝐿superscriptnormsubscript~𝑦𝑖subscript𝑃podsubscript~𝑦𝑖2\displaystyle\sum_{j=1}^{L}\|y_{j}-P_{\text{pod}}y_{j}\|^{2}\leq\sum_{i,j=1}^{% L}P^{2}_{ij}\sum_{i=1}^{L}\|\tilde{y}_{i}-P_{\text{pod}}\tilde{y}_{i}\|^{2}=\|% P\|^{2}_{F}\sum_{i=1}^{L}\|\tilde{y}_{i}-P_{\text{pod}}\tilde{y}_{i}\|^{2}.∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ∥ italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ ∑ start_POSTSUBSCRIPT italic_i , italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ∥ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ∥ italic_P ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ∥ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (3.9)

The rest is to estimate the Frobenius norm of P𝑃Pitalic_P. Since P=J1D1J𝑃superscript𝐽1superscript𝐷1𝐽P=J^{-1}D^{-1}Jitalic_P = italic_J start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_J, we define Pd=D12subscriptnorm𝑃𝑑subscriptnormsuperscript𝐷12\|P\|_{d}=\|D^{-1}\|_{2}∥ italic_P ∥ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT = ∥ italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. It is easy to verify that d\|\cdot\|_{d}∥ ⋅ ∥ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT is a matrix norm. Then,

PFCPd=CD12CeμLT.subscriptnorm𝑃𝐹𝐶subscriptnorm𝑃𝑑𝐶subscriptnormsuperscript𝐷12𝐶superscript𝑒subscript𝜇𝐿𝑇\|P\|_{F}\leq C\|P\|_{d}=C\|D^{-1}\|_{2}\leq Ce^{\mu_{L}T}.∥ italic_P ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ≤ italic_C ∥ italic_P ∥ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT = italic_C ∥ italic_D start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ italic_C italic_e start_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT .

On the other hand, since ΦΦ\Phiroman_Φ is an orthogonal matrix, we have

j=1Ly~j2superscriptsubscript𝑗1𝐿superscriptnormsubscript~𝑦𝑗2\displaystyle\sum_{j=1}^{L}\|\tilde{y}_{j}\|^{2}∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ∥ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT =ΦDFJF2=DFJF2DF2FJF2absentsuperscriptsubscriptnormΦ𝐷𝐹𝐽𝐹2superscriptsubscriptnorm𝐷𝐹𝐽𝐹2superscriptsubscriptnorm𝐷𝐹2superscriptsubscriptnorm𝐹𝐽𝐹2\displaystyle=\|\Phi DFJ\|_{F}^{2}=\|DFJ\|_{F}^{2}\leq\|D\|_{F}^{2}\|FJ\|_{F}^% {2}= ∥ roman_Φ italic_D italic_F italic_J ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ∥ italic_D italic_F italic_J ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ ∥ italic_D ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ italic_F italic_J ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (3.10)
CFJF2absent𝐶superscriptsubscriptnorm𝐹𝐽𝐹2\displaystyle\leq C\|FJ\|_{F}^{2}≤ italic_C ∥ italic_F italic_J ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (3.11)
=Cj=1Lyj2.\displaystyle=\leq C\sum_{j=1}^{L}\|y_{j}\|^{2}.= ≤ italic_C ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ∥ italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (3.12)

Alighed with (3.3), we finally have,

i=1MyiPpodyiL2(Ω)2i=1MyiL2(Ω)2superscriptsubscript𝑖1𝑀superscriptsubscriptnormsubscript𝑦𝑖subscript𝑃𝑝𝑜𝑑subscript𝑦𝑖superscript𝐿2Ω2superscriptsubscript𝑖1𝑀superscriptsubscriptnormsubscript𝑦𝑖superscript𝐿2Ω2\displaystyle\frac{\sum_{i=1}^{M}\left|\left|y_{i}-P_{pod}y_{i}\right|\right|_% {L^{2}(\Omega)}^{2}}{\sum_{i=1}^{M}\left|\left|y_{i}\right|\right|_{L^{2}(% \Omega)}^{2}}divide start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT | | italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_p italic_o italic_d end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT | | italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG CPF2i=1My~iPpody~iL2(Ω)2i=1My~iL2(Ω)2absent𝐶subscriptsuperscriptnorm𝑃2𝐹superscriptsubscript𝑖1𝑀superscriptsubscriptnormsubscript~𝑦𝑖subscript𝑃𝑝𝑜𝑑subscript~𝑦𝑖superscript𝐿2Ω2superscriptsubscript𝑖1𝑀superscriptsubscriptnormsubscript~𝑦𝑖superscript𝐿2Ω2\displaystyle\leq C\|P\|^{2}_{F}\frac{\sum_{i=1}^{M}\left|\left|\tilde{y}_{i}-% P_{pod}\tilde{y}_{i}\right|\right|_{L^{2}(\Omega)}^{2}}{\sum_{i=1}^{M}\left|% \left|\tilde{y}_{i}\right|\right|_{L^{2}(\Omega)}^{2}}≤ italic_C ∥ italic_P ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT divide start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT | | over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_p italic_o italic_d end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT | | over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG (3.13)
Ce2μLTρ.absent𝐶superscript𝑒2subscript𝜇𝐿𝑇𝜌\displaystyle\leq Ce^{2\mu_{L}T}\rho.≤ italic_C italic_e start_POSTSUPERSCRIPT 2 italic_μ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT italic_ρ . (3.14)

The conclusion comes with the estimation μiCi2/dsubscript𝜇𝑖𝐶superscript𝑖2𝑑\mu_{i}\leq Ci^{2/d}italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_C italic_i start_POSTSUPERSCRIPT 2 / italic_d end_POSTSUPERSCRIPT.

3.2 Convergence of backward problem

To solve this inverse problem, we use the traditional Tikhonov regularization method,

mingX𝒮(g)mL2(Ω)2+λgL2(Ω)2.subscriptmin𝑔𝑋superscriptsubscriptnorm𝒮𝑔𝑚superscript𝐿2Ω2𝜆superscriptsubscriptnorm𝑔superscript𝐿2Ω2\displaystyle\mathop{\rm min}\limits_{g\in X}\|\mathcal{S}(g)-m\|_{L^{2}(% \Omega)}^{2}+\lambda\|g\|_{L^{2}(\Omega)}^{2}.roman_min start_POSTSUBSCRIPT italic_g ∈ italic_X end_POSTSUBSCRIPT ∥ caligraphic_S ( italic_g ) - italic_m ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_λ ∥ italic_g ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (3.15)

In the general discrete approximation of the problem (3.15), we solve the following least-squares regularized optimization problem:

mingVpod𝒮pod(g)mL2(Ω)2+λgL2(Ω)2.subscriptmin𝑔subscript𝑉podsuperscriptsubscriptnormsubscript𝒮pod𝑔𝑚superscript𝐿2Ω2𝜆superscriptsubscriptnorm𝑔superscript𝐿2Ω2\displaystyle\mathop{\rm min}\limits_{g\in V_{\text{pod}}}\|\mathcal{S}_{\text% {pod}}(g)-m\|_{L^{2}(\Omega)}^{2}+\lambda\|g\|_{L^{2}(\Omega)}^{2}.roman_min start_POSTSUBSCRIPT italic_g ∈ italic_V start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ caligraphic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT ( italic_g ) - italic_m ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_λ ∥ italic_g ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (3.16)

But in the traditional setting of the POD method, one has to know the source term f𝑓fitalic_f and the initial condition g𝑔gitalic_g first to derive the snapshots to get the POD basis functions, but the only information known in inverse problems is the measurement m(x)𝑚𝑥m(x)italic_m ( italic_x ). This is called inverse crime which makes this method impossible to implement in practice.

Define the functional 𝒥pod[g]=𝒮podgmL2(Ω)2+λgL2(Ω)2subscript𝒥poddelimited-[]𝑔superscriptsubscriptnormsubscript𝒮pod𝑔𝑚superscript𝐿2Ω2𝜆superscriptsubscriptnorm𝑔superscript𝐿2Ω2\mathcal{J}_{\text{pod}}[g]=\|\mathcal{S}_{\text{pod}}g-m\|_{L^{2}(\Omega)}^{2% }+\lambda\|g\|_{L^{2}(\Omega)}^{2}caligraphic_J start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT [ italic_g ] = ∥ caligraphic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_g - italic_m ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_λ ∥ italic_g ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. We can compute the Fre´´𝑒\acute{e}over´ start_ARG italic_e end_ARGchet derivative of 𝒥pod[g]subscript𝒥poddelimited-[]𝑔\mathcal{J}_{\text{pod}}[g]caligraphic_J start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT [ italic_g ] and obtain the following iterative scheme:

gk+1=gkβd𝒥pod[gk],k,formulae-sequencesubscript𝑔𝑘1subscript𝑔𝑘𝛽𝑑subscript𝒥𝑝𝑜𝑑delimited-[]subscript𝑔𝑘for-all𝑘\displaystyle g_{k+1}=g_{k}-\beta d\mathcal{J}_{pod}[g_{k}],\quad\forall k\in% \mathbb{N},italic_g start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT = italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_β italic_d caligraphic_J start_POSTSUBSCRIPT italic_p italic_o italic_d end_POSTSUBSCRIPT [ italic_g start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ] , ∀ italic_k ∈ blackboard_N , (3.17)

where β𝛽\betaitalic_β is the step size, d𝒥pod[g]=𝒮pod(𝒮podgm)+λf𝑑subscript𝒥poddelimited-[]𝑔superscriptsubscript𝒮podsubscript𝒮pod𝑔𝑚𝜆𝑓d\mathcal{J}_{\text{pod}}[g]=\mathcal{S}_{\text{pod}}^{*}(\mathcal{S}_{\text{% pod}}g-m)+\lambda fitalic_d caligraphic_J start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT [ italic_g ] = caligraphic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( caligraphic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_g - italic_m ) + italic_λ italic_f, and g0subscript𝑔0g_{0}italic_g start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is an initial guess.

The above theory is based on noise free case, i.e. the final time measurement m=u(,T)𝑚𝑢𝑇m=u(\cdot,T)italic_m = italic_u ( ⋅ , italic_T ) is exactly known. For practical consideration, the measurement data always contains uncertainty. We assume the measurement data is always blurred by noise and takes the discrete form min=u(di,T)+eisubscriptsuperscript𝑚𝑛𝑖𝑢subscript𝑑𝑖𝑇subscript𝑒𝑖m^{n}_{i}=u(d_{i},T)+e_{i}italic_m start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_u ( italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_T ) + italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, i=1,,n𝑖1𝑛i=1,\cdots,nitalic_i = 1 , ⋯ , italic_n, where disubscript𝑑𝑖d_{i}italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPTs are the positions of detectors and {ei}i=1nsubscriptsuperscriptsubscript𝑒𝑖𝑛𝑖1\{e_{i}\}^{n}_{i=1}{ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT are independent and identically distributed (i.i.d.) random variables on a proper probability space (𝔛,,)\mathfrak{X},\mathcal{F},\mathbb{P})fraktur_X , caligraphic_F , blackboard_P ). From property of paraboic equation, we know that uC([0,T];H2(Ω))CgL2(Ω)subscriptnorm𝑢𝐶0𝑇superscript𝐻2Ω𝐶subscriptnorm𝑔superscript𝐿2Ω\|u\|_{C([0,T];H^{2}(\Omega))}\leq C\|g\|_{L^{2}(\Omega)}∥ italic_u ∥ start_POSTSUBSCRIPT italic_C ( [ 0 , italic_T ] ; italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) ) end_POSTSUBSCRIPT ≤ italic_C ∥ italic_g ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT. According to the embedding theorem of Sobolev spaces, we know that H2(Ω)superscript𝐻2ΩH^{2}(\Omega)italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) is continuously embedded into C(Ω¯)𝐶¯ΩC(\bar{\Omega})italic_C ( over¯ start_ARG roman_Ω end_ARG ) so that u(,T)𝑢𝑇u(\cdot,T)italic_u ( ⋅ , italic_T ) is well defined point-wisely for all diΩsubscript𝑑𝑖Ωd_{i}\in\Omegaitalic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ roman_Ω.

We will first use the technique developed in [5] to recover the final time measurement u(,T)𝑢𝑇u(\cdot,T)italic_u ( ⋅ , italic_T ) from the noisy data minsubscriptsuperscript𝑚𝑛𝑖m^{n}_{i}italic_m start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for i=1,,n𝑖1𝑛i=1,...,nitalic_i = 1 , … , italic_n. We approximate u(,T)𝑢𝑇u(\cdot,T)italic_u ( ⋅ , italic_T ) by solving the following minimization problem:

m=argminuX1ni=1n(u(xi)min)2+α|u|H2(Ω)2.𝑚subscriptargmin𝑢𝑋1𝑛superscriptsubscript𝑖1𝑛superscript𝑢subscript𝑥𝑖subscriptsuperscript𝑚𝑛𝑖2𝛼superscriptsubscript𝑢superscript𝐻2Ω2\displaystyle m=\mathop{\rm argmin}\limits_{u\in X}\frac{1}{n}\sum_{i=1}^{n}(u% (x_{i})-m^{n}_{i})^{2}+\alpha|u|_{H^{2}(\Omega)}^{2}.italic_m = roman_argmin start_POSTSUBSCRIPT italic_u ∈ italic_X end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_u ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - italic_m start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_α | italic_u | start_POSTSUBSCRIPT italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (3.18)

Assume the point wise noise eisubscript𝑒𝑖e_{i}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT has bounded variance σ𝜎\sigmaitalic_σ, this is the so-called noise level. [5]analyzed this problem and give the optimal convergence results, moreover they proposed a posteriori algorithm to give the best approximation without knowing the true solution m𝑚mitalic_m and noise level σ𝜎\sigmaitalic_σ. Here we list their main results: If one chooses the optimal regularization parameter

α1/2+d/8=O(σn1/2u(,T)H2(Ω)1),superscript𝛼12𝑑8𝑂𝜎superscript𝑛12subscriptsuperscriptnorm𝑢𝑇1superscript𝐻2Ω\alpha^{1/2+d/8}=O(\sigma n^{-1/2}\|u(\cdot,T)\|^{-1}_{H^{2}(\Omega)}),italic_α start_POSTSUPERSCRIPT 1 / 2 + italic_d / 8 end_POSTSUPERSCRIPT = italic_O ( italic_σ italic_n start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ∥ italic_u ( ⋅ , italic_T ) ∥ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT ) ,

then the solution m𝑚mitalic_m of (3.18) achieves the optimal convergence

𝔼[u(,T)mL2(Ω)]Cα1/2u(,T)H2(Ω).𝔼delimited-[]subscriptnorm𝑢𝑇𝑚superscript𝐿2Ω𝐶superscript𝛼12subscriptnorm𝑢𝑇superscript𝐻2Ω\mathbb{E}\big{[}\|u(\cdot,T)-m\|_{L^{2}(\Omega)}\big{]}\leq C\alpha^{1/2}\|u(% \cdot,T)\|_{H^{2}(\Omega)}.blackboard_E [ ∥ italic_u ( ⋅ , italic_T ) - italic_m ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT ] ≤ italic_C italic_α start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ∥ italic_u ( ⋅ , italic_T ) ∥ start_POSTSUBSCRIPT italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT . (3.19)

And if the noise {ei}i=1nsubscriptsuperscriptsubscript𝑒𝑖𝑛𝑖1\{e_{i}\}^{n}_{i=1}{ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT are independent Gaussian random variables with variance σ𝜎\sigmaitalic_σ, we further have,

(u(,T)mL2(Ω)α1/2u(,T)H2(Ω)z)2eCz2.subscriptnorm𝑢𝑇𝑚superscript𝐿2Ωsuperscript𝛼12subscriptnorm𝑢𝑇superscript𝐻2Ω𝑧2superscript𝑒𝐶superscript𝑧2\mathbb{P}(\|u(\cdot,T)-m\|_{L^{2}(\Omega)}\geq\alpha^{1/2}\|u(\cdot,T)\|_{H^{% 2}(\Omega)}z)\leq 2e^{-Cz^{2}}.blackboard_P ( ∥ italic_u ( ⋅ , italic_T ) - italic_m ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT ≥ italic_α start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ∥ italic_u ( ⋅ , italic_T ) ∥ start_POSTSUBSCRIPT italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT italic_z ) ≤ 2 italic_e start_POSTSUPERSCRIPT - italic_C italic_z start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT . (3.20)

Using this recovered function m(x)𝑚𝑥m(x)italic_m ( italic_x ), we generate the adjoint POD basis in Section 2.1. It can be easily shown that, with uncertainty, the POD basis is still a good low rank approximation of the snapshots {y1,,yM}subscript𝑦1subscript𝑦𝑀\{y_{1},...,y_{M}\}{ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_y start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT }. Combining the Theorem 3.2 and (3.19), we shall have that for any 1iM1𝑖𝑀1\leq i\leq M1 ≤ italic_i ≤ italic_M,

yiPpodyiL2(Ω)2C(Me2μLTρ+α)gL2(Ω)2,superscriptsubscriptnormsubscript𝑦𝑖subscript𝑃podsubscript𝑦𝑖superscript𝐿2Ω2𝐶𝑀superscript𝑒2subscript𝜇𝐿𝑇𝜌𝛼subscriptsuperscriptnorm𝑔2superscript𝐿2Ω\left|\left|y_{i}-P_{\text{pod}}y_{i}\right|\right|_{L^{2}(\Omega)}^{2}\leq C(% Me^{2\mu_{L}T}\rho+\alpha)\|g\|^{2}_{L^{2}(\Omega)},| | italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ italic_C ( italic_M italic_e start_POSTSUPERSCRIPT 2 italic_μ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT italic_ρ + italic_α ) ∥ italic_g ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT , (3.21)

Since we replace the source term by a finite truncation (3.5), we assume

ggappL22ε(L).subscriptsuperscriptnorm𝑔subscript𝑔app2superscript𝐿2𝜀𝐿\|g-g_{\text{app}}\|^{2}_{L^{2}}\leq\varepsilon(L).∥ italic_g - italic_g start_POSTSUBSCRIPT app end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≤ italic_ε ( italic_L ) . (3.22)

With those results, using a similar technique to prove the Theorem 3.7 in [18], we have the following convergence results.

Theorem 3.3

Let {ei}i=1nsubscriptsuperscriptsubscript𝑒𝑖𝑛𝑖1\{e_{i}\}^{n}_{i=1}{ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT be independent random variables satisfying 𝔼[ei]=0𝔼delimited-[]subscript𝑒𝑖0\mathbb{E}[e_{i}]=0blackboard_E [ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] = 0 and 𝔼[ei2]σ2𝔼delimited-[]subscriptsuperscript𝑒2𝑖superscript𝜎2\mathbb{E}[e^{2}_{i}]\leq\sigma^{2}blackboard_E [ italic_e start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] ≤ italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT for i=1,,n𝑖1𝑛i=1,\cdots,nitalic_i = 1 , ⋯ , italic_n. Set α1/2+d/8=O(σn1/2u(,T)H2(Ω)1)superscript𝛼12𝑑8𝑂𝜎superscript𝑛12subscriptsuperscriptnorm𝑢𝑇1superscript𝐻2Ω\alpha^{1/2+d/8}=O(\sigma n^{-1/2}\|u(\cdot,T)\|^{-1}_{H^{2}(\Omega)})italic_α start_POSTSUPERSCRIPT 1 / 2 + italic_d / 8 end_POSTSUPERSCRIPT = italic_O ( italic_σ italic_n start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ∥ italic_u ( ⋅ , italic_T ) ∥ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT ) in (2.23), and λ=O(Me2μLTρ+α)𝜆𝑂𝑀superscript𝑒2subscript𝜇𝐿𝑇𝜌𝛼\lambda=O(Me^{2\mu_{L}T}\rho+\alpha)italic_λ = italic_O ( italic_M italic_e start_POSTSUPERSCRIPT 2 italic_μ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT italic_ρ + italic_α ), then

𝔼[𝒮g𝒮podgpodL2(Ω)2]CλgL2(Ω)2+Cε,𝔼delimited-[]superscriptsubscriptnorm𝒮superscript𝑔subscript𝒮podsubscript𝑔podsuperscript𝐿2Ω2𝐶𝜆subscriptsuperscriptnormsuperscript𝑔2superscript𝐿2Ω𝐶𝜀\displaystyle\mathbb{E}\big{[}\|\mathcal{S}g^{*}-\mathcal{S}_{\text{pod}}g_{% \text{pod}}\|_{L^{2}(\Omega)}^{2}\big{]}\leq C\lambda\|g^{*}\|^{2}_{L^{2}(% \Omega)}+C\varepsilon,blackboard_E [ ∥ caligraphic_S italic_g start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - caligraphic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] ≤ italic_C italic_λ ∥ italic_g start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT + italic_C italic_ε , (3.23)
𝔼[ggpodL2(Ω)2]CgL2(Ω)2+Cε.𝔼delimited-[]superscriptsubscriptnormsuperscript𝑔subscript𝑔podsuperscript𝐿2Ω2𝐶subscriptsuperscriptnormsuperscript𝑔2superscript𝐿2Ω𝐶𝜀\displaystyle\mathbb{E}\big{[}\|g^{*}-g_{\text{pod}}\|_{L^{2}(\Omega)}^{2}\big% {]}\leq C\|g^{*}\|^{2}_{L^{2}(\Omega)}+C\varepsilon.blackboard_E [ ∥ italic_g start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_g start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] ≤ italic_C ∥ italic_g start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT + italic_C italic_ε . (3.24)

Furthermore, if we assume the noise {ei}i=1nsubscriptsuperscriptsubscript𝑒𝑖𝑛𝑖1\{e_{i}\}^{n}_{i=1}{ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT are independent Gaussian random variables with variance σ𝜎\sigmaitalic_σ, we will have a stronger type of convergence, one can refer to [18] for a similar proof. We just list the results here.

Theorem 3.4

Let {ei}i=1nsubscriptsuperscriptsubscript𝑒𝑖𝑛𝑖1\{e_{i}\}^{n}_{i=1}{ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT be independent Gaussian random variables with variance σ𝜎\sigmaitalic_σ. Set α1/2+d/8=O(σn1/2u(,T)H2(Ω)1)superscript𝛼12𝑑8𝑂𝜎superscript𝑛12subscriptsuperscriptnorm𝑢𝑇1superscript𝐻2Ω\alpha^{1/2+d/8}=O(\sigma n^{-1/2}\|u(\cdot,T)\|^{-1}_{H^{2}(\Omega)})italic_α start_POSTSUPERSCRIPT 1 / 2 + italic_d / 8 end_POSTSUPERSCRIPT = italic_O ( italic_σ italic_n start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT ∥ italic_u ( ⋅ , italic_T ) ∥ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT ) in (2.23), and λ=O(Me2μLTρ+α)𝜆𝑂𝑀superscript𝑒2subscript𝜇𝐿𝑇𝜌𝛼\lambda=O(Me^{2\mu_{L}T}\rho+\alpha)italic_λ = italic_O ( italic_M italic_e start_POSTSUPERSCRIPT 2 italic_μ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT italic_T end_POSTSUPERSCRIPT italic_ρ + italic_α ), then there exists a constant C, for any z>0𝑧0z>0italic_z > 0,

(SpodgpodSgL2(Ω)(λ1/2gL2+ε)z)2eCz2,subscriptnormsubscript𝑆podsubscript𝑔pod𝑆superscript𝑔superscript𝐿2Ωsuperscript𝜆12subscriptnormsuperscript𝑔superscript𝐿2𝜀𝑧2superscript𝑒𝐶superscript𝑧2\displaystyle\mathbb{P}(\|S_{\text{pod}}g_{\text{pod}}-Sg^{*}\|_{L^{2}(\Omega)% }\geq(\lambda^{1/2}\|g^{*}\|_{L^{2}}+\varepsilon)z)\leq 2e^{-Cz^{2}},blackboard_P ( ∥ italic_S start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT - italic_S italic_g start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT ≥ ( italic_λ start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ∥ italic_g start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT + italic_ε ) italic_z ) ≤ 2 italic_e start_POSTSUPERSCRIPT - italic_C italic_z start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT , (3.25)
(gpodgL2(Ω)(gL2+ε)z)2eCz2.subscriptnormsubscript𝑔podsuperscript𝑔superscript𝐿2Ωsubscriptnormsuperscript𝑔superscript𝐿2𝜀𝑧2superscript𝑒𝐶superscript𝑧2\displaystyle\mathbb{P}(\|g_{\text{pod}}-g^{*}\|_{L^{2}(\Omega)}\geq(\|g^{*}\|% _{L^{2}}+\varepsilon)z)\leq 2e^{-Cz^{2}}.blackboard_P ( ∥ italic_g start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT - italic_g start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT ≥ ( ∥ italic_g start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT + italic_ε ) italic_z ) ≤ 2 italic_e start_POSTSUPERSCRIPT - italic_C italic_z start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT . (3.26)

4 Numerical examples

In this section, we present several numerical examples to demonstrate the reconstruction results for the inverse source problem and the backward problem discussed in this paper. We consider the domain Ω=[0,π]2Ωsuperscript0𝜋2\Omega=[0,\pi]^{2}roman_Ω = [ 0 , italic_π ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. For each observation data set, we first apply the backward Euler scheme in time and the linear finite element method (specifically, the P1 element) in space with a mesh size of h=1/50150h=1/50italic_h = 1 / 50 and a time step of Δt=T/400Δ𝑡𝑇400\Delta t=T/400roman_Δ italic_t = italic_T / 400. We select 9 POD basis functions to compute the inverse problems, whereas, for the finite element method, there are approximately 2500 basis functions.

In [17], the authors have already compared the efficiency of the POD method and the finite element method for solving this inverse problem. They demonstrate that the POD method achieves a speed-up of at least 6 times, even with 400 finite element basis functions. Therefore, we do not include a comparison with the finite element method in this paper. As the number of finite element basis functions increases, the potential for the POD method to achieve greater speed-up also grows correspondingly.

4.1 Inverse source examples

In the following examples, we apply the adjoint POD method to recover the source term f𝑓fitalic_f as described in Section 2. We obtain the data for the forward problem with the exact source term f𝑓fitalic_f at the final time T=1𝑇1T=1italic_T = 1.

Example 4.1 We first demonstrate the importance of choosing the appropriate POD basis functions. For the same source term, we apply different right-hand sides of equation (2.1) to obtain the POD basis functions. Subsequently, we solve the inverse source problem using these different POD basis functions.

Figure 1 illustrates this process. The true source term is given by sin(2x)sin(2y)ex+yπ2𝑥2𝑦superscript𝑒𝑥𝑦𝜋\sin(2x)\sin(2y)e^{\frac{x+y}{\pi}}roman_sin ( 2 italic_x ) roman_sin ( 2 italic_y ) italic_e start_POSTSUPERSCRIPT divide start_ARG italic_x + italic_y end_ARG start_ARG italic_π end_ARG end_POSTSUPERSCRIPT, and its surface plot is shown in Figure 1 (a). Figure 1 (b) presents the reconstruction result obtained using the adjoint POD method proposed in this paper, indicating that our new method effectively recovers the source term in an efficient manner. Figure 1 (c) displays the result when an incorrect right-hand side is used to generate the POD basis functions. In this case, we use sin(x)sin(y)𝑥𝑦\sin(x)\sin(y)roman_sin ( italic_x ) roman_sin ( italic_y ) as the right-hand side in (2.1) to generate the POD basis functions. Figure 1 (d) shows the result when we use an A-shaped function as the right-hand side to derive the POD basis.

As can be seen, Figure 1 (b) provides a good reconstruction, whereas Figures 1 (c) and (d) yield inaccurate results. The result in Figure 1 (d) is particularly striking, as the recovered image deviates significantly from the exact source term.

In this example, we demonstrate the importance of selecting the appropriate basis functions for solving inverse problems. Utilizing an unsuitable set of basis functions can lead to wrong results. Our proposed adjoint POD method offers a set of suitable basis functions for such problems. In the following examples, we will compare our adjoint POD basis functions with the original true POD basis to validate Theorem 2.3.

Refer to caption
(a) Exact source term
Refer to caption
(b) Reconstructed by the Adjoint POD method
Refer to caption
(c) Using the POD basis generated with the right hand side sin(x)sin(y)𝑥𝑦\sin(x)\sin(y)roman_sin ( italic_x ) roman_sin ( italic_y )
Refer to caption
(d) Using the POD basis generated with the right hand side of A shaped function
Figure 1: The importance of choice of POD basis

Example 4.2 In this example, we will first use the true source term as the right-hand side in (2.1) to generate the POD basis. Then, we will generate the POD basis using our proposed adjoint POD method. To validate Theorem 2.3, we will compare both sets of basis functions and assess their similarity. Figure LABEL:basis-comp-source-sin shows the results when using the exact source term f=sin(2x)sin(2y)superscript𝑓2𝑥2𝑦f^{*}=\sin(2x)\sin(2y)italic_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_sin ( 2 italic_x ) roman_sin ( 2 italic_y ). Figure LABEL:basis-comp-source-sin (b) and Figure LABEL:basis-comp-source-sin (d) show that both the traditional POD and our adjoint POD work well to recover the true source term. However, our adjoint POD method does not require prior knowledge of the exact source term, while the traditional POD method does, leading to the so-called inverse crime. The basis functions for each method are depicted in Figure LABEL:basis-comp-source-sin (c) and Figure LABEL:basis-comp-source-sin (e), demonstrating that our adjoint POD basis is highly similar to the traditional one, even though we derived it solely from measured data.

Figure LABEL:basis-comp-source-Z presents the results when using an exact source term fsuperscript𝑓f^{*}italic_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT in the form of a Z-shaped function. This example also illustrates the efficiency of the POD method in solving inverse problems compared to the finite element method. Figure LABEL:basis-comp-source-Z (c) and Figure LABEL:basis-comp-source-Z (e) show that both the basis functions of the traditional POD method and our adjoint POD method contain the critical information of the exact source term that we aim to recover. In contrast, the basis functions of finite element method do not contain any prior information about the true function we need to recover.

In the aforementioned cases, all the measured data were noise-free. We will now test the denoising method described in Section 2.3 by examining highly challenging cases with noise levels ranging from 10%percent1010\%10 % to 50%percent5050\%50 %.

Example 4.3 In this example, we will evaluate the robustness of the adjoint POD method in the presence of noise. We consider the measurement data to be min=u(di,T)+σeisubscriptsuperscript𝑚𝑛𝑖𝑢subscript𝑑𝑖𝑇𝜎subscript𝑒𝑖m^{n}_{i}=u(d_{i},T)+\sigma e_{i}italic_m start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_u ( italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_T ) + italic_σ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, i=1,,n𝑖1𝑛i=1,\cdots,nitalic_i = 1 , ⋯ , italic_n, where disubscript𝑑𝑖d_{i}italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT represents positions within the domain ΩΩ\Omegaroman_Ω, and {ei}i=1nsubscriptsuperscriptsubscript𝑒𝑖𝑛𝑖1\{e_{i}\}^{n}_{i=1}{ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT are independent standard normal random variables. We will take 2500 positions disubscript𝑑𝑖d_{i}italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT uniformly distributed over the domain ΩΩ\Omegaroman_Ω.

Figure 4 demonstrates the robustness of our method in the presence of significant noise. Even with a 50% noise level, where the measured data is entirely obscured by noise as shown in Figure 4 (c), our method is still able to recover the source term as depicted in Figure 4 (f).

Refer to caption
(a) Exact source term
Refer to caption
(b) Measured data with 25%percent\%% noise
Refer to caption
(c) Measured data with 50%percent\%% noise
Refer to caption
(d) Recovered result for 10%percent\%% noise
Refer to caption
(e) Recovered result for 25%percent\%% noise
Refer to caption
(f) Recovered result for 50%percent\%% noise
Figure 4: Robustness of the adjoint POD against the noise for f=sin(2x)sin(2y)ex+yπ𝑓𝑠𝑖𝑛2𝑥𝑠𝑖𝑛2𝑦superscript𝑒𝑥𝑦𝜋f=sin(2x)sin(2y)e^{\frac{x+y}{\pi}}italic_f = italic_s italic_i italic_n ( 2 italic_x ) italic_s italic_i italic_n ( 2 italic_y ) italic_e start_POSTSUPERSCRIPT divide start_ARG italic_x + italic_y end_ARG start_ARG italic_π end_ARG end_POSTSUPERSCRIPT.

4.2 Examples for backward problem

In this subsection, we will apply the new POD method to recover the initial term g𝑔gitalic_g as discussed in Section 3. We will collect the data at the time T=0.05𝑇0.05T=0.05italic_T = 0.05.

Example 4.4 We will demonstrate the importance of selecting the appropriate POD basis functions. For the same source term, we apply different right-hand sides of equation (3.1) to derive the POD basis functions. Then, we solve the backward problem using different POD basis functions.

Figure 5 illustrates the corresponding results. The true source term is sin(2x)sin(2y)ex+yπ2𝑥2𝑦superscript𝑒𝑥𝑦𝜋\sin(2x)\sin(2y)e^{\frac{x+y}{\pi}}roman_sin ( 2 italic_x ) roman_sin ( 2 italic_y ) italic_e start_POSTSUPERSCRIPT divide start_ARG italic_x + italic_y end_ARG start_ARG italic_π end_ARG end_POSTSUPERSCRIPT, and its surface plot is shown in Figure 5 (a). Figure 5 (b) presents the reconstruction result using our adjoint POD method proposed in this paper, which demonstrates the effectiveness and efficiency of our new POD method in recovering the source term. Figure 5 (c) shows the result when we use an incorrect right-hand side to generate the POD basis functions. In this case, we use sin(x)sin(y)𝑥𝑦\sin(x)\sin(y)roman_sin ( italic_x ) roman_sin ( italic_y ) as the right-hand side in (3.1) to generate the POD basis functions. Figure 5 (d) displays the result when we use an A-shaped function as the right-hand side to generate the POD basis functions. It can be observed that Figure 5 (b) provides a good reconstruction, while Figure 5 (c) and Figure 5 (d) yield incorrect results. Particularly in Figure 5 (d), the recovered image is entirely different from the exact source term.

In this example, we demonstrate the importance of choosing the correct basis to solve the inverse problem. Using an inappropriate set of basis functions may lead to unsatisfactory results. Our proposed adjoint POD provides a set of suitable basis functions. In the upcoming examples, we will compare our adjoint POD basis functions with the original true POD basis functions to verify Theorem 3.2.

Refer to caption
(a) Exact initial term
Refer to caption
(b) POD method proposed in this paper
Refer to caption
(c) Using the POD basis generated with the right hand side sin(x)sin(y)𝑠𝑖𝑛𝑥𝑠𝑖𝑛𝑦sin(x)sin(y)italic_s italic_i italic_n ( italic_x ) italic_s italic_i italic_n ( italic_y )
Refer to caption
(d) Using the POD basis generated with the right hand side of A-shaped function
Figure 5: The importance of choice of POD basis

Example 4.5 In the following two examples, we first use the true source term as the right-hand side in Eq. (3.1) to generate the POD basis functions. Then, we generate the POD basis functions using our proposed adjoint POD method. To verify Theorem 3.2, we will plot two sets of basis functions to determine if they are closely related. Figure LABEL:fig:ini-eg_sin2exp-basis shows the results using the exact source term g=sin(2x)sin(2y)ex+yπsuperscript𝑔2𝑥2𝑦superscript𝑒𝑥𝑦𝜋g^{*}=\sin(2x)\sin(2y)e^{\frac{x+y}{\pi}}italic_g start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_sin ( 2 italic_x ) roman_sin ( 2 italic_y ) italic_e start_POSTSUPERSCRIPT divide start_ARG italic_x + italic_y end_ARG start_ARG italic_π end_ARG end_POSTSUPERSCRIPT. Figures LABEL:fig:ini-eg_sin2exp-basis (b) and LABEL:fig:ini-eg_sin2exp-basis (d) demonstrate that both the traditional POD and our adjoint POD methods work well in recovering the true initial term. However, the difference is that our adjoint POD method does not require prior knowledge of the exact initial term, while the traditional POD method does, which is known as the inverse crime. Figures LABEL:fig:ini-eg_sin2exp-basis (c) and LABEL:fig:ini-eg_sin2exp-basis (e) display the basis functions of each method. We can conclude that the basis functions of our adjoint POD method are very close to the traditional one, and we obtain it solely from the measured data.

Figure LABEL:fig:ini-eg_A-basis shows the results using the exact initial term gsuperscript𝑔g^{*}italic_g start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT of the A-shaped function. This example also illustrates why the POD method is more efficient in solving inverse problems compared to the finite element method. Figures LABEL:fig:ini-eg_A-basis (c) and LABEL:fig:ini-eg_A-basis (e) reveal that both the traditional POD method and our adjoint POD method’s basis functions contain critical information about the exact initial term we aim to recover. In contrast, the finite element method’s basis lacks any priori information about the true function we need to recover.

In the above cases, all the measured data are noise-free. We will now test the denoising method discussed in Section 3.2 by examining very challenging cases with noise levels ranging from 10%percent1010\%10 % to 50%percent5050\%50 %.

Example 4.6 In this example, we will test the robustness of the adjoint POD method against noise. We take the measurement data as min=u(di,T)+σeisubscriptsuperscript𝑚𝑛𝑖𝑢subscript𝑑𝑖𝑇𝜎subscript𝑒𝑖m^{n}_{i}=u(d_{i},T)+\sigma e_{i}italic_m start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_u ( italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_T ) + italic_σ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, for i=1,,n𝑖1𝑛i=1,\cdots,nitalic_i = 1 , ⋯ , italic_n, where disubscript𝑑𝑖d_{i}italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPTs represent the positions inside of ΩΩ\Omegaroman_Ω, and {ei}i=1nsubscriptsuperscriptsubscript𝑒𝑖𝑛𝑖1\{e_{i}\}^{n}_{i=1}{ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT are independent standard normal random variables. We will use 2500 positions disubscript𝑑𝑖d_{i}italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT uniformly distributed over the domain ΩΩ\Omegaroman_Ω. Figure 8 demonstrates that our method is robust even in the presence of big noise. Remarkably, even with 50%percent5050\%50 % noise, when the measured data is completely obscured by noise as shown in Figure 8 (c), we can still recover the source term, as seen in Figure 8 (f).

Refer to caption
(a) Exact initial term
Refer to caption
(b) Measured data with 25%percent\%% noise
Refer to caption
(c) Measured data with 50%percent\%% noise
Refer to caption
(d) Recovered result for 10%percent\%% noise
Refer to caption
(e) Recovered result for 25%percent\%% noise
Refer to caption
(f) Recovered result for 50%percent\%% noise
Figure 8: Robustness of the adjoint POD against the noise for g=sin(2x)sin(2y)𝑔𝑠𝑖𝑛2𝑥𝑠𝑖𝑛2𝑦g=sin(2x)sin(2y)italic_g = italic_s italic_i italic_n ( 2 italic_x ) italic_s italic_i italic_n ( 2 italic_y ).

Example 4.7 Finally, we study a more interesting case. While the inverse source problem and the backward problem are two distinct problems, we have observed from the previous numerical examples that they share some commonalities. Specifically, the POD basis for both problems contains critical information about the functions one wants to recover. As a result, we will employ the POD basis functions derived from the inverse source problem to solve the backward problem. Please refer to Figure LABEL:fig:ini-eg_A_using_source-basis for the reconstruction results.

5 Conclusion

We have developed a data-driven and model-based approach for solving parabolic inverse source problems with uncertain data. The key idea is to exploit the model-based intrinsic low-dimensional structure of the underlying parabolic PDEs and construct data-based POD basis functions to achieve significant dimension reduction in the solution space. Equipped with the POD basis functions, we develop a fast algorithm that can compute the optimization problem in the inverse source problems. Hence, we obtain an effective data-driven and model-based approach for the inverse source problems and overcome the typical computational bottleneck of FEM in solving these problems. Under a weak assumption on the regularity of the solution, we provide the convergence analysis of our POD algorithm in solving the forward parabolic PDEs and thus obtain the error estimate of the POD algorithm for the parabolic inverse source problems. Finally, we carry out numerical experiments to demonstrate the accuracy and efficiency of the proposed method. We also study other issues of the POD algorithm, such as the dependence of the error on the mesh size, the regularization parameter in the least-squares regularized minimization problems, and the number of POD basis functions. Through numerical results, we find that our POD algorithm provides significant computational savings over the FEM while yielding as good approximations as the FEM. We expect an even better performance of efficiency can be obtained for 3D problems, which will be studied in our future works.

Acknowledgement

The research of W. Zhang is supported by the National Natural Science Foundation of China No. 12371423 and 12241104. The research of Z. Zhang is supported by Hong Kong RGC grant project 17307921, National Natural Science Foundation of China No. 12171406, Seed Funding for Strategic Interdisciplinary Research Scheme 2021/22 (HKU), and Seed Funding from the HKU-TCL Joint Research Centre for Artificial Intelligence.

Appendix A Proper orthogonal decomposition (POD) method

Assuming that uH01(Ω)𝑢subscriptsuperscript𝐻10Ωu\in H^{1}_{0}(\Omega)italic_u ∈ italic_H start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( roman_Ω ) is the solution to the weak formulation of the parabolic equation (1.1), the construction of POD basis functions requires solution snapshots. These solution snapshots can be obtained by the appropriate technological means related to a specific application, such as experimental data or numerical methods.

Given a set of solutions at different time instances {u(,t0),u(,t1),,u(,tM)}𝑢subscript𝑡0𝑢subscript𝑡1𝑢subscript𝑡𝑀\big{\{}u(\cdot,t_{0}),u(\cdot,t_{1}),\ldots,u(\cdot,t_{M})\big{\}}{ italic_u ( ⋅ , italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) , italic_u ( ⋅ , italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , italic_u ( ⋅ , italic_t start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ) }, where tk=kΔtsubscript𝑡𝑘𝑘Δ𝑡t_{k}=k\Delta titalic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_k roman_Δ italic_t with Δt=TMΔ𝑡𝑇𝑀\Delta t=\frac{T}{M}roman_Δ italic_t = divide start_ARG italic_T end_ARG start_ARG italic_M end_ARG and k=0,,M𝑘0𝑀k=0,\ldots,Mitalic_k = 0 , … , italic_M, we first obtain the solution snapshots {y1,,yM+1,\{y_{1},\ldots,y_{M+1},{ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_y start_POSTSUBSCRIPT italic_M + 1 end_POSTSUBSCRIPT , yM+2,,y2M+1}y_{M+2},\ldots,y_{2M+1}\}italic_y start_POSTSUBSCRIPT italic_M + 2 end_POSTSUBSCRIPT , … , italic_y start_POSTSUBSCRIPT 2 italic_M + 1 end_POSTSUBSCRIPT }, where yk=u(,tk1)subscript𝑦𝑘𝑢subscript𝑡𝑘1y_{k}=u(\cdot,t_{k-1})italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_u ( ⋅ , italic_t start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ), k=1,,M+1𝑘1𝑀1k=1,\ldots,M+1italic_k = 1 , … , italic_M + 1, and yk=¯u(,tkM1)subscript𝑦𝑘¯𝑢subscript𝑡𝑘𝑀1y_{k}=\overline{\partial}u(\cdot,t_{k-M-1})italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = over¯ start_ARG ∂ end_ARG italic_u ( ⋅ , italic_t start_POSTSUBSCRIPT italic_k - italic_M - 1 end_POSTSUBSCRIPT ), k=M+2,,2m+1𝑘𝑀22𝑚1k=M+2,\ldots,2m+1italic_k = italic_M + 2 , … , 2 italic_m + 1 with ¯u(,tk)=u(,tk)u(,tk1)Δt¯𝑢subscript𝑡𝑘𝑢subscript𝑡𝑘𝑢subscript𝑡𝑘1Δ𝑡\overline{\partial}u(\cdot,t_{k})=\frac{u(\cdot,t_{k})-u(\cdot,t_{k-1})}{% \Delta t}over¯ start_ARG ∂ end_ARG italic_u ( ⋅ , italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = divide start_ARG italic_u ( ⋅ , italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) - italic_u ( ⋅ , italic_t start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ) end_ARG start_ARG roman_Δ italic_t end_ARG, k=1,,M𝑘1𝑀k=1,\ldots,Mitalic_k = 1 , … , italic_M.

Then, the POD basis functions {ψk}k=1Npodsuperscriptsubscriptsubscript𝜓𝑘𝑘1subscript𝑁pod\{\psi_{k}\}_{k=1}^{{N_{\text{pod}}}}{ italic_ψ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT end_POSTSUPERSCRIPT are constructed by minimizing the following projection error:

12m+1(j=0Mu(tj)k=1Npod(u(tj),ψk)L2(Ω)ψkL2(Ω)2\displaystyle\frac{1}{2m+1}\Big{(}\sum_{j=0}^{M}\big{\|}u(t_{j})-\sum_{k=1}^{{% N_{\text{pod}}}}(u(t_{j}),\psi_{k})_{L^{2}(\Omega)}\psi_{k}\big{\|}_{L^{2}(% \Omega)}^{2}divide start_ARG 1 end_ARG start_ARG 2 italic_m + 1 end_ARG ( ∑ start_POSTSUBSCRIPT italic_j = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT ∥ italic_u ( italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) - ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_u ( italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) , italic_ψ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (A.1)
+j=1M¯u(tj)k=1Npod(¯u(tj),ψk)L2(Ω)ψkL2(Ω)2)\displaystyle+\sum_{j=1}^{M}\big{\|}\overline{\partial}u(t_{j})-\sum_{k=1}^{{N% _{\text{pod}}}}(\overline{\partial}u(t_{j}),\psi_{k})_{L^{2}(\Omega)}\psi_{k}% \big{\|}_{L^{2}(\Omega)}^{2}\Big{)}+ ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT ∥ over¯ start_ARG ∂ end_ARG italic_u ( italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) - ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( over¯ start_ARG ∂ end_ARG italic_u ( italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) , italic_ψ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) (A.2)

subject to the constraints that (ψk1(),ψk2())L2(Ω)=δk1k2subscriptsubscript𝜓subscript𝑘1subscript𝜓subscript𝑘2superscript𝐿2Ωsubscript𝛿subscript𝑘1subscript𝑘2\big{(}\psi_{k_{1}}(\cdot),\psi_{k_{2}}(\cdot)\big{)}_{L^{2}(\Omega)}=\delta_{% k_{1}k_{2}}( italic_ψ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋅ ) , italic_ψ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋅ ) ) start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT = italic_δ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, 1k1,k2Npodformulae-sequence1subscript𝑘1subscript𝑘2subscript𝑁pod1\leq k_{1},k_{2}\leq N_{\text{pod}}1 ≤ italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT, where δk1k2=1subscript𝛿subscript𝑘1subscript𝑘21\delta_{k_{1}k_{2}}=1italic_δ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = 1 if k1=k2subscript𝑘1subscript𝑘2k_{1}=k_{2}italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, otherwise δk1k2=0subscript𝛿subscript𝑘1subscript𝑘20\delta_{k_{1}k_{2}}=0italic_δ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = 0. Here, we use Npodsubscript𝑁podN_{\text{pod}}italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT to denote the number of POD basis functions that will be extracted from solution snapshots.

Let Vpod=span{ψ1,,ψNpod}subscript𝑉podspansubscript𝜓1subscript𝜓subscript𝑁pod{V_{\text{pod}}}=\text{span}\{\psi_{1},\ldots,\psi_{N_{\text{pod}}}\}italic_V start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT = span { italic_ψ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_ψ start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT end_POSTSUBSCRIPT } denote the finite-dimensional space spanned by the POD basis functions. Using the method of snapshot proposed by Sirovich [15], we know that the minimizing problem can be reduced to the following eigenvalue problem:

Kv=μv,𝐾𝑣𝜇𝑣Kv=\mu v,italic_K italic_v = italic_μ italic_v , (A.3)

where the correlation matrix K𝐾Kitalic_K is computed from the solution snapshots {y1,y2,,y2M+1}subscript𝑦1subscript𝑦2subscript𝑦2𝑀1\{y_{1},y_{2},\ldots,y_{2M+1}\}{ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_y start_POSTSUBSCRIPT 2 italic_M + 1 end_POSTSUBSCRIPT } with entries Kij=(yi,yj)L2(Ω)subscript𝐾𝑖𝑗subscriptsubscript𝑦𝑖subscript𝑦𝑗superscript𝐿2ΩK_{ij}=(y_{i},y_{j})_{L^{2}(\Omega)}italic_K start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = ( italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT, i,j=1,,2M+1formulae-sequence𝑖𝑗12𝑀1i,j=1,\ldots,2M+1italic_i , italic_j = 1 , … , 2 italic_M + 1, and K𝐾Kitalic_K is symmetric and semi-positive definite. We sort the eigenvalues in a decreasing order as λ1λ2λ2m+1subscript𝜆1subscript𝜆2subscript𝜆2𝑚1\lambda_{1}\geq\lambda_{2}\geq...\geq\lambda_{2m+1}italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≥ italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≥ … ≥ italic_λ start_POSTSUBSCRIPT 2 italic_m + 1 end_POSTSUBSCRIPT and the corresponding eigenvectors are denoted by vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, k=1,,2M+1𝑘12𝑀1k=1,...,2M+1italic_k = 1 , … , 2 italic_M + 1. It can be shown that if the POD basis functions are constructed by

φk()=1λkj=12M+1(vk)ju(,tj),1kNpod,formulae-sequencesubscript𝜑𝑘1subscript𝜆𝑘superscriptsubscript𝑗12𝑀1subscriptsubscript𝑣𝑘𝑗𝑢subscript𝑡𝑗1𝑘subscript𝑁pod\varphi_{k}(\cdot)=\frac{1}{\sqrt{\lambda_{k}}}\sum_{j=1}^{2M+1}(v_{k})_{j}u(% \cdot,t_{j}),\quad 1\leq k\leq N_{\text{pod}},italic_φ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( ⋅ ) = divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_M + 1 end_POSTSUPERSCRIPT ( italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_u ( ⋅ , italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) , 1 ≤ italic_k ≤ italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT , (A.4)

where (vk)jsubscriptsubscript𝑣𝑘𝑗(v_{k})_{j}( italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is the j𝑗jitalic_j-th component of the eigenvector vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, they minimize the projection error.

The approximation error for the POD method has been studied extensively in the literature, particularly in the works [10] and [3].

Proposition A.1 (Sec. 3.3.2, [10] or p. 502, [3])

Let λ1λ2λ2M+10subscript𝜆1subscript𝜆2subscript𝜆2𝑀10\lambda_{1}\geq\lambda_{2}\geq...\geq\lambda_{2M+1}\geq 0italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≥ italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≥ … ≥ italic_λ start_POSTSUBSCRIPT 2 italic_M + 1 end_POSTSUBSCRIPT ≥ 0 denote the non-negative eigenvalues of K𝐾Kitalic_K in the eigenvalue problem (A.3). Then, {ψk}k=1Npodsuperscriptsubscriptsubscript𝜓𝑘𝑘1subscript𝑁pod\{\psi_{k}\}_{k=1}^{N_{\text{pod}}}{ italic_ψ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT end_POSTSUPERSCRIPT constructed according to the method of snapshots (A.4) is the set of POD basis functions, and we have the following error formula:

i=12M+1y~ik=1Npod(y~i,ψk())L2(Ωψk()L2(Ω)2i=12M+1y~iL2(Ω)2=k=Npod+12M+1λkk=12M+1λk,\frac{\sum_{i=1}^{2M+1}\left|\left|\tilde{y}_{i}-\sum_{k=1}^{{N_{\text{pod}}}}% \big{(}\tilde{y}_{i},\psi_{k}(\cdot)\big{)}_{L^{2}(\Omega}\psi_{k}(\cdot)% \right|\right|_{L^{2}(\Omega)}^{2}}{\sum_{i=1}^{2M+1}\left|\left|\tilde{y}_{i}% \right|\right|_{L^{2}(\Omega)}^{2}}=\frac{\sum_{k={N_{\text{pod}}}+1}^{2M+1}% \lambda_{k}}{\sum_{k=1}^{2M+1}\lambda_{k}},divide start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_M + 1 end_POSTSUPERSCRIPT | | over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_ψ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( ⋅ ) ) start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω end_POSTSUBSCRIPT italic_ψ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( ⋅ ) | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_M + 1 end_POSTSUPERSCRIPT | | over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( roman_Ω ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG = divide start_ARG ∑ start_POSTSUBSCRIPT italic_k = italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_M + 1 end_POSTSUPERSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_M + 1 end_POSTSUPERSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG , (A.5)

where the number Npodsubscript𝑁podN_{\text{pod}}italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT is determined according to the decay of the ratio ρ=k=Npod+12M+1λkk=12M+1λk𝜌superscriptsubscript𝑘subscript𝑁pod12𝑀1subscript𝜆𝑘superscriptsubscript𝑘12𝑀1subscript𝜆𝑘\rho=\frac{\sum_{k={N_{\text{pod}}}+1}^{2M+1}\lambda_{k}}{\sum_{k=1}^{2M+1}% \lambda_{k}}italic_ρ = divide start_ARG ∑ start_POSTSUBSCRIPT italic_k = italic_N start_POSTSUBSCRIPT pod end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_M + 1 end_POSTSUPERSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_M + 1 end_POSTSUPERSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG.

References

  • [1] S. Agmon. Lectures on Elliptic Boundary Value Problems. Van Norstrand, Princeton, NJ, 1965.
  • [2] A. Alla and M. Falcone. A time-adaptive POD method for optimal control problems. IFAC Proceedings Volumes, 46(26):245–250, 2013.
  • [3] P. Benner, S. Gugercin, and K. Willcox. A survey of projection-based model reduction methods for parametric dynamical systems. SIAM Review, 57(4):483–531, 2015.
  • [4] G. Berkooz, P. Holmes, and J. Lumley. The proper orthogonal decomposition in the analysis of turbulent flows. Annual review of fluid mechanics, 25(1):539–575, 1993.
  • [5] Z. Chen, R. Tuo, and W. Zhang. Stochastic convergence of a nonconforming finite element method for the thin plate spline smoother for observational data. SIAM Journal on Numerical Analysis, 56(2):635–659, 2018.
  • [6] Z. Chen, W. Zhang, and J. Zou. Stochastic convergence of regularized solutions and their finite element approximations to inverse source problems. SIAM Journal on Numerical Analysis, 60(2):751–780, 2022.
  • [7] J. Fleckinger and M. Lapidus. Eigenvalues of elliptic boundary value problems with an indefinite weight function. Transactions of the American Mathematical Society, 295(1):305–324, 1986.
  • [8] H. Gu, J. Xin, and Z. Zhang. Error estimates for a POD method for solving viscous G-equations in incompressible cellular flows. SIAM Journal on Scientific Computing, 43(1):A636–A662, 2021.
  • [9] J. Hesthaven, G. Rozza, and B. Stamm. Certified reduced basis methods for parametrized partial differential equations. Springer, 2016.
  • [10] P. Holmes, J. Lumley, and G. Berkooz. Turbulence, coherent structures, dynamical systems and symmetry. Cambridge University Press, 1998.
  • [11] K. Kunisch and S. Volkwein. Galerkin proper orthogonal decomposition methods for parabolic problems. Numerische Mathematik, 90(1):117–148, 2001.
  • [12] K. Kunisch, S. Volkwein, and L. Xie. HJB-POD-based feedback design for the optimal control of evolution problems. SIAM Journal on Applied Dynamical Systems, 3(4):701–722, 2004.
  • [13] A. Quarteroni, A. Manzoni, and F. Negri. Reduced basis methods for partial differential equations: an introduction, volume 92. Springer, 2015.
  • [14] L. Sirovich. Turbulence and the dynamics of coherent structures. I. Coherent structures. Quarterly of applied mathematics, 45(3):561–571, 1987.
  • [15] L. Sirovich. Turbulence and the dynamics of coherent structures. I. Coherent structures. Quarterly of applied mathematics, 45(3):561–571, 1987.
  • [16] S. Volkwein. Proper orthogonal decomposition: Theory and reduced-order modelling. Lecture Notes, University of Konstanz, 4(4), 2013.
  • [17] Z. Wang, W. Zhang, and Z. Zhang. A data-driven model reduction method for parabolic inverse source problems and its convergence analysis. Journal of Computational Physics, 487:112156, 2023.
  • [18] Z. Wang, W. Zhang, and Z. Zhang. Stochastic convergence of regularized solutions for backward heat conduction problems. arXiv:2311.03623, 2023.