\newsiamremark

remarkRemark \newsiamremarkhypothesisHypothesis \newsiamthmclaimClaim \headersImplicitly Restarted LanczosP.S. Negi and C. Arratia \externaldocument[][nocite]ex_supplement

Structure preserving restarts of the non-symmetric Lanczos algorithm via the implicitly shifted LR algorithmthanks: Submitted to the editors DATE. \fundingThis work was funded by the Nordita the Swedish Research Council Grant No. 2018-04290. Nordita is partially supported by Nordforsk.

P. S. Negi Nordita, Stockholm University, KTH Royal Institute of Technology, Stockholm, Sweden (). [email protected]    C. Arratia Nordita, Stockholm University, KTH Royal Institute of Technology, Stockholm, Sweden (). [email protected]
Abstract

The implicitly shifted QR iteration is used as a restart procedure for the Arnoldi method for the calculation of a few dominant eigenvalues of a large matrix. We show that the underlying idea of implicit polynomial filtering can be utilized in much the same manner via the implicitly shifted LR iteration to create a restart procedure for the non-symmetric Lanczos algorithm for eigenvalue computations, which preserves the tri-diagonal structure of the reduced matrix.

keywords:
LR algorithm, non-symmetric Lanczos, implicit restart
{MSCcodes}

68Q25, 68R10, 68U05

1 Introduction

The Arnoldi iteration [1] is a popular Krylov space method for calculating a few eigenvalues of a large matrix. The method relies on the generation of a sequence of Krylov vectors which determine the subspace within which approximations of the eigenvalue-eigenvector pairs are obtained. Depending on the accuracy and number of eigenpair approximations needed, the Krylov space size can become exceedingly large so that the quality of the results may be limited by the available memory. Sorensen [18] introduced an elegant procedure for restarting the Arnoldi factorization based on polynomial filters, which are applied through the implicitly shifted QR iterations on the reduced Hessenberg matrix obtained through the Arnoldi method. In particular, the use of exact shifts was shown to be successful in the convergence process of the eigenspace[18] of the specified eigenvalues. The method has subsequently found widespread application through the ARPACK library [11]. An alternative, more modern method for eigenvalue computations is the Krylov-Schur method introduced by Stewart [19], which can be found implemented in the SLEPc library [9]. Krylov-Schur restarts are known to be less sensitive than the restarting based on the implicitly shifted QR method. However, they do not preserve the Hessenberg structure of the reduced matrix which may sometimes be required. The use of QR iterations ensures that the reduced matrix preserves its Hessenberg structure through the transforms that make up the restart process. If the underlying matrix is symmetric, the Arnoldi iteration reduces to the Lanczos algorithm, and the Hessenberg matrix reduces to a symmetric tridiagonal matrix. The QR iteration preserves the symmetric tridiagonal structure as well and, as pointed out by Sorensen [18], the implicit restart process applies equally well for the Lanczos method for symmetric matrices.

The Lanczos algorithm introduced in [10] is in fact the predecessor of the Arnoldi method and can be used for non-symmetric matrices as well. In such cases it is referred to as the non-symmetric or the bi-Lanczos. It has been used in investigationss for plasma instabilities [13], linear systems solvers [8], model reduction [12], and more recently for non-linear eigenvalue problems [5]. For the non-symmetric Lanczos one builds two Krylov subspaces referred to as the right and the left Krylov subspaces. The idea behind the two methods is similar, which is to obtain a projection of the original large matrix on to an appropriate reduced subspace such that, the eigenvalues may be approximated via the eigenvalues of the reduced operator. The difference being that in the Arnoldi method one obtains an orthogonal projection on to a subspace while, in the Lanczos method one obtains an oblique projection. One would therefore like to extend the idea of implicit restarts to the non-symmetric Lanczos algorithm as well. However, the reduced matrix that one obtains in such a case is a non-symmetric tridiagonal matrix, with the tridiagonal structure being the result of the recurrence relations of the Lanczos algorithm [17]. Since the QR iterations do not preserve the banded structure of non-symmetric matrices, a straightforward application of the restart procedure put forward by Sorensen will lead to a loss of this tridiagonal structure (the Hessenberg structure will still be preserved). This loss of structure can be circumvented if one looks to the predecessor of the QR algorithm namely, the LR algorithm proposed by Rutishauser [14, 16, 15], which has the attractive property of preserving the band structure of a matrix. This property was already pointed out by Rutishauser in [14] where the banded matrices were referred to as striped matrices. As we will show in the next section, the shifted LR transform is the appropriate analogue of the restart procedure in the case of non-symmetric Lanczos iteration. The process would necessarily require refining both the right as well as the left Krylov spaces simultaneously.

The rest of the paper is organized as follows. In section 2 we start with the introduction of the non-symmetric Lanczos iteration and then develop the restart procedure. In section 3 we apply the restart process to the Grcar matrix, and make some concluding remarks in section 4.

2 Non-symmetric Lanczos

Lanczos first introduced his algorithm in [10] as a method for tridiagonalizing a matrix, but also realized that the method could be used iteratively to find eigenvalues. For an arbitry matrix A𝐴Aitalic_A, the method generates a pair of Krylov subspaces {v1,,vm}subscript𝑣1subscript𝑣𝑚\{v_{1},\ldots,v_{m}\}{ italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_v start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } and {w1,,wm}subscript𝑤1subscript𝑤𝑚\{w_{1},\ldots,w_{m}\}{ italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_w start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT }, through repeated action of A𝐴Aitalic_A and AHsuperscript𝐴𝐻A^{H}italic_A start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT respectively. We refer to these as the right and left Krylov spaces respectively and they satisfy the biorthogonality relation wiHvj=δijsuperscriptsubscript𝑤𝑖𝐻subscript𝑣𝑗subscript𝛿𝑖𝑗w_{i}^{H}v_{j}=\delta_{ij}italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_δ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT. The two subspaces are generated through a three term recurrence relation which, for a Krylov space of size m𝑚mitalic_m, can be written in matrix form as

(1a) AVmVmTm𝐴subscript𝑉𝑚subscript𝑉𝑚subscript𝑇𝑚\displaystyle AV_{m}-V_{m}T_{m}italic_A italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT =\displaystyle== v~m+1emT,subscript~𝑣𝑚1superscriptsubscript𝑒𝑚𝑇\displaystyle\widetilde{v}_{m+1}e_{m}^{T},over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ,
(1b) AHWmWmTmHsuperscript𝐴𝐻subscript𝑊𝑚subscript𝑊𝑚subscriptsuperscript𝑇𝐻𝑚\displaystyle A^{H}W_{m}-W_{m}T^{H}_{m}italic_A start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_W start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - italic_W start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT =\displaystyle== w~m+1emT,subscript~𝑤𝑚1superscriptsubscript𝑒𝑚𝑇\displaystyle\widetilde{w}_{m+1}e_{m}^{T},over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ,
(1c) WmHVmsuperscriptsubscript𝑊𝑚𝐻subscript𝑉𝑚\displaystyle W_{m}^{H}V_{m}italic_W start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT =\displaystyle== Im,subscript𝐼𝑚\displaystyle I_{m},italic_I start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ,

where, Imsubscript𝐼𝑚I_{m}italic_I start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT represents the Identity matrix of size m𝑚mitalic_m, Tmsubscript𝑇𝑚T_{m}italic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT is a tri-diagonal matrix of size m𝑚mitalic_m with TmHsuperscriptsubscript𝑇𝑚𝐻T_{m}^{H}italic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT it’s Hermitian conjugate, and emsubscript𝑒𝑚e_{m}italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT is the standard unit vector. v~m+1subscript~𝑣𝑚1\widetilde{v}_{m+1}over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT and w~m+1subscript~𝑤𝑚1\widetilde{w}_{m+1}over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT represent the residual vectors at the mthsuperscript𝑚𝑡m^{th}italic_m start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT step. If either v~m+1subscript~𝑣𝑚1\widetilde{v}_{m+1}over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT or w~m+1subscript~𝑤𝑚1\widetilde{w}_{m+1}over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT vanishes it represents the convergence of the right or the left Krylov subspaces to an invariant subspace of dimension m𝑚mitalic_m. A more serious breakdown occurs if w~m+1Hv~m+1=0superscriptsubscript~𝑤𝑚1𝐻subscript~𝑣𝑚10\widetilde{w}_{m+1}^{H}\widetilde{v}_{m+1}=0over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT = 0 with both v~m+10subscript~𝑣𝑚10\widetilde{v}_{m+1}\neq 0over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT ≠ 0 and w~m+10subscript~𝑤𝑚10\widetilde{w}_{m+1}\neq 0over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT ≠ 0, in which case a look-ahead strategy may be employed. We do not address the issues with breakdown here since it is not specifically related to the restart procedure. We refer the reader to [13] for the look-ahead Lanczos and to [8] for a comprehensive overview on Lanczos type solvers and the related issues of breakdown.

As Sorensen points out for the Arnoldi method [18], if one is interested in an invariant subspace of dimension m𝑚mitalic_m, the starting vector of the Krylov subspace must not contain components of the generator of a cyclic subspace of dimension greater than m𝑚mitalic_m. This applies equally for the right and left Krylov subspaces generated through the Lanczos recurrence relations. Hence a non-vanishing v~m+1subscript~𝑣𝑚1\widetilde{v}_{m+1}over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT (respectively w~m+1subscript~𝑤𝑚1\widetilde{w}_{m+1}over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT) implies that v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT (respectively w1subscript𝑤1w_{1}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT) contains components of an invariant subspace of dimension greater than m𝑚mitalic_m. The idea behind restarts then is to discard the components of the starting vector v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT (and w1subscript𝑤1w_{1}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT) along the unwanted dimensions, such that each restart process moves the Krylov space(s) closer to being invariant. For the Arnoldi method Sorensen [18] proposed to achieve this via polynomial filtering, i.e. replacing

(2a) v1ψ(A)v1,subscript𝑣1𝜓𝐴subscript𝑣1\displaystyle v_{1}\leftarrow\psi(A)v_{1},italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ← italic_ψ ( italic_A ) italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ,
(2b) ψ(λ)=(1/τ)Πj=1p(λμj).𝜓𝜆1𝜏superscriptsubscriptΠ𝑗1𝑝𝜆subscript𝜇𝑗\displaystyle\psi(\lambda)=(1/\tau)\Pi_{j=1}^{p}(\lambda-\mu_{j}).italic_ψ ( italic_λ ) = ( 1 / italic_τ ) roman_Π start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( italic_λ - italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) .

Obviously ψ(λ)𝜓𝜆\psi(\lambda)italic_ψ ( italic_λ ) is the filtering polynomial, τ𝜏\tauitalic_τ is a normalization constant and each μjsubscript𝜇𝑗\mu_{j}italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT specifies a node of the polynomial. The polynomial acts on v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT to filter out the part of the spectrum of A𝐴Aitalic_A that is close to each μjsubscript𝜇𝑗\mu_{j}italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. If a particular μjsubscript𝜇𝑗\mu_{j}italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT corresponds to an exact eigenvalue of A𝐴Aitalic_A, then components of the corresponding eigenvector are completely filtered out from v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT (at least in exact arithmetic). The node μjsubscript𝜇𝑗\mu_{j}italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is referred to as a shift since the application of the polynomial filtering relies on the shifted QR algorithm, where μjsubscript𝜇𝑗\mu_{j}italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is used as the shift. As shown below for the case of a single shift, an analogous procedure can be followed using a shifted LR algorithm which achieves the same effect of applying a polynomial filter to the starting vector v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Starting with the Lanczos relation for the right subspace (1a), and adding and subtracting μVm𝜇subscript𝑉𝑚\mu V_{m}italic_μ italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT we obtain

(3a) (AμI)VmVm(TmμI)𝐴𝜇𝐼subscript𝑉𝑚subscript𝑉𝑚subscript𝑇𝑚𝜇𝐼\displaystyle(A-\mu I)V_{m}-V_{m}(T_{m}-\mu I)( italic_A - italic_μ italic_I ) italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - italic_μ italic_I ) =\displaystyle== v~m+1emTsubscript~𝑣𝑚1superscriptsubscript𝑒𝑚𝑇\displaystyle\widetilde{v}_{m+1}e_{m}^{T}over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT
(3b) (AμI)VmVm(L1R1)𝐴𝜇𝐼subscript𝑉𝑚subscript𝑉𝑚subscript𝐿1subscript𝑅1\displaystyle(A-\mu I)V_{m}-V_{m}(L_{1}R_{1})( italic_A - italic_μ italic_I ) italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) =\displaystyle== v~m+1emTsubscript~𝑣𝑚1superscriptsubscript𝑒𝑚𝑇\displaystyle\widetilde{v}_{m+1}e_{m}^{T}over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT
(3c) (AμI)VmL1Vm(L1R1)L1𝐴𝜇𝐼subscript𝑉𝑚subscript𝐿1subscript𝑉𝑚subscript𝐿1subscript𝑅1subscript𝐿1\displaystyle(A-\mu I)V_{m}L_{1}-V_{m}(L_{1}R_{1})L_{1}( italic_A - italic_μ italic_I ) italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT =\displaystyle== v~m+1emTL1subscript~𝑣𝑚1superscriptsubscript𝑒𝑚𝑇subscript𝐿1\displaystyle\widetilde{v}_{m+1}e_{m}^{T}L_{1}over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT
(3d) A(VmL1)(VmL1)(R1L1+μI)𝐴subscript𝑉𝑚subscript𝐿1subscript𝑉𝑚subscript𝐿1subscript𝑅1subscript𝐿1𝜇𝐼\displaystyle A(V_{m}L_{1})-(V_{m}L_{1})(R_{1}L_{1}+\mu I)italic_A ( italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - ( italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ( italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_μ italic_I ) =\displaystyle== v~m+1emTL1subscript~𝑣𝑚1superscriptsubscript𝑒𝑚𝑇subscript𝐿1\displaystyle\widetilde{v}_{m+1}e_{m}^{T}L_{1}over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT
(3e) AVmVmTm𝐴subscriptsuperscript𝑉𝑚subscriptsuperscript𝑉𝑚subscriptsuperscript𝑇𝑚\displaystyle AV^{\prime}_{m}-V^{\prime}_{m}T^{\prime}_{m}italic_A italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT =\displaystyle== v~m+1emTL1.subscript~𝑣𝑚1superscriptsubscript𝑒𝑚𝑇subscript𝐿1\displaystyle\widetilde{v}_{m+1}e_{m}^{T}L_{1}.over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT .

Here we have set Vm=VmL1subscriptsuperscript𝑉𝑚subscript𝑉𝑚subscript𝐿1V^{\prime}_{m}=V_{m}L_{1}italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and Tm=(R1L1+μI)subscriptsuperscript𝑇𝑚subscript𝑅1subscript𝐿1𝜇𝐼T^{\prime}_{m}=(R_{1}L_{1}+\mu I)italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = ( italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_μ italic_I ). The matrices L1,R1subscript𝐿1subscript𝑅1L_{1},R_{1}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT are the lower and upper triangular matrices obtained from the LU decomposition of (TmμI)subscript𝑇𝑚𝜇𝐼(T_{m}-\mu I)( italic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - italic_μ italic_I ). The matrix L1subscript𝐿1L_{1}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT can be required to be unit triangular (all entries on the main diagonal are ones), in which case the LU decomposition is unique. Furthermore, L1subscript𝐿1L_{1}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT for a tridiagonal matrix only consists of one sub-diagonal (in addition to the main diagonal). One can easily recognize that the new reduced matrix Tmsubscriptsuperscript𝑇𝑚T^{\prime}_{m}italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT is a result of one step of the shifted LR iteration. Hence Tmsubscriptsuperscript𝑇𝑚T^{\prime}_{m}italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT retains the tridiagonal structure of the Tmsubscript𝑇𝑚T_{m}italic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT [14, 15]. The relationship between starting vectors of the two spaces Vmsubscript𝑉𝑚V_{m}italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT and Vmsubscriptsuperscript𝑉𝑚V^{\prime}_{m}italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT can be obtained by multiplying (3b) by e1subscript𝑒1e_{1}italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, i.e.

(AμI)Vme1(Vm)R1e1𝐴𝜇𝐼subscript𝑉𝑚subscript𝑒1subscriptsuperscript𝑉𝑚subscript𝑅1subscript𝑒1\displaystyle(A-\mu I)V_{m}e_{1}-(V^{\prime}_{m})R_{1}e_{1}( italic_A - italic_μ italic_I ) italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - ( italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT =\displaystyle== v~m+1emTe1subscript~𝑣𝑚1superscriptsubscript𝑒𝑚𝑇subscript𝑒1\displaystyle\widetilde{v}_{m+1}e_{m}^{T}e_{1}over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT
(AμI)v1absent𝐴𝜇𝐼subscript𝑣1\displaystyle\implies(A-\mu I)v_{1}⟹ ( italic_A - italic_μ italic_I ) italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT =\displaystyle== v1ρ11,subscriptsuperscript𝑣1subscript𝜌11\displaystyle v^{\prime}_{1}\rho_{11},italic_v start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_ρ start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ,

where ρ11=e1TR1e1subscript𝜌11subscriptsuperscript𝑒𝑇1subscript𝑅1subscript𝑒1\rho_{11}=e^{T}_{1}R_{1}e_{1}italic_ρ start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. This clearly shows the filtering operation done on the original vector v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT to generate the new vector v1subscriptsuperscript𝑣1v^{\prime}_{1}italic_v start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT.

Since the Lanczos method creates a biorthogonal basis, one must simultaneously transform the left basis Wmsubscript𝑊𝑚W_{m}italic_W start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT to maintain the biorthogonality property. It is easy to see that the necessary transform to maintain biorthogonality is Wm=WmL1Hsubscriptsuperscript𝑊𝑚subscript𝑊𝑚superscriptsubscript𝐿1𝐻W^{\prime}_{m}=W_{m}L_{1}^{-H}italic_W start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = italic_W start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT, since,

WHVm=(WmL1H)H(VmL1)=L11(WmHVm)L1=Isuperscript𝑊𝐻subscriptsuperscript𝑉𝑚superscriptsubscript𝑊𝑚superscriptsubscript𝐿1𝐻𝐻subscript𝑉𝑚subscript𝐿1superscriptsubscript𝐿11subscriptsuperscript𝑊𝐻𝑚subscript𝑉𝑚subscript𝐿1𝐼\displaystyle W^{\prime H}V^{\prime}_{m}=(W_{m}L_{1}^{-H})^{H}(V_{m}L_{1})=L_{% 1}^{-1}(W^{H}_{m}V_{m})L_{1}=Iitalic_W start_POSTSUPERSCRIPT ′ italic_H end_POSTSUPERSCRIPT italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = ( italic_W start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) = italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_I

Substituting the relation obtained from the LU decomposition TmH=(R1HL1H+μ¯I)superscriptsubscript𝑇𝑚𝐻superscriptsubscript𝑅1𝐻superscriptsubscript𝐿1𝐻¯𝜇𝐼T_{m}^{H}=(R_{1}^{H}L_{1}^{H}+\bar{\mu}I)italic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT = ( italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT + over¯ start_ARG italic_μ end_ARG italic_I ) in to equation (1b) we can obtain the modified Lanczos relation for the left Krylov space as

(4a) AHWmWm(R1HL1H+μ¯I)superscript𝐴𝐻subscript𝑊𝑚subscript𝑊𝑚subscriptsuperscript𝑅𝐻1subscriptsuperscript𝐿𝐻1¯𝜇𝐼\displaystyle A^{H}W_{m}-W_{m}(R^{H}_{1}L^{H}_{1}+\bar{\mu}I)italic_A start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_W start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - italic_W start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_R start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + over¯ start_ARG italic_μ end_ARG italic_I ) =\displaystyle== w~m+1emTsubscript~𝑤𝑚1superscriptsubscript𝑒𝑚𝑇\displaystyle\widetilde{w}_{m+1}e_{m}^{T}over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT
(4b) AH(WmL1H)Wm(L1HL1H)(R1HL1H+μ¯I)L1Hsuperscript𝐴𝐻subscript𝑊𝑚superscriptsubscript𝐿1𝐻subscript𝑊𝑚superscriptsubscript𝐿1𝐻superscriptsubscript𝐿1𝐻subscriptsuperscript𝑅𝐻1subscriptsuperscript𝐿𝐻1¯𝜇𝐼superscriptsubscript𝐿1𝐻\displaystyle A^{H}(W_{m}L_{1}^{-H})-W_{m}(L_{1}^{-H}L_{1}^{H})(R^{H}_{1}L^{H}% _{1}+\bar{\mu}I)L_{1}^{-H}italic_A start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_W start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT ) - italic_W start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) ( italic_R start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + over¯ start_ARG italic_μ end_ARG italic_I ) italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT =\displaystyle== w~m+1emTL1Hsubscript~𝑤𝑚1superscriptsubscript𝑒𝑚𝑇superscriptsubscript𝐿1𝐻\displaystyle\widetilde{w}_{m+1}e_{m}^{T}L_{1}^{-H}over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT
(4c) AH(WmL1H)(WmL1H)(L1HR1H+μ¯I)superscript𝐴𝐻subscript𝑊𝑚superscriptsubscript𝐿1𝐻subscript𝑊𝑚superscriptsubscript𝐿1𝐻subscriptsuperscript𝐿𝐻1subscriptsuperscript𝑅𝐻1¯𝜇𝐼\displaystyle A^{H}(W_{m}L_{1}^{-H})-(W_{m}L_{1}^{-H})(L^{H}_{1}R^{H}_{1}+\bar% {\mu}I)italic_A start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_W start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT ) - ( italic_W start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT ) ( italic_L start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_R start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + over¯ start_ARG italic_μ end_ARG italic_I ) =\displaystyle== w~m+1emTL1Hsubscript~𝑤𝑚1superscriptsubscript𝑒𝑚𝑇superscriptsubscript𝐿1𝐻\displaystyle\widetilde{w}_{m+1}e_{m}^{T}L_{1}^{-H}over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT
(4d) AHWmWm(Tm)Hsuperscript𝐴𝐻subscriptsuperscript𝑊𝑚subscriptsuperscript𝑊𝑚superscriptsubscriptsuperscript𝑇𝑚𝐻\displaystyle A^{H}W^{\prime}_{m}-W^{\prime}_{m}(T^{\prime}_{m})^{H}italic_A start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_W start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - italic_W start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT =\displaystyle== w~m+1emTL1H.subscript~𝑤𝑚1superscriptsubscript𝑒𝑚𝑇superscriptsubscript𝐿1𝐻\displaystyle\widetilde{w}_{m+1}e_{m}^{T}L_{1}^{-H}.over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT .

Conveniently the structure of the Lanczos iteration for the left Krylov space is also preserved. Noting that L1Hsuperscriptsubscript𝐿1𝐻L_{1}^{-H}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT is upper triangular, one can again expose the relationship between the generating vectors of the two left Krylov spaces as

We1superscript𝑊subscript𝑒1\displaystyle W^{\prime}e_{1}italic_W start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT =\displaystyle== (WmL1H)e1subscript𝑊𝑚superscriptsubscript𝐿1𝐻subscript𝑒1\displaystyle(W_{m}L_{1}^{-H})e_{1}( italic_W start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT ) italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT
w1absentsubscriptsuperscript𝑤1\displaystyle\implies w^{\prime}_{1}⟹ italic_w start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT =\displaystyle== w1(e1TL1He1).subscript𝑤1superscriptsubscript𝑒1𝑇superscriptsubscript𝐿1𝐻subscript𝑒1\displaystyle w_{1}(e_{1}^{T}L_{1}^{-H}e_{1}).italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) .

Clearly w1subscriptsuperscript𝑤1w^{\prime}_{1}italic_w start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is simply a scalar multiple of the old vector w1subscript𝑤1w_{1}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and no filtering of the generating vector has occurred. In order to ensure that we filter the left Krylov space as well, we perform one step of the shifted LR iteration with the conjugated shift μ¯¯𝜇\bar{\mu}over¯ start_ARG italic_μ end_ARG on the reduced matrix (Tm)Hsuperscriptsubscriptsuperscript𝑇𝑚𝐻(T^{\prime}_{m})^{H}( italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT obtained in equation (4d).

(5a) (AμI)HWmWm(TmμI)Hsuperscript𝐴𝜇𝐼𝐻subscriptsuperscript𝑊𝑚subscriptsuperscript𝑊𝑚superscriptsubscriptsuperscript𝑇𝑚𝜇𝐼𝐻\displaystyle(A-\mu I)^{H}W^{\prime}_{m}-W^{\prime}_{m}(T^{\prime}_{m}-\mu I)^% {H}( italic_A - italic_μ italic_I ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_W start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - italic_W start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - italic_μ italic_I ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT =\displaystyle== w~m+1emTL1H,subscript~𝑤𝑚1superscriptsubscript𝑒𝑚𝑇superscriptsubscript𝐿1𝐻\displaystyle\widetilde{w}_{m+1}e_{m}^{T}L_{1}^{-H},over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT ,
(5b) (AμI)HWmWm(L2R2)superscript𝐴𝜇𝐼𝐻subscriptsuperscript𝑊𝑚subscriptsuperscript𝑊𝑚subscript𝐿2subscript𝑅2\displaystyle(A-\mu I)^{H}W^{\prime}_{m}-W^{\prime}_{m}(L_{2}R_{2})( italic_A - italic_μ italic_I ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_W start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - italic_W start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) =\displaystyle== w~m+1emTL1H,subscript~𝑤𝑚1superscriptsubscript𝑒𝑚𝑇superscriptsubscript𝐿1𝐻\displaystyle\widetilde{w}_{m+1}e_{m}^{T}L_{1}^{-H},over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT ,
(5c) (AμI)HWmL2Wm(L2R2)L2superscript𝐴𝜇𝐼𝐻subscriptsuperscript𝑊𝑚subscript𝐿2subscriptsuperscript𝑊𝑚subscript𝐿2subscript𝑅2subscript𝐿2\displaystyle(A-\mu I)^{H}W^{\prime}_{m}L_{2}-W^{\prime}_{m}(L_{2}R_{2})L_{2}( italic_A - italic_μ italic_I ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_W start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_W start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT =\displaystyle== w~m+1emTL1HL2,subscript~𝑤𝑚1superscriptsubscript𝑒𝑚𝑇superscriptsubscript𝐿1𝐻subscript𝐿2\displaystyle\widetilde{w}_{m+1}e_{m}^{T}L_{1}^{-H}L_{2},over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ,
(5d) A(WmL2)(WmL2)(R2L2+μ¯I)𝐴subscriptsuperscript𝑊𝑚subscript𝐿2subscriptsuperscript𝑊𝑚subscript𝐿2subscript𝑅2subscript𝐿2¯𝜇𝐼\displaystyle A(W^{\prime}_{m}L_{2})-(W^{\prime}_{m}L_{2})(R_{2}L_{2}+\bar{\mu% }I)italic_A ( italic_W start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - ( italic_W start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ( italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + over¯ start_ARG italic_μ end_ARG italic_I ) =\displaystyle== w~m+1emTL1HL2,subscript~𝑤𝑚1superscriptsubscript𝑒𝑚𝑇superscriptsubscript𝐿1𝐻subscript𝐿2\displaystyle\widetilde{w}_{m+1}e_{m}^{T}L_{1}^{-H}L_{2},over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ,
(5e) AWm′′Wm′′(Tm′′)H𝐴subscriptsuperscript𝑊′′𝑚subscriptsuperscript𝑊′′𝑚superscriptsubscriptsuperscript𝑇′′𝑚𝐻\displaystyle AW^{\prime\prime}_{m}-W^{\prime\prime}_{m}(T^{\prime\prime}_{m})% ^{H}italic_A italic_W start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - italic_W start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_T start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT =\displaystyle== w~m+1emTL1HL2.subscript~𝑤𝑚1superscriptsubscript𝑒𝑚𝑇superscriptsubscript𝐿1𝐻subscript𝐿2\displaystyle\widetilde{w}_{m+1}e_{m}^{T}L_{1}^{-H}L_{2}.over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT .

One may again obtain the relation between the starting vectors of the two spaces as

(6) (AHμ¯I)w1=w1′′(e1TR2e1),superscript𝐴𝐻¯𝜇𝐼subscriptsuperscript𝑤1subscriptsuperscript𝑤′′1superscriptsubscript𝑒1𝑇subscript𝑅2subscript𝑒1\displaystyle(A^{H}-\bar{\mu}I)w^{\prime}_{1}=w^{\prime\prime}_{1}(e_{1}^{T}R_% {2}e_{1}),( italic_A start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT - over¯ start_ARG italic_μ end_ARG italic_I ) italic_w start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_w start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ,

which clearly shows the filtering operation performed on the starting vector of the left Krylov space. Obviously the appropriate transform for the right subspace to maintain orthogonality is Vm′′=VmL2H=VmL1L2Hsubscriptsuperscript𝑉′′𝑚subscriptsuperscript𝑉𝑚superscriptsubscript𝐿2𝐻subscript𝑉𝑚subscript𝐿1superscriptsubscript𝐿2𝐻V^{\prime\prime}_{m}=V^{\prime}_{m}L_{2}^{-H}=V_{m}L_{1}L_{2}^{-H}italic_V start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = italic_V start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT = italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT. Again, note that the upper triangular L2Hsuperscriptsubscript𝐿2𝐻L_{2}^{-H}italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT implies that the new v1′′subscriptsuperscript𝑣′′1v^{\prime\prime}_{1}italic_v start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is simply the scalar multiple of v1subscriptsuperscript𝑣1v^{\prime}_{1}italic_v start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and no additional filtering occurs for v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT in this step. One can write the corresponding modified Lanczos relation as

(7) AVm′′Vm′′Tm′′𝐴subscriptsuperscript𝑉′′𝑚subscriptsuperscript𝑉′′𝑚subscriptsuperscript𝑇′′𝑚\displaystyle AV^{\prime\prime}_{m}-V^{\prime\prime}_{m}T^{\prime\prime}_{m}italic_A italic_V start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - italic_V start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT =\displaystyle== v~m+1emTL1L2H.subscript~𝑣𝑚1superscriptsubscript𝑒𝑚𝑇subscript𝐿1superscriptsubscript𝐿2𝐻\displaystyle\widetilde{v}_{m+1}e_{m}^{T}L_{1}L_{2}^{-H}.over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT .

The above process can be repeated for p𝑝pitalic_p unwanted shifts. We denote by L1p=L11L12L1psuperscriptsubscript𝐿1𝑝subscript𝐿11subscript𝐿12subscript𝐿1𝑝L_{1}^{p}=L_{11}L_{12}\ldots L_{1p}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT = italic_L start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT … italic_L start_POSTSUBSCRIPT 1 italic_p end_POSTSUBSCRIPT as the product of the lower triangular matrices generated due to p𝑝pitalic_p shifted-LR steps for the right Krylov space Vmsubscript𝑉𝑚V_{m}italic_V start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT, and by L2p=L21L22L2psubscriptsuperscript𝐿𝑝2subscript𝐿21subscript𝐿22subscript𝐿2𝑝L^{p}_{2}=L_{21}L_{22}\ldots L_{2p}italic_L start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_L start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 22 end_POSTSUBSCRIPT … italic_L start_POSTSUBSCRIPT 2 italic_p end_POSTSUBSCRIPT as the product of the lower triangular matrices due to the p𝑝pitalic_p shifted-LR iterations for the left Krylov space. Then for a Krylov space size of m=k+p𝑚𝑘𝑝m=k+pitalic_m = italic_k + italic_p we have two modified Lanczos relations

(8a) AVk+p′′Vk+p′′Tk+p′′𝐴subscriptsuperscript𝑉′′𝑘𝑝subscriptsuperscript𝑉′′𝑘𝑝subscriptsuperscript𝑇′′𝑘𝑝\displaystyle AV^{\prime\prime}_{k+p}-V^{\prime\prime}_{k+p}T^{\prime\prime}_{% k+p}italic_A italic_V start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT - italic_V start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT =\displaystyle== v~k+p+1ek+pTL1p(L2p)H,subscript~𝑣𝑘𝑝1superscriptsubscript𝑒𝑘𝑝𝑇superscriptsubscript𝐿1𝑝superscriptsuperscriptsubscript𝐿2𝑝𝐻\displaystyle\widetilde{v}_{k+p+1}e_{k+p}^{T}L_{1}^{p}(L_{2}^{p})^{-H},over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k + italic_p + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT ,
(8b) AHWk+p′′Wk+p′′(Tk+p′′)Hsuperscript𝐴𝐻subscriptsuperscript𝑊′′𝑘𝑝subscriptsuperscript𝑊′′𝑘𝑝superscriptsubscriptsuperscript𝑇′′𝑘𝑝𝐻\displaystyle A^{H}W^{\prime\prime}_{k+p}-W^{\prime\prime}_{k+p}(T^{\prime% \prime}_{k+p})^{H}italic_A start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_W start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT - italic_W start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT ( italic_T start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT =\displaystyle== w~k+p+1ek+pT(L1p)HL2p.subscript~𝑤𝑘𝑝1superscriptsubscript𝑒𝑘𝑝𝑇superscriptsuperscriptsubscript𝐿1𝑝𝐻superscriptsubscript𝐿2𝑝\displaystyle\widetilde{w}_{k+p+1}e_{k+p}^{T}(L_{1}^{p})^{-H}L_{2}^{p}.over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_k + italic_p + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT .

We may take a closer look at the structure of the residual matrices on the right hand side of equation (8a). L1psuperscriptsubscript𝐿1𝑝L_{1}^{p}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT is a product of p𝑝pitalic_p matrices that are lower triangular with just one subdiagonal. L1psuperscriptsubscript𝐿1𝑝L_{1}^{p}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT then is lower triangular with p𝑝pitalic_p non-zero subdiagonals. (L2p)Hsuperscriptsuperscriptsubscript𝐿2𝑝𝐻(L_{2}^{p})^{-H}( italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT is upper triangular and the product L1p(L2p)Hsuperscriptsubscript𝐿1𝑝superscriptsuperscriptsubscript𝐿2𝑝𝐻L_{1}^{p}(L_{2}^{p})^{-H}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT therefore has p𝑝pitalic_p non-zero subdiagonals. Left multiplication by ek+pTsuperscriptsubscript𝑒𝑘𝑝𝑇e_{k+p}^{T}italic_e start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT therefore has the form

ek+pTL1p(L2p)H=(0,0,θk+pk,b1Tp)superscriptsubscript𝑒𝑘𝑝𝑇superscriptsubscript𝐿1𝑝superscriptsuperscriptsubscript𝐿2𝑝𝐻subscript00subscript𝜃𝑘𝑝𝑘subscriptsuperscriptsubscript𝑏1𝑇𝑝\displaystyle e_{k+p}^{T}L_{1}^{p}(L_{2}^{p})^{-H}=(\underbrace{0,0\ldots,% \theta_{k+p}}_{k},\underbrace{b_{1}^{T}}_{p})italic_e start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT = ( under⏟ start_ARG 0 , 0 … , italic_θ start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , under⏟ start_ARG italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT )

where, θk+p=ek+pT(L1p(L2p)H)eksubscript𝜃𝑘𝑝superscriptsubscript𝑒𝑘𝑝𝑇superscriptsubscript𝐿1𝑝superscriptsuperscriptsubscript𝐿2𝑝𝐻subscript𝑒𝑘\theta_{k+p}=e_{k+p}^{T}(L_{1}^{p}(L_{2}^{p})^{-H})e_{k}italic_θ start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT = italic_e start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ( italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT ) italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Therefore the matrix on the right hand side of (8a) has zeros in the first k1𝑘1k-1italic_k - 1 columns and the kthsuperscript𝑘𝑡k^{th}italic_k start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT column is simply θk+pv~k+p+1subscript𝜃𝑘𝑝subscript~𝑣𝑘𝑝1\theta_{k+p}\widetilde{v}_{k+p+1}italic_θ start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k + italic_p + 1 end_POSTSUBSCRIPT. The remaining columns are non-zero in general. A very similar structure is obtained for the residual matrix in the right hand side of equation (8b) with zeros in the first k1𝑘1k-1italic_k - 1 columns and the kthsuperscript𝑘𝑡k^{th}italic_k start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT column being equal to ϕk+pw~k+p+1subscriptitalic-ϕ𝑘𝑝subscript~𝑤𝑘𝑝1\phi_{k+p}\widetilde{w}_{k+p+1}italic_ϕ start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_k + italic_p + 1 end_POSTSUBSCRIPT, with ϕk+p=ek+pT((L1p)HL2p)eksubscriptitalic-ϕ𝑘𝑝superscriptsubscript𝑒𝑘𝑝𝑇superscriptsuperscriptsubscript𝐿1𝑝𝐻superscriptsubscript𝐿2𝑝subscript𝑒𝑘\phi_{k+p}=e_{k+p}^{T}((L_{1}^{p})^{-H}L_{2}^{p})e_{k}italic_ϕ start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT = italic_e start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ) italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT

Partitioning the matrices such that

Vk+p′′=(Vk′′,Vp′′),subscriptsuperscript𝑉′′𝑘𝑝subscriptsuperscript𝑉′′𝑘subscriptsuperscript𝑉′′𝑝\displaystyle V^{\prime\prime}_{k+p}=(V^{\prime\prime}_{k},V^{\prime\prime}_{p% }),italic_V start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT = ( italic_V start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_V start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) , Tk+p′′=(Tk′′δk+1eke1Tβk+1e1ekTTp′′),subscriptsuperscript𝑇′′𝑘𝑝matrixsubscriptsuperscript𝑇′′𝑘subscript𝛿𝑘1subscript𝑒𝑘superscriptsubscript𝑒1𝑇subscript𝛽𝑘1subscript𝑒1superscriptsubscript𝑒𝑘𝑇subscriptsuperscript𝑇′′𝑝\displaystyle\hskip 10.0ptT^{\prime\prime}_{k+p}=\begin{pmatrix}T^{\prime% \prime}_{k}&\delta_{k+1}e_{k}e_{1}^{T}\\ \beta_{k+1}e_{1}e_{k}^{T}&T^{\prime\prime}_{p}\end{pmatrix},italic_T start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT = ( start_ARG start_ROW start_CELL italic_T start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_CELL start_CELL italic_δ start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL italic_β start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_CELL start_CELL italic_T start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) ,
Wk+p′′=(Wk′′,Wp′′),subscriptsuperscript𝑊′′𝑘𝑝subscriptsuperscript𝑊′′𝑘subscriptsuperscript𝑊′′𝑝\displaystyle W^{\prime\prime}_{k+p}=(W^{\prime\prime}_{k},W^{\prime\prime}_{p% }),italic_W start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT = ( italic_W start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_W start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) , (Tk+p′′)H=((Tk′′)Hβ¯k+1eke1Tδ¯k+1e1ekT(Tp′′)H),superscriptsubscriptsuperscript𝑇′′𝑘𝑝𝐻matrixsuperscriptsubscriptsuperscript𝑇′′𝑘𝐻subscript¯𝛽𝑘1subscript𝑒𝑘superscriptsubscript𝑒1𝑇subscript¯𝛿𝑘1subscript𝑒1superscriptsubscript𝑒𝑘𝑇superscriptsubscriptsuperscript𝑇′′𝑝𝐻\displaystyle\hskip 10.0pt(T^{\prime\prime}_{k+p})^{H}=\begin{pmatrix}(T^{% \prime\prime}_{k})^{H}&\bar{\beta}_{k+1}e_{k}e_{1}^{T}\\ \bar{\delta}_{k+1}e_{1}e_{k}^{T}&(T^{\prime\prime}_{p})^{H}\end{pmatrix},( italic_T start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT = ( start_ARG start_ROW start_CELL ( italic_T start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT end_CELL start_CELL over¯ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL over¯ start_ARG italic_δ end_ARG start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_CELL start_CELL ( italic_T start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG ) ,

with the length of the eisubscript𝑒𝑖e_{i}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT vectors understood to be such that the resulting matrices are consistent. We can write the modified Lanczos relations of (8a) and (8b) as

A(Vk′′,Vp′′)=(Vk′′,Vp′′)(Tk′′δk+1eke1Tβk+1e1ekTTp′′)+(θk+pv~k+p+1ekT,Mv),𝐴subscriptsuperscript𝑉′′𝑘subscriptsuperscript𝑉′′𝑝subscriptsuperscript𝑉′′𝑘subscriptsuperscript𝑉′′𝑝matrixsubscriptsuperscript𝑇′′𝑘subscript𝛿𝑘1subscript𝑒𝑘superscriptsubscript𝑒1𝑇subscript𝛽𝑘1subscript𝑒1superscriptsubscript𝑒𝑘𝑇subscriptsuperscript𝑇′′𝑝matrixsubscript𝜃𝑘𝑝subscript~𝑣𝑘𝑝1superscriptsubscript𝑒𝑘𝑇subscript𝑀𝑣\displaystyle A(V^{\prime\prime}_{k},V^{\prime\prime}_{p})=(V^{\prime\prime}_{% k},V^{\prime\prime}_{p})\begin{pmatrix}T^{\prime\prime}_{k}&\delta_{k+1}e_{k}e% _{1}^{T}\\ \beta_{k+1}e_{1}e_{k}^{T}&T^{\prime\prime}_{p}\end{pmatrix}+\begin{pmatrix}% \theta_{k+p}\widetilde{v}_{k+p+1}e_{k}^{T},M_{v}\end{pmatrix},italic_A ( italic_V start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_V start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) = ( italic_V start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_V start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) ( start_ARG start_ROW start_CELL italic_T start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_CELL start_CELL italic_δ start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL italic_β start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_CELL start_CELL italic_T start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) + ( start_ARG start_ROW start_CELL italic_θ start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k + italic_p + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , italic_M start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) ,
AH(Wk′′,Wp′′)=(Wk′′,Wp′′)((Tk′′)Hβ¯k+1eke1Tδ¯k+1e1ekT(Tp′′)H)+(ϕk+pw~k+p+1ekT,Mw),superscript𝐴𝐻subscriptsuperscript𝑊′′𝑘subscriptsuperscript𝑊′′𝑝subscriptsuperscript𝑊′′𝑘subscriptsuperscript𝑊′′𝑝matrixsuperscriptsubscriptsuperscript𝑇′′𝑘𝐻subscript¯𝛽𝑘1subscript𝑒𝑘superscriptsubscript𝑒1𝑇subscript¯𝛿𝑘1subscript𝑒1superscriptsubscript𝑒𝑘𝑇superscriptsubscriptsuperscript𝑇′′𝑝𝐻matrixsubscriptitalic-ϕ𝑘𝑝subscript~𝑤𝑘𝑝1superscriptsubscript𝑒𝑘𝑇subscript𝑀𝑤\displaystyle A^{H}(W^{\prime\prime}_{k},W^{\prime\prime}_{p})=(W^{\prime% \prime}_{k},W^{\prime\prime}_{p})\begin{pmatrix}(T^{\prime\prime}_{k})^{H}&% \bar{\beta}_{k+1}e_{k}e_{1}^{T}\\ \bar{\delta}_{k+1}e_{1}e_{k}^{T}&(T^{\prime\prime}_{p})^{H}\end{pmatrix}+% \begin{pmatrix}\phi_{k+p}\widetilde{w}_{k+p+1}e_{k}^{T},M_{w}\end{pmatrix},italic_A start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_W start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_W start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) = ( italic_W start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_W start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) ( start_ARG start_ROW start_CELL ( italic_T start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT end_CELL start_CELL over¯ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL over¯ start_ARG italic_δ end_ARG start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT end_CELL start_CELL ( italic_T start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG ) + ( start_ARG start_ROW start_CELL italic_ϕ start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_k + italic_p + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , italic_M start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) ,

Finally, equating the individual columns on both sides and discarding columns k+1,,k+p𝑘1𝑘𝑝k+1,\ldots,k+pitalic_k + 1 , … , italic_k + italic_p we are left with the new Krylov spaces of order k𝑘kitalic_k and the Lanczos relations

(10a) AVk′′Vk′′Tk′′𝐴subscriptsuperscript𝑉′′𝑘subscriptsuperscript𝑉′′𝑘subscriptsuperscript𝑇′′𝑘\displaystyle AV^{\prime\prime}_{k}-V^{\prime\prime}_{k}T^{\prime\prime}_{k}italic_A italic_V start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_V start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT =\displaystyle== v~k+1′′ekT,subscriptsuperscript~𝑣′′𝑘1superscriptsubscript𝑒𝑘𝑇\displaystyle\widetilde{v}^{\prime\prime}_{k+1}e_{k}^{T},over~ start_ARG italic_v end_ARG start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ,
(10b) AHWk′′Wk′′(Tk′′)Hsuperscript𝐴𝐻subscriptsuperscript𝑊′′𝑘subscriptsuperscript𝑊′′𝑘superscriptsubscriptsuperscript𝑇′′𝑘𝐻\displaystyle A^{H}W^{\prime\prime}_{k}-W^{\prime\prime}_{k}(T^{\prime\prime}_% {k})^{H}italic_A start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_W start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_W start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_T start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT =\displaystyle== w~k+1′′ekT,subscriptsuperscript~𝑤′′𝑘1superscriptsubscript𝑒𝑘𝑇\displaystyle\widetilde{w}^{\prime\prime}_{k+1}e_{k}^{T},over~ start_ARG italic_w end_ARG start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ,
(10c) (Wk′′)HVksuperscriptsubscriptsuperscript𝑊′′𝑘𝐻subscript𝑉𝑘\displaystyle(W^{\prime\prime}_{k})^{H}V_{k}( italic_W start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT =\displaystyle== I.𝐼\displaystyle I.italic_I .

The new residual vectors are defined as

(11a) v~k+1′′subscriptsuperscript~𝑣′′𝑘1\displaystyle\widetilde{v}^{\prime\prime}_{k+1}over~ start_ARG italic_v end_ARG start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT =\displaystyle== βk+1Vp′′e1+θk+pv~k+p+1,subscript𝛽𝑘1subscriptsuperscript𝑉′′𝑝subscript𝑒1subscript𝜃𝑘𝑝subscript~𝑣𝑘𝑝1\displaystyle\beta_{k+1}V^{\prime\prime}_{p}e_{1}+\theta_{k+p}\widetilde{v}_{k% +p+1},italic_β start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT italic_V start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_θ start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k + italic_p + 1 end_POSTSUBSCRIPT ,
(11b) w~k+1′′subscriptsuperscript~𝑤′′𝑘1\displaystyle\widetilde{w}^{\prime\prime}_{k+1}over~ start_ARG italic_w end_ARG start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT =\displaystyle== δ¯k+1Wp′′e1+ϕk+pw~k+p+1,subscript¯𝛿𝑘1subscriptsuperscript𝑊′′𝑝subscript𝑒1subscriptitalic-ϕ𝑘𝑝subscript~𝑤𝑘𝑝1\displaystyle\bar{\delta}_{k+1}W^{\prime\prime}_{p}e_{1}+\phi_{k+p}\widetilde{% w}_{k+p+1},over¯ start_ARG italic_δ end_ARG start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT italic_W start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_ϕ start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_k + italic_p + 1 end_POSTSUBSCRIPT ,

which may be normalized appropriately such that the inner product of the new Krylov vectors is unity. The Lanczos process may now be carried out again to generate the next p𝑝pitalic_p vectors of the right and left Krylov spaces and the cycle may be repeated till an adequately converged subspace has been obtained. In the case that exact shifts (eigenvalues of Tmsubscript𝑇𝑚T_{m}italic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT) are used for the restart procedure βk+1subscript𝛽𝑘1\beta_{k+1}italic_β start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT and δk+1subscript𝛿𝑘1\delta_{k+1}italic_δ start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT are both zero in exact arithmetic.

As a final note, we mention that Della-Dora [3] introduced a class of algorithms of the GR type of which, LR and QR are special cases. Watkins then introduced generic bulge-chasing algorithms for the entire GR class of methods[20]. The shifted LR algorithm used in the restart procedure outlined above can therefore be carried out in an implicit manner through the bulge-chase sequence of Watkins [20]. For real matrices, the operations can be confined in the real space by using the double-shift strategy introduced by Francis [4]. The entire restart process is put together in algorithm 1.

Algorithm 1 Restarted nonsymmetric Lanczos

Input: Vk+p,v~k+p+1,Wk+p,w~k+p+1,Tk+psubscript𝑉𝑘𝑝subscript~𝑣𝑘𝑝1subscript𝑊𝑘𝑝subscript~𝑤𝑘𝑝1subscript𝑇𝑘𝑝V_{k+p},\widetilde{v}_{k+p+1},W_{k+p},\widetilde{w}_{k+p+1},T_{k+p}italic_V start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT , over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k + italic_p + 1 end_POSTSUBSCRIPT , italic_W start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT , over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_k + italic_p + 1 end_POSTSUBSCRIPT , italic_T start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT
Input: μ1,μ2,μpsubscript𝜇1subscript𝜇2subscript𝜇𝑝\mu_{1},\mu_{2},\ldots\mu_{p}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … italic_μ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT \triangleright Unwanted shifts
Output: Vk,v~k+1,Wk,w~k+1,Tksubscript𝑉𝑘subscript~𝑣𝑘1subscript𝑊𝑘subscript~𝑤𝑘1subscript𝑇𝑘V_{k},\widetilde{v}_{k+1},W_{k},\widetilde{w}_{k+1},T_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT , italic_W start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT , italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT

1:AVk+pVk+pTk+p=v~k+p+1ek+pT𝐴subscript𝑉𝑘𝑝subscript𝑉𝑘𝑝subscript𝑇𝑘𝑝subscript~𝑣𝑘𝑝1superscriptsubscript𝑒𝑘𝑝𝑇AV_{k+p}-V_{k+p}T_{k+p}=\widetilde{v}_{k+p+1}e_{k+p}^{T}italic_A italic_V start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT - italic_V start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT = over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k + italic_p + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT \triangleright Right Lanczos relation
2:AHWk+pWk+pTk+pH=w~k+p+1ek+pTsuperscript𝐴𝐻subscript𝑊𝑘𝑝subscript𝑊𝑘𝑝subscriptsuperscript𝑇𝐻𝑘𝑝subscript~𝑤𝑘𝑝1superscriptsubscript𝑒𝑘𝑝𝑇A^{H}W_{k+p}-W_{k+p}T^{H}_{k+p}=\widetilde{w}_{k+p+1}e_{k+p}^{T}italic_A start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_W start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT - italic_W start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT italic_T start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT = over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_k + italic_p + 1 end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT \triangleright Left Lanczos relation
3:Wk+pHVk+p=Isuperscriptsubscript𝑊𝑘𝑝𝐻subscript𝑉𝑘𝑝𝐼W_{k+p}^{H}V_{k+p}=Iitalic_W start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_V start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT = italic_I; Wk+pHv~k+p+1=0superscriptsubscript𝑊𝑘𝑝𝐻subscript~𝑣𝑘𝑝10W_{k+p}^{H}\widetilde{v}_{k+p+1}=0italic_W start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k + italic_p + 1 end_POSTSUBSCRIPT = 0; Vk+pHw~k+p+1=0superscriptsubscript𝑉𝑘𝑝𝐻subscript~𝑤𝑘𝑝10V_{k+p}^{H}\widetilde{w}_{k+p+1}=0italic_V start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_k + italic_p + 1 end_POSTSUBSCRIPT = 0; \triangleright Bi-orthogonality
4:procedure Refine right subspace
5:     LrIk+psubscript𝐿𝑟subscript𝐼𝑘𝑝L_{r}\leftarrow I_{k+p}italic_L start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ← italic_I start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT
6:     for j1𝑗1j\leftarrow 1italic_j ← 1 to p𝑝pitalic_p do
7:         Tk+pL1Tk+pLsubscript𝑇𝑘𝑝superscript𝐿1subscript𝑇𝑘𝑝𝐿T_{k+p}\leftarrow L^{-1}T_{k+p}Litalic_T start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT ← italic_L start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT italic_L \triangleright Implicitly shifted LR with shift μjsubscript𝜇𝑗\mu_{j}italic_μ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT
8:         LrLLrsubscript𝐿𝑟𝐿subscript𝐿𝑟L_{r}\leftarrow LL_{r}italic_L start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ← italic_L italic_L start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT
9:     end for
10:end procedure
11:procedure Refine left subspace
12:     LlIk+psubscript𝐿𝑙subscript𝐼𝑘𝑝L_{l}\leftarrow I_{k+p}italic_L start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ← italic_I start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT
13:     for j1𝑗1j\leftarrow 1italic_j ← 1 to p𝑝pitalic_p do
14:         Tk+pHL1Tk+pHLsubscriptsuperscript𝑇𝐻𝑘𝑝superscript𝐿1subscriptsuperscript𝑇𝐻𝑘𝑝𝐿T^{H}_{k+p}\leftarrow L^{-1}T^{H}_{k+p}Litalic_T start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT ← italic_L start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_T start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT italic_L \triangleright Implicitly shifted LR with shift μ¯jsubscript¯𝜇𝑗\bar{\mu}_{j}over¯ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT
15:         LlLLlsubscript𝐿𝑙𝐿subscript𝐿𝑙L_{l}\leftarrow LL_{l}italic_L start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ← italic_L italic_L start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT
16:     end for
17:end procedure
18:procedure Update
19:     βek+1TTk+pek𝛽subscriptsuperscript𝑒𝑇𝑘1subscript𝑇𝑘𝑝subscript𝑒𝑘\beta\leftarrow e^{T}_{k+1}T_{k+p}e_{k}italic_β ← italic_e start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT;       δekTTk+pek+1𝛿subscriptsuperscript𝑒𝑇𝑘subscript𝑇𝑘𝑝subscript𝑒𝑘1\delta\leftarrow e^{T}_{k}T_{k+p}e_{k+1}italic_δ ← italic_e start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT
20:     θ=ek+pT(Lr(Ll)H)ek𝜃superscriptsubscript𝑒𝑘𝑝𝑇subscript𝐿𝑟superscriptsubscript𝐿𝑙𝐻subscript𝑒𝑘\theta=e_{k+p}^{T}(L_{r}(L_{l})^{-H})e_{k}italic_θ = italic_e start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_L start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_L start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT ) italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT;        ϕ=ek+pT(LrHLl)ekitalic-ϕsuperscriptsubscript𝑒𝑘𝑝𝑇superscriptsubscript𝐿𝑟𝐻subscript𝐿𝑙subscript𝑒𝑘\phi=e_{k+p}^{T}(L_{r}^{-H}L_{l})e_{k}italic_ϕ = italic_e start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_L start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT
21:     v~k+1βVk+pek+1+θv~k+p+1subscript~𝑣𝑘1𝛽subscript𝑉𝑘𝑝subscript𝑒𝑘1𝜃subscript~𝑣𝑘𝑝1\widetilde{v}_{k+1}\leftarrow\beta V_{k+p}e_{k+1}+\theta\widetilde{v}_{k+p+1}over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ← italic_β italic_V start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT + italic_θ over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k + italic_p + 1 end_POSTSUBSCRIPT \triangleright New right residual
22:     w~k+1δ¯Wk+pek+1+ϕw~k+p+1subscript~𝑤𝑘1¯𝛿subscript𝑊𝑘𝑝subscript𝑒𝑘1italic-ϕsubscript~𝑤𝑘𝑝1\widetilde{w}_{k+1}\leftarrow\bar{\delta}W_{k+p}e_{k+1}+\phi\widetilde{w}_{k+p% +1}over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT ← over¯ start_ARG italic_δ end_ARG italic_W start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT + italic_ϕ over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_k + italic_p + 1 end_POSTSUBSCRIPT \triangleright New left residual
23:     VkVk+p(LrLlH)(Ik0p)subscript𝑉𝑘subscript𝑉𝑘𝑝subscript𝐿𝑟superscriptsubscript𝐿𝑙𝐻matrixsubscript𝐼𝑘subscript0𝑝V_{k}\leftarrow V_{k+p}(L_{r}L_{l}^{-H})\begin{pmatrix}I_{k}\\ 0_{p}\end{pmatrix}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ← italic_V start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT ( italic_L start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT ) ( start_ARG start_ROW start_CELL italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL 0 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) \triangleright New right Krylov space
24:     WkWk+p(LrHLl)(Ik0p)subscript𝑊𝑘subscript𝑊𝑘𝑝superscriptsubscript𝐿𝑟𝐻subscript𝐿𝑙matrixsubscript𝐼𝑘subscript0𝑝W_{k}\leftarrow W_{k+p}(L_{r}^{-H}L_{l})\begin{pmatrix}I_{k}\\ 0_{p}\end{pmatrix}italic_W start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ← italic_W start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT ( italic_L start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_H end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) ( start_ARG start_ROW start_CELL italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL 0 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) \triangleright New left Krylov space
25:     Tk(Ik0p)Tk+p(Ik0p)subscript𝑇𝑘matrixsubscript𝐼𝑘subscript0𝑝subscript𝑇𝑘𝑝matrixsubscript𝐼𝑘subscript0𝑝T_{k}\leftarrow\begin{pmatrix}I_{k}&0_{p}\end{pmatrix}T_{k+p}\begin{pmatrix}I_% {k}\\ 0_{p}\end{pmatrix}italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ← ( start_ARG start_ROW start_CELL italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_CELL start_CELL 0 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) italic_T start_POSTSUBSCRIPT italic_k + italic_p end_POSTSUBSCRIPT ( start_ARG start_ROW start_CELL italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL 0 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) \triangleright New tridiagonal matrix
26:end procedure
27:return Vk,v~k+1,Wk,w~k+1,Tksubscript𝑉𝑘subscript~𝑣𝑘1subscript𝑊𝑘subscript~𝑤𝑘1subscript𝑇𝑘V_{k},\widetilde{v}_{k+1},W_{k},\widetilde{w}_{k+1},T_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT , italic_W start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT , italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT

3 Computational Results

We present the results of computational tests performed in the Grcar matrix which is highly non-normal and has presented problems with convergence in previous studies. In his implicit restart work Sorensen [18] indeed points out that the restarted Arnoldi has trouble converging to the left-most part of the spectrum and even though the algorithm claimed convergence, what was obtained was in fact part of the pseudospectrum.

We present the results of the restarted Lanczos method applied to the 50×50505050\times 5050 × 50 Grcar matrix for k=10𝑘10k=10italic_k = 10 and p=10𝑝10p=10italic_p = 10. Figure 2 shows the spectrum obtained after performing eleven restarts of the Lanczos algorithm. The restarts were performed such that the k𝑘kitalic_k eigenvalues with the largest imaginary part were retained for the reduced operator and the remaining were used as shifts to be discarded from the subspace. The same part of the spectrum was sought by Sorensen in [18]. In this particular case we obtained w~m+1Tv~m+1O(1011)similar-tosuperscriptsubscript~𝑤𝑚1𝑇subscript~𝑣𝑚1𝑂superscript1011\widetilde{w}_{m+1}^{T}\widetilde{v}_{m+1}\sim O(10^{-11})over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT over~ start_ARG italic_v end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT ∼ italic_O ( 10 start_POSTSUPERSCRIPT - 11 end_POSTSUPERSCRIPT ) even though the individual residuals were both of order 103104superscript103superscript10410^{-3}-10^{-4}10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT - 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT. At this particular moment, one would need to employ a look-ahead step of the Lanczos however, we have shown the spectrum of the reduced matrix at this point. One may think of the spectrum to have converged to O(104)𝑂superscript104O(10^{-4})italic_O ( 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT ). As seen from the figure, the spectrum of the reduced operator matches quite well with the original spectrum of the Grcar matrix. In figure 2, we show the convergence history of the individual eigenvalues with each restart step as the Lanczos algorithm progresses. Clearly the restart process is working well to shift the Krylov spaces towards the wanted region of the spectrum. The error in the eigenvalues is of O(107)𝑂superscript107O(10^{-7})italic_O ( 10 start_POSTSUPERSCRIPT - 7 end_POSTSUPERSCRIPT ) even though the perturbation to the reduced matrix is of order 104superscript10410^{-4}10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT. This is in contrast to the results obtained in [18] which reported large perturbations to the eigenvalues even when the Arnoldi method reported convergence. We suspect this is due to the fact that the non-symmetric Lanczos approximates both the right and left eigenspaces simultaneously and leads to a lower error in the truncated matrix. We expect this to be particularly useful in hydrodynamic problems where highly non-normal matrices are a routine occurrence and where problems with convergence of the spectrum have often been reported (see [2] for example).

Refer to caption
Figure 1: Spectrum obtained from the restarted Lanczos after 11111111 restart steps.
Refer to caption
Figure 2: Error in the eigenvalues after each restart i𝑖iitalic_i.

At this point we make a note of caution that for large Grcar matrices our attempts have been somewhat less successful and the convergence sometimes fails. We expect this is due to the naive implementations of the implicit LR method that we have done in Julia where, the standard checks for small sub-diagonal elements have not been performed. In our investigations of such cases we indeed do find small sub diagonal elements which lead to loss of precision. Very small sub diagonal elements also lead to rapid loss of biorthogonality of the two subspaces. Similarly we have not paid attention to the issue arising out of the Lanczos algorithm itself, except for employing a double (two-sided) Gram-Schmidt to ensure biorthogonality. We also note that for large number of restarts, the biorthogonality property of the two subspaces is progressively lost, probably due to accruing floating point errors. Hence we expect some method of reorthogonalization would be required to ensure stability for very long calculations, which has not been done in the current work. The (implicit) LR decomposition is not unique. Uniqueness is ensured for unit main diagonal of the lower triangular matrix however, this does not pay any heed to conditioning of the transforming matrices. A better strategy of building the matrices could be pursued which has better conditioning while at the same time preserves the tridiagonal structure. These issues would require careful implementation of all the individual components and we do not address those in the current work.

Finally we note that very similar work has been reported in [7] where hyperbolic transforms (HR) are used for restarts of the bi-Lanczos method, and in [6] where the LR transformations are used, albeit in the context of model reduction.

4 Conclusion

We present an algorithm to restart the non-symmetric Lanczos method which is based on the idea of polynomial filtering via the implicit QR method proposed by Sorensen [18] for restarting the Arnoldi iteration. The (implicitly) shifted LR method is shown to be the appropriate analogue for restarting the non-symmetric Lanczos algorithm for structure preserving restarts. It is shown that the polynomial filtering process needs to be carried out for both the right and the left Krylov spaces and the appropriate transforms for maintaining biorthogonality of the two spaces are highlighted. Computational results are shown for a Grcar matrix which is known to be highly non-normal and the spectrum is found to converge adequately even when the residual error is relatively large.

Acknowledgments

The authors would like to thank Professor Elias Jarlebring for his helpful comments on the manuscript. The authors acknowledge support of Nordita and the Swedish Research Council Grant No. 2018-04290. Nordita is partially supported by Nordforsk.

References

  • [1] W. E. Arnoldi, The principle of minimized iterations in the solution of the matrix eigenvalue problem, Quarterly of applied mathematics, 9 (1951), pp. 17–29.
  • [2] M. Brynjell-Rahkola, N. Shahriari, P. Schlatter, A. Hanifi, and D. S. Henningson, Stability and sensitivity of a cross-flow-dominated falkner–skan–cooke boundary layer with discrete surface roughness, Journal of Fluid Mechanics, 826 (2017), p. 830–850.
  • [3] J. Della-Dora, Numerical linear algorithms and group theory, Linear Algebra and its Applications, 10 (1975), pp. 267–283.
  • [4] J. G. F. Francis, The QR Transformation—Part 2, The Computer Journal, 4 (1962), pp. 332–345.
  • [5] S. W. Gaaf and E. Jarlebring, The infinite bi-lanczos method for nonlinear eigenvalue problems, SIAM Journal on Scientific Computing, 39 (2017), pp. S898–S919.
  • [6] E. Grimme, D. Sorensen, and P. Van Dooren, Stable partial realizations via an implicitly restarted lanczos method, in Proceedings of 1994 American Control Conference - ACC ’94, vol. 3, 1994, pp. 2814–2818 vol.3.
  • [7] E. J. Grimme, D. C. Sorensen, and P. Van Dooren, Model reduction of state space systems via an implicitly restarted lanczos method, Numerical algorithms, 12 (1996), pp. 1–31.
  • [8] M. H. Gutknecht, Lanczos-type solvers for nonsymmetric linear systems of equations, Acta numerica, 6 (1997), pp. 271–397.
  • [9] V. Hernandez, J. E. Roman, and V. Vidal, SLEPc: A scalable and flexible toolkit for the solution of eigenvalue problems, ACM Trans. Math. Software, 31 (2005), pp. 351–362.
  • [10] C. Lanczos, An iteration method for the solution of the eigenvalue problem of linear differential and integral operators, (1950).
  • [11] R. B. Lehoucq, D. C. Sorensen, and C. Yang, ARPACK users’ guide: solution of large-scale eigenvalue problems with implicitly restarted Arnoldi methods, SIAM, 1998.
  • [12] V. Papakos and I. Jaimoukha, A deflated implicitly restarted lanczos algorithm for model reduction, in 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475), vol. 3, 2003, pp. 2902–2907 Vol.3.
  • [13] B. N. Parlett, D. R. Taylor, and Z. A. Liu, A look-ahead lanczos algorithm for unsymmetric matrices, Mathematics of Computation, 44 (1985), pp. 105–124.
  • [14] H. Rutishauser, Solution of eigenvalue problems with the lr-transformation, National Bureau of Standards, Applied Mathematics Series, 49 (1958), pp. 47–81.
  • [15] H. Rutishauser, Lectures on Numerical Mathematics, Birkhauser, 1990.
  • [16] H. Rutishauser and H. Schwarz, The lr transformation method for symmetric matrices, Numerische Mathematik, 5 (1963), pp. 273–289.
  • [17] Y. Saad, The lanczos biorthogonalization algorithm and other oblique projection methods for solving large unsymmetric systems, SIAM Journal on Numerical Analysis, 19 (1982), pp. 485–506.
  • [18] D. C. Sorensen, Implicit application of polynomial filters in a k-step arnoldi method, Siam journal on matrix analysis and applications, 13 (1992), pp. 357–385.
  • [19] G. W. Stewart, A krylov–schur algorithm for large eigenproblems, SIAM Journal on Matrix Analysis and Applications, 23 (2002), pp. 601–614.
  • [20] D. S. Watkins and L. Elsner, Chasing algorithms for the eigenvalue problem, SIAM Journal on Matrix Analysis and Applications, 12 (1991), pp. 374–384.