HTML conversions sometimes display errors due to content that did not convert correctly from the source. This paper uses the following packages that are not yet supported by the HTML conversion tool. Feedback on these issues are not necessary; they are known and are being worked on.

  • failed: threeparttablex

Authors: achieve the best HTML results from your LaTeX submissions by following these best practices.

License: arXiv.org perpetual non-exclusive license
arXiv:2404.05808v1 [stat.ME] 08 Apr 2024
11footnotetext: Department of Statistics, Florida State University, Tallahassee, FL 32306, USA.22footnotetext: Department of Statistics, Texas A&M University, College Station, TX, 77843, USA.

Replicability analysis of high dimensional data accounting for dependence

Pengfei Lyu11{}^{1}start_FLOATSUPERSCRIPT 1 end_FLOATSUPERSCRIPT, Xianyang Zhang22{}^{2}start_FLOATSUPERSCRIPT 2 end_FLOATSUPERSCRIPT and Hongyuan Cao11{}^{1}start_FLOATSUPERSCRIPT 1 end_FLOATSUPERSCRIPT Corresponding author: [email protected]
Abstract

Replicability is the cornerstone of scientific research. We study the replicability of data from high-throughput experiments, where tens of thousands of features are examined simultaneously. Existing replicability analysis methods either ignore the dependence among features or impose strong modeling assumptions, producing overly conservative or overly liberal results. Based on p𝑝pitalic_p-values from two studies, we use a four-state hidden Markov model to capture the structure of local dependence. Our method effectively borrows information from different features and studies while accounting for dependence among features and heterogeneity across studies. We show that the proposed method has better power than competing methods while controlling the false discovery rate, both empirically and theoretically. Analyzing datasets from genome-wide association studies reveals new biological insights that otherwise cannot be obtained by using existing methods. Keywords: False discovery rate; hidden Markov model; high dimensional replicability analysis; non-parametric maximum likelihood estimation.

1 Introduction

Replicability is the cornerstone of modern scientific research. Consistent findings at different times, places, and populations provide stronger scientific evidence. We study conceptual replicability, where consistent results are obtained using different procedures and populations that target the same scientific question. In high-throughput experiments, non-biological factors, such as batch effects, may confound signals in a single study. With multiple studies, if a signal is replicable, it suggests that the result is not due to chance or confounders, strengthening evidence from a single study. To focus on the main ideas, we study replicability analysis of high dimensional data from two studies. In high-throughput experiments, where tens of thousands of features are examined simultaneously, an acute problem is multiple comparisons. Compared with multiple testing from a single study, the null hypothesis for replicability analysis is composite. We regard a feature as replicable if it is non-null from both studies. The composite null hypothesis for replicability analysis consists of three states: the feature from both studies is null, the feature from one study is null, and from another study is non-null and vice versa. An ad hoc approach for replicability analysis of high dimensional data is to implement a multiple testing procedure, for instance, the Benjamini and Hochberg procedure (Benjamini and Hochberg, 1995), for each study, and get the intersection of discoveries from all studies. This commonly used and intuitive procedure does not control the false discovery rate and has low power because it does not borrow information from different studies (Bogomolov and Heller, 2023). As a conservative alternative, Benjamini et al. (2009) proposes to use the maximum of p𝑝pitalic_p-values from different studies as a test statistic and implement the Benjamini and Hochberg procedure afterward. To improve power, Lyu et al. (2023) estimates different proportions of composite null and gets a better approximation of the cumulative distribution function of the maximum of p𝑝pitalic_p-values, yet no theoretical results are provided. Li et al. (2011) proposes a reproducible discovery rate and graphical tools to assess the replicability. Philtron et al. (2018) uses the maximum rank of each feature to assess replicability non-parametrically. The procedures proposed by Li et al. (2011) and Philtron et al. (2018) both assume that the two studies share the same states in the sense that they are both null or non-null. This assumption is strong and does not incorporate the heterogeneity of different studies. Bogomolov and Heller (2018) first pre-screens the p𝑝pitalic_p-values and uses a cross-screening strategy to borrow information from two studies. An empirical Bayes approach was proposed by Heller and Yekutieli (2014) where unknown functions are estimated parametrically. Hung and Fithian (2020) discusses different criteria for replicability analysis. We refer the readers to Bogomolov and Heller (2023)for a recent survey on this topic. To the best of our knowledge, high dimensional replicability analysis accounting for dependence has not been studied before. In many high dimensional data, dependence among features is a norm rather than an exception. For instance, genome-wide association studies (GWAS) data exhibit linkage disequilibrium among single nucleotide polymorphisms (SNPs), where alleles at nearby sites can co-occur on the same phenotype more often than by chance alone. As a result, it is common to observe that phenotype associated SNPs form clusters (Pritchard and Przeworski, 2001). An effective strategy for dependence modeling is through the hidden Markov model (Li and Stephens, 2003; Sesia et al., 2021). Under the hidden Markov model, consistency of the maximum likelihood estimation is shown by Leroux (1992) with parametric assumption on the density functions of the mixture model. Bickel et al. (1998) further shows the asymptotic normality of the estimated parameters. Recently, Alexandrovich et al. (2016) gives nonparametric identification and maximum likelihood estimation for finite-state hidden Markov models, and Abraham et al. (2022) shows the optimal minimax rate of the supremum-norm convergence of preliminary estimators of the emission densities of the hidden Markov model. With one dataset, the existing false discovery rate control procedure accounting for dependence through the hidden Markov model uses Gaussian mixtures to estimate the non-null density function (Sun and Cai, 2009; Abraham et al., 2022). These procedures cannot be extended to the replicability analysis of two studies. In this paper, we develop a robust, efficient, and computationally scalable high dimensional replicability analysis method without any tuning parameters. Our method only requires paired p𝑝pitalic_p-values from two studies as the input and does not require the availability of individual data, which may be prohibitive due to privacy concerns or logistics. We use a four-state hidden Markov model to account for the heterogeneity of different states. Conditional on the hidden states, we assume that the paired p𝑝pitalic_p-values follow a four-group mixture model (Efron, 2012). We do not assume signals from different studies have the same effect size and account for such heterogeneity by modeling non-null density functions of p𝑝pitalic_p-values from two studies separately. In addition, to have robust inference, we do not impose parametric assumptions on the non-null density functions of p𝑝pitalic_p-values and develop a non-parametric maximum likelihood estimation procedure. Computationally, we combine the forward-backward algorithm (Baum et al., 1970), EM algorithm (Dempster et al., 1977) and the pool-adjacent-violator-algorithm (Robertson et al., 1988) to estimate unknown parameters and unknown functions. Theoretically, we show consistency of the estimated parameters and functions under minimum assumptions, and asymptotic false discovery rate control using the proposed estimation method.

2 Methodology

2.1 Notations and model set-up

Suppose that we have p𝑝pitalic_p-values of m𝑚mitalic_m hypotheses from two studies (y1j,y2j),j=1,,m.formulae-sequencesubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚(y_{1j},y_{2j}),j=1,\ldots,m.( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) , italic_j = 1 , … , italic_m . These p𝑝pitalic_p-values can be obtained by the marginal association of each SNP with a phenotype in different populations. We are interested in identifying replicable SNPs associated with the phenotype in both studies. We use GWAS as a motivating and illustrative example and remark that our method is general and can be applied in other settings. Let θijsubscript𝜃𝑖𝑗\theta_{ij}italic_θ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT denote the hidden states of j𝑗jitalic_jth SNP in study i𝑖iitalic_i, where θij=1subscript𝜃𝑖𝑗1\theta_{ij}=1italic_θ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = 1 indicates association of the j𝑗jitalic_jth SNP in study i𝑖iitalic_i and θij=0subscript𝜃𝑖𝑗0\theta_{ij}=0italic_θ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = 0 otherwise, i=1,2,j=1,,mformulae-sequence𝑖12𝑗1𝑚i=1,2,j=1,\ldots,mitalic_i = 1 , 2 , italic_j = 1 , … , italic_m. We use sj=0,1,2,3subscript𝑠𝑗0123s_{j}=0,1,2,3italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 , 1 , 2 , 3 to indicate the four possible values of the joint association status (θ1j,θ2j)=(0,0),(0,1),(1,0)subscript𝜃1𝑗subscript𝜃2𝑗000110(\theta_{1j},\theta_{2j})=(0,0),(0,1),(1,0)( italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) = ( 0 , 0 ) , ( 0 , 1 ) , ( 1 , 0 ) and (1,1)11(1,1)( 1 , 1 ). The replicability null hypothesis is composite with

H0j:sj{0,1,2},j=1,,m.:subscript𝐻0𝑗formulae-sequencesubscript𝑠𝑗012𝑗1𝑚H_{0j}:s_{j}\in\{0,1,2\},\quad j=1,\dots,m.italic_H start_POSTSUBSCRIPT 0 italic_j end_POSTSUBSCRIPT : italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } , italic_j = 1 , … , italic_m . (2.1)

To capture the local dependence structure, we assume that 𝒔=(s1,,sm)𝒔subscript𝑠1subscript𝑠𝑚\bm{s}=(s_{1},\dots,s_{m})bold_italic_s = ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) follows a four-state stationary, irreducible, and aperiodic Markov chain. The transition probabilities are akl=P(sj+1=lsj=k),j=1,,m1,formulae-sequencesubscript𝑎𝑘𝑙𝑃subscript𝑠𝑗1conditional𝑙subscript𝑠𝑗𝑘𝑗1𝑚1a_{kl}=P(s_{j+1}=l\mid s_{j}=k),\ j=1,\ldots,m-1,italic_a start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT = italic_P ( italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = italic_l ∣ italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ) , italic_j = 1 , … , italic_m - 1 , where k,l=0,1,2,3formulae-sequence𝑘𝑙0123k,l=0,1,2,3italic_k , italic_l = 0 , 1 , 2 , 3 with constraint l=03akl=1superscriptsubscript𝑙03subscript𝑎𝑘𝑙1\sum_{l=0}^{3}a_{kl}=1∑ start_POSTSUBSCRIPT italic_l = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT = 1. The stationary distribution of state sjsubscript𝑠𝑗s_{j}italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is P(sj=k)=πk,j=1,,m;k=0,1,2,3,formulae-sequence𝑃subscript𝑠𝑗𝑘subscript𝜋𝑘formulae-sequence𝑗1𝑚𝑘0123P(s_{j}=k)=\pi_{k},j=1,\ldots,m;k=0,1,2,3,italic_P ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ) = italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_j = 1 , … , italic_m ; italic_k = 0 , 1 , 2 , 3 , where k=03πk=1superscriptsubscript𝑘03subscript𝜋𝑘1\sum_{k=0}^{3}\pi_{k}=1∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 1. Denote A=(akl)R4×4𝐴subscript𝑎𝑘𝑙superscript𝑅44A=(a_{kl})\in R^{4\times 4}italic_A = ( italic_a start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT ) ∈ italic_R start_POSTSUPERSCRIPT 4 × 4 end_POSTSUPERSCRIPT as the transition probability matrix and π=(π0,π1,π2,π3)𝜋subscript𝜋0subscript𝜋1subscript𝜋2subscript𝜋3\pi=(\pi_{0},\pi_{1},\pi_{2},\pi_{3})italic_π = ( italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) as the vector of stationary probabilities. Since the Markov chain is stationary, we have πA=π𝜋𝐴𝜋\pi A=\piitalic_π italic_A = italic_π. The convergence theorem of a Markov chain (Theorem 5.5.1 in Durrett (2019)) implies that m1j=1mI(sj=k)πk,superscript𝑚1superscriptsubscript𝑗1𝑚𝐼subscript𝑠𝑗𝑘subscript𝜋𝑘m^{-1}\sum_{j=1}^{m}I(s_{j}=k)\to\pi_{k},italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ) → italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , almost surely for k=0,1,2,3𝑘0123k=0,1,2,3italic_k = 0 , 1 , 2 , 3 as m𝑚m\to\inftyitalic_m → ∞. Conditional on the hidden states, we model the probability density function of p𝑝pitalic_p-values by a mixture model. Specifically,

y1jθ1j(1θ1j)f0+θ1jf1,similar-toconditionalsubscript𝑦1𝑗subscript𝜃1𝑗1subscript𝜃1𝑗subscript𝑓0subscript𝜃1𝑗subscript𝑓1\displaystyle y_{1j}\mid\theta_{1j}\sim(1-\theta_{1j})f_{0}+\theta_{1j}f_{1},italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ∣ italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ∼ ( 1 - italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , (2.2)
y2jθ2j(1θ2j)f0+θ2jf2,similar-toconditionalsubscript𝑦2𝑗subscript𝜃2𝑗1subscript𝜃2𝑗subscript𝑓0subscript𝜃2𝑗subscript𝑓2\displaystyle y_{2j}\mid\theta_{2j}\sim(1-\theta_{2j})f_{0}+\theta_{2j}f_{2},italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ∣ italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ∼ ( 1 - italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ,

where f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is the probability density function of p𝑝pitalic_p-values when θ1j=θ2j=0,subscript𝜃1𝑗subscript𝜃2𝑗0\theta_{1j}=\theta_{2j}=0,italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT = italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT = 0 , and f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are the probability density functions of p𝑝pitalic_p-values when θ1j=1subscript𝜃1𝑗1\theta_{1j}=1italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT = 1 and θ2j=1,subscript𝜃2𝑗1\theta_{2j}=1,italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT = 1 , respectively. We assume that f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT follows the standard uniform distribution and impose the following monotone likelihood ratio condition (Sun and Cai, 2007; Cao et al., 2013).

f1(x)/f0(x) and f2(x)/f0(x) are monotonically non-increasing in x.subscript𝑓1𝑥subscript𝑓0𝑥 and subscript𝑓2𝑥subscript𝑓0𝑥 are monotonically non-increasing in 𝑥f_{1}(x)/f_{0}(x)\text{ and }f_{2}(x)/f_{0}(x)\text{ are monotonically non-% increasing in }x.italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x ) / italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_x ) and italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x ) / italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_x ) are monotonically non-increasing in italic_x . (2.3)

This condition is natural as small p𝑝pitalic_p-values indicate evidence against the null. The paired p𝑝pitalic_p-values are assumed to be conditionally independent given the hidden states. Based on (2.2), we have f(sj)(y1j,y2j)=f0(y1j)f0(y2j),f0(y1j)f2(y2j),f1(y1j)f0(y2j),superscript𝑓subscript𝑠𝑗subscript𝑦1𝑗subscript𝑦2𝑗subscript𝑓0subscript𝑦1𝑗subscript𝑓0subscript𝑦2𝑗subscript𝑓0subscript𝑦1𝑗subscript𝑓2subscript𝑦2𝑗subscript𝑓1subscript𝑦1𝑗subscript𝑓0subscript𝑦2𝑗f^{(s_{j})}(y_{1j},y_{2j})=f_{0}(y_{1j})f_{0}(y_{2j}),f_{0}(y_{1j})f_{2}(y_{2j% }),f_{1}(y_{1j})f_{0}(y_{2j}),italic_f start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) = italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) , italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) , and f1(y1j)f2(y2j)subscript𝑓1subscript𝑦1𝑗subscript𝑓2subscript𝑦2𝑗f_{1}(y_{1j})f_{2}(y_{2j})italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) for sj=0,1,2subscript𝑠𝑗012s_{j}=0,1,2italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 , 1 , 2 and 3,33,3 , respectively, where f(sj)superscript𝑓subscript𝑠𝑗f^{(s_{j})}italic_f start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT is conditional probability density function of (y1j,y2j)subscript𝑦1𝑗subscript𝑦2𝑗(y_{1j},y_{2j})( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) given state sjsubscript𝑠𝑗s_{j}italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, j=1,,m𝑗1𝑚j=1,\ldots,mitalic_j = 1 , … , italic_m.

2.2 Estimation

We use ϕ=(π,A,f1,f2)italic-ϕ𝜋𝐴subscript𝑓1subscript𝑓2\phi=(\pi,A,f_{1},f_{2})italic_ϕ = ( italic_π , italic_A , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) to denote the collection of unknown parameters and unknown functions, with the true value denoted as ϕ0subscriptitalic-ϕ0\phi_{0}italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. The likelihood function for (y1j,y2j)j=1msuperscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚(y_{1j},y_{2j})_{j=1}^{m}( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT is

pm((y1j,y2j)j=1m;ϕ)=s1smπs1(ϕ)f(s1)(y11,y21;ϕ)j=2masj1,sj(ϕ)f(sj)(y1j,y2j;ϕ).subscript𝑝𝑚superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚italic-ϕsubscriptsubscript𝑠1subscriptsubscript𝑠𝑚subscript𝜋subscript𝑠1italic-ϕsuperscript𝑓subscript𝑠1subscript𝑦11subscript𝑦21italic-ϕsuperscriptsubscriptproduct𝑗2𝑚subscript𝑎subscript𝑠𝑗1subscript𝑠𝑗italic-ϕsuperscript𝑓subscript𝑠𝑗subscript𝑦1𝑗subscript𝑦2𝑗italic-ϕp_{m}\left((y_{1j},y_{2j})_{j=1}^{m};\phi\right)=\sum_{s_{1}}\dots\sum_{s_{m}}% \pi_{s_{1}}(\phi)f^{(s_{1})}(y_{11},y_{21};\phi)\prod_{j=2}^{m}a_{s_{j-1},s_{j% }}(\phi)f^{(s_{j})}(y_{1j},y_{2j};\phi).italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ; italic_ϕ ) = ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT … ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ ) italic_f start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT ; italic_ϕ ) ∏ start_POSTSUBSCRIPT italic_j = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ ) italic_f start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ; italic_ϕ ) .

The maximum-likelihood estimate is defined as

ϕ^m=argmaxϕΦpm((y1j,y2j)j=1m;ϕ),subscript^italic-ϕ𝑚italic-ϕΦsubscript𝑝𝑚superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚italic-ϕ\displaystyle\hat{\phi}_{m}=\underset{\phi\in\Phi}{\arg\max}\>p_{m}\left((y_{1% j},y_{2j})_{j=1}^{m};\phi\right),over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = start_UNDERACCENT italic_ϕ ∈ roman_Φ end_UNDERACCENT start_ARG roman_arg roman_max end_ARG italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ; italic_ϕ ) , (2.4)

where ΦΦ\Phiroman_Φ is the parameter space of ϕ.italic-ϕ\phi.italic_ϕ . Given ϕ0subscriptitalic-ϕ0\phi_{0}italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, define the forward probability as αj(sj)=Pϕ0((y1t,y2t)t=1j,sj)subscript𝛼𝑗subscript𝑠𝑗subscript𝑃subscriptitalic-ϕ0superscriptsubscriptsubscript𝑦1𝑡subscript𝑦2𝑡𝑡1𝑗subscript𝑠𝑗\alpha_{j}(s_{j})=P_{\phi_{0}}\left((y_{1t},y_{2t})_{t=1}^{j},s_{j}\right)italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_t end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT , italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) and the backward probability as βj(sj)=Pϕ0((y1t,y2t)t=j+1msj)subscript𝛽𝑗subscript𝑠𝑗subscript𝑃subscriptitalic-ϕ0conditionalsuperscriptsubscriptsubscript𝑦1𝑡subscript𝑦2𝑡𝑡𝑗1𝑚subscript𝑠𝑗\beta_{j}(s_{j})=P_{\phi_{0}}\left((y_{1t},y_{2t})_{t=j+1}^{m}\mid s_{j}\right)italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_t end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t = italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∣ italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ), respectively. The forward-backward procedure (Baum et al., 1970) is used in the calculation. Specifically, we first initialize α1(s1)=πs1f(s1)(y11,y21)subscript𝛼1subscript𝑠1subscript𝜋subscript𝑠1superscript𝑓subscript𝑠1subscript𝑦11subscript𝑦21\alpha_{1}(s_{1})=\pi_{s_{1}}f^{(s_{1})}(y_{11},y_{21})italic_α start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) = italic_π start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT ) and βm(sm)=1.subscript𝛽𝑚subscript𝑠𝑚1\beta_{m}(s_{m})=1.italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) = 1 . We can obtain αj()subscript𝛼𝑗\alpha_{j}(\cdot)italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( ⋅ ) and βj()subscript𝛽𝑗\beta_{j}(\cdot)italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( ⋅ ) for j=1,,m𝑗1𝑚j=1,\dots,mitalic_j = 1 , … , italic_m recursively by αj+1(sj+1)=sj=03αj(sj)asjsj+1f(sj+1)(y1,j+1,y2,j+1),subscript𝛼𝑗1subscript𝑠𝑗1superscriptsubscriptsubscript𝑠𝑗03subscript𝛼𝑗subscript𝑠𝑗subscript𝑎subscript𝑠𝑗subscript𝑠𝑗1superscript𝑓subscript𝑠𝑗1subscript𝑦1𝑗1subscript𝑦2𝑗1\alpha_{j+1}(s_{j+1})=\sum_{s_{j}=0}^{3}\alpha_{j}(s_{j})a_{s_{j}s_{j+1}}f^{(s% _{j+1})}(y_{1,j+1},y_{2,j+1}),italic_α start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) italic_a start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ) , and similarly, βj(sj)=sj+1=03βj+1(sj+1)f(sj+1)(y1,j+1,y2,j+1)asjsj+1.subscript𝛽𝑗subscript𝑠𝑗superscriptsubscriptsubscript𝑠𝑗103subscript𝛽𝑗1subscript𝑠𝑗1superscript𝑓subscript𝑠𝑗1subscript𝑦1𝑗1subscript𝑦2𝑗1subscript𝑎subscript𝑠𝑗subscript𝑠𝑗1\beta_{j}(s_{j})=\sum_{s_{j+1}=0}^{3}\beta_{j+1}(s_{j+1})f^{(s_{j+1})}(y_{1,j+% 1},y_{2,j+1})a_{s_{j}s_{j+1}}.italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) italic_f start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ) italic_a start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . Define two posterior probabilities as γj(sj)=Pϕ0(sj(y1j,y2j)j=1m)subscript𝛾𝑗subscript𝑠𝑗subscript𝑃subscriptitalic-ϕ0conditionalsubscript𝑠𝑗superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚\gamma_{j}(s_{j})=P_{\phi_{0}}(s_{j}\mid(y_{1j},y_{2j})_{j=1}^{m})italic_γ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) and ξj(sj,sj+1)=Pϕ0(sj,sj+1(y1j,y2j)j=1m)subscript𝜉𝑗subscript𝑠𝑗subscript𝑠𝑗1subscript𝑃subscriptitalic-ϕ0subscript𝑠𝑗conditionalsubscript𝑠𝑗1superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚\xi_{j}(s_{j},s_{j+1})=P_{\phi_{0}}(s_{j},s_{j+1}\mid(y_{1j},y_{2j})_{j=1}^{m})italic_ξ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) = italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ). By definition, we have γj(sj)=sj+1=03ξj(sj,sj+1)subscript𝛾𝑗subscript𝑠𝑗superscriptsubscriptsubscript𝑠𝑗103subscript𝜉𝑗subscript𝑠𝑗subscript𝑠𝑗1\gamma_{j}(s_{j})=\sum_{s_{j+1}=0}^{3}\xi_{j}(s_{j},s_{j+1})italic_γ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_ξ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ). They can be obtained from the forward and backward probabilities through

γj(sj)=αj(sj)βj(sj)sj=03αj(sj)βj(sj)subscript𝛾𝑗subscript𝑠𝑗subscript𝛼𝑗subscript𝑠𝑗subscript𝛽𝑗subscript𝑠𝑗superscriptsubscriptsubscript𝑠𝑗03subscript𝛼𝑗subscript𝑠𝑗subscript𝛽𝑗subscript𝑠𝑗\displaystyle\gamma_{j}(s_{j})=\frac{\alpha_{j}(s_{j})\beta_{j}(s_{j})}{\sum_{% s_{j}=0}^{3}\alpha_{j}(s_{j})\beta_{j}(s_{j})}italic_γ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = divide start_ARG italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG

and

ξj(sj,sj+1)=αj(sj)βj+1(sj+1)asjsj+1f(sj+1)(y1,j+1,y2,j+1)sj=03sj+1=03αj(sj)βj+1(sj+1)asjsj+1f(sj+1)(y1,j+1,y2,j+1).subscript𝜉𝑗subscript𝑠𝑗subscript𝑠𝑗1subscript𝛼𝑗subscript𝑠𝑗subscript𝛽𝑗1subscript𝑠𝑗1subscript𝑎subscript𝑠𝑗subscript𝑠𝑗1superscript𝑓subscript𝑠𝑗1subscript𝑦1𝑗1subscript𝑦2𝑗1superscriptsubscriptsubscript𝑠𝑗03superscriptsubscriptsubscript𝑠𝑗103subscript𝛼𝑗subscript𝑠𝑗subscript𝛽𝑗1subscript𝑠𝑗1subscript𝑎subscript𝑠𝑗subscript𝑠𝑗1superscript𝑓subscript𝑠𝑗1subscript𝑦1𝑗1subscript𝑦2𝑗1\displaystyle\xi_{j}(s_{j},s_{j+1})=\frac{\alpha_{j}(s_{j})\beta_{j+1}(s_{j+1}% )a_{s_{j}s_{j+1}}f^{(s_{j+1})}(y_{1,j+1},y_{2,j+1})}{\sum_{s_{j}=0}^{3}\sum_{s% _{j+1}=0}^{3}\alpha_{j}(s_{j})\beta_{j+1}(s_{j+1})a_{s_{j}s_{j+1}}f^{(s_{j+1})% }(y_{1,j+1},y_{2,j+1})}.italic_ξ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) = divide start_ARG italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) italic_β start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) italic_a start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) italic_β start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) italic_a start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ) end_ARG .

The likelihood function of the complete data (y1j,y2j,sj)j=1msuperscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗subscript𝑠𝑗𝑗1𝑚(y_{1j},y_{2j},s_{j})_{j=1}^{m}( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT is given by

L(ϕ;(y1j,y2j,sj)j=1m)=πs1j=2masj1sjj=1mf(sj)(y1j,y2j).𝐿italic-ϕsuperscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗subscript𝑠𝑗𝑗1𝑚subscript𝜋subscript𝑠1superscriptsubscriptproduct𝑗2𝑚subscript𝑎subscript𝑠𝑗1subscript𝑠𝑗superscriptsubscriptproduct𝑗1𝑚superscript𝑓subscript𝑠𝑗subscript𝑦1𝑗subscript𝑦2𝑗\displaystyle L\left(\phi;(y_{1j},y_{2j},s_{j})_{j=1}^{m}\right)=\pi_{s_{1}}% \prod_{j=2}^{m}a_{s_{j-1}s_{j}}\cdot\prod_{j=1}^{m}f^{(s_{j})}(y_{1j},y_{2j}).italic_L ( italic_ϕ ; ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) = italic_π start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_j = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⋅ ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_f start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) .

We use the EM algorithm (Dempster et al., 1977) in combination of the pool-adjacent-violator-algorithm (Robertson et al., 1988) to estimate the unknowns ϕ=(π,A,f1,f2).italic-ϕ𝜋𝐴subscript𝑓1subscript𝑓2\phi=(\pi,A,f_{1},f_{2}).italic_ϕ = ( italic_π , italic_A , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) . With an appropriate initialization ϕ(0)=(π(0),A(0),f1(0),f2(0))superscriptitalic-ϕ0superscript𝜋0superscript𝐴0superscriptsubscript𝑓10superscriptsubscript𝑓20\phi^{(0)}=\left(\pi^{(0)},A^{(0)},f_{1}^{(0)},f_{2}^{(0)}\right)italic_ϕ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = ( italic_π start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT , italic_A start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ), the EM algorithm proceeds by iteratively implementing the following two steps. E-step: Given current ϕ(t)=(π(t),A(t),f1(t),f2(t))superscriptitalic-ϕ𝑡superscript𝜋𝑡superscript𝐴𝑡superscriptsubscript𝑓1𝑡superscriptsubscript𝑓2𝑡\phi^{(t)}=\left(\pi^{(t)},A^{(t)},f_{1}^{(t)},f_{2}^{(t)}\right)italic_ϕ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = ( italic_π start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , italic_A start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ), the forward and backward probabilities αj(t)(sj),βj(t)(sj),superscriptsubscript𝛼𝑗𝑡subscript𝑠𝑗superscriptsubscript𝛽𝑗𝑡subscript𝑠𝑗\alpha_{j}^{(t)}(s_{j}),\beta_{j}^{(t)}(s_{j}),italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) , italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) , we can obtain the posterior probabilities γj(t)(sj),ξj(t)(sj,sj+1).superscriptsubscript𝛾𝑗𝑡subscript𝑠𝑗superscriptsubscript𝜉𝑗𝑡subscript𝑠𝑗subscript𝑠𝑗1\gamma_{j}^{(t)}(s_{j}),\xi_{j}^{(t)}(s_{j},s_{j+1}).italic_γ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) , italic_ξ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) . The conditional expectation of the log-likelihood function is

D(ϕϕ(t))𝐷conditionalitalic-ϕsuperscriptitalic-ϕ𝑡\displaystyle D\left(\phi\mid\phi^{(t)}\right)italic_D ( italic_ϕ ∣ italic_ϕ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) =𝒔Pϕ(t)(𝒔(y1j,y2j)j=1m)logL(ϕ;(y1j,y2j)j=1m,𝒔)absentsubscript𝒔subscript𝑃superscriptitalic-ϕ𝑡conditional𝒔superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚𝐿italic-ϕsuperscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚𝒔\displaystyle=\sum_{\bm{s}}P_{\phi^{(t)}}\left(\bm{s}\mid(y_{1j},y_{2j})_{j=1}% ^{m}\right)\log L\left(\phi;(y_{1j},y_{2j})_{j=1}^{m},\bm{s}\right)= ∑ start_POSTSUBSCRIPT bold_italic_s end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( bold_italic_s ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) roman_log italic_L ( italic_ϕ ; ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , bold_italic_s )
=𝒔[Pϕ(t)(𝒔(y1j,y2j)j=1m){logπs1+j=2mlogasj1sj+j=1mlogf(sj)(y1j,y2j)}].absentsubscript𝒔delimited-[]subscript𝑃superscriptitalic-ϕ𝑡conditional𝒔superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚subscript𝜋subscript𝑠1superscriptsubscript𝑗2𝑚subscript𝑎subscript𝑠𝑗1subscript𝑠𝑗superscriptsubscript𝑗1𝑚superscript𝑓subscript𝑠𝑗subscript𝑦1𝑗subscript𝑦2𝑗\displaystyle=\sum_{\bm{s}}\left[P_{\phi^{(t)}}\left(\bm{s}\mid(y_{1j},y_{2j})% _{j=1}^{m}\right)\left\{\log\pi_{s_{1}}+\sum_{j=2}^{m}\log a_{s_{j-1}s_{j}}+% \sum_{j=1}^{m}\log f^{(s_{j})}(y_{1j},y_{2j})\right\}\right].= ∑ start_POSTSUBSCRIPT bold_italic_s end_POSTSUBSCRIPT [ italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( bold_italic_s ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) { roman_log italic_π start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_j = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT roman_log italic_a start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT roman_log italic_f start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) } ] .

M-step: Update ϕ(t+1)superscriptitalic-ϕ𝑡1\phi^{(t+1)}italic_ϕ start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT by

ϕ(t+1)superscriptitalic-ϕ𝑡1\displaystyle\phi^{(t+1)}italic_ϕ start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT =argmaxπ,A,f1,f2D(π,A,f1,f2ϕ(t)).absent𝜋𝐴subscript𝑓1subscript𝑓2𝐷𝜋𝐴subscript𝑓1conditionalsubscript𝑓2superscriptitalic-ϕ𝑡\displaystyle=\underset{\pi,A,f_{1},f_{2}}{\arg\max}D\left(\pi,A,f_{1},f_{2}% \mid\phi^{(t)}\right).= start_UNDERACCENT italic_π , italic_A , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_UNDERACCENT start_ARG roman_arg roman_max end_ARG italic_D ( italic_π , italic_A , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∣ italic_ϕ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) .

By using the Lagrange multiplier, we can calculate π(t+1)=(π0(t+1),π1(t+1),π2(t+1),π3(t+1))superscript𝜋𝑡1superscriptsubscript𝜋0𝑡1superscriptsubscript𝜋1𝑡1superscriptsubscript𝜋2𝑡1superscriptsubscript𝜋3𝑡1\pi^{(t+1)}=(\pi_{0}^{(t+1)},\pi_{1}^{(t+1)},\pi_{2}^{(t+1)},\pi_{3}^{(t+1)})italic_π start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT = ( italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT , italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT , italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT , italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT ) and each element (akl(t+1))superscriptsubscript𝑎𝑘𝑙𝑡1(a_{kl}^{(t+1)})( italic_a start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT ) of A(t+1)superscript𝐴𝑡1A^{(t+1)}italic_A start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT with k,l=0,1,2,3formulae-sequence𝑘𝑙0123k,l=0,1,2,3italic_k , italic_l = 0 , 1 , 2 , 3 as

πs(t+1)=γ1(t)(s),s{0,1,2,3}formulae-sequencesuperscriptsubscript𝜋𝑠𝑡1superscriptsubscript𝛾1𝑡𝑠𝑠0123\displaystyle\pi_{s}^{(t+1)}=\gamma_{1}^{(t)}(s),\quad s\in\{0,1,2,3\}italic_π start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT = italic_γ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( italic_s ) , italic_s ∈ { 0 , 1 , 2 , 3 }

and

akl(t+1)=j=2mξj1(t)(k,l)j=2ml=03ξj1(t)(k,l),k,l{0,1,2,3}.formulae-sequencesuperscriptsubscript𝑎𝑘𝑙𝑡1superscriptsubscript𝑗2𝑚superscriptsubscript𝜉𝑗1𝑡𝑘𝑙superscriptsubscript𝑗2𝑚superscriptsubscript𝑙03superscriptsubscript𝜉𝑗1𝑡𝑘𝑙𝑘𝑙0123\displaystyle a_{kl}^{(t+1)}=\frac{\sum_{j=2}^{m}\xi_{j-1}^{(t)}(k,l)}{\sum_{j% =2}^{m}\sum_{l=0}^{3}\xi_{j-1}^{(t)}(k,l)},\quad k,l\in\{0,1,2,3\}.italic_a start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT = divide start_ARG ∑ start_POSTSUBSCRIPT italic_j = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_ξ start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( italic_k , italic_l ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_l = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_ξ start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( italic_k , italic_l ) end_ARG , italic_k , italic_l ∈ { 0 , 1 , 2 , 3 } .

The two functions can be updated by

f1(t+1)=argmaxf1j=1m{(γj(t)(2)+γj(t)(3))logf1(y1j)},superscriptsubscript𝑓1𝑡1subscript𝑓1superscriptsubscript𝑗1𝑚superscriptsubscript𝛾𝑗𝑡2superscriptsubscript𝛾𝑗𝑡3subscript𝑓1subscript𝑦1𝑗\displaystyle f_{1}^{(t+1)}=\underset{f_{1}\in\mathcal{H}}{\arg\max}\sum_{j=1}% ^{m}\left\{\left(\gamma_{j}^{(t)}(2)+\gamma_{j}^{(t)}(3)\right)\log f_{1}(y_{1% j})\right\},italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT = start_UNDERACCENT italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ caligraphic_H end_UNDERACCENT start_ARG roman_arg roman_max end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT { ( italic_γ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( 2 ) + italic_γ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( 3 ) ) roman_log italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) } , (2.5)

and

f2(t+1)=argmaxf2j=1m{(γj(t)(1)+γj(t)(3))logf2(y2j)},superscriptsubscript𝑓2𝑡1subscript𝑓2superscriptsubscript𝑗1𝑚superscriptsubscript𝛾𝑗𝑡1superscriptsubscript𝛾𝑗𝑡3subscript𝑓2subscript𝑦2𝑗\displaystyle f_{2}^{(t+1)}=\underset{f_{2}\in\mathcal{H}}{\arg\max}\sum_{j=1}% ^{m}\left\{\left(\gamma_{j}^{(t)}(1)+\gamma_{j}^{(t)}(3)\right)\log f_{2}(y_{2% j})\right\},italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT = start_UNDERACCENT italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ caligraphic_H end_UNDERACCENT start_ARG roman_arg roman_max end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT { ( italic_γ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( 1 ) + italic_γ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( 3 ) ) roman_log italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) } , (2.6)

where \mathcal{H}caligraphic_H is the space of non-increasing density functions with support (0,1)01(0,1)( 0 , 1 ). We iterate between the E-step and M-step until the algorithm converges. Next, we provide details to solve (2.5). We first order the p𝑝pitalic_p-values {y1j}j=1msuperscriptsubscriptsubscript𝑦1𝑗𝑗1𝑚\{y_{1j}\}_{j=1}^{m}{ italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT as 0=y1(0)y1(1)y1(m)y1(m+1)=10subscript𝑦10subscript𝑦11subscript𝑦1𝑚subscript𝑦1𝑚110=y_{1(0)}\leq y_{1(1)}\leq\ldots\leq y_{1(m)}\leq y_{1(m+1)}=10 = italic_y start_POSTSUBSCRIPT 1 ( 0 ) end_POSTSUBSCRIPT ≤ italic_y start_POSTSUBSCRIPT 1 ( 1 ) end_POSTSUBSCRIPT ≤ … ≤ italic_y start_POSTSUBSCRIPT 1 ( italic_m ) end_POSTSUBSCRIPT ≤ italic_y start_POSTSUBSCRIPT 1 ( italic_m + 1 ) end_POSTSUBSCRIPT = 1. Denote Γj(t)=γj(t)(2)+γj(t)(3)superscriptsubscriptΓ𝑗𝑡superscriptsubscript𝛾𝑗𝑡2superscriptsubscript𝛾𝑗𝑡3\Gamma_{j}^{(t)}=\gamma_{j}^{(t)}(2)+\gamma_{j}^{(t)}(3)roman_Γ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = italic_γ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( 2 ) + italic_γ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( 3 ) and let Γ(j)(t)superscriptsubscriptΓ𝑗𝑡\Gamma_{(j)}^{(t)}roman_Γ start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT correspond to y1(j)subscript𝑦1𝑗y_{1(j)}italic_y start_POSTSUBSCRIPT 1 ( italic_j ) end_POSTSUBSCRIPT. Denote zj=f1(y1(j))subscript𝑧𝑗subscript𝑓1subscript𝑦1𝑗z_{j}=f_{1}(y_{1(j)})italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 ( italic_j ) end_POSTSUBSCRIPT ) and 𝒛=(z1,,zm)𝒛subscript𝑧1subscript𝑧𝑚\bm{z}=(z_{1},\ldots,z_{m})bold_italic_z = ( italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_z start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ). Let 𝒬={𝒛:z1z2zm}𝒬conditional-set𝒛subscript𝑧1subscript𝑧2subscript𝑧𝑚\mathcal{Q}=\{\bm{z}:z_{1}\geq z_{2}\geq\ldots\geq z_{m}\}caligraphic_Q = { bold_italic_z : italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≥ italic_z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≥ … ≥ italic_z start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } be the space of 𝒛𝒛\bm{z}bold_italic_z. We aim to find

𝒛^=argmax𝒛𝒬j=1m{Γ(j)(t)logzj}, subject to j=1m{(y1(j)y1(j1))zj}=1.formulae-sequence^𝒛𝒛𝒬superscriptsubscript𝑗1𝑚superscriptsubscriptΓ𝑗𝑡subscript𝑧𝑗 subject to superscriptsubscript𝑗1𝑚subscript𝑦1𝑗subscript𝑦1𝑗1subscript𝑧𝑗1\displaystyle\hat{\bm{z}}=\underset{\bm{z}\in\mathcal{Q}}{\arg\max}\sum_{j=1}^% {m}\left\{\Gamma_{(j)}^{(t)}\log z_{j}\right\},\quad\text{ subject to }\sum_{j% =1}^{m}\{(y_{1(j)}-y_{1(j-1)})z_{j}\}=1.over^ start_ARG bold_italic_z end_ARG = start_UNDERACCENT bold_italic_z ∈ caligraphic_Q end_UNDERACCENT start_ARG roman_arg roman_max end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT { roman_Γ start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT roman_log italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } , subject to ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT { ( italic_y start_POSTSUBSCRIPT 1 ( italic_j ) end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT 1 ( italic_j - 1 ) end_POSTSUBSCRIPT ) italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } = 1 .

Using the Lagrangian multiplier, the objective function we want to maximize becomes

L(𝒛,λ)=𝐿𝒛𝜆absent\displaystyle L(\bm{z},\lambda)=italic_L ( bold_italic_z , italic_λ ) = j=1m{Γ(j)(t)logzj}+λ[j=1m{(y1(j)y1(j1))zj}1].superscriptsubscript𝑗1𝑚superscriptsubscriptΓ𝑗𝑡subscript𝑧𝑗𝜆delimited-[]superscriptsubscript𝑗1𝑚subscript𝑦1𝑗subscript𝑦1𝑗1subscript𝑧𝑗1\displaystyle\sum_{j=1}^{m}\left\{\Gamma_{(j)}^{(t)}\log z_{j}\right\}+\lambda% \left[\sum_{j=1}^{m}\{(y_{1(j)}-y_{1(j-1)})z_{j}\}-1\right].∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT { roman_Γ start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT roman_log italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } + italic_λ [ ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT { ( italic_y start_POSTSUBSCRIPT 1 ( italic_j ) end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT 1 ( italic_j - 1 ) end_POSTSUBSCRIPT ) italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } - 1 ] .

Taking derivatives with respect to zjsubscript𝑧𝑗z_{j}italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and λ,𝜆\lambda,italic_λ , we have

λ~=~𝜆absent\displaystyle\tilde{\lambda}=over~ start_ARG italic_λ end_ARG = j=1mΓ(j)(t),superscriptsubscript𝑗1𝑚superscriptsubscriptΓ𝑗𝑡\displaystyle-\sum_{j=1}^{m}\Gamma_{(j)}^{(t)},- ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ,
z~j=subscript~𝑧𝑗absent\displaystyle\tilde{z}_{j}=over~ start_ARG italic_z end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = Γ(j)(t)k=1mΓk(t)1y1(j)y1(j1),j=1,,m.formulae-sequencesuperscriptsubscriptΓ𝑗𝑡superscriptsubscript𝑘1𝑚superscriptsubscriptΓ𝑘𝑡1subscript𝑦1𝑗subscript𝑦1𝑗1𝑗1𝑚\displaystyle\frac{\Gamma_{(j)}^{(t)}}{\sum_{k=1}^{m}\Gamma_{k}^{(t)}}\cdot% \frac{1}{y_{1(j)}-y_{1(j-1)}},\quad j=1,\ldots,m.divide start_ARG roman_Γ start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG ⋅ divide start_ARG 1 end_ARG start_ARG italic_y start_POSTSUBSCRIPT 1 ( italic_j ) end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT 1 ( italic_j - 1 ) end_POSTSUBSCRIPT end_ARG , italic_j = 1 , … , italic_m .

To incorporate the non-increasing constraint on zj,subscript𝑧𝑗z_{j},italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , we have

𝒛^=^𝒛absent\displaystyle\hat{\bm{z}}=over^ start_ARG bold_italic_z end_ARG = argmin𝒛𝒬{L(𝒛,λ~)}𝒛𝒬𝐿𝒛~𝜆\displaystyle\underset{\bm{z}\in\mathcal{Q}}{\arg\min}\left\{-L(\bm{z},\tilde{% \lambda})\right\}start_UNDERACCENT bold_italic_z ∈ caligraphic_Q end_UNDERACCENT start_ARG roman_arg roman_min end_ARG { - italic_L ( bold_italic_z , over~ start_ARG italic_λ end_ARG ) }
=\displaystyle== argmin𝒛𝒬j=1m(Γ(j)(t)[logzj{k=1mΓ(k)(t)}{y1(j)y1(j1)}Γ(j)(t)zj]).𝒛𝒬superscriptsubscript𝑗1𝑚superscriptsubscriptΓ𝑗𝑡delimited-[]subscript𝑧𝑗superscriptsubscript𝑘1𝑚superscriptsubscriptΓ𝑘𝑡subscript𝑦1𝑗subscript𝑦1𝑗1subscriptsuperscriptΓ𝑡𝑗subscript𝑧𝑗\displaystyle\underset{\bm{z}\in\mathcal{Q}}{\arg\min}\sum_{j=1}^{m}\left(% \Gamma_{(j)}^{(t)}\left[-\log z_{j}-\frac{-\{\sum_{k=1}^{m}\Gamma_{(k)}^{(t)}% \}\{y_{1(j)}-y_{1(j-1)}\}}{\Gamma^{(t)}_{(j)}}z_{j}\right]\right).start_UNDERACCENT bold_italic_z ∈ caligraphic_Q end_UNDERACCENT start_ARG roman_arg roman_min end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( roman_Γ start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT [ - roman_log italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - divide start_ARG - { ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT ( italic_k ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT } { italic_y start_POSTSUBSCRIPT 1 ( italic_j ) end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT 1 ( italic_j - 1 ) end_POSTSUBSCRIPT } end_ARG start_ARG roman_Γ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT end_ARG italic_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ] ) .

Let 𝒖=(u1,,um)𝒖subscript𝑢1subscript𝑢𝑚\bm{u}=(u_{1},\ldots,u_{m})bold_italic_u = ( italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) and

𝒖^=^𝒖absent\displaystyle\hat{\bm{u}}=over^ start_ARG bold_italic_u end_ARG = argmin𝒖𝒬j=1m(Γ(j)(t)[uj{k=1mΓ(k)(t)}{y1(j)y1(j1)}Γ(j)(t)]2).𝒖𝒬superscriptsubscript𝑗1𝑚superscriptsubscriptΓ𝑗𝑡superscriptdelimited-[]subscript𝑢𝑗superscriptsubscript𝑘1𝑚superscriptsubscriptΓ𝑘𝑡subscript𝑦1𝑗subscript𝑦1𝑗1superscriptsubscriptΓ𝑗𝑡2\displaystyle\underset{\bm{u}\in\mathcal{Q}}{\arg\min}\sum_{j=1}^{m}\left(% \Gamma_{(j)}^{(t)}\left[u_{j}-\frac{-\{\sum_{k=1}^{m}\Gamma_{(k)}^{(t)}\}\{y_{% 1(j)}-y_{1(j-1)}\}}{\Gamma_{(j)}^{(t)}}\right]^{2}\right).start_UNDERACCENT bold_italic_u ∈ caligraphic_Q end_UNDERACCENT start_ARG roman_arg roman_min end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( roman_Γ start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT [ italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - divide start_ARG - { ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT ( italic_k ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT } { italic_y start_POSTSUBSCRIPT 1 ( italic_j ) end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT 1 ( italic_j - 1 ) end_POSTSUBSCRIPT } end_ARG start_ARG roman_Γ start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) .

We have

u^j=maxbjminaj{k=1mΓ(k)(t)}k=ab{y1(k)y1(k1)}k=abΓ(k)(t),subscript^𝑢𝑗subscript𝑏𝑗subscript𝑎𝑗superscriptsubscript𝑘1𝑚superscriptsubscriptΓ𝑘𝑡superscriptsubscript𝑘𝑎𝑏subscript𝑦1𝑘subscript𝑦1𝑘1superscriptsubscript𝑘𝑎𝑏superscriptsubscriptΓ𝑘𝑡\displaystyle\hat{u}_{j}=\max_{b\geq j}\min_{a\leq j}\frac{-\left\{\sum_{k=1}^% {m}\Gamma_{(k)}^{(t)}\right\}\sum_{k=a}^{b}\{y_{1(k)}-y_{1(k-1)}\}}{\sum_{k=a}% ^{b}\Gamma_{(k)}^{(t)}},over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = roman_max start_POSTSUBSCRIPT italic_b ≥ italic_j end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT italic_a ≤ italic_j end_POSTSUBSCRIPT divide start_ARG - { ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT ( italic_k ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT } ∑ start_POSTSUBSCRIPT italic_k = italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT { italic_y start_POSTSUBSCRIPT 1 ( italic_k ) end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT 1 ( italic_k - 1 ) end_POSTSUBSCRIPT } end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT roman_Γ start_POSTSUBSCRIPT ( italic_k ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT end_ARG ,

which can be obtained by the pool-adjacent-violator-algorithm. According to Theorem 3.1 of Barlow and Brunk (1972), we have f^1(y1(j))=z^j=1/u^jsubscript^𝑓1subscript𝑦1𝑗subscript^𝑧𝑗1subscript^𝑢𝑗\hat{f}_{1}(y_{1(j)})=\hat{z}_{j}=-1/\hat{u}_{j}over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 ( italic_j ) end_POSTSUBSCRIPT ) = over^ start_ARG italic_z end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = - 1 / over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, j=1,,m𝑗1𝑚j=1,\ldots,mitalic_j = 1 , … , italic_m. The calculation of f2(t+1)superscriptsubscript𝑓2𝑡1f_{2}^{(t+1)}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT in (2.6) follows the same line and we omit the details.

2.3 Testing

The test statistic we use is the replicability local index of significance (rLIS), which is defined as the posterior probability of being null given the observed paired p𝑝pitalic_p-values. Specifically,

rLISj:=Pϕ0(sj{0,1,2}(y1j,y2j)j=1m),j=1,,m.formulae-sequenceassignsubscriptrLIS𝑗subscript𝑃subscriptitalic-ϕ0subscript𝑠𝑗conditional012superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚𝑗1𝑚{\rm rLIS}_{j}:=P_{\phi_{0}}\left(s_{j}\in\{0,1,2\}\mid(y_{1j},y_{2j})_{j=1}^{% m}\right),\quad j=1,\ldots,m.roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT := italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) , italic_j = 1 , … , italic_m .

Denote I(B)𝐼𝐵I(B)italic_I ( italic_B ) as the indicator function of event B𝐵Bitalic_B, i.e., I(B)=1𝐼𝐵1I(B)=1italic_I ( italic_B ) = 1 if B𝐵Bitalic_B is true and I(B)=0𝐼𝐵0I(B)=0italic_I ( italic_B ) = 0 otherwise. With a rejection threshold λ𝜆\lambdaitalic_λ, H0jsubscript𝐻0𝑗H_{0j}italic_H start_POSTSUBSCRIPT 0 italic_j end_POSTSUBSCRIPT is rejected if rLISjλsubscriptrLIS𝑗𝜆{\rm rLIS}_{j}\leq\lambdaroman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ. We can write the total number of rejections as

R(λ)=j=1mI(rLISjλ).𝑅𝜆superscriptsubscript𝑗1𝑚𝐼subscriptrLIS𝑗𝜆R(\lambda)=\sum_{j=1}^{m}I({\rm rLIS}_{j}\leq\lambda).italic_R ( italic_λ ) = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ ) .

The number of false rejections is

V(λ)=j=1mI(rLISjλ,sj{0,1,2}).𝑉𝜆superscriptsubscript𝑗1𝑚𝐼formulae-sequencesubscriptrLIS𝑗𝜆subscript𝑠𝑗012V(\lambda)=\sum_{j=1}^{m}I\left({\rm rLIS}_{j}\leq\lambda,s_{j}\in\{0,1,2\}% \right).italic_V ( italic_λ ) = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ , italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ) .

The law of total expectation gives

E{V(λ)}=𝐸𝑉𝜆absent\displaystyle E\{V(\lambda)\}=italic_E { italic_V ( italic_λ ) } = E{j=1mI(rLISjλ,sj{0,1,2})}𝐸superscriptsubscript𝑗1𝑚𝐼formulae-sequencesubscriptrLIS𝑗𝜆subscript𝑠𝑗012\displaystyle E\left\{\sum_{j=1}^{m}I({\rm rLIS}_{j}\leq\lambda,s_{j}\in\{0,1,% 2\})\right\}italic_E { ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ , italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ) }
=\displaystyle== E[E{j=1mI(rLISjλ,sj{0,1,2})(y1j,y2j)j=1m}]𝐸delimited-[]𝐸conditional-setsuperscriptsubscript𝑗1𝑚𝐼formulae-sequencesubscriptrLIS𝑗𝜆subscript𝑠𝑗012superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚\displaystyle E\left[E\left\{\sum_{j=1}^{m}I({\rm rLIS}_{j}\leq\lambda,s_{j}% \in\{0,1,2\})\mid(y_{1j},y_{2j})_{j=1}^{m}\right\}\right]italic_E [ italic_E { ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ , italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ) ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT } ]
=\displaystyle== E{j=1mI(rLISjλ)rLISj}.𝐸superscriptsubscript𝑗1𝑚𝐼subscriptrLIS𝑗𝜆subscriptrLIS𝑗\displaystyle E\left\{\sum_{j=1}^{m}I({\rm rLIS}_{j}\leq\lambda){\rm rLIS}_{j}% \right\}.italic_E { ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ ) roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } . (2.7)

We aim to control the false discovery rate at a pre-specified level q,𝑞q,italic_q , where

FDR(λ)=E{V(λ)R(λ)1}.FDR𝜆𝐸𝑉𝜆𝑅𝜆1{\rm FDR}(\lambda)=E\left\{\frac{V(\lambda)}{R(\lambda)\vee 1}\right\}.roman_FDR ( italic_λ ) = italic_E { divide start_ARG italic_V ( italic_λ ) end_ARG start_ARG italic_R ( italic_λ ) ∨ 1 end_ARG } .

The false discovery proportion is defined as

FDP(λ)=V(λ)R(λ)1=j=1mI(rLISjλ,sj{0,1,2}){j=1mI(rLISjλ)}1.FDP𝜆𝑉𝜆𝑅𝜆1superscriptsubscript𝑗1𝑚𝐼formulae-sequencesubscriptrLIS𝑗𝜆subscript𝑠𝑗012superscriptsubscript𝑗1𝑚𝐼subscriptrLIS𝑗𝜆1\text{FDP}(\lambda)=\frac{V(\lambda)}{R(\lambda)\vee 1}=\frac{\sum_{j=1}^{m}I(% {\rm rLIS}_{j}\leq\lambda,s_{j}\in\{0,1,2\})}{\left\{\sum_{j=1}^{m}I({\rm rLIS% }_{j}\leq\lambda)\right\}\vee 1}.FDP ( italic_λ ) = divide start_ARG italic_V ( italic_λ ) end_ARG start_ARG italic_R ( italic_λ ) ∨ 1 end_ARG = divide start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ , italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ) end_ARG start_ARG { ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ ) } ∨ 1 end_ARG .

By (2.7),

FDP(λ)j=1mI(rLISjλ)rLISj{j=1mI(rLISjλ)}1.FDP𝜆superscriptsubscript𝑗1𝑚𝐼subscriptrLIS𝑗𝜆subscriptrLIS𝑗superscriptsubscript𝑗1𝑚𝐼subscriptrLIS𝑗𝜆1{\rm FDP}(\lambda)\approx\frac{\sum_{j=1}^{m}I({\rm rLIS}_{j}\leq\lambda){\rm rLIS% }_{j}}{\left\{\sum_{j=1}^{m}I({\rm rLIS}_{j}\leq\lambda)\right\}\vee 1}.roman_FDP ( italic_λ ) ≈ divide start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ ) roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG { ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ ) } ∨ 1 end_ARG .

In the oracle case, we assume ϕ0=(π,A,f1,f2)subscriptitalic-ϕ0𝜋𝐴subscript𝑓1subscript𝑓2\phi_{0}=(\pi,A,f_{1},f_{2})italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ( italic_π , italic_A , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) is known. With nominal FDR level q𝑞qitalic_q, we can apply the following testing procedure.

λOR=sup{λ0:j=1mI(rLISjλ)rLISj{j=1mI(rLISjλ)}1q},and reject H0j if rLISjλOR for j=1,2,,m.missing-subexpressionsubscript𝜆ORsupremumconditional-set𝜆0superscriptsubscript𝑗1𝑚𝐼subscriptrLIS𝑗𝜆subscriptrLIS𝑗superscriptsubscript𝑗1𝑚𝐼subscriptrLIS𝑗𝜆1𝑞missing-subexpressionformulae-sequenceand reject subscript𝐻0𝑗 if subscriptrLIS𝑗subscript𝜆OR for 𝑗12𝑚\displaystyle\begin{aligned} &\lambda_{\rm OR}=\sup\left\{\lambda\geq 0:\frac{% \sum_{j=1}^{m}I({\rm rLIS}_{j}\leq\lambda){\rm rLIS}_{j}}{\left\{\sum_{j=1}^{m% }I({\rm rLIS}_{j}\leq\lambda)\right\}\vee 1}\leq q\right\},\\ &\text{and reject }H_{0j}\text{ if }{\rm rLIS}_{j}\leq\lambda_{\rm OR}\text{ % for }j=1,2,\ldots,m.\end{aligned}start_ROW start_CELL end_CELL start_CELL italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT = roman_sup { italic_λ ≥ 0 : divide start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ ) roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG { ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ ) } ∨ 1 end_ARG ≤ italic_q } , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL and reject italic_H start_POSTSUBSCRIPT 0 italic_j end_POSTSUBSCRIPT if roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT for italic_j = 1 , 2 , … , italic_m . end_CELL end_ROW (2.8)

Let rLIS(1)rLIS(m)subscriptrLIS1subscriptrLIS𝑚{\rm rLIS}_{(1)}\leq\ldots\leq{\rm rLIS}_{(m)}roman_rLIS start_POSTSUBSCRIPT ( 1 ) end_POSTSUBSCRIPT ≤ … ≤ roman_rLIS start_POSTSUBSCRIPT ( italic_m ) end_POSTSUBSCRIPT be the ordered rLISrLIS{\rm rLIS}roman_rLIS and H0(1),,H0(m)subscript𝐻01subscript𝐻0𝑚H_{0(1)},\ldots,H_{0(m)}italic_H start_POSTSUBSCRIPT 0 ( 1 ) end_POSTSUBSCRIPT , … , italic_H start_POSTSUBSCRIPT 0 ( italic_m ) end_POSTSUBSCRIPT be the corresponding hypotheses. Assume there are R𝑅Ritalic_R rejections, which means rLIS(R)λOR<rLIS(R+1)subscriptrLIS𝑅subscript𝜆ORsubscriptrLIS𝑅1{\rm rLIS}_{(R)}\leq\lambda_{\rm OR}<{\rm rLIS}_{(R+1)}roman_rLIS start_POSTSUBSCRIPT ( italic_R ) end_POSTSUBSCRIPT ≤ italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT < roman_rLIS start_POSTSUBSCRIPT ( italic_R + 1 ) end_POSTSUBSCRIPT. Thus, the rejection criterion (2.8) is equivalent to the following step-up procedure

Let R=max{r:1rj=1rrLIS(j)q};Let 𝑅:𝑟1𝑟superscriptsubscript𝑗1𝑟subscriptrLIS𝑗𝑞\displaystyle\text{Let }R=\max\left\{r:\frac{1}{r}\sum_{j=1}^{r}{\rm rLIS}_{(j% )}\leq q\right\};Let italic_R = roman_max { italic_r : divide start_ARG 1 end_ARG start_ARG italic_r end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT roman_rLIS start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ≤ italic_q } ; (2.9)
then reject all H0(j) for j=1,,R.then reject all H0(j) for 𝑗1𝑅\displaystyle\text{then reject all $H_{0(j)}$ for }j=1,\dots,R.then reject all italic_H start_POSTSUBSCRIPT 0 ( italic_j ) end_POSTSUBSCRIPT for italic_j = 1 , … , italic_R .

We can write rLIS in terms of forward and backward probabilities. Specifically, we have

rLISj=sj=02αj(sj)βj(sj)sj=03αj(sj)βj(sj).subscriptrLIS𝑗superscriptsubscriptsubscript𝑠𝑗02subscript𝛼𝑗subscript𝑠𝑗subscript𝛽𝑗subscript𝑠𝑗superscriptsubscriptsubscript𝑠𝑗03subscript𝛼𝑗subscript𝑠𝑗subscript𝛽𝑗subscript𝑠𝑗{\rm rLIS}_{j}=\frac{\sum_{s_{j}=0}^{2}\alpha_{j}(s_{j})\beta_{j}(s_{j})}{\sum% _{s_{j}=0}^{3}\alpha_{j}(s_{j})\beta_{j}(s_{j})}.roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = divide start_ARG ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG .

With the maximum likelihood estimator (2.4), we can estimate rLISj,j=1,,mformulae-sequencesubscriptrLISjj1m\rm{rLIS}_{j},j=1,\ldots,mroman_rLIS start_POSTSUBSCRIPT roman_j end_POSTSUBSCRIPT , roman_j = 1 , … , roman_m and plug the estimators in the step-up procedure (2.9). Its validity is shown in Theorem 3.2 in Section 3.

3 Theory

3.1 Notations and identifiability

Recall that sj=0,1,2,3subscript𝑠𝑗0123s_{j}=0,1,2,3italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 , 1 , 2 , 3 corresponds to (θ1j,θ2j)=(0,0),(0,1),(1,0),(1,1)subscript𝜃1𝑗subscript𝜃2𝑗00011011(\theta_{1j},\theta_{2j})=(0,0),(0,1),(1,0),(1,1)( italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) = ( 0 , 0 ) , ( 0 , 1 ) , ( 1 , 0 ) , ( 1 , 1 ) for j=1,,m𝑗1𝑚j=1,\ldots,mitalic_j = 1 , … , italic_m, π=(π0,π1,π2,π3)𝜋subscript𝜋0subscript𝜋1subscript𝜋2subscript𝜋3\pi=(\pi_{0},\pi_{1},\pi_{2},\pi_{3})italic_π = ( italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) denotes the stationary probabilities of sjsubscript𝑠𝑗s_{j}italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, and A=(akl)k,l=03𝐴superscriptsubscriptsubscript𝑎𝑘𝑙𝑘𝑙03A=(a_{kl})_{k,l=0}^{3}italic_A = ( italic_a start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k , italic_l = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT is its transition probability matrix. f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT denotes the probability density function of y1jsubscript𝑦1𝑗y_{1j}italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT given θ1j=1subscript𝜃1𝑗1\theta_{1j}=1italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT = 1 and f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT denotes the probability density function of y2jsubscript𝑦2𝑗y_{2j}italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT given θ2j=1subscript𝜃2𝑗1\theta_{2j}=1italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT = 1, j=1,,m𝑗1𝑚j=1,\ldots,mitalic_j = 1 , … , italic_m. Since the hidden Markov model is stationary, we have πA=π𝜋𝐴𝜋\pi A=\piitalic_π italic_A = italic_π, i.e., π𝜋\piitalic_π is the eigenvector of A𝐴Aitalic_A with the corresponding eigenvalue 1111. When rank(AI4)=3rank𝐴subscript𝐼43{\rm rank}(A-I_{4})=3roman_rank ( italic_A - italic_I start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ) = 3, π𝜋\piitalic_π is uniquely determined by A𝐴Aitalic_A, where I4subscript𝐼4I_{4}italic_I start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT is a 4444-dimensional identity matrix. Let ΦΦ\Phiroman_Φ be the parameter space of ϕ=(π,A,f1,f2)italic-ϕ𝜋𝐴subscript𝑓1subscript𝑓2\phi=(\pi,A,f_{1},f_{2})italic_ϕ = ( italic_π , italic_A , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ), where

Φ={ϕ=(π,A,f1,f2):\displaystyle\Phi=\bigg{\{}\phi=(\pi,A,f_{1},f_{2}):roman_Φ = { italic_ϕ = ( italic_π , italic_A , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) : πk(0,1),k=03πk=1;akl(0,1),formulae-sequencesubscript𝜋𝑘01formulae-sequencesuperscriptsubscript𝑘03subscript𝜋𝑘1subscript𝑎𝑘𝑙01\displaystyle~{}\pi_{k}\in(0,1),\sum_{k=0}^{3}\pi_{k}=1;a_{kl}\in(0,1),italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ ( 0 , 1 ) , ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 1 ; italic_a start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT ∈ ( 0 , 1 ) ,
l=03akl=1, for k=0,1,2,3;πA=π;f1,f2},\displaystyle\sum_{l=0}^{3}a_{kl}=1,\text{ for }k=0,1,2,3;\pi A=\pi;f_{1},f_{2% }\in\mathcal{H}\bigg{\}},∑ start_POSTSUBSCRIPT italic_l = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT = 1 , for italic_k = 0 , 1 , 2 , 3 ; italic_π italic_A = italic_π ; italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ caligraphic_H } ,

where \mathcal{H}caligraphic_H is a space of non-increasing probability density functions with support (0,1)01(0,1)( 0 , 1 ). For f1subscript𝑓1f_{1}\in\mathcal{H}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ caligraphic_H, let D𝐷Ditalic_D be the set of discontinuous points of f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. For any y0Dsubscript𝑦0𝐷y_{0}\in Ditalic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ italic_D, let Ly0=limyy0f1(y)subscript𝐿subscript𝑦0subscript𝑦limit-fromsubscript𝑦0subscript𝑓1𝑦L_{y_{0}}=\lim_{y\rightarrow y_{0}-}f_{1}(y)italic_L start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = roman_lim start_POSTSUBSCRIPT italic_y → italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y ) be the left limit of f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT at y0subscript𝑦0y_{0}italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and Ry0=limyy0+f1(y)subscript𝑅subscript𝑦0subscript𝑦limit-fromsubscript𝑦0subscript𝑓1𝑦R_{y_{0}}=\lim_{y\rightarrow y_{0}+}f_{1}(y)italic_R start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = roman_lim start_POSTSUBSCRIPT italic_y → italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y ) be the right limit of f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT at y0subscript𝑦0y_{0}italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. Since f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is non-increasing and y0subscript𝑦0y_{0}italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is a point of discontinuity, we know Ly0subscript𝐿subscript𝑦0L_{y_{0}}italic_L start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and Ry0subscript𝑅subscript𝑦0R_{y_{0}}italic_R start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT exist and Ly0>Ry0subscript𝐿subscript𝑦0subscript𝑅subscript𝑦0L_{y_{0}}>R_{y_{0}}italic_L start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > italic_R start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Then there exists a rational number in the open interval (Ry0,Ly0)subscript𝑅subscript𝑦0subscript𝐿subscript𝑦0(R_{y_{0}},L_{y_{0}})( italic_R start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_L start_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ). Thus, we can find an injection from rational numbers to D𝐷Ditalic_D, which means that D𝐷Ditalic_D, the set of discontinuous points of f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, has at most countable points. Consequently, f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is continuous almost everywhere. By the same token, f2subscript𝑓2f_{2}\in\mathcal{H}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ caligraphic_H is also continuous almost everywhere. For any ϕ=(π,A,f1,f2)italic-ϕ𝜋𝐴subscript𝑓1subscript𝑓2\phi=(\pi,A,f_{1},f_{2})italic_ϕ = ( italic_π , italic_A , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) and ϕ*=(π*,A*,f1*,f2*)superscriptitalic-ϕsuperscript𝜋superscript𝐴superscriptsubscript𝑓1superscriptsubscript𝑓2\phi^{*}=(\pi^{*},A^{*},f_{1}^{*},f_{2}^{*})italic_ϕ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = ( italic_π start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_A start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) in ΦΦ\Phiroman_Φ, denote the distance between them as

d(ϕ,ϕ*)=ππ*2+AA*F+dH(f1,f1*)+dH(f2,f2*),𝑑italic-ϕsuperscriptitalic-ϕsubscriptnorm𝜋superscript𝜋2subscriptnorm𝐴superscript𝐴𝐹subscript𝑑𝐻subscript𝑓1superscriptsubscript𝑓1subscript𝑑𝐻subscript𝑓2superscriptsubscript𝑓2d(\phi,\phi^{*})=\|\pi-\pi^{*}\|_{2}+\|A-A^{*}\|_{F}+d_{H}(f_{1},f_{1}^{*})+d_% {H}(f_{2},f_{2}^{*}),italic_d ( italic_ϕ , italic_ϕ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = ∥ italic_π - italic_π start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + ∥ italic_A - italic_A start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT ( italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) + italic_d start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT ( italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) , (3.1)

where 2\|\cdot\|_{2}∥ ⋅ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT denotes the L2subscript𝐿2L_{2}italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT norm of a vector, F\|\cdot\|_{F}∥ ⋅ ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT denotes the Frobenius norm of a matrix, and dH(,)subscript𝑑𝐻d_{H}(\cdot,\cdot)italic_d start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT ( ⋅ , ⋅ ) denotes the Hellinger distance of two density functions, where dH(g1,g2)2=2101{g1(y)1/2g2(y)1/2}2dy.subscript𝑑𝐻superscriptsubscript𝑔1subscript𝑔22superscript21superscriptsubscript01superscriptsubscript𝑔1superscript𝑦12subscript𝑔2superscript𝑦122differential-d𝑦d_{H}(g_{1},g_{2})^{2}=2^{-1}\int_{0}^{1}\left\{g_{1}(y)^{1/2}-g_{2}(y)^{1/2}% \right\}^{2}{\rm d}y.italic_d start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT ( italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 2 start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT { italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT - italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT } start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_d italic_y . Under the distance d(,)𝑑d(\cdot,\cdot)italic_d ( ⋅ , ⋅ ) in (3.1), we can obtain identifiability of elements in ΦΦ\Phiroman_Φ. Identifiability means that d(ϕ,ϕ*)=0𝑑italic-ϕsuperscriptitalic-ϕ0d(\phi,\phi^{*})=0italic_d ( italic_ϕ , italic_ϕ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = 0 implies ϕ=ϕ*italic-ϕsuperscriptitalic-ϕ\phi=\phi^{*}italic_ϕ = italic_ϕ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT almost everywhere, where ϕ=(π,A,f1,f2)italic-ϕ𝜋𝐴subscript𝑓1subscript𝑓2\phi=(\pi,A,f_{1},f_{2})italic_ϕ = ( italic_π , italic_A , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) and ϕ*=(π*,A*,f1*,f2*)superscriptitalic-ϕsuperscript𝜋superscript𝐴superscriptsubscript𝑓1superscriptsubscript𝑓2\phi^{*}=(\pi^{*},A^{*},f_{1}^{*},f_{2}^{*})italic_ϕ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = ( italic_π start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_A start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ). When d(ϕ,ϕ*)=0𝑑italic-ϕsuperscriptitalic-ϕ0d(\phi,\phi^{*})=0italic_d ( italic_ϕ , italic_ϕ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = 0, it follows that π=π*𝜋superscript𝜋\pi=\pi^{*}italic_π = italic_π start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, A=A*𝐴superscript𝐴A=A^{*}italic_A = italic_A start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, dH(f1,f1*)=0subscript𝑑𝐻subscript𝑓1superscriptsubscript𝑓10d_{H}(f_{1},f_{1}^{*})=0italic_d start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT ( italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = 0 and dH(f2,f2*)=0subscript𝑑𝐻subscript𝑓2superscriptsubscript𝑓20d_{H}(f_{2},f_{2}^{*})=0italic_d start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT ( italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = 0. Moreover, dH(f1,f1*)=0subscript𝑑𝐻subscript𝑓1superscriptsubscript𝑓10d_{H}(f_{1},f_{1}^{*})=0italic_d start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT ( italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = 0 implies that f1=f1*subscript𝑓1superscriptsubscript𝑓1f_{1}=f_{1}^{*}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT almost everywhere. By the same token, dH(f2,f2*)=0subscript𝑑𝐻subscript𝑓2superscriptsubscript𝑓20d_{H}(f_{2},f_{2}^{*})=0italic_d start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT ( italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = 0 implies f2=f2*subscript𝑓2superscriptsubscript𝑓2f_{2}=f_{2}^{*}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT almost everywhere.

Proposition 1.

ΦΦ\Phiroman_Φ is compact with respect to distance d(,)𝑑normal-⋅normal-⋅d(\cdot,\cdot)italic_d ( ⋅ , ⋅ ) defined in (3.1).

The proof of Proposition 1 is in the Supplementary Materials. This result is needed in the consistency proof of unknown parameters and unknown functions in the next subsection.

3.2 Consistency of the maximum likelihood estimation

We need the following conditions to show the consistency of the maximum likelihood estimator ϕ^msubscript^italic-ϕ𝑚\hat{\phi}_{m}over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT and asymptotic false discovery rate control. (C1) The transition probability matrix A(ϕ0)𝐴subscriptitalic-ϕ0A(\phi_{0})italic_A ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) is irreducible and ϕ0subscriptitalic-ϕ0\phi_{0}italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, the true parameter, is an interior point of ΦΦ\Phiroman_Φ. (C2) There exist δ0>0subscript𝛿00\delta_{0}>0italic_δ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT > 0 and 0<ε01/40subscript𝜀0140<\varepsilon_{0}\leq 1/40 < italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≤ 1 / 4 such that for any ϕitalic-ϕ\phiitalic_ϕ satisfying d(ϕ,ϕ0)<δ0𝑑italic-ϕsubscriptitalic-ϕ0subscript𝛿0d(\phi,\phi_{0})<\delta_{0}italic_d ( italic_ϕ , italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) < italic_δ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, we have πk(ϕ)ε0subscript𝜋𝑘italic-ϕsubscript𝜀0\pi_{k}(\phi)\geq\varepsilon_{0}italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_ϕ ) ≥ italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, π3(ϕ)<1qsubscript𝜋3italic-ϕ1𝑞\pi_{3}(\phi)<1-qitalic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_ϕ ) < 1 - italic_q and akl(ϕ)ε0subscript𝑎𝑘𝑙italic-ϕsubscript𝜀0a_{kl}(\phi)\geq\varepsilon_{0}italic_a start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT ( italic_ϕ ) ≥ italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT for k,l=0,1,2,3formulae-sequence𝑘𝑙0123k,l=0,1,2,3italic_k , italic_l = 0 , 1 , 2 , 3, where q𝑞qitalic_q is a pre-specified false discovery rate level. Moreover, for

c=c(ε0,q)=12ε0+(12ε0)2+4(13ε0)ε03q/(2q)2ε03q/(2q),𝑐𝑐subscript𝜀0𝑞12subscript𝜀0superscript12subscript𝜀02413subscript𝜀0superscriptsubscript𝜀03𝑞2𝑞2superscriptsubscript𝜀03𝑞2𝑞\displaystyle c=c(\varepsilon_{0},q)=\frac{1-2\varepsilon_{0}+\sqrt{(1-2% \varepsilon_{0})^{2}+4(1-3\varepsilon_{0})\varepsilon_{0}^{3}q/(2-q)}}{2% \varepsilon_{0}^{3}q/(2-q)},italic_c = italic_c ( italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_q ) = divide start_ARG 1 - 2 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + square-root start_ARG ( 1 - 2 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 4 ( 1 - 3 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_q / ( 2 - italic_q ) end_ARG end_ARG start_ARG 2 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_q / ( 2 - italic_q ) end_ARG ,

we have

limy0f1(y)>c,limy0f2(y)>c.formulae-sequencesubscript𝑦0subscript𝑓1𝑦𝑐subscript𝑦0subscript𝑓2𝑦𝑐\displaystyle\lim_{y\rightarrow 0}f_{1}(y)>c,\quad\lim_{y\rightarrow 0}f_{2}(y% )>c.roman_lim start_POSTSUBSCRIPT italic_y → 0 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y ) > italic_c , roman_lim start_POSTSUBSCRIPT italic_y → 0 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y ) > italic_c .

(C3) Eϕ0{|logf(k)(Y11,Y21;ϕ0)|}<,k=0,1,2,3formulae-sequencesubscript𝐸subscriptitalic-ϕ0superscript𝑓𝑘subscript𝑌11subscript𝑌21subscriptitalic-ϕ0𝑘0123E_{\phi_{0}}\left\{\left|\log f^{(k)}\left(Y_{11},Y_{21};\phi_{0}\right)\right% |\right\}<\infty,k=0,1,2,3italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT { | roman_log italic_f start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ( italic_Y start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT ; italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) | } < ∞ , italic_k = 0 , 1 , 2 , 3. (C4) Eϕ0[supd(ϕ0,ϕ)<δ1{logf(k)(Y11,Y21;ϕ)}+]<subscript𝐸subscriptitalic-ϕ0delimited-[]subscriptsupremum𝑑subscriptitalic-ϕ0italic-ϕsubscript𝛿1superscriptsuperscript𝑓𝑘subscript𝑌11subscript𝑌21italic-ϕE_{\phi_{0}}\left[\sup_{d(\phi_{0},\phi)<\delta_{1}}\left\{\log f^{(k)}\left(Y% _{11},Y_{21};\phi\right)\right\}^{+}\right]<\inftyitalic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ roman_sup start_POSTSUBSCRIPT italic_d ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ) < italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT { roman_log italic_f start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ( italic_Y start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT ; italic_ϕ ) } start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ] < ∞ for some δ1>0subscript𝛿10\delta_{1}>0italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > 0 and all k=0,1,2,3𝑘0123k=0,1,2,3italic_k = 0 , 1 , 2 , 3, where x+=max{x,0}superscript𝑥𝑥0x^{+}=\max\{x,0\}italic_x start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT = roman_max { italic_x , 0 }. (C5) There exists δ2>0subscript𝛿20\delta_{2}>0italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > 0 such that Pϕ0{ρ0(Y11,Y21)<s1=k}>0subscript𝑃subscriptitalic-ϕ0subscript𝜌0subscript𝑌11subscript𝑌21brasubscript𝑠1𝑘0P_{\phi_{0}}\{\rho_{0}(Y_{11},Y_{21})<\infty\mid s_{1}=k\}>0italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT { italic_ρ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_Y start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT ) < ∞ ∣ italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_k } > 0 for all k=0,1,2,3𝑘0123k=0,1,2,3italic_k = 0 , 1 , 2 , 3, where

ρ0(y1,y2)=supd(ϕ,ϕ0)<δ2max0k,k3{f(k)(y1,y2;ϕ)f(k)(y1,y2;ϕ)}.subscript𝜌0subscript𝑦1subscript𝑦2subscriptsupremum𝑑italic-ϕsubscriptitalic-ϕ0subscript𝛿2subscriptformulae-sequence0𝑘superscript𝑘3superscript𝑓𝑘subscript𝑦1subscript𝑦2italic-ϕsuperscript𝑓superscript𝑘subscript𝑦1subscript𝑦2italic-ϕ\displaystyle\rho_{0}(y_{1},y_{2})=\sup_{d(\phi,\phi_{0})<\delta_{2}}\max_{0% \leq k,k^{\prime}\leq 3}\left\{\frac{f^{(k)}(y_{1},y_{2};\phi)}{f^{(k^{\prime}% )}(y_{1},y_{2};\phi)}\right\}.italic_ρ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = roman_sup start_POSTSUBSCRIPT italic_d ( italic_ϕ , italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) < italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT 0 ≤ italic_k , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≤ 3 end_POSTSUBSCRIPT { divide start_ARG italic_f start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ; italic_ϕ ) end_ARG start_ARG italic_f start_POSTSUPERSCRIPT ( italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ; italic_ϕ ) end_ARG } .

(C1) ensures that the hidden states (sj)j=1msuperscriptsubscriptsubscript𝑠𝑗𝑗1𝑚(s_{j})_{j=1}^{m}( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT are irreducible, which guarantees that the distribution of the observed data (Y1j,Y2j)j=1msuperscriptsubscriptsubscript𝑌1𝑗subscript𝑌2𝑗𝑗1𝑚(Y_{1j},Y_{2j})_{j=1}^{m}( italic_Y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT is ergodic. Since ϕ0subscriptitalic-ϕ0\phi_{0}italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is an interior point of ΦΦ\Phiroman_Φ, the compactness of ΦΦ\Phiroman_Φ can be used to show that ϕ^mϕ0subscript^italic-ϕ𝑚subscriptitalic-ϕ0\hat{\phi}_{m}\rightarrow\phi_{0}over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT → italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT in probability as m𝑚m\rightarrow\inftyitalic_m → ∞. (C2) bounds the stationary probabilities and elements in the transition probabilities away from 0 when ϕitalic-ϕ\phiitalic_ϕ is close to ϕ0subscriptitalic-ϕ0\phi_{0}italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. In addition, it requires a lower bound on the non-null probability density function of p𝑝pitalic_p-values near 0,00,0 , which is mild for a non-increasing probability density function in (0,1).01(0,1).( 0 , 1 ) . (C3) is a technical assumption (Leroux, 1992). (C4) guarantees the existence of generalized Kullback–Leibler divergence of two distributions indexed by ϕ0subscriptitalic-ϕ0\phi_{0}italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and ϕ,italic-ϕ\phi,italic_ϕ , where ϕitalic-ϕ\phiitalic_ϕ is in a small neighborhood of ϕ0subscriptitalic-ϕ0\phi_{0}italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. (C5) requires that the ratio of density functions for any two hidden states is finite with positive probability.

Theorem 3.1.

Assume (C1)-(C5) hold. Let ϕ0subscriptitalic-ϕ0\phi_{0}italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT be the true parameter value and ϕ^msubscriptnormal-^italic-ϕ𝑚\hat{\phi}_{m}over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT be the maximum likelihood estimator defined in (2.4). Then ϕ^msubscriptnormal-^italic-ϕ𝑚\hat{\phi}_{m}over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT converges to ϕ0subscriptitalic-ϕ0\phi_{0}italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT in probability.

The proof of Theorem 3.1 is relegated in the Supplementary Material. Denote the shift operator of a hidden Markov model as 𝒯(Y1j,Y2j)=(Y1,j+1,Y2,j+1)𝒯subscript𝑌1𝑗subscript𝑌2𝑗subscript𝑌1𝑗1subscript𝑌2𝑗1\mathcal{T}(Y_{1j},Y_{2j})=(Y_{1,j+1},Y_{2,j+1})caligraphic_T ( italic_Y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) = ( italic_Y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ). Let \mathcal{I}caligraphic_I be any shift-invariant set, which means (Y1j,Y2j)subscript𝑌1𝑗subscript𝑌2𝑗(Y_{1j},Y_{2j})\in\mathcal{I}( italic_Y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) ∈ caligraphic_I if and only if 𝒯(Y1j,Y2j)𝒯subscript𝑌1𝑗subscript𝑌2𝑗\mathcal{T}(Y_{1j},Y_{2j})\in\mathcal{I}caligraphic_T ( italic_Y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) ∈ caligraphic_I. The distribution of (Y1j,Y2j)superscriptsubscriptsubscript𝑌1𝑗subscript𝑌2𝑗(Y_{1j},Y_{2j})_{-\infty}^{\infty}( italic_Y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT is defined to be ergodic if P((Y1j,Y2j))=0𝑃subscript𝑌1𝑗subscript𝑌2𝑗0P((Y_{1j},Y_{2j})\in\mathcal{I})=0italic_P ( ( italic_Y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) ∈ caligraphic_I ) = 0 or 1111 for any shift-invariant set \mathcal{I}caligraphic_I. Lemma 1 in Leroux (1992) shows that under (C1), (Y1j,Y2j)superscriptsubscriptsubscript𝑌1𝑗subscript𝑌2𝑗(Y_{1j},Y_{2j})_{-\infty}^{\infty}( italic_Y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT are ergodic. Therefore, we can apply Birkhoff’s ergodic theorem (Birkhoff, 1931): For any g:[0,1]2R:𝑔superscript012𝑅g:[0,1]^{2}\rightarrow Ritalic_g : [ 0 , 1 ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT → italic_R, where Eϕ0g(Y11,Y21)subscript𝐸subscriptitalic-ϕ0𝑔subscript𝑌11subscript𝑌21E_{\phi_{0}}g(Y_{11},Y_{21})italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_g ( italic_Y start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT ) exists, we have

1mj=1mg(Y1j,Y2j)Eϕ0g(Y11,Y21)almost surely as m.formulae-sequence1𝑚superscriptsubscript𝑗1𝑚𝑔subscript𝑌1𝑗subscript𝑌2𝑗subscript𝐸subscriptitalic-ϕ0𝑔subscript𝑌11subscript𝑌21almost surely as 𝑚\displaystyle\frac{1}{m}\sum_{j=1}^{m}g(Y_{1j},Y_{2j})\rightarrow E_{\phi_{0}}% g(Y_{11},Y_{21})\quad\text{almost surely as }m\rightarrow\infty.divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_g ( italic_Y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) → italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_g ( italic_Y start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT ) almost surely as italic_m → ∞ .

Theorem 2 in Leroux (1992) gives the definitions of entropy and generalized Kullback–Leibler divergence when the probability density functions under different hidden states are parametric under the hidden Markov model. We extend their results to the setting where the probability density functions under different hidden states are estimated nonparametrically. Specifically, assume (C1) and (C4) hold. For ϕΦitalic-ϕΦ\phi\in\Phiitalic_ϕ ∈ roman_Φ, there is a constant H(ϕ0,ϕ)[,)𝐻subscriptitalic-ϕ0italic-ϕH(\phi_{0},\phi)\in[-\infty,\infty)italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ) ∈ [ - ∞ , ∞ ) such that

  1. 1.

    m1Eϕ0[logpm({Y1j,Y2j}j=1m;ϕ)]H(ϕ0,ϕ)superscript𝑚1subscript𝐸subscriptitalic-ϕ0delimited-[]subscript𝑝𝑚superscriptsubscriptsubscript𝑌1𝑗subscript𝑌2𝑗𝑗1𝑚italic-ϕ𝐻subscriptitalic-ϕ0italic-ϕm^{-1}E_{\phi_{0}}[\log p_{m}(\{Y_{1j},Y_{2j}\}_{j=1}^{m};\phi)]\to H(\phi_{0}% ,\phi)italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ roman_log italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( { italic_Y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ; italic_ϕ ) ] → italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ) as m𝑚m\rightarrow\inftyitalic_m → ∞;

  2. 2.

    m1logpm({Y1j,Y2j}j=1m;ϕ)H(ϕ0,ϕ)superscript𝑚1subscript𝑝𝑚superscriptsubscriptsubscript𝑌1𝑗subscript𝑌2𝑗𝑗1𝑚italic-ϕ𝐻subscriptitalic-ϕ0italic-ϕm^{-1}\log p_{m}(\{Y_{1j},Y_{2j}\}_{j=1}^{m};\phi)\to H(\phi_{0},\phi)italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_log italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( { italic_Y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ; italic_ϕ ) → italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ) almost surely as m𝑚m\rightarrow\inftyitalic_m → ∞.

Moreover, Assume (C1) - (C4) hold. For every ϕΦitalic-ϕΦ\phi\in\Phiitalic_ϕ ∈ roman_Φ, K(ϕ0,ϕ)=H(ϕ0,ϕ0)H(ϕ0,ϕ)0𝐾subscriptitalic-ϕ0italic-ϕ𝐻subscriptitalic-ϕ0subscriptitalic-ϕ0𝐻subscriptitalic-ϕ0italic-ϕ0K(\phi_{0},\phi)=H(\phi_{0},\phi_{0})-H(\phi_{0},\phi)\geq 0italic_K ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ) = italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ) ≥ 0. If ϕϕ0italic-ϕsubscriptitalic-ϕ0\phi\neq\phi_{0}italic_ϕ ≠ italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, K(ϕ0,ϕ)>0𝐾subscriptitalic-ϕ0italic-ϕ0K(\phi_{0},\phi)>0italic_K ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ) > 0. With the compactness of the parameter space ΦΦ\Phiroman_Φ and non-negativeness of the generalized Kullback–Leibler divergence K(ϕ0,ϕ)𝐾subscriptitalic-ϕ0italic-ϕK(\phi_{0},\phi)italic_K ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ), we obtain the consistency of the maximum likelihood estimator in Theorem 3.1.

3.3 Asymptotic false discovery rate control

With the maximum likelihood estimator ϕ^m=(ϕ^,A^,f^1,f^2)subscript^italic-ϕ𝑚^italic-ϕ^𝐴subscript^𝑓1subscript^𝑓2\hat{\phi}_{m}=(\hat{\phi},\hat{A},\hat{f}_{1},\hat{f}_{2})over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = ( over^ start_ARG italic_ϕ end_ARG , over^ start_ARG italic_A end_ARG , over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ), we can estimate the forward probabilities by α^j(sj)=Pϕ^m((y1t,y2t)t=1j,sj)subscript^𝛼𝑗subscript𝑠𝑗subscript𝑃subscript^italic-ϕ𝑚superscriptsubscriptsubscript𝑦1𝑡subscript𝑦2𝑡𝑡1𝑗subscript𝑠𝑗\hat{\alpha}_{j}(s_{j})=P_{\hat{\phi}_{m}}\left((y_{1t},y_{2t})_{t=1}^{j},s_{j% }\right)over^ start_ARG italic_α end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = italic_P start_POSTSUBSCRIPT over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_t end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT , italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) and the backward probabilities by β^j(sj)=Pϕ^m((y1t,y2t)t=j+1msj)subscript^𝛽𝑗subscript𝑠𝑗subscript𝑃subscript^italic-ϕ𝑚conditionalsuperscriptsubscriptsubscript𝑦1𝑡subscript𝑦2𝑡𝑡𝑗1𝑚subscript𝑠𝑗\hat{\beta}_{j}(s_{j})=P_{\hat{\phi}_{m}}\left((y_{1t},y_{2t})_{t=j+1}^{m}\mid s% _{j}\right)over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = italic_P start_POSTSUBSCRIPT over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_t end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t = italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∣ italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) for j=1,,m𝑗1𝑚j=1,\ldots,mitalic_j = 1 , … , italic_m and sj=0,1,2,3.subscript𝑠𝑗0123s_{j}=0,1,2,3.italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 , 1 , 2 , 3 . The estimates can be obtained as follows.

α^1(s1)=subscript^𝛼1subscript𝑠1absent\displaystyle\hat{\alpha}_{1}(s_{1})=over^ start_ARG italic_α end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) = π^s1f^(s1)(y11,y21),β^m(sm)=1,subscript^𝜋subscript𝑠1superscript^𝑓subscript𝑠1subscript𝑦11subscript𝑦21subscript^𝛽𝑚subscript𝑠𝑚1\displaystyle\hat{\pi}_{s_{1}}\hat{f}^{(s_{1})}(y_{11},y_{21}),\quad\quad\hat{% \beta}_{m}(s_{m})=1,over^ start_ARG italic_π end_ARG start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT over^ start_ARG italic_f end_ARG start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT ) , over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) = 1 ,
α^j+1(sj+1)=subscript^𝛼𝑗1subscript𝑠𝑗1absent\displaystyle\hat{\alpha}_{j+1}(s_{j+1})=over^ start_ARG italic_α end_ARG start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) = sj=03α^j(sj)a^sj,sj+1f^(sj+1)(y1,j+1,y2,j+1),j=1,,m1, andformulae-sequencesuperscriptsubscriptsubscript𝑠𝑗03subscript^𝛼𝑗subscript𝑠𝑗subscript^𝑎subscript𝑠𝑗subscript𝑠𝑗1superscript^𝑓subscript𝑠𝑗1subscript𝑦1𝑗1subscript𝑦2𝑗1𝑗1𝑚1 and\displaystyle\sum_{s_{j}=0}^{3}\hat{\alpha}_{j}(s_{j})\hat{a}_{s_{j},s_{j+1}}% \hat{f}^{(s_{j+1})}(y_{1,j+1},y_{2,j+1}),\quad j=1,\ldots,m-1,\text{ and }∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT over^ start_ARG italic_α end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) over^ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT over^ start_ARG italic_f end_ARG start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ) , italic_j = 1 , … , italic_m - 1 , and
β^j(sj)=subscript^𝛽𝑗subscript𝑠𝑗absent\displaystyle\hat{\beta}_{j}(s_{j})=over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = sj+1=03β^j+1(sj+1)a^sj,sj+1f^(sj+1)(y1,j+1,y2,j+1),j=1,,m1.formulae-sequencesuperscriptsubscriptsubscript𝑠𝑗103subscript^𝛽𝑗1subscript𝑠𝑗1subscript^𝑎subscript𝑠𝑗subscript𝑠𝑗1superscript^𝑓subscript𝑠𝑗1subscript𝑦1𝑗1subscript𝑦2𝑗1𝑗1𝑚1\displaystyle\sum_{s_{j+1}=0}^{3}\hat{\beta}_{j+1}(s_{j+1})\hat{a}_{s_{j},s_{j% +1}}\hat{f}^{(s_{j+1})}(y_{1,j+1},y_{2,j+1}),\quad j=1,\ldots,m-1.∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) over^ start_ARG italic_a end_ARG start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT over^ start_ARG italic_f end_ARG start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ) , italic_j = 1 , … , italic_m - 1 .

We calculate the test statistic

rLIS^j=subscript^rLIS𝑗absent\displaystyle\widehat{\mathrm{rLIS}}_{j}=over^ start_ARG roman_rLIS end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = Pϕ^m(sj{0,1,2}(y1j,y2j)j=1m)subscript𝑃subscript^italic-ϕ𝑚subscript𝑠𝑗conditional012superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚\displaystyle P_{\hat{\phi}_{m}}\left(s_{j}\in\{0,1,2\}\mid(y_{1j},y_{2j})_{j=% 1}^{m}\right)italic_P start_POSTSUBSCRIPT over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT )
=\displaystyle== sj=02α^j(sj)β^j(sj)sj=03α^j(sj)β^j(sj).superscriptsubscriptsubscript𝑠𝑗02subscript^𝛼𝑗subscript𝑠𝑗subscript^𝛽𝑗subscript𝑠𝑗superscriptsubscriptsubscript𝑠𝑗03subscript^𝛼𝑗subscript𝑠𝑗subscript^𝛽𝑗subscript𝑠𝑗\displaystyle\frac{\sum_{s_{j}=0}^{2}\hat{\alpha}_{j}(s_{j})\hat{\beta}_{j}(s_% {j})}{\sum_{s_{j}=0}^{3}\hat{\alpha}_{j}(s_{j})\hat{\beta}_{j}(s_{j})}.divide start_ARG ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT over^ start_ARG italic_α end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT over^ start_ARG italic_α end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG . (3.2)

Order the test statistics as rLIS^(1)rLIS^(m)subscript^rLIS1subscript^rLIS𝑚\widehat{\mathrm{rLIS}}_{(1)}\leq\ldots\leq\widehat{\mathrm{rLIS}}_{(m)}over^ start_ARG roman_rLIS end_ARG start_POSTSUBSCRIPT ( 1 ) end_POSTSUBSCRIPT ≤ … ≤ over^ start_ARG roman_rLIS end_ARG start_POSTSUBSCRIPT ( italic_m ) end_POSTSUBSCRIPT with corresponding replicability null hypotheses H0(1),,H0(m)subscript𝐻01subscript𝐻0𝑚H_{0(1)},\ldots,H_{0(m)}italic_H start_POSTSUBSCRIPT 0 ( 1 ) end_POSTSUBSCRIPT , … , italic_H start_POSTSUBSCRIPT 0 ( italic_m ) end_POSTSUBSCRIPT. For a pre-specified false discovery rate level q(0,1)𝑞01q\in(0,1)italic_q ∈ ( 0 , 1 ), we have the following procedure.

R^=max{r:1rj=1rrLIS^(j)q},and reject H0(j) for j=1,,R^.^𝑅absent:𝑟1𝑟superscriptsubscript𝑗1𝑟subscript^rLIS𝑗𝑞missing-subexpressionand reject subscript𝐻0𝑗 for 𝑗1^𝑅\displaystyle\begin{aligned} \hat{R}=&\max\left\{r:\frac{1}{r}\sum_{j=1}^{r}% \widehat{\mathrm{rLIS}}_{(j)}\leq q\right\},\\ &\text{and reject }H_{0(j)}\text{ for }j=1,\ldots,\hat{R}.\end{aligned}start_ROW start_CELL over^ start_ARG italic_R end_ARG = end_CELL start_CELL roman_max { italic_r : divide start_ARG 1 end_ARG start_ARG italic_r end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT over^ start_ARG roman_rLIS end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ≤ italic_q } , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL and reject italic_H start_POSTSUBSCRIPT 0 ( italic_j ) end_POSTSUBSCRIPT for italic_j = 1 , … , over^ start_ARG italic_R end_ARG . end_CELL end_ROW (3.3)

Theorem 3.2 shows asymptotic false discovery rate control at level q𝑞qitalic_q with the maximum likelihood estimator ϕ^msubscript^italic-ϕ𝑚\hat{\phi}_{m}over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT specified in (2.4) and the testing procedure (3.3).

Theorem 3.2.

Assume the paired p𝑝pitalic_p-values and the joint hidden states (Y1j,Y2j,Sj)j=1msuperscriptsubscriptsubscript𝑌1𝑗subscript𝑌2𝑗subscript𝑆𝑗𝑗1𝑚(Y_{1j},Y_{2j},S_{j})_{j=1}^{m}( italic_Y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT , italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT follow a four dimensional hidden Markov model defined with true parameter ϕ0=(π,A,f1,f2)subscriptitalic-ϕ0𝜋𝐴subscript𝑓1subscript𝑓2\phi_{0}=(\pi,A,f_{1},f_{2})italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ( italic_π , italic_A , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ). Conditional on the hidden states, the paired p𝑝pitalic_p-values (y1j,y2j)j=1msuperscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚(y_{1j},y_{2j})_{j=1}^{m}( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT follow a mixture model specified in (2.2). Assume the probability density function of null p𝑝pitalic_p-values is a standard uniform distribution and (2.3) holds. If (C1)-(C5) hold, procedure (3.3) controls the false discovery rate asymptotically at level q𝑞qitalic_q.

The proof of Theorem 3.2 is relegated in the Supplementary Material. Combining Theorem 3.1 and Theorem 3.2, we provide an automatic and self-contained replicability analysis of high dimensional data accounting for dependence without any tuning parameters.

4 Simulations

We conduct simulation studies to evaluate the finite sample performance of the proposed method in terms of false discovery rate control and power comparison. The data generating process is as follows. We set the total number of tests m=10,000.𝑚10000m=10,000.italic_m = 10 , 000 . We first simulate a Markov chain (sj)j=1msuperscriptsubscriptsubscript𝑠𝑗𝑗1𝑚(s_{j})_{j=1}^{m}( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT with the stationary probability π=(π0,π1,π2,π3)𝜋subscript𝜋0subscript𝜋1subscript𝜋2subscript𝜋3\pi=(\pi_{0},\pi_{1},\pi_{2},\pi_{3})italic_π = ( italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ). For π=(0.7,0.1,0.1,0.1)𝜋0.70.10.10.1\pi=(0.7,0.1,0.1,0.1)italic_π = ( 0.7 , 0.1 , 0.1 , 0.1 ) and (0.6,0.15,0.15,0.1)0.60.150.150.1(0.6,0.15,0.15,0.1)( 0.6 , 0.15 , 0.15 , 0.1 ), we have the corresponding transition probability matrices

A=(0.9050.0320.0320.0320.2220.3330.2220.2220.2220.2220.3330.2220.2220.2220.2220.333) and (0.8890.0370.0370.0370.1480.5560.1480.1480.1480.1480.5560.1480.2220.2220.2220.333),𝐴matrix0.9050.0320.0320.0320.2220.3330.2220.2220.2220.2220.3330.2220.2220.2220.2220.333 and matrix0.8890.0370.0370.0370.1480.5560.1480.1480.1480.1480.5560.1480.2220.2220.2220.333\displaystyle A=\begin{pmatrix}0.905&0.032&0.032&0.032\\ 0.222&0.333&0.222&0.222\\ 0.222&0.222&0.333&0.222\\ 0.222&0.222&0.222&0.333\end{pmatrix}\text{ and }\begin{pmatrix}0.889&0.037&0.0% 37&0.037\\ 0.148&0.556&0.148&0.148\\ 0.148&0.148&0.556&0.148\\ 0.222&0.222&0.222&0.333\\ \end{pmatrix},italic_A = ( start_ARG start_ROW start_CELL 0.905 end_CELL start_CELL 0.032 end_CELL start_CELL 0.032 end_CELL start_CELL 0.032 end_CELL end_ROW start_ROW start_CELL 0.222 end_CELL start_CELL 0.333 end_CELL start_CELL 0.222 end_CELL start_CELL 0.222 end_CELL end_ROW start_ROW start_CELL 0.222 end_CELL start_CELL 0.222 end_CELL start_CELL 0.333 end_CELL start_CELL 0.222 end_CELL end_ROW start_ROW start_CELL 0.222 end_CELL start_CELL 0.222 end_CELL start_CELL 0.222 end_CELL start_CELL 0.333 end_CELL end_ROW end_ARG ) and ( start_ARG start_ROW start_CELL 0.889 end_CELL start_CELL 0.037 end_CELL start_CELL 0.037 end_CELL start_CELL 0.037 end_CELL end_ROW start_ROW start_CELL 0.148 end_CELL start_CELL 0.556 end_CELL start_CELL 0.148 end_CELL start_CELL 0.148 end_CELL end_ROW start_ROW start_CELL 0.148 end_CELL start_CELL 0.148 end_CELL start_CELL 0.556 end_CELL start_CELL 0.148 end_CELL end_ROW start_ROW start_CELL 0.222 end_CELL start_CELL 0.222 end_CELL start_CELL 0.222 end_CELL start_CELL 0.333 end_CELL end_ROW end_ARG ) ,

where A=(akl)k,l=03𝐴superscriptsubscriptsubscript𝑎𝑘𝑙𝑘𝑙03A=(a_{kl})_{k,l=0}^{3}italic_A = ( italic_a start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k , italic_l = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT and the stationary property πA=π𝜋𝐴𝜋\pi A=\piitalic_π italic_A = italic_π holds. We generate s1subscript𝑠1s_{1}italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT from the stationary probability as P(s1=k)=πk𝑃subscript𝑠1𝑘subscript𝜋𝑘P(s_{1}=k)=\pi_{k}italic_P ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_k ) = italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT for k=0,1,2,3𝑘0123k=0,1,2,3italic_k = 0 , 1 , 2 , 3. For each j=2,,m𝑗2𝑚j=2,\ldots,mitalic_j = 2 , … , italic_m, we generate sjsubscript𝑠𝑗s_{j}italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT based sj1subscript𝑠𝑗1s_{j-1}italic_s start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT as P(sj=lsj1=k)=akl𝑃subscript𝑠𝑗conditional𝑙subscript𝑠𝑗1𝑘subscript𝑎𝑘𝑙P(s_{j}=l\mid s_{j-1}=k)=a_{kl}italic_P ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l ∣ italic_s start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT = italic_k ) = italic_a start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT for k,l=0,1,2,3formulae-sequence𝑘𝑙0123k,l=0,1,2,3italic_k , italic_l = 0 , 1 , 2 , 3. Based on the definition of sjsubscript𝑠𝑗s_{j}italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, the hidden states of two studies, θij,i=1,2formulae-sequencesubscript𝜃𝑖𝑗𝑖12\theta_{ij},i=1,2italic_θ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT , italic_i = 1 , 2, can be obtained from the value of sjsubscript𝑠𝑗s_{j}italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, j=1,,m𝑗1𝑚j=1,\dots,mitalic_j = 1 , … , italic_m. Let N(μ,σ2)𝑁𝜇superscript𝜎2N(\mu,\sigma^{2})italic_N ( italic_μ , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) denote the normal distribution with mean μ𝜇\muitalic_μ and variance σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. For simplicity, we directly simulate observed z𝑧zitalic_z-values Xijsubscript𝑋𝑖𝑗X_{ij}italic_X start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT conditional on θijsubscript𝜃𝑖𝑗\theta_{ij}italic_θ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT for i=1,2𝑖12i=1,2italic_i = 1 , 2 and j=1,,m𝑗1𝑚j=1,\dots,mitalic_j = 1 , … , italic_m. Specifically, Xijθij(1θij)N(0,1)+θijN(μi,1)similar-toconditionalsubscript𝑋𝑖𝑗subscript𝜃𝑖𝑗1subscript𝜃𝑖𝑗𝑁01subscript𝜃𝑖𝑗𝑁subscript𝜇𝑖1X_{ij}\mid\theta_{ij}\sim(1-\theta_{ij})N(0,1)+\theta_{ij}N(\mu_{i},1)italic_X start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∣ italic_θ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∼ ( 1 - italic_θ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ) italic_N ( 0 , 1 ) + italic_θ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_N ( italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , 1 ), where μisubscript𝜇𝑖\mu_{i}italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT denotes the signal strength in study i𝑖iitalic_i. One-sided p𝑝pitalic_p-values are calculated by yij=Xij(2π)1/2exp{t2/2}dtsubscript𝑦𝑖𝑗superscriptsubscriptsubscript𝑋𝑖𝑗superscript2𝜋12superscript𝑡22differential-d𝑡y_{ij}=\int_{X_{ij}}^{\infty}(2\pi)^{-1/2}\exp\{-t^{2}/2\}{\rm d}titalic_y start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = ∫ start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( 2 italic_π ) start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT roman_exp { - italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 2 } roman_d italic_t. We compare the proposed method with replicability analysis methods that do not account for the dependence structure among hypotheses, including the ad hoc BH, MaxP, radjust, JUMP, and STAREG. Detailed descriptions of these methods are provided in the Supplementary Materials. For each setting, the simulations are repeated 100100100100 times to calculate the empirical false discovery rate and statistical power of different methods with nominal false discovery rate level q=0.05𝑞0.05q=0.05italic_q = 0.05. We first set π3=0.1subscript𝜋30.1\pi_{3}=0.1italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = 0.1 and let π1=π2subscript𝜋1subscript𝜋2\pi_{1}=\pi_{2}italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. We vary π1subscript𝜋1\pi_{1}italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and the signal strengths μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and μ2subscript𝜇2\mu_{2}italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT to evaluate the false discovery rate and power of different methods in different settings. The results are summarized in Figure 1. We observe that ad hoc BH cannot control the false discovery rate, and we exclude it in the power comparison. All other methods have valid false discovery rate control. MaxP, radjust, JUMP are too conservative, and have low power. Our procedure has higher power than the other methods across all settings. The power gain is especially pronounced in the challenging weak signal scenario. The power increases with increased signal strengths for all methods.

Refer to caption
Figure 1: FDR control (left) and power (power) of different methods.

Figure 2 presents the FDR control and power comparison of different methods with varied FDR levels where m=10,000,π=(0.7,0.1,0.1,0.1),μ1=μ2=1.5,σ1=σ2=1formulae-sequenceformulae-sequence𝑚10000formulae-sequence𝜋0.70.10.10.1subscript𝜇1subscript𝜇21.5subscript𝜎1subscript𝜎21m=10,000,\pi=(0.7,0.1,0.1,0.1),\mu_{1}=\mu_{2}=1.5,\sigma_{1}=\sigma_{2}=1italic_m = 10 , 000 , italic_π = ( 0.7 , 0.1 , 0.1 , 0.1 ) , italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 1.5 , italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_σ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 1 and

A=(0.9050.0320.0320.0320.2220.3330.2220.2220.2220.2220.3330.2220.2220.2220.2220.333).𝐴matrix0.9050.0320.0320.0320.2220.3330.2220.2220.2220.2220.3330.2220.2220.2220.2220.333\displaystyle A=\begin{pmatrix}0.905&0.032&0.032&0.032\\ 0.222&0.333&0.222&0.222\\ 0.222&0.222&0.333&0.222\\ 0.222&0.222&0.222&0.333\end{pmatrix}.italic_A = ( start_ARG start_ROW start_CELL 0.905 end_CELL start_CELL 0.032 end_CELL start_CELL 0.032 end_CELL start_CELL 0.032 end_CELL end_ROW start_ROW start_CELL 0.222 end_CELL start_CELL 0.333 end_CELL start_CELL 0.222 end_CELL start_CELL 0.222 end_CELL end_ROW start_ROW start_CELL 0.222 end_CELL start_CELL 0.222 end_CELL start_CELL 0.333 end_CELL start_CELL 0.222 end_CELL end_ROW start_ROW start_CELL 0.222 end_CELL start_CELL 0.222 end_CELL start_CELL 0.222 end_CELL start_CELL 0.333 end_CELL end_ROW end_ARG ) .

We examine the performance of different methods with nominal FDR ranging from 0.0010.0010.0010.001 to 0.20.20.20.2. We use the diagonal lines with slope 1111 as references. Our procedure and STAREG can control FDR while ad hoc BH, MaxP, radjust and JUMP are too conservative. Our procedure has the highest power.

Refer to caption
Figure 2: FDR control (left) and power (right) at different nominal levels

5 Data analysis

We illustrate our method by analyzing two GWAS datasets (Morris et al., 2012). A sex-differentiated meta-analysis was performed to test for the association of SNPs with type 2 diabetes. Type 2 diabetes occurs when blood glucose is too high. It affected approximately 329 million individuals in 2015 (Lipton et al., 2016). Identifying replicable SNPs that contribute to disease risk can be instrumental in a full understanding of disease biology and the development of therapeutics. In the first data set, there are 20,2192021920,21920 , 219 type 2 diabetes cases and 54,6045460454,60454 , 604 controls from the male population. In the second data set, there are 14,6211462114,62114 , 621 type 2 diabetes cases and 60,3776037760,37760 , 377 controls from the female population. We aim to find replicable SNPs that are associated with both genders. The datasets are downloaded from DIAbetes Genetics Replication and Meta-analysis Consortium at https://www.diagram-consortium.org/downloads.html. The male group contains summary statistics of 123,535123535123,535123 , 535 SNPs, and the female group contains summary statistics of 118,399118399118,399118 , 399 SNPs. We analyze the paired p𝑝pitalic_p-values of m=118,364𝑚118364m=118,364italic_m = 118 , 364 SNPs that are common to both studies, where y1j,j=1,,mformulae-sequencesubscript𝑦1𝑗𝑗1𝑚y_{1j},j=1,\ldots,mitalic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_j = 1 , … , italic_m denote p𝑝pitalic_p-values for male population and y2j,j=1,,mformulae-sequencesubscript𝑦2𝑗𝑗1𝑚y_{2j},j=1,\ldots,mitalic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT , italic_j = 1 , … , italic_m denote p𝑝pitalic_p-values for female population. The estimated transition matrix is

A^=(0.98400.00660.00400.00550.06570.92710.00040.00690.05460.00100.93790.00660.05010.00450.00500.9403),^𝐴matrix0.98400.00660.00400.00550.06570.92710.00040.00690.05460.00100.93790.00660.05010.00450.00500.9403\displaystyle\hat{A}=\begin{pmatrix}0.9840&0.0066&0.0040&0.0055\\ 0.0657&0.9271&0.0004&0.0069\\ 0.0546&0.0010&0.9379&0.0066\\ 0.0501&0.0045&0.0050&0.9403\end{pmatrix},over^ start_ARG italic_A end_ARG = ( start_ARG start_ROW start_CELL 0.9840 end_CELL start_CELL 0.0066 end_CELL start_CELL 0.0040 end_CELL start_CELL 0.0055 end_CELL end_ROW start_ROW start_CELL 0.0657 end_CELL start_CELL 0.9271 end_CELL start_CELL 0.0004 end_CELL start_CELL 0.0069 end_CELL end_ROW start_ROW start_CELL 0.0546 end_CELL start_CELL 0.0010 end_CELL start_CELL 0.9379 end_CELL start_CELL 0.0066 end_CELL end_ROW start_ROW start_CELL 0.0501 end_CELL start_CELL 0.0045 end_CELL start_CELL 0.0050 end_CELL start_CELL 0.9403 end_CELL end_ROW end_ARG ) ,

and the stationary probability of different states is

π^=(0.779,0.077,0.057,0.087),^𝜋0.7790.0770.0570.087\displaystyle\hat{\pi}=(0.779,0.077,0.057,0.087),over^ start_ARG italic_π end_ARG = ( 0.779 , 0.077 , 0.057 , 0.087 ) ,

which is the eigenvector of A^^𝐴\hat{A}over^ start_ARG italic_A end_ARG corresponding to eigenvalue 1.11.1 . The estimated probability density functions of non-null p𝑝pitalic_p-values, f^1subscript^𝑓1\hat{f}_{1}over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and f^2subscript^𝑓2\hat{f}_{2}over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, are plotted in Figure 3.

Refer to caption
Figure 3: Estimates of f^1subscript^𝑓1\hat{f}_{1}over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and f^2subscript^𝑓2\hat{f}_{2}over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT in two type 2 diabetes studies

Next, we compare our method to the competing methods. The results of different methods at the FDR level q=105𝑞superscript105q=10^{-5}italic_q = 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT are summarized in Figure 4. The MaxP procedure is the most conservative with 176176176176 findings, and all of them are identified by other methods. Our method has 1,60416041,6041 , 604 findings, among which 1,20212021,2021 , 202 are uniquely identified by our method.

Refer to caption
Figure 4: Replicability analysis results of type 2 diabetes data

Among these 1,20212021,2021 , 202 unique findings, 107107107107 are recorded in the NHGRI-EBI GWAS Catalog (Sollis et al., 2023), which reports associations with type 2 diabetes in published GWAS at SNP level.

Refer to caption
Figure 5: The Manhattan plots of type 2 diabetes GWASs based on pmaxsubscript𝑝maxp_{\rm max}italic_p start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT, LfdrLfdr{\rm Lfdr}roman_Lfdr and rLISrLIS{\rm rLIS}roman_rLIS. The horizontal blue lines denote the FDR cutoffs of 1×1051superscript1051\times 10^{-5}1 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT produced by maxP, STAREG and our method.

Figure 5 presents the Manhattan plots of MaxP, STAREG and our method. In Figure 5, the vertical axes are log10subscript10-\log_{10}- roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT transformations of test statistics for replicability analysis. They are pmaxsubscript𝑝maxp_{\rm max}italic_p start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT for MaxP, LfdrLfdr{\rm Lfdr}roman_Lfdr for STAREG and rLISrLIS{\rm rLIS}roman_rLIS for our method. Then we map the remaining SNPs to genes by R package snpGeneSets (Mei et al., 2016). The other 1,09510951,0951 , 095 SNPs are mapped to 77777777 genes, many of which have been reported to be related to type 2 diabetes in previous literature. For example, JAZF1, CDC123, THADA, ADAMTS9-AS2 and NOTCH2 were reported to be associated with type 2 diabetes (Zeggini et al., 2008). JAZF1 is a key transcriptional regulator of ribosome biogenesis, global protein, and insulin translation and has a significant association with type 2 diabetes (Kobiita et al., 2020). 33 SNPs are mapped to JAZF1 such as rs10245867 (rLIS: 2.64e-06; Male p𝑝pitalic_p-value: 1.03e-08; Female p𝑝pitalic_p-vale: 6.64e-05). ADAMTS9 can increase the risk of type 2 diabetes through impairment of insulin sensitivity (Graae et al., 2019). 25 SNPs are mapped to ADAMTS9, such as rs11914351 (rLIS: 3.71e-05; Male p𝑝pitalic_p-value: 8.53e-4; Female p𝑝pitalic_p-value: 5.70e-2). Increased expression of NOTCH2 may play a role in the pathogenesis of type 2 diabetes, and they may contribute to poor control of the glycemic state (Ghanem et al., 2020). 9 SNPs are mapped to NOTCH2, such as rs10127888 (rLIS: 4.44e-05; Male p𝑝pitalic_p-value: 2.82e-2; Female p𝑝pitalic_p-value: 1.52e-2).

6 Concluding remarks

In this paper, we propose a robust and powerful inference of high-dimensional replicability analysis accounting for dependence. We deal with summary statistics such as p𝑝pitalic_p-values from each study instead of the raw data since summary statistics are easier to access and store. We capture the local dependence of p𝑝pitalic_p-values by a hidden Markov model. We account for the heterogeneity of different studies by joint hidden states, allowing non-null density functions to have different distributions and estimating them non-parametrically. Furthermore, we obtain the identifiability condition of the unknown parameters and functions, consistency of estimated parameters and functions, and the asymptotic false discovery rate control. Simulation studies demonstrate valid false discovery rate control and higher power of our method. GWAS data analysis provides new biological insights that otherwise cannot be obtained using existing methods. For the maximum likelihood estimation with a hidden Markov model, theoretical results such as the rate of convergence is desirable, and we leave it as future work. We use two studies to illustrate the main ideas. In theory, our approach can be extended to more than two studies. In practice, for n𝑛nitalic_n studies, the total number of possible states is 2nsuperscript2𝑛2^{n}2 start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, which is computationally prohibitive, and a new approach is warranted.

Acknowledgement

We thank Yan Li for her help with the simulation studies. This research is partially supported by NSF 2311249 and NIH 2UL1TR001427-5.

References

  • Abraham et al. (2022) K. Abraham, I. Castillo, and E. Gassiat. Multiple testing in nonparametric hidden Markov models: An empirical Bayes approach. Journal of Machine Learning Research, 23(94):1–57, 2022.
  • Alexandrovich et al. (2016) G. Alexandrovich, H. Holzmann, and A. Leister. Nonparametric identification and maximum likelihood estimation for hidden Markov models. Biometrika, 103(2):423–434, 2016.
  • Barlow and Brunk (1972) R. Barlow and H. Brunk. The isotonic regression problem and its dual. Journal of the American Statistical Association, 67(337):140–147, 1972.
  • Baum et al. (1970) L. E. Baum, T. Petrie, G. Soules, and N. Weiss. A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Annals of Mathematical Statistics, 41(1):164–171, 1970.
  • Benjamini and Hochberg (1995) Y. Benjamini and Y. Hochberg. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society: Series B (Methodological), 57(1):289–300, 1995.
  • Benjamini et al. (2009) Y. Benjamini, R. Heller, and D. Yekutieli. Selective inference in complex research. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 367(1906):4255–4271, 2009.
  • Bickel et al. (1998) P. J. Bickel, Y. Ritov, and T. Ryden. Asymptotic normality of the maximum-likelihood estimator for general hidden Markov models. Annals of Statistics, 26(4):1614–1635, 1998.
  • Birkhoff (1931) G. D. Birkhoff. Proof of the ergodic theorem. Proceedings of the National Academy of Sciences, 17(12):656–660, 1931.
  • Bogomolov and Heller (2018) M. Bogomolov and R. Heller. Assessing replicability of findings across two studies of multiple features. Biometrika, 105(3):505–516, 2018.
  • Bogomolov and Heller (2023) M. Bogomolov and R. Heller. Replicability across multiple studies. Statistical Science, 38(4):602–620, 2023.
  • Cao et al. (2013) H. Cao, W. Sun, and M. R. Kosorok. The optimal power puzzle: scrutiny of the monotone likelihood ratio assumption in multiple testing. Biometrika, 100(2):495–502, 2013.
  • Dempster et al. (1977) A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological), 39(1):1–22, 1977.
  • Durrett (2019) R. Durrett. Probability: theory and examples. Cambridge University Press, 2019.
  • Efron (2012) B. Efron. Large-scale inference: empirical Bayes methods for estimation, testing, and prediction, volume 1. Cambridge University Press, 2012.
  • Fekete (1923) M. Fekete. Über die verteilung der wurzeln bei gewissen algebraischen gleichungen mit ganzzahligen koeffizienten. Mathematische Zeitschrift, 17(1):228–249, 1923.
  • Ghanem et al. (2020) Y. Ghanem, A. Ismail, R. Elsharkawy, R. Fathalla, and A. El Feky. Expression of Notch 2 and ABCC8 genes in patients with type 2 diabetes mellitus and their association with diabetic kidney disease. Clinical Diabetology, 9(5):306–312, 2020.
  • Graae et al. (2019) A.-S. Graae, N. Grarup, R. Ribel-Madsen, S. H. Lystbaek, T. Boesgaard, H. Staiger, A. Fritsche, N. Wellner, K. Sulek, M. Kjolby, et al. ADAMTS9 regulates skeletal muscle insulin sensitivity through extracellular matrix alterations. Diabetes, 68(3):502–514, 2019.
  • Heller and Yekutieli (2014) R. Heller and D. Yekutieli. Replicability analysis for genome-wide association studies. Annals of Applied Statistics, 8(1):481–498, 2014.
  • Hung and Fithian (2020) K. Hung and W. Fithian. Statistical methods for replicability assessment. Annals of Applied Statistics, 14(3):1063–1087, 2020.
  • Kobiita et al. (2020) A. Kobiita, S. Godbersen, E. Araldi, U. Ghoshdastider, M. W. Schmid, G. Spinas, H. Moch, and M. Stoffel. The diabetes gene JAZF1 is essential for the homeostatic control of ribosome biogenesis and function in metabolic stress. Cell Reports, 32(1), 2020.
  • Leroux (1992) B. G. Leroux. Maximum-likelihood estimation for hidden Markov models. Stochastic Processes and Their Applications, 40(1):127–143, 1992.
  • Li and Stephens (2003) N. Li and M. Stephens. Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data. Genetics, 165(4):2213–2233, 2003.
  • Li et al. (2011) Q. Li, J. B. Brown, H. Huang, and P. J. Bickel. Measuring reproducibility of high-throughput experiments. Annals of Applied Statistics, 5(3):1752–1779, 2011.
  • Li et al. (2023) Y. Li, X. Zhou, R. Chen, X. Zhang, and H. Cao. Stareg: an empirical Bayesian approach to detect replicable spatially variable genes in spatial transcriptomic studies. bioRxiv 10.1101/2023.05.30.542607, 2023.
  • Lipton et al. (2016) R. Lipton, T. Schwedt, B. Friedman, et al. Global, regional, and national incidence, prevalence, and years lived with disability for 310 diseases and injuries, 1990-2015: a systematic analysis for the Global Burden of Disease Study 2015. Lancet, 388(10053):1545–1602, 2016.
  • Lyu et al. (2023) P. Lyu, Y. Li, X. Wen, and H. Cao. JUMP: replicability analysis of high-throughput experiments with applications to spatial transcriptomic studies. Bioinformatics, 39(6):btad366, 2023.
  • Mei et al. (2016) H. Mei, L. Li, F. Jiang, J. Simino, M. Griswold, T. Mosley, and S. Liu. snpGeneSets: an r package for genome-wide study annotation. G3: Genes, Genomes, Genetics, 6(12):4087–4095, 2016.
  • Morris et al. (2012) B. Morris, Andrewand Voight, T. Teslovich, T. Ferreira, A. Segré, et al. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nature Genetics, 44(9):981–990, 2012.
  • Philtron et al. (2018) D. Philtron, Y. Lyu, Q. Li, and D. Ghosh. Maximum rank reproducibility: a nonparametric approach to assessing reproducibility in replicate experiments. Journal of the American Statistical Association, 113(523):1028–1039, 2018.
  • Pritchard and Przeworski (2001) J. K. Pritchard and M. Przeworski. Linkage disequilibrium in humans: models and data. The American Journal of Human Genetics, 69(1):1–14, 2001.
  • Riesz (1928) F. Riesz. Sur la convergence en moyenne. Acta Sci. Math, 4(1):58–64, 1928.
  • Robertson et al. (1988) T. Robertson, R. L. Dykstra, and F. T. Wright. Order restricted statistical inference. In Wiley Series in Probability and Mathematical Statistics. John Wiley and Sons, 1988.
  • Sesia et al. (2021) M. Sesia, S. Bates, E. Candès, J. Marchini, and C. Sabatti. False discovery rate control in genome-wide association studies with population structure. Proceedings of the National Academy of Sciences, 118(40):e2105841118, 2021.
  • Sollis et al. (2023) E. Sollis, A. Mosaku, A. Abid, A. Buniello, M. Cerezo, L. Gil, T. Groza, O. Güneş, P. Hall, J. Hayhurst, et al. The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Research, 51(D1):D977–D985, 2023.
  • Storey (2002) J. D. Storey. A direct approach to false discovery rates. Journal of the Royal Statistical Society: Series B: Statistical Methodology, 64(3):479–498, 2002.
  • Storey and Tibshirani (2003) J. D. Storey and R. Tibshirani. Statistical significance for genomewide studies. Proceedings of the National Academy of Sciences, 100(16):9440–9445, 2003.
  • Storey et al. (2004) J. D. Storey, J. E. Taylor, and D. Siegmund. Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach. Journal of the Royal Statistical Society: Series B: Statistical Methodology, 66(1):187–205, 2004.
  • Sun and Cai (2009) W. Sun and T. Cai. Large-scale multiple testing under dependence. Journal of the Royal Statistical Society: Series B: Statistical Methodology, 71(2):393–424, 2009.
  • Sun and Cai (2007) W. Sun and T. T. Cai. Oracle and adaptive compound decision rules for false discovery rate control. Journal of the American Statistical Association, 102(479):901–912, 2007.
  • Williams (1991) D. Williams. Probability with martingales. Cambridge university press, 1991.
  • Zeggini et al. (2008) E. Zeggini, L. J. Scott, R. Saxena, B. F. Voight, J. L. Marchini, T. Hu, P. I. de Bakker, G. R. Abecasis, P. Almgren, G. Andersen, et al. Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes. Nature Genetics, 40(5):638–645, 2008.

Supplementary Materials

Appendix A Proof of main results

A.1 Proof of Proposition 1

Proof.

Since the spaces of the transition matrix A𝐴Aitalic_A and stationary probability π𝜋\piitalic_π are bounded and closed with finite dimensions, they are compact. We just need to show the decreasing density function space \mathcal{H}caligraphic_H is compact under the Hellinger distance dH(,)subscript𝑑𝐻d_{H}(\cdot,\cdot)italic_d start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT ( ⋅ , ⋅ ). Consider any Cauchy sequence {gn}n=1superscriptsubscriptsubscript𝑔𝑛𝑛1\{g_{n}\}_{n=1}^{\infty}\subseteq\mathcal{H}{ italic_g start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ⊆ caligraphic_H. Our goal is to show there exists some g𝑔g\in\mathcal{H}italic_g ∈ caligraphic_H such that dH(gn,g)0subscript𝑑𝐻subscript𝑔𝑛𝑔0d_{H}(g_{n},g)\rightarrow 0italic_d start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT ( italic_g start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_g ) → 0 as n𝑛n\rightarrow\inftyitalic_n → ∞. Denote hn=gn1/2subscript𝑛superscriptsubscript𝑔𝑛12h_{n}=g_{n}^{1/2}italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_g start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT for all n=1,2,𝑛12n=1,2,\ldotsitalic_n = 1 , 2 , …. Then

01gn(y)dy=01hn2(y)dy=hn22=1,superscriptsubscript01subscript𝑔𝑛𝑦differential-d𝑦superscriptsubscript01superscriptsubscript𝑛2𝑦differential-d𝑦superscriptsubscriptnormsubscript𝑛221\int_{0}^{1}g_{n}(y){\rm d}y=\int_{0}^{1}h_{n}^{2}(y){\rm d}y=\|h_{n}\|_{2}^{2% }=1,∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_g start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_y ) roman_d italic_y = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_y ) roman_d italic_y = ∥ italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 1 ,

which means that hnL2[0,1]subscript𝑛superscript𝐿201h_{n}\in L^{2}[0,1]italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT [ 0 , 1 ]. Since L2[0,1]superscript𝐿201L^{2}[0,1]italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT [ 0 , 1 ] is compact, there exists some hL2[0,1]superscript𝐿201h\in L^{2}[0,1]italic_h ∈ italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT [ 0 , 1 ] satisfying

hnh22=01{hn(y)h(y)}2dy0 as n.superscriptsubscriptnormsubscript𝑛22superscriptsubscript01superscriptsubscript𝑛𝑦𝑦2differential-d𝑦0 as 𝑛\displaystyle\|h_{n}-h\|_{2}^{2}=\int_{0}^{1}\{h_{n}(y)-h(y)\}^{2}{\rm d}y% \rightarrow 0\text{ as }n\rightarrow\infty.∥ italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_h ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT { italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_y ) - italic_h ( italic_y ) } start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_d italic_y → 0 as italic_n → ∞ .

Denote g=h2𝑔superscript2g=h^{2}italic_g = italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Our goal is to show that g𝑔g\in\mathcal{H}italic_g ∈ caligraphic_H: g𝑔gitalic_g is a non-increasing density function. We next show that 01g(y)dy=1superscriptsubscript01𝑔𝑦differential-d𝑦1\int_{0}^{1}g(y){\rm d}y=1∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_g ( italic_y ) roman_d italic_y = 1 by contradiction. Note that 01g(y)dy=h22superscriptsubscript01𝑔𝑦differential-d𝑦superscriptsubscriptnorm22\int_{0}^{1}g(y){\rm d}y=\|h\|_{2}^{2}∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_g ( italic_y ) roman_d italic_y = ∥ italic_h ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Assume h21subscriptnorm21\|h\|_{2}\neq 1∥ italic_h ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≠ 1 and denote δ=|h21|>0𝛿subscriptnorm210\delta=|\|h\|_{2}-1|>0italic_δ = | ∥ italic_h ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 | > 0. Thus

hnh22=superscriptsubscriptnormsubscript𝑛22absent\displaystyle\|h_{n}-h\|_{2}^{2}=∥ italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_h ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 01{hn(y)h(y)}2dysuperscriptsubscript01superscriptsubscript𝑛𝑦𝑦2differential-d𝑦\displaystyle\int_{0}^{1}\{h_{n}(y)-h(y)\}^{2}{\rm d}y∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT { italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_y ) - italic_h ( italic_y ) } start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_d italic_y
=\displaystyle== hn22+h22201hn(y)h(y)dysuperscriptsubscriptnormsubscript𝑛22superscriptsubscriptnorm222superscriptsubscript01subscript𝑛𝑦𝑦differential-d𝑦\displaystyle\|h_{n}\|_{2}^{2}+\|h\|_{2}^{2}-2\int_{0}^{1}h_{n}(y)h(y){\rm d}y∥ italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∥ italic_h ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 2 ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_y ) italic_h ( italic_y ) roman_d italic_y
\displaystyle\geq hn22+h222hn2h2superscriptsubscriptnormsubscript𝑛22superscriptsubscriptnorm222subscriptnormsubscript𝑛2subscriptnorm2\displaystyle\|h_{n}\|_{2}^{2}+\|h\|_{2}^{2}-2\|h_{n}\|_{2}\|h\|_{2}∥ italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∥ italic_h ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 2 ∥ italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∥ italic_h ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT
=\displaystyle== (h21)2=δ2>0,superscriptsubscriptnorm212superscript𝛿20\displaystyle(\|h\|_{2}-1)^{2}=\delta^{2}>0,( ∥ italic_h ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT > 0 ,

which is contradictory to the fact that hnh20subscriptnormsubscript𝑛20\|h_{n}-h\|_{2}\rightarrow 0∥ italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_h ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT → 0. Then we can conclude that h22=01g(y)dy=1.superscriptsubscriptnorm22superscriptsubscript01𝑔𝑦differential-d𝑦1\|h\|_{2}^{2}=\int_{0}^{1}g(y){\rm d}y=1.∥ italic_h ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_g ( italic_y ) roman_d italic_y = 1 . Next, we show that hhitalic_h is also non-increasing. For any ε>0𝜀0\varepsilon>0italic_ε > 0, denote

En(ε)={y:|hn(y)h(y)|>ε}.subscript𝐸𝑛𝜀conditional-set𝑦subscript𝑛𝑦𝑦𝜀\displaystyle E_{n}(\varepsilon)=\{y:|h_{n}(y)-h(y)|>\varepsilon\}.italic_E start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_ε ) = { italic_y : | italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_y ) - italic_h ( italic_y ) | > italic_ε } . (A.1)

Denote μ()𝜇\mu(\cdot)italic_μ ( ⋅ ) as the Lebesgue measure. Thus

εμ{En(ε)}1/2=𝜀𝜇superscriptsubscript𝐸𝑛𝜀12absent\displaystyle\varepsilon\mu\{E_{n}(\varepsilon)\}^{1/2}=italic_ε italic_μ { italic_E start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_ε ) } start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT = (En(ε)ε2𝑑y)1/2superscriptsubscriptsubscript𝐸𝑛𝜀superscript𝜀2differential-d𝑦12\displaystyle\left(\int_{E_{n}(\varepsilon)}\varepsilon^{2}dy\right)^{1/2}( ∫ start_POSTSUBSCRIPT italic_E start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_ε ) end_POSTSUBSCRIPT italic_ε start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_y ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT
\displaystyle\leq (En(ε)|hn(y)h(y)|2𝑑y)1/2superscriptsubscriptsubscript𝐸𝑛𝜀superscriptsubscript𝑛𝑦𝑦2differential-d𝑦12\displaystyle\left(\int_{E_{n}(\varepsilon)}|h_{n}(y)-h(y)|^{2}dy\right)^{1/2}( ∫ start_POSTSUBSCRIPT italic_E start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_ε ) end_POSTSUBSCRIPT | italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_y ) - italic_h ( italic_y ) | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_y ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT
\displaystyle\leq (01|hn(y)h(y)|2𝑑y)1/2superscriptsuperscriptsubscript01superscriptsubscript𝑛𝑦𝑦2differential-d𝑦12\displaystyle\left(\int_{0}^{1}|h_{n}(y)-h(y)|^{2}dy\right)^{1/2}( ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_y ) - italic_h ( italic_y ) | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_y ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT
=\displaystyle== hnh220 as n,superscriptsubscriptnormsubscript𝑛220 as 𝑛\displaystyle\|h_{n}-h\|_{2}^{2}\to 0\text{ as }n\to\infty,∥ italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_h ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT → 0 as italic_n → ∞ ,

which implies that hnsubscript𝑛h_{n}italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT converges to hhitalic_h in measure, or equivalently, for any ε>0𝜀0\varepsilon>0italic_ε > 0,

limnμ(En{ε)}=0.\displaystyle\lim_{n\to\infty}\mu(E_{n}\{\varepsilon)\}=0.roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT italic_μ ( italic_E start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT { italic_ε ) } = 0 .

By the theorem of Riesz [Riesz, 1928], there exists a subsequence {hnk}subscriptsubscript𝑛𝑘\{h_{n_{k}}\}{ italic_h start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT } of {hn}subscript𝑛\{h_{n}\}{ italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT }, such that hnkhsubscriptsubscript𝑛𝑘h_{n_{k}}\rightarrow hitalic_h start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT → italic_h almost everywhere. Since hnksubscriptsubscript𝑛𝑘h_{n_{k}}italic_h start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT are non-increasing, we could conclude that hhitalic_h is also non-increasing. In conclusion, g=h2𝑔superscript2g=h^{2}\in\mathcal{H}italic_g = italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∈ caligraphic_H and consequently, \mathcal{H}caligraphic_H is compact with respect to the Hellinger distance dH(,)subscript𝑑𝐻d_{H}(\cdot,\cdot)italic_d start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT ( ⋅ , ⋅ ). ∎

A.2 Proof of Theorem 3.1

For any ϕΦitalic-ϕΦ\phi\in\Phiitalic_ϕ ∈ roman_Φ with d(ϕ,ϕ0)𝑑italic-ϕsubscriptitalic-ϕ0d(\phi,\phi_{0})italic_d ( italic_ϕ , italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ), define the conditional distribution of (y1j,y2j)j=1msuperscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚(y_{1j},y_{2j})_{j=1}^{m}( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT given s1=k,k=0,1,2,3formulae-sequencesubscript𝑠1𝑘𝑘0123s_{1}=k,k=0,1,2,3italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_k , italic_k = 0 , 1 , 2 , 3 as

m(k;ϕ):=:subscript𝑚𝑘italic-ϕ\displaystyle\ell_{m}(k;\phi):=roman_ℓ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_k ; italic_ϕ ) : = f(k)(y11,y21;ϕ)s2smak,s2(ϕ)f(s2)(y12,y22;ϕ)j=3masj1,sj(ϕ)f(sj)(y1j,y2j;ϕ),superscript𝑓𝑘subscript𝑦11subscript𝑦21italic-ϕsubscriptsubscript𝑠2subscriptsubscript𝑠𝑚subscript𝑎𝑘subscript𝑠2italic-ϕsuperscript𝑓subscript𝑠2subscript𝑦12subscript𝑦22italic-ϕsuperscriptsubscriptproduct𝑗3𝑚subscript𝑎subscript𝑠𝑗1subscript𝑠𝑗italic-ϕsuperscript𝑓subscript𝑠𝑗subscript𝑦1𝑗subscript𝑦2𝑗italic-ϕ\displaystyle f^{(k)}(y_{11},y_{21};\phi)\sum_{s_{2}}\dots\sum_{s_{m}}a_{k,s_{% 2}}(\phi)f^{(s_{2})}(y_{12},y_{22};\phi)\prod_{j=3}^{m}a_{s_{j-1},s_{j}}(\phi)% f^{(s_{j})}(y_{1j},y_{2j};\phi),italic_f start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT ; italic_ϕ ) ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT … ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k , italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ ) italic_f start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 22 end_POSTSUBSCRIPT ; italic_ϕ ) ∏ start_POSTSUBSCRIPT italic_j = 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ ) italic_f start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ; italic_ϕ ) ,

where sjsubscript𝑠𝑗s_{j}italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT denotes the hidden state of the j𝑗jitalic_jth gene for j=1,,m𝑗1𝑚j=1,\ldots,mitalic_j = 1 , … , italic_m. Denote the largest m(k;ϕ)subscript𝑚𝑘italic-ϕ\ell_{m}(k;\phi)roman_ℓ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_k ; italic_ϕ ) for k=0,1,2,3𝑘0123k=0,1,2,3italic_k = 0 , 1 , 2 , 3

qm(ϕ):=maxk=0,1,2,3m(k;ϕ).assignsubscript𝑞𝑚italic-ϕsubscript𝑘0123subscript𝑚𝑘italic-ϕq_{m}(\phi):=\max_{k=0,1,2,3}\ell_{m}(k;\phi).italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) := roman_max start_POSTSUBSCRIPT italic_k = 0 , 1 , 2 , 3 end_POSTSUBSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_k ; italic_ϕ ) .

Then the likelihood function pm(ϕ)=pm((y1j,y2j)j=1m;ϕ)subscript𝑝𝑚italic-ϕsubscript𝑝𝑚superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚italic-ϕp_{m}(\phi)=p_{m}\left((y_{1j},y_{2j})_{j=1}^{m};\phi\right)italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) = italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ; italic_ϕ ) satisfies

pm(ϕ)=subscript𝑝𝑚italic-ϕabsent\displaystyle p_{m}(\phi)=italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) = k=0,1,2,3πk(ϕ)m(k;ϕ)qm(ϕ),subscript𝑘0123subscript𝜋𝑘italic-ϕsubscript𝑚𝑘italic-ϕsubscript𝑞𝑚italic-ϕ\displaystyle\sum_{k=0,1,2,3}\pi_{k}(\phi)\ell_{m}(k;\phi)\leq q_{m}(\phi),∑ start_POSTSUBSCRIPT italic_k = 0 , 1 , 2 , 3 end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_ϕ ) roman_ℓ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_k ; italic_ϕ ) ≤ italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) , (A.2)

where πk(ϕ)=Pϕ(sj=k)subscript𝜋𝑘italic-ϕsubscript𝑃italic-ϕsubscript𝑠𝑗𝑘\pi_{k}(\phi)=P_{\phi}(s_{j}=k)italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_ϕ ) = italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ) for j=1,,m𝑗1𝑚j=1,\ldots,mitalic_j = 1 , … , italic_m; k=0,1,2,3𝑘0123k=0,1,2,3italic_k = 0 , 1 , 2 , 3 and satisfies k=03πk(ϕ)=1superscriptsubscript𝑘03subscript𝜋𝑘italic-ϕ1\sum_{k=0}^{3}\pi_{k}(\phi)=1∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_ϕ ) = 1. In addition, assume qm(ϕ)=m(k0;ϕ)subscript𝑞𝑚italic-ϕsubscript𝑚subscript𝑘0italic-ϕq_{m}(\phi)=\ell_{m}(k_{0};\phi)italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) = roman_ℓ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_ϕ ) for some k0{0,1,2,3}subscript𝑘00123k_{0}\in\{0,1,2,3\}italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 , 3 }. Then

pm(ϕ)=subscript𝑝𝑚italic-ϕabsent\displaystyle p_{m}(\phi)=italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) = k=0,1,2,3πk(ϕ)m(k;ϕ)πk0(ϕ)m(k0;ϕ)ε0qm(ϕ),subscript𝑘0123subscript𝜋𝑘italic-ϕsubscript𝑚𝑘italic-ϕsubscript𝜋subscript𝑘0italic-ϕsubscript𝑚subscript𝑘0italic-ϕsubscript𝜀0subscript𝑞𝑚italic-ϕ\displaystyle\sum_{k=0,1,2,3}\pi_{k}(\phi)\ell_{m}(k;\phi)\geq\pi_{k_{0}}(\phi% )\ell_{m}(k_{0};\phi)\geq\varepsilon_{0}q_{m}(\phi),∑ start_POSTSUBSCRIPT italic_k = 0 , 1 , 2 , 3 end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_ϕ ) roman_ℓ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_k ; italic_ϕ ) ≥ italic_π start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ ) roman_ℓ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_ϕ ) ≥ italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) , (A.3)

where (A.3) holds due to (C2): πk(ϕ)ε0subscript𝜋𝑘italic-ϕsubscript𝜀0\pi_{k}(\phi)\geq\varepsilon_{0}italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_ϕ ) ≥ italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT for all k=0,1,2,3𝑘0123k=0,1,2,3italic_k = 0 , 1 , 2 , 3. Therefore, combining (A.2) and (A.3) and taking the logarithm, we have

log(ε0)logpm(ϕ)qm(ϕ)0.subscript𝜀0subscript𝑝𝑚italic-ϕsubscript𝑞𝑚italic-ϕ0\log\left(\varepsilon_{0}\right)\leq\log\frac{p_{m}(\phi)}{q_{m}(\phi)}\leq 0.roman_log ( italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ≤ roman_log divide start_ARG italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) end_ARG start_ARG italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) end_ARG ≤ 0 . (A.4)

Dividing (A.4) by m𝑚mitalic_m, we have

1mlog(ε0)1mlogpm(ϕ)1mlogqm(ϕ)0.1𝑚subscript𝜀01𝑚subscript𝑝𝑚italic-ϕ1𝑚subscript𝑞𝑚italic-ϕ0\displaystyle\frac{1}{m}\log\left(\varepsilon_{0}\right)\leq\frac{1}{m}\log p_% {m}(\phi)-\frac{1}{m}\log q_{m}(\phi)\leq 0.divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_log ( italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ≤ divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_log italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) - divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_log italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) ≤ 0 . (A.5)

Letting m𝑚m\rightarrow\inftyitalic_m → ∞, the lower bound of inequality (A.5) tends to 00. Hence m1logqm(ϕ)superscript𝑚1subscript𝑞𝑚italic-ϕm^{-1}\log q_{m}(\phi)italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_log italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) and m1logpm(ϕ)superscript𝑚1subscript𝑝𝑚italic-ϕm^{-1}\log p_{m}(\phi)italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_log italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) converges to the same limit in probability. Taking the expectation on all terms of inequality (A.5), we know m1Eϕ0logqm(ϕ)superscript𝑚1subscript𝐸subscriptitalic-ϕ0subscript𝑞𝑚italic-ϕm^{-1}E_{\phi_{0}}\log q_{m}(\phi)italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_log italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) has the same limit as m1Eϕ0logpm(ϕ)superscript𝑚1subscript𝐸subscriptitalic-ϕ0subscript𝑝𝑚italic-ϕm^{-1}E_{\phi_{0}}\log p_{m}(\phi)italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_log italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ). By Theorem 2 in Leroux [1992], there exists some H(ϕ0,ϕ)<𝐻subscriptitalic-ϕ0italic-ϕH(\phi_{0},\phi)<\inftyitalic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ) < ∞ satisfying

limm1mEϕ0{logpm(ϕ)}subscript𝑚1𝑚subscript𝐸subscriptitalic-ϕ0subscript𝑝𝑚italic-ϕ\displaystyle\lim_{m\rightarrow\infty}\frac{1}{m}E_{\phi_{0}}\{\log p_{m}(\phi)\}roman_lim start_POSTSUBSCRIPT italic_m → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_m end_ARG italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT { roman_log italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) } =H(ϕ0,ϕ), andabsent𝐻subscriptitalic-ϕ0italic-ϕ and\displaystyle=H(\phi_{0},\phi),\text{ and }= italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ) , and
limm1mlogpm(ϕ)subscript𝑚1𝑚subscript𝑝𝑚italic-ϕ\displaystyle\lim_{m\rightarrow\infty}\frac{1}{m}\log p_{m}(\phi)roman_lim start_POSTSUBSCRIPT italic_m → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_log italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) =H(ϕ0,ϕ) almost surely under ϕ0.absent𝐻subscriptitalic-ϕ0italic-ϕ almost surely under subscriptitalic-ϕ0\displaystyle=H(\phi_{0},\phi)\text{ almost surely under }\phi_{0}.= italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ) almost surely under italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT .

We also have

limm1mEϕ0{logqm(ϕ)}subscript𝑚1𝑚subscript𝐸subscriptitalic-ϕ0subscript𝑞𝑚italic-ϕ\displaystyle\lim_{m\rightarrow\infty}\frac{1}{m}E_{\phi_{0}}\{\log q_{m}(\phi)\}roman_lim start_POSTSUBSCRIPT italic_m → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_m end_ARG italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT { roman_log italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) } =H(ϕ0,ϕ), andabsent𝐻subscriptitalic-ϕ0italic-ϕ and\displaystyle=H(\phi_{0},\phi),\text{ and }= italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ) , and
limm1mlogqm(ϕ)subscript𝑚1𝑚subscript𝑞𝑚italic-ϕ\displaystyle\lim_{m\rightarrow\infty}\frac{1}{m}\log q_{m}(\phi)roman_lim start_POSTSUBSCRIPT italic_m → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_log italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) =H(ϕ0,ϕ) almost surely under ϕ0.absent𝐻subscriptitalic-ϕ0italic-ϕ almost surely under subscriptitalic-ϕ0\displaystyle=H(\phi_{0},\phi)\text{ almost surely under }\phi_{0}.= italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ) almost surely under italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT .

Replacing ϕitalic-ϕ\phiitalic_ϕ by ϕ0subscriptitalic-ϕ0\phi_{0}italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, we get the limit H(ϕ0,ϕ0)𝐻subscriptitalic-ϕ0subscriptitalic-ϕ0H(\phi_{0},\phi_{0})italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ). Lemma 6 in Leroux [1992] gives that H(ϕ0,ϕ)<H(ϕ0,ϕ0)𝐻subscriptitalic-ϕ0italic-ϕ𝐻subscriptitalic-ϕ0subscriptitalic-ϕ0H(\phi_{0},\phi)<H(\phi_{0},\phi_{0})italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ) < italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) for ϕϕ0italic-ϕsubscriptitalic-ϕ0\phi\neq\phi_{0}italic_ϕ ≠ italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. Letting ε={H(ϕ0,ϕ0)H(ϕ0,ϕ)}/2𝜀𝐻subscriptitalic-ϕ0subscriptitalic-ϕ0𝐻subscriptitalic-ϕ0italic-ϕ2\varepsilon=\{H(\phi_{0},\phi_{0})-H(\phi_{0},\phi)\}/2italic_ε = { italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ) } / 2, there exists mεsubscript𝑚𝜀m_{\varepsilon}italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT such that,

1mεEϕ0{logqmε(ϕ)}<H(ϕ0,ϕ)+ε=H(ϕ0,ϕ0)ε.1subscript𝑚𝜀subscript𝐸subscriptitalic-ϕ0subscript𝑞subscript𝑚𝜀italic-ϕ𝐻subscriptitalic-ϕ0italic-ϕ𝜀𝐻subscriptitalic-ϕ0subscriptitalic-ϕ0𝜀\displaystyle\frac{1}{m_{\varepsilon}}E_{\phi_{0}}\{\log q_{m_{\varepsilon}}(% \phi)\}<H(\phi_{0},\phi)+\varepsilon=H(\phi_{0},\phi_{0})-\varepsilon.divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_ARG italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT { roman_log italic_q start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ ) } < italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ) + italic_ε = italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_ε . (A.6)

Denote Oϕ,r={ϕΦ:d(ϕ,ϕ)<r}subscript𝑂italic-ϕ𝑟conditional-setsuperscriptitalic-ϕΦ𝑑superscriptitalic-ϕitalic-ϕ𝑟O_{\phi,r}=\{\phi^{\prime}\in\Phi:d(\phi^{\prime},\phi)<r\}italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT = { italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ roman_Φ : italic_d ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_ϕ ) < italic_r } as a ball centered at ϕitalic-ϕ\phiitalic_ϕ with radius r>0𝑟0r>0italic_r > 0, where d(ϕ,ϕ)𝑑superscriptitalic-ϕitalic-ϕd(\phi^{\prime},\phi)italic_d ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_ϕ ) is the distance between ϕsuperscriptitalic-ϕ\phi^{\prime}italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT and ϕitalic-ϕ\phiitalic_ϕ defined in (3.1). Eϕ0[{log(supϕOϕ,rqmε(ϕ))}+]<subscript𝐸subscriptitalic-ϕ0delimited-[]superscriptsubscriptsupremumsuperscriptitalic-ϕsubscript𝑂italic-ϕ𝑟subscript𝑞subscript𝑚𝜀superscriptitalic-ϕE_{\phi_{0}}[\{\log(\sup_{\phi^{\prime}\in O_{\phi,r}}q_{m_{\varepsilon}}(\phi% ^{\prime}))\}^{+}]<\inftyitalic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ { roman_log ( roman_sup start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) } start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ] < ∞ by (C4). Therefore, Eϕ0[{log(supϕOϕ,rqmε(ϕ))}+]subscript𝐸subscriptitalic-ϕ0delimited-[]superscriptsubscriptsupremumsuperscriptitalic-ϕsubscript𝑂italic-ϕ𝑟subscript𝑞subscript𝑚𝜀superscriptitalic-ϕE_{\phi_{0}}[\{\log(\sup_{\phi^{\prime}\in O_{\phi,r}}q_{m_{\varepsilon}}(\phi% ^{\prime}))\}^{+}]italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ { roman_log ( roman_sup start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) } start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ] is a bounded monotone increasing function of r𝑟ritalic_r. Since f1(ϕ),f2(ϕ)subscript𝑓1italic-ϕsubscript𝑓2italic-ϕf_{1}(\phi),f_{2}(\phi)italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ϕ ) , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_ϕ ) are continuous functions of ϕitalic-ϕ\phiitalic_ϕ, pm(ϕ)subscript𝑝𝑚italic-ϕp_{m}(\phi)italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) and qm(ϕ)subscript𝑞𝑚italic-ϕq_{m}(\phi)italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) are also continuous. By the Monotone Convergence Theorem and the continuity of qmε(ϕ)subscript𝑞subscript𝑚𝜀italic-ϕq_{m_{\varepsilon}}(\phi)italic_q start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ ), we have

1mεEϕ0{log(supϕOϕ,rqmε(ϕ))}1mεEϕ0{logqmε(ϕ)} as r0.1subscript𝑚𝜀subscript𝐸subscriptitalic-ϕ0subscriptsupremumsuperscriptitalic-ϕsubscript𝑂italic-ϕ𝑟subscript𝑞subscript𝑚𝜀superscriptitalic-ϕ1subscript𝑚𝜀subscript𝐸subscriptitalic-ϕ0subscript𝑞subscript𝑚𝜀italic-ϕ as 𝑟0\displaystyle\frac{1}{m_{\varepsilon}}E_{\phi_{0}}\left\{\log\left(\sup_{\phi^% {\prime}\in O_{\phi,r}}q_{m_{\varepsilon}}(\phi^{\prime})\right)\right\}% \rightarrow\frac{1}{m_{\varepsilon}}E_{\phi_{0}}\left\{\log q_{m_{\varepsilon}% }(\phi)\right\}\text{ as }r\rightarrow 0.divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_ARG italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT { roman_log ( roman_sup start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) } → divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_ARG italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT { roman_log italic_q start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ ) } as italic_r → 0 .

Then there exists r0>0subscript𝑟00r_{0}>0italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT > 0, such that

1mεEϕ0{log(supϕOϕ,r0qmε(ϕ))}<1subscript𝑚𝜀subscript𝐸subscriptitalic-ϕ0subscriptsupremumsuperscriptitalic-ϕsubscript𝑂italic-ϕsubscript𝑟0subscript𝑞subscript𝑚𝜀superscriptitalic-ϕabsent\displaystyle\frac{1}{m_{\varepsilon}}E_{\phi_{0}}\left\{\log\left(\sup_{\phi^% {\prime}\in O_{\phi,r_{0}}}q_{m_{\varepsilon}}(\phi^{\prime})\right)\right\}<divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_ARG italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT { roman_log ( roman_sup start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) } < 1mεEϕ0{logqmε(ϕ)}+ε/21subscript𝑚𝜀subscript𝐸subscriptitalic-ϕ0subscript𝑞subscript𝑚𝜀italic-ϕ𝜀2\displaystyle\frac{1}{m_{\varepsilon}}E_{\phi_{0}}\{\log q_{m_{\varepsilon}}(% \phi)\}+\varepsilon/2divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_ARG italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT { roman_log italic_q start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ ) } + italic_ε / 2
<\displaystyle<< H(ϕ0,ϕ0)ε/2,𝐻subscriptitalic-ϕ0subscriptitalic-ϕ0𝜀2\displaystyle H(\phi_{0},\phi_{0})-\varepsilon/2,italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_ε / 2 , (A.7)

where the second inequality holds due to (A.6). By the continuity of pm(ϕ)subscript𝑝𝑚italic-ϕp_{m}(\phi)italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) and qm(ϕ)subscript𝑞𝑚italic-ϕq_{m}(\phi)italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ), it follows from (A.5) that

1mlog(ε0)1mlog{supϕOϕ,rpm(ϕ)}1mlog{supϕOϕ,rqm(ϕ)}0.1𝑚subscript𝜀01𝑚subscriptsupremumsuperscriptitalic-ϕsubscript𝑂italic-ϕ𝑟subscript𝑝𝑚superscriptitalic-ϕ1𝑚subscriptsupremumsuperscriptitalic-ϕsubscript𝑂italic-ϕ𝑟subscript𝑞𝑚superscriptitalic-ϕ0\frac{1}{m}\log\left(\varepsilon_{0}\right)\leq\frac{1}{m}\log\left\{\sup_{% \phi^{\prime}\in O_{\phi,r}}p_{m}(\phi^{\prime})\right\}-\frac{1}{m}\log\left% \{\sup_{\phi^{\prime}\in O_{\phi,r}}q_{m}(\phi^{\prime})\right\}\leq 0.divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_log ( italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ≤ divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_log { roman_sup start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) } - divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_log { roman_sup start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) } ≤ 0 .

Thus, m1log{supϕOϕ,rpm(ϕ)}superscript𝑚1subscriptsupremumsuperscriptitalic-ϕsubscript𝑂italic-ϕ𝑟subscript𝑝𝑚superscriptitalic-ϕm^{-1}\log\{\sup_{\phi^{\prime}\in O_{\phi,r}}p_{m}(\phi^{\prime})\}italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_log { roman_sup start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) } and m1log{supϕOϕ,rqm(ϕ)}superscript𝑚1subscriptsupremumsuperscriptitalic-ϕsubscript𝑂italic-ϕ𝑟subscript𝑞𝑚superscriptitalic-ϕm^{-1}\log\{\sup_{\phi^{\prime}\in O_{\phi,r}}q_{m}(\phi^{\prime})\}italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_log { roman_sup start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) } converge to the same limit in probability. Define

J(ϕ0,ϕ;Oϕ,r)=limm1mEϕ0{log(supϕOϕ,rqm(ϕ))}.𝐽subscriptitalic-ϕ0italic-ϕsubscript𝑂italic-ϕ𝑟subscript𝑚1𝑚subscript𝐸subscriptitalic-ϕ0subscriptsupremumsuperscriptitalic-ϕsubscript𝑂italic-ϕ𝑟subscript𝑞𝑚superscriptitalic-ϕJ(\phi_{0},\phi;O_{\phi,r})=\lim_{m\rightarrow\infty}\frac{1}{m}E_{\phi_{0}}% \left\{\log\left(\sup_{\phi^{\prime}\in O_{\phi,r}}q_{m}(\phi^{\prime})\right)% \right\}.italic_J ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ; italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT ) = roman_lim start_POSTSUBSCRIPT italic_m → ∞ end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_m end_ARG italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT { roman_log ( roman_sup start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) } .

In addition, we have

1mlog{supϕOϕ,rqm(ϕ)}1𝑚subscriptsupremumsuperscriptitalic-ϕsubscript𝑂italic-ϕ𝑟subscript𝑞𝑚superscriptitalic-ϕ\displaystyle\frac{1}{m}\log\left\{\sup_{\phi^{\prime}\in O_{\phi,r}}q_{m}(% \phi^{\prime})\right\}divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_log { roman_sup start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) } J(ϕ0,ϕ;Oϕ,r) in probability, andabsent𝐽subscriptitalic-ϕ0italic-ϕsubscript𝑂italic-ϕ𝑟 in probability, and\displaystyle\rightarrow J(\phi_{0},\phi;O_{\phi,r})\text{ in probability, and }→ italic_J ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ; italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT ) in probability, and
1mlog{supϕOϕ,rpm(ϕ)}1𝑚subscriptsupremumsuperscriptitalic-ϕsubscript𝑂italic-ϕ𝑟subscript𝑝𝑚superscriptitalic-ϕ\displaystyle\frac{1}{m}\log\left\{\sup_{\phi^{\prime}\in O_{\phi,r}}p_{m}(% \phi^{\prime})\right\}divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_log { roman_sup start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) } J(ϕ0,ϕ;Oϕ,r) in probability.absent𝐽subscriptitalic-ϕ0italic-ϕsubscript𝑂italic-ϕ𝑟 in probability\displaystyle\rightarrow J(\phi_{0},\phi;O_{\phi,r})\text{ in probability}.→ italic_J ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ; italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT ) in probability . (A.8)

By the construction of qm(ϕ)=qm((y1j,y2j)j=1m;ϕ)subscript𝑞𝑚italic-ϕsubscript𝑞𝑚superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚italic-ϕq_{m}(\phi)=q_{m}((y_{1j},y_{2j})_{j=1}^{m};\phi)italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) = italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ; italic_ϕ ), Lemma 3 of Leroux [1992] shows that logqm((y1j,y2j)j=1m;ϕ)subscript𝑞𝑚superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚italic-ϕ\log q_{m}((y_{1j},y_{2j})_{j=1}^{m};\phi)roman_log italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ; italic_ϕ ) is subadditive, which means for any sequence (y1j,y2j)j=1msuperscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚(y_{1j},y_{2j})_{j=1}^{m}( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT,

logqs+t((y1j,y2j)j=1s+t;ϕ)logqs((y1j,y2j)j=1s;ϕ)+logqt((y1j,y2j)j=s+1s+t;ϕ).subscript𝑞𝑠𝑡superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑠𝑡italic-ϕsubscript𝑞𝑠superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑠italic-ϕsubscript𝑞𝑡superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗𝑠1𝑠𝑡italic-ϕ\displaystyle\log q_{s+t}((y_{1j},y_{2j})_{j=1}^{s+t};\phi)\leq\log q_{s}((y_{% 1j},y_{2j})_{j=1}^{s};\phi)+\log q_{t}((y_{1j},y_{2j})_{j=s+1}^{s+t};\phi).roman_log italic_q start_POSTSUBSCRIPT italic_s + italic_t end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s + italic_t end_POSTSUPERSCRIPT ; italic_ϕ ) ≤ roman_log italic_q start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT ; italic_ϕ ) + roman_log italic_q start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = italic_s + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s + italic_t end_POSTSUPERSCRIPT ; italic_ϕ ) .

By the property of subadditive processes [Fekete, 1923],

J(ϕ0,ϕ;Oϕ,r)=infm1mEϕ0{log(supϕOϕ,rqm(ϕ))}𝐽subscriptitalic-ϕ0italic-ϕsubscript𝑂italic-ϕ𝑟subscriptinfimum𝑚1𝑚subscript𝐸subscriptitalic-ϕ0subscriptsupremumsuperscriptitalic-ϕsubscript𝑂italic-ϕ𝑟subscript𝑞𝑚superscriptitalic-ϕJ(\phi_{0},\phi;O_{\phi,r})=\inf_{m}\frac{1}{m}E_{\phi_{0}}\left\{\log\left(% \sup_{\phi^{\prime}\in O_{\phi,r}}q_{m}(\phi^{\prime})\right)\right\}italic_J ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ; italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT ) = roman_inf start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_m end_ARG italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT { roman_log ( roman_sup start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) }

which implies that

J(ϕ0,ϕ;Oϕ,r)1mεEϕ0{log(supϕOϕ,rqmε(ϕ))}.𝐽subscriptitalic-ϕ0italic-ϕsubscript𝑂italic-ϕ𝑟1subscript𝑚𝜀subscript𝐸subscriptitalic-ϕ0subscriptsupremumsuperscriptitalic-ϕsubscript𝑂italic-ϕ𝑟subscript𝑞subscript𝑚𝜀superscriptitalic-ϕJ(\phi_{0},\phi;O_{\phi,r})\leq\frac{1}{m_{\varepsilon}}E_{\phi_{0}}\left\{% \log\left(\sup_{\phi^{\prime}\in O_{\phi,r}}q_{m_{\varepsilon}}(\phi^{\prime})% \right)\right\}.italic_J ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ; italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT ) ≤ divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_ARG italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT { roman_log ( roman_sup start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) } . (A.9)

Consequently, by (A.8), (A.9) and (A.7), we have as m𝑚m\rightarrow\inftyitalic_m → ∞,

1mlog{supϕOϕ,rpm(ϕ)}1𝑚subscriptsupremumsuperscriptitalic-ϕsubscript𝑂italic-ϕ𝑟subscript𝑝𝑚superscriptitalic-ϕabsent\displaystyle\frac{1}{m}\log\left\{\sup_{\phi^{\prime}\in O_{\phi,r}}p_{m}(% \phi^{\prime})\right\}\rightarrowdivide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_log { roman_sup start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) } → J(ϕ0,ϕ;Oϕ,r) in probability, and𝐽subscriptitalic-ϕ0italic-ϕsubscript𝑂italic-ϕ𝑟 in probability, and\displaystyle J(\phi_{0},\phi;O_{\phi,r})\text{ in probability, and }italic_J ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ; italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT ) in probability, and
J(ϕ0,ϕ;Oϕ,r)𝐽subscriptitalic-ϕ0italic-ϕsubscript𝑂italic-ϕ𝑟absent\displaystyle J(\phi_{0},\phi;O_{\phi,r})\leqitalic_J ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ ; italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT ) ≤ 1mεEϕ0{log(supϕOϕ,rqmε(ϕ))}1subscript𝑚𝜀subscript𝐸subscriptitalic-ϕ0subscriptsupremumsuperscriptitalic-ϕsubscript𝑂italic-ϕ𝑟subscript𝑞subscript𝑚𝜀superscriptitalic-ϕ\displaystyle\frac{1}{m_{\varepsilon}}E_{\phi_{0}}\left\{\log\left(\sup_{\phi^% {\prime}\in O_{\phi,r}}q_{m_{\varepsilon}}(\phi^{\prime})\right)\right\}divide start_ARG 1 end_ARG start_ARG italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_ARG italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT { roman_log ( roman_sup start_POSTSUBSCRIPT italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) }
<\displaystyle<< H(ϕ0,ϕ0)ε/2.𝐻subscriptitalic-ϕ0subscriptitalic-ϕ0𝜀2\displaystyle H(\phi_{0},\phi_{0})-\varepsilon/2.italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) - italic_ε / 2 . (A.10)

Next, we use (A.10) to show the consistency of ϕ^msubscript^italic-ϕ𝑚\hat{\phi}_{m}over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT. Let C𝐶Citalic_C be any closed subset of ΦΦ\Phiroman_Φ, not containing ϕ0subscriptitalic-ϕ0\phi_{0}italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. Since ΦΦ\Phiroman_Φ is compact, C𝐶Citalic_C is also compact and is covered by the union of finite open sets h=1dOϕh,rsuperscriptsubscript1𝑑subscript𝑂subscriptitalic-ϕ𝑟\bigcup_{h=1}^{d}O_{\phi_{h},r}⋃ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT italic_O start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT , italic_r end_POSTSUBSCRIPT, where {ϕ1,,ϕd}subscriptitalic-ϕ1subscriptitalic-ϕ𝑑\{\phi_{1},\ldots,\phi_{d}\}{ italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_ϕ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT } is a finite set in C𝐶Citalic_C. Therefore,

supϕC{logpm(ϕ)logpm(ϕ0)}subscriptsupremumitalic-ϕ𝐶subscript𝑝𝑚italic-ϕsubscript𝑝𝑚subscriptitalic-ϕ0\displaystyle\sup_{\phi\in C}\left\{\log p_{m}(\phi)-\log p_{m}(\phi_{0})\right\}roman_sup start_POSTSUBSCRIPT italic_ϕ ∈ italic_C end_POSTSUBSCRIPT { roman_log italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) - roman_log italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) }
\displaystyle\leq max1hd[m{1mlog(supϕOϕh,rpm(ϕ))1mlogpm(ϕ0)}]subscript1𝑑𝑚1𝑚subscriptsupremumitalic-ϕsubscript𝑂subscriptitalic-ϕ𝑟subscript𝑝𝑚italic-ϕ1𝑚subscript𝑝𝑚subscriptitalic-ϕ0\displaystyle\max_{1\leq h\leq d}\left[m\left\{\frac{1}{m}\log\left(\sup_{\phi% \in O_{\phi_{h},r}}p_{m}(\phi)\right)-\frac{1}{m}\log p_{m}(\phi_{0})\right\}\right]roman_max start_POSTSUBSCRIPT 1 ≤ italic_h ≤ italic_d end_POSTSUBSCRIPT [ italic_m { divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_log ( roman_sup start_POSTSUBSCRIPT italic_ϕ ∈ italic_O start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT , italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ ) ) - divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_log italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) } ]
\displaystyle\rightarrow  in probability, in probability\displaystyle-\infty\text{ in probability},- ∞ in probability ,

where the limit in the last line holds due to (A.10) and that m1logpm(ϕ0)H(ϕ0,ϕ0)superscript𝑚1subscript𝑝𝑚subscriptitalic-ϕ0𝐻subscriptitalic-ϕ0subscriptitalic-ϕ0m^{-1}\log p_{m}(\phi_{0})\to H(\phi_{0},\phi_{0})italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_log italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) → italic_H ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) almost surely as m𝑚m\to\inftyitalic_m → ∞ by Birkhoff’s ergodic theorem [Birkhoff, 1931]. Since ϕ^msubscript^italic-ϕ𝑚\hat{\phi}_{m}over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT is a maximum likelihood estimator, logpm(ϕ^m)logpm(ϕ0)subscript𝑝𝑚subscript^italic-ϕ𝑚subscript𝑝𝑚subscriptitalic-ϕ0\log p_{m}(\hat{\phi}_{m})\geq\log p_{m}(\phi_{0})roman_log italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ≥ roman_log italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ). Therefore, ϕ^msubscript^italic-ϕ𝑚\hat{\phi}_{m}over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT cannot be in C𝐶Citalic_C. In other words, for any open set Oϕ0,rΦsubscript𝑂subscriptitalic-ϕ0𝑟ΦO_{\phi_{0},r}\subseteq\Phiitalic_O start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_r end_POSTSUBSCRIPT ⊆ roman_Φ containing ϕ0subscriptitalic-ϕ0\phi_{0}italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, ϕ^msubscript^italic-ϕ𝑚\hat{\phi}_{m}over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT must be in Oϕ,rsubscript𝑂italic-ϕ𝑟O_{\phi,r}italic_O start_POSTSUBSCRIPT italic_ϕ , italic_r end_POSTSUBSCRIPT for large m𝑚mitalic_m. Letting r0𝑟0r\rightarrow 0italic_r → 0, we conclude that ϕ^mϕ0subscript^italic-ϕ𝑚subscriptitalic-ϕ0\hat{\phi}_{m}\rightarrow\phi_{0}over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT → italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT in probability.

A.3 FDR control under oracle case

We consider the case that ϕ0=(A,f1,f2)subscriptitalic-ϕ0𝐴subscript𝑓1subscript𝑓2\phi_{0}=(A,f_{1},f_{2})italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ( italic_A , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) is known. The following theorem shows that FDR can be controlled under the oracle case.

Theorem A.1.

Under the oracle case where ϕ0subscriptitalic-ϕ0\phi_{0}italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is known, denote rLISj=Pϕ0(sj{0,1,2}(y1j,y2j)j=1m)subscriptnormal-rLIS𝑗subscript𝑃subscriptitalic-ϕ0subscript𝑠𝑗conditional012superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚{\rm rLIS}_{j}=P_{\phi_{0}}(s_{j}\in\{0,1,2\}\mid(y_{1j},y_{2j})_{j=1}^{m})roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) for j=1,,m𝑗1normal-…𝑚j=1,\ldots,mitalic_j = 1 , … , italic_m. Order the test statistics rLIS(1)rLIS(m)subscriptnormal-rLIS1normal-…subscriptnormal-rLIS𝑚{\rm rLIS}_{(1)}\leq\ldots\leq{\rm rLIS}_{(m)}roman_rLIS start_POSTSUBSCRIPT ( 1 ) end_POSTSUBSCRIPT ≤ … ≤ roman_rLIS start_POSTSUBSCRIPT ( italic_m ) end_POSTSUBSCRIPT with the corresponding null hypotheses H0(1),,H0(m)subscript𝐻01normal-…subscript𝐻0𝑚H_{0(1)},\ldots,H_{0(m)}italic_H start_POSTSUBSCRIPT 0 ( 1 ) end_POSTSUBSCRIPT , … , italic_H start_POSTSUBSCRIPT 0 ( italic_m ) end_POSTSUBSCRIPT. For a pre-specified false discovery rate level q𝑞qitalic_q, we have the following procedure

R=𝑅absent\displaystyle R=italic_R = max{r:1rj=1rrLIS(j)q}, and:𝑟1𝑟superscriptsubscript𝑗1𝑟subscriptrLIS𝑗𝑞 and\displaystyle\max\left\{r:\frac{1}{r}\sum_{j=1}^{r}{\rm rLIS}_{(j)}\leq q% \right\},\text{ and }roman_max { italic_r : divide start_ARG 1 end_ARG start_ARG italic_r end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT roman_rLIS start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ≤ italic_q } , and
reject H0(j) for j=1,,R.reject subscript𝐻0𝑗 for 𝑗1𝑅\displaystyle\text{ reject }H_{0(j)}\text{ for }j=1,\ldots,R.reject italic_H start_POSTSUBSCRIPT 0 ( italic_j ) end_POSTSUBSCRIPT for italic_j = 1 , … , italic_R .

Then the testing procedure can control the FDR at level q𝑞qitalic_q.

Proof.

Denote R𝑅Ritalic_R as the number of total rejections and V𝑉Vitalic_V as the number of false rejections. If we reject the replicability null hypothesis H0jsubscript𝐻0𝑗H_{0j}italic_H start_POSTSUBSCRIPT 0 italic_j end_POSTSUBSCRIPT if rLISjλsubscriptrLIS𝑗𝜆{\rm rLIS}_{j}\leq\lambdaroman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ for some threshold λ𝜆\lambdaitalic_λ, then λ𝜆\lambdaitalic_λ satisfies

rLIS(R)λ<rLIS(R+1).subscriptrLIS𝑅𝜆subscriptrLIS𝑅1\displaystyle{\rm rLIS}_{(R)}\leq\lambda<{\rm rLIS}_{(R+1)}.roman_rLIS start_POSTSUBSCRIPT ( italic_R ) end_POSTSUBSCRIPT ≤ italic_λ < roman_rLIS start_POSTSUBSCRIPT ( italic_R + 1 ) end_POSTSUBSCRIPT .

Let λ=rLIS(R)𝜆subscriptrLIS𝑅\lambda={\rm rLIS}_{(R)}italic_λ = roman_rLIS start_POSTSUBSCRIPT ( italic_R ) end_POSTSUBSCRIPT for simplicity. Therefore,

FDR=FDRabsent\displaystyle{\rm FDR}=roman_FDR = E{VR1}𝐸𝑉𝑅1\displaystyle E\left\{\frac{V}{R\vee 1}\right\}italic_E { divide start_ARG italic_V end_ARG start_ARG italic_R ∨ 1 end_ARG }
=\displaystyle== E{E(VR1|(y1j,y2j)j=1m)}𝐸𝐸conditional𝑉𝑅1superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚\displaystyle E\left\{E\left(\frac{V}{R\vee 1}\bigg{|}(y_{1j},y_{2j})_{j=1}^{m% }\right)\right\}italic_E { italic_E ( divide start_ARG italic_V end_ARG start_ARG italic_R ∨ 1 end_ARG | ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) }
=\displaystyle== E{1R1E(V(y1j,y2j)j=1m)}.𝐸1𝑅1𝐸conditional𝑉superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚\displaystyle E\left\{\frac{1}{R\vee 1}E\left(V\mid(y_{1j},y_{2j})_{j=1}^{m}% \right)\right\}.italic_E { divide start_ARG 1 end_ARG start_ARG italic_R ∨ 1 end_ARG italic_E ( italic_V ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) } .

The last equality holds because R𝑅Ritalic_R is a function of (y1j,y2j)j=1msuperscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚(y_{1j},y_{2j})_{j=1}^{m}( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT. Since V=j=1mI(rLISjλ,sj{0,1,2})=j=1RI(s(j){0,1,2})𝑉superscriptsubscript𝑗1𝑚𝐼formulae-sequencesubscriptrLIS𝑗𝜆subscript𝑠𝑗012superscriptsubscript𝑗1𝑅𝐼subscript𝑠𝑗012V=\sum_{j=1}^{m}I({\rm rLIS}_{j}\leq\lambda,s_{j}\in\{0,1,2\})=\sum_{j=1}^{R}I% (s_{(j)}\in\{0,1,2\})italic_V = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( roman_rLIS start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ , italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ) = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_I ( italic_s start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ), we have

E(V(y1j,y2j)j=1m)=𝐸conditional𝑉superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚absent\displaystyle E\left(V\mid(y_{1j},y_{2j})_{j=1}^{m}\right)=italic_E ( italic_V ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) = j=1RP(s(j){0,1,2}(y1j,y2j)j=1m)=j=1RrLIS(j).superscriptsubscript𝑗1𝑅𝑃subscript𝑠𝑗conditional012superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚superscriptsubscript𝑗1𝑅subscriptrLIS𝑗\displaystyle\sum_{j=1}^{R}P(s_{(j)}\in\{0,1,2\}\mid(y_{1j},y_{2j})_{j=1}^{m})% =\sum_{j=1}^{R}{\rm rLIS}_{(j)}.∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_P ( italic_s start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT roman_rLIS start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT .

Consequently,

FDR=E{1R1j=1RrLIS(j)}q.FDR𝐸1𝑅1superscriptsubscript𝑗1𝑅subscriptrLIS𝑗𝑞\displaystyle{\rm FDR}=E\left\{\frac{1}{R\vee 1}\sum_{j=1}^{R}{\rm rLIS}_{(j)}% \right\}\leq q.roman_FDR = italic_E { divide start_ARG 1 end_ARG start_ARG italic_R ∨ 1 end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT roman_rLIS start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT } ≤ italic_q .

A.4 Proof of Theorem 3.2

First, we introduce some notations used in the proof. Consider an infinite hidden Markov model with hidden states {Sj}superscriptsubscriptsubscript𝑆𝑗\{S_{j}\}_{-\infty}^{\infty}{ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT and p𝑝pitalic_p-values (y1j,y2j)superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗(y_{1j},y_{2j})_{-\infty}^{\infty}( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT. Denote the following test statistics

Tj=subscript𝑇𝑗absent\displaystyle T_{j}=italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = Pϕ0(sj{0,1,2}(y1j,y2j)1m),subscript𝑃subscriptitalic-ϕ0subscript𝑠𝑗conditional012superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗1𝑚\displaystyle P_{\phi_{0}}(s_{j}\in\{0,1,2\}\mid(y_{1j},y_{2j})_{1}^{m}),italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) ,
T^j=subscript^𝑇𝑗absent\displaystyle\hat{T}_{j}=over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = Pϕ^m(sj{0,1,2}(y1j,y2j)1m),subscript𝑃subscript^italic-ϕ𝑚subscript𝑠𝑗conditional012superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗1𝑚\displaystyle P_{\hat{\phi}_{m}}(s_{j}\in\{0,1,2\}\mid(y_{1j},y_{2j})_{1}^{m}),italic_P start_POSTSUBSCRIPT over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) ,
Tj=superscriptsubscript𝑇𝑗absent\displaystyle T_{j}^{\infty}=italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT = Pϕ0(sj{0,1,2}(y1j,y2j)),subscript𝑃subscriptitalic-ϕ0subscript𝑠𝑗conditional012superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗\displaystyle P_{\phi_{0}}(s_{j}\in\{0,1,2\}\mid(y_{1j},y_{2j})_{-\infty}^{% \infty}),italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) ,
T^j=superscriptsubscript^𝑇𝑗absent\displaystyle\hat{T}_{j}^{\infty}=over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT = Pϕ^m(sj{0,1,2}(y1j,y2j)).subscript𝑃subscript^italic-ϕ𝑚subscript𝑠𝑗conditional012superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗\displaystyle P_{\hat{\phi}_{m}}(s_{j}\in\{0,1,2\}\mid(y_{1j},y_{2j})_{-\infty% }^{\infty}).italic_P start_POSTSUBSCRIPT over^ start_ARG italic_ϕ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) .

For any test statistics ξj{Tj,T^j,Tj,T^j}subscript𝜉𝑗subscript𝑇𝑗subscript^𝑇𝑗superscriptsubscript𝑇𝑗superscriptsubscript^𝑇𝑗\xi_{j}\in\{T_{j},\hat{T}_{j},T_{j}^{\infty},\hat{T}_{j}^{\infty}\}italic_ξ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT , over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT } corresponding to the null hypothesis H0jsubscript𝐻0𝑗H_{0j}italic_H start_POSTSUBSCRIPT 0 italic_j end_POSTSUBSCRIPT, consider the testing procedure based on ordered ξ(1)ξ(m)subscript𝜉1subscript𝜉𝑚\xi_{(1)}\leq\ldots\leq\xi_{(m)}italic_ξ start_POSTSUBSCRIPT ( 1 ) end_POSTSUBSCRIPT ≤ … ≤ italic_ξ start_POSTSUBSCRIPT ( italic_m ) end_POSTSUBSCRIPT with corresponding null hypotheses H0(1),,H0(m)subscript𝐻01subscript𝐻0𝑚H_{0(1)},\ldots,H_{0(m)}italic_H start_POSTSUBSCRIPT 0 ( 1 ) end_POSTSUBSCRIPT , … , italic_H start_POSTSUBSCRIPT 0 ( italic_m ) end_POSTSUBSCRIPT. We have the number of rejections given by

R0=max{r:1rj=1rξ(j)q}.subscript𝑅0:𝑟1𝑟superscriptsubscript𝑗1𝑟subscript𝜉𝑗𝑞\displaystyle R_{0}=\max\left\{r:\frac{1}{r}\sum_{j=1}^{r}\xi_{(j)}\leq q% \right\}.italic_R start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = roman_max { italic_r : divide start_ARG 1 end_ARG start_ARG italic_r end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT italic_ξ start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ≤ italic_q } . (A.11)

We reject H0(j)subscript𝐻0𝑗H_{0(j)}italic_H start_POSTSUBSCRIPT 0 ( italic_j ) end_POSTSUBSCRIPT for j=1,,R0𝑗1subscript𝑅0j=1,\ldots,R_{0}italic_j = 1 , … , italic_R start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. An equivalent algorithm is

λ0=sup{λ(0,1):j=1mξjI(ξjλ){j=1mI(ξjλ)}1q}.subscript𝜆0supremumconditional-set𝜆01superscriptsubscript𝑗1𝑚subscript𝜉𝑗𝐼subscript𝜉𝑗𝜆superscriptsubscript𝑗1𝑚𝐼subscript𝜉𝑗𝜆1𝑞\displaystyle\lambda_{0}=\sup\left\{\lambda\in(0,1):\frac{\sum_{j=1}^{m}\xi_{j% }I(\xi_{j}\leq\lambda)}{\left\{\sum_{j=1}^{m}I(\xi_{j}\leq\lambda)\right\}\vee 1% }\leq q\right\}.italic_λ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = roman_sup { italic_λ ∈ ( 0 , 1 ) : divide start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_ξ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_I ( italic_ξ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ ) end_ARG start_ARG { ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_ξ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ ) } ∨ 1 end_ARG ≤ italic_q } . (A.12)

The rejection threshold can be written as λ0=ξ(R0)subscript𝜆0subscript𝜉subscript𝑅0\lambda_{0}=\xi_{(R_{0})}italic_λ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_ξ start_POSTSUBSCRIPT ( italic_R start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT. The total number of false rejections is V0=j=1mI(ξjλ0 and sj{0,1,2})subscript𝑉0superscriptsubscript𝑗1𝑚𝐼subscript𝜉𝑗subscript𝜆0 and subscript𝑠𝑗012V_{0}=\sum_{j=1}^{m}I(\xi_{j}\leq\lambda_{0}\text{ and }s_{j}\in\{0,1,2\})italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_ξ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ). Replacing ξjsubscript𝜉𝑗\xi_{j}italic_ξ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT by Tj,T^j,Tjsubscript𝑇𝑗subscript^𝑇𝑗superscriptsubscript𝑇𝑗T_{j},\hat{T}_{j},T_{j}^{\infty}italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT and T^jsuperscriptsubscript^𝑇𝑗\hat{T}_{j}^{\infty}over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT, the number of rejections and number of false rejections are denoted by (R,V)𝑅𝑉(R,V)( italic_R , italic_V ), (R^,V^)^𝑅^𝑉(\hat{R},\hat{V})( over^ start_ARG italic_R end_ARG , over^ start_ARG italic_V end_ARG ), (R,V)superscript𝑅superscript𝑉(R^{\infty},V^{\infty})( italic_R start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT , italic_V start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) and (R^,V^)superscript^𝑅superscript^𝑉(\hat{R}^{\infty},\hat{V}^{\infty})( over^ start_ARG italic_R end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT , over^ start_ARG italic_V end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ). Moreover, we define the corresponding rejection thresholds as λ^OR,λ^rLIS,λ^OR,λ^rLISsubscript^𝜆ORsubscript^𝜆rLISsuperscriptsubscript^𝜆ORsuperscriptsubscript^𝜆rLIS\hat{\lambda}_{\mathrm{OR}},\hat{\lambda}_{\mathrm{rLIS}},\hat{\lambda}_{% \mathrm{OR}}^{\infty},\hat{\lambda}_{\mathrm{rLIS}}^{\infty}over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT , over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_rLIS end_POSTSUBSCRIPT , over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT , over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_rLIS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT. Next, we consider the distribution of Tjsuperscriptsubscript𝑇𝑗T_{j}^{\infty}italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT. Since {Sj}superscriptsubscriptsubscript𝑆𝑗\{S_{j}\}_{-\infty}^{\infty}{ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT is stationary, irreducible, and aperiodic, the two-sided generalization of Theorem 6.1.3 in Durrett [2019] implies that {Tj}superscriptsubscript𝑇𝑗\{T_{j}^{\infty}\}{ italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT } is ergodic. Therefore, Tjsuperscriptsubscript𝑇𝑗T_{j}^{\infty}italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT are identically distributed. Denote the cumulative distribution function of Tjsuperscriptsubscript𝑇𝑗T_{j}^{\infty}italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT as

Pϕ0(Tjt)=G(t).subscript𝑃subscriptitalic-ϕ0superscriptsubscript𝑇𝑗𝑡superscript𝐺𝑡\displaystyle P_{\phi_{0}}(T_{j}^{\infty}\leq t)=G^{\infty}(t).italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_t ) = italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) .

Denote the conditional cumulative distribution function of Tjsuperscriptsubscript𝑇𝑗T_{j}^{\infty}italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT given sj=ksubscript𝑠𝑗𝑘s_{j}=kitalic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k as

Pϕ0(Tjtsj=k)=Gk(t) for k=0,1,2,3.formulae-sequencesubscript𝑃subscriptitalic-ϕ0superscriptsubscript𝑇𝑗conditional𝑡subscript𝑠𝑗𝑘superscriptsubscript𝐺𝑘𝑡 for 𝑘0123\displaystyle P_{\phi_{0}}(T_{j}^{\infty}\leq t\mid s_{j}=k)=G_{k}^{\infty}(t)% \text{ for }k=0,1,2,3.italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_t ∣ italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ) = italic_G start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) for italic_k = 0 , 1 , 2 , 3 .

Thus

G(t)=π0G0(t)+π1G1(t)+π2G2(t)+π3G3(t).superscript𝐺𝑡subscript𝜋0superscriptsubscript𝐺0𝑡subscript𝜋1superscriptsubscript𝐺1𝑡subscript𝜋2superscriptsubscript𝐺2𝑡subscript𝜋3superscriptsubscript𝐺3𝑡G^{\infty}(t)=\pi_{0}G_{0}^{\infty}(t)+\pi_{1}G_{1}^{\infty}(t)+\pi_{2}G_{2}^{% \infty}(t)+\pi_{3}G_{3}^{\infty}(t).italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) = italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) + italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) + italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) + italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) .

Let

α*=inf{0t1:G(t)=1}.subscript𝛼infimumconditional-set0𝑡1superscript𝐺𝑡1\displaystyle\alpha_{*}=\inf\{0\leq t\leq 1:G^{\infty}(t)=1\}.italic_α start_POSTSUBSCRIPT * end_POSTSUBSCRIPT = roman_inf { 0 ≤ italic_t ≤ 1 : italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) = 1 } . (A.13)

By the forward-backward algorithm [Baum et al., 1970],

Tj=sj=02αj(sj)βj(sj)sj=03αj(sj)βj(sj),superscriptsubscript𝑇𝑗superscriptsubscriptsubscript𝑠𝑗02subscript𝛼𝑗subscript𝑠𝑗subscript𝛽𝑗subscript𝑠𝑗superscriptsubscriptsubscript𝑠𝑗03subscript𝛼𝑗subscript𝑠𝑗subscript𝛽𝑗subscript𝑠𝑗\displaystyle T_{j}^{\infty}=\frac{\sum_{s_{j}=0}^{2}\alpha_{j}(s_{j})\beta_{j% }(s_{j})}{\sum_{s_{j}=0}^{3}\alpha_{j}(s_{j})\beta_{j}(s_{j})},italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT = divide start_ARG ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_ARG ,

where αj(sj)=Pϕ0((y1t,y2t)t=j,sj)subscript𝛼𝑗subscript𝑠𝑗subscript𝑃subscriptitalic-ϕ0superscriptsubscriptsubscript𝑦1𝑡subscript𝑦2𝑡𝑡𝑗subscript𝑠𝑗\alpha_{j}(s_{j})=P_{\phi_{0}}((y_{1t},y_{2t})_{t=-\infty}^{j},s_{j})italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_t end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t = - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT , italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) and βj(sj)=Pϕ0((y1t,y2t)t=j+1sj)subscript𝛽𝑗subscript𝑠𝑗subscript𝑃subscriptitalic-ϕ0conditionalsuperscriptsubscriptsubscript𝑦1𝑡subscript𝑦2𝑡𝑡𝑗1subscript𝑠𝑗\beta_{j}(s_{j})=P_{\phi_{0}}((y_{1t},y_{2t})_{t=j+1}^{\infty}\mid s_{j})italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_t end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t = italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∣ italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ). αj()subscript𝛼𝑗\alpha_{j}(\cdot)italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( ⋅ ) and βj()subscript𝛽𝑗\beta_{j}(\cdot)italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( ⋅ ) can be derived recursively by αj+1(sj+1)=sj=03αj(sj)asj,sj+1f(sj+1)(y1,j+1,y2,j+1)subscript𝛼𝑗1subscript𝑠𝑗1superscriptsubscriptsubscript𝑠𝑗03subscript𝛼𝑗subscript𝑠𝑗subscript𝑎subscript𝑠𝑗subscript𝑠𝑗1superscript𝑓subscript𝑠𝑗1subscript𝑦1𝑗1subscript𝑦2𝑗1\alpha_{j+1}(s_{j+1})=\sum_{s_{j}=0}^{3}\alpha_{j}(s_{j})a_{s_{j},s_{j+1}}f^{(% s_{j}+1)}(y_{1,j+1},y_{2,j+1})italic_α start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_α start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) italic_a start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ) and βj(sj)=sj+1=03βj+1(sj+1)f(sj+1)(y1,j+1,y2,j+1)subscript𝛽𝑗subscript𝑠𝑗superscriptsubscriptsubscript𝑠𝑗103subscript𝛽𝑗1subscript𝑠𝑗1superscript𝑓subscript𝑠𝑗1subscript𝑦1𝑗1subscript𝑦2𝑗1\beta_{j}(s_{j})=\sum_{s_{j+1}=0}^{3}\beta_{j+1}(s_{j+1})f^{(s_{j+1})}(y_{1,j+% 1},y_{2,j+1})italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) italic_f start_POSTSUPERSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ). Since the joint distribution of (y1j,y2j)superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗(y_{1j},y_{2j})_{-\infty}^{\infty}( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT is continuous, and Tjsuperscriptsubscript𝑇𝑗T_{j}^{\infty}italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT is a continuous map from (y1j,y2j)superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗(y_{1j},y_{2j})_{-\infty}^{\infty}( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT to (0, 1), the probability density function of Tjsuperscriptsubscript𝑇𝑗T_{j}^{\infty}italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT is positive and continuous. It suffices to show that Gsuperscript𝐺G^{\infty}italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT is strictly increasing in (0,α*)0subscript𝛼(0,\alpha_{*})( 0 , italic_α start_POSTSUBSCRIPT * end_POSTSUBSCRIPT ). For some threshold λ>0𝜆0\lambda>0italic_λ > 0, define the number of rejections and false rejections as

Rλ=superscriptsubscript𝑅𝜆absent\displaystyle R_{\lambda}^{\infty}=italic_R start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT = j=1mI(Tjλ),superscriptsubscript𝑗1𝑚𝐼superscriptsubscript𝑇𝑗𝜆\displaystyle\sum_{j=1}^{m}I(T_{j}^{\infty}\leq\lambda),∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_λ ) ,
Vλ=superscriptsubscript𝑉𝜆absent\displaystyle V_{\lambda}^{\infty}=italic_V start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT = j=1mI(Tjλ,sj{0,1,2}).superscriptsubscript𝑗1𝑚𝐼formulae-sequencesuperscriptsubscript𝑇𝑗𝜆subscript𝑠𝑗012\displaystyle\sum_{j=1}^{m}I(T_{j}^{\infty}\leq\lambda,s_{j}\in\{0,1,2\}).∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_λ , italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ) .

Thus we have the expectations

E(Rλ)=𝐸superscriptsubscript𝑅𝜆absent\displaystyle E(R_{\lambda}^{\infty})=italic_E ( italic_R start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) = mG(λ),𝑚superscript𝐺𝜆\displaystyle mG^{\infty}(\lambda),italic_m italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ ) ,
E(Vλ)=𝐸superscriptsubscript𝑉𝜆absent\displaystyle E(V_{\lambda}^{\infty})=italic_E ( italic_V start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) = m(π0G0(λ)+π1G1(λ)+π2G2(λ)).𝑚subscript𝜋0superscriptsubscript𝐺0𝜆subscript𝜋1superscriptsubscript𝐺1𝜆subscript𝜋2superscriptsubscript𝐺2𝜆\displaystyle m(\pi_{0}G_{0}^{\infty}(\lambda)+\pi_{1}G_{1}^{\infty}(\lambda)+% \pi_{2}G_{2}^{\infty}(\lambda)).italic_m ( italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ ) + italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ ) + italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ ) ) .

Therefore, the marginal false discovery rate is

QOR(λ)=E(Vλ)/E(Rλ)=(π0G0(λ)+π1G1(λ)+π2G2(λ))/G(λ).superscriptsubscript𝑄OR𝜆𝐸superscriptsubscript𝑉𝜆𝐸superscriptsubscript𝑅𝜆subscript𝜋0superscriptsubscript𝐺0𝜆subscript𝜋1superscriptsubscript𝐺1𝜆subscript𝜋2superscriptsubscript𝐺2𝜆superscript𝐺𝜆Q_{\text{OR}}^{\infty}(\lambda)=E(V_{\lambda}^{\infty})/E(R_{\lambda}^{\infty}% )=(\pi_{0}G_{0}^{\infty}(\lambda)+\pi_{1}G_{1}^{\infty}(\lambda)+\pi_{2}G_{2}^% {\infty}(\lambda))/G^{\infty}(\lambda).italic_Q start_POSTSUBSCRIPT OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ ) = italic_E ( italic_V start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) / italic_E ( italic_R start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) = ( italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ ) + italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ ) + italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ ) ) / italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ ) .

Theorem 1 of Sun and Cai [2009] implies that QOR(λ)superscriptsubscript𝑄OR𝜆Q_{\text{OR}}^{\infty}(\lambda)italic_Q start_POSTSUBSCRIPT OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ ) is increasing in λ𝜆\lambdaitalic_λ. Define the threshold based on the marginal false discovery rate as

λOR=sup{λ:QOR(λ)q}.superscriptsubscript𝜆ORsupremumconditional-set𝜆superscriptsubscript𝑄OR𝜆𝑞\lambda_{\text{OR}}^{\infty}=\sup\{\lambda:Q_{\text{OR}}^{\infty}(\lambda)\leq q\}.italic_λ start_POSTSUBSCRIPT OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT = roman_sup { italic_λ : italic_Q start_POSTSUBSCRIPT OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ ) ≤ italic_q } .

Since G(t)=1superscript𝐺𝑡1G^{\infty}(t)=1italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) = 1 is equivalent to the statement that Gs(t)=1superscriptsubscript𝐺𝑠𝑡1G_{s}^{\infty}(t)=1italic_G start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) = 1 for s=0,1,2,3𝑠0123s=0,1,2,3italic_s = 0 , 1 , 2 , 3, we have

QOR(α*)=π0+π1+π2>qsuperscriptsubscript𝑄ORsubscript𝛼subscript𝜋0subscript𝜋1subscript𝜋2𝑞\displaystyle Q_{\mathrm{OR}}^{\infty}(\alpha_{*})=\pi_{0}+\pi_{1}+\pi_{2}>qitalic_Q start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_α start_POSTSUBSCRIPT * end_POSTSUBSCRIPT ) = italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > italic_q

under (C2) with π3<1qsubscript𝜋31𝑞\pi_{3}<1-qitalic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT < 1 - italic_q. Without loss of generality, we assume λOR<α*.superscriptsubscript𝜆ORsubscript𝛼\lambda_{\mathrm{OR}}^{\infty}<\alpha_{*}.italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT < italic_α start_POSTSUBSCRIPT * end_POSTSUBSCRIPT . With the notations above, we will prove Theorem 3.2 as follows. In Step 1, we show that the total number of rejections R𝑅Ritalic_R and R^^𝑅\hat{R}over^ start_ARG italic_R end_ARG converge to infinity almost surely. In Step 2, we show that E|R/R^1|0𝐸𝑅^𝑅10E|R/\hat{R}-1|\rightarrow 0italic_E | italic_R / over^ start_ARG italic_R end_ARG - 1 | → 0 and E|V/V^1|0𝐸𝑉^𝑉10E|V/\hat{V}-1|\rightarrow 0italic_E | italic_V / over^ start_ARG italic_V end_ARG - 1 | → 0 as m𝑚m\to\inftyitalic_m → ∞. Finally, we show the asymptotic false discovery rate control in Step 3. Step 1. Asymptotic behavior of rejection numbers. Recall that λ^ORsuperscriptsubscript^𝜆OR\hat{\lambda}_{\mathrm{OR}}^{\infty}over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT and λ^rLISsuperscriptsubscript^𝜆rLIS\hat{\lambda}_{\mathrm{rLIS}}^{\infty}over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_rLIS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT are the corresponding rejection threshold given by {Tj}j=1msuperscriptsubscriptsuperscriptsubscript𝑇𝑗𝑗1𝑚\{T_{j}^{\infty}\}_{j=1}^{m}{ italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT and {T^j}j=1msuperscriptsubscriptsuperscriptsubscript^𝑇𝑗𝑗1𝑚\{\hat{T}_{j}^{\infty}\}_{j=1}^{m}{ over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT. First, we show λ^ORλORsuperscriptsubscript^𝜆ORsuperscriptsubscript𝜆OR\hat{\lambda}_{\mathrm{OR}}^{\infty}\rightarrow\lambda_{\mathrm{OR}}^{\infty}over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT → italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT and λ^rLISλORsuperscriptsubscript^𝜆rLISsuperscriptsubscript𝜆OR\hat{\lambda}_{\mathrm{rLIS}}^{\infty}\rightarrow\lambda_{\mathrm{OR}}^{\infty}over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_rLIS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT → italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT in probability by Lemma 1.

Lemma 1.

Assume (C1)-(C4) hold. λ^ORλORnormal-→superscriptsubscriptnormal-^𝜆normal-ORsuperscriptsubscript𝜆normal-OR\hat{\lambda}_{\mathrm{OR}}^{\infty}\rightarrow\lambda_{\mathrm{OR}}^{\infty}over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT → italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT and λ^rLISλORnormal-→superscriptsubscriptnormal-^𝜆normal-rLISsuperscriptsubscript𝜆normal-OR\hat{\lambda}_{\mathrm{rLIS}}^{\infty}\rightarrow\lambda_{\mathrm{OR}}^{\infty}over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_rLIS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT → italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT in probability.

We next show R^^𝑅\hat{R}\rightarrow\inftyover^ start_ARG italic_R end_ARG → ∞ almost surely. For simplicity, denote (y1j,y2j)j=j1j2superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑗subscript𝑗1subscript𝑗2(y_{1j},y_{2j})_{j=j_{1}}^{j_{2}}( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_j = italic_j start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT as yj1j2superscriptsubscript𝑦subscript𝑗1subscript𝑗2y_{j_{1}}^{j_{2}}italic_y start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT for any j1<j2subscript𝑗1subscript𝑗2j_{1}<j_{2}italic_j start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_j start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. By (C2), ε0alk(ϕ)1subscript𝜀0subscript𝑎𝑙𝑘italic-ϕ1\varepsilon_{0}\leq a_{lk}(\phi)\leq 1italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≤ italic_a start_POSTSUBSCRIPT italic_l italic_k end_POSTSUBSCRIPT ( italic_ϕ ) ≤ 1 for all l,k𝑙𝑘l,kitalic_l , italic_k and (C5), for any k,k=0,1,2,3formulae-sequence𝑘superscript𝑘0123k,k^{\prime}=0,1,2,3italic_k , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 0 , 1 , 2 , 3, f(k)(y1,j+1,y2,j+1;ϕ)/f(k)(y1,j+1,y2,j+1;ϕ)ρ0(y1,j+1,y2,j+1)superscript𝑓superscript𝑘subscript𝑦1𝑗1subscript𝑦2𝑗1italic-ϕsuperscript𝑓𝑘subscript𝑦1𝑗1subscript𝑦2𝑗1italic-ϕsubscript𝜌0subscript𝑦1𝑗1subscript𝑦2𝑗1f^{(k^{\prime})}(y_{1,j+1},y_{2,j+1};\phi)/f^{(k)}(y_{1,j+1},y_{2,j+1};\phi)% \leq\rho_{0}(y_{1,j+1},y_{2,j+1})italic_f start_POSTSUPERSCRIPT ( italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ; italic_ϕ ) / italic_f start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ; italic_ϕ ) ≤ italic_ρ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ). Then for any states k,k𝑘superscript𝑘k,k^{\prime}italic_k , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT and l=0,1,2,3𝑙0123l=0,1,2,3italic_l = 0 , 1 , 2 , 3,

Pϕ(Sj+1=kSj=l,y1m)Pϕ(Sj+1=kSj=l,y1m)\displaystyle\frac{P_{\phi}(S_{j+1}=k^{\prime}\mid S_{j}=l,y_{1}^{m})}{P_{\phi% }(S_{j+1}=k\mid S_{j}=l,y_{1}^{m})}divide start_ARG italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = italic_k ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) end_ARG
=\displaystyle== Pϕ(Sj+1=k,Sj=l,y1m)Pϕ(Sj+1=k,Sj=l,y1m)subscript𝑃italic-ϕformulae-sequencesubscript𝑆𝑗1superscript𝑘subscript𝑆𝑗𝑙superscriptsubscript𝑦1𝑚subscript𝑃italic-ϕformulae-sequencesubscript𝑆𝑗1𝑘subscript𝑆𝑗𝑙superscriptsubscript𝑦1𝑚\displaystyle\frac{P_{\phi}(S_{j+1}=k^{\prime},S_{j}=l,y_{1}^{m})}{P_{\phi}(S_% {j+1}=k,S_{j}=l,y_{1}^{m})}divide start_ARG italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = italic_k , italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) end_ARG
=\displaystyle== Pϕ(Sj+1=k,y1mSj=l)Pϕ(Sj+1=k,y1mSj=l)subscript𝑃italic-ϕformulae-sequencesubscript𝑆𝑗1superscript𝑘conditionalsuperscriptsubscript𝑦1𝑚subscript𝑆𝑗𝑙subscript𝑃italic-ϕformulae-sequencesubscript𝑆𝑗1𝑘conditionalsuperscriptsubscript𝑦1𝑚subscript𝑆𝑗𝑙\displaystyle\frac{P_{\phi}(S_{j+1}=k^{\prime},y_{1}^{m}\mid S_{j}=l)}{P_{\phi% }(S_{j+1}=k,y_{1}^{m}\mid S_{j}=l)}divide start_ARG italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = italic_k , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l ) end_ARG
=\displaystyle== k0=03Pϕ(Sj+1=k,Sj+2=k0,y1mSj=l)k0=03Pϕ(Sj+1=k,Sj+2=k0,y1mSj=l)superscriptsubscriptsubscript𝑘003subscript𝑃italic-ϕformulae-sequencesubscript𝑆𝑗1superscript𝑘formulae-sequencesubscript𝑆𝑗2subscript𝑘0conditionalsuperscriptsubscript𝑦1𝑚subscript𝑆𝑗𝑙superscriptsubscriptsubscript𝑘003subscript𝑃italic-ϕformulae-sequencesubscript𝑆𝑗1𝑘formulae-sequencesubscript𝑆𝑗2subscript𝑘0conditionalsuperscriptsubscript𝑦1𝑚subscript𝑆𝑗𝑙\displaystyle\frac{\sum_{k_{0}=0}^{3}P_{\phi}(S_{j+1}=k^{\prime},S_{j+2}=k_{0}% ,y_{1}^{m}\mid S_{j}=l)}{\sum_{k_{0}=0}^{3}P_{\phi}(S_{j+1}=k,S_{j+2}=k_{0},y_% {1}^{m}\mid S_{j}=l)}divide start_ARG ∑ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_S start_POSTSUBSCRIPT italic_j + 2 end_POSTSUBSCRIPT = italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = italic_k , italic_S start_POSTSUBSCRIPT italic_j + 2 end_POSTSUBSCRIPT = italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l ) end_ARG
=\displaystyle== Pϕ(y1jSj=l;ϕ)k0=03alk(ϕ)akk0(ϕ)f(k)(y1,j+1,y2,j+1;ϕ)Pϕ(yj+2mSj+2=k0;ϕ)Pϕ(y1jSj=l;ϕ)k0=03alk(ϕ)akk0(ϕ)f(k)(y1,j+1,y2,j+1;ϕ)Pϕ(yj+2mSj+2=k0;ϕ)subscript𝑃italic-ϕconditionalsuperscriptsubscript𝑦1𝑗subscript𝑆𝑗𝑙italic-ϕsuperscriptsubscriptsubscript𝑘003subscript𝑎𝑙superscript𝑘italic-ϕsubscript𝑎superscript𝑘subscript𝑘0italic-ϕsuperscript𝑓superscript𝑘subscript𝑦1𝑗1subscript𝑦2𝑗1italic-ϕsubscript𝑃italic-ϕconditionalsuperscriptsubscript𝑦𝑗2𝑚subscript𝑆𝑗2subscript𝑘0italic-ϕsubscript𝑃italic-ϕconditionalsuperscriptsubscript𝑦1𝑗subscript𝑆𝑗𝑙italic-ϕsuperscriptsubscriptsubscript𝑘003subscript𝑎𝑙𝑘italic-ϕsubscript𝑎𝑘subscript𝑘0italic-ϕsuperscript𝑓𝑘subscript𝑦1𝑗1subscript𝑦2𝑗1italic-ϕsubscript𝑃italic-ϕconditionalsuperscriptsubscript𝑦𝑗2𝑚subscript𝑆𝑗2subscript𝑘0italic-ϕ\displaystyle\frac{P_{\phi}(y_{1}^{j}\mid S_{j}=l;\phi)\sum_{k_{0}=0}^{3}a_{lk% ^{\prime}}(\phi)a_{k^{\prime}k_{0}}(\phi)f^{(k^{\prime})}(y_{1,j+1},y_{2,j+1};% \phi)P_{\phi}(y_{j+2}^{m}\mid S_{j+2}=k_{0};\phi)}{P_{\phi}(y_{1}^{j}\mid S_{j% }=l;\phi)\sum_{k_{0}=0}^{3}a_{lk}(\phi)a_{kk_{0}}(\phi)f^{(k)}(y_{1,j+1},y_{2,% j+1};\phi)P_{\phi}(y_{j+2}^{m}\mid S_{j+2}=k_{0};\phi)}divide start_ARG italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l ; italic_ϕ ) ∑ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_l italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_ϕ ) italic_a start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ ) italic_f start_POSTSUPERSCRIPT ( italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ; italic_ϕ ) italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_j + 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j + 2 end_POSTSUBSCRIPT = italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_ϕ ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l ; italic_ϕ ) ∑ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_l italic_k end_POSTSUBSCRIPT ( italic_ϕ ) italic_a start_POSTSUBSCRIPT italic_k italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ ) italic_f start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ; italic_ϕ ) italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_j + 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j + 2 end_POSTSUBSCRIPT = italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_ϕ ) end_ARG
=\displaystyle== f(k)(y1,j+1,y2,j+1;ϕ)k0=03alk(ϕ)akk0(ϕ)Pϕ(yj+2mSj+2=k0;ϕ)f(k)(y1,j+1,y2,j+1;ϕ)k0=03alk(ϕ)akk0(ϕ)Pϕ(yj+2mSj+2=k0;ϕ)superscript𝑓superscript𝑘subscript𝑦1𝑗1subscript𝑦2𝑗1italic-ϕsuperscriptsubscriptsubscript𝑘003subscript𝑎𝑙superscript𝑘italic-ϕsubscript𝑎superscript𝑘subscript𝑘0italic-ϕsubscript𝑃italic-ϕconditionalsuperscriptsubscript𝑦𝑗2𝑚subscript𝑆𝑗2subscript𝑘0italic-ϕsuperscript𝑓𝑘subscript𝑦1𝑗1subscript𝑦2𝑗1italic-ϕsuperscriptsubscriptsubscript𝑘003subscript𝑎𝑙𝑘italic-ϕsubscript𝑎𝑘subscript𝑘0italic-ϕsubscript𝑃italic-ϕconditionalsuperscriptsubscript𝑦𝑗2𝑚subscript𝑆𝑗2subscript𝑘0italic-ϕ\displaystyle\frac{f^{(k^{\prime})}(y_{1,j+1},y_{2,j+1};\phi)\sum_{k_{0}=0}^{3% }a_{lk^{\prime}}(\phi)a_{k^{\prime}k_{0}}(\phi)P_{\phi}(y_{j+2}^{m}\mid S_{j+2% }=k_{0};\phi)}{f^{(k)}(y_{1,j+1},y_{2,j+1};\phi)\sum_{k_{0}=0}^{3}a_{lk}(\phi)% a_{kk_{0}}(\phi)P_{\phi}(y_{j+2}^{m}\mid S_{j+2}=k_{0};\phi)}divide start_ARG italic_f start_POSTSUPERSCRIPT ( italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ; italic_ϕ ) ∑ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_l italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_ϕ ) italic_a start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ ) italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_j + 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j + 2 end_POSTSUBSCRIPT = italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_ϕ ) end_ARG start_ARG italic_f start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ; italic_ϕ ) ∑ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_l italic_k end_POSTSUBSCRIPT ( italic_ϕ ) italic_a start_POSTSUBSCRIPT italic_k italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ϕ ) italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_j + 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j + 2 end_POSTSUBSCRIPT = italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_ϕ ) end_ARG
\displaystyle\leq f(k)(y1,j+1,y2,j+1;ϕ)k0=03Pϕ(yj+2mSj+2=k0;ϕ)f(k)(y1,j+1,y2,j+1;ϕ)k0=03ε02Pϕ(yj+2mSj+2=k0;ϕ)superscript𝑓superscript𝑘subscript𝑦1𝑗1subscript𝑦2𝑗1italic-ϕsuperscriptsubscriptsubscript𝑘003subscript𝑃italic-ϕconditionalsuperscriptsubscript𝑦𝑗2𝑚subscript𝑆𝑗2subscript𝑘0italic-ϕsuperscript𝑓𝑘subscript𝑦1𝑗1subscript𝑦2𝑗1italic-ϕsuperscriptsubscriptsubscript𝑘003superscriptsubscript𝜀02subscript𝑃italic-ϕconditionalsuperscriptsubscript𝑦𝑗2𝑚subscript𝑆𝑗2subscript𝑘0italic-ϕ\displaystyle\frac{f^{(k^{\prime})}(y_{1,j+1},y_{2,j+1};\phi)\sum_{k_{0}=0}^{3% }P_{\phi}(y_{j+2}^{m}\mid S_{j+2}=k_{0};\phi)}{f^{(k)}(y_{1,j+1},y_{2,j+1};% \phi)\sum_{k_{0}=0}^{3}\varepsilon_{0}^{2}P_{\phi}(y_{j+2}^{m}\mid S_{j+2}=k_{% 0};\phi)}divide start_ARG italic_f start_POSTSUPERSCRIPT ( italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ; italic_ϕ ) ∑ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_j + 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j + 2 end_POSTSUBSCRIPT = italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_ϕ ) end_ARG start_ARG italic_f start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ; italic_ϕ ) ∑ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_j + 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j + 2 end_POSTSUBSCRIPT = italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_ϕ ) end_ARG
=\displaystyle== ε02f(k)(y1,j+1,y2,j+1;ϕ)f(k)(y1,j+1,y2,j+1;ϕ)superscriptsubscript𝜀02superscript𝑓superscript𝑘subscript𝑦1𝑗1subscript𝑦2𝑗1italic-ϕsuperscript𝑓𝑘subscript𝑦1𝑗1subscript𝑦2𝑗1italic-ϕ\displaystyle\varepsilon_{0}^{-2}\frac{f^{(k^{\prime})}(y_{1,j+1},y_{2,j+1};% \phi)}{f^{(k)}(y_{1,j+1},y_{2,j+1};\phi)}italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT divide start_ARG italic_f start_POSTSUPERSCRIPT ( italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ; italic_ϕ ) end_ARG start_ARG italic_f start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ; italic_ϕ ) end_ARG
\displaystyle\leq ε02ρ0(y1,j+1,y2,j+1).superscriptsubscript𝜀02subscript𝜌0subscript𝑦1𝑗1subscript𝑦2𝑗1\displaystyle\varepsilon_{0}^{-2}\rho_{0}(y_{1,j+1},y_{2,j+1}).italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT italic_ρ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ) .

Let τ0(y1,y2)=(1+3ε02ρ0(y1,y2))1subscript𝜏0subscript𝑦1subscript𝑦2superscript13superscriptsubscript𝜀02subscript𝜌0subscript𝑦1subscript𝑦21\tau_{0}(y_{1},y_{2})=(1+3\varepsilon_{0}^{-2}\rho_{0}(y_{1},y_{2}))^{-1}italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = ( 1 + 3 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT italic_ρ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. Since k=03Pϕ(Sj+1=kSj=l,y1m)=1\sum_{k^{\prime}=0}^{3}P_{\phi}(S_{j+1}=k^{\prime}\mid S_{j}=l,y_{1}^{m})=1∑ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) = 1, we conclude that for all k,l=0,1,2,3formulae-sequence𝑘𝑙0123k,l=0,1,2,3italic_k , italic_l = 0 , 1 , 2 , 3,

Pϕ(Sj+1=kSj=l,y1m)=\displaystyle P_{\phi}(S_{j+1}=k\mid S_{j}=l,y_{1}^{m})=italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = italic_k ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) = Pϕ(Sj+1=kSj=l,y1m)k=03Pϕ(Sj+1=kSj=l,y1m)\displaystyle\frac{P_{\phi}(S_{j+1}=k\mid S_{j}=l,y_{1}^{m})}{\sum_{k^{\prime}% =0}^{3}P_{\phi}(S_{j+1}=k^{\prime}\mid S_{j}=l,y_{1}^{m})}divide start_ARG italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = italic_k ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) end_ARG
=\displaystyle== 11+kkPϕ(Sj+1=kSj=l,y1m)Pϕ(Sj+1=kSj=l,y1m)\displaystyle\frac{1}{1+\sum_{k^{\prime}\neq k}\frac{P_{\phi}(S_{j+1}=k^{% \prime}\mid S_{j}=l,y_{1}^{m})}{P_{\phi}(S_{j+1}=k\mid S_{j}=l,y_{1}^{m})}}divide start_ARG 1 end_ARG start_ARG 1 + ∑ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≠ italic_k end_POSTSUBSCRIPT divide start_ARG italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = italic_k ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) end_ARG end_ARG
\displaystyle\geq {1+3ε02ρ0(y1,j+1,y2,j+1)}1.superscript13superscriptsubscript𝜀02subscript𝜌0subscript𝑦1𝑗1subscript𝑦2𝑗11\displaystyle\{1+3\varepsilon_{0}^{-2}\rho_{0}(y_{1,j+1},y_{2,j+1})\}^{-1}.{ 1 + 3 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT italic_ρ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ) } start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT .

Define

τ0(y1,j+1,y2,j+1)={1+3ε02ρ0(y1,j+1,y2,j+1)}1.subscript𝜏0subscript𝑦1𝑗1subscript𝑦2𝑗1superscript13superscriptsubscript𝜀02subscript𝜌0subscript𝑦1𝑗1subscript𝑦2𝑗11\displaystyle\tau_{0}(y_{1,j+1},y_{2,j+1})=\{1+3\varepsilon_{0}^{-2}\rho_{0}(y% _{1,j+1},y_{2,j+1})\}^{-1}.italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ) = { 1 + 3 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT italic_ρ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ) } start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT .

Then

Pϕ(Sj+1=kSj=l,y1m)τ0(y1,j+1,y2,j+1).\displaystyle P_{\phi}(S_{j+1}=k\mid S_{j}=l,y_{1}^{m})\geq\tau_{0}(y_{1,j+1},% y_{2,j+1}).italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT = italic_k ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_l , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) ≥ italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j + 1 end_POSTSUBSCRIPT ) . (A.14)

Then we apply Lemma 2 below to show R𝑅R\rightarrow\inftyitalic_R → ∞ and R^^𝑅\hat{R}\rightarrow\inftyover^ start_ARG italic_R end_ARG → ∞ almost surely as m𝑚m\rightarrow\inftyitalic_m → ∞.

Lemma 2.

If (A.14) and (C1)-(C3) hold, then R/mG(q/2)>0𝑅𝑚superscript𝐺𝑞20R/m\geq G^{\infty}(q/2)>0italic_R / italic_m ≥ italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q / 2 ) > 0 and R^/mG(q/2)>0normal-^𝑅𝑚superscript𝐺𝑞20\hat{R}/m\geq G^{\infty}(q/2)>0over^ start_ARG italic_R end_ARG / italic_m ≥ italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q / 2 ) > 0 almost surely.

Step 2. Convergence of R/R^𝑅^𝑅R/\hat{R}italic_R / over^ start_ARG italic_R end_ARG and V/V^𝑉^𝑉V/\hat{V}italic_V / over^ start_ARG italic_V end_ARG in expectation. In Step 2, we show E|R/R^1|0𝐸𝑅^𝑅10E|R/\hat{R}-1|\rightarrow 0italic_E | italic_R / over^ start_ARG italic_R end_ARG - 1 | → 0 and E|V/V^1|0𝐸𝑉^𝑉10E|V/\hat{V}-1|\rightarrow 0italic_E | italic_V / over^ start_ARG italic_V end_ARG - 1 | → 0 under the general case that 0<λOR<α*0superscriptsubscript𝜆ORsubscript𝛼0<\lambda_{\rm{OR}}^{\infty}<\alpha_{*}0 < italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT < italic_α start_POSTSUBSCRIPT * end_POSTSUBSCRIPT.

Lemma 3.

If 0<λ𝑂𝑅<α*0superscriptsubscript𝜆𝑂𝑅subscript𝛼0<\lambda_{\text{OR}}^{\infty}<\alpha_{*}0 < italic_λ start_POSTSUBSCRIPT OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT < italic_α start_POSTSUBSCRIPT * end_POSTSUBSCRIPT, then E|R/R^1|0normal-→𝐸𝑅normal-^𝑅10E|R/\hat{R}-1|\rightarrow 0italic_E | italic_R / over^ start_ARG italic_R end_ARG - 1 | → 0 and E|V/V^1|0normal-→𝐸𝑉normal-^𝑉10E|V/\hat{V}-1|\rightarrow 0italic_E | italic_V / over^ start_ARG italic_V end_ARG - 1 | → 0 as mnormal-→𝑚m\to\inftyitalic_m → ∞.

When λORα*subscript𝜆ORsubscript𝛼\lambda_{\rm OR}\geq\alpha_{*}italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT ≥ italic_α start_POSTSUBSCRIPT * end_POSTSUBSCRIPT, R/m𝑅𝑚R/mitalic_R / italic_m tends to 1111 as m𝑚m\to\inftyitalic_m → ∞, which means all the null hypotheses will be rejected. This is not a feasible case. Step 3. Asymptotic FDR control. We have

V^R^1VR1^𝑉^𝑅1𝑉𝑅1absent\displaystyle\frac{\hat{V}}{\hat{R}\vee 1}-\frac{V}{R\vee 1}\leqdivide start_ARG over^ start_ARG italic_V end_ARG end_ARG start_ARG over^ start_ARG italic_R end_ARG ∨ 1 end_ARG - divide start_ARG italic_V end_ARG start_ARG italic_R ∨ 1 end_ARG ≤ V^R^1(1VV^1)+VR1(R1R^11).^𝑉^𝑅11𝑉^𝑉1𝑉𝑅1𝑅1^𝑅11\displaystyle\frac{\hat{V}}{\hat{R}\vee 1}\left(1-\frac{V}{\hat{V}\vee 1}% \right)+\frac{V}{R\vee 1}\left(\frac{R\vee 1}{\hat{R}\vee 1}-1\right).divide start_ARG over^ start_ARG italic_V end_ARG end_ARG start_ARG over^ start_ARG italic_R end_ARG ∨ 1 end_ARG ( 1 - divide start_ARG italic_V end_ARG start_ARG over^ start_ARG italic_V end_ARG ∨ 1 end_ARG ) + divide start_ARG italic_V end_ARG start_ARG italic_R ∨ 1 end_ARG ( divide start_ARG italic_R ∨ 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG ∨ 1 end_ARG - 1 ) .

Since V^/(R^1)1^𝑉^𝑅11\hat{V}/(\hat{R}\vee 1)\leq 1over^ start_ARG italic_V end_ARG / ( over^ start_ARG italic_R end_ARG ∨ 1 ) ≤ 1, we have

0|E{V^R^1(1VV^1)}|E|V^R^1(1VV^1)|E|1VV^1|.0𝐸^𝑉^𝑅11𝑉^𝑉1𝐸^𝑉^𝑅11𝑉^𝑉1𝐸1𝑉^𝑉1\displaystyle 0\leq\bigg{|}E\left\{\frac{\hat{V}}{\hat{R}\vee 1}\left(1-\frac{% V}{\hat{V}\vee 1}\right)\right\}\bigg{|}\leq E\bigg{|}\frac{\hat{V}}{\hat{R}% \vee 1}\left(1-\frac{V}{\hat{V}\vee 1}\right)\bigg{|}\leq E\bigg{|}1-\frac{V}{% \hat{V}\vee 1}\bigg{|}.0 ≤ | italic_E { divide start_ARG over^ start_ARG italic_V end_ARG end_ARG start_ARG over^ start_ARG italic_R end_ARG ∨ 1 end_ARG ( 1 - divide start_ARG italic_V end_ARG start_ARG over^ start_ARG italic_V end_ARG ∨ 1 end_ARG ) } | ≤ italic_E | divide start_ARG over^ start_ARG italic_V end_ARG end_ARG start_ARG over^ start_ARG italic_R end_ARG ∨ 1 end_ARG ( 1 - divide start_ARG italic_V end_ARG start_ARG over^ start_ARG italic_V end_ARG ∨ 1 end_ARG ) | ≤ italic_E | 1 - divide start_ARG italic_V end_ARG start_ARG over^ start_ARG italic_V end_ARG ∨ 1 end_ARG | .

By Lemma 3, we have

E{V^R^1(1VV^1)}0 as m.𝐸^𝑉^𝑅11𝑉^𝑉10 as 𝑚\displaystyle E\left\{\frac{\hat{V}}{\hat{R}\vee 1}\left(1-\frac{V}{\hat{V}% \vee 1}\right)\right\}\rightarrow 0\text{ as }m\to\infty.italic_E { divide start_ARG over^ start_ARG italic_V end_ARG end_ARG start_ARG over^ start_ARG italic_R end_ARG ∨ 1 end_ARG ( 1 - divide start_ARG italic_V end_ARG start_ARG over^ start_ARG italic_V end_ARG ∨ 1 end_ARG ) } → 0 as italic_m → ∞ .

Similarly, we also have

E{VR1(RR^11)}0 as m.𝐸𝑉𝑅1𝑅^𝑅110 as 𝑚\displaystyle E\left\{\frac{V}{R\vee 1}\left(\frac{R}{\hat{R}\vee 1}-1\right)% \right\}\rightarrow 0\text{ as }m\to\infty.italic_E { divide start_ARG italic_V end_ARG start_ARG italic_R ∨ 1 end_ARG ( divide start_ARG italic_R end_ARG start_ARG over^ start_ARG italic_R end_ARG ∨ 1 end_ARG - 1 ) } → 0 as italic_m → ∞ .

Therefore,

FDRFDROR=E(V^R^1)E(VR1)0 as m.FDRsubscriptFDROR𝐸^𝑉^𝑅1𝐸𝑉𝑅10 as 𝑚\displaystyle{\rm FDR}-{\rm FDR}_{\rm OR}=E\left(\frac{\hat{V}}{\hat{R}\vee 1}% \right)-E\left(\frac{V}{R\vee 1}\right)\leq 0\text{ as }m\rightarrow\infty.roman_FDR - roman_FDR start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT = italic_E ( divide start_ARG over^ start_ARG italic_V end_ARG end_ARG start_ARG over^ start_ARG italic_R end_ARG ∨ 1 end_ARG ) - italic_E ( divide start_ARG italic_V end_ARG start_ARG italic_R ∨ 1 end_ARG ) ≤ 0 as italic_m → ∞ .

Since FDRORqsubscriptFDROR𝑞{\rm FDR}_{\rm OR}\leq qroman_FDR start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT ≤ italic_q by Theorem A.1, we know FDRFDR{\rm FDR}roman_FDR is asymptotically controlled.

Appendix B Proof of lemmas

B.1 Proof of Lemma 1

Proof.

Recall that

λ^OR=superscriptsubscript^𝜆ORabsent\displaystyle\hat{\lambda}_{\mathrm{OR}}^{\infty}=over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT = sup{t:Q^OR(t)q},supremumconditional-set𝑡superscriptsubscript^𝑄OR𝑡𝑞\displaystyle\sup\left\{t:\hat{Q}_{\mathrm{OR}}^{\infty}(t)\leq q\right\},roman_sup { italic_t : over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) ≤ italic_q } ,
λOR=superscriptsubscript𝜆ORabsent\displaystyle\lambda_{\mathrm{OR}}^{\infty}=italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT = sup{t:QOR(t)q},supremumconditional-set𝑡superscriptsubscript𝑄OR𝑡𝑞\displaystyle\sup\left\{t:Q_{\mathrm{OR}}^{\infty}(t)\leq q\right\},roman_sup { italic_t : italic_Q start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) ≤ italic_q } ,

where

Q^OR(t)=superscriptsubscript^𝑄OR𝑡absent\displaystyle\hat{Q}_{\mathrm{OR}}^{\infty}(t)=over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) = j=1mI(Tjt)Tjj=1mI(Tjt),superscriptsubscript𝑗1𝑚𝐼superscriptsubscript𝑇𝑗𝑡superscriptsubscript𝑇𝑗superscriptsubscript𝑗1𝑚𝐼superscriptsubscript𝑇𝑗𝑡\displaystyle\frac{\sum_{j=1}^{m}I(T_{j}^{\infty}\leq t)T_{j}^{\infty}}{\sum_{% j=1}^{m}I(T_{j}^{\infty}\leq t)},divide start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_t ) italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_t ) end_ARG , (B.1)
QOR(t)=superscriptsubscript𝑄OR𝑡absent\displaystyle Q_{\mathrm{OR}}^{\infty}(t)=italic_Q start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) = π0G0(t)+π1G1(t)+π2G2(t)G(t).subscript𝜋0superscriptsubscript𝐺0𝑡subscript𝜋1superscriptsubscript𝐺1𝑡subscript𝜋2superscriptsubscript𝐺2𝑡superscript𝐺𝑡\displaystyle\frac{\pi_{0}G_{0}^{\infty}(t)+\pi_{1}G_{1}^{\infty}(t)+\pi_{2}G_% {2}^{\infty}(t)}{G^{\infty}(t)}.divide start_ARG italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) + italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) + italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) end_ARG start_ARG italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) end_ARG .

Since T1=P(s1{0,1,2}(y1j,y2j))superscriptsubscript𝑇1𝑃subscript𝑠1conditional012superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗T_{1}^{\infty}=P(s_{1}\in\{0,1,2\}\mid(y_{1j},y_{2j})_{-\infty}^{\infty})italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT = italic_P ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ), it is a funtion of (y1j,y2j)superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗(y_{1j},y_{2j})_{-\infty}^{\infty}( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT. Thus

E{I(T1t)T1}=𝐸𝐼superscriptsubscript𝑇1𝑡superscriptsubscript𝑇1absent\displaystyle E\left\{I(T_{1}^{\infty}\leq t)T_{1}^{\infty}\right\}=italic_E { italic_I ( italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_t ) italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT } = E{I(T1t)E[I(s1{0,1,2})(y1j,y2j)]}𝐸𝐼superscriptsubscript𝑇1𝑡𝐸delimited-[]conditional𝐼subscript𝑠1012superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗\displaystyle E\left\{I(T_{1}^{\infty}\leq t)E\left[I(s_{1}\in\{0,1,2\})\mid(y% _{1j},y_{2j})_{-\infty}^{\infty}\right]\right\}italic_E { italic_I ( italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_t ) italic_E [ italic_I ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ) ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ] }
=\displaystyle== E{E[I(T1t,s1{0,1,2})(y1j,y2j)]}𝐸𝐸delimited-[]conditional𝐼formulae-sequencesuperscriptsubscript𝑇1𝑡subscript𝑠1012superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗\displaystyle E\left\{E[I(T_{1}^{\infty}\leq t,s_{1}\in\{0,1,2\})\mid(y_{1j},y% _{2j})_{-\infty}^{\infty}]\right\}italic_E { italic_E [ italic_I ( italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_t , italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ) ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ] }
=\displaystyle== P(T1t,s1{0,1,2})𝑃formulae-sequencesuperscriptsubscript𝑇1𝑡subscript𝑠1012\displaystyle P\left(T_{1}^{\infty}\leq t,s_{1}\in\{0,1,2\}\right)italic_P ( italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_t , italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } )
=\displaystyle== π0G0(t)+π1G1(t)+π2G2(t), andsubscript𝜋0superscriptsubscript𝐺0𝑡subscript𝜋1superscriptsubscript𝐺1𝑡subscript𝜋2superscriptsubscript𝐺2𝑡 and\displaystyle\pi_{0}G_{0}^{\infty}(t)+\pi_{1}G_{1}^{\infty}(t)+\pi_{2}G_{2}^{% \infty}(t),\text{ and }italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) + italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) + italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) , and
E{I(T1t)}=𝐸𝐼superscriptsubscript𝑇1𝑡absent\displaystyle E\left\{I(T_{1}^{\infty}\leq t)\right\}=italic_E { italic_I ( italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_t ) } = G(t).superscript𝐺𝑡\displaystyle G^{\infty}(t).italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) .

Brikhoff’s ergodic theorem [Birkhoff, 1931] gives

1mj=1mI(Tjt)Tj1𝑚superscriptsubscript𝑗1𝑚𝐼superscriptsubscript𝑇𝑗𝑡superscriptsubscript𝑇𝑗absent\displaystyle\frac{1}{m}\sum_{j=1}^{m}I(T_{j}^{\infty}\leq t)T_{j}^{\infty}\rightarrowdivide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_t ) italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT → π0G0(t)+π1G1(t)+π2G2(t) almost surely for any t,subscript𝜋0superscriptsubscript𝐺0𝑡subscript𝜋1superscriptsubscript𝐺1𝑡subscript𝜋2superscriptsubscript𝐺2𝑡 almost surely for any 𝑡\displaystyle\pi_{0}G_{0}^{\infty}(t)+\pi_{1}G_{1}^{\infty}(t)+\pi_{2}G_{2}^{% \infty}(t)\text{ almost surely for any }t,italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) + italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) + italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) almost surely for any italic_t ,
1mj=1mI(Tjt)1𝑚superscriptsubscript𝑗1𝑚𝐼superscriptsubscript𝑇𝑗𝑡absent\displaystyle\frac{1}{m}\sum_{j=1}^{m}I(T_{j}^{\infty}\leq t)\rightarrowdivide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_t ) → G(t) almost surely for any t.superscript𝐺𝑡 almost surely for any 𝑡\displaystyle G^{\infty}(t)\text{ almost surely for any }t.italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) almost surely for any italic_t .

Consequently,

Q^OR(t)superscriptsubscript^𝑄OR𝑡absent\displaystyle\hat{Q}_{\mathrm{OR}}^{\infty}(t)\rightarrowover^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) → QOR(t) almost surely for any t such that G(t)>0.superscriptsubscript𝑄OR𝑡 almost surely for any 𝑡 such that superscript𝐺𝑡0\displaystyle Q_{\mathrm{OR}}^{\infty}(t)\text{ almost surely for any }t\text{% such that }G^{\infty}(t)>0.italic_Q start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) almost surely for any italic_t such that italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) > 0 . (B.2)

In addition, G(λOR)>0superscript𝐺subscriptsuperscript𝜆OR0G^{\infty}(\lambda^{\infty}_{\rm OR})>0italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT ) > 0. Therefore,

P(limmQ^OR(λOR)q)=𝑃subscript𝑚superscriptsubscript^𝑄ORsuperscriptsubscript𝜆OR𝑞absent\displaystyle P\bigg{(}\lim_{m\rightarrow\infty}\hat{Q}_{\mathrm{OR}}^{\infty}% (\lambda_{\mathrm{OR}}^{\infty})\leq q\bigg{)}=italic_P ( roman_lim start_POSTSUBSCRIPT italic_m → ∞ end_POSTSUBSCRIPT over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) ≤ italic_q ) = P(limm(Q^OR(λOR)QOR(λOR))+QOR(λOR)q)=1,𝑃subscript𝑚superscriptsubscript^𝑄ORsuperscriptsubscript𝜆ORsuperscriptsubscript𝑄ORsuperscriptsubscript𝜆ORsuperscriptsubscript𝑄ORsuperscriptsubscript𝜆OR𝑞1\displaystyle P\bigg{(}\lim_{m\rightarrow\infty}\big{(}\hat{Q}_{\mathrm{OR}}^{% \infty}(\lambda_{\mathrm{OR}}^{\infty})-Q_{\mathrm{OR}}^{\infty}(\lambda_{% \mathrm{OR}}^{\infty})\big{)}+Q_{\mathrm{OR}}^{\infty}(\lambda_{\mathrm{OR}}^{% \infty})\leq q\bigg{)}=1,italic_P ( roman_lim start_POSTSUBSCRIPT italic_m → ∞ end_POSTSUBSCRIPT ( over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) - italic_Q start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) ) + italic_Q start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) ≤ italic_q ) = 1 ,

which implies that P(limmλ^ORλOR)=1𝑃subscript𝑚superscriptsubscript^𝜆ORsuperscriptsubscript𝜆OR1P(\lim_{m\rightarrow\infty}\hat{\lambda}_{\mathrm{OR}}^{\infty}\geq\lambda_{% \mathrm{OR}}^{\infty})=1italic_P ( roman_lim start_POSTSUBSCRIPT italic_m → ∞ end_POSTSUBSCRIPT over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≥ italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) = 1, or equivalently,

λ^ORλOR almost surely.superscriptsubscript^𝜆ORsuperscriptsubscript𝜆OR almost surely.\displaystyle\hat{\lambda}_{\mathrm{OR}}^{\infty}\geq\lambda_{\mathrm{OR}}^{% \infty}\text{ almost surely.}over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≥ italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT almost surely. (B.3)

By construction, Q^OR(t)superscriptsubscript^𝑄OR𝑡\hat{Q}_{\mathrm{OR}}^{\infty}(t)over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) is an increasing step function with jump at T(j)superscriptsubscript𝑇𝑗T_{(j)}^{\infty}italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT. For T(j)t<T(j+1)superscriptsubscript𝑇𝑗𝑡superscriptsubscript𝑇𝑗1T_{(j)}^{\infty}\leq t<T_{(j+1)}^{\infty}italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_t < italic_T start_POSTSUBSCRIPT ( italic_j + 1 ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT, construct the lower bound of Q^OR(t)superscriptsubscript^𝑄OR𝑡\hat{Q}_{\rm OR}^{\infty}(t)over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) as

L^OR(t)=superscriptsubscript^𝐿OR𝑡absent\displaystyle\hat{L}_{\mathrm{OR}}^{\infty}(t)=over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) = T(j+1)tT(j+1)T(j)Q^OR(T(j1))+tT(j)T(j+1)T(j)Q^OR(T(j)).superscriptsubscript𝑇𝑗1𝑡superscriptsubscript𝑇𝑗1superscriptsubscript𝑇𝑗superscriptsubscript^𝑄ORsuperscriptsubscript𝑇𝑗1𝑡superscriptsubscript𝑇𝑗superscriptsubscript𝑇𝑗1superscriptsubscript𝑇𝑗superscriptsubscript^𝑄ORsuperscriptsubscript𝑇𝑗\displaystyle\frac{T_{(j+1)}^{\infty}-t}{T_{(j+1)}^{\infty}-T_{(j)}^{\infty}}% \hat{Q}_{\mathrm{OR}}^{\infty}(T_{(j-1)}^{\infty})+\frac{t-T_{(j)}^{\infty}}{T% _{(j+1)}^{\infty}-T_{(j)}^{\infty}}\hat{Q}_{\mathrm{OR}}^{\infty}(T_{(j)}^{% \infty}).divide start_ARG italic_T start_POSTSUBSCRIPT ( italic_j + 1 ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT - italic_t end_ARG start_ARG italic_T start_POSTSUBSCRIPT ( italic_j + 1 ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT - italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_ARG over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_T start_POSTSUBSCRIPT ( italic_j - 1 ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) + divide start_ARG italic_t - italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_ARG start_ARG italic_T start_POSTSUBSCRIPT ( italic_j + 1 ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT - italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_ARG over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) .

Then L^OR(t)superscriptsubscript^𝐿OR𝑡\hat{L}_{\mathrm{OR}}^{\infty}(t)over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) is strictly increasing in t𝑡titalic_t. We also have

0Q^OR(t)L^OR(t)0superscriptsubscript^𝑄OR𝑡superscriptsubscript^𝐿OR𝑡absent\displaystyle 0\leq\hat{Q}_{\mathrm{OR}}^{\infty}(t)-\hat{L}_{\mathrm{OR}}^{% \infty}(t)\leq0 ≤ over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) - over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) ≤ Q^OR(T(j))Q^OR(T(j1))superscriptsubscript^𝑄ORsuperscriptsubscript𝑇𝑗superscriptsubscript^𝑄ORsuperscriptsubscript𝑇𝑗1\displaystyle\hat{Q}_{\mathrm{OR}}^{\infty}(T_{(j)}^{\infty})-\hat{Q}_{\mathrm% {OR}}^{\infty}(T_{(j-1)}^{\infty})over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) - over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_T start_POSTSUBSCRIPT ( italic_j - 1 ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT )
=\displaystyle== (j1)T(j)k=1j1T(k)j(j1)𝑗1superscriptsubscript𝑇𝑗superscriptsubscript𝑘1𝑗1superscriptsubscript𝑇𝑘𝑗𝑗1\displaystyle\frac{(j-1)T_{(j)}^{\infty}-\sum_{k=1}^{j-1}T_{(k)}^{\infty}}{j(j% -1)}divide start_ARG ( italic_j - 1 ) italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT - ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_k ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_ARG start_ARG italic_j ( italic_j - 1 ) end_ARG
\displaystyle\leq 1j1𝑗\displaystyle\frac{1}{j}divide start_ARG 1 end_ARG start_ARG italic_j end_ARG
=\displaystyle== 1R(t),1superscript𝑅𝑡\displaystyle\frac{1}{R^{\infty}(t)},divide start_ARG 1 end_ARG start_ARG italic_R start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) end_ARG ,

where R(t)=k=1m1(Tkt)superscript𝑅𝑡superscriptsubscript𝑘1𝑚1superscriptsubscript𝑇𝑘𝑡R^{\infty}(t)=\sum_{k=1}^{m}1(T_{k}^{\infty}\leq t)italic_R start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT 1 ( italic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_t ) denotes the number of rejections yielded by threshold t𝑡titalic_t, satisfying R(t)=jsuperscript𝑅𝑡𝑗R^{\infty}(t)=jitalic_R start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) = italic_j if T(j)t<T(j+1)superscriptsubscript𝑇𝑗𝑡superscriptsubscript𝑇𝑗1T_{(j)}^{\infty}\leq t<T_{(j+1)}^{\infty}italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_t < italic_T start_POSTSUBSCRIPT ( italic_j + 1 ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT. By Birkhoff’s ergodic theorem [Birkhoff, 1931], R(t)/mG(t)superscript𝑅𝑡𝑚superscript𝐺𝑡R^{\infty}(t)/m\to G^{\infty}(t)italic_R start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) / italic_m → italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) almost surely as m𝑚m\to\inftyitalic_m → ∞. Then for any t[0,1]𝑡01t\in[0,1]italic_t ∈ [ 0 , 1 ],

Q^OR(t)L^OR(t)0 almost surely.superscriptsubscript^𝑄OR𝑡superscriptsubscript^𝐿OR𝑡0 almost surely\displaystyle\hat{Q}_{\mathrm{OR}}^{\infty}(t)-\hat{L}_{\mathrm{OR}}^{\infty}(% t)\rightarrow 0\text{ almost surely}.over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) - over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) → 0 almost surely .

By (B.2), L^OR(t)QOR(t)superscriptsubscript^𝐿OR𝑡superscriptsubscript𝑄OR𝑡\hat{L}_{\mathrm{OR}}^{\infty}(t)\rightarrow Q_{\mathrm{OR}}^{\infty}(t)over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) → italic_Q start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) almost surely for any t[0,1]𝑡01t\in[0,1]italic_t ∈ [ 0 , 1 ]. Denote

λ^L,OR=sup{t(0,1):L^OR(t)q}.superscriptsubscript^𝜆LORsupremumconditional-set𝑡01superscriptsubscript^𝐿OR𝑡𝑞\displaystyle\hat{\lambda}_{\mathrm{L,OR}}^{\infty}=\sup\{t\in(0,1):\hat{L}_{% \mathrm{OR}}^{\infty}(t)\leq q\}.over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_L , roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT = roman_sup { italic_t ∈ ( 0 , 1 ) : over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) ≤ italic_q } .

As Q^OR(t)L^OR(t)superscriptsubscript^𝑄OR𝑡superscriptsubscript^𝐿OR𝑡\hat{Q}_{\mathrm{OR}}^{\infty}(t)\geq\hat{L}_{\mathrm{OR}}^{\infty}(t)over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) ≥ over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) with probability 1111, we have

λ^ORλ^L,OR with probability 1.superscriptsubscript^𝜆ORsuperscriptsubscript^𝜆LOR with probability 1\displaystyle\hat{\lambda}_{\mathrm{OR}}^{\infty}\leq\hat{\lambda}_{\mathrm{L,% OR}}^{\infty}\text{ with probability }1.over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_L , roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT with probability 1 . (B.4)

By (B.3), we also have λ^L,ORλORsuperscriptsubscript^𝜆LORsuperscriptsubscript𝜆OR\hat{\lambda}_{\rm L,OR}^{\infty}\geq\lambda_{\rm OR}^{\infty}over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_L , roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≥ italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT almost surely. We claim that λ^L,ORλORsuperscriptsubscript^𝜆LORsuperscriptsubscript𝜆OR\hat{\lambda}_{\mathrm{L,OR}}^{\infty}\rightarrow\lambda_{\mathrm{OR}}^{\infty}over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_L , roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT → italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT in probability. If not, there exist ε2>0subscript𝜀20\varepsilon_{2}>0italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > 0 and η0>0subscript𝜂00\eta_{0}>0italic_η start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT > 0 such that for any M>0𝑀0M>0italic_M > 0, there exists m1Msubscript𝑚1𝑀m_{1}\geq Mitalic_m start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≥ italic_M satisfying

P(Km11)2η0,𝑃superscriptsubscript𝐾subscript𝑚112subscript𝜂0\displaystyle P(K_{m_{1}}^{1})\geq 2\eta_{0},italic_P ( italic_K start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) ≥ 2 italic_η start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ,

where Km11superscriptsubscript𝐾subscript𝑚11K_{m_{1}}^{1}italic_K start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT denotes the even that λ^L,ORλOR>ε2superscriptsubscript^𝜆LORsuperscriptsubscript𝜆ORsubscript𝜀2\hat{\lambda}_{\mathrm{L,OR}}^{\infty}-\lambda_{\mathrm{OR}}^{\infty}>% \varepsilon_{2}over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_L , roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT - italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT > italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. Let

2δ1=QOR(λOR+ε2)q>0.2subscript𝛿1superscriptsubscript𝑄ORsuperscriptsubscript𝜆ORsubscript𝜀2𝑞0\displaystyle 2\delta_{1}=Q_{\mathrm{OR}}^{\infty}(\lambda_{\mathrm{OR}}^{% \infty}+\varepsilon_{2})-q>0.2 italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_Q start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_q > 0 .

Since L^OR(t)QOR(t)superscriptsubscript^𝐿OR𝑡superscriptsubscript𝑄OR𝑡\hat{L}_{\mathrm{OR}}^{\infty}(t)\rightarrow Q_{\mathrm{OR}}^{\infty}(t)over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) → italic_Q start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) in probability for any t[0,1]𝑡01t\in[0,1]italic_t ∈ [ 0 , 1 ], there exists M>0𝑀0M>0italic_M > 0, such that for any m2Msubscript𝑚2𝑀m_{2}\geq Mitalic_m start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≥ italic_M,

P(Km22)1η0,𝑃superscriptsubscript𝐾subscript𝑚221subscript𝜂0\displaystyle P(K_{m_{2}}^{2})\geq 1-\eta_{0},italic_P ( italic_K start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ≥ 1 - italic_η start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ,

where Km22superscriptsubscript𝐾subscript𝑚22K_{m_{2}}^{2}italic_K start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT denotes the event that |L^OR(λOR+ε2)QOR(λOR+ε2)|<δ1superscriptsubscript^𝐿ORsuperscriptsubscript𝜆ORsubscript𝜀2superscriptsubscript𝑄ORsuperscriptsubscript𝜆ORsubscript𝜀2subscript𝛿1|\hat{L}_{\mathrm{OR}}^{\infty}(\lambda_{\mathrm{OR}}^{\infty}+\varepsilon_{2}% )-Q_{\mathrm{OR}}^{\infty}(\lambda_{\mathrm{OR}}^{\infty}+\varepsilon_{2})|<% \delta_{1}| over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_Q start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) | < italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Without loss of generality, assume m1=m2=msubscript𝑚1subscript𝑚2𝑚m_{1}=m_{2}=mitalic_m start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_m start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_m. Letting Km=Km1Km2subscript𝐾𝑚superscriptsubscript𝐾𝑚1superscriptsubscript𝐾𝑚2K_{m}=K_{m}^{1}\bigcap K_{m}^{2}italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ⋂ italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, we have

P(Km)=𝑃subscript𝐾𝑚absent\displaystyle P(K_{m})=italic_P ( italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) = 1P((Km1)c(Km2)c)1𝑃superscriptsuperscriptsubscript𝐾𝑚1𝑐superscriptsuperscriptsubscript𝐾𝑚2𝑐\displaystyle 1-P((K_{m}^{1})^{c}\cup(K_{m}^{2})^{c})1 - italic_P ( ( italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ∪ ( italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT )
\displaystyle\geq 1{(12η0)+η0}112subscript𝜂0subscript𝜂0\displaystyle 1-\{(1-2\eta_{0})+\eta_{0}\}1 - { ( 1 - 2 italic_η start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) + italic_η start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }
=\displaystyle== η0.subscript𝜂0\displaystyle\eta_{0}.italic_η start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT .

Thus Kmsubscript𝐾𝑚K_{m}italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT has positive probability. Additionally, L^OR(t)superscriptsubscript^𝐿OR𝑡\hat{L}_{\mathrm{OR}}^{\infty}(t)over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) is strictly increasing over t𝑡titalic_t with probability 1111. On Kmsubscript𝐾𝑚K_{m}italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT, we have

q=𝑞absent\displaystyle q=italic_q = L^OR(λ^L,OR)superscriptsubscript^𝐿ORsuperscriptsubscript^𝜆LOR\displaystyle\hat{L}_{\mathrm{OR}}^{\infty}(\hat{\lambda}_{\mathrm{L,OR}}^{% \infty})over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_L , roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT )
>\displaystyle>> L^OR(λOR+ε2)superscriptsubscript^𝐿ORsuperscriptsubscript𝜆ORsubscript𝜀2\displaystyle\hat{L}_{\mathrm{OR}}^{\infty}(\lambda_{\mathrm{OR}}^{\infty}+% \varepsilon_{2})over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT )
>\displaystyle>> QOR(λOR+ε2)δ1superscriptsubscript𝑄ORsuperscriptsubscript𝜆ORsubscript𝜀2subscript𝛿1\displaystyle Q_{\mathrm{OR}}^{\infty}(\lambda_{\mathrm{OR}}^{\infty}+% \varepsilon_{2})-\delta_{1}italic_Q start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT + italic_ε start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT
=\displaystyle== q+δ1,𝑞subscript𝛿1\displaystyle q+\delta_{1},italic_q + italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ,

which is a contradiction. Thus we must have λ^L,ORλORsuperscriptsubscript^𝜆LORsuperscriptsubscript𝜆OR\hat{\lambda}_{\mathrm{L,OR}}^{\infty}\rightarrow\lambda_{\mathrm{OR}}^{\infty}over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_L , roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT → italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT in probability. Furthermore, by (B.3) and (B.4),

λ^ORλOR in probability as m.superscriptsubscript^𝜆ORsuperscriptsubscript𝜆OR in probability as 𝑚\displaystyle\hat{\lambda}_{\mathrm{OR}}^{\infty}\rightarrow\lambda_{\mathrm{% OR}}^{\infty}\text{ in probability as }m\rightarrow\infty.over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT → italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT in probability as italic_m → ∞ .

λ^rLISλORsuperscriptsubscript^𝜆rLISsuperscriptsubscript𝜆OR\hat{\lambda}_{\rm rLIS}^{\infty}\to\lambda_{\rm OR}^{\infty}over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_rLIS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT → italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT in probability can be shown in the same way. ∎

B.2 Proof of Lemma 2

Proof.

Define Md+(j,ϕ)=maxk,l=0,1,2,3Pϕ(Sj=ky1m,Sjd=l)superscriptsubscript𝑀𝑑𝑗italic-ϕsubscriptformulae-sequence𝑘𝑙0123subscript𝑃italic-ϕsubscript𝑆𝑗conditional𝑘superscriptsubscript𝑦1𝑚subscript𝑆𝑗𝑑𝑙M_{d}^{+}(j,\phi)=\max_{k,l=0,1,2,3}P_{\phi}(S_{j}=k\mid y_{1}^{m},S_{j-d}=l)italic_M start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) = roman_max start_POSTSUBSCRIPT italic_k , italic_l = 0 , 1 , 2 , 3 end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , italic_S start_POSTSUBSCRIPT italic_j - italic_d end_POSTSUBSCRIPT = italic_l ). Similarly, define Md(j,ϕ)=mink,l=0,1,2,3Pϕ(Sj=ky1m,Sjd=l)superscriptsubscript𝑀𝑑𝑗italic-ϕsubscriptformulae-sequence𝑘𝑙0123subscript𝑃italic-ϕsubscript𝑆𝑗conditional𝑘superscriptsubscript𝑦1𝑚subscript𝑆𝑗𝑑𝑙M_{d}^{-}(j,\phi)=\min_{k,l=0,1,2,3}P_{\phi}(S_{j}=k\mid y_{1}^{m},S_{j-d}=l)italic_M start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) = roman_min start_POSTSUBSCRIPT italic_k , italic_l = 0 , 1 , 2 , 3 end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , italic_S start_POSTSUBSCRIPT italic_j - italic_d end_POSTSUBSCRIPT = italic_l ). We need to show that

|Md+(j,ϕ)Md(j,ϕ)|i=jd+1j1{12τ0(y1i,y2i)}.superscriptsubscript𝑀𝑑𝑗italic-ϕsuperscriptsubscript𝑀𝑑𝑗italic-ϕsuperscriptsubscriptproduct𝑖𝑗𝑑1𝑗112subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖\displaystyle|M_{d}^{+}(j,\phi)-M_{d}^{-}(j,\phi)|\leq\prod_{i=j-d+1}^{j-1}\{1% -2\tau_{0}(y_{1i},y_{2i})\}.| italic_M start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) - italic_M start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) | ≤ ∏ start_POSTSUBSCRIPT italic_i = italic_j - italic_d + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT { 1 - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } . (B.5)

Since ε01/4subscript𝜀014\varepsilon_{0}\leq 1/4italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≤ 1 / 4 and ρ0(y1i,y2i)1subscript𝜌0subscript𝑦1𝑖subscript𝑦2𝑖1\rho_{0}(y_{1i},y_{2i})\geq 1italic_ρ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) ≥ 1 with probability 1111, we have τ0(y1i,y2i)={1+ε02ρ0(y1i,y2i)}11/13subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖superscript1superscriptsubscript𝜀02subscript𝜌0subscript𝑦1𝑖subscript𝑦2𝑖1113\tau_{0}(y_{1i},y_{2i})=\{1+\varepsilon_{0}^{2}\rho_{0}(y_{1i},y_{2i})\}^{-1}% \leq 1/13italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) = { 1 + italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_ρ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ≤ 1 / 13 with probability 1111. Thus 12τ0(y1i,y2i)>012subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖01-2\tau_{0}(y_{1i},y_{2i})>01 - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) > 0 with probability 1111 for any i=1,,m.𝑖1𝑚i=1,\ldots,m.italic_i = 1 , … , italic_m . We have

Pϕ(Sj=ky1m,Sjd=l)=subscript𝑃italic-ϕsubscript𝑆𝑗conditional𝑘superscriptsubscript𝑦1𝑚subscript𝑆𝑗𝑑𝑙absent\displaystyle P_{\phi}(S_{j}=k\mid y_{1}^{m},S_{j-d}=l)=italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , italic_S start_POSTSUBSCRIPT italic_j - italic_d end_POSTSUBSCRIPT = italic_l ) = k=03Pϕ(Sj=k,Sjd+1=ky1m,Sjd=l)superscriptsubscriptsuperscript𝑘03subscript𝑃italic-ϕformulae-sequencesubscript𝑆𝑗𝑘subscript𝑆𝑗𝑑1conditionalsuperscript𝑘superscriptsubscript𝑦1𝑚subscript𝑆𝑗𝑑𝑙\displaystyle\sum_{k^{\prime}=0}^{3}P_{\phi}(S_{j}=k,S_{j-d+1}=k^{\prime}\mid y% _{1}^{m},S_{j-d}=l)∑ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k , italic_S start_POSTSUBSCRIPT italic_j - italic_d + 1 end_POSTSUBSCRIPT = italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , italic_S start_POSTSUBSCRIPT italic_j - italic_d end_POSTSUBSCRIPT = italic_l )
=\displaystyle== k=03Pϕ(Sj=ky1m,Sjd+1=k)Pϕ(Sjd+1=ky1m,Sjd=l).superscriptsubscriptsuperscript𝑘03subscript𝑃italic-ϕsubscript𝑆𝑗conditional𝑘superscriptsubscript𝑦1𝑚subscript𝑆𝑗𝑑1superscript𝑘subscript𝑃italic-ϕsubscript𝑆𝑗𝑑1conditionalsuperscript𝑘superscriptsubscript𝑦1𝑚subscript𝑆𝑗𝑑𝑙\displaystyle\sum_{k^{\prime}=0}^{3}P_{\phi}(S_{j}=k\mid y_{1}^{m},S_{j-d+1}=k% ^{\prime})P_{\phi}(S_{j-d+1}=k^{\prime}\mid y_{1}^{m},S_{j-d}=l).∑ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , italic_S start_POSTSUBSCRIPT italic_j - italic_d + 1 end_POSTSUBSCRIPT = italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j - italic_d + 1 end_POSTSUBSCRIPT = italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , italic_S start_POSTSUBSCRIPT italic_j - italic_d end_POSTSUBSCRIPT = italic_l ) .

Since Pϕ(Sjd+1=ky1m,Sjd=l)τ0(y1,jd+1,y2,jd+1)subscript𝑃italic-ϕsubscript𝑆𝑗𝑑1conditionalsuperscript𝑘superscriptsubscript𝑦1𝑚subscript𝑆𝑗𝑑𝑙subscript𝜏0subscript𝑦1𝑗𝑑1subscript𝑦2𝑗𝑑1P_{\phi}(S_{j-d+1}=k^{\prime}\mid y_{1}^{m},S_{j-d}=l)\geq\tau_{0}(y_{1,j-d+1}% ,y_{2,j-d+1})italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j - italic_d + 1 end_POSTSUBSCRIPT = italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , italic_S start_POSTSUBSCRIPT italic_j - italic_d end_POSTSUBSCRIPT = italic_l ) ≥ italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j - italic_d + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j - italic_d + 1 end_POSTSUBSCRIPT ), we have

Md+(j,ϕ)superscriptsubscript𝑀𝑑𝑗italic-ϕabsent\displaystyle M_{d}^{+}(j,\phi)\leqitalic_M start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) ≤ {1τ0(y1,jd+1,y2,jd+1)}Md1+(j,ϕ)+τ0(y1,jd+1,y2,jd+1)Md1(j,ϕ),1subscript𝜏0subscript𝑦1𝑗𝑑1subscript𝑦2𝑗𝑑1superscriptsubscript𝑀𝑑1𝑗italic-ϕsubscript𝜏0subscript𝑦1𝑗𝑑1subscript𝑦2𝑗𝑑1superscriptsubscript𝑀𝑑1𝑗italic-ϕ\displaystyle\{1-\tau_{0}(y_{1,j-d+1},y_{2,j-d+1})\}M_{d-1}^{+}(j,\phi)+\tau_{% 0}(y_{1,j-d+1},y_{2,j-d+1})M_{d-1}^{-}(j,\phi),{ 1 - italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j - italic_d + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j - italic_d + 1 end_POSTSUBSCRIPT ) } italic_M start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) + italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j - italic_d + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j - italic_d + 1 end_POSTSUBSCRIPT ) italic_M start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) ,

and similarly,

Md(j,ϕ)superscriptsubscript𝑀𝑑𝑗italic-ϕabsent\displaystyle M_{d}^{-}(j,\phi)\geqitalic_M start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) ≥ {1τ0(y1,jd+1,y2,jd+1)}Md1(j,ϕ)+τ0(y1,jd+1,y2,jd+1)Md1+(j,ϕ).1subscript𝜏0subscript𝑦1𝑗𝑑1subscript𝑦2𝑗𝑑1superscriptsubscript𝑀𝑑1𝑗italic-ϕsubscript𝜏0subscript𝑦1𝑗𝑑1subscript𝑦2𝑗𝑑1superscriptsubscript𝑀𝑑1𝑗italic-ϕ\displaystyle\{1-\tau_{0}(y_{1,j-d+1},y_{2,j-d+1})\}M_{d-1}^{-}(j,\phi)+\tau_{% 0}(y_{1,j-d+1},y_{2,j-d+1})M_{d-1}^{+}(j,\phi).{ 1 - italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j - italic_d + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j - italic_d + 1 end_POSTSUBSCRIPT ) } italic_M start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) + italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j - italic_d + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j - italic_d + 1 end_POSTSUBSCRIPT ) italic_M start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) .

Therefore,

Md+(j,ϕ)Md(j,ϕ)superscriptsubscript𝑀𝑑𝑗italic-ϕsuperscriptsubscript𝑀𝑑𝑗italic-ϕabsent\displaystyle M_{d}^{+}(j,\phi)-M_{d}^{-}(j,\phi)\leqitalic_M start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) - italic_M start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) ≤ {12τ0(y1,jd+1,y2,jd+1)}{Md1+(j,ϕ)Md1(j,ϕ)}12subscript𝜏0subscript𝑦1𝑗𝑑1subscript𝑦2𝑗𝑑1superscriptsubscript𝑀𝑑1𝑗italic-ϕsuperscriptsubscript𝑀𝑑1𝑗italic-ϕ\displaystyle\{1-2\tau_{0}(y_{1,j-d+1},y_{2,j-d+1})\}\{M_{d-1}^{+}(j,\phi)-M_{% d-1}^{-}(j,\phi)\}{ 1 - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 , italic_j - italic_d + 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 , italic_j - italic_d + 1 end_POSTSUBSCRIPT ) } { italic_M start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) - italic_M start_POSTSUBSCRIPT italic_d - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) }
\displaystyle\leq i=jd+1j1{12τ0(y1i,y2i)}{M1+(j,ϕ)M1(j,ϕ)}.superscriptsubscriptproduct𝑖𝑗𝑑1𝑗112subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖superscriptsubscript𝑀1𝑗italic-ϕsuperscriptsubscript𝑀1𝑗italic-ϕ\displaystyle\prod_{i=j-d+1}^{j-1}\{1-2\tau_{0}(y_{1i},y_{2i})\}\{M_{1}^{+}(j,% \phi)-M_{1}^{-}(j,\phi)\}.∏ start_POSTSUBSCRIPT italic_i = italic_j - italic_d + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT { 1 - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } { italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) - italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) } .

Since M1+(j,ϕ)M1(j,ϕ)1superscriptsubscript𝑀1𝑗italic-ϕsuperscriptsubscript𝑀1𝑗italic-ϕ1M_{1}^{+}(j,\phi)-M_{1}^{-}(j,\phi)\leq 1italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) - italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) ≤ 1, we know (B.5) is true. We have the similar definitions Nd+(j,ϕ)=maxk,l=0,1,2,3Pϕ(Sj=ky1m,Sj+d=l)superscriptsubscript𝑁𝑑𝑗italic-ϕsubscriptformulae-sequence𝑘𝑙0123subscript𝑃italic-ϕsubscript𝑆𝑗conditional𝑘superscriptsubscript𝑦1𝑚subscript𝑆𝑗𝑑𝑙N_{d}^{+}(j,\phi)=\max_{k,l=0,1,2,3}P_{\phi}(S_{j}=k\mid y_{1}^{m},S_{j+d}=l)italic_N start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) = roman_max start_POSTSUBSCRIPT italic_k , italic_l = 0 , 1 , 2 , 3 end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , italic_S start_POSTSUBSCRIPT italic_j + italic_d end_POSTSUBSCRIPT = italic_l ) and Nd(j,ϕ)=mink,l=0,1,2,3Pϕ(Sj=ky1m,Sj+d=l)superscriptsubscript𝑁𝑑𝑗italic-ϕsubscriptformulae-sequence𝑘𝑙0123subscript𝑃italic-ϕsubscript𝑆𝑗conditional𝑘superscriptsubscript𝑦1𝑚subscript𝑆𝑗𝑑𝑙N_{d}^{-}(j,\phi)=\min_{k,l=0,1,2,3}P_{\phi}(S_{j}=k\mid y_{1}^{m},S_{j+d}=l)italic_N start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) = roman_min start_POSTSUBSCRIPT italic_k , italic_l = 0 , 1 , 2 , 3 end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , italic_S start_POSTSUBSCRIPT italic_j + italic_d end_POSTSUBSCRIPT = italic_l ). We also have

|Nd+(j,ϕ)Nd(j,ϕ)|i=j+1j+d1{12τ0(y1i,y2i)}.superscriptsubscript𝑁𝑑𝑗italic-ϕsuperscriptsubscript𝑁𝑑𝑗italic-ϕsuperscriptsubscriptproduct𝑖𝑗1𝑗𝑑112subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖\displaystyle|N_{d}^{+}(j,\phi)-N_{d}^{-}(j,\phi)|\leq\prod_{i=j+1}^{j+d-1}\{1% -2\tau_{0}(y_{1i},y_{2i})\}.| italic_N start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) - italic_N start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) | ≤ ∏ start_POSTSUBSCRIPT italic_i = italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j + italic_d - 1 end_POSTSUPERSCRIPT { 1 - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } . (B.6)

We move to the second step. Let L<m/2𝐿𝑚2L<m/2italic_L < italic_m / 2. For any j𝑗jitalic_j, let L1=1(jL)subscript𝐿11𝑗𝐿L_{1}=1\vee(j-L)italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 ∨ ( italic_j - italic_L ) and L2=m(j+L)subscript𝐿2𝑚𝑗𝐿L_{2}=m\wedge(j+L)italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_m ∧ ( italic_j + italic_L ). We claim that when L1>1subscript𝐿11L_{1}>1italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > 1 and L2<msubscript𝐿2𝑚L_{2}<mitalic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < italic_m,

|Pϕ(Sj{0,1,2}y1m)Pϕ(Sj{0,1,2}y)|\displaystyle|P_{\phi}(S_{j}\in\{0,1,2\}\mid y_{1}^{m})-P_{\phi}(S_{j}\in\{0,1% ,2\}\mid y_{-\infty}^{\infty})|| italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) - italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) |
<\displaystyle<< 3i=L1+1j1exp{2τ0(y1i,y2i)}+3i=j+1L21exp{2τ0(y1i,y2i)}.3superscriptsubscriptproduct𝑖subscript𝐿11𝑗12subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖3superscriptsubscriptproduct𝑖𝑗1subscript𝐿212subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖\displaystyle 3\prod_{i=L_{1}+1}^{j-1}\exp\{-2\tau_{0}(y_{1i},y_{2i})\}+3\prod% _{i=j+1}^{L_{2}-1}\exp\{-2\tau_{0}(y_{1i},y_{2i})\}.3 ∏ start_POSTSUBSCRIPT italic_i = italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } + 3 ∏ start_POSTSUBSCRIPT italic_i = italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } . (B.7)

We have

|Pϕ(Sj{0,1,2}y1m)Pϕ(Sj{0,1,2}y)|\displaystyle|P_{\phi}(S_{j}\in\{0,1,2\}\mid y_{1}^{m})-P_{\phi}(S_{j}\in\{0,1% ,2\}\mid y_{-\infty}^{\infty})|| italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) - italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) |
\displaystyle\leq |Pϕ(Sj{0,1,2}y1m)Pϕ(Sj{0,1,2}ym)|\displaystyle|P_{\phi}(S_{j}\in\{0,1,2\}\mid y_{1}^{m})-P_{\phi}(S_{j}\in\{0,1% ,2\}\mid y_{-\infty}^{m})|| italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) - italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) |
+|Pϕ(Sj{0,1,2}ym)Pϕ(Sj{0,1,2}y)|.\displaystyle+|P_{\phi}(S_{j}\in\{0,1,2\}\mid y_{-\infty}^{m})-P_{\phi}(S_{j}% \in\{0,1,2\}\mid y_{-\infty}^{\infty})|.+ | italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) - italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) | .

We just need to show

|Pϕ(Sj{0,1,2}y1m)Pϕ(Sj{0,1,2}ym)|\displaystyle|P_{\phi}(S_{j}\in\{0,1,2\}\mid y_{1}^{m})-P_{\phi}(S_{j}\in\{0,1% ,2\}\mid y_{-\infty}^{m})|\leq| italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) - italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) | ≤ 3i=L1+1j1exp{2τ0(y1i,y2i)}3superscriptsubscriptproduct𝑖subscript𝐿11𝑗12subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖\displaystyle 3\prod_{i=L_{1}+1}^{j-1}\exp\{-2\tau_{0}(y_{1i},y_{2i})\}3 ∏ start_POSTSUBSCRIPT italic_i = italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) }

and

|Pϕ(Sj{0,1,2}ym)Pϕ(Sj{0,1,2}y)|\displaystyle|P_{\phi}(S_{j}\in\{0,1,2\}\mid y_{-\infty}^{m})-P_{\phi}(S_{j}% \in\{0,1,2\}\mid y_{-\infty}^{\infty})|\leq| italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) - italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) | ≤ 3i=j+1L21exp{2τ0(y1i,y2i)}.3superscriptsubscriptproduct𝑖𝑗1subscript𝐿212subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖\displaystyle 3\prod_{i=j+1}^{L_{2}-1}\exp\{-2\tau_{0}(y_{1i},y_{2i})\}.3 ∏ start_POSTSUBSCRIPT italic_i = italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } .

We have for k=0,1,2𝑘012k=0,1,2italic_k = 0 , 1 , 2,

|Pϕ(Sj=ky1m)Pϕ(Sj=kym)|\displaystyle|P_{\phi}(S_{j}=k\mid y_{1}^{m})-P_{\phi}(S_{j}=k\mid y_{-\infty}% ^{m})|| italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) - italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) |
=\displaystyle== |l=03Pϕ(Sj=kSjL=l,y1m)Pϕ(SjL=ly1m)\displaystyle\bigg{|}\sum_{l=0}^{3}P_{\phi}(S_{j}=k\mid S_{j-L}=l,y_{1}^{m})P_% {\phi}(S_{j-L}=l\mid y_{1}^{m})| ∑ start_POSTSUBSCRIPT italic_l = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ∣ italic_S start_POSTSUBSCRIPT italic_j - italic_L end_POSTSUBSCRIPT = italic_l , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j - italic_L end_POSTSUBSCRIPT = italic_l ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT )
l=03Pϕ(Sj=kSjL=l,y1m)Pϕ(SjL=lym)|\displaystyle-\sum_{l^{\prime}=0}^{3}P_{\phi}(S_{j}=k\mid S_{j-L}=l^{\prime},y% _{1}^{m})P_{\phi}(S_{j-L}=l^{\prime}\mid y_{-\infty}^{m})\bigg{|}- ∑ start_POSTSUBSCRIPT italic_l start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ∣ italic_S start_POSTSUBSCRIPT italic_j - italic_L end_POSTSUBSCRIPT = italic_l start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j - italic_L end_POSTSUBSCRIPT = italic_l start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) |
\displaystyle\leq maxl,l=0,1,2,3|Pϕ(Sj=kSjL=l,y1m)Pϕ(Sj=kSjL=l,y1m)|\displaystyle\max_{l,l^{\prime}=0,1,2,3}|P_{\phi}(S_{j}=k\mid S_{j-L}=l,y_{1}^% {m})-P_{\phi}(S_{j}=k\mid S_{j-L}=l^{\prime},y_{1}^{m})|roman_max start_POSTSUBSCRIPT italic_l , italic_l start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 0 , 1 , 2 , 3 end_POSTSUBSCRIPT | italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ∣ italic_S start_POSTSUBSCRIPT italic_j - italic_L end_POSTSUBSCRIPT = italic_l , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) - italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ∣ italic_S start_POSTSUBSCRIPT italic_j - italic_L end_POSTSUBSCRIPT = italic_l start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) |
\displaystyle\leq ML+(j,ϕ)ML(j,ϕ)superscriptsubscript𝑀𝐿𝑗italic-ϕsuperscriptsubscript𝑀𝐿𝑗italic-ϕ\displaystyle M_{L}^{+}(j,\phi)-M_{L}^{-}(j,\phi)italic_M start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_j , italic_ϕ ) - italic_M start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_j , italic_ϕ )
\displaystyle\leq i=L1+1j1{12τ0(y1i,y2i)}superscriptsubscriptproduct𝑖subscript𝐿11𝑗112subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖\displaystyle\prod_{i=L_{1}+1}^{j-1}\{1-2\tau_{0}(y_{1i},y_{2i})\}∏ start_POSTSUBSCRIPT italic_i = italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT { 1 - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) }
\displaystyle\leq i=L1+1j1exp{2τ0(y1i,y2i)}.superscriptsubscriptproduct𝑖subscript𝐿11𝑗12subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖\displaystyle\prod_{i=L_{1}+1}^{j-1}\exp\{-2\tau_{0}(y_{1i},y_{2i})\}.∏ start_POSTSUBSCRIPT italic_i = italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } .

Then

|Pϕ(Sj{0,1,2}y1m)Pϕ(Sj{0,1,2}ym)|\displaystyle|P_{\phi}(S_{j}\in\{0,1,2\}\mid y_{1}^{m})-P_{\phi}(S_{j}\in\{0,1% ,2\}\mid y_{-\infty}^{m})|| italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) - italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) |
\displaystyle\leq k=02|Pϕ(Sj=ky1m)Pϕ(Sj=kym)|\displaystyle\sum_{k=0}^{2}|P_{\phi}(S_{j}=k\mid y_{1}^{m})-P_{\phi}(S_{j}=k% \mid y_{-\infty}^{m})|∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) - italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) |
\displaystyle\leq 3i=L1+1j1exp{2τ0(y1i,y2i)}.3superscriptsubscriptproduct𝑖subscript𝐿11𝑗12subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖\displaystyle 3\prod_{i=L_{1}+1}^{j-1}\exp\{-2\tau_{0}(y_{1i},y_{2i})\}.3 ∏ start_POSTSUBSCRIPT italic_i = italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } .

Similarly, we also have

|Pϕ(Sj{0,1,2}ym)Pϕ(Sj{0,1,2}y)|\displaystyle|P_{\phi}(S_{j}\in\{0,1,2\}\mid y_{-\infty}^{m})-P_{\phi}(S_{j}% \in\{0,1,2\}\mid y_{-\infty}^{\infty})|\leq| italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) - italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) | ≤ 3i=j+1L21exp{2τ0(y1i,y2i)}.3superscriptsubscriptproduct𝑖𝑗1subscript𝐿212subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖\displaystyle 3\prod_{i=j+1}^{L_{2}-1}\exp\{-2\tau_{0}(y_{1i},y_{2i})\}.3 ∏ start_POSTSUBSCRIPT italic_i = italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } .

Therefore, (B.2) is true. Then we consider the expectations.

Eϕ0|Pϕ(Sj{0,1,2}y1m)Pϕ(Sj{0,1,2}y)|\displaystyle E_{\phi_{0}}|P_{\phi}(S_{j}\in\{0,1,2\}\mid y_{1}^{m})-P_{\phi}(% S_{j}\in\{0,1,2\}\mid y_{-\infty}^{\infty})|italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT | italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) - italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) |
\displaystyle\leq Eϕ0[3i=L1+1j1exp{2τ0(y1i,y2i)}+3i=j+1L21exp{2τ0(y1i,y2i)}]subscript𝐸subscriptitalic-ϕ0delimited-[]3superscriptsubscriptproduct𝑖subscript𝐿11𝑗12subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖3superscriptsubscriptproduct𝑖𝑗1subscript𝐿212subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖\displaystyle E_{\phi_{0}}\left[3\prod_{i=L_{1}+1}^{j-1}\exp\{-2\tau_{0}(y_{1i% },y_{2i})\}+3\prod_{i=j+1}^{L_{2}-1}\exp\{-2\tau_{0}(y_{1i},y_{2i})\}\right]italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ 3 ∏ start_POSTSUBSCRIPT italic_i = italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } + 3 ∏ start_POSTSUBSCRIPT italic_i = italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } ]
=\displaystyle== Eϕ0{Eϕ0[3i=L1+1j1exp{2τ0(y1i,y2i)}+3i=j+1L21exp{2τ0(y1i,y2i)}|S1,,Sm]}subscript𝐸subscriptitalic-ϕ0subscript𝐸subscriptitalic-ϕ0delimited-[]3superscriptsubscriptproduct𝑖subscript𝐿11𝑗12subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖conditional3superscriptsubscriptproduct𝑖𝑗1subscript𝐿212subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖subscript𝑆1subscript𝑆𝑚\displaystyle E_{\phi_{0}}\left\{E_{\phi_{0}}\left[3\prod_{i=L_{1}+1}^{j-1}% \exp\{-2\tau_{0}(y_{1i},y_{2i})\}+3\prod_{i=j+1}^{L_{2}-1}\exp\{-2\tau_{0}(y_{% 1i},y_{2i})\}\bigg{|}S_{1},\ldots,S_{m}\right]\right\}italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT { italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ 3 ∏ start_POSTSUBSCRIPT italic_i = italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } + 3 ∏ start_POSTSUBSCRIPT italic_i = italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } | italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ] }
=\displaystyle== Eϕ0{3i=L1+1j1Eϕ0[exp{2τ0(y1i,y2i)}Si]+3i=j+1L21Eϕ0[exp{2τ0(y1i,y2i)}Si]}.subscript𝐸subscriptitalic-ϕ03superscriptsubscriptproduct𝑖subscript𝐿11𝑗1subscript𝐸subscriptitalic-ϕ0delimited-[]conditional2subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖subscript𝑆𝑖3superscriptsubscriptproduct𝑖𝑗1subscript𝐿21subscript𝐸subscriptitalic-ϕ0delimited-[]conditional2subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖subscript𝑆𝑖\displaystyle E_{\phi_{0}}\left\{3\prod_{i=L_{1}+1}^{j-1}E_{\phi_{0}}\left[% \exp\{-2\tau_{0}(y_{1i},y_{2i})\}\mid S_{i}\right]+3\prod_{i=j+1}^{L_{2}-1}E_{% \phi_{0}}\left[\exp\{-2\tau_{0}(y_{1i},y_{2i})\}\mid S_{i}\right]\right\}.italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT { 3 ∏ start_POSTSUBSCRIPT italic_i = italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } ∣ italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] + 3 ∏ start_POSTSUBSCRIPT italic_i = italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } ∣ italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] } .

By (C5) and the construction of τ0(Y1j,Y2j)subscript𝜏0subscript𝑌1𝑗subscript𝑌2𝑗\tau_{0}(Y_{1j},Y_{2j})italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_Y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ), we have Pϕ0(τ0(Y1j,Y2j)>0Sj=k)=1subscript𝑃subscriptitalic-ϕ0subscript𝜏0subscript𝑌1𝑗subscript𝑌2𝑗conditional0subscript𝑆𝑗𝑘1P_{\phi_{0}}(\tau_{0}(Y_{1j},Y_{2j})>0\mid S_{j}=k)=1italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_Y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) > 0 ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ) = 1 for all k=0,1,2,3𝑘0123k=0,1,2,3italic_k = 0 , 1 , 2 , 3. Let

β0=maxk=0,1,2,3Eϕ0[exp{2τ0(y11,y21)}S1=k],subscript𝛽0subscript𝑘0123subscript𝐸subscriptitalic-ϕ0delimited-[]conditional2subscript𝜏0subscript𝑦11subscript𝑦21subscript𝑆1𝑘\displaystyle\beta_{0}=\max_{k=0,1,2,3}E_{\phi_{0}}\left[\exp\{-2\tau_{0}(y_{1% 1},y_{21})\}\mid S_{1}=k\right],italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = roman_max start_POSTSUBSCRIPT italic_k = 0 , 1 , 2 , 3 end_POSTSUBSCRIPT italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT ) } ∣ italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_k ] , (B.8)

then we have β0<1subscript𝛽01\beta_{0}<1italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < 1. Therefore, for some C0>0subscript𝐶00C_{0}>0italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT > 0,

Eϕ0|Pϕ(Sj{0,1,2}y1m)Pϕ(Sj{0,1,2}y)|C0β0L.\displaystyle E_{\phi_{0}}|P_{\phi}(S_{j}\in\{0,1,2\}\mid y_{1}^{m})-P_{\phi}(% S_{j}\in\{0,1,2\}\mid y_{-\infty}^{\infty})|\leq C_{0}\beta_{0}^{L}.italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT | italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) - italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) | ≤ italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT . (B.9)

By Lévy’s upward theorem [Williams, 1991], Pϕ(S0{0,1,2}(y1j,y2j)NN)T0subscript𝑃italic-ϕsubscript𝑆0conditional012superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑁𝑁superscriptsubscript𝑇0P_{\phi}(S_{0}\in\{0,1,2\}\mid(y_{1j},y_{2j})_{-N}^{N})\rightarrow T_{0}^{\infty}italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ) → italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT almost surely as N𝑁N\rightarrow\inftyitalic_N → ∞. Next, we show that G(q/2)=Pϕ(T0q/2)>0superscript𝐺𝑞2subscript𝑃italic-ϕsuperscriptsubscript𝑇0𝑞20G^{\infty}(q/2)=P_{\phi}(T_{0}^{\infty}\leq q/2)>0italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q / 2 ) = italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_q / 2 ) > 0. Note that

Pϕ(T0q/2)subscript𝑃italic-ϕsuperscriptsubscript𝑇0𝑞2\displaystyle P_{\phi}(T_{0}^{\infty}\leq q/2)italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_q / 2 )
=\displaystyle== Pϕ(Pϕ(Sj{0,1,2}(y1j,y2j))Pϕ(Sj=3(y1j,y2j))q/21q/2)subscript𝑃italic-ϕsubscript𝑃italic-ϕsubscript𝑆𝑗conditional012superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗subscript𝑃italic-ϕsubscript𝑆𝑗conditional3superscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗𝑞21𝑞2\displaystyle P_{\phi}\left(\frac{P_{\phi}(S_{j}\in\{0,1,2\}\mid(y_{1j},y_{2j}% )_{-\infty}^{\infty})}{P_{\phi}(S_{j}=3\mid(y_{1j},y_{2j})_{-\infty}^{\infty})% }\leq\frac{q/2}{1-q/2}\right)italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( divide start_ARG italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 3 ∣ ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) end_ARG ≤ divide start_ARG italic_q / 2 end_ARG start_ARG 1 - italic_q / 2 end_ARG )
=\displaystyle== Pϕ(k=02πkPϕ((y1j,y2j)Sj=k)π3Pϕ((y1j,y2j)Sj=3)q/21q/2)subscript𝑃italic-ϕsuperscriptsubscript𝑘02subscript𝜋𝑘subscript𝑃italic-ϕconditionalsuperscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗subscript𝑆𝑗𝑘subscript𝜋3subscript𝑃italic-ϕconditionalsuperscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗subscript𝑆𝑗3𝑞21𝑞2\displaystyle P_{\phi}\left(\frac{\sum_{k=0}^{2}\pi_{k}P_{\phi}((y_{1j},y_{2j}% )_{-\infty}^{\infty}\mid S_{j}=k)}{\pi_{3}P_{\phi}((y_{1j},y_{2j})_{-\infty}^{% \infty}\mid S_{j}=3)}\leq\frac{q/2}{1-q/2}\right)italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( divide start_ARG ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_k ) end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 3 ) end_ARG ≤ divide start_ARG italic_q / 2 end_ARG start_ARG 1 - italic_q / 2 end_ARG )
=\displaystyle== Pϕ(k=02l1,l2=0,1,2,3Pϕ((y1j,y2j)1S1=l1)al1kakl2Pϕ((y1j,y2j)1S1=l2)l1,l2=0,1,2,3Pϕ((y1j,y2j)1S1=l1)al13a3l2Pϕ((y1j,y2j)1S1=l2)\displaystyle P_{\phi}\bigg{(}\sum_{k=0}^{2}\frac{\sum_{l_{1},l_{2}=0,1,2,3}P_% {\phi}((y_{1j},y_{2j})_{-\infty}^{-1}\mid S_{-1}=l_{1})a_{l_{1}k}a_{kl_{2}}P_{% \phi}((y_{1j},y_{2j})_{1}^{\infty}\mid S_{1}=l_{2})}{\sum_{l_{1},l_{2}=0,1,2,3% }P_{\phi}((y_{1j},y_{2j})_{-\infty}^{-1}\mid S_{-1}=l_{1})a_{l_{1}3}a_{3l_{2}}% P_{\phi}((y_{1j},y_{2j})_{1}^{\infty}\mid S_{1}=l_{2})}italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG ∑ start_POSTSUBSCRIPT italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 0 , 1 , 2 , 3 end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT = italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_a start_POSTSUBSCRIPT italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 0 , 1 , 2 , 3 end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT = italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_a start_POSTSUBSCRIPT italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 3 italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG
πkf(k)(y10,y20)π3f(3)(y10,y20)q/21q/2).\displaystyle\quad\cdot\frac{\pi_{k}f^{(k)}(y_{10},y_{20})}{\pi_{3}f^{(3)}(y_{% 10},y_{20})}\leq\frac{q/2}{1-q/2}\bigg{)}.⋅ divide start_ARG italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_f start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 20 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_f start_POSTSUPERSCRIPT ( 3 ) end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 20 end_POSTSUBSCRIPT ) end_ARG ≤ divide start_ARG italic_q / 2 end_ARG start_ARG 1 - italic_q / 2 end_ARG ) .

By (C2), ε0akl1subscript𝜀0subscript𝑎𝑘𝑙1\varepsilon_{0}\leq a_{kl}\leq 1italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≤ italic_a start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT ≤ 1 for all k,l=0,1,2,3formulae-sequence𝑘𝑙0123k,l=0,1,2,3italic_k , italic_l = 0 , 1 , 2 , 3. Then we have

ε02l1,l2=0,1,2,3Pϕ((y1j,y2j)1S1=l1)al1kakl2Pϕ((y1j,y2j)1S1=l2)l1,l2=0,1,2,3Pϕ((y1j,y2j)1S1=l1)al13a3l2Pϕ((y1j,y2j)1S1=l2)ε02.superscriptsubscript𝜀02subscriptformulae-sequencesubscript𝑙1subscript𝑙20123subscript𝑃italic-ϕconditionalsuperscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗1subscript𝑆1subscript𝑙1subscript𝑎subscript𝑙1𝑘subscript𝑎𝑘subscript𝑙2subscript𝑃italic-ϕconditionalsuperscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗1subscript𝑆1subscript𝑙2subscriptformulae-sequencesubscript𝑙1subscript𝑙20123subscript𝑃italic-ϕconditionalsuperscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗1subscript𝑆1subscript𝑙1subscript𝑎subscript𝑙13subscript𝑎3subscript𝑙2subscript𝑃italic-ϕconditionalsuperscriptsubscriptsubscript𝑦1𝑗subscript𝑦2𝑗1subscript𝑆1subscript𝑙2superscriptsubscript𝜀02\displaystyle\varepsilon_{0}^{2}\leq\frac{\sum_{l_{1},l_{2}=0,1,2,3}P_{\phi}((% y_{1j},y_{2j})_{-\infty}^{-1}\mid S_{-1}=l_{1})a_{l_{1}k}a_{kl_{2}}P_{\phi}((y% _{1j},y_{2j})_{1}^{\infty}\mid S_{1}=l_{2})}{\sum_{l_{1},l_{2}=0,1,2,3}P_{\phi% }((y_{1j},y_{2j})_{-\infty}^{-1}\mid S_{-1}=l_{1})a_{l_{1}3}a_{3l_{2}}P_{\phi}% ((y_{1j},y_{2j})_{1}^{\infty}\mid S_{1}=l_{2})}\leq\varepsilon_{0}^{-2}.italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ divide start_ARG ∑ start_POSTSUBSCRIPT italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 0 , 1 , 2 , 3 end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT = italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_a start_POSTSUBSCRIPT italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 0 , 1 , 2 , 3 end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT = italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_a start_POSTSUBSCRIPT italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 3 italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∣ italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG ≤ italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT .

Consequently,

Pϕ(T0q/2)subscript𝑃italic-ϕsuperscriptsubscript𝑇0𝑞2absent\displaystyle P_{\phi}(T_{0}^{\infty}\leq q/2)\geqitalic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_q / 2 ) ≥ Pϕ{ε02(π0π31f1(y10)f2(y20)+π1π31f2(y20)+π2π31f1(y10))q/21q/2}subscript𝑃italic-ϕsuperscriptsubscript𝜀02subscript𝜋0subscript𝜋31subscript𝑓1subscript𝑦10subscript𝑓2subscript𝑦20subscript𝜋1subscript𝜋31subscript𝑓2subscript𝑦20subscript𝜋2subscript𝜋31subscript𝑓1subscript𝑦10𝑞21𝑞2\displaystyle P_{\phi}\left\{\varepsilon_{0}^{-2}\left(\frac{\pi_{0}}{\pi_{3}}% \frac{1}{f_{1}(y_{10})f_{2}(y_{20})}+\frac{\pi_{1}}{\pi_{3}}\frac{1}{f_{2}(y_{% 20})}+\frac{\pi_{2}}{\pi_{3}}\frac{1}{f_{1}(y_{10})}\right)\leq\frac{q/2}{1-q/% 2}\right\}italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT { italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT ( divide start_ARG italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG divide start_ARG 1 end_ARG start_ARG italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 20 end_POSTSUBSCRIPT ) end_ARG + divide start_ARG italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG divide start_ARG 1 end_ARG start_ARG italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 20 end_POSTSUBSCRIPT ) end_ARG + divide start_ARG italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG divide start_ARG 1 end_ARG start_ARG italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ) end_ARG ) ≤ divide start_ARG italic_q / 2 end_ARG start_ARG 1 - italic_q / 2 end_ARG }
=\displaystyle== Pϕ{ε02q/21q/2f1(y10)f2(y20)π1π3f1(y10)π2π3f2(y20)π0π30}.subscript𝑃italic-ϕsuperscriptsubscript𝜀02𝑞21𝑞2subscript𝑓1subscript𝑦10subscript𝑓2subscript𝑦20subscript𝜋1subscript𝜋3subscript𝑓1subscript𝑦10subscript𝜋2subscript𝜋3subscript𝑓2subscript𝑦20subscript𝜋0subscript𝜋30\displaystyle P_{\phi}\left\{\varepsilon_{0}^{2}\frac{q/2}{1-q/2}f_{1}(y_{10})% f_{2}(y_{20})-\frac{\pi_{1}}{\pi_{3}}f_{1}(y_{10})-\frac{\pi_{2}}{\pi_{3}}f_{2% }(y_{20})-\frac{\pi_{0}}{\pi_{3}}\geq 0\right\}.italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT { italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG italic_q / 2 end_ARG start_ARG 1 - italic_q / 2 end_ARG italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 20 end_POSTSUBSCRIPT ) - divide start_ARG italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ) - divide start_ARG italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 20 end_POSTSUBSCRIPT ) - divide start_ARG italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG ≥ 0 } .

By (C2), limy0f1(y)>csubscript𝑦0subscript𝑓1𝑦𝑐\lim_{y\rightarrow 0}f_{1}(y)>croman_lim start_POSTSUBSCRIPT italic_y → 0 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y ) > italic_c and limy0f2(y)>csubscript𝑦0subscript𝑓2𝑦𝑐\lim_{y\rightarrow 0}f_{2}(y)>croman_lim start_POSTSUBSCRIPT italic_y → 0 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y ) > italic_c. Moreover, two roots of the quadratic equation

g2(x)=ε02q/21q/2x2π1+π2π3xπ0π3=0subscript𝑔2𝑥superscriptsubscript𝜀02𝑞21𝑞2superscript𝑥2subscript𝜋1subscript𝜋2subscript𝜋3𝑥subscript𝜋0subscript𝜋30\displaystyle g_{2}(x)=\varepsilon_{0}^{2}\frac{q/2}{1-q/2}x^{2}-\frac{\pi_{1}% +\pi_{2}}{\pi_{3}}x-\frac{\pi_{0}}{\pi_{3}}=0italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x ) = italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG italic_q / 2 end_ARG start_ARG 1 - italic_q / 2 end_ARG italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - divide start_ARG italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG italic_x - divide start_ARG italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG = 0

are

x0(1)=superscriptsubscript𝑥01absent\displaystyle x_{0}^{(1)}=italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = π1+π2(π1+π2)2+4ε02π0π3q/(2q)2ε02π3q/(2q)<0 andsubscript𝜋1subscript𝜋2superscriptsubscript𝜋1subscript𝜋224superscriptsubscript𝜀02subscript𝜋0subscript𝜋3𝑞2𝑞2superscriptsubscript𝜀02subscript𝜋3𝑞2𝑞0 and\displaystyle\frac{\pi_{1}+\pi_{2}-\sqrt{(\pi_{1}+\pi_{2})^{2}+4\varepsilon_{0% }^{2}\pi_{0}\pi_{3}q/(2-q)}}{2\varepsilon_{0}^{2}\pi_{3}q/(2-q)}<0\text{ and }divide start_ARG italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - square-root start_ARG ( italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 4 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_q / ( 2 - italic_q ) end_ARG end_ARG start_ARG 2 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_q / ( 2 - italic_q ) end_ARG < 0 and
x0(2)=superscriptsubscript𝑥02absent\displaystyle x_{0}^{(2)}=italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT = π1+π2+(π1+π2)2+4ε02π0π3q/(2q)2ε02π3q/(2q)>0.subscript𝜋1subscript𝜋2superscriptsubscript𝜋1subscript𝜋224superscriptsubscript𝜀02subscript𝜋0subscript𝜋3𝑞2𝑞2superscriptsubscript𝜀02subscript𝜋3𝑞2𝑞0\displaystyle\frac{\pi_{1}+\pi_{2}+\sqrt{(\pi_{1}+\pi_{2})^{2}+4\varepsilon_{0% }^{2}\pi_{0}\pi_{3}q/(2-q)}}{2\varepsilon_{0}^{2}\pi_{3}q/(2-q)}>0.divide start_ARG italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + square-root start_ARG ( italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 4 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_q / ( 2 - italic_q ) end_ARG end_ARG start_ARG 2 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_q / ( 2 - italic_q ) end_ARG > 0 .

By (C2), for each k=0,1,2,3𝑘0123k=0,1,2,3italic_k = 0 , 1 , 2 , 3, ε0πk13ε0subscript𝜀0subscript𝜋𝑘13subscript𝜀0\varepsilon_{0}\leq\pi_{k}\leq 1-3\varepsilon_{0}italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≤ italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ 1 - 3 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. Thus

x0(2)12ε0+(12ε0)2+4ε03(13ε0)q/(2q)2ε03q/(2q)=c,superscriptsubscript𝑥0212subscript𝜀0superscript12subscript𝜀024superscriptsubscript𝜀0313subscript𝜀0𝑞2𝑞2superscriptsubscript𝜀03𝑞2𝑞𝑐\displaystyle x_{0}^{(2)}\leq\frac{1-2\varepsilon_{0}+\sqrt{(1-2\varepsilon_{0% })^{2}+4\varepsilon_{0}^{3}(1-3\varepsilon_{0})q/(2-q)}}{2\varepsilon_{0}^{3}q% /(2-q)}=c,italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT ≤ divide start_ARG 1 - 2 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + square-root start_ARG ( 1 - 2 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 4 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ( 1 - 3 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_q / ( 2 - italic_q ) end_ARG end_ARG start_ARG 2 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_q / ( 2 - italic_q ) end_ARG = italic_c ,

where c𝑐citalic_c is defined in (C2). Thus g2(c)>0subscript𝑔2𝑐0g_{2}(c)>0italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_c ) > 0 and

ε02q/21q/2cπ1π3>superscriptsubscript𝜀02𝑞21𝑞2𝑐subscript𝜋1subscript𝜋3absent\displaystyle\varepsilon_{0}^{2}\frac{q/2}{1-q/2}c-\frac{\pi_{1}}{\pi_{3}}>italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG italic_q / 2 end_ARG start_ARG 1 - italic_q / 2 end_ARG italic_c - divide start_ARG italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG > 1c(π2π3c+π0π3)>0, and1𝑐subscript𝜋2subscript𝜋3𝑐subscript𝜋0subscript𝜋30 and\displaystyle\frac{1}{c}\left(\frac{\pi_{2}}{\pi_{3}}c+\frac{\pi_{0}}{\pi_{3}}% \right)>0,\text{ and }divide start_ARG 1 end_ARG start_ARG italic_c end_ARG ( divide start_ARG italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG italic_c + divide start_ARG italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG ) > 0 , and
ε02q/21q/2cπ2π3>superscriptsubscript𝜀02𝑞21𝑞2𝑐subscript𝜋2subscript𝜋3absent\displaystyle\varepsilon_{0}^{2}\frac{q/2}{1-q/2}c-\frac{\pi_{2}}{\pi_{3}}>italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG italic_q / 2 end_ARG start_ARG 1 - italic_q / 2 end_ARG italic_c - divide start_ARG italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG > 1c(π1π3c+π0π3)>0.1𝑐subscript𝜋1subscript𝜋3𝑐subscript𝜋0subscript𝜋30\displaystyle\frac{1}{c}\left(\frac{\pi_{1}}{\pi_{3}}c+\frac{\pi_{0}}{\pi_{3}}% \right)>0.divide start_ARG 1 end_ARG start_ARG italic_c end_ARG ( divide start_ARG italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG italic_c + divide start_ARG italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG ) > 0 .

By (C2), limx10f1(x1)>csubscriptsubscript𝑥10subscript𝑓1subscript𝑥1𝑐\lim_{x_{1}\to 0}f_{1}(x_{1})>croman_lim start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT → 0 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) > italic_c and limx20f2(x2)>csubscriptsubscript𝑥20subscript𝑓2subscript𝑥2𝑐\lim_{x_{2}\to 0}f_{2}(x_{2})>croman_lim start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT → 0 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) > italic_c. Since f1,f2subscript𝑓1subscript𝑓2f_{1},f_{2}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are continuous, there exist u1,u2(0,1)subscript𝑢1subscript𝑢201u_{1},u_{2}\in(0,1)italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ ( 0 , 1 ) such that f1(x1)>csubscript𝑓1subscript𝑥1𝑐f_{1}(x_{1})>citalic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) > italic_c and f2(x2)>csubscript𝑓2subscript𝑥2𝑐f_{2}(x_{2})>citalic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) > italic_c whenever 0<x1<u10subscript𝑥1subscript𝑢10<x_{1}<u_{1}0 < italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and 0<x2<u20subscript𝑥2subscript𝑢20<x_{2}<u_{2}0 < italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. Consequently, for 0<x1<u10subscript𝑥1subscript𝑢10<x_{1}<u_{1}0 < italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and 0<x2<u20subscript𝑥2subscript𝑢20<x_{2}<u_{2}0 < italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT

ε02q/21q/2f2(x2)π1π3>ε02q/21q/2cπ1π3>0.superscriptsubscript𝜀02𝑞21𝑞2subscript𝑓2subscript𝑥2subscript𝜋1subscript𝜋3superscriptsubscript𝜀02𝑞21𝑞2𝑐subscript𝜋1subscript𝜋30\displaystyle\varepsilon_{0}^{2}\frac{q/2}{1-q/2}f_{2}(x_{2})-\frac{\pi_{1}}{% \pi_{3}}>\varepsilon_{0}^{2}\frac{q/2}{1-q/2}c-\frac{\pi_{1}}{\pi_{3}}>0.italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG italic_q / 2 end_ARG start_ARG 1 - italic_q / 2 end_ARG italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - divide start_ARG italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG > italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG italic_q / 2 end_ARG start_ARG 1 - italic_q / 2 end_ARG italic_c - divide start_ARG italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG > 0 .

Therefore,

ε02q/21q/2f1(x1)f2(x2)π1π3f1(x1)π2π3f2(x2)π0π3superscriptsubscript𝜀02𝑞21𝑞2subscript𝑓1subscript𝑥1subscript𝑓2subscript𝑥2subscript𝜋1subscript𝜋3subscript𝑓1subscript𝑥1subscript𝜋2subscript𝜋3subscript𝑓2subscript𝑥2subscript𝜋0subscript𝜋3\displaystyle\varepsilon_{0}^{2}\frac{q/2}{1-q/2}f_{1}(x_{1})f_{2}(x_{2})-% \frac{\pi_{1}}{\pi_{3}}f_{1}(x_{1})-\frac{\pi_{2}}{\pi_{3}}f_{2}(x_{2})-\frac{% \pi_{0}}{\pi_{3}}italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG italic_q / 2 end_ARG start_ARG 1 - italic_q / 2 end_ARG italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - divide start_ARG italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - divide start_ARG italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - divide start_ARG italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG
=\displaystyle== {ε02q/21q/2f2(x2)π1π3}f1(x1)π2π3f2(x2)π0π3superscriptsubscript𝜀02𝑞21𝑞2subscript𝑓2subscript𝑥2subscript𝜋1subscript𝜋3subscript𝑓1subscript𝑥1subscript𝜋2subscript𝜋3subscript𝑓2subscript𝑥2subscript𝜋0subscript𝜋3\displaystyle\left\{\varepsilon_{0}^{2}\frac{q/2}{1-q/2}f_{2}(x_{2})-\frac{\pi% _{1}}{\pi_{3}}\right\}f_{1}(x_{1})-\frac{\pi_{2}}{\pi_{3}}f_{2}(x_{2})-\frac{% \pi_{0}}{\pi_{3}}{ italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG italic_q / 2 end_ARG start_ARG 1 - italic_q / 2 end_ARG italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - divide start_ARG italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG } italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - divide start_ARG italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - divide start_ARG italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG
\displaystyle\geq {ε02q/21q/2f2(x2)π1π3}cπ2π3f2(x2)π0π3superscriptsubscript𝜀02𝑞21𝑞2subscript𝑓2subscript𝑥2subscript𝜋1subscript𝜋3𝑐subscript𝜋2subscript𝜋3subscript𝑓2subscript𝑥2subscript𝜋0subscript𝜋3\displaystyle\left\{\varepsilon_{0}^{2}\frac{q/2}{1-q/2}f_{2}(x_{2})-\frac{\pi% _{1}}{\pi_{3}}\right\}c-\frac{\pi_{2}}{\pi_{3}}f_{2}(x_{2})-\frac{\pi_{0}}{\pi% _{3}}{ italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG italic_q / 2 end_ARG start_ARG 1 - italic_q / 2 end_ARG italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - divide start_ARG italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG } italic_c - divide start_ARG italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - divide start_ARG italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG
=\displaystyle== {ε02q/21q/2cπ2π3}f2(x2)π1π3cπ0π3superscriptsubscript𝜀02𝑞21𝑞2𝑐subscript𝜋2subscript𝜋3subscript𝑓2subscript𝑥2subscript𝜋1subscript𝜋3𝑐subscript𝜋0subscript𝜋3\displaystyle\left\{\varepsilon_{0}^{2}\frac{q/2}{1-q/2}c-\frac{\pi_{2}}{\pi_{% 3}}\right\}f_{2}(x_{2})-\frac{\pi_{1}}{\pi_{3}}c-\frac{\pi_{0}}{\pi_{3}}{ italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG italic_q / 2 end_ARG start_ARG 1 - italic_q / 2 end_ARG italic_c - divide start_ARG italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG } italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - divide start_ARG italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG italic_c - divide start_ARG italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG
\displaystyle\geq ε02q/21q/2c2π1+π2π3cπ0π3=g2(c)>0.superscriptsubscript𝜀02𝑞21𝑞2superscript𝑐2subscript𝜋1subscript𝜋2subscript𝜋3𝑐subscript𝜋0subscript𝜋3subscript𝑔2𝑐0\displaystyle\varepsilon_{0}^{2}\frac{q/2}{1-q/2}c^{2}-\frac{\pi_{1}+\pi_{2}}{% \pi_{3}}c-\frac{\pi_{0}}{\pi_{3}}=g_{2}(c)>0.italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG italic_q / 2 end_ARG start_ARG 1 - italic_q / 2 end_ARG italic_c start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - divide start_ARG italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_π start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG italic_c - divide start_ARG italic_π start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG = italic_g start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_c ) > 0 .

Therefore, we have

Pϕ(T0q/2)subscript𝑃italic-ϕsuperscriptsubscript𝑇0𝑞2absent\displaystyle P_{\phi}(T_{0}^{\infty}\leq q/2)\geqitalic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_q / 2 ) ≥ Pϕ{Y10(0,u1),Y20(0,u2)}>0.subscript𝑃italic-ϕformulae-sequencesubscript𝑌100subscript𝑢1subscript𝑌200subscript𝑢20\displaystyle P_{\phi}\left\{Y_{10}\in(0,u_{1}),Y_{20}\in(0,u_{2})\right\}>0.italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT { italic_Y start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ∈ ( 0 , italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , italic_Y start_POSTSUBSCRIPT 20 end_POSTSUBSCRIPT ∈ ( 0 , italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) } > 0 .

and thus we can conclude that Pϕ(T0q/2)>0subscript𝑃italic-ϕsuperscriptsubscript𝑇0𝑞20P_{\phi}(T_{0}^{\infty}\leq q/2)>0italic_P start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_q / 2 ) > 0. Finally, we show that R/mG(q/2)𝑅𝑚superscript𝐺𝑞2R/m\geq G^{\infty}(q/2)italic_R / italic_m ≥ italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q / 2 ) and R^/mG(q/2)^𝑅𝑚superscript𝐺𝑞2\hat{R}/m\geq G^{\infty}(q/2)over^ start_ARG italic_R end_ARG / italic_m ≥ italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q / 2 ) almost surely as m𝑚m\to\inftyitalic_m → ∞. We consider the case that not all hypotheses are rejected. Recall (A.12). The threshold λ^ORsubscript^𝜆OR\hat{\lambda}_{\rm OR}over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT satisfies λ^ORqsubscript^𝜆OR𝑞\hat{\lambda}_{\rm OR}\geq qover^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT ≥ italic_q with probability 1111. It suffices to show that m1j=1mI(Tjq)G(q/2)superscript𝑚1superscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗𝑞superscript𝐺𝑞2m^{-1}\sum_{j=1}^{m}I(T_{j}\leq q)\geq G^{\infty}(q/2)italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_q ) ≥ italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q / 2 ) almost surely as m𝑚m\to\inftyitalic_m → ∞. Take Lm=mκsubscript𝐿𝑚superscript𝑚𝜅L_{m}=m^{\kappa}italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = italic_m start_POSTSUPERSCRIPT italic_κ end_POSTSUPERSCRIPT, with κ(0,1)𝜅01\kappa\in(0,1)italic_κ ∈ ( 0 , 1 ). Lmsubscript𝐿𝑚L_{m}italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT satisfies Lm<m/2subscript𝐿𝑚𝑚2L_{m}<m/2italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT < italic_m / 2 when m𝑚mitalic_m is large enough. For any j𝑗jitalic_j satisfying Lm+1<j<mLm1subscript𝐿𝑚1𝑗𝑚subscript𝐿𝑚1L_{m}+1<j<m-L_{m}-1italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 < italic_j < italic_m - italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - 1, by (B.2), we have

|TjTj|=subscript𝑇𝑗superscriptsubscript𝑇𝑗absent\displaystyle|T_{j}-T_{j}^{\infty}|=| italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT | = |Pϕ0(sj{0,1,2}y1m)Pϕ0(sj{0,1,2}y)|\displaystyle|P_{\phi_{0}}(s_{j}\in\{0,1,2\}\mid y_{1}^{m})-P_{\phi_{0}}(s_{j}% \in\{0,1,2\}\mid y_{-\infty}^{\infty})|| italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) - italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { 0 , 1 , 2 } ∣ italic_y start_POSTSUBSCRIPT - ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) |
<\displaystyle<< 3i=jLm+1j1exp{2τ0(y1i,y2i)}+3i=j+1j+Lm1exp{2τ0(y1i,y2i)}3superscriptsubscriptproduct𝑖𝑗subscript𝐿𝑚1𝑗12subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖3superscriptsubscriptproduct𝑖𝑗1𝑗subscript𝐿𝑚12subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖\displaystyle 3\prod_{i=j-L_{m}+1}^{j-1}\exp\{-2\tau_{0}(y_{1i},y_{2i})\}+3% \prod_{i=j+1}^{j+L_{m}-1}\exp\{-2\tau_{0}(y_{1i},y_{2i})\}3 ∏ start_POSTSUBSCRIPT italic_i = italic_j - italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } + 3 ∏ start_POSTSUBSCRIPT italic_i = italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j + italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) }

with probability 1111. Define dj((y1i,y2i)1m)=3i=jLm+1j1exp{2τ0(y1i,y2i)}+3i=j+1j+Lm1exp{2τ0(y1i,y2i)}subscript𝑑𝑗superscriptsubscriptsubscript𝑦1𝑖subscript𝑦2𝑖1𝑚3superscriptsubscriptproduct𝑖𝑗subscript𝐿𝑚1𝑗12subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖3superscriptsubscriptproduct𝑖𝑗1𝑗subscript𝐿𝑚12subscript𝜏0subscript𝑦1𝑖subscript𝑦2𝑖d_{j}((y_{1i},y_{2i})_{1}^{m})=3\prod_{i=j-L_{m}+1}^{j-1}\exp\{-2\tau_{0}(y_{1% i},y_{2i})\}+3\prod_{i=j+1}^{j+L_{m}-1}\exp\{-2\tau_{0}(y_{1i},y_{2i})\}italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) = 3 ∏ start_POSTSUBSCRIPT italic_i = italic_j - italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) } + 3 ∏ start_POSTSUBSCRIPT italic_i = italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j + italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT roman_exp { - 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) }. Then dj((y1i,y2i)1m)subscript𝑑𝑗superscriptsubscriptsubscript𝑦1𝑖subscript𝑦2𝑖1𝑚d_{j}((y_{1i},y_{2i})_{1}^{m})italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( ( italic_y start_POSTSUBSCRIPT 1 italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) is ergodic. Thus Brikhoff’s ergodic theorem [Birkhoff, 1931] gives that

1m2Lm1j=Lm+1mLm1I(dj>q/2)Pϕ0(d1>q/2) in probability.1𝑚2subscript𝐿𝑚1superscriptsubscript𝑗subscript𝐿𝑚1𝑚subscript𝐿𝑚1𝐼subscript𝑑𝑗𝑞2subscript𝑃subscriptitalic-ϕ0subscript𝑑1𝑞2 in probability\displaystyle\frac{1}{m-2L_{m}-1}\sum_{j=L_{m}+1}^{m-L_{m}-1}I(d_{j}>q/2)% \rightarrow P_{\phi_{0}}(d_{1}>q/2)\text{ in probability}.divide start_ARG 1 end_ARG start_ARG italic_m - 2 italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - 1 end_ARG ∑ start_POSTSUBSCRIPT italic_j = italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT italic_I ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT > italic_q / 2 ) → italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > italic_q / 2 ) in probability .

Moreover, Eϕ0[dj]<C0β0Lmsubscript𝐸subscriptitalic-ϕ0delimited-[]subscript𝑑𝑗subscript𝐶0superscriptsubscript𝛽0subscript𝐿𝑚E_{\phi_{0}}[d_{j}]<C_{0}\beta_{0}^{L_{m}}italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ] < italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUPERSCRIPT by the construction of β0subscript𝛽0\beta_{0}italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT in (B.8). Then Markov’s inequality gives

Pϕ0(d1>q/2)Eϕ0[dj]q/20 as Lm=mκ.subscript𝑃subscriptitalic-ϕ0subscript𝑑1𝑞2subscript𝐸subscriptitalic-ϕ0delimited-[]subscript𝑑𝑗𝑞20 as subscript𝐿𝑚superscript𝑚𝜅\displaystyle P_{\phi_{0}}(d_{1}>q/2)\leq\frac{E_{\phi_{0}}[d_{j}]}{q/2}% \rightarrow 0\text{ as }L_{m}=m^{\kappa}\rightarrow\infty.italic_P start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > italic_q / 2 ) ≤ divide start_ARG italic_E start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ] end_ARG start_ARG italic_q / 2 end_ARG → 0 as italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = italic_m start_POSTSUPERSCRIPT italic_κ end_POSTSUPERSCRIPT → ∞ .

Thus

1mj=1mI(|TjTj|>q/2)2Lm+1m+1mj=Lm+1mLm1I(dj>q/2)0 in probability1𝑚superscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗superscriptsubscript𝑇𝑗𝑞22subscript𝐿𝑚1𝑚1𝑚superscriptsubscript𝑗subscript𝐿𝑚1𝑚subscript𝐿𝑚1𝐼subscript𝑑𝑗𝑞20 in probability\displaystyle\frac{1}{m}\sum_{j=1}^{m}I\left(|T_{j}-T_{j}^{\infty}|>q/2\right)% \leq\frac{2L_{m}+1}{m}+\frac{1}{m}\sum_{j=L_{m}+1}^{m-L_{m}-1}I(d_{j}>q/2)% \rightarrow 0\text{ in probability}divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( | italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT | > italic_q / 2 ) ≤ divide start_ARG 2 italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 end_ARG start_ARG italic_m end_ARG + divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT italic_I ( italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT > italic_q / 2 ) → 0 in probability

as m𝑚m\to\inftyitalic_m → ∞. We use the property that I(Tjq)+I(|TjTj|>q/2)I(Tjq/2)𝐼subscript𝑇𝑗𝑞𝐼subscript𝑇𝑗superscriptsubscript𝑇𝑗𝑞2𝐼superscriptsubscript𝑇𝑗𝑞2I(T_{j}\leq q)+I(|T_{j}-T_{j}^{\infty}|>q/2)\geq I(T_{j}^{\infty}\leq q/2)italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_q ) + italic_I ( | italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT | > italic_q / 2 ) ≥ italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_q / 2 ). Then

1mj=1mI(Tjq)+1mj=1mI(|TjTj|>q/2)1mj=1mI(Tjq/2).1𝑚superscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗𝑞1𝑚superscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗superscriptsubscript𝑇𝑗𝑞21𝑚superscriptsubscript𝑗1𝑚𝐼superscriptsubscript𝑇𝑗𝑞2\displaystyle\frac{1}{m}\sum_{j=1}^{m}I(T_{j}\leq q)+\frac{1}{m}\sum_{j=1}^{m}% I(|T_{j}-T_{j}^{\infty}|>q/2)\geq\frac{1}{m}\sum_{j=1}^{m}I(T_{j}^{\infty}\leq q% /2).divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_q ) + divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( | italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT | > italic_q / 2 ) ≥ divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_q / 2 ) .

By Birkhoff’s ergodic theorem, we have

1mj=1mI(Tjq/2)G(q/2) almost surely as m.1𝑚superscriptsubscript𝑗1𝑚𝐼superscriptsubscript𝑇𝑗𝑞2superscript𝐺𝑞2 almost surely as 𝑚\displaystyle\frac{1}{m}\sum_{j=1}^{m}I(T_{j}^{\infty}\leq q/2)\rightarrow G^{% \infty}(q/2)\text{ almost surely as }m\to\infty.divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_q / 2 ) → italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q / 2 ) almost surely as italic_m → ∞ .

Then we have m1j=1mI(Tjq)G(q/2)superscript𝑚1superscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗𝑞superscript𝐺𝑞2m^{-1}\sum_{j=1}^{m}I(T_{j}\leq q)\geq G^{\infty}(q/2)italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_q ) ≥ italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q / 2 ) almost surely. We have shown that G(t)>0superscript𝐺𝑡0G^{\infty}(t)>0italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ) > 0 for any t(0,1)𝑡01t\in(0,1)italic_t ∈ ( 0 , 1 ). Therefore, m1j=1mI(Tjq)G(q/2)>0superscript𝑚1superscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗𝑞superscript𝐺𝑞20m^{-1}\sum_{j=1}^{m}I(T_{j}\leq q)\geq G^{\infty}(q/2)>0italic_m start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_q ) ≥ italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q / 2 ) > 0 almost surely, which means R/mG(q/2)𝑅𝑚superscript𝐺𝑞2R/m\geq G^{\infty}(q/2)italic_R / italic_m ≥ italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q / 2 ) almost surely. We can use a similar argument to show that R^/mG(q/2)^𝑅𝑚superscript𝐺𝑞2\hat{R}/m\geq G^{\infty}(q/2)over^ start_ARG italic_R end_ARG / italic_m ≥ italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q / 2 ) almost surely. The details are omitted. ∎

B.3 Proof of Lemma 3

Proof.

Note that R𝑅R\rightarrow\inftyitalic_R → ∞ almost surely as m𝑚m\to\inftyitalic_m → ∞ as shown in Lemma 2. The rejection criteria in (2.9) implies that

1Rj=1RT(j)q<1R+1j=1R+1T(j).1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗𝑞1𝑅1superscriptsubscript𝑗1𝑅1subscript𝑇𝑗\frac{1}{R}\sum_{j=1}^{R}T_{(j)}\leq q<\frac{1}{R+1}\sum_{j=1}^{R+1}T_{(j)}.divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ≤ italic_q < divide start_ARG 1 end_ARG start_ARG italic_R + 1 end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R + 1 end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT .

Note that as m𝑚m\rightarrow\inftyitalic_m → ∞,

E|1Rj=1RT(j)1R+1j=1R+1T(j)|=E|j=1R(T(j)T(R+1))R(R+1)|E|1R+1|0.𝐸1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗1𝑅1superscriptsubscript𝑗1𝑅1subscript𝑇𝑗𝐸superscriptsubscript𝑗1𝑅subscript𝑇𝑗subscript𝑇𝑅1𝑅𝑅1𝐸1𝑅10\displaystyle E\left|\frac{1}{R}\sum_{j=1}^{R}T_{(j)}-\frac{1}{R+1}\sum_{j=1}^% {R+1}T_{(j)}\right|=E\left|\frac{\sum_{j=1}^{R}\left(T_{(j)}-T_{(R+1)}\right)}% {R(R+1)}\right|\leq E\left|\frac{1}{R+1}\right|\rightarrow 0.italic_E | divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_R + 1 end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R + 1 end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | = italic_E | divide start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT ( italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT ( italic_R + 1 ) end_POSTSUBSCRIPT ) end_ARG start_ARG italic_R ( italic_R + 1 ) end_ARG | ≤ italic_E | divide start_ARG 1 end_ARG start_ARG italic_R + 1 end_ARG | → 0 .

Since

00absent\displaystyle 0\leq0 ≤ |1Rj=1RT(j)q||1Rj=1RT(j)1R+1j=1R+1T(j)|,1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗𝑞1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗1𝑅1superscriptsubscript𝑗1𝑅1subscript𝑇𝑗\displaystyle\left|\frac{1}{R}\sum_{j=1}^{R}T_{(j)}-q\right|\leq\left|\frac{1}% {R}\sum_{j=1}^{R}T_{(j)}-\frac{1}{R+1}\sum_{j=1}^{R+1}T_{(j)}\right|,| divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - italic_q | ≤ | divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_R + 1 end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R + 1 end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | ,

we have

E|1Rj=1RT(j)q|𝐸1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗𝑞absent\displaystyle E\left|\frac{1}{R}\sum_{j=1}^{R}T_{(j)}-q\right|\rightarrowitalic_E | divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - italic_q | → 0 as m.0 as 𝑚\displaystyle 0\text{ as }m\rightarrow\infty.0 as italic_m → ∞ . (B.10)

We can use the same approach to show

E|1R^j=1R^T^(j)q|𝐸1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗𝑞absent\displaystyle E\left|\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}\hat{T}_{(j)}-q% \right|\rightarrowitalic_E | divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - italic_q | → 0 as m.0 as 𝑚\displaystyle 0\text{ as }m\rightarrow\infty.0 as italic_m → ∞ . (B.11)

Therefore, combining (B.10) and (B.11), we have

E|1R^j=1R^T^(j)1Rj=1RT(j)|0 as m.𝐸1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗0 as 𝑚E\left|\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}\hat{T}_{(j)}-\frac{1}{R}\sum_{j=1% }^{R}T_{(j)}\right|\to 0\text{ as }m\rightarrow\infty.italic_E | divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | → 0 as italic_m → ∞ . (B.12)

We finish the proof by contradiction. Assume that limmE|R/R^1|=0subscript𝑚𝐸𝑅^𝑅10\lim_{m\rightarrow\infty}E|R/\hat{R}-1|=0roman_lim start_POSTSUBSCRIPT italic_m → ∞ end_POSTSUBSCRIPT italic_E | italic_R / over^ start_ARG italic_R end_ARG - 1 | = 0 does not hold, where R𝑅Ritalic_R is the total number of rejections by (2.9) when the total number of hypotheses is m𝑚mitalic_m and R^^𝑅\hat{R}over^ start_ARG italic_R end_ARG is the total number of rejections by (3.3) when the total number of hypotheses is m𝑚mitalic_m. Then there is ε1>0subscript𝜀10\varepsilon_{1}>0italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > 0 such that, for any M>0𝑀0M>0italic_M > 0, there exists some mM𝑚𝑀m\geq Mitalic_m ≥ italic_M satisfying E|R/R^1|>ε1𝐸𝑅^𝑅1subscript𝜀1E|R/\hat{R}-1|>\varepsilon_{1}italic_E | italic_R / over^ start_ARG italic_R end_ARG - 1 | > italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Since

E|R/R^1|=𝐸𝑅^𝑅1absent\displaystyle E|R/\hat{R}-1|=italic_E | italic_R / over^ start_ARG italic_R end_ARG - 1 | = E{(1R/R^)I(R^>R)}+E{(R/R^1)I(R>R^)},𝐸1𝑅^𝑅𝐼^𝑅𝑅𝐸𝑅^𝑅1𝐼𝑅^𝑅\displaystyle E\{(1-R/\hat{R})I(\hat{R}>R)\}+E\{(R/\hat{R}-1)I(R>\hat{R})\},italic_E { ( 1 - italic_R / over^ start_ARG italic_R end_ARG ) italic_I ( over^ start_ARG italic_R end_ARG > italic_R ) } + italic_E { ( italic_R / over^ start_ARG italic_R end_ARG - 1 ) italic_I ( italic_R > over^ start_ARG italic_R end_ARG ) } ,

E|R/R^1|>ε1𝐸𝑅^𝑅1subscript𝜀1E|R/\hat{R}-1|>\varepsilon_{1}italic_E | italic_R / over^ start_ARG italic_R end_ARG - 1 | > italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT implies that either (i) E{(1R/R^)I(R^>R)}>ε1/2𝐸1𝑅^𝑅𝐼^𝑅𝑅subscript𝜀12E\{(1-R/\hat{R})I(\hat{R}>R)\}>\varepsilon_{1}/2italic_E { ( 1 - italic_R / over^ start_ARG italic_R end_ARG ) italic_I ( over^ start_ARG italic_R end_ARG > italic_R ) } > italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 2, or (ii) E{(R/R^1)I(R>R^)}>ε1/2𝐸𝑅^𝑅1𝐼𝑅^𝑅subscript𝜀12E\{(R/\hat{R}-1)I(R>\hat{R})\}>\varepsilon_{1}/2italic_E { ( italic_R / over^ start_ARG italic_R end_ARG - 1 ) italic_I ( italic_R > over^ start_ARG italic_R end_ARG ) } > italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 2. We first consider the case that (i) is true. Then E{(1R/R^)I(R^>R)}>ε1/2𝐸1𝑅^𝑅𝐼^𝑅𝑅subscript𝜀12E\{(1-R/\hat{R})I(\hat{R}>R)\}>\varepsilon_{1}/2italic_E { ( 1 - italic_R / over^ start_ARG italic_R end_ARG ) italic_I ( over^ start_ARG italic_R end_ARG > italic_R ) } > italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 2 and therefore the event E1={R^>R}subscript𝐸1^𝑅𝑅E_{1}=\{\hat{R}>R\}italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = { over^ start_ARG italic_R end_ARG > italic_R } has positive probability. On the event E1subscript𝐸1E_{1}italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, we have

|1R^j=1R^T^(j)1Rj=1RT(j)|1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗\displaystyle\left|\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}\hat{T}_{(j)}-\frac{1}% {R}\sum_{j=1}^{R}T_{(j)}\right|| divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT |
=\displaystyle== |1R^j=1R^T^(j)1R^j=1R^T(j)+1R^j=1R^T(j)1Rj=1RT(j)|1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript𝑇𝑗1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗\displaystyle\left|\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}\hat{T}_{(j)}-\frac{1}% {\hat{R}}\sum_{j=1}^{\hat{R}}T_{(j)}+\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}T_{(% j)}-\frac{1}{R}\sum_{j=1}^{R}T_{(j)}\right|| divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT |
=\displaystyle== |1R^j=1R^(T^(j)T(j))+1R^(j=1R+j=R+1R^)T(j)1Rj=1RT(j)|1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗subscript𝑇𝑗1^𝑅superscriptsubscript𝑗1𝑅superscriptsubscript𝑗𝑅1^𝑅subscript𝑇𝑗1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗\displaystyle\left|\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}\left(\hat{T}_{(j)}-T_% {(j)}\right)+\frac{1}{\hat{R}}\left(\sum_{j=1}^{R}+\sum_{j=R+1}^{\hat{R}}% \right)T_{(j)}-\frac{1}{R}\sum_{j=1}^{R}T_{(j)}\right|| divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT ( over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ) + divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ( ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_j = italic_R + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT ) italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT |
=\displaystyle== |1R^j=1R^(T^(j)T(j))+1R^j=R+1R^T(j)(1RR^)1Rj=1RT(j)|1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗subscript𝑇𝑗1^𝑅superscriptsubscript𝑗𝑅1^𝑅subscript𝑇𝑗1𝑅^𝑅1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗\displaystyle\left|\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}\left(\hat{T}_{(j)}-T_% {(j)}\right)+\frac{1}{\hat{R}}\sum_{j=R+1}^{\hat{R}}T_{(j)}-\left(1-\frac{R}{% \hat{R}}\right)\frac{1}{R}\sum_{j=1}^{R}T_{(j)}\right|| divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT ( over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ) + divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = italic_R + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - ( 1 - divide start_ARG italic_R end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ) divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT |
\displaystyle\geq |1R^j=R+1R^T(j)(1RR^)1Rj=1RT(j)||1R^j=1R^(T^(j)T(j))|1^𝑅superscriptsubscript𝑗𝑅1^𝑅subscript𝑇𝑗1𝑅^𝑅1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗subscript𝑇𝑗\displaystyle\left|\frac{1}{\hat{R}}\sum_{j=R+1}^{\hat{R}}T_{(j)}-\left(1-% \frac{R}{\hat{R}}\right)\frac{1}{R}\sum_{j=1}^{R}T_{(j)}\right|-\left|\frac{1}% {\hat{R}}\sum_{j=1}^{\hat{R}}\left(\hat{T}_{(j)}-T_{(j)}\right)\right|| divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = italic_R + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - ( 1 - divide start_ARG italic_R end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ) divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | - | divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT ( over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ) | (B.13)
\displaystyle\geq |1RR^||T(R+1)1Rj=1RT(j)||1R^j=1R^T^(j)1R^j=1R^T(j)|,1𝑅^𝑅subscript𝑇𝑅11𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript𝑇𝑗\displaystyle\left|1-\frac{R}{\hat{R}}\right|\left|T_{\left(R+1\right)}-\frac{% 1}{R}\sum_{j=1}^{R}T_{(j)}\right|-\left|\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}% \hat{T}_{(j)}-\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}T_{(j)}\right|,| 1 - divide start_ARG italic_R end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG | | italic_T start_POSTSUBSCRIPT ( italic_R + 1 ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | - | divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | , (B.14)

where (B.13) holds due to triangle inequality |a+b||b||a|𝑎𝑏𝑏𝑎|a+b|\geq|b|-|a|| italic_a + italic_b | ≥ | italic_b | - | italic_a | and (B.14) holds because

1R^j=R+1R^T(j)(1RR^)1Rj=1RT(j)1^𝑅superscriptsubscript𝑗𝑅1^𝑅subscript𝑇𝑗1𝑅^𝑅1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗\displaystyle\frac{1}{\hat{R}}\sum_{j=R+1}^{\hat{R}}T_{(j)}-\left(1-\frac{R}{% \hat{R}}\right)\frac{1}{R}\sum_{j=1}^{R}T_{(j)}divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = italic_R + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - ( 1 - divide start_ARG italic_R end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ) divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT
\displaystyle\geq 1R^j=R+1R^T(R+1)(1RR^)1Rj=1RT(j)1^𝑅superscriptsubscript𝑗𝑅1^𝑅subscript𝑇𝑅11𝑅^𝑅1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗\displaystyle\frac{1}{\hat{R}}\sum_{j=R+1}^{\hat{R}}T_{(R+1)}-\left(1-\frac{R}% {\hat{R}}\right)\frac{1}{R}\sum_{j=1}^{R}T_{(j)}divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = italic_R + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_R + 1 ) end_POSTSUBSCRIPT - ( 1 - divide start_ARG italic_R end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ) divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT
=\displaystyle== R^RR^T(R+1)(1RR^)1Rj=1RT(j)^𝑅𝑅^𝑅subscript𝑇𝑅11𝑅^𝑅1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗\displaystyle\frac{\hat{R}-R}{\hat{R}}T_{(R+1)}-\left(1-\frac{R}{\hat{R}}% \right)\frac{1}{R}\sum_{j=1}^{R}T_{(j)}divide start_ARG over^ start_ARG italic_R end_ARG - italic_R end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG italic_T start_POSTSUBSCRIPT ( italic_R + 1 ) end_POSTSUBSCRIPT - ( 1 - divide start_ARG italic_R end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ) divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT
=\displaystyle== (1RR^)(T(R+1)1Rj=1RT(j))0.1𝑅^𝑅subscript𝑇𝑅11𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗0\displaystyle\left(1-\frac{R}{\hat{R}}\right)\left(T_{(R+1)}-\frac{1}{R}\sum_{% j=1}^{R}T_{(j)}\right)\geq 0.( 1 - divide start_ARG italic_R end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ) ( italic_T start_POSTSUBSCRIPT ( italic_R + 1 ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ) ≥ 0 .

We next show that |T(R+1)R1j=1RT(j)|subscript𝑇𝑅1superscript𝑅1superscriptsubscript𝑗1𝑅subscript𝑇𝑗|T_{(R+1)}-R^{-1}\sum_{j=1}^{R}T_{(j)}|| italic_T start_POSTSUBSCRIPT ( italic_R + 1 ) end_POSTSUBSCRIPT - italic_R start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | in (B.14) is positive with probability 1111. Since the event {R1j=1RT(j)q}superscript𝑅1superscriptsubscript𝑗1𝑅subscript𝑇𝑗𝑞\{R^{-1}\sum_{j=1}^{R}T_{(j)}\leq q\}{ italic_R start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ≤ italic_q } has probability 1111, it suffices to show that

T(R+1)subscript𝑇𝑅1absent\displaystyle T_{(R+1)}\geqitalic_T start_POSTSUBSCRIPT ( italic_R + 1 ) end_POSTSUBSCRIPT ≥ λOR with probability approaching 1.superscriptsubscript𝜆OR with probability approaching 1.\displaystyle\lambda_{\mathrm{OR}}^{\infty}\text{ with probability approaching% $1$.}italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT with probability approaching 1 . (B.15)
λOR>superscriptsubscript𝜆ORabsent\displaystyle\lambda_{\rm OR}^{\infty}>italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT > q.𝑞\displaystyle q.italic_q . (B.16)

Since λ^OR>qsubscript^𝜆OR𝑞\hat{\lambda}_{\mathrm{OR}}>qover^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT > italic_q with probability 1111, for 0<γ<10𝛾10<\gamma<10 < italic_γ < 1 and Q^ORsubscript^𝑄OR\hat{Q}_{\rm OR}over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT defined in (B.1), we have

Q^OR(λ^OR)=subscript^𝑄ORsubscript^𝜆ORabsent\displaystyle\hat{Q}_{\mathrm{OR}}(\hat{\lambda}_{\mathrm{OR}})=over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT ( over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT ) = 1Rj=1mTjI(Tjλ^OR)1𝑅superscriptsubscript𝑗1𝑚subscript𝑇𝑗𝐼subscript𝑇𝑗subscript^𝜆OR\displaystyle\frac{1}{R}\sum_{j=1}^{m}T_{j}I(T_{j}\leq\hat{\lambda}_{\rm OR})divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT )
=\displaystyle== 1Rj=1mTjI(Tjγq)+1Rj=1mTjI(γq<Tjλ^OR)1𝑅superscriptsubscript𝑗1𝑚subscript𝑇𝑗𝐼subscript𝑇𝑗𝛾𝑞1𝑅superscriptsubscript𝑗1𝑚subscript𝑇𝑗𝐼𝛾𝑞subscript𝑇𝑗subscript^𝜆OR\displaystyle\frac{1}{R}\sum_{j=1}^{m}T_{j}I(T_{j}\leq\gamma q)+\frac{1}{R}% \sum_{j=1}^{m}T_{j}I(\gamma q<T_{j}\leq\hat{\lambda}_{\mathrm{OR}})divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_γ italic_q ) + divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_I ( italic_γ italic_q < italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT )
\displaystyle\leq γqj=1mI(Tjγq)j=1mI(Tjλ^OR)+λ^ORj=1mI(γq<Tjλ^OR)j=1mI(Tjλ^OR).𝛾𝑞superscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗𝛾𝑞superscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗subscript^𝜆ORsubscript^𝜆ORsuperscriptsubscript𝑗1𝑚𝐼𝛾𝑞subscript𝑇𝑗subscript^𝜆ORsuperscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗subscript^𝜆OR\displaystyle\gamma q\frac{\sum_{j=1}^{m}I(T_{j}\leq\gamma q)}{\sum_{j=1}^{m}I% (T_{j}\leq\hat{\lambda}_{\mathrm{OR}})}+\hat{\lambda}_{\mathrm{OR}}\frac{\sum_% {j=1}^{m}I(\gamma q<T_{j}\leq\hat{\lambda}_{\mathrm{OR}})}{\sum_{j=1}^{m}I(T_{% j}\leq\hat{\lambda}_{\mathrm{OR}})}.italic_γ italic_q divide start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_γ italic_q ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT ) end_ARG + over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT divide start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_γ italic_q < italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT ) end_ARG . (B.17)

Lemma 1 shows that λ^ORλORsuperscriptsubscript^𝜆ORsuperscriptsubscript𝜆OR\hat{\lambda}_{\mathrm{OR}}^{\infty}\rightarrow\lambda_{\mathrm{OR}}^{\infty}over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT → italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT in probability, and the construction of Tjsubscript𝑇𝑗T_{j}italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and Tjsuperscriptsubscript𝑇𝑗T_{j}^{\infty}italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT gives that λ^ORλ^OR0subscript^𝜆ORsuperscriptsubscript^𝜆OR0\hat{\lambda}_{\rm OR}-\hat{\lambda}_{\rm OR}^{\infty}\rightarrow 0over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT - over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT → 0 in probability. Therefore,

λ^ORλOR in probability as m.subscript^𝜆ORsuperscriptsubscript𝜆OR in probability as 𝑚\displaystyle\hat{\lambda}_{\rm OR}\to\lambda_{\rm OR}^{\infty}\text{ in % probability as }m\to\infty.over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT → italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT in probability as italic_m → ∞ . (B.18)

Similarly, we also have

λ^rLISλOR in probability as m.subscript^𝜆rLISsuperscriptsubscript𝜆OR in probability as 𝑚\displaystyle\hat{\lambda}_{\rm rLIS}\to\lambda_{\rm OR}^{\infty}\text{ in % probability as }m\to\infty.over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_rLIS end_POSTSUBSCRIPT → italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT in probability as italic_m → ∞ . (B.19)

Combining (B.18) and (B.19), we have

λ^ORλ^rLIS0 in probability as m.subscript^𝜆ORsubscript^𝜆rLIS0 in probability as 𝑚\displaystyle\hat{\lambda}_{\rm OR}-\hat{\lambda}_{\rm rLIS}\to 0\text{ in % probability as }m\to\infty.over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT - over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_rLIS end_POSTSUBSCRIPT → 0 in probability as italic_m → ∞ . (B.20)

By the rejection criteria (2.8) and (2.9), T(R+1)>λ^ORsubscript𝑇𝑅1subscript^𝜆ORT_{(R+1)}>\hat{\lambda}_{\rm OR}italic_T start_POSTSUBSCRIPT ( italic_R + 1 ) end_POSTSUBSCRIPT > over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT with probability 1111. Thus (B.15) holds. Moreover, for any ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0, P(|λ^ORλOR|>ϵ)0𝑃subscript^𝜆ORsuperscriptsubscript𝜆ORitalic-ϵ0P(|\hat{\lambda}_{\rm OR}-\lambda_{\rm OR}^{\infty}|>\epsilon)\rightarrow 0italic_P ( | over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT - italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT | > italic_ϵ ) → 0 as m𝑚m\rightarrow\inftyitalic_m → ∞. Then on the event |λ^ORλOR|ϵsubscript^𝜆ORsuperscriptsubscript𝜆ORitalic-ϵ|\hat{\lambda}_{\rm OR}-\lambda_{\rm OR}^{\infty}|\leq\epsilon| over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT - italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT | ≤ italic_ϵ,

1mj=1mI(Tjλ^OR)1𝑚superscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗subscript^𝜆ORabsent\displaystyle\frac{1}{m}\sum_{j=1}^{m}I(T_{j}\leq\hat{\lambda}_{\rm OR})\leqdivide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT ) ≤ 1mj=1mI(TjλOR+ϵ),1𝑚superscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗superscriptsubscript𝜆ORitalic-ϵ\displaystyle\frac{1}{m}\sum_{j=1}^{m}I(T_{j}\leq\lambda_{\rm OR}^{\infty}+% \epsilon),divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT + italic_ϵ ) ,
1mj=1mI(Tjλ^OR)1𝑚superscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗subscript^𝜆ORabsent\displaystyle\frac{1}{m}\sum_{j=1}^{m}I(T_{j}\leq\hat{\lambda}_{\rm OR})\geqdivide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT ) ≥ 1mj=1mI(TjλORϵ).1𝑚superscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗superscriptsubscript𝜆ORitalic-ϵ\displaystyle\frac{1}{m}\sum_{j=1}^{m}I(T_{j}\leq\lambda_{\rm OR}^{\infty}-% \epsilon).divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT - italic_ϵ ) .

Birkhoff’s ergodic theorem [Birkhoff, 1931] gives that as m𝑚m\rightarrow\inftyitalic_m → ∞,

1mj=1mI(TjλOR+ϵ)1𝑚superscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗superscriptsubscript𝜆ORitalic-ϵabsent\displaystyle\frac{1}{m}\sum_{j=1}^{m}I(T_{j}\leq\lambda_{\rm OR}^{\infty}+% \epsilon)\rightarrowdivide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT + italic_ϵ ) → G(λOR+ϵ) almost surely, andsuperscript𝐺superscriptsubscript𝜆ORitalic-ϵ almost surely, and\displaystyle G^{\infty}(\lambda_{\rm OR}^{\infty}+\epsilon)\text{ almost % surely, and }italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT + italic_ϵ ) almost surely, and
1mj=1mI(TjλORϵ)1𝑚superscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗superscriptsubscript𝜆ORitalic-ϵabsent\displaystyle\frac{1}{m}\sum_{j=1}^{m}I(T_{j}\leq\lambda_{\rm OR}^{\infty}-% \epsilon)\rightarrowdivide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT - italic_ϵ ) → G(λORϵ) almost surely.superscript𝐺superscriptsubscript𝜆ORitalic-ϵ almost surely.\displaystyle G^{\infty}(\lambda_{\rm OR}^{\infty}-\epsilon)\text{ almost % surely.}italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT - italic_ϵ ) almost surely.

When ϵitalic-ϵ\epsilonitalic_ϵ tends to 00, the continuity of Gsuperscript𝐺G^{\infty}italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT gives that as m𝑚m\rightarrow\inftyitalic_m → ∞,

1mj=1mI(Tjλ^OR)G(λOR) almost surely.1𝑚superscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗subscript^𝜆ORsuperscript𝐺superscriptsubscript𝜆OR almost surely.\displaystyle\frac{1}{m}\sum_{j=1}^{m}I(T_{j}\leq\hat{\lambda}_{\rm OR})% \rightarrow G^{\infty}(\lambda_{\rm OR}^{\infty})\text{ almost surely.}divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT ) → italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) almost surely. (B.21)

Similarly,

1mj=1mI(T^jλ^rLIS)G(λOR) almost surely.1𝑚superscriptsubscript𝑗1𝑚𝐼subscript^𝑇𝑗subscript^𝜆rLISsuperscript𝐺superscriptsubscript𝜆OR almost surely.\displaystyle\frac{1}{m}\sum_{j=1}^{m}I(\hat{T}_{j}\leq\hat{\lambda}_{\rm rLIS% })\rightarrow G^{\infty}(\lambda_{\rm OR}^{\infty})\text{ almost surely.}divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_rLIS end_POSTSUBSCRIPT ) → italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) almost surely. (B.22)

Moreover, by Birkhoff’s ergodic theorem [Birkhoff, 1931], we have

1mj=1mI(Tjγq)1𝑚superscriptsubscript𝑗1𝑚𝐼subscript𝑇𝑗𝛾𝑞absent\displaystyle\frac{1}{m}\sum_{j=1}^{m}I(T_{j}\leq\gamma q)\rightarrowdivide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_γ italic_q ) → G(γq) almost surely.superscript𝐺𝛾𝑞 almost surely.\displaystyle G^{\infty}(\gamma q)\text{ almost surely.}italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_γ italic_q ) almost surely. (B.23)

Combining (B.17), (B.21) and (B.23), we have

Q^OR(λ^OR)subscript^𝑄ORsubscript^𝜆ORabsent\displaystyle\hat{Q}_{\mathrm{OR}}(\hat{\lambda}_{\mathrm{OR}})\leqover^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT ( over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT ) ≤ γqG(γq)G(λOR)+λORG(λOR)G(γq)G(λOR),𝛾𝑞superscript𝐺𝛾𝑞superscript𝐺superscriptsubscript𝜆ORsuperscriptsubscript𝜆ORsuperscript𝐺superscriptsubscript𝜆ORsuperscript𝐺𝛾𝑞superscript𝐺superscriptsubscript𝜆OR\displaystyle\gamma q\frac{G^{\infty}(\gamma q)}{G^{\infty}(\lambda_{\mathrm{% OR}}^{\infty})}+\lambda_{\mathrm{OR}}^{\infty}\frac{G^{\infty}(\lambda_{% \mathrm{OR}}^{\infty})-G^{\infty}(\gamma q)}{G^{\infty}(\lambda_{\mathrm{OR}}^% {\infty})},italic_γ italic_q divide start_ARG italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_γ italic_q ) end_ARG start_ARG italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) end_ARG + italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT divide start_ARG italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) - italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_γ italic_q ) end_ARG start_ARG italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) end_ARG ,

with probability approaching 1111. Recall that Q^OR(λ^OR)=R1j=1RT(j)=q+op(1)subscript^𝑄ORsubscript^𝜆ORsuperscript𝑅1superscriptsubscript𝑗1𝑅subscript𝑇𝑗𝑞subscript𝑜𝑝1\hat{Q}_{\mathrm{OR}}(\hat{\lambda}_{\mathrm{OR}})=R^{-1}\sum_{j=1}^{R}T_{(j)}% =q+o_{p}(1)over^ start_ARG italic_Q end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT ( over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT ) = italic_R start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT = italic_q + italic_o start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( 1 ). Thus for any γ(0,1)𝛾01\gamma\in(0,1)italic_γ ∈ ( 0 , 1 ),

q𝑞absent\displaystyle q\leqitalic_q ≤ γqG(γq)G(λOR)+λORG(λOR)G(γq)G(λOR)+o(1).𝛾𝑞superscript𝐺𝛾𝑞superscript𝐺superscriptsubscript𝜆ORsuperscriptsubscript𝜆ORsuperscript𝐺superscriptsubscript𝜆ORsuperscript𝐺𝛾𝑞superscript𝐺superscriptsubscript𝜆OR𝑜1\displaystyle\gamma q\frac{G^{\infty}(\gamma q)}{G^{\infty}(\lambda_{\mathrm{% OR}}^{\infty})}+\lambda_{\mathrm{OR}}^{\infty}\frac{G^{\infty}(\lambda_{% \mathrm{OR}}^{\infty})-G^{\infty}(\gamma q)}{G^{\infty}(\lambda_{\mathrm{OR}}^% {\infty})}+o(1).italic_γ italic_q divide start_ARG italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_γ italic_q ) end_ARG start_ARG italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) end_ARG + italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT divide start_ARG italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) - italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_γ italic_q ) end_ARG start_ARG italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) end_ARG + italic_o ( 1 ) .

Equivlently, we have

λORsuperscriptsubscript𝜆ORabsent\displaystyle\lambda_{\mathrm{OR}}^{\infty}\geqitalic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≥ supγ(0,1){q+q(1γ)G(γq)G(λOR)G(γq)}>q.subscriptsupremum𝛾01𝑞𝑞1𝛾superscript𝐺𝛾𝑞superscript𝐺superscriptsubscript𝜆ORsuperscript𝐺𝛾𝑞𝑞\displaystyle\sup_{\gamma\in(0,1)}\left\{q+\frac{q(1-\gamma)G^{\infty}(\gamma q% )}{G^{\infty}(\lambda_{\mathrm{OR}}^{\infty})-G^{\infty}(\gamma q)}\right\}>q.roman_sup start_POSTSUBSCRIPT italic_γ ∈ ( 0 , 1 ) end_POSTSUBSCRIPT { italic_q + divide start_ARG italic_q ( 1 - italic_γ ) italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_γ italic_q ) end_ARG start_ARG italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) - italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_γ italic_q ) end_ARG } > italic_q .

Since Gsuperscript𝐺G^{\infty}italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT is strictly increasing in (0,α*)0subscript𝛼(0,\alpha_{*})( 0 , italic_α start_POSTSUBSCRIPT * end_POSTSUBSCRIPT ) and α*>λORsubscript𝛼superscriptsubscript𝜆OR\alpha_{*}>\lambda_{\rm OR}^{\infty}italic_α start_POSTSUBSCRIPT * end_POSTSUBSCRIPT > italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT, we have G(γq)>0superscript𝐺𝛾𝑞0G^{\infty}(\gamma q)>0italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_γ italic_q ) > 0 and G(λOR)G(γq)>0superscript𝐺superscriptsubscript𝜆ORsuperscript𝐺𝛾𝑞0G^{\infty}(\lambda_{\mathrm{OR}}^{\infty})-G^{\infty}(\gamma q)>0italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) - italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_γ italic_q ) > 0. Therefore, (B.16) holds. By (B.10), (B.15) and (B.16), we have

E(T(R+1)1Rj=1RT(R))𝐸subscript𝑇𝑅11𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑅\displaystyle E\left(T_{(R+1)}-\frac{1}{R}\sum_{j=1}^{R}T_{(R)}\right)italic_E ( italic_T start_POSTSUBSCRIPT ( italic_R + 1 ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_R ) end_POSTSUBSCRIPT )
=\displaystyle== E(T(R+1)λOR)+(λORq)+E(q1Rj=1RT(R))𝐸subscript𝑇𝑅1superscriptsubscript𝜆ORsuperscriptsubscript𝜆OR𝑞𝐸𝑞1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑅\displaystyle E\left(T_{(R+1)}-\lambda_{\rm OR}^{\infty}\right)+(\lambda_{\rm OR% }^{\infty}-q)+E\left(q-\frac{1}{R}\sum_{j=1}^{R}T_{(R)}\right)italic_E ( italic_T start_POSTSUBSCRIPT ( italic_R + 1 ) end_POSTSUBSCRIPT - italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) + ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT - italic_q ) + italic_E ( italic_q - divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_R ) end_POSTSUBSCRIPT )
\displaystyle\geq λORq as m.superscriptsubscript𝜆OR𝑞 as 𝑚\displaystyle\lambda_{\rm OR}^{\infty}-q\text{ as }m\to\infty.italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT - italic_q as italic_m → ∞ .

It implies that

T(R+1)1Rj=1RT(R)λORq with probability approaching 1.subscript𝑇𝑅11𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑅subscript𝜆OR𝑞 with probability approaching 1\displaystyle T_{(R+1)}-\frac{1}{R}\sum_{j=1}^{R}T_{(R)}\geq\lambda_{\rm OR}-q% \text{ with probability approaching }1.italic_T start_POSTSUBSCRIPT ( italic_R + 1 ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_R ) end_POSTSUBSCRIPT ≥ italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT - italic_q with probability approaching 1 . (B.24)

Next, we show

E|1R^j=1R^T^(j)1R^j=1R^T(j)|0 as m.𝐸1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript𝑇𝑗0 as 𝑚\displaystyle E\left|\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}\hat{T}_{(j)}-\frac{% 1}{\hat{R}}\sum_{j=1}^{\hat{R}}T_{(j)}\right|\to 0\text{ as }m\to\infty.italic_E | divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | → 0 as italic_m → ∞ . (B.25)

Note that

|1R^j=1R^T(j)1R^j=1R^T^(j)|1^𝑅superscriptsubscript𝑗1^𝑅subscript𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗absent\displaystyle\left|\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}T_{(j)}-\frac{1}{\hat{% R}}\sum_{j=1}^{\hat{R}}\hat{T}_{(j)}\right|\leq| divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | ≤ |1R^j=1R^T(j)1R^j=1R^T(j)|1^𝑅superscriptsubscript𝑗1^𝑅subscript𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅superscriptsubscript𝑇𝑗\displaystyle\left|\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}T_{(j)}-\frac{1}{\hat{% R}}\sum_{j=1}^{\hat{R}}T_{(j)}^{\infty}\right|| divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT |
+\displaystyle++ |1R^j=1R^T(j)1R^j=1R^T^(j)|1^𝑅superscriptsubscript𝑗1^𝑅superscriptsubscript𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅superscriptsubscript^𝑇𝑗\displaystyle\left|\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}T_{(j)}^{\infty}-\frac% {1}{\hat{R}}\sum_{j=1}^{\hat{R}}\hat{T}_{(j)}^{\infty}\right|| divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT |
+\displaystyle++ |1R^j=1R^T^(j)1R^j=1R^T^(j)|.1^𝑅superscriptsubscript𝑗1^𝑅superscriptsubscript^𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗\displaystyle\left|\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}\hat{T}_{(j)}^{\infty}% -\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}\hat{T}_{(j)}\right|.| divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | . (B.26)

Denote SR^={j:TjT(R^)}subscript𝑆^𝑅conditional-set𝑗subscript𝑇𝑗subscript𝑇^𝑅S_{\hat{R}}=\{j:T_{j}\leq T_{(\hat{R})}\}italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT = { italic_j : italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_T start_POSTSUBSCRIPT ( over^ start_ARG italic_R end_ARG ) end_POSTSUBSCRIPT } and SR^={j:TjT(R^)}superscriptsubscript𝑆^𝑅conditional-set𝑗superscriptsubscript𝑇𝑗superscriptsubscript𝑇^𝑅S_{\hat{R}}^{\infty}=\{j:T_{j}^{\infty}\leq T_{(\hat{R})}^{\infty}\}italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT = { italic_j : italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_T start_POSTSUBSCRIPT ( over^ start_ARG italic_R end_ARG ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT }. Since jSR^TjjSR^Tjsubscript𝑗subscript𝑆^𝑅subscript𝑇𝑗subscript𝑗superscriptsubscript𝑆^𝑅subscript𝑇𝑗\sum_{j\in S_{\hat{R}}}T_{j}\leq\sum_{j\in S_{\hat{R}}^{\infty}}T_{j}∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and jSR^TjjSR^Tjsubscript𝑗superscriptsubscript𝑆^𝑅superscriptsubscript𝑇𝑗subscript𝑗subscript𝑆^𝑅superscriptsubscript𝑇𝑗\sum_{j\in S_{\hat{R}}^{\infty}}T_{j}^{\infty}\leq\sum_{j\in S_{\hat{R}}}T_{j}% ^{\infty}∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT, we have

jSR^TjjSR^TjjSR^TjjSR^TjjSR^TjjSR^Tj.subscript𝑗subscript𝑆^𝑅subscript𝑇𝑗subscript𝑗subscript𝑆^𝑅superscriptsubscript𝑇𝑗subscript𝑗subscript𝑆^𝑅subscript𝑇𝑗subscript𝑗superscriptsubscript𝑆^𝑅superscriptsubscript𝑇𝑗subscript𝑗superscriptsubscript𝑆^𝑅subscript𝑇𝑗subscript𝑗superscriptsubscript𝑆^𝑅superscriptsubscript𝑇𝑗\displaystyle\sum_{j\in S_{\hat{R}}}T_{j}-\sum_{j\in S_{\hat{R}}}T_{j}^{\infty% }\leq\sum_{j\in S_{\hat{R}}}T_{j}-\sum_{j\in S_{\hat{R}}^{\infty}}T_{j}^{% \infty}\leq\sum_{j\in S_{\hat{R}}^{\infty}}T_{j}-\sum_{j\in S_{\hat{R}}^{% \infty}}T_{j}^{\infty}.∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT .

Therefore,

|jSR^TjjSR^Tj||jSR^TjjSR^Tj|+|jSR^TjjSR^Tj|.subscript𝑗subscript𝑆^𝑅subscript𝑇𝑗subscript𝑗superscriptsubscript𝑆^𝑅superscriptsubscript𝑇𝑗subscript𝑗subscript𝑆^𝑅subscript𝑇𝑗subscript𝑗subscript𝑆^𝑅superscriptsubscript𝑇𝑗subscript𝑗superscriptsubscript𝑆^𝑅subscript𝑇𝑗subscript𝑗superscriptsubscript𝑆^𝑅superscriptsubscript𝑇𝑗\displaystyle\left|\sum_{j\in S_{\hat{R}}}T_{j}-\sum_{j\in S_{\hat{R}}^{\infty% }}T_{j}^{\infty}\right|\leq\left|\sum_{j\in S_{\hat{R}}}T_{j}-\sum_{j\in S_{% \hat{R}}}T_{j}^{\infty}\right|+\left|\sum_{j\in S_{\hat{R}}^{\infty}}T_{j}-% \sum_{j\in S_{\hat{R}}^{\infty}}T_{j}^{\infty}\right|.| ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT | ≤ | ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT | + | ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT | . (B.27)

For Lm=mκsubscript𝐿𝑚superscript𝑚𝜅L_{m}=m^{\kappa}italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = italic_m start_POSTSUPERSCRIPT italic_κ end_POSTSUPERSCRIPT with κ(0,1)𝜅01\kappa\in(0,1)italic_κ ∈ ( 0 , 1 ), denote

S~R^=subscript~𝑆^𝑅absent\displaystyle\tilde{S}_{\hat{R}}=over~ start_ARG italic_S end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT = {jSR^:Lm+1<j<mLm1}, andconditional-set𝑗subscript𝑆^𝑅subscript𝐿𝑚1𝑗𝑚subscript𝐿𝑚1, and\displaystyle\{j\in S_{\hat{R}}:L_{m}+1<j<m-L_{m}-1\}\text{, and}{ italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT : italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 < italic_j < italic_m - italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - 1 } , and
S~R^=superscriptsubscript~𝑆^𝑅absent\displaystyle\tilde{S}_{\hat{R}}^{\infty}=over~ start_ARG italic_S end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT = {jSR^:Lm+1<j<mLm1}.conditional-set𝑗superscriptsubscript𝑆^𝑅subscript𝐿𝑚1𝑗𝑚subscript𝐿𝑚1\displaystyle\{j\in S_{\hat{R}}^{\infty}:L_{m}+1<j<m-L_{m}-1\}.{ italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT : italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 < italic_j < italic_m - italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - 1 } .

By (B.9), E|TjTj|C0β0Lm𝐸subscript𝑇𝑗superscriptsubscript𝑇𝑗subscript𝐶0superscriptsubscript𝛽0subscript𝐿𝑚E|T_{j}-T_{j}^{\infty}|\leq C_{0}\beta_{0}^{L_{m}}italic_E | italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT | ≤ italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUPERSCRIPT for β0(0,1)subscript𝛽001\beta_{0}\in(0,1)italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ ( 0 , 1 ) if Lm+1<j<mLm1subscript𝐿𝑚1𝑗𝑚subscript𝐿𝑚1L_{m}+1<j<m-L_{m}-1italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 < italic_j < italic_m - italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - 1. Whenever jLm+1𝑗subscript𝐿𝑚1j\leq L_{m}+1italic_j ≤ italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 or jmLm1𝑗𝑚subscript𝐿𝑚1j\geq m-L_{m}-1italic_j ≥ italic_m - italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - 1, |TjTj|1subscript𝑇𝑗superscriptsubscript𝑇𝑗1|T_{j}-T_{j}^{\infty}|\leq 1| italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT | ≤ 1. Note that Lemma 2 shows that R^/mG(q/2)^𝑅𝑚superscript𝐺𝑞2\hat{R}/m\geq G^{\infty}(q/2)over^ start_ARG italic_R end_ARG / italic_m ≥ italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q / 2 ) almost surely as m𝑚m\to\inftyitalic_m → ∞. Therefore, by (B.27),

E|1R^jSR^Tj1R^jSR^Tj|𝐸1^𝑅subscript𝑗subscript𝑆^𝑅subscript𝑇𝑗1^𝑅subscript𝑗superscriptsubscript𝑆^𝑅superscriptsubscript𝑇𝑗absent\displaystyle E\left|\frac{1}{\hat{R}}\sum_{j\in S_{\hat{R}}}T_{j}-\frac{1}{% \hat{R}}\sum_{j\in S_{\hat{R}}^{\infty}}T_{j}^{\infty}\right|\leqitalic_E | divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT | ≤ 2E(2Lm+1R^)+E|1R^jS~R^(TjTj)|+E|1R^jS~R^(TjTj)|2𝐸2subscript𝐿𝑚1^𝑅𝐸1^𝑅subscript𝑗subscript~𝑆^𝑅subscript𝑇𝑗superscriptsubscript𝑇𝑗𝐸1^𝑅subscript𝑗superscriptsubscript~𝑆^𝑅subscript𝑇𝑗superscriptsubscript𝑇𝑗\displaystyle 2E\left(\frac{2L_{m}+1}{\hat{R}}\right)+E\left|\frac{1}{\hat{R}}% \sum_{j\in\tilde{S}_{\hat{R}}}(T_{j}-T_{j}^{\infty})\right|+E\left|\frac{1}{% \hat{R}}\sum_{j\in\tilde{S}_{\hat{R}}^{\infty}}(T_{j}-T_{j}^{\infty})\right|2 italic_E ( divide start_ARG 2 italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ) + italic_E | divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ over~ start_ARG italic_S end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) | + italic_E | divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ over~ start_ARG italic_S end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) |
\displaystyle\leq 2(2mκ+1)mG(q/2)+2maxLm+1<j<mLm1E|TjTj|22superscript𝑚𝜅1𝑚superscript𝐺𝑞22subscriptsubscript𝐿𝑚1𝑗𝑚subscript𝐿𝑚1𝐸subscript𝑇𝑗superscriptsubscript𝑇𝑗\displaystyle\frac{2(2m^{\kappa}+1)}{mG^{\infty}(q/2)}+2\max_{L_{m}+1<j<m-L_{m% }-1}E|T_{j}-T_{j}^{\infty}|divide start_ARG 2 ( 2 italic_m start_POSTSUPERSCRIPT italic_κ end_POSTSUPERSCRIPT + 1 ) end_ARG start_ARG italic_m italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q / 2 ) end_ARG + 2 roman_max start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 < italic_j < italic_m - italic_L start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT italic_E | italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT |
\displaystyle\leq 2(2mκ+1)mG(q/2)+2C0β0mκ0 as m.22superscript𝑚𝜅1𝑚superscript𝐺𝑞22subscript𝐶0superscriptsubscript𝛽0superscript𝑚𝜅0 as 𝑚\displaystyle\frac{2(2m^{\kappa}+1)}{mG^{\infty}(q/2)}+2C_{0}\beta_{0}^{m^{% \kappa}}\to 0\text{ as }m\to\infty.divide start_ARG 2 ( 2 italic_m start_POSTSUPERSCRIPT italic_κ end_POSTSUPERSCRIPT + 1 ) end_ARG start_ARG italic_m italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q / 2 ) end_ARG + 2 italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUPERSCRIPT italic_κ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT → 0 as italic_m → ∞ . (B.28)

Similarly, for S^R^={j:T^jT^(R^)}subscript^𝑆^𝑅conditional-set𝑗subscript^𝑇𝑗subscript^𝑇^𝑅\hat{S}_{\hat{R}}=\{j:\hat{T}_{j}\leq\hat{T}_{(\hat{R})}\}over^ start_ARG italic_S end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT = { italic_j : over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( over^ start_ARG italic_R end_ARG ) end_POSTSUBSCRIPT } and S^R^={j:T^jT^(R^)}superscriptsubscript^𝑆^𝑅conditional-set𝑗superscriptsubscript^𝑇𝑗superscriptsubscript^𝑇^𝑅\hat{S}_{\hat{R}}^{\infty}=\{j:\hat{T}_{j}^{\infty}\leq\hat{T}_{(\hat{R})}^{% \infty}\}over^ start_ARG italic_S end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT = { italic_j : over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( over^ start_ARG italic_R end_ARG ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT }, we have

E|1R^jS^R^T^j1R^jS^R^T^j|2(2mκ+1)mG(q/2)+2C0β0mκ0 as m.𝐸1^𝑅subscript𝑗superscriptsubscript^𝑆^𝑅superscriptsubscript^𝑇𝑗1^𝑅subscript𝑗subscript^𝑆^𝑅subscript^𝑇𝑗22superscript𝑚𝜅1𝑚superscript𝐺𝑞22subscript𝐶0superscriptsubscript𝛽0superscript𝑚𝜅0 as 𝑚\displaystyle E\left|\frac{1}{\hat{R}}\sum_{j\in\hat{S}_{\hat{R}}^{\infty}}% \hat{T}_{j}^{\infty}-\frac{1}{\hat{R}}\sum_{j\in\hat{S}_{\hat{R}}}\hat{T}_{j}% \right|\leq\frac{2(2m^{\kappa}+1)}{mG^{\infty}(q/2)}+2C_{0}\beta_{0}^{m^{% \kappa}}\to 0\text{ as }m\to\infty.italic_E | divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ over^ start_ARG italic_S end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ over^ start_ARG italic_S end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | ≤ divide start_ARG 2 ( 2 italic_m start_POSTSUPERSCRIPT italic_κ end_POSTSUPERSCRIPT + 1 ) end_ARG start_ARG italic_m italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q / 2 ) end_ARG + 2 italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUPERSCRIPT italic_κ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT → 0 as italic_m → ∞ . (B.29)

Furthermore, denote SR^={j:TjT(R^)}superscriptsubscript𝑆^𝑅conditional-set𝑗superscriptsubscript𝑇𝑗superscriptsubscript𝑇^𝑅S_{\hat{R}}^{\infty}=\{j:T_{j}^{\infty}\leq T_{(\hat{R})}^{\infty}\}italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT = { italic_j : italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_T start_POSTSUBSCRIPT ( over^ start_ARG italic_R end_ARG ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT } and S^R^={j:T^jT^(R^)}superscriptsubscript^𝑆^𝑅conditional-set𝑗superscriptsubscript^𝑇𝑗superscriptsubscript^𝑇^𝑅\hat{S}_{\hat{R}}^{\infty}=\{j:\hat{T}_{j}^{\infty}\leq\hat{T}_{(\hat{R})}^{% \infty}\}over^ start_ARG italic_S end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT = { italic_j : over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( over^ start_ARG italic_R end_ARG ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT }. By (B.22), we have R^/mG(λOR)^𝑅𝑚superscript𝐺superscriptsubscript𝜆OR\hat{R}/m\to G^{\infty}(\lambda_{\rm OR}^{\infty})over^ start_ARG italic_R end_ARG / italic_m → italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) almost surely as m𝑚m\to\inftyitalic_m → ∞. By Birkhoff’s ergodic theorem [Birkhoff, 1931], we have T(R^)λORsuperscriptsubscript𝑇^𝑅superscriptsubscript𝜆ORT_{(\hat{R})}^{\infty}\to\lambda_{\rm OR}^{\infty}italic_T start_POSTSUBSCRIPT ( over^ start_ARG italic_R end_ARG ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT → italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT almost surely as m𝑚m\to\inftyitalic_m → ∞. Note that

1R^jSR^Tj=1^𝑅subscript𝑗superscriptsubscript𝑆^𝑅superscriptsubscript𝑇𝑗absent\displaystyle\frac{1}{\hat{R}}\sum_{j\in S_{\hat{R}}^{\infty}}T_{j}^{\infty}=divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT = 1R^jSR^TjI(TjT(R^)).1^𝑅subscript𝑗superscriptsubscript𝑆^𝑅superscriptsubscript𝑇𝑗𝐼superscriptsubscript𝑇𝑗superscriptsubscript𝑇^𝑅\displaystyle\frac{1}{\hat{R}}\sum_{j\in S_{\hat{R}}^{\infty}}T_{j}^{\infty}I(% T_{j}^{\infty}\leq T_{(\hat{R})}^{\infty}).divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_T start_POSTSUBSCRIPT ( over^ start_ARG italic_R end_ARG ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) .

Birkhoff’s ergodic theorem [Birkhoff, 1931] gives that

E{1R^jSR^TjI(TjT(R^))}𝐸1^𝑅subscript𝑗superscriptsubscript𝑆^𝑅superscriptsubscript𝑇𝑗𝐼superscriptsubscript𝑇𝑗superscriptsubscript𝑇^𝑅absent\displaystyle E\left\{\frac{1}{\hat{R}}\sum_{j\in S_{\hat{R}}^{\infty}}T_{j}^{% \infty}I(T_{j}^{\infty}\leq T_{(\hat{R})}^{\infty})\right\}\toitalic_E { divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_T start_POSTSUBSCRIPT ( over^ start_ARG italic_R end_ARG ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) } → E{T1T1λOR}=1G(λOR)0λORxdG(x) as m.𝐸conditional-setsuperscriptsubscript𝑇1superscriptsubscript𝑇1superscriptsubscript𝜆OR1superscript𝐺superscriptsubscript𝜆ORsuperscriptsubscript0superscriptsubscript𝜆OR𝑥differential-dsuperscript𝐺𝑥 as 𝑚\displaystyle E\{T_{1}^{\infty}\mid T_{1}^{\infty}\leq\lambda_{\rm OR}^{\infty% }\}=\frac{1}{G^{\infty}(\lambda_{\rm OR}^{\infty})}\int_{0}^{\lambda_{\rm OR}^% {\infty}}x{\rm d}G^{\infty}(x)\text{ as }m\to\infty.italic_E { italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ∣ italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ≤ italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT } = divide start_ARG 1 end_ARG start_ARG italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT italic_x roman_d italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_x ) as italic_m → ∞ .

Similarly, we have T^(R^)λORsuperscriptsubscript^𝑇^𝑅superscriptsubscript𝜆OR\hat{T}_{(\hat{R})}^{\infty}\to\lambda_{\rm OR}^{\infty}over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( over^ start_ARG italic_R end_ARG ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT → italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT almost surely as m𝑚m\to\inftyitalic_m → ∞ by Birkhoff’s ergodit theorem [Birkhoff, 1931]. Therefore,

E{1R^jS^R^T^j}1G(λOR)0λORxdG(x) as m.𝐸1^𝑅subscript𝑗superscriptsubscript^𝑆^𝑅superscriptsubscript^𝑇𝑗1superscript𝐺superscriptsubscript𝜆ORsuperscriptsubscript0superscriptsubscript𝜆OR𝑥differential-dsuperscript𝐺𝑥 as 𝑚\displaystyle E\left\{\frac{1}{\hat{R}}\sum_{j\in\hat{S}_{\hat{R}}^{\infty}}% \hat{T}_{j}^{\infty}\right\}\to\frac{1}{G^{\infty}(\lambda_{\rm OR}^{\infty})}% \int_{0}^{\lambda_{\rm OR}^{\infty}}x{\rm d}G^{\infty}(x)\text{ as }m\to\infty.italic_E { divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ over^ start_ARG italic_S end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT } → divide start_ARG 1 end_ARG start_ARG italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT italic_x roman_d italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_x ) as italic_m → ∞ .

Therefore, we have

E|1R^jSR^Tj1R^jS^R^T^j|0 as m.𝐸1^𝑅subscript𝑗superscriptsubscript𝑆^𝑅superscriptsubscript𝑇𝑗1^𝑅subscript𝑗superscriptsubscript^𝑆^𝑅superscriptsubscript^𝑇𝑗0 as 𝑚\displaystyle E\left|\frac{1}{\hat{R}}\sum_{j\in S_{\hat{R}}^{\infty}}T_{j}^{% \infty}-\frac{1}{\hat{R}}\sum_{j\in\hat{S}_{\hat{R}}^{\infty}}\hat{T}_{j}^{% \infty}\right|\to 0\text{ as }m\to\infty.italic_E | divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j ∈ over^ start_ARG italic_S end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT | → 0 as italic_m → ∞ . (B.30)

Combining (B.26), (B.28), (B.29) and (B.30), we have (B.25). Then by (B.14), (B.24) and (B.25), for any M>0𝑀0M>0italic_M > 0, there exists some mM𝑚𝑀m\geq Mitalic_m ≥ italic_M satisfying

E|1R^j=1R^T^(j)1Rj=1RT(j)|𝐸1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗\displaystyle E\left|\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}\hat{T}_{(j)}-\frac{% 1}{R}\sum_{j=1}^{R}T_{(j)}\right|italic_E | divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT |
\displaystyle\geq E{|1RR^|I(R^>R)|T(R+1)1Rj=1RT(j)|}E|1R^j=1R^T^(j)1R^j=1R^T(j)|𝐸1𝑅^𝑅𝐼^𝑅𝑅subscript𝑇𝑅11𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗𝐸1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript𝑇𝑗\displaystyle E\left\{\left|1-\frac{R}{\hat{R}}\right|\cdot I(\hat{R}>R)\cdot% \bigg{|}T_{(R+1)}-\frac{1}{R}\sum_{j=1}^{R}T_{(j)}\bigg{|}\right\}-E\bigg{|}% \frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}\hat{T}_{(j)}-\frac{1}{\hat{R}}\sum_{j=1}% ^{\hat{R}}T_{(j)}\bigg{|}italic_E { | 1 - divide start_ARG italic_R end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG | ⋅ italic_I ( over^ start_ARG italic_R end_ARG > italic_R ) ⋅ | italic_T start_POSTSUBSCRIPT ( italic_R + 1 ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | } - italic_E | divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT |
>\displaystyle>> ε12|λORq|+o(1).subscript𝜀12superscriptsubscript𝜆OR𝑞𝑜1\displaystyle\frac{\varepsilon_{1}}{2}\left|\lambda_{\mathrm{OR}}^{\infty}-q% \right|+o(1).divide start_ARG italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG | italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT - italic_q | + italic_o ( 1 ) .

This is contradictionary to (B.12). Therefore, (i) does not hold. Now consider the case when (ii) is true. In this case, E{(R/R^1)I(R>R^)}>ε1/2𝐸𝑅^𝑅1𝐼𝑅^𝑅subscript𝜀12E\{(R/\hat{R}-1)I(R>\hat{R})\}>\varepsilon_{1}/2italic_E { ( italic_R / over^ start_ARG italic_R end_ARG - 1 ) italic_I ( italic_R > over^ start_ARG italic_R end_ARG ) } > italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 2 and therefore the event E2={R/R^>1+ε1/2}subscript𝐸2𝑅^𝑅1subscript𝜀12E_{2}=\{R/\hat{R}>1+\varepsilon_{1}/2\}italic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = { italic_R / over^ start_ARG italic_R end_ARG > 1 + italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 2 } has positive probability. By (B.11) and (B.25), we have

(1/R^)j=1R^T(j)=q with probability approaching 1.1^𝑅superscriptsubscript𝑗1^𝑅subscript𝑇𝑗𝑞 with probability approaching 1\displaystyle(1/\hat{R})\sum_{j=1}^{\hat{R}}T_{(j)}=q\text{ with probability % approaching $1$}.( 1 / over^ start_ARG italic_R end_ARG ) ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT = italic_q with probability approaching 1 . (B.31)

Thus T(R^+1)qsubscript𝑇^𝑅1𝑞T_{(\hat{R}+1)}\geq qitalic_T start_POSTSUBSCRIPT ( over^ start_ARG italic_R end_ARG + 1 ) end_POSTSUBSCRIPT ≥ italic_q with probability 1111. Then we can use a similar method as (B.14) and obtain that on the event E2subscript𝐸2E_{2}italic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT,

|1Rj=1RT(j)1R^j=1R^T^(j)|=1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗absent\displaystyle\left|\frac{1}{R}\sum_{j=1}^{R}T_{(j)}-\frac{1}{\hat{R}}\sum_{j=1% }^{\hat{R}}\hat{T}_{(j)}\right|=| divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | = |1R^j=1R^(T^(j)T(j))+(1R^R)(1R^j=1R^T(j)1RR^j=R^+1RT(j))|1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗subscript𝑇𝑗1^𝑅𝑅1^𝑅superscriptsubscript𝑗1^𝑅subscript𝑇𝑗1𝑅^𝑅superscriptsubscript𝑗^𝑅1𝑅subscript𝑇𝑗\displaystyle\left|\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}\left(\hat{T}_{(j)}-T_% {(j)}\right)+\left(1-\frac{\hat{R}}{R}\right)\left(\frac{1}{\hat{R}}\sum_{j=1}% ^{\hat{R}}T_{(j)}-\frac{1}{R-\hat{R}}\sum_{j=\hat{R}+1}^{R}T_{(j)}\right)\right|| divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT ( over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ) + ( 1 - divide start_ARG over^ start_ARG italic_R end_ARG end_ARG start_ARG italic_R end_ARG ) ( divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_R - over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = over^ start_ARG italic_R end_ARG + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ) |
\displaystyle\geq |1R^R||1RR^j=R^+1RT(j)1R^j=1R^T(j)||1R^j=1R^T^(j)1R^j=1R^T(j)|.1^𝑅𝑅1𝑅^𝑅superscriptsubscript𝑗^𝑅1𝑅subscript𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript𝑇𝑗\displaystyle\left|1-\frac{\hat{R}}{R}\right|\left|\frac{1}{R-\hat{R}}\sum_{j=% \hat{R}+1}^{R}T_{(j)}-\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}T_{(j)}\right|-% \left|\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}\hat{T}_{(j)}-\frac{1}{\hat{R}}\sum% _{j=1}^{\hat{R}}T_{(j)}\right|.| 1 - divide start_ARG over^ start_ARG italic_R end_ARG end_ARG start_ARG italic_R end_ARG | | divide start_ARG 1 end_ARG start_ARG italic_R - over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = over^ start_ARG italic_R end_ARG + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | - | divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | .

Let η(q,λOR),𝒮1={j:T(R^+1)T(j)η}formulae-sequence𝜂𝑞superscriptsubscript𝜆ORsubscript𝒮1conditional-set𝑗subscript𝑇^𝑅1subscript𝑇𝑗𝜂\eta\in\left(q,\lambda_{\mathrm{OR}}^{\infty}\right),\mathcal{S}_{1}=\left\{j:% T_{(\hat{R}+1)}\leq T_{(j)}\leq\eta\right\}italic_η ∈ ( italic_q , italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) , caligraphic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = { italic_j : italic_T start_POSTSUBSCRIPT ( over^ start_ARG italic_R end_ARG + 1 ) end_POSTSUBSCRIPT ≤ italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ≤ italic_η } and 𝒮2={j:η<T(j)T(R)}subscript𝒮2conditional-set𝑗𝜂subscript𝑇𝑗subscript𝑇𝑅\mathcal{S}_{2}=\left\{j:\eta<T_{(j)}\leq T_{(R)}\right\}caligraphic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = { italic_j : italic_η < italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ≤ italic_T start_POSTSUBSCRIPT ( italic_R ) end_POSTSUBSCRIPT }. We know |𝒮1|+|𝒮2|=RR^subscript𝒮1subscript𝒮2𝑅^𝑅|\mathcal{S}_{1}|+|\mathcal{S}_{2}|=R-\hat{R}| caligraphic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | + | caligraphic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | = italic_R - over^ start_ARG italic_R end_ARG, where |||\cdot|| ⋅ | denotes the cardinality of a set. Since T(R^+1)qsubscript𝑇^𝑅1𝑞T_{(\hat{R}+1)}\geq qitalic_T start_POSTSUBSCRIPT ( over^ start_ARG italic_R end_ARG + 1 ) end_POSTSUBSCRIPT ≥ italic_q with probability 1111, we have

1RR^j=R^+1RT(j)=1𝑅^𝑅superscriptsubscript𝑗^𝑅1𝑅subscript𝑇𝑗absent\displaystyle\frac{1}{R-\hat{R}}\sum_{j=\hat{R}+1}^{R}T_{(j)}=divide start_ARG 1 end_ARG start_ARG italic_R - over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = over^ start_ARG italic_R end_ARG + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT = 1RR^(jS1T(j)+jS2T(j))1𝑅^𝑅subscript𝑗subscript𝑆1subscript𝑇𝑗subscript𝑗subscript𝑆2subscript𝑇𝑗\displaystyle\frac{1}{R-\hat{R}}\left(\sum_{j\in S_{1}}T_{(j)}+\sum_{j\in S_{2% }}T_{(j)}\right)divide start_ARG 1 end_ARG start_ARG italic_R - over^ start_ARG italic_R end_ARG end_ARG ( ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_j ∈ italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT )
\displaystyle\geq 1RR^(|𝒮1|q+|𝒮2|(ηq+q)+op(1))1𝑅^𝑅subscript𝒮1𝑞subscript𝒮2𝜂𝑞𝑞subscript𝑜𝑝1\displaystyle\frac{1}{R-\hat{R}}\bigg{(}|\mathcal{S}_{1}|q+|\mathcal{S}_{2}|(% \eta-q+q)+o_{p}(1)\bigg{)}divide start_ARG 1 end_ARG start_ARG italic_R - over^ start_ARG italic_R end_ARG end_ARG ( | caligraphic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_q + | caligraphic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | ( italic_η - italic_q + italic_q ) + italic_o start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( 1 ) )
=\displaystyle== q+|𝒮2|RR^(ηq)+op(1).𝑞subscript𝒮2𝑅^𝑅𝜂𝑞subscript𝑜𝑝1\displaystyle q+\frac{\left|\mathcal{S}_{2}\right|}{R-\hat{R}}(\eta-q)+o_{p}(1).italic_q + divide start_ARG | caligraphic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | end_ARG start_ARG italic_R - over^ start_ARG italic_R end_ARG end_ARG ( italic_η - italic_q ) + italic_o start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( 1 ) .

We apply the ergodic theorem [Birkhoff, 1931] and continuity of Gsuperscript𝐺G^{\infty}italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT to obtain

1m|𝒮2|=1𝑚subscript𝒮2absent\displaystyle\frac{1}{m}\left|\mathcal{S}_{2}\right|=divide start_ARG 1 end_ARG start_ARG italic_m end_ARG | caligraphic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | = 1mj=1mI(η<TjT(R))G(λOR)G(η) almost surely1𝑚superscriptsubscript𝑗1𝑚𝐼𝜂subscript𝑇𝑗subscript𝑇𝑅superscript𝐺superscriptsubscript𝜆ORsuperscript𝐺𝜂 almost surely\displaystyle\frac{1}{m}\sum_{j=1}^{m}I\left(\eta<T_{j}\leq T_{(R)}\right)% \rightarrow G^{\infty}\left(\lambda_{\mathrm{OR}}^{\infty}\right)-G^{\infty}(% \eta)\text{ almost surely}divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_η < italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_T start_POSTSUBSCRIPT ( italic_R ) end_POSTSUBSCRIPT ) → italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) - italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_η ) almost surely

as m.𝑚m\to\infty.italic_m → ∞ . Since T(R)λORsubscript𝑇𝑅superscriptsubscript𝜆ORT_{(R)}\leq\lambda_{\rm OR}^{\infty}italic_T start_POSTSUBSCRIPT ( italic_R ) end_POSTSUBSCRIPT ≤ italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT with probability 1111 and T(R^+1)qsubscript𝑇^𝑅1𝑞T_{(\hat{R}+1)}\geq qitalic_T start_POSTSUBSCRIPT ( over^ start_ARG italic_R end_ARG + 1 ) end_POSTSUBSCRIPT ≥ italic_q with probability, we have

1m(RR^)=1mj=1mI(T(R^+1)TjT(R))G(λOR)G(q) almost surely1𝑚𝑅^𝑅1𝑚superscriptsubscript𝑗1𝑚𝐼subscript𝑇^𝑅1subscript𝑇𝑗subscript𝑇𝑅superscript𝐺superscriptsubscript𝜆ORsuperscript𝐺𝑞 almost surely\frac{1}{m}\left(R-\hat{R}\right)=\frac{1}{m}\sum_{j=1}^{m}I\left(T_{(\hat{R}+% 1)}\leq T_{j}\leq T_{(R)}\right)\leq G^{\infty}\left(\lambda_{\mathrm{OR}}^{% \infty}\right)-G^{\infty}(q)\text{ almost surely}divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ( italic_R - over^ start_ARG italic_R end_ARG ) = divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I ( italic_T start_POSTSUBSCRIPT ( over^ start_ARG italic_R end_ARG + 1 ) end_POSTSUBSCRIPT ≤ italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_T start_POSTSUBSCRIPT ( italic_R ) end_POSTSUBSCRIPT ) ≤ italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) - italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q ) almost surely

as m.𝑚m\to\infty.italic_m → ∞ . Since |𝒮2|/(RR^)1subscript𝒮2𝑅^𝑅1|\mathcal{S}_{2}|/(R-\hat{R})\leq 1| caligraphic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | / ( italic_R - over^ start_ARG italic_R end_ARG ) ≤ 1, the continuous mapping theorem gives that

1RR^j=R^+1RT(j)q+G(λOR)G(η)G(λOR)G(q)(ηq) almost surely1𝑅^𝑅superscriptsubscript𝑗^𝑅1𝑅subscript𝑇𝑗𝑞superscript𝐺superscriptsubscript𝜆ORsuperscript𝐺𝜂superscript𝐺superscriptsubscript𝜆ORsuperscript𝐺𝑞𝜂𝑞 almost surely\displaystyle\frac{1}{R-\hat{R}}\sum_{j=\hat{R}+1}^{R}T_{(j)}\geq q+\frac{G^{% \infty}\left(\lambda_{\mathrm{OR}}^{\infty}\right)-G^{\infty}(\eta)}{G^{\infty% }\left(\lambda_{\mathrm{OR}}^{\infty}\right)-G^{\infty}(q)}(\eta-q)\text{ % almost surely}divide start_ARG 1 end_ARG start_ARG italic_R - over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = over^ start_ARG italic_R end_ARG + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ≥ italic_q + divide start_ARG italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) - italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_η ) end_ARG start_ARG italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) - italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_q ) end_ARG ( italic_η - italic_q ) almost surely (B.32)

as m.𝑚m\to\infty.italic_m → ∞ . Denote ν0=[{G(λOR)G(η)}/{G(λOR)G(q)}](ηq)subscript𝜈0delimited-[]superscript𝐺superscriptsubscript𝜆ORsuperscript𝐺𝜂𝐺superscriptsubscript𝜆OR𝐺𝑞𝜂𝑞\nu_{0}=\left[\left\{G^{\infty}\left(\lambda_{\mathrm{OR}}^{\infty}\right)-G^{% \infty}(\eta)\right\}/\left\{G\left(\lambda_{\mathrm{OR}}^{\infty}\right)-G(q)% \right\}\right](\eta-q)italic_ν start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = [ { italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) - italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_η ) } / { italic_G ( italic_λ start_POSTSUBSCRIPT roman_OR end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) - italic_G ( italic_q ) } ] ( italic_η - italic_q ). Note that G(t)superscript𝐺𝑡G^{\infty}(t)italic_G start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_t ), the culmulative distribution function of Tjsuperscriptsubscript𝑇𝑗T_{j}^{\infty}italic_T start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT, is strictly increasing in t𝑡titalic_t over the interval (0,α*)0subscript𝛼\left(0,\alpha_{*}\right)( 0 , italic_α start_POSTSUBSCRIPT * end_POSTSUBSCRIPT ). It implies that ν0>0subscript𝜈00\nu_{0}>0italic_ν start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT > 0. Hence by (B.31) and (B.32), we have

|1Rj=1RT(j)1R^j=1R^T^(j)||1R^R|ν0|1R^j=1R^T^(j)1R^j=1R^T(j)|.1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗1^𝑅𝑅subscript𝜈01^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript𝑇𝑗\displaystyle\left|\frac{1}{R}\sum_{j=1}^{R}T_{(j)}-\frac{1}{\hat{R}}\sum_{j=1% }^{\hat{R}}\hat{T}_{(j)}\right|\geq\left|1-\frac{\hat{R}}{R}\right|\nu_{0}-% \bigg{|}\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}\hat{T}_{(j)}-\frac{1}{\hat{R}}% \sum_{j=1}^{\hat{R}}T_{(j)}\bigg{|}.| divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | ≥ | 1 - divide start_ARG over^ start_ARG italic_R end_ARG end_ARG start_ARG italic_R end_ARG | italic_ν start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - | divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | . (B.33)

By (B.25), we take expectations on both sides of (B.33) to get

E|1Rj=1RT(j)1R^j=1R^T^(j)|𝐸1𝑅superscriptsubscript𝑗1𝑅subscript𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗absent\displaystyle E\left|\frac{1}{R}\sum_{j=1}^{R}T_{(j)}-\frac{1}{\hat{R}}\sum_{j% =1}^{\hat{R}}\hat{T}_{(j)}\right|\geqitalic_E | divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT | ≥ E{|1R^R|I(E2)}ν0E|1R^j=1R^T^(j)1R^j=1R^T(j)|𝐸1^𝑅𝑅𝐼subscript𝐸2subscript𝜈0𝐸1^𝑅superscriptsubscript𝑗1^𝑅subscript^𝑇𝑗1^𝑅superscriptsubscript𝑗1^𝑅subscript𝑇𝑗\displaystyle E\left\{\left|1-\frac{\hat{R}}{R}\right|\cdot I(E_{2})\right\}% \cdot\nu_{0}-E\bigg{|}\frac{1}{\hat{R}}\sum_{j=1}^{\hat{R}}\hat{T}_{(j)}-\frac% {1}{\hat{R}}\sum_{j=1}^{\hat{R}}T_{(j)}\bigg{|}italic_E { | 1 - divide start_ARG over^ start_ARG italic_R end_ARG end_ARG start_ARG italic_R end_ARG | ⋅ italic_I ( italic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) } ⋅ italic_ν start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - italic_E | divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT over^ start_ARG italic_T end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_R end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG end_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT |
=\displaystyle== E{1R^/RE2}P(E2)ν0+o(1)𝐸conditional-set1^𝑅𝑅subscript𝐸2𝑃subscript𝐸2subscript𝜈0𝑜1\displaystyle E\left\{1-\hat{R}/R\mid E_{2}\right\}\cdot P(E_{2})\cdot\nu_{0}+% o(1)italic_E { 1 - over^ start_ARG italic_R end_ARG / italic_R ∣ italic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } ⋅ italic_P ( italic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ⋅ italic_ν start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_o ( 1 )
\displaystyle\geq ν0ε1/21+ε1/2P(E2)+o(1)>0.subscript𝜈0subscript𝜀121subscript𝜀12𝑃subscript𝐸2𝑜10\displaystyle\frac{\nu_{0}\varepsilon_{1}/2}{1+\varepsilon_{1}/2}\cdot P(E_{2}% )+o(1)>0.divide start_ARG italic_ν start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 2 end_ARG start_ARG 1 + italic_ε start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 2 end_ARG ⋅ italic_P ( italic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) + italic_o ( 1 ) > 0 .

The result is contradictory to (B.12). Therefore, (ii) does not hold either. We have shown that neither (i) nor (ii) holds, which implies that limmE|R/R^1|=0subscript𝑚𝐸𝑅^𝑅10\lim_{m\rightarrow\infty}E|R/\hat{R}-1|=0roman_lim start_POSTSUBSCRIPT italic_m → ∞ end_POSTSUBSCRIPT italic_E | italic_R / over^ start_ARG italic_R end_ARG - 1 | = 0. Similarly, we can obtain that limmE|V/V^1|=0subscript𝑚𝐸𝑉^𝑉10\lim_{m\rightarrow\infty}E|V/\hat{V}-1|=0roman_lim start_POSTSUBSCRIPT italic_m → ∞ end_POSTSUBSCRIPT italic_E | italic_V / over^ start_ARG italic_V end_ARG - 1 | = 0. The details are omitted. ∎

Appendix C Competing methods

We compare the FDR and power of our method with several replicability analysis methods, including ad hoc BH, MaxP [Benjamini et al., 2009] radjust method [Bogomolov and Heller, 2018], JUMP [Lyu et al., 2023] and STAREG [Li et al., 2023].

C.1 The ad hoc BH method

BH [Benjamini and Hochberg, 1995] is the most popular multiple testing procedure that conservatively controls the false discovery rate for m𝑚mitalic_m independent or positively dependent tests. In study i,i=1,2formulae-sequence𝑖𝑖12i,\ i=1,2italic_i , italic_i = 1 , 2, the BH procedure proceeds as follows.

  • Step 1. Let yi(1)yi(2)yi(m)subscript𝑦𝑖1subscript𝑦𝑖2subscript𝑦𝑖𝑚y_{i(1)}\leq y_{i(2)}\leq\dots\leq y_{i(m)}italic_y start_POSTSUBSCRIPT italic_i ( 1 ) end_POSTSUBSCRIPT ≤ italic_y start_POSTSUBSCRIPT italic_i ( 2 ) end_POSTSUBSCRIPT ≤ ⋯ ≤ italic_y start_POSTSUBSCRIPT italic_i ( italic_m ) end_POSTSUBSCRIPT be the ordered p𝑝pitalic_p-values, and denote by Hi(j)subscript𝐻𝑖𝑗H_{i(j)}italic_H start_POSTSUBSCRIPT italic_i ( italic_j ) end_POSTSUBSCRIPT the corresponding hypothesis;

  • Step 2. Find the largest k𝑘kitalic_k such that yi(k)kq/msubscript𝑦𝑖𝑘𝑘𝑞𝑚y_{i(k)}\leq kq/mitalic_y start_POSTSUBSCRIPT italic_i ( italic_k ) end_POSTSUBSCRIPT ≤ italic_k italic_q / italic_m, i.e., k^=max{1km:yi(k)kq/m}^𝑘maxconditional-set1𝑘𝑚subscript𝑦𝑖𝑘𝑘𝑞𝑚\hat{k}=\mbox{max}\{1\leq k\leq m:y_{i(k)}\leq kq/m\}over^ start_ARG italic_k end_ARG = max { 1 ≤ italic_k ≤ italic_m : italic_y start_POSTSUBSCRIPT italic_i ( italic_k ) end_POSTSUBSCRIPT ≤ italic_k italic_q / italic_m }, and k^=0^𝑘0\hat{k}=0over^ start_ARG italic_k end_ARG = 0 if the set is empty;

  • Step 3. Reject Hi(j),j=1,,k^formulae-sequencesubscript𝐻𝑖𝑗𝑗1^𝑘H_{i(j)},j=1,\dots,\hat{k}italic_H start_POSTSUBSCRIPT italic_i ( italic_j ) end_POSTSUBSCRIPT , italic_j = 1 , … , over^ start_ARG italic_k end_ARG.

The ad hoc BH method for replicability analysis identifies SNPs rejected by both studies as replicable SNPs.

C.2 The MaxP method

Define the maximum of p𝑝pitalic_p-values as

yjmax=max(y1j,y2j),j=1,,m.formulae-sequencesuperscriptsubscript𝑦𝑗maxsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚y_{j}^{\max}=\mbox{max}(y_{1j},y_{2j}),j=1,\dots,m.italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT = max ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) , italic_j = 1 , … , italic_m .

Note that yjmaxsuperscriptsubscript𝑦𝑗y_{j}^{\max}italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT follows a super-uniform distribution under the replicability null. The MaxP method directly applies BH [Benjamini and Hochberg, 1995] to yjmax,j=1,,mformulae-sequencesuperscriptsubscript𝑦𝑗𝑗1𝑚y_{j}^{\max},j=1,\dots,mitalic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT , italic_j = 1 , … , italic_m for false discovery rate control.

C.3 The radjust procedure

The radjust procedure [Bogomolov and Heller, 2018] works as follows.

  • Step 1. For a pre-specified false discovery rate level q𝑞qitalic_q, compute

    R=max[r:j𝒮1𝒮2I{(y1j,y2j)(rq2|𝒮2|,rq2|𝒮1|)}=r],R=\mbox{max}\left[r:\sum_{j\in\mathcal{S}_{1}\cap\mathcal{S}_{2}}I\left\{(y_{1% j},y_{2j})\leq\left(\frac{rq}{2|\mathcal{S}_{2}|},\frac{rq}{2|\mathcal{S}_{1}|% }\right)\right\}=r\right],italic_R = max [ italic_r : ∑ start_POSTSUBSCRIPT italic_j ∈ caligraphic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∩ caligraphic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_I { ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) ≤ ( divide start_ARG italic_r italic_q end_ARG start_ARG 2 | caligraphic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | end_ARG , divide start_ARG italic_r italic_q end_ARG start_ARG 2 | caligraphic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | end_ARG ) } = italic_r ] ,

    where 𝒮isubscript𝒮𝑖\mathcal{S}_{i}caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the set of features pre-selected in study i𝑖iitalic_i for i=1,2𝑖12i=1,2italic_i = 1 , 2. By default, it selects features with p𝑝pitalic_p-values less than or equal to q/2𝑞2q/2italic_q / 2.

  • Step 2. Reject features with indices in the set

    ={j:(y1j,y2j)(Rq2|𝒮2|,Rq2|S1|),j𝒮1𝒮2}.conditional-set𝑗formulae-sequencesubscript𝑦1𝑗subscript𝑦2𝑗𝑅𝑞2subscript𝒮2𝑅𝑞2subscript𝑆1𝑗subscript𝒮1subscript𝒮2\mathcal{R}=\left\{j:(y_{1j},y_{2j})\leq\left(\frac{Rq}{2|\mathcal{S}_{2}|},% \frac{Rq}{2|S_{1}|}\right),j\in\mathcal{S}_{1}\cap\mathcal{S}_{2}\right\}.caligraphic_R = { italic_j : ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) ≤ ( divide start_ARG italic_R italic_q end_ARG start_ARG 2 | caligraphic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | end_ARG , divide start_ARG italic_R italic_q end_ARG start_ARG 2 | italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | end_ARG ) , italic_j ∈ caligraphic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∩ caligraphic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } .

In this paper, we implement an adaptive version of the radjust procedure Bogomolov and Heller [2018] in the simulations, which first estimates the fractions of true null hypotheses among the pre-selected features. The fractions in the two studies are estimated as follows.

π^0(1)=1+j𝒮2,qI(y1j>q)|𝒮2,q|(1q),π^0(2)=1+j𝒮1,qI(y2j>q)|𝒮1,q|(1q),formulae-sequencesuperscriptsubscript^𝜋011subscript𝑗subscript𝒮2𝑞𝐼subscript𝑦1𝑗𝑞subscript𝒮2𝑞1𝑞superscriptsubscript^𝜋021subscript𝑗subscript𝒮1𝑞𝐼subscript𝑦2𝑗𝑞subscript𝒮1𝑞1𝑞\hat{\pi}_{0}^{(1)}=\frac{1+\sum_{j\in\mathcal{S}_{2,q}}I(y_{1j}>q)}{|\mathcal% {S}_{2,q}|(1-q)},\quad\hat{\pi}_{0}^{(2)}=\frac{1+\sum_{j\in\mathcal{S}_{1,q}}% I(y_{2j}>q)}{|\mathcal{S}_{1,q}|(1-q)},over^ start_ARG italic_π end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = divide start_ARG 1 + ∑ start_POSTSUBSCRIPT italic_j ∈ caligraphic_S start_POSTSUBSCRIPT 2 , italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_I ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT > italic_q ) end_ARG start_ARG | caligraphic_S start_POSTSUBSCRIPT 2 , italic_q end_POSTSUBSCRIPT | ( 1 - italic_q ) end_ARG , over^ start_ARG italic_π end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT = divide start_ARG 1 + ∑ start_POSTSUBSCRIPT italic_j ∈ caligraphic_S start_POSTSUBSCRIPT 1 , italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_I ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT > italic_q ) end_ARG start_ARG | caligraphic_S start_POSTSUBSCRIPT 1 , italic_q end_POSTSUBSCRIPT | ( 1 - italic_q ) end_ARG , (C.1)

where 𝒮i,q=𝒮i{1jm:yijq},i=1,2formulae-sequencesubscript𝒮𝑖𝑞subscript𝒮𝑖conditional-set1𝑗𝑚subscript𝑦𝑖𝑗𝑞𝑖12\mathcal{S}_{i,q}=\mathcal{S}_{i}\cap\{1\leq j\leq m:y_{ij}\leq q\},\ i=1,2caligraphic_S start_POSTSUBSCRIPT italic_i , italic_q end_POSTSUBSCRIPT = caligraphic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∩ { 1 ≤ italic_j ≤ italic_m : italic_y start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ≤ italic_q } , italic_i = 1 , 2. The adaptive procedure with a nominal false discovery rate level q𝑞qitalic_q works as follows.

  • Step 1. Compute π^0(1)superscriptsubscript^𝜋01\hat{\pi}_{0}^{(1)}over^ start_ARG italic_π end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT and π^0(2)superscriptsubscript^𝜋02\hat{\pi}_{0}^{(2)}over^ start_ARG italic_π end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT using (C.1). Let

    R=max[r:j𝒮1,q𝒮2,qI{(y1j,y2j)(rq2|𝒮2,q|π^0(1),rq2|𝒮1,q|π^0(2))}=r],R=\mbox{max}\left[r:\sum_{j\in\mathcal{S}_{1,q}\cap\mathcal{S}_{2,q}}I\left\{(% y_{1j},y_{2j})\leq\left(\frac{rq}{2|\mathcal{S}_{2,q}|\hat{\pi}_{0}^{(1)}},% \frac{rq}{2|\mathcal{S}_{1,q}|\hat{\pi}_{0}^{(2)}}\right)\right\}=r\right],italic_R = max [ italic_r : ∑ start_POSTSUBSCRIPT italic_j ∈ caligraphic_S start_POSTSUBSCRIPT 1 , italic_q end_POSTSUBSCRIPT ∩ caligraphic_S start_POSTSUBSCRIPT 2 , italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_I { ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) ≤ ( divide start_ARG italic_r italic_q end_ARG start_ARG 2 | caligraphic_S start_POSTSUBSCRIPT 2 , italic_q end_POSTSUBSCRIPT | over^ start_ARG italic_π end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT end_ARG , divide start_ARG italic_r italic_q end_ARG start_ARG 2 | caligraphic_S start_POSTSUBSCRIPT 1 , italic_q end_POSTSUBSCRIPT | over^ start_ARG italic_π end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT end_ARG ) } = italic_r ] ,
  • Step 2. Reject features with indices in the set

    ={j:(y1j,y2j)(Rq2|𝒮2,q|π^0(1),Rq2|S1,q|π^0(2)),j𝒮1,q𝒮2,q}.conditional-set𝑗formulae-sequencesubscript𝑦1𝑗subscript𝑦2𝑗𝑅𝑞2subscript𝒮2𝑞superscriptsubscript^𝜋01𝑅𝑞2subscript𝑆1𝑞superscriptsubscript^𝜋02𝑗subscript𝒮1𝑞subscript𝒮2𝑞\mathcal{R}=\left\{j:(y_{1j},y_{2j})\leq\left(\frac{Rq}{2|\mathcal{S}_{2,q}|% \hat{\pi}_{0}^{(1)}},\frac{Rq}{2|S_{1,q}|\hat{\pi}_{0}^{(2)}}\right),j\in% \mathcal{S}_{1,q}\cap\mathcal{S}_{2,q}\right\}.caligraphic_R = { italic_j : ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) ≤ ( divide start_ARG italic_R italic_q end_ARG start_ARG 2 | caligraphic_S start_POSTSUBSCRIPT 2 , italic_q end_POSTSUBSCRIPT | over^ start_ARG italic_π end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT end_ARG , divide start_ARG italic_R italic_q end_ARG start_ARG 2 | italic_S start_POSTSUBSCRIPT 1 , italic_q end_POSTSUBSCRIPT | over^ start_ARG italic_π end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT end_ARG ) , italic_j ∈ caligraphic_S start_POSTSUBSCRIPT 1 , italic_q end_POSTSUBSCRIPT ∩ caligraphic_S start_POSTSUBSCRIPT 2 , italic_q end_POSTSUBSCRIPT } .

C.4 The JUMP method

The JUMP method [Lyu et al., 2023] works on the maximum of p𝑝pitalic_p-values across two studies. Define

yjmax=max(y1j,y2j),j=1,,m.formulae-sequencesuperscriptsubscript𝑦𝑗maxsubscript𝑦1𝑗subscript𝑦2𝑗𝑗1𝑚y_{j}^{\max}=\mbox{max}(y_{1j},y_{2j}),j=1,\dots,m.italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT = max ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) , italic_j = 1 , … , italic_m .

Let sj=(θ1j,θ2j),j=1,,mformulae-sequencesubscript𝑠𝑗subscript𝜃1𝑗subscript𝜃2𝑗𝑗1𝑚s_{j}=(\theta_{1j},\theta_{2j}),\ j=1,\dots,mitalic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) , italic_j = 1 , … , italic_m denote the inferred association status of single nucleotide polymorphisms across two studies. Then sj{(0,0),(0,1),(1,0),(1,1)}subscript𝑠𝑗00011011s_{j}\in\{(0,0),(0,1),(1,0),(1,1)\}italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { ( 0 , 0 ) , ( 0 , 1 ) , ( 1 , 0 ) , ( 1 , 1 ) } with (sj=(k,l))=ξklsubscript𝑠𝑗𝑘𝑙subscript𝜉𝑘𝑙\mathbb{P}(s_{j}=(k,l))=\xi_{kl}blackboard_P ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( italic_k , italic_l ) ) = italic_ξ start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT for k,l=0,1formulae-sequence𝑘𝑙01k,l=0,1italic_k , italic_l = 0 , 1 and k,lξkl=1subscript𝑘𝑙subscript𝜉𝑘𝑙1\sum_{k,l}\xi_{kl}=1∑ start_POSTSUBSCRIPT italic_k , italic_l end_POSTSUBSCRIPT italic_ξ start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT = 1. It can be shown that

P(yjmaxtH0j is true)𝑃superscriptsubscript𝑦𝑗conditional𝑡subscript𝐻0𝑗 is true\displaystyle P\left(y_{j}^{\max}\leq t\mid H_{0j}\text{ is true}\right)italic_P ( italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ≤ italic_t ∣ italic_H start_POSTSUBSCRIPT 0 italic_j end_POSTSUBSCRIPT is true )
=\displaystyle== ξ00(yjmaxtτj=(0,0))ξ00+ξ01+ξ10+ξ01(yjmaxtτj=(0,1))ξ00+ξ01+ξ10+ξ10(yjmaxtτj=(1,0))ξ00+ξ01+ξ10subscript𝜉00superscriptsubscript𝑦𝑗conditional𝑡subscript𝜏𝑗00subscript𝜉00subscript𝜉01subscript𝜉10subscript𝜉01superscriptsubscript𝑦𝑗conditional𝑡subscript𝜏𝑗01subscript𝜉00subscript𝜉01subscript𝜉10subscript𝜉10superscriptsubscript𝑦𝑗conditional𝑡subscript𝜏𝑗10subscript𝜉00subscript𝜉01subscript𝜉10\displaystyle\frac{\xi_{00}\mathbb{P}(y_{j}^{\max}\leq t\mid\tau_{j}=(0,0))}{% \xi_{00}+\xi_{01}+\xi_{10}}+\frac{\xi_{01}\mathbb{P}(y_{j}^{\max}\leq t\mid% \tau_{j}=(0,1))}{\xi_{00}+\xi_{01}+\xi_{10}}+\frac{\xi_{10}\mathbb{P}(y_{j}^{% \max}\leq t\mid\tau_{j}=(1,0))}{\xi_{00}+\xi_{01}+\xi_{10}}divide start_ARG italic_ξ start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT blackboard_P ( italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ≤ italic_t ∣ italic_τ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( 0 , 0 ) ) end_ARG start_ARG italic_ξ start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT + italic_ξ start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT + italic_ξ start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT end_ARG + divide start_ARG italic_ξ start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT blackboard_P ( italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ≤ italic_t ∣ italic_τ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( 0 , 1 ) ) end_ARG start_ARG italic_ξ start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT + italic_ξ start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT + italic_ξ start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT end_ARG + divide start_ARG italic_ξ start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT blackboard_P ( italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ≤ italic_t ∣ italic_τ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( 1 , 0 ) ) end_ARG start_ARG italic_ξ start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT + italic_ξ start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT + italic_ξ start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT end_ARG
\displaystyle\leq ξ00t2+(ξ01+ξ10)tξ00+ξ01+ξ10t,subscript𝜉00superscript𝑡2subscript𝜉01subscript𝜉10𝑡subscript𝜉00subscript𝜉01subscript𝜉10𝑡\displaystyle\frac{\xi_{00}t^{2}+(\xi_{01}+\xi_{10})t}{\xi_{00}+\xi_{01}+\xi_{% 10}}\leq t,divide start_ARG italic_ξ start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ( italic_ξ start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT + italic_ξ start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ) italic_t end_ARG start_ARG italic_ξ start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT + italic_ξ start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT + italic_ξ start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT end_ARG ≤ italic_t ,

which means that yjmaxsuperscriptsubscript𝑦𝑗y_{j}^{\max}italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT follows a super-uniform distribution under the replicability null. Denote

G(t)=ξ00t2+(ξ01+ξ10)tξ00+ξ01+ξ10.𝐺𝑡subscript𝜉00superscript𝑡2subscript𝜉01subscript𝜉10𝑡subscript𝜉00subscript𝜉01subscript𝜉10G(t)=\frac{\xi_{00}t^{2}+(\xi_{01}+\xi_{10})t}{\xi_{00}+\xi_{01}+\xi_{10}}.italic_G ( italic_t ) = divide start_ARG italic_ξ start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ( italic_ξ start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT + italic_ξ start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ) italic_t end_ARG start_ARG italic_ξ start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT + italic_ξ start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT + italic_ξ start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT end_ARG .

For a given threshold t(0,1)𝑡01t\in(0,1)italic_t ∈ ( 0 , 1 ), a conservative estimate of the false discovery rate is obtained by

FDR*(t)=m(ξ00+ξ01+ξ10)G(t)j=1mI{yjmaxt}1.superscriptFDR𝑡𝑚subscript𝜉00subscript𝜉01subscript𝜉10𝐺𝑡superscriptsubscript𝑗1𝑚𝐼superscriptsubscript𝑦𝑗𝑡1\mbox{FDR}^{*}(t)=\frac{m(\xi_{00}+\xi_{01}+\xi_{10})G(t)}{\sum_{j=1}^{m}I\{y_% {j}^{\max}\leq t\}\vee 1}.FDR start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_t ) = divide start_ARG italic_m ( italic_ξ start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT + italic_ξ start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT + italic_ξ start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ) italic_G ( italic_t ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I { italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ≤ italic_t } ∨ 1 end_ARG .

Following Storey [2002], Storey et al. [2004], the proportion of null hypotheses in study i𝑖iitalic_i can be estimated by

π^0(i)(λi)=j=1mI{yijλi}m(1λi),i=1,2.formulae-sequencesuperscriptsubscript^𝜋0𝑖subscript𝜆𝑖superscriptsubscript𝑗1𝑚𝐼subscript𝑦𝑖𝑗subscript𝜆𝑖𝑚1subscript𝜆𝑖𝑖12\hat{\pi}_{0}^{(i)}(\lambda_{i})=\frac{\sum_{j=1}^{m}I\{y_{ij}\geq\lambda_{i}% \}}{m(1-\lambda_{i})},\quad i=1,2.over^ start_ARG italic_π end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ( italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = divide start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I { italic_y start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ≥ italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } end_ARG start_ARG italic_m ( 1 - italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG , italic_i = 1 , 2 .

Similarly, ξ00subscript𝜉00\xi_{00}italic_ξ start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT is estimated by

ξ^00(λ3)=j=1mI{y1jλ3,y2jλ3}m(1λ3)2,subscript^𝜉00subscript𝜆3superscriptsubscript𝑗1𝑚𝐼formulae-sequencesubscript𝑦1𝑗subscript𝜆3subscript𝑦2𝑗subscript𝜆3𝑚superscript1subscript𝜆32\hat{\xi}_{00}(\lambda_{3})=\frac{\sum_{j=1}^{m}I\{y_{1j}\geq\lambda_{3},y_{2j% }\geq\lambda_{3}\}}{m(1-\lambda_{3})^{2}},over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT ( italic_λ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) = divide start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I { italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ≥ italic_λ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ≥ italic_λ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT } end_ARG start_ARG italic_m ( 1 - italic_λ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ,

where λ1,λ2subscript𝜆1subscript𝜆2\lambda_{1},\lambda_{2}italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and λ3subscript𝜆3\lambda_{3}italic_λ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT are tuning parameters that can be selected by using the smoothing method provided in Storey and Tibshirani [2003]. Then we have

ξ^01=π^0(1)ξ^00,ξ^10=π^0(2)ξ^00.formulae-sequencesubscript^𝜉01superscriptsubscript^𝜋01subscript^𝜉00subscript^𝜉10superscriptsubscript^𝜋02subscript^𝜉00\hat{\xi}_{01}=\hat{\pi}_{0}^{(1)}-\hat{\xi}_{00},\quad\hat{\xi}_{10}=\hat{\pi% }_{0}^{(2)}-\hat{\xi}_{00}.over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT = over^ start_ARG italic_π end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT - over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT , over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT = over^ start_ARG italic_π end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT - over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT .

With these estimates, we have a plug-in estimate of false discovery rate,

FDR^*(t)=m(ξ^00t2+ξ^01t+ξ^10t)j=1mI{yjmaxt}1.superscript^FDR𝑡𝑚subscript^𝜉00superscript𝑡2subscript^𝜉01𝑡subscript^𝜉10𝑡superscriptsubscript𝑗1𝑚𝐼superscriptsubscript𝑦𝑗𝑡1\widehat{\mbox{FDR}}^{*}(t)=\frac{m(\hat{\xi}_{00}t^{2}+\hat{\xi}_{01}t+\hat{% \xi}_{10}t)}{\sum_{j=1}^{m}I\{y_{j}^{\max}\leq t\}\vee 1}.over^ start_ARG FDR end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_t ) = divide start_ARG italic_m ( over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT italic_t + over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_t ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I { italic_y start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ≤ italic_t } ∨ 1 end_ARG .

The JUMP method works as follows.

  • Step 1. Let y(1)maxy(m)maxsuperscriptsubscript𝑦1superscriptsubscript𝑦𝑚y_{(1)}^{\max}\leq\dots\leq y_{(m)}^{\max}italic_y start_POSTSUBSCRIPT ( 1 ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ≤ ⋯ ≤ italic_y start_POSTSUBSCRIPT ( italic_m ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT be the ordered maximum of p𝑝pitalic_p-values and denote by H0(j)subscript𝐻0𝑗H_{0(j)}italic_H start_POSTSUBSCRIPT 0 ( italic_j ) end_POSTSUBSCRIPT the corresponding hypothesis;

  • Step 2. Find the largest k𝑘kitalic_k such that the estimated false discovery rate is controlled, where

    k^=max{1km:FDR^*(y(k)max)q};^𝑘:1𝑘𝑚superscript^FDRsuperscriptsubscript𝑦𝑘𝑞\hat{k}=\max\{1\leq k\leq m:\widehat{\mbox{FDR}}^{*}(y_{(k)}^{\max})\leq q\};over^ start_ARG italic_k end_ARG = roman_max { 1 ≤ italic_k ≤ italic_m : over^ start_ARG FDR end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT ( italic_k ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_max end_POSTSUPERSCRIPT ) ≤ italic_q } ;
  • Step 3. Reject H0(j),subscript𝐻0𝑗H_{0(j)},italic_H start_POSTSUBSCRIPT 0 ( italic_j ) end_POSTSUBSCRIPT , j=1,,k^𝑗1^𝑘j=1,\dots,\hat{k}italic_j = 1 , … , over^ start_ARG italic_k end_ARG.

C.5 The STAREG method

Let sj=(θ1j,θ2j),j=1,,mformulae-sequencesubscript𝑠𝑗subscript𝜃1𝑗subscript𝜃2𝑗𝑗1𝑚s_{j}=(\theta_{1j},\theta_{2j}),\ j=1,\dots,mitalic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) , italic_j = 1 , … , italic_m denote the inferred association status of single nucleotide polymorphisms across two studies. Then sj{(0,0),(0,1),(1,0),(1,1)}subscript𝑠𝑗00011011s_{j}\in\{(0,0),(0,1),(1,0),(1,1)\}italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ { ( 0 , 0 ) , ( 0 , 1 ) , ( 1 , 0 ) , ( 1 , 1 ) } with P(sj=(k,l))=ξkl𝑃subscript𝑠𝑗𝑘𝑙subscript𝜉𝑘𝑙P(s_{j}=(k,l))=\xi_{kl}italic_P ( italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( italic_k , italic_l ) ) = italic_ξ start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT for k,l=0,1formulae-sequence𝑘𝑙01k,l=0,1italic_k , italic_l = 0 , 1 and k,lξkl=1subscript𝑘𝑙subscript𝜉𝑘𝑙1\sum_{k,l}\xi_{kl}=1∑ start_POSTSUBSCRIPT italic_k , italic_l end_POSTSUBSCRIPT italic_ξ start_POSTSUBSCRIPT italic_k italic_l end_POSTSUBSCRIPT = 1. Assume a mixture model for p𝑝pitalic_p-values in the two studies. Specifically,

y1jsubscript𝑦1𝑗\displaystyle y_{1j}italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT θ1j(1θ1j)f0+θ1jf1,\displaystyle\mid\theta_{1j}\sim(1-\theta_{1j})f_{0}+\theta_{1j}f_{1},∣ italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ∼ ( 1 - italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ,
y2jsubscript𝑦2𝑗\displaystyle y_{2j}italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT θ2j(1θ2j)f0+θ2jf2,j=1,,m,\displaystyle\mid\theta_{2j}\sim(1-\theta_{2j})f_{0}+\theta_{2j}f_{2},\quad j=% 1,\dots,m,∣ italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ∼ ( 1 - italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_j = 1 , … , italic_m ,

where f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is the density function of p𝑝pitalic_p-values under the null, f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT denote the non-null density functions for study 1 and study 2, respectively. Then the local false discovery rate (Lfdr) is defined as the posterior probability of being replicability null given data. We have

Lfdrj(y1j,y2j)subscriptLfdr𝑗subscript𝑦1𝑗subscript𝑦2𝑗\displaystyle\mbox{Lfdr}_{j}(y_{1j},y_{2j})Lfdr start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) :=1(θ1j=θ2j=1y1j,y2j)assignabsent1subscript𝜃1𝑗subscript𝜃2𝑗conditional1subscript𝑦1𝑗subscript𝑦2𝑗\displaystyle:=1-\mathbb{P}(\theta_{1j}=\theta_{2j}=1\mid y_{1j},y_{2j}):= 1 - blackboard_P ( italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT = italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT = 1 ∣ italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT )
=ξ00f0(y1j)f0(y2j)+ξ01f0(y1j)f2(y2j)+ξ10f1(y1j)f0(y2j)ξ00f0(y1j)f0(y2j)+ξ01f0(y1j)f2(y2j)+ξ10f1(y1j)f0(y2j)+ξ11f1(y1j)f2(y2j).absentsubscript𝜉00subscript𝑓0subscript𝑦1𝑗subscript𝑓0subscript𝑦2𝑗subscript𝜉01subscript𝑓0subscript𝑦1𝑗subscript𝑓2subscript𝑦2𝑗subscript𝜉10subscript𝑓1subscript𝑦1𝑗subscript𝑓0subscript𝑦2𝑗subscript𝜉00subscript𝑓0subscript𝑦1𝑗subscript𝑓0subscript𝑦2𝑗subscript𝜉01subscript𝑓0subscript𝑦1𝑗subscript𝑓2subscript𝑦2𝑗subscript𝜉10subscript𝑓1subscript𝑦1𝑗subscript𝑓0subscript𝑦2𝑗subscript𝜉11subscript𝑓1subscript𝑦1𝑗subscript𝑓2subscript𝑦2𝑗\displaystyle=\frac{\xi_{00}f_{0}(y_{1j})f_{0}(y_{2j})+\xi_{01}f_{0}(y_{1j})f_% {2}(y_{2j})+\xi_{10}f_{1}(y_{1j})f_{0}(y_{2j})}{\xi_{00}f_{0}(y_{1j})f_{0}(y_{% 2j})+\xi_{01}f_{0}(y_{1j})f_{2}(y_{2j})+\xi_{10}f_{1}(y_{1j})f_{0}(y_{2j})+\xi% _{11}f_{1}(y_{1j})f_{2}(y_{2j})}.= divide start_ARG italic_ξ start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) + italic_ξ start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) + italic_ξ start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) end_ARG start_ARG italic_ξ start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) + italic_ξ start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) + italic_ξ start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) + italic_ξ start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) end_ARG .

Assume the monotone likelihood ratio condition [Sun and Cai, 2007, Cao et al., 2013]:

f1(x)/f0(x) and f2(x)/f0(x) are non-increasing in x.subscript𝑓1𝑥subscript𝑓0𝑥 and subscript𝑓2𝑥subscript𝑓0𝑥 are non-increasing in 𝑥\displaystyle f_{1}(x)/f_{0}(x)\text{ and }f_{2}(x)/f_{0}(x)\text{ are non-% increasing in }x.italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_x ) / italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_x ) and italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_x ) / italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_x ) are non-increasing in italic_x . (C.2)

We have that LfdrjsubscriptLfdr𝑗\mbox{Lfdr}_{j}Lfdr start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is monotonically non-decreasing in (y1j,y2j)subscript𝑦1𝑗subscript𝑦2𝑗(y_{1j},y_{2j})( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ). The rejection rule based on LfdrjsubscriptLfdr𝑗\mbox{Lfdr}_{j}Lfdr start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT to test the replicability null is δj=I{Lfdrjλ}subscript𝛿𝑗𝐼subscriptLfdr𝑗𝜆\delta_{j}=I\{\mbox{Lfdr}_{j}\leq\lambda\}italic_δ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_I { Lfdr start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ }, where λ𝜆\lambdaitalic_λ is a threshold to be determined. We write the total number of discoveries as R(λ)=j=1mI{Lfdrjλ}𝑅𝜆superscriptsubscript𝑗1𝑚𝐼subscriptLfdr𝑗𝜆R(\lambda)=\sum_{j=1}^{m}I\{\mbox{Lfdr}_{j}\leq\lambda\}italic_R ( italic_λ ) = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I { Lfdr start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ }, and the number of false discoveries as V(λ)=j=1mI{Lfdrjλ}(1θ1jθ2j)𝑉𝜆superscriptsubscript𝑗1𝑚𝐼subscriptLfdr𝑗𝜆1subscript𝜃1𝑗subscript𝜃2𝑗V(\lambda)=\sum_{j=1}^{m}I\{\mbox{Lfdr}_{j}\leq\lambda\}(1-\theta_{1j}\theta_{% 2j})italic_V ( italic_λ ) = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I { Lfdr start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ } ( 1 - italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ). In the oracle case that we know (ξ00,ξ01,ξ10,ξ11,f1,f2)subscript𝜉00subscript𝜉01subscript𝜉10subscript𝜉11subscript𝑓1subscript𝑓2(\xi_{00},\xi_{01},\xi_{10},\xi_{11},f_{1},f_{2})( italic_ξ start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT , italic_ξ start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT , italic_ξ start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT , italic_ξ start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ), define

λm=sup{λ[0,1]:j=1mLfdrjI{Lfdrjλ}j=1mI{Lfdrjλ}q}.subscript𝜆𝑚supremumconditional-set𝜆01superscriptsubscript𝑗1𝑚subscriptLfdr𝑗𝐼subscriptLfdr𝑗𝜆superscriptsubscript𝑗1𝑚𝐼subscriptLfdr𝑗𝜆𝑞\lambda_{m}=\sup\left\{\lambda\in[0,1]:\frac{\sum_{j=1}^{m}\mbox{Lfdr}_{j}I\{% \mbox{Lfdr}_{j}\leq\lambda\}}{\sum_{j=1}^{m}I\{\mbox{Lfdr}_{j}\leq\lambda\}}% \leq q\right\}.italic_λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = roman_sup { italic_λ ∈ [ 0 , 1 ] : divide start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT Lfdr start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_I { Lfdr start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ } end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I { Lfdr start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ } end_ARG ≤ italic_q } .

Reject H0jsubscript𝐻0𝑗H_{0j}italic_H start_POSTSUBSCRIPT 0 italic_j end_POSTSUBSCRIPT if LfdrjλmsubscriptLfdr𝑗subscript𝜆𝑚\mbox{Lfdr}_{j}\leq\lambda_{m}Lfdr start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT. Then the FDR is asymptotically controlled at level q𝑞qitalic_q.

Assume f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT follows a standard uniform distribution. Let 𝒚1={y1j}j=1msubscript𝒚1superscriptsubscriptsubscript𝑦1𝑗𝑗1𝑚\bm{y}_{1}=\{y_{1j}\}_{j=1}^{m}bold_italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = { italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT and 𝒚2={y2j}j=1msubscript𝒚2superscriptsubscriptsubscript𝑦2𝑗𝑗1𝑚\bm{y}_{2}=\{y_{2j}\}_{j=1}^{m}bold_italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = { italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT denote p𝑝pitalic_p-values from study 1 and study 2, respectively. Denote 𝜽1={θ1j}j=1msubscript𝜽1superscriptsubscriptsubscript𝜃1𝑗𝑗1𝑚\bm{\theta}_{1}=\{\theta_{1j}\}_{j=1}^{m}bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = { italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT and 𝜽2={θ2j}j=1m.subscript𝜽2superscriptsubscriptsubscript𝜃2𝑗𝑗1𝑚\bm{\theta}_{2}=\{\theta_{2j}\}_{j=1}^{m}.bold_italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = { italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT . The unknown parameters and functions are estimated by maximizing the following log-likelihood function

l(𝒚1,𝒚2,𝜽1,𝜽2)=𝑙subscript𝒚1subscript𝒚2subscript𝜽1subscript𝜽2absent\displaystyle l(\bm{y}_{1},\bm{y}_{2},\bm{\theta}_{1},\bm{\theta}_{2})=italic_l ( bold_italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = j=1m[log{(1θ1j)f0(y1j)+θ1jf1(y1j)}+log{(1θ2j)f0(y2j)+θ2jf2(y2j)}\displaystyle\sum_{j=1}^{m}\big{[}\log\{(1-\theta_{1j})f_{0}(y_{1j})+\theta_{1% j}f_{1}(y_{1j})\}+\log\{(1-\theta_{2j})f_{0}(y_{2j})+\theta_{2j}f_{2}(y_{2j})\}∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT [ roman_log { ( 1 - italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) + italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) } + roman_log { ( 1 - italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) + italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) }
+θ1j(1θ2j)logξ10+(1θ1j)θ2jlogξ01+(1θ1j)(1θ2j)logξ00subscript𝜃1𝑗1subscript𝜃2𝑗subscript𝜉101subscript𝜃1𝑗subscript𝜃2𝑗subscript𝜉011subscript𝜃1𝑗1subscript𝜃2𝑗subscript𝜉00\displaystyle+\theta_{1j}(1-\theta_{2j})\log\xi_{10}+(1-\theta_{1j})\theta_{2j% }\log\xi_{01}+(1-\theta_{1j})(1-\theta_{2j})\log\xi_{00}+ italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ( 1 - italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) roman_log italic_ξ start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT + ( 1 - italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT roman_log italic_ξ start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT + ( 1 - italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) ( 1 - italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) roman_log italic_ξ start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT
+θ1jθ2jlogξ11],\displaystyle+\theta_{1j}\theta_{2j}\log\xi_{11}\big{]},+ italic_θ start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT roman_log italic_ξ start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT ] ,

where 𝜽1subscript𝜽1\bm{\theta}_{1}bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and 𝜽2subscript𝜽2\bm{\theta}_{2}bold_italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are latent variables. For scalable computation, we utilize EM algorithm [Dempster et al., 1977] in combination of pool-adjacent-violator-algorithm [Robertson et al., 1988] to efficiently estimate the unknowns (ξ00,ξ01,ξ10,ξ11,f1,f2)subscript𝜉00subscript𝜉01subscript𝜉10subscript𝜉11subscript𝑓1subscript𝑓2(\xi_{00},\xi_{01},\xi_{10},\xi_{11},f_{1},f_{2})( italic_ξ start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT , italic_ξ start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT , italic_ξ start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT , italic_ξ start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) incorporating the monotonic constraint (C.2) for f1subscript𝑓1f_{1}italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and f2subscript𝑓2f_{2}italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. With the estimates (ξ^00,ξ^01,ξ^10,ξ^11,f^1,f^2)subscript^𝜉00subscript^𝜉01subscript^𝜉10subscript^𝜉11subscript^𝑓1subscript^𝑓2(\hat{\xi}_{00},\hat{\xi}_{01},\hat{\xi}_{10},\hat{\xi}_{11},\hat{f}_{1},\hat{% f}_{2})( over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT , over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT , over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT , over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ), we obtain the estimated Lfdr as follows.

Lfdr^j=ξ^00f0(y1j)f0(y2j)+ξ^01f0(y1j)f^2(y2j)+ξ^10f^1(y1j)f0(y2j)ξ^00f0(y1j)f0(y2j)+ξ^01f0(y1j)f^2(y2j)+ξ^10f^1(y1j)f0(y2j)+ξ^11f^1(y1j)f^2(y2j).subscript^Lfdr𝑗subscript^𝜉00subscript𝑓0subscript𝑦1𝑗subscript𝑓0subscript𝑦2𝑗subscript^𝜉01subscript𝑓0subscript𝑦1𝑗subscript^𝑓2subscript𝑦2𝑗subscript^𝜉10subscript^𝑓1subscript𝑦1𝑗subscript𝑓0subscript𝑦2𝑗subscript^𝜉00subscript𝑓0subscript𝑦1𝑗subscript𝑓0subscript𝑦2𝑗subscript^𝜉01subscript𝑓0subscript𝑦1𝑗subscript^𝑓2subscript𝑦2𝑗subscript^𝜉10subscript^𝑓1subscript𝑦1𝑗subscript𝑓0subscript𝑦2𝑗subscript^𝜉11subscript^𝑓1subscript𝑦1𝑗subscript^𝑓2subscript𝑦2𝑗\widehat{\text{Lfdr}}_{j}=\frac{\hat{\xi}_{00}f_{0}(y_{1j})f_{0}(y_{2j})+\hat{% \xi}_{01}f_{0}(y_{1j})\hat{f}_{2}(y_{2j})+\hat{\xi}_{10}\hat{f}_{1}(y_{1j})f_{% 0}(y_{2j})}{\hat{\xi}_{00}f_{0}(y_{1j})f_{0}(y_{2j})+\hat{\xi}_{01}f_{0}(y_{1j% })\hat{f}_{2}(y_{2j})+\hat{\xi}_{10}\hat{f}_{1}(y_{1j})f_{0}(y_{2j})+\hat{\xi}% _{11}\hat{f}_{1}(y_{1j})\hat{f}_{2}(y_{2j})}.over^ start_ARG Lfdr end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = divide start_ARG over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) + over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) + over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) end_ARG start_ARG over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 00 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) + over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 01 end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) + over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) + over^ start_ARG italic_ξ end_ARG start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 1 italic_j end_POSTSUBSCRIPT ) over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT 2 italic_j end_POSTSUBSCRIPT ) end_ARG .

An estimate of λmsubscript𝜆𝑚\lambda_{m}italic_λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT is

λ^m=sup{λ[0,1]:j=1mLfdr^jI{Lfdr^jλ}j=1mI{Lfdr^jλ}q}.subscript^𝜆𝑚supremumconditional-set𝜆01superscriptsubscript𝑗1𝑚subscript^Lfdr𝑗𝐼subscript^Lfdr𝑗𝜆superscriptsubscript𝑗1𝑚𝐼subscript^Lfdr𝑗𝜆𝑞\hat{\lambda}_{m}=\sup\left\{\lambda\in[0,1]:\frac{\sum_{j=1}^{m}\widehat{% \mbox{Lfdr}}_{j}I\{\widehat{\mbox{Lfdr}}_{j}\leq\lambda\}}{\sum_{j=1}^{m}I\{% \widehat{\mbox{Lfdr}}_{j}\leq\lambda\}}\leq q\right\}.over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = roman_sup { italic_λ ∈ [ 0 , 1 ] : divide start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT over^ start_ARG Lfdr end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_I { over^ start_ARG Lfdr end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ } end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_I { over^ start_ARG Lfdr end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_λ } end_ARG ≤ italic_q } .

The replicability null hypothesis H0jsubscript𝐻0𝑗H_{0j}italic_H start_POSTSUBSCRIPT 0 italic_j end_POSTSUBSCRIPT is rejected if Lfdr^jλ^m.subscript^Lfdr𝑗subscript^𝜆𝑚\widehat{\mbox{Lfdr}}_{j}\leq\hat{\lambda}_{m}.over^ start_ARG Lfdr end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT . This is equivalent to the step-up procedure [Sun and Cai, 2007]: let Lfdr^(1)Lfdr^(m)subscript^Lfdr1subscript^Lfdr𝑚\widehat{\mbox{Lfdr}}_{(1)}\leq\ldots\leq\widehat{\mbox{Lfdr}}_{(m)}over^ start_ARG Lfdr end_ARG start_POSTSUBSCRIPT ( 1 ) end_POSTSUBSCRIPT ≤ … ≤ over^ start_ARG Lfdr end_ARG start_POSTSUBSCRIPT ( italic_m ) end_POSTSUBSCRIPT be the order statistics of {Lfdr^j}j=1msuperscriptsubscriptsubscript^Lfdr𝑗𝑗1𝑚\{\widehat{\mbox{Lfdr}}_{j}\}_{j=1}^{m}{ over^ start_ARG Lfdr end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT and denote by H0(1),,H0(m)subscript𝐻01subscript𝐻0𝑚H_{0(1)},\ldots,H_{0(m)}italic_H start_POSTSUBSCRIPT 0 ( 1 ) end_POSTSUBSCRIPT , … , italic_H start_POSTSUBSCRIPT 0 ( italic_m ) end_POSTSUBSCRIPT the corresponding ordered hypotheses, the procedure works as follows.

Find k^:=max{1km:1kj=1kLfdr^(j)q},andassign^𝑘:1𝑘𝑚1𝑘superscriptsubscript𝑗1𝑘subscript^Lfdr𝑗𝑞and\displaystyle\hat{k}:=\max\left\{1\leq k\leq m:\frac{1}{k}\sum_{j=1}^{k}% \widehat{\mbox{Lfdr}}_{(j)}\leq q\right\},\ \mbox{and}over^ start_ARG italic_k end_ARG := roman_max { 1 ≤ italic_k ≤ italic_m : divide start_ARG 1 end_ARG start_ARG italic_k end_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT over^ start_ARG Lfdr end_ARG start_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT ≤ italic_q } , and
rejectH0(j),j=1,,k^.formulae-sequencerejectsubscript𝐻0𝑗𝑗1^𝑘\displaystyle\mbox{reject}\ H_{0(j)},\ \ j=1,\ldots,\hat{k}.reject italic_H start_POSTSUBSCRIPT 0 ( italic_j ) end_POSTSUBSCRIPT , italic_j = 1 , … , over^ start_ARG italic_k end_ARG .