\tnotetext

[1]This document is the result of the research project funded by the National Science Foundation [DMS 2311005].

[ orcid=0009-0009-3331-6523]

\credit

Conceptualization, Formal Analysis, Methodology, Software, Writing - original draft and revision

1]organization=Texas A&M University, addressline=TAMU, postcode=77843, postcodesep=, city=College Station, country=USA

[orcid=0000-0003-1672-3118] \creditMethodology, Software

[orcid=0000-0002-8659-4772] \cormark[1]

\credit

Conceptualization, Formal Analysis, Methodology, Software, Writing - original draft and revision, Supervision

Bayes Factors Based on Test Statistics and Non-Local Moment Prior Densities

Saptati Datta [    Rachael Shudde    Valen E. Johnson [email protected]
Abstract

We describe Bayes factors based on z, t, χ2superscript𝜒2\chi^{2}italic_χ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, and F statistics when non-local moment prior distributions are used to define alternative hypotheses. The non-local alternative prior distributions are centered on standardized effects. The prior densities include a dispersion parameter that can be used to model prior precision and the variation of effect sizes across replicated experiments. We examine the convergence rates of Bayes factors under true null and true alternative hypotheses and show how these Bayes factors can be used to construct Bayes factor functions. An example illustrates the application of resulting Bayes factors to psychological experiments.

keywords:
Bayes factor function \sepNon-local prior density \sepNormal-moment density \sepReplicated Design

1 Introduction

Bayes factors based on test statistics and first-order non-local prior densities were used in Johnson et al. (2023) (hereafter J23) to define Bayes factor functions (BFFs). The first-order non-local prior densities used to define those Bayes factors contained a single scale parameter that determined the mode of the non-local prior densities used to define the alternative hypotheses (Johnson and Rossell, 2010). BFFs were defined as the mapping of these prior modes to Bayes factors. The use of first-order moment prior densities allowed J23 to obtain closed-form expressions for Bayes factors based on common test statistics, including z𝑧zitalic_z, t𝑡titalic_t, F𝐹Fitalic_F, and χ2superscript𝜒2\chi^{2}italic_χ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT statistics.

In this article we extend the results of J23 by providing closed-form expressions for Bayes factors based on test statistics (BFBOTs) (Johnson, 2005) and alternative hypotheses defined using non-local moment prior densities of arbitrary order. This extension enables the incorporation of subjective prior knowledge regarding the precision of prior estimates of non-null effect sizes. Moreover, it provides a potential mechanism for modeling variation in effect sizes across replicated experiments.

2 BFBOTs and moment prior alternatives

J23 defined Bayes factors using two categories of prior densities: normal-moment prior densities and gamma prior densities. A normal-moment density on a parameter λ𝜆\lambdaitalic_λ, given hyperparameters (τ2,r)superscript𝜏2𝑟(\tau^{2},r)( italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_r ), can be expressed as

j(λτ2,r)𝑗conditional𝜆superscript𝜏2𝑟\displaystyle j(\lambda\mid\tau^{2},r)italic_j ( italic_λ ∣ italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_r ) =\displaystyle== (λ2)r(2τ2)r+12Γ(r+12)exp(λ22τ2),superscriptsuperscript𝜆2𝑟superscript2superscript𝜏2𝑟12Γ𝑟12superscript𝜆22superscript𝜏2\displaystyle\frac{(\lambda^{2})^{r}}{(2\tau^{2})^{r+\frac{1}{2}}\Gamma\left(r% +\frac{1}{2}\right)}\exp\left(-\frac{\lambda^{2}}{2\tau^{2}}\right),divide start_ARG ( italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT end_ARG start_ARG ( 2 italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_r + divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT roman_Γ ( italic_r + divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) end_ARG roman_exp ( - divide start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) , (1)
<λ<,τ,r>0.formulae-sequence𝜆𝜏𝑟0\displaystyle-\infty<\lambda<\infty,\quad\tau,r>0.- ∞ < italic_λ < ∞ , italic_τ , italic_r > 0 .

J23 derived Bayes factors based on t𝑡titalic_t and z𝑧zitalic_z statistics by imposing normal-moment prior densities with r=1𝑟1r=1italic_r = 1 on the non-centrality parameters of the test statistics under the alternative hypothesis. We extend these results for r0𝑟0r\geq 0italic_r ≥ 0.

BFBOTs for χk2subscriptsuperscript𝜒2𝑘\chi^{2}_{k}italic_χ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and Fk,msubscript𝐹𝑘𝑚F_{k,m}italic_F start_POSTSUBSCRIPT italic_k , italic_m end_POSTSUBSCRIPT test statistics were defined by assuming the non-centrality parameter λ𝜆\lambdaitalic_λ for each distribution under the alternative hypothesis followed gamma distributions with parameters k/2+1𝑘21k/2+1italic_k / 2 + 1 and 1/(2τ2)12superscript𝜏21/(2\tau^{2})1 / ( 2 italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ), i.e., G[k/2+1,1/(2τ2)]𝐺𝑘2112superscript𝜏2G[k/2+1,1/(2\tau^{2})]italic_G [ italic_k / 2 + 1 , 1 / ( 2 italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ]. We generalize these results by extending the class of gamma priors to include G[k/2+r,1/(2τ2)]𝐺𝑘2𝑟12superscript𝜏2G[k/2+r,1/(2\tau^{2})]italic_G [ italic_k / 2 + italic_r , 1 / ( 2 italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ] distributions for r,τ>0𝑟𝜏0r,\tau>0italic_r , italic_τ > 0.

For one-sided tests with positive non-centrality parameters, the prior densities j+(μτ2,r)superscript𝑗conditional𝜇superscript𝜏2𝑟j^{+}(\mu\mid\tau^{2},r)italic_j start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_μ ∣ italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_r ) and j(μτ2,r)superscript𝑗conditional𝜇superscript𝜏2𝑟j^{-}(\mu\mid\tau^{2},r)italic_j start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT ( italic_μ ∣ italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_r ) are defined similarly to the expression in (1), but they are constrained to the positive or negative real line, respectively. This constraint depends on whether the alternative hypothesis necessitates that λ>0𝜆0\lambda>0italic_λ > 0 or λ<0𝜆0\lambda<0italic_λ < 0.

For simplicity we adopt the notation used in J23 and define

  • N(a,b)𝑁𝑎𝑏N(a,b)italic_N ( italic_a , italic_b ) as a normal distribution with mean a𝑎aitalic_a and variance b𝑏bitalic_b.

  • Tν(λ)subscript𝑇𝜈𝜆T_{\nu}(\lambda)italic_T start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ( italic_λ ) as a t𝑡titalic_t distribution with ν𝜈\nuitalic_ν degrees of freedom and non-centrality parameter λ𝜆\lambdaitalic_λ.

  • χν2(λ)subscriptsuperscript𝜒2𝜈𝜆\chi^{2}_{\nu}(\lambda)italic_χ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ( italic_λ ) as a χ2superscript𝜒2\chi^{2}italic_χ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT distribution with ν𝜈\nuitalic_ν degrees of freedom and non-centrality parameter λ𝜆\lambdaitalic_λ.

  • Fk,m(λ)subscript𝐹𝑘𝑚𝜆F_{k,m}(\lambda)italic_F start_POSTSUBSCRIPT italic_k , italic_m end_POSTSUBSCRIPT ( italic_λ ) as an F𝐹Fitalic_F distribution with (k,m)𝑘𝑚(k,m)( italic_k , italic_m ) degrees of freedom and non-centrality parameter λ𝜆\lambdaitalic_λ.

  • G(α,λ)𝐺𝛼𝜆G(\alpha,\lambda)italic_G ( italic_α , italic_λ ) as a gamma distribution with shape parameter α𝛼\alphaitalic_α and rate parameter λ𝜆\lambdaitalic_λ.

  • J(τ2,r)𝐽superscript𝜏2𝑟J(\tau^{2},r)italic_J ( italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_r ) as a normal-moment distribution of order r𝑟ritalic_r and rate parameter τ2superscript𝜏2\tau^{2}italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and density given in (1).

In addition, we let F11(a,b;z)subscriptsubscript𝐹11𝑎𝑏𝑧{{}_{1}}F_{1}(a,b;z)start_FLOATSUBSCRIPT 1 end_FLOATSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_a , italic_b ; italic_z ) denote the confluent hypergeometric function and F12(a,b,c;z)subscriptsubscript𝐹12𝑎𝑏𝑐𝑧{{}_{2}}F_{1}(a,b,c;z)start_FLOATSUBSCRIPT 2 end_FLOATSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_a , italic_b , italic_c ; italic_z ) the Gaussian hypergeometric function.

Table 1: Summary of theorems describing Bayes factors based on test statistics. The constants a𝑎aitalic_a, b𝑏bitalic_b and c𝑐citalic_c are defined as a=(1+τ2)(r+12)𝑎superscript1superscript𝜏2𝑟12a=\left(1+\tau^{2}\right)^{-\left(r+{\frac{1}{2}}\right)}italic_a = ( 1 + italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - ( italic_r + divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) end_POSTSUPERSCRIPT, b=(1+τ2)(k2+r)𝑏superscript1superscript𝜏2𝑘2𝑟b=\left(1+\tau^{2}\right)^{-\left(\frac{k}{2}+r\right)}italic_b = ( 1 + italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - ( divide start_ARG italic_k end_ARG start_ARG 2 end_ARG + italic_r ) end_POSTSUPERSCRIPT, and c=[Γ(ν2+1)Γ(r+1)]/[Γ(ν+12)Γ(r+12)]𝑐delimited-[]Γ𝜈21Γ𝑟1delimited-[]Γ𝜈12Γ𝑟12c=\left[\Gamma(\frac{\nu}{2}+1)\,\Gamma(r+1)\right]/\left[\Gamma(\frac{\nu+1}{% 2})\,{\Gamma(r+{\frac{1}{2}})}\right]italic_c = [ roman_Γ ( divide start_ARG italic_ν end_ARG start_ARG 2 end_ARG + 1 ) roman_Γ ( italic_r + 1 ) ] / [ roman_Γ ( divide start_ARG italic_ν + 1 end_ARG start_ARG 2 end_ARG ) roman_Γ ( italic_r + divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) ].
Test H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT H1subscript𝐻1H_{1}italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT Prior BF10𝐵subscript𝐹10BF_{10}italic_B italic_F start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT
Two-sided z zN(0,1)similar-to𝑧𝑁01z\sim N(0,1)italic_z ∼ italic_N ( 0 , 1 ) zN(λ,1)similar-to𝑧𝑁𝜆1z\sim N(\lambda,1)italic_z ∼ italic_N ( italic_λ , 1 ) J(λ|τ2,r)𝐽conditional𝜆superscript𝜏2𝑟J(\lambda{\,|\,}{\tau^{2}},r)italic_J ( italic_λ | italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_r ) aF1(r+12,12;τ2z22(1+τ2))𝑎subscript𝐹1𝑟1212superscript𝜏2superscript𝑧221superscript𝜏2a\,F_{1}\left(r+{\frac{1}{2}},{\frac{1}{2}};\frac{\tau^{2}z^{2}}{2(1+\tau^{2})% }\right)italic_a italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_r + divide start_ARG 1 end_ARG start_ARG 2 end_ARG , divide start_ARG 1 end_ARG start_ARG 2 end_ARG ; divide start_ARG italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_z start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 ( 1 + italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_ARG )
One-sided z zN(0,1)similar-to𝑧𝑁01z\sim N(0,1)italic_z ∼ italic_N ( 0 , 1 ) zN(λ,1)similar-to𝑧𝑁𝜆1z\sim N(\lambda,1)italic_z ∼ italic_N ( italic_λ , 1 ) J+(λ|τ2,r)superscript𝐽conditional𝜆superscript𝜏2𝑟J^{+}(\lambda{\,|\,}{\tau^{2}},r)italic_J start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_λ | italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_r ) a[F11(r+12,12;y2)+2yΓ(r+1)Γ(r+12)F11(r+12,32;y2)]𝑎delimited-[]subscriptsubscript𝐹11𝑟1212superscript𝑦22𝑦Γ𝑟1Γ𝑟12subscriptsubscript𝐹11𝑟1232superscript𝑦2a\,\left[{{}_{1}}F_{1}\left(r+{\frac{1}{2}},{\frac{1}{2}};y^{2}\right)+2y\frac% {\Gamma(r+1)}{\Gamma(r+{\frac{1}{2}})}{{}_{1}}F_{1}\left(r+{\frac{1}{2}},\frac% {3}{2};y^{2}\right)\right]italic_a [ start_FLOATSUBSCRIPT 1 end_FLOATSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_r + divide start_ARG 1 end_ARG start_ARG 2 end_ARG , divide start_ARG 1 end_ARG start_ARG 2 end_ARG ; italic_y start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) + 2 italic_y divide start_ARG roman_Γ ( italic_r + 1 ) end_ARG start_ARG roman_Γ ( italic_r + divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) end_ARG start_FLOATSUBSCRIPT 1 end_FLOATSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_r + divide start_ARG 1 end_ARG start_ARG 2 end_ARG , divide start_ARG 3 end_ARG start_ARG 2 end_ARG ; italic_y start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ],
with y=τz(2+2τ2)12𝑦𝜏𝑧superscript22superscript𝜏212y=\tau z(2+2\tau^{2})^{-{\frac{1}{2}}}italic_y = italic_τ italic_z ( 2 + 2 italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT
Two-sided t𝑡titalic_t tTν(0,1)similar-to𝑡subscript𝑇𝜈01t\sim T_{\nu}(0,1)italic_t ∼ italic_T start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ( 0 , 1 ) tTν(λ,1)similar-to𝑡subscript𝑇𝜈𝜆1t\sim T_{\nu}(\lambda,1)italic_t ∼ italic_T start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ( italic_λ , 1 ) J(λ|τ2,r)𝐽conditional𝜆superscript𝜏2𝑟J(\lambda{\,|\,}{\tau^{2}},r)italic_J ( italic_λ | italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_r ) a[F12(ν+12,r+12,12;y2)+cyF12(ν2+1,r+1,32;y2)]𝑎delimited-[]subscriptsubscript𝐹12𝜈12𝑟1212superscript𝑦2𝑐𝑦subscriptsubscript𝐹12𝜈21𝑟132superscript𝑦2a\,\left[{{}_{2}}F_{1}\left(\frac{\nu+1}{2},r+{\frac{1}{2}},{\frac{1}{2}};y^{2% }\right)+cy\ \,{{}_{2}}F_{1}\left(\frac{\nu}{2}+1,r+1,\frac{3}{2};y^{2}\right)\right]italic_a [ start_FLOATSUBSCRIPT 2 end_FLOATSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( divide start_ARG italic_ν + 1 end_ARG start_ARG 2 end_ARG , italic_r + divide start_ARG 1 end_ARG start_ARG 2 end_ARG , divide start_ARG 1 end_ARG start_ARG 2 end_ARG ; italic_y start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) + italic_c italic_y start_FLOATSUBSCRIPT 2 end_FLOATSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( divide start_ARG italic_ν end_ARG start_ARG 2 end_ARG + 1 , italic_r + 1 , divide start_ARG 3 end_ARG start_ARG 2 end_ARG ; italic_y start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ]
with y=τt[(ν+t2)(1+τ2)]12𝑦𝜏𝑡superscriptdelimited-[]𝜈superscript𝑡21superscript𝜏212y=\tau t\,[(\nu+t^{2})(1+\tau^{2})]^{-{\frac{1}{2}}}italic_y = italic_τ italic_t [ ( italic_ν + italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ( 1 + italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ] start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT
One-sided t𝑡titalic_t tTν(0,1)similar-to𝑡subscript𝑇𝜈01t\sim T_{\nu}(0,1)italic_t ∼ italic_T start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ( 0 , 1 ) tTν(λ,1)similar-to𝑡subscript𝑇𝜈𝜆1t\sim T_{\nu}(\lambda,1)italic_t ∼ italic_T start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ( italic_λ , 1 ) J+(λ|τ2,r)superscript𝐽conditional𝜆superscript𝜏2𝑟J^{+}(\lambda{\,|\,}{\tau^{2}},r)italic_J start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_λ | italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_r ) a[F12(ν+12,r+12,12;y2)+2cyF12(ν2+1,r+1,32;y2)]𝑎delimited-[]subscriptsubscript𝐹12𝜈12𝑟1212superscript𝑦22𝑐𝑦subscriptsubscript𝐹12𝜈21𝑟132superscript𝑦2a\,\left[{{}_{2}}F_{1}\left(\frac{\nu+1}{2},r+{\frac{1}{2}},{\frac{1}{2}};y^{2% }\right)+2cy\ \,{{}_{2}}F_{1}\left(\frac{\nu}{2}+1,r+1,\frac{3}{2};y^{2}\right% )\right]italic_a [ start_FLOATSUBSCRIPT 2 end_FLOATSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( divide start_ARG italic_ν + 1 end_ARG start_ARG 2 end_ARG , italic_r + divide start_ARG 1 end_ARG start_ARG 2 end_ARG , divide start_ARG 1 end_ARG start_ARG 2 end_ARG ; italic_y start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) + 2 italic_c italic_y start_FLOATSUBSCRIPT 2 end_FLOATSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( divide start_ARG italic_ν end_ARG start_ARG 2 end_ARG + 1 , italic_r + 1 , divide start_ARG 3 end_ARG start_ARG 2 end_ARG ; italic_y start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ]
with y=τt[(ν+t2)(1+τ2)]12𝑦𝜏𝑡superscriptdelimited-[]𝜈superscript𝑡21superscript𝜏212y=\tau t\,[(\nu+t^{2})(1+\tau^{2})]^{-{\frac{1}{2}}}italic_y = italic_τ italic_t [ ( italic_ν + italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ( 1 + italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ] start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT
χk2subscriptsuperscript𝜒2𝑘\chi^{2}_{k}italic_χ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT hχk2(0)similar-tosubscriptsuperscript𝜒2𝑘0h\sim\chi^{2}_{k}(0)italic_h ∼ italic_χ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( 0 ) hχk2(λ)similar-tosuperscriptsubscript𝜒𝑘2𝜆h\sim\chi_{k}^{2}(\lambda)italic_h ∼ italic_χ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_λ ) G(λ|k2+r,12τ2)𝐺conditional𝜆𝑘2𝑟12superscript𝜏2G\left(\lambda{\,|\,}\frac{k}{2}+r,\frac{1}{2\tau^{2}}\right)italic_G ( italic_λ | divide start_ARG italic_k end_ARG start_ARG 2 end_ARG + italic_r , divide start_ARG 1 end_ARG start_ARG 2 italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) bF11(k2+r,k2;τ2h2(1+τ2))𝑏subscriptsubscript𝐹11𝑘2𝑟𝑘2superscript𝜏221superscript𝜏2b\ \,{{}_{1}}F_{1}\left(\frac{k}{2}+r,\frac{k}{2};\frac{\tau^{2}h}{2(1+\tau^{2% })}\right)italic_b start_FLOATSUBSCRIPT 1 end_FLOATSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( divide start_ARG italic_k end_ARG start_ARG 2 end_ARG + italic_r , divide start_ARG italic_k end_ARG start_ARG 2 end_ARG ; divide start_ARG italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_h end_ARG start_ARG 2 ( 1 + italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_ARG )
Fk,msubscript𝐹𝑘𝑚F_{k,m}italic_F start_POSTSUBSCRIPT italic_k , italic_m end_POSTSUBSCRIPT fFk,m(0)similar-to𝑓subscript𝐹𝑘𝑚0f\sim F_{k,m}(0)italic_f ∼ italic_F start_POSTSUBSCRIPT italic_k , italic_m end_POSTSUBSCRIPT ( 0 ) fFk,m(λ)similar-to𝑓subscript𝐹𝑘𝑚𝜆f\sim F_{k,m}(\lambda)italic_f ∼ italic_F start_POSTSUBSCRIPT italic_k , italic_m end_POSTSUBSCRIPT ( italic_λ ) G(λ|k2+r,12τ2)𝐺conditional𝜆𝑘2𝑟12superscript𝜏2G\left(\lambda{\,|\,}\frac{k}{2}+r,\frac{1}{2\tau^{2}}\right)italic_G ( italic_λ | divide start_ARG italic_k end_ARG start_ARG 2 end_ARG + italic_r , divide start_ARG 1 end_ARG start_ARG 2 italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) bF12(k2+r,k+m2,k2;kτ2f(1+τ2)(m+kf))𝑏subscriptsubscript𝐹12𝑘2𝑟𝑘𝑚2𝑘2𝑘superscript𝜏2𝑓1superscript𝜏2𝑚𝑘𝑓b\ \,{{}_{2}}F_{1}\left(\frac{k}{2}+r,\frac{k+m}{2},\frac{k}{2};\frac{k\tau^{2% }f}{(1+\tau^{2})(m+kf)}\right)italic_b start_FLOATSUBSCRIPT 2 end_FLOATSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( divide start_ARG italic_k end_ARG start_ARG 2 end_ARG + italic_r , divide start_ARG italic_k + italic_m end_ARG start_ARG 2 end_ARG , divide start_ARG italic_k end_ARG start_ARG 2 end_ARG ; divide start_ARG italic_k italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_f end_ARG start_ARG ( 1 + italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ( italic_m + italic_k italic_f ) end_ARG )

Using this notation, the main results of this article are provided in Table 1, which provides explicit expressions for Bayes factors based on z𝑧zitalic_z, t𝑡titalic_t, χ2superscript𝜒2\chi^{2}italic_χ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and F𝐹Fitalic_F tests. These expressions are justified by theorems provided in the Supplemental Material.

In common applications of these tests, the convergence rates for the resulting Bayes factors may be summarized as follows.

  1. 1.

    Under true alternative hypotheses (i.e., λ0𝜆0\lambda\neq 0italic_λ ≠ 0), Bayes factors in favor of null hypotheses decrease exponentially fast to 0 as the sample size n𝑛n\rightarrow\inftyitalic_n → ∞.

  2. 2.

    Under true null hypotheses and certain regularity conditions, Bayes factors in favor of alternative hypotheses often decrease at rate Op(nr12)subscript𝑂𝑝superscript𝑛𝑟12O_{p}(n^{-r-{\frac{1}{2}}})italic_O start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_n start_POSTSUPERSCRIPT - italic_r - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ) for z𝑧zitalic_z and t𝑡titalic_t tests and Op(nrk2)subscript𝑂𝑝superscript𝑛𝑟𝑘2O_{p}(n^{-r-\frac{k}{2}})italic_O start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_n start_POSTSUPERSCRIPT - italic_r - divide start_ARG italic_k end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ) for χk2superscriptsubscript𝜒𝑘2\chi_{k}^{2}italic_χ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and Fk,msubscript𝐹𝑘𝑚F_{k,m}italic_F start_POSTSUBSCRIPT italic_k , italic_m end_POSTSUBSCRIPT tests.

Sufficient conditions for achieving these rates are provided in the Supplemental Material.

The improved convergence rates obtained for r>1𝑟1r>1italic_r > 1 provide a partial motivation for generalizing the class of prior densities considered in J23. The non-local alternative prior densities proposed there correspond to setting r=1𝑟1r=1italic_r = 1 in Table 1. The improvement in convergence rates for true null hypotheses can be attributed to the more rapid descent of the non-local prior densities to 0 as the non-centrality parameters converge to 00.

3 Selection of prior hyperparameters

We now describe strategies for specifying (τ2,r)superscript𝜏2𝑟(\tau^{2},r)( italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_r ) to define either a Bayes factor or to construct a BFF. We assume that the non-centrality parameter λ𝜆\lambdaitalic_λ of the test statistic can be expressed as a function of ω𝜔\omegaitalic_ω, a standardized effect. That is, λ=ψ(ω)𝜆𝜓𝜔\lambda=\psi(\omega)italic_λ = italic_ψ ( italic_ω ). For example, in a z𝑧zitalic_z test that a normal mean μ𝜇\muitalic_μ is 0, the non-centrality parameter satisfies λ=ψ(ω)=nω𝜆𝜓𝜔𝑛𝜔\lambda=\psi(\omega)=\sqrt{n}\omegaitalic_λ = italic_ψ ( italic_ω ) = square-root start_ARG italic_n end_ARG italic_ω, where ω=μ/σ𝜔𝜇𝜎\omega=\mu/\sigmaitalic_ω = italic_μ / italic_σ and n𝑛nitalic_n is the sample size.

Our recommendation for setting (τ2,r)superscript𝜏2𝑟(\tau^{2},r)( italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_r ) is to fix τ2superscript𝜏2\tau^{2}italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT conditionally on r𝑟ritalic_r, and then set τ2=τω,r2superscript𝜏2subscriptsuperscript𝜏2𝜔𝑟\tau^{2}=\tau^{2}_{\omega,r}italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ω , italic_r end_POSTSUBSCRIPT so that the prior mode on the non-centrality parameter equals ψ(ω)𝜓superscript𝜔\psi(\omega^{*})italic_ψ ( italic_ω start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ), where ωsuperscript𝜔\omega^{*}italic_ω start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT represents a hypothesized value of ω𝜔\omegaitalic_ω. That is, we define τω,r2superscriptsubscript𝜏superscript𝜔𝑟2\tau_{\omega^{*},r}^{2}italic_τ start_POSTSUBSCRIPT italic_ω start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT such that

ψ(ω)=argmax𝜆π(λ|τω,r2,r).𝜓superscript𝜔𝜆argmax𝜋conditional𝜆subscriptsuperscript𝜏2superscript𝜔𝑟𝑟\psi(\omega^{*})=\underset{\lambda}{\operatorname{argmax}}\ \pi(\lambda|\tau^{% 2}_{\omega^{*},r},r).italic_ψ ( italic_ω start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = underitalic_λ start_ARG roman_argmax end_ARG italic_π ( italic_λ | italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ω start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_r end_POSTSUBSCRIPT , italic_r ) . (2)

Here, π𝜋\piitalic_π denotes the prior density on λ𝜆\lambdaitalic_λ under the alternative hypothesis. This constraint places the prior mode for λ𝜆\lambdaitalic_λ at the specified value of ψ(ω)𝜓superscript𝜔\psi(\omega^{*})italic_ψ ( italic_ω start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ). In constructing a BFF, ωsuperscript𝜔\omega^{*}italic_ω start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is varied over a plausible range of values for the standardized effect.

Like many default Bayesian testing methods, criteria for setting the scale of the prior density used to define alternative hypotheses remains a topic of active research (e.g., Zellner, 1986; Doucet et al., 2002; Liang et al., 2008; Rouder et al., 2009; Consonni et al., 2018; Pramanik and Johnson, 2023). For the purposes of this article, we simply examine the sensitivity of Bayes factors and BFFs to the choice of r𝑟ritalic_r. Methods for estimating r𝑟ritalic_r from published findings of similar studies or in replicated designs are currently under investigation.

To illustrate this strategy for setting τ2superscript𝜏2\tau^{2}italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT for given r𝑟ritalic_r, consider a t𝑡titalic_t test of a null hypothesis H0:μ=0:subscript𝐻0𝜇0H_{0}:\mu=0italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : italic_μ = 0 based on a random sample x1,,xnsubscript𝑥1subscript𝑥𝑛x_{1},\dots,x_{n}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, where xiN(μ,σ)similar-tosubscript𝑥𝑖𝑁𝜇𝜎x_{i}\sim N(\mu,\sigma)italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∼ italic_N ( italic_μ , italic_σ ) and σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is unknown. For this test, t=nx¯/s𝑡𝑛¯𝑥𝑠t=\sqrt{n}\bar{x}/sitalic_t = square-root start_ARG italic_n end_ARG over¯ start_ARG italic_x end_ARG / italic_s, where s2superscript𝑠2s^{2}italic_s start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is the usual unbiased estimate of σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. The distribution of t𝑡titalic_t is

tμ,σTν(nμσ),similar-toconditional𝑡𝜇𝜎subscript𝑇𝜈𝑛𝜇𝜎t\mid\mu,\sigma\sim T_{\nu}\left(\frac{\sqrt{n}\mu}{\sigma}\right),italic_t ∣ italic_μ , italic_σ ∼ italic_T start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ( divide start_ARG square-root start_ARG italic_n end_ARG italic_μ end_ARG start_ARG italic_σ end_ARG ) , (3)

where ν=n1𝜈𝑛1\nu=n-1italic_ν = italic_n - 1. Under the null hypothesis, tTν(0)similar-to𝑡subscript𝑇𝜈0t\sim T_{\nu}(0)italic_t ∼ italic_T start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT ( 0 ). The non-centrality parameter of the t𝑡titalic_t distribution under the alternative hypothesis is λ=nω𝜆𝑛𝜔\lambda=\sqrt{n}\omegaitalic_λ = square-root start_ARG italic_n end_ARG italic_ω, where we define ω=μ/σ𝜔𝜇𝜎\omega=\mu/\sigmaitalic_ω = italic_μ / italic_σ as the standardized effect.

The prior distribution recommended for λ𝜆\lambdaitalic_λ in Table 1 is a normal-moment prior, J(τ2,r)𝐽superscript𝜏2𝑟J(\tau^{2},r)italic_J ( italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_r ). The modes of this density are ±2rτplus-or-minus2𝑟𝜏\pm\sqrt{2r}\tau± square-root start_ARG 2 italic_r end_ARG italic_τ. To define a Bayes factor given a value ωsuperscript𝜔\omega^{*}italic_ω start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, we select τ2superscript𝜏2\tau^{2}italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT so that the modes of the prior occur at nω𝑛superscript𝜔\sqrt{n}\omega^{*}square-root start_ARG italic_n end_ARG italic_ω start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. That is, we equate nω=2rτ𝑛superscript𝜔2𝑟𝜏\sqrt{n}\omega^{*}=\sqrt{2r}\tausquare-root start_ARG italic_n end_ARG italic_ω start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = square-root start_ARG 2 italic_r end_ARG italic_τ and define τω,r2=nω2/(2r)subscriptsuperscript𝜏2superscript𝜔𝑟𝑛superscriptsuperscript𝜔22𝑟\tau^{2}_{\omega^{*},r}=n{\omega^{*}}^{2}/(2r)italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ω start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_r end_POSTSUBSCRIPT = italic_n italic_ω start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / ( 2 italic_r ).

A similar procedure can be used to set τω,r2subscriptsuperscript𝜏2superscript𝜔𝑟\tau^{2}_{\omega^{*},r}italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ω start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_r end_POSTSUBSCRIPT for other test statistics. Table 2 lists values of τω,r2subscriptsuperscript𝜏2superscript𝜔𝑟\tau^{2}_{\omega^{*},r}italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ω start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_r end_POSTSUBSCRIPT for several common statistical tests. This table generalizes the values provided in Table 1 of J23 for the case r=1𝑟1r=1italic_r = 1.

Figure 1 illustrates the effect of varying r𝑟ritalic_r so that the mode of the prior density on an effect size remains constant. It shows that the prior dispersion around the mode decreases as r𝑟ritalic_r increases.

Refer to caption
Figure 1: Normal moment prior densities with varying r𝑟ritalic_r
Table 2: Default choices of τω,rsubscript𝜏𝜔𝑟\tau_{\omega,r}italic_τ start_POSTSUBSCRIPT italic_ω , italic_r end_POSTSUBSCRIPT
Test Statistic Standardized Effect (ω𝜔\omegaitalic_ω) τω,r2subscriptsuperscript𝜏2𝜔𝑟\tau^{2}_{\omega,r}italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ω , italic_r end_POSTSUBSCRIPT
1-sample z nx¯σ𝑛¯𝑥𝜎\frac{\sqrt{n}\bar{x}}{\sigma}divide start_ARG square-root start_ARG italic_n end_ARG over¯ start_ARG italic_x end_ARG end_ARG start_ARG italic_σ end_ARG μσ𝜇𝜎\frac{\mu}{\sigma}divide start_ARG italic_μ end_ARG start_ARG italic_σ end_ARG nω22r𝑛superscript𝜔22𝑟\frac{n\omega^{2}}{2r}divide start_ARG italic_n italic_ω start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_r end_ARG
1-sample t nx¯s𝑛¯𝑥𝑠\frac{\sqrt{n}\bar{x}}{s}divide start_ARG square-root start_ARG italic_n end_ARG over¯ start_ARG italic_x end_ARG end_ARG start_ARG italic_s end_ARG μσ𝜇𝜎\frac{\mu}{\sigma}divide start_ARG italic_μ end_ARG start_ARG italic_σ end_ARG nω22r𝑛superscript𝜔22𝑟\frac{n\omega^{2}}{2r}divide start_ARG italic_n italic_ω start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_r end_ARG
2-sample z n1n2(x¯1x¯2)σn1+n2subscript𝑛1subscript𝑛2subscript¯𝑥1subscript¯𝑥2𝜎subscript𝑛1subscript𝑛2\frac{\sqrt{n_{1}n_{2}}(\bar{x}_{1}-\bar{x}_{2})}{\sigma\sqrt{n_{1}+n_{2}}}divide start_ARG square-root start_ARG italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ( over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_σ square-root start_ARG italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG end_ARG μ1μ2σsubscript𝜇1subscript𝜇2𝜎\frac{\mu_{1}-\mu_{2}}{\sigma}divide start_ARG italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_σ end_ARG n1n2ω22r(n1+n2)subscript𝑛1subscript𝑛2superscript𝜔22𝑟subscript𝑛1subscript𝑛2\frac{n_{1}n_{2}\omega^{2}}{2r(n_{1}+n_{2})}divide start_ARG italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_ω start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_r ( italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG
2-sample t n1n2(x¯1x¯2)sn1+n2subscript𝑛1subscript𝑛2subscript¯𝑥1subscript¯𝑥2𝑠subscript𝑛1subscript𝑛2\frac{\sqrt{n_{1}n_{2}}(\bar{x}_{1}-\bar{x}_{2})}{s\sqrt{n_{1}+n_{2}}}divide start_ARG square-root start_ARG italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ( over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_s square-root start_ARG italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG end_ARG μ1μ2σsubscript𝜇1subscript𝜇2𝜎\frac{\mu_{1}-\mu_{2}}{\sigma}divide start_ARG italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_σ end_ARG n1n2ω22r(n1+n2)subscript𝑛1subscript𝑛2superscript𝜔22𝑟subscript𝑛1subscript𝑛2\frac{n_{1}n_{2}\omega^{2}}{2r(n_{1}+n_{2})}divide start_ARG italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_ω start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_r ( italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG
Multinomial/Poisson χν2=i=1k(ninfi(θ^))2nfi(θ^)subscriptsuperscript𝜒2𝜈superscriptsubscript𝑖1𝑘superscriptsubscript𝑛𝑖𝑛subscript𝑓𝑖^𝜃2𝑛subscript𝑓𝑖^𝜃\chi^{2}_{\nu}=\sum\limits_{i=1}^{k}\frac{(n_{i}-nf_{i}(\hat{\theta}))^{2}}{nf% _{i}(\hat{\theta})}italic_χ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT divide start_ARG ( italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_n italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( over^ start_ARG italic_θ end_ARG ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_n italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( over^ start_ARG italic_θ end_ARG ) end_ARG (pifi(θ)fi(θ))k×1subscriptsubscript𝑝𝑖subscript𝑓𝑖𝜃subscript𝑓𝑖𝜃𝑘1\left(\frac{p_{i}-f_{i}(\theta)}{\sqrt{f_{i}(\theta)}}\right)_{k\times 1}( divide start_ARG italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ ) end_ARG start_ARG square-root start_ARG italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ ) end_ARG end_ARG ) start_POSTSUBSCRIPT italic_k × 1 end_POSTSUBSCRIPT nωω2(k2+r1)=nkω~22(k2+r1)𝑛superscript𝜔𝜔2𝑘2𝑟1𝑛𝑘superscript~𝜔22𝑘2𝑟1\frac{n\omega^{\prime}\omega}{2(\frac{k}{2}+r-1)}=\frac{nk\tilde{\omega}^{2}}{% 2(\frac{k}{2}+r-1)}divide start_ARG italic_n italic_ω start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_ω end_ARG start_ARG 2 ( divide start_ARG italic_k end_ARG start_ARG 2 end_ARG + italic_r - 1 ) end_ARG = divide start_ARG italic_n italic_k over~ start_ARG italic_ω end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 ( divide start_ARG italic_k end_ARG start_ARG 2 end_ARG + italic_r - 1 ) end_ARG
Linear model Fk,np=(RSS0RSS1)/k[(RSS1)/(np)]subscript𝐹𝑘𝑛𝑝𝑅𝑆subscript𝑆0𝑅𝑆subscript𝑆1𝑘delimited-[]𝑅𝑆subscript𝑆1𝑛𝑝F_{k,n-p}=\frac{(RSS_{0}-RSS_{1})/k}{[(RSS_{1})/(n-p)]}italic_F start_POSTSUBSCRIPT italic_k , italic_n - italic_p end_POSTSUBSCRIPT = divide start_ARG ( italic_R italic_S italic_S start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - italic_R italic_S italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) / italic_k end_ARG start_ARG [ ( italic_R italic_S italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) / ( italic_n - italic_p ) ] end_ARG 𝐋1(𝐀𝜷𝐚)σsuperscript𝐋1𝐀𝜷𝐚𝜎\frac{\mathbf{L}^{-1}(\mathbf{A}\boldsymbol{\beta}-\mathbf{a})}{\sigma}divide start_ARG bold_L start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( bold_A bold_italic_β - bold_a ) end_ARG start_ARG italic_σ end_ARG nωω2(k2+r1)=nkω~24(k2+r1)𝑛superscript𝜔𝜔2𝑘2𝑟1𝑛𝑘superscript~𝜔24𝑘2𝑟1\frac{n\omega^{\prime}\omega}{2(\frac{k}{2}+r-1)}=\frac{nk\tilde{\omega}^{2}}{% 4(\frac{k}{2}+r-1)}divide start_ARG italic_n italic_ω start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_ω end_ARG start_ARG 2 ( divide start_ARG italic_k end_ARG start_ARG 2 end_ARG + italic_r - 1 ) end_ARG = divide start_ARG italic_n italic_k over~ start_ARG italic_ω end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 4 ( divide start_ARG italic_k end_ARG start_ARG 2 end_ARG + italic_r - 1 ) end_ARG
Likelihood Ratio χk2=2log[l(θr0,θs^)l(θ^)]subscriptsuperscript𝜒2𝑘2𝑙subscript𝜃𝑟0^subscript𝜃𝑠𝑙^𝜃\chi^{2}_{k}=-2\log\left[\frac{l(\theta_{r0},\hat{\theta_{s}})}{l(\hat{\theta}% )}\right]italic_χ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = - 2 roman_log [ divide start_ARG italic_l ( italic_θ start_POSTSUBSCRIPT italic_r 0 end_POSTSUBSCRIPT , over^ start_ARG italic_θ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG ) end_ARG start_ARG italic_l ( over^ start_ARG italic_θ end_ARG ) end_ARG ] 𝐋1(θrθr0)superscript𝐋1subscript𝜃𝑟subscript𝜃𝑟0{\bf L}^{-1}(\theta_{r}-\theta_{r0})bold_L start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT - italic_θ start_POSTSUBSCRIPT italic_r 0 end_POSTSUBSCRIPT ) nωω2(k2+r1)=nkω~22(k2+r1)𝑛superscript𝜔𝜔2𝑘2𝑟1𝑛𝑘superscript~𝜔22𝑘2𝑟1\frac{n\omega^{\prime}\omega}{2(\frac{k}{2}+r-1)}=\frac{nk\tilde{\omega}^{2}}{% 2(\frac{k}{2}+r-1)}divide start_ARG italic_n italic_ω start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_ω end_ARG start_ARG 2 ( divide start_ARG italic_k end_ARG start_ARG 2 end_ARG + italic_r - 1 ) end_ARG = divide start_ARG italic_n italic_k over~ start_ARG italic_ω end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 ( divide start_ARG italic_k end_ARG start_ARG 2 end_ARG + italic_r - 1 ) end_ARG

4 Examples

To demonstrate the application of the Bayes factors described in Table 1 and the procedure for setting (τ2,r)superscript𝜏2𝑟(\tau^{2},r)( italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_r ) described in Section 3, we re-analyzed an experiment from the Many Labs 3 project (Ebersole et al., 2016).

The experiment measured the effect on response time for subjects performing the Stroop task (Ebersole et al., 2016). This effect is among the strongest and most widely replicated effects in experimental psychology. The Stroop task requires subjects to identify the color of the type of printed words. This task takes longer when there is discordance between the type’s color and the color’s name. For example, responding red takes longer than blue. After preprocessing the data to account for unusually long response times and incorrect answers (Greenwald et al., 2003), the authors “calculated the average response time for all correct responses separately for congruent and incongruent trials" and then “replaced response latencies for trials with errors using the mean of correct responses in that condition plus 600 ms.” They then “recomputed the means for congruent and incongruent trials overall” and used the difference between these two means divided by the standard deviation of all correct trials regardless of condition” to construct paired t𝑡titalic_t statistics.

Twenty replications of this experiment were replicated in the Many Labs experiment. For illustration, we begin by analyzing results from the first experimental site where the t𝑡titalic_t statistic was 9.38 on 83838383 degrees of freedom (Table 3).

The null hypothesis in this study is that the mean difference in response times for the congruous and incongruous conditions is 0. Under the alternative hypothesis, we assume that the population mean difference in response times is μ𝜇\muitalic_μ and that the observational variance is σ𝜎\sigmaitalic_σ, and let ω=μ/σ𝜔𝜇𝜎\omega=\mu/\sigmaitalic_ω = italic_μ / italic_σ denote the standardized effect size. Given ω>0𝜔0\omega>0italic_ω > 0, we assume that the non-centrality parameters of the distributions of the t𝑡titalic_t statistic, λ𝜆\lambdaitalic_λ, is drawn from a normal-moment distribution,

J+(r+12,12τω,r2),superscript𝐽𝑟1212subscriptsuperscript𝜏2𝜔𝑟J^{+}\left(r+\frac{1}{2},\frac{1}{2\tau^{2}_{\omega,r}}\right),italic_J start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ( italic_r + divide start_ARG 1 end_ARG start_ARG 2 end_ARG , divide start_ARG 1 end_ARG start_ARG 2 italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ω , italic_r end_POSTSUBSCRIPT end_ARG ) , (4)

where, from Table 2, we set

τω,r2=nω22r.subscriptsuperscript𝜏2𝜔𝑟𝑛superscript𝜔22𝑟\tau^{2}_{\omega,r}=\frac{n\omega^{2}}{2r}.italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ω , italic_r end_POSTSUBSCRIPT = divide start_ARG italic_n italic_ω start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_r end_ARG . (5)

Figure 2 displays the plot of BFFs obtained for r=1,5,10𝑟1510r=1,5,10italic_r = 1 , 5 , 10, and 15151515 using the theoretical results from Tables 1 and 2.

Several aspects of this figure merit comment. As expected, the BFF curves reflect more evidence in favor of alternative hypotheses corresponding to ω1𝜔1\omega\approx 1italic_ω ≈ 1 as r𝑟ritalic_r increases. This happens because alternative hypotheses concentrate more of their prior mass on the hypothesized values of ω𝜔\omegaitalic_ω as r𝑟ritalic_r increases. Indeed, as r𝑟r\rightarrow\inftyitalic_r → ∞, all priors converge to a point mass on the hypothesized value of ω𝜔\omegaitalic_ω, and the BFF curves reduce to a plot of the log-likelihood ratio (based on the test statistic). Nonetheless, all BFF curves reflect very strong evidence for alternative hypotheses defined by priors centered on standardized effect sizes greater than about 0.05, and the curves for r=5,10,15𝑟51015r=5,10,15italic_r = 5 , 10 , 15 are relatively insensitive to the choice of r𝑟ritalic_r within this range.

Refer to caption
Figure 2: Bayes factor functions for Stroop test for varying r𝑟ritalic_r.

We now consider how estimates of r𝑟ritalic_r might be used to construct BFFs when data from replicated sites is available. In this case, data from 20 Many Labs consortium sites are available; t𝑡titalic_t statistics from the 20 sites are listed in Table 3. For simplicity, we took a naive empirical Bayes approach to estimate r𝑟ritalic_r (see Supplemental Materials), leading to an estimate of r=9.99𝑟9.99r=9.99italic_r = 9.99. This value of r𝑟ritalic_r suggests that standardized effects were relatively consistent across sites.

Figure 3 provides the BFF based on the empirical Bayes estimate of r𝑟ritalic_r and the t𝑡titalic_t statistics from all 20 sites. For comparison, the BFF for r=1𝑟1r=1italic_r = 1 is also displayed. In this case, using the empirical Bayes estimate of r𝑟ritalic_r had a moderate effect in increasing the BFF near its maximum of ω=0.89𝜔0.89\omega=0.89italic_ω = 0.89. With 20 experimental sites each exhibiting very strong evidence for a Stroop effect, the evidence against the null hypothesis is overwhelming except for alternative hypotheses concentrating prior mass on ω𝜔\omegaitalic_ω near 0.

Table 3: T statistics and degrees-of-freedom for the Stroop task [t-statistic (degrees of freedom)].
9.38(83) 9.85(118) 7.36(43) 11.62(90)
7.85(95) 12.56(317) 11.01(123) 10.15(130)
13.52(157) 10.14(100) 8.90(116) 10.37(141)
11.68(177) 9.11(118) 16.97(241) 8.82(136)
8.46(88) 5.93(80) 12.17(193) 9.37(94)
Refer to caption
Figure 3: The combined BFF using the MOM estimator of r𝑟ritalic_r for the 20 sites in the test set is depicted by the solid black line. The combined BFF using r=1𝑟1r=1italic_r = 1 is shown by the black dashed line, while the BFFs for each individual study (using r=1𝑟1r=1italic_r = 1) are illustrated by the colored lines.

5 Discussion

Bayes factors based on test statistics present compelling advantages over standard methods for calculating Bayes factors. They eliminate computational challenges that arise when Bayes factors are computed from complex statistical models, and they reduce subjectivity when defining prior distributions on model parameters. By indexing Bayes factors based on test statistics according to standardized effects, BFFs eliminate much of the subjectivity associated with specifying a single alternative model. They also provide users with a simple representation of the statistical evidence in favor of alternative hypotheses centered on a range of effect sizes.

The BFFs proposed in J23 lack flexibility in allowing scientists to incorporate the precision of their estimates of effect sizes into Bayesian hypothesis tests. This article addresses this shortcoming by expanding the class of prior distributions used in defining BFFs. In particular, it provides analytic expressions for Bayes factors based on test statistics in conjunction with more general classes of prior distributions and illustrates how BFFs can be constructed from these broader classes. Importantly, we demonstrate how the scale parameters of the prior distributions can be linked so that the modes of prior distributions are located at hypothesized effect sizes.

J23 demonstrated how test statistics from replicated experiments could be combined to generate an aggregated BFF. This article’s results will permit that methodology to account for the dispersion of effect sizes across replications of similar experiments. Efficient and coherent procedures for incorporating such information are currently under investigation.

The BFF package (https://CRAN.R-project.org/package=BFF), available from the CRAN depository, provides R functions to compute the BFFs reported in this article.

Acknowledgment

The authors acknowledge support from the National Science Foundation, NSF DMS 2311005.

References

  • Consonni et al. [2018] Guido Consonni, Dimitris Fouskakis, Brunero Liseo, and Ioannis Ntzoufras. Prior Distributions for Objective Bayesian Analysis. Bayesian Analysis, 13(2):627 – 679, 2018. 10.1214/18-BA1103. URL https://doi.org/10.1214/18-BA1103.
  • Doucet et al. [2002] A. Doucet, S.J. Godsill, and C.P. Robert. Marginal maximum a posteriori estimation using Markov chain Monte Carlo. Statistics and Computing, 12:77–84, 2002.
  • Ebersole et al. [2016] Charles R. Ebersole, Olivia E. Atherton, Aimee L. Belanger, Hayley M. Skulborstad, Jill M. Allen, Jonathan B. Banks, Erica Baranski, Michael J. Bernstein, Diane B.V. Bonfiglio, Leanne Boucher, Elizabeth R. Brown, Nancy I. Budiman, Athena H. Cairo, Colin A. Capaldi, Christopher R. Chartier, Joanne M. Chung, David C. Cicero, Jennifer A. Coleman, John G. Conway, William E. Davis, Thierry Devos, Melody M. Fletcher, Komi German, Jon E. Grahe, Anthony D. Hermann, Joshua A. Hicks, Nathan Honeycutt, Brandon Humphrey, Matthew Janus, David J. Johnson, Jennifer A. Joy-Gaba, Hannah Juzeler, Ashley Keres, Diana Kinney, Jacqeline Kirshenbaum, Richard A. Klein, Richard E. Lucas, Christopher J.N. Lustgraaf, Daniel Martin, Madhavi Menon, Mitchell Metzger, Jaclyn M. Moloney, Patrick J. Morse, Radmila Prislin, Timothy Razza, Daniel E. Re, Nicholas O. Rule, Donald F. Sacco, Kyle Sauerberger, Emily Shrider, Megan Shultz, Courtney Siemsen, Karin Sobocko, R. Weylin Sternglanz, Amy Summerville, Konstantin O. Tskhay, Zack van Allen, Leigh Ann Vaughn, Ryan J. Walker, Ashley Weinberg, John Paul Wilson, James H. Wirth, Jessica Wortman, and Brian A. Nosek. Many labs 3: Evaluating participant pool quality across the academic semester via replication. Journal of Experimental Social Psychology, 67:68–82, 2016. ISSN 0022-1031. https://doi.org/10.1016/j.jesp.2015.10.012. URL https://www.sciencedirect.com/science/article/pii/S0022103115300123. Special Issue: Confirmatory.
  • Greenwald et al. [2003] A.G. Greenwald, B.A. Nosek, and M.R. Banaji. Understanding and using the implicit association test: I. an improved scoring algorithm. J Pers Soc Psychol, 2003. 10.1037/0022-3514.85.2.197.
  • Johnson [2005] Valen E. Johnson. Bayes factors based on test statistics. Journal of the Royal Statistical Society. Series B (Statistical Methodology), 67(5):689–701, 2005. ISSN 13697412, 14679868. URL http://www.jstor.org/stable/3647614.
  • Johnson and Rossell [2010] Valen E. Johnson and David Rossell. On the use of non-local prior densities in bayesian hypothesis tests. Journal of the Royal Statistical Society. Series B (Statistical Methodology), 72(2):143–170, 2010. ISSN 13697412, 14679868. URL http://www.jstor.org/stable/40541581.
  • Johnson et al. [2023] Valen E. Johnson, Sandipan Pramanik, and Rachael Shudde. Bayes factor functions for reporting outcomes of hypothesis tests. Proceedings of the National Academy of Sciences, 2023. URL https://doi.org/10.48550/arXiv.2210.00049.
  • Liang et al. [2008] Feng Liang, Rui Paulo, German Molina, Merlise A Clyde, and Jim O Berger. Mixtures of g priors for bayesian variable selection. Journal of the American Statistical Association, 103(481):410–423, 2008. 10.1198/016214507000001337. URL https://doi.org/10.1198/016214507000001337.
  • Pramanik and Johnson [2023] Sandipan Pramanik and Valen Johnson. Efficient alternatives for bayesian hypothesis tests in psychology. Psychological Methods, 2023. URL https://doi.org/10.1037/met0000482.
  • Rouder et al. [2009] J.N. Rouder, P.L. Speckman, D. Sun, and R.D. Morey. Bayesian t tests for accepting and rejecting the null hypothesis. Psychonomic Bulletin & Review, 16:225–237, 2009. 10.3758/PBR.16.2.225.
  • Zellner [1986] A Zellner. On Assessing Prior Distributions and Bayesian Regression Analysis with g Prior Distributions. Bayesian Inference and Decision Techniques: Essays in Honor of Bruno de Finetti. Studies in Bayesian Econometrics and Statistics. Vol. 6. New York: Elsevier, 1986. ISBN 78-0-444-87712-3.