License: arXiv.org perpetual non-exclusive license
arXiv:2402.00108v1 [astro-ph.GA] 31 Jan 2024

The Escape Velocity Profile of the Milky Way from Gaia DR3

Cian Roche Department of Physics and MIT Kavli Institute for Astrophysics and Space Research,
77 Massachusetts Avenue, Cambridge, MA 02139, USA
Lina Necib Department of Physics and MIT Kavli Institute for Astrophysics and Space Research,
77 Massachusetts Avenue, Cambridge, MA 02139, USA
The NSF AI Institute for Artificial Intelligence and Fundamental Interactions,
77 Massachusetts Avenue, Cambridge, MA 02139, USA
Tongyan Lin Department of Physics, University of California San Diego, La Jolla, CA 92093, USA Xiaowei Ou Department of Physics and MIT Kavli Institute for Astrophysics and Space Research,
77 Massachusetts Avenue, Cambridge, MA 02139, USA
Tri Nguyen Department of Physics and MIT Kavli Institute for Astrophysics and Space Research,
77 Massachusetts Avenue, Cambridge, MA 02139, USA
The NSF AI Institute for Artificial Intelligence and Fundamental Interactions,
77 Massachusetts Avenue, Cambridge, MA 02139, USA
Abstract

The escape velocity profile of the Milky Way offers a crucial and independent measurement of its underlying mass distribution and dark matter properties. Using a sample of stars from the third data release of Gaia with 6D kinematics and strict quality cuts, we obtain an escape velocity profile of the Milky Way from 4kpc4kpc4\,\rm{kpc}4 roman_kpc to 11kpc11kpc11\,\rm{kpc}11 roman_kpc in Galactocentric radius. To infer the escape velocity in radial bins, we model the tail of the stellar speed distribution with both traditional power law models and a new functional form that we introduce. While power law models tend to rely on extrapolation to high speeds, we find our new functional form gives the most faithful representation of the observed distribution. Using this for the escape velocity profile, we constrain the properties of the Milky Way’s dark matter halo modeled as a Navarro-Frenck-White profile. Combined with constraints from the circular velocity at the solar position, we obtain a concentration and mass of c200cDM=13.94.3+6.2superscriptsubscript𝑐200cDMsubscriptsuperscript13.96.24.3c_{200\rm{c}}^{\rm{DM}}=13.9^{+6.2}_{-4.3}italic_c start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_DM end_POSTSUPERSCRIPT = 13.9 start_POSTSUPERSCRIPT + 6.2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 4.3 end_POSTSUBSCRIPT and M200cDM=0.550.14+0.15×1012MsuperscriptsubscriptM200cDMsubscriptsuperscript0.550.150.14superscript1012subscriptMdirect-product\rm{M}_{200\rm{c}}^{\rm{DM}}=0.55^{+0.15}_{-0.14}\times 10^{12}\,M_{\odot}roman_M start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_DM end_POSTSUPERSCRIPT = 0.55 start_POSTSUPERSCRIPT + 0.15 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.14 end_POSTSUBSCRIPT × 10 start_POSTSUPERSCRIPT 12 end_POSTSUPERSCRIPT roman_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT. This corresponds to a total Milky Way mass of M200c=0.640.14+0.15×1012MsubscriptM200csubscriptsuperscript0.640.150.14superscript1012subscriptMdirect-product\rm{M}_{200\rm{c}}=0.64^{+0.15}_{-0.14}\times 10^{12}\,M_{\odot}roman_M start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT = 0.64 start_POSTSUPERSCRIPT + 0.15 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.14 end_POSTSUBSCRIPT × 10 start_POSTSUPERSCRIPT 12 end_POSTSUPERSCRIPT roman_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT, which is on the low end of the historic range of the Galaxy’s mass, but in line with other recent estimates.

software: Python (Van Rossum & Drake, 2009), numpy (Harris et al., 2020), scipy (Virtanen et al., 2020), astropy (Astropy Collaboration et al., 2013, 2018), jupyter (Kluyver et al., 2016), corner.py (Foreman-Mackey, 2016), emcee (Foreman-Mackey et al., 2013), Galpy (Bovy, 2015). The implementation of the affine-invariant Markov chain Monte Carlo algorithm of Goodman & Weare (2010) in the julia programming language (Bezanson et al., 2017) can be found at https://github.com/CianMRoche/MCJulia.jl which is forked from https://github.com/mktranstrum/MCJulia.jl.

1 Introduction

The earliest hints of the existence of Dark Matter (DM) came from the dynamics of stars and galaxies (Zwicky, 1933; Rubin & Ford, 1970). These observations provided the first evidence that galaxies are surrounded by DM halos extending more than an order of magnitude past the visible matter (Ostriker et al., 1974; Einasto et al., 1974). In the Milky Way, the DM halo dominates the mass of the Galaxy, and is expected to be similar-to\sim 1-2 orders of magnitudes larger in mass than the baryonic (stars and gas) component. However, many of its detailed properties have thus far not been well understood. Insight into both the total mass and density profile of the DM halo is extremely valuable for galactic dynamics and studies of DM, informing our understanding of baryonic feedback (Tollet et al., 2016; Lazar et al., 2020), the particle nature of DM (Spergel & Steinhardt, 2000; Tulin & Yu, 2018; Lisanti et al., 2019; Nadler et al., 2019, 2021), and indirect detection of DM (Cirelli et al., 2011; Slatyer, 2018; Rinchiuso et al., 2021).

Many methods have been used to measure the MW mass, and to a lesser extent the density profile of DM: the circular velocity of the MW (Eilers et al., 2019; Ou et al., 2023; Jiao et al., 2023; Wang et al., 2023; Zhou et al., 2023), the population and motion of satellites (Barber et al., 2013; Cautun et al., 2014; Patel et al., 2018; Callingham et al., 2019; Sohn et al., 2018; Watkins et al., 2019; Fritz et al., 2020), stellar streams (Gibbons et al., 2014; Küpper et al., 2015; Dierickx & Loeb, 2017), and the escape velocity of the MW (Piffl et al., 2014; Monari et al., 2018; Deason et al., 2019; Koppelman & Helmi, 2021; Necib & Lin, 2022a). The escape velocity, in particular, gives a measure of the gravitational potential at a given location, and can be used to constrain the DM potential when combined with models of the baryonic mass. It is typically extracted by modeling the tail of the stellar speed distribution. Due to a lack of data, earlier studies focused on the local escape velocity in the solar neighborhood (Leonard & Tremaine, 1990; Smith et al., 2007). In this paper, we produce precise measurements of the escape velocity over a range of Galactocentric radii, in order to obtain an escape velocity profile and thus inform the DM potential over a wider range. We revisit prior power law models for the tail of the speed distribution, and also introduce a new, more robust functional form.

Our analysis is made possible with the significant improvements in the third data release from Gaia (Gaia DR3). The Gaia space mission (Gaia Collaboration et al., 2016a, b, 2018, 2021) has revolutionized the field of MW astronomy, from studies of streams (Malhan et al., 2018; Price-Whelan & Bonaca, 2018; Helmi, 2020; Chandra et al., 2023), dust (Green et al., 2019; Lallement et al., 2019; Leike et al., 2020), disk resonances (Kawata et al., 2018; Trick et al., 2021), to the discovery of previously unknown merging events (Helmi et al., 2018; Belokurov et al., 2018; Necib et al., 2020; Naidu et al., 2020a), and more. The latest data release of the Gaia mission includes positions, parallax, proper motions, and radial velocities of an astounding 33 million stars (Gaia Collaboration et al., 2023; Katz et al., 2022). The six-dimensional features of this large dataset allow us to finally probe detailed dynamics of the MW at a few kpc from the solar position, enabling more precise and reliable determinations of the escape velocity profile. Gaia DR3, in particular, has a significant advantage over DR2 in that it includes many more sources at large distances from the Sun with line-of-sight velocities.

Even with excellent data, estimating the escape velocity crucially relies on modeling the tail of the stellar speed distribution. Leonard & Tremaine (1990) introduced a power-law model, with a distribution given by

g(vvesc,k)(vescv)k,v[vmin,vesc]formulae-sequenceproportional-to𝑔conditional𝑣subscript𝑣esc𝑘superscriptsubscript𝑣esc𝑣𝑘𝑣subscript𝑣minsubscript𝑣escg(v\mid v_{\rm{esc}},k)\propto(v_{\rm{esc}}-v)^{k},\quad v\in[v_{\rm{min}},v_{% \rm{esc}}]italic_g ( italic_v ∣ italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT , italic_k ) ∝ ( italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT - italic_v ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT , italic_v ∈ [ italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT ] (1)

where vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT is the escape velocity, k𝑘kitalic_k is the slope of the power law, and vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT is a lower speed cutoff which reflects the fact that we only model the tail of the speed distribution for stars at a given Galactocentric radius. The overall normalization is set such that vminvesc𝑑vg(v)=1superscriptsubscriptsubscript𝑣minsubscript𝑣escdifferential-d𝑣𝑔𝑣1\int_{v_{\rm{min}}}^{v_{\rm{esc}}}dv\,g(v)=1∫ start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_d italic_v italic_g ( italic_v ) = 1. This functional form has been the basis of most of the escape velocity measurements in the MW for the following decades. However, escape velocity estimates based upon this modeling (Smith et al., 2007; Piffl et al., 2014; Monari et al., 2018; Deason et al., 2019; Koppelman & Helmi, 2021; Necib & Lin, 2022a) face various challenges such as small sample sizes, degeneracy of parameters, systematic uncertainties in model selection, and the choice of strong prior distributions on the fitting parameters.

The functional form in Eq. 1 exhibits a strong degeneracy between the two parameters vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT and k𝑘kitalic_k, leading to large uncertainties on the escape velocity estimates; this degeneracy can be exacerbated by small sample sizes or if the underlying distribution deviates from a single power law. To reduce such uncertainties on vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT, previous work in the literature adopted an array of either theoretically or numerically (based on cosmological simulations) motivated priors on the slope k𝑘kitalic_k. Such choices affect the final measurement of the escape velocity, and ideally one would require a model that best fits the data without the need to adopt strong priors.

Refer to caption
Figure 1: High-speed and high-quality sample of stars in Gaia DR3 with 6D kinematics, and the derived escape velocity profile at 68%percent6868\%68 % (boxes) and 95%percent9595\%95 % (whiskers) confidence intervals. Medians marked with a horizontal line. Escape velocities shown here are obtained by binning the data into 1kpc1kpc1\,\rm{kpc}1 roman_kpc bins and fitting the speed distributions above vmin=310kms1subscript𝑣min310kmsuperscripts1v_{\rm{min}}=310\,\rm{km\,s^{-1}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = 310 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT with the stretched exponential power law model introduced in this paper. Contours correspond to the logarithm of the number of stars at that speed and Galactocentric radius, labelled N.

Motivated by the discovery of the Gaia-Sausage-Enceladus (GSE) (Helmi et al., 2018; Belokurov et al., 2018) merger that has contributed a large fraction of the ex-situ stars in the MW (see e.g. Necib et al., 2019; Naidu et al., 2020b), and arguing for the need to address the issues above, Necib & Lin (2022b) adopted a two-component power law, generalizing Eq. 1 but with two different slopes k1subscript𝑘1k_{1}italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and k2subscript𝑘2k_{2}italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT,

g(vvesc,k1,k2)proportional-to𝑔conditional𝑣subscript𝑣escsubscript𝑘1subscript𝑘2absent\displaystyle g(v\mid v_{\rm{esc}},k_{1},k_{2})\proptoitalic_g ( italic_v ∣ italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ∝ (1fS)g(vvesc,k1)1subscript𝑓𝑆𝑔conditional𝑣subscript𝑣escsubscript𝑘1\displaystyle(1-f_{S})g(v\mid v_{\rm{esc}},k_{1})( 1 - italic_f start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ) italic_g ( italic_v ∣ italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) (2)
+fSg(vvesc,k2),v[vmin,vesc]subscript𝑓𝑆𝑔conditional𝑣subscript𝑣escsubscript𝑘2𝑣subscript𝑣minsubscript𝑣esc\displaystyle+f_{S}g(v\mid v_{\rm{esc}},k_{2}),\quad v\in[v_{\rm{min}},v_{\rm{% esc}}]+ italic_f start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_g ( italic_v ∣ italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) , italic_v ∈ [ italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT ]

where fSsubscript𝑓𝑆f_{S}italic_f start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT is an additional parameter that controls the relative fraction of the power laws. Based on this new functional form, Necib & Lin (2022a) computed the escape velocity of the MW at the position of the Sun using the early Gaia data release 3 (eDR3) (Gaia Collaboration et al., 2021).

The two-component model in Necib & Lin (2022b) provided a more general functional form, allowing for a better fit to data containing substructure. Indeed, Necib & Lin (2022a) showed using the Akaike information criterion (AIC; Akaike (1974)), that the two-component model is preferred at lower vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT, but that at higher vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT it was not distinguishable from a single component. This affirms the presence of a deviation from the single power law away from vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT. The model further addressed the degeneracy and the issue of strong priors on the slope k𝑘kitalic_k: with the modified distribution, it was possible to obtain a good fit to the data and robust measurement of vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT with wide priors on the slopes.

In this work, we aim at further investigating models for the tail of the speed distribution, adapting to the improved quality and increased statistics of the data in Gaia DR3. We reconsider the single power law and two-component power law models, and obtain the first two-component power law escape velocity profile using this extended dataset. With the improved dataset, we find that there remains a number of issues in using these power-law models: they can predict a high vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT relative to the fastest stars in the dataset, results can be sensitive to the choice of vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT, and some parameters correlated with the escape velocity can remain unconverged even with wide priors. Motivated by these challenges, we introduce a new functional form that allows for a steeper rise in the speed distribution away from vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT, while keeping a power law cutoff near vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT. We find that this new form most closely tracks the fastest stars and gives the most stable results. With each of these models, we obtain independent escape velocity measurements in 1 kpc-wide bins from 4 kpc to 11 kpc in Galactocentric radius. We use the resulting escape velocity profile to fit the DM density profile, and combine this with a circular velocity measurement from Eilers et al. (2019) to obtain an updated measurement of the MW mass.

The remainder of this paper is organized as follows: In Section 2, we present the selection criteria for the high-quality sample of Gaia DR3 data used to obtain our escape velocity profile and Milky Way halo constraints. In Section 3 we describe the general modeling approach for the tail of the stellar speed distribution, and in Section 4 we present the different models for this speed distribution. These models include the previously studied one and two-component power laws, as well as the new model we introduce in this work, the “stretched exponential power law.” In Section 5, we show the results of applying these three models to the Gaia DR3 data, presenting the escape velocity as a function of the radial distance and validating the modeling. Finally in Section 6, we use the escape velocity data to constrain the DM density profile and total mass of the Milky Way.

2 Data

To evaluate the escape velocity of the Milky Way, we use the third catalog from the Gaia space mission (Gaia DR3) (Gaia Collaboration et al., 2016a, 2023), which contains 33similar-toabsent33\sim 33∼ 33 million stars with 6D kinematics, an increase of about a factor of five from the previous release of Gaia eDR3 (Gaia Collaboration et al., 2021). As we are interested in modeling the tail of the speed distribution, we extract a high-quality sample of stars from this catalog in order to avoid stars with contaminated radial velocities and/or unphysical parallax measurements. Such contaminants could bias escape velocity estimates toward higher values since these analyses are sensitive to the fastest star in the sample (see e.g. Koppelman & Helmi (2021)).

To obtain a high quality sample, we follow the selection criteria of Marchetti et al. (2022), imposing quality cuts on parallax measurements, renormalized unit weight error (RUWE), number of transits used to calculate the radial velocity of the source, and signal-to-noise ratio. We do however demand higher-quality signal to noise ratios of 10101010 (as opposed to 5555) in line with Katz et al. (2022), which found a large number of spurious radial velocity measurements near the Solar position at low signal-to-noise, and parallaxes with at most 10%percent1010\%10 % error (as opposed to 20%percent2020\%20 %) to reduce the contamination of stars across radial bins. For parallax measurements, we apply a zero-point correction according to Lindegren et al. (2021) and choose not to include those stars for which a zero-point-corrected and positive parallax is unavailable. Note that an alternative implementation of the cut on parallax percentage errors would be a “bin membership” cut whereby a star only be included in the analysis if it is a member of its hosting radial bin at some chosen confidence level. In the present analysis, such a cut would be too restrictive and reduce statistics away from the solar position due to the limitations of parallax measurements. This may lead to some leakage of stars near bin edges across radial bins, but since the escape velocity is not expected to vary significantly over distances 0.5kpcless-than-or-similar-toabsent0.5kpc\lesssim 0.5\,\rm{kpc}≲ 0.5 roman_kpc, we do not expect this to appreciably bias results as is shown in Fig. A1, in which we repeated our analysis with a shift in the binning.

We also require that the derived percentage error on the speed be less than 1%percent11\%1 % since large errors can significantly modify the shape of the tail of the distribution. In some works such as Piffl et al. (2014); Monari et al. (2018), only the retrograde stars in the sample are modeled to avoid disk contamination, but following Necib & Lin (2022a) we can also remove disk stars via high minimum speed cuts, here not considering stars with speeds below 300kms1300kmsuperscripts1300\,\rm{km\,s^{-1}}300 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, or by choosing more flexible models that can account for the presence of additional velocity components.

The cuts are summarized as follows:

  • -

    RUWE 1.4absent1.4\leq 1.4≤ 1.4

  • -

    RV_NB_TRANSITS 10absent10\geq 10≥ 10

  • -

    RV_EXPECTED_SIG_TO_NOISE 10absent10\geq 10≥ 10

  • -

    Positive zero-point-corrected parallax ϖϖzp>0italic-ϖsubscriptitalic-ϖ𝑧𝑝0\varpi-\varpi_{zp}>0italic_ϖ - italic_ϖ start_POSTSUBSCRIPT italic_z italic_p end_POSTSUBSCRIPT > 0

  • -

    Parallax error: σϖ/(ϖϖzp)10%subscript𝜎italic-ϖitalic-ϖsubscriptitalic-ϖ𝑧𝑝percent10\sigma_{\varpi}/(\varpi-\varpi_{zp})\leq 10\%italic_σ start_POSTSUBSCRIPT italic_ϖ end_POSTSUBSCRIPT / ( italic_ϖ - italic_ϖ start_POSTSUBSCRIPT italic_z italic_p end_POSTSUBSCRIPT ) ≤ 10 %

  • -

    Minimum speed: v300kms1𝑣300kmsuperscripts1v\geq 300\,\rm{km\,s^{-1}}italic_v ≥ 300 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT

  • -

    Speed error: σv/v1%subscript𝜎𝑣𝑣percent1\sigma_{v}/v\leq 1\%italic_σ start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT / italic_v ≤ 1 %

The data processing pipeline111Pipeline and data products are available at https://doi.org/10.5281/zenodo.8088365. used to calculate the stellar kinematics and associated uncertainties in Galactocentric coordinates is similar to that of Necib & Lin (2022a), with the addition of the zero-point correction of parallaxes and stricter quality cuts. In particular, we use sources with parallax errors of 10%percent1010\%10 % or less (rather than 20%percent2020\%20 %) to reflect the finer radial binning used in this analysis (1kpc1kpc1\,\rm{kpc}1 roman_kpc bins as opposed to 2kpc2kpc2\,\rm{kpc}2 roman_kpc), and percentage speed errors of at most 1%percent11\%1 % (rather than 5%percent55\%5 %) because speed measurement errors can dominate the shape of the high-speed tail; these choices are enabled by the improved statistics of Gaia DR3. The Solar position used in this analysis is x=8.122kpcsubscript𝑥direct-product8.122kpcx_{\odot}=-8.122\,\rm{kpc}italic_x start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT = - 8.122 roman_kpc (GRAVITY Collaboration et al., 2018), y=0kpcsubscript𝑦direct-product0kpcy_{\odot}=0\,\rm{kpc}italic_y start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT = 0 roman_kpc, and z=0.0208kpcsubscript𝑧direct-product0.0208kpcz_{\odot}=0.0208\,\rm{kpc}italic_z start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT = 0.0208 roman_kpc (Bennett & Bovy, 2018) in galactic coordinates. For the Solar peculiar velocity vector we use v=(12.9,245.6,7.78)kms1subscript𝑣direct-product12.9245.67.78kmsuperscripts1v_{\odot}=(12.9,245.6,7.78)\,\rm{km\,s^{-1}}italic_v start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT = ( 12.9 , 245.6 , 7.78 ) roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT (Drimmel & Poggio, 2018).

The cuts we utilize result in one of the highest-quality 6D kinematics datasets used for the escape velocity measurements to date. It is also the largest, containing 12,3171231712,31712 , 317 stars above a speed of 300kms1300kmsuperscripts1300\,\rm{km\,s^{-1}}300 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, compared to the 3,93239323,9323 , 932 within 1kpc1kpc1\,\rm{kpc}1 roman_kpc of the solar position of Necib & Lin (2022a), and the 2,06720672,0672 , 067 in the 6D kinematics sample of Koppelman & Helmi (2021). We represent this selection of stars in Gaia DR3 in Figure 1, along with one of the escape velocity profiles obtained in this paper, discussed in detail in Sec. 5.

In Figure 1, one can note some general features of the data. First, there is a slight overdensity of high velocity stars near the solar position. This excess may be due to a larger number of spurious radial velocity measurements in this region (Katz et al., 2022), as this feature appears primarily close to the solar position.

Second, the speed of the fastest stars generally decreases from 4.5similar-toabsent4.5\sim 4.5∼ 4.5 to 12kpc12kpc12\,\rm{kpc}12 roman_kpc, partly as a result of the increasing Galactic potential. At Galactocentric radii smaller than 4kpc4kpc4\,\rm{kpc}4 roman_kpc, however, there is a decrease in statistics resulting from parallax quality cuts, leading to fewer high speed stars. In general, the number of observed stars in each radial bin is modulated by the Gaia selection function, which is typically modeled as a function of magnitude and sky position (Everall & Boubert, 2021), or a combined metric of both parameters such as the median magnitude of a patch of sky (Cantat-Gaudin et al., 2023). Assuming no significant correlations between completeness and speed, we expect the shape of the speed distribution in a given radial bin is not strongly biased by the selection function. However, even if the selection function impacts the detailed shape of the distribution, our goal is to extract the cutoff of the speed distribution and not its exact shape.

One can also note over and under-densities in the contours from 78kpc78kpc7-8\,\rm{kpc}7 - 8 roman_kpc at speeds close to and below 300kms1300kmsuperscripts1300\,\rm{km\,s^{-1}}300 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. The source of this feature is the resonant band structure of the speed distribution of the Milky Way (Antoja et al., 2018), and in fact there are other parallel bands which are simply not visible in this limited data sample. Over and under-densities at certain Galactocentric radii cannot be modeled by simple power law distributions, and thus fits to single power laws could lead to biased escape velocity estimates. This already suggests the need for models that are robust to the presence of such features.

3 Methods

In this Section, we present our approach to modeling the tail of the stellar speed distribution, the likelihood for a given set of observed data (speeds of stars in one bin of Galactocentric radius), and the method for obtaining posterior distributions of the model parameters. The speed distribution will be modeled as the combination of one or more “bound component(s)” corresponding to kinematic structures that are gravitationally bound to the Galaxy, and an outlier distribution which models both unbound and potentially mismeasured stars. Here we focus on the methodology, while Section 4 will discuss the different parametric models including the single power law Eq. 1, the 2-component power law Eq. 2 and the newly introduced “stretched exponential power law.”

3.1 Bound component

We first describe the parametric modeling of a single bound component, i.e. an approximately isotropic distribution of stars bound to the Milky Way. Let us assume that in a given bin of Galactocentric radius, the tail of the speed distribution for these bound stars is described by g(vθ)𝑔conditional𝑣𝜃g(v\mid\mathbf{\theta})italic_g ( italic_v ∣ italic_θ ) (see Eq. 1), where θ𝜃\mathbf{\theta}italic_θ is a vector of model parameters including the escape velocity vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT, and g𝑔gitalic_g is defined on [0,vesc]0subscript𝑣esc[0,v_{\mathrm{esc}}][ 0 , italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT ] as we expect the distribution of bound stars to be zero at speeds higher than vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT (Leonard & Tremaine, 1990). In the next section, we will consider different functional forms for g𝑔gitalic_g with various motivations.

In Koppelman & Helmi (2021) and Necib & Lin (2022b), the probability of observing a star labelled α𝛼\alphaitalic_α with a speed vαsubscript𝑣𝛼v_{\alpha}italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT and a Gaussian measurement uncertainty σαsubscript𝜎𝛼\sigma_{\alpha}italic_σ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT is given by the convolution

pα(vαθ)=Cα(θ)0vesc𝑑ve(vvα)22σα22πσα2g(vθ)subscript𝑝𝛼conditionalsubscript𝑣𝛼𝜃subscript𝐶𝛼𝜃superscriptsubscript0subscript𝑣escdifferential-d𝑣superscript𝑒superscript𝑣subscript𝑣𝛼22superscriptsubscript𝜎𝛼22𝜋superscriptsubscript𝜎𝛼2𝑔conditional𝑣𝜃p_{\alpha}\left(v_{\alpha}\mid\mathbf{\theta}\right)=C_{\alpha}(\mathbf{\theta% })\int_{0}^{v_{\mathrm{esc}}}dv\,\frac{e^{-\frac{\left(v-v_{\alpha}\right)^{2}% }{2\sigma_{\alpha}^{2}}}}{\sqrt{2\pi\sigma_{\alpha}^{2}}}\,g(v\mid\mathbf{% \theta})italic_p start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ∣ italic_θ ) = italic_C start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_d italic_v divide start_ARG italic_e start_POSTSUPERSCRIPT - divide start_ARG ( italic_v - italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_σ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_POSTSUPERSCRIPT end_ARG start_ARG square-root start_ARG 2 italic_π italic_σ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_ARG italic_g ( italic_v ∣ italic_θ ) (3)

where Cα(θ)subscript𝐶𝛼𝜃C_{\alpha}(\theta)italic_C start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_θ ) is a normalization constant obtained by integrating over the data region [vmin,]subscript𝑣min[v_{\mathrm{min}},\infty][ italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT , ∞ ] such that

vmin𝑑vαpα(vαθ)=1.superscriptsubscriptsubscript𝑣mindifferential-dsubscript𝑣𝛼subscript𝑝𝛼conditionalsubscript𝑣𝛼𝜃1\int_{v_{\mathrm{min}}}^{\infty}dv_{\alpha}\,p_{\alpha}\left(v_{\alpha}\mid% \mathbf{\theta}\right)=1\,.∫ start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_d italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ∣ italic_θ ) = 1 . (4)

However, in this work we utilize a strict quality cut on stellar speed uncertainties σvα/vα1%subscript𝜎subscript𝑣𝛼subscript𝑣𝛼percent1\sigma_{v_{\alpha}}/v_{\alpha}\leq 1\%italic_σ start_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT end_POSTSUBSCRIPT / italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ≤ 1 %. In the limit of vanishing relative uncertainty, the probability distribution Eq. 3 reduces to

pα(vαθ)=g(vαθ)subscript𝑝𝛼conditionalsubscript𝑣𝛼𝜃𝑔conditionalsubscript𝑣𝛼𝜃p_{\alpha}(v_{\alpha}\mid\theta)=g(v_{\alpha}\mid\mathbf{\theta})italic_p start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ∣ italic_θ ) = italic_g ( italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ∣ italic_θ ) (5)

with the appropriate normalization according to Eq. 4, where the upper limit becomes vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT since g𝑔gitalic_g has support on [0,vesc]0subscript𝑣esc[0,v_{\mathrm{esc}}][ 0 , italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT ]. We utilize the approximation Eq. 5 in the present analysis.

3.2 Outliers

Following Williams et al. (2017) and Necib & Lin (2022b), we include a wide Gaussian outlier distribution, corresponding to unbound stars that are not modeled by the distribution g(vθ)𝑔conditional𝑣𝜃g(v\mid\mathbf{\theta})italic_g ( italic_v ∣ italic_θ ) or stars with potentially mismeasured speeds. The probability for a star α𝛼\alphaitalic_α to be drawn from this outlier distribution is modeled as

pαout(vασout)=Aexp(vα22[σout2+σα2])superscriptsubscript𝑝𝛼outconditionalsubscript𝑣𝛼subscript𝜎out𝐴superscriptsubscript𝑣𝛼22delimited-[]superscriptsubscript𝜎out2superscriptsubscript𝜎𝛼2p_{\alpha}^{\mathrm{out}}\left(v_{\alpha}\mid\sigma_{\mathrm{out}}\right)=A% \exp\left(-\frac{v_{\alpha}^{2}}{2\left[\sigma_{\mathrm{out}}^{2}+\sigma_{% \alpha}^{2}\right]}\right)italic_p start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_out end_POSTSUPERSCRIPT ( italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ∣ italic_σ start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT ) = italic_A roman_exp ( - divide start_ARG italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 [ italic_σ start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_σ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] end_ARG ) (6)

where the dispersion of the outlier distribution σoutsubscript𝜎out\sigma_{\mathrm{out}}italic_σ start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT is treated as an additional model parameter. We add the measurement uncertainty in quadrature to the dispersion of the underlying outlier distribution as these quantities are independent, however σασoutmuch-less-thansubscript𝜎𝛼subscript𝜎out\sigma_{\alpha}\ll\sigma_{\mathrm{out}}italic_σ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ≪ italic_σ start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT so this addition has little impact. Furthermore, due to the high quality cuts imposed in Sec. 2, we expect a small degree of outlier contamination in our sample. We again normalize this distribution over the data region [vmin,]subscript𝑣min[v_{\mathrm{min}},\infty][ italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT , ∞ ], finding

A1=π2σout2+σα2erfc(vmin2σout2+σα2).superscript𝐴1𝜋2superscriptsubscript𝜎out2superscriptsubscript𝜎𝛼2erfcsubscript𝑣2superscriptsubscript𝜎out2superscriptsubscript𝜎𝛼2A^{-1}=\sqrt{\frac{\pi}{2}}\sqrt{\sigma_{\mathrm{out}}^{2}+\sigma_{\alpha}^{2}% }\,\mathrm{erfc}\left(\frac{v_{\min}}{\sqrt{2}}\sqrt{\sigma_{\mathrm{out}}^{2}% +\sigma_{\alpha}^{2}}\right).italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = square-root start_ARG divide start_ARG italic_π end_ARG start_ARG 2 end_ARG end_ARG square-root start_ARG italic_σ start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_σ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG roman_erfc ( divide start_ARG italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT end_ARG start_ARG square-root start_ARG 2 end_ARG end_ARG square-root start_ARG italic_σ start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_σ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) . (7)

3.3 Parameter estimation

For a bound kinematic structure modeled by the distribution g(vθ)𝑔conditional𝑣𝜃g(v\mid\mathbf{\theta})italic_g ( italic_v ∣ italic_θ ) and an outlier component modeled by Eq. 6, the likelihood for a star α𝛼\alphaitalic_α with speed vαsubscript𝑣𝛼v_{\alpha}italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT and measurement uncertainty σαsubscript𝜎𝛼\sigma_{\alpha}italic_σ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT to be drawn from the combined distribution is

α=(1fout)pα(vαθ)+foutpαout(vασout)subscript𝛼1subscript𝑓outsubscript𝑝𝛼conditionalsubscript𝑣𝛼𝜃subscript𝑓outsuperscriptsubscript𝑝𝛼outconditionalsubscript𝑣𝛼subscript𝜎out\mathcal{L}_{\alpha}=(1-f_{\mathrm{out}})\,p_{\alpha}\left(v_{\alpha}\mid% \mathbf{\theta}\right)+f_{\mathrm{out}}\,p_{\alpha}^{\mathrm{out}}\left(v_{% \alpha}\mid\sigma_{\mathrm{out}}\right)caligraphic_L start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT = ( 1 - italic_f start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT ) italic_p start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ∣ italic_θ ) + italic_f start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_out end_POSTSUPERSCRIPT ( italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ∣ italic_σ start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT ) (8)

where foutsubscript𝑓outf_{\mathrm{out}}italic_f start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT is the fraction of the integrated distribution contributed by the outlier model, and the probability pαsubscript𝑝𝛼p_{\alpha}italic_p start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT implicitly depends on the model choice g(vθ)𝑔conditional𝑣𝜃g(v\mid\mathbf{\theta})italic_g ( italic_v ∣ italic_θ ). Had we instead wished to model a combination of Q𝑄Qitalic_Q different bound components each with speed distribution gi(vθi)subscript𝑔𝑖conditional𝑣subscript𝜃𝑖g_{i}(v\mid\mathbf{\theta}_{i})italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_v ∣ italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) and a parameter fisubscript𝑓𝑖f_{i}italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT describing the fractional contribution of that component to the total distribution, then the likelihood for a single star would take the form

αQ=(1fout)[i=1Qfipαi(vαθi)]+foutpαout(vασout)superscriptsubscript𝛼𝑄1subscript𝑓outdelimited-[]superscriptsubscript𝑖1𝑄subscript𝑓𝑖superscriptsubscript𝑝𝛼𝑖conditionalsubscript𝑣𝛼subscript𝜃𝑖subscript𝑓outsuperscriptsubscript𝑝𝛼outconditionalsubscript𝑣𝛼subscript𝜎out\mathcal{L}_{\alpha}^{Q}=(1-f_{\mathrm{out}})\,\left[\sum_{i=1}^{Q}f_{i}\,p_{% \alpha}^{i}\left(v_{\alpha}\mid\mathbf{\theta}_{i}\right)\right]+f_{\mathrm{% out}}\,p_{\alpha}^{\mathrm{out}}\left(v_{\alpha}\mid\sigma_{\mathrm{out}}\right)caligraphic_L start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_Q end_POSTSUPERSCRIPT = ( 1 - italic_f start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT ) [ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_Q end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ( italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ∣ italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ] + italic_f start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_out end_POSTSUPERSCRIPT ( italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ∣ italic_σ start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT ) (9)

where pαisuperscriptsubscript𝑝𝛼𝑖p_{\alpha}^{i}italic_p start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT denotes the probability distribution in Eq. 3, reducing to gi(vθi)subscript𝑔𝑖conditional𝑣subscript𝜃𝑖g_{i}(v\mid\mathbf{\theta}_{i})italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_v ∣ italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) in the limit of vanishing relative speed uncertainties. In this notation, the 2 and 3-component power law models of Necib & Lin (2022b, a) would have Q=2𝑄2Q=2italic_Q = 2 or 3333 and each gisubscript𝑔𝑖g_{i}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT described by the single power law Eq. 1 with unique slope kisubscript𝑘𝑖k_{i}italic_k start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT but a common escape velocity vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT.

For a set of N𝑁Nitalic_N stars observed with speeds and Gaussian uncertainties on those speeds {(vα,σα)}α=1,,Nsubscriptsubscript𝑣𝛼subscript𝜎𝛼𝛼1𝑁\{(v_{\alpha},\sigma_{\alpha})\}_{\alpha=1,\dots,N}{ ( italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ) } start_POSTSUBSCRIPT italic_α = 1 , … , italic_N end_POSTSUBSCRIPT, and a chosen set of model parameters θ𝜃\mathbf{\theta}italic_θ (or {θi}i1,,Qsubscriptsubscript𝜃𝑖𝑖1𝑄\{\theta_{i}\}_{i\in 1,\dots,Q}{ italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i ∈ 1 , … , italic_Q end_POSTSUBSCRIPT), the log-likelihood would be

log=α=1Nlogα.superscriptsubscript𝛼1𝑁subscript𝛼\log\mathcal{L}=\sum_{\alpha=1}^{N}\log\mathcal{L}_{\alpha}.roman_log caligraphic_L = ∑ start_POSTSUBSCRIPT italic_α = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT roman_log caligraphic_L start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT . (10)

The goal of this analysis is to estimate the posterior distribution of the parameters σout,foutsubscript𝜎outsubscript𝑓out\sigma_{\rm{out}},f_{\rm{out}}italic_σ start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT, and θ𝜃\mathbf{\theta}italic_θ (or {θi}i1Qsubscriptsubscript𝜃𝑖𝑖1𝑄\{\theta_{i}\}_{i\in 1\dots Q}{ italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i ∈ 1 … italic_Q end_POSTSUBSCRIPT), focusing in particular on the parameter vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT. In practice we utilize an implementation222Code is provided at https://github.com/CianMRoche/MCJulia.jl, and is a modified version of an earlier implementation available at https://github.com/mktranstrum/MCJulia.jl. of the affine-invariant Markov chain Monte Carlo algorithm of Goodman & Weare (2010) since it is more efficient in skewed parameter spaces, and work in the julia programming language (Bezanson et al., 2017), primarily to reduce computational time.

3.4 Model selection

The above parameter estimation procedure will be performed for different models of the bound component(s), each detailed in the next section, and we will compare the goodness of fit of these models in part via the Akaike information criterion (AIC; Akaike (1974)), defined as

AIC=2s2log(max)AIC2s2subscriptmax\rm{AIC}=2s-2\log(\mathcal{L}_{\rm{max}})roman_AIC = 2 roman_s - 2 roman_log ( caligraphic_L start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ) (11)

where s𝑠sitalic_s is the number of model parameters (for one bound component, it is the number of elements in the vector θ𝜃\mathbf{\theta}italic_θ plus an additional 2 outlier parameters) and log(max)subscriptmax\log(\mathcal{L}_{\rm{max}})roman_log ( caligraphic_L start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ) is the maximum of the log-likelihood of the model for a given set of data. To compare models A𝐴Aitalic_A and B𝐵Bitalic_B one may calculate the AICAIC\rm{AIC}roman_AIC for each model. Then ΔAICAICBAICAΔAICsubscriptAICBsubscriptAICA\Delta\rm{AIC}\equiv\rm{AIC}_{B}-\rm{AIC}_{A}roman_Δ roman_AIC ≡ roman_AIC start_POSTSUBSCRIPT roman_B end_POSTSUBSCRIPT - roman_AIC start_POSTSUBSCRIPT roman_A end_POSTSUBSCRIPT positive implies model A𝐴Aitalic_A is preferred, and conversely ΔAICΔAIC\Delta\rm{AIC}roman_Δ roman_AIC negative implies model B𝐵Bitalic_B is preferred.

4 Modeling the Tail

We now examine the different models g(vθ)𝑔conditional𝑣𝜃g(v\mid\mathbf{\theta})italic_g ( italic_v ∣ italic_θ ) that have been used in the literature to describe bound kinematic components in the tail of the local speed distribution. We summarize the limitations and prior choices of the single power law model in Sec. 4.1, then turn to recent developments with multiple power law components in Sec. 4.2. These ameliorate some of the challenges in using single component power laws, but still have remaining limitations, as discussed in Sec. 4.3. We then introduce an alternative more robust functional form which addresses these issues in Sec. 4.4. We summarize the priors used for all models in this paper in Sec. 4.5.

4.1 Single power law (1PL)

The model proposed by Leonard & Tremaine (1990) is that of a single power law,333Note that when originally proposed, this model was not used with an outlier distribution. such that the tail of the speed distribution of a collection of stars at similar Galactocentric radius follows

g(vvesc,k)(vescv)k.proportional-to𝑔conditional𝑣subscript𝑣esc𝑘superscriptsubscript𝑣esc𝑣𝑘g(v\mid v_{\mathrm{esc}},k)\propto(v_{\mathrm{esc}}-v)^{k}.italic_g ( italic_v ∣ italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT , italic_k ) ∝ ( italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT - italic_v ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT . (12)

This distribution can be understood in the following way: Assuming that Jeans theorem (Jeans, 1915) holds, and that the velocities are isotropic, the asymptotic distribution function of the speeds of the stars is g(vϵ,k)ϵkproportional-to𝑔conditional𝑣italic-ϵ𝑘superscriptitalic-ϵ𝑘g(v\mid\epsilon,k)\propto\epsilon^{k}italic_g ( italic_v ∣ italic_ϵ , italic_k ) ∝ italic_ϵ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT for some index k𝑘kitalic_k where ϵ=(Φ+v2/2)italic-ϵΦsuperscript𝑣22\epsilon=-(\Phi+v^{2}/2)italic_ϵ = - ( roman_Φ + italic_v start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 2 ) is the total (potential + kinetic) energy (Kochanek, 1996; Smith et al., 2007). We can identify ΦΦ\Phiroman_Φ with vesc2/2superscriptsubscript𝑣esc22-v_{\rm{esc}}^{2}/2- italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 2 yielding

g(vvesc,k)(vesc2v2)k.proportional-to𝑔conditional𝑣subscript𝑣esc𝑘superscriptsuperscriptsubscript𝑣esc2superscript𝑣2𝑘g(v\mid v_{\mathrm{esc}},k)\propto(v_{\mathrm{esc}}^{2}-v^{2})^{k}.italic_g ( italic_v ∣ italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT , italic_k ) ∝ ( italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_v start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT . (13)

The behavior of this distribution for v𝑣vitalic_v close to vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT is then approximated by Eq. 12. Therefore, this single power law model of Leonard & Tremaine (1990) can be understood as the vvesc𝑣subscript𝑣escv\rightarrow v_{\rm{esc}}italic_v → italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT limit of the speed distribution of an isotropic system for which Jeans’ theorem holds. Eq. 12 is the distribution if both radial (line-of-sight) and tangential (on-sky) velocities are known. If only radial velocities are used, the power law is modified to kr=k+1subscript𝑘𝑟𝑘1k_{r}=k+1italic_k start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT = italic_k + 1, an approach taken in Piffl et al. (2014). If only tangential velocities are known, then kt=k+12subscript𝑘𝑡𝑘12k_{t}=k+\frac{1}{2}italic_k start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_k + divide start_ARG 1 end_ARG start_ARG 2 end_ARG (Monari et al., 2018; Koppelman & Helmi, 2021).

For analyses using the power-law model, vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT is highly correlated with the slope parameter k𝑘kitalic_k, leading to large uncertainties on the resulting escape velocity estimates. To avoid this, priors of various ranges and shapes have been placed on k𝑘kitalic_k with various motivations, which we now summarize briefly.

Leonard & Tremaine (1990) argues for k[1,2]𝑘12k\in[1,2]italic_k ∈ [ 1 , 2 ], with k=1𝑘1k=1italic_k = 1 for a collisionally relaxed system and k=1.5𝑘1.5k=1.5italic_k = 1.5 for an isolated system that has undergone violent relaxation, and Kochanek (1996) adopts k[0.5,2.5]𝑘0.52.5k\in[0.5,2.5]italic_k ∈ [ 0.5 , 2.5 ] to widen the previously proposed prior range. There have also been many prior ranges motivated by different suites of cosmological simulations: Smith et al. (2007) adopts k[2.7,4.7]𝑘2.74.7k\in[2.7,4.7]italic_k ∈ [ 2.7 , 4.7 ] based on simulations outlined in Abadi et al. (2006), Piffl et al. (2014) and Monari et al. (2018) argue for k[2.3,3.7]𝑘2.33.7k\in[2.3,3.7]italic_k ∈ [ 2.3 , 3.7 ] based on the Aquarius suite (Springel et al., 2008; Scannapieco et al., 2009), and Deason et al. (2019) investigates the signature of GSE-like mergers in the Auriga suite (Grand et al., 2017) and concludes that an appropriate range is k[1,2.5]𝑘12.5k\in[1,2.5]italic_k ∈ [ 1 , 2.5 ].

Furthermore, the shape of the prior distribution for both vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT and k𝑘kitalic_k varies across these works. For example a uniform prior in log(vesc)subscript𝑣esc\log(v_{\rm{esc}})roman_log ( italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT ) is often (but not always) chosen to favor lower values of vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT, and Smith et al. (2007) adopts priors in vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT and k𝑘kitalic_k derived from Jeffreys’ rules (Jeffreys, 1961). In Koppelman & Helmi (2021) a local value of k𝑘kitalic_k is obtained and its posterior treated as a prior distribution for determining vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT at other Galactocentric radii. Since the escape velocity is highly correlated with the k𝑘kitalic_k parameter, the chosen prior distribution can strongly influence the escape velocity estimate.

Refer to caption
Figure 2: Shape comparison of the single power law model (1PL), two-component power law model (2PL) and the stretched exponential power law model (SEPL). All models shown have vesc=500kms1subscript𝑣esc500kmsuperscripts1v_{\rm{esc}}=500\,\rm{km}\,\rm{s^{-1}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT = 500 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. In the 1PL model, the power law index k𝑘kitalic_k describes both the steepness of the distribution away from vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT and the shape of the cutoff near vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT, leading to highly correlated fits for vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT and k𝑘kitalic_k. The 2PL model mitigates this with the introduction of an additional power law-component with index kSsubscript𝑘𝑆k_{S}italic_k start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT. In the SEPL model, the parameter β𝛽\betaitalic_β controls the exponential rise of the distribution away from vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT while the behavior near vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT is a power law with index γ=1𝛾1\gamma=1italic_γ = 1.

In these works, and as described in Sec. 3, it has also been necessary to set some lower bound on the speed vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT in order to model only the tail of the distribution. vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT is typically chosen close to 300kms1300kmsuperscripts1300\,\rm{km\,s^{-1}}300 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT (but lower in the pre-Gaia era due to limited statistics) to balance the statistical constraining power of the sample and the fact that Eq. 12 is expected to model the tail of the distribution only. The escape velocities obtained by these analyses are typically of the order 500kms1500kmsuperscripts1500\,\rm{km\,s^{-1}}500 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, and it is not clear that Eq. 12 should hold over such a large range of speeds, due to the presence of kinematic substructure, disk contamination, and the fact that this expression is obtained perturbatively in a neighborhood of vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT. Some of these issues were addressed in Koppelman & Helmi (2021) and Necib & Lin (2022a) by implementing aggressive cuts on disk stars, and furthermore in Necib & Lin (2022a) by repeating the modeling at different vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT, where it was demonstrated that the 1PL modeling was not stable to the choice of vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT.

Since there exist different choices for priors on k𝑘kitalic_k and for vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT which affect the resulting vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT, the 1PL model may be inappropriate for robust estimation of escape velocities. When strict priors on k𝑘kitalic_k are not employed, the resulting escape velocities are highly uncertain due to the strong parameter correlations. As a result, alternative models have been investigated; this is the subject of the following Section.

4.2 Multiple power law components

The Milky Way contains kinematic substructure such as the GSE (Helmi et al., 2018; Belokurov et al., 2018), and this substructure comprises a significant portion of the speed distribution (Necib et al., 2019). It was demonstrated in Grand et al. (2019); Necib & Lin (2022b) that the presence of kinematic substructures such as the GSE may significantly bias the determination of the escape velocity, in particular if using the single power law model (Eq. 12) with a low vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT such as 300kms1300kmsuperscripts1300\,\rm{km\,s^{-1}}300 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT.

Motivated by these considerations, Necib & Lin (2022b) proposed to model the local speed distribution as a sum of multiple bound kinematic components, each described by Eq. 12 with independent slope but common vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT, allowing for more robust estimates of the escape velocity. Via Eq. 9, the likelihood for a single star α𝛼\alphaitalic_α assuming a sum of two power law components with common vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT takes the form

α2=superscriptsubscript𝛼2absent\displaystyle\mathcal{L}_{\alpha}^{2}=caligraphic_L start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = (1fout)[(1fS)pα(vαvesc,k)\displaystyle\,(1-f_{\mathrm{out}})\,\big{[}(1-f_{S})\,p_{\alpha}\left(v_{% \alpha}\mid v_{\mathrm{esc}},k\right)( 1 - italic_f start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT ) [ ( 1 - italic_f start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ) italic_p start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ∣ italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT , italic_k ) (14)
+fSpα(vαvesc,kS)]+foutpαout(vασout)\displaystyle+f_{S}\,p_{\alpha}\left(v_{\alpha}\mid v_{\mathrm{esc}},k_{S}% \right)\big{]}+f_{\mathrm{out}}\,p_{\alpha}^{\mathrm{out}}\left(v_{\alpha}\mid% \sigma_{\mathrm{out}}\right)+ italic_f start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ∣ italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ) ] + italic_f start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_out end_POSTSUPERSCRIPT ( italic_v start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ∣ italic_σ start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT )

where each pαsubscript𝑝𝛼p_{\alpha}italic_p start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT implicitly uses the simple power law model Eq. 12. kSsubscript𝑘𝑆k_{S}italic_k start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT and fSsubscript𝑓𝑆f_{S}italic_f start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT are the power law slope and fractional contribution of the substructure component, respectively. Hereafter we will often refer to the single power law and 2-component power law models as “1PL” and “2PL”, respectively.

Applying this approach to the Gaia eDR3 dataset, Necib & Lin (2022a) found that at low vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT (320kms1similar-toabsent320kmsuperscripts1\sim 320\,\rm{km\,s^{-1}}∼ 320 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT) a two-component model was largely favored over a single bound component, whereas at vmin360kms1greater-than-or-equivalent-tosubscript𝑣min360kmsuperscripts1v_{\rm{min}}\gtrsim 360\,\rm{km\,s^{-1}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ≳ 360 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT the single component model was slightly preferred, but the different models were not meaningfully distinguishable via the AIC. The three-component model was also found not to be distinguishable from the two-component, and as a result in this article we consider at most 2 bound components.

4.3 Limitations of the 2PL model

While Necib & Lin (2022a) found that the 2PL model provided a better fit to the Gaia eDR3 data, it remained the case that high values of k𝑘kitalic_k were preferred for one of the bound components, and even with a high prior limit of k=20𝑘20k=20italic_k = 20 the marginal distribution for k𝑘kitalic_k remained unconverged. The 3-component model also exhibited the same convergence issues, suggesting that the source of the problem is a systematic effect of the modeling close to vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT, where there is a steeply rising distribution of stars.

In Fig. 2, we illustrate the 1PL and 2PL models for fixed vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT. In particular, we show how the various slope parameters affect the resulting speed distributions. Here one can see that the 1PL model would be heavily constrained by a steep distribution close to vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT. This would favor high k𝑘kitalic_k and thus strongly inform the vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT estimate. The 2PL model takes a step towards describing these separate parts of the distribution near vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT and vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT by allowing for different power law behaviors. However, its flexibility saturates at about k15similar-to-or-equals𝑘15k\simeq 15italic_k ≃ 15, indicating that continuing to increase the upper prior limit further will not be sufficient.

Assuming the problem lies only in disk contamination or the inability to model substructure, one approach would be to increase vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT until this steeply rising feature disappears or until the distribution is indistinguishable from the 1PL fit, as made precise by the AIC (Eq. 11). However, if we wish to apply this analysis across many Galactocentric radii, this would require the tuning of vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT in each radial bin based on some arbitrary convergence or stability criterion. One could replace the vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT cut by only using the fastest N𝑁Nitalic_N stars in each radial bin as in Koppelman & Helmi (2021), but the choice of an appropriate N𝑁Nitalic_N is again challenging. As discussed earlier, the number of stars in each radial bin is affected by the selection function and quality cuts such as parallax error, with far fewer stars in the 45kpc45kpc4-5\,\rm{kpc}4 - 5 roman_kpc bin compared to the 78kpc78kpc7-8\,\rm{kpc}7 - 8 roman_kpc bin, for example. Since the statistics vary greatly across Galactocentric radii, N𝑁Nitalic_N would again require a tuning in each bin.

Even with a high vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT, both the 1PL and 2PL models can estimate potentially unreliable escape velocities past the data region (i.e. much faster than the fastest star observed) depending on the data features. For example, it is observed in Koppelman & Helmi (2021) that beyond the solar position, escape velocity estimates based on the 1PL model curiously rise with Galactocentric radius, contrary to the expected shape in realistic potentials. The speed distributions are reported to become steeper with Galactocentric radius, and since vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT is positively correlated with the slope k𝑘kitalic_k, this leads to higher vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT. This feature suggests some radially-dependent effect in the shape of the distribution which is not well-modeled by the single power law and is not accounted for by the statistical uncertainties of parameter estimation.

Given these limitations, it is thus desirable to develop a model that can both closely track the cutoff of a continuous kinematic distribution (i.e., the fastest non-outlier star, accounting for measurement errors) and account for the steeply rising distribution at lower speeds, without tuning vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT.

4.4 Stretched exponential power law (SEPL)

We now introduce a new functional form, motivated by the challenges of fitting the data to a single or two-component power law model. In both cases, the data tends to be fit by increasingly high power law slopes k𝑘kitalic_k unless a prior range or strict vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT are set. These steep power laws can lead to unreliable extrapolation past the data region and large vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT estimates. The fundamental reason is that the steepness of the distribution is correlated with the shape of the cutoff near vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT in power law distributions. This indicates the need for a model where the steep rise in the distribution away from vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT is not tied to the shape of the cutoff close to vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT.

We use these facts as motivation to introduce a model that grows exponentially away from vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT, but maintains the characteristics of a power law close to vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT. This is similar to the motivation for the 2PL model, but allows for a steeper rise at lower speeds. A general distribution satisfying these requirements is

g(vθ)(vescv)γexp[(vescvτ)β].proportional-to𝑔conditional𝑣𝜃superscriptsubscript𝑣esc𝑣𝛾superscriptsubscript𝑣esc𝑣𝜏𝛽g(v\mid\mathbf{\theta})\propto(v_{\mathrm{esc}}-v)^{\gamma}\exp\left[\left(% \frac{v_{\mathrm{esc}}-v}{\tau}\right)^{\beta}\right].italic_g ( italic_v ∣ italic_θ ) ∝ ( italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT - italic_v ) start_POSTSUPERSCRIPT italic_γ end_POSTSUPERSCRIPT roman_exp [ ( divide start_ARG italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT - italic_v end_ARG start_ARG italic_τ end_ARG ) start_POSTSUPERSCRIPT italic_β end_POSTSUPERSCRIPT ] . (15)

Close to vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT, this distribution rises as a stretched exponential with index β𝛽\betaitalic_β and scale τ𝜏\tauitalic_τ. Very close to vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT, this distribution gives a power law with index γ𝛾\gammaitalic_γ, with lower γ𝛾\gammaitalic_γ giving a more sharp cutoff at vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT. Since we wish to avoid extrapolation of vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT beyond the data region, we find that choosing a low γ=1𝛾1\gamma=1italic_γ = 1 gives results for vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT that most closely track the fastest non-outlier star in the data region. We refer to this model with γ=1𝛾1\gamma=1italic_γ = 1 as a “stretched exponential power law”444The stretched exponential usually is written with a negative sign in the exponent, but we wish to model an exponential rise toward vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT away from vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT, and so we make use of this name in a slightly non-standard way. (SEPL), with the form

g(vθ)(vescv)exp[(vescvτ)β],proportional-to𝑔conditional𝑣𝜃subscript𝑣esc𝑣superscriptsubscript𝑣esc𝑣𝜏𝛽g(v\mid\mathbf{\theta})\propto(v_{\mathrm{esc}}-v)\exp\left[\left(\frac{v_{% \mathrm{esc}}-v}{\tau}\right)^{\beta}\right],italic_g ( italic_v ∣ italic_θ ) ∝ ( italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT - italic_v ) roman_exp [ ( divide start_ARG italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT - italic_v end_ARG start_ARG italic_τ end_ARG ) start_POSTSUPERSCRIPT italic_β end_POSTSUPERSCRIPT ] , (16)

where vesc,β,subscript𝑣esc𝛽v_{\mathrm{esc}},\beta,italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT , italic_β , and the scale τ𝜏\tauitalic_τ are treated as free parameters. Here β𝛽\betaitalic_β and τ𝜏\tauitalic_τ have limited impact on the precise shape of the cutoff at speeds very close to vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT, but allow for greater flexibility in describing the data at speeds where statistics are greatest. The choice of γ=1𝛾1\gamma=1italic_γ = 1 in Eq. 16 is further validated in Sec. 5 by repeating the analysis for γ{1,2,3}𝛾123\gamma\in\{1,2,3\}italic_γ ∈ { 1 , 2 , 3 } and additionally with γ𝛾\gammaitalic_γ as a free parameter with a wide uniform prior.

We demonstrate the shape of the SEPL model and compare it to the power law models in Fig. 2. Here we show a fixed scale τ𝜏\tauitalic_τ, although when fitting we leave this scale as a free parameter. The SEPL model mimics a valuable feature of the 2-component model, which is to describe the tail as the transition between two distinct behaviors (here exponential to power law), but more readily accounts for the steep rise, and ideally further reduces the correlation of any improperly converged parameters with vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT.

4.5 Priors

Since the choice of priors in the power law models is a subject of significant uncertainty, we adopt wide uniform priors on all parameters of the bound component models, and in line with Necib & Lin (2022b), we choose uniform priors on the logarithm of the outlier distribution parameters. A summary is given in Table 1. The prior limits of the power law models are also chosen in line with Necib & Lin (2022b), where we maintain the choice that k𝑘kitalic_k should label the behavior close to vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT in all cases, but remain agnostic to the interpretation of each component.

The prior limits in the case of the SEPL model are chosen such that any combination yields a distribution that can be reliably normalized according to Eq. 4; the exponential nature can cause numerical overflow issues in some parts of the parameter space, though this does not represent any unphysical nature of the modeling. As a result, note that the maximum of the uniform escape velocity prior is lower than that of the power law models, but this upper limit is still significantly higher than existing estimates of vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT and is never informative for the resulting posterior distributions in this paper.

Table 1: Prior distributions for the power law and stretched exponential power law models. The notation 𝒰(a,b)𝒰𝑎𝑏\mathcal{U}(a,b)caligraphic_U ( italic_a , italic_b ) refers to a uniform prior distribution from a𝑎aitalic_a to b𝑏bitalic_b and 𝒰log(c,d)subscript𝒰log𝑐𝑑\mathcal{U}_{\,\rm{log}}(c,d)caligraphic_U start_POSTSUBSCRIPT roman_log end_POSTSUBSCRIPT ( italic_c , italic_d ) refers to a uniform prior in the log of that parameter from c𝑐citalic_c to d𝑑ditalic_d.
Model Parameter Prior
1-component vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT 𝒰(0,1000)𝒰01000\mathcal{U}(0,1000)caligraphic_U ( 0 , 1000 )
power law (1PL) k𝑘kitalic_k 𝒰(0.1,20.0)𝒰0.120.0\mathcal{U}(0.1,20.0)caligraphic_U ( 0.1 , 20.0 )
2-component vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT 𝒰(0,1000)𝒰01000\mathcal{U}(0,1000)caligraphic_U ( 0 , 1000 )
power law (2PL) k𝑘kitalic_k 𝒰(0.1,20.0)𝒰0.120.0\mathcal{U}(0.1,20.0)caligraphic_U ( 0.1 , 20.0 )
kSsubscript𝑘𝑆k_{S}italic_k start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT 𝒰(0.1,k)𝒰0.1𝑘\mathcal{U}(0.1,k)caligraphic_U ( 0.1 , italic_k )
fSsubscript𝑓𝑆f_{S}italic_f start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT 𝒰(0,1)𝒰01\mathcal{U}(0,1)caligraphic_U ( 0 , 1 )
Stretched exponential vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT 𝒰(0,700)𝒰0700\mathcal{U}(0,700)caligraphic_U ( 0 , 700 )
power law (SEPL) β𝛽\betaitalic_β 𝒰(1,3.5)𝒰13.5\mathcal{U}(1,3.5)caligraphic_U ( 1 , 3.5 )
τ𝜏\tauitalic_τ 𝒰(80,1000)𝒰801000\mathcal{U}(80,1000)caligraphic_U ( 80 , 1000 )
Outlier model foutsubscript𝑓outf_{\rm{out}}italic_f start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT 𝒰log(106,1)subscript𝒰logsuperscript1061\mathcal{U}_{\,\rm{log}}(10^{-6},1)caligraphic_U start_POSTSUBSCRIPT roman_log end_POSTSUBSCRIPT ( 10 start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT , 1 )
σoutsubscript𝜎out\sigma_{\rm{out}}italic_σ start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT 𝒰log(600,3000)subscript𝒰log6003000\mathcal{U}_{\,\rm{log}}(600,3000)caligraphic_U start_POSTSUBSCRIPT roman_log end_POSTSUBSCRIPT ( 600 , 3000 )

5 Escape velocity results

Refer to caption
Figure 3: Comparison of fits to high-speed distribution of stars using the 1PL, 2PL and SEPL models, shown for a representative radial bin of 89kpc89kpc8-9\,\rm{kpc}8 - 9 roman_kpc and at a vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT of 310kms1310kmsuperscripts1310\,\rm{km\,s^{-1}}310 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. The fits at all Galactocentric radii can be found in the appendix. The arrow in the 1PL panel indicates that the escape velocity and its 68%percent6868\%68 % confidence interval are very large and not visible on these axes. The vertical position of the vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT markers does not encode any information.

We now apply the fitting procedure outlined in Sec. 3 to the high-quality DR3 dataset of Sec. 2, using the 1PL, 2PL and SEPL models of Sec. 4. We perform the fit for each model in 1kpc1kpc1\rm{kpc}1 roman_k roman_p roman_c radial bins from 411kpc411kpc4-11\,\rm{kpc}4 - 11 roman_kpc.

In Figure 3, we show the data and the best-fit speed distribution for each model, taking a representative radial bin of 89kpc89kpc8-9\,\rm{kpc}8 - 9 roman_kpc and vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT of 310kms1310kmsuperscripts1310\,\rm{km\,s^{-1}}310 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. The fits at all Galactocentric radii can be found in the Appendix, Fig. A2. The corner plots corresponding to the fits at 89kpc89kpc8-9\,\rm{kpc}8 - 9 roman_kpc are also available in the Appendix (Figs. A3, A4, A5).

In Figure 3, it can also be seen that the 1PL model results in a best-fit vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT which is well beyond the fastest star in the sample. Here the shape of the distribution at lower speeds is dictating the behavior of the high-speed tail, with an unreliable extrapolation of the distribution’s shape to speeds where there is no data. The best fits for the 2PL model perform better, since the additional model parameters allow for more flexibility in modeling both the lower-speed and fastest stars. However, the 2PL model can also yield a vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT estimate far from the data region, relying again on strong assumptions about the shape of the distribution at high speeds. This effect is most pronounced for the power law models at large Galactocentric radius, as seen in Fig. A2. Finally, the SEPL fits exhibit sharper cutoffs near vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT, while simultaneously describing the steep rise of the distribution close to vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT. The SEPL escape velocity estimate thus relies less on extrapolation of the speed distribution, and is very close to the observed cutoff in the distribution. The advantages of using the SEPL are discussed in more detail in Sec. 5.1.

Refer to caption
Figure 4: (Top) The escape velocity profile of the high-quality Gaia DR3 sample as obtained via the single power law (1PL), 2-component power law (2PL) and stretched exponential power law (SEPL) models, each at vmin=310kms1subscript𝑣min310kmsuperscripts1v_{\rm{min}}=310\,\rm{km\,s^{-1}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = 310 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. (Bottom left) The stability of each model’s escape velocity estimate with increasing vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT, shown here for the representative radial bin of 78kpc78kpc7-8\,\rm{kpc}7 - 8 roman_kpc. (Bottom right) Repeating the analysis with the SEPL model using different power law exponents γ𝛾\gammaitalic_γ at high-speed extreme, including the case in which γ𝛾\gammaitalic_γ is treated as a fit parameter. All bands correspond to 68%percent6868\%68 % confidence intervals.

These features are also apparent when comparing the escape velocity profile of each model, as in the top panel of Fig. 4. The 1PL model confidently overestimates the escape velocities, and increases unphysically with Galactocentric radius from 69kpc69kpc6-9\,\rm{kpc}6 - 9 roman_kpc. The 2PL model estimates lower escape velocities and relies less on an unreliable extrapolation beyond the data region, at least at small Galactocentric radius. However, at and beyond the Solar position, the 2PL fit produces similar results to the 1PL model and the escape velocity estimates become poorly constrained. The SEPL fitting typically results in the lowest estimates for the escape velocity, as the distribution can exhibit a sharper cutoff at the edge of the data region. Unlike the profiles of the power law models, the SEPL profile is consistent with monotonically decreasing with radius, which is a necessary feature of a physical escape velocity profile. The SEPL escape velocities are most uncertain when it is challenging to distinguish between bound and outlier distributions as in the 56kpc56kpc5-6\,\rm{kpc}5 - 6 roman_kpc bin, and when statistics degrade as at high Galactocentric radius.

5.1 Advantages of the SEPL model

Using the AIC (Eq. 11) to compare the different models reveals that the SEPL and 2PL are not statistically distinguishable (|ΔAIC|5less-than-or-similar-toΔAIC5|\Delta\rm{AIC}|\lesssim 5| roman_Δ roman_AIC | ≲ 5) and that the 1PL is strongly disfavored (|ΔAIC|10100similar-to-or-equalsΔAIC10100|\Delta\rm{AIC}|\simeq 10-100| roman_Δ roman_AIC | ≃ 10 - 100). Despite the fact that the SEPL and 2PL have similar measures of goodness-of-fit, there are a number of reasons to favor the SEPL as our fiducial result. We discuss below the advantages of using the SEPL model, but note that in our final results for MW halo properties we will also consider vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT obtained with 2PL model to illustrate the model dependence.

As discussed above, the SEPL model gives results for vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT that rely less on extrapolation beyond the data region. This is preferable to the overestimation seen in the power law models (on the order of hundreds of kms1kmsuperscripts1\rm{km\,s^{-1}}roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT at high Galactocentric radius) as it is an approach informed more by the data than assumptions about the tail shape. In light of this discussion, one might ask whether it is appropriate use the speed of the fastest star as an estimator for vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT. The difficulty of this approach is that it does not account for outliers or statistical fluctuations near the tail, and degrades significantly in low statistics regimes such as at high Galactocentric radius. Using the SEPL model with a cutoff in the distribution accounts for these effects, and results in larger uncertainties on vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT in radial bins with limited statistics or outliers.

In addition, using the SEPL model mitigates one of the issues seen in power law models: the power law slope that governs the low-speed behavior, k𝑘kitalic_k, is often both unconverged and correlated with vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT. In the Appendix, Fig. A6 shows the degree to which the model parameters of the 1PL, 2PL and SEPL are correlated with vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT as a function of Galactocentric radius. In both the 1PL and 2PL models, k𝑘kitalic_k exhibits a high correlation coefficient with vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT, and it can be seen in the representative corner plots Fig. A3 and A4 that this parameter is unconverged even with a large prior range. This suggests that the marginal distribution for vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT is in fact not reliable in this case, as it is correlated with an unconverged parameter. We find the β𝛽\betaitalic_β parameter in the SEPL model is sometimes similarly unconverged, in part because the upper prior limit is set to avoid numerical overflow. However, it is uncorrelated with the escape velocity and thus does not pose the same problem as the power law models. There is one radial bin (56kpc56kpc5-6\,\rm{kpc}5 - 6 roman_kpc) in which there is meaningful correlation between β𝛽\betaitalic_β and vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT, but this is accompanied by a large uncertainty on the vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT estimate of the SEPL, and so does not significantly bias results of DM halo parameter fitting in Sec. 6.

The lower left panel of Fig. 4 shows the result of increasing vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT for the three models considered in this paper. For the power law models, the results for vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT become highly uncertain at high vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT as the statistics of the sample degrade, with both models estimating an escape velocity 150kms1similar-toabsent150kmsuperscripts1\sim 150\,\rm{km\,s^{-1}}∼ 150 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT above the SEPL and low-vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT 2PL measurements. We find that the SEPL model gives results that are more stable to the choice of vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT than the power law models. This can be explained by the greater flexibility in the SEPL for describing the rise toward vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT, which better describes the features observed in the data. Using the AIC to compare the models, the SEPL and 2PL results are indistinguishable at all vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT while all three models (1PL, 2PL, SEPL) become indistinguishable for high vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT, consistent with expectations given the limited statistics.

We also consider the more general form of the SEPL, Eq. 15, with various choices of the power law index γ𝛾\gammaitalic_γ. The lower right panel of Fig. 4 shows fit results with different fixed γ=2,3𝛾23\gamma=2,3italic_γ = 2 , 3 as well as with γ𝛾\gammaitalic_γ as a free parameter. Larger γ𝛾\gammaitalic_γ values give distributions with shallower cutoffs in the tail, which tend to result in higher vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT values. The SEPL with γ𝛾\gammaitalic_γ a free parameter reproduces the behavior of the 2PL model close to vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT, in particular rising and becoming more uncertain past the solar position. This feature could be due to the reduced number of stars above vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT in these bins, and the “overshooting” issue of the power law models discussed in Section 4.3. We use γ=1𝛾1\gamma=1italic_γ = 1 since it results in the lowest values of vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT in this comparison, consistent with our desire to avoid extrapolating beyond the data region. Further reducing γ𝛾\gammaitalic_γ below 1 to give an even sharper cutoff would not substantially impact our results, since there are limited statistics to distinguish such low values of γ𝛾\gammaitalic_γ. For example, comparing the γ=1𝛾1\gamma=1italic_γ = 1 and γ=2𝛾2\gamma=2italic_γ = 2 cases we see these profiles are already largely consistent within 1σ1𝜎1\sigma1 italic_σ, but with the γ=2𝛾2\gamma=2italic_γ = 2 case systematically above γ=1𝛾1\gamma=1italic_γ = 1.

Lastly, we test the consistency of the escape velocity profile with an alternative radial binning (but the same bin width), offset by 0.5kpc0.5kpc0.5\,\rm{kpc}0.5 roman_kpc from the fiducial bins. This is shown in the Appendix Fig. A1 for both 2PL and SEPL models. The offset escape velocity profiles exhibit the same features as the fiducial profiles, confirming that bin membership uncertainty has a small effect on the resulting escape velocity results.

Refer to caption
Figure 5: (Top) stretched exponential power law (SEPL) and 2-component power law (2PL) escape velocity profiles along with comparable results from the literature. Bands and error bars correspond to 68%percent6868\%68 % confidence intervals. Horizontal error bars for isolated results represent the Galactocentric radius range of sources used for that result. The escape velocities shown from the literature do not include any corrections motivated by simulations. (Bottom) Best fit of the NFW dark matter halo to the escape velocity profile, using SEPL escape velocity profile and assuming a known baryonic content of the Milky Way. The NFW halo parameters M200cDMsuperscriptsubscriptM200cDM\rm{M}_{200c}^{\rm{DM}}roman_M start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_DM end_POSTSUPERSCRIPT and c200cDMsuperscriptsubscript𝑐200𝑐DMc_{200c}^{\rm{DM}}italic_c start_POSTSUBSCRIPT 200 italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_DM end_POSTSUPERSCRIPT represent the virial mass and concentration of the dark matter halo. The SEPL profile is shown at 68%percent6868\%68 % (boxes) and 95%percent9595\%95 % (whiskers) confidence intervals. The baryon-only and NFW-only profiles show the escape velocity if the MW consisted of only those components, respectively.

5.2 Comparison with previous work

For our final results, we consider the 2PL and SEPL models with the fiducial radial bins, and a default power law index of γ=1𝛾1\gamma=1italic_γ = 1 for SEPL. We take vmin=310kms1subscript𝑣min310kmsuperscripts1v_{\rm{min}}=310\,\rm{km\,s^{-1}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = 310 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT as the default because this maintains a large sample size while still remaining above the typically chosen 300kms1300kmsuperscripts1300\,\rm{km\,s^{-1}}300 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT to avoid disk contamination. Furthermore, as discussed above, the power law models perform less well at high vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT given the lower statistics.

In Figure 5, we present the comparison of the 2PL and SEPL escape velocity profiles with individual measurements and profiles from the literature. Our results are largely consistent with previous work obtaining vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT near the solar radius. For the Necib & Lin (2022a) result, the escape velocity is consistent within 1σ1𝜎1\sigma1 italic_σ for the SEPL measurement and 1.5σsimilar-toabsent1.5𝜎\sim 1.5\sigma∼ 1.5 italic_σ for the 2PL measurement, but the escape velocity found in this work is larger by 2030kms12030kmsuperscripts1~{}20-30\,\rm{km\,s^{-1}}20 - 30 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. This is likely higher due to the larger statistics of Gaia DR3 as compared to eDR3, allowing for stricter quality cuts and thus a more reliable discrimination between outlier and bound distributions, in addition to speed distributions which are filled closer to the escape velocity.

Compared to the measurements of Deason et al. (2019) and Koppelman & Helmi (2021) near the solar radius, our vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT estimates are consistent. Note that the values cannot be compared directly, since the results of Deason et al. (2019) and Koppelman & Helmi (2021) are obtained using much wider radius bins (horizontal bars), compared to our results in 1 kpc bins. The vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT parameter is known to be correlated with the fastest star in the distribution being fit, and if using a wide radial bin, one will likely preferentially fit the high-speed stars which are expected to come from the smallest radii. Examining the horizontal error bars of Fig. 5, it is plausible that the Deason et al. (2019) and Koppelman & Helmi (2021) measurements are consistent with 2PL or SEPL measurements at the lower radii of 6similar-toabsent6\sim 6∼ 6 kpc and 7similar-toabsent7\sim 7∼ 7 kpc, respectively.

Turning to the escape velocity profile, the key feature of the SEPL escape velocity profile is that it trends toward lower vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT at large Galactocentric radii, consistent with expectation. This is in contrast to fits with power law models, which exhibit a significant rise at and beyond the solar position. Our results with the 2PL model are similar to the profile of Koppelman & Helmi (2021), which exhibits approximately the same rise in vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT at large Galactocentric radii. Note that in Koppelman & Helmi (2021), in order to obtain what we label as the “inferred profile” (Fig. 5), first a fit is performed using stars within 2kpc2kpc2\,\rm{kpc}2 roman_kpc of the solar position. Then using the posterior for k𝑘kitalic_k in this local fit as a prior on k𝑘kitalic_k, escape velocity fits are obtained in 0.5kpc0.5kpc0.5\,\rm{kpc}0.5 roman_kpc bins from 411kpc411kpc4-11\,\rm{kpc}4 - 11 roman_kpc, giving an escape velocity profile assuming the power law slope is consistent across Galactocentric radii. We also show the profile of Monari et al. (2018), which was obtained using Gaia DR2 with independent fits to vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT and k𝑘kitalic_k in radial bins, using a bootstrap method to account for uncertainties in bin membership and speed. It does not exhibit a significant trend due to the large uncertainties resulting from the relatively small statistics of Gaia DR2, but also gives high vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT estimates at large Galactocentric radii. These features suggest that power law modeling does not reliably describe the high-speed tail, at least at larger radii.

6 Milky Way Halo Constraints

Table 2: Chosen values for the modeling of different Milky Way components, for the purpose of obtaining a mass profile from an escape velocity profile. The notation 𝒰(a,b)𝒰𝑎𝑏\mathcal{U}(a,b)caligraphic_U ( italic_a , italic_b ) refers to a uniform prior distribution from a𝑎aitalic_a to b𝑏bitalic_b.
Component Model Parameter Symbol Value
Bulge Plummer profile Mass MbulgesubscriptMbulge\rm{M}_{\text{bulge}}roman_M start_POSTSUBSCRIPT bulge end_POSTSUBSCRIPT 1.067×1010M1.067superscript1010subscriptMdirect-product1.067\times 10^{10}\,\rm{M}_{\odot}1.067 × 10 start_POSTSUPERSCRIPT 10 end_POSTSUPERSCRIPT roman_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT
(Plummer, 1911) Scale radius b𝑏bitalic_b 0.3kpc0.3kpc0.3\,\rm{kpc}0.3 roman_kpc
Thin disk Miyamoto-Nagai profile Mass Mthin disksubscriptMthin disk\rm{M}_{\text{thin disk}}roman_M start_POSTSUBSCRIPT thin disk end_POSTSUBSCRIPT 3.944×1010M3.944superscript1010subscriptMdirect-product3.944\times 10^{10}\,\rm{M}_{\odot}3.944 × 10 start_POSTSUPERSCRIPT 10 end_POSTSUPERSCRIPT roman_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT
(Miyamoto & Nagai, 1975) Scale radius rthin disksubscript𝑟thin diskr_{\text{thin disk}}italic_r start_POSTSUBSCRIPT thin disk end_POSTSUBSCRIPT 5.3kpc5.3kpc5.3\,\rm{kpc}5.3 roman_kpc
Scale height zthin disksubscript𝑧thin diskz_{\text{thin disk}}italic_z start_POSTSUBSCRIPT thin disk end_POSTSUBSCRIPT 0.25kpc0.25kpc0.25\,\rm{kpc}0.25 roman_kpc
Thick disk Miyamoto-Nagai profile Mass Mthick disksubscriptMthick disk\rm{M}_{\text{thick disk}}roman_M start_POSTSUBSCRIPT thick disk end_POSTSUBSCRIPT 3.944×1010M3.944superscript1010subscriptMdirect-product3.944\times 10^{10}\,\rm{M}_{\odot}3.944 × 10 start_POSTSUPERSCRIPT 10 end_POSTSUPERSCRIPT roman_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT
(Miyamoto & Nagai, 1975) Scale radius rthick disksubscript𝑟thick diskr_{\text{thick disk}}italic_r start_POSTSUBSCRIPT thick disk end_POSTSUBSCRIPT 2.6kpc2.6kpc2.6\,\rm{kpc}2.6 roman_kpc
Scale height zthick disksubscript𝑧thick diskz_{\text{thick disk}}italic_z start_POSTSUBSCRIPT thick disk end_POSTSUBSCRIPT 0.8kpc0.8kpc0.8\,\rm{kpc}0.8 roman_kpc
DM halo NFW profile Mass M200cDMsuperscriptsubscriptM200cDM\rm{M}_{200c}^{\rm{DM}}roman_M start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_DM end_POSTSUPERSCRIPT 𝒰(1010,1013)M𝒰superscript1010superscript1013subscriptMdirect-product\mathcal{U}(10^{10},10^{13})\rm{M}_{\odot}caligraphic_U ( 10 start_POSTSUPERSCRIPT 10 end_POSTSUPERSCRIPT , 10 start_POSTSUPERSCRIPT 13 end_POSTSUPERSCRIPT ) roman_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT
(Navarro et al., 1996) Concentration c200cDMsuperscriptsubscript𝑐200𝑐DMc_{200c}^{\rm{DM}}italic_c start_POSTSUBSCRIPT 200 italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_DM end_POSTSUPERSCRIPT 𝒰(0.1,50.0)𝒰0.150.0\mathcal{U}(0.1,50.0)caligraphic_U ( 0.1 , 50.0 )

6.1 From escape velocity to dark matter

The escape velocity profile of an isolated, finite, and spherical mass distribution is related to its potential Φ(|r|)Φ𝑟\Phi(|\vec{r}|)roman_Φ ( | over→ start_ARG italic_r end_ARG | ) via vesc(|r|)=2Φ(|r|)subscript𝑣esc𝑟2Φ𝑟v_{\rm{esc}}(|\vec{r}|)=\sqrt{2\Phi(|\vec{r}|)}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT ( | over→ start_ARG italic_r end_ARG | ) = square-root start_ARG 2 roman_Φ ( | over→ start_ARG italic_r end_ARG | ) end_ARG. In practice, however, the Milky Way is neither isolated nor spherical. It is therefore necessary to choose a distance at which we consider a star to be unbound, the choice of which is somewhat arbitrary. To remain consistent with the literature (Deason et al., 2019; Necib & Lin, 2022a), we choose 2R200c2subscript𝑅200c2R_{200\rm{c}}2 italic_R start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT as this limiting radius, where R200csubscript𝑅200cR_{200\rm{c}}italic_R start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT is the radius within which the galaxy’s mean density is 200 times the critical density of the universe

ρc=3H28πG,subscript𝜌c3superscript𝐻28𝜋𝐺\rho_{\rm{c}}=\frac{3H^{2}}{8\pi G},italic_ρ start_POSTSUBSCRIPT roman_c end_POSTSUBSCRIPT = divide start_ARG 3 italic_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 8 italic_π italic_G end_ARG , (17)

where H𝐻Hitalic_H is the Hubble constant, and G𝐺Gitalic_G is the Newton’s constant of gravity. In this work, we adopt a Hubble constant of H=70kms1Mpc1𝐻70kmsuperscripts1superscriptMpc1H=70\,\rm{km\,s^{-1}\,Mpc^{-1}}italic_H = 70 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT (although slightly different from recent measurements (Planck Collaboration et al., 2016), this value is chosen for consistency with prior analyses). With this assumption, the escape velocity is related to the gravitational potential via

vesc(r)=2|Φ(r)Φ(2R200c)|subscript𝑣esc𝑟2Φ𝑟Φ2subscript𝑅200cv_{\rm{esc}}(r)=\sqrt{2|\Phi(r)-\Phi(2R_{200\rm{c}})|}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT ( italic_r ) = square-root start_ARG 2 | roman_Φ ( italic_r ) - roman_Φ ( 2 italic_R start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT ) | end_ARG (18)

which we evaluate with cylindrical radius (r𝑟ritalic_r) in the plane of the disk for axisymmetric potentials. One can then obtain the relationship between the potential and density profile via Poisson’s equation, establishing the link between vesc(r)subscript𝑣esc𝑟v_{\rm{esc}}(r)italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT ( italic_r ) and ρ(r)𝜌𝑟\rho(r)italic_ρ ( italic_r ).

To recover the density profile of the DM, it is necessary to model the density profiles of each of the Milky Way components, namely the bulge, thin disk, thick disk, and DM halo. In this work, we adopt model I of Pouliasis, E. et al. (2017), which we show in Table 2, to stay consistent with previous studies (Deason et al., 2019; Necib & Lin, 2022a).

We consider the Navarro-Frenck-White (NFW) profile (Navarro et al., 1996, 1997) for the DM component, with density profile given by

ρ(r)=ρ0rRs(1+rRs)2,𝜌𝑟subscript𝜌0𝑟subscript𝑅𝑠superscript1𝑟subscript𝑅𝑠2\rho(r)=\frac{\rho_{0}}{\frac{r}{R_{s}}\left(1+\frac{r}{R_{s}}\right)^{2}},italic_ρ ( italic_r ) = divide start_ARG italic_ρ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG divide start_ARG italic_r end_ARG start_ARG italic_R start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG ( 1 + divide start_ARG italic_r end_ARG start_ARG italic_R start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , (19)

where ρ0subscript𝜌0\rho_{0}italic_ρ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is the normalization and Rssubscript𝑅𝑠R_{s}italic_R start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is the scale radius. This profile can be formulated in terms of the equivalent pair of parameters M200cDMsuperscriptsubscriptM200cDM\rm{M}_{200\rm{c}}^{\rm{DM}}roman_M start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_DM end_POSTSUPERSCRIPT (DM mass within a sphere of radius R200csubscript𝑅200cR_{200\rm{c}}italic_R start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT) and concentration c200cDM=R200c/Rssuperscriptsubscript𝑐200cDMsubscript𝑅200csubscript𝑅𝑠c_{200\rm{c}}^{\rm{DM}}=R_{200\rm{c}}/R_{s}italic_c start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_DM end_POSTSUPERSCRIPT = italic_R start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT / italic_R start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT. The relation between these parameters is given by integrating the density profile to obtain

M200cDM=0R200c4πr2ρ(r)𝑑r=4πρ0Rs3[ln(1+c)c1+c]superscriptsubscriptM200cDMsuperscriptsubscript0subscript𝑅200c4𝜋superscript𝑟2𝜌𝑟differential-d𝑟4𝜋subscript𝜌0superscriptsubscript𝑅𝑠3delimited-[]1𝑐𝑐1𝑐\displaystyle\begin{split}{\rm{M}}_{200\rm{c}}^{\rm{DM}}&=\int_{0}^{R_{200{\rm% {c}}}}4\pi r^{2}\rho(r)dr\\ &=4\pi\rho_{0}R_{s}^{3}\left[\ln(1+c)-\frac{c}{1+c}\right]\end{split}start_ROW start_CELL roman_M start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_DM end_POSTSUPERSCRIPT end_CELL start_CELL = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT end_POSTSUPERSCRIPT 4 italic_π italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_ρ ( italic_r ) italic_d italic_r end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = 4 italic_π italic_ρ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT [ roman_ln ( 1 + italic_c ) - divide start_ARG italic_c end_ARG start_ARG 1 + italic_c end_ARG ] end_CELL end_ROW (20)

wherein c𝑐citalic_c is used as a shorthand for c200cDMsuperscriptsubscript𝑐200cDMc_{200\rm{c}}^{\rm{DM}}italic_c start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_DM end_POSTSUPERSCRIPT. The priors on M200cDMsuperscriptsubscriptM200cDM\rm{M}_{200\rm{c}}^{\rm{DM}}roman_M start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_DM end_POSTSUPERSCRIPT and c200cDMsuperscriptsubscript𝑐200cDMc_{200\rm{c}}^{\rm{DM}}italic_c start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_DM end_POSTSUPERSCRIPT used in fitting are shown in Table 2.

6.2 Milky Way Halo Parameters

We now obtain the halo parameters for the Milky Way by simultaneously fitting the entire escape velocity profile for a given speed distribution model, such as the SEPL or 2PL. We use the full vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT posterior at each Galactocentric radius to evaluate the likelihood of a given set of DM halo parameters at that radius, and assume that each measurement in the profile is independent such that the total likelihood is the product of the individual evaluations of the posteriors. The theoretical escape velocity values at each radius for a given M200cDMsuperscriptsubscriptM200cDM\rm{M}_{200\rm{c}}^{\rm{DM}}roman_M start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_DM end_POSTSUPERSCRIPT and c200cDMsuperscriptsubscript𝑐200cDMc_{200\rm{c}}^{\rm{DM}}italic_c start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_DM end_POSTSUPERSCRIPT are obtained by modeling the galaxy as the combination of this NFW halo and the baryonic components shown in Table 2, performed using galpy (Bovy, 2015).

We perform the fits using emcee (Foreman-Mackey et al., 2013) which is also an affine-invariant Markov Chain Monte Carlo sampler. The best fit to the SEPL escape velocity profile is shown in Fig. 5, and the shaded band is the 68% confidence interval. We also show the decomposition of the best fit profile into the baryon-only and NFW-only components. Constraints on the NFW parameter space due to the vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT measurements of both the SEPL and 2PL models are shown in Fig. 6. It can be seen that both models yield consistent constraints, but those of the 2PL model are more uncertain and biased toward slightly higher masses, likely due to overestimation and the large uncertainty on vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT beyond the solar position for this model.

Refer to caption
Figure 6: Left: Constraints on the parameter space of the NFW halo of the Milky Way, as obtained via the escape velocity measurements of both the two-component power law (2PL) and stretched exponential power law (SEPL) models. Right: Combined constraints on the Milky Way NFW halo parameters due to the SEPL escape velocity constraints and the circular velocity constraint at the solar position of Eilers et al. (2019). Contours correspond to 68%percent6868\%68 % and 95%percent9595\%95 % confidence.

To obtain stronger constraints on the NFW parameter space, previous works have combined escape velocity constraints with those of circular velocity measurements (Piffl et al., 2014; Monari et al., 2018; Deason et al., 2019; Koppelman & Helmi, 2021; Necib & Lin, 2022a). This is because the escape velocity probes the large-scale mass of the Milky Way, whereas circular velocity measurements are sensitive to the enclosed mass at that position. With the same baryonic model from Table 2, in line with Necib & Lin (2022a) we also include the circular velocity measurement at the solar position of Eilers et al. (2019),555More recent studies, such as Ou et al. (2023), have produced new measurements of the circular velocity based on Gaia DR3. However, we adopt the older study of Eilers et al. (2019) for two reasons: First, it is consistent with Necib & Lin (2022a), enabling a more direct comparison, and second, recent studies have shown a significant decline in the rotation curve at large Galactocentric radii that remains to be addressed. We try to avoid such a result affecting our measurements until a full understanding of the new circular velocity curves is established. namely vcirc(r)=23010+10kms1subscript𝑣circsubscript𝑟direct-productsubscriptsuperscript2301010kmsuperscripts1v_{\rm{circ}}(r_{\odot})=230^{+10}_{-10}\,\rm{km\,s^{-1}}italic_v start_POSTSUBSCRIPT roman_circ end_POSTSUBSCRIPT ( italic_r start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT ) = 230 start_POSTSUPERSCRIPT + 10 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 10 end_POSTSUBSCRIPT roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. Our combined vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT and vcircsubscript𝑣circv_{\rm{circ}}italic_v start_POSTSUBSCRIPT roman_circ end_POSTSUBSCRIPT constraints on the NFW parameter space are shown in Fig. 6. We obtain a final mass estimate for the DM halo of M200cDM=0.550.14+0.15×1012MsuperscriptsubscriptM200cDMsubscriptsuperscript0.550.150.14superscript1012subscriptMdirect-product\rm{M}_{200\rm{c}}^{\rm{DM}}=0.55^{+0.15}_{-0.14}\times 10^{12}\,M_{\odot}roman_M start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_DM end_POSTSUPERSCRIPT = 0.55 start_POSTSUPERSCRIPT + 0.15 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.14 end_POSTSUBSCRIPT × 10 start_POSTSUPERSCRIPT 12 end_POSTSUPERSCRIPT roman_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT, which corresponds to a total Milky Way mass of M200c=0.640.14+0.15×1012MsubscriptM200csubscriptsuperscript0.640.150.14superscript1012subscriptMdirect-product\rm{M}_{200\rm{c}}=0.64^{+0.15}_{-0.14}\times 10^{12}\,M_{\odot}roman_M start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT = 0.64 start_POSTSUPERSCRIPT + 0.15 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.14 end_POSTSUBSCRIPT × 10 start_POSTSUPERSCRIPT 12 end_POSTSUPERSCRIPT roman_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT with this baryonic model.

A comparison of Milky Way mass estimates obtained by fitting both escape velocity and local circular velocity measurements is shown in Fig. 7. Here it can be seen that the mass estimates obtained via the SEPL modeling and 2PL modeling are consistent, though the 2PL results are less constrained. Furthermore, these results are consistent with those of Necib & Lin (2022a) which were obtained via a 2PL measurement of vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT within 2 kpc of the solar position using Gaia eDR3 data. The results are also consistent with the uncorrected Koppelman & Helmi (2021) mass estimate; the correction is motivated by simulations and amounts to increasing all escape velocity estimates by 10%percent1010\%10 %. Our results support the trend of analyses in recent years based on escape velocity modeling, which tend to find a lighter Milky Way of mass 0.51×1012Msimilar-toabsent0.51superscript1012subscript𝑀direct-product\sim 0.5-1\times 10^{12}\,M_{\odot}∼ 0.5 - 1 × 10 start_POSTSUPERSCRIPT 12 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT. This is likely due to the “overshooting” of the 1PL inflating vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT measurements in earlier work.

7 Conclusions

In this paper, we have measured an escape velocity profile of the Milky Way from 411kpc411kpc4-11\,\rm{kpc}4 - 11 roman_kpc using a sample of high-speed stars from Gaia DR3 with 6D kinematics and strict quality cuts. Following Necib & Lin (2022b), we considered single and multi-component power laws to model the tail of the stellar speed distribution and extract escape velocities. Similar to Necib & Lin (2022a), we found that a single power law fit does not provide a good fit to the tail and that it often significantly overestimates the escape velocity compared to the fastest star in the data. The multi-component power law model is motivated by the presence of kinematic substructure, and provides a much better fit. However, with the high quality data of Gaia DR3 and a large range of radii, we found even the two-component power law tends to systematically predict higher vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT. This is particularly noticeable at positions beyond the solar radius, where the power law models give a rising escape velocity profile. Furthermore the vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT result is highly correlated with the slope of the distribution at low speeds, which is often an unconverged parameter in the fits.

Motivated by the above issues with the power law models, we introduced an empirical model which we call the “stretched exponential power law” (SEPL). This model was motivated by the same features that multi-component power laws aimed to capture, such as a different behavior near vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT and near vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT. However, in this case, there is more freedom for a steep rise in the distribution near vminsubscript𝑣minv_{\rm{min}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT, while the profile near vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT remains that of a power law. The escape velocity profile obtained using this new model is less susceptible to the issue of overestimating vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT, and gives a falling profile past the solar position, consistent with expectation. This model is empirically motivated, and future work may improve this modeling as data quality and statistics improve or by studying its application to simulations.

Using the resulting escape velocity profile, we constrained the DM halo parameters of the Milky Way, in particular its virial mass, by assuming a model of the baryonic content consistent with the literature. By combining constraints with a complementary circular velocity measurement, we find a total virial mass for the Milky Way of M200c=0.640.14+0.15×1012Msubscript𝑀200csubscriptsuperscript0.640.150.14superscript1012subscriptMdirect-productM_{200\rm{c}}=0.64^{+0.15}_{-0.14}\times 10^{12}\,\rm{M}_{\odot}italic_M start_POSTSUBSCRIPT 200 roman_c end_POSTSUBSCRIPT = 0.64 start_POSTSUPERSCRIPT + 0.15 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - 0.14 end_POSTSUBSCRIPT × 10 start_POSTSUPERSCRIPT 12 end_POSTSUPERSCRIPT roman_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT using the SEPL model for speed distributions. We obtain consistent but less constrained estimates using the 2PL model. This Milky Way mass is found to be consistent with recent measurements in the literature using power law models, albeit on the lighter end, following the recent trend with such analyses. This trend may result from the improvement in data as well as more modeling techniques that account for substructure in the speed distribution.

Refer to caption
Figure 7: Milky Way total mass estimates obtained via escape velocity modeling along with circular velocity constraints. We compare results from the literature with those of this work. All error bars correspond to 68%percent6868\%68 % confidence intervals, and arrows imply corrections to the inferred escape velocities motivated by simulations.

Apart from the escape velocity, the mass of the Milky Way has been inferred by numerous methods, each resulting in estimates distributed roughly between 0.52×1012M0.52superscript1012subscriptMdirect-product0.5-2\times 10^{12}\,\rm{M}_{\odot}0.5 - 2 × 10 start_POSTSUPERSCRIPT 12 end_POSTSUPERSCRIPT roman_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT over the past decade (Wang et al., 2020). Masses inferred via escape velocities have constituted some of the larger estimates in this population in the past; however, as mentioned above, recent estimates trend below 1012Msuperscript1012subscriptMdirect-product10^{12}\,\rm{M}_{\odot}10 start_POSTSUPERSCRIPT 12 end_POSTSUPERSCRIPT roman_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT. Interestingly, some measurements of masses inferred via other probes such as the rotation curve of the inner Milky Way (Ou et al., 2023; Jiao et al., 2023; Zhou et al., 2023) and comparing the Milky Way satellite population to simulations (Barber et al., 2013; Cautun et al., 2014; Patel et al., 2018) have also trended downward in the past decade. This sits in contrast to recent Galaxy mass estimates obtained from modeling the orbital dynamics of globular clusters (Sohn et al., 2018; Watkins et al., 2019) and satellite dwarfs (Fritz et al., 2020), which suggest an intermediate-mass Milky Way (1.4×1012Msimilar-toabsent1.4superscript1012subscriptMdirect-product\sim 1.4\times 10^{12}\,\rm{M_{\odot}}∼ 1.4 × 10 start_POSTSUPERSCRIPT 12 end_POSTSUPERSCRIPT roman_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT). Other probes such as phase space distribution modeling (Li et al., 2020; Callingham et al., 2019) exhibit no significant trend (see Wang et al. (2020) for a review of masses obtained via each method). These results together paint a picture of a Milky Way whose mass is close to 1012Msuperscript1012subscript𝑀direct-product10^{12}\,M_{\odot}10 start_POSTSUPERSCRIPT 12 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT, although with some disagreement to within a factor 2less-than-or-similar-toabsent2\lesssim 2≲ 2. As observations continue to rapidly improve, a unified understanding of these different facets of the Milky Way’s dynamics will be essential to ultimately pinning down the characteristics and history of our Galaxy.

Acknowledgements

This work has made use of data from the European Space Agency (ESA) mission Gaia (https://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC, https://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, in particular the institutions participating in the Gaia Multilateral Agreement. TL acknowledges support from the US Department of Energy Office of Science under Award No. DE-SC0022104. The authors also thank Philip Harris for valuable discussions.

References

  • Abadi et al. (2006) Abadi, M. G., Navarro, J. F., & Steinmetz, M. 2006, Monthly Notices of the Royal Astronomical Society, 365, 747, doi: 10.1111/j.1365-2966.2005.09789.x
  • Akaike (1974) Akaike, H. 1974, IEEE Transactions on Automatic Control, 19, 716
  • Antoja et al. (2018) Antoja, T., Helmi, A., Romero-Gómez, M., et al. 2018, Nature, 561, 360, doi: 10.1038/s41586-018-0510-7
  • Astropy Collaboration et al. (2013) Astropy Collaboration, Robitaille, T. P., Tollerud, E. J., et al. 2013, A&A, 558, A33, doi: 10.1051/0004-6361/201322068
  • Astropy Collaboration et al. (2018) Astropy Collaboration, Price-Whelan, A. M., Sipőcz, B. M., et al. 2018, AJ, 156, 123, doi: 10.3847/1538-3881/aabc4f
  • Barber et al. (2013) Barber, C., Starkenburg, E., Navarro, J. F., McConnachie, A. W., & Fattahi, A. 2013, Monthly Notices of the Royal Astronomical Society, 437, 959, doi: 10.1093/mnras/stt1959
  • Belokurov et al. (2018) Belokurov, V., Erkal, D., Evans, N. W., Koposov, S. E., & Deason, A. J. 2018, MNRAS, 478, 611, doi: 10.1093/mnras/sty982
  • Bennett & Bovy (2018) Bennett, M., & Bovy, J. 2018, Monthly Notices of the Royal Astronomical Society, 482, 1417, doi: 10.1093/mnras/sty2813
  • Bezanson et al. (2017) Bezanson, J., Edelman, A., Karpinski, S., & Shah, V. B. 2017, SIAM review, 59, 65. https://doi.org/10.1137/141000671
  • Bovy (2015) Bovy, J. 2015, The Astrophysical Journal Supplement Series, 216, 29, doi: 10.1088/0067-0049/216/2/29
  • Callingham et al. (2019) Callingham, T. M., Cautun, M., Deason, A. J., et al. 2019, Monthly Notices of the Royal Astronomical Society, 484, 5453, doi: 10.1093/mnras/stz365
  • Cantat-Gaudin et al. (2023) Cantat-Gaudin, T., Fouesneau, M., Rix, H.-W., et al. 2023, A&A, 669, A55, doi: 10.1051/0004-6361/202244784
  • Cautun et al. (2014) Cautun, M., Frenk, C. S., van de Weygaert, R., Hellwing, W. A., & Jones, B. J. T. 2014, Monthly Notices of the Royal Astronomical Society, 445, 2049, doi: 10.1093/mnras/stu1849
  • Chandra et al. (2023) Chandra, V., Naidu, R. P., Conroy, C., et al. 2023, ApJ, 951, 26, doi: 10.3847/1538-4357/accf13
  • Cirelli et al. (2011) Cirelli, M., Corcella, G., Hektor, A., et al. 2011, JCAP, 03, 051, doi: 10.1088/1475-7516/2012/10/E01
  • Deason et al. (2019) Deason, A. J., Fattahi, A., Belokurov, V., et al. 2019, Monthly Notices of the Royal Astronomical Society, 485, 3514
  • Dierickx & Loeb (2017) Dierickx, M. I. P., & Loeb, A. 2017, The Astrophysical Journal, 847, 42, doi: 10.3847/1538-4357/aa8767
  • Drimmel & Poggio (2018) Drimmel, R., & Poggio, E. 2018, Research Notes of the AAS, 2, 210, doi: 10.3847/2515-5172/aaef8b
  • Eilers et al. (2019) Eilers, A.-C., Hogg, D. W., Rix, H.-W., & Ness, M. K. 2019, ApJ, 871, 120, doi: 10.3847/1538-4357/aaf648
  • Einasto et al. (1974) Einasto, J., Kaasik, A., & Saar, E. 1974, Nature, 250, 309, doi: 10.1038/250309a0
  • Everall & Boubert (2021) Everall, A., & Boubert, D. 2021, Monthly Notices of the Royal Astronomical Society, 509, 6205, doi: 10.1093/mnras/stab3262
  • Foreman-Mackey (2016) Foreman-Mackey, D. 2016, The Journal of Open Source Software, 1, 24, doi: 10.21105/joss.00024
  • Foreman-Mackey et al. (2013) Foreman-Mackey, D., Hogg, D. W., Lang, D., & Goodman, J. 2013, PASP, 125, 306, doi: 10.1086/670067
  • Fritz et al. (2020) Fritz, T. K., Di Cintio, A., Battaglia, G., Brook, C., & Taibi, S. 2020, Monthly Notices of the Royal Astronomical Society, 494, 5178, doi: 10.1093/mnras/staa1040
  • Gaia Collaboration et al. (2016a) Gaia Collaboration, Prusti, T., de Bruijne, J. H. J., et al. 2016a, A&A, 595, A1, doi: 10.1051/0004-6361/201629272
  • Gaia Collaboration et al. (2016b) Gaia Collaboration, Brown, A. G. A., Vallenari, A., et al. 2016b, A&A, 595, A2, doi: 10.1051/0004-6361/201629512
  • Gaia Collaboration et al. (2018) —. 2018, A&A, 616, A1, doi: 10.1051/0004-6361/201833051
  • Gaia Collaboration et al. (2021) —. 2021, A&A, 649, A1, doi: 10.1051/0004-6361/202039657
  • Gaia Collaboration et al. (2023) Gaia Collaboration, Vallenari, A., Brown, A. G. A., et al. 2023, A&A, 674, A1, doi: 10.1051/0004-6361/202243940
  • Gibbons et al. (2014) Gibbons, S. L. J., Belokurov, V., & Evans, N. W. 2014, Monthly Notices of the Royal Astronomical Society, 445, 3788, doi: 10.1093/mnras/stu1986
  • Goodman & Weare (2010) Goodman, J., & Weare, J. 2010, Communications in Applied Mathematics and Computational Science, 5, 65, doi: 10.2140/camcos.2010.5.65
  • Grand et al. (2019) Grand, R. J. J., Deason, A. J., White, S. D. M., et al. 2019, MNRAS, 487, L72, doi: 10.1093/mnrasl/slz092
  • Grand et al. (2017) Grand, R. J. J., Gómez, F. A., Marinacci, F., et al. 2017, MNRAS, 467, 179, doi: 10.1093/mnras/stx071
  • GRAVITY Collaboration et al. (2018) GRAVITY Collaboration, Abuter, R., Amorim, A., et al. 2018, A&A, 615, L15, doi: 10.1051/0004-6361/201833718
  • Green et al. (2019) Green, G. M., Schlafly, E., Zucker, C., Speagle, J. S., & Finkbeiner, D. 2019, ApJ, 887, 93, doi: 10.3847/1538-4357/ab5362
  • Harris et al. (2020) Harris, C. R., Millman, K. J., van der Walt, S. J., et al. 2020, Nature, 585, 357, doi: 10.1038/s41586-020-2649-2
  • Helmi (2020) Helmi, A. 2020, ARA&A, 58, 205, doi: 10.1146/annurev-astro-032620-021917
  • Helmi et al. (2018) Helmi, A., Babusiaux, C., Koppelman, H. H., et al. 2018, Nature, 563, 85, doi: 10.1038/s41586-018-0625-x
  • Jeans (1915) Jeans, J. H. 1915, Monthly Notices of the Royal Astronomical Society, 76, 70, doi: 10.1093/mnras/76.2.70
  • Jeffreys (1961) Jeffreys, H. 1961, Oxford Univ. Press
  • Jiao et al. (2023) Jiao, Y., Hammer, F., Wang, H., et al. 2023, arXiv e-prints, arXiv:2309.00048, doi: 10.48550/arXiv.2309.00048
  • Katz et al. (2022) Katz, D., Sartoretti, P., Guerrier, A., et al. 2022, arXiv e-prints, arXiv:2206.05902, doi: 10.48550/arXiv.2206.05902
  • Kawata et al. (2018) Kawata, D., Baba, J., Ciucǎ, I., et al. 2018, MNRAS, 479, L108, doi: 10.1093/mnrasl/sly107
  • Kluyver et al. (2016) Kluyver, T., Ragan-Kelley, B., Pérez, F., et al. 2016, in Positioning and Power in Academic Publishing: Players, Agents and Agendas, ed. F. Loizides & B. Schmidt, IOS Press, 87 – 90
  • Kochanek (1996) Kochanek, C. S. 1996, ApJ, 457, 228, doi: 10.1086/176724
  • Koppelman & Helmi (2021) Koppelman, & Helmi. 2021, A&A, 649, A136, doi: 10.1051/0004-6361/202038777
  • Küpper et al. (2015) Küpper, A. H. W., Balbinot, E., Bonaca, A., et al. 2015, The Astrophysical Journal, 803, 80, doi: 10.1088/0004-637X/803/2/80
  • Lallement et al. (2019) Lallement, R., Babusiaux, C., Vergely, J. L., et al. 2019, A&A, 625, A135, doi: 10.1051/0004-6361/201834695
  • Lazar et al. (2020) Lazar, A., Bullock, J. S., Boylan-Kolchin, M., et al. 2020, MNRAS, 497, 2393, doi: 10.1093/mnras/staa2101
  • Leike et al. (2020) Leike, R. H., Glatzle, M., & Enßlin, T. A. 2020, A&A, 639, A138, doi: 10.1051/0004-6361/202038169
  • Leonard & Tremaine (1990) Leonard, P. J. T., & Tremaine, S. 1990, ApJ, 353, 486, doi: 10.1086/168638
  • Li et al. (2020) Li, Z.-Z., Qian, Y.-Z., Han, J., et al. 2020, The Astrophysical Journal, 894, 10, doi: 10.3847/1538-4357/ab84f0
  • Lindegren et al. (2021) Lindegren, L., Bastian, U., Biermann, M., et al. 2021, A&A, 649, A4, doi: 10.1051/0004-6361/202039653
  • Lisanti et al. (2019) Lisanti, M., Moschella, M., Outmezguine, N. J., & Slone, O. 2019, Phys. Rev. D, 100, 083009, doi: 10.1103/PhysRevD.100.083009
  • Malhan et al. (2018) Malhan, K., Ibata, R. A., & Martin, N. F. 2018, MNRAS, 481, 3442, doi: 10.1093/mnras/sty2474
  • Marchetti et al. (2022) Marchetti, T., Evans, F. A., & Rossi, E. M. 2022, Monthly Notices of the Royal Astronomical Society, 515, 767, doi: 10.1093/mnras/stac1777
  • Miyamoto & Nagai (1975) Miyamoto, M., & Nagai, R. 1975, PASJ, 27, 533
  • Monari et al. (2018) Monari, G., Famaey, B., Carrillo, I., et al. 2018, A&A, 616, L9, doi: 10.1051/0004-6361/201833748
  • Nadler et al. (2019) Nadler, E. O., Gluscevic, V., Boddy, K. K., & Wechsler, R. H. 2019, Astrophys. J. Lett., 878, 32, doi: 10.3847/2041-8213/ab1eb2
  • Nadler et al. (2021) Nadler, E. O., et al. 2021, Phys. Rev. Lett., 126, 091101, doi: 10.1103/PhysRevLett.126.091101
  • Naidu et al. (2020a) Naidu, R. P., Conroy, C., Bonaca, A., et al. 2020a, ApJ, 901, 48, doi: 10.3847/1538-4357/abaef4
  • Naidu et al. (2020b) —. 2020b, ApJ, 901, 48, doi: 10.3847/1538-4357/abaef4
  • Navarro et al. (1996) Navarro, J. F., Frenk, C. S., & White, S. D. M. 1996, ApJ, 462, 563, doi: 10.1086/177173
  • Navarro et al. (1997) —. 1997, ApJ, 490, 493, doi: 10.1086/304888
  • Necib & Lin (2022a) Necib, L., & Lin, T. 2022a, ApJ, 926, 189, doi: 10.3847/1538-4357/ac4244
  • Necib & Lin (2022b) —. 2022b, ApJ, 926, 188, doi: 10.3847/1538-4357/ac4243
  • Necib et al. (2019) Necib, L., Lisanti, M., & Belokurov, V. 2019, ApJ, 874, 3, doi: 10.3847/1538-4357/ab095b
  • Necib et al. (2020) Necib, L., Ostdiek, B., Lisanti, M., et al. 2020, Nature Astronomy, 4, 1078, doi: 10.1038/s41550-020-1131-2
  • Ostriker et al. (1974) Ostriker, J. P., Peebles, P. J. E., & Yahil, A. 1974, ApJ, 193, L1, doi: 10.1086/181617
  • Ou et al. (2023) Ou, X., Eilers, A.-C., Necib, L., & Frebel, A. 2023, arXiv e-prints, arXiv:2303.12838, doi: 10.48550/arXiv.2303.12838
  • Patel et al. (2018) Patel, E., Besla, G., Mandel, K., & Sohn, S. T. 2018, The Astrophysical Journal, 857, 78, doi: 10.3847/1538-4357/aab78f
  • Piffl et al. (2014) Piffl, T., Binney, J., McMillan, P. J., et al. 2014, MNRAS, 445, 3133, doi: 10.1093/mnras/stu1948
  • Planck Collaboration et al. (2016) Planck Collaboration, Ade, P. A. R., Aghanim, N., et al. 2016, A&A, 594, A13, doi: 10.1051/0004-6361/201525830
  • Plummer (1911) Plummer, H. C. 1911, MNRAS, 71, 460, doi: 10.1093/mnras/71.5.460
  • Pouliasis, E. et al. (2017) Pouliasis, E., Di Matteo, P., & Haywood, M. 2017, A&A, 598, A66, doi: 10.1051/0004-6361/201527346
  • Price-Whelan & Bonaca (2018) Price-Whelan, A. M., & Bonaca, A. 2018, ApJ, 863, L20, doi: 10.3847/2041-8213/aad7b5
  • Rinchiuso et al. (2021) Rinchiuso, L., Macias, O., Moulin, E., Rodd, N. L., & Slatyer, T. R. 2021, Phys. Rev. D, 103, 023011, doi: 10.1103/PhysRevD.103.023011
  • Rubin & Ford (1970) Rubin, V. C., & Ford, W. Kent, J. 1970, ApJ, 159, 379, doi: 10.1086/150317
  • Scannapieco et al. (2009) Scannapieco, C., White, S. D. M., Springel, V., & Tissera, P. B. 2009, Monthly Notices of the Royal Astronomical Society, 396, 696, doi: 10.1111/j.1365-2966.2009.14764.x
  • Slatyer (2018) Slatyer, T. R. 2018, in Theoretical Advanced Study Institute in Elementary Particle Physics: Anticipating the Next Discoveries in Particle Physics, 297–353, doi: 10.1142/9789813233348_0005
  • Smith et al. (2007) Smith, M. C., Ruchti, G. R., Helmi, A., et al. 2007, Monthly Notices of the Royal Astronomical Society, 379, 755
  • Sohn et al. (2018) Sohn, S. T., Watkins, L. L., Fardal, M. A., et al. 2018, The Astrophysical Journal, 862, 52, doi: 10.3847/1538-4357/aacd0b
  • Spergel & Steinhardt (2000) Spergel, D. N., & Steinhardt, P. J. 2000, Phys. Rev. Lett., 84, 3760, doi: 10.1103/PhysRevLett.84.3760
  • Springel et al. (2008) Springel, V., Wang, J., Vogelsberger, M., et al. 2008, MNRAS, 391, 1685, doi: 10.1111/j.1365-2966.2008.14066.x
  • Tollet et al. (2016) Tollet, E., Macciò, A. V., Dutton, A. A., et al. 2016, MNRAS, 456, 3542, doi: 10.1093/mnras/stv2856
  • Trick et al. (2021) Trick, W. H., Fragkoudi, F., Hunt, J. A. S., Mackereth, J. T., & White, S. D. M. 2021, MNRAS, 500, 2645, doi: 10.1093/mnras/staa3317
  • Tulin & Yu (2018) Tulin, S., & Yu, H.-B. 2018, Phys. Rept., 730, 1, doi: 10.1016/j.physrep.2017.11.004
  • Van Rossum & Drake (2009) Van Rossum, G., & Drake, F. L. 2009, Python 3 Reference Manual (Scotts Valley, CA: CreateSpace)
  • Virtanen et al. (2020) Virtanen, P., Gommers, R., Oliphant, T. E., et al. 2020, Nature Methods, 17, 261, doi: 10.1038/s41592-019-0686-2
  • Wang et al. (2023) Wang, H.-F., Chrobáková, Ž., López-Corredoira, M., & Sylos Labini, F. 2023, ApJ, 942, 12, doi: 10.3847/1538-4357/aca27c
  • Wang et al. (2020) Wang, W., Han, J., Cautun, M., Li, Z., & Ishigaki, M. N. 2020, Science China Physics, Mechanics & Astronomy, 63, 109801, doi: 10.1007/s11433-019-1541-6
  • Watkins et al. (2019) Watkins, L. L., van der Marel, R. P., Sohn, S. T., & Evans, N. W. 2019, ApJ, 873, 118, doi: 10.3847/1538-4357/ab089f
  • Williams et al. (2017) Williams, A. A., Belokurov, V., Casey, A. R., & Evans, N. W. 2017, Monthly Notices of the Royal Astronomical Society, 468, 2359, doi: 10.1093/mnras/stx508
  • Zhou et al. (2023) Zhou, Y., Li, X., Huang, Y., & Zhang, H. 2023, ApJ, 946, 73, doi: 10.3847/1538-4357/acadd9
  • Zwicky (1933) Zwicky, F. 1933, Helvetica Physica Acta, 6, 110

The Figures in this Appendix are organized as follows: Fig. A1 contains a test of an alternative radial binning to produce 2PL and SEPL escape velocity profiles. Fig. A2 is a collection of the speed distributions at all Galactocentric radii considered here and the corresponding 1PL, 2PL and SEPL fits. Fig. A3, A4 and A5 are the corner plots corresponding to the fits shown in Fig. 3. Fig. A6 shows the correlation between best-fit model parameters for the 1PL, 2PL and SEPL models and the escape velocity as a function of galactocentric radius. The correlation is calculated as the absolute value of the Pearson product-moment correlation coefficient using the numpy function corrcoef (Harris et al., 2020).

Refer to caption
Figure A1: Stretched exponential power law (SEPL) and two-component power law (2PL) escape velocity fits to Gaia DR3 data from 411kpc411kpc4-11\,\rm{kpc}4 - 11 roman_kpc in the fiducial 1kpc1kpc1\,\rm{kpc}1 roman_kpc-wide radial bins (circles, error bars) and in bins offset by 0.5kpc0.5kpc0.5\,\rm{kpc}0.5 roman_kpc (squares, bands), both with vmin=310kms1subscript𝑣min310kmsuperscripts1v_{\rm{min}}=310\,\rm{km\,s^{-1}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = 310 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. Error bars and bands represent 68%percent6868\%68 % confidence intervals.
Refer to caption
Figure A2: Escape velocity fits using the power law (1PL), two-component power law (2PL) and stretched exponential power law (SEPL) at vmin=310kms1subscript𝑣min310kmsuperscripts1v_{\rm{min}}=310\,\rm{km\,s^{-1}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = 310 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT at all Galactocentric radii considered in this work. All bands and error bars represent 68%percent6868\%68 % confidence intervals. Vertical position of vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT markers do not encode any information.
Refer to caption
Refer to caption
Figure A3: Single power law (1PL) fit to Gaia DR3 data in the 89kpc89kpc8-9\,\rm{kpc}8 - 9 roman_kpc radial bin and with vmin=310kms1subscript𝑣min310kmsuperscripts1v_{\rm{min}}=310\,\rm{km\,s^{-1}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = 310 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT (left) and vmin=400kms1subscript𝑣min400kmsuperscripts1v_{\rm{min}}=400\,\rm{km\,s^{-1}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = 400 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT (right). We do not show the outlier fraction foutsubscript𝑓outf_{\rm{out}}italic_f start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT and distribution width σoutsubscript𝜎out\sigma_{\rm{out}}italic_σ start_POSTSUBSCRIPT roman_out end_POSTSUBSCRIPT, as the outlier distribution is consistent with zero in this radial bin.
Refer to caption
Figure A4: Two-component power law (2PL) fit to Gaia DR3 data in the 89kpc89kpc8-9\,\rm{kpc}8 - 9 roman_kpc radial bin and with vmin=310kms1subscript𝑣min310kmsuperscripts1v_{\rm{min}}=310\,\rm{km\,s^{-1}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = 310 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. Outlier distribution parameters not shown as the outlier distribution is consistent with 0 in this bin.
Refer to caption
Figure A5: Stretched exponential power law (SEPL) fit to Gaia DR3 data in the 89kpc89kpc8-9\,\rm{kpc}8 - 9 roman_kpc radial bin and with vmin=310kms1subscript𝑣min310kmsuperscripts1v_{\rm{min}}=310\,\rm{km\,s^{-1}}italic_v start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = 310 roman_km roman_s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. Outlier distribution parameters not shown as the outlier distribution is consistent with 0 in this bin.
Refer to caption
Figure A6: Correlations between vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT and other model parameters. First row is the 1-component power law (1PL) correlations, next 3 rows are 2-component power law (2PL), and final two rows are correlations with the stretched exponential (SEPL) model parameters. Outlier model correlations not shown, as they exhibit no significant difference across models. Black outlines indicate parameters which are often not properly converged, and their correlations with the parameter of interest, vescsubscript𝑣escv_{\rm{esc}}italic_v start_POSTSUBSCRIPT roman_esc end_POSTSUBSCRIPT.