The Power of High Precision Broadband Photometry: Tracing the Milky Way Density Profile with Blue Horizontal Branch Stars in the Dark Energy Survey

Fengqing Yu (余枫青) Department of Computer Science, University of Toronto, 40 St. George Street, Canada Data Sciences Institute, University of Toronto, 17th Floor, Ontario Power Building, 700 University Ave, Toronto, ON M5G 1Z5, Canada Ting S. Li David A. Dunlap Department of Astronomy & Astrophysics, University of Toronto, 50 St George Street, Toronto ON M5S 3H4, Canada Dunlap Institute for Astronomy & Astrophysics, University of Toronto, 50 St George Street, Toronto, ON M5S 3H4, Canada Data Sciences Institute, University of Toronto, 17th Floor, Ontario Power Building, 700 University Ave, Toronto, ON M5G 1Z5, Canada Joshua S. Speagle (沈佳士) Department of Statistical Sciences, University of Toronto, 9th Floor, Ontario Power Building, 700 University Ave, Toronto, ON M5G 1Z5, Canada David A. Dunlap Department of Astronomy & Astrophysics, University of Toronto, 50 St George Street, Toronto ON M5S 3H4, Canada Dunlap Institute for Astronomy & Astrophysics, University of Toronto, 50 St George Street, Toronto, ON M5S 3H4, Canada Data Sciences Institute, University of Toronto, 17th Floor, Ontario Power Building, 700 University Ave, Toronto, ON M5G 1Z5, Canada Gustavo E. Medina David A. Dunlap Department of Astronomy & Astrophysics, University of Toronto, 50 St George Street, Toronto ON M5S 3H4, Canada Dunlap Institute for Astronomy & Astrophysics, University of Toronto, 50 St George Street, Toronto, ON M5S 3H4, Canada Sergey E. Koposov Institute for Astronomy, University of Edinburgh, Royal Observatory, Blackford Hill, Edinburgh EH9 3HJ, UK Institute of Astronomy, University of Cambridge, Madingley Road, Cambridge CB3 0HA, UK Kavli Institute for Cosmology, University of Cambridge, Madingley Road, Cambridge CB3 0HA, UK Joss Bland-Hawthorn Sydney Institute for Astronomy, School of Physics, A28, The University of Sydney, NSW 2006, Australia Centre of Excellence for All-Sky Astrophysics in Three Dimensions (ASTRO 3D), Australia Lara R. Cullinane Department of Physics and Astronomy, Johns Hopkins University, 3400 N. Charles St, Baltimore, MD 21218, USA Gwendolyn M. Eadie David A. Dunlap Department of Astronomy & Astrophysics, University of Toronto, 50 St George Street, Toronto ON M5S 3H4, Canada Department of Statistical Sciences, University of Toronto, 9th Floor, Ontario Power Building, 700 University Ave, Toronto, ON M5G 1Z5, Canada Data Sciences Institute, University of Toronto, 17th Floor, Ontario Power Building, 700 University Ave, Toronto, ON M5G 1Z5, Canada Denis Erkal Department of Physics, University of Surrey, Guildford GU2 7XH, UK Geraint F. Lewis Sydney Institute for Astronomy, School of Physics, A28, The University of Sydney, NSW 2006, Australia Guilherme Limberg Universidade de São Paulo, IAG, Departamento de Astronomia, SP 05508-090, São Paulo, Brazil Daniel B. Zucker School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW 2109, Australia Macquarie University Research Centre for Astrophysics and Space Technologies, Sydney, NSW 2109, Australia Ting S. Li [email protected]

Abstract

Blue Horizontal Branch (BHB) stars, excellent distant tracers for probing the Milky Way’s halo density profile, are distinguished in the $(g-r)_{0}$ vs $(i-z)_{0}$ color space from another class of stars, blue straggler stars (BSs). We develop a Bayesian mixture model to classify BHB stars using high-precision photometry data from the Dark Energy Survey Data Release 2 (DES DR2). We select $\sim 2100$ highly-probable BHBs based on their $griz$ photometry and the associated uncertainties, and use these stars to map the stellar halo over the Galactocentric radial range $20\lesssim R\lesssim 70$ kpc. After excluding known stellar overdensities, we find that the number density $n_{\star}$ of BHBs can be represented by a power law density profile $n_{\star}\propto R^{-\alpha}$ with an index of $\alpha=4.28_{-0.12}^{+0.13}$ , consistent with existing literature values. In addition, we examine the impact of systematic errors and the spatial inhomogeneity on the fitted density profile. Our work demonstrates the effectiveness of high-precision $griz$ photometry in selecting BHB stars. The upcoming photometric survey from the Rubin Observatory, expected to reach depths 2-3 magnitudes greater than DES during its 10-year mission, will enable us to investigate the density profile of the Milky Way’s halo out to the virial radius, unravelling the complex processes of formation and evolution in our Galaxy.

1 Introduction

The Milky Way halo contains fundamental information about the evolution history of our Galaxy and nature of dark matter (Helmi, 2008). It is generally believed that the Milky Way is formed via hierarchical formation, where small stellar systems such as dwarf galaxies are merged and assembled (Fukushima et al., 2019; Searle & Zinn, 1978). Hence, the stellar halo preserves a fossil record of the Galaxy’s formation history and past accretion events (Helmi, 2020). Moreover, the stellar halo can provide insight into the structure of the dark matter halo through stellar dynamics (Gerhard, 2012). In upcoming surveys, it may even be possible to measure the Galaxy’s change in mass with cosmic time by using the kinematics of a smoothly distributed halo population (Sharma et al., 2023), assuming such a population exists.

The Milky Way stellar halo can be directly probed with tracer populations such as red giant-branch (RGB) stars, RR Lyrae (RRL) stars, and blue horizontal-branch (BHB) stars. These stars are bright, and can be observed even in the very periphery of the Milky Way. Among them, $\mathrm{BHB}$ stars are frequently used because their absolute magnitudes (and thus distances) are relatively straightforward to calibrate (Preston et al., 1991; Fukushima et al., 2018). The stellar density profile of the halo $n_{\star}(R)$ is often fitted with an inverse power law with index $\alpha$ , such that $n_{\star}\propto R^{-\alpha}$ , where $R$ is the Galactocentric radius. Deason et al. (2011a) mapped BHB using Sloan Digital Sky Survey Data Release 8 from 1 - 40 kpc with a broken power-law model; they found an inner slope $\alpha_{\mathrm{in}}=2.3$ and $\alpha_{\mathrm{in}}=4.6$ with the break radius $\approx$ 27 kpc. Deason et al. (2018) subsequently mapped $\mathrm{BHB}$ stars starting at 50 kpc using Hyper Suprime-Cam (HSC) photometry; they measured a slope of 4 when excluding the Sagittarius (Sgr) stream, consistent with the power law from smaller distances.

On the other hand, Thomas et al. (2018) showed an inner slope of $\alpha\approx 4.2$ and a shallower outer slope of $\alpha\approx 3.2$ beyond a radius of about 40 kpc by mapping $\mathrm{BHB}$ stars in the Canada-France Imaging Survey I. More recently, addressing the dichotomy in literature values of the single breaking radius, Han et al. (2022) showed a doubly broken power law with break radii at 12 kpc and 28 kpc using the H3 Survey (Conroy et al., 2019). Because of the discrepancies in the slope and break radius, the stellar halo density profile continues to be an active area of research.

Refer to caption — Figure 1: Photometry from DES DR1 (top left), DECaLS DR9 (top right), and DES DR2 (bottom left) cross-matched with ${S}^{5}$ survey targets, with target classes of BHB and BS as classified by ${S}^{5}$ surface gravity and temperature measurements (see Section 2). The bottom right panel uses the same photometry as the bottom left, but classified according to the photometric mixture model described in Section 3. The improved data quality of DES DR2 relative to DES DR1 and DECaLS DR9 is clearly visible, with both BHB and BS sequences becoming tighter and their separation more apparent. The high quality photometry in DES DR2 is one of the key factors that enables our mixture model to accurately recover the underlying BHB and BS populations from photometric colors alone. We note that we do not include $i-$ band photometry here for comparison since DECaLS DR9 does not include $i-$ band data.

While BHBs are incredibly useful, finding them in photometric data can be challenging. To select BHBs from photometric data, the main obstacle lies in the removal of other contaminants that have similar colors and magnitudes to BHBs, such as blue straggler stars (BSs), white dwarfs (WDs), and quasi-stellar objects (QSOs). Among them, the removal of BSs remains the most difficult as these are much closer to BHBs in the color-color space than other contaminants. Previous works have largely relied on the $u-g$ color to distinguish BHBs and BSs (Yanny et al., 2000; Deason et al., 2011a). Deason et al. (2018) used a combination of $g,r,i,z$ photometry from HSC to identify BHB stars. Using the BHB and BS classifications based on surface gravity and effective temperature from the Southern Stellar Stream Spectroscopic Survey ( ${S}^{5}$ ) and photometry from the Dark Energy Survey Data Release 1 (DES DR1; Abbott et al. 2018), Li et al. (2019) identified a distinct and separable sequence for BHB and BS stars in $(i-z)$ vs $(g-r)$ color-color space. However, this separation becomes more difficult for fainter stars because of the increase in uncertainties, and the decreased fraction of BHBs at fainter magnitudes.

In this study, we exploit the BHB/BS separation seen in Li et al. (2019) and present a statistical model to classify BHB stars using photometric data from Dark Energy Survey Data Release 2 (DES DR2; Abbott et al., 2021), incorporating photometric uncertainties. Using BHB candidates selected from our model, we measure the Galactic stellar halo density profile $n_{\star}(R)$ and compare its slope with literature values.

In Section 2, we present an overview of the photometric data and preprocessing methods used in this study. In Section 3, we describe our statistical model that predicts the probability of a star being a BHB based on its colors and associated uncertainties. In Section 4, we use the selected $\mathrm{BHB}$ candidates to derive the Milky Way density profile. In Section 5, we discuss our findings and implications, and we conclude in Section 6.

2 Data

2.1 High-quality Photometry

High-precision photometry is required for teasing out the BHB signal from the contaminants for a large dataset. Here, we consider the photometry from the Dark Energy Camera (DECam; Flaugher et al., 2015), given its wide sky coverage, large telescope aperture, and the clear BHB/BS sequence separation identified by Li et al. (2019). We utilize two DECam-based surveys, the Dark Energy Survey (DES) and the DECam Legacy Survey (DECaLS), and compare the results from the public data releases of these two surveys. As part of the evaluation, we use the BHB/BS classification based on the measurements from ${S}^{5}$ . Throughout the paper, the subscript 0 indicates our use of extinction-corrected photometry; this correction is made possible using the dust map derived by Schlegel et al. (1998).

The DES is a wide-field optical/near-infrared imaging survey that contains 400 million astronomical objects (Dark Energy Survey Collaboration et al., 2016), covering $\sim$ 5000 deg ${}^{2}$ of the South Galactic Cap region and obtaining photometry in $g,r,i,z,Y$ bands. In this comparison, we include both DES DR1 and DES DR2, the former of which is also used by Li et al. (2019).

We also exploit the DECaLS survey, which forms part of the DESI Legacy Imaging Surveys (Dey et al., 2019). The latter mapped 14,000 deg ${}^{2}$ of the extragalactic sky visible from the northern hemisphere in three optical bands ( $g,r$ , and $z$ ). DECaLS data release 9 (DECaLS DR9) incorporates the DES imaging, which is used in the comparison here.

2.2 Spectroscopic Crossmatching

${S}^{5}$ (Li et al., 2019) is a spectroscopic survey with an initial focus on identifying stream member stars within the footprint of the DES, and an eventual goal of mapping the entire Southern sky. Its first public data release ( ${S}^{5}$ DR1; Li & S5 Collaboration, 2021) includes a total of $\sim 31000$ stars, most of which are concentrated on streams within the DES footprint. In ${S}^{5}$ DR1, the survey measures stellar parameters by fitting interpolated stellar atmosphere models to the IR spectra in the Calcium Triplet region. As shown in Figure 11 of Li et al. (2019), the stellar parameters of surface gravity and effective temperature from ${S}^{5}$ DR1 can effectively provide a distinction between BHB stars and BS stars, with BS stars having higher surface gravity at the same temperature.

We cross-match stars in ${S}^{5}$ DR1 with DES DR1, DES DR2, and DECaLS DR9 respectively. Following Li et al. (2019), We restrict our selection to stars with effective temperature $6000\mathrm{K}<T_{\mathrm{eff}}<10000\mathrm{K}$ and surface gravity $2.5<\log\,g<6$ . We then apply photometric cuts of $16<g_{0}<19$ , $-0.35<(g-r)_{0}<-0.05$ and $-0.5<(i-z)_{0}<0.1$ . To ensure reliable stellar parameters, we require the stars in ${S}^{5}$ to have high signal-to-noise ratio (S/N) spectra (sn_1700d $>5$ ) and small uncertainties on surface gravity (logg_std $<0.5$ ). Furthermore, we require that all $g,r,i,z$ photometry are available in DES DR2. RR Lyrae stars are removed by cross-matching with the RR Lyrae catalog from Gaia Data Release 3 (Gaia Collaboration et al., 2023). After the cut, a total of 365 stars remain. We adopt the same criterion defined by Li et al. (2019) for BHB and BS classification in $\log\,g-T_{\mathrm{eff}}$ space. In Figure 1, we depict the stars in $(r-z)_{0}$ vs $(g-r)_{0}$ color-color space¹¹1Note the $i$ -band is not used here because DECaLS DR9 does not contain $i$ -band photometry, with photometry from DES DR1, DECaLS DR9, and DES DR2 respectively.

Compared to both DES DR1 and DECaLS DR9, DES DR2 shows tighter sequences of BHB and BS stars and a clearer separation. The improvement in photometric calibration in DES DR2 enables our mixture model described in Section 3 to differentiate the BHB and BS sequences with photometry alone. This is an important result.

2.3 Data Preparation

Our ultimate goal is to develop a model which allows a purely photometric selection of BHB stars, since this allows us to select a much larger sample of these stars than those observed by spectroscopic surveys. We build our model with DES DR2. We first apply a magnitude cut between 15 mag and 24 mag for $(g,r,i,z)$ photometry. Next, high-quality photometry is selected by using flags_g,r,i,z $<4$ . We use extended_class_coadd $\leq 1$ to select only objects highly likely to be stars. To select blue stars, we apply color cuts of $-0.35<(g-r)_{0}<-0.05$ (as motivated by ${S}^{5}$ DR1 x DES DR2 in Figure 1) and $-0.2<(i-z)_{0}<0.05$ . One additional color cut, $(r-z)_{0}\geq-0.17+(g-r)_{0}\times 0.09$ , is used to remove outliers likely to be WDs and QSOs, as shown in Figure 2.

The stars are then separated into different magnitude bins, based on $g_{0}$ band magnitude. We show stars at the selected magnitude ranges in Figure 3. From left to right, as the magnitude increases and stars become fainter, the BHB sequence and BS sequence merge together and a clear separation is no longer observed. This phenomenon is mostly due to the increase in photometric uncertainty at fainter magnitudes. While uncertainties in both $(i-z)_{0}$ and $(g-r)_{0}$ increase as the magnitude increases, uncertainties in $(i-z)_{0}$ are approximately 3 times larger than those in $(g-r)_{0}$ for stars with BHB-like colors. At $g_{0}\approx 21$ , the mean uncertainty for $(i-z)_{0}$ reaches 0.035, but the mean uncertainty for $(g-r)_{0}$ is only around 0.012. Thus, we enforce an additional magnitude cut of $16<g_{0}<21$ . In total, we selected 46031 sources for our mixture model described in Section 3.

3 Mixture Model

In this section, we describe a mixture model that predicts the probability of a star being a BHB, dependent on the $g_{0}$ magnitude, $(g-r)_{0}$ and $(i-z)_{0}$ color.

3.1 General Form of the Likelihood

We assume that our observed data can be classified into three categories: BHBs, BSs, and other contaminating sources (which we will refer to as “outliers”). Each of these groups occupies fractions $f_{\rm BHB}$ , $f_{\rm BS}$ and $f_{\rm out}$ in our dataset, respectively, subject to the constraint that $f_{\rm BHB}+f_{\rm BS}+f_{\rm out}=1$ . We assume that the BHB and BS stars lie along sequences in $(g-r)_{0}$ and $(i-z)_{0}$ color-color space centered around central ridgelines $\mu_{\rm BHB}$ and $\mu_{\rm BS}$ , with some amount of intrinsic scatter $\sigma_{\rm BHB}$ and $\sigma_{\rm BS}$ respectively, while outliers are drawn from an alternate distribution which we take to be uniform between some lower and upper color bounds $c_{\rm min}$ and $c_{\rm max}$ . Finally, we assume our observed colors and magnitudes have uncertainties $\sigma_{\rm obs}$ which are accurately estimated by the photometric pipelines.

This gives us the following statistical model. First, we sample the object class $Y$ based on the fractions of objects in each group given some extra data values $D$ (e.g., magnitudes):

\displaystyle Y|D\sim{\rm Categorical}(f_{\rm BHB},f_{\rm BS},f_{\rm out})

(1)

Then, given the result, we assign one of three different color-color distributions to the intrinsic colors $X$ :

$\displaystyle X\|Y={\rm BHB},D$	$\displaystyle\sim{\rm Normal}(\mu_{\rm BHB},\sigma_{\rm BHB})$	(2)
$\displaystyle X\|Y={\rm BS},D$	$\displaystyle\sim{\rm Normal}(\mu_{\rm BS},\sigma_{\rm BS})$	(3)
$\displaystyle X\|Y={\rm Outlier},D$	$\displaystyle\sim{\rm Uniform}(c_{\rm min},c_{\rm max})$	(4)

For example, if the star is categorized as a BS, then the data is assumed to be distributed as normal with mean $\mu_{\mathrm{BS}}$ and standard deviation $\sigma_{\mathrm{BS}}$ .

Finally, we add in observational uncertainties to get the distribution of the observed colors $X_{\rm obs}$ :

X_{\rm obs}|X\sim{\rm Normal}(X,\sigma_{\rm obs})

(5)

Given a collection of $n$ independent and identically distributed (iid) observations $\{X_{{\rm obs},i}\}_{i=1}^{n}$ and combining all our parameters that characterize the detailed relationships in our model into the vector $\boldsymbol{\theta}$ , this gives us the total (combined) log-likelihood:

\ln P_{\rm tot}(\boldsymbol{\theta})=\sum_{i=1}^{n}\ln P(X_{{\rm obs},i}|X_{i}% ,Y_{i},D_{i},\boldsymbol{\theta}).

(6)

The individual likelihood for each object can be written as a sum of the class-dependent log-likelihoods over $j=\{{\rm BHB},{\rm BS},{\rm Outlier}\}$ :

P(X_{{\rm obs},i}|X_{i},Y_{i},D_{i},\boldsymbol{\theta})=\\ \sum_{j}\mathbbm{1}({Y_{i}=j})P_{j}(X_{{\rm obs},i}|X_{i},D_{i},\boldsymbol{% \theta})

(7)

where $\mathbbm{1}({Y_{i}=j})$ is the indicator function which gives $1$ when $Y_{i}=j$ and $0$ otherwise. $P_{j}(\cdot)$ is the probability distribution for $X$ within each class (either Normal or Uniform).

Of course, in practice the particular class $Y_{i}$ for object $i$ is unknown. Therefore, we marginalize over all possible classes, which allows us to replace our indicator function with the expected fraction of objects in each class:

P(X_{{\rm obs},i}|X_{i},D_{i},\boldsymbol{\theta})=\\ \sum_{j}f_{j}(D_{i})\,P_{j}(X_{{\rm obs},i}|X_{i},D_{i},\boldsymbol{\theta})

(8)

In addition, the true color $X_{i}$ of a given object is not known. Thus, We must also marginalize over all possible colors given our observations. Since it is not guaranteed to obtain an analytic solution, we will write out the full likelihood below:

P(X_{{\rm obs},i}|D_{i},\boldsymbol{\theta})=\\ \sum_{j}f_{j}(D_{i})\int P(X_{{\rm obs},i}|X_{i},D_{i},\boldsymbol{\theta})P_{% j}(X_{i}|D_{i},\boldsymbol{\theta})\,{\rm d}X_{i}

(9)

This gives us a tractable form of the likelihood to evaluate, where:

•

$P(X_{{\rm obs},i}|X_{i},D_{i},\boldsymbol{\theta})$ is the noise distribution of the observed colors given the true colors, which we assume follow a Normal distribution with unknown mean and known measurement uncertainties $\sigma_{\rm obs}$ .
•

$P_{j}(X_{i}|D_{i},\boldsymbol{\theta})$ describes the intrinsic color-color relationships for BHB and BS stars as well as outliers, which we take to also be Normal distributions centered around some mean ridgeline $\mu_{j}$ with some intrinsic scatter $\sigma_{\rm int}$ (for BHB and BS classes) or Uniform within some color boundaries (for outliers).
•

The integral over all possible intrinsic colors $X_{i}$ is the explicit way in which we marginalize over the unknown true colors for a given object.

3.2 Detailed Model Implementation

To determine the final functional form for our model, we fit individual models over objects from distinct magnitudes bins over $g_{0}=(16,21)$ with a width of 0.5 mag, as well as across the entire $g_{0}$ magnitude range. After exploring a number of different models with varying complexity, we find a fixed third-order polynomial can accurately describe the ridgeline for the BHB and BS populations at all magnitudes. We take this to be a model for $(i-z)_{0}$ color as a function of $(g-r)_{0}$ color, giving us four parameters each:

	$\displaystyle\mu_{\rm BHB}\equiv(i-z)_{0,{\rm BHB}}$	$\displaystyle=\sum_{k=0}^{3}a_{k}[(g-r)_{0}]^{k}$		(10)
	$\displaystyle\mu_{\rm BS}\equiv(i-z)_{0,{\rm BS}}$	$\displaystyle=\sum_{k=0}^{3}b_{k}[(g-r)_{0}]^{k}$		(11)

We also confirm that the data are fully consistent with constant intrinsic scatter terms $\sigma_{\rm BHB}$ and $\sigma_{\rm BS}$ across all magnitudes, giving us two additional parameters. We find the probability of having a source belong to each group can be described using a linear function of $g_{0}$ magnitude, giving us four additional parameters:

	$\displaystyle f_{\rm out}(g_{0})$	$\displaystyle=c_{0}+c_{1}\times g_{0}$		(12)
	$\displaystyle f_{\rm BHB}(g_{0})$	$\displaystyle=(1-f_{\rm out}(g_{0}))\times(d_{0}+d_{1}\times g_{0})$		(13)

with the probability of being in the BS class determined via:

f_{\rm BS}(g_{0})=1-f_{\rm BHB}(g_{0})-f_{\rm out}(g_{0})

(14)

This gives us a total of 14 unique parameters in our model to parametrize the distribution of BHB, BS, and outlier sources.

To marginalize over the observational uncertainties in color, we solve the integral outlined in equation (9) in two parts. First, we note that the distribution of $(i-z)_{0}$ color at a given $(g-r)_{0}$ color is either Normal (BS or BHB) or uniform (outlier). The integral of a product of two normal distributions is analytic and gives a new normal distribution with a standard deviation of $\sqrt{\sigma_{(i-z)_{0}}^{2}+\sigma_{\rm BHB}^{2}}$ . This leaves us with just the integral over $(g-r)_{0}$ . For each object, we define a grid of 50 points in $(g-r)_{0}$ evenly spaced between $\pm 5\sigma_{(g_{r})_{0}}$ , compute the integral over $(i-z)_{0}$ given each $(g-r)_{0}$ , and then compute the remaining 1-D integral using a simple Riemann sum over the associated $(g-r)_{0}$ likelihood. For the uniform case, the integral always gives a constant value that does not depend on the input parameters (and therefore can simply be ignored).

Table 1: Uniform prior range, best-sampled results with the maximum likelihood, and uncertainties of the 14 parameters in the photometric mixture model.

Parameter	Uniform Prior Range	Best Fit²²2Samples with the maximum likelihood	95 $\%$ Credible Interval
$a_{3}$	$(-0.2,0.2)$	$-$ 0.0088	$(-0.0019,0.0020)$
$a_{2}$	$(-1.0,1.0)$	0.438	$(-0.037,0.037)$
$a_{1}$	$(-10,10)$	2.68	$(-0.21,0.21)$
$a_{0}$	$(-25,25)$	6.88	$(-0.36,0.36)$
$\sigma_{\mathrm{BHB}}$	$(0,0.015)$	0.0049	$(-0.0001,0.0001)$
$b_{3}$	$(-0.2,0.2)$	$-$ 0.0512	$(-0.0018,0.0018)$
$b_{2}$	$(-1.0,1.0)$	0.357	$(-0.037,0.037)$
$b_{1}$	$(-10,10)$	1.54	$(-0.22,0.22)$
$b_{0}$	$(-25,25)$	3.74	$(-0.40,0.39)$
$\sigma_{\mathrm{BS}}$	$(0,0.015)$	0.0107	$(-0.0002,0.0002)$
$c_{1}$	$(-3.5,3.5)$	0.149	$(-0.015,0.017)$
$c_{0}$	$(-0.6,0.6)$	-0.0071	$(-0.0008,0.0007)$
$d_{1}$	$(-3.5,3.5)$	3.022	$(-0.070,0.072)$
$d_{0}$	$(-0.6,0.6)$	-0.1450	$(-0.0037,0.0036)$

We assume broad, uniform priors on all parameters as listed in Table 1. We sample from posterior using the dynesty Nested Sampling package (Speagle, 2020; Koposov et al., 2023) version 1.2.3 in Dynamic Nested Sampling mode (Skilling, 2004, 2006; Higson et al., 2019) under default settings that used multi-ellipsoidal bounds and uniform sampling (Feroz et al., 2009).

3.3 Results

The result of the sampling process is shown in the bottom left panel of Figure 4. The corner plot shows the 1D and 2D marginal posterior distributions of the 14 parameters in our photometric mixture model. There is some covariance among $a_{3},a_{2},a_{1},a_{0}$ . As those parameters collectively describe the position of the BHB ridgeline, they are expected to be dependent on each other. This holds true for parameters $b_{3},b_{2},b_{1},b_{0}$ as well. In addition, the coefficients of the polynomial that models the outlier ratio should also be dependent on each other, corresponding to the observation of covariance in $c_{1},c_{0}$ . This applies to the pair $d_{1},d_{0}$ as well. Other than those pairs, the rest of the parameters should not be dependent on each other, and indeed there is no evidence of covariance shown in the plot.

Using the best-fit parameters in Table 1, the probability of each star being a BHB is calculated via

p_{\mathrm{BHB}}=\frac{f_{\mathrm{BHB}}L_{\mathrm{BHB}}}{f_{\mathrm{BHB}}L_{% \mathrm{BHB}}+f_{\mathrm{BS}}L_{\mathrm{BS}}+f_{\mathrm{outlier}}L_{\mathrm{% outlier}}}

(15)

where $L_{j}$ is the corresponding likelihood described in Section 3.1. $p_{\mathrm{BS}}$ and $p_{\mathrm{outlier}}$ are defined similarly.

In the top right of Figure 4, we show a visualization of the predicted class probabilities of the stars used in our model in color-color space. Stars classified as BHBs, BSs, and outliers are shown in blue circles, purple squares, and red stars, respectively. The predicted class is defined to be the one that shows the highest posterior-marginalized probability, and the color is used to represent the probability (one of $p_{\mathrm{BHB}},p_{\mathrm{BS}},p_{\mathrm{outlier}})$ . The blue solid line and purple solid line show the ridgelines $\mu_{\mathrm{BHB}},\mu_{\mathrm{BS}}$ , calculated from the best-fit parameters in Table 1. The shaded regions with dashed lines on their boundaries along each ridgeline show the intrinsic scatters $\sigma_{\rm int}$ for each sequence. Furthermore, the predicted ratios of BHB stars (blue), BS stars (purple), and outliers (red) as a function of magnitude are shown in the middle right of Figure 4.

Using the mixture model, we predict the probabilities of being a BHB star from 16 mag to 21 mag. Figure 5 shows stars from selected magnitude bins. The model distinguishes BHB versus BS easily at brighter magnitudes $(g_{0}<19)$ with a clear decision boundary. At fainter magnitudes, we see there are substantially fewer BHBs, because of the larger uncertainties and decreasing ratio of BHB versus BS at these fainter magnitudes.

3.4 Validation from cross-matching with ${S}^{5}$ Data

We apply our mixture model to ${S}^{5}$ x DES DR2 data to compute the BHB probability and use a threshold cutoff on the probability of $p_{\mathrm{BHB}}\geq 0.5$ to distinguish between BHB and BS stars. The resulting predictions are shown in the bottom right panel of Figure 1. It matches the prediction from the spectroscopic parameters in ${S}^{5}$ well (see the bottom left panel), with recall 0.962, precision 0.953, and false positive rate 0.047, suggesting the robust performance of our model.

Using the predicted BHB probabilities, we select the BHB candidates from DES DR2 and use them for subsequent analysis to derive the density profile of the Milky Way.

4 Density Profile

4.1 Calibration to Distance and Selection of BHBs

We enforce a cut $p_{\mathrm{BHB}}\geq 0.5$ to select the BHB stars. The distances of BHBs can be calibrated in a straightforward manner. We employ the relations between absolute magnitude $M_{g}$ and $(g-r)_{0}$ color from Belokurov & Koposov (2016),

M_{g}=0.398-0.392(g-r)_{0}+2.729((g-r)_{0})^{2}\\ +29.1128((g-r)_{0})^{3}+113.569((g-r)_{0})^{4}.

(16)

See also a similar relation from Deason et al. (2011b). We then use $g_{0}-M_{g}$ to obtain the distance modulus, which is subsequently converted to heliocentric distance ( $d$ ).

To characterize the underlying density profile of the halo itself, we need to remove stars associated with substructures that will likely result in overdensities. Known substructures in the Milky Way halo which are in the vicinity of our sample include Sculptor, Fornax, the Sgr stream, the Large Magellanic Cloud (LMC). Accordingly, stars in regions close to these substructures are excluded from our density profile study. In addition, we also exclude the region near an external galaxy NGC 300 as there is an excess of blue stars in the DES DR2 photometry in this region. For Fornax and Sculptor, we use a 3-deg radius cut centered at $(\mathrm{RA},\mathrm{Dec})=(39.9583^{\circ},-34.4997^{\circ})$ and $(15.0183^{\circ},-33.7186^{\circ})$ , respectively. For the Sgr stream, we first convert the Galactic coordinate system to the heliocentric spherical coordinate system defined by the orbit of the Sagittarius dwarf galaxy (Majewski et al., 2003), with Sgr longitude-like angle $\Lambda$ and Sgr latitude-like angle $B$ . Stars within the region $|B|\leq 20^{\circ}$ are removed. For the LMC, we remove stars within the region $\mathrm{Dec}\leq-50^{\circ},50^{\circ}\leq\mathrm{RA}\leq 150^{\circ}$ . For NGC 300, we use a 0.5-deg radius cut centered at $(\mathrm{RA},\mathrm{Dec})=(13.7229^{\circ},-37.6844^{\circ})$ .

We show the radial distribution of the selected BHB stars in Figure 6, both before and after the removal of these overdensities. In order of increasing heliocentric distance, overdensities associated with the Sgr stream, the LMC, and Sculptor are visible in the full sample. We also present a view of the on-sky location of our BHB candidates, before and after removal of overdensities, in Figure 7. We note that the removal of overdensities is not exhaustive, and there are numerous stars from substructures which remain in the sample. We discuss these further in section 5.

To establish an accurate density profile, it is important to account for potential partial selection effects for the BHB sample, especially at the faint and bright extremes. We assume that the BHB sample in DES is 100% complete in the magnitude range of $16<g_{0}<20.5$ . Since the absolute magnitude of BHBs $M_{g}$ is a function of $g-r$ color, we compute the maximum distance for a BHB at $g_{\mathrm{min}}=16$ in the color range of $-0.35<(g-r)_{0}<-0.05$ as the lower limit of our complete sample, $d_{\mathrm{min}}=13.05$ kpc. Similarly, We use the minimum distance for a BHB at $g_{\mathrm{max}}=20.5$ in the same color range as the upper limit of our complete sample, $d_{\mathrm{max}}=68.37$ kpc. We consider our sample to be complete between these lower and upper limits, and thus model the density profile only across this region. We additionally exclude stars with Galactocentric distances $R<$ 20 kpc, as we notice those stars are likely highly incomplete, and do not model the density profile in this region. We obtain a total of 2103 BHBs after these selections, and we show histograms (blue) with respect to heliocentric distance (d) and Galactocentric distances ( $R$ ) in the left and right panels of Figure 8 respectively.

4.2 Inhomogeneous Poisson Point Process Model

As the selected BHB candidates are influenced by 1) the spatial coverage of the DES survey and 2) the limits on Galactocentric distances ( $R$ ), they are not a true representation of the underlying stellar distributions. Hence when fitting the expected stellar distribution, we must consider these two selection effects. Below we present the details of modeling the stellar density profile based on the selection effects, and we model each star using an inhomogeneous Poisson point process (IPPP). This method has been successfully employed in previous studies conducted by Bovy et al. (2012), Xue et al. (2015), and Han et al. (2022), among others.

We define the Poisson intensity function in the Galactic coordinate system $(l,b,d)$ . The rate of finding a star can be written as

\lambda(l,b,d)=A\times(R+R_{\mathrm{smooth}})^{-\alpha}\times\\ |\mathbf{J}(l,b,d|R)|\times\mathbf{S_{\mathrm{distance}}}(d,R)\times\mathbf{S_% {\mathrm{sky}}}(l,b)

(17)

where

•

$R$ is the Galactocentric radius, defined by $R=\sqrt{R_{0}^{2}+d^{2}-2R_{0}d\cdot\cos(l)\cdot\cos(d)}$ and $R_{0}$ is the distance from the Sun to the Galactic Center, taken to be 8.3 kpc (Gillessen et al., 2017).
•

$R_{\mathrm{smooth}}$ is a smoothing radius to allow for integration down to 0, and it is set to 1 kpc.
•

$\alpha$ is the slope for the power law. We do not look for a power law with a breaking radius near 20 kpc as this is close to our lower limit in distance (Medina et al., 2018; Watkins et al., 2009).
•

$\mathbf{S_{\mathrm{distance}}}(d,R)$ and $\mathbf{S_{\mathrm{sky}}}(l,b)$ are the distance selection function and spatial selection function based on the DES footprint, respectively.
•

$|\mathbf{J}(l,b,d|R)|$ is the Jacobian term to account for coordinate transformation and $|\mathbf{J}(l,b,d|R)|=d^{2}cos(b)$
•

$A$ is a normalization coefficient.

We define our selection functions to be simple binary indicators as a function of $R$ , $d$ , $l$ , and $b$ , with

\mathbf{S_{\mathrm{distance}}}(d,R)=\begin{cases}1&\text{if 20 kpc }\leq R% \text{ and }d_{\mathrm{min}}<d<d_{\mathrm{max}}\\ 0&\text{else}\end{cases}

(18)

and

\mathbf{S_{\mathrm{sky}}}(l,b)=\mathbf{S}(\mathrm{RA},\mathrm{Dec})=\begin{% cases}1&\text{if }-60^{\circ}\leq\mathrm{RA}\leq 100^{\circ}\\ \text{ and }&\text{ }-70^{\circ}\leq\mathrm{Dec}\leq 10^{\circ}\\ 0&\text{else}\end{cases}

(19)

which approximately constrains us to within the coverage of the DES footprint. Intuitively, $\lambda\cdot(dl\cdot db\cdot dd)$ can be interpreted as the infinitesimal probability of a star existing in an infinitesimal volume region $(dl,db,dd)$ located at position $(l,b,d)$ .

With the rate parameter $\lambda$ , the log-likelihood of an IPPP can be written as

	$\displaystyle\log L$	$\displaystyle=\sum\log L(l,b,d\|\boldsymbol{\phi})$
		$\displaystyle=-\Lambda+\sum\log\lambda(l,b,d\|\boldsymbol{\phi})$		(20)

where $\boldsymbol{\phi}$ represents the set of parameters $\alpha$ and $A$ . $\Lambda$ is the integral of $\lambda$ , and

\Lambda=A\,\int_{d_{\mathrm{min}}}^{d_{\mathrm{max}}}\int_{0}^{2\pi}\int_{-% \frac{\pi}{2}}^{\frac{\pi}{2}}d^{2}\,\cos(b)\,(R+R_{\mathrm{smooth}})^{-\alpha% }\times\\ \mathbf{S_{\mathrm{distance}}}(d,R)\,\mathbf{S_{\mathrm{sky}}}(l,b)\,{\rm d}b% \,{\rm d}l\,{\rm d}d

(21)

As the integral $\Lambda$ is analytically intractable, we approximate it using the trapezoidal rule by integrating over grids of $l,b,d$ . To speed up the computation, we specify a grid of $\alpha$ values ( $\Lambda$ is a function of $\alpha$ ) and pre-computed the integrals at each $\alpha$ . We then use interpolation from scipy (Virtanen et al., 2020) on the $\alpha$ grid such that during the actual sampling process the integral can be easily obtained by inputting $\alpha$ . We again assume uniform priors on the two parameters as listed in Table 2. Similarly to Section 3, we sample from the posterior using the dynesty Nested Sampling package with the default settings.

Table 2: Uniform prior range, samples that has the maximum likelihood (best-fit), and uncertainties of the 2 parameters in the density model.

Parameter	Range	Fit	Interval	Interval
	Prior	Best	68 $\%$ Credible	95 $\%$ Credible
$\alpha$	(2, 7)	4.28	( $-$ 0.07, 0.07)	( $-$ 0.12,0.13)
$\log A$	(1,16)	10.97	( $-$ 0.25, 0.24)	( $-$ 0.44, 0.47)

4.3 Results

We find $\alpha$ that maximizes the likelihood and its $95\%$ $(2\sigma)$ credible interval to be $4.28_{-0.12}^{+0.13}$ and normalization factor A to be $10.97_{-0.44}^{+0.47}$ . In Figure 9, the corner plot shows the 2D and 1D marginal posterior probability distributions of the two parameters of our density profile estimated with dynesty. We see that the power-law index $\alpha$ and normalization coefficient $A$ are well-constrained and positively correlated. This is expected given the constant stellar count $N(\rm{BHB})$ .

4.4 Mock Data

As a verification, we generate mock data from our best-fit parameters (that maximize the likelihood) and compare the predicted stellar distribution with the actual distribution. To generate the predicted stellar distribution, we construct a grid of $l,b,d$ values. For each point $(l,b,d)$ , we compute the density function $\lambda$ with the two selection functions $\mathbf{S}$ , and record the distances $d,R$ . Then, we use the trapezoidal rule over the grid to compute the integral of densities $\lambda$ . We normalize this integral so it is equal to the total number of BHBs. After normalization, the previously computed density $\lambda$ at different distances $d,R$ will approximate the predicted stellar count. In Figure 8, we compare the predicted stellar count (shown in red) to the observed distribution (shown in blue) for both the heliocentric distances $d$ (left panel) and Galactocentric distances $R$ (right panel). The distributions show good agreement, indicating that our model captures the stellar density distribution of the data well.

5 Discussion

5.1 Systematic Error of Hard-Thresholding

During the selection process of BHBs, we use a hard probability threshold cutoff to select sources with $p_{\mathrm{BHB}}\geq 0.5$ . Hard thresholding is likely to introduce artificial contamination to our BHB sample. Although a threshold of 0.5 appears to yield high accuracy for bright stars (shown by testing on S5 $\times$ DES DR2 in Figure 1), it is likely to overestimate or underestimate the star count in the fainter range due to the increased uncertainty. Thus, it is essential to investigate the systematic error as a result of applying different threshold values.

To address this, we generate 28 evenly-spaced (spacing=0.25) probability threshold values ranging from $p_{\mathrm{threshold}}=0.3$ to $1.0$ . Under each threshold, we select BHB candidates and conduct the same analysis in Section 4 to estimate A and $\alpha$ . The results are presented in Figure 10, with the BHB count and estimate of the power law index $\alpha$ displayed in the left and right panels, respectively. The normalization coefficient $A$ is completely degenerate with $\alpha$ (they collectively define the number of stars) and is thus omitted. As the threshold increases, the power-law index increases. This matches our expectation since a stricter threshold cutoff will result in fewer BHBs selected in the fainter magnitude, corresponding to a steeper decline in the density, and equivalently, a larger power law index.

Notably, the plot reveals some small jumps in BHB count and power-law index as the threshold becomes stricter (at $p_{\mathrm{threshold}}\sim 0.9)$ . We hypothesize this is likely due to the local overdensities in the halo, so that a specific threshold cutoff might accidentally remove a group of stars. To visualize this, we calculate the average BHB probability with respect to $g_{0}$ magnitude (as a proxy for heliocentric distance) and Galactocentric distance $R$ . After selecting stars with $p_{\mathrm{BHB}}\geq 0.3$ , which is the lowest threshold in Figure 10, we separate these candidates into different groups based on both $g_{0}$ band photometry and Galactocentric distance $R$ . Within each group, the average BHB probability is computed, shown as the value in each 2D bin in Figure 11. We see that the average BHB probability generally decreases as distance increases, or when the magnitude becomes larger. However, it is worth noting that the trend is not smooth with respect to magnitude and distance. There are some bins that have higher average BHB probabilities than their neighbours. A hard probability threshold cutoff may accidentally remove these small overdensities, resulting in the bumps seen in Figure 10.

The variations in the threshold value exert a non-negligible influence on the fitted parameters. This experiment assesses the systematic error and demonstrates that depending on the method, the true power-law index should fall between $4.2$ to $4.5$ .

Incorporating the probabilities as weights in the density model fitting process could potentially enhance the results. To this end, we explore another method where we drop the threshold cutoff completely and use the probabilities as weights in the log-likelihood for the sampling process instead.

To compare with $N(\mathrm{BHB})$ from the thresholding approach, we sum up the BHB probabilities over all the sources (that possibly include BHBs, BSs, and outliers), shown as the orange star in the left panel of Figure 10. To compare with $\alpha$ , we use these BHB probabilities to define a weighted log-likelihood function. The best-fit $\alpha$ that maximizes this likelihood is shown in orange in the right panel of Figure 10. The weighted log-likelihood function multiplies the log-likelihood $\log\lambda$ of each source by its BHB probability. Using this approach, both the summed probability and power law index seem to align with N(BHB) and $\alpha$ from the 0.5 threshold cutoff. However, for the weighted log-likelihood approach, after we remove the log in the likelihood, multiplying by the weight effectively exponentiates the likelihood value of each star by its respective BHB probability, which lacks a clear physical meaning. Therefore, we include it for reference purposes but do not use it in our analysis.

5.2 Sky Variation

The stellar distribution can be influenced by the particular region of the sky under investigation. To understand the impact of different regions of the sky on the resulting density profile, we partition the DES footprint into three distinct patches, as illustrated in the left panel of Figure 12. We apply the same analysis in Section 4 to each patch and find the power law indexes of those patches range from 4.05 to 5.12, presented in the right panel of Figure 12. The variation of power law indexes among different regions suggests that the Milky Way halo is spatially inhomogeneous, and that it is important to consider the location and coverage of a survey when fitting a density profile. In Section 5.5, we further discuss the implication of the sky variation on a flattened or triaxial halo density profile and suggestions for future investigations.

5.3 Search for Substructures

It is worth highlighting that the removal of substructures is not exhaustive during the data processing phase. As an example, when we select stars at specific distances of stellar streams discovered Shipp et al. (2018), we are able to recover the presence of several previously discovered thinner streams, including Turranburra, Willka Yaku, Chenab, and Elqui, shown in Figure 13. Our catalog thus assumes importance to complement the existing stellar stream catalog and proffer candidates for new streams.

In addition, more diffused and massive streams like Palca/Cetus can be vaguely seen in Figure 6 and 7 around RA $\sim 30^{\circ}$ . Inevitably, these substructures may also have some impact on the halo density measurement, which may also impact the sky variation discussed in Section 5.2. However, given the large number of streams in the survey area, it is not possible to remove all stream members.

5.4 Comparison with Existing Literature

We compare our estimated density profile power-law index $\alpha$ with existing literature in Figure 14, including the measurements using photometric data from the DES (thus the same sky coverage), but with different tracers - RR Lyrae stars (Stringer et al., 2021) and main sequence turn-off (MSTO) stars (Pieres et al., 2020). Our power law index is consistent with previous findings at the same Galactocentric distances $R$ .

Our investigations on different threshold cutoffs and different regions of the sky provide a good explanation for the variability in the existing literature. Models with different probability threshold cutoffs are analogous to the different methods used by other researchers, and we explain the variation for our model quantitatively by assessing the systematic error. The experiment on the different sky patches illustrates the dependence of the fitted density profile on the location and coverage of different surveys.

5.5 Limitations and Future Studies

There are a number of ways we can expand upon the current study. Firstly, the analysis is conducted using stars within a distance range of 20 kpc to 70 kpc due to the conservative selection of high-probable stars. With future surveys with higher depth and better precision, we can map out more distant regions of the halo.

Secondly, although our density model incorporates the fundamental characteristics of the stellar halo, it is limited by assuming a perfect spheroidal shape centered at the Galactic center, with an identical decline in all directions. However, the literature suggests that the true shape (and orientation) of the halo is more complex. Several previous studies have claimed, for instance, that the halo is oblate (Olling & Merrifield, 2000; Sesar et al., 2011; Deason et al., 2011; Bowden et al., 2015; Thomas et al., 2018), prolate (Helmi, 2004; Banerjee & Jog, 2011; Bowden et al., 2016; Fukushima et al., 2018), and triaxial (Law & Majewski, 2010; Deg & Widrow, 2013; Iorio & Belokurov, 2019), although spherical distributions (Fellhauer et al., 2006; Smith et al., 2009; Das et al., 2023; Palau & Miralda-Escudé, 2023) and models where the halo flattening varies with Galactocentric radius or line of sight (e.g., Vera-Ciro & Helmi, 2013; Hernitschek et al., 2018) have also been considered. Additionally, it has been proposed that the halo likely exhibits a misalignment with respect to the galactic disk (e.g. Han et al., 2022), as a result of its accretion history (e.g., Prada et al., 2019; Dillamore et al., 2022). Furthermore, the halo density of the studied region may also be impacted by the wake induced by the infall of LMC (Belokurov et al., 2019; Conroy et al., 2021). Indeed, the variation in $\alpha$ seen in Figure 12 and Section 5.2 may already imply that the density profile is more complicated than spherical, as significantly different declines in stellar densities are observed at similar Galactic latitudes. For future work, it would be advantageous to consider more flexible models that can accommodate such complexities.

5.5.1 Parametrization of the BHB/BS Mixture Model

In Section 3 we parametrize our model that the ratio of BHB and BS is purely a function of $g_{0}$ . However, given that BHB and BS sequences represent two stellar populations with distinct evolutionary trajectories and distributions in the MW, this model might be too simplified.

First, the ratio of the two populations is not expected to be constant as a function of $(g-r)_{0}$ . For this, we experiment with a parametric form involving both $(g-r)_{0}$ and $g_{0}$ in the Appendix A. As detailed in the Appendix, the resulting number of BHB stars as well as the power law index $\alpha$ with the new model shows little difference compared to the simpler model (i.e. without $(g-r)_{0}$ dependency).

Moreover, BHB stars are intrinsically brighter than BS stars, and thus locate much further than BS stars at the same apparent magnitude. Hence we also expect to see a wide range of observed BHB to BS ratios across different lines of sight, in particular, as a function of Galactic latitude where BS is likely dominated by the disk stars. However, since our model considers the sky as a whole, averaging out these variations, the differences in ratios along various lines of sight become less significant. For future research, it would be beneficial to model them as separate populations and parameterize densities of BHB and BS as functions of both distance and location on the sky (in Galactic latitude and longitude).

6 Conclusions

In this study, we have developed a mixture model that predicts the probability of a star being a BHB based on its $g_{0}$ band magnitude, $(g-r)_{0}$ and $(i-z)_{0}$ photometry using DES DR2. Our study demonstrates that, even in the absence of the $u-$ band, we can distinguish between BHB and BS sequences through precise $griz$ photometry. We identify $\sim 2100$ highly probable BHB candidates in the Southern Hemisphere, and investigate the stellar halo within a distance range of 20 to 70 kpc. After excluding stars in the area associated with major known substructures, we observe a smooth decline in the stellar density, with a power law index $\alpha=4.28_{-0.12}^{+0.13}$ , consistent with existing literature values. By drawing connections to our assessment of systematic error in threshold cutoffs and sky locations, we argue that the variations in current power law indexes in the literature can be largely associated with (a) different methodologies used to derive the density profiles, and (b) the inherent spatial inhomogeneity of the halo. We provide the entire catalog (which contains 46031 sources) with computed $p_{\mathrm{BHB}}$ using our model in Appendix B. We hope this catalog will be useful for future research on the Galactic halo.

To gain a more comprehensive understanding of the stellar halo, future studies should focus on using more flexible models and probing more distant stars in the halo. Such a BHB sample would not only help us study the stellar density profile of the Milky Way halo, but would also help identify old substructures such as stellar streams. The latter will contribute to our understanding of the Milky Way’s accretion history.

With forthcoming photometric surveys, like the Rubin Observatory’s Legacy Survey of Space and Time (LSST), we anticipate not only extending our reach to greater distances but also expanding our observational scope across a broader expanse of the sky. This will enable us to investigate the density profile of the halo out to the virial radius of the MW and explore potential spatial inhomogeneity more comprehensively. Overall, this study provides insights into the properties of the stellar halo and sets the stage for future investigations that aim to unravel the complex formation and evolution processes of our Galaxy.

The authors would like to thank the Summer Undergraduate Data Science (SUDS) Opportunities Program at the Data Science Institute at the University of Toronto for providing the funding and opportunity to enable this project. T.S.L. acknowledges financial support from Natural Sciences and Engineering Research Council of Canada (NSERC) through grant RGPIN-2022-04794. S.K. acknowledges support from Science & Technology Facilities Council (STFC) (grant ST/Y001001/1). G.L. acknowledges FAPESP (proc. 2021/10429-0). G.M.E. acknowledges financial support from NSERC through grant RGPIN-2020-04554. The authors would like to thank Alex Drlica-Wagner, Andrew Pace, Adriano Pieres for their helpful comments. This paper includes data obtained with the Anglo-Australian Telescope in Australia. We acknowledge the traditional owners of the land on which the AAT stands, the Gamilaraay people, and pay our respects to elders past and present. For the purpose of open access, the author has applied a Creative Commons Attribution (CC BY) licence to any Author Accepted Manuscript version arising from this submission. This research has made use of the SIMBAD database, operated at CDS, Strasbourg, France (Wenger et al., 2000). This research has made use of NASA’s Astrophysics Data System Bibliographic Services. This paper made use of the Whole Sky Database (wsdb) created by Sergey Koposov and maintained at the Institute of Astronomy, Cambridge by Sergey Koposov, Vasily Belokurov and Wyn Evans with financial support from the Science & Technology Facilities Council (STFC) and the European Research Council (ERC). This project used public archival data from the Dark Energy Survey (DES). Funding for the DES Projects has been provided by the U.S. Department of Energy, the U.S. National Science Foundation, the Ministry of Science and Education of Spain, the Science and Technology Facilities Council of the United Kingdom, the Higher Education Funding Council for England, the National Center for Supercomputing Applications at the University of Illinois at Urbana-Champaign, the Kavli Institute of Cosmological Physics at the University of Chicago, the Center for Cosmology and Astro-Particle Physics at the Ohio State University, the Mitchell Institute for Fundamental Physics and Astronomy at Texas A&M University, Financiadora de Estudos e Projetos, Fundação Carlos Chagas Filho de Amparo à Pesquisa do Estado do Rio de Janeiro, Conselho Nacional de Desenvolvimento Científico e Tecnológico and the Ministério da Ciência, Tecnologia e Inovação, the Deutsche Forschungsgemeinschaft, and the Collaborating Institutions in the Dark Energy Survey. The Collaborating Institutions are Argonne National Laboratory, the University of California at Santa Cruz, the University of Cambridge, Centro de Investigaciones Energéticas, Medioambientales y Tecnológicas-Madrid, the University of Chicago, University College London, the DES-Brazil Consortium, the University of Edinburgh, the Eidgenössische Technische Hochschule (ETH) Zürich, Fermi National Accelerator Laboratory, the University of Illinois at Urbana-Champaign, the Institut de Ciències de l’Espai (IEEC/CSIC), the Institut de Física d’Altes Energies, Lawrence Berkeley National Laboratory, the Ludwig-Maximilians Universität München and the associated Excellence Cluster Universe, the University of Michigan, the National Optical Astronomy Observatory, the University of Nottingham, The Ohio State University, the OzDES Membership Consortium, the University of Pennsylvania, the University of Portsmouth, SLAC National Accelerator Laboratory, Stanford University, the University of Sussex, and Texas A&M University. Based in part on observations at Cerro Tololo Inter-American Observatory, National Optical Astronomy Observatory, which is operated by the Association of Universities for Research in Astronomy (AURA) under a cooperative agreement with the National Science Foundation. The Legacy Surveys consist of three individual and complementary projects: the Dark Energy Camera Legacy Survey (DECaLS; NOAO Proposal ID # 2014B-0404; PIs: David Schlegel and Arjun Dey), the Beijing-Arizona Sky Survey (BASS; NOAO Proposal ID # 2015A-0801; PIs: Zhou Xu and Xiaohui Fan), and the Mayall z-band Legacy Survey (MzLS; NOAO Proposal ID # 2016A-0453; PI: Arjun Dey). DECaLS, BASS and MzLS together include data obtained, respectively, at the Blanco telescope, Cerro Tololo Inter-American Observatory, National Optical Astronomy Observatory (NOAO); the Bok telescope, Steward Observatory, University of Arizona; and the Mayall telescope, Kitt Peak National Observatory, NOAO. The Legacy Surveys project is honored to be permitted to conduct astronomical research on Iolkam Du’ag (Kitt Peak), a mountain with particular significance to the Tohono O’odham Nation. NOAO is operated by the Association of Universities for Research in Astronomy (AURA) under a cooperative agreement with the National Science Foundation. The Legacy Survey team makes use of data products from the Near-Earth Object Wide-field Infrared Survey Explorer (NEOWISE), which is a project of the Jet Propulsion Laboratory/California Institute of Technology. NEOWISE is funded by the National Aeronautics and Space Administration. The Legacy Surveys imaging of the DESI footprint is supported by the Director, Office of Science, Office of High Energy Physics of the U.S. Department of Energy under Contract No. DE-AC02-05CH1123, by the National Energy Research Scientific Computing Center, a DOE Office of Science User Facility under the same contract; and by the U.S. National Science Foundation, Division of Astronomical Sciences under Contract No. AST-0950945 to NOAO. Software: numpy (van der Walt et al., 2011), scipy (Virtanen et al., 2020), matplotlib (Hunter, 2007), astropy (Astropy Collaboration et al., 2013, 2018, 2022), dynesty (Speagle, 2020; Koposov et al., 2023),

Appendix A BHB ratio as a function of both color and magnitude

We explore a parametric form of BHB ratio defined with $(g-r)_{0}$ and $g_{0}$ . The statistical model for this parametric form is defined similarly as the previous (see Section 3.1), except that we describe the probability of having a source belong to the BHB group using a polynomial function of $g_{0}$ magnitude and $(g-r)_{0}$ color (vs equation 13, which is defined solely on $g_{0}$ ). We keep the probability of having a source belong to the outlier group as a linear function of $g_{0}$ . This gives us five parameters:

f_{\rm BHB}(g_{0})=(1-f_{\rm out}(g_{0}))\times[d_{0}+d_{1}\times g_{0}\\ +d_{2}\times(g-r)_{0}+d_{3}\times g_{0}\times(g-r)_{0}+d_{4}\times(g-r)_{0}^{2}]

(A1)

with the probability of being in the BS class determined via:

f_{\rm BS}(g_{0})=1-f_{\rm BHB}(g_{0})-f_{\rm out}(g_{0})

(A2)

Hence we have a total of 17 unique parameters in our model to parametrize the distribution of BHB, BS, and outlier sources.

We conduct the same sampling process to find the parameters that maximize the likelihood, shown in Figure 15. We also plot the probability of a source being each of the group, shown in Figure 16. We then find the number of BHBs following the same processing discussed in Section 4. The number of BHB is found to be 2111, and the power law index $\alpha$ is $4.27^{+0.13}_{-0.13}$ , shown in Figure 17 and Figure 18, respectively. Both of them show little difference from the previous fit.

Appendix B BHB catalog

Table 3: A catalog with BHB probability

p_{\mathrm{BHB}}

computed from the mixture model described in Section 3 using

griz

photometry from DES DR2. Distance is the heliocentric distance, computed assuming the sources are BHBs. A total of 46031 sources are included. Only the first ten lines are shown. The complete table is available online in a machine readable format.

Coadd Object ID	RA	Dec	$g_{0}$	$r_{0}$	$i_{0}$	$z_{0}$	distance	$p_{\mathrm{BHB}}$
994953737	345.969096	-42.549333	20.89	21.08	21.26	21.35	118.9	0.00
999753919	346.598197	-42.483844	19.32	19.54	19.73	19.78	56.18	0.95
999777071	346.818066	-42.739140	19.98	20.14	20.3	20.33	79.45	0.97
995168807	348.283024	-43.375280	17.19	17.50	17.74	17.84	17.53	0.98
998800330	347.896323	-43.028549	17.66	17.89	18.09	18.19	25.66	0.00
1001308502	349.326354	-43.156899	17.41	17.73	17.93	18.07	18.91	0.00
998795299	348.005177	-42.971495	20.24	20.36	20.50	20.60	90.63	0.00
998786086	347.917693	-42.875529	18.55	18.69	18.85	18.93	41.12	0.00
998773023	347.836941	-42.732092	17.52	17.66	17.82	17.89	25.72	0.00
1001283704	348.961490	-42.916494	18.60	18.65	18.75	18.80	43.20	0.00

Table 3 shows a compiled catalog from Section 2.3, which contains 46031 sources, with $p_{\mathrm{BHB}}$ computed by our mixture model described in Section 3 and heliocentric distance $d$ computed assuming the sources are BHBs using Equation 16. We also provide Coadd Object ID, RA, Dec, and $griz$ photometry, obtained from DES DR2 for each source.

References

Abbott et al. (2018) Abbott, T. M. C., Abdalla, F. B., Allam, S., et al. 2018, ApJS, 239, 18, doi: 10.3847/1538-4365/aae9f0
Abbott et al. (2021) Abbott, T. M. C., Adamów, M., Aguena, M., et al. 2021, ApJS, 255, 20, doi: 10.3847/1538-4365/ac00b3
Astropy Collaboration et al. (2013) Astropy Collaboration, Robitaille, T. P., Tollerud, E. J., et al. 2013, A&A, 558, A33, doi: 10.1051/0004-6361/201322068
Astropy Collaboration et al. (2018) Astropy Collaboration, Price-Whelan, A. M., Sipőcz, B. M., et al. 2018, AJ, 156, 123, doi: 10.3847/1538-3881/aabc4f
Astropy Collaboration et al. (2022) Astropy Collaboration, Price-Whelan, A. M., Lim, P. L., et al. 2022, ApJ, 935, 167, doi: 10.3847/1538-4357/ac7c74
Banerjee & Jog (2011) Banerjee, A., & Jog, C. J. 2011, ApJ, 732, L8, doi: 10.1088/2041-8205/732/1/L8
Belokurov et al. (2019) Belokurov, V., Deason, A. J., Erkal, D., et al. 2019, MNRAS, 488, L47, doi: 10.1093/mnrasl/slz101
Belokurov & Koposov (2016) Belokurov, V., & Koposov, S. E. 2016, MNRAS, 456, 602, doi: 10.1093/mnras/stv2688
Bovy et al. (2012) Bovy, J., Rix, H.-W., Liu, C., et al. 2012, The Astrophysical Journal, 753, 148, doi: 10.1088/0004-637X/753/2/148
Bowden et al. (2015) Bowden, A., Belokurov, V., & Evans, N. W. 2015, MNRAS, 449, 1391, doi: 10.1093/mnras/stv285
Bowden et al. (2016) Bowden, A., Evans, N. W., & Williams, A. A. 2016, MNRAS, 460, 329, doi: 10.1093/mnras/stw994
Conroy et al. (2021) Conroy, C., Naidu, R. P., Garavito-Camargo, N., et al. 2021, Nature, 592, 534, doi: 10.1038/s41586-021-03385-7
Conroy et al. (2019) Conroy, C., Bonaca, A., Cargile, P., et al. 2019, ApJ, 883, 107, doi: 10.3847/1538-4357/ab38b8
Dark Energy Survey Collaboration et al. (2016) Dark Energy Survey Collaboration, Abbott, T., Abdalla, F. B., et al. 2016, MNRAS, 460, 1270, doi: 10.1093/mnras/stw641
Das et al. (2023) Das, M., Ianjamasimanana, R., McGaugh, S. S., Schombert, J., & Dwarakanath, K. S. 2023, ApJ, 946, L8, doi: 10.3847/2041-8213/acc10e
Deason et al. (2011a) Deason, A. J., Belokurov, V., & Evans, N. W. 2011a, MNRAS, 416, 2903, doi: 10.1111/j.1365-2966.2011.19237.x
Deason et al. (2011b) —. 2011b, MNRAS, 416, 2903, doi: 10.1111/j.1365-2966.2011.19237.x
Deason et al. (2011) Deason, A. J., Belokurov, V., & Evans, N. W. 2011, Monthly Notices of the Royal Astronomical Society, 416, 2903, doi: 10.1111/j.1365-2966.2011.19237.x
Deason et al. (2018) Deason, A. J., Belokurov, V., & Koposov, S. E. 2018, ApJ, 852, 118, doi: 10.3847/1538-4357/aa9d19
Deg & Widrow (2013) Deg, N., & Widrow, L. 2013, MNRAS, 428, 912, doi: 10.1093/mnras/sts089
Dey et al. (2019) Dey, A., Schlegel, D. J., Lang, D., et al. 2019, AJ, 157, 168, doi: 10.3847/1538-3881/ab089d
Dillamore et al. (2022) Dillamore, A. M., Belokurov, V., Font, A. S., & McCarthy, I. G. 2022, MNRAS, 513, 1867, doi: 10.1093/mnras/stac1038
Fellhauer et al. (2006) Fellhauer, M., Belokurov, V., Evans, N. W., et al. 2006, ApJ, 651, 167, doi: 10.1086/507128
Feroz et al. (2009) Feroz, F., Hobson, M. P., & Bridges, M. 2009, MNRAS, 398, 1601, doi: 10.1111/j.1365-2966.2009.14548.x
Flaugher et al. (2015) Flaugher, B., Diehl, H. T., Honscheid, K., et al. 2015, AJ, 150, 150, doi: 10.1088/0004-6256/150/5/150
Fukushima et al. (2018) Fukushima, T., Chiba, M., Homma, D., et al. 2018, Publications of the Astronomical Society of Japan, Volume 70, Issue 4, id.69, 70, 69, doi: 10.1093/pasj/psy060
Fukushima et al. (2019) Fukushima, T., Chiba, M., Tanaka, M., et al. 2019, Publications of the Astronomical Society of Japan, 71, 72, doi: 10.1093/pasj/psz052
Gaia Collaboration et al. (2023) Gaia Collaboration, Vallenari, A., Brown, A. G. A., et al. 2023, A&A, 674, A1, doi: 10.1051/0004-6361/202243940
Gerhard (2012) Gerhard, O. 2012, Proceedings of the International Astronomical Union, 8, 211, doi: 10.1017/S174392131300481X
Gillessen et al. (2017) Gillessen, S., Plewa, P. M., Eisenhauer, F., et al. 2017, ApJ, 837, 30, doi: 10.3847/1538-4357/aa5c41
Han et al. (2022) Han, J. J., Conroy, C., Johnson, B. D., et al. 2022, The Astronomical Journal, 164, 249, doi: 10.3847/1538-3881/ac97e9
Helmi (2004) Helmi, A. 2004, MNRAS, 351, 643, doi: 10.1111/j.1365-2966.2004.07812.x
Helmi (2008) Helmi, A. 2008, The Astronomy and Astrophysics Review, 15, 145, doi: 10.1007/s00159-008-0009-6
Helmi (2020) Helmi, A. 2020, ARA&A, 58, 205, doi: 10.1146/annurev-astro-032620-021917
Hernitschek et al. (2018) Hernitschek, N., Cohen, J. G., Rix, H.-W., et al. 2018, The Astrophysical Journal, 859, 31, doi: 10.3847/1538-4357/aabfbb
Higson et al. (2019) Higson, E., Handley, W., Hobson, M., & Lasenby, A. 2019, Statistics and Computing, 29, 891, doi: 10.1007/s11222-018-9844-0
Hunter (2007) Hunter, J. D. 2007, Computing in Science and Engineering, 9, 90, doi: 10.1109/MCSE.2007.55
Iorio & Belokurov (2019) Iorio, G., & Belokurov, V. 2019, MNRAS, 482, 3868, doi: 10.1093/mnras/sty2806
Iorio et al. (2018) Iorio, G., Belokurov, V., Erkal, D., et al. 2018, Monthly Notices of the Royal Astronomical Society, 474, 2142, doi: 10.1093/mnras/stx2819
Koposov et al. (2023) Koposov, S., Speagle, J., Barbary, K., et al. 2023, joshspeagle/dynesty: v2.1.2, Zenodo, doi: 10.5281/zenodo.7995596
Law & Majewski (2010) Law, D. R., & Majewski, S. R. 2010, ApJ, 718, 1128, doi: 10.1088/0004-637X/718/2/1128
Li & S5 Collaboration (2021) Li, T., & S5 Collaboration. 2021, Southern Stellar Stream Spectroscopic Survey: The First Public Data Release, Data Release 1, Zenodo, doi: 10.5281/zenodo.4695135
Li et al. (2019) Li, T. S., Koposov, S. E., Zucker, D. B., et al. 2019, MNRAS, 490, 3508, doi: 10.1093/mnras/stz2731
Majewski et al. (2003) Majewski, S. R., Skrutskie, M. F., Weinberg, M. D., & Ostheimer, J. C. 2003, ApJ, 599, 1082, doi: 10.1086/379504
Medina et al. (2018) Medina, G. E., Muñoz, R. R., Vivas, A. K., et al. 2018, The Astrophysical Journal, 855, 43, doi: 10.3847/1538-4357/aaad02
Olling & Merrifield (2000) Olling, R. P., & Merrifield, M. R. 2000, MNRAS, 311, 361, doi: 10.1046/j.1365-8711.2000.03053.x
Palau & Miralda-Escudé (2023) Palau, C. G., & Miralda-Escudé, J. 2023, MNRAS, 524, 2124, doi: 10.1093/mnras/stad1930
Pieres et al. (2020) Pieres, A., Girardi, L., Balbinot, E., et al. 2020, MNRAS, 497, 1547, doi: 10.1093/mnras/staa1980
Prada et al. (2019) Prada, J., Forero-Romero, J. E., Grand, R. J. J., Pakmor, R., & Springel, V. 2019, MNRAS, 490, 4877, doi: 10.1093/mnras/stz2873
Preston et al. (1991) Preston, G. W., Shectman, S. A., & Beers, T. C. 1991, ApJ, 375, 121, doi: 10.1086/170175
Schlegel et al. (1998) Schlegel, D. J., Finkbeiner, D. P., & Davis, M. 1998, ApJ, 500, 525, doi: 10.1086/305772
Searle & Zinn (1978) Searle, L., & Zinn, R. 1978, The Astrophysical Journal, 225, 357, doi: 10.1086/156499
Sesar et al. (2011) Sesar, B., Jurić, M., & Ivezić, Ž. 2011, ApJ, 731, 4, doi: 10.1088/0004-637X/731/1/4
Sharma et al. (2023) Sharma, S., Bland-Hawthorn, J., Silk, J., & Boehm, C. 2023, MNRAS, 521, 4074, doi: 10.1093/mnras/stad721
Shipp et al. (2018) Shipp, N., Drlica-Wagner, A., Balbinot, E., et al. 2018, ApJ, 862, 114, doi: 10.3847/1538-4357/aacdab
Skilling (2004) Skilling, J. 2004, in American Institute of Physics Conference Series, Vol. 735, Bayesian Inference and Maximum Entropy Methods in Science and Engineering: 24th International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering, ed. R. Fischer, R. Preuss, & U. V. Toussaint, 395–405, doi: 10.1063/1.1835238
Skilling (2006) Skilling, J. 2006, Bayesian Analysis, 1, 833 , doi: 10.1214/06-BA127
Smith et al. (2009) Smith, M. C., Evans, N. W., & An, J. H. 2009, ApJ, 698, 1110, doi: 10.1088/0004-637X/698/2/1110
Speagle (2020) Speagle, J. S. 2020, MNRAS, 493, 3132, doi: 10.1093/mnras/staa278
Starkenburg et al. (2019) Starkenburg, E., Youakim, K., Martin, N., et al. 2019, Monthly Notices of the Royal Astronomical Society, 490, 5757, doi: 10.1093/mnras/stz2935
Stringer et al. (2021) Stringer, K. M., Drlica-Wagner, A., Macri, L., et al. 2021, ApJ, 911, 109, doi: 10.3847/1538-4357/abe873
Thomas et al. (2018) Thomas, G. F., McConnachie, A. W., Ibata, R. A., et al. 2018, Monthly Notices of the Royal Astronomical Society, 481, 5223, doi: 10.1093/mnras/sty2604
van der Walt et al. (2011) van der Walt, S., Colbert, S. C., & Varoquaux, G. 2011, Computing in Science and Engineering, 13, 22, doi: 10.1109/MCSE.2011.37
Vera-Ciro & Helmi (2013) Vera-Ciro, C., & Helmi, A. 2013, ApJ, 773, L4, doi: 10.1088/2041-8205/773/1/L4
Virtanen et al. (2020) Virtanen, P., Gommers, R., Oliphant, T. E., et al. 2020, Nature Methods, 17, 261, doi: 10.1038/s41592-019-0686-2
Watkins et al. (2009) Watkins, L. L., Evans, N. W., Belokurov, V., et al. 2009, Monthly Notices of the Royal Astronomical Society, 398, 1757, doi: 10.1111/j.1365-2966.2009.15242.x
Wenger et al. (2000) Wenger, M., Ochsenbein, F., Egret, D., et al. 2000, A&AS, 143, 9, doi: 10.1051/aas:2000332
Xue et al. (2015) Xue, X.-X., Rix, H.-W., Ma, Z., et al. 2015, The Astrophysical Journal, 809, 144, doi: 10.1088/0004-637X/809/2/144
Yanny et al. (2000) Yanny, B., Newberg, H. J., Kent, S., et al. 2000, ApJ, 540, 825, doi: 10.1086/309386

The Power of High Precision Broadband Photometry: Tracing the Milky Way Density Profile with Blue Horizontal Branch Stars in the Dark Energy Survey

Abstract

1 Introduction

2 Data

2.1 High-quality Photometry

2.2 Spectroscopic Crossmatching

2.3 Data Preparation

3 Mixture Model

3.1 General Form of the Likelihood

3.2 Detailed Model Implementation

3.3 Results

3.4 Validation from cross-matching with S5superscript𝑆5{S}^{5}italic_S start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT Data

4 Density Profile

4.1 Calibration to Distance and Selection of BHBs

4.2 Inhomogeneous Poisson Point Process Model

4.3 Results

4.4 Mock Data

5 Discussion

5.1 Systematic Error of Hard-Thresholding

5.2 Sky Variation

5.3 Search for Substructures

5.4 Comparison with Existing Literature

5.5 Limitations and Future Studies

5.5.1 Parametrization of the BHB/BS Mixture Model

6 Conclusions

Appendix A BHB ratio as a function of both color and magnitude

Appendix B BHB catalog

References

3.4 Validation from cross-matching with ${S}^{5}$ Data