Abstract
Observations have revealed massive () galaxies that were already dead when the universe was only ∼2 Gyr. Given the short time before these galaxies were quenched, their past histories and quenching mechanism(s) are of particular interest. In this paper, we study star formation histories (SFHs) of 24 massive galaxies at 1.6 < z < 2.5. A deep slitless spectroscopy and imaging data set collected from multiple Hubble Space Telescope surveys allows robust determination of their spectral energy distributions and SFHs with no functional assumption on their forms. We find that most of our massive galaxies had formed >50% of their extant masses by ∼1.5 Gyr before the time of observed redshifts, with a trend where more massive galaxies form earlier. Their stellar-phase metallicities are already compatible with those of local early-type galaxies, with a median value of and scatter of ∼0.15 dex. In combination with the reconstructed SFHs, we reveal their rapid metallicity evolution from z ∼ 5.5 to ∼2.2 at a rate of ∼0.2 dex Gyr−1 in . Interestingly, the inferred stellar-phase metallicities are, when compared at half-mass time, ∼0.25 dex higher than observed gas-phase metallicities of star-forming galaxies. While systematic uncertainties remain, this may imply that these quenched galaxies have continued low-level star formation, rather than abruptly terminating their star formation activity, and kept enhancing their metallicity until recently.
Export citation and abstract BibTeX RIS
1. Introduction
In the local universe, early-type galaxies dominate the massive end of the galaxy mass function, (Cole et al. 2001; Bell et al. 2003). Those galaxies consist of old and chemically enriched stellar populations, indicating that most of their star formation activities ended (Kauffmann et al. 2003; Thomas et al. 2003, 2010; Gallazzi et al. 2005; Treu et al. 2005b). In fact, observations have revealed that some galaxies are already massive and passively evolving at z ≳ 2 (Cimatti et al. 2004; Daddi et al. 2005; van Dokkum et al. 2008; Kriek et al. 2009; Belli et al. 2014; Straatman et al. 2014; Marsan et al. 2015; Glazebrook et al. 2017). Given the short time since the big bang and their stellar mass, their earlier star formation must be extremely intense, followed by a rapid cessation of their star formation activity, which we here refer to as quenching.10
However, these episodes still remain observationally indirect. What were their star formation histories (SFHs) like? How and why did they stop forming stars, especially at the peak time of the cosmic star formation? Are they already enriched in metallicity as local counterparts, or do any post-quenching processes play key roles over the following 10 Gyr? These are the central questions we aim to answer in this series of papers. In this first paper, we focus on their SFHs.
A number of studies have investigated SFHs of massive galaxies using different approaches. For example, observations of high-redshift galaxies provide an analogy to their past properties, especially when they were actively forming stars. While sufficient valuable information can be obtained from high-z populations (e.g., star formation rate (SFR), number density, metallicity; e.g., Hamann & Ferland 1999; Tacconi et al. 2008; Toft et al. 2014), it is limited by its rather indirect aspect, where connecting different objects at different epochs may introduce systematic uncertainties (e.g., Wellons et al. 2015; Torrey et al. 2017).
Another approach is based on the archeological information of local galaxies, or fossil record (Thomas et al. 2003, 2010; Heavens et al. 2004; McDermid et al. 2015). Detailed information about their stellar population (e.g., age and chemical abundances) provides their past histories, characterizing their formation redshift to . This information from the local galaxies is, however, limited up to several Gyr with current observing facilities (e.g., Worthey 1994; see also Conroy 2013), which is short for exploring the SFHs of galaxies that are already dead at z ∼ 2.
To explore evolution histories in the earlier epoch, we need a method that combines some of the virtues of both approaches, that is, the archeological study of high-z galaxies. For example, Kelson et al. (2014) performed spectral energy distribution (SED) modeling of galaxies at z ∼ 1 by using low-resolution spectra and broadband photometry and reconstructed their SFHs back to z ∼ 1.5 (also Dressler et al. 2016, 2018). Chauke et al. (2018) recently attempted a similar approach to galaxies at but with higher-resolution spectra taken with a ground-based spectrograph and successfully revealed their formation histories back to z ∼ 2.5. In the current study, we target galaxies at higher redshift, aiming at earlier evolution histories up to their formation redshift from the fossil record obtained with the low spectral resolution yet high-sensitivity Hubble Space Telescope (HST) spectrophotometric data set.11
Stellar metallicity is another key parameter that provides further details of physical mechanisms. In particular, since both the cosmic metallicity and individual gas-phase metallicity are still pristine at these redshifts (e.g., Erb et al. 2006; Maiolino et al. 2008; Lehner et al. 2016), the enrichment process within such massive systems has to be substantial to explain the observed solar/supersolar metallicity of lower-redshift galaxies (Onodera et al. 2012; Choi et al. 2014; Gallazzi et al. 2014; Lonoce et al. 2015), whereas the process is highly dependent on SFHs (e.g., Peng et al. 2015).
With such demands, we here improve our previous methodology of SED modeling, which is free from functional forms of SFHs, by increasing the flexibility in metallicity. We collect 24 massive quenched galaxies at z ∼ 2 that have deep WFC3/G102 and G141 grism spectra coverage in their rest-frame 4000 Å. The combination of grism spectra and wide broadband photometry (0.2–8.0 μm by HST and Spitzer) provides a unique opportunity to constrain not only age but also metallicity from the entire SED shape.
We proceed as follows. In Section 2, we describe the data used in this study and their reduction process. In Section 3, we introduce our method for the SED modeling. In Section 4, we show the results. We discuss our results and interpretation in Section 5 and close in Section 6. Further details, including intensive simulation tests of SED modeling and comparison with functional SFHs, are presented in the Appendices. Throughout the text, magnitudes are quoted in the AB system (Oke & Gunn 1983; Fukugita et al. 1996); Ωm = 0.27, ΩΛ = 0.73, for the cosmological parameters; and Z⊙ = 0.0142 (Asplund et al. 2009) for the solar metallicity.
2. Data
To achieve our goal of constraining galaxy SFHs, it is essential to cover the wavelength range surrounding 4000 Å, where the spectral features are most age-sensitive, with sufficiently deep spectra and rest-frame NUV-optical-NIR wavelength with broadband photometry. Therefore, we limit ourselves to only fields where deep HST grism data are available, which are MACS 1149.6+2223 (hereafter M1149) and the GOODS-North/South (GDN/GDS). To collect the initial photometric sample galaxies, we use publicly available photometric catalogs. We use HST/WFC3 G102 and G141 grism data (that covers λobs ∼ 8000–17000 Å) taken in various surveys in these fields.
2.1. Initial Photometric Sample
M1149 is a sightline of a massive cluster of galaxies at z = 0.544. The data were taken in CLASH (Postman et al. 2012), Hubble Frontier Fields (Lotz et al. 2017), GLASS (Schmidt et al. 2014; Treu et al. 2015), and the SN Refsdal follow-up campaigns (Kelly et al. 2015, 2016). The combination of its gravitational magnification power and those very deep observations provides a unique opportunity among other clusters of GLASS. We use the photometric catalog used in Morishita et al. (2017, 2018), which consists of HST photometry taken in all HST surveys above, as well as Spitzer IRAC (3.6 + 4.5 μm; PIs: T. Soifer and P. Capak) and ground-based KS-band imaging (Brammer et al. 2016) and publicly available spectroscopic redshifts by GLASS (Schmidt et al. 2014).
For GDN and GDS, we use the publicly available catalog by 3DHST (Skelton et al. 2014). The photometric catalog consists of fluxes, spectroscopic/photometric redshifts, rest-frame colors, and stellar mass based on data taken in CANDELS and 3DHST (Grogin et al. 2011; Koekemoer et al. 2011; van Dokkum et al. 2013a; Skelton et al. 2014; Momcheva et al. 2016), as well as photometric fluxes obtained in ground-based surveys. We use HST photometry for optical/near-IR range, ground-based KS-band photometry, and Spitzer IRAC photometry in this study. Other ground-based fluxes listed in the 3DHST catalog, most of which are in the optical range, are not used, as the deep HST photometry is sufficient to constrain optical SEDs.
Broadband fluxes of HST are measured in a fixed aperture (r = 07) and then scaled to the total flux by multiplying C = fAUTO/faper, where fAUTO is the AUTO flux of SExtractor (Bertin & Arnouts 1996), as in Skelton et al. (2014) and Morishita et al. (2017).
From these photometric catalogs, we choose those that satisfy , 1.6 < z < 3.3, and . We also apply (U − V)rest > 1 mag to select quiescent galaxies. By setting a slightly bluer color for U − V than in the literature (∼1.4 mag), our sample also contains quenching galaxies. With the criteria, we found 17, 51, and 73 galaxies in M1149, GDN, and GDS, respectively, as an initial photometric sample (Figure 1).
2.2. HST Grism Spectrum
In M1149, the grism data were taken through GLASS (Schmidt et al. 2014; Treu et al. 2015). GLASS is a spectroscopic survey with HST/WFC3 G102 and G141 grisms (10 and 4 orbits, respectively). In addition, we supplement with the follow-up HST-GO/DDT campaign (Proposal ID 14041; PI: P. Kelly) of the multiply imaged supernova SN Refsdal (Kelly et al. 2015, 2016), which adds another 30 orbits of G141 data to the original GLASS observation.
In GDN/GDS, we retrieve the public data through MAST. In addition to the 3DHST data (van Dokkum et al. 2013b; Momcheva et al. 2016) that cover entire CANDELS's GDN/GDS fields, we add those taken in FIGS (13779; PI: S. Malhotra Pirzkal et al. 2017), CLEAR (14227; PI: C. Papovich, also Estrada-Carpenter et al. 2018), and other follow-ups (12099 and 12461, PI: A. Riess; 12190, A. Koekemoer; 13420, PI: G. Barro; 13871, PI: P. Oesch).
We extract 1D spectra from all fields in a consistent way by using the latest version of Grizli (Brammer 2018). During the extraction, the code automatically models neighboring objects, which are flagged in pre-provided SExtractor segmentation maps, and produces clean spectra for a target galaxy. The clean, optimal extracted spectra from each position angle (PA) are then stacked in a refined wavelength grid of 45 Å pixel–1. The pixel scale is slightly finer than the Nyquist sampling of the G141 grism, since we have many samplings over different orbits, each of which slightly shifts in the dispersion direction. Each spectrum is convolved with the image of the source to match the morphological difference in different PAs.
For the aperture correction of broadband photometry, we match the pseudo-broadband flux extracted from grism spectra by convolving with the corresponding filters (F140W/F105W) to the observed broadband flux (Section 3.2).
In addition to the random uncertainty in flux, we also estimate the uncertainty associated with the stacking of different PAs by following Onodera et al. (2015), and we integrate this to the random noise in quadrature for conservative estimates. The uncertainty accounts of ∼20% of the random uncertainty. Signal-to-noise ratios (S/Ns) of the final 1D spectrum range up to ∼50. Median values of each spectral element are S/N ∼ 18 at 4200–5000 Å and ∼4 at 3400–3800 Å (Table 1).
Table 1. Summary of Physical Parameters
Obj. ID | R.A. | Decl. | zgrism | AV | U − V | V − J | S/Na | tG102b | tG141b | ||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
(deg) | (deg) | (M⊙) | (Z⊙)c | (Gyr) | (mag) | (mag) | (mag) | Blue | Red | (s) | (s) | ||
MACS J1149.6+2223d | |||||||||||||
00141 | 1.77403e+02 | 2.24185e+01 | 4.2 | 24.5 | 9529 | 75987 | |||||||
00227 | 1.77407e+02 | 2.24162e+01 | 6.8 | 17.4 | 19758 | 80399 | |||||||
GOODS-North | |||||||||||||
06215 | 1.89029e+02 | 6.21726e+01 | 2.8 | 8.5 | 5011 | 6117 | |||||||
07276 | 1.89306e+02 | 6.21791e+01 | 6.1 | 14.1 | 5011 | 5011 | |||||||
11470 | 1.89066e+02 | 6.21987e+01 | 3.0 | 20.1 | 10023 | 8723 | |||||||
17599 | 1.89121e+02 | 6.22289e+01 | 3.5 | 20.6 | 33482 | 39394 | |||||||
17735 | 1.89061e+02 | 6.22290e+01 | 3.8 | 23.5 | 5011 | 8123 | |||||||
19341 | 1.89087e+02 | 6.22367e+01 | 4.1 | 31.1 | 5011 | 38494 | |||||||
19850 | 1.89090e+02 | 6.22392e+01 | 2.1 | 22.2 | 5011 | 38494 | |||||||
22774 | 1.89128e+02 | 6.22537e+01 | 2.2 | 18.4 | 4811 | 31476 | |||||||
23006 | 1.89351e+02 | 6.22547e+01 | 2.8 | 8.3 | 5011 | 4911 | |||||||
23249 | 1.89064e+02 | 6.22560e+01 | 2.9 | 7.7 | 5011 | 15841 | |||||||
24033 | 1.89115e+02 | 6.22594e+01 | 3.2 | 29.2 | 4811 | 39494 | |||||||
33780 | 1.89202e+02 | 6.23172e+01 | 2.9 | 14.8 | 33282 | 14635 | |||||||
GOODS-South | |||||||||||||
09704 | 5.32857e+01 | -2.78641e+01 | 12.5 | 17.4 | 98073 | 4711 | |||||||
23073 | 5.31231e+01 | -2.78034e+01 | 5.4 | 11.9 | 0 | 9423 | |||||||
24569 | 5.31588e+01 | -2.77972e+01 | 1.0 | 22.6 | 103246 | 23358 | |||||||
31397 | 5.31410e+01 | -2.77667e+01 | 5.9 | 32.3 | 0 | 21552 | |||||||
39364 | 5.30628e+01 | -2.77265e+01 | 8.9 | 23.6 | 27270 | 8923 | |||||||
41021 | 5.31874e+01 | -2.77192e+01 | 4.9 | 15.7 | 0 | 4711 | |||||||
41148 | 5.31279e+01 | -2.77189e+01 | 3.4 | 10.6 | 23058 | 4611 | |||||||
41520 | 5.31527e+01 | -2.77163e+01 | 4.7 | 10.2 | 27470 | 4711 | |||||||
43005 | 5.31085e+01 | -2.77101e+01 | 3.9 | 13.1 | 23058 | 9323 | |||||||
43114 | 5.30624e+01 | -2.77069e+01 | 25.2 | 49.4 | 27270 | 7720 |
Notes.
aAverage S/Ns of grism spectral element measured at blue (3400–3800 Å) and red (4200–5000 Å) wavelength ranges. bTotal exposure time in G102 and G141 observations. cZ⊙ = 0.0142 (Asplund et al. 2009). dStellar masses are corrected for magnifications by the foreground cluster.Download table as: ASCIITypeset image
2.3. Additional Photometric Data
In addition to photometric fluxes collected in 3DHST, we add WFC3/UVIS photometry. The rest-frame UV coverage is important to constrain SEDs with/without the UV upturn, which depends on metallicity (e.g., Yi et al. 1997; Treu et al. 2005a). We use WFC3/UVIS images from the HDUV legacy survey (Oesch et al. 2018) that cover parts of the GDN/GDS fields with the F275W and F336W filters. The UVIS images in the GDS consist of the previous data taken in the UDF by the UVUDF team (Teplitz et al. 2013). We run SExtractor on the public imaging data to conduct photometry and use the flux measured in a fixed aperture of 072 diameter.
3. Spectrophotometric SED Fitting
3.1. Basic Templates
Our SED fitting method (Grism SED Fitter, gsf12 ) is based on the canonical template fitting, where the best-fit parameters are determined by minimizing the residuals of observed and model SEDs. One major difference from most other works is the way we construct the model templates. The SED templates are often constructed with a functional form for SFHs, such as the exponential declining model, , where A, T0, and τ are free parameters. However, it is known that such a simplification may not represent real galaxy SFHs by observations (e.g., Pacifici et al. 2016; Iyer & Gawiser 2017) and simulations (e.g., Diemer et al. 2017). As such, here we avoid any functional forms and adopt an alternative method to generate model templates. The core of the method is to find the best combination of amplitudes, {ai}, for a set of composite stellar population templates of different ages, {ti}, that matches the data, as previously performed by Morishita et al. (2018). This type of SED modeling has been used in previous studies (Heavens et al. 2004; Cid Fernandes et al. 2005; Panter et al. 2007; Tojeiro et al. 2007; Kelson et al. 2014; Dressler et al. 2018), some of which demonstrated its strength and validity with intensive simulation tests.
To generate the template with different parameters, we use the flexible stellar population synthesis code (FSPS; Conroy et al. 2009; Conroy & Gunn 2010; Foreman-Mackey et al. 2014) to generate ith templates with ages of ti, based on MIST isochrones (Choi et al. 2016) and the MILES stellar library. As found by Morishita et al. (2018), different isochrones may return different results, in addition to a systematic difference in assumed metallicities.
We set the number of age "pixels" to 10, with [0.01, 0.03, 0.1, 0.3, 0.5, 0.7, 1.0, 1.5, 2.0, 3.0] Gyr (Figure 2), doubling the number from those adopted in Morishita et al. (2018). While we set the equal width of the template in lognormal space (∼0.5) following previous studies (e.g., Cid Fernandes et al. 2005), we added extra bins at intermediate age, where most of our galaxies are located, to increase the flexibility of SFHs.
Download figure:
Standard image High-resolution imageThe template is generated by assuming a short constant SFR within each bin width (∼30 Myr), rather than a simple stellar population (SSP). The reason we do not adopt the SSP model is that, while it is simple, it is unrealistic for real galaxies. Changing the width of constant star formation in each bin would result in a minor but systematic shift in reconstructed SFHs. The uncertainty in bin width is considered in calculation of parameters (e.g., age) by randomly fluctuating values within the width.
We also set the metallicity of each age pixel as a free parameter in a range of , as opposed to one global value in Morishita et al. (2018). While the determination of metallicity at each age pixel (i.e., metallicity histories of individual galaxies) is more challenging (see Appendix A), this gives extra flexibility in fitting templates and allows a reasonable estimate of uncertainty in SFHs (Section 5.3).
It is noted that metallicity-sensitive lines (such as Fe and Mg) are not measured at our spectral resolution. Our method instead relies on the entire spectral shape, with grism spectra and wide broadband photometry, that spans from the NUV, to the optical (that are sensitive to age), to the NIR (to metallicity) wavelength range (see Figure 2).
Templates generated with MIST are uniformly set to the solar-scaled abundance (Asplund et al. 2009; i.e., [α/Fe] = 0). It is noted that galaxies at high z may have an α-enhanced chemical composition (e.g., Onodera et al. 2015; Kriek et al. 2016), as found in local early-type galaxies (e.g., Thomas et al. 2005; Walcher et al. 2015). In fact, enhancement of the α-element has a similar effect as that of iron in UV and NIR continuum slopes (e.g., Vazdekis et al. 2015), while our low-resolution spectra cannot capture a detailed difference in each absorption line (i.e., Lick indices), and both abundances are degenerated in our total metallicity measurement, .13 As such, our total metallicity should remain similar to those with, e.g., α-enhanced templates (see also Walcher et al. 2009).
We assume a Salpeter (1955) initial mass function and Calzetti et al. (2000) dust law, where the dust attenuation, AV, is a global parameter that is applied to all age templates equally. Redshift is set as a free parameter at this step but within the 3σ range estimated in the previous step. In sum, the fits have 10 × 2 + 1 + 1 = 22 free parameters.
The degree of freedom of our fitting is worth noting. The number of spectral data points for each of our galaxies is >200 (with ∼16 for broadband photometric data points), where the spectral element is set to 45 Å in this study. Considering the correlation due to morphology (which is ∼104 Å for the mean size of our galaxies, ) and the large number of parameters, our spectra still have ∼100 independent data points.
Our updated method here has a few advantages over Morishita et al. (2018). First, it is more flexible than an a priori assumption of the SFH and robust to systematic bias in derived parameters (e.g., Wuyts et al. 2012). Second, it is flexible to a complex shape of SFHs, such as those with multiple bursts and sudden declines (e.g., Boquien et al. 2014). An SFH with multiple peaks cannot be reproduced by the exponential declining model (see Section 4). Third, it is flexible to the metallicity evolution. Methods with functional forms often have a fixed metallicity over the entire history. In our method, each template at a different age has a flexibility in metallicity as a free parameter that also provides metallicity enrichment histories, though the uncertainty in each age is typically large for most of our data sets in this study (Appendix A).
3.2. SED Parameter Exploration
The combination of templates is controlled by changing each amplitude, ai, as free parameters during the fit. A challenging part is the large number of parameters (over a dozen, compared to parameters with functional SFHs), which could be trapped in local minima. To sufficiently, yet efficiently, explore the parameter space, we adopt the Markov chain Monte Carlo (MCMC) method.
The fitting process of gsf is twofold: (1) initial redshift determination based on visual inspection of absorption lines and (2) MCMC realization to estimate the probability distribution for all parameters.
First, gsf determines the redshift by fitting the model templates to the observed grism spectrum. At this point, it only generates model templates in the wavelength range of grism spectra to minimize the computational cost. The templates are convolved to the resolution of the spectra with a Moffat function derived from the observed source morphology for spectra. It searches the best-fit redshift by minimizing χ2, as well as visual inspection, to avoid catastrophic errors. During the visual inspection, we rely on the major absorption lines in the observed range (i.e., Hδ, Hγ, Hβ), and thus those without clear absorption features (i.e., low S/N) are discarded here. At this step, gsf also determines the scale of the G102/G141 spectra so that each matches to the broadband photometry in F105W and F140W at a given template. The added scale for our sample is small () thanks to the accurate sky-background estimation in Grizli.
Then, gsf generates a template library at the redshift determined in the previous step and fits SEDs at the entire wavelength range. Here gsf fits the observed spectra and broadband photometry simultaneously by using emcee (Foreman-Mackey et al. 2013), as in Morishita et al. (2018). Redshift is also explored at this step by shifting and refining the template wavelength grid at a proposed redshift of each MCMC step. Emission lines, if detected, are masked during the fit. Those lines are modeled with a Gaussian function after subtracting the best-fit SED template to estimate the line flux and equivalent width (EW; Section 4.2).
We set the number of walkers to 100 and the number of realizations, Nmc, to 105. We adopt a uniform prior for each parameter over the parameter ranges, , , and AV/mag . The effect of the amplitude prior (i.e., SFH; see Figure 9) should be minimal due to the wide constrained range, as seen in the simulation in Appendix A. While some previous studies set a prior in metallicity histories from the local mass–metallicity relation (e.g., Pacifici et al. 2016; Leja et al. 2018; i.e., increasing metallicity as a function of time), we find that our result reproduces this behavior without such priors. However, the age–metallicity degeneracy could also mimic this trend. The test using an artificial SED data set in Appendix A revealed that, while the trend is fairly reproduced, nonnegligible scatter occurs in derived metallicity at each age pixel, and the metallicity histories of individual galaxies remain less promising (see also below).
During the fit, we let emcee run with the parallel tempering sampling, with ntemp = 5. With this, emcee samples the parameter spaces but with ntemp samplers in parallel. Each sampler has a different value for the temperature parameter in a Metropolis–Hastings step; i.e., a higher temperature makes a larger step in a parameter space. With this, the sampler suffers less from the local minima. We note, however, that a sufficient number of MCMC realizations (which is inferred from a simulation test for our case) is necessary to estimate reliable uncertainties. Otherwise, the derived uncertainties would become inappropriately small, with possible biases in the best-fit values (see Cid Fernandes 2018).
The first half realization of the sampled chain is discarded to avoid biased results from initial input values. We take the 50th and 16th/84th percentiles of marginalized distributions as the best fit and uncertainty range.
To check the reliability of the SED parameters and reconstructed histories, we conducted a simulation test with a mock data set (Appendix A). From the test, we found that the scatter in the reproduced amplitude and metallicity at each age bin is ∼0.5 and ∼0.25 dex, respectively. To account for this, we add the estimated scatter to the reconstructed star formation and metallicity histories, which are also propagated to other SED parameters (Section 4.3).
4. Results
4.1. 24 Galaxies as the Final Sample
In Figure 1, we summarize the basic parameters of the final sample galaxies. The final sample consists of 2, 12, and 10 galaxies from M1149, GDN, and GDS, after the rest of the initial samples are visually discarded because of poor redshift fitting quality (Section 3.2). Due to the partial coverage of the grism observations, as well as random contamination from neighboring galaxies, our samples have nonuniform exposure times in the grism observation and S/N (Table 1). The two galaxies in M1149 are those previously reported by Morishita et al. (2018).
In panels (a) and (b) of Figure 1, we plot the distribution of our sample galaxies in redshift–stellar mass/F140W–magnitude spaces, respectively. Due to the increasing sensitivity of the G141 grism with wavelength, galaxies at higher redshift are fainter in F140W than those at lower redshift. In (c), we show the UVJ color–color diagram for diagnosing galaxy quiescence (e.g., Williams et al. 2009). While most of the sample locates within the passive category, there are six galaxies that fall below the passive/star-forming boundary in U − V. These galaxies are in transition between star-forming () and passive (), akin to the green valley (or "quenching") galaxies in the local universe (Kauffmann et al. 2003; Schawinski et al. 2014).
The characteristic age, which is about the time since a system reached its half-mass, of each galaxy represents the mass-weighted value from the reconstructed SFH,
where ti is the median age, ai is the best-fit amplitude, and Ψi is the mass-to-light ratio of the ith template. An estimated error for each amplitude from the simulation test (Appendix A) is added in quadrature. The typical error in mass-weighted age is ∼0.2 dex.
The derived parameters are summarized in Table 1. It is noted that the estimated errors in stellar mass are larger (∼0.2 dex) than those listed in the original catalogs ( dex; Skelton et al. 2014). This is due to the fact that our fitting accounts for an additional uncertainty originated in the flexibility of SFHs. While one can implement this by, e.g., repeating a SED fitting analysis with different SFHs (Wuyts et al. 2012; Morishita et al. 2015), the stellar-mass measurement in the original catalog (as well as many others) is based on one functional form for SFHs; thus, the quoted errors are solely from photometric error and redshift (see also Appendix B).
4.2. Diagnostics from the SED Shape
In the left panels of Figure 3, we show the SEDs of the sample galaxies with the best-fit templates. Our galaxies are well characterized with k + a (Dressler & Gunn 1983) and quiescent spectra, as is expected from the sample selection and redshift range. Deep spectra successfully capture the spectral features of these types of galaxies, such as absorption lines and the 4000 Å/Balmer break. The wide broadband coverage well captures the spectral features, such as a blue UV slope from a young population (∼1 Gyr) and near-IR excess from an old population ( Gyr), that is consistent with the derived mass-weighted age (T*; see Table 1).
Six galaxies have moderately detected (∼1.5σ) weak emission lines, such as [O ii], Hδ, Hγ, and Hβ, which is a signature of ongoing star formation. We first fit each emission line with a Gaussian after subtracting the best-fit spectrum. The EW is then measured with the total flux from the Gaussian fit and the best-fit template as a continuum. For the [O ii] line, we use Equation (3) in Kennicutt (1998) to estimate the SFRs. For the Hβ line, we use a recombination coefficient in Case B (Osterbrock 1989) and then Equation (2) in Kennicutt (1998). These weak emission lines indicate a specific SFR (SFR/M*) of yr−1. While the detection is tentative, the low level of star formation activity is also observed in previous findings (e.g., Belli et al. 2017), possibly providing more detailed pictures of quenching mechanisms (Section 5). Two of the emission-detected galaxies (IDs 19341 and 19850) have strong [O iii] lines (4959 + 5007 Å) with a relatively weak Hβ line, suggesting the existence of active galactic nuclei (AGNs). While and N ii are beyond our wavelength coverage, the line ratio of the and [O iii] lines ( and 0.13 ± 0.05) implies that these galaxies are AGNs in the mass-excitation diagram (Juneau et al. 2014).
On the other hand, we find six galaxies that consist of very old populations with mass-weighted ages and dominate the high-mass end of our sample. While such massive galaxies are rare (n ∼ 3 × 10−5 Mpc−3; Muzzin et al. 2013; Tomczak et al. 2014), it is also true that some ancient galaxies that formed a long time ago have more chances to experience ex situ processes, e.g., merger and gas accretion, especially at this high redshift. Such ancient galaxies would be smuggled into the younger population and become indistinguishable when seen with, e.g., a light-weighted age. Our reconstructed SFHs have the ability to investigate this.
4.3. SFHs
In the right three columns of Figure 3, we show the reconstructed SFHs, mass accumulation histories, and metallicity enrichment histories for our sample galaxies. We reconstruct SFRs in each time bin by dividing the amount of stellar mass formed (including the lost mass by the time of observation) by the bin length. Thus, the SFRs at each bin represent its average values over time (∼30 Myr for the youngest template to ∼1 Gyr for the oldest one). It is also noted that the derived SFR cannot distinguish between in situ (stars formed in the system) or ex situ (those obtained via mergers).
Download figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution imageSome galaxies are worth highlighting. For example, ID 43114, the most massive galaxy in our sample (), had already formed about 50% of the final mass at (∼2 Gyr ago). The galaxy was at low star formation activity for ∼1.5 Gyr and then started active star formation (∼1000 M⊙ yr−1) at z ∼ 2.5, ∼300 Myr ago. The significant star formation is comparably as high as that of submillimeter galaxies at this redshift (Younger et al. 2007; Tacconi et al. 2008). In fact, its morphology shows a tidal feature with two close objects at the outer part, suggesting a recent (major) merger. Its star formation activity seen in the reconstructed SFH is consistent with a typical merger timescale at this redshift (Lotz et al. 2011; Snyder et al. 2017). Interestingly, the dust attenuation of this galaxy is relatively low (AV ∼ 0.3 mag) compared to typical submillimeter galaxies and starburst galaxies at this redshift (AV > 3 mag; Riechers et al. 2013; Toft et al. 2014), suggesting that post-processes might have cleared a large amount of dust.
Other galaxies with clearly disturbed morphology (e.g., IDs 09701 and 00141) also show recent intense star formation at before tobs that may provide an independent constraint on the merger timescale and induced star formation activity from follow-up kinematical studies.
Interestingly, many of our galaxies show extended star formation activity to ∼0.3 Gyr before their observed redshifts. This differs from the previous understanding of massive early-type galaxies, whose star formation activity was believed to decline rapidly or become truncated after forming the bulk of the stars in a very short time. While our sample size here is too small to generalize this (and is also possibly biased toward compact morphology), it is curious to see how previously adopted functional form SFHs behave toward such an extended feature, as well as other features like dual-peak SFHs.
We repeat the same analysis but with functional forms of SFHs to compare with our SFHs. In Figure 3, we show the best-fit SFHs obtained with a functional form for the SFH τ model with . In addition to τ and t0, we allocate one parameter for metallicity and one for dust attenuation. In most cases, it is clear that SFHs derived from gsf cannot be reproduced by the τ model. For example, the τ model cannot capture the dual peaks observed in some of our galaxies (e.g., ID 43114). The τ model also fails to capture extended star formation at both the young and old age sides. This is due to the fact that the functional SFH is light-weighted, where the best-fit parameters are more sensitive to differential amounts of light. Such a qualitative discrepancy in fact results in a quantitative discrepancy in the best-fit parameters, with χ2/ν systematically larger than gsf (see also Carnall et al. 2018, who argued limitation by a functionally defined SFH). Appendix B summarizes the comparison of the τ model, as well as results with the delayed τ model.
The metallicity histories shown in Figure 3 represent mass-weighted accumulated metallicity,
where i covers older age pixels than j. Individual metallicity values at each age pixel often suffer from large uncertainty (∼0.25 dex; Appendix A), while this is partially attributed to a small amount of light in those age pixels (i.e., small ai). We therefore avoid discussing metallicity enrichment histories of individual galaxies and instead focus on (more robust) total metallicity in this study (Section 5.3). It is noted, however, that having parameters for metallicity at each age bin allows flexibility in fitting and a more reasonable estimate (i.e., larger error bars) in SFHs and SED parameters.
5. Discussion
5.1. Timescale of Star Formation
In Figure 4, we summarize the mass accumulation histories as a function of lookback time from the observed redshift. Most of our sample galaxies formed >50% of their extant mass by ∼1.5 Gyr prior to the observation (z ∼ 2.5–5, depending on the observed redshift), which is quantitatively consistent with recent studies at similar and higher redshift (Domínguez Sánchez et al. 2016; Estrada-Carpenter et al. 2018; Schreiber et al. 2018; Belli et al. 2019).
Download figure:
Standard image High-resolution imageWe estimate the half-mass time, t50 (lookback time from the observed redshift), from individual SFHs in the right panel of Figure 4. It is estimated in each step of the MCMC, and thus its uncertainty represents those in SFHs and individual age bin widths. For some galaxies, we only estimate the lower limit, as >50% of stellar mass is in the oldest template. Higher-resolution spectra by, e.g., the James Webb Space Telescope are required to reveal ancient histories at a higher time resolution.
Still, we see a trend where more massive galaxies form earlier, known as downsizing (Cowie et al. 1996; Heavens et al. 2004; Treu et al. 2005b), with a linear fit of Gyr−1 . The measured standard deviation (∼0.16 Gyr) is comparable to the redshift range of our galaxies ( Gyr). The fact that the downsizing trend exists in the early time of the universe provides hints about the galaxy evolution at even earlier epochs, when they were star-forming galaxies, and how observed luminous galaxies form (Zitrin et al. 2015; Oesch et al. 2016) in relation to, e.g., their environments (Harikane et al. 2019).
5.2. Stellar Mass–Metallicity Relation at z ∼ 2
The stellar mass–metallicity relation is a key diagnostic of galaxies' chemical and mass maturation histories. The relation encodes the coevolution of stellar mass and chemical enrichment among galaxies and provides an independent clue to the past evolution of SFHs. The relation is known to hold from the local universe (Gallazzi et al. 2005; Panter et al. 2008; González Delgado et al. 2014) in a wide range of mass (Kirby et al. 2013), up to z ∼ 1 (Choi et al. 2014; Gallazzi et al. 2014; Leethochawalit et al. 2018). Beyond the redshift, however, it is still studied with a small sample of galaxies and is not clear (Onodera et al. 2012, 2015; Kriek et al. 2016; Morishita et al. 2018). While the relation in gas-phase metallicity at may suggest that the relation may remain universal to a higher redshift (Tremonti et al. 2004; Kewley & Ellison 2008; Maiolino et al. 2008; Mannucci et al. 2010; Yabe et al. 2014; Zahid et al. 2014; Onodera et al. 2016; Wang et al. 2017), the observed scatter is still large due to a selection bias, different tracer of metallicity, and different physical state of gas-phase metallicity (e.g., Andrews & Martini 2013). Here we overview the relation at z > 1.6 for the first time.
In Figure 5, we show the distribution of our galaxies in the stellar mass–metallicity plane. The metallicity here is a mass-weighted value, as for the age (Equation (1)). While no significant mass dependency of the metallicity is observed, this is because our galaxies occupy the high-mass end and do not span a wide mass range. In fact, the flattening behavior at a high-mass range is consistent with the relation at lower redshift (Gallazzi et al. 2005, 2014). The observed metallicity is significantly high (median of ) and tight (scatter of ∼0.15–0.25 dex around the median), which is comparable to the observed scatter in the local relation.
Download figure:
Standard image High-resolution imageWhile challenging, it is still of particular interest to compare our metallicity measurement with the local relation. To do this, we need to calibrate the absolute value of metallicity. Although Gallazzi et al.'s (2005) stellar-phase metallicity measurement is based on the total metallicity, as in this study (as opposed to an element abundance–based measurement), there is a systematic difference due to the adopted isochrones (MIST versus Padova), each of which has a different definition of solar metallicity (Z⊙ = 0.0142 versus 0.0190). We correct this by applying a +0.15 dex offset to the Gallazzi et al. (2005) measurement.
Another systematics is the α-abundance of the template. While the templates used in these studies are set to the solar composition (Choi et al. 2016), it is not clear, due to the low resolution of our spectra, if this metallicity is enhanced by α-elements, despite its completely different origin from iron (e.g., Thomas et al. 2005). In fact, Leethochawalit et al. (2018) recently found an ∼0.16 dex decrease in the iron abundance of massive galaxies at z ∼ 0.4 compared to the local value. Given the time of the universe (when iron was relatively deficient) and short timescale of star formation for our galaxies (where α-elements are enhanced), the high values may represent the α-enhancement, as is the case for the galaxy (Onodera et al. 2015; Kriek et al. 2016). However, due to the definition of the total metallicity used here and Gallazzi et al. (2005), [Z/H] ∼ [Fe/H] + 0.94 [α/Fe] (Thomas et al. 2003), changing the template to the α-enhanced ones should result in minor differences in this comparison.
While keeping these systematics in mind, we find that most of our galaxies are already on the local relation, with a median measured for the entire mass range of . The scatter around the median is revealed to be small (). This implies that chemical enrichment of at least some massive galaxies have already been completed within the first ∼3 Gyr of the universe. We revisit this in the following section.
Two galaxies fall below the median relation (IDs 00227 and 24569). The former, which was reported and discussed in Morishita et al. (2018), shows rather extended SFHs with small metallicity values over the entire history. According to its undisturbed morphology and gradual mass increase, seen in Figure 3, accretion of low-mass satellites (i.e., metal-poor) or late-time star formation triggered by the infall of pristine gas may explain the observed properties, rather than more dramatic episode involving, e.g., major merger. Detailed investigation of its inner structure and chemical composition at a higher angular resolution would provide further insight into its enrichment evolution (e.g., Abramson et al. 2018; Wang et al. 2018).
The other galaxy, on the other hand, shows a rapid assembly of mass. Since the galaxy formed a large fraction (∼50%) of its current mass at , its low metallicity is consistent with the cosmic metal enrichment (e.g., Lehner et al. 2016), as well as an observed rapid decrease in gas-phase metallicity at a given mass (Troncoso et al. 2014; Onodera et al. 2016). Given the time left to z ∼ 0, metal-poor galaxies like ID 24569 would possibly be enriched in metallicity by, e.g., mergers and recycled gas and may sneak into the local average population (see discussion in Morishita et al. 2018).
It is noted that the systematic uncertainty in α-enhancement would not explain the small value in , as both iron and α-abundances need to be significantly low.
5.3. Redshift Evolution of Stellar-phase Metallicity
While our mass–metallicity relation indicates that galaxies are already enriched to the value at present day (Figure 5), their origin and observed scatter, especially those with small metallicity, are yet to be investigated. We investigate the redshift evolution of total metallicity as a population by considering the formation time (zform), which is derived with the mass-weighted age and observed redshift, (i.e., lookback time to the half-mass time).
In Figure 6, we show the distribution of metallicity as a function of Tform. A clear correlation between Tform and observed metallicity is observed. A linear regression reveals a slope of dex Gyr−1, with a standard deviation of ∼0.15 dex. Our result shows the metallicity enrichment happening in this class of massive galaxies, whose metallicity already reaches the local value at z ∼ 3, for the first time.
Download figure:
Standard image High-resolution imageOne may suspect that this is an artifact from the age–metallicity degeneracy. While some galaxies show a weak correlation between the two parameters, the degree of correlation is much smaller than the quoted error bars in Figure 6. Our simulation test also revealed that the total metallicity/age are reproduced reliably enough for the observed trend (see Appendix A).
Interestingly, the linear regression suggests that the metallicity even exceeds the local value of the most massive galaxies in Gallazzi et al. (2005) by ∼0.1 dex. As noted before, our sample galaxies are biased to compact, high-density galaxies due to the high-S/N requirement for the SED fitting. While compact massive galaxies are rare at z ∼ 0 (Taylor et al. 2010; but see also Poggianti et al. 2013), following events such as minor mergers/second bursts in the following ∼10 Gyr would resolve the tension. We discuss this in the following section.
In Figure 6, we also plot gas-phase metallicity measurements of star-forming galaxies in a similar redshift range for comparison. We use the formula derived in Maiolino et al. (2008) at the same mass (). We match the gas-phase metallicity measurement to the stellar metallicity at z ∼ 0. While comparing the absolute values of two different metallicities is extremely challenging due to a number of uncertainties in each measurement (e.g., Sánchez et al. 2017; Bian et al. 2018), the matching process is reasonable for our purpose here, i.e., comparison of relative differences at z ∼ 0 and 2.
It is interesting to find an offset of ∼0.2–0.3 dex between those metallicity measurements in our redshift range that may give us a clue to how massive quenched galaxies enrich metallicity in this redshift. Assuming those star-forming galaxies are star-forming counterparts of our passive galaxies, the observed gap has to be resolved between the half-mass time (i.e., T* ago) and the time when they are observed as quenched galaxies.
One possible explanation is the continuation of low-level star formation activity in a closed box, or "strangulation" (Larson et al. 1980; Peng et al. 2015). Peng et al. (2015) demonstrated in their chemical evolution model that the observed offset in metallicity of local massive star-forming and passive galaxies (∼0.1 dex) can be explained by strangulation. Rapid cessation of star formation by AGN/stellar feedback would instead reproduce a similar metallicity for the two populations. Recent observations, in fact, revealed low star formation activity (e.g., Belli et al. 2017; Gobat et al. 2017), as well as remaining gas (Gobat et al. 2018, but also Sargent et al. 2015; Bezanson et al. 2019), in massive quiescent galaxies at z ∼ 2, implying the continuation of star formation after they have formed a large amount of stars, or quenched. In addition, the closed-box enrichment seems to be a good agreement with the observed individual SFHs (Figure 3), independently supporting our speculation here.
However, more detailed chemical modelings would be required to reach a conclusion. For example, the observed gap may also be attributed to the dilution of gas-phase metallicity by infalling pristine gas, while there is no gas infall in quenched galaxies due to virial shock heating (e.g., Birnboim & Dekel 2003). If this is the case, it is suggested that galaxy quenching may be largely caused by termination of gas infall (e.g., Feldmann & Mayer 2015), while it is not likely that cutting the gas supply would result in extended SFHs as we observe here. A sophisticated chemical modeling with a panchromatic data set, including gas-mass measurements, would be required for further understanding.
5.4. Following Evolution to z ∼ 0
We have found that our galaxies are already enriched in metallicity, located on the local mass–metallicity relation. Given the amount of mass and its quiescence, it is within our interests to investigate how these galaxies will evolve to the local population. While it is challenging to predict their exact descendant population at low redshift (as described in the introduction), it is still worth describing their possible paths and mechanisms.
In particular, many members of our sample are compact, high-density galaxies ( possibly due to the selection bias toward high S/Ns). Compact galaxies at these redshifts are often debated in terms of size evolution, where the observed size is ∼3–5 times smaller than galaxies at z ∼ 0 with similar masses (Trujillo et al. 2007; Morishita et al. 2014; van der Wel et al. 2014). While there is still much debate as to whether (all of) these galaxies would follow such a significant size evolution (Newman et al. 2012; Nipoti et al. 2012; Poggianti et al. 2013; Belli et al. 2017), minor merger is a popular mechanism that can efficiently increase their sizes (e.g., Hopkins et al. 2009; Naab et al. 2009; Oser et al. 2010; van Dokkum et al. 2015; Morishita & Ichikawa 2016).
The scenario appears to consistently work for our result of metallicity, where accretion of low-mass galaxies (which are less metal-enriched, as expected from the mass–metallicity relation) would dilute the system's metallicity to the consistent value. For example, approximately five minor mergers, with 1/10 the mass of the host and metallicity inferred from the relation at z ∼ 2, would lower the host metallicity by ∼0.1 dex, being consistent with the local value. The metallicity gradient observed at z ∼ 0 (e.g., González Delgado et al. 2014; Martín-Navarro et al. 2018) is independent evidence that such high-z metal-rich galaxies would become cores while the accreted component locates the outer part of local massive galaxies. The integrated metallicity is instead an average value of the whole system; thus, metallicities observed in the local relation should be lower than those observed at higher redshifts.
Infall of metal-poor gas associated with minor merging satellites (e.g., Torrey et al. 2012) or direct infall from the cosmic web (Dekel & Birnboim 2006) would also dilute the system's total metallicity by inducing the second burst. While it is not clear if the scenario reproduces the observed metallicity gradient at z ∼ 0, there is a large fraction of early-type galaxies that show evidence of ongoing star formation at intermediate redshift (e.g., Treu et al. 2005a; Kaviraj et al. 2011). Spatially resolved studies of such second burst galaxies will shed light on how these different scenarios contribute to the evolutionary path of massive galaxies at high redshift to the local counterpart.
6. Summary
We reconstructed SFHs of 24 massive, passively evolving galaxies at z ∼ 2. Our new SED modeling with gsf simultaneously fit slitless spectroscopic and photometric data taken from multiple surveys, with no functional assumption for SFHs. Our main findings are as follows.
- 1.Our massive galaxies have already formed >50% of their current mass by ∼1.5 Gyr prior to the epoch of observation, with a downsizing trend where more massive galaxies evolve earlier.
- 2.The SFHs reconstructed by gsf show a more extended feature than what is obtained with a τ-model fitting for most of the sample galaxies, indicating low-level star formation activity until recently, rather than abrupt cessation.
- 3.The stellar-phase metallicities of most of our galaxies are already compatible with local values, indicating a rapid metallicity enrichment associated with the early stellar-mass formation.
- 4.By using the reconstructed SFHs and inferred metallicity, we revealed a rapid metallicity enrichment of this class of massive galaxies at a rate of ∼0.2 dex Gyr−1 in from z ∼ 5.5 to 2.2.
- 5.While systematic uncertainties remain, the observed gap between the stellar- and gas-phase metallicities can be explained by continuation of a low level of star formation in quiescent galaxies and/or dilution of gas-phase metallicity due to the inflow of pristine gas to star-forming galaxies. The former scenario is consistent with the finding from individual SFHs.
We thank the anonymous referee for reading the manuscript carefully and providing constructive comments. We thank Marco Chiaberge, Colin Norman, Kartheik Iyer, Sara Ellison, and Susan Kassin for fruitful discussion. We thank Benedikt Diemer for providing star formation histories extracted from the Illustris simulation. We thank Pascal Oesch for providing HDUV data prior to the public release. Support for GLASS (HST-GO-13459) was provided by NASA through a grant from the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Inc., under NASA contract NAS 5-26555. Support for this work is provided by NASA through a Spitzer award issued by JPL/Caltech, HST-AR-13235 and HST-GO-13177. M.T. acknowledges the support provided by the Australian Research Council Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D) through project No. CE170100013.
Software: Astropy (Muna et al. 2016), emcee (Foreman-Mackey et al. 2013), lmfit (Newville et al. 2017), Grizli (Brammer 2018), python-fsps (Conroy et al. 2009; Conroy & Gunn 2010; Foreman-Mackey et al. 2014).
Appendix A: Mock Simulation of SED Fitting
We test the fidelity of galaxy SFHs and other parameters with our SED fitting method.
A1. Simulation Setup
In this study, to explore the parameter spaces we are interested in (i.e., quenched galaxies at z ∼ 2), we set parameters as follows: redshift mass-weighted age T*/Gyr peaked at ∼1.5 Gyr; dust attenuation AV/mag with a flat distribution; and metallicity peaked at dex (top panels of Figure 7). The amplitudes of each template (ai; i.e., SFHs) are randomly assigned. The input SFHs are shown in Figure 8.
Download figure:
Standard image High-resolution imageDownload figure:
Standard image High-resolution imageThe mock SEDs are generated via FSPS (Conroy et al. 2009; Conroy & Gunn 2010) with the assigned parameters. While we provide SFHs at the same time resolution as the fitting templates, this turns out only a small effect, thanks to a sufficient number of age bins (see also Appendix C for the result with higher-resolution SFHs).
Broadband photometry is then extracted by convolving the mock SEDs with filter response curves. For grism spectra, we convolve the mock template with the observed line spread function (modeled with a Gaussian), which takes into account the morphology/instrumental convolution. The error of each spectral element and broadband photometry is randomly assigned based on the observed uncertainty of 24 galaxies in this study (∼5–20 pixel–1 at Table 1).
No emission lines are added, since our focus is the quenched/old galaxy population. In total, 600 mock sets of G102/G141 spectra+broadband photometry are prepared from the template generated with random sets of parameters. We follow the same fitting method as in the main text.
A2. Result of Global Parameters
Figure 7 shows the offset of output and input values (Δy = yout − yin) as a function of output value for the major parameters: stellar mass, mass-weighted age, dust attenuation, and metallicity of the mock galaxies. By taking output values (rather than input ones) in the x-axis, it is possible to infer the false-positive fraction at a given output (i.e., observed) value and also implement the scatter to the observed values for a more comprehensive estimate of the uncertainty.
We find excellent agreement in stellar mass with a scatter of ∼0.05 dex and moderate agreement in mass-weighted age, dust attenuation, and mass-weighted metallicity with scatters of ∼0.11, 0.14, and 0.13 around the median values. Median offsets are small for most of the parameter ranges. Mass-weighted age shows a negative slope, underestimating for ∼0.15 dex at . However, most of our sample galaxies dominate higher values, with a median of / Gyr ∼ 0.2 (Figure 4), where the bias in parameters is small; thus, we do not correct the offset for our galaxies in the main text.
A3. Result of SFHs
In Figure 9, we summarize the offsets of the output and input values for amplitude and metallicity at each age pixel to show the fidelity of SFHs and metallicity enrichment histories from all mock galaxies used here. While the offset and scatter may depend on parameter sets with different combinations, this suggests that SFHs can be determined with ∼0.5 dex accuracy.
Download figure:
Standard image High-resolution imageMetallicity shows a large scatter in reproduced values, with a standard deviation of ∼0.3 dex. Given the parameter range assigned for metallicity (), we conclude that determination of metallicity at each age pixel is challenging with the current data set. It is noted that this does not mean that reproduced total metallicity has comparable uncertainty, since part of the scatter can be attributed to the age pixel where the total contribution of light is small. Total metallicity, which is light- or mass-weighted, should remain less scattered ( dex), as shown in the main text and Section A.2.
A4. Age–Metallicity Degeneracy
Degeneracy between age and metallicity is notoriously known as one of the difficult aspects when modeling accurate SEDs from photometric data (Worthey 1994). The degeneracy is, however, resolved once one obtains information at both the optical and NIR wavelength ranges simultaneously (Figure 2; also de Jong 1996; Smail et al. 2001; Choi et al. 2016).
To see if this is the case for our data set, we first show in the left panel of Figure 10 a rest-frame NIR color– diagram, both of which are available with our data in this study. Rest-frame NIR color (H − K) and the strength of 4000 Å( Balogh et al. 1999) are calculated with templates used for fitting, which are convolved to a comparable resolution of grism data (R ∼ 150), including the convolution effect by source morphology. As we see in the figure, age and metallicity are nearly orthogonal in most of the parameter range, meaning the age and metallicity can be well separated from those measurements. The only exception is for old ( Gyr) and solar/supersolar metallicity, where the relation of the two measurements becomes less orthogonal. Due to this, the metallicities of our fitting typically have larger uncertainties for old populations.
Download figure:
Standard image High-resolution imageOne may notice that some parameters are not distinguished, especially by the reddening effect of dust (e.g., t = 0.3 Gyr with , AV = 0 versus t = 0.3 Gyr with , AV = 1.0 mag). However, we stress that our SED fitting is not relying on any specific colors or indicators but rather on all spectrophotometric information over the wide wavelength range. As an example, we show two spectral templates in the right panel of Figure 10 that are located at a similar position in the –color space. Despite this, the two templates are clearly distinguishable at the rest-frame optical-to-NIR wavelength range, where sufficient photometric data points are available in this study. Also, the photometric error of each flux measurement may affect the SED parameters, but the uncertainty is properly implemented in our fitting framework using MCMC and reflected in the uncertainty range of the posterior.
We also investigate the age–metallicity degeneracy with our mock data set. In Figure 11, we show the distribution of the offset in total mass-weighted age and metallicity for our mock galaxies. The distribution is symmetric in both axes, whereas the distribution will follow a negative slope if these parameters are degenerated. The distribution is scattered for ∼0.2 dex, which is consistent with those found in Section A2.
Download figure:
Standard image High-resolution imageFrom both tests here, we conclude that the data set used in this study can resolve the age–metallicity degeneracy for our moderately old galaxies ( Gyr), but star formation and metallicity histories become less certain beyond .
Appendix B: Comparison of SED Parameters Obtained with Functional SFHs
In Section 4.3, we see that our reconstructed SFHs capture the detail features of individual galaxy SFHs that are often missed with functional forms. While the deviation is clear in these comparisons, it is yet to be investigated how a different assumption of SFH results in SED parameters. Here we compare the best-fit parameters between two types of SFHs.
In Figure 12, we compare the goodness of fit, χ2/ν, and major parameters from SED fitting, i.e., stellar mass, light-weighted age and metallicity, and dust attenuation. These values are compared between those reproduced by gsf (main text) and functional ones. First, the goodness of fit is better with gsf for most of our galaxies, which is reasonable given the flexibility of its modeling. We also find that the τ model () is more sensitive to the light from young stellar populations, where the model systematically underestimates the system's ages for ∼0.3 Gyr, on average (top panels). The discrepancy propagates to other parameters, where we find overestimated dust attenuation (∼0.3 mag) and largely underestimated metallicity.
Download figure:
Standard image High-resolution imageThe discrepancy in age and dust becomes slightly smaller when the delayed-τ model () is used (bottom panels of Figure 12). This is shown in the goodness of fit, where the delayed-τ model results in smaller values of (see also Pacifici et al. 2016). This is partly attributed to its rising slope in SFHs, which makes the age slightly older and cancels out the discrepancy in other parameters. A large discrepancy in metallicity, however, still remains.
Despite this, it is interesting that the reproduced stellar masses are very consistent with each other, with only ∼0.1 dex scatter in among our sample. While it is challenging to comprehensively understand this agreement due to a large number of parameters here, this is partly due to the sufficient wavelength coverage of the rest-frame near-IR region, where the light from low-mass stars dominates and is less sensitive to other parameters (e.g., Bell et al. 2003). The contribution by other parameters (i.e., age/metallicity/dust) is canceled out due to partial degeneracy at a given form of SFH. However, this does not necessarily mean that the typical error in stellar mass remains comparably small. As shown in the main text and Appendix A, the stellar-mass measurement with gsf is accompanied by an ∼0.2 dex uncertainty that mainly originates from, e.g., systematics in estimating accurate SFHs and metallicity enrichment histories. (Therefore, the estimated uncertainty in stellar mass with functional form SFHs is much smaller here.) For this reason, we conclude that the stellar-mass measurement remains at least at ∼0.15 dex accuracy for our galaxies, and possibly for other types of galaxies, since the uncertainty comes from its assumption of SFHs and SED modeling. The best-fit parameters derived with the two functional SFHs are summarized in Table 2.
Table 2. Summary of Physical Parameters with Different SFHs
gsfa | Exponential Model | Delayed Exponential Model | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Obj. ID | χ2/ν | b | b | AV | χ2/ν | AV | |||||||
(Z⊙) | (Gyr) | (M⊙) | (Z⊙) | (Gyr) | (mag) | (M⊙) | (Z⊙) | (Gyr) | (mag) | ||||
00141 | 2.41 | 2.43 | 2.47 | ||||||||||
00227 | 1.75 | 3.52 | 3.95 | ||||||||||
06215 | 1.59 | 1.77 | 1.65 | ||||||||||
07276 | 1.54 | 2.58 | 2.50 | ||||||||||
09704 | 1.74 | 3.34 | 2.72 | ||||||||||
11470 | 1.88 | 2.65 | 5.59 | ||||||||||
17599 | 1.49 | 3.69 | 2.74 | ||||||||||
17735 | 1.70 | 6.68 | 2.24 | ||||||||||
19341 | 4.37 | 20.66 | 4.79 | ||||||||||
19850 | 1.87 | 4.63 | 2.43 | ||||||||||
22774 | 1.19 | 2.11 | 1.59 | ||||||||||
23006 | 1.06 | 1.52 | 1.34 | ||||||||||
23073 | 1.19 | 3.62 | 2.66 | ||||||||||
23249 | 1.48 | 1.52 | 1.45 | ||||||||||
24033 | 2.18 | 20.81 | 5.73 | ||||||||||
24569 | 2.66 | 6.03 | 5.16 | ||||||||||
31397 | 2.81 | 8.22 | 4.89 | ||||||||||
33780 | 1.72 | 2.95 | 1.81 | ||||||||||
39364 | 3.98 | 6.42 | 7.11 | ||||||||||
41021 | 2.86 | 3.22 | 3.00 | ||||||||||
41148 | 2.36 | 3.65 | 3.77 | ||||||||||
41520 | 1.94 | 2.88 | 2.71 | ||||||||||
43005 | 2.73 | 3.27 | 4.81 | ||||||||||
43114 | 8.08 | 12.98 | 17.93 |
Notes.
aStellar mass and dust attenuation for gsf are listed in Table 1. bLight-weighted metallicity and age, to match those derived with functional SFHs.Download table as: ASCIITypeset image
Appendix C: Simulation with Realistic SFHs
While our test with randomly generated SFHs provides a general idea of the goodness of gsf, there is still concern for how a specific type of SFH affects the output results. In particular, the random SFHs do not fully investigate the SFHs of quenched galaxies, i.e., the target galaxies of our study. Upon such a demand, here we repeat a similar fitting analysis as in Appendix A but with SFHs taken from a cosmological simulation, which gives us an idea of how well quenched galaxies are reconstructed within our framework.
In Figure 13, we compare the input and output histories for 10 galaxies. The set of galaxies is selected from the Illustris simulation (Nelson et al. 2015), with a similar mass to our galaxies () and quenched at the time of observation (SFR < 1 M⊙ yr−1 at z ∼ 2). The SFHs and metallicity enrichment histories are provided to the FSPS to synthesize the SEDs. From the generated SEDs, we extract fluxes corresponding to our grism elements (convolved with the morphology of one of our sources) and broadband filters. We then add noise with a conservative value of at 4200–5000 Å (and 15 as a supplemental test). We set the observed redshift uniformly to z = 2 and AV = 0.5 mag for the sake of simplicity, but this hardly affects the conclusion here.
Download figure:
Standard image High-resolution imageIn general, the posterior captures the feature of SFHs—the peak time of SFR and its length—and gives fairly good estimates of SED parameters: stellar mass, mass-weighted age and metallicity, and dust attenuation (bottom panel of Figure 13). There is a trend that gsf underestimates star formation in the oldest bin. This leads to underestimation in the mass-weighted age for old galaxies with , but this is only for a small amount (<0.1 dex). The offset seen in output metallicity is also small (∼0.1 dex) and hardly changes our conclusion in the main text, as the measurements in the main text quote much larger uncertainties from the analysis in Appendix A.
One caveat is that dust attenuation is overestimated for ∼0.2 mag (and stellar mass is overestimated for ∼0.1 dex accordingly). However, this should be considered as a result of this specific type of SFH (and SED), as we see fairly good reproduction of the parameter for random SFHs in Figure 7. The offset becomes smaller by increasing the input S/N to 15 but is not completely dismissed.
Footnotes
- 10
The term may refer to different phenomena in different contexts. For example, one may also refer to keeping star formation at very low levels after an initial decline, or the (rapid) decline of star formation itself (see Man & Belli 2018, for a recent review). We use the term to describe any decline of galaxy star formation activity regardless of speed.
- 11
Belli et al. (2019) recently presented SFHs of 1.5 < z < 2.5 galaxies reconstructed with ground-based spectroscopic data. None of their sample galaxies overlaps with ours in this study.
- 12
The code is available at https://github.com/mtakahiro/gsf.
- 13