Improving Photometric Redshift Estimates with Training Sample Augmentation

Moskowitz, Irene; Gawiser, Eric; Crenshaw, John Franklin; Andrews, Brett H.; Malz, Alex I.; Schmidt, Samuel; Collaboration, The LSST Dark Energy Science

doi:10.3847/2041-8213/ad4039

Astrophysics > Instrumentation and Methods for Astrophysics

arXiv:2402.15551 (astro-ph)

[Submitted on 23 Feb 2024 (v1), last revised 14 May 2024 (this version, v2)]

Title:Improving Photometric Redshift Estimates with Training Sample Augmentation

Authors:Irene Moskowitz, Eric Gawiser, John Franklin Crenshaw, Brett H. Andrews, Alex I. Malz, Samuel Schmidt, The LSST Dark Energy Science Collaboration

View PDF HTML (experimental)

Abstract:Large imaging surveys will rely on photometric redshifts (photo-z's), which are typically estimated through machine learning methods. Currently planned spectroscopic surveys will not be deep enough to produce a representative training sample for LSST, so we seek methods to improve the photo-z estimates that arise from non-representative training samples. Spectroscopic training samples for photo-z's are biased towards redder, brighter galaxies, which also tend to be at lower redshift than the typical galaxy observed by LSST, leading to poor photo-z estimates with outlier fractions nearly 4 times larger than for a representative training sample. In this paper, we apply the concept of training sample augmentation, where we augment simulated non-representative training samples with simulated galaxies possessing otherwise unrepresented features. When we select simulated galaxies with (g-z) color, i-band magnitude and redshift outside the range of the original training sample, we are able to reduce the outlier fraction of the photo-z estimates for simulated LSST data by nearly 50% and the normalized median absolute deviation (NMAD) by 56%. When compared to a fully representative training sample, augmentation can recover nearly 70% of the degradation in the outlier fraction and 80% of the degradation in NMAD. Training sample augmentation is a simple and effective way to improve training samples for photo-z's without requiring additional spectroscopic samples.

Comments:	11 pages, 4 figures, published in ApJ Letters
Subjects:	Instrumentation and Methods for Astrophysics (astro-ph.IM); Cosmology and Nongalactic Astrophysics (astro-ph.CO)
Cite as:	arXiv:2402.15551 [astro-ph.IM]
	(or arXiv:2402.15551v2 [astro-ph.IM] for this version)
	https://doi.org/10.48550/arXiv.2402.15551
Journal reference:	ApJL 967 L6 (2024)
Related DOI:	https://doi.org/10.3847/2041-8213/ad4039

Submission history

From: Irene Moskowitz [view email]
[v1] Fri, 23 Feb 2024 17:03:02 UTC (1,551 KB)
[v2] Tue, 14 May 2024 14:09:33 UTC (1,555 KB)

Astrophysics > Instrumentation and Methods for Astrophysics

Title:Improving Photometric Redshift Estimates with Training Sample Augmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Astrophysics > Instrumentation and Methods for Astrophysics

Title:Improving Photometric Redshift Estimates with Training Sample Augmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators