A General Framework for Abstention Under Label Shift

Alexandari, Amr M.; Kundaje, Anshul; Shrikumar, Avanti

Statistics > Machine Learning

arXiv:1802.07024 (stat)

[Submitted on 20 Feb 2018 (v1), last revised 19 Jun 2022 (this version, v5)]

Title:A General Framework for Abstention Under Label Shift

Authors:Amr M. Alexandari, Anshul Kundaje, Avanti Shrikumar

View PDF

Abstract:In safety-critical applications of machine learning, it is often important to abstain from making predictions on low confidence examples. Standard abstention methods tend to be focused on optimizing top-k accuracy, but in many applications, accuracy is not the metric of interest. Further, label shift (a shift in class proportions between training time and prediction time) is ubiquitous in practical settings, and existing abstention methods do not handle label shift well. In this work, we present a general framework for abstention that can be applied to optimize any metric of interest, that is adaptable to label shift at test time, and that works out-of-the-box with any classifier that can be calibrated. Our approach leverages recent reports that calibrated probability estimates can be used as a proxy for the true class labels, thereby allowing us to estimate the change in an arbitrary metric if an example were abstained on. We present computationally efficient algorithms under our framework to optimize sensitivity at a target specificity, auROC, and the weighted Cohen's Kappa, and introduce a novel strong baseline based on JS divergence from prior class probabilities. Experiments on synthetic, biological, and clinical data support our findings.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1802.07024 [stat.ML]
	(or arXiv:1802.07024v5 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1802.07024

Submission history

From: Amr Alexandari [view email]
[v1] Tue, 20 Feb 2018 09:24:49 UTC (809 KB)
[v2] Sat, 19 May 2018 00:21:34 UTC (1,480 KB)
[v3] Fri, 14 Sep 2018 05:11:25 UTC (1,481 KB)
[v4] Wed, 30 Oct 2019 06:02:57 UTC (250 KB)
[v5] Sun, 19 Jun 2022 04:32:20 UTC (381 KB)

Statistics > Machine Learning

Title:A General Framework for Abstention Under Label Shift

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:A General Framework for Abstention Under Label Shift

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators