Recovering from Biased Data: Can Fairness Constraints Improve Accuracy?

Blum, Avrim; Stangl, Kevin

Computer Science > Machine Learning

arXiv:1912.01094 (cs)

[Submitted on 2 Dec 2019 (v1), last revised 22 Aug 2024 (this version, v2)]

Title:Recovering from Biased Data: Can Fairness Constraints Improve Accuracy?

Authors:Avrim Blum, Kevin Stangl

View PDF HTML (experimental)

Abstract:Multiple fairness constraints have been proposed in the literature, motivated by a range of concerns about how demographic groups might be treated unfairly by machine learning classifiers. In this work we consider a different motivation; learning from biased training data. We posit several ways in which training data may be biased, including having a more noisy or negatively biased labeling process on members of a disadvantaged group, or a decreased prevalence of positive or negative examples from the disadvantaged group, or both.
Given such biased training data, Empirical Risk Minimization (ERM) may produce a classifier that not only is biased but also has suboptimal accuracy on the true data distribution. We examine the ability of fairness-constrained ERM to correct this problem. In particular, we find that the Equal Opportunity fairness constraint (Hardt, Price, and Srebro 2016) combined with ERM will provably recover the Bayes Optimal Classifier under a range of bias models. We also consider other recovery methods including reweighting the training data, Equalized Odds, and Demographic Parity. These theoretical results provide additional motivation for considering fairness interventions even if an actor cares primarily about accuracy.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1912.01094 [cs.LG]
	(or arXiv:1912.01094v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1912.01094

Submission history

From: Kevin Matthew Stangl [view email]
[v1] Mon, 2 Dec 2019 22:00:14 UTC (160 KB)
[v2] Thu, 22 Aug 2024 02:33:28 UTC (7,618 KB)

Computer Science > Machine Learning

Title:Recovering from Biased Data: Can Fairness Constraints Improve Accuracy?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Recovering from Biased Data: Can Fairness Constraints Improve Accuracy?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators