Group Fairness in Bandit Arm Selection

Schumann, Candice; Lang, Zhi; Mattei, Nicholas; Dickerson, John P.

Computer Science > Machine Learning

arXiv:1912.03802 (cs)

[Submitted on 9 Dec 2019 (v1), last revised 15 Feb 2022 (this version, v3)]

Title:Group Fairness in Bandit Arm Selection

Authors:Candice Schumann, Zhi Lang, Nicholas Mattei, John P. Dickerson

View PDF

Abstract:We propose a novel formulation of group fairness with biased feedback in the contextual multi-armed bandit (CMAB) setting. In the CMAB setting, a sequential decision maker must, at each time step, choose an arm to pull from a finite set of arms after observing some context for each of the potential arm pulls. In our model, arms are partitioned into two or more sensitive groups based on some protected feature(s) (e.g., age, race, or socio-economic status). Initial rewards received from pulling an arm may be distorted due to some unknown societal or measurement bias. We assume that in reality these groups are equal despite the biased feedback received by the agent. To alleviate this, we learn a societal bias term which can be used to both find the source of bias and to potentially fix the problem outside of the algorithm. We provide a novel algorithm that can accommodate this notion of fairness for an arbitrary number of groups, and provide a theoretical bound on the regret for our algorithm. We validate our algorithm using synthetic data and two real-world datasets for intervention settings wherein we want to allocate resources fairly across groups.

Comments:	Accepted to AAMAS 2022
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1912.03802 [cs.LG]
	(or arXiv:1912.03802v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1912.03802

Submission history

From: John Dickerson [view email]
[v1] Mon, 9 Dec 2019 01:02:35 UTC (887 KB)
[v2] Tue, 18 Feb 2020 17:26:41 UTC (1,121 KB)
[v3] Tue, 15 Feb 2022 21:15:27 UTC (976 KB)

Computer Science > Machine Learning

Title:Group Fairness in Bandit Arm Selection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Group Fairness in Bandit Arm Selection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators