Indexability is Not Enough for Whittle: Improved, Near-Optimal Algorithms for Restless Bandits

Ghosh, Abheek; Nagaraj, Dheeraj; Jain, Manish; Tambe, Milind

Computer Science > Multiagent Systems

arXiv:2211.00112 (cs)

[Submitted on 31 Oct 2022 (v1), last revised 28 Feb 2023 (this version, v2)]

Title:Indexability is Not Enough for Whittle: Improved, Near-Optimal Algorithms for Restless Bandits

Authors:Abheek Ghosh, Dheeraj Nagaraj, Manish Jain, Milind Tambe

View PDF

Abstract:We study the problem of planning restless multi-armed bandits (RMABs) with multiple actions. This is a popular model for multi-agent systems with applications like multi-channel communication, monitoring and machine maintenance tasks, and healthcare. Whittle index policies, which are based on Lagrangian relaxations, are widely used in these settings due to their simplicity and near-optimality under certain conditions. In this work, we first show that Whittle index policies can fail in simple and practically relevant RMAB settings, even when the RMABs are indexable. We discuss why the optimality guarantees fail and why asymptotic optimality may not translate well to practically relevant planning horizons.
We then propose an alternate planning algorithm based on the mean-field method, which can provably and efficiently obtain near-optimal policies with a large number of arms, without the stringent structural assumptions required by the Whittle index policies. This borrows ideas from existing research with some improvements: our approach is hyper-parameter free, and we provide an improved non-asymptotic analysis which has: (a) no requirement for exogenous hyper-parameters and tighter polynomial dependence on known problem parameters; (b) high probability bounds which show that the reward of the policy is reliable; and (c) matching sub-optimality lower bounds for this algorithm with respect to the number of arms, thus demonstrating the tightness of our bounds. Our extensive experimental analysis shows that the mean-field approach matches or outperforms other baselines.

Comments:	21 pages; AAMAS'23 version with appendix
Subjects:	Multiagent Systems (cs.MA); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as:	arXiv:2211.00112 [cs.MA]
	(or arXiv:2211.00112v2 [cs.MA] for this version)
	https://doi.org/10.48550/arXiv.2211.00112

Submission history

From: Abheek Ghosh [view email]
[v1] Mon, 31 Oct 2022 19:35:15 UTC (953 KB)
[v2] Tue, 28 Feb 2023 18:30:57 UTC (1,094 KB)

Computer Science > Multiagent Systems

Title:Indexability is Not Enough for Whittle: Improved, Near-Optimal Algorithms for Restless Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Multiagent Systems

Title:Indexability is Not Enough for Whittle: Improved, Near-Optimal Algorithms for Restless Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators