Regret of Queueing Bandits

Krishnasamy, Subhashini; Sen, Rajat; Johari, Ramesh; Shakkottai, Sanjay

Computer Science > Systems and Control

arXiv:1604.06377v2 (cs)

[Submitted on 21 Apr 2016 (v1), revised 13 Jun 2016 (this version, v2), latest version 21 Nov 2019 (v4)]

Title:Regret of Queueing Bandits

Authors:Subhashini Krishnasamy, Rajat Sen, Ramesh Johari, Sanjay Shakkottai

View PDF

Abstract:We consider a variant of the multiarmed bandit problem where jobs queue for service, and service rates of different servers may be unknown. We study algorithms that minimize queue-regret: the (expected) difference between the queue-lengths obtained by the algorithm, and those obtained by a "genie"-aided matching algorithm that knows exact service rates. A naive view of this problem would suggest that queue-regret should grow logarithmically: since queue-regret cannot be larger than classical regret, results for the standard MAB problem give algorithms that ensure queue-regret increases no more than logarithmically in time. Our paper shows surprisingly more complex behavior. In particular, the naive intuition is correct as long as the bandit algorithm's queues have relatively long regenerative cycles: in this case queue-regret is similar to cumulative regret, and scales (essentially) logarithmically. However, we show that this "early stage" of the queueing bandit eventually gives way to a "late stage", where the optimal queue-regret scaling is $O(1/t)$. We demonstrate an algorithm that (order-wise) achieves this asymptotic queue-regret, and also exhibits close to optimal switching time from the early stage to the late stage.

Subjects:	Systems and Control (eess.SY)
Cite as:	arXiv:1604.06377 [cs.SY]
	(or arXiv:1604.06377v2 [cs.SY] for this version)
	https://doi.org/10.48550/arXiv.1604.06377

Submission history

From: Subhashini Krishnasamy [view email]
[v1] Thu, 21 Apr 2016 16:43:27 UTC (600 KB)
[v2] Mon, 13 Jun 2016 18:37:51 UTC (762 KB)
[v3] Mon, 8 Oct 2018 01:11:54 UTC (3,580 KB)
[v4] Thu, 21 Nov 2019 22:18:22 UTC (3,580 KB)

Computer Science > Systems and Control

Title:Regret of Queueing Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Systems and Control

Title:Regret of Queueing Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators