Regret of Queueing Bandits

Krishnasamy, Subhashini; Sen, Rajat; Johari, Ramesh; Shakkottai, Sanjay

Abstract:This paper studies multi-armed bandit algorithms for queueing systems where system parameters, such as service rates, may be unknown. This is a fundamental issue in many queueing systems, including crowdsourcing systems and wireless networks. We consider a system with multiple queues and servers, where the servers have queue-dependent, but unknown, service rates. At each time, a matching must be chosen between queues and servers. We focus on the goal of keeping the queue sizes small, and in particular aim to design algorithms that minimize queue-regret: the (expected) difference between the queue-lengths obtained by the algorithm, and those obtained by a "genie"-aided matching algorithm that knows exact server statistics. A naive view of this problem would suggest that queue-regret should grow logarithmically: since a queue is an integrator of service rates, classical regret analysis implies there exist algorithms that ensure queue-regret increases no more than logarithmically in time.
Our paper shows surprisingly more complex behavior. In particular, the naive intuition is correct as long as the bandit algorithm's queues have relatively long regenerative cycles: in this case queue-regret is similar to cumulative regret, and scales (essentially) logarithmically. However, we show that this "early stage" of the queueing bandit eventually gives way to a "late stage", where the optimal queue-regret scaling is $O(1/t)$. We demonstrate an algorithm that (order-wise) achieves this asymptotic queue-regret, and also exhibits close to optimal switching time from the early stage to the late stage.

Subjects:	Systems and Control (eess.SY)
Cite as:	arXiv:1604.06377 [cs.SY]
	(or arXiv:1604.06377v1 [cs.SY] for this version)
	https://doi.org/10.48550/arXiv.1604.06377

Computer Science > Systems and Control

Title:Regret of Queueing Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators