Non-Stationary Latent Auto-Regressive Bandits

Trella, Anna L.; Dempsey, Walter; Doshi-Velez, Finale; Murphy, Susan A.

Computer Science > Machine Learning

arXiv:2402.03110 (cs)

[Submitted on 5 Feb 2024 (v1), last revised 12 Aug 2024 (this version, v2)]

Title:Non-Stationary Latent Auto-Regressive Bandits

Authors:Anna L. Trella, Walter Dempsey, Finale Doshi-Velez, Susan A. Murphy

View PDF HTML (experimental)

Abstract:We consider the stochastic multi-armed bandit problem with non-stationary rewards. We present a novel formulation of non-stationarity in the environment where changes in the mean reward of the arms over time are due to some unknown, latent, auto-regressive (AR) state of order $k$. We call this new environment the latent AR bandit. Different forms of the latent AR bandit appear in many real-world settings, especially in emerging scientific fields such as behavioral health or education where there are few mechanistic models of the environment. If the AR order $k$ is known, we propose an algorithm that achieves $\tilde{O}(k\sqrt{T})$ regret in this setting. Empirically, our algorithm outperforms standard UCB across multiple non-stationary environments, even if $k$ is mis-specified.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2402.03110 [cs.LG]
	(or arXiv:2402.03110v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.03110

Submission history

From: Anna Trella [view email]
[v1] Mon, 5 Feb 2024 15:38:01 UTC (5,698 KB)
[v2] Mon, 12 Aug 2024 16:58:54 UTC (5,698 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2024-02

Change to browse by:

cs
cs.AI

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Non-Stationary Latent Auto-Regressive Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Non-Stationary Latent Auto-Regressive Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators