Deep Exploration via Randomized Value Functions

Osband, Ian; Van Roy, Benjamin; Russo, Daniel; Wen, Zheng

Statistics > Machine Learning

arXiv:1703.07608 (stat)

[Submitted on 22 Mar 2017 (v1), last revised 23 Sep 2019 (this version, v5)]

Title:Deep Exploration via Randomized Value Functions

Authors:Ian Osband, Benjamin Van Roy, Daniel Russo, Zheng Wen

View PDF

Abstract:We study the use of randomized value functions to guide deep exploration in reinforcement learning. This offers an elegant means for synthesizing statistically and computationally efficient exploration with common practical approaches to value function learning. We present several reinforcement learning algorithms that leverage randomized value functions and demonstrate their efficacy through computational studies. We also prove a regret bound that establishes statistical efficiency with a tabular representation.

Comments:	Accepted for publication in Journal of Machine Learning Research 2019
Subjects:	Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1703.07608 [stat.ML]
	(or arXiv:1703.07608v5 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1703.07608

Submission history

From: Ian Osband [view email]
[v1] Wed, 22 Mar 2017 11:53:53 UTC (2,577 KB)
[v2] Tue, 5 Jun 2018 17:13:06 UTC (1,705 KB)
[v3] Wed, 6 Jun 2018 09:17:37 UTC (1,872 KB)
[v4] Mon, 4 Mar 2019 23:48:32 UTC (1,329 KB)
[v5] Mon, 23 Sep 2019 18:29:02 UTC (1,923 KB)

Full-text links:

Access Paper:

view license

Current browse context:

stat.ML

< prev | next >

new | recent | 2017-03

Change to browse by:

cs
cs.AI
cs.LG
stat

References & Citations

export BibTeX citation

Statistics > Machine Learning

Title:Deep Exploration via Randomized Value Functions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Deep Exploration via Randomized Value Functions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators