Budgeted Reinforcement Learning in Continuous State Space

Carrara, Nicolas; Leurent, Edouard; Laroche, Romain; Urvoy, Tanguy; Maillard, Odalric-Ambrym; Pietquin, Olivier

Computer Science > Machine Learning

arXiv:1903.01004 (cs)

[Submitted on 3 Mar 2019 (v1), last revised 27 May 2019 (this version, v3)]

Title:Budgeted Reinforcement Learning in Continuous State Space

Authors:Nicolas Carrara, Edouard Leurent, Romain Laroche, Tanguy Urvoy, Odalric-Ambrym Maillard, Olivier Pietquin

View PDF

Abstract:A Budgeted Markov Decision Process (BMDP) is an extension of a Markov Decision Process to critical applications requiring safety constraints. It relies on a notion of risk implemented in the shape of a cost signal constrained to lie below an - adjustable - threshold. So far, BMDPs could only be solved in the case of finite state spaces with known dynamics. This work extends the state-of-the-art to continuous spaces environments and unknown dynamics. We show that the solution to a BMDP is a fixed point of a novel Budgeted Bellman Optimality operator. This observation allows us to introduce natural extensions of Deep Reinforcement Learning algorithms to address large-scale BMDPs. We validate our approach on two simulated applications: spoken dialogue and autonomous driving.

Comments:	N. Carrara and E. Leurent have equally contributed
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1903.01004 [cs.LG]
	(or arXiv:1903.01004v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1903.01004

Submission history

From: Edouard Leurent [view email]
[v1] Sun, 3 Mar 2019 22:24:01 UTC (1,012 KB)
[v2] Wed, 6 Mar 2019 17:37:51 UTC (1,012 KB)
[v3] Mon, 27 May 2019 21:50:33 UTC (1,297 KB)

Computer Science > Machine Learning

Title:Budgeted Reinforcement Learning in Continuous State Space

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Budgeted Reinforcement Learning in Continuous State Space

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators