Formal Language Constraints for Markov Decision Processes

Quint, Eleanor; Xu, Dong; Flint, Samuel; Scott, Stephen; Dwyer, Matthew

Computer Science > Machine Learning

arXiv:1910.01074 (cs)

[Submitted on 2 Oct 2019 (v1), last revised 13 Oct 2020 (this version, v3)]

Title:Formal Language Constraints for Markov Decision Processes

Authors:Eleanor Quint, Dong Xu, Samuel Flint, Stephen Scott, Matthew Dwyer

View PDF

Abstract:In order to satisfy safety conditions, an agent may be constrained from acting freely. A safe controller can be designed a priori if an environment is well understood, but not when learning is employed. In particular, reinforcement learned (RL) controllers require exploration, which can be hazardous in safety critical situations. We study the benefits of giving structure to the constraints of a constrained Markov decision process by specifying them in formal languages as a step towards using safety methods from software engineering and controller synthesis. We instantiate these constraints as finite automata to efficiently recognise constraint violations. Constraint states are then used to augment the underlying MDP state and to learn a dense cost function, easing the problem of quickly learning joint MDP/constraint dynamics. We empirically evaluate the effect of these methods on training a variety of RL algorithms over several constraints specified in Safety Gym, MuJoCo, and Atari environments.

Comments:	NeurIPS 2019 Workshop on Safety and Robustness in Decision Making
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1910.01074 [cs.LG]
	(or arXiv:1910.01074v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1910.01074

Submission history

From: Eleanor Quint [view email]
[v1] Wed, 2 Oct 2019 16:45:23 UTC (166 KB)
[v2] Fri, 9 Oct 2020 22:38:36 UTC (855 KB)
[v3] Tue, 13 Oct 2020 18:00:26 UTC (855 KB)

Computer Science > Machine Learning

Title:Formal Language Constraints for Markov Decision Processes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Formal Language Constraints for Markov Decision Processes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators