Investigating Generalisation in Continuous Deep Reinforcement Learning

Zhao, Chenyang; Sigaud, Olivier; Stulp, Freek; Hospedales, Timothy M.

Computer Science > Machine Learning

arXiv:1902.07015 (cs)

[Submitted on 19 Feb 2019 (v1), last revised 20 Feb 2019 (this version, v2)]

Title:Investigating Generalisation in Continuous Deep Reinforcement Learning

Authors:Chenyang Zhao, Olivier Sigaud, Freek Stulp, Timothy M. Hospedales

View PDF

Abstract:Deep Reinforcement Learning has shown great success in a variety of control tasks. However, it is unclear how close we are to the vision of putting Deep RL into practice to solve real world problems. In particular, common practice in the field is to train policies on largely deterministic simulators and to evaluate algorithms through training performance alone, without a train/test distinction to ensure models generalise and are not overfitted. Moreover, it is not standard practice to check for generalisation under domain shift, although robustness to such system change between training and testing would be necessary for real-world Deep RL control, for example, in robotics. In this paper we study these issues by first characterising the sources of uncertainty that provide generalisation challenges in Deep RL. We then provide a new benchmark and thorough empirical evaluation of generalisation challenges for state of the art Deep RL methods. In particular, we show that, if generalisation is the goal, then common practice of evaluating algorithms based on their training performance leads to the wrong conclusions about algorithm choice. Finally, we evaluate several techniques for improving generalisation and draw conclusions about the most robust techniques to date.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1902.07015 [cs.LG]
	(or arXiv:1902.07015v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1902.07015

Submission history

From: Olivier Sigaud [view email]
[v1] Tue, 19 Feb 2019 12:20:36 UTC (2,030 KB)
[v2] Wed, 20 Feb 2019 16:19:07 UTC (2,030 KB)

Computer Science > Machine Learning

Title:Investigating Generalisation in Continuous Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Investigating Generalisation in Continuous Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators