How to pick the domain randomization parameters for sim-to-real transfer of reinforcement learning policies?

Vuong, Quan; Vikram, Sharad; Su, Hao; Gao, Sicun; Christensen, Henrik I.

Computer Science > Machine Learning

arXiv:1903.11774 (cs)

[Submitted on 28 Mar 2019]

Title:How to pick the domain randomization parameters for sim-to-real transfer of reinforcement learning policies?

Authors:Quan Vuong, Sharad Vikram, Hao Su, Sicun Gao, Henrik I. Christensen

View PDF

Abstract:Recently, reinforcement learning (RL) algorithms have demonstrated remarkable success in learning complicated behaviors from minimally processed input. However, most of this success is limited to simulation. While there are promising successes in applying RL algorithms directly on real systems, their performance on more complex systems remains bottle-necked by the relative data inefficiency of RL algorithms. Domain randomization is a promising direction of research that has demonstrated impressive results using RL algorithms to control real robots. At a high level, domain randomization works by training a policy on a distribution of environmental conditions in simulation. If the environments are diverse enough, then the policy trained on this distribution will plausibly generalize to the real world. A human-specified design choice in domain randomization is the form and parameters of the distribution of simulated environments. It is unclear how to the best pick the form and parameters of this distribution and prior work uses hand-tuned distributions. This extended abstract demonstrates that the choice of the distribution plays a major role in the performance of the trained policies in the real world and that the parameter of this distribution can be optimized to maximize the performance of the trained policies in the real world

Comments:	2-page extended abstract
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1903.11774 [cs.LG]
	(or arXiv:1903.11774v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1903.11774

Submission history

From: Quan Vuong [view email]
[v1] Thu, 28 Mar 2019 03:24:44 UTC (151 KB)

Computer Science > Machine Learning

Title:How to pick the domain randomization parameters for sim-to-real transfer of reinforcement learning policies?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:How to pick the domain randomization parameters for sim-to-real transfer of reinforcement learning policies?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators