Hindsight Experience Replay

Andrychowicz, Marcin; Wolski, Filip; Ray, Alex; Schneider, Jonas; Fong, Rachel; Welinder, Peter; McGrew, Bob; Tobin, Josh; Abbeel, Pieter; Zaremba, Wojciech

Computer Science > Machine Learning

arXiv:1707.01495 (cs)

[Submitted on 5 Jul 2017 (v1), last revised 23 Feb 2018 (this version, v3)]

Title:Hindsight Experience Replay

Authors:Marcin Andrychowicz, Filip Wolski, Alex Ray, Jonas Schneider, Rachel Fong, Peter Welinder, Bob McGrew, Josh Tobin, Pieter Abbeel, Wojciech Zaremba

View PDF

Abstract:Dealing with sparse rewards is one of the biggest challenges in Reinforcement Learning (RL). We present a novel technique called Hindsight Experience Replay which allows sample-efficient learning from rewards which are sparse and binary and therefore avoid the need for complicated reward engineering. It can be combined with an arbitrary off-policy RL algorithm and may be seen as a form of implicit curriculum.
We demonstrate our approach on the task of manipulating objects with a robotic arm. In particular, we run experiments on three different tasks: pushing, sliding, and pick-and-place, in each case using only binary rewards indicating whether or not the task is completed. Our ablation studies show that Hindsight Experience Replay is a crucial ingredient which makes training possible in these challenging environments. We show that our policies trained on a physics simulation can be deployed on a physical robot and successfully complete the task.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO)
Cite as:	arXiv:1707.01495 [cs.LG]
	(or arXiv:1707.01495v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1707.01495

Submission history

From: Marcin Andrychowicz [view email]
[v1] Wed, 5 Jul 2017 17:55:53 UTC (1,023 KB)
[v2] Mon, 10 Jul 2017 18:35:33 UTC (1,023 KB)
[v3] Fri, 23 Feb 2018 10:04:20 UTC (1,238 KB)

Computer Science > Machine Learning

Title:Hindsight Experience Replay

Submission history

Access Paper:

References & Citations

4 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Hindsight Experience Replay

Submission history

Access Paper:

References & Citations

4 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators