Dynamic Experience Replay

Luo, Jieliang; Li, Hui

Computer Science > Artificial Intelligence

arXiv:2003.02372 (cs)

[Submitted on 4 Mar 2020]

Title:Dynamic Experience Replay

Authors:Jieliang Luo, Hui Li

View PDF

Abstract:We present a novel technique called Dynamic Experience Replay (DER) that allows Reinforcement Learning (RL) algorithms to use experience replay samples not only from human demonstrations but also successful transitions generated by RL agents during training and therefore improve training efficiency. It can be combined with an arbitrary off-policy RL algorithm, such as DDPG or DQN, and their distributed versions. We build upon Ape-X DDPG and demonstrate our approach on robotic tight-fitting joint assembly tasks, based on force/torque and Cartesian pose observations. In particular, we run experiments on two different tasks: peg-in-hole and lap-joint. In each case, we compare different replay buffer structures and how DER affects them. Our ablation studies show that Dynamic Experience Replay is a crucial ingredient that either largely shortens the training time in these challenging environments or solves the tasks that the vanilla Ape-X DDPG cannot solve. We also show that our policies learned purely in simulation can be deployed successfully on the real robot. The video presenting our experiments is available at this https URL

Comments:	10 pages, 5 figures, presented at 2019 Conference on Robot Learning (CoRL)
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2003.02372 [cs.AI]
	(or arXiv:2003.02372v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2003.02372
Journal reference:	PMLR 100:1191-1200, 2020

Submission history

From: Jieliang Luo [view email]
[v1] Wed, 4 Mar 2020 23:46:45 UTC (5,721 KB)

Computer Science > Artificial Intelligence

Title:Dynamic Experience Replay

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Dynamic Experience Replay

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators