Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning

Lin, Yijiong; Huang, Jiancong; Zimmer, Matthieu; Guan, Yisheng; Rojas, Juan; Weng, Paul

doi:10.1109/LRA.2020.3013937

Computer Science > Robotics

arXiv:1909.10707 (cs)

[Submitted on 24 Sep 2019 (v1), last revised 5 Jul 2020 (this version, v6)]

Title:Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning

Authors:Yijiong Lin, Jiancong Huang, Matthieu Zimmer, Yisheng Guan, Juan Rojas, Paul Weng

View PDF

Abstract:Deep Reinforcement Learning (RL) is a promising approach for adaptive robot control, but its current application to robotics is currently hindered by high sample requirements. To alleviate this issue, we propose to exploit the symmetries present in robotic tasks. Intuitively, symmetries from observed trajectories define transformations that leave the space of feasible RL trajectories invariant and can be used to generate new feasible trajectories, which could be used for training. Based on this data augmentation idea, we formulate a general framework, called Invariant Transform Experience Replay that we present with two techniques: (i) Kaleidoscope Experience Replay exploits reflectional symmetries and (ii) Goal-augmented Experience Replay which takes advantage of lax goal definitions. In the Fetch tasks from OpenAI Gym, our experimental results show significant increases in learning rates and success rates. Particularly, we attain a 13, 3, and 5 times speedup in the pushing, sliding, and pick-and-place tasks respectively in the multi-goal setting. Performance gains are also observed in similar tasks with obstacles and we successfully deployed a trained policy on a real Baxter robot. Our work demonstrates that invariant transformations on RL trajectories are a promising methodology to speed up learning in deep RL.

Comments:	8 pages, 11 figures, additional 3 pages for appendix. IEEE Robotics and Automation Letters (RAL), 2020. Also in: Intelligent Robots and Systems (IROS)
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1909.10707 [cs.RO]
	(or arXiv:1909.10707v6 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.1909.10707
Journal reference:	IEEE Robotics and Automation Letters, Volume: 5, Issue: 4, p. 6615-6622, Oct. 2020
Related DOI:	https://doi.org/10.1109/LRA.2020.3013937

Submission history

From: Yijiong Lin [view email]
[v1] Tue, 24 Sep 2019 04:34:58 UTC (5,749 KB)
[v2] Thu, 10 Oct 2019 07:31:08 UTC (2,836 KB)
[v3] Sat, 14 Dec 2019 02:42:27 UTC (5,748 KB)
[v4] Thu, 12 Mar 2020 09:27:44 UTC (6,569 KB)
[v5] Tue, 9 Jun 2020 03:51:26 UTC (8,819 KB)
[v6] Sun, 5 Jul 2020 03:19:06 UTC (9,044 KB)

Computer Science > Robotics

Title:Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators