High Acceleration Reinforcement Learning for Real-World Juggling with Binary Rewards

Ploeger, Kai; Lutter, Michael; Peters, Jan

Computer Science > Robotics

arXiv:2010.13483 (cs)

[Submitted on 26 Oct 2020 (v1), last revised 31 Oct 2020 (this version, v3)]

Title:High Acceleration Reinforcement Learning for Real-World Juggling with Binary Rewards

Authors:Kai Ploeger, Michael Lutter, Jan Peters

View PDF

Abstract:Robots that can learn in the physical world will be important to en-able robots to escape their stiff and pre-programmed movements. For dynamic high-acceleration tasks, such as juggling, learning in the real-world is particularly challenging as one must push the limits of the robot and its actuation without harming the system, amplifying the necessity of sample efficiency and safety for robot learning algorithms. In contrast to prior work which mainly focuses on the learning algorithm, we propose a learning system, that directly incorporates these requirements in the design of the policy representation, initialization, and optimization. We demonstrate that this system enables the high-speed Barrett WAM manipulator to learn juggling two balls from 56 minutes of experience with a binary reward signal. The final policy juggles continuously for up to 33 minutes or about 4500 repeated catches. The videos documenting the learning process and the evaluation can be found at this https URL

Comments:	Published at Conference on Robot Learning (CoRL) 2020
Subjects:	Robotics (cs.RO); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2010.13483 [cs.RO]
	(or arXiv:2010.13483v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2010.13483

Submission history

From: Michael Lutter [view email]
[v1] Mon, 26 Oct 2020 11:13:47 UTC (5,682 KB)
[v2] Wed, 28 Oct 2020 17:41:39 UTC (5,682 KB)
[v3] Sat, 31 Oct 2020 18:15:45 UTC (5,682 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.RO

< prev | next >

new | recent | 2020-10

Change to browse by:

cs
cs.LG
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Kai Ploeger
Michael Lutter
Jan Peters

export BibTeX citation

Computer Science > Robotics

Title:High Acceleration Reinforcement Learning for Real-World Juggling with Binary Rewards

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:High Acceleration Reinforcement Learning for Real-World Juggling with Binary Rewards

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators