Bundled Gradients through Contact via Randomized Smoothing

Suh, H. J. Terry; Pang, Tao; Tedrake, Russ

Computer Science > Robotics

arXiv:2109.05143 (cs)

[Submitted on 11 Sep 2021 (v1), last revised 22 Jan 2022 (this version, v3)]

Title:Bundled Gradients through Contact via Randomized Smoothing

Authors:H.J. Terry Suh, Tao Pang, Russ Tedrake

View PDF

Abstract:The empirical success of derivative-free methods in reinforcement learning for planning through contact seems at odds with the perceived fragility of classical gradient-based optimization methods in these domains. What is causing this gap, and how might we use the answer to improve gradient-based methods? We believe a stochastic formulation of dynamics is one crucial ingredient. We use tools from randomized smoothing to analyze sampling-based approximations of the gradient, and formalize such approximations through the gradient bundle. We show that using the gradient bundle in lieu of the gradient mitigates fast-changing gradients of non-smooth contact dynamics modeled by the implicit time-stepping, or the penalty method. Finally, we apply the gradient bundle to optimal control using iLQR, introducing a novel algorithm which improves convergence over using exact gradients. Combining our algorithm with a convex implicit time-stepping formulation of contact, we show that we can tractably tackle planning-through-contact problems in manipulation.

Comments:	The first two authors contributed equally. Accepted to Robotics and Automation Letters (RA-L)
Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2109.05143 [cs.RO]
	(or arXiv:2109.05143v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2109.05143

Submission history

From: Hyung Ju Suh [view email]
[v1] Sat, 11 Sep 2021 00:03:28 UTC (11,080 KB)
[v2] Tue, 14 Sep 2021 03:19:00 UTC (9,045 KB)
[v3] Sat, 22 Jan 2022 03:55:01 UTC (4,628 KB)

Computer Science > Robotics

Title:Bundled Gradients through Contact via Randomized Smoothing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Bundled Gradients through Contact via Randomized Smoothing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators