An Ergodic Measure for Active Learning From Equilibrium

Abraham, Ian; Prabhakar, Ahalya; Murphey, Todd D.

Computer Science > Robotics

arXiv:2006.03552 (cs)

[Submitted on 5 Jun 2020 (v1), last revised 7 Dec 2020 (this version, v2)]

Title:An Ergodic Measure for Active Learning From Equilibrium

Authors:Ian Abraham, Ahalya Prabhakar, Todd D. Murphey

View PDF

Abstract:This paper develops KL-Ergodic Exploration from Equilibrium ($\text{KL-E}^3$), a method for robotic systems to integrate stability into actively generating informative measurements through ergodic exploration. Ergodic exploration enables robotic systems to indirectly sample from informative spatial distributions globally, avoiding local optima, and without the need to evaluate the derivatives of the distribution against the robot dynamics. Using hybrid systems theory, we derive a controller that allows a robot to exploit equilibrium policies (i.e., policies that solve a task) while allowing the robot to explore and generate informative data using an ergodic measure that can extend to high-dimensional states. We show that our method is able to maintain Lyapunov attractiveness with respect to the equilibrium task while actively generating data for learning tasks such, as Bayesian optimization, model learning, and off-policy reinforcement learning. In each example, we show that our proposed method is capable of generating an informative distribution of data while synthesizing smooth control signals. We illustrate these examples using simulated systems and provide simplification of our method for real-time online learning in robotic systems.

Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2006.03552 [cs.RO]
	(or arXiv:2006.03552v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2006.03552

Submission history

From: Ian Abraham [view email]
[v1] Fri, 5 Jun 2020 16:58:55 UTC (2,761 KB)
[v2] Mon, 7 Dec 2020 18:49:20 UTC (2,749 KB)

Computer Science > Robotics

Title:An Ergodic Measure for Active Learning From Equilibrium

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:An Ergodic Measure for Active Learning From Equilibrium

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators