Reinforcement learning for port-Hamiltonian systems

Sprangers, Olivier; Lopes, Gabriel A. D.; Babuska, Robert

doi:10.1109/TCYB.2014.2343194

Computer Science > Systems and Control

arXiv:1212.5524 (cs)

[Submitted on 21 Dec 2012 (v1), last revised 22 Aug 2013 (this version, v2)]

Title:Reinforcement learning for port-Hamiltonian systems

Authors:Olivier Sprangers, Gabriel A. D. Lopes, Robert Babuska

View PDF

Abstract:Passivity-based control (PBC) for port-Hamiltonian systems provides an intuitive way of achieving stabilization by rendering a system passive with respect to a desired storage function. However, in most instances the control law is obtained without any performance considerations and it has to be calculated by solving a complex partial differential equation (PDE). In order to address these issues we introduce a reinforcement learning approach into the energy-balancing passivity-based control (EB-PBC) method, which is a form of PBC in which the closed-loop energy is equal to the difference between the stored and supplied energies. We propose a technique to parameterize EB-PBC that preserves the systems's PDE matching conditions, does not require the specification of a global desired Hamiltonian, includes performance criteria, and is robust to extra non-linearities such as control input saturation. The parameters of the control law are found using actor-critic reinforcement learning, enabling learning near-optimal control policies satisfying a desired closed-loop energy landscape. The advantages are that near-optimal controllers can be generated using standard energy shaping techniques and that the solutions learned can be interpreted in terms of energy shaping and damping injection, which makes it possible to numerically assess stability using passivity theory. From the reinforcement learning perspective, our proposal allows for the class of port-Hamiltonian systems to be incorporated in the actor-critic framework, speeding up the learning thanks to the resulting parameterization of the policy. The method has been successfully applied to the pendulum swing-up problem in simulations and real-life experiments.

Comments:	submitted
Subjects:	Systems and Control (eess.SY); Machine Learning (cs.LG)
Cite as:	arXiv:1212.5524 [cs.SY]
	(or arXiv:1212.5524v2 [cs.SY] for this version)
	https://doi.org/10.48550/arXiv.1212.5524
Journal reference:	IEEE Transactions on Cybernetics, Volume: 45 , Issue: 5 , May 2015
Related DOI:	https://doi.org/10.1109/TCYB.2014.2343194

Submission history

From: Gabriel Lopes [view email]
[v1] Fri, 21 Dec 2012 16:57:28 UTC (213 KB)
[v2] Thu, 22 Aug 2013 16:16:31 UTC (263 KB)

Computer Science > Systems and Control

Title:Reinforcement learning for port-Hamiltonian systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Systems and Control

Title:Reinforcement learning for port-Hamiltonian systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators