Learning Observation-Based Certifiable Safe Policy for Decentralized Multi-Robot Navigation

Cui, Yuxiang; Lin, Longzhong; Huang, Xiaolong; Zhang, Dongkun; Wang, Yue; Xiong, Rong

Computer Science > Robotics

arXiv:2109.07760 (cs)

[Submitted on 16 Sep 2021]

Title:Learning Observation-Based Certifiable Safe Policy for Decentralized Multi-Robot Navigation

Authors:Yuxiang Cui, Longzhong Lin, Xiaolong Huang, Dongkun Zhang, Yue Wang, Rong Xiong

View PDF

Abstract:Safety is of great importance in multi-robot navigation problems. In this paper, we propose a control barrier function (CBF) based optimizer that ensures robot safety with both high probability and flexibility, using only sensor measurement. The optimizer takes action commands from the policy network as initial values and then provides refinement to drive the potentially dangerous ones back into safe regions. With the help of a deep transition model that predicts the evolution of surrounding dynamics and the consequences of different actions, the CBF module can guide the optimization in a reasonable time horizon. We also present a novel joint training framework that improves the cooperation between the Reinforcement Learning (RL) based policy and the CBF-based optimizer both in training and inference procedures by utilizing reward feedback from the CBF module. We observe that the policy using our method can achieve a higher success rate while maintaining the safety of multiple robots in significantly fewer episodes compared with other methods. Experiments are conducted in multiple scenarios both in simulation and the real world, the results demonstrate the effectiveness of our method in maintaining the safety of multi-robot navigation. Code is available at \url{this https URL

Comments:	7 pages, 7 figures. conference
Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2109.07760 [cs.RO]
	(or arXiv:2109.07760v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2109.07760

Submission history

From: Yuxiang Cui [view email]
[v1] Thu, 16 Sep 2021 07:16:39 UTC (5,726 KB)

Computer Science > Robotics

Title:Learning Observation-Based Certifiable Safe Policy for Decentralized Multi-Robot Navigation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Learning Observation-Based Certifiable Safe Policy for Decentralized Multi-Robot Navigation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators