Bi-directional Value Learning for Risk-aware Planning Under Uncertainty: Extended Version

Kim, Sung-Kyun; Thakker, Rohan; Agha-mohammadi, Ali-akbar

doi:10.1109/LRA.2019.2903259

Computer Science > Robotics

arXiv:1902.05698 (cs)

[Submitted on 15 Feb 2019 (v1), last revised 6 Apr 2019 (this version, v2)]

Title:Bi-directional Value Learning for Risk-aware Planning Under Uncertainty: Extended Version

Authors:Sung-Kyun Kim, Rohan Thakker, Ali-akbar Agha-mohammadi

View PDF

Abstract:Decision-making under uncertainty is a crucial ability for autonomous systems. In its most general form, this problem can be formulated as a Partially Observable Markov Decision Process (POMDP). The solution policy of a POMDP can be implicitly encoded as a value function. In partially observable settings, the value function is typically learned via forward simulation of the system evolution. Focusing on accurate and long-range risk assessment, we propose a novel method, where the value function is learned in different phases via a bi-directional search in belief space. A backward value learning process provides a long-range and risk-aware base policy. A forward value learning process ensures local optimality and updates the policy via forward simulations. We consider a class of scalable and continuous-space rover navigation problems (RNP) to assess the safety, scalability, and optimality of the proposed algorithm. The results demonstrate the capabilities of the proposed algorithm in evaluating long-range risk/safety of the planner while addressing continuous problems with long planning horizons.

Comments:	Copyright 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Subjects:	Robotics (cs.RO)
Cite as:	arXiv:1902.05698 [cs.RO]
	(or arXiv:1902.05698v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.1902.05698
Related DOI:	https://doi.org/10.1109/LRA.2019.2903259

Submission history

From: Sung-Kyun Kim [view email]
[v1] Fri, 15 Feb 2019 06:07:57 UTC (1,660 KB)
[v2] Sat, 6 Apr 2019 20:53:16 UTC (1,677 KB)

Computer Science > Robotics

Title:Bi-directional Value Learning for Risk-aware Planning Under Uncertainty: Extended Version

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Bi-directional Value Learning for Risk-aware Planning Under Uncertainty: Extended Version

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators