Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals

Dhiman, Vikas; Banerjee, Shurjo; Siskind, Jeffrey M.; Corso, Jason J.

Computer Science > Machine Learning

arXiv:1809.09318 (cs)

[Submitted on 25 Sep 2018 (v1), last revised 4 Jan 2019 (this version, v4)]

Title:Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals

Authors:Vikas Dhiman, Shurjo Banerjee, Jeffrey M. Siskind, Jason J. Corso

View PDF

Abstract:Consider mutli-goal tasks that involve static environments and dynamic goals. Examples of such tasks, such as goal-directed navigation and pick-and-place in robotics, abound. Two types of Reinforcement Learning (RL) algorithms are used for such tasks: model-free or model-based. Each of these approaches has limitations. Model-free RL struggles to transfer learned information when the goal location changes, but achieves high asymptotic accuracy in single goal tasks. Model-based RL can transfer learned information to new goal locations by retaining the explicitly learned state-dynamics, but is limited by the fact that small errors in modelling these dynamics accumulate over long-term planning. In this work, we improve upon the limitations of model-free RL in multi-goal domains. We do this by adapting the Floyd-Warshall algorithm for RL and call the adaptation Floyd-Warshall RL (FWRL). The proposed algorithm learns a goal-conditioned action-value function by constraining the value of the optimal path between any two states to be greater than or equal to the value of paths via intermediary states. Experimentally, we show that FWRL is more sample-efficient and learns higher reward strategies in multi-goal tasks as compared to Q-learning, model-based RL and other relevant baselines in a tabular domain.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1809.09318 [cs.LG]
	(or arXiv:1809.09318v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1809.09318

Submission history

From: Vikas Dhiman [view email]
[v1] Tue, 25 Sep 2018 05:09:32 UTC (127 KB)
[v2] Tue, 2 Oct 2018 20:23:32 UTC (127 KB)
[v3] Thu, 4 Oct 2018 18:33:20 UTC (128 KB)
[v4] Fri, 4 Jan 2019 20:53:40 UTC (129 KB)

Computer Science > Machine Learning

Title:Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators