Context-Aware Policy Reuse

Li, Siyuan; Gu, Fangda; Zhu, Guangxiang; Zhang, Chongjie

Computer Science > Artificial Intelligence

arXiv:1806.03793 (cs)

[Submitted on 11 Jun 2018 (v1), last revised 8 Mar 2019 (this version, v4)]

Title:Context-Aware Policy Reuse

Authors:Siyuan Li, Fangda Gu, Guangxiang Zhu, Chongjie Zhang

View PDF

Abstract:Transfer learning can greatly speed up reinforcement learning for a new task by leveraging policies of relevant tasks.
Existing works of policy reuse either focus on only selecting a single best source policy for transfer without considering contexts, or cannot guarantee to learn an optimal policy for a target task.
To improve transfer efficiency and guarantee optimality, we develop a novel policy reuse method, called Context-Aware Policy reuSe (CAPS), that enables multi-policy transfer. Our method learns when and which source policy is best for reuse, as well as when to terminate its reuse. CAPS provides theoretical guarantees in convergence and optimality for both source policy selection and target task learning. Empirical results on a grid-based navigation domain and the Pygame Learning Environment demonstrate that CAPS significantly outperforms other state-of-the-art policy reuse methods.

Comments:	Camera-ready version for AAMAS 2019
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:1806.03793 [cs.AI]
	(or arXiv:1806.03793v4 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1806.03793

Submission history

From: Siyuan Li [view email]
[v1] Mon, 11 Jun 2018 03:37:43 UTC (3,972 KB)
[v2] Thu, 14 Jun 2018 02:53:52 UTC (1 KB) (withdrawn)
[v3] Thu, 28 Jun 2018 11:01:33 UTC (4,116 KB)
[v4] Fri, 8 Mar 2019 14:13:36 UTC (2,272 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2018-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Siyuan Li
Fangda Gu
Guangxiang Zhu
Chongjie Zhang

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Context-Aware Policy Reuse

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Context-Aware Policy Reuse

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators