SOAC: The Soft Option Actor-Critic Architecture

Li, Chenghao; Ma, Xiaoteng; Zhang, Chongjie; Yang, Jun; Xia, Li; Zhao, Qianchuan

Computer Science > Artificial Intelligence

arXiv:2006.14363 (cs)

[Submitted on 25 Jun 2020]

Title:SOAC: The Soft Option Actor-Critic Architecture

Authors:Chenghao Li, Xiaoteng Ma, Chongjie Zhang, Jun Yang, Li Xia, Qianchuan Zhao

View PDF

Abstract:The option framework has shown great promise by automatically extracting temporally-extended sub-tasks from a long-horizon task. Methods have been proposed for concurrently learning low-level intra-option policies and high-level option selection policy. However, existing methods typically suffer from two major challenges: ineffective exploration and unstable updates. In this paper, we present a novel and stable off-policy approach that builds on the maximum entropy model to address these challenges. Our approach introduces an information-theoretical intrinsic reward for encouraging the identification of diverse and effective options. Meanwhile, we utilize a probability inference model to simplify the optimization problem as fitting optimal trajectories. Experimental results demonstrate that our approach significantly outperforms prior on-policy and off-policy methods in a range of Mujoco benchmark tasks while still providing benefits for transfer learning. In these tasks, our approach learns a diverse set of options, each of whose state-action space has strong coherence.

Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2006.14363 [cs.AI]
	(or arXiv:2006.14363v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2006.14363

Submission history

From: Chenghao Li [view email]
[v1] Thu, 25 Jun 2020 13:06:59 UTC (1,006 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2020-06

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Chongjie Zhang
Jun Yang
Li Xia

export BibTeX citation

Computer Science > Artificial Intelligence

Title:SOAC: The Soft Option Actor-Critic Architecture

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:SOAC: The Soft Option Actor-Critic Architecture

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators