IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks

Luo, Michael; Yao, Jiahao; Liaw, Richard; Liang, Eric; Stoica, Ion

Computer Science > Machine Learning

arXiv:1912.00167v2 (cs)

[Submitted on 30 Nov 2019 (v1), revised 11 Dec 2019 (this version, v2), latest version 23 Jan 2020 (v3)]

Title:IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks

Authors:Michael Luo, Jiahao Yao, Richard Liaw, Eric Liang, Ion Stoica

View PDF

Abstract:The practical usage of reinforcement learning agents is often bottlenecked by the duration of training time. To accelerate training, practitioners often turn to distributed reinforcement learning architectures to parallelize and accelerate the training process. However, modern methods for scalable reinforcement learning (RL) often tradeoff between the throughput of samples that an RL agent can learn from (sample throughput) and the quality of learning from each sample (sample efficiency). In these scalable RL architectures, as one increases sample throughput (i.e. increasing parallelization in IMPALA), sample efficiency drops significantly. To address this, we propose a new distributed reinforcement learning algorithm, IMPACT. IMPACT extends IMPALA with three changes: a target network for stabilizing the surrogate objective, a circular buffer, and truncated importance sampling. In discrete action-space environments, we show that IMPACT attains higher reward and, simultaneously, achieves up to 30% decrease in training wall-time than that of IMPALA. For continuous control environments, IMPACT trains faster than existing scalable agents while preserving the sample efficiency of synchronous PPO.

Comments:	14 pages, 10 figures
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1912.00167 [cs.LG]
	(or arXiv:1912.00167v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1912.00167

Submission history

From: Michael Luo [view email]
[v1] Sat, 30 Nov 2019 09:44:19 UTC (4,174 KB)
[v2] Wed, 11 Dec 2019 09:23:15 UTC (4,174 KB)
[v3] Thu, 23 Jan 2020 07:30:51 UTC (4,174 KB)

Computer Science > Machine Learning

Title:IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators