Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments

Zheng, Yan; Hao, Jianye; Zhang, Zongzhang

Computer Science > Multiagent Systems

arXiv:1802.08534 (cs)

[Submitted on 23 Feb 2018 (v1), last revised 14 Apr 2018 (this version, v2)]

Title:Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments

Authors:Yan Zheng, Jianye Hao, Zongzhang Zhang

View PDF

Abstract:Recently, multiagent deep reinforcement learning (DRL) has received increasingly wide attention. Existing multiagent DRL algorithms are inefficient when facing with the non-stationarity due to agents update their policies simultaneously in stochastic cooperative environments. This paper extends the recently proposed weighted double estimator to the multiagent domain and propose a multiagent DRL framework, named weighted double deep Q-network (WDDQN). By utilizing the weighted double estimator and the deep neural network, WDDQN can not only reduce the bias effectively but also be extended to scenarios with raw visual inputs. To achieve efficient cooperation in the multiagent domain, we introduce the lenient reward network and the scheduled replay strategy. Experiments show that the WDDQN outperforms the existing DRL and multiaent DRL algorithms, i.e., double DQN and lenient Q-learning, in terms of the average reward and the convergence rate in stochastic cooperative environments.

Comments:	8 pages, 7 figures
Subjects:	Multiagent Systems (cs.MA); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1802.08534 [cs.MA]
	(or arXiv:1802.08534v2 [cs.MA] for this version)
	https://doi.org/10.48550/arXiv.1802.08534

Submission history

From: Yan Zheng [view email]
[v1] Fri, 23 Feb 2018 14:03:22 UTC (1,614 KB)
[v2] Sat, 14 Apr 2018 16:34:29 UTC (1,102 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.MA

< prev | next >

new | recent | 2018-02

Change to browse by:

cs
cs.AI
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yan Zheng
Jianye Hao
Zongzhang Zhang

export BibTeX citation

Computer Science > Multiagent Systems

Title:Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Multiagent Systems

Title:Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators