Showing 1–1 of 1 results for author: Freelan, D

Search v0.5.6 released 2020-02-24

arXiv:1804.09817 [pdf, other]

cs.AI

Multiagent Soft Q-Learning

Authors: Ermo Wei, Drew Wicke, David Freelan, Sean Luke

Abstract: Policy gradient methods are often applied to reinforcement learning in continuous multiagent games. These methods perform local search in the joint-action space, and as we show, they are susceptable to a game-theoretic pathology known as relative overgeneralization. To resolve this issue, we propose Multiagent Soft Q-learning, which can be seen as the analogue of applying Q-learning to continuous… ▽ More Policy gradient methods are often applied to reinforcement learning in continuous multiagent games. These methods perform local search in the joint-action space, and as we show, they are susceptable to a game-theoretic pathology known as relative overgeneralization. To resolve this issue, we propose Multiagent Soft Q-learning, which can be seen as the analogue of applying Q-learning to continuous controls. We compare our method to MADDPG, a state-of-the-art approach, and show that our method achieves better coordination in multiagent cooperative tasks, converging to better local optima in the joint action space. △ Less

Submitted 25 April, 2018; originally announced April 2018.

Comments: Accepted in AAAI 18 Spring Symposium

Search v0.5.6 released 2020-02-24