Skip to main content

Showing 1–1 of 1 results for author: Freelan, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:1804.09817  [pdf, other

    cs.AI

    Multiagent Soft Q-Learning

    Authors: Ermo Wei, Drew Wicke, David Freelan, Sean Luke

    Abstract: Policy gradient methods are often applied to reinforcement learning in continuous multiagent games. These methods perform local search in the joint-action space, and as we show, they are susceptable to a game-theoretic pathology known as relative overgeneralization. To resolve this issue, we propose Multiagent Soft Q-learning, which can be seen as the analogue of applying Q-learning to continuous… ▽ More

    Submitted 25 April, 2018; originally announced April 2018.

    Comments: Accepted in AAAI 18 Spring Symposium