Learning to Cooperate via Policy Search

Peshkin, Leonid; Kim, Kee-Eung; Meuleau, Nicolas; Kaelbling, Leslie Pack

Computer Science > Artificial Intelligence

arXiv:1408.1484 (cs)

[Submitted on 7 Aug 2014]

Title:Learning to Cooperate via Policy Search

Authors:Leonid Peshkin, Kee-Eung Kim, Nicolas Meuleau, Leslie Pack Kaelbling

View PDF

Abstract:Cooperative games are those in which both agents share the same payoff structure. Value-based reinforcement-learning algorithms, such as variants of Q-learning, have been applied to learning cooperative games, but they only apply when the game state is completely observable to both agents. Policy search methods are a reasonable alternative to value-based methods for partially observable environments. In this paper, we provide a gradient-based distributed policy-search method for cooperative games and compare the notion of local optimum to that of Nash equilibrium. We demonstrate the effectiveness of this method experimentally in a small, partially observable simulated soccer domain.

Comments:	Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)
Subjects:	Artificial Intelligence (cs.AI)
Report number:	UAI-P-2000-PG-489-496
Cite as:	arXiv:1408.1484 [cs.AI]
	(or arXiv:1408.1484v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1408.1484

Submission history

From: Leonid Peshkin [view email] [via AUAI proxy]
[v1] Thu, 7 Aug 2014 06:25:37 UTC (389 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2014-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Leonid Peshkin
Kee-Eung Kim
Nicolas Meuleau
Leslie Pack Kaelbling

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Learning to Cooperate via Policy Search

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Learning to Cooperate via Policy Search

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators