Towards Sample Efficient Agents through Algorithmic Alignment

Li, Mingxuan; Littman, Michael L.

Computer Science > Artificial Intelligence

arXiv:2008.03229 (cs)

[Submitted on 7 Aug 2020 (v1), last revised 21 Oct 2021 (this version, v5)]

Title:Towards Sample Efficient Agents through Algorithmic Alignment

Authors:Mingxuan Li, Michael L. Littman

View PDF

Abstract:In this work, we propose and explore Deep Graph Value Network (DeepGV) as a promising method to work around sample complexity in deep reinforcement-learning agents using a message-passing mechanism. The main idea is that the agent should be guided by structured non-neural-network algorithms like dynamic programming. According to recent advances in algorithmic alignment, neural networks with structured computation procedures can be trained efficiently. We demonstrate the potential of graph neural network in supporting sample efficient learning by showing that Deep Graph Value Network can outperform unstructured baselines by a large margin in solving the Markov Decision Process (MDP). We believe this would open up a new avenue for structured agent design. See this https URL for the code.

Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2008.03229 [cs.AI]
	(or arXiv:2008.03229v5 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2008.03229

Submission history

From: Mingxuan Li [view email]
[v1] Fri, 7 Aug 2020 15:44:32 UTC (408 KB)
[v2] Tue, 8 Sep 2020 15:55:33 UTC (408 KB)
[v3] Fri, 18 Sep 2020 11:31:00 UTC (408 KB)
[v4] Mon, 7 Dec 2020 14:19:22 UTC (411 KB)
[v5] Thu, 21 Oct 2021 09:28:04 UTC (411 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2020-08

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Michael L. Littman

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Towards Sample Efficient Agents through Algorithmic Alignment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Towards Sample Efficient Agents through Algorithmic Alignment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators