Combining imagination and heuristics to learn strategies that generalize

Peterson, Erik J; Müyesser, Necati Alp; Verstynen, Timothy; Dunovan, Kyle

Computer Science > Artificial Intelligence

arXiv:1809.03406 (cs)

[Submitted on 10 Sep 2018 (v1), last revised 11 Jun 2020 (this version, v2)]

Title:Combining imagination and heuristics to learn strategies that generalize

Authors:Erik J Peterson, Necati Alp Müyesser, Timothy Verstynen, Kyle Dunovan

View PDF

Abstract:Deep reinforcement learning can match or exceed human performance in stable contexts, but with minor changes to the environment artificial networks, unlike humans, often cannot adapt. Humans rely on a combination of heuristics to simplify computational load and imagination to extend experiential learning to new and more challenging environments. Motivated by theories of the hierarchical organization of the human prefrontal networks, we have developed a model of hierarchical reinforcement learning that combines both heuristics and imagination into a stumbler-strategist network. We test performance of this network using Wythoff's game, a gridworld environment with a known optimal strategy. We show that a heuristic labeling of each position as hot or cold, combined with imagined play, both accelerates learning and promotes transfer to novel games, while also improving model interpretability.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:1809.03406 [cs.AI]
	(or arXiv:1809.03406v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1809.03406

Submission history

From: Erik Peterson [view email]
[v1] Mon, 10 Sep 2018 15:43:57 UTC (2,478 KB)
[v2] Thu, 11 Jun 2020 20:40:35 UTC (5,160 KB)

Computer Science > Artificial Intelligence

Title:Combining imagination and heuristics to learn strategies that generalize

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Combining imagination and heuristics to learn strategies that generalize

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators