Modelling Agent Policies with Interpretable Imitation Learning

Bewley, Tom; Lawry, Jonathan; Richards, Arthur

Computer Science > Artificial Intelligence

arXiv:2006.11309 (cs)

[Submitted on 19 Jun 2020]

Title:Modelling Agent Policies with Interpretable Imitation Learning

Authors:Tom Bewley, Jonathan Lawry, Arthur Richards

View PDF

Abstract:As we deploy autonomous agents in safety-critical domains, it becomes important to develop an understanding of their internal mechanisms and representations. We outline an approach to imitation learning for reverse-engineering black box agent policies in MDP environments, yielding simplified, interpretable models in the form of decision trees. As part of this process, we explicitly model and learn agents' latent state representations by selecting from a large space of candidate features constructed from the Markov state. We present initial promising results from an implementation in a multi-agent traffic environment.

Comments:	6 pages, 3 figures; under review for the 1st TAILOR Workshop, due to take place 29-30 August 2020 in Santiago de Compostela
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2006.11309 [cs.AI]
	(or arXiv:2006.11309v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2006.11309

Submission history

From: Tom Bewley [view email]
[v1] Fri, 19 Jun 2020 18:19:08 UTC (1,928 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2020-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jonathan Lawry
Arthur Richards

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Modelling Agent Policies with Interpretable Imitation Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Modelling Agent Policies with Interpretable Imitation Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators