Evolution of Q Values for Deep Q Learning in Stable Baselines

Andrews, Matthew; Dibek, Cemil; Palyutina, Karina

Computer Science > Machine Learning

arXiv:2004.11766 (cs)

[Submitted on 24 Apr 2020]

Title:Evolution of Q Values for Deep Q Learning in Stable Baselines

Authors:Matthew Andrews, Cemil Dibek, Karina Palyutina

View PDF

Abstract:We investigate the evolution of the Q values for the implementation of Deep Q Learning (DQL) in the Stable Baselines library. Stable Baselines incorporates the latest Reinforcement Learning techniques and achieves superhuman performance in many game environments. However, for some simple non-game environments, the DQL in Stable Baselines can struggle to find the correct actions. In this paper we aim to understand the types of environment where this suboptimal behavior can happen, and also investigate the corresponding evolution of the Q values for individual states.
We compare a smart TrafficLight environment (where performance is poor) with the AI Gym FrozenLake environment (where performance is perfect). We observe that DQL struggles with TrafficLight because actions are reversible and hence the Q values in a given state are closer than in FrozenLake. We then investigate the evolution of the Q values using a recent decomposition technique of Achiam et al.. We observe that for TrafficLight, the function approximation error and the complex relationships between the states lead to a situation where some Q values meander far from optimal.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2004.11766 [cs.LG]
	(or arXiv:2004.11766v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2004.11766

Submission history

From: Cemil Dibek [view email]
[v1] Fri, 24 Apr 2020 14:13:46 UTC (1,200 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-04

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Matthew Andrews

export BibTeX citation

Computer Science > Machine Learning

Title:Evolution of Q Values for Deep Q Learning in Stable Baselines

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Evolution of Q Values for Deep Q Learning in Stable Baselines

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators