Weakly Supervised Reinforcement Learning for Autonomous Highway Driving via Virtual Safety Cages

Kuutti, Sampo; Bowden, Richard; Fallah, Saber

doi:10.3390/s21062032

Computer Science > Machine Learning

arXiv:2103.09726 (cs)

[Submitted on 17 Mar 2021]

Title:Weakly Supervised Reinforcement Learning for Autonomous Highway Driving via Virtual Safety Cages

Authors:Sampo Kuutti, Richard Bowden, Saber Fallah

View PDF

Abstract:The use of neural networks and reinforcement learning has become increasingly popular in autonomous vehicle control. However, the opaqueness of the resulting control policies presents a significant barrier to deploying neural network-based control in autonomous vehicles. In this paper, we present a reinforcement learning based approach to autonomous vehicle longitudinal control, where the rule-based safety cages provide enhanced safety for the vehicle as well as weak supervision to the reinforcement learning agent. By guiding the agent to meaningful states and actions, this weak supervision improves the convergence during training and enhances the safety of the final trained policy. This rule-based supervisory controller has the further advantage of being fully interpretable, thereby enabling traditional validation and verification approaches to ensure the safety of the vehicle. We compare models with and without safety cages, as well as models with optimal and constrained model parameters, and show that the weak supervision consistently improves the safety of exploration, speed of convergence, and model performance. Additionally, we show that when the model parameters are constrained or sub-optimal, the safety cages can enable a model to learn a safe driving policy even when the model could not be trained to drive through reinforcement learning alone.

Comments:	Published in Sensors
Subjects:	Machine Learning (cs.LG); Systems and Control (eess.SY)
Cite as:	arXiv:2103.09726 [cs.LG]
	(or arXiv:2103.09726v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2103.09726
Journal reference:	Sensors 2021, 21, 2032
Related DOI:	https://doi.org/10.3390/s21062032

Submission history

From: Sampo Kuutti [view email]
[v1] Wed, 17 Mar 2021 15:30:36 UTC (1,741 KB)

Computer Science > Machine Learning

Title:Weakly Supervised Reinforcement Learning for Autonomous Highway Driving via Virtual Safety Cages

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Weakly Supervised Reinforcement Learning for Autonomous Highway Driving via Virtual Safety Cages

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators