Safe Mutations for Deep and Recurrent Neural Networks through Output Gradients

Lehman, Joel; Chen, Jay; Clune, Jeff; Stanley, Kenneth O.

Computer Science > Neural and Evolutionary Computing

arXiv:1712.06563 (cs)

[Submitted on 18 Dec 2017 (v1), last revised 1 May 2018 (this version, v3)]

Title:Safe Mutations for Deep and Recurrent Neural Networks through Output Gradients

Authors:Joel Lehman, Jay Chen, Jeff Clune, Kenneth O. Stanley

View PDF

Abstract:While neuroevolution (evolving neural networks) has a successful track record across a variety of domains from reinforcement learning to artificial life, it is rarely applied to large, deep neural networks. A central reason is that while random mutation generally works in low dimensions, a random perturbation of thousands or millions of weights is likely to break existing functionality, providing no learning signal even if some individual weight changes were beneficial. This paper proposes a solution by introducing a family of safe mutation (SM) operators that aim within the mutation operator itself to find a degree of change that does not alter network behavior too much, but still facilitates exploration. Importantly, these SM operators do not require any additional interactions with the environment. The most effective SM variant capitalizes on the intriguing opportunity to scale the degree of mutation of each individual weight according to the sensitivity of the network's outputs to that weight, which requires computing the gradient of outputs with respect to the weights (instead of the gradient of error, as in conventional deep learning). This safe mutation through gradients (SM-G) operator dramatically increases the ability of a simple genetic algorithm-based neuroevolution method to find solutions in high-dimensional domains that require deep and/or recurrent neural networks (which tend to be particularly brittle to mutation), including domains that require processing raw pixels. By improving our ability to evolve deep neural networks, this new safer approach to mutation expands the scope of domains amenable to neuroevolution.

Subjects:	Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI)
Cite as:	arXiv:1712.06563 [cs.NE]
	(or arXiv:1712.06563v3 [cs.NE] for this version)
	https://doi.org/10.48550/arXiv.1712.06563

Submission history

From: Joel Lehman [view email]
[v1] Mon, 18 Dec 2017 18:16:51 UTC (1,476 KB)
[v2] Wed, 17 Jan 2018 18:45:26 UTC (1,862 KB)
[v3] Tue, 1 May 2018 20:18:32 UTC (2,661 KB)

Computer Science > Neural and Evolutionary Computing

Title:Safe Mutations for Deep and Recurrent Neural Networks through Output Gradients

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Neural and Evolutionary Computing

Title:Safe Mutations for Deep and Recurrent Neural Networks through Output Gradients

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators