Training recurrent networks online without backtracking

Ollivier, Yann; Tallec, Corentin; Charpiat, Guillaume

Computer Science > Neural and Evolutionary Computing

arXiv:1507.07680 (cs)

[Submitted on 28 Jul 2015 (v1), last revised 20 Nov 2015 (this version, v2)]

Title:Training recurrent networks online without backtracking

Authors:Yann Ollivier, Corentin Tallec, Guillaume Charpiat

View PDF

Abstract:We introduce the "NoBackTrack" algorithm to train the parameters of dynamical systems such as recurrent neural networks. This algorithm works in an online, memoryless setting, thus requiring no backpropagation through time, and is scalable, avoiding the large computational and memory cost of maintaining the full gradient of the current state with respect to the parameters.
The algorithm essentially maintains, at each time, a single search direction in parameter space. The evolution of this search direction is partly stochastic and is constructed in such a way to provide, at every time, an unbiased random estimate of the gradient of the loss function with respect to the parameters. Because the gradient estimate is unbiased, on average over time the parameter is updated as it should.
The resulting gradient estimate can then be fed to a lightweight Kalman-like filter to yield an improved algorithm. For recurrent neural networks, the resulting algorithms scale linearly with the number of parameters.
Small-scale experiments confirm the suitability of the approach, showing that the stochastic approximation of the gradient introduced in the algorithm is not detrimental to learning. In particular, the Kalman-like version of NoBackTrack is superior to backpropagation through time (BPTT) when the time span of dependencies in the data is longer than the truncation span for BPTT.

Subjects:	Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1507.07680 [cs.NE]
	(or arXiv:1507.07680v2 [cs.NE] for this version)
	https://doi.org/10.48550/arXiv.1507.07680

Submission history

From: Yann Ollivier [view email]
[v1] Tue, 28 Jul 2015 08:26:50 UTC (36 KB)
[v2] Fri, 20 Nov 2015 22:29:38 UTC (405 KB)

Computer Science > Neural and Evolutionary Computing

Title:Training recurrent networks online without backtracking

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Neural and Evolutionary Computing

Title:Training recurrent networks online without backtracking

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators