Diagonal Rescaling For Neural Networks

Lafond, Jean; Vasilache, Nicolas; Bottou, Léon

Computer Science > Machine Learning

arXiv:1705.09319 (cs)

[Submitted on 25 May 2017]

Title:Diagonal Rescaling For Neural Networks

Authors:Jean Lafond, Nicolas Vasilache, Léon Bottou

View PDF

Abstract:We define a second-order neural network stochastic gradient training algorithm whose block-diagonal structure effectively amounts to normalizing the unit activations. Investigating why this algorithm lacks in robustness then reveals two interesting insights. The first insight suggests a new way to scale the stepsizes, clarifying popular algorithms such as RMSProp as well as old neural network tricks such as fanin stepsize scaling. The second insight stresses the practical importance of dealing with fast changes of the curvature of the cost.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1705.09319 [cs.LG]
	(or arXiv:1705.09319v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1705.09319

Submission history

From: Léon Bottou [view email]
[v1] Thu, 25 May 2017 18:33:24 UTC (448 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2017-05

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jean Lafond
Nicolas Vasilache
Léon Bottou

export BibTeX citation

Computer Science > Machine Learning

Title:Diagonal Rescaling For Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Diagonal Rescaling For Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators