ADADELTA: An Adaptive Learning Rate Method

Zeiler, Matthew D.

Computer Science > Machine Learning

arXiv:1212.5701 (cs)

[Submitted on 22 Dec 2012]

Title:ADADELTA: An Adaptive Learning Rate Method

Authors:Matthew D. Zeiler

View PDF

Abstract:We present a novel per-dimension learning rate method for gradient descent called ADADELTA. The method dynamically adapts over time using only first order information and has minimal computational overhead beyond vanilla stochastic gradient descent. The method requires no manual tuning of a learning rate and appears robust to noisy gradient information, different model architecture choices, various data modalities and selection of hyperparameters. We show promising results compared to other methods on the MNIST digit classification task using a single machine and on a large scale voice dataset in a distributed cluster environment.

Comments:	6 pages
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1212.5701 [cs.LG]
	(or arXiv:1212.5701v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1212.5701

Submission history

From: Matthew Zeiler [view email]
[v1] Sat, 22 Dec 2012 15:46:49 UTC (266 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2012-12

Change to browse by:

References & Citations

9 blog links

(what is this?)

DBLP - CS Bibliography

listing | bibtex

Matthew D. Zeiler

export BibTeX citation

Computer Science > Machine Learning

Title:ADADELTA: An Adaptive Learning Rate Method

Submission history

Access Paper:

References & Citations

9 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:ADADELTA: An Adaptive Learning Rate Method

Submission history

Access Paper:

References & Citations

9 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators