SGD and Hogwild! Convergence Without the Bounded Gradients Assumption

Nguyen, Lam M.; Nguyen, Phuong Ha; van Dijk, Marten; Richtárik, Peter; Scheinberg, Katya; Takáč, Martin

Mathematics > Optimization and Control

arXiv:1802.03801 (math)

[Submitted on 11 Feb 2018 (v1), last revised 8 Jun 2018 (this version, v2)]

Title:SGD and Hogwild! Convergence Without the Bounded Gradients Assumption

Authors:Lam M. Nguyen, Phuong Ha Nguyen, Marten van Dijk, Peter Richtárik, Katya Scheinberg, Martin Takáč

View PDF

Abstract:Stochastic gradient descent (SGD) is the optimization algorithm of choice in many machine learning applications such as regularized empirical risk minimization and training deep neural networks. The classical convergence analysis of SGD is carried out under the assumption that the norm of the stochastic gradient is uniformly bounded. While this might hold for some loss functions, it is always violated for cases where the objective function is strongly convex. In (Bottou et al.,2016), a new analysis of convergence of SGD is performed under the assumption that stochastic gradients are bounded with respect to the true gradient norm. Here we show that for stochastic problems arising in machine learning such bound always holds; and we also propose an alternative convergence analysis of SGD with diminishing learning rate regime, which results in more relaxed conditions than those in (Bottou et al.,2016). We then move on the asynchronous parallel setting, and prove convergence of Hogwild! algorithm in the same regime, obtaining the first convergence results for this method in the case of diminished learning rate.

Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1802.03801 [math.OC]
	(or arXiv:1802.03801v2 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.1802.03801
Journal reference:	Proceedings of the 35th International Conference on Machine Learning, PMLR 80:3747-3755, 2018

Submission history

From: Lam Nguyen [view email]
[v1] Sun, 11 Feb 2018 19:35:46 UTC (1,385 KB)
[v2] Fri, 8 Jun 2018 04:23:40 UTC (1,388 KB)

Mathematics > Optimization and Control

Title:SGD and Hogwild! Convergence Without the Bounded Gradients Assumption

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:SGD and Hogwild! Convergence Without the Bounded Gradients Assumption

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators