A Unified Convergence Analysis for Shuffling-Type Gradient Methods

Nguyen, Lam M.; Tran-Dinh, Quoc; Phan, Dzung T.; Nguyen, Phuong Ha; van Dijk, Marten

Mathematics > Optimization and Control

arXiv:2002.08246 (math)

[Submitted on 19 Feb 2020 (v1), last revised 20 Sep 2021 (this version, v2)]

Title:A Unified Convergence Analysis for Shuffling-Type Gradient Methods

Authors:Lam M. Nguyen, Quoc Tran-Dinh, Dzung T. Phan, Phuong Ha Nguyen, Marten van Dijk

View PDF

Abstract:In this paper, we propose a unified convergence analysis for a class of generic shuffling-type gradient methods for solving finite-sum optimization problems. Our analysis works with any sampling without replacement strategy and covers many known variants such as randomized reshuffling, deterministic or randomized single permutation, and cyclic and incremental gradient schemes. We focus on two different settings: strongly convex and nonconvex problems, but also discuss the non-strongly convex case. Our main contribution consists of new non-asymptotic and asymptotic convergence rates for a wide class of shuffling-type gradient methods in both nonconvex and convex settings. We also study uniformly randomized shuffling variants with different learning rates and model assumptions. While our rate in the nonconvex case is new and significantly improved over existing works under standard assumptions, the rate on the strongly convex one matches the existing best-known rates prior to this paper up to a constant factor without imposing a bounded gradient condition. Finally, we empirically illustrate our theoretical results via two numerical examples: nonconvex logistic regression and neural network training examples. As byproducts, our results suggest some appropriate choices for diminishing learning rates in certain shuffling variants.

Comments:	Journal of Machine Learning Research, 2021
Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2002.08246 [math.OC]
	(or arXiv:2002.08246v2 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2002.08246

Submission history

From: Lam Nguyen [view email]
[v1] Wed, 19 Feb 2020 15:45:41 UTC (4,944 KB)
[v2] Mon, 20 Sep 2021 00:44:31 UTC (661 KB)

Mathematics > Optimization and Control

Title:A Unified Convergence Analysis for Shuffling-Type Gradient Methods

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:A Unified Convergence Analysis for Shuffling-Type Gradient Methods

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators