Proximal Gradient Descent-Ascent: Variable Convergence under K{\L} Geometry

Chen, Ziyi; Zhou, Yi; Xu, Tengyu; Liang, Yingbin

Mathematics > Optimization and Control

arXiv:2102.04653 (math)

[Submitted on 9 Feb 2021 (v1), last revised 17 Feb 2021 (this version, v2)]

Title:Proximal Gradient Descent-Ascent: Variable Convergence under KŁ Geometry

Authors:Ziyi Chen, Yi Zhou, Tengyu Xu, Yingbin Liang

View PDF

Abstract:The gradient descent-ascent (GDA) algorithm has been widely applied to solve minimax optimization problems. In order to achieve convergent policy parameters for minimax optimization, it is important that GDA generates convergent variable sequences rather than convergent sequences of function values or gradient norms. However, the variable convergence of GDA has been proved only under convexity geometries, and there lacks understanding for general nonconvex minimax optimization. This paper fills such a gap by studying the convergence of a more general proximal-GDA for regularized nonconvex-strongly-concave minimax optimization. Specifically, we show that proximal-GDA admits a novel Lyapunov function, which monotonically decreases in the minimax optimization process and drives the variable sequence to a critical point. By leveraging this Lyapunov function and the KŁ geometry that parameterizes the local geometries of general nonconvex functions, we formally establish the variable convergence of proximal-GDA to a critical point $x^*$, i.e., $x_t\to x^*, y_t\to y^*(x^*)$. Furthermore, over the full spectrum of the KŁ-parameterized geometry, we show that proximal-GDA achieves different types of convergence rates ranging from sublinear convergence up to finite-step convergence, depending on the geometry associated with the KŁ parameter. This is the first theoretical result on the variable convergence for nonconvex minimax optimization.

Comments:	To appear in ICLR 2021
Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG)
Cite as:	arXiv:2102.04653 [math.OC]
	(or arXiv:2102.04653v2 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2102.04653

Submission history

From: Ziyi Chen [view email]
[v1] Tue, 9 Feb 2021 05:35:53 UTC (715 KB)
[v2] Wed, 17 Feb 2021 16:51:36 UTC (753 KB)

Mathematics > Optimization and Control

Title:Proximal Gradient Descent-Ascent: Variable Convergence under KŁ Geometry

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Proximal Gradient Descent-Ascent: Variable Convergence under KŁ Geometry

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators