On Linear Stability of SGD and Input-Smoothness of Neural Networks

Ma, Chao; Ying, Lexing

Computer Science > Machine Learning

arXiv:2105.13462 (cs)

[Submitted on 27 May 2021 (v1), last revised 28 Nov 2021 (this version, v2)]

Title:On Linear Stability of SGD and Input-Smoothness of Neural Networks

Authors:Chao Ma, Lexing Ying

View PDF

Abstract:The multiplicative structure of parameters and input data in the first layer of neural networks is explored to build connection between the landscape of the loss function with respect to parameters and the landscape of the model function with respect to input data. By this connection, it is shown that flat minima regularize the gradient of the model function, which explains the good generalization performance of flat minima. Then, we go beyond the flatness and consider high-order moments of the gradient noise, and show that Stochastic Gradient Descent (SGD) tends to impose constraints on these moments by a linear stability analysis of SGD around global minima. Together with the multiplicative structure, we identify the Sobolev regularization effect of SGD, i.e. SGD regularizes the Sobolev seminorms of the model function with respect to the input data. Finally, bounds for generalization error and adversarial robustness are provided for solutions found by SGD under assumptions of the data distribution.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2105.13462 [cs.LG]
	(or arXiv:2105.13462v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2105.13462

Submission history

From: Chao Ma [view email]
[v1] Thu, 27 May 2021 21:49:21 UTC (88 KB)
[v2] Sun, 28 Nov 2021 19:29:48 UTC (263 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-05

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Chao Ma
Lexing Ying

export BibTeX citation

Computer Science > Machine Learning

Title:On Linear Stability of SGD and Input-Smoothness of Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On Linear Stability of SGD and Input-Smoothness of Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators