Memorization and Optimization in Deep Neural Networks with Minimum Over-parameterization

Bombari, Simone; Amani, Mohammad Hossein; Mondelli, Marco

Statistics > Machine Learning

arXiv:2205.10217 (stat)

[Submitted on 20 May 2022 (v1), last revised 21 May 2023 (this version, v3)]

Title:Memorization and Optimization in Deep Neural Networks with Minimum Over-parameterization

Authors:Simone Bombari, Mohammad Hossein Amani, Marco Mondelli

View PDF

Abstract:The Neural Tangent Kernel (NTK) has emerged as a powerful tool to provide memorization, optimization and generalization guarantees in deep neural networks. A line of work has studied the NTK spectrum for two-layer and deep networks with at least a layer with $\Omega(N)$ neurons, $N$ being the number of training samples. Furthermore, there is increasing evidence suggesting that deep networks with sub-linear layer widths are powerful memorizers and optimizers, as long as the number of parameters exceeds the number of samples. Thus, a natural open question is whether the NTK is well conditioned in such a challenging sub-linear setup. In this paper, we answer this question in the affirmative. Our key technical contribution is a lower bound on the smallest NTK eigenvalue for deep networks with the minimum possible over-parameterization: the number of parameters is roughly $\Omega(N)$ and, hence, the number of neurons is as little as $\Omega(\sqrt{N})$. To showcase the applicability of our NTK bounds, we provide two results concerning memorization capacity and optimization guarantees for gradient descent training.

Comments:	Uniformed with the published NeurIPS 2022 version
Subjects:	Machine Learning (stat.ML); Information Theory (cs.IT); Machine Learning (cs.LG)
Cite as:	arXiv:2205.10217 [stat.ML]
	(or arXiv:2205.10217v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2205.10217

Submission history

From: Simone Bombari [view email]
[v1] Fri, 20 May 2022 14:50:24 UTC (937 KB)
[v2] Sun, 24 Jul 2022 17:45:52 UTC (87 KB)
[v3] Sun, 21 May 2023 07:02:36 UTC (87 KB)

Statistics > Machine Learning

Title:Memorization and Optimization in Deep Neural Networks with Minimum Over-parameterization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Memorization and Optimization in Deep Neural Networks with Minimum Over-parameterization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators