Neural Capacitance: A New Perspective of Neural Network Selection via Edge Dynamics

Jiang, Chunheng; Pedapati, Tejaswini; Chen, Pin-Yu; Sun, Yizhou; Gao, Jianxi

Computer Science > Machine Learning

arXiv:2201.04194 (cs)

[Submitted on 11 Jan 2022 (v1), last revised 14 Jan 2022 (this version, v2)]

Title:Neural Capacitance: A New Perspective of Neural Network Selection via Edge Dynamics

Authors:Chunheng Jiang, Tejaswini Pedapati, Pin-Yu Chen, Yizhou Sun, Jianxi Gao

View PDF

Abstract:Efficient model selection for identifying a suitable pre-trained neural network to a downstream task is a fundamental yet challenging task in deep learning. Current practice requires expensive computational costs in model training for performance prediction. In this paper, we propose a novel framework for neural network selection by analyzing the governing dynamics over synaptic connections (edges) during training. Our framework is built on the fact that back-propagation during neural network training is equivalent to the dynamical evolution of synaptic connections. Therefore, a converged neural network is associated with an equilibrium state of a networked system composed of those edges. To this end, we construct a network mapping $\phi$, converting a neural network $G_A$ to a directed line graph $G_B$ that is defined on those edges in $G_A$. Next, we derive a neural capacitance metric $\beta_{\rm eff}$ as a predictive measure universally capturing the generalization capability of $G_A$ on the downstream task using only a handful of early training results. We carried out extensive experiments using 17 popular pre-trained ImageNet models and five benchmark datasets, including CIFAR10, CIFAR100, SVHN, Fashion MNIST and Birds, to evaluate the fine-tuning performance of our framework. Our neural capacitance metric is shown to be a powerful indicator for model selection based only on early training results and is more efficient than state-of-the-art methods.

Comments:	19 pages, 7 figures, neural architecture search, mean-field
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2201.04194 [cs.LG]
	(or arXiv:2201.04194v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2201.04194

Submission history

From: Chunheng Jiang [view email]
[v1] Tue, 11 Jan 2022 20:53:15 UTC (22,712 KB)
[v2] Fri, 14 Jan 2022 21:18:24 UTC (22,713 KB)

Computer Science > Machine Learning

Title:Neural Capacitance: A New Perspective of Neural Network Selection via Edge Dynamics

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Neural Capacitance: A New Perspective of Neural Network Selection via Edge Dynamics

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators