Learning Versatile Neural Architectures by Propagating Network Codes

Ding, Mingyu; Huo, Yuqi; Lu, Haoyu; Yang, Linjie; Wang, Zhe; Lu, Zhiwu; Wang, Jingdong; Luo, Ping

Computer Science > Computer Vision and Pattern Recognition

arXiv:2103.13253 (cs)

[Submitted on 24 Mar 2021 (v1), last revised 17 Feb 2022 (this version, v2)]

Title:Learning Versatile Neural Architectures by Propagating Network Codes

Authors:Mingyu Ding, Yuqi Huo, Haoyu Lu, Linjie Yang, Zhe Wang, Zhiwu Lu, Jingdong Wang, Ping Luo

View PDF

Abstract:This work explores how to design a single neural network capable of adapting to multiple heterogeneous vision tasks, such as image segmentation, 3D detection, and video recognition. This goal is challenging because both network architecture search (NAS) spaces and methods in different tasks are inconsistent. We solve this challenge from both sides. We first introduce a unified design space for multiple tasks and build a multitask NAS benchmark (NAS-Bench-MR) on many widely used datasets, including ImageNet, Cityscapes, KITTI, and HMDB51. We further propose Network Coding Propagation (NCP), which back-propagates gradients of neural predictors to directly update architecture codes along the desired gradient directions to solve various tasks. In this way, optimal architecture configurations can be found by NCP in our large search space in seconds.
Unlike prior arts of NAS that typically focus on a single task, NCP has several unique benefits. (1) NCP transforms architecture optimization from data-driven to architecture-driven, enabling joint search an architecture among multitasks with different data distributions. (2) NCP learns from network codes but not original data, enabling it to update the architecture efficiently across datasets. (3) In addition to our NAS-Bench-MR, NCP performs well on other NAS benchmarks, such as NAS-Bench-201. (4) Thorough studies of NCP on inter-, cross-, and intra-tasks highlight the importance of cross-task neural architecture design, i.e., multitask neural architectures and architecture transferring between different tasks. Code is available at this https URL.

Comments:	ICLR 2022. Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2103.13253 [cs.CV]
	(or arXiv:2103.13253v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2103.13253

Submission history

From: Mingyu Ding [view email]
[v1] Wed, 24 Mar 2021 15:20:38 UTC (2,645 KB)
[v2] Thu, 17 Feb 2022 15:16:17 UTC (2,393 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Versatile Neural Architectures by Propagating Network Codes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Versatile Neural Architectures by Propagating Network Codes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators