Data-Driven Compression of Convolutional Neural Networks

Pahwa, Ramit; Arivazhagan, Manoj Ghuhan; Garg, Ankur; Krishnamoorthy, Siddarth; Saxena, Rohit; Choudhary, Sunav

Computer Science > Machine Learning

arXiv:1911.12740 (cs)

[Submitted on 28 Nov 2019]

Title:Data-Driven Compression of Convolutional Neural Networks

Authors:Ramit Pahwa, Manoj Ghuhan Arivazhagan, Ankur Garg, Siddarth Krishnamoorthy, Rohit Saxena, Sunav Choudhary

View PDF

Abstract:Deploying trained convolutional neural networks (CNNs) to mobile devices is a challenging task because of the simultaneous requirements of the deployed model to be fast, lightweight and accurate. Designing and training a CNN architecture that does well on all three metrics is highly non-trivial and can be very time-consuming if done by hand. One way to solve this problem is to compress the trained CNN models before deploying to mobile devices. This work asks and answers three questions on compressing CNN models automatically: a) How to control the trade-off between speed, memory and accuracy during model compression? b) In practice, a deployed model may not see all classes and/or may not need to produce all class labels. Can this fact be used to improve the trade-off? c) How to scale the compression algorithm to execute within a reasonable amount of time for many deployments? The paper demonstrates that a model compression algorithm utilizing reinforcement learning with architecture search and knowledge distillation can answer these questions in the affirmative. Experimental results are provided for current state-of-the-art CNN model families for image feature extraction like VGG and ResNet with CIFAR datasets.

Comments:	17 pages, 10 tables, 1 figure
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1911.12740 [cs.LG]
	(or arXiv:1911.12740v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1911.12740

Submission history

From: Sunav Choudhary [view email]
[v1] Thu, 28 Nov 2019 15:03:59 UTC (236 KB)

Computer Science > Machine Learning

Title:Data-Driven Compression of Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Data-Driven Compression of Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators