Accelerating Convolutional Neural Networks via Activation Map Compression

Georgiadis, Georgios

Computer Science > Computer Vision and Pattern Recognition

arXiv:1812.04056 (cs)

[Submitted on 10 Dec 2018 (v1), last revised 27 Mar 2019 (this version, v2)]

Title:Accelerating Convolutional Neural Networks via Activation Map Compression

Authors:Georgios Georgiadis

View PDF

Abstract:The deep learning revolution brought us an extensive array of neural network architectures that achieve state-of-the-art performance in a wide variety of Computer Vision tasks including among others, classification, detection and segmentation. In parallel, we have also been observing an unprecedented demand in computational and memory requirements, rendering the efficient use of neural networks in low-powered devices virtually unattainable. Towards this end, we propose a three-stage compression and acceleration pipeline that sparsifies, quantizes and entropy encodes activation maps of Convolutional Neural Networks. Sparsification increases the representational power of activation maps leading to both acceleration of inference and higher model accuracy. Inception-V3 and MobileNet-V1 can be accelerated by as much as $1.6\times$ with an increase in accuracy of $0.38\%$ and $0.54\%$ on the ImageNet and CIFAR-10 datasets respectively. Quantizing and entropy coding the sparser activation maps lead to higher compression over the baseline, reducing the memory cost of the network execution. Inception-V3 and MobileNet-V1 activation maps, quantized to $16$ bits, are compressed by as much as $6\times$ with an increase in accuracy of $0.36\%$ and $0.55\%$ respectively.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1812.04056 [cs.CV]
	(or arXiv:1812.04056v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1812.04056

Submission history

From: Georgios Georgiadis [view email]
[v1] Mon, 10 Dec 2018 19:50:44 UTC (1,813 KB)
[v2] Wed, 27 Mar 2019 17:42:08 UTC (1,814 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Accelerating Convolutional Neural Networks via Activation Map Compression

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Accelerating Convolutional Neural Networks via Activation Map Compression

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators