Adaptive Loss-aware Quantization for Multi-bit Networks

Qu, Zhongnan; Zhou, Zimu; Cheng, Yun; Thiele, Lothar

Computer Science > Computer Vision and Pattern Recognition

arXiv:1912.08883 (cs)

[Submitted on 18 Dec 2019 (v1), last revised 4 Jul 2020 (this version, v4)]

Title:Adaptive Loss-aware Quantization for Multi-bit Networks

Authors:Zhongnan Qu, Zimu Zhou, Yun Cheng, Lothar Thiele

View PDF

Abstract:We investigate the compression of deep neural networks by quantizing their weights and activations into multiple binary bases, known as multi-bit networks (MBNs), which accelerate the inference and reduce the storage for the deployment on low-resource mobile and embedded platforms. We propose Adaptive Loss-aware Quantization (ALQ), a new MBN quantization pipeline that is able to achieve an average bitwidth below one-bit without notable loss in inference accuracy. Unlike previous MBN quantization solutions that train a quantizer by minimizing the error to reconstruct full precision weights, ALQ directly minimizes the quantization-induced error on the loss function involving neither gradient approximation nor full precision maintenance. ALQ also exploits strategies including adaptive bitwidth, smooth bitwidth reduction, and iterative trained quantization to allow a smaller network size without loss in accuracy. Experiment results on popular image datasets show that ALQ outperforms state-of-the-art compressed networks in terms of both storage and accuracy. Code is available at this https URL

Comments:	To appear in CVPR 2020; Code available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1912.08883 [cs.CV]
	(or arXiv:1912.08883v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1912.08883

Submission history

From: Zhongnan Qu [view email]
[v1] Wed, 18 Dec 2019 20:48:29 UTC (60 KB)
[v2] Mon, 9 Mar 2020 17:11:11 UTC (74 KB)
[v3] Sat, 6 Jun 2020 22:31:11 UTC (74 KB)
[v4] Sat, 4 Jul 2020 20:24:41 UTC (74 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Adaptive Loss-aware Quantization for Multi-bit Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Adaptive Loss-aware Quantization for Multi-bit Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators