Learning Sparse Low-Precision Neural Networks With Learnable Regularization

Choi, Yoojin; El-Khamy, Mostafa; Lee, Jungwon

doi:10.1109/ACCESS.2020.2996936

Computer Science > Computer Vision and Pattern Recognition

arXiv:1809.00095 (cs)

[Submitted on 1 Sep 2018 (v1), last revised 24 May 2020 (this version, v2)]

Title:Learning Sparse Low-Precision Neural Networks With Learnable Regularization

Authors:Yoojin Choi, Mostafa El-Khamy, Jungwon Lee

View PDF

Abstract:We consider learning deep neural networks (DNNs) that consist of low-precision weights and activations for efficient inference of fixed-point operations. In training low-precision networks, gradient descent in the backward pass is performed with high-precision weights while quantized low-precision weights and activations are used in the forward pass to calculate the loss function for training. Thus, the gradient descent becomes suboptimal, and accuracy loss follows. In order to reduce the mismatch in the forward and backward passes, we utilize mean squared quantization error (MSQE) regularization. In particular, we propose using a learnable regularization coefficient with the MSQE regularizer to reinforce the convergence of high-precision weights to their quantized values. We also investigate how partial L2 regularization can be employed for weight pruning in a similar manner. Finally, combining weight pruning, quantization, and entropy coding, we establish a low-precision DNN compression pipeline. In our experiments, the proposed method yields low-precision MobileNet and ShuffleNet models on ImageNet classification with the state-of-the-art compression ratios of 7.13 and 6.79, respectively. Moreover, we examine our method for image super resolution networks to produce 8-bit low-precision models at negligible performance loss.

Comments:	IEEE Access
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1809.00095 [cs.CV]
	(or arXiv:1809.00095v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1809.00095
Related DOI:	https://doi.org/10.1109/ACCESS.2020.2996936

Submission history

From: Yoojin Choi [view email]
[v1] Sat, 1 Sep 2018 01:28:21 UTC (741 KB)
[v2] Sun, 24 May 2020 00:41:54 UTC (1,168 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Sparse Low-Precision Neural Networks With Learnable Regularization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning Sparse Low-Precision Neural Networks With Learnable Regularization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators