Parameter Compression of Recurrent Neural Networks and Degradation of Short-term Memory

Cox, Jonathan A.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1612.00891 (cs)

[Submitted on 2 Dec 2016 (v1), last revised 24 Feb 2017 (this version, v2)]

Title:Parameter Compression of Recurrent Neural Networks and Degradation of Short-term Memory

Authors:Jonathan A. Cox

View PDF

Abstract:The significant computational costs of deploying neural networks in large-scale or resource constrained environments, such as data centers and mobile devices, has spurred interest in model compression, which can achieve a reduction in both arithmetic operations and storage memory. Several techniques have been proposed for reducing or compressing the parameters for feed-forward and convolutional neural networks, but less is understood about the effect of parameter compression on recurrent neural networks (RNN). In particular, the extent to which the recurrent parameters can be compressed and the impact on short-term memory performance, is not well understood. In this paper, we study the effect of complexity reduction, through singular value decomposition rank reduction, on RNN and minimal gated recurrent unit (MGRU) networks for several tasks. We show that considerable rank reduction is possible when compressing recurrent weights, even without fine tuning. Furthermore, we propose a perturbation model for the effect of general perturbations, such as a compression, on the recurrent parameters of RNNs. The model is tested against a noiseless memorization experiment that elucidates the short-term memory performance. In this way, we demonstrate that the effect of compression of recurrent parameters is dependent on the degree of temporal coherence present in the data and task. This work can guide on-the-fly RNN compression for novel environments or tasks, and provides insight for applying RNN compression in low-power devices, such as hearing aids.

Comments:	Accepted to IJCNN 2017. Final camera ready paper
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1612.00891 [cs.CV]
	(or arXiv:1612.00891v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1612.00891

Submission history

From: Jonathan Cox [view email]
[v1] Fri, 2 Dec 2016 23:11:10 UTC (1,940 KB)
[v2] Fri, 24 Feb 2017 18:22:30 UTC (1,954 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Parameter Compression of Recurrent Neural Networks and Degradation of Short-term Memory

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Parameter Compression of Recurrent Neural Networks and Degradation of Short-term Memory

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators