SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing

Ren, Ao; Li, Ji; Li, Zhe; Ding, Caiwen; Qian, Xuehai; Qiu, Qinru; Yuan, Bo; Wang, Yanzhi

Computer Science > Computer Vision and Pattern Recognition

arXiv:1611.05939 (cs)

[Submitted on 18 Nov 2016 (v1), last revised 31 Jan 2017 (this version, v2)]

Title:SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing

Authors:Ao Ren, Ji Li, Zhe Li, Caiwen Ding, Xuehai Qian, Qinru Qiu, Bo Yuan, Yanzhi Wang

View PDF

Abstract:With recent advancing of Internet of Things (IoTs), it becomes very attractive to implement the deep convolutional neural networks (DCNNs) onto embedded/portable systems. Presently, executing the software-based DCNNs requires high-performance server clusters in practice, restricting their widespread deployment on the mobile devices. To overcome this issue, considerable research efforts have been conducted in the context of developing highly-parallel and specific DCNN hardware, utilizing GPGPUs, FPGAs, and ASICs. Stochastic Computing (SC), which uses bit-stream to represent a number within [-1, 1] by counting the number of ones in the bit-stream, has a high potential for implementing DCNNs with high scalability and ultra-low hardware footprint. Since multiplications and additions can be calculated using AND gates and multiplexers in SC, significant reductions in power/energy and hardware footprint can be achieved compared to the conventional binary arithmetic implementations. The tremendous savings in power (energy) and hardware resources bring about immense design space for enhancing scalability and robustness for hardware DCNNs. This paper presents the first comprehensive design and optimization framework of SC-based DCNNs (SC-DCNNs). We first present the optimal designs of function blocks that perform the basic operations, i.e., inner product, pooling, and activation function. Then we propose the optimal design of four types of combinations of basic function blocks, named feature extraction blocks, which are in charge of extracting features from input feature maps. Besides, weight storage methods are investigated to reduce the area and power/energy consumption for storing weights. Finally, the whole SC-DCNN implementation is optimized, with feature extraction blocks carefully selected, to minimize area and power/energy consumption while maintaining a high network accuracy level.

Comments:	This paper is accepted by 22nd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2017
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1611.05939 [cs.CV]
	(or arXiv:1611.05939v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1611.05939

Submission history

From: Zhe Li [view email]
[v1] Fri, 18 Nov 2016 01:11:17 UTC (6,670 KB)
[v2] Tue, 31 Jan 2017 16:19:46 UTC (14,131 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators