Sparse Deep Neural Network Exact Solutions

Kepner, Jeremy; Gadepally, Vijay; Jananthan, Hayden; Milechin, Lauren; Samsi, Sid

doi:10.1109/HPEC.2018.8547742

Computer Science > Machine Learning

arXiv:1807.03165 (cs)

[Submitted on 6 Jul 2018]

Title:Sparse Deep Neural Network Exact Solutions

Authors:Jeremy Kepner, Vijay Gadepally, Hayden Jananthan, Lauren Milechin, Sid Samsi

View PDF

Abstract:Deep neural networks (DNNs) have emerged as key enablers of machine learning. Applying larger DNNs to more diverse applications is an important challenge. The computations performed during DNN training and inference are dominated by operations on the weight matrices describing the DNN. As DNNs incorporate more layers and more neurons per layers, these weight matrices may be required to be sparse because of memory limitations. Sparse DNNs are one possible approach, but the underlying theory is in the early stages of development and presents a number of challenges, including determining the accuracy of inference and selecting nonzero weights for training. Associative array algebra has been developed by the big data community to combine and extend database, matrix, and graph/network concepts for use in large, sparse data problems. Applying this mathematics to DNNs simplifies the formulation of DNN mathematics and reveals that DNNs are linear over oscillating semirings. This work uses associative array DNNs to construct exact solutions and corresponding perturbation models to the rectified linear unit (ReLU) DNN equations that can be used to construct test vectors for sparse DNN implementations over various precisions. These solutions can be used for DNN verification, theoretical explorations of DNN properties, and a starting point for the challenge of sparse training.

Comments:	8 pages, 10 figures, accepted to IEEE HPEC 2018. arXiv admin note: text overlap with arXiv:1708.02937
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Machine Learning (stat.ML)
Cite as:	arXiv:1807.03165 [cs.LG]
	(or arXiv:1807.03165v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1807.03165
Related DOI:	https://doi.org/10.1109/HPEC.2018.8547742

Submission history

From: Jeremy Kepner [view email]
[v1] Fri, 6 Jul 2018 00:47:12 UTC (1,886 KB)

Computer Science > Machine Learning

Title:Sparse Deep Neural Network Exact Solutions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Sparse Deep Neural Network Exact Solutions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators