Hello Edge: Keyword Spotting on Microcontrollers

Zhang, Yundong; Suda, Naveen; Lai, Liangzhen; Chandra, Vikas

Computer Science > Sound

arXiv:1711.07128v1 (cs)

[Submitted on 20 Nov 2017 (this version), latest version 14 Feb 2018 (v3)]

Title:Hello Edge: Keyword Spotting on Microcontrollers

Authors:Yundong Zhang, Naveen Suda, Liangzhen Lai, Vikas Chandra

View PDF

Abstract:Keyword spotting (KWS) is a critical component for enabling speech based user interactions on smart devices. It requires real-time response and high accuracy for good user experience. Recently, neural networks have become an attractive choice for KWS architecture because of their superior accuracy compared to traditional speech processing algorithms. Due to its always-on nature, KWS application has highly constrained power budget and typically runs on tiny microcontrollers with limited memory and compute capability. The design of neural network architecture for KWS must consider these constraints. In this work, we perform neural network architecture evaluation and exploration for running KWS on resource-constrained microcontrollers. We train various neural network architectures for keyword spotting published in literature to compare their accuracy and memory/compute requirements. We show that it is possible to optimize these neural network architectures to fit within the memory and compute constraints of microcontrollers without sacrificing accuracy. We further explore the depthwise separable convolutional neural network (DS-CNN) and compare it against other neural network architectures. DS-CNN achieves an accuracy of 95.4%, which is ~10% higher than the DNN model with similar number of parameters.

Subjects:	Sound (cs.SD); Computation and Language (cs.CL); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:1711.07128 [cs.SD]
	(or arXiv:1711.07128v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.1711.07128

Submission history

From: Liangzhen Lai [view email]
[v1] Mon, 20 Nov 2017 03:19:03 UTC (1,786 KB)
[v2] Wed, 13 Dec 2017 23:54:52 UTC (1,786 KB)
[v3] Wed, 14 Feb 2018 19:24:55 UTC (842 KB)

Computer Science > Sound

Title:Hello Edge: Keyword Spotting on Microcontrollers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Hello Edge: Keyword Spotting on Microcontrollers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators