Extremely Low Footprint End-to-End ASR System for Smart Device

Gao, Zhifu; Yao, Yiwu; Zhang, Shiliang; Yang, Jun; Lei, Ming; McLoughlin, Ian

Computer Science > Sound

arXiv:2104.05784v1 (cs)

[Submitted on 6 Apr 2021 (this version), latest version 7 Jul 2021 (v5)]

Title:Extremely Low Footprint End-to-End ASR System for Smart Device

Authors:Zhifu Gao, Yiwu Yao, Shiliang Zhang, Jun Yang, Ming Lei, Ian McLoughlin

View PDF

Abstract:Recently, end-to-end (E2E) speech recognition has become popular, since it can integrate the acoustic, pronunciation and language models into a single neural network, as well as outperforms conventional models. Among E2E approaches, attention-based models, $e.g.$ Transformer, have emerged as being superior. The E2E models have opened the door of deployment of ASR on smart device, however it still suffers from large amount model parameters. This work proposes an extremely low footprint E2E ASR system for smart device, to achieve the goal of satisfying resource constraints without sacrificing recognition accuracy. We adopt cross-layer weight sharing to improve parameter-efficiency. We further exploit the model compression methods including sparsification and quantization, to reduce the memory storage and boost the decoding efficiency on smart device. We have evaluated our approach on the public AISHELL-1 and AISHELL-2 benchmarks. On the AISHELL-2 task, the proposed method achieves more than 10x compression (model size from 248MB to 24MB) while shuffer from small performance loss (CER from 6.49% to 6.92%).

Comments:	5 pages, 2 figures, submitted to INTERSPEECH2021
Subjects:	Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2104.05784 [cs.SD]
	(or arXiv:2104.05784v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2104.05784

Submission history

From: Zhifu Gao [view email]
[v1] Tue, 6 Apr 2021 12:44:12 UTC (584 KB)
[v2] Mon, 26 Apr 2021 02:53:03 UTC (585 KB)
[v3] Mon, 7 Jun 2021 02:48:21 UTC (534 KB)
[v4] Tue, 6 Jul 2021 08:08:04 UTC (992 KB)
[v5] Wed, 7 Jul 2021 03:26:09 UTC (1,029 KB)

Computer Science > Sound

Title:Extremely Low Footprint End-to-End ASR System for Smart Device

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Extremely Low Footprint End-to-End ASR System for Smart Device

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators