[PDF][PDF] Garbage modeling for on-device speech recognition.
User interactions with mobile devices increasingly depend on voice as a primary input
modality. Due to the disadvantages of sending audio across potentially spotty network
connections for speech recognition, in recent years there has been growing attention to
performing recognition on-device. The limited computational resources, however, typically
require additional model constraints. In this work, we explore the task of on-device utterance
verification, wherein the recognizer must transcribe an utterance if it is in a target set or reject …
modality. Due to the disadvantages of sending audio across potentially spotty network
connections for speech recognition, in recent years there has been growing attention to
performing recognition on-device. The limited computational resources, however, typically
require additional model constraints. In this work, we explore the task of on-device utterance
verification, wherein the recognizer must transcribe an utterance if it is in a target set or reject …
Garbage modeling for on-device speech recognition
User interactions with mobile devices increasingly depend on voice as a primary input
modality. Due to the disadvantages of sending audio across potentially spotty network
connections for speech recognition, in recent years there has been growing attention to
performing recognition on-device. The limited computational resources, however, typically
require additional model constraints. In this work, we explore the task of on-device utterance
verification, wherein the recognizer must transcribe an utterance if it is in a target set or reject …
modality. Due to the disadvantages of sending audio across potentially spotty network
connections for speech recognition, in recent years there has been growing attention to
performing recognition on-device. The limited computational resources, however, typically
require additional model constraints. In this work, we explore the task of on-device utterance
verification, wherein the recognizer must transcribe an utterance if it is in a target set or reject …
Showing the best results for this search. See all results