Improving End-to-End Speech Recognition with Policy Learning.

AllVideos Books News Images Maps Shopping

Improving End-to-End Speech Recognition with Policy Learning

Dec 19, 2017 · We show that joint training improves relative performance by 4% to 13% for our end-to-end model as compared to the same model learned through ...

Scholarly articles for Improving End-to-End Speech Recognition with Policy Learning.

scholar.google.com › citations

Improving end-to-end speech recognition with policy …
Zhou · Cited by 49

Improving End-to-End Speech Recognition with Policy Learning

ieeexplore.ieee.org › iel7

ABSTRACT. Connectionist temporal classification (CTC) is widely used for maximum likelihood learning in end-to-end speech recog- nition models.

WO/2019/084228 IMPROVING END-TO-END SPEECH ...

patentscope.wipo.int › search › detail

A deep end-to-end speech recognition model is provided. Multi-objective learning criteria are used to train the model on training data comprising speech ...

Improving End-to-End Speech Recognition with Policy Learning

www.semanticscholar.org › paper › Impr...

It is shown that joint training improves relative performance by 4% to 13% for the end-to-end model as compared to the same model learned through maximum ...

Improving End-to-End Speech Recognition with Policy Learning ...

dl.acm.org › abs › ICASSP.2018.8462361

We show that joint training improves relative performance by 4% to 13% for our end-to-end model as compared to the same model learned through maximum likelihood ...

Improving end-to-end Speech Recognition Models - Salesforce

www.salesforce.com › blog › improving-...

Dec 14, 2017 · We show that the performance of the end-to-end speech models can be improved significantly by performing proper regularization and adjustment to the training ...

Improving End-to-End Speech Recognition with Policy Learning

www.researchgate.net › publication › 32...

We propose keeping track of the decisions that the system has made, and using them to constrain the system's future behavior in the dialogue. In this way, we ...

Improving End-to-End Speech Processing by Efficient Text Data ...

arxiv.org › cs

Oct 9, 2023 · We propose Latent Synthesis (LaSyn), an efficient textual data utilization framework for E2E speech processing models.

Awesome End-to-End Speech Recognition - GitHub

github.com › charlesliucn › awesome-en...

A list of End-to-End speech recognition, including papers, codes and other materials - charlesliucn/awesome-end2end-speech-recognition.

Improved training strategies for end-to-end speech recognition in ...

www.amazon.science › publications › im...

Training endto-end (E2E) speech recognition models without careful attention to such data results in sub-optimal performance as models prioritize learning wake- ...