Training Multi-Task Adversarial Network for Extracting Noise-Robust Speaker Embedding

Zhou, Jianfeng; Jiang, Tao; Li, Lin; Hong, Qingyang; Wang, Zhe; Xia, Bingyin

Computer Science > Sound

arXiv:1811.09355 (cs)

[Submitted on 23 Nov 2018 (v1), last revised 12 May 2019 (this version, v2)]

Title:Training Multi-Task Adversarial Network for Extracting Noise-Robust Speaker Embedding

Authors:Jianfeng Zhou, Tao Jiang, Lin Li, Qingyang Hong, Zhe Wang, Bingyin Xia

View PDF

Abstract:Under noisy environments, to achieve the robust performance of speaker recognition is still a challenging task. Motivated by the promising performance of multi-task training in a variety of image processing tasks, we explore the potential of multi-task adversarial training for learning a noise-robust speaker embedding. In this paper we present a novel framework which consists of three components: an encoder that extracts noise-robust speaker embedding; a classifier that classifies the speakers; a discriminator that discriminates the noise type of the speaker embedding. Besides, we propose a training strategy using the training accuracy as an indicator to stabilize the multi-class adversarial optimization process. We conduct our experiments on the English and Mandarin corpus and the experimental results demonstrate that our proposed multi-task adversarial training method could greatly outperform the other methods without adversarial training in noisy environments. Furthermore, experiments indicate that our method is also able to improve the speaker verification performance the clean condition.

Comments:	accepted by ICASSP2019
Subjects:	Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:1811.09355 [cs.SD]
	(or arXiv:1811.09355v2 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.1811.09355

Submission history

From: Jianfeng Zhou [view email]
[v1] Fri, 23 Nov 2018 04:08:15 UTC (69 KB)
[v2] Sun, 12 May 2019 07:26:57 UTC (137 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.SD

< prev | next >

new | recent | 2018-11

Change to browse by:

cs
eess
eess.AS

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jianfeng Zhou
Tao Jiang
Lin Li
Qingyang Hong
Zhe Wang

…

export BibTeX citation

Computer Science > Sound

Title:Training Multi-Task Adversarial Network for Extracting Noise-Robust Speaker Embedding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Training Multi-Task Adversarial Network for Extracting Noise-Robust Speaker Embedding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators