A Comparison of Metric Learning Loss Functions for End-To-End Speaker Verification

Coria, Juan M.; Bredin, Hervé; Ghannay, Sahar; Rosset, Sophie

Computer Science > Machine Learning

arXiv:2003.14021 (cs)

[Submitted on 31 Mar 2020]

Title:A Comparison of Metric Learning Loss Functions for End-To-End Speaker Verification

Authors:Juan M. Coria, Hervé Bredin, Sahar Ghannay, Sophie Rosset

View PDF

Abstract:Despite the growing popularity of metric learning approaches, very little work has attempted to perform a fair comparison of these techniques for speaker verification. We try to fill this gap and compare several metric learning loss functions in a systematic manner on the VoxCeleb dataset. The first family of loss functions is derived from the cross entropy loss (usually used for supervised classification) and includes the congenerous cosine loss, the additive angular margin loss, and the center loss. The second family of loss functions focuses on the similarity between training samples and includes the contrastive loss and the triplet loss. We show that the additive angular margin loss function outperforms all other loss functions in the study, while learning more robust representations. Based on a combination of SincNet trainable features and the x-vector architecture, the network used in this paper brings us a step closer to a really-end-to-end speaker verification system, when combined with the additive angular margin loss, while still being competitive with the x-vector baseline. In the spirit of reproducible research, we also release open source Python code for reproducing our results, and share pretrained PyTorch models on this http URL that can be used either directly or after fine-tuning.

Subjects:	Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
Cite as:	arXiv:2003.14021 [cs.LG]
	(or arXiv:2003.14021v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2003.14021

Submission history

From: Juan Manuel Coria [view email]
[v1] Tue, 31 Mar 2020 08:36:07 UTC (366 KB)

Computer Science > Machine Learning

Title:A Comparison of Metric Learning Loss Functions for End-To-End Speaker Verification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Comparison of Metric Learning Loss Functions for End-To-End Speaker Verification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators