Optimal Condition Training for Target Source Separation

Tzinis, Efthymios; Wichern, Gordon; Smaragdis, Paris; Roux, Jonathan Le

Computer Science > Sound

arXiv:2211.05927 (cs)

[Submitted on 11 Nov 2022]

Title:Optimal Condition Training for Target Source Separation

Authors:Efthymios Tzinis, Gordon Wichern, Paris Smaragdis, Jonathan Le Roux

View PDF

Abstract:Recent research has shown remarkable performance in leveraging multiple extraneous conditional and non-mutually exclusive semantic concepts for sound source separation, allowing the flexibility to extract a given target source based on multiple different queries. In this work, we propose a new optimal condition training (OCT) method for single-channel target source separation, based on greedy parameter updates using the highest performing condition among equivalent conditions associated with a given target source. Our experiments show that the complementary information carried by the diverse semantic concepts significantly helps to disentangle and isolate sources of interest much more efficiently compared to single-conditioned models. Moreover, we propose a variation of OCT with condition refinement, in which an initial conditional vector is adapted to the given mixture and transformed to a more amenable representation for target source extraction. We showcase the effectiveness of OCT on diverse source separation experiments where it improves upon permutation invariant models with oracle assignment and obtains state-of-the-art performance in the more challenging task of text-based source separation, outperforming even dedicated text-only conditioned models.

Comments:	Submitted to ICASSP 2023
Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2211.05927 [cs.SD]
	(or arXiv:2211.05927v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2211.05927

Submission history

From: Efthymios Tzinis [view email]
[v1] Fri, 11 Nov 2022 00:04:55 UTC (935 KB)

Computer Science > Sound

Title:Optimal Condition Training for Target Source Separation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Optimal Condition Training for Target Source Separation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators