Change your singer: a transfer learning generative adversarial framework for song to song conversion

Daher, Rema; Zein, Mohammad Kassem; Zini, Julia El; Awad, Mariette; Asmar, Daniel

Computer Science > Machine Learning

arXiv:1911.02933 (cs)

[Submitted on 7 Nov 2019 (v1), last revised 30 Jan 2020 (this version, v2)]

Title:Change your singer: a transfer learning generative adversarial framework for song to song conversion

Authors:Rema Daher, Mohammad Kassem Zein, Julia El Zini, Mariette Awad, Daniel Asmar

View PDF

Abstract:Have you ever wondered how a song might sound if performed by a different artist? In this work, we propose SCM-GAN, an end-to-end non-parallel song conversion system powered by generative adversarial and transfer learning that allows users to listen to a selected target singer singing any song. SCM-GAN first separates songs into vocals and instrumental music using a U-Net network, then converts the vocal segments to the target singer using advanced CycleGAN-VC, before merging the converted vocals with their corresponding background music. SCM-GAN is first initialized with feature representations learned from a state-of-the-art voice-to-voice conversion and then trained on a dataset of non-parallel songs. Furthermore, SCM-GAN is evaluated against a set of metrics including global variance GV and modulation spectra MS on the 24 Mel-cepstral coefficients (MCEPs). Transfer learning improves the GV by 35% and the MS by 13% on average. A subjective comparison is conducted to test the user satisfaction with the quality and the naturalness of the conversion. Results show above par similarity between SCM-GAN's output and the target (70\% on average) as well as great naturalness of the converted songs.

Subjects:	Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
Cite as:	arXiv:1911.02933 [cs.LG]
	(or arXiv:1911.02933v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1911.02933

Submission history

From: Mohammad Kassem Zein [view email]
[v1] Thu, 7 Nov 2019 14:32:43 UTC (2,698 KB)
[v2] Thu, 30 Jan 2020 19:03:39 UTC (8,070 KB)

Computer Science > Machine Learning

Title:Change your singer: a transfer learning generative adversarial framework for song to song conversion

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Change your singer: a transfer learning generative adversarial framework for song to song conversion

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators