Conditioning Trick for Training Stable GANs

Esmaeilpour, Mohammad; Sallo, Raymel Alfonso; St-Georges, Olivier; Cardinal, Patrick; Koerich, Alessandro Lameiras

Computer Science > Sound

arXiv:2010.05844 (cs)

[Submitted on 12 Oct 2020]

Title:Conditioning Trick for Training Stable GANs

Authors:Mohammad Esmaeilpour, Raymel Alfonso Sallo, Olivier St-Georges, Patrick Cardinal, Alessandro Lameiras Koerich

View PDF

Abstract:In this paper we propose a conditioning trick, called difference departure from normality, applied on the generator network in response to instability issues during GAN training. We force the generator to get closer to the departure from normality function of real samples computed in the spectral domain of Schur decomposition. This binding makes the generator amenable to truncation and does not limit exploring all the possible modes. We slightly modify the BigGAN architecture incorporating residual network for synthesizing 2D representations of audio signals which enables reconstructing high quality sounds with some preserved phase information. Additionally, the proposed conditional training scenario makes a trade-off between fidelity and variety for the generated spectrograms. The experimental results on UrbanSound8k and ESC-50 environmental sound datasets and the Mozilla common voice dataset have shown that the proposed GAN configuration with the conditioning trick remarkably outperforms baseline architectures, according to three objective metrics: inception score, Frechet inception distance, and signal-to-noise ratio.

Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2010.05844 [cs.SD]
	(or arXiv:2010.05844v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2010.05844

Submission history

From: Alessandro Lameiras Koerich [view email]
[v1] Mon, 12 Oct 2020 16:50:22 UTC (15,587 KB)

Computer Science > Sound

Title:Conditioning Trick for Training Stable GANs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Conditioning Trick for Training Stable GANs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators