BSD-GAN: Branched Generative Adversarial Network for Scale-Disentangled Representation Learning and Image Synthesis

Yi, Zili; Chen, Zhiqin; Cai, Hao; Mao, Wendong; Gong, Minglun; Zhang, Hao

Computer Science > Computer Vision and Pattern Recognition

arXiv:1803.08467 (cs)

[Submitted on 22 Mar 2018 (v1), last revised 4 Aug 2020 (this version, v5)]

Title:BSD-GAN: Branched Generative Adversarial Network for Scale-Disentangled Representation Learning and Image Synthesis

Authors:Zili Yi, Zhiqin Chen, Hao Cai, Wendong Mao, Minglun Gong, Hao Zhang

View PDF

Abstract:We introduce BSD-GAN, a novel multi-branch and scale-disentangled training method which enables unconditional Generative Adversarial Networks (GANs) to learn image representations at multiple scales, benefiting a wide range of generation and editing tasks. The key feature of BSD-GAN is that it is trained in multiple branches, progressively covering both the breadth and depth of the network, as resolutions of the training images increase to reveal finer-scale features. Specifically, each noise vector, as input to the generator network of BSD-GAN, is deliberately split into several sub-vectors, each corresponding to, and is trained to learn, image representations at a particular scale. During training, we progressively "de-freeze" the sub-vectors, one at a time, as a new set of higher-resolution images is employed for training and more network layers are added. A consequence of such an explicit sub-vector designation is that we can directly manipulate and even combine latent (sub-vector) codes which model different feature this http URL experiments demonstrate the effectiveness of our training method in scale-disentangled learning of image representations and synthesis of novel image contents, without any extra labels and without compromising quality of the synthesized high-resolution images. We further demonstrate several image generation and manipulation applications enabled or improved by BSD-GAN. Source codes are available at this https URL.

Comments:	12 pages, 20 figures, accepted to IEEE Transaction on Image Processing
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1803.08467 [cs.CV]
	(or arXiv:1803.08467v5 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1803.08467

Submission history

From: Zili Yi [view email]
[v1] Thu, 22 Mar 2018 17:07:32 UTC (12,702 KB)
[v2] Wed, 28 Nov 2018 20:14:26 UTC (22,382 KB)
[v3] Tue, 21 Jul 2020 01:31:51 UTC (55,696 KB)
[v4] Sat, 25 Jul 2020 07:04:23 UTC (55,696 KB)
[v5] Tue, 4 Aug 2020 02:17:13 UTC (17,745 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:BSD-GAN: Branched Generative Adversarial Network for Scale-Disentangled Representation Learning and Image Synthesis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:BSD-GAN: Branched Generative Adversarial Network for Scale-Disentangled Representation Learning and Image Synthesis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators