Positional Encoding as Spatial Inductive Bias in GANs

Xu, Rui; Wang, Xintao; Chen, Kai; Zhou, Bolei; Loy, Chen Change

Computer Science > Computer Vision and Pattern Recognition

arXiv:2012.05217 (cs)

[Submitted on 9 Dec 2020]

Title:Positional Encoding as Spatial Inductive Bias in GANs

Authors:Rui Xu, Xintao Wang, Kai Chen, Bolei Zhou, Chen Change Loy

View PDF

Abstract:SinGAN shows impressive capability in learning internal patch distribution despite its limited effective receptive field. We are interested in knowing how such a translation-invariant convolutional generator could capture the global structure with just a spatially i.i.d. input. In this work, taking SinGAN and StyleGAN2 as examples, we show that such capability, to a large extent, is brought by the implicit positional encoding when using zero padding in the generators. Such positional encoding is indispensable for generating images with high fidelity. The same phenomenon is observed in other generative architectures such as DCGAN and PGGAN. We further show that zero padding leads to an unbalanced spatial bias with a vague relation between locations. To offer a better spatial inductive bias, we investigate alternative positional encodings and analyze their effects. Based on a more flexible positional encoding explicitly, we propose a new multi-scale training strategy and demonstrate its effectiveness in the state-of-the-art unconditional generator StyleGAN2. Besides, the explicit spatial inductive bias substantially improve SinGAN for more versatile image manipulation.

Comments:	paper with appendix, project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2012.05217 [cs.CV]
	(or arXiv:2012.05217v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2012.05217

Submission history

From: Rui Xu [view email]
[v1] Wed, 9 Dec 2020 18:27:16 UTC (33,691 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2020-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Rui Xu
Xintao Wang
Kai Chen
Bolei Zhou
Chen Change Loy

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Positional Encoding as Spatial Inductive Bias in GANs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Positional Encoding as Spatial Inductive Bias in GANs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators