Picture that Sketch: Photorealistic Image Generation from Abstract Sketches

Koley, Subhadeep; Bhunia, Ayan Kumar; Sain, Aneeshan; Chowdhury, Pinaki Nath; Xiang, Tao; Song, Yi-Zhe

Computer Science > Computer Vision and Pattern Recognition

arXiv:2303.11162 (cs)

[Submitted on 20 Mar 2023 (v1), last revised 30 Mar 2023 (this version, v2)]

Title:Picture that Sketch: Photorealistic Image Generation from Abstract Sketches

Authors:Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song

View PDF

Abstract:Given an abstract, deformed, ordinary sketch from untrained amateurs like you and me, this paper turns it into a photorealistic image - just like those shown in Fig. 1(a), all non-cherry-picked. We differ significantly from prior art in that we do not dictate an edgemap-like sketch to start with, but aim to work with abstract free-hand human sketches. In doing so, we essentially democratise the sketch-to-photo pipeline, "picturing" a sketch regardless of how good you sketch. Our contribution at the outset is a decoupled encoder-decoder training paradigm, where the decoder is a StyleGAN trained on photos only. This importantly ensures that generated results are always photorealistic. The rest is then all centred around how best to deal with the abstraction gap between sketch and photo. For that, we propose an autoregressive sketch mapper trained on sketch-photo pairs that maps a sketch to the StyleGAN latent space. We further introduce specific designs to tackle the abstract nature of human sketches, including a fine-grained discriminative loss on the back of a trained sketch-photo retrieval model, and a partial-aware sketch augmentation strategy. Finally, we showcase a few downstream tasks our generation model enables, amongst them is showing how fine-grained sketch-based image retrieval, a well-studied problem in the sketch community, can be reduced to an image (generated) to image retrieval task, surpassing state-of-the-arts. We put forward generated results in the supplementary for everyone to scrutinise.

Comments:	Accepted in CVPR 2023. Project page available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2303.11162 [cs.CV]
	(or arXiv:2303.11162v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2303.11162

Submission history

From: Subhadeep Koley [view email]
[v1] Mon, 20 Mar 2023 14:49:03 UTC (32,711 KB)
[v2] Thu, 30 Mar 2023 15:10:20 UTC (32,711 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Picture that Sketch: Photorealistic Image Generation from Abstract Sketches

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Picture that Sketch: Photorealistic Image Generation from Abstract Sketches

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators