Separating Style and Content for Generalized Style Transfer

Zhang, Yexun; Zhang, Ya; Cai, Wenbin; Chang, Jie

Computer Science > Computer Vision and Pattern Recognition

arXiv:1711.06454 (cs)

[Submitted on 17 Nov 2017 (v1), last revised 23 Sep 2018 (this version, v6)]

Title:Separating Style and Content for Generalized Style Transfer

Authors:Yexun Zhang, Ya Zhang, Wenbin Cai, Jie Chang

View PDF

Abstract:Neural style transfer has drawn broad attention in recent years. However, most existing methods aim to explicitly model the transformation between different styles, and the learned model is thus not generalizable to new styles. We here attempt to separate the representations for styles and contents, and propose a generalized style transfer network consisting of style encoder, content encoder, mixer and decoder. The style encoder and content encoder are used to extract the style and content factors from the style reference images and content reference images, respectively. The mixer employs a bilinear model to integrate the above two factors and finally feeds it into a decoder to generate images with target style and content. To separate the style features and content features, we leverage the conditional dependence of styles and contents given an image. During training, the encoder network learns to extract styles and contents from two sets of reference images in limited size, one with shared style and the other with shared content. This learning framework allows simultaneous style transfer among multiple styles and can be deemed as a special `multi-task' learning scenario. The encoders are expected to capture the underlying features for different styles and contents which is generalizable to new styles and contents. For validation, we applied the proposed algorithm to the Chinese Typeface transfer problem. Extensive experiment results on character generation have demonstrated the effectiveness and robustness of our method.

Comments:	Accepted by CVPR2018
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1711.06454 [cs.CV]
	(or arXiv:1711.06454v6 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1711.06454

Submission history

From: Yexun Zhang [view email]
[v1] Fri, 17 Nov 2017 08:26:12 UTC (5,319 KB)
[v2] Sat, 17 Mar 2018 12:21:57 UTC (9,540 KB)
[v3] Thu, 22 Mar 2018 03:21:37 UTC (9,541 KB)
[v4] Tue, 27 Mar 2018 10:40:45 UTC (9,542 KB)
[v5] Fri, 30 Mar 2018 02:09:03 UTC (9,543 KB)
[v6] Sun, 23 Sep 2018 10:36:52 UTC (9,479 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Separating Style and Content for Generalized Style Transfer

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Separating Style and Content for Generalized Style Transfer

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators