0% found this document useful (0 votes)

17 views39 pages

CS485 Ch4 Latentcodes-1

Uploaded by

Mennan Gök

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views39 pages

CS485 Ch4 Latentcodes-1

Uploaded by

Mennan Gök

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

CS485/585

Deep Generative Networks

Bilkent University
Content
• Reconstruct an image
• Manipulate an image
• Find interpretable directions in the latent space
Image2StyleGAN
• Goal is to edit an existing photograph with
StyleGAN.
• StyleGAN generates novel faces.
• First, we need to find the latent code that can
generate an existing photograph.
Latent Space Embedding
• Learn an encoder.
– Fast
– Limitation in generalizing beyond the training dataset.

• Select a random initial latent code and optimize it

using gradient descent.
– Slow
– Adapts to novel images
Image2StyleGAN

Image2StyleGAN: How to Embed Images Into

the StyleGAN Latent Space?, ICCV 2019
Image2StyleGAN

Image2StyleGAN: How to Embed Images Into

the StyleGAN Latent Space?, ICCV 2019
Image2StyleGAN
• Start with a random latent code and backpropagate
the target loss.
Loss Functions
• the low-level similarity between two images is
measured in the pixel space with L1/L2 loss
functions.
• Per-pixel losses do not capture perceptual
differences between output and target image.
– Two identical image with 1 pixel offset from each other
results in high per-pixel loss, despite their high perceptual
similarity
• Perceptual Loss

Perceptual losses for real-time style transfer and super-resolution

J Johnson, A Alahi, L Fei-Fei - European conference on computer vision, 2016
Perceptual Loss
• Perceptual loss functions based not on differences
between pixels but instead on differences between
high-level image feature representations extracted
from pretrained convolutional neural networks
• Key insight: convolutional neural networks
pretrained for image classification have already
learned to encode the perceptual and semantic
information we would like to measure in our loss
functions

Perceptual losses for real-time style transfer and super-resolution

J Johnson, A Alahi, L Fei-Fei - European conference on computer vision, 2016
Perceptual Loss

Let φj (x) be the activations of the jth layer of the

network φ when processing the image x; if j is a
convolutional layer then φj (x) will be a feature
map of shape Cj × Hj × Wj .

Perceptual losses for real-time style transfer and super-resolution

J Johnson, A Alahi, L Fei-Fei - European conference on computer vision, 2016
Perceptual Loss

Optimization to find an image y’ that minimizes the feature reconstruction loss

φ for several layers j from the pretrained VGG-16 loss network φ.
Reconstruction from higher layers, image content and overall spatial structure
are preserved, but color, texture, and exact shape are not.

Perceptual losses for real-time style transfer and super-resolution

J Johnson, A Alahi, L Fei-Fei - European conference on computer vision, 2016
Image2StyleGAN
• Start with a random latent code and backpropagate
the target loss.

Image2StyleGAN: How to Embed Images Into

the StyleGAN Latent Space?, ICCV 2019
Which Latent Space to Choose?

1) Initial latent space z.

2) the intermediate latent space W
3) extended latent space W+. W+ is a
concatenation of 18 different 512-
dimensional w vectors, one for each
layer of the StyleGAN architecture
that can receive input via AdaIn

Image2StyleGAN: How to Embed Images Into

the StyleGAN Latent Space?, ICCV 2019
Image2StyleGAN

Image2StyleGAN: How to Embed Images Into

the StyleGAN Latent Space?, ICCV 2019
Image2StyleGAN - Morphing

Image2StyleGAN: How to Embed Images Into

the StyleGAN Latent Space?, ICCV 2019
Image2StyleGAN – Expression
Transfer

Image2StyleGAN: How to Embed Images Into

the StyleGAN Latent Space?, ICCV 2019
Image2StyleGAN – Style Transfer

Image2StyleGAN: How to Embed Images Into

the StyleGAN Latent Space?, ICCV 2019
In Domain GAN Inversion

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In Domain GAN Inversion
• inversion methods typically focus on reconstructing
the target image by pixel values.
• Is it enough?

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In Domain GAN Inversion
• inversion methods typically focus on reconstructing
the target image by pixel values.
• Is it enough?
What if the code is not aligned with the semantic domain of
the latent space?

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In Domain GAN Inversion
• inversion methods typically focus on reconstructing
the target image by pixel values yet fail to land the
inverted code in the semantic domain of the original
latent space.
• As a result, the reconstructed image cannot well
support semantic editing through varying the
inverted code.
• To solve this problem, in-domain GAN inversion
approach, not only faithfully reconstructs the input
image but also ensures the inverted code to be
semantically meaningful for editing.

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In Domain GAN Inversion
• Domain Guided Encoder
• Then optimization

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In Domain GAN Inversion

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In Domain GAN Inversion

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In Domain GAN Inversion

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In Domain GAN Inversion
• it is hard to learn a perfect reverse mapping with an
encoder alone due to its limited representation
capability.
• Therefore, even though the inverted code from the
proposed domain-guided encoder can well
reconstruct the input image based on the pre-
trained generator and ensure the code itself to be
semantically meaningful, we still need to refine the
code to make it better fit the target individual image
at the pixel values.

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In Domain GAN Inversion
• The domain-guided encoder provides an ideal
starting point which avoids the code from getting
stuck at a local minimum
• Also used as a regularizer to preserve the latent
code within the semantic domain of the generator.

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In Domain GAN Inversion

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In Domain GAN Inversion

[29] Image2stylegan: How to embed images into the stylegan latent space? In: ICCV (2019)
[36] Generative visual manipulation on the natural image manifold. In: ECCV (2016)

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In Domain GAN Inversion

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In Domain GAN Inversion

[29] Image2stylegan: How to embed images into the stylegan latent space? In: ICCV (2019)

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In Domain GAN Inversion

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

High-fidelity Image Inversion

High-Fidelity GAN Inversion for Image Attribute Editing, CVPR 2022

High-fidelity Image Inversion

High-Fidelity GAN Inversion for Image Attribute Editing, CVPR 2022

High-fidelity Image Inversion

High-Fidelity GAN Inversion for Image Attribute Editing, CVPR 2022

High-fidelity Image Inversion

High-Fidelity GAN Inversion for Image Attribute Editing, CVPR 2022

High-fidelity Image Inversion

High-Fidelity GAN Inversion for Image Attribute Editing, CVPR 2022

High-fidelity Image Inversion

High-Fidelity GAN Inversion for Image Attribute Editing, CVPR 2022

Next Class – finding directions

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

AASHTO - Pavement Management Guide 2nd Edition-AASHTO (2012)
100% (4)
AASHTO - Pavement Management Guide 2nd Edition-AASHTO (2012)
196 pages
Design and Implementation of A Computerised Stadium Management Information System
100% (8)
Design and Implementation of A Computerised Stadium Management Information System
32 pages
Generating Anime Faces From Human Faces With Adversarial Networks
No ratings yet
Generating Anime Faces From Human Faces With Adversarial Networks
7 pages
A Style-Based GAN Encoder For High Fidelity
No ratings yet
A Style-Based GAN Encoder For High Fidelity
17 pages
Pix 2 Style 2 Pix
No ratings yet
Pix 2 Style 2 Pix
21 pages
Oh From 2D Portraits To 3D Realities Advancing GAN Inversion For CVPRW 2024 Paper
No ratings yet
Oh From 2D Portraits To 3D Realities Advancing GAN Inversion For CVPRW 2024 Paper
10 pages
Image2StyleGAN How To Embed Images Into The StyleGAN Latent Space
No ratings yet
Image2StyleGAN How To Embed Images Into The StyleGAN Latent Space
10 pages
Stitch It in Time: GAN-Based Facial Editing of Real Videos
No ratings yet
Stitch It in Time: GAN-Based Facial Editing of Real Videos
11 pages
Collaborative Learning For Faster StyleGAN Embedding
No ratings yet
Collaborative Learning For Faster StyleGAN Embedding
10 pages
CVPR2020-Image Processing Using Multi-Code GAN Prior
No ratings yet
CVPR2020-Image Processing Using Multi-Code GAN Prior
10 pages
A Style-Based Generator Architecture For Generative Adversarial Networks
No ratings yet
A Style-Based Generator Architecture For Generative Adversarial Networks
12 pages
Resolution Dependent GAN Interpolation For Controllable Image Synthesis Between Domains
No ratings yet
Resolution Dependent GAN Interpolation For Controllable Image Synthesis Between Domains
7 pages
Presentation #7 A Style-Based GANs
No ratings yet
Presentation #7 A Style-Based GANs
23 pages
NeurIPS 2020 Ganspace Discovering Interpretable Gan Controls Paper
No ratings yet
NeurIPS 2020 Ganspace Discovering Interpretable Gan Controls Paper
10 pages
CS485 Ch4 Latentcodes-2
No ratings yet
CS485 Ch4 Latentcodes-2
36 pages
3D Cartoon Face Generation With Controllable Expressions From A Single GAN Image
No ratings yet
3D Cartoon Face Generation With Controllable Expressions From A Single GAN Image
11 pages
Document Query
No ratings yet
Document Query
5 pages
1803 07422xasxa
No ratings yet
1803 07422xasxa
28 pages
Swapping Autoencoder For Deep Image Manipulation: Webpage
No ratings yet
Swapping Autoencoder For Deep Image Manipulation: Webpage
23 pages
Expanding The Latent Space of StyleGAN For Real Face Editing
No ratings yet
Expanding The Latent Space of StyleGAN For Real Face Editing
17 pages
Few-Shot Image Generation Via Style Adaptation and Content Preservation
No ratings yet
Few-Shot Image Generation Via Style Adaptation and Content Preservation
12 pages
Image-to-Image Translation With Conditional Adversarial Networks (Review)
No ratings yet
Image-to-Image Translation With Conditional Adversarial Networks (Review)
3 pages
N The Steerability OF Generative Adversarial Networks
No ratings yet
N The Steerability OF Generative Adversarial Networks
31 pages
Masterclass GANs
No ratings yet
Masterclass GANs
20 pages
StarGAN v2 - Diverse Image Synthesis For Multiple Domains
No ratings yet
StarGAN v2 - Diverse Image Synthesis For Multiple Domains
14 pages
Pivotal Tuning Inversion
No ratings yet
Pivotal Tuning Inversion
26 pages
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
No ratings yet
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
17 pages
Script
No ratings yet
Script
10 pages
Anysize GAN: A Solution To The Image-Warping Problem
No ratings yet
Anysize GAN: A Solution To The Image-Warping Problem
14 pages
Neural Style Transfer Final Paper
No ratings yet
Neural Style Transfer Final Paper
10 pages
Generative Image Inpainting With Contextual Attention
No ratings yet
Generative Image Inpainting With Contextual Attention
15 pages
GenAI 2025 StyleGAN
No ratings yet
GenAI 2025 StyleGAN
57 pages
Image To Image Translation Using Generative Adversarial Network
No ratings yet
Image To Image Translation Using Generative Adversarial Network
5 pages
07 1.gan 2
No ratings yet
07 1.gan 2
56 pages
Invertible Conditional GANs For Image Editing
No ratings yet
Invertible Conditional GANs For Image Editing
9 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
89 pages
Gen AI Lab Questions
No ratings yet
Gen AI Lab Questions
3 pages
NeurIPS 2020 Swapping Autoencoder For Deep Image Manipulation Paper
No ratings yet
NeurIPS 2020 Swapping Autoencoder For Deep Image Manipulation Paper
14 pages
R3GAN
No ratings yet
R3GAN
39 pages
Boosted GAN With Semantically Interpretable Information For Image Inpainting
No ratings yet
Boosted GAN With Semantically Interpretable Information For Image Inpainting
8 pages
Sr. No Title Published Problem Statement Methodology Dataset Dataset Avail-Ability
No ratings yet
Sr. No Title Published Problem Statement Methodology Dataset Dataset Avail-Ability
2 pages
Bio Robotics
No ratings yet
Bio Robotics
2 pages
Contextual Loss ECCV 2018
No ratings yet
Contextual Loss ECCV 2018
16 pages
A Survey of Image Synthesis and Editing With Generative Adversarial Networks PDF
No ratings yet
A Survey of Image Synthesis and Editing With Generative Adversarial Networks PDF
15 pages
Ieee Publish - Docx LIB2
No ratings yet
Ieee Publish - Docx LIB2
6 pages
GLea D
No ratings yet
GLea D
12 pages
Zhangyuanxin Final
No ratings yet
Zhangyuanxin Final
12 pages
SSRN Id3354412
No ratings yet
SSRN Id3354412
8 pages
PiiGAN Generative Adversarial Networks For Pluralistic Image Inpainting
No ratings yet
PiiGAN Generative Adversarial Networks For Pluralistic Image Inpainting
13 pages
Unsupervised Cross-Domain Image Generation
No ratings yet
Unsupervised Cross-Domain Image Generation
14 pages
Exploring The Various Machine Learning Models For Image Generation - A Comprehensive Survey Unlocking The Future of Digital Creativity
No ratings yet
Exploring The Various Machine Learning Models For Image Generation - A Comprehensive Survey Unlocking The Future of Digital Creativity
15 pages
L10-DL Intro
No ratings yet
L10-DL Intro
25 pages
Deep Network Interpolation For Continuous Imagery Effect Transition
No ratings yet
Deep Network Interpolation For Continuous Imagery Effect Transition
17 pages
Yu Generative Image Inpainting CVPR 2018 Paper
No ratings yet
Yu Generative Image Inpainting CVPR 2018 Paper
10 pages
Pathak Context Encoders Feature CVPR 2016 Paper
No ratings yet
Pathak Context Encoders Feature CVPR 2016 Paper
9 pages
Context Encoders: Feature Learning by Inpainting
No ratings yet
Context Encoders: Feature Learning by Inpainting
12 pages
Volume GAN
No ratings yet
Volume GAN
12 pages
Deep Generative Image Models Using A Laplacian Pyramid of Adversarial Networks
No ratings yet
Deep Generative Image Models Using A Laplacian Pyramid of Adversarial Networks
10 pages
Joint Photographic Experts Group: Unlocking the Power of Visual Data with the JPEG Standard
From Everand
Joint Photographic Experts Group: Unlocking the Power of Visual Data with the JPEG Standard
Fouad Sabry
No ratings yet
Volume Rendering: Exploring Visual Realism in Computer Vision
From Everand
Volume Rendering: Exploring Visual Realism in Computer Vision
Fouad Sabry
No ratings yet
Image Compression: Efficient Techniques for Visual Data Optimization
From Everand
Image Compression: Efficient Techniques for Visual Data Optimization
Fouad Sabry
No ratings yet
Scanline Rendering: Exploring Visual Realism Through Scanline Rendering Techniques
From Everand
Scanline Rendering: Exploring Visual Realism Through Scanline Rendering Techniques
Fouad Sabry
No ratings yet
Lecture 5
No ratings yet
Lecture 5
2 pages
HUM 112-Sections 11 - Film Report - Emir Bülbül-22103967
No ratings yet
HUM 112-Sections 11 - Film Report - Emir Bülbül-22103967
3 pages
Lecture 8 Â Autobiographical Memory
No ratings yet
Lecture 8 Â Autobiographical Memory
31 pages
Discourse On The Origin of Inequality
No ratings yet
Discourse On The Origin of Inequality
7 pages
2024 05 24 595730v1 Full
No ratings yet
2024 05 24 595730v1 Full
22 pages
Customs Procedures Code (CPC)
No ratings yet
Customs Procedures Code (CPC)
17 pages
Richtek RT9742
No ratings yet
Richtek RT9742
20 pages
End-Of-Term Test Higher A
No ratings yet
End-Of-Term Test Higher A
4 pages
Web Methods EbXML Module Installation and User's Guide 7.1 SP1
100% (1)
Web Methods EbXML Module Installation and User's Guide 7.1 SP1
154 pages
C# Concepts
No ratings yet
C# Concepts
2 pages
Abdullah 2018
No ratings yet
Abdullah 2018
45 pages
141 Colors That Start With Z (Names, Hex, RGB, CMYK)
No ratings yet
141 Colors That Start With Z (Names, Hex, RGB, CMYK)
77 pages
Internet Safety - Crossword Puzzle
No ratings yet
Internet Safety - Crossword Puzzle
2 pages
Calculating Devices FV
No ratings yet
Calculating Devices FV
13 pages
Mass Media Essays
100% (2)
Mass Media Essays
5 pages
Product Senior Manager Financial Services in Phoenix AZ Resume Corey Miller
No ratings yet
Product Senior Manager Financial Services in Phoenix AZ Resume Corey Miller
2 pages
Firewall Ufw
No ratings yet
Firewall Ufw
10 pages
Leoxsys - Wifi Usb Adaper - User Manual
No ratings yet
Leoxsys - Wifi Usb Adaper - User Manual
43 pages
18 Ajit Gupta Android Practical
No ratings yet
18 Ajit Gupta Android Practical
122 pages
IA Industrial Automation Product Pricelist V6 August 02-09-21
No ratings yet
IA Industrial Automation Product Pricelist V6 August 02-09-21
148 pages
Temperature Prediction Models in Mass Concrete State of The Art Literature Review
No ratings yet
Temperature Prediction Models in Mass Concrete State of The Art Literature Review
10 pages
Advance Excel Toolkit
No ratings yet
Advance Excel Toolkit
3 pages
Microsoft Word - Social Media Page Activity
No ratings yet
Microsoft Word - Social Media Page Activity
8 pages
002 - A - Pemrograman Berorientasi Object
No ratings yet
002 - A - Pemrograman Berorientasi Object
26 pages
Portfolio Rituja
No ratings yet
Portfolio Rituja
18 pages
SNMP Datasheet For Ita Ups Sic snmp810
No ratings yet
SNMP Datasheet For Ita Ups Sic snmp810
2 pages
Cross Site Scripting XSS CSS: Also Known As or
No ratings yet
Cross Site Scripting XSS CSS: Also Known As or
219 pages
Ece I Basic Electronics Engg. (15eln15) Notes
No ratings yet
Ece I Basic Electronics Engg. (15eln15) Notes
124 pages
VR&AR
No ratings yet
VR&AR
8 pages
Presentation1 GRP
No ratings yet
Presentation1 GRP
13 pages
Extinguishant Control Panel (SHC70002, SHC70003) Operation and Maintenance Manual
No ratings yet
Extinguishant Control Panel (SHC70002, SHC70003) Operation and Maintenance Manual
38 pages
Geovision Hybrid Software Datasheet
No ratings yet
Geovision Hybrid Software Datasheet
6 pages
Find Changes Logs For A Table Using SM30 - SAP Blogs
No ratings yet
Find Changes Logs For A Table Using SM30 - SAP Blogs
7 pages

CS485 Ch4 Latentcodes-1

Uploaded by

CS485 Ch4 Latentcodes-1

Uploaded by

CS485/585

Deep Generative Networks

• Select a random initial latent code and optimize it

Image2StyleGAN: How to Embed Images Into

Image2StyleGAN: How to Embed Images Into

Perceptual losses for real-time style transfer and super-resolution

Perceptual losses for real-time style transfer and super-resolution

Let φj (x) be the activations of the jth layer of the

Perceptual losses for real-time style transfer and super-resolution

Optimization to find an image y’ that minimizes the feature reconstruction loss

Perceptual losses for real-time style transfer and super-resolution

Image2StyleGAN: How to Embed Images Into

1) Initial latent space z.

Image2StyleGAN: How to Embed Images Into

Image2StyleGAN: How to Embed Images Into

Image2StyleGAN: How to Embed Images Into

Image2StyleGAN: How to Embed Images Into

Image2StyleGAN: How to Embed Images Into

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

High-Fidelity GAN Inversion for Image Attribute Editing, CVPR 2022

High-Fidelity GAN Inversion for Image Attribute Editing, CVPR 2022

High-Fidelity GAN Inversion for Image Attribute Editing, CVPR 2022

High-Fidelity GAN Inversion for Image Attribute Editing, CVPR 2022

High-Fidelity GAN Inversion for Image Attribute Editing, CVPR 2022

High-Fidelity GAN Inversion for Image Attribute Editing, CVPR 2022

In-Domain GAN Inversion for Real Image Editing, ECCV 2020

You might also like