0% found this document useful (0 votes)

17 views9 pages

CycleGAN - Learning To Translate Images (Without Paired Training Data) - by Sarah Wolf - Towards Data Science

Uploaded by

hai.nguyen29

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views9 pages

CycleGAN - Learning To Translate Images (Without Paired Training Data) - by Sarah Wolf - Towards Data Science

Uploaded by

hai.nguyen29

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

5/17/2021 CycleGAN: Learning to Translate Images (Without Paired Training Data) | by Sarah Wolf | Towards Data Science

Open in app

Follow 594K Followers

CycleGAN: Learning to Translate Images

(Without Paired Training Data)
Sarah Wolf Nov 20, 2018 · 7 min read

An image of zebras translated to horses, using a CycleGAN

Image-to-image translation is the task of transforming an image from one domain (e.g.,
images of zebras), to another (e.g., images of horses). Ideally, other features of the
image — anything not directly related to either domain, such as the background —
should stay recognizably the same. As we might imagine, a good image-to-image
translation system could have an almost unlimited number of applications. Changing art
styles, going from sketch to photo, or changing the season of the landscape in a photo
are just a few examples.

https://towardsdatascience.com/cyclegan-learning-to-translate-images-without-paired-training-data-5b4e93862c8d 1/9
5/17/2021 CycleGAN: Learning to Translate Images (Without Paired Training Data) | by Sarah Wolf | Towards Data Science

Open in app

Examples of paired and unpaired data. *Image taken from the paper.

While there has been a great deal of research into this task, most of it has utilized
supervised training, where we have access to (x, y) pairs of corresponding images from
the two domains we want to learn to translate between. CycleGAN was introduced in the
now well-known 2017 paper out of Berkeley, Unpaired Image-to-Image Translation
using Cycle-Consistent Adversarial Networks. It was interesting because it did not
require paired training data — while an x and y set of images are still required, they do
not need to directly correspond to each other. In other words, if you wanted to translate
between sketches and photos, you still need to train on a bunch of sketches and a bunch
of photos, but the sketches would not need to be of the exact photos in your dataset.

Since paired data is harder to find in most domains, and not even possible in some, the
unsupervised training capabilities of CycleGAN are quite useful.

How does it work?

A Tale of Two Generators

CycleGAN is a Generative Adversarial Network (GAN) that uses two generators and two
discriminators. (Note: If you are not familiar with GANs, you may want to read up about

https://towardsdatascience.com/cyclegan-learning-to-translate-images-without-paired-training-data-5b4e93862c8d 2/9
5/17/2021 CycleGAN: Learning to Translate Images (Without Paired Training Data) | by Sarah Wolf | Towards Data Science

them before continuing).

Open in app

We call one generator G, and have it convert images from the X domain to the Y domain.
The other generator is called F, and converts images from Y to X.

Both G and F are generators that take an image from one domain and translate it to another. G maps from X to
Y, whereas F goes in the opposite direction, mapping Y to X.

Each generator has a corresponding discriminator, which attempts to tell apart its
synthesized images from real ones.

One discriminator provides adversarial training for G, and the other does the same for F.

The Objective Function

There are two components to the CycleGAN objective function, an adversarial loss and a
cycle consistency loss. Both are essential to getting good results.

If you are familiar with GANs, the adversarial loss should come as no surprise. Both
generators are attempting to “fool” their corresponding discriminator into being less
able to distinguish their generated images from the real versions. We use the least
squares loss (found by Mao et al to be more effective than the typical log likelihood loss)
to capture this.

https://towardsdatascience.com/cyclegan-learning-to-translate-images-without-paired-training-data-5b4e93862c8d 3/9
5/17/2021 CycleGAN: Learning to Translate Images (Without Paired Training Data) | by Sarah Wolf | Towards Data Science

However, the adversarial loss alone is not sufficient to produce good images, as it leaves
Open in app
the model under-constrained. It enforces that the generated output be of the appropriate
domain, but does not enforce that the input and output are recognizably the same. For
example, a generator that output an image y that was an excellent example of that
domain, but looked nothing like x, would do well by the standard of the adversarial loss,
despite not giving us what we really want.

The cycle consistency loss addresses this issue. It relies on the expectation that if you
convert an image to the other domain and back again, by successively feeding it through
both generators, you should get back something similar to what you put in. It enforces
that F(G(x)) ≈ x and G(F(y)) ≈ y.

We can create the full objective function by putting these loss terms together, and
weighting the cycle consistency loss by a hyperparameter λ. We suggest setting λ = 10.

Generator Architecture
Each CycleGAN generator has three sections: an encoder, a transformer, and a decoder.
The input image is fed directly into the encoder, which shrinks the representation size
while increasing the number of channels. The encoder is composed of three convolution
layers. The resulting activation is then passed to the transformer, a series of six residual
blocks. It is then expanded again by the decoder, which uses two transpose convolutions
to enlarge the representation size, and one output layer to produce the final image in
RGB.

You can see the details in the figure below. Please note that each layer is followed by an
instance normalization and a ReLU layer, but these have been omitted for simplicity.

https://towardsdatascience.com/cyclegan-learning-to-translate-images-without-paired-training-data-5b4e93862c8d 4/9
5/17/2021 CycleGAN: Learning to Translate Images (Without Paired Training Data) | by Sarah Wolf | Towards Data Science

Open in app

An architecture for a CycleGAN generator. As you can see above, the representation size shrinks in the
encoder phase, stays constant in the transformer phase, and expands again in the decoder phase. The
representation size that each layer outputs is listed below it, in terms of the input image size, k. On each layer
is listed the number of filters, the size of those filters, and the stride. Each layer is followed by an instance
normalization and ReLU activation.

Since the generators’ architecture is fully convolutional, they can handle arbitrarily large
input once trained.

Discriminator Architecture
The discriminators are PatchGANs, fully convolutional neural networks that look at a
“patch” of the input image, and output the probability of the patch being “real”. This is
both more computationally efficient than trying to look at the entire input image, and is
also more effective — it allows the discriminator to focus on more surface-level features,
like texture, which is often the sort of thing being changed in an image translation task.

If you’ve read about other image-to-image translation systems, you may already be
familiar with PatchGAN. By the time of the CycleGAN paper, a version of PatchGAN had
already been successfully used in paired image-to-image translation by Isola et al in
Image-to-Image Translation with Conditional Adversarial Nets.

https://towardsdatascience.com/cyclegan-learning-to-translate-images-without-paired-training-data-5b4e93862c8d 5/9
5/17/2021 CycleGAN: Learning to Translate Images (Without Paired Training Data) | by Sarah Wolf | Towards Data Science

Open in app

An example architecture for a PatchGAN discriminator. PatchGAN is a fully convolutional network, that takes
in an image, and produces a matrix of probabilities, each referring to the probability of the corresponding
“patch” of the image being “real” (as opposed to generated). The representation size that each layer outputs is
listed below it, in terms of the input image size, k. On each layer is listed the number of filters, the size of those
filters, and the stride.

As you can see in the example architecture above, the PatchGAN halves the
representation size and doubles the number of channels until the desired output size is
reached. In this case, it was most effective to have the PatchGAN evaluate 70x70 sized
patches of the input.

Reducing Model Oscillation

To prevent the model from changing drastically from iteration to iteration, the
discriminators were fed a history of generated images, rather than just the ones
produced by the latest versions of the generators. To do this, we keep a pool to store the
50 most recently generated images. This technique of reducing model oscillation was
pioneered by Shrivastava et al. in Learning from Simulated and Unsupervised Images
through Adversarial Training.

Other Training Details

https://towardsdatascience.com/cyclegan-learning-to-translate-images-without-paired-training-data-5b4e93862c8d 6/9
5/17/2021 CycleGAN: Learning to Translate Images (Without Paired Training Data) | by Sarah Wolf | Towards Data Science

The training approach was fairly typical for an image-to-image translation task. The
Open in app
Adam optimizer, a common variant of gradient descent, was used to make training more
stable and efficient. The learning rate was set to 0.0002 for the first half of training, and
then linearly reduced to zero over the remaining iterations. The batch size was set to 1,
which is why we refer to instance normalization, rather than batch normalization, in the
architecture diagrams above.

Strengths and Limitations

Overall, the results produced by CycleGAN are very good — image quality approaches
that of paired image-to-image translation on many tasks. This is impressive, because
paired translation tasks are a form of fully supervised learning, and this is not. When the
CycleGAN paper came out, it handily surpassed other unsupervised image translation
techniques available at the time. In “real vs fake” experiments, humans were unable to
distinguish the synthesized image from the real one about 25% of the time.

CycleGAN can be used for collection style transfer, where the entire works of an artist are used to train the
model. *Image taken from paper.

If you are planning to use CycleGAN for a practical application, it is important to be

aware of its strengths and limitations. It works well on tasks that involve color or texture
changes, like day-to-night photo translations, or photo-to-painting tasks like collection
style transfer (see above). However, tasks that require substantial geometric changes to
the image, such as cat-to-dog translations, usually fail.

https://towardsdatascience.com/cyclegan-learning-to-translate-images-without-paired-training-data-5b4e93862c8d 7/9
5/17/2021 CycleGAN: Learning to Translate Images (Without Paired Training Data) | by Sarah Wolf | Towards Data Science

Open in app

A very unimpressive attempt at a cat-to-dog image translation. Don’t try to use a CycleGAN for this. *Image
taken from paper.

Translations on the training data often look substantially better than those done on test
data.

Conclusion
Thanks for reading! I hope this was a useful overview. If you would like to see more
implementation details, there are some great public implementations out there you can
refer to. Please leave a comment if you have questions, corrections, or suggestions for
improving this post.

Sign up for The Variable

By Towards Data Science

Every Thursday, the Variable delivers the very best of Towards Data Science: from hands-on tutorials
and cutting-edge research to original features you don't want to miss. Take a look.

Emails will be sent to [email protected].

Get this newsletter
Not you?

https://towardsdatascience.com/cyclegan-learning-to-translate-images-without-paired-training-data-5b4e93862c8d 8/9
5/17/2021 CycleGAN: Learning to Translate Images (Without Paired Training Data) | by Sarah Wolf | Towards Data Science

Machine Learning Gans Computer Vision Towards Data Science

Open in app

About Help Legal

Get the Medium app

https://towardsdatascience.com/cyclegan-learning-to-translate-images-without-paired-training-data-5b4e93862c8d 9/9

Generating Anime Faces From Human Faces With Adversarial Networks
No ratings yet
Generating Anime Faces From Human Faces With Adversarial Networks
7 pages
AAI-M1 - Cycle GAN-compressed
No ratings yet
AAI-M1 - Cycle GAN-compressed
9 pages
Image Disentanglement and Uncooperative Re-Entanglement For High-Fidelity Image-to-Image Translation
No ratings yet
Image Disentanglement and Uncooperative Re-Entanglement For High-Fidelity Image-to-Image Translation
12 pages
Report 16
No ratings yet
Report 16
9 pages
CycleGAN CVPR2017
No ratings yet
CycleGAN CVPR2017
18 pages
Unpaired Image To Image Translation CycleGAn
No ratings yet
Unpaired Image To Image Translation CycleGAn
18 pages
Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks
No ratings yet
Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks
18 pages
3rd Unit Notes
No ratings yet
3rd Unit Notes
16 pages
Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks
No ratings yet
Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks
10 pages
Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks
No ratings yet
Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks
20 pages
Sketch Image Translation
No ratings yet
Sketch Image Translation
7 pages
UVCGAN UNetVision Transformer Cycle-Consistent GAN For Unpaired
No ratings yet
UVCGAN UNetVision Transformer Cycle-Consistent GAN For Unpaired
17 pages
Conditional GAN: Deep Image Processing Seminar
No ratings yet
Conditional GAN: Deep Image Processing Seminar
61 pages
Nizan, Tal - 2019 - Breaking The Cycle-Colleagues Are All You Need
No ratings yet
Nizan, Tal - 2019 - Breaking The Cycle-Colleagues Are All You Need
10 pages
Cycle Gan
No ratings yet
Cycle Gan
24 pages
Image Data Augmentation With Unpaired Image-To-Image Camera Model Translation
No ratings yet
Image Data Augmentation With Unpaired Image-To-Image Camera Model Translation
5 pages
Introduction Generative Adversarial Networks
No ratings yet
Introduction Generative Adversarial Networks
41 pages
Beige and White Contemporary Editorial Landscape University Research Poster
No ratings yet
Beige and White Contemporary Editorial Landscape University Research Poster
1 page
Face Swap Using Autoencoders & Image-To-Image Translation Techniques
No ratings yet
Face Swap Using Autoencoders & Image-To-Image Translation Techniques
7 pages
Contrastive Learning For Unpaired Image-to-Image Translation
No ratings yet
Contrastive Learning For Unpaired Image-to-Image Translation
29 pages
An Alternative Lightness Control With GAN For Augmenting Camera Data
No ratings yet
An Alternative Lightness Control With GAN For Augmenting Camera Data
6 pages
Batch 16
No ratings yet
Batch 16
24 pages
Dualgan: Unsupervised Dual Learning For Image-To-Image Translation
No ratings yet
Dualgan: Unsupervised Dual Learning For Image-To-Image Translation
9 pages
Lata 2019
No ratings yet
Lata 2019
4 pages
Learning Unsupervised Cross-Domain Image-to-Image Translation Using A Shared Discriminator
No ratings yet
Learning Unsupervised Cross-Domain Image-to-Image Translation Using A Shared Discriminator
9 pages
Image-to-Image Translation With Conditional Adversarial Networks (Review)
No ratings yet
Image-to-Image Translation With Conditional Adversarial Networks (Review)
3 pages
Image-to-Image Translation With Conditional Adversarial Networks
No ratings yet
Image-to-Image Translation With Conditional Adversarial Networks
17 pages
Breaking The Dilemma of Medical Image-To-Image
No ratings yet
Breaking The Dilemma of Medical Image-To-Image
18 pages
Image-to-Image Translation With Conditional Adversarial Networks
No ratings yet
Image-to-Image Translation With Conditional Adversarial Networks
17 pages
Masterclass GANs
No ratings yet
Masterclass GANs
20 pages
Ganss Harward Uni Notes
No ratings yet
Ganss Harward Uni Notes
44 pages
06 cGAN
No ratings yet
06 cGAN
45 pages
Project1 Report
No ratings yet
Project1 Report
11 pages
Week 9 Generative Adversarial Networks
No ratings yet
Week 9 Generative Adversarial Networks
50 pages
7 RL-CycleGAN - Reinforcement Learning Aware Simulation-To-Real
No ratings yet
7 RL-CycleGAN - Reinforcement Learning Aware Simulation-To-Real
13 pages
One-Step Image Translation With Text-to-Image Models
No ratings yet
One-Step Image Translation With Text-to-Image Models
29 pages
Generative Adversarial Networks (GANs)
No ratings yet
Generative Adversarial Networks (GANs)
51 pages
Sr. No Title Published Problem Statement Methodology Dataset Dataset Avail-Ability
No ratings yet
Sr. No Title Published Problem Statement Methodology Dataset Dataset Avail-Ability
2 pages
Generative Adversarial Networks For Image and Video Synthesis: Algorithms and Applications
No ratings yet
Generative Adversarial Networks For Image and Video Synthesis: Algorithms and Applications
24 pages
Image To Imag e Translation
No ratings yet
Image To Imag e Translation
19 pages
07 1.gan 2
No ratings yet
07 1.gan 2
56 pages
Cross-Caption Cycle-Consistent Text-to-Image Synthesis
No ratings yet
Cross-Caption Cycle-Consistent Text-to-Image Synthesis
9 pages
Learning From Paired and Unpaired Data Alternately Trained CycleGAN For Near Infrared Image Colorization
No ratings yet
Learning From Paired and Unpaired Data Alternately Trained CycleGAN For Near Infrared Image Colorization
4 pages
Attention-Guided Generative Adversarial Networks For Unsupervised Image-to-Image Translation
No ratings yet
Attention-Guided Generative Adversarial Networks For Unsupervised Image-to-Image Translation
8 pages
Liu Hu Report
No ratings yet
Liu Hu Report
6 pages
Vanilla Cyclegan (Zhu Et Al., 2017) Structure. Cyclegan Face-Off Application (Wei, 2017)
No ratings yet
Vanilla Cyclegan (Zhu Et Al., 2017) Structure. Cyclegan Face-Off Application (Wei, 2017)
9 pages
Instance-Wise Hard Negative Example Generation For Contrastive Learning in Unpaired Image-to-Image Translation
No ratings yet
Instance-Wise Hard Negative Example Generation For Contrastive Learning in Unpaired Image-to-Image Translation
10 pages
Frank Gabel Eml2018 Report
No ratings yet
Frank Gabel Eml2018 Report
15 pages
Lecture4 GAN B
No ratings yet
Lecture4 GAN B
38 pages
AAI Extra
No ratings yet
AAI Extra
7 pages
Gans
No ratings yet
Gans
14 pages
Yuan Unsupervised Image Super-Resolution CVPR 2018 Paper
No ratings yet
Yuan Unsupervised Image Super-Resolution CVPR 2018 Paper
10 pages
Cycle-Consistent Inverse GAN For Text-to-Image Synthesis - 3474085.3475226
No ratings yet
Cycle-Consistent Inverse GAN For Text-to-Image Synthesis - 3474085.3475226
2 pages
Image Colorization Using GANs
No ratings yet
Image Colorization Using GANs
18 pages
Perceptual Adversarial Networks For Image-to-Image Transformation
No ratings yet
Perceptual Adversarial Networks For Image-to-Image Transformation
13 pages
Generative Adversarial Networks For Image and Video Synthesis: Algorithms and Applications
No ratings yet
Generative Adversarial Networks For Image and Video Synthesis: Algorithms and Applications
22 pages
Gan June 2019
No ratings yet
Gan June 2019
28 pages
Lecture16 GAN Cont
No ratings yet
Lecture16 GAN Cont
35 pages
Autonomous Car Drift Control RG
No ratings yet
Autonomous Car Drift Control RG
8 pages
AITools Unit-4
No ratings yet
AITools Unit-4
25 pages
Data Analytics in The Internet of Things A Survey PDF
No ratings yet
Data Analytics in The Internet of Things A Survey PDF
24 pages
"Real Time Driver Behavior Detection": A Project Report On
No ratings yet
"Real Time Driver Behavior Detection": A Project Report On
47 pages
Comic Characters Detection Using Deep Learning
No ratings yet
Comic Characters Detection Using Deep Learning
6 pages
FT of AI
No ratings yet
FT of AI
109 pages
MCQs - Artificial Neural Networks - Components and Concepts - AIMCQs
No ratings yet
MCQs - Artificial Neural Networks - Components and Concepts - AIMCQs
11 pages
Lung Cancer Detection
No ratings yet
Lung Cancer Detection
8 pages
【2023】热点文章 Mamba Linear-Time Sequence Modeling with Selective State Spaces
No ratings yet
【2023】热点文章 Mamba Linear-Time Sequence Modeling with Selective State Spaces
37 pages
Mini-Project Report1
No ratings yet
Mini-Project Report1
7 pages
869 When Vision Transformers Outpe
No ratings yet
869 When Vision Transformers Outpe
20 pages
CS7643: Deep Learning Assignment 3: Instructor: Zsolt Kira Deadline: 11:59pm Mar 14, 2021, EST
No ratings yet
CS7643: Deep Learning Assignment 3: Instructor: Zsolt Kira Deadline: 11:59pm Mar 14, 2021, EST
12 pages
Introduction To Deep Learning: Welcome
No ratings yet
Introduction To Deep Learning: Welcome
17 pages
Image Preprocessing For Efficient Training of YOLO Deep Learning Networks
No ratings yet
Image Preprocessing For Efficient Training of YOLO Deep Learning Networks
3 pages
Computer Vision For Plant Pathology
No ratings yet
Computer Vision For Plant Pathology
21 pages
Multimodal Medical Image Fusion Network Based On Target Information Enhancement
No ratings yet
Multimodal Medical Image Fusion Network Based On Target Information Enhancement
19 pages
Signature Verification Using Machine Learning
No ratings yet
Signature Verification Using Machine Learning
18 pages
Blue Futuristic Illustrative Artificial Intelligence Project Presentation
No ratings yet
Blue Futuristic Illustrative Artificial Intelligence Project Presentation
8 pages
Artificial Intelligence in Ododontics Current Application and Future Directions
No ratings yet
Artificial Intelligence in Ododontics Current Application and Future Directions
6 pages
Water: Convolutional Neural Network Coupled With A Transfer-Learning Approach For Time-Series Flood Predictions
100% (1)
Water: Convolutional Neural Network Coupled With A Transfer-Learning Approach For Time-Series Flood Predictions
19 pages
Deep Sequence-To-Sequence Entity Matching For Heterogeneous Entity Resolution
No ratings yet
Deep Sequence-To-Sequence Entity Matching For Heterogeneous Entity Resolution
10 pages
MCA - IA-1 Report-3-1
No ratings yet
MCA - IA-1 Report-3-1
9 pages
DL Unit2 HD
No ratings yet
DL Unit2 HD
141 pages
Arecaunut Classification Report Final Yolo Based
No ratings yet
Arecaunut Classification Report Final Yolo Based
35 pages
Improved Yolov12 With Llm-Generated Synthetic Data For Enhanced Apple Detection and Benchmarking Against Yolov11 and Yolov10
No ratings yet
Improved Yolov12 With Llm-Generated Synthetic Data For Enhanced Apple Detection and Benchmarking Against Yolov11 and Yolov10
8 pages
AES 31 10 2022 385 J.Srilatha
No ratings yet
AES 31 10 2022 385 J.Srilatha
9 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
35 pages
Seminar
No ratings yet
Seminar
23 pages
Be Comp Engg Sem-Viii r2019
No ratings yet
Be Comp Engg Sem-Viii r2019
56 pages
Deep Learning
No ratings yet
Deep Learning
12 pages

CycleGAN - Learning To Translate Images (Without Paired Training Data) - by Sarah Wolf - Towards Data Science

Uploaded by

CycleGAN - Learning To Translate Images (Without Paired Training Data) - by Sarah Wolf - Towards Data Science

Uploaded by

5/17/2021 CycleGAN: Learning to Translate Images (Without Paired Training Data) | by Sarah Wolf | Towards Data Science

Follow 594K Followers

CycleGAN: Learning to Translate Images

An image of zebras translated to horses, using a CycleGAN

How does it work?

A Tale of Two Generators

them before continuing).

The Objective Function

Reducing Model Oscillation

Other Training Details

Strengths and Limitations

If you are planning to use CycleGAN for a practical application, it is important to be

Sign up for The Variable

Emails will be sent to [email protected].

Machine Learning Gans Computer Vision Towards Data Science

About Help Legal

Get the Medium app

You might also like