0% found this document useful (0 votes)

6 views

Lecture4 GAN b

The document outlines a course on Applied Machine Learning taught by Dr. Tao Han at NJIT, focusing on Generative Adversarial Networks (GANs) and their evaluation methods. It discusses the limitations of JS divergence in binary classification and introduces Wasserstein distance as a more effective metric for GAN training. Additionally, it covers various GAN applications, including conditional generation and learning from unpaired data, along with evaluation techniques like Inception Score and Fréchet Inception Distance.

Uploaded by

ra734

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Lecture4 GAN b

Uploaded by

ra734

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

ECE 381:

Applied Machine Learning

• Tao Han, Ph.D.

• Associate Professor
• Electrical and Computer Engineering
• Newark College of Engineering
• New Jersey Institute of Technology

• https://tao-han-njit.netlify.app

Slides are designed based on Prof. Hung-yi Lee’s Machine Learning courses at National Taiwan University
𝐺 ∗ = 𝑎𝑟𝑔 min 𝐷𝑖𝑣
max𝑃𝑉 𝐺,𝑑𝑎𝑡𝑎
𝐺, 𝑃 𝐷
𝐺 𝐷

𝐷 ∗ = 𝑎𝑟𝑔 max 𝑉 𝐷, 𝐺 The maximum objective value

𝐷 is related to JS divergence.

• Initialize generator and discriminator

• In each training iteration:
Step 1: Fix generator G, and update discriminator D
Step 2: Fix discriminator D, and update generator G

2
JS divergence (Binary Classifier) is
not suitable
• In most cases, 𝑃𝐺 and 𝑃𝑑𝑎𝑡𝑎 are not overlapped.
• 1. The nature of data
Both 𝑃𝑑𝑎𝑡𝑎 and 𝑃𝐺 are low-dim
manifold in high-dim space.
𝑃𝐺 𝑃𝑑𝑎𝑡𝑎
The overlap can be ignored.
• 2. Sampling
Even though 𝑃𝑑𝑎𝑡𝑎 and 𝑃𝐺
have overlap.
If you do not have enough
sampling ……
3
What is the problem of JS divergence (Binary Classifier) ?
JS divergence is always log2 if two distributions do not overlap.

𝑃𝐺0 𝑃𝑑𝑎𝑡𝑎 𝑃𝐺1 𝑃𝑑𝑎𝑡𝑎 …… 𝑃𝐺100 𝑃𝑑𝑎𝑡𝑎

Equally bad
𝐽𝑆 𝑃𝐺0 , 𝑃𝑑𝑎𝑡𝑎 𝐽𝑆 𝑃𝐺1 , 𝑃𝑑𝑎𝑡𝑎 …… 𝐽𝑆 𝑃𝐺100 , 𝑃𝑑𝑎𝑡𝑎
= 𝑙𝑜𝑔2 = 𝑙𝑜𝑔2 =0

Intuition: If two distributions do not overlap, binary classifier

achieves 100% accuracy.
The accuracy (or loss) means nothing during GAN training.
4.
Algorithm
• Initialize generator and discriminator G D
• In each training iteration:
Sample some
Update
real objects: 1 1 1 1
Learning Generate some D
D fake objects: vector 0 0 0 0
vector
vector
vector
G fix

Learning
G
vector
vector
vector
vector

image
G image
image D 1
image
update fix
5
What is the problem of JS divergence (Binary Classifier) ?

𝑑0 𝑑1
𝑃𝐺0 𝑃𝑑𝑎𝑡𝑎 𝑃𝐺1 𝑃𝑑𝑎𝑡𝑎 …… 𝑃𝐺100 𝑃𝑑𝑎𝑡𝑎

𝐽𝑆 𝑃𝐺0 , 𝑃𝑑𝑎𝑡𝑎 𝐽𝑆 𝑃𝐺1 , 𝑃𝑑𝑎𝑡𝑎 …… 𝐽𝑆 𝑃𝐺100 , 𝑃𝑑𝑎𝑡𝑎

= 𝑙𝑜𝑔2 = 𝑙𝑜𝑔2 =0

𝑊 𝑃𝐺0 , 𝑃𝑑𝑎𝑡𝑎 𝑊 𝑃𝐺1 , 𝑃𝑑𝑎𝑡𝑎 …… 𝑊 𝑃𝐺100 , 𝑃𝑑𝑎𝑡𝑎

= 𝑑0 = 𝑑1 =0
Better!

6
Wasserstein distance
• Considering one distribution P as a pile of earth,
and another distribution Q as the target
• The average distance the earth mover has to move
the earth.

𝑃 𝑄

𝑊 𝑃, 𝑄 = 𝑑
7
Wasserstein distance
Smaller Larger
distance? distance?
𝑃

There are many possible “moving plans”.

Using the “moving plan” with the smallest average distance to
define the Wasserstein distance.
8
Source of image: https://vincentherrmann.github.io/blog/wasserstein/
https://arxiv.org/abs/1701.07875

WGAN
Evaluate Wasserstein distance between 𝑃𝑑𝑎𝑡𝑎 and 𝑃𝐺

max 𝐸𝑦~𝑃𝑑𝑎𝑡𝑎 𝐷 𝑦 − 𝐸𝑦~𝑃𝐺 𝐷 𝑦

𝐷∈1−𝐿𝑖𝑝𝑠𝑐ℎ𝑖𝑡𝑧

D has to be smooth enough.

∞
Without the constraint, the
training of D will not converge.
generated real
Keeping the D smooth forces
D(y) become ∞ and −∞ D
−∞
9
GAN is still challenging …
• Generator and Discriminator needs to match each
other
Generate fake images to fool discriminator

Cannot fool the

Fail to improve ...
discriminator …

Generator Discriminator

I cannot tell the

Fail to improve ...
difference ……
Tell the difference between real and fake
Readings
• Tips from Soumith
• https://github.com/soumith/ganhacks
• Tips in DCGAN: Guideline for network architecture design
for image generation
• https://arxiv.org/abs/1511.06434
• Improved techniques for training GANs
• https://arxiv.org/abs/1606.03498
• Tips from BigGAN
• https://arxiv.org/abs/1809.11096

11
Evaluation of Generation

12
Quality of Image
• Human evaluation is expensive (and sometimes
unfair/unstable).
• How to evaluate the quality of the generated
images automatically?
class 2

𝑦 Off-the-shelf
image
class 1 𝑃 𝑐|𝑦
Image Classifier class 3
e.g., Inception net,
VGG, etc. Concentrated distribution
means higher visual quality

13
Diversity - Mode Collapse

: real data
: generated data

14
Diversity - Mode Dropping
: real data
: generated data

Generator
at iteration t

Generator
at iteration t+1

(BEGAN on CelebA)
15
Diversity
class 2
𝑃 𝑐|𝑦1
𝑦1 CNN class 1 𝑃 𝑐
class 3
1
= ෍ 𝑃 𝑐|𝑦 𝑛
class 2 𝑁
𝑛
𝑃 𝑐|𝑦 2
𝑦2 CNN class 1 class 3 class 2

class 1 class 3
class 2

𝑦3 CNN class 1 𝑃 𝑐|𝑦 3 low diversity

class 3
……

16
Inception Score (IS):
Good quality, large diversity → Large IS
Diversity
class 2
𝑃 𝑐|𝑦1
𝑦1 CNN class 1 𝑃 𝑐
class 3
1
= ෍ 𝑃 𝑐|𝑦 𝑛
class 1 𝑁
𝑛
𝑃 𝑐|𝑦 2
𝑦2 CNN class 2
class 3

class 3 Uniform means

higher variety
𝑦3 CNN class 1
class 2
𝑃 𝑐|𝑦 3
……

What is the problem here? ☺ 17

https://arxiv.org/pdf/1706.08500.pdf

Fréchet Inception Distance (FID)

Orange points: real images

softmax
blue points: generated images
CNN
FID = Fréchet distance
between the two Gaussians
Smaller is better
https://arxiv.org/pdf/1511.01844.pdf

We don’t want memory GAN.

Real Data

Generated
Data
Same as real data …

Generated
Data
Simply flip real data …
20
Conditional Generation

22
Text-to-image red eyes yellow hair
𝑥 black hair dark circles
red eyes

Generator 𝑦

red hair,
green eyes

blue hair,
red eyes 23
Conditional GAN
𝑥: Red eyes
G Image 𝑦 = 𝐺 𝑐, 𝑧
Normal distribution 𝑧

𝑦 is real image or not

D Generator will learn to

𝑦 scalar
(original) generate realistic images ….
But completely ignore the
Real images: 1 input conditions.

Generated images: 0
https://arxiv.org/abs/1605.05396

Conditional GAN
𝑥: Red eyes
G Image 𝑦 = 𝐺 𝑐, 𝑧
Normal distribution 𝑧

𝑦 D 𝑦 is realistic or not +
(better)
scalar 𝑥 and 𝑦 are matched or not
𝑥

True text-image pairs: (red eyes, ) 1

(red eyes, ) 0 (red eyes, ) 0

https://arxiv.org/abs/1611.07004

Conditional GAN
𝑥
G 𝑦 = 𝐺 𝑐, 𝑧
𝑧

Image translation, or pix2pix

https://arxiv.org/abs/1611.07004

Conditional GAN

G Image D scalar
𝑧
https://arxiv.org/abs/1808.04108

Conditional GAN

𝑥: sound G Image
"a dog barking sound"

Training Data
Collection

video
Conditional GAN
Talking Head Generation

https://arxiv.org/abs/1905.08233
Conditional GAN
Video-to-Video Synthesis

https://github.com/NVIDIA/vid2vid
Learning from
Unpaired Data

31
Learning from Unpaired Data

𝒙 Deep 𝒚
Network

𝒙𝟏 𝒚𝟐
𝒙𝟑 𝒙𝟕 𝒚𝟒 𝒚𝟏𝟎
𝒙𝟓
𝟗 𝒚𝟖 𝒚𝟔
𝒙
unpaired

32
Learning from Unpaired Data

𝒙 Deep 𝒚
Network
Image Style
Transfer

Domain 𝒳 Domain 𝒴
unpaired

Can we learn the mapping without any paired data?

Unsupervised Conditional Generation
33
Learning from Unpaired Data

Domain 𝒳 Domain 𝒴

Network

34
Domain 𝒳 Domain 𝒴

Cycle GAN

Domain 𝒳
Become similar
to domain 𝒴
𝐺𝒳→𝒴 ?

𝐷𝒴 scalar

Input image
belongs to
domain 𝒴 or not
Domain 𝒴
Domain 𝒳 Domain 𝒴

Cycle GAN

Domain 𝒳
Become similar
to domain 𝒴
𝐺𝒳→𝒴 ?

ignore input
𝐷𝒴 scalar

Input image
belongs to
domain 𝒴 or not
Domain 𝒴
Cycle GAN
as close as possible
Cycle consistency

𝐺𝒳→𝒴 ? 𝐺𝒴→𝒳

Lack of information
for reconstruction
𝐷𝒴 scalar

Input image
belongs to
domain 𝒴 or not
Domain 𝒴
Cycle GAN
as close as possible
Cycle consistency

𝐺𝒳→𝒴 𝐺𝒴→𝒳

“Related” to input, so
possible to reconstruct
𝐷𝒴 scalar

Input image
belongs to
domain 𝒴 or not
Domain 𝒴
Cycle GAN
as close as possible
Cycle consistency

𝐺𝒳→𝒴 𝐺𝒴→𝒳

scalar: belongs to
domain 𝒳or not 𝐷𝒳 𝐷𝒴 scalar

𝐺𝒴→𝒳 𝐺𝒳→𝒴
Concluding Remarks

Generative Adversarial Network (GAN)

Theory behind GAN

Evaluation of Generative Models

Conditional Generation

Learning from unpaired data

Week 9 Generative Adversarial Networks
No ratings yet
Week 9 Generative Adversarial Networks
50 pages
Gans Stanford
No ratings yet
Gans Stanford
39 pages
06 cGAN
No ratings yet
06 cGAN
45 pages
07 1.gan 2
No ratings yet
07 1.gan 2
56 pages
lecture16 GAN cont
No ratings yet
lecture16 GAN cont
35 pages
Hung-yi Lee GAN-Improving GAN (2017.05.05)
No ratings yet
Hung-yi Lee GAN-Improving GAN (2017.05.05)
71 pages
Mod5_Slides
No ratings yet
Mod5_Slides
37 pages
Lecture4 GAN A
No ratings yet
Lecture4 GAN A
30 pages
Advanced Design For AI Algorithms: Lec.: 1 GAN
No ratings yet
Advanced Design For AI Algorithms: Lec.: 1 GAN
223 pages
Gan June 2019
No ratings yet
Gan June 2019
28 pages
gan_tutorial_suwang
No ratings yet
gan_tutorial_suwang
11 pages
Generative Adversarial Networks (GAN) : A Gentle Introduction (UPDATED)
No ratings yet
Generative Adversarial Networks (GAN) : A Gentle Introduction (UPDATED)
11 pages
Week 3 - Post - GAN
No ratings yet
Week 3 - Post - GAN
38 pages
Generative Adversarial Networks (GANs)
No ratings yet
Generative Adversarial Networks (GANs)
51 pages
Generative Adversarial Networks: Biplab Banerjee
No ratings yet
Generative Adversarial Networks: Biplab Banerjee
54 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
14 pages
From Adversarial Training To Geenerative Adversarial Networks
No ratings yet
From Adversarial Training To Geenerative Adversarial Networks
12 pages
Introduction Generative Adversarial Networks
No ratings yet
Introduction Generative Adversarial Networks
41 pages
Hung-yi Lee GAN-Basic Idea (2017.04.21)
No ratings yet
Hung-yi Lee GAN-Basic Idea (2017.04.21)
67 pages
Liu_Hu_report
No ratings yet
Liu_Hu_report
6 pages
GAN Lecture
No ratings yet
GAN Lecture
53 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
36 pages
Introduction To Generative Adversarial Networks: Luke de Oliveira
No ratings yet
Introduction To Generative Adversarial Networks: Luke de Oliveira
31 pages
AI_slide_2
No ratings yet
AI_slide_2
82 pages
CSCI 5922 Neural Networks and Deep Learning
No ratings yet
CSCI 5922 Neural Networks and Deep Learning
37 pages
12-DL-Deep Learning For GANS
No ratings yet
12-DL-Deep Learning For GANS
75 pages
Lec19 - GANs
No ratings yet
Lec19 - GANs
47 pages
Lecture 2 PDF
No ratings yet
Lecture 2 PDF
62 pages
The Six Fronts of The Generative Adversarial Networks
No ratings yet
The Six Fronts of The Generative Adversarial Networks
11 pages
Image-to-Image Translation With Conditional Adversarial Networks (Review)
No ratings yet
Image-to-Image Translation With Conditional Adversarial Networks (Review)
3 pages
Generative Adversarial Networks For Image and Video Synthesis: Algorithms and Applications
No ratings yet
Generative Adversarial Networks For Image and Video Synthesis: Algorithms and Applications
24 pages
2008.02793
No ratings yet
2008.02793
22 pages
Gen AI 10-1
No ratings yet
Gen AI 10-1
60 pages
CISC 867 Deep Learning: 15. Generative Adversarial Networks
No ratings yet
CISC 867 Deep Learning: 15. Generative Adversarial Networks
71 pages
Lecture 2
No ratings yet
Lecture 2
98 pages
Gan
No ratings yet
Gan
28 pages
Machine Learning Final Presentation
No ratings yet
Machine Learning Final Presentation
32 pages
L19 GANs
No ratings yet
L19 GANs
9 pages
Generative Adversarial Networks For Data
No ratings yet
Generative Adversarial Networks For Data
86 pages
sem5_ppt
No ratings yet
sem5_ppt
21 pages
ardizzone2019 - Conditional Coupling Layers
No ratings yet
ardizzone2019 - Conditional Coupling Layers
11 pages
Masterclass-GANs
No ratings yet
Masterclass-GANs
20 pages
Research on Extended Image Data Set Based on Deep Convolution Generative Adversarial Network
No ratings yet
Research on Extended Image Data Set Based on Deep Convolution Generative Adversarial Network
4 pages
15 Unsup+gen PDF
No ratings yet
15 Unsup+gen PDF
210 pages
2017 Beginner's Review of Generative Adversarial Networks (GAN) Architectures
No ratings yet
2017 Beginner's Review of Generative Adversarial Networks (GAN) Architectures
9 pages
Deep Learning - GAN
No ratings yet
Deep Learning - GAN
31 pages
Lecture 3 Generative Models
No ratings yet
Lecture 3 Generative Models
47 pages
3 GANs
No ratings yet
3 GANs
50 pages
CF-GO-Net: A Universal Distribution Learner Via Characteristic Function Networks With Graph Optimizers
No ratings yet
CF-GO-Net: A Universal Distribution Learner Via Characteristic Function Networks With Graph Optimizers
11 pages
Generative adversarial network An overview of theory and applications
No ratings yet
Generative adversarial network An overview of theory and applications
9 pages
Gans + Final Practice Questions: Instructor: Preethi Jyothi
No ratings yet
Gans + Final Practice Questions: Instructor: Preethi Jyothi
28 pages
DL Unit5
No ratings yet
DL Unit5
15 pages
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
No ratings yet
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
17 pages
Generative Adversarial Networks
No ratings yet
Generative Adversarial Networks
4 pages
paper4 (GAN)
No ratings yet
paper4 (GAN)
24 pages
Evolutionary Generative Adversarial Networks
No ratings yet
Evolutionary Generative Adversarial Networks
14 pages
Name: Ahammad Nadendla Roll No: B19BB030
No ratings yet
Name: Ahammad Nadendla Roll No: B19BB030
12 pages
AAI Module 2
No ratings yet
AAI Module 2
18 pages
Computer Graphics from Scratch: A Programmer's Introduction to 3D Rendering
From Everand
Computer Graphics from Scratch: A Programmer's Introduction to 3D Rendering
Gabriel Gambetta
No ratings yet
Dynamic Geometry Game for Pods: Gerry Stahl's eLibrary, #21
From Everand
Dynamic Geometry Game for Pods: Gerry Stahl's eLibrary, #21
Gerry Stahl
No ratings yet
JPSP 2022 41
No ratings yet
JPSP 2022 41
14 pages
Artificial Intelligence in Education
No ratings yet
Artificial Intelligence in Education
7 pages
Dulce Frencelly Esquivel Jaramillo Alicia Pacheco Morales Clemente Miranda Barranco Rene Cruz Flores
No ratings yet
Dulce Frencelly Esquivel Jaramillo Alicia Pacheco Morales Clemente Miranda Barranco Rene Cruz Flores
8 pages
Religious Moral Education For Basic Schools Teachers Guide Book 3
No ratings yet
Religious Moral Education For Basic Schools Teachers Guide Book 3
72 pages
Sherwood Convent School Holiday Homework Session - 2021 - 2022 Class - VIII
No ratings yet
Sherwood Convent School Holiday Homework Session - 2021 - 2022 Class - VIII
3 pages
Name: Salsabila Purnama Sari NIM: 20216012003 Lecturer: Dr. Baginda Simaibang, M.Ed. Subject: Literature Teaching and Appreciation
No ratings yet
Name: Salsabila Purnama Sari NIM: 20216012003 Lecturer: Dr. Baginda Simaibang, M.Ed. Subject: Literature Teaching and Appreciation
2 pages
Year 9 Maths - Statistics Unit - Band 4/5
No ratings yet
Year 9 Maths - Statistics Unit - Band 4/5
4 pages
How To Be Smarter
No ratings yet
How To Be Smarter
4 pages
2 PDF
100% (5)
2 PDF
196 pages
Maths Talent Quest 2019 - Investigation
No ratings yet
Maths Talent Quest 2019 - Investigation
21 pages
CPDD PTR 02 Instructional Design R4A REGIONAL TRAINING OF TRAINERS ON DCP ADOPTION
No ratings yet
CPDD PTR 02 Instructional Design R4A REGIONAL TRAINING OF TRAINERS ON DCP ADOPTION
9 pages
The Role of Fear of Movement/ (Re) Injury in Pain Disability
No ratings yet
The Role of Fear of Movement/ (Re) Injury in Pain Disability
18 pages
Attributes of Professionalism
No ratings yet
Attributes of Professionalism
27 pages
Jocelle Ann Ricablanca-Resume PDF
No ratings yet
Jocelle Ann Ricablanca-Resume PDF
2 pages
Visit KIDZANIA Today and Explore Your Ambition!
No ratings yet
Visit KIDZANIA Today and Explore Your Ambition!
1 page
Health10 - q3 - Mod4 - Adopting Global Health Initiatives To Local or National Context
No ratings yet
Health10 - q3 - Mod4 - Adopting Global Health Initiatives To Local or National Context
16 pages
Don Mariano Marcos Memorial State University La Union, Philippines
No ratings yet
Don Mariano Marcos Memorial State University La Union, Philippines
10 pages
2-4 Illusions of Competence Powerpoint
No ratings yet
2-4 Illusions of Competence Powerpoint
16 pages
The Successful Implementation of The Deped Five Year Ict4E Strategic Plan Will
No ratings yet
The Successful Implementation of The Deped Five Year Ict4E Strategic Plan Will
2 pages
Social Science: CLASS IX-X (2021-22) CODE NO. (087) Term Wise Curriculum
No ratings yet
Social Science: CLASS IX-X (2021-22) CODE NO. (087) Term Wise Curriculum
22 pages
DLL Grade 9 Q3-Week 1-4
No ratings yet
DLL Grade 9 Q3-Week 1-4
15 pages
An A-Z of Methodology: Reading
No ratings yet
An A-Z of Methodology: Reading
3 pages
Ants Could Teach Ants
No ratings yet
Ants Could Teach Ants
3 pages
Effective Learning and Quality Teaching
No ratings yet
Effective Learning and Quality Teaching
9 pages
Itpdp Example
No ratings yet
Itpdp Example
2 pages
Implementing Technology-Enhanced Learning
No ratings yet
Implementing Technology-Enhanced Learning
18 pages
NESA - Mathematics K 10 2022 (S4)
No ratings yet
NESA - Mathematics K 10 2022 (S4)
77 pages
Personality Theories Workbook 6th Edition Full Text DOCX
100% (4)
Personality Theories Workbook 6th Edition Full Text DOCX
14 pages
The Impact of Strikes On The Academic Performance of Students in Eswatini
100% (1)
The Impact of Strikes On The Academic Performance of Students in Eswatini
6 pages
PHR Study Guide 8th Editio All Pages
No ratings yet
PHR Study Guide 8th Editio All Pages
154 pages