100% found this document useful (1 vote)

282 views34 pages

GANppt

The document discusses recent advances in generative adversarial networks (GANs) for computer vision applications. It provides an overview of GANs, including how they work by pitting a generator network against a discriminator network. It describes different types of GANs such as DCGANs, CGANs, CycleGANs, and SeqGANs. DCGANs apply convolutional networks to GANs for image generation. CGANs add conditional information to GANs. CycleGANs perform image-to-image translation without paired examples. SeqGANs use reinforcement learning to generate discrete sequential data like text.

Uploaded by

Sreejith PB

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

282 views34 pages

GANppt

Uploaded by

Sreejith PB

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

Recent Advances of Generative Adversarial

Networks in Computer Vision

SREEJITH PB (PKD16IT053)
Guided By
Sibily Joseph and Joby NJ
Asst. Professors
Department of Computer Science and Engineering

GOVERNMENT ENGINNERING COLLEGE, SREEKRISHNAPURAM

September 2019

GEC SREEKRISHNAPURAM GAN 1 / 34

CONTENTS

• Introduction
• System Overview
• Types of GAN
• Applications
• Conclusion
• References

GEC SREEKRISHNAPURAM GAN 2 / 34

Introduction

GEC SREEKRISHNAPURAM GAN 3 / 34

Introduction

• Generative Adversarial Network (GAN), a generative approach

proposed by Goodfellow in 2014 has become one of the most
discussed topics in machine learning
• Generative Adversarial Network can
• Generate high quality images
• Generate high quality audios and videos
• Generate images from text
• Convert images from one domain to another(Image translation)
• etc.
• Different types of GANs are available now for various
application.

GEC SREEKRISHNAPURAM GAN 4 / 34

Introduction

GEC SREEKRISHNAPURAM GAN 5 / 34

System Overview
• Generative adversarial networks (GANs) are deep neural net
architectures comprised of two networks Generator(D) and
Discriminator, pitting one against the other (thus the
adversarial)
• Working of GAN

.
GEC SREEKRISHNAPURAM GAN 6 / 34
System Overview

• The Generator takes in random noise and returns an image.

• This generated image is fed into the Discriminator alongside a
stream of images taken from the actual data set.
• The Discriminator takes in both real and fake images and
returns probabilities, a number between 0 and 1, with 1
representing a prediction of authenticity and 0 representing
fake
• The entities/adversaries are in constant battle as
one(generator) tries to fool the other(discriminator),
while the other tries not to be fooled.

GEC SREEKRISHNAPURAM GAN 7 / 34

System Overview

Two Feedback Loops:

• The Discriminator is in a feedback loop with the ground truth
of the images (are they real or fake)
• The Generator is in a feedback loop with the Discriminator
(did the Discriminator label it real or fake, regardless of the
truth)

GEC SREEKRISHNAPURAM GAN 8 / 34

System Overview
Discriminator vs Generator

GEC SREEKRISHNAPURAM GAN 9 / 34

System Overview

GEC SREEKRISHNAPURAM GAN 10 / 34

System Overview
Loss Function In Discriminative Model
Loss function in Discriminative Model is a regular cross entropy
loss function associated with a binary classifier.

P can be represented as D(x); ie, Probability estimated by

Discriminator D that image X is real image.

GEC SREEKRISHNAPURAM GAN 11 / 34

System Overview

Applying Gradient descent algorithm for minimizing the loss

function
the equation becomes

Loss Function In Generative Model

GEC SREEKRISHNAPURAM GAN 12 / 34

System Overview

GEC SREEKRISHNAPURAM GAN 13 / 34

System Overview

Advantages Over VAE

• GAN belongs to the type of non-parametric production-based
modeling methods, which does not require prior approximate
distributions of training
• GAN works on the whole image and takes less time to
generate samples by directly using global information

GEC SREEKRISHNAPURAM GAN 14 / 34

System Overview

GAN Problems
• Non-convergence:The model parameters oscillate, destabilize
and never converge.
• Mode collapse:The generator collapses which produces limited
varieties of samples.
• Diminished gradient: the discriminator gets too successful
that the generator gradient vanishes and learns nothing.
• Unbalance between the generator and discriminator causing
overfitting
• Highly sensitive to the hyperparameter selections.

GEC SREEKRISHNAPURAM GAN 15 / 34

Types Of GAN

1.DCGAN(Deep Convolutional GAN)

• The generator and discriminator of simple GAN is a simple
fully connected network
generator=Sequential([
Dense(128,inputshape=(100,)),
LeakyReLu(alpha-0.01),
Dense(784),
Activation(’tanh’),
],name=’generator’)
• But in DCGAN Discriminator is a Convolutional Nueral
Network (CNN) and Generator is Transposed Convolutional
Network(Deconvolutional network)
• ie DCGAN will be more fit for the image/video data than a
Simple GAN

GEC SREEKRISHNAPURAM GAN 16 / 34

Types Of GAN(DCGAN cont..)

Similarities Of Neural Networks And CNN

• Both Nueral Network and CNN have learn able weights and
biases.
• In both networks nueron receives some input,perform a dot
product follows it up with a non linear function like
RELU(Rectified Linear Unit)
Main problems with fully connected layers
• Number of weights needed for the nueral network is large
• Networks with large number of parameters faces several
problems.
• slower training time
• chances of overfitting
• etc..

GEC SREEKRISHNAPURAM GAN 17 / 34

Types Of GAN(DCGAN cont..)
Convolutional Neural Network(CNN)
• In CNN the main image matrix is reduced to a matrix of lower
dimension in the first layer through an operation called
Convolution
eg:an image of 64x64xx3 can be reduced to 1x1x10 following
subsequent operation.

Figure: Architecture of Convolutional Neural Network

GEC SREEKRISHNAPURAM GAN 18 / 34

Types Of GAN(DCGAN cont..)
Convolutional Layer

GEC SREEKRISHNAPURAM GAN 19 / 34

Types Of GAN(DCGAN cont..)
Max pooling

Figure: Max Pooling

GEC SREEKRISHNAPURAM GAN 20 / 34

Types Of GAN(DCGAN cont..)

Figure: Discriminator
GEC SREEKRISHNAPURAM GAN 21 / 34
Types Of GAN(DCGAN cont..)

Figure: Generator
GEC SREEKRISHNAPURAM GAN 22 / 34
Types Of GAN
2.CGAN(Conditional GAN)
• when the data set is complex or large-scale, it is difficult for
GAN to control generated result.
• Conditional GANs (CGANs) are an extension of the GANs
model.
• In CGAN the Generator and Discriminator both receive some
additional conditioning input information(y). This could be
the class of the current image or some other property.

NOTE: CGANs have one disadvantage. CGANs are not strictly

unsupervised and we need some kind of labels for them to work
GEC SREEKRISHNAPURAM GAN 23 / 34
Types Of GAN
3.CYCLE GAN
• The CycleGAN is an extension of the GAN architecture that
involves the simultaneous training of two generator models
and two discriminator models.
• The CycleGAN is a technique that involves the automatic
training of image-to-image translation models without paired
examples.
• The models are trained in an unsupervised manner using a
collection of images from the source and target domain that
do not need to be related in any way.

GEC SREEKRISHNAPURAM GAN 24 / 34

Types Of GAN (CYCLEGAN cont...)

GEC SREEKRISHNAPURAM GAN 25 / 34

Types Of GAN

4.SEQGAN(Sequential GAN)
• In sequential data (text, speech, etc), there are some
limitations in applying the exact same concepts of GAN.
These limitations arise mainly due to the sequential and
discrete nature of the data.
• This is the image representation of a random matrix (M)

GEC SREEKRISHNAPURAM GAN 26 / 34

Types Of GAN(SEQGAN cont...)

• This is the image representation of M+0.08

• .But in case of a text ,Suppose that the word computer is

represented by the real-valued vector v = [0.11143, -0.97712,
0.445216 .., 0.7221240]. Now, v + 0.08 is another vector
which need not necessarily represent some word in the
vocabulary.
• eg:”penguin”+0.001==¿”ostrich”

GEC SREEKRISHNAPURAM GAN 27 / 34

Types Of GAN(SEQGAN cont...)

• To overcome,Goodfellow( father of GAN )recommended to

use Reinforcement learning to train GAN to generate discrete
tokens.
• SeqGan(Sequence Generative Adversarial Nets) Using
Reinforcement Learning to combat the non-differentiability
issue in text GANs.

GEC SREEKRISHNAPURAM GAN 28 / 34

Types Of GAN(SEQGAN cont...)

• The generator is treated as an RL agent.

• previous tokens are the states (stored in the hidden states)
and the action is the next token to generate.

• The discriminator is fed with both real and synthetic data to

local the difference.
• To evaluate some partial sequence, they use another
generator.

GEC SREEKRISHNAPURAM GAN 29 / 34

Types Of GAN(SEQGAN cont...)

• Finally completing the sentence .ie completing the action it

will get some rewards(in this case from the discriminator) how
good the sentence is?
• For picking the right action from the particular state using the
concept of policy.
• For optimizing the policy gradient methods are used.

GEC SREEKRISHNAPURAM GAN 30 / 34

Applications
Different types of GAN and its applications:

GEC SREEKRISHNAPURAM GAN 31 / 34

Conclusion

Conclusion:
GANs are one of the new state of the art neural networks which
can be used to do many things.There is a lot of active research in
the field to apply GANs for language tasks, to improve their
stability and ease of training, and so on. They are already being
applied in industry for a variety of applications ranging from
interactive image editing, 3D shape estimation, drug discovery,
semi-supervised learning to robotics etc.

GEC SREEKRISHNAPURAM GAN 32 / 34

References

GEC SREEKRISHNAPURAM GAN 33 / 34

THANK YOU

GEC SREEKRISHNAPURAM GAN 34 / 34

CIGRE Technical Brochure 939 - Analysis of AC Transformer Reliability, September 2024
100% (1)
CIGRE Technical Brochure 939 - Analysis of AC Transformer Reliability, September 2024
109 pages
Lesson 5 Deep Neural Net Optimization Tuning Interpretability
100% (1)
Lesson 5 Deep Neural Net Optimization Tuning Interpretability
105 pages
Top 100 Interview Questions On Machine Learning
100% (1)
Top 100 Interview Questions On Machine Learning
155 pages
Generative Adversial Network
No ratings yet
Generative Adversial Network
21 pages
Generative Adversarial Networks (GANs)
No ratings yet
Generative Adversarial Networks (GANs)
51 pages
Deep Learning PPT Full Notes
No ratings yet
Deep Learning PPT Full Notes
105 pages
Generative Adversarial Networks Review 1-06-08-1.edit
No ratings yet
Generative Adversarial Networks Review 1-06-08-1.edit
24 pages
Yugandar - Generative AI Architect
No ratings yet
Yugandar - Generative AI Architect
8 pages
Generative AI With Large Language Models AWS & DeepLearning
No ratings yet
Generative AI With Large Language Models AWS & DeepLearning
96 pages
The Effect of Time Schedule On The Students'
No ratings yet
The Effect of Time Schedule On The Students'
32 pages
NLP and Generative AI Syllabus - 2025
No ratings yet
NLP and Generative AI Syllabus - 2025
5 pages
Mathematics of Generative AI
No ratings yet
Mathematics of Generative AI
22 pages
(Advances in Computer Vision and Pattern Recognition) Ke Gu, Hongyan Liu, Chengxu Zhou - Quality Assessment of Visual Content-Springer (2022)
No ratings yet
(Advances in Computer Vision and Pattern Recognition) Ke Gu, Hongyan Liu, Chengxu Zhou - Quality Assessment of Visual Content-Springer (2022)
256 pages
Machine Learning
100% (1)
Machine Learning
189 pages
Deep Learning RNN
100% (1)
Deep Learning RNN
53 pages
Knowledge Graphs V Vector Databases and When Not To Use Them!
No ratings yet
Knowledge Graphs V Vector Databases and When Not To Use Them!
3 pages
Deep Learning Unit 1
No ratings yet
Deep Learning Unit 1
32 pages
Neural Networks PDF
No ratings yet
Neural Networks PDF
89 pages
Unit 4 Deeplearning
No ratings yet
Unit 4 Deeplearning
41 pages
Machine Learning GenAI Roadma
No ratings yet
Machine Learning GenAI Roadma
36 pages
A Review On Large Language Models Architectures Ap
No ratings yet
A Review On Large Language Models Architectures Ap
31 pages
GenAI Unit1 3
No ratings yet
GenAI Unit1 3
31 pages
Machine Learning
No ratings yet
Machine Learning
20 pages
MACHINE LEARNING 1-5 (Ai &DS)
100% (1)
MACHINE LEARNING 1-5 (Ai &DS)
60 pages
Introduction To Machine Learning PDF
100% (1)
Introduction To Machine Learning PDF
17 pages
AI Lab Manual Version 1.3
100% (1)
AI Lab Manual Version 1.3
123 pages
Imaging Brain Function With EEG
100% (4)
Imaging Brain Function With EEG
266 pages
Generative AI
No ratings yet
Generative AI
2 pages
Lec16 - Autoencoders
No ratings yet
Lec16 - Autoencoders
18 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
A Survey of Evolution of Image Captioning PDF
No ratings yet
A Survey of Evolution of Image Captioning PDF
18 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
Advanced Deep Learning Questions - ChatGPT
No ratings yet
Advanced Deep Learning Questions - ChatGPT
13 pages
Deep Learning Interview Questions
No ratings yet
Deep Learning Interview Questions
17 pages
Deep Learning Questions
50% (2)
Deep Learning Questions
51 pages
DL Lab Manual
No ratings yet
DL Lab Manual
65 pages
22 Selected Top Papers On Deep Learning
No ratings yet
22 Selected Top Papers On Deep Learning
393 pages
W3schools: CSS Reference
No ratings yet
W3schools: CSS Reference
21 pages
Deploy A Machine Learning Model Using Flask - Towards Data Science
No ratings yet
Deploy A Machine Learning Model Using Flask - Towards Data Science
12 pages
Deep Learning Handout
100% (1)
Deep Learning Handout
6 pages
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
Deep Learning Unit-1 Finals
No ratings yet
Deep Learning Unit-1 Finals
23 pages
Unit 2
No ratings yet
Unit 2
112 pages
Brief Introduction To GenAI
No ratings yet
Brief Introduction To GenAI
1 page
Anitha S. Pillai and Roberto Tedesco - Machine Learning and Deep Learning in Natural Language Processing-CRC Press (2024)
100% (2)
Anitha S. Pillai and Roberto Tedesco - Machine Learning and Deep Learning in Natural Language Processing-CRC Press (2024)
245 pages
Deep Learning 2017 Lecture7GAN
No ratings yet
Deep Learning 2017 Lecture7GAN
62 pages
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
No ratings yet
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
7 pages
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
100% (1)
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
73 pages
Backpropagation
No ratings yet
Backpropagation
7 pages
Design A Machine Learning System
No ratings yet
Design A Machine Learning System
9 pages
Introduction To Neural Networks
No ratings yet
Introduction To Neural Networks
51 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Dropout Vs Pruning
No ratings yet
Dropout Vs Pruning
2 pages
542 315 Word2vec
No ratings yet
542 315 Word2vec
20 pages
PyTorch Workflow Fundamentals
No ratings yet
PyTorch Workflow Fundamentals
1 page
Day 1
No ratings yet
Day 1
32 pages
8 Machine Learning Algorithms in Python
100% (3)
8 Machine Learning Algorithms in Python
16 pages
Modeling and Design of Plate Heat Exchanger
No ratings yet
Modeling and Design of Plate Heat Exchanger
33 pages
Variation
No ratings yet
Variation
20 pages
Ebook Deep Learning Objective Type Questions
No ratings yet
Ebook Deep Learning Objective Type Questions
102 pages
Langchain PDF Reader
100% (1)
Langchain PDF Reader
15 pages
10 Evani Generative AI Champion
No ratings yet
10 Evani Generative AI Champion
39 pages
Liar by Isaac Asimov 2
No ratings yet
Liar by Isaac Asimov 2
16 pages
Ad Tepa - Pessi by Qamar Ali
No ratings yet
Ad Tepa - Pessi by Qamar Ali
3 pages
Javascript Tutorial
No ratings yet
Javascript Tutorial
30 pages
Minimal Representations of Orientation Homogeneous Transformations
No ratings yet
Minimal Representations of Orientation Homogeneous Transformations
14 pages
2006-09 Lodgeroom
100% (1)
2006-09 Lodgeroom
25 pages
Evaluating Risks of Construction-Induced Building Damage For Large Underground Construction Projects
No ratings yet
Evaluating Risks of Construction-Induced Building Damage For Large Underground Construction Projects
28 pages
Machine Learning Handouts
No ratings yet
Machine Learning Handouts
110 pages
ML - 8
No ratings yet
ML - 8
70 pages
Unit 4
No ratings yet
Unit 4
27 pages
Lucky Name Numerology Calculator - Is Your Name Fortunate
No ratings yet
Lucky Name Numerology Calculator - Is Your Name Fortunate
2 pages
Mathematics For Economics: Euncheol Shin
No ratings yet
Mathematics For Economics: Euncheol Shin
14 pages
All BlueJ Program
No ratings yet
All BlueJ Program
14 pages
ORF 544 Week 5 Derivative Free Stochastic Optimization VFA and DLA
No ratings yet
ORF 544 Week 5 Derivative Free Stochastic Optimization VFA and DLA
123 pages
Jurnal Ekonomi Mikro
No ratings yet
Jurnal Ekonomi Mikro
26 pages
Assignment-1 QT
No ratings yet
Assignment-1 QT
3 pages
Improved Serially Concatenated Convolution Turbo Code (SCCTC) Using Chicken Swarm Optimization
No ratings yet
Improved Serially Concatenated Convolution Turbo Code (SCCTC) Using Chicken Swarm Optimization
6 pages
Permutations and Combination
No ratings yet
Permutations and Combination
26 pages
CS 1101 Unit 4
No ratings yet
CS 1101 Unit 4
3 pages
Learn To Submit HTML Data To Mysql Database Using PHP: Programming-Tutorials/)
No ratings yet
Learn To Submit HTML Data To Mysql Database Using PHP: Programming-Tutorials/)
35 pages
Porous Media in Openfoam: Chalmers Spring 2009
No ratings yet
Porous Media in Openfoam: Chalmers Spring 2009
14 pages
Symplectic Geometry
No ratings yet
Symplectic Geometry
21 pages
Unit-II - ADS - IMP QP
No ratings yet
Unit-II - ADS - IMP QP
3 pages
Mrsptu Syllabus
No ratings yet
Mrsptu Syllabus
14 pages
Math Mentals G2
No ratings yet
Math Mentals G2
12 pages
Formulating and Solving LPs Using Excel Solver
No ratings yet
Formulating and Solving LPs Using Excel Solver
10 pages
Automated Doctor Appointment and Patient Prescription Management System
No ratings yet
Automated Doctor Appointment and Patient Prescription Management System
21 pages
The Multiple Classical Linear Regression Model (CLRM) : Specification and Assumptions
No ratings yet
The Multiple Classical Linear Regression Model (CLRM) : Specification and Assumptions
19 pages
Artigo SEPOPE - Redes Neurais - Ingles
No ratings yet
Artigo SEPOPE - Redes Neurais - Ingles
12 pages
Search On Codescracker Search: F T G L y
No ratings yet
Search On Codescracker Search: F T G L y
6 pages
HTML Basics: Search On Codescracker Search
No ratings yet
HTML Basics: Search On Codescracker Search
6 pages
C Programs
No ratings yet
C Programs
3 pages
Clarification Finalsem PDF
No ratings yet
Clarification Finalsem PDF
1 page
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet

GANppt

Uploaded by

GANppt

Uploaded by

Recent Advances of Generative Adversarial

Networks in Computer Vision

GOVERNMENT ENGINNERING COLLEGE, SREEKRISHNAPURAM

GEC SREEKRISHNAPURAM GAN 1 / 34

GEC SREEKRISHNAPURAM GAN 2 / 34

GEC SREEKRISHNAPURAM GAN 3 / 34

• Generative Adversarial Network (GAN), a generative approach

GEC SREEKRISHNAPURAM GAN 4 / 34

GEC SREEKRISHNAPURAM GAN 5 / 34

• The Generator takes in random noise and returns an image.

GEC SREEKRISHNAPURAM GAN 7 / 34

Two Feedback Loops:

GEC SREEKRISHNAPURAM GAN 8 / 34

GEC SREEKRISHNAPURAM GAN 9 / 34

GEC SREEKRISHNAPURAM GAN 10 / 34

P can be represented as D(x); ie, Probability estimated by

GEC SREEKRISHNAPURAM GAN 11 / 34

Applying Gradient descent algorithm for minimizing the loss

Loss Function In Generative Model

GEC SREEKRISHNAPURAM GAN 12 / 34

GEC SREEKRISHNAPURAM GAN 13 / 34

Advantages Over VAE

GEC SREEKRISHNAPURAM GAN 14 / 34

GEC SREEKRISHNAPURAM GAN 15 / 34

1.DCGAN(Deep Convolutional GAN)

GEC SREEKRISHNAPURAM GAN 16 / 34

Similarities Of Neural Networks And CNN

GEC SREEKRISHNAPURAM GAN 17 / 34

Figure: Architecture of Convolutional Neural Network

GEC SREEKRISHNAPURAM GAN 18 / 34

GEC SREEKRISHNAPURAM GAN 19 / 34

Figure: Max Pooling

GEC SREEKRISHNAPURAM GAN 20 / 34

NOTE: CGANs have one disadvantage. CGANs are not strictly

GEC SREEKRISHNAPURAM GAN 24 / 34

GEC SREEKRISHNAPURAM GAN 25 / 34

GEC SREEKRISHNAPURAM GAN 26 / 34

• This is the image representation of M+0.08

• .But in case of a text ,Suppose that the word computer is

GEC SREEKRISHNAPURAM GAN 27 / 34

• To overcome,Goodfellow( father of GAN )recommended to

GEC SREEKRISHNAPURAM GAN 28 / 34

• The generator is treated as an RL agent.

• The discriminator is fed with both real and synthetic data to

GEC SREEKRISHNAPURAM GAN 29 / 34

• Finally completing the sentence .ie completing the action it

GEC SREEKRISHNAPURAM GAN 30 / 34

GEC SREEKRISHNAPURAM GAN 31 / 34

GEC SREEKRISHNAPURAM GAN 32 / 34

GEC SREEKRISHNAPURAM GAN 33 / 34

GEC SREEKRISHNAPURAM GAN 34 / 34

You might also like