0% found this document useful (0 votes)

21 views21 pages

Variational AutoEncoder

Uploaded by

hamzajafri04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views21 pages

Variational AutoEncoder

Uploaded by

hamzajafri04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Variational AutoEncoder

Muhammad Atif Tahir

Introduction
• Two significant contributions in Deep learning based generative
models during recent years
• Variational Autoencoders
• Generative Adversarial Networks (GANs)
Few Symbols
Variational AutoEncoders
• Variational autoencoder was proposed in 2013 by Diederik P. Kingma and
Max Welling at Google and Qualcomm

• Variational Autoencoders (VAEs) are a fascinating model that combine

Bayesian statistics with deep neural networks. VAEs wear many hats and
bridge many different worlds

• In other words, a variational autoencoder (VAE) provides a probabilistic

manner for describing an observation in latent space

• Thus, rather than building an encoder that outputs a single value to

describe each latent state attribute, VAE, formulate encoder to describe a
probability distribution for each latent attribute
DP Kingma, M Welling, Auto-encoding variational bayes, arXiv preprint arXiv:1312.6114,
2013•arxiv.org, Cited by 32776 (5/2/24)
Variational AutoEncoders
Use in

• Deep neural networks,

• Bayesian statistical machines,
• Latent variable models,
• Maximum likelihood estimators,
• Dimensionality reducers, and
• Generative models
AutoEncoder vs Variational AutoEncoder
Variational AutoEncoder architecture diagram
Variational AutoEncoder architecture diagram
• The encoder will take each input image
• Encode it to two vectors that together define a multivariate normal
distribution in the latent space
• Some Notations
• z_mean: The mean point of the distribution
• z_log_var: The logarithm of the variance of each dimension
• Point z from the distribution defined by these values using the
following equation:
• z = z_mean + z_sigma * epsilon
where z_sigma = exp(z_log_var * 0.5) and epsilon ~ N(0,I)
AutoEncoders versus Variational AutoEncoders
Variational AutoEncoder architecture diagram
• VAE model modified standard autoencoders by modeling a distribution
as the encoder output as opposed to just a brittle vector of numbers

• They then sampled from the distribution during forward pass, and
utilized the re-parametrization trick allowing for backpropagation
through the sampling step

• Both re-parametrization trick and variational inference had been around

longer, but gained greater popularity due to its application to VAEs
Reparameterization Trick
• Rather than sample directly from a normal distribution with
parameters z_mean and z_log_var, epsilon can be sampled from a
standard normal distribution and then manually adjust the sample to
have the correct mean and variance
• This is known as the reparameterization trick, and it’s important as it
means gradients can backpropagate freely through the layer
• By keeping all of the randomness of the layer contained within the
variable epsilon, the partial derivative of the layer output with respect
to its input can be shown to be deterministic (i.e., independent of the
random epsilon)
• This is essential for backpropagation through the layer to be possible
VAE Network with and without reparameterization trick. 𝜙 representations the
distribution the network is trying to learn
Loss Function
• Traditional autoencoder only consisted of the reconstruction loss
between images and their attempted copies after being passed
through the encoder and decoder
• For variational autoencoder, KL Divergence is added as extra
component

• The sum is taken over all the dimensions in the latent space

• kl_loss is minimized to 0 when z_mean = 0 and z_log_var = 0 for all

dimensions. As these two terms start to differ from 0, kl_loss increases
• focusing only on reconstruction loss does allow
us to separate out the classes (in this case,
MNIST digits) which should allow our decoder
model the ability to reproduce the original
handwritten digit, but there's an uneven
distribution of data within the latent space
• In other words, there are areas in latent space
which don't represent any of our observed data

Image Credit
• end up describing every observation
using the same unit Gaussian

• This effectively treats every observation

as having the same characteristics; in
other words, we've failed to describe the
original data

Image Credit
• However, when the two terms are
optimized simultaneously
• The latent state is described for an
observation with distributions close
to the prior but deviating when
necessary to describe salient features
of the input

Image Credit
Summary

• The encoder-decoder architecture lies at the heart of Variational

Autoencoders (VAEs), distinguishing them from traditional
autoencoders

• The encoder network takes raw input data and transforms it into a
probability distribution within the latent space

• The latent code generated by the encoder is a probabilistic encoding,

allowing the VAE to express not just a single point in the latent space
but a distribution of potential representations
Summary (Continue)
• The decoder network, in turn, takes a sampled point from the latent
distribution and reconstructs it back into data space
• During training, the model refines both the encoder and decoder
parameters to minimize the reconstruction loss – the disparity between
the input data and the decoded output
• The goal is not just to achieve accurate reconstruction but also to
regularize the latent space, ensuring that it conforms to a specified
distribution
• The process involves a delicate balance between two essential
components: the reconstruction loss and the regularization term, often
represented by the Kullback-Leibler divergence
Summary (Continue)
• The reconstruction loss compels the model to accurately reconstruct
the input, while the regularization term encourages the latent space to
adhere to the chosen distribution, preventing overfitting and promoting
generalization

• By iteratively adjusting these parameters during training, the VAE

learns to encode input data into a meaningful latent space
representation. This optimized latent code encapsulates the underlying
features and structures of the data, facilitating precise reconstruction

• The probabilistic nature of the latent space also enables the generation
of novel samples by drawing random points from the learned
distribution
References
• https://www.linkedin.com/pulse/understanding-variational-autoencoders-
vaes-how-useful-raja
• https://www.analyticsvidhya.com/blog/2021/04/generate-your-own-
dataset-using-gan/
• https://www.geeksforgeeks.org/variational-autoencoders/
• https://medium.com/retina-ai-health-inc/variational-inference-derivation-
of-the-variational-autoencoder-vae-loss-function-a-true-story-
3543a3dc67ee
• https://towardsdatascience.com/reparameterization-trick-126062cfd3c3

VAE Vs GAN
100% (1)
VAE Vs GAN
3 pages
GAPE_module_3 - Copy - Copy
No ratings yet
GAPE_module_3 - Copy - Copy
21 pages
Week 2 - VAE
No ratings yet
Week 2 - VAE
14 pages
7.Variational Autoencoders
No ratings yet
7.Variational Autoencoders
4 pages
Variational Autoencoders (VAEs)
No ratings yet
Variational Autoencoders (VAEs)
5 pages
12 Variational Autoencoder v2.07
No ratings yet
12 Variational Autoencoder v2.07
35 pages
Exploring the Latent Space of Autoencoders with
No ratings yet
Exploring the Latent Space of Autoencoders with
34 pages
Variation Autoencoder VAEs in PyTorch
No ratings yet
Variation Autoencoder VAEs in PyTorch
9 pages
Unit 2 Variational Auto Encoder
No ratings yet
Unit 2 Variational Auto Encoder
11 pages
VAE, Domain Adaptation
No ratings yet
VAE, Domain Adaptation
15 pages
Assignment On Module-3
No ratings yet
Assignment On Module-3
3 pages
Generating Diverse High-Fidelity Images
No ratings yet
Generating Diverse High-Fidelity Images
15 pages
Presentation - Deeplearning2015 Courville Autoencoder Extension 01
No ratings yet
Presentation - Deeplearning2015 Courville Autoencoder Extension 01
61 pages
Module 2 Gen
No ratings yet
Module 2 Gen
57 pages
Variational Autoencoder Explanation
No ratings yet
Variational Autoencoder Explanation
11 pages
ACV - Notes - Final
No ratings yet
ACV - Notes - Final
7 pages
Adversarial Variational Bayes
No ratings yet
Adversarial Variational Bayes
14 pages
Lecture # 6 Latent Variable Models
No ratings yet
Lecture # 6 Latent Variable Models
55 pages
Auto Encoder s
No ratings yet
Auto Encoder s
16 pages
Variational Autoencoders
No ratings yet
Variational Autoencoders
94 pages
Auto-Encoding_Variational_Bayes
No ratings yet
Auto-Encoding_Variational_Bayes
8 pages
Wikipedia VAE
No ratings yet
Wikipedia VAE
9 pages
Presentation On Variational Autoencoders
No ratings yet
Presentation On Variational Autoencoders
44 pages
VAE Continued: Biplab Banerjee
No ratings yet
VAE Continued: Biplab Banerjee
23 pages
465-Lecture 12
No ratings yet
465-Lecture 12
31 pages
5 - VAE
No ratings yet
5 - VAE
20 pages
LLM RG
No ratings yet
LLM RG
4 pages
Chapter 10
No ratings yet
Chapter 10
20 pages
C 03 Variational Autoencoders Generative Adversarial Network
No ratings yet
C 03 Variational Autoencoders Generative Adversarial Network
54 pages
Autoencoders
No ratings yet
Autoencoders
35 pages
DLA Unit 5
No ratings yet
DLA Unit 5
18 pages
Combinevae&Gan 4
No ratings yet
Combinevae&Gan 4
19 pages
1 Autoencoders
No ratings yet
1 Autoencoders
22 pages
AAI - Module 2 - Variational Autoencoders
No ratings yet
AAI - Module 2 - Variational Autoencoders
9 pages
AAI Module 3
No ratings yet
AAI Module 3
11 pages
Vae - Gan 1
No ratings yet
Vae - Gan 1
136 pages
Representation Learning
No ratings yet
Representation Learning
21 pages
Generative Model For Image Classification
No ratings yet
Generative Model For Image Classification
4 pages
Make 02 00020
No ratings yet
Make 02 00020
19 pages
AVAE
No ratings yet
AVAE
21 pages
Mod 3 Advanced AI
No ratings yet
Mod 3 Advanced AI
37 pages
Autoencoders in Machine Learning
No ratings yet
Autoencoders in Machine Learning
7 pages
Auto Encoder s
No ratings yet
Auto Encoder s
22 pages
7& 9 Autoencoder and Variational Autoencoder
No ratings yet
7& 9 Autoencoder and Variational Autoencoder
13 pages
Unsupervised Deep Learning
No ratings yet
Unsupervised Deep Learning
11 pages
6S191 MIT DeepLearning L4
No ratings yet
6S191 MIT DeepLearning L4
88 pages
2002.12164v2
No ratings yet
2002.12164v2
7 pages
CSD411-Week14-AutoRBM_1731474657667996771673434e1e7d46
No ratings yet
CSD411-Week14-AutoRBM_1731474657667996771673434e1e7d46
18 pages
An Introduction To Variational Autoencoders: Foundations and Trends in Machine Learning
No ratings yet
An Introduction To Variational Autoencoders: Foundations and Trends in Machine Learning
89 pages
Week 2 - VAE - Lesson
No ratings yet
Week 2 - VAE - Lesson
22 pages
Intro To Vae
No ratings yet
Intro To Vae
89 pages
220110038_MuskanSharma_III IT
No ratings yet
220110038_MuskanSharma_III IT
10 pages
LECT-GEN AI-2
No ratings yet
LECT-GEN AI-2
22 pages
Variational Autoencoders-Fashion Mnist
No ratings yet
Variational Autoencoders-Fashion Mnist
9 pages
Presentation-2 CDVAE(05-31-2024)
No ratings yet
Presentation-2 CDVAE(05-31-2024)
33 pages
Session2 2024_2025_ Natural Language Processing
No ratings yet
Session2 2024_2025_ Natural Language Processing
30 pages
Csci 544 Sequence Labeling L
No ratings yet
Csci 544 Sequence Labeling L
79 pages
Variational Autoencoders
No ratings yet
Variational Autoencoders
14 pages
DL - M2 - Deep Feedforward NN
No ratings yet
DL - M2 - Deep Feedforward NN
97 pages
ARMAX
No ratings yet
ARMAX
3 pages
This Is AI4001: GCR: t37g47w
No ratings yet
This Is AI4001: GCR: t37g47w
57 pages
Unit 3 MCQ
No ratings yet
Unit 3 MCQ
4 pages
Tutorial - What Is A Variational Autoencoder - Jaan Altosaar
No ratings yet
Tutorial - What Is A Variational Autoencoder - Jaan Altosaar
20 pages
Coaching Actuaries Exam P Suggested Study Schedule: Phase 1: Learn
100% (1)
Coaching Actuaries Exam P Suggested Study Schedule: Phase 1: Learn
7 pages
This Is AI4001: GCR: t37g47w
No ratings yet
This Is AI4001: GCR: t37g47w
51 pages
Word Embedding
No ratings yet
Word Embedding
60 pages
ViT Explained
No ratings yet
ViT Explained
15 pages
BackPropagation Through Time
No ratings yet
BackPropagation Through Time
6 pages
AI Fundamentals Finals
No ratings yet
AI Fundamentals Finals
6 pages
lstm
No ratings yet
lstm
12 pages
AI60201_module3 (1)
No ratings yet
AI60201_module3 (1)
61 pages
Efficient Epileptic Seizure Prediction Based On Deep Learning
No ratings yet
Efficient Epileptic Seizure Prediction Based On Deep Learning
10 pages
Generative_Models
No ratings yet
Generative_Models
65 pages
IMP - Fundamentals of Deep Learning - Introduction To Recurrent Neural Networks
No ratings yet
IMP - Fundamentals of Deep Learning - Introduction To Recurrent Neural Networks
33 pages
Y .C, YA,: Yt Yy y Ys
No ratings yet
Y .C, YA,: Yt Yy y Ys
24 pages
GR7 Hw3mansci
No ratings yet
GR7 Hw3mansci
4 pages
RNN Numerical
No ratings yet
RNN Numerical
3 pages
Assignment Questions
No ratings yet
Assignment Questions
1 page
Assignment Probability (Version 1) .XLSB
No ratings yet
Assignment Probability (Version 1) .XLSB
25 pages
L3 DFA Introduction
No ratings yet
L3 DFA Introduction
17 pages
Moment Generation Function
No ratings yet
Moment Generation Function
38 pages
MAT2001-SE Course Materials - Module 4
No ratings yet
MAT2001-SE Course Materials - Module 4
38 pages
Mealy Machine
0% (1)
Mealy Machine
6 pages
UML Class Diagram 3 Relationships
No ratings yet
UML Class Diagram 3 Relationships
35 pages
Cours 2 - Training Deep Neural Networks
No ratings yet
Cours 2 - Training Deep Neural Networks
42 pages
1.017/1.010 Class 11 Multivariate Probability: Multiple Random Variables
No ratings yet
1.017/1.010 Class 11 Multivariate Probability: Multiple Random Variables
3 pages
Automata Theory Nfa
No ratings yet
Automata Theory Nfa
33 pages
Using The Binomial Distribution - Solutions PDF
No ratings yet
Using The Binomial Distribution - Solutions PDF
3 pages
NPU MachineLearning
No ratings yet
NPU MachineLearning
28 pages
Assignment 1
No ratings yet
Assignment 1
1 page
Ch. 9: Introduction To Convolution Neural Networks (CNN) and Systems
No ratings yet
Ch. 9: Introduction To Convolution Neural Networks (CNN) and Systems
96 pages
A399 Pert CPM
No ratings yet
A399 Pert CPM
3 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet

Variational AutoEncoder

Uploaded by

Variational AutoEncoder

Uploaded by

Variational AutoEncoder

Muhammad Atif Tahir

• Variational Autoencoders (VAEs) are a fascinating model that combine

• In other words, a variational autoencoder (VAE) provides a probabilistic

• Thus, rather than building an encoder that outputs a single value to

• Deep neural networks,

• Both re-parametrization trick and variational inference had been around

• kl_loss is minimized to 0 when z_mean = 0 and z_log_var = 0 for all

• This effectively treats every observation

• The encoder-decoder architecture lies at the heart of Variational

• The latent code generated by the encoder is a probabilistic encoding,

• By iteratively adjusting these parameters during training, the VAE

You might also like