0% found this document useful (0 votes)

6 views53 pages

L20 GenerativeModels

The document discusses generative models, specifically focusing on Variational Autoencoders (VAEs) and their foundational concepts such as the manifold hypothesis, Bayesian inference, and the evidence lower bound (ELBO). It explains the training process of VAEs, including the reparameterization trick and applications like image segmentation, denoising, and super-resolution. Additionally, it touches on Generative Adversarial Networks (GANs) and their competitive training framework.

Uploaded by

Ed Z

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views53 pages

L20 GenerativeModels

Uploaded by

Ed Z

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 53

Generative Models:

Variational Autoencoders

Foundations of Data Analysis

April 28, 2022

These are not real people

Karras et al., CVPR 2020, and thispersondoesnotexist.com

These are not real people

Karras et al., CVPR 2020, and thispersondoesnotexist.com

These are not real people

Karras et al., CVPR 2020, and thispersondoesnotexist.com

These are not real people

Karras et al., CVPR 2020, and thispersondoesnotexist.com

Manifold Hypothesis
Real data lie near lower-dimensional manifolds

M
Deep Generative Models

Input: Output:
d g=gL ◦gL−1 ◦···◦g1
z∈R −−−−−−−−−→ x ∈ RD
z ∼ N(0, I)

d << D
Generative Models as Immersed Manifolds

g
M

g=gL ◦gL−1 ◦···◦g1

z ∈ Rd −−−−−−−−−−→ x ∈ RD
Z

1. g should be differentiable
2. Jacobian matrix, Dg, should be full rank

Shao, Kumar, Fletcher, The Riemannian Geometry of Deep Generative Models, DiffCVML 2018.
Talking about this paper:

Diederik Kingma and Max Welling, Auto-Encoding

Variational Bayes, In International Conference on
Learning Representation (ICLR), 2014.
Autoencoders
Input Latent Space Output

x ∈ RD z ∈ Rd x 0 ∈ RD

d << D
Autoencoders

I Linear activation functions give you PCA

Autoencoders

I Linear activation functions give you PCA

I Training:
1. Given data x, feedforward to x0 output
2. Compute loss, e.g., L(x, x0 ) = kx − x0 k2
3. Backpropagate loss gradient to update weights
Autoencoders

I Linear activation functions give you PCA

I Training:
1. Given data x, feedforward to x0 output
2. Compute loss, e.g., L(x, x0 ) = kx − x0 k2
3. Backpropagate loss gradient to update weights
I Not a generative model!
Variational Autoencoders
Input Latent Space Output

σ2

x ∈ RD z ∼ N(µ, σ 2 ) x 0 ∈ RD
Generative Models

z Sample a new x in two steps:

θ Prior: p(z)
Generator: pθ (x | z)
x
Generative Models

z Sample a new x in two steps:

θ Prior: p(z)
Generator: pθ (x | z)
x

Now the analogy to the “encoder” is:

Posterior: p(z | x)
Bayesian Inference

Posterior via Bayes’ Rule:

pθ (x | z)p(z)
p(z | x) =
p(x)
pθ (x | z)p(z)
=R
pθ (x | z)p(z)dz

Integral in denominator is (usually) intractable!

Kullback-Leibler Divergence

Z
p(z)
DKL (qkp) = − q(z) log dz
q(z)

p
= Eq − log
q
Kullback-Leibler Divergence

Z
p(z)
DKL (qkp) = − q(z) log dz
q(z)

p
= Eq − log
q

The average information gained from moving from q to p

Variational Inference

Approximate intractable posterior p(z | x) with a

manageable distribution q(z)
Variational Inference

Approximate intractable posterior p(z | x) with a

manageable distribution q(z)

Minimize the KL divergence: DKL (q(z)kp(z | x))

Evidence Lower Bound (ELBO)
DKL (q(z)kp(z | x))

p(z | x)
= Eq − log
q(z)
Evidence Lower Bound (ELBO)
DKL (q(z)kp(z | x))

p(z | x)
= Eq − log
q(z)

p(z, x)
= Eq − log
q(z)p(x)
Evidence Lower Bound (ELBO)
DKL (q(z)kp(z | x))

p(z | x)
= Eq − log
q(z)

p(z, x)
= Eq − log
q(z)p(x)
= Eq [− log p(z, x) + log q(z) + log p(x)]
Evidence Lower Bound (ELBO)
DKL (q(z)kp(z | x))

p(z | x)
= Eq − log
q(z)

p(z, x)
= Eq − log
q(z)p(x)
= Eq [− log p(z, x) + log q(z) + log p(x)]
= −Eq [log p(z, x)] + Eq [log q(z)] + log p(x)
Evidence Lower Bound (ELBO)
DKL (q(z)kp(z | x))

p(z | x)
= Eq − log
q(z)

p(z, x)
= Eq − log
q(z)p(x)
= Eq [− log p(z, x) + log q(z) + log p(x)]
= −Eq [log p(z, x)] + Eq [log q(z)] + log p(x)

log p(x) = DKL (q(z)kp(z | x)) + L[q(z)]

ELBO: L[q(z)] = Eq [log p(z, x)] − Eq [log q(z)]
Variational Autoencoder

qφ (z | x) pθ (x | z)

Encoder Network Decoder Network

Maximize ELBO:

L(θ, φ, x) = Eqφ [log pθ (x, z) − log qφ (z | x)]

VAE ELBO

L(θ, φ, x) = Eqφ [log pθ (x, z) − log qφ (z | x)]

VAE ELBO

L(θ, φ, x) = Eqφ [log pθ (x, z) − log qφ (z | x)]

= Eqφ [log pθ (z) + log pθ (x | z) − log qφ (z | x)]
VAE ELBO

L(θ, φ, x) = Eqφ [log pθ (x, z) − log qφ (z | x)]

= Eqφ [log pθ (z) + log pθ (x | z) − log qφ (z | x)]

pθ (z)
= Eqφ log + log pθ (x | z)
qφ (z | x)
VAE ELBO

L(θ, φ, x) = Eqφ [log pθ (x, z) − log qφ (z | x)]

L(θ, φ, x) = Eqφ [log pθ (x, z) − log qφ (z | x)]

Problem: Gradient ∇φ Eqφ [log pθ (x | z)] is intractable!

VAE ELBO

L(θ, φ, x) = Eqφ [log pθ (x, z) − log qφ (z | x)]

Problem: Gradient ∇φ Eqφ [log pθ (x | z)] is intractable!

Use Monte Carlo approx., sampling z(s) ∼ qφ (z | x):
S
1X
∇φ Eqφ [log pθ (x | z)] ≈ log pθ (x | z)∇φ log qφ (z(s) | x)
S s=1
Reparameterization Trick

What about the other term?

−DKL (qφ (z | x)kpθ (z))

Reparameterization Trick

What about the other term?

−DKL (qφ (z | x)kpθ (z))

Says encoder, qφ (z | x), should make code z look like

prior distribution
Reparameterization Trick

What about the other term?

−DKL (qφ (z | x)kpθ (z))

Says encoder, qφ (z | x), should make code z look like

prior distribution

Instead of encoding z, encode parameters for a normal

distribution, N(µ, σ 2 )
Reparameterization Trick

(i) 2(i)
qφ (zj | x(i) ) = N(µj , σj )
pθ (z) = N(0, I)
Reparameterization Trick

(i) 2(i)
qφ (zj | x(i) ) = N(µj , σj )
pθ (z) = N(0, I)

KL divergence between these two is:

d
(i) 1 X 2(i) (i) 2(i)

DKL (qφ (z | x )kpθ (z)) = − 1 + log(σj ) − (µj )2 − σj
2 j=1
Results from Kingma & Welling
Why Do Variational?

Example trained on MNIST:

Autoencoder
(reconstruction loss)

From: this webpage

Why Do Variational?

Example trained on MNIST:

Autoencoder
KL divergence only
(reconstruction loss)

From: this webpage

Why Do Variational?

Example trained on MNIST:

Autoencoder VAE
KL divergence only
(reconstruction loss) (KL + recon. loss)

From: this webpage

Applications of Autoencoder / VAE
Models
Image-to-Image Networks
Instead of trying to reconstruct the original input:
1. Encode input: z = encode(x)
2. Decode derived output: y = decode(z)
Image-to-Image Networks
Instead of trying to reconstruct the original input:
1. Encode input: z = encode(x)
2. Decode derived output: y = decode(z)

Example: Image Segmentation

Image Denoising
Learn mapping from noisy inputs → clean outputs

Hales et al., JMRI 2020

Image Super-resolution

Learn mapping from low-res inputs → hi-res outputs

From: this webpage

Image Colorization

Input Latent Space Output

x ∈ RD z ∈ Rd y ∈ RD
Generative Adversarial Networks (GANs)
Generative Adversarial Network

Fake Data
Random Noise

Generator
Network

Discriminator Real / Fake

z ~ N(0, I) G(z) Network (0 / 1)
Real Data

D(x)
GAN Game Theory

GAN training is framed as a competition where:

1. Discriminator is trying to maximize its reward
2. Generator is trying to minimize it

min max V(D, G)

G D

V(D, G) = Ex∼p(x) [log D(x)] + Ez∼N(0,I) [log(1 − D(G(z))]

GAN Training Algorithm
Original GAN Faces (2014)

Goodfellow et al., NeurIPS 2014

UM XPress Revision in Short Cases PDF
80% (10)
UM XPress Revision in Short Cases PDF
226 pages
Exerc Session9 v2 Answ
No ratings yet
Exerc Session9 v2 Answ
2 pages
Pet Listening Part 2
67% (3)
Pet Listening Part 2
7 pages
Bayesian NN
No ratings yet
Bayesian NN
82 pages
Latent Variable Models: Stefano Ermon
No ratings yet
Latent Variable Models: Stefano Ermon
26 pages
24 Variational Inference
No ratings yet
24 Variational Inference
24 pages
Tung Kieu - Probabilistic - Graphical - Model - Report
No ratings yet
Tung Kieu - Probabilistic - Graphical - Model - Report
9 pages
VAE talk.compressed - 副本
No ratings yet
VAE talk.compressed - 副本
59 pages
Variational Autoencoder Explanation
No ratings yet
Variational Autoencoder Explanation
11 pages
Mlgs 2021 Retake
No ratings yet
Mlgs 2021 Retake
54 pages
Wikipedia VAE
No ratings yet
Wikipedia VAE
9 pages
Mod4 Slides
No ratings yet
Mod4 Slides
49 pages
Variational Autoencoder
No ratings yet
Variational Autoencoder
21 pages
DGM 2023 Endterm Solution
No ratings yet
DGM 2023 Endterm Solution
12 pages
cs236 Lecture5
No ratings yet
cs236 Lecture5
29 pages
Variational Autoencoders
No ratings yet
Variational Autoencoders
94 pages
An Introduction To Variational Calculus in Machine Learning
No ratings yet
An Introduction To Variational Calculus in Machine Learning
7 pages
05 Vae
No ratings yet
05 Vae
76 pages
Auto-Encoding Variational Bayes: Diederik P. Kingma Max Welling
No ratings yet
Auto-Encoding Variational Bayes: Diederik P. Kingma Max Welling
9 pages
Khan - Diffusion Models and Normalizing Flows
No ratings yet
Khan - Diffusion Models and Normalizing Flows
36 pages
Presentation - Deeplearning2015 Courville Autoencoder Extension 01
No ratings yet
Presentation - Deeplearning2015 Courville Autoencoder Extension 01
61 pages
A Beginner's Guide To Variational Inference
No ratings yet
A Beginner's Guide To Variational Inference
48 pages
Understanding Diffusion Models: A Unified Perspective
No ratings yet
Understanding Diffusion Models: A Unified Perspective
23 pages
08 VariationalInference
No ratings yet
08 VariationalInference
31 pages
Machine Learning and Pattern Recognition Variational KL
No ratings yet
Machine Learning and Pattern Recognition Variational KL
5 pages
Tutorial On Diffusion Models
No ratings yet
Tutorial On Diffusion Models
4 pages
Reparametrization Trick
No ratings yet
Reparametrization Trick
8 pages
Tutorial On Diffusion Models For Imaging and Vision: Stanley Chan March 28, 2024
No ratings yet
Tutorial On Diffusion Models For Imaging and Vision: Stanley Chan March 28, 2024
51 pages
Bachman Pre Cup 2015
No ratings yet
Bachman Pre Cup 2015
10 pages
Variational Autoencoder From Scratch Umar Jamil: License Video Not For Commercial Use
No ratings yet
Variational Autoencoder From Scratch Umar Jamil: License Video Not For Commercial Use
29 pages
Variation Al
No ratings yet
Variation Al
25 pages
Auto Encoding Variational Bayes
No ratings yet
Auto Encoding Variational Bayes
14 pages
A Brief Primer On Variational Inference - Fabian Dablander
No ratings yet
A Brief Primer On Variational Inference - Fabian Dablander
14 pages
DL 1
No ratings yet
DL 1
10 pages
Mlgs 2021 Endterm Solution
No ratings yet
Mlgs 2021 Endterm Solution
26 pages
Session 10
No ratings yet
Session 10
11 pages
8.auto-Encoding Variational Bayes
No ratings yet
8.auto-Encoding Variational Bayes
14 pages
On The Challenges of Learning With Inference Networks On Sparse, High-Dimensional Data
No ratings yet
On The Challenges of Learning With Inference Networks On Sparse, High-Dimensional Data
14 pages
Latent 2
No ratings yet
Latent 2
4 pages
Lecture-04 GMM EMalg
No ratings yet
Lecture-04 GMM EMalg
34 pages
25 Customizing Models A Algorithms
No ratings yet
25 Customizing Models A Algorithms
38 pages
CS236 Hw2 Answers
No ratings yet
CS236 Hw2 Answers
14 pages
Mod6 Slides
No ratings yet
Mod6 Slides
27 pages
Class19 Approxinf
No ratings yet
Class19 Approxinf
45 pages
Cheat Sheet 1
No ratings yet
Cheat Sheet 1
2 pages
Demystifying Variational Diffusion Models
No ratings yet
Demystifying Variational Diffusion Models
48 pages
Aait HW3
No ratings yet
Aait HW3
9 pages
ACV - Notes - Final
No ratings yet
ACV - Notes - Final
7 pages
Machine Learning and Pattern Recognition - Variational - Details
No ratings yet
Machine Learning and Pattern Recognition - Variational - Details
3 pages
Slides No Break
No ratings yet
Slides No Break
77 pages
2020 Exam2 Solution
No ratings yet
2020 Exam2 Solution
9 pages
Vapnik - Complete Statistical Theory of Learning Learning U
No ratings yet
Vapnik - Complete Statistical Theory of Learning Learning U
59 pages
Lecture 17 - KL Divergence, Autoencoders
No ratings yet
Lecture 17 - KL Divergence, Autoencoders
54 pages
Lecture 14
No ratings yet
Lecture 14
23 pages
Approximate Inference: Sargur Srihari Srihari@cedar - Buffalo.edu
No ratings yet
Approximate Inference: Sargur Srihari Srihari@cedar - Buffalo.edu
18 pages
Lecture 12
No ratings yet
Lecture 12
35 pages
Mod 3 Advanced AI
No ratings yet
Mod 3 Advanced AI
37 pages
An Introduction To Variational Autoencoders: Foundations and Trends in Machine Learning
No ratings yet
An Introduction To Variational Autoencoders: Foundations and Trends in Machine Learning
89 pages
Auto-Encoding Variational Bayes
No ratings yet
Auto-Encoding Variational Bayes
8 pages
3 Bayesian Deep Learning
No ratings yet
3 Bayesian Deep Learning
33 pages
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Mathematical Functions
From Everand
Mathematical Functions
Oliver Linton
No ratings yet
L18 Backprop
No ratings yet
L18 Backprop
18 pages
L06 Vectors
No ratings yet
L06 Vectors
26 pages
L08 MaximumLikelihoodEstimation
No ratings yet
L08 MaximumLikelihoodEstimation
5 pages
Chance Constr
No ratings yet
Chance Constr
22 pages
Lectures HD
No ratings yet
Lectures HD
301 pages
Commodity Misperceptions January 30 2017
No ratings yet
Commodity Misperceptions January 30 2017
9 pages
Lehman GPR
No ratings yet
Lehman GPR
68 pages
663 Detecting 2021
No ratings yet
663 Detecting 2021
78 pages
Commodity
No ratings yet
Commodity
91 pages
Connor Sensible Return Forecasting 1997
No ratings yet
Connor Sensible Return Forecasting 1997
8 pages
Homework 5
No ratings yet
Homework 5
1 page
36-325/725 Homework 1 Due Thursday Aug 29 (1) Chapter 2 Problem 1
No ratings yet
36-325/725 Homework 1 Due Thursday Aug 29 (1) Chapter 2 Problem 1
1 page
Commodity February 11 2005
No ratings yet
Commodity February 11 2005
57 pages
663 Topics
No ratings yet
663 Topics
12 pages
Probability and Mathematical Statistics I: Lectures Instructor Office Extension Email Web-Site Text
No ratings yet
Probability and Mathematical Statistics I: Lectures Instructor Office Extension Email Web-Site Text
3 pages
Agile Teamwork - Minimize Handoffs
No ratings yet
Agile Teamwork - Minimize Handoffs
3 pages
Tobias Matthay: The Man, The Pedagogue, The Composer
No ratings yet
Tobias Matthay: The Man, The Pedagogue, The Composer
37 pages
PBL - Implementation - English
No ratings yet
PBL - Implementation - English
29 pages
Abstract Nouns
No ratings yet
Abstract Nouns
8 pages
DLL Lecture Note 1.8
No ratings yet
DLL Lecture Note 1.8
4 pages
pr2 Nakakamatay
No ratings yet
pr2 Nakakamatay
15 pages
Use The Words in Brackets To Make Possessive Nouns.: Final Test. Name: . Surname: . Form
No ratings yet
Use The Words in Brackets To Make Possessive Nouns.: Final Test. Name: . Surname: . Form
2 pages
Personality Disorder
No ratings yet
Personality Disorder
20 pages
Understanding The Self 6
No ratings yet
Understanding The Self 6
30 pages
Cbp-2018-Concept-note-module Empowering Leadership Positive Gov Final
No ratings yet
Cbp-2018-Concept-note-module Empowering Leadership Positive Gov Final
11 pages
Sociology9699 (01-Specimen 2021) Paper+ms+answers
No ratings yet
Sociology9699 (01-Specimen 2021) Paper+ms+answers
26 pages
Elements of Design in Fashion
No ratings yet
Elements of Design in Fashion
45 pages
Memorandum Order 40 - Guidelines Change of Status PDF
100% (1)
Memorandum Order 40 - Guidelines Change of Status PDF
3 pages
Gr12 DLL Module3 Entrepreneur
No ratings yet
Gr12 DLL Module3 Entrepreneur
4 pages
Diploma in Graphic Design
No ratings yet
Diploma in Graphic Design
2 pages
DLLmath 7 Q1 Week 3
No ratings yet
DLLmath 7 Q1 Week 3
7 pages
THECAUSES AND EFFECTS OF CHILDMARRIAGE IN LUSAKA A CASE STUDY OF University ZAMBIA
No ratings yet
THECAUSES AND EFFECTS OF CHILDMARRIAGE IN LUSAKA A CASE STUDY OF University ZAMBIA
48 pages
Descargas 1
No ratings yet
Descargas 1
7 pages
Chapter-2 The Philosophical Background of Business Ethics
No ratings yet
Chapter-2 The Philosophical Background of Business Ethics
6 pages
Implementing Cisco Secure Access Control System (ACS) v5.2: Course Objectives Associated Certifications
No ratings yet
Implementing Cisco Secure Access Control System (ACS) v5.2: Course Objectives Associated Certifications
2 pages
Current Electricity Solutions
No ratings yet
Current Electricity Solutions
17 pages
Felix Gauger - User - Preferences - For - Coworking - Spaces - A - Comparison Between The Netherlands, Germany and The Czech Republic
No ratings yet
Felix Gauger - User - Preferences - For - Coworking - Spaces - A - Comparison Between The Netherlands, Germany and The Czech Republic
25 pages
6A Programming-Assignment: Harmonic Model (Week 6) : Instructions
No ratings yet
6A Programming-Assignment: Harmonic Model (Week 6) : Instructions
3 pages
Iimv NFLP Brochure
No ratings yet
Iimv NFLP Brochure
4 pages
Lesson Plan in Mapeh 10
No ratings yet
Lesson Plan in Mapeh 10
3 pages
Marketing Plan Bachelor of Elementary Education (ETEEAP)
No ratings yet
Marketing Plan Bachelor of Elementary Education (ETEEAP)
6 pages
ETHICS Seatwork #1
No ratings yet
ETHICS Seatwork #1
3 pages
Kalt Mia Resume
No ratings yet
Kalt Mia Resume
2 pages