0% found this document useful (0 votes)

24 views55 pages

Deep Generative Models

The document covers deep generative models, focusing on their training methods, including supervised and unsupervised learning, as well as specific models like autoencoders, variational autoencoders (VAEs), and generative adversarial networks (GANs). It discusses applications of generative models, such as generating realistic samples and data augmentation, while also highlighting the pros and cons of VAEs and GANs. The document emphasizes the importance of understanding the underlying structures of data and the challenges associated with training these models.

Uploaded by

dent

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views55 pages

Deep Generative Models

Uploaded by

dent

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 55

Deep Generative Models

Mostafa Mehdipour Ghazi ([email protected])

Pioneer Centre for Artificial Intelligence

Department of Computer Science
2

Intended Learning Outcomes

Generative Models Applications

• Realistic samples
• Artwork, super-resolution, colorization, customization

https://arxiv.org/pdf/2108.02774v1
4

Generative Models Applications

• Realistic samples
• Artwork, super-resolution, colorization, customization

• Learning general latent representations

• Inference, interpretability, denoising & reconstruction

• Data augmentation
• Robust model training, bias & fairness
5

• Deep network training

Supervised Learning

• Data: (predictor x, target y)

• Goal: learn a function to map x ↦ y
• Example: classification

Shallow learning

Deep learning

https://www.mlguru.ai/Learn/concepts-deep-learning
7

Supervised Learning

• Data: (predictor x, target y)

• Goal: learn a function to map x ↦ y
• Example: classification

Shallow learning

Deep learning
8

Supervised Learning

• Data: (predictor x, target y)

• Goal: learn a function to map x ↦ y
• Example: classification
9

Supervised Learning

• Data: (predictor x, target y)

• Goal: learn a function to map x ↦ y
• Example: detection, regression
10

Supervised Learning

• Data: (predictor x, target y)

• Goal: learn a function to map x ↦ y
• Example: pixel classification, segmentation
11

Unsupervised Learning

• Data: (predictor x), no labels!

• Goal: learn some underlying hidden structure of the data
• Example: clustering
12

Unsupervised Learning

• Data: (predictor x), no labels!

• Goal: learn some underlying hidden structure of the data
• Example: dimensionality reduction (PCA)

PC2

PC1
13

Unsupervised Learning

• Data: (predictor x), no labels!

• Goal: learn some underlying hidden structure of the data
• Example: density estimation
14

Unsupervised Learning

• Data: (predictor x), no labels! (self-supervised if pseudo labels from x)

• Goal: learn some underlying hidden structure of the data
• Example: reconstruction, imputation, denoising

MRIs with rotation and motion artifacts High-resolution MRI

https://arxiv.org/pdf/2308.04395
15

Unsupervised Learning

• Data: (predictor x), no labels! (self-supervised if pseudo labels from x)

• Goal: learn some underlying hidden structure of the data
• Example: generation

training data ~ pdata(x) generated samples ~ pmodel(x)

learning pmodel(x) similar to pdata(x)

Unsupervised Learning

• Data: (predictor x), no labels! (self-supervised if pseudo labels from x)

• Goal: learn some underlying hidden structure of the data
• Example: generation

training data ~ pdata(x) generated samples ~ pmodel(x)

learning pmodel(x) similar to pdata(x)

Self-Supervised Learning

• Supervisory signals
• Generating pseudo labels from the input data
• Example: masked patches are used as labels in masked autoencoders for reconstruction

• Pretext tasks
• Learning meaningful context-aware representations of the data with a given task
• Example: contrastive learning to differentiate pairs like augmented views of the same image

• Transferable representations
• Using the learned representations for downstream tasks
• Example: fine-tuning the encoder for classification or segmentation
18

Deep Network Training

• Supervised pre-training
• Uses labeled data
• Learns task-specific representations

• Unsupervised pre-training
• Uses unlabeled data
• Learns generic representations

• New task
• What are the strategies for training?
• What factors to consider?
19

Deep Network Training

• New task
• Large enough data & resources → training from scratch

• Limited data & time → transfer learning

• Similarities between the data & targets

• More differences → fine-tuning

• Multiple targets → multitask learning

• Deep network training

Deep Generative Models

• Explicit models
• Learn a model that explicitly defines and estimates density pmodel(x)
• Example: VAEs, denoising diffusion models (DDMs)

• Implicit models
• Learn a model that samples from pmodel(x) w/o explicitly defining it
• Example: GANs

training data ~ pdata(x) generated samples ~ pmodel(x)

https://link.springer.com/chapter/10.1007/978-3-031-72744-3_19
22

Autoencoders

• Train such that features can be used to reconstruct original data

Autoencoders

• Train such that features can be used to reconstruct original data

• To learn without labels for denoising, imputation, or enhancement
24

Autoencoders

• Train such that features can be used to reconstruct original data

• Learn lower-dimensional features from unlabeled data
25

Autoencoders

• Train such that features can be used to reconstruct original data

• Learn lower-dimensional features from unlabeled data
• To capture meaningful factors of data variation for downstream tasks
26

Autoencoders

• Train such that features can be used to reconstruct original data

• Learn lower-dimensional features from unlabeled data
27

Autoencoders vs. U-Nets

• Similarities
28

Autoencoders vs. U-Nets

• Different in architectures: with or without skip connections

• Different in tasks: representations learning or segmentation
29

Autoencoders

• Can reconstruct data and learn features to initialize supervised models

• Can capture factors of variation in latent space from training data

https://lilianweng.github.io/posts/2018-08-12-vae/
30

Autoencoders

• Can reconstruct data and learn features to initialize supervised models

• Can capture factors of variation in latent space from training data
• Cannot generate/sample (new) data

https://lilianweng.github.io/posts/2018-08-12-vae/
31

Autoencoders

• Can reconstruct data and learn features to initialize supervised models

• Can capture factors of variation in latent space from training data
• Cannot generate/sample (new) data

https://lilianweng.github.io/posts/2018-08-12-vae/
32

Autoencoders

• Can reconstruct data and learn features to initialize supervised models

• Can capture factors of variation in latent space from training data
• Cannot generate/sample (new) data
• Different types
• Stacked/Denoising autoencoders (DAE)
• Sparse autoencoders (SAE)
• Contractive autoencoders (CAE)
• Masked autoencoders (MAE)

https://lilianweng.github.io/posts/2018-08-12-vae/
33

• Deep network training

Variational Autoencoders

• Probabilistic spin on autoencoders to sample from the model

• Gaussian prior pθ(z) = N(0, I)
• The latent distribution is often non-Gaussian and complex
• Decoder likelihood pθ(x|z) = N(µ θ(z), ∑θ(z))
• Gaussian assumptions for tractability and flexibility

https://lilianweng.github.io/posts/2018-08-12-vae/
35

Variational Autoencoders

• Probabilistic spin on autoencoders to sample from the model

• Marginal data likelihood pθ(x) = ʃ pθ(x|z) pθ(z) dz is intractable
• Posterior density pθ(z|x) = pθ(x|z) pθ(z) / pθ(x) is also intractable
• Integrating over all possible latent configurations is computationally expensive and impractical

https://lilianweng.github.io/posts/2018-08-12-vae/
36

Variational Autoencoders

• Training
• Use an encoder/inference network qɸ(z|x) = N(µ ɸ(x), ∑ɸ(x)) that approximates pθ(z|x)
• Use maximum likelihood estimation to estimate the parameters of a model
minimize
maximize

https://lilianweng.github.io/posts/2018-08-12-vae/
37

Evidence Lower Bound (ELBO)

• Maximize the likelihood of the observed data 𝑥 under the model

Evidence Lower Bound (ELBO)

• Maximize the likelihood of the observed data 𝑥 under the model

Evidence Lower Bound (ELBO)

• Maximize the likelihood of the observed data 𝑥 under the model

Evidence Lower Bound (ELBO)

• Maximize the likelihood of the observed data 𝑥 under the model

Evidence Lower Bound (ELBO)

• Maximize the likelihood of the observed data 𝑥 under the model

https://link.springer.com/book/10.1007/978-3-030-93158-2
42

VAEs in Practice

• Training
• Data likelihood (evidence) lower bound is tractable
• Maximize log pθ(x) ≥ ELBO = Ez[log pθ(x|z)] - Ez[log qϕ(z|x) pθ(z)]
Reconstruction loss (x ; x’) - KLD (N(µ ɸ(x), ∑ ɸ(x)) ; N(0, I))

https://lilianweng.github.io/posts/2018-08-12-vae/
43

VAEs in Practice

• Training
• The encoder learns to output 𝜇 and 𝜎 for each input data point
• A latent vector 𝑧 is sampled for reconstruction, using the reparameterization trick
𝑧 = 𝜇 + 𝜎 ϵ, ϵ ∼ N(0, I)

• Generation
• Sample latent 𝑧 ∼ N(0, I)
• Pass 𝑧 through the decoder
44

Reparameterization Trick

• Reds are non-differentiable sampling operations and blues are loss layers
• The backpropagation can be applied to the reparametrized (right) network

https://arxiv.org/pdf/1606.05908v3
45

Summary

• Pros

• Cons
46

Summary

• Pros
• Generalization -> VAEs can generate diverse images due to better density modeling
• Interpretability -> VAEs latent representations can be used for interpretability

• Cons
47

Summary

• Pros
• Generalization -> VAEs can generate diverse images due to better density modeling
• Interpretability -> VAEs latent representations can be used for interpretability

• Cons
• Quality -> VAEs generate smoother/blurry and less detailed images
• Data -> VAEs require diverse enough data to span the entire distribution
• Dimensionality -> It is not clear how to choose the latent dimension
• Optimization -> The ELBO enforces an information bottleneck at the latent variables,
making the optimization prone to bad local minima
48

• Deep network training

Generative Adversarial Networks

• GAN is a dynamic 2-player game

• Generator (Player 1) tries to create images that look real
• Discriminator (Player 2) tries to distinguish between real and fake images
• When Nash equilibrium is achieved, generated images are indistinguishable from real
• When no player can gain more by changing strategy given another player’s strategy is fixed

• How it works
• Iteration: both networks continuously update their strategies over time
• Learning: the generator learns to fool the discriminator, while the discriminator
becomes better at detecting fakes, mimicking the feedback loop seen in game theory
50

GANs in Practice

• Training GANs ->

https://newsletter.theaiedge.io/p/how-generative-adversarial-networks
51

Summary

• Pros

• Cons
52

Summary

• Pros
• Quality -> GANs can generate high-quality, sharp images
• Utility -> Adversarial concepts can be used to improve generation process

• Cons
53

Summary

• Pros
• Quality -> GANs can generate high-quality, sharp images
• Utility -> Adversarial concepts can be used for improving generation process

• Cons
• Training instability -> Jointly training two networks can result in mode collapse
• Bias and fairness -> GANs can reflect the biases present in the training data
• Interpretability -> GANs are implicit models and difficult to interpret or explain
54

Appendix

• Bayes’ rule

• Kullback-Leibler divergence

• Jensen’s inequality
• Linear function ( )
• Convex function ( )
• Concave function ( )
Thank you!

Any questions?

Neural Networks & Deep Learning Makaut & & 7th SemNotes
No ratings yet
Neural Networks & Deep Learning Makaut & & 7th SemNotes
36 pages
Introductory Statistics A Problem Solving Approach - 2nd Edition
100% (2)
Introductory Statistics A Problem Solving Approach - 2nd Edition
890 pages
Modern Mathematical Statistics With Applications (2nd Edition)
13% (32)
Modern Mathematical Statistics With Applications (2nd Edition)
13 pages
Deep Learning Material
No ratings yet
Deep Learning Material
136 pages
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
61 pages
23 DeepLearning PDF
No ratings yet
23 DeepLearning PDF
74 pages
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
Deep Gen Models Tutorial
No ratings yet
Deep Gen Models Tutorial
96 pages
Coursera - Online Courses From Top Universities
0% (2)
Coursera - Online Courses From Top Universities
3 pages
Cheatsheets For Deep Learning 1650192034
No ratings yet
Cheatsheets For Deep Learning 1650192034
95 pages
Statistical Methods For Dynamic Disease Screening and Spatio Temporal Disease Surveillance - 1st Edition Ebook Download
100% (19)
Statistical Methods For Dynamic Disease Screening and Spatio Temporal Disease Surveillance - 1st Edition Ebook Download
15 pages
Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
123 pages
Explain The Process of Quantization and Obtain The Expression For Signal To Quantization Ratio in The Case of Uniform Quantizer
0% (1)
Explain The Process of Quantization and Obtain The Expression For Signal To Quantization Ratio in The Case of Uniform Quantizer
3 pages
Advanced Data Analysis Binder 2015
100% (1)
Advanced Data Analysis Binder 2015
165 pages
Unit 5 Cosm Short Notes
No ratings yet
Unit 5 Cosm Short Notes
6 pages
Jntuk r20 Unit V Deep Learning Techniqueswwwjntumaterials
No ratings yet
Jntuk r20 Unit V Deep Learning Techniqueswwwjntumaterials
32 pages
Curve Fitting ST Line and Parabola
0% (1)
Curve Fitting ST Line and Parabola
12 pages
GenAI Unit1 3
No ratings yet
GenAI Unit1 3
31 pages
Lampiran 10 - Hasil Uji Validitas
No ratings yet
Lampiran 10 - Hasil Uji Validitas
4 pages
Introduction To VAE
No ratings yet
Introduction To VAE
5 pages
chp4 10
No ratings yet
chp4 10
144 pages
Practicedump: Free Practice Dumps - Unlimited Free Access of Practice Exam
No ratings yet
Practicedump: Free Practice Dumps - Unlimited Free Access of Practice Exam
5 pages
Analysis of Heights of Singers
100% (4)
Analysis of Heights of Singers
14 pages
Chapter 5
No ratings yet
Chapter 5
140 pages
DAAI - Lecture - 15 - 23nov22
No ratings yet
DAAI - Lecture - 15 - 23nov22
113 pages
Generative Models
No ratings yet
Generative Models
65 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
10 - Generative AI
No ratings yet
10 - Generative AI
71 pages
Lec15 Generative Models
No ratings yet
Lec15 Generative Models
51 pages
Unit5 Autoencoders
No ratings yet
Unit5 Autoencoders
45 pages
STAT 2006 Chapter 2 - 2022
No ratings yet
STAT 2006 Chapter 2 - 2022
83 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
L12 Generative Models en
No ratings yet
L12 Generative Models en
65 pages
Unit IV V Deep Learning Material
No ratings yet
Unit IV V Deep Learning Material
32 pages
GenAIWorkshop GEOMAR With Footnotes Final
No ratings yet
GenAIWorkshop GEOMAR With Footnotes Final
41 pages
Lecture # 6 Latent Variable Models
No ratings yet
Lecture # 6 Latent Variable Models
55 pages
CM20315 01 Intro
No ratings yet
CM20315 01 Intro
62 pages
Deep Learning For NLP
No ratings yet
Deep Learning For NLP
78 pages
12.simple Regression NLS Edit
No ratings yet
12.simple Regression NLS Edit
62 pages
Week 6 Unsupervised Learning
No ratings yet
Week 6 Unsupervised Learning
60 pages
Conditional
No ratings yet
Conditional
2 pages
Unit II
No ratings yet
Unit II
27 pages
An Introduction To Variational Autoencoders: Foundations and Trends in Machine Learning
No ratings yet
An Introduction To Variational Autoencoders: Foundations and Trends in Machine Learning
89 pages
Slides PyConfr Bordeaux Calcagno
No ratings yet
Slides PyConfr Bordeaux Calcagno
46 pages
Module 5
No ratings yet
Module 5
23 pages
Intro To Vae
No ratings yet
Intro To Vae
89 pages
Unit - V
No ratings yet
Unit - V
44 pages
Part 15 MD
No ratings yet
Part 15 MD
36 pages
Kidist PPT - Admas
No ratings yet
Kidist PPT - Admas
21 pages
Nips10 Workshop Tutorial Final PDF
No ratings yet
Nips10 Workshop Tutorial Final PDF
73 pages
Chapter17 Autoencoders
No ratings yet
Chapter17 Autoencoders
23 pages
Auto Encoder S
No ratings yet
Auto Encoder S
22 pages
Sony Ai Content
No ratings yet
Sony Ai Content
26 pages
Dis10 Sol
No ratings yet
Dis10 Sol
11 pages
Session 2 ANN 2024
No ratings yet
Session 2 ANN 2024
29 pages
Unit 5e - Autoencoders
No ratings yet
Unit 5e - Autoencoders
32 pages
Autoencoder NPTEL Presentation
No ratings yet
Autoencoder NPTEL Presentation
11 pages
Structural Equation Modeling and Path Analysis
No ratings yet
Structural Equation Modeling and Path Analysis
47 pages
Chapter 5
No ratings yet
Chapter 5
35 pages
7& 9 Autoencoder and Variational Autoencoder
No ratings yet
7& 9 Autoencoder and Variational Autoencoder
13 pages
Unsupervised Deep Learning
No ratings yet
Unsupervised Deep Learning
11 pages
AAI - Module 2 - Variational Autoencoders
No ratings yet
AAI - Module 2 - Variational Autoencoders
9 pages
Tutorial 1,2
No ratings yet
Tutorial 1,2
12 pages
465-Lecture 12
No ratings yet
465-Lecture 12
31 pages
CSD411 Week14 AutoRBM
No ratings yet
CSD411 Week14 AutoRBM
18 pages
P A S - L S - R VAE: Erformance Nalysis of EMI Supervised Earning in THE Mall Data Egime Using S
No ratings yet
P A S - L S - R VAE: Erformance Nalysis of EMI Supervised Earning in THE Mall Data Egime Using S
7 pages
Deep Learning Viva Questions (1-3)
No ratings yet
Deep Learning Viva Questions (1-3)
4 pages
Unit 5 Autoencoders
No ratings yet
Unit 5 Autoencoders
6 pages
Unit 2
No ratings yet
Unit 2
28 pages
MuskanSharma - III IT
No ratings yet
MuskanSharma - III IT
10 pages
DL Asmt-2
No ratings yet
DL Asmt-2
17 pages
Lecture #15: Regression Trees & Random Forests
No ratings yet
Lecture #15: Regression Trees & Random Forests
34 pages
Deep Learning U1
No ratings yet
Deep Learning U1
5 pages
HW FRM2
No ratings yet
HW FRM2
5 pages
Introtodeeplearning MIT 6.S191
No ratings yet
Introtodeeplearning MIT 6.S191
36 pages
Deep Learning and Applications: Pham The Bao Ptbao@sgu - Edu.vn
No ratings yet
Deep Learning and Applications: Pham The Bao Ptbao@sgu - Edu.vn
43 pages
Assignment 10
No ratings yet
Assignment 10
4 pages
ch14 Autoencoder
No ratings yet
ch14 Autoencoder
42 pages
Learning Platform
No ratings yet
Learning Platform
4 pages
Assignment 14 Modern AI
No ratings yet
Assignment 14 Modern AI
3 pages
Architectures RST
No ratings yet
Architectures RST
4 pages
6 - Problems On Sampling Distributions
No ratings yet
6 - Problems On Sampling Distributions
15 pages
CST 42315 Dam - L9 1
No ratings yet
CST 42315 Dam - L9 1
15 pages
Econometrics Project: Pana Elena Bianca Group 137
No ratings yet
Econometrics Project: Pana Elena Bianca Group 137
17 pages
Deep
No ratings yet
Deep
15 pages
A Selective Overview of Deep Learning: Jianqing Fan Cong Ma Yiqiao Zhong April 16, 2019
No ratings yet
A Selective Overview of Deep Learning: Jianqing Fan Cong Ma Yiqiao Zhong April 16, 2019
37 pages
2nd Year Stat ch.12 Test
No ratings yet
2nd Year Stat ch.12 Test
1 page
Probability Midterm Exam
No ratings yet
Probability Midterm Exam
3 pages
Correlational Research Design: Sarah & Emeral
No ratings yet
Correlational Research Design: Sarah & Emeral
16 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet

Deep Generative Models

Uploaded by

Deep Generative Models

Uploaded by

Deep Generative Models

Mostafa Mehdipour Ghazi ([email protected])

Pioneer Centre for Artificial Intelligence

Intended Learning Outcomes

• Deep network training

Generative Models Applications

Generative Models Applications

• Learning general latent representations

• Deep network training

• Data: (predictor x, target y)

• Data: (predictor x, target y)

• Data: (predictor x, target y)

• Data: (predictor x, target y)

• Data: (predictor x, target y)

• Data: (predictor x), no labels!

• Data: (predictor x), no labels!

• Data: (predictor x), no labels!

• Data: (predictor x), no labels! (self-supervised if pseudo labels from x)

MRIs with rotation and motion artifacts High-resolution MRI

• Data: (predictor x), no labels! (self-supervised if pseudo labels from x)

training data ~ pdata(x) generated samples ~ pmodel(x)

learning pmodel(x) similar to pdata(x)

• Data: (predictor x), no labels! (self-supervised if pseudo labels from x)

training data ~ pdata(x) generated samples ~ pmodel(x)

learning pmodel(x) similar to pdata(x)

Deep Network Training

Deep Network Training

• Limited data & time → transfer learning

• Similarities between the data & targets

• More differences → fine-tuning

• Multiple targets → multitask learning

• Deep network training

Deep Generative Models

training data ~ pdata(x) generated samples ~ pmodel(x)

• Train such that features can be used to reconstruct original data

• Train such that features can be used to reconstruct original data

• Train such that features can be used to reconstruct original data

• Train such that features can be used to reconstruct original data

• Train such that features can be used to reconstruct original data

Autoencoders vs. U-Nets

Autoencoders vs. U-Nets

• Different in architectures: with or without skip connections

• Can reconstruct data and learn features to initialize supervised models

• Can reconstruct data and learn features to initialize supervised models

• Can reconstruct data and learn features to initialize supervised models

• Can reconstruct data and learn features to initialize supervised models

• Deep network training

• Probabilistic spin on autoencoders to sample from the model

• Probabilistic spin on autoencoders to sample from the model

Evidence Lower Bound (ELBO)

• Maximize the likelihood of the observed data 𝑥 under the model

Evidence Lower Bound (ELBO)

• Maximize the likelihood of the observed data 𝑥 under the model

Evidence Lower Bound (ELBO)

• Maximize the likelihood of the observed data 𝑥 under the model

Evidence Lower Bound (ELBO)

• Maximize the likelihood of the observed data 𝑥 under the model

Evidence Lower Bound (ELBO)

• Maximize the likelihood of the observed data 𝑥 under the model

• Deep network training

Generative Adversarial Networks

• GAN is a dynamic 2-player game

• Training GANs ->

You might also like