0% found this document useful (0 votes)

10 views83 pages

Lec24 Diffusion

The document discusses the fundamentals of diffusion models in deep learning, particularly their generative capabilities and how they relate to variational autoencoders (VAEs). It outlines the forward and reverse processes of denoising diffusion models, emphasizing their training methodologies and the use of stochastic differential equations. Additionally, it highlights the limitations of traditional VAEs and presents diffusion models as an advanced approach to generating high-quality data.

Uploaded by

marcus

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views83 pages

Lec24 Diffusion

Uploaded by

marcus

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 83

Deep

Learning
Diffusion
Hao Chen

Fall 2024
Attendance:
@

1
Generative vs. Discriminative
• Generative models learn the data
distribution

2
Generative Models
• Learning to generate
data

https://cvpr2022-tutorial-diffusion-models.github.io/ 3
Generative Models

4
https://lilianweng.github.io/posts/2021-07-11-diffusion-models/
Generative
Models
Last
Lecture

5
https://lilianweng.github.io/posts/2021-07-11-diffusion-models/
Generative Models

This
Lecture

6
https://lilianweng.github.io/posts/2021-07-11-diffusion-models/
A Fast Evolving Field

SORA 2024

7
Conten
•
t
Denoising Diffusion Model Basics
• Diffusion Models from Stochastic
Differential Equations and Score Matching
Perspective
• Denoising Diffusion Implicit Model (DDIM)
• Conditional Diffusion Models
• Applications of Diffusion Models

8
Conten
•
t
Diffusion Model Basics
– Diffusion Models as Stacking VAEs
– Diffusion Models: Forward, Reverse, Training,
Sampling
• Diffusion Models from Stochastic
Differential Equations and Score Matching
Perspective
• Denoising Diffusion Implicit Model (DDIM)
• Conditional Diffusion Models
• Applications of Diffusion Models
9
Denoising Diffusion
• Models
what we often see about diffusion
models

Forward diffusion Reverse denoising

process process

10
Denoising Diffusion
• Models
what we often see about diffusion
models

Forward diffusion Reverse denoising

process process

• this lecture: denoising diffusion is a stack of

VAEs 11
Recap: Variational Autoencoders
• VAEs: a likelihood-based generative
model
• Encoder: an inference model that 𝑝(𝑥|
𝑧)
approximates the posterior 𝑞(𝑧| � �
𝑥) � �
• Decoder: a generative model that 𝑞(𝑧|
𝑥)
transforms a Gaussian variable z to real
data
• Training: maximize the
ELBO
12
Recap: Variational Autoencoders
Decoder: transforms a
Gaussian variable to real data

𝑝(𝑥|
𝑧)
VAE

� �
� �
𝑞(𝑧|𝑥)

Encoder: an inference model

approximates the posterior, i.e.
Gaussian

13
VAEs are good, but…
• Blurry
results

14
Kingma et al. Auto-Encoding Variational
Limitations of VAEs
• Decoder must transform a standard Gaussian all
the way to the target distribution in one-step
– Often too large a gap
– Blurry results are generated
z 𝑥
𝐷(𝑥; +
𝜙)

𝑒~
𝑁(0, 𝐶)

15
Limitations of VAEs
• Decoder must transform a standard Gaussian all
the way to the target distribution in one-step
– Often too large a gap
– Blurry results are generated
z 𝑥
𝐷(𝑥; +
𝜙)

𝑒~
𝑁(0, 𝐶)

• Solution: have some intermediate latent variables

to reduce the gap of each step 16
Hierarchical
• Hierarchical VAEs– Stacking VAEs on top of each
VAEs
other
– Multiple (T) intermediate latent

– Joint distribution

– Posterior
• Better likelihood achieved!
𝑝(𝑥|𝑧1 )
𝑝(𝑧2 |
𝑧3 )

𝑝(𝑧1 |𝑧2 ) z3

𝑥 𝑞(𝑧 3 |
𝑧 )
17
Sønderby et al. Ladder Variational Networks. 2016
Stacking VAEs
• Each step, the decoder removes part of the
noise

𝑥 distribution 𝑥
• Provides a seed model closer to final

3 0
𝐷(𝑥; 𝐷(𝑥; 𝐷(𝑥;
𝑥2 𝑥1𝜙 1 )
+ + +
𝜙3 ) 𝜙2 )

𝑒~ 𝑒~ 𝑒 ~ 𝑁(0,
𝑁(0, 𝐶) 𝑁(0, 𝐶) 𝐶)

18
Stacking
• VAEs
We can have many many steps (in total T)…
• Each step incrementally recovers the final
distribution

𝑥 𝑥 𝑥 𝑥
…
𝑥 Decoder𝑥 𝑇−Decoder
Decoder 𝑇−
𝑇 1 2 2 1 0
… Decoder Decoder Decoder
T T-1 T-2 3 2 1

• Looks familiar?
19
Diffusion Models are Stacking
•VAEs
Diffusion models are special cases of Stacking
VAEs
Decoder Decoder Decoder
Decoder
1 2 t T

𝑥 𝑥
𝑥0 𝑥1 𝑥 𝑡− 𝑥 𝑇−
…
𝑡 1 𝑇
1
…
•𝑥2The reverse denoising process is the stack
of decoders
• What about encoders?

20
Diffusion Models are Stacking
•VAEs
Diffusion models are special case of Stacking
VAEs
Decoder Decoder Decoder Decoder

𝑥
2

𝑥
1 t

𝑥 𝑥1 𝑥 𝑇−
𝑥 𝑡−
T
… …
𝑥2
0 𝑡 1 𝑇
1
Encoder Encoder Encoder Encoder
1 2 t T

• In VAEs, encoders are learned with KL-

divergence between the posterior and the prior
• Suffers from the ‘posterior-collapse’ issue
• Diffusion models use fixed inference encoders
21
Chen et al. Variational Lossy Autoencoder. 2016
Pol
l

22
Denoising Diffusion Models
• Diffusion models have two processes

• Forward diffusion process gradually adds noise

to input

• Reverse denoising process learns to generate

data by denoising

23
Forward Diffusion Process
• Forward diffusion process is stacking fixed VAE
encoders
– gradually adding Gaussian noise according to schedule 𝛽𝑡

24
Forward Diffusion Process

• The forward process allows sampling of 𝑥𝑡

at arbitrary timestep 𝑡 in closed form:

• The noise schedule (𝛽𝑡 values) is designed such

that 25
Reverse Denoising Process
• Generation process
– Sample
– Iteratively sample

• not directly tractable

Gaussian distribution if 𝛽 𝑡 is small

• But can be estimated with a

at each step
– The purpose of our stack of VAE
decoders! 26
Reverse Denoising Process
• Reverse diffusion process is stacking learnable VAE
decoders
– Predicting the mean and std of added Gaussian Noise

27
Reverse Denoising Process
• Reverse diffusion process is stacking learnable VAE
decoders
– Predicting the mean and std of added Gaussian Noise

28
Reverse Denoising Process
• Reverse diffusion process is stacking learnable VAE
decoders
– Predicting the mean and std of added Gaussian Noise

Trainable Network, Shared Across All

Timesteps

29
Learning the Denoising
• Modelmodels are trained with variational
Denoising
upper bound (negative ELBO), as VAEs

• which derives
to:

• tractable posterior distribution (closed-

form)

Ho et al. Denoising Diffusion Probabilistic Models. 30

2020.
Learning the Denoising
• Modelmodels are trained with variational
Denoising
upper bound (negative ELBO), as VAEs

• which derives
to:

constant Scalin
• tractable posterior distribution (closed- g
form)

Ho et al. Denoising Diffusion Probabilistic Models. 31

2020.
Learning the Denoising
• Modelmodels are trained with variational
Denoising
upper bound (negative ELBO), as VAEs

• which derives
to:

• tractable posterior distribution (closed-

form)

Ho et al. Denoising Diffusion Probabilistic Models. 32

2020.
Parameterizing the Denoising
• Model
KL divergence has a simple form between
Gaussians

• Recall that:

• Trainable network predicts the noise

mean

• Final
Objective 33
Simplified Training Objective

𝜆𝑡

• 𝜆 𝑡 ensures the weighting for correct

maximum likelihood estimation

• In DDPM, this is further simplified to:

34
Summary: Training and Sampling

35
Summary: Noise Schedule

Str¨umke et al. Lecture Notes in Probabilistic Diffusion Models. 2020. 36

Connection with Hierarchical VAEs
• Diffusion models are special case of Hierarchical
VAEs
– Fixed inference models in forward process
– Latent variables have same dimension as data
– ELBO is decomposed to each timestep: faster to train
– Model is trained with some weighting of ELBO
𝑝(𝑥| 𝑝(𝑧1 | 𝑝(𝑧2 |
𝑧1 ) 𝑧2 ) 𝑧3 )
� z1 z2 z3
�
𝑞(𝑧1 | 𝑞(𝑧 2 | 𝑞(𝑧 3 |
𝑥) 𝑧1 ) 𝑧2 )
Ho et al. Denoising Diffusion Probabilistic Models. 37
2020.
Pol
l

38
Conten
•
t
Diffusion Model Basics
– Diffusion Models as Stacking VAEs
– Diffusion Models: Forward, Reverse, Training,
Sampling
• Diffusion Models from Stochastic
Differential Equations and Score Matching
Perspective
• Classifier-Free Guidance for Conditional
Models
• Applications of Diffusion Models
39
Why SDEs?
• A unified framework for interpreting
diffusion models and score-based
generation models
– Variants of diffusion-based and flow-based
models

40
Stochastic Differential Equations

41
Slide credit to: https://cvpr2022-tutorial-diffusion-models.github.io/
Stochastic Differential Equations

42
Slide credit to: https://cvpr2022-tutorial-diffusion-models.github.io/
Score Matching
• General form of probability density function

• Maximizing the log-likelihood requires us to

know
– Often intractable

• Instead, we can model the score function

43
Forward Diffusion Process as
SDEs

• Consider a forward process with many many small steps (continuous

time)

Taylor expansion
44
Slide credit to: https://cvpr2022-tutorial-diffusion-models.github.io/
Forward Diffusion Process as
SDEs

• Consider a forward process with many many small

steps

Taylor expansion
Allows different size along Step
Slide credit to: https://cvpr2022-tutorial-diffusion-models.github.io/ t size45
Forward Diffusion Process as
SDEs

• Consider a forward process with many many small

steps

Taylor expansion
46
Slide credit to: https://cvpr2022-tutorial-diffusion-models.github.io/
Forward Diffusion Process as
SDEs

• An iterative update that can be viewed as

SDEs

Stochastic Differential Equation

(SDE)
Slide credit to: https://cvpr2022-tutorial-diffusion-models.github.io/
47
Forward Diffusion Process as
SDEs

Drift Term Diffusion Term

(Pulls toward the (Injects
mode) Noise) 48
Slide credit to: https://cvpr2022-tutorial-diffusion-models.github.io/
49
Figure credit to: https://yang-song.net/blog/2021/score/
Generative Reverse SDEs

• The forward SDE has a reverse

form:

50
Slide credit to: https://cvpr2022-tutorial-diffusion-models.github.io/
51
Figure credit to: https://yang-song.net/blog/2021/score/
Generative Reverse SDEs

• The forward SDE has a reverse

form:

Score
function How 52
Slide credit to: https://cvpr2022-tutorial-diffusion-models.github.io/
Denoising Score
Matching

53
Figure credit to: https://yang-song.net/blog/2021/score/
Denoising Score
Matching

54
Figure credit to: https://yang-song.net/blog/2021/score/
Denoising Score
Matching

Looks 55
Figure credit to: https://yang-song.net/blog/2021/score/ similar?
Denoising Score
• Matching
Denoising score matching
objective

• Re-parametrized
sampling:

• Score function:

• Denoising
network:

• Final objective:
56
Weighted Diffusion Objective
• Denoising score matching objective with loss
weighting

• Loss weights trade-off between

– good perceptual quality:
– maximum likelihood:

• More complicated model parametrization and loss

weighting
leads to different diffusion model variants in the literature!

57
Slide credit to: https://cvpr2022-tutorial-diffusion-models.github.io/
Pol
l

58
Conten
•
t
Diffusion Model Basics
• Diffusion Models from Stochastic
Differential Equations and Score Matching
Perspective
• Denoising Diffusion Implicit Model (DDIM)
• Conditional Diffusion Models
• Applications of Diffusion Models

59
Many Steps in Diffusion
• Slow in generation

• In Training, we randomly sample one time

step

• But in inference, we must transit from T to 0

– 1000 steps
– extremely slow for raw images/signals

60
Can we do generation with less
steps?

61
Slide credit to: https://cvpr2022-tutorial-diffusion-models.github.io/
DDPM

62
DDPM

Only depends on previous

step

Only used during

training

63
DDIM

• A Non-Markovian Forward
Process

Song et al. Denoising Diffusion Implicit Models. 2021. 64

DDIM

• Backward
process

Song et al. Denoising Diffusion Implicit Models. 2021. 65

DDPM vs DDIM

66
DDIM with Fewer Steps Sampling

67
DDIM Results

68
Pol
l

69
Conten
•
t
Diffusion Model Basics
• Diffusion Models from Stochastic
Differential Equations and Score Matching
Perspective
• Denoising Diffusion Implicit Model (DDIM)
• Conditional Diffusion Models
• Applications of Diffusion Models

70
Conditional Diffusion Models
• Un-conditional •

Conditional

More controllable!
71
Conditional Score Matching
• Score matching with conditional
information

72
Classifier Guidance
• Use a discriminative classifier
for

• 𝛾 controls the strength of the condition

• Limitations:
– Need a separate classifier
– Conditioning depends on the performance
of classifier
73
Classifier-Free Guidance
• Score matching with conditional
information

• Classifier-free
guidance

Ho et al. Classifier-Free Diffusion Guidance. 74

2022.
Training of Classifier-Free
Guidance
• For conditional embeddings
– Randomly drop p original conditionals with
an additional unconditional class

Ho et al. Classifier-Free Diffusion Guidance. 75

2022.
Conten
•
t
Diffusion Model Basics
• Diffusion Models from Stochastic
Differential Equations and Score Matching
Perspective
• Denoising Diffusion Implicit Model (DDIM)
• Conditional Diffusion Models
• Applications of Diffusion Models

76
DDPM
• Training diffusion models on raw images
with a U-Net model

Ho et al. Denoising Diffusion Probabilistic Models. 2020. 77

Diffusion Models Beat GANs
• Larger denoising model with sophisticated
design
– Adaptive group normalization
– Attention layers in U-Net

Dhariwal et al. Diffusion Models Beat GANs on Image Synthesis. 78

2021.
Latent Diffusion Models (LDMs)
• Learn diffusion on VAE’s latent
– Yet another VAE! Except pre-
trained.

Rombach et al. High-Resolution Image Synthesis with Latent Diffusion Models. 2022. 79
Stable Diffusion
• Large-scale text-conditional LDMs
– With VAEs trained also on larger
datasets

Stability AI. https://github.com/Stability-AI/stablediffusion 80

DALLE

Ramesh et al. Hierarchical Text-Conditional Image Generation with CLIP 81

Latents
DiT
• A transformer architecture for diffusion
models

Peebles et al. Scalable Diffusion Models with Transformers. 82

2020.
MAR
• An autoregressive model with diffusion
loss

Li et al. Autoregressive Image Generation without Vector Quantization. 2024. 83

KSS Catalog-E
No ratings yet
KSS Catalog-E
236 pages
Pro Swift - Break Out of Beginner's Swift With This Hands-On Guide - PDF Room
No ratings yet
Pro Swift - Break Out of Beginner's Swift With This Hands-On Guide - PDF Room
265 pages
MTH001 Final Term Current
No ratings yet
MTH001 Final Term Current
14 pages
LCL Ewm BBP Document
100% (2)
LCL Ewm BBP Document
18 pages
Grade 10 Notes Printed - 05 - 2010 - Logic Gates
No ratings yet
Grade 10 Notes Printed - 05 - 2010 - Logic Gates
4 pages
CVPR2022 Tutorial Diffusion Model
No ratings yet
CVPR2022 Tutorial Diffusion Model
188 pages
Variational Autoencoders
No ratings yet
Variational Autoencoders
94 pages
Unit 5
No ratings yet
Unit 5
36 pages
Deep Generative Models
No ratings yet
Deep Generative Models
55 pages
Course Tittle:-Project Title:-: Object Oriented Software Analysis and Design
100% (1)
Course Tittle:-Project Title:-: Object Oriented Software Analysis and Design
24 pages
Diffusion: by Aryan Jain
100% (1)
Diffusion: by Aryan Jain
55 pages
Lecture 2-Hardware Architecture (Part 1)
No ratings yet
Lecture 2-Hardware Architecture (Part 1)
22 pages
Chapter 5
No ratings yet
Chapter 5
140 pages
Biometric User Manual
No ratings yet
Biometric User Manual
23 pages
2303ec039 - Display Systems
100% (1)
2303ec039 - Display Systems
2 pages
Tutorialon Diffusion Modelsfor Imaging and Vision
No ratings yet
Tutorialon Diffusion Modelsfor Imaging and Vision
90 pages
Lecture7-8 Diffusion Model
No ratings yet
Lecture7-8 Diffusion Model
136 pages
Lec16 DiffusionModels
No ratings yet
Lec16 DiffusionModels
57 pages
## Parsing A Data File (Python For Beginner) Somet...
No ratings yet
## Parsing A Data File (Python For Beginner) Somet...
3 pages
Lecture7 8 - Diffusion - Model 1 78 1 66
No ratings yet
Lecture7 8 - Diffusion - Model 1 78 1 66
66 pages
Embalming: History, Theory, and Practice, Sixth Edition Sharon Gee-Mascarello Download PDF
100% (1)
Embalming: History, Theory, and Practice, Sixth Edition Sharon Gee-Mascarello Download PDF
29 pages
Adsa Mid-1 MCQ Unit-1
No ratings yet
Adsa Mid-1 MCQ Unit-1
5 pages
Bayesian NN
No ratings yet
Bayesian NN
82 pages
Lecture # 13-2 Stable Diffusion Model
No ratings yet
Lecture # 13-2 Stable Diffusion Model
48 pages
Tutorial On Diffusion Models For Imaging and Vision: Stanley Chan September 10, 2024
No ratings yet
Tutorial On Diffusion Models For Imaging and Vision: Stanley Chan September 10, 2024
89 pages
AWS Helper
No ratings yet
AWS Helper
67 pages
Lecture7 8 Diffusion Model 1 78
No ratings yet
Lecture7 8 Diffusion Model 1 78
78 pages
Lecture 14
No ratings yet
Lecture 14
23 pages
6S191 MIT DeepLearning L4
No ratings yet
6S191 MIT DeepLearning L4
88 pages
Pic Favorite
No ratings yet
Pic Favorite
86 pages
Intro To Vae
No ratings yet
Intro To Vae
89 pages
Lecture 13
No ratings yet
Lecture 13
31 pages
Diffusion Models For PNP IR
No ratings yet
Diffusion Models For PNP IR
48 pages
Structured Denoising Diffusion Models in Discrete State-Spaces
No ratings yet
Structured Denoising Diffusion Models in Discrete State-Spaces
33 pages
Lecture7 Diffusion
No ratings yet
Lecture7 Diffusion
42 pages
D D B M: Enoising Iffusion Ridge Odels
No ratings yet
D D B M: Enoising Iffusion Ridge Odels
26 pages
Project Xpressbee
No ratings yet
Project Xpressbee
52 pages
An Introduction To Variational Autoencoders: Foundations and Trends in Machine Learning
No ratings yet
An Introduction To Variational Autoencoders: Foundations and Trends in Machine Learning
89 pages
From Denoising Diffusions To Denoising Markov Models
No ratings yet
From Denoising Diffusions To Denoising Markov Models
55 pages
Application DPM
No ratings yet
Application DPM
43 pages
Tutorial On Diffusion Models For Imaging and Vision: Stanley Chan March 28, 2024
No ratings yet
Tutorial On Diffusion Models For Imaging and Vision: Stanley Chan March 28, 2024
51 pages
Step-by-Step Diffusion: An Elementary Tutorial
No ratings yet
Step-by-Step Diffusion: An Elementary Tutorial
51 pages
Score-Based Diffusion Models Via Stochastic Differential Equations - A Technical Tutorial
No ratings yet
Score-Based Diffusion Models Via Stochastic Differential Equations - A Technical Tutorial
29 pages
Elucidating The Design Space of Diffusion-Based Generative Models
No ratings yet
Elucidating The Design Space of Diffusion-Based Generative Models
47 pages
DiffusionModel DDPM
No ratings yet
DiffusionModel DDPM
52 pages
2018 - 4 - Answer Key of Naib Tehsildar (Main) - 2018 Held On 14-04-2018
No ratings yet
2018 - 4 - Answer Key of Naib Tehsildar (Main) - 2018 Held On 14-04-2018
2 pages
Kaist cs492d Fall 2024 Lecture 4
No ratings yet
Kaist cs492d Fall 2024 Lecture 4
33 pages
Lecture 5 Diffusion - Models Part II Final
No ratings yet
Lecture 5 Diffusion - Models Part II Final
49 pages
CS772 Lec21
No ratings yet
CS772 Lec21
26 pages
Introduction To VAE
No ratings yet
Introduction To VAE
5 pages
Demystifying Variational Diffusion Models
No ratings yet
Demystifying Variational Diffusion Models
48 pages
Stable Diffusion For Image Generation
No ratings yet
Stable Diffusion For Image Generation
23 pages
Diffusion
No ratings yet
Diffusion
55 pages
Diffusion Models A Concise Perspective
No ratings yet
Diffusion Models A Concise Perspective
8 pages
Diffusion
No ratings yet
Diffusion
19 pages
Lecture 4 Diffusion - Models Part I Final
No ratings yet
Lecture 4 Diffusion - Models Part I Final
39 pages
Improving Diffusion Models For Inverse Problems Using Manifold Constraints
No ratings yet
Improving Diffusion Models For Inverse Problems Using Manifold Constraints
29 pages
Khan - Diffusion Models and Normalizing Flows
No ratings yet
Khan - Diffusion Models and Normalizing Flows
36 pages
Slides 2
No ratings yet
Slides 2
28 pages
Diffusion Models in Vision: A Survey: Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, and Mubarak Shah
No ratings yet
Diffusion Models in Vision: A Survey: Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, and Mubarak Shah
25 pages
Part4 F
No ratings yet
Part4 F
26 pages
Unit 2 (With Page Number)
No ratings yet
Unit 2 (With Page Number)
30 pages
MRSPTU B.Tech. Electrical 7th-8th Sem Scheme and Syllabus 2018 Batch Onwards
No ratings yet
MRSPTU B.Tech. Electrical 7th-8th Sem Scheme and Syllabus 2018 Batch Onwards
26 pages
ACV - Notes - Final
No ratings yet
ACV - Notes - Final
7 pages
Seminar Topic: Department of Mechanical Engineering
No ratings yet
Seminar Topic: Department of Mechanical Engineering
19 pages
Introduction To Syntax Analysis: CSCI4160: Compiler Design and Software Development
No ratings yet
Introduction To Syntax Analysis: CSCI4160: Compiler Design and Software Development
36 pages
Diffusion Model Clearly Explained! - by Steins - Medium
No ratings yet
Diffusion Model Clearly Explained! - by Steins - Medium
18 pages
DDGAN
No ratings yet
DDGAN
28 pages
Denoising Diffusion Probabilistic Models
No ratings yet
Denoising Diffusion Probabilistic Models
25 pages
Diffusion Model
No ratings yet
Diffusion Model
16 pages
Denoising Diffusion Implicit Models
No ratings yet
Denoising Diffusion Implicit Models
22 pages
Diffusion Models in Deep Learning
No ratings yet
Diffusion Models in Deep Learning
14 pages
Understanding Diffusion Models: A Unified Perspective
No ratings yet
Understanding Diffusion Models: A Unified Perspective
23 pages
Diffusion Based Representation Learning
No ratings yet
Diffusion Based Representation Learning
20 pages
Diffusion Model
No ratings yet
Diffusion Model
17 pages
Improved Denoising Diffusion Probabilistic Models
No ratings yet
Improved Denoising Diffusion Probabilistic Models
17 pages
Co-Creating An Industry Standard For Sharing Agricultural Data - Aug 20
No ratings yet
Co-Creating An Industry Standard For Sharing Agricultural Data - Aug 20
13 pages
An In-Depth Guide To Denoising Diffusion Probabilistic Models - From Theory To Implementation
No ratings yet
An In-Depth Guide To Denoising Diffusion Probabilistic Models - From Theory To Implementation
18 pages
CS 229, Autumn 2017 Problem Set #4: EM, DL & RL
No ratings yet
CS 229, Autumn 2017 Problem Set #4: EM, DL & RL
10 pages
CD4541 Programmable Timer
No ratings yet
CD4541 Programmable Timer
7 pages
DLAI4 Energy Boltzmann
No ratings yet
DLAI4 Energy Boltzmann
8 pages
Tutorial On Diffusion Models
No ratings yet
Tutorial On Diffusion Models
4 pages
ISYE6501 HW1 Kevin
No ratings yet
ISYE6501 HW1 Kevin
7 pages
Tanvi PM Challenge
No ratings yet
Tanvi PM Challenge
9 pages
Optical Character Recognition Using Neural Networks: Title of The Project
No ratings yet
Optical Character Recognition Using Neural Networks: Title of The Project
5 pages
This Work Has Been Funded by The German Research Foundation (DFG) in The Transregio Project Crossmodal Learning (TRR 169)
No ratings yet
This Work Has Been Funded by The German Research Foundation (DFG) in The Transregio Project Crossmodal Learning (TRR 169)
5 pages
Sujata Sahoo Adhar
No ratings yet
Sujata Sahoo Adhar
1 page
10C Form T1 PHD Thesis Submission For Repository NITT
No ratings yet
10C Form T1 PHD Thesis Submission For Repository NITT
2 pages
Telit Le920-Family Datasheet
No ratings yet
Telit Le920-Family Datasheet
2 pages
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
From Everand
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
Fouad Sabry
No ratings yet

Lec24 Diffusion

Uploaded by

Lec24 Diffusion

Uploaded by

Deep

Forward diffusion Reverse denoising

Forward diffusion Reverse denoising

• this lecture: denoising diffusion is a stack of

Encoder: an inference model

• Solution: have some intermediate latent variables

• In VAEs, encoders are learned with KL-

• Forward diffusion process gradually adds noise

• Reverse denoising process learns to generate

• The forward process allows sampling of 𝑥𝑡

• The noise schedule (𝛽𝑡 values) is designed such

• not directly tractable

Gaussian distribution if 𝛽 𝑡 is small

Trainable Network, Shared Across All

• tractable posterior distribution (closed-

Ho et al. Denoising Diffusion Probabilistic Models. 30

Ho et al. Denoising Diffusion Probabilistic Models. 31

• tractable posterior distribution (closed-

Ho et al. Denoising Diffusion Probabilistic Models. 32

• Trainable network predicts the noise

• 𝜆 𝑡 ensures the weighting for correct

• In DDPM, this is further simplified to:

Str¨umke et al. Lecture Notes in Probabilistic Diffusion Models. 2020. 36

• Maximizing the log-likelihood requires us to

• Instead, we can model the score function

• Consider a forward process with many many small steps (continuous

• Consider a forward process with many many small

• Consider a forward process with many many small

• An iterative update that can be viewed as

Stochastic Differential Equation

Drift Term Diffusion Term

• The forward SDE has a reverse

• The forward SDE has a reverse

• Loss weights trade-off between

• More complicated model parametrization and loss

• In Training, we randomly sample one time

• But in inference, we must transit from T to 0

Only depends on previous

Only used during

Song et al. Denoising Diffusion Implicit Models. 2021. 64

Song et al. Denoising Diffusion Implicit Models. 2021. 65

• 𝛾 controls the strength of the condition

Ho et al. Classifier-Free Diffusion Guidance. 74

Ho et al. Classifier-Free Diffusion Guidance. 75

Ho et al. Denoising Diffusion Probabilistic Models. 2020. 77

Dhariwal et al. Diffusion Models Beat GANs on Image Synthesis. 78

Stability AI. https://github.com/Stability-AI/stablediffusion 80

Ramesh et al. Hierarchical Text-Conditional Image Generation with CLIP 81

Peebles et al. Scalable Diffusion Models with Transformers. 82

Li et al. Autoregressive Image Generation without Vector Quantization. 2024. 83

You might also like