0% found this document useful (0 votes)

15 views

Lecture 4 Diffusion - Models Part I Final

Uploaded by

huukhoadn

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

Lecture 4 Diffusion - Models Part I Final

Uploaded by

huukhoadn

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

CAP6412

Advanced Computer Vision

Mubarak Shah
[email protected]
HEC-245
Lecture-4: Diffusion Models

1/23/2023 CAP6412 - Lecture 1 Introduction 1

Diffusion models in vision: A survey
https://arxiv.org/pdf/2209.04747.pdf

Alin Croitoru Vlad Hondru Radu Tudor Ionescu Mubarak Shah

University of Bucharest, University of Bucharest, University of Bucharest, University of Central
Romania Romania Romania Florida, US
[email protected] [email protected] [email protected] [email protected]
Agenda

A hedgehog using a A corgi wearing a red A transparent sculpture- of

calculator. bowtie and a purple a duck made out of glass.
party hat.

A photo of a Corgi dog riding a Pomeranian king Zebras roaming

bike in Times Square. It is wearing with tiger soldiers. in the field.
sunglasses and a beach hat.
Outline

1. Motivation
2. High-level overview
3. Denoising diffusion probabilistic models
4. Noise Conditioned Score Network
5. Conditional Generation
6. Stochastic Differential Equations
7. Research directions
High-level overview
• Diffusion models are probabilistic models used for image generation
• They involve reversing the process of gradually degrading the data
• Consist of two processes:
 The forward process: data is progressively destroyed by adding noise across
multiple time steps
 The reverse process: using a neural network, noise is sequentially removed
to obtain the original data

Standard Gaussian
Data distribution

reverse

forward
High-level overview

• Three categories:

 Denoising Diffusion Probabilistic Models (DDPM)

 Noise Conditioned Score Networks (NCSN)

 Stochastic Differential Equations (SDE)

Outline

𝒩 𝑥; 𝜇, 𝜎 ⋅ 𝐼 - Gaussian distribution

Random Variable (image) Mean Vector Covariance matrix. 𝐼 is the identity matrix

𝑥= 𝜇+ 𝜎 ⋅ 𝑧, 𝑧~𝒩(0, 𝐼)

Sample from this distribution

Denoising Diffusion Probabilistic Models (DDPMs)

Forward process
𝑥 𝑥

… …

𝑥 ~𝑝(𝑥 ) 𝑥 ~𝒩(0, 𝐼)
Denoising Diffusion Probabilistic Models (DDPMs)

𝑥 𝑥

… …

𝑥 ~𝑝(𝑥 ) Reverse process 𝑥 ~𝒩(0, 𝐼)

Denoising Diffusion Probabilistic Models (DDPMs)

Forward process (Iterative) The image is

replaced with
noise
𝑥 ~𝑝 𝑥 𝑥 = 𝒩(𝑥 ; 1 − 𝛽 𝑥 , 𝛽 I) 𝛽 ≪ 1 , 𝑡 = 1, 𝑇

… …
𝑥 𝑥 𝑥 𝑥
Denoising Diffusion Probabilistic Models (DDPMs)

Forward process. Ancestral sampling (One Shot) Notations:

𝛽 = 𝛼
𝑥 ~𝑝 𝑥 𝑥 = 𝒩(𝑥 ; 𝛽 ⋅ 𝑥 , 1 − 𝛽 I) 𝛼 =1 − 𝛽

… …
𝑥 𝑥 𝑥 𝑥
DDPMs. Properties of

1. 𝛽 ≪ 1 , 𝑡 = 1, 𝑇
𝑥 ~𝑝 𝑥 𝑥 = 𝒩(𝑥 ; 1 − 𝛽 𝑥 , 𝛽 I)

𝑥 is created with a small step modeled by 𝛽

𝑡−1 𝑡

𝑥 comes from region close to 𝑥 ,

therefore we can model with Gaussian

𝑥 ~𝑝 𝑥 𝑥 = 𝒩(𝑥 ; 𝜇 𝑥 ,𝑡 ,Σ 𝑥 ,𝑡 )
DDPMs. Properties of

1. 𝛽 ≪ 1 , 𝑡 = 1, 𝑇

𝑥 ~𝑝 𝑥 𝑥 = 𝒩(𝑥 ; 1 − 𝛽 𝑥 , 𝛽 I)

𝑡−1 𝑡

?
Less certain where was the 𝑥 , because we could have
reached 𝑥 from many more regions.
DDPMs. Properties of

1. 𝛽 ≪ 1, 𝑡 = 1, 𝑇 ⟹ 2. 𝑇 𝑖𝑠 𝑙𝑎𝑟𝑔𝑒
𝑥 𝑖𝑠 𝑝𝑢𝑟𝑒 𝑛𝑜𝑖𝑠𝑒

𝑥 𝑥

𝑇 𝑖𝑡𝑒𝑟𝑎𝑡𝑖𝑜𝑛𝑠
DDPMs. Training objective
Remember that:

𝑥 𝑥 𝑥 𝑥

… …
𝑝 𝑥 𝑥 ≈𝑝 𝑥 𝑥 = 𝒩(𝑥 ;𝜇 𝑥 ,𝑡 ,Σ 𝑥 ,𝑡 )
Reverse process

Neural network Approximated by

weights a neural network
DDPMs. Training objective
Simplification:

𝑥 𝑥 𝑥 𝑥

… …
𝑝 𝑥 𝑥 ≈𝑝 𝑥 𝑥 = 𝒩(𝑥 ; 𝜇 𝑥 , 𝑡 , 𝜎 I)
Reverse process

Neural network Approximated by

Fix the variance instead of learning, and predict/learn the mean weights a neural network
DDPMs. Training objective
UNet-like neural network

𝜇 (𝑥 , 𝑡)

~𝒩 𝑥 , 𝜇 (𝑥 , 𝑡), 𝜎 I

𝑥
U-Net
U-Net
U-Net
Slide from:
Denoising Diffusion-based Generative Modeling:
Foundations and Applications
Karsten Kreis Ruiqi Gao Arash Vahdat
DDPMs. Training Objective
02 Attention and Transformers

Cross Entropy and KL (Kullback-Leibler) divergence

• Entropy: E(P) = - ΣiP(i)logP(i)

• Cross Entropy: C(P) = - ΣiP(i) log Q(i)
• KL divergence: DKL(P || Q) = ΣiP(i)log[P(i)/Q(i)] = ΣiP(i)[logP(i) – logQ(i)]

Slides from Ming Li, University of Waterloo, CS 886 Deep Learning and NLP
DDPMs. Training Objective

min 𝔼 ~ ( ) − log 𝑝 𝑥 𝑥 + 𝐾𝐿(𝑝(𝑥 |𝑥 )||𝑝 𝑥 ) + 𝐾𝐿(𝑝 𝑥 𝑥 , 𝑥 ||𝑝 𝑥 𝑥 )

At each time step t, 𝑝 𝑥 𝑥 is as close

This term can be ignored because 𝑝 𝑥 is 𝒩 0, Ι as possible to the true posterior of the
and does not depend on 𝜃. forward process when conditioned on the
original image.
DDPMs. Training Objective. Simplifications
min 𝔼 ~ ( ) − log 𝑝 𝑥 𝑥 + 𝐾𝐿(𝑝 𝑥 𝑥 , 𝑥 ||𝑝 𝑥 𝑥 )

• The KL divergences of 2 gaussians is L2 distance between their means

• The first term measures the reconstruction error and can be addressed
with an independent decoder.

• DDPMs paper introduced two simplifications that led to a much simple

objective that is based on the noise in the image.
DDPMs. Training Objective. Simplifications
min 𝔼 ~ ( ) − log 𝑝 𝑥 𝑥 + 𝐾𝐿(𝑝 𝑥 𝑥 , 𝑥 ||𝑝 𝑥 𝑥 )

Notations:
𝑝 𝑥 𝑥 ,𝑥 = 𝒩 𝑥 ; 𝜇 𝑥 ,𝑥 ,𝛽 𝐼
𝛽 = 𝛼
1 1 − 𝛼
𝜇 𝑥 ,𝑥 = 𝑥 − 𝑧 , 𝑧 ~𝒩(0, I) 𝛼 =1 − 𝛽
𝛼
1−𝛽
1−𝛽
𝛽 = ⋅𝛽
1−𝛽
DDPMs. Training Objective. Simplifications
min 𝔼 ~ ( ) − log 𝑝 𝑥 𝑥 + 𝐾𝐿(𝑝 𝑥 𝑥 , 𝑥 ||𝑝 𝑥 𝑥 )

Tractable posterior: Notations:

𝑝 𝑥 𝑥 ,𝑥 = 𝒩 𝑥 ; 𝜇 𝑥 ,𝑥 ,𝛽 𝐼 𝛽 = 𝛼

𝛼 =1 − 𝛽
1 1 − 𝛼
𝜇 𝑥 ,𝑥 = 𝑥 − 𝑧 , 𝑧 ~𝒩(0, I) 1−𝛽
𝛼 𝛽 = ⋅𝛽
1−𝛽 1−𝛽
1 1 − 𝛼 ⟹
𝜇 𝑥 ,𝑥 = 𝑥 − 𝑧 (𝑥 , 𝑡)
𝛼
1−𝛽

𝛽
⟹ 𝐾𝐿(𝑝 𝑥 𝑥 , 𝑥 |𝑝 𝑥 𝑥 = 𝔼 ~𝒩( , ) 𝑧 − 𝑧 (𝑥 , 𝑡)
2𝜎 𝛼 (1 − 𝛽 )
DDPMs. Training Objective. Simplifications
min 𝔼 ~ ( ) − log 𝑝 𝑥 𝑥 + 𝐾𝐿(𝑝 𝑥 𝑥 , 𝑥 ||𝑝 𝑥 𝑥 )

Tractable posterior: Notations:

𝑝 𝑥 𝑥 ,𝑥 = 𝒩 𝑥 ; 𝜇 𝑥 ,𝑥 ,𝛽 𝐼 𝛽 = 𝛼

𝛼 =1 − 𝛽
1 1 − 𝛼
𝜇 𝑥 ,𝑥 = 𝑥 − 𝑧 , 𝑧 ~𝒩(0, I) 1−𝛽
𝛼 𝛽 = ⋅𝛽
1−𝛽 1−𝛽
1 1 − 𝛼 ⟹
𝜇 𝑥 ,𝑥 = 𝑥 − 𝑧 (𝑥 , 𝑡)
𝛼
1−𝛽
Ignored

𝛽
⟹ 𝐾𝐿(𝑝 𝑥 𝑥 , 𝑥 |𝑝 𝑥 𝑥 = 𝔼 ~𝒩( , ) 𝑧 − 𝑧 (𝑥 , 𝑡)
2𝜎 𝛼 (1 − 𝛽 )
DDPMs. Training Algorithm

1
min 𝔼 ~ , ~𝒩 , 𝑧 − 𝑧 (𝑥 , 𝑡)
𝑇

Training algorithm:

Repeat 𝛽 = 𝛼
𝑥 ~𝑝 𝑥
𝑡~𝒰 1, … , 𝑇
𝑧 ~𝒩(0, I)
𝑥 = 𝛽 ⋅𝑥 + 1−𝛽 𝑧
𝜃 = 𝜃 − 𝑙𝑟 ⋅ ∇ ℒ
Until convergence
DDPMs. Sampling

𝑥
𝑧 (𝑥 , 𝑡)

• Pass the current noisy image along with t to the neural network

• With the resultant compute the mean of the gaussian distribution

DDPMs. Sampling

𝑥
𝑧 (𝑥 , 𝑡)

Sample the image for the next iteration

𝜇 (𝑥 , 𝑡)

1 1 − 𝛼
~𝒩 𝑥 , 𝑥 − 𝑧 𝑥 ,𝑡 ,𝜎 I
𝛼
1−𝛽

𝑥
Thank You

Chapter Summerize 54
No ratings yet
Chapter Summerize 54
4 pages
Diffusion: by Aryan Jain
100% (1)
Diffusion: by Aryan Jain
55 pages
1-27 Propogation of Error
No ratings yet
1-27 Propogation of Error
22 pages
Lecture 5 Diffusion - Models Part II Final
No ratings yet
Lecture 5 Diffusion - Models Part II Final
49 pages
diffusion_models_for_pnp_IR
No ratings yet
diffusion_models_for_pnp_IR
48 pages
Lecture # 13-2 Stable Diffusion Model
No ratings yet
Lecture # 13-2 Stable Diffusion Model
48 pages
DiffusionModel DDPM
No ratings yet
DiffusionModel DDPM
52 pages
Lecture7 8 - Diffusion - Model 1 78 1 66
No ratings yet
Lecture7 8 - Diffusion - Model 1 78 1 66
66 pages
Lecture7 8 Diffusion Model 1 78
No ratings yet
Lecture7 8 Diffusion Model 1 78
78 pages
Lecture7-8 Diffusion Model
No ratings yet
Lecture7-8 Diffusion Model
136 pages
Application-DPM
No ratings yet
Application-DPM
43 pages
Tutorial On Diffusion Models For Imaging and Vision: Stanley Chan March 28, 2024
No ratings yet
Tutorial On Diffusion Models For Imaging and Vision: Stanley Chan March 28, 2024
51 pages
lec24.diffusion
No ratings yet
lec24.diffusion
83 pages
An In-Depth Guide To Denoising Diffusion Probabilistic Models - From Theory To Implementation
No ratings yet
An In-Depth Guide To Denoising Diffusion Probabilistic Models - From Theory To Implementation
18 pages
CVPR2022 Tutorial Diffusion Model
No ratings yet
CVPR2022 Tutorial Diffusion Model
188 pages
kaist_cs492d_fall_2024_lecture_4
No ratings yet
kaist_cs492d_fall_2024_lecture_4
33 pages
Diffusion Model
No ratings yet
Diffusion Model
17 pages
Machine Learning: The Basics
No ratings yet
Machine Learning: The Basics
288 pages
Khan - Diffusion Models and Normalizing Flows
No ratings yet
Khan - Diffusion Models and Normalizing Flows
36 pages
lecture7-diffusion
No ratings yet
lecture7-diffusion
42 pages
Diffusion
No ratings yet
Diffusion
55 pages
MLBasicsBook
No ratings yet
MLBasicsBook
287 pages
Denoising Diffusion Probabilistic Models in Six Simple Steps
No ratings yet
Denoising Diffusion Probabilistic Models in Six Simple Steps
15 pages
Improved Denoising Diffusion Probabilistic Models
No ratings yet
Improved Denoising Diffusion Probabilistic Models
17 pages
Divide-and-Conquer Posterior Sampling For Denoising Diffusion Priors
No ratings yet
Divide-and-Conquer Posterior Sampling For Denoising Diffusion Priors
30 pages
Tutorialon Diffusion Modelsfor Imaging and Vision
No ratings yet
Tutorialon Diffusion Modelsfor Imaging and Vision
90 pages
Clustering PDF
No ratings yet
Clustering PDF
36 pages
Step-by-Step Diffusion: An Elementary Tutorial
No ratings yet
Step-by-Step Diffusion: An Elementary Tutorial
51 pages
Diffusion Models A Concise Perspective
No ratings yet
Diffusion Models A Concise Perspective
8 pages
Diffusion Model Clearly Explained! _ by Steins _ Medium
No ratings yet
Diffusion Model Clearly Explained! _ by Steins _ Medium
18 pages
Main
No ratings yet
Main
183 pages
Non Gaussian Denoising Diffusion Models
No ratings yet
Non Gaussian Denoising Diffusion Models
11 pages
Denoising Diffusion Restoration Models
No ratings yet
Denoising Diffusion Restoration Models
32 pages
Tutorial On Diffusion Models For Imaging and Vision: Stanley Chan September 10, 2024
No ratings yet
Tutorial On Diffusion Models For Imaging and Vision: Stanley Chan September 10, 2024
89 pages
121 DL2 Ann
No ratings yet
121 DL2 Ann
64 pages
CS772-Lec21
No ratings yet
CS772-Lec21
26 pages
slides2 (1)
No ratings yet
slides2 (1)
28 pages
pnp_slides
No ratings yet
pnp_slides
50 pages
FL LectureNotes
No ratings yet
FL LectureNotes
92 pages
Demystifying Variational Diffusion Models
No ratings yet
Demystifying Variational Diffusion Models
48 pages
Week 4 - Diffusion Models
No ratings yet
Week 4 - Diffusion Models
35 pages
New Denoising Diffusion Model
No ratings yet
New Denoising Diffusion Model
13 pages
Variational Autoencoders
No ratings yet
Variational Autoencoders
94 pages
WEEK 4
No ratings yet
WEEK 4
61 pages
Chapter 5
No ratings yet
Chapter 5
140 pages
From Denoising Diffusions To Denoising Markov Models
No ratings yet
From Denoising Diffusions To Denoising Markov Models
55 pages
Data Mining1
No ratings yet
Data Mining1
3 pages
Website - Machine Learning
No ratings yet
Website - Machine Learning
6 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
Lecture # 2-1 Probabilistic Models
No ratings yet
Lecture # 2-1 Probabilistic Models
40 pages
Mathematics of Deep Learning: Lecture 1-Introduction and The Universality of Depth 1 Nets
No ratings yet
Mathematics of Deep Learning: Lecture 1-Introduction and The Universality of Depth 1 Nets
12 pages
Denoising Diffusion Implicit Models
No ratings yet
Denoising Diffusion Implicit Models
22 pages
Unit 5
No ratings yet
Unit 5
36 pages
009-Neural_Networks-Complete
No ratings yet
009-Neural_Networks-Complete
61 pages
通过用于医学图像分割的预分割扩散采样加速扩散模型
No ratings yet
通过用于医学图像分割的预分割扩散采样加速扩散模型
5 pages
2312.14977diffusion Models For Generative Artificial
No ratings yet
2312.14977diffusion Models For Generative Artificial
23 pages
Stable Diffusion For Image Generation
No ratings yet
Stable Diffusion For Image Generation
23 pages
Prob RV Opt Basics
No ratings yet
Prob RV Opt Basics
35 pages
DDPM Slides
No ratings yet
DDPM Slides
18 pages
Intro Aml Fp
No ratings yet
Intro Aml Fp
92 pages
Training Neural
No ratings yet
Training Neural
16 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Probability - Expectation of Sample Variance - Mathematics Stack Exchange
No ratings yet
Probability - Expectation of Sample Variance - Mathematics Stack Exchange
5 pages
Time Series and Forecasting
No ratings yet
Time Series and Forecasting
92 pages
Sist Iso 16269 4 2014
No ratings yet
Sist Iso 16269 4 2014
15 pages
2.13 Modeling+Cycles:+MA,+AR,+and+ARMA+Models+建模周期：MA,+AR,+和ARMA+模型
No ratings yet
2.13 Modeling+Cycles:+MA,+AR,+and+ARMA+Models+建模周期：MA,+AR,+和ARMA+模型
17 pages
Lesson 7 6 Answer Key AP Stats Math Medic 0798d9b3ba
No ratings yet
Lesson 7 6 Answer Key AP Stats Math Medic 0798d9b3ba
2 pages
Time Series Modelling Using Eviews 2. Macroeconomic Modelling Using Eviews 3. Macroeconometrics Using Eviews
No ratings yet
Time Series Modelling Using Eviews 2. Macroeconomic Modelling Using Eviews 3. Macroeconometrics Using Eviews
29 pages
Group 5 (Lopez, Lacorte, Limbo)
No ratings yet
Group 5 (Lopez, Lacorte, Limbo)
5 pages
14221580
No ratings yet
14221580
72 pages
Variabel Kategorikal: Npar Tests - Tes Binomial
No ratings yet
Variabel Kategorikal: Npar Tests - Tes Binomial
4 pages
BA Tableau Final Capstone a Section
No ratings yet
BA Tableau Final Capstone a Section
17 pages
TELE9754 L1-ProbTheory
No ratings yet
TELE9754 L1-ProbTheory
33 pages
pmwj27 Oct2014 Wain Updating The Lang Factor Featured Paper PDF
No ratings yet
pmwj27 Oct2014 Wain Updating The Lang Factor Featured Paper PDF
17 pages
Final Assign Harshi
0% (1)
Final Assign Harshi
15 pages
2021 The Ponds
100% (1)
2021 The Ponds
14 pages
Forecasting Football Corner Odds
No ratings yet
Forecasting Football Corner Odds
82 pages
Stat Q4 Mod 3 Week3
No ratings yet
Stat Q4 Mod 3 Week3
25 pages
SPSS Exact Tests 10.0
No ratings yet
SPSS Exact Tests 10.0
2 pages
Final Exam STS 201 Business Statistics II Spring 2021
No ratings yet
Final Exam STS 201 Business Statistics II Spring 2021
2 pages
Kolmogorov Uji Normalitas
No ratings yet
Kolmogorov Uji Normalitas
19 pages
Ta Khanh Vinh
No ratings yet
Ta Khanh Vinh
33 pages
Sess Test-1
No ratings yet
Sess Test-1
2 pages
CAPE Applied Mathematics 2008 U1 P2 TT
No ratings yet
CAPE Applied Mathematics 2008 U1 P2 TT
9 pages
PTSP Subjective Questions
No ratings yet
PTSP Subjective Questions
5 pages
Assignment 2 DMED2103 - Statistics For Educational Research
No ratings yet
Assignment 2 DMED2103 - Statistics For Educational Research
6 pages
BST 32202 LINEAR REGRESSION 4 TWO WAY ANOVA
No ratings yet
BST 32202 LINEAR REGRESSION 4 TWO WAY ANOVA
25 pages
Multiple Regression
No ratings yet
Multiple Regression
11 pages
15.455x Sample Exam Questions
No ratings yet
15.455x Sample Exam Questions
5 pages
Statisticsprobability11 q4 Week2 v4
100% (1)
Statisticsprobability11 q4 Week2 v4
10 pages

Lecture 4 Diffusion - Models Part I Final

Uploaded by

Lecture 4 Diffusion - Models Part I Final

Uploaded by

CAP6412

Advanced Computer Vision

1/23/2023 CAP6412 - Lecture 1 Introduction 1

Alin Croitoru Vlad Hondru Radu Tudor Ionescu Mubarak Shah

A hedgehog using a A corgi wearing a red A transparent sculpture- of

A photo of a Corgi dog riding a Pomeranian king Zebras roaming

 Denoising Diffusion Probabilistic Models (DDPM)

 Noise Conditioned Score Networks (NCSN)

 Stochastic Differential Equations (SDE)

Sample from this distribution

𝑥 ~𝑝(𝑥 ) Reverse process 𝑥 ~𝒩(0, 𝐼)

Forward process (Iterative) The image is

Forward process. Ancestral sampling (One Shot) Notations:

𝑥 is created with a small step modeled by 𝛽

𝑥 comes from region close to 𝑥 ,

Neural network Approximated by

Neural network Approximated by

Cross Entropy and KL (Kullback-Leibler) divergence

• Entropy: E(P) = - ΣiP(i)logP(i)

min 𝔼 ~ ( ) − log 𝑝 𝑥 𝑥 + 𝐾𝐿(𝑝(𝑥 |𝑥 )||𝑝 𝑥 ) + 𝐾𝐿(𝑝 𝑥 𝑥 , 𝑥 ||𝑝 𝑥 𝑥 )

At each time step t, 𝑝 𝑥 𝑥 is as close

• The KL divergences of 2 gaussians is L2 distance between their means

• DDPMs paper introduced two simplifications that led to a much simple

Tractable posterior: Notations:

Tractable posterior: Notations:

• With the resultant compute the mean of the gaussian distribution

Sample the image for the next iteration

You might also like