0% found this document useful (0 votes)

9 views51 pages

Lec1 Intro

The lecture introduces deep generative models, emphasizing their applications in various fields such as natural language processing, image and video generation, and protein design. It contrasts generative models with discriminative models, highlighting their ability to produce multiple plausible outputs from a single input. The course will explore the formulation of real-world problems as generative models, along with their probabilistic foundations and associated challenges.

Uploaded by

hungnh11251989

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views51 pages

Lec1 Intro

Uploaded by

hungnh11251989

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 51

Lecture 1

Introduction

6.S978 Deep Generative Models

Kaiming He
Fall 2024, EECS, MIT
The “GenAI” Era
Chatbot and natural language conversation
The “GenAI” Era
Text-to-image generation

Generated by Stable Diffusion 3 Medium.

Prompt: teddy bear teaching a course, with "generative models" written on blackboard
The “GenAI” Era
Text-to-video generation

Generated by Sora
The “GenAI” Era
AI assistant for code generation
The “GenAI” Era
Protein design and generation

Watson, et al. De novo design of protein structure and function with RFdiffusion, Nature 2023
The “GenAI” Era
Weather forecasting

Skilful precipitation nowcasting using deep generative models of radar, Nature 2021
Generative Models before the “GenAI” Era
2009, PatchMatch: Photoshop’s Content-aware Fill

PatchMatch: A Randomized Correspondence Algorithm for Structural Image Editing, SIGGRAPH 2009
Generative Models before the “GenAI” Era
1999, the Efros-Leung algorithm for texture synthesis
In today’s word: this is an Autoregressive model

Texture Synthesis by Non-parametric Sampling, ICCV 1999

What are Generative Models?
What do these scenarios have in common?
• There are multiple or infinite predictions
to one input.
• Some predictions are more “plausible”
than some others. Image generation
Chatbot
• Training data may contain no exact
solution.
• Predictions may be more complex, more
informative, and higher-dimensional than
input. Video generation
Protein generation
Discriminative vs. Generative models
discriminative discriminative generative
• “sample” x ⇨ “label” y
x y
• one desired output “dog”

generative
model
• “label” y ⇨ “sample” x
• many possible outputs
model

“dog”
y x
Discriminative vs. Generative models
discriminative generative

• Generative models can be discriminative: Bayes’ rule

• Can discriminative models be generative?
• Generative models can be discriminative: Bayes’ rule
assuming known prior

discriminative generative
constant for given x
• Generative models can be discriminative: Bayes’ rule
assuming known prior

discriminative generative
constant for given x

• Can discriminative models be generative?

still need to model prior
distribution of x

generative discriminative
constant for given y

• The challenge is about representing and predicting distributions

Probabilistic modeling
• Where does probability come from?
• Assuming underlying distributions of data
generation process
example:
• latent factors z (pose, lighting, scale, ...)
• z has simple distributions
• observations x are rendered by a “world model”
that’s a function on z
• observations x have complex distributions

• Probability is part of the modeling.

Figure from: W. T. Freeman, J. B. Tenenbaum, “Learning Bilinear Models for Two-Factor Problems in Vision”, 1996
Probability is part of the modeling
• There may not be “underlying” distributions.
• Even there are, what we can observe are a finite set of data points
• The models extrapolate the observations for modeling distributions

• Overfitting vs. underfitting: like discriminative models

overfit “right” fit underfit

discriminative models

Figure credit: https://www.mathworks.com/discovery/overfitting.html

Probability is part of the modeling

x
Probability is part of the modeling

p
underfit

x
Probability is part of the modeling

p
overfit

x
Probability is part of the modeling
• To the extreme, using delta functions is
like sampling from training data
p
overfit

x
Generative models w/ probabilistic modeling
data
Generative models w/ probabilistic modeling
data
• This is already part
of the modeling
distribution
of data
Generative models w/ probabilistic modeling
data

distribution • Optimize a loss function

of data

estimated
distribution
of data
Generative models w/ probabilistic modeling
data

distribution
of data

sample new “data”

estimated
distribution
of data
Generative models w/ probabilistic modeling
data

distribution
of data

estimate prob density

estimated
=?
distribution
of data
Generative models w/ probabilistic modeling
Notes:
• Generative models involve statistical models which are often designed and
derived by humans.
• Probabilistic modeling is not just the work of neural nets.
• Probabilistic modeling is a popular way, but not the only way.
• "All models are wrong, but some are useful.” - George Box
What are Deep Generative Models?
Deep Generative Models
• Deep learning is representation learning
• Learning to represent data instances
• map data to feature:
• minimize loss w/ target:
Deep Generative Models
• Deep learning is representation learning
• Learning to represent data instances
• map data to feature:
• minimize loss w/ target:

• Learning to represent probability distributions

• map a simple distribution (Gaussian/uniform) to a complex one:
• minimize loss w/ data distribution:

• Often perform both together

Learning to represent probability distributions
• From simple to complex distributions

to approximate

simple distribution
data distribution
Learning to represent probability distributions
• Not all parts of distribution modeling is done by learning

Case study:
This dependency graph is
Autoregressive model designed (not learned).
Learning to represent probability distributions
• Not all parts of distribution modeling is done by learning

Case study: The mapping function is learned

Autoregressive model (e.g., Transformer)
Learning to represent probability distributions
• Not all parts of distribution modeling is done by learning

Case study:
noising
Diffusion model

This dependency graph is

designed (not learned).
denoising
Learning to represent probability distributions
• Not all parts of distribution modeling is done by learning

Case study:
noising
Diffusion model

denoising

The mapping function is learned

(e.g., Unet)
Deep Generative Models may involve:
• Formulation:
• formulate a problem as probabilistic modeling
• decompose complex distributions into simple and tractable ones
• Representation: deep neural networks to represent data and their
distributions
• Objective function: to measure how good the predicted distribution is
• Optimization: optimize the networks and/or the decomposition
• Inference:
• sampler: to produce new samples
• probability density estimator (optional)
Formulating Real-world Problems
as Generative Models
Formulating Real-world Problems as Generative Models
• Generative models are about

What can be y? What can be x?

• condition • “data”
• constraint • samples
• labels • observations
• attributes • measurements

• more abstract • more concrete

• less informative • more informative
Case study: Formulating as p(x|y)
• Natural language conversation
y: prompt

x: response of the chatbot

Case study: Formulating as p(x|y)
• Text-to-image/video generation
Prompt: teddy bear teaching a course, with y: text prompt
"generative models" written on blackboard

x: generated visual content

Image generated by Stable Diffusion 3 Medium

Case study: Formulating as p(x|y)
• Text-to-3D structure generation

x: generated
3D structures

y: text prompt

Figure credit: Tang, et al. LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation. ECCV 2024
Case study: Formulating as p(x|y)
• Protein structure generation

x: generated
y: condition/constraint protein structures
(e.g., symmetry)

Watson, et al. De novo design of protein structure and function with RFdiffusion, Nature 2023
Case study: Formulating as p(x|y)
• Class-conditional image generation
“red fox” y: class label

x: generated image

Image generated by: Li, et al. Autoregressive Image Generation without Vector Quantization, 2024
Case study: Formulating as p(x|y)
• “Unconditional” image generation
y: an implicit condition
“images following CIFAR10 distribution”

x: generated CIFAR10-like images

• p(x|y): images ~ CIFAR10

• p(x): all images

Images generated by: Karras, et al. Elucidating the Design Space of Diffusion-Based Generative Models, NeurIPS 2022
Case study: Formulating as p(x|y)
• Classification (a generative perspective)

y: an image as the “condition” x: probability of classes

conditioned on the image

cat

bird

horse

dog
Case study: Formulating as p(x|y)
• Open-vocabulary recognition

y: an image as the “condition” x: plausible descriptions

conditioned on the image

bird

flamingo

red color

orange color

...... ...
Case study: Formulating as p(x|y)
• Image captioning

y: an image as the “condition” x: plausible descriptions

conditioned on the image

figure credit: https://github.com/GoogleCloudPlatform/asl-ml-immersion/blob/master/notebooks/multi_modal/solutions/image_captioning.ipynb

Case study: Formulating as p(x|y)
• Chatbot with visual inputs

y: image and text prompt

x: response of the chatbot

Figure from: GPT-4 Technical Report, 2023

Case study: Formulating as p(x|y)
• Policy Learning in Robotics x: policies
y: visual and other (probability of actions)
sensory observations

Chi, et al. Diffusion Policy: Visuomotor Policy Learning via Action Diffusion, RSS 2023
Formulating Real-world Problems as Generative Models
• Generative models are about
• Many problems can be formulated as generative models
• What’s x? What’s y?
• How to represent x, y, and their dependence?
About this course
This course will cover:
• How real-world problems are formulated as generative models?
• Probabilistic foundations and learning algorithms
• Challenges, opportunities, open questions

Set Theory for Beginners: Foundational Mathematics for Software Developers, #1
From Everand
Set Theory for Beginners: Foundational Mathematics for Software Developers, #1
Subhomoy Haldar
No ratings yet
ISYE 2028 Chapter 8 Solutions
100% (2)
ISYE 2028 Chapter 8 Solutions
41 pages
Time Series: Book 2
100% (1)
Time Series: Book 2
45 pages
AASHTO Flexible Pavement Design Method
100% (4)
AASHTO Flexible Pavement Design Method
24 pages
Edexcel S1 Revision Sheets
No ratings yet
Edexcel S1 Revision Sheets
9 pages
Lecture # 1-2 Introduction To Gen AI
No ratings yet
Lecture # 1-2 Introduction To Gen AI
41 pages
Generative
No ratings yet
Generative
19 pages
Ijcai Ecai Tutorial
No ratings yet
Ijcai Ecai Tutorial
115 pages
Stable Diffusion For Image Generation
No ratings yet
Stable Diffusion For Image Generation
23 pages
Generative VS Discriminative Models - by Prathap Manohar Joshi - Medium
No ratings yet
Generative VS Discriminative Models - by Prathap Manohar Joshi - Medium
1 page
Paper 8
No ratings yet
Paper 8
26 pages
Noets 2016 Tutorial:Generative Adversarial Networks PDF
No ratings yet
Noets 2016 Tutorial:Generative Adversarial Networks PDF
57 pages
CSGL
No ratings yet
CSGL
11 pages
Gan Tutorial
No ratings yet
Gan Tutorial
57 pages
15 Unsup+gen PDF
No ratings yet
15 Unsup+gen PDF
210 pages
Quadrant Data Efficient Machine Learning Screen
No ratings yet
Quadrant Data Efficient Machine Learning Screen
6 pages
Generative Adversarial Networks
No ratings yet
Generative Adversarial Networks
6 pages
Deep Learning As A Building Block in Probabilistic Models: Pierre-Alexandre Mattei
No ratings yet
Deep Learning As A Building Block in Probabilistic Models: Pierre-Alexandre Mattei
62 pages
Diffusion Models in Deep Learning
No ratings yet
Diffusion Models in Deep Learning
14 pages
Module 1 Presentation
No ratings yet
Module 1 Presentation
48 pages
Generative Learning Algorithims 1233
No ratings yet
Generative Learning Algorithims 1233
33 pages
Generative Adversarial Networks For Data
No ratings yet
Generative Adversarial Networks For Data
86 pages
Deep Gen Models Tutorial
No ratings yet
Deep Gen Models Tutorial
96 pages
AA2 3.3.1 Generative Explicit Density 2024
No ratings yet
AA2 3.3.1 Generative Explicit Density 2024
43 pages
Introduction To Generative AI
No ratings yet
Introduction To Generative AI
29 pages
Introduction To Generative Models
No ratings yet
Introduction To Generative Models
13 pages
Lecture 11
No ratings yet
Lecture 11
130 pages
A Gentle Introduction To Generative Adversarial Networks (GANs)
No ratings yet
A Gentle Introduction To Generative Adversarial Networks (GANs)
15 pages
Lecture # 2-1 Probabilistic Models
No ratings yet
Lecture # 2-1 Probabilistic Models
40 pages
ML 5
No ratings yet
ML 5
28 pages
Advanced Machine Learning: Course Overview
No ratings yet
Advanced Machine Learning: Course Overview
26 pages
Introduction To Generative Models - Post Quiz - Attempt Review
No ratings yet
Introduction To Generative Models - Post Quiz - Attempt Review
4 pages
Unit-2: Logistic Regression
No ratings yet
Unit-2: Logistic Regression
30 pages
Week 12 Chats
No ratings yet
Week 12 Chats
4 pages
DLT Unit-1
No ratings yet
DLT Unit-1
28 pages
ML 3
No ratings yet
ML 3
21 pages
Mod5 Slides
No ratings yet
Mod5 Slides
37 pages
AAI - Module 1 - Generative Adversarial Network and Probabilistic Models
No ratings yet
AAI - Module 1 - Generative Adversarial Network and Probabilistic Models
12 pages
Math of Gen Ai
No ratings yet
Math of Gen Ai
61 pages
Machine Learning Final Presentation
No ratings yet
Machine Learning Final Presentation
32 pages
01 Intro 2
No ratings yet
01 Intro 2
67 pages
Introduction To Gen Ai
No ratings yet
Introduction To Gen Ai
13 pages
6 Probabilities
No ratings yet
6 Probabilities
52 pages
Lec 12
No ratings yet
Lec 12
15 pages
Lec 19
No ratings yet
Lec 19
111 pages
L11 - UCLxDeepMind DL2020
No ratings yet
L11 - UCLxDeepMind DL2020
68 pages
Autoregressive Models
No ratings yet
Autoregressive Models
57 pages
Data Science Project
No ratings yet
Data Science Project
11 pages
Lecture 2
No ratings yet
Lecture 2
31 pages
CP4252 ML Unit-Iv
No ratings yet
CP4252 ML Unit-Iv
12 pages
Mit Diffusion
No ratings yet
Mit Diffusion
30 pages
cs236 Lecture2
No ratings yet
cs236 Lecture2
30 pages
Advance Deep Learning - BIT L4
No ratings yet
Advance Deep Learning - BIT L4
100 pages
Session 2 Introduction To Generative AI
No ratings yet
Session 2 Introduction To Generative AI
17 pages
AI Week 14
No ratings yet
AI Week 14
3 pages
Chapter 5
No ratings yet
Chapter 5
140 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
36 pages
AD8552-Machnie Learning QB
No ratings yet
AD8552-Machnie Learning QB
25 pages
AAI - Notes Mumbai University
No ratings yet
AAI - Notes Mumbai University
48 pages
CSCI 5922 Neural Networks and Deep Learning
No ratings yet
CSCI 5922 Neural Networks and Deep Learning
37 pages
Decoding Generative and Discriminative Models
No ratings yet
Decoding Generative and Discriminative Models
8 pages
(Ebook) Generative Deep Learning, 2nd Edition (Third Early Release) by David Foster ISBN 9781098134174, 1098134176
No ratings yet
(Ebook) Generative Deep Learning, 2nd Edition (Third Early Release) by David Foster ISBN 9781098134174, 1098134176
81 pages
Unit 6
No ratings yet
Unit 6
55 pages
Naive Bayes Classifier: Fundamentals and Applications
From Everand
Naive Bayes Classifier: Fundamentals and Applications
Fouad Sabry
No ratings yet
SDM 1 Formula
No ratings yet
SDM 1 Formula
9 pages
IDF cURVE OF pOKHARA KUSHAMA
No ratings yet
IDF cURVE OF pOKHARA KUSHAMA
28 pages
Soybean Yield Prediction by Machine Learning and Climate
No ratings yet
Soybean Yield Prediction by Machine Learning and Climate
17 pages
List of Correction For Applied Statistics Module
No ratings yet
List of Correction For Applied Statistics Module
26 pages
Oldoni Et Al. - 2019 - Delineation of Management Zones in A Peach Orchard Using Multivariate and Geostatistical Analyses
No ratings yet
Oldoni Et Al. - 2019 - Delineation of Management Zones in A Peach Orchard Using Multivariate and Geostatistical Analyses
10 pages
Demand Forecasting
No ratings yet
Demand Forecasting
21 pages
Unit V
No ratings yet
Unit V
21 pages
L5 QM4BM Manual Advance Edition
100% (1)
L5 QM4BM Manual Advance Edition
299 pages
Rubber, Coffee and Cacao Species Site Matching
No ratings yet
Rubber, Coffee and Cacao Species Site Matching
5 pages
CHAPTER 4 Measure of Dispersion
No ratings yet
CHAPTER 4 Measure of Dispersion
76 pages
Uncertainty
No ratings yet
Uncertainty
8 pages
Advanced Machine Learning: CS 281
100% (1)
Advanced Machine Learning: CS 281
88 pages
PTSP Probability and Stochastic Processs
No ratings yet
PTSP Probability and Stochastic Processs
35 pages
Trip Generation Analysis - FHWA 1975 - Procesat
No ratings yet
Trip Generation Analysis - FHWA 1975 - Procesat
184 pages
Types of Scale
No ratings yet
Types of Scale
6 pages
5.1 HW Solutions
No ratings yet
5.1 HW Solutions
7 pages
Writing Practice
No ratings yet
Writing Practice
20 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
31 pages
Sen 1968 Sen's Slope Method
No ratings yet
Sen 1968 Sen's Slope Method
12 pages
Conducting Investigatory Project
No ratings yet
Conducting Investigatory Project
26 pages
Regression Explained SPSS
100% (1)
Regression Explained SPSS
23 pages
The Vapour Compression Cycle (Sample Problems)
67% (6)
The Vapour Compression Cycle (Sample Problems)
3 pages
A Python Program To Model and Analyze Wind Speed Data
No ratings yet
A Python Program To Model and Analyze Wind Speed Data
16 pages
Regression
No ratings yet
Regression
36 pages
Unit - I
No ratings yet
Unit - I
44 pages
Direct - and - Indirect - Data IT As Level 9626
No ratings yet
Direct - and - Indirect - Data IT As Level 9626
2 pages

Lec1 Intro

Uploaded by

Lec1 Intro

Uploaded by

Lecture 1

6.S978 Deep Generative Models

Generated by Stable Diffusion 3 Medium.

Texture Synthesis by Non-parametric Sampling, ICCV 1999

• Generative models can be discriminative: Bayes’ rule

• Can discriminative models be generative?

• The challenge is about representing and predicting distributions

• Probability is part of the modeling.

• Overfitting vs. underfitting: like discriminative models

Figure credit: https://www.mathworks.com/discovery/overfitting.html

distribution • Optimize a loss function

sample new “data”

estimate prob density

• Learning to represent probability distributions

• Often perform both together

Case study: The mapping function is learned

This dependency graph is

The mapping function is learned

What can be y? What can be x?

• more abstract • more concrete

x: response of the chatbot

x: generated visual content

Image generated by Stable Diffusion 3 Medium

x: generated CIFAR10-like images

• p(x|y): images ~ CIFAR10

y: an image as the “condition” x: probability of classes

y: an image as the “condition” x: plausible descriptions

y: an image as the “condition” x: plausible descriptions

figure credit: https://github.com/GoogleCloudPlatform/asl-ml-immersion/blob/master/notebooks/multi_modal/solutions/image_captioning.ipynb

y: image and text prompt

x: response of the chatbot

Figure from: GPT-4 Technical Report, 2023

You might also like