0% found this document useful (0 votes)

14 views44 pages

(Seminar) An Introduction To Simulation-Based Inference

The document presents an introduction to simulation-based inference, highlighting its significance in statistical analysis and the advancements made possible by deep learning. It discusses various algorithms for inference, including neural ratio estimation and diagnostics for validating results. Challenges such as the curse of dimensionality and the need for extensive data are acknowledged as areas requiring further development.

Uploaded by

Yiqiao Jin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views44 pages

(Seminar) An Introduction To Simulation-Based Inference

Uploaded by

Yiqiao Jin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 44

An introduction to

simulation-based inference

51st SLAC Summer Institute

August 16, 2023

Gilles Louppe
[email protected]

1 / 36
2 / 36
vx = v cos(α), vy = v sin(α),
dx dy dvy
= vx , = vy , = −G.
dt dt dt

3 / 36
def simulate(v, alpha, dt=0.001):
v_x = v * np.cos(alpha) # x velocity m/s
v_y = v * np.sin(alpha) # y velocity m/s
y = 1.1 + 0.3 * random.normal()
x = 0.0

while y > 0: # simulate until ball hits floor

v_y += dt * -G # acceleration due to gravity
x += dt * v_x
y += dt * v_y

return x + 0.25 * random.normal()

4 / 36
5 / 36
What parameter values θ are the most plausible?

6 / 36
7 / 36
Outline
1. Simulation-based inference

2. Algorithms

Neural ratio estimation

Neural posterior estimation

Neural score estimation

3. Diagnostics

8 / 36
Simulation-based inference

8 / 36
Scienti c simulators

9 / 36
θ, z, x ∼ p(θ, z, x)

10 / 36
θ, z ∼ p(θ, z∣x)

11 / 36
12 / 36
12 / 36
12 / 36
12 / 36
p(x∣θ) = ∭ p(zp ∣θ)p(zs ∣zp )p(zd ∣zs )p(x∣zd )dzp dzs dzd

yikes!

13 / 36
Bayesian inference

Start with

a simulator that can generate N samples xi ∼ p(xi ∣θi ),

a prior model p(θ),

observed data xobs ∼ p(xobs ∣θtrue ).

Then, estimate the posterior

p(xobs ∣θ)p(θ)
p(θ∣xobs ) = .
p(xobs )

14 / 36
15 / 36
Algorithms

15 / 36
―
Credits: Cranmer, Brehmer and Louppe, 2020. 16 / 36
Approximate Bayesian Computation (ABC)

Issues:

How to choose x′ ? ϵ? ∣∣ ⋅ ∣∣?

No tractable posterior.

Need to run new simulations for new data or new prior.

―
Credits: Johann Brehmer. 17 / 36
―
Credits: Cranmer, Brehmer and Louppe, 2020. 18 / 36
―
Credits: Cranmer, Brehmer and Louppe, 2020. 18 / 36
Neural ratio estimation
p(x∣θ) p(x,θ)
The likelihood-to-evidence r(x∣θ) = p(x) = p(x)p(θ) ratio can be learned, even
if neither the likelihood nor the evidence can be evaluated:

x, θ ∼ p(x, θ)

r^(x∣θ)

x, θ ∼ p(x)p(θ)

―
Credits: Cranmer et al, 2015; Hermans et al, 2020. 19 / 36
The solution d found after training approximates the optimal classi er

p(x, θ)
d(x, θ) ≈ d∗ (x, θ) = .
p(x, θ) + p(x)p(θ)
Therefore,

p(x∣θ) p(x, θ) d(x, θ)

r(x∣θ) = = ≈ = r^(x∣θ).
p(x) p(x)p(θ) 1 − d(x, θ)

20 / 36
p(θ∣x) ≈ r^(x∣θ)p(θ)

21 / 36
Constraining dark matter with stellar streams

.]
Interaction of Pal 5 with two …

―
Image credits: C. Bickel/Science; D. Erkal. 22 / 36
―
Credits: Hermans et al, 2021. 23 / 36
Preliminary results for GD-1 suggest a preference for CDM over WDM.

24 / 36
Neural Posterior Estimation

min Ep(x) [KL(p(θ∣x)∣∣qϕ (θ∣x))]

qϕ

25 / 36
Normalizing ows

A normalizing ow is a sequence of invertible transformations fk that map a

simple distribution p0 to a more complex distribution pK :

By the change of variables formula, the log-likelihood of a sample x is given by

K
log p(x) = log p(z0 ) − ∑ log ∣det Jfk (zk−1 )∣ .
k=1

26 / 36
Exoplanet atmosphere characterization

―
Credits: NSA/JPL-Caltech, 2010. 27 / 36
―
Credits: Vasist et al, 2023. 28 / 36
Diagnostics

28 / 36
p^(θ∣x) = sbi(p(x∣θ), p(θ), x)

We must make sure our approximate

simulation-based inference
algorithms can (at least) actually
realize faithful inferences on the
(expected) observations.

How do we know this is good enough?

29 / 36
Mode convergence

The maximum a posteriori estimate converges towards the nominal value θ∗

for an increasing number of independent and identically distributed observables
xi ∼ p(x∣θ∗ ):

lim arg max p(θ∣{xi }N

i=1 )
N →∞ θ

= lim arg max p(θ) ∏ r(xi ∣θ) = θ∗

N →∞ θ
xi

―
Credits: Brehmer et al, 2019. 30 / 36
Coverage diagnostic

For x, θ ∼ p(x, θ), compute the 1 − α

credible interval based on p^(θ∣x).

If the fraction of samples for which θ is

contained within the interval is larger than the
nominal coverage probability 1 − α, then the
approximate posterior p^(θ∣x) has coverage.

―
Credits: Hermans et al, 2021; Siddharth Mishra-Sharma, 2021. 31 / 36
―
Credits: Hermans et al, 2021. 32 / 36
What if diagnostics fail?

33 / 36
Balanced NRE
Enforce neural ratio estimation to be conservative by using binary classi ers d^
that are balanced, i.e. such that

Ep(θ,x) [ d^(θ, x)] = Ep(θ)p(x) [1 − d^(θ, x)] .

―
Credits: Delaunoy et al, 2022. 34 / 36
―
Credits: Delaunoy et al, 2022. 35 / 36
Summary
Advances in deep learning have enabled new approaches to statistical
inference.

This is major evolution in the statistical capabilities for science, as it

enables the analysis of complex models and data without simplifying
assumptions.

Inference remains approximate and requires careful validation.

Obstacles remain to be overcome, such as the curse of dimensionality and

the need for large amounts of data.

36 / 36
The end.

36 / 36

Unit 5 - Machine Learning
No ratings yet
Unit 5 - Machine Learning
16 pages
Unit 5 - Machine Learning
No ratings yet
Unit 5 - Machine Learning
17 pages
Backup - of - T MOBILE ORIGINAL 2020
No ratings yet
Backup - of - T MOBILE ORIGINAL 2020
1 page
PML Class 1 2025
No ratings yet
PML Class 1 2025
54 pages
(Seminar) Simulation-Based Inference - Where Classical Statistics Meets Machine Learning
No ratings yet
(Seminar) Simulation-Based Inference - Where Classical Statistics Meets Machine Learning
55 pages
UNIT 3-Bayesian Statistics
No ratings yet
UNIT 3-Bayesian Statistics
80 pages
Latent Variable Models: Stefano Ermon
No ratings yet
Latent Variable Models: Stefano Ermon
26 pages
Lecture 12 Bayesian Neural Network
No ratings yet
Lecture 12 Bayesian Neural Network
46 pages
cs236 Lecture3
No ratings yet
cs236 Lecture3
36 pages
ML 3
No ratings yet
ML 3
66 pages
Barthelme EP2
No ratings yet
Barthelme EP2
58 pages
KSMF
No ratings yet
KSMF
35 pages
Lec 24
No ratings yet
Lec 24
39 pages
Wilson2020 Part2
No ratings yet
Wilson2020 Part2
47 pages
Chapter 4 ML Parametric Classification
No ratings yet
Chapter 4 ML Parametric Classification
42 pages
Talk On Regression Based Method For Bayesian Nonparanormal Graphical Models
No ratings yet
Talk On Regression Based Method For Bayesian Nonparanormal Graphical Models
40 pages
Probabilistic Models For Supervised Learning: Piyush Rai Introduction To Machine Learning (CS771A)
No ratings yet
Probabilistic Models For Supervised Learning: Piyush Rai Introduction To Machine Learning (CS771A)
32 pages
FunctionSpace Regularization in Neural NetworksA Probabilistic Perspective
No ratings yet
FunctionSpace Regularization in Neural NetworksA Probabilistic Perspective
16 pages
Deep Density Estimation
No ratings yet
Deep Density Estimation
20 pages
Database Systems Handbook
No ratings yet
Database Systems Handbook
293 pages
Lecture 8: Bayesian Estimation of Parameters in State Space Models
No ratings yet
Lecture 8: Bayesian Estimation of Parameters in State Space Models
33 pages
Kil Bertus 2020
No ratings yet
Kil Bertus 2020
31 pages
Factor Models
No ratings yet
Factor Models
59 pages
Understanding Diffusion Models: A Unified Perspective
No ratings yet
Understanding Diffusion Models: A Unified Perspective
23 pages
CS 601 Machine Learning Unit 5
No ratings yet
CS 601 Machine Learning Unit 5
18 pages
07 - Bayesian Learning
No ratings yet
07 - Bayesian Learning
55 pages
Model Selection/ Structure Learning Koller & Friedman Chapter 14 Mackay Chapter 28
No ratings yet
Model Selection/ Structure Learning Koller & Friedman Chapter 14 Mackay Chapter 28
49 pages
Lec 04
No ratings yet
Lec 04
70 pages
33790-Article Text-37858-1-2-20250410
No ratings yet
33790-Article Text-37858-1-2-20250410
10 pages
BLITZ Durgesh
No ratings yet
BLITZ Durgesh
45 pages
Class19 Approxinf
No ratings yet
Class19 Approxinf
45 pages
CS 182 Berkeley 2021 Discussion 1
No ratings yet
CS 182 Berkeley 2021 Discussion 1
7 pages
First Term SS 2 Maths Exam
100% (3)
First Term SS 2 Maths Exam
3 pages
Data Engineering Nanodegree Program Syllabus
33% (3)
Data Engineering Nanodegree Program Syllabus
15 pages
Bayesdll: Bayesian Deep Learning Library: T.Hospedales@Ed - Ac.Uk
No ratings yet
Bayesdll: Bayesian Deep Learning Library: T.Hospedales@Ed - Ac.Uk
13 pages
Perceptrons
No ratings yet
Perceptrons
12 pages
Answer Key
No ratings yet
Answer Key
12 pages
SVM 2
No ratings yet
SVM 2
8 pages
Montanari
No ratings yet
Montanari
10 pages
Sar 2000
No ratings yet
Sar 2000
22 pages
Probabilistic Neural Networks: Original Contribution
No ratings yet
Probabilistic Neural Networks: Original Contribution
10 pages
2223hk1 Slide01 ML2022-2
No ratings yet
2223hk1 Slide01 ML2022-2
23 pages
Time Grad
No ratings yet
Time Grad
11 pages
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
No ratings yet
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
50 pages
CBLM Template Common LO #3
No ratings yet
CBLM Template Common LO #3
24 pages
Lec 37
No ratings yet
Lec 37
5 pages
Lecture 6
No ratings yet
Lecture 6
13 pages
Notes6 Classification
No ratings yet
Notes6 Classification
10 pages
Notes5 Regression
No ratings yet
Notes5 Regression
14 pages
Basics of Probabilistic/Bayesian Modeling and Parameter Estimation
No ratings yet
Basics of Probabilistic/Bayesian Modeling and Parameter Estimation
21 pages
Ryan Adams 140814 Bayesopt Ncap
No ratings yet
Ryan Adams 140814 Bayesopt Ncap
84 pages
Bayesian Learning: Thanks To Nir Friedman, HU
No ratings yet
Bayesian Learning: Thanks To Nir Friedman, HU
41 pages
Lecture 2
No ratings yet
Lecture 2
6 pages
Bishop CH 3 Notes
No ratings yet
Bishop CH 3 Notes
6 pages
Unit 5 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Machine Learning - WWW - Rgpvnotes.in
17 pages
DM See M4
No ratings yet
DM See M4
8 pages
Prof. Richardson Neuralnetworks
No ratings yet
Prof. Richardson Neuralnetworks
61 pages
Lesson Plan
No ratings yet
Lesson Plan
5 pages
MCMC Bayes PDF
No ratings yet
MCMC Bayes PDF
27 pages
CS 601 Machine Learning Unit 5
No ratings yet
CS 601 Machine Learning Unit 5
18 pages
Agricultural Land Use in Kerala
No ratings yet
Agricultural Land Use in Kerala
5 pages
Bayes ML Tutorial
No ratings yet
Bayes ML Tutorial
69 pages
CSCE 970 Lecture 2: Bayesian-Based Classifiers: Most Probable
No ratings yet
CSCE 970 Lecture 2: Bayesian-Based Classifiers: Most Probable
5 pages
Statistical Learning Theory
No ratings yet
Statistical Learning Theory
4 pages
Shotcrete Testing. When and How?
100% (1)
Shotcrete Testing. When and How?
5 pages
5g-core-guide-building-a-new-world Переход от лте к 5г английский
No ratings yet
5g-core-guide-building-a-new-world Переход от лте к 5г английский
13 pages
Sharp LC 20s5e BK
No ratings yet
Sharp LC 20s5e BK
58 pages
3GPP TS 36.300: Technical Specification
No ratings yet
3GPP TS 36.300: Technical Specification
403 pages
Ori Ind Styleguide
No ratings yet
Ori Ind Styleguide
74 pages
Resume - Rajat Chaturvedi
No ratings yet
Resume - Rajat Chaturvedi
3 pages
22MBASEC002 - Spreadsheet For Business DEcision Making
No ratings yet
22MBASEC002 - Spreadsheet For Business DEcision Making
9 pages
HikCentral Professional V1.5 - System Requirements & Performance - 20191026
No ratings yet
HikCentral Professional V1.5 - System Requirements & Performance - 20191026
24 pages
Introduction To Phased Array Ultrasonic Technology Applications: Olympus Guideline
No ratings yet
Introduction To Phased Array Ultrasonic Technology Applications: Olympus Guideline
5 pages
Operator Manual - BR1000
No ratings yet
Operator Manual - BR1000
58 pages
Commission Agent Scenario PDF
No ratings yet
Commission Agent Scenario PDF
17 pages
ADM 016 ACRS Certification Agreement (Version 3.1) Web
No ratings yet
ADM 016 ACRS Certification Agreement (Version 3.1) Web
23 pages
Marcus Terms and Conditions
No ratings yet
Marcus Terms and Conditions
19 pages
Bridge Configurations: Quarter-Bridge Type I
No ratings yet
Bridge Configurations: Quarter-Bridge Type I
7 pages
Crack AI and ML Interviews
No ratings yet
Crack AI and ML Interviews
15 pages
Smart India Hackathon 2024
No ratings yet
Smart India Hackathon 2024
7 pages
1.back To Basics
No ratings yet
1.back To Basics
2 pages
001 FIDC-WB03 - @udemyrip
No ratings yet
001 FIDC-WB03 - @udemyrip
7 pages
Iwerkz Keyboard Manual
No ratings yet
Iwerkz Keyboard Manual
2 pages
Paragard Specialty Pharmacy Request Form (Digital)
No ratings yet
Paragard Specialty Pharmacy Request Form (Digital)
2 pages
PIC 600 Instrucciones
No ratings yet
PIC 600 Instrucciones
2 pages
Micrometers: How To Read The Scale Measuring Force Limiting Device
No ratings yet
Micrometers: How To Read The Scale Measuring Force Limiting Device
1 page
Datasheet AP B175N 65C 43
No ratings yet
Datasheet AP B175N 65C 43
1 page
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Useful Formulae: Mathematical & Physical
From Everand
Useful Formulae: Mathematical & Physical
Matthew Watkins
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

(Seminar) An Introduction To Simulation-Based Inference

Uploaded by

(Seminar) An Introduction To Simulation-Based Inference

Uploaded by

An introduction to

51st SLAC Summer Institute

August 16, 2023

while y > 0: # simulate until ball hits floor

return x + 0.25 * random.normal()

Neural ratio estimation

Neural posterior estimation

Neural score estimation

a simulator that can generate N samples xi ∼ p(xi ∣θi ),

a prior model p(θ),

observed data xobs ∼ p(xobs ∣θtrue ).

Then, estimate the posterior

How to choose x′ ? ϵ? ∣∣ ⋅ ∣∣?

Need to run new simulations for new data or new prior.

p(x∣θ) p(x, θ) d(x, θ)

min Ep(x) [KL(p(θ∣x)∣∣qϕ (θ∣x))]

A normalizing ow is a sequence of invertible transformations fk that map a

By the change of variables formula, the log-likelihood of a sample x is given by

We must make sure our approximate

How do we know this is good enough?

The maximum a posteriori estimate converges towards the nominal value θ∗

lim arg max p(θ∣{xi }N

= lim arg max p(θ) ∏ r(xi ∣θ) = θ∗

For x, θ ∼ p(x, θ), compute the 1 − α

If the fraction of samples for which θ is

Ep(θ,x) [ d^(θ, x)] = Ep(θ)p(x) [1 − d^(θ, x)] .

This is major evolution in the statistical capabilities for science, as it

Inference remains approximate and requires careful validation.

Obstacles remain to be overcome, such as the curse of dimensionality and

You might also like