0% found this document useful (0 votes)

86 views

Use Julia

This document provides an overview of probabilistic data analysis tools for Python. It discusses linear regression, maximum likelihood estimation, and uncertainty quantification as examples of probabilistic inference problems. Linear regression can be performed with known Gaussian uncertainties using least squares optimization. Maximum likelihood estimation handles non-linear models and non-Gaussian uncertainties using optimizers like SciPy to minimize the negative log-likelihood. Automatic differentiation tools like Theano and autograd are recommended for efficiently computing gradients needed for optimization, by exactly evaluating derivatives during compilation.

Uploaded by

Felipe Arzola

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

86 views

Use Julia

Uploaded by

Felipe Arzola

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 68

Tools for Probabilistic Data Analysis in Python *

Dan Foreman-Mackey | #pyastro16

Tools for Probabilistic Data Analysis in Python *

Dan Foreman-Mackey | #pyastro16

* in 15 minutes
What have I done?
Tools for Probabilistic Data Analysis in Python
Physics

Data
Physics

mean model
(physical parameters → predicted data)

Data
Physics

mean model
(physical parameters → predicted data)

Data

noise
(stochastic; instrument, systematics, etc.)
Physics

inference mean model

(parameter estimation) (physical parameters → predicted data)

Data

noise
(stochastic; instrument, systematics, etc.)
A few examples

1 linear regression

2 maximum likelihood

3 uncertainty quantification
Linear regression
Linear regression

if you have:
a linear mean model and
known Gaussian uncertainties

and you want:

"best" parameters and uncertainties
Linear (mean) models

y = mx + b
Linear (mean) models

y = mx + b

y = a2 x2 + a1 x + a0
Linear (mean) models

y = mx + b

y = a2 x2 + a1 x + a0

y = a sin(x + w)
Linear (mean) models

y = mx + b

y = a2 x2 + a1 x + a0

y = a sin(x + w)

+ known Gaussian uncertainties

Linear regression
Linear regression
Linear regression

# x, y, yerr are numpy arrays of the same shape

import numpy as np

A = np.vander(x, 2)
ATA = np.dot(A.T, A / yerr[:, None]**2)
sigma_w = np.linalg.inv(ATA)
mean_w = np.linalg.solve(ATA, np.dot(A.T, y / yerr**2))
Linear regression
0 1
x1 1
B x2 1 C
# x, y, yerr are numpy arrays of the same B
shape C
A=B . .. C
@ .. . A
import numpy as np
xn 1
A = np.vander(x, 2)
ATA = np.dot(A.T, A / yerr[:, None]**2)
sigma_w = np.linalg.inv(ATA)
mean_w = np.linalg.solve(ATA, np.dot(A.T, y / yerr**2))
Linear regression
0 1
x1 1
B x2 1 C
# x, y, yerr are numpy arrays of the same B
shape C
A=B . .. C
@ .. . A
import numpy as np
xn 1
A = np.vander(x, 2)
ATA = np.dot(A.T, A / yerr[:, None]**2)
sigma_w = np.linalg.inv(ATA)
mean_w = np.linalg.solve(ATA, np.dot(A.T, y / yerr**2))
✓ ◆
m
w=
b
Linear regression
0 1
x1 1
B x2 1 C
# x, y, yerr are numpy arrays of the same B
shape C
A=B . .. C
@ .. . A
import numpy as np
xn 1
A = np.vander(x, 2)
ATA = np.dot(A.T, A / yerr[:, None]**2)
sigma_w = np.linalg.inv(ATA)
mean_w = np.linalg.solve(ATA, np.dot(A.T, y / yerr**2))
✓ ◆
m
w=
b
That's it!
(in other words: "Don't use MCMC for linear regression!")
Maximum likelihood
Maximum likelihood

if you have:
a non-linear mean model and/or
non-Gaussian/unknown noise

and you want:

"best" parameters
Likelihoods

p(data | physics)
"probability of the data given physics"

parameterized by some parameters

Example likelihood function

log-likelihood mean model

XN 2
1 [yn f✓ (xn )]
ln p({yn } | ✓) = 2
+ constant
2 n=1 n

" "
2
Likelihoods
Likelihoods

SciPy
Likelihoods

# x, y, yerr are numpy arrays of the same shape

import numpy as np
from scipy.optimize import minimize

def model(theta, x):

a, b, c = theta
return a / (1 + np.exp(-b * (x - c)))

def neg_log_like(theta):
return 0.5 * np.sum(((model(theta, x) - y) / yerr)**2)

r = minimize(nll, [1.0, 10.0, 1.5])

print(r)

XN
1 [yn f✓ (xn )]2
ln p({yn } | ✓) = 2
+ constant
2 n=1 n
Likelihoods

# x, y, yerr are numpy arrays of the same shape

import numpy as np
from scipy.optimize import minimize
a
def model(theta, x): f✓ (xn ) = b (xn c)
a, b, c = theta 1 +e
return a / (1 + np.exp(-b * (x - c)))

def neg_log_like(theta):
return 0.5 * np.sum(((model(theta, x) - y) / yerr)**2)

r = minimize(nll, [1.0, 10.0, 1.5])

print(r)

XN
1 [yn f✓ (xn )]2
ln p({yn } | ✓) = 2
+ constant
2 n=1 n
Likelihoods

# x, y, yerr are numpy arrays of the same shape

import numpy as np
from scipy.optimize import minimize
a
def model(theta, x): f✓ (xn ) = b (xn c)
a, b, c = theta 1 +e
return a / (1 + np.exp(-b * (x - c)))

def neg_log_like(theta): ln p({yn } | ✓)

return 0.5 * np.sum(((model(theta, x) - y) / yerr)**2)

r = minimize(nll, [1.0, 10.0, 1.5])

print(r)

XN
1 [yn f✓ (xn )]2
ln p({yn } | ✓) = 2
+ constant
2 n=1 n
"But it doesn't work…"
— everyone
1 initialization

2 bounds

3 convergence

4 gradients
1 initialization

2 bounds

3 convergence

4 gradients
Gradients

d
ln p({yn } | ✓)
d✓
seriously?
AutoDiff to the rescue!

"The most criminally underused tool

in the [PyAstro] toolkit"
— adapted from
justindomke.wordpress.com
AutoDiff

"Compile" time exact gradients

AutoDiff

"Compile" time chain rule

AutoDiff

GradType sin (GradType x):

return GradType(
x.value,
x.grad * cos(x.value)
)
AutoDiff

"Compile" time exact gradients

AutoDiff in Python

1 Theano: deeplearning.net/software/theano

2 HIPS/autograd: github.com/HIPS/autograd
HIPS/autograd just works

import autograd.numpy as np
from autograd import elementwise_grad

def f(x):
y = np.exp(-x)
return (1.0 - y) / (1.0 + y)

df = elementwise_grad(f)
ddf = elementwise_grad(df)
HIPS/autograd just works

1.0
f (x); f (x); f (x)
0.5
import autograd.numpy as np
00

from autograd import elementwise_grad

def f(x):
y = np.exp(-x)
0.0
0

return (1.0 - y) / (1.0 + y)

0.5
df = elementwise_grad(f)
ddf = elementwise_grad(df)

1.0
4 2 0 2 4
x
before autograd

# x, y, yerr are numpy arrays of the same shape

import numpy as np
from scipy.optimize import minimize

def model(theta, x):

a, b, c = theta
return a / (1 + np.exp(-b * (x - c)))

def neg_log_like(theta):
r = (y - model(theta, x)) / yerr
return 0.5 * np.sum(r*r)

r = minimize(neg_log_like, [1.0, 10.0, 1.5])

print(r)
after autograd

# x, y, yerr are numpy arrays of the same shape

from autograd import grad

import autograd.numpy as np
from scipy.optimize import minimize

def model(theta, x):

a, b, c = theta
return a / (1 + np.exp(-b * (x - c)))

def neg_log_like(theta):
r = (y - model(theta, x)) / yerr
return 0.5 * np.sum(r*r)

r = minimize(neg_log_like, [1.0, 10.0, 1.5],

jac=grad(neg_log_like))
print(r)
after autograd

# x, y, yerr are numpy arrays of the same shape

from autograd import grad

import autograd.numpy as np
from scipy.optimize import minimize

def model(theta, x):

a, b, c = theta
return a / (1 + np.exp(-b * (x - c)))

def neg_log_like(theta):
r = (y - model(theta, x)) / yerr
return 0.5 * np.sum(r*r)

r = minimize(neg_log_like, [1.0, 10.0, 1.5],

jac=grad(neg_log_like))
print(r)
after autograd

# x, y, yerr are numpy arrays of the same shape

from autograd import grad

import autograd.numpy as np
from scipy.optimize import minimize

def model(theta, x):

a, b, c = theta
115
return a / (1 +calls 66 calls
np.exp(-b * (x - c)))

def neg_log_like(theta):
r = (y - model(theta, x)) / yerr
return 0.5 * np.sum(r*r)

r = minimize(neg_log_like, [1.0, 10.0, 1.5],

jac=grad(neg_log_like))
print(r)
HIPS/autograd just works

but… HIPS/autograd is not super fast

HIPS/autograd just works

but… HIPS/autograd is not super fast

you might need to drop down to a compiled language

HIPS/autograd just works

but… HIPS/autograd is not super fast

you might need to drop down to a compiled language

or...
Use Julia?
Uncertainty quantification
Uncertainty quantification

if you have:
a non-linear mean model and/or
non-Gaussian/unknown noise

and you want:

parameter uncertainties
Uncertainty

p(physics | data) / p(data | physics) p(physics)

distribution of likelihood prior

physical parameters
consistent with data
You're going to have to

SAMPLE

cbnd
Flickr user Franz Jachim
MCMC sampling
MCMC sampling

it's
ham
me
r tim
e!

emcee
The MCMC Hammer
MCMC sampling with emcee

dfm.io/emcee; github.com/dfm/emcee
MCMC sampling with emcee

# x, y, yerr are numpy arrays of the same shape

import emcee
import numpy as np

def model(theta, x):

a, b, c = theta
return a / (1 + np.exp(-b * (x - c)))

def log_prob(theta):
log_prior = 0.0
r = (y - model(theta, x)) / yerr
return -0.5 * np.sum(r*r) + log_prior

ndim, nwalkers = 3, 32
p0 = np.array([1.0, 10.0, 1.5])
p0 = p0 + 0.01*np.random.randn(nwalkers, ndim)
sampler = emcee.EnsembleSampler(nwalkers, ndim, log_prob)
sampler.run_mcmc(p0, 1000)
MCMC sampling with emcee

a
f✓ (xn ) = b (xn c)
14
10 12
1+e
b
8 5
50 .52
1
0
c
1.
5
47
1.

8
5
0
5
0

10
12
14

5
97
00
02
05

52
0.
1.
1.
1.

1.
a b c made using:
github.com/dfm/corner.py
1 initialization

2 bounds

3 convergence

4 gradients
1 initialization

2 priors

3 convergence

4 gradients?
Other MCMC samplers in Python

1 pymc-devs/pymc3

2 stan-dev/pystan

3 JohannesBuchner/PyMultiNest

4 eggplantbren/DNest4
Other MCMC samplers in Python

hierarchical
1 pymc-devs/pymc3

inference
2 stan-dev/pystan

3 JohannesBuchner/PyMultiNest

4 eggplantbren/DNest4
Other MCMC samplers in Python

hierarchical
1 pymc-devs/pymc3

inference
2 stan-dev/pystan

3 JohannesBuchner/PyMultiNest

sampling
nested
4 eggplantbren/DNest4
in summary…
If your data analysis problem looks like this… *

Physics

inference mean model

(parameter estimation) (physical parameters → predicted data)

Data

noise
(stochastic; instrument, systematics, etc.)

* it probably does
… now you know how to solve it! *

https://speakerdeck.com/dfm/pyastro16

* in theory

Machine Learning Exercises in Python, Part 1: Curious Insight
No ratings yet
Machine Learning Exercises in Python, Part 1: Curious Insight
14 pages
2.3 SciPy-1
No ratings yet
2.3 SciPy-1
17 pages
practicalMachineLearning_lecture3
No ratings yet
practicalMachineLearning_lecture3
25 pages
Scipy Cheat Sheet Python For Data Science: Linear Algebra
No ratings yet
Scipy Cheat Sheet Python For Data Science: Linear Algebra
1 page
Scipy Cheat Sheet Python For Data Science: Linear Algebra
No ratings yet
Scipy Cheat Sheet Python For Data Science: Linear Algebra
1 page
ML Record Print
No ratings yet
ML Record Print
20 pages
SciPyGUIA PYTHON-02
No ratings yet
SciPyGUIA PYTHON-02
1 page
SciPy Cheat Sheet
No ratings yet
SciPy Cheat Sheet
1 page
Overview of Numerical Methods
No ratings yet
Overview of Numerical Methods
8 pages
ML Labs
No ratings yet
ML Labs
46 pages
Foundations of Data Science: Exercise 1
No ratings yet
Foundations of Data Science: Exercise 1
5 pages
NB 13
No ratings yet
NB 13
27 pages
EE2211 CheatSheet
No ratings yet
EE2211 CheatSheet
15 pages
Statistics and Risk Modelling Using Python
No ratings yet
Statistics and Risk Modelling Using Python
99 pages
15CSL76 Students
No ratings yet
15CSL76 Students
18 pages
SkriptOptMach
No ratings yet
SkriptOptMach
49 pages
DNN ALL Practical 28
No ratings yet
DNN ALL Practical 28
34 pages
1 3 (Co1, Co2)
No ratings yet
1 3 (Co1, Co2)
17 pages
Logistic _Regresssion
No ratings yet
Logistic _Regresssion
22 pages
Scientific Python (By Stanford)
No ratings yet
Scientific Python (By Stanford)
58 pages
vertopal.com_C1_W3_Logistic_Regression
No ratings yet
vertopal.com_C1_W3_Logistic_Regression
27 pages
Numerical Python Book Practice
No ratings yet
Numerical Python Book Practice
10 pages
Qs ML
No ratings yet
Qs ML
8 pages
Lab Manual 05
No ratings yet
Lab Manual 05
13 pages
Slides Estimation
No ratings yet
Slides Estimation
171 pages
Numerical Methods To Solve Systems of Equations in Python
No ratings yet
Numerical Methods To Solve Systems of Equations in Python
12 pages
ML ASS ppt
No ratings yet
ML ASS ppt
16 pages
DIP Lab Manual No 02
No ratings yet
DIP Lab Manual No 02
24 pages
AIML_LAB
No ratings yet
AIML_LAB
37 pages
1 Question No. 1 Synthetic Data Generation and Simple Curve Fitting
No ratings yet
1 Question No. 1 Synthetic Data Generation and Simple Curve Fitting
14 pages
Deep Learning Assignment3 Solution
No ratings yet
Deep Learning Assignment3 Solution
9 pages
Homework Assignment 3 Homework Assignment 3
No ratings yet
Homework Assignment 3 Homework Assignment 3
10 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
Ml Cyber Lab
No ratings yet
Ml Cyber Lab
16 pages
MLCyberLab
No ratings yet
MLCyberLab
9 pages
scipy - Jupyter Notebook
No ratings yet
scipy - Jupyter Notebook
8 pages
DatA414 Prac 2 Linear Regression 2024.pdfasisipho
No ratings yet
DatA414 Prac 2 Linear Regression 2024.pdfasisipho
21 pages
Week 3 Lecture Notes
No ratings yet
Week 3 Lecture Notes
7 pages
04_training_linear_models
No ratings yet
04_training_linear_models
35 pages
ML-LAB-MANUAL (1)
No ratings yet
ML-LAB-MANUAL (1)
21 pages
exercise01
No ratings yet
exercise01
3 pages
Soft Computing
No ratings yet
Soft Computing
38 pages
LAB1_ML_EAC22050
No ratings yet
LAB1_ML_EAC22050
17 pages
Numerical Maximum Likelihood: STA 312: Fall 2012
No ratings yet
Numerical Maximum Likelihood: STA 312: Fall 2012
30 pages
Lecture1
No ratings yet
Lecture1
56 pages
Machine Learning Lab (17CSL76)
No ratings yet
Machine Learning Lab (17CSL76)
48 pages
1 An Introduction To Machine Learning With Scikit Learn
No ratings yet
1 An Introduction To Machine Learning With Scikit Learn
2 pages
Machine Learning and Pattern Recognition Minimal Stochastic Variational Inference Demo
No ratings yet
Machine Learning and Pattern Recognition Minimal Stochastic Variational Inference Demo
3 pages
HW 3
No ratings yet
HW 3
4 pages
CS4100 CS5100 CW1 20241001
No ratings yet
CS4100 CS5100 CW1 20241001
10 pages
Notes5_Regression
No ratings yet
Notes5_Regression
14 pages
ML Lab Record
No ratings yet
ML Lab Record
33 pages
Notes Data Science 1
No ratings yet
Notes Data Science 1
6 pages
Advance Python Lab Solution
No ratings yet
Advance Python Lab Solution
4 pages
ML Lab Manual
No ratings yet
ML Lab Manual
14 pages
Numpy Array
No ratings yet
Numpy Array
2 pages
B24 ML Exp-1
No ratings yet
B24 ML Exp-1
10 pages
ML File - Merged
No ratings yet
ML File - Merged
24 pages
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Beta Regression For Modelling Rates and Proportions: Silvia L. P. Ferrari and Francisco Cribari-Neto
No ratings yet
Beta Regression For Modelling Rates and Proportions: Silvia L. P. Ferrari and Francisco Cribari-Neto
17 pages
M347 201806
No ratings yet
M347 201806
25 pages
Modelling and Quantitative Methods in Fisheries - Malcolm Haddon 2th Edition-114-227
No ratings yet
Modelling and Quantitative Methods in Fisheries - Malcolm Haddon 2th Edition-114-227
114 pages
R Programming for Actuarial Science Peter Mcquire download pdf
100% (5)
R Programming for Actuarial Science Peter Mcquire download pdf
66 pages
Theoretical Statistics. Lecture 15.: M-Estimators. Consistency of M-Estimators. Nonparametric Maximum Likelihood
No ratings yet
Theoretical Statistics. Lecture 15.: M-Estimators. Consistency of M-Estimators. Nonparametric Maximum Likelihood
20 pages
Yu Et Al-2017-Risk Analysis
No ratings yet
Yu Et Al-2017-Risk Analysis
15 pages
Quantum Informatics View of Statistical Data Processing: Yu. I. Bogdanov, N. A. Bogdanova
No ratings yet
Quantum Informatics View of Statistical Data Processing: Yu. I. Bogdanov, N. A. Bogdanova
7 pages
Error Analysis in Circle Fitting
No ratings yet
Error Analysis in Circle Fitting
26 pages
The SYSLIN Procedure Two-Stage Least Squares Estimation
No ratings yet
The SYSLIN Procedure Two-Stage Least Squares Estimation
13 pages
CLM Tutorial
No ratings yet
CLM Tutorial
18 pages
10.3934 Naco.2024012
No ratings yet
10.3934 Naco.2024012
13 pages
Basket Trading Under Co-Integration With The
No ratings yet
Basket Trading Under Co-Integration With The
14 pages
Econ Shu301 CH11
No ratings yet
Econ Shu301 CH11
53 pages
Capturing Stochastic Variations of Train Event Tim
No ratings yet
Capturing Stochastic Variations of Train Event Tim
13 pages
Syllabus EE6343 S2012
No ratings yet
Syllabus EE6343 S2012
6 pages
MATH 2901 Assignment: University of New South Wales
No ratings yet
MATH 2901 Assignment: University of New South Wales
12 pages
Estimation of Nonstationary Heterogeneous Panels
No ratings yet
Estimation of Nonstationary Heterogeneous Panels
13 pages
CS 229 Autumn 2017 Problem Set #3: Deep Learning & Unsupervised Learning
No ratings yet
CS 229 Autumn 2017 Problem Set #3: Deep Learning & Unsupervised Learning
9 pages
Population Pharmacokinetic and Pharmacodynamic Modeling
No ratings yet
Population Pharmacokinetic and Pharmacodynamic Modeling
35 pages
Propagation of Data Uncertainty in Surface Wave Inversion
No ratings yet
Propagation of Data Uncertainty in Surface Wave Inversion
10 pages
Teo Et Al. 2013 - 2
No ratings yet
Teo Et Al. 2013 - 2
19 pages
Statlect: Log-Likelihood
No ratings yet
Statlect: Log-Likelihood
6 pages
Just and Pope 1978
No ratings yet
Just and Pope 1978
34 pages
Stock_Watson_3U_ExerciseSolutions_Chapter11_Instructors
No ratings yet
Stock_Watson_3U_ExerciseSolutions_Chapter11_Instructors
12 pages
Empirical Properties of Asset Returns
No ratings yet
Empirical Properties of Asset Returns
14 pages
Imp - Maximum Likelihood Estimation - STAT 414 - 415
No ratings yet
Imp - Maximum Likelihood Estimation - STAT 414 - 415
8 pages
Signal Processing 20101110
100% (3)
Signal Processing 20101110
36 pages
Robust Statistics Theory and Methods 1st Edition Ricardo A. Maronna download
100% (1)
Robust Statistics Theory and Methods 1st Edition Ricardo A. Maronna download
60 pages
Three-Parameter vs. Two-Parameter Weibull Distribution
No ratings yet
Three-Parameter vs. Two-Parameter Weibull Distribution
7 pages
Statistical inference and simulation for spatial point processes 1st Edition Jesper Møller all chapter instant download
100% (4)
Statistical inference and simulation for spatial point processes 1st Edition Jesper Møller all chapter instant download
81 pages

Use Julia

Uploaded by

Use Julia

Uploaded by

Tools for Probabilistic Data Analysis in Python *

Dan Foreman-Mackey | #pyastro16

Dan Foreman-Mackey | #pyastro16

inference mean model

and you want:

+ known Gaussian uncertainties

# x, y, yerr are numpy arrays of the same shape

and you want:

parameterized by some parameters

log-likelihood mean model

# x, y, yerr are numpy arrays of the same shape

def model(theta, x):

r = minimize(nll, [1.0, 10.0, 1.5])

# x, y, yerr are numpy arrays of the same shape

r = minimize(nll, [1.0, 10.0, 1.5])

# x, y, yerr are numpy arrays of the same shape

def neg_log_like(theta): ln p({yn } | ✓)

r = minimize(nll, [1.0, 10.0, 1.5])

"The most criminally underused tool

"Compile" time exact gradients

"Compile" time chain rule

GradType sin (GradType x):

"Compile" time exact gradients

from autograd import elementwise_grad

return (1.0 - y) / (1.0 + y)

# x, y, yerr are numpy arrays of the same shape

def model(theta, x):

r = minimize(neg_log_like, [1.0, 10.0, 1.5])

# x, y, yerr are numpy arrays of the same shape

from autograd import grad

def model(theta, x):

r = minimize(neg_log_like, [1.0, 10.0, 1.5],

# x, y, yerr are numpy arrays of the same shape

from autograd import grad

def model(theta, x):

r = minimize(neg_log_like, [1.0, 10.0, 1.5],

# x, y, yerr are numpy arrays of the same shape

from autograd import grad

def model(theta, x):

r = minimize(neg_log_like, [1.0, 10.0, 1.5],

but… HIPS/autograd is not super fast

but… HIPS/autograd is not super fast

you might need to drop down to a compiled language

but… HIPS/autograd is not super fast

you might need to drop down to a compiled language

and you want:

p(physics | data) / p(data | physics) p(physics)

distribution of likelihood prior

# x, y, yerr are numpy arrays of the same shape

def model(theta, x):

inference mean model

You might also like