Machine Learning For Astronomy: Rob Fergus
Machine Learning For Astronomy: Rob Fergus
Astronomy
Rob Fergus
• Want to compute
Posterior Likelihood Prior
• Alternatively:
Discriminative Generative
approaches approaches
compute this compute this
Generative Modeling
• Top-down interpretation of data
– i.e. adjust model parameters to fit observed data
• E.g. Gaussian model, estimate
that maximizes likelihood of data:
+
Generative Modeling
• Given new point, we can compute
• Combine with prior to give posterior
• Likelihood ratio defines decision surface
+
Discriminative Modeling
• Model posterior directly (no model of data density)
• Fit decision surface directly
• Bottom-up model: input=x, output=class prediction
Principal Components Analysis (PCA)
• Example of generative model (objective: compression)
• Observed data points:
• Hidden manifold coords.:
• Hidden linear mapping:
r
Comparison
Generative Models Discriminative Models
+ Labels not essential - Need labels
+ Unsupervised or - Supervised only
supervised • Model only fits
• Models whole density decision
+ Interpretable result surface
- Can be hard to specify + Fast to evaluate
model structure + Can be very powerful
Detour
• Three ingredients:
1. Discriminative neural network models
(supervised training)
2. Big labeled datasets
3. Lots of computation
Computer Vision
• Image Recognition
– Input: Pixels
– Output: Class Label
Ground Truth
Model
Predictions
• Latest generation
nVidia GPUs (Pascal)
deliver 10 TFlops / card
– Faster than fastest
super-computer
in world in 2000
ImageNet Performance over time
30
Convolutional
25 Neural Nets
Top-5 Classification Error (%)
20
15
10
0
2010 2011 2012 2013 2014 Human 2015
• Crowd-sourced
labels for
different galaxy
shapes
F i gu r e 1. T he Galaxy Zoo 2 decision t ree. Reproduced from Figure 1 in W illet t et al. (2013).
Galaxy Morphology Classification
[Rotation-invariant convolutional neural networks for galaxy morphology prediction,
Dieleman, Willett, Dambre, R. Astron. Soc. March 2015]
1 Dept. of Computer Science, 2 Center for Cosmology 3 Dept. of Astrophysics 4 Space Telescope
Courant Institute, & Particle Physics, American Museum Science Institute
New York University Dept. of Physics, of Natural History
New York University
P1640 Data Cubes
• Each exposure gives 32 wavelength bands
(near IR 950-1770nm)
• Speckles are
diffraction artifacts
• Planet stationary
Use Polar Representation
• Speckles become diagonal structures
Wavelength
• Planet is vertical
– Key to separating the two
• Assume: independence to
angle and exposure
Wavelength
Wavelength
Three versions of S4
Machine
1. S4 Detect
Background data +
Support Vector Machine Prediction value of the
DS4 artificially generated
(Supervised learning) model
data
S4 Detect vs DS4 Detect
Relative brightness of companion vs speckle flux
3. S4 Spectra
True Generative Model for Spectra
Wavelength
-- Treat as latent variable
Radius
• Observed data = PCA speckle model
+
Fixed (spatial) planet model with latent spectra
Planet
Estimated Speckles + Estimated Planet Model
Speckle Model Radius
Wavelength
Raw data
+ synthetic planet transit
Inferred
depth map
“Unified” Generative Model of Astronomical Images
• Unified
Bayesian
model
• Propagate
uncertainty
from pixels
• Physics-
informed
priors
Hogg & Fergus, NSF #1124794 “CDI: A Unified Probabilistic Model of Astronomical Imaging”
Detection of Planets
Wavelength
S4 Graphical Model
Speckle coeffs. Planet position Planet
spectrum
Coeff.
prior ϕ zi μi s
W xi pi g
Speckle
Planet shape
basis
Pixel j
Speckle image: from yji Planet image:
cube i pixel j
exposure i
Assume Gaussian distributions,
yields overall cost:
Approach
• Build statistical model of speckles
– Physical model of optics too complex
Broadband
white light
source
y
• Obtained from
instrument calibration
(spatially invariant) x
• Spectra fixed:
Wavelength
assume white
Radius
Correlation with Planet Model
Radius
Data Cubes
• Each exposure gives 32 wavelength bands
(near IR 950-1770nm)
• Speckles are
diffraction artifacts
• Planet stationary
Leave-Out Strategy
• Use model to
reconstruct test slices
Evaluation
• 10 exposures of star HR8799 from June 2012