0% found this document useful (0 votes)

13 views101 pages

2006 March 21 MRF

Uploaded by

neyi1302chen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views101 pages

2006 March 21 MRF

Uploaded by

neyi1302chen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 101

Graphical models,

belief propagation, and

Markov random fields

Bill Freeman, MIT

Fredo Durand, MIT

6.882 March 21, 2005

Color selection problem
• (see Photoshop demonstration)
Stereo problem

L R
Squared
difference,
(L[x] – R[x-d])^2,
for some x.
d

Showing local disparity evidence

vectors for a set of neighboring
d positions, x.
Super-resolution image synthesis

How select which selection of

high resolution patches best
fits together? Ignoring which
patch fits well with which
gives this result for the high
frequency components of an
image:
Things we want to be able to
articulate in a spatial prior
• Favor neighboring pixels having the same state
(state, meaning: estimated depth, or group
segment membership)
• Favor neighboring nodes have compatible states (a
patch at node i should fit well with selected patch
at node j).
• But encourage state changes to occur at certain
places (like regions of high image gradient).
Graphical models: tinker toys to build
complex probability distributions
• Circles represent random variables.
• Lines represent statistical dependencies.
• There is a corresponding equation that gives P(x1, x2, x3, y, z),
but often it’s easier to understand things from the picture.
• These tinker toys for probabilities let you build up, from simple,
easy-to-understand pieces, complicated probability distributions
involving many variables.

x1 x2 x3

y z

http://mark.michaelis.net/weblog/2002/12/29/Tinker%20Toys%20Car.jpg
Steps in building and using graphical models

• First, define the function you want to optimize. Note the

two common ways of framing the problem
– In terms of probabilities. Multiply together component terms,
which typically involve exponentials.
– In terms of energies. The log of the probabilities. Typically add
together the exponentiated terms from above.
• The second step: optimize that function. For probabilities,
take the mean or the max (or use some other “loss
function”). For energies, take the min.
• 3rd step: in many cases, you want to learn the function
from the 1st step.
Define model parameters

⎛1 α α⎞
⎜ ⎟
⎜α 1 α ⎟
⎜α α 1 ⎟
⎝ ⎠
A more general compatibility matrix
(values shown as grey scale)
Derivation of belief propagation
y1 y2 y3
Φ ( x1 , y1 ) Φ ( x2 , y 2 ) Φ ( x3 , y3 )

x1 x2 x3
Ψ ( x1 , x2 ) Ψ ( x2 , x3 )

x1MMSE = mean sum sum P ( x1 , x2 , x3 , y1 , y2 , y3 )

x1 x2 x3
The posterior factorizes
x1MMSE = mean sum sum P ( x1 , x2 , x3 , y1 , y2 , y3 )
x1 x2 x3

x1MMSE = mean sum sum Φ ( x1 , y1 )

x1 x2 x3

Φ ( x2 , y2 ) Ψ ( x1 , x2 )
Φ ( x3 , y3 ) Ψ ( x2 , x3 )
x1MMSE = mean Φ ( x1 , y1 )
x1
y1 y2 y3
sum Φ ( x2 , y2 ) Ψ ( x1 , x2 ) Φ ( x1 , y1 ) Φ ( x2 , y 2 ) Φ ( x3 , y3 )
x2
x1 x2 x3
sum Φ ( x3 , y3 ) Ψ ( x2 , x3 ) Ψ ( x1 , x2 ) Ψ ( x2 , x3 )
x3
Propagation rules
x1MMSE = mean sum sum P ( x1 , x2 , x3 , y1 , y2 , y3 )
x1 x2 x3

x1MMSE = mean sum sum Φ ( x1 , y1 )

x1 x2 x3

sum Φ ( x2 , y2 ) Ψ ( x1 , x2 )
x2

sum Φ ( x3 , y3 ) Ψ ( x2 , x3 )
x3

M 12 ( x1 ) = sum Ψ ( x1 , x2 ) Φ ( x2 , y2 ) M 23 ( x2 )
x2
y1 y2 y3
Φ ( x1 , y1 ) Φ ( x2 , y 2 ) Φ ( x3 , y3 )

x1 x2 x3
Ψ ( x1 , x2 ) Ψ ( x2 , x3 )
Propagation rules
x1MMSE = mean Φ ( x1 , y1 )
x1

sum Φ ( x2 , y2 ) Ψ ( x1 , x2 )
x2

sum Φ ( x3 , y3 ) Ψ ( x2 , x3 )
x3

M 12 ( x1 ) = sum Ψ ( x1 , x2 ) Φ ( x2 , y2 ) M 23 ( x2 )
x2
y1 y2 y3
Φ ( x1 , y1 ) Φ ( x2 , y 2 ) Φ ( x3 , y3 )

x1 x2 x3
Ψ ( x1 , x2 ) Ψ ( x2 , x3 )
Belief propagation: the nosey
neighbor rule
“Given everything that I know, here’s what I
think you should think”

(Given the probabilities of my being in

different states, and how my states relate to
your states, here’s what I think the
probabilities of your states should be)
Belief propagation messages
A message: can be thought of as a set of weights on
each of your possible states
To send a message: Multiply together all the incoming
messages, except from the node you’re sending to,
then multiply by the compatibility matrix and marginalize
over the sender’s states.

M i j ( xi ) = ∑ψ ij (xi , x j ) ∏ j (x j )
M k

xj k ∈N ( j ) \ i

i j i j
=
Beliefs
To find a node’s beliefs: Multiply together all the
messages coming in to that node.

j bj (x j ) = ∏ j (x j )
M k

k ∈N ( j )
Simple BP example
y1 y3
Φ ( x1 , y1 ) Φ ( x3 , y3 )

x1 x2 x3
⎛ .9 .1 ⎞
Ψ ( x1 , x2 ) Ψ ( x2 , x3 )
Ψ ( x1 , x2 ) = ⎜⎜ ⎟⎟
⎝ .1 .9 ⎠
⎛ .4 ⎞ ⎛ .8 ⎞
M y1
= ⎜⎜ ⎟⎟ M y3
= ⎜⎜ ⎟⎟ ⎛ .9 .1 ⎞
Ψ ( x2 , x3 ) = ⎜⎜ ⎟⎟
1 3
⎝ .6 ⎠ ⎝ .2 ⎠
⎝ .1 .9 ⎠
x1 x2 x3
Ψ ( x1 , x2 ) Ψ ( x2 , x3 )
Simple BP example
⎛ .4 ⎞ ⎛ .8 ⎞
M 1y1 = ⎜⎜ ⎟⎟ M 3y 3 = ⎜⎜ ⎟⎟
⎝ .6 ⎠ ⎝ .2 ⎠ ⎛ .9 .1 ⎞
Ψ ( x1 , x2 ) = Ψ ( x2 , x3 ) = ⎜⎜ ⎟⎟
⎝ .1 .9 ⎠
x1 x2 x3
Ψ ( x1 , x2 ) Ψ ( x2 , x3 )

To find the marginal probability for each variable, you can

(a) Marginalize out the other variables of:
P( x1 , x2 , x3 ) = Ψ ( x1 , x2 )Ψ ( x2 , x3 ) M 1y1 ( x1 ) M 3y 3 ( x3 )

(b) Or you can run belief propagation, (BP). BP redistributes the various
partial sums, leading to a very efficient calculation.
Belief, and message updates

j bj (x j ) = ∏ j (x j )
M k

k ∈N ( j )

M i j ( xi ) = ∑ψ ij (xi , x j ) ∏ j (x j )
M k

xj k ∈N ( j ) \ i

i i j
=
Optimal solution in a chain or tree:
Belief Propagation
• “Do the right thing” Bayesian algorithm.
• For Gaussian random variables over time:
Kalman filter.
• For hidden Markov models:
forward/backward algorithm (and MAP
variant is Viterbi).
Making probability distributions modular, and
therefore tractable:
Probabilistic graphical models

Vision is a problem involving the interactions of many variables:

things can seem hopelessly complex. Everything is made
tractable, or at least, simpler, if we modularize the problem.
That’s what probabilistic graphical models do, and let’s examine
that.

Readings: Jordan and Weiss intro article—fantastic!

Kevin Murphy web page—comprehensive and with
pointers to many advanced topics
A toy example
Suppose we have a system of 5 interacting variables, perhaps some are
observed and some are not. There’s some probabilistic relationship between
the 5 variables, described by their joint probability,
P(x1, x2, x3, x4, x5).

If we want to find out what the likely state of variable x1 is (say, the
position of the hand of some person we are observing), what can we do?

Two reasonable choices are: (a) find the value of x1 (and of all the other
variables) that gives the maximum of P(x1, x2, x3, x4, x5); that’s the MAP
solution.
Or (b) marginalize over all the other variables and then take the mean or the
maximum of the other variables. Marginalizing, then taking the mean, is
equivalent to finding the MMSE solution. Marginalizing, then taking the
max, is called the max marginal solution and sometimes a useful thing to do.
To find the marginal probability at x1, we have to take this sum:
∑ P( x , x , x , x , x )
x2 , x3 , x4 , x5
1 2 3 4 5

If the system really is high dimensional, that will quickly become

intractable. But if there is some modularity in P ( x1 , x2 , x3 , x4 , x5 )
then things become tractable again.

Suppose the variables form a Markov chain: x1 causes x2 which causes x3,
etc. We might draw out this relationship as follows:

x1 x2 x3 x4 x5
P(a,b) = P(b|a) P(a)

By the chain rule, for any probability distribution, we have:

P ( x1 , x2 , x3 , x4 , x5 ) = P ( x1 ) P( x2 , x3 , x4 , x5 | x1 )

= P ( x1 ) P ( x2 | x1 ) P ( x3 | x1 , x2 ) P ( x4 | x1 , x2 , x3 ) P ( x5 | x1 , x2 , x3 , x4 )

But if we exploit the assumed modularity of the probability distribution over

the 5 variables (in this case, the assumed Markov chain structure), then that
expression simplifies:
= P ( x1 ) P ( x2 | x1 ) P( x3 | x2 ) P( x4 | x3 ) P ( x5 | x4 )
x1 x2 x3 x4 x5

Now our marginalization summations distribute through those terms:

∑ P ( x , x , x , x , x ) = ∑ P ( x )∑ P ( x
x2 , x3 , x4 , x5
1 2 3 4 5
x1
1
x2
2 | x1 )∑ P ( x3 | x2 )∑ P ( x4 | x3 )∑ P ( x5 | x4 )
x3 x4 x5
Belief propagation
Performing the marginalization by doing the partial sums is called
“belief propagation”.

∑ P ( x , x , x , x , x ) = ∑ P ( x )∑ P ( x
x2 , x3 , x4 , x5
1 2 3 4 5
x1
1
x2
2 | x1 )∑ P ( x3 | x2 )∑ P ( x4 | x3 )∑ P( x5 | x4 )
x3 x4 x5

In this example, it has saved us a lot of computation. Suppose each

variable has 10 discrete states. Then, not knowing the special structure
of P, we would have to perform 10000 additions (10^4) to marginalize
over the four variables.
But doing the partial sums on the right hand side, we only need 40
additions (10*4) to perform the same marginalization!
Another modular probabilistic structure, more common in vision
problems, is an undirected graph:

x1 x2 x3 x4 x5

The joint probability for this graph is given by:

P ( x1 , x2 , x3 , x4 , x5 ) = Φ ( x1 , x2 )Φ ( x2 , x3 )Φ ( x3 , x4 )Φ ( x4 , x5 )

Where Φ ( x , x ) is called a “compatibility function”. We can

1 2
define compatibility functions we result in the same joint probability as
for the directed graph described in the previous slides; for that example,
we could use either form.
No factorization with loops!
x1MMSE = mean Φ ( x1 , y1 )
x1

sum Φ ( x2 , y2 ) Ψ ( x1 , x2 )
x2

sum Φ ( x3 , y3 ) Ψ ( x2 , x3 ) Ψ ( x1 , x3 )
x3

y1 y3
x2

x1 x3
Justification for running belief propagation
in networks with loops
• Experimental results:
– Error-correcting codes Kschischang and Frey, 1998;
McEliece et al., 1998
Freeman and Pasztor, 1999;
– Vision applications
Frey, 2000
• Theoretical results:
– For Gaussian processes, means are correct.
Weiss and Freeman, 1999
– Large neighborhood local maximum for MAP.
Weiss and Freeman, 2000
– Equivalent to Bethe approx. in statistical physics.
Yedidia, Freeman, and Weiss, 2000
– Tree-weighted reparameterization
Wainwright, Willsky, Jaakkola, 2001
Region marginal probabilities

bi ( xi ) = k Φ ( xi ) ∏ i ( xi )
M k

k ∈N ( i )
i

bij ( xi , x j ) = k Ψ ( xi , x j ) ∏ i ( xi )
M k

k ∈N ( i ) \ j
∏ j (x j )
M k

k ∈N ( j ) \ i

i j
Belief propagation equations
Belief propagation equations come from the
marginalization constraints.

i i j

i i j
=
M i j ( xi ) = ∑ψ ij (xi , x j ) ∏ j (x j )
M k

xj k ∈N ( j ) \ i
Results from Bethe free energy analysis
• Fixed point of belief propagation equations iff. Bethe
approximation stationary point.
• Belief propagation always has a fixed point.
• Connection with variational methods for inference: both
minimize approximations to Free Energy,
– variational: usually use primal variables.
– belief propagation: fixed pt. equs. for dual variables.
• Kikuchi approximations lead to more accurate belief
propagation algorithms.
• Other Bethe free energy minimization algorithms—
Yuille, Welling, etc.
Kikuchi message-update rules
Groups of nodes send messages to other groups of nodes.

Typical choice for Kikuchi cluster.

i j i j
i j
i
= =
k l

Update for Update for

messages messages
Generalized belief propagation
Marginal probabilities for nodes in one row
of a 10x10 spin glass

BP: belief propagation

GBP: generalized belief propagation
ML: maximum likelihood
References on BP and GBP
• J. Pearl, 1985
– classic
• Y. Weiss, NIPS 1998
– Inspires application of BP to vision
• W. Freeman et al learning low-level vision, IJCV 1999
– Applications in super-resolution, motion, shading/paint
discrimination
• H. Shum et al, ECCV 2002
– Application to stereo
• M. Wainwright, T. Jaakkola, A. Willsky
– Reparameterization version
• J. Yedidia, AAAI 2000
– The clearest place to read about BP and GBP.
Probability models for entire images:
Markov Random Fields
• Allows rich probabilistic models for
images.
• But built in a local, modular way. Learn
local relationships, get global effects out.
MRF nodes as pixels

Winkler, 1995, p. 32
MRF nodes as patches

image patches

scene patches
image
Φ(xi, yi)

Ψ(xi, xj)
scene
Network joint probability

∏Ψ( x , x ) ∏Φ( x , y )
1
P ( x, y ) = i j i i
Z i, j i
scene Scene-scene Image-scene
image compatibility compatibility
function function
neighboring local
scene nodes observations
In order to use MRFs:
• Given observations y, and the parameters of
the MRF, how infer the hidden variables, x?
• How learn the parameters of the MRF?
Outline of MRF section
• Inference in MRF’s.
– Iterated conditional modes (ICM)
– Gibbs sampling, simulated annealing
– Variational methods
– Belief propagation
– Graph cuts
• Vision applications of inference in MRF’s.
• Learning MRF parameters.
– Iterative proportional fitting (IPF)
Iterated conditional modes
• For each node:
– Condition on all the neighbors
– Find the mode
– Repeat.

Described in: Winkler, 1995. Introduced by Besag in 1986.

Winkler, 1995
Outline of MRF section
• Inference in MRF’s.
– Iterated conditional modes (ICM)
– Gibbs sampling, simulated annealing
– Variational methods
– Belief propagation
– Graph cuts
• Vision applications of inference in MRF’s.
• Learning MRF parameters.
– Iterative proportional fitting (IPF)
Gibbs Sampling and Simulated
Annealing

• Gibbs sampling:
– A way to generate random samples from a (potentially
very complicated) probability distribution.
• Simulated annealing:
– A schedule for modifying the probability distribution so
that, at “zero temperature”, you draw samples only
from the MAP solution.

Reference: Geman and Geman, IEEE PAMI 1984.

Sampling from a 1-d function
1. Discretize the density
function 3. Sampling

f (x ) f (k ) draw α ~ U(0,1);
for k = 1 to n
if F (k ) ≥ α
break;
f (k ) F (k ) x = x0 + kτ ;
2. Compute distribution function
from density function
Gibbs Sampling
x1(t +1) ~ π ( x1 | x2( t ) , x3(t ) , L, xK(t ) )

x2(t +1) ~ π ( x2 | x1(t +1) , x3(t ) , L , xK(t ) )

xK( t +1) ~ π ( xK | x1( t +1) ,L, xK( t −+11) )

x1
Slide by Ce Liu
Gibbs sampling and simulated
annealing
Simulated annealing as you gradually lower
the “temperature” of the probability
distribution ultimately giving zero
probability to all but the MAP estimate.
What’s good about it: finds global MAP
solution.
What’s bad about it: takes forever. Gibbs
sampling is in the inner loop…
Gibbs sampling and simulated
annealing
So you can find the mean value (MMSE
estimate) of a variable by doing Gibbs
sampling and averaging over the values that
come out of your sampler.
You can find the MAP value of a variable by
doing Gibbs sampling and gradually
lowering the temperature parameter to zero.
Outline of MRF section
• Inference in MRF’s.
– Iterated conditional modes (ICM)
– Gibbs sampling, simulated annealing
– Variational methods
– Belief propagation
– Graph cuts
• Vision applications of inference in MRF’s.
• Learning MRF parameters.
– Iterative proportional fitting (IPF)
Variational methods
• Reference: Tommi Jaakkola’s tutorial on
variational methods,
http://www.ai.mit.edu/people/tommi/
• Example: mean field
– For each node
• Calculate the expected value of the node,
conditioned on the mean values of the neighbors.
Outline of MRF section
• Inference in MRF’s.
– Iterated conditional modes (ICM)
– Gibbs sampling, simulated annealing
– Variational methods
– Belief propagation
– Graph cuts
• Vision applications of inference in MRF’s.
• Learning MRF parameters.
– Iterative proportional fitting (IPF)
Outline of MRF section
• Inference in MRF’s.
– Iterated conditional modes (ICM)
– Gibbs sampling, simulated annealing
– Variational methods
– Belief propagation
– Graph cuts
• Vision applications of inference in MRF’s.
• Learning MRF parameters.
– Iterative proportional fitting (IPF)
Graph cuts
• Algorithm: uses node label swaps or expansions
as moves in the algorithm to reduce the energy.
Swaps many labels at once, not just one at a time,
as with ICM.
• Find which pixel labels to swap using min cut/max
flow algorithms from network theory.
• Can offer bounds on optimality.
• See Boykov, Veksler, Zabih, IEEE PAMI 23 (11)
Nov. 2001 (available on web).
Comparison of graph cuts and belief
propagation
Comparison of Graph Cuts with Belief
Propagation for Stereo, using Identical
MRF Parameters, ICCV 2003.
Marshall F. Tappen William T. Freeman
Ground truth, graph cuts, and belief
propagation disparity solution energies
Graph cuts versus belief propagation
• Graph cuts consistently gave slightly lower energy
solutions for that stereo-problem MRF, although
BP ran faster, although there is now a faster graph
cuts implementation than what we used…
• However, here’s why I still use Belief
Propagation:
– Works for any compatibility functions, not a restricted
set like graph cuts.
– I find it very intuitive.
– Extensions: sum-product algorithm computes MMSE,
and Generalized Belief Propagation gives you very
accurate solutions, at a cost of time.
MAP versus MMSE
Show program comparing some
methods on a simple MRF
testMRF.m
Outline of MRF section
• Inference in MRF’s.
– Gibbs sampling, simulated annealing
– Iterated condtional modes (ICM)
– Variational methods
– Belief propagation
– Graph cuts
• Applications of inference in MRF’s.
• Learning MRF parameters.
– Iterative proportional fitting (IPF)
Applications of MRF’s
• Stereo
• Motion estimation
• Labelling shading and reflectance
• Many others…
Applications of MRF’s
• Stereo
• Motion estimation
• Labelling shading and reflectance
• Many others…
Motion application
image patches

image

scene patches

scene
What behavior should we see in a
motion algorithm?
• Aperture problem
• Resolution through propagation of
information
• Figure/ground discrimination
The aperture problem
The aperture problem
Program demo
Motion analysis: related work
• Markov network
– Luettgen, Karl, Willsky and collaborators.
• Neural network or learning-based
– Nowlan & T. J. Senjowski; Sereno.
• Optical flow analysis
– Weiss & Adelson; Darrell & Pentland; Ju,
Black & Jepson; Simoncelli; Grzywacz &
Yuille; Hildreth; Horn & Schunk; etc.
Inference: Motion estimation results
(maxima of scene probability distributions displayed)

Image data

Iterations 0 and 1
Initial guesses only
show motion at edges.
Motion estimation results
(maxima of scene probability distributions displayed)

Iterations 2 and 3

Figure/ground still
unresolved here.
Motion estimation results
(maxima of scene probability distributions displayed)

Iterations 4 and 5

Final result compares well with vector

quantized true (uniform) velocities.
Vision applications of MRF’s
• Stereo
• Motion estimation
• Labelling shading and reflectance
• Many others…
Forming an Image
Illuminate the surface to get:

Surface (Height Map) Shading Image

The shading image is the interaction of the shape
of the surface and the illumination
Painting the Surface

Scene Image
Add a reflectance pattern to the surface. Points
inside the squares should reflect less light
Goal

Image Shading Image Reflectance

Image
Basic Steps
1. Compute the x and y image derivatives
2. Classify each derivative as being caused by
either shading or a reflectance change
3. Set derivatives with the wrong label to zero.
4. Recover the intrinsic images by finding the least-
squares solution of the derivatives.

Classify each derivative

Original x derivative image
(White is reflectance)
Learning the Classifiers
• Combine multiple classifiers into a strong classifier using
AdaBoost (Freund and Schapire)
• Choose weak classifiers greedily similar to (Tieu and Viola
2000)
• Train on synthetic images
• Assume the light direction is from the right

Shading Training Set Reflectance Change Training Set

Using Both Color and
Gray-Scale Information

Results without
considering gray-scale
Some Areas of the Image Are
Locally Ambiguous

Is the change here better explained as

Input
or ?

Shading Reflectance
Propagating Information
• Can disambiguate areas by propagating
information from reliable areas of the image
into ambiguous areas of the image
Propagating Information
• Consider relationship between
neighboring derivatives

• Use Generalized Belief

Propagation to infer labels
Setting Compatibilities
• Set compatibilities
according to image
contours
– All derivatives along a
contour should have
the same label
• Derivatives along an
image contour
strongly influence β=
each other 0.5 1.0
⎡1 − β β ⎤
ψ ( xi , x j ) = ⎢
⎣ β 1 − β ⎥⎦
Improvements Using Propagation

Input Image Reflectance Image Reflectance Image

Without Propagation With Propagation
(More Results)

Input Image Shading Image Reflectance Image

Outline of MRF section
• Inference in MRF’s.
– Gibbs sampling, simulated annealing
– Iterated conditional modes (ICM)
– Variational methods
– Belief propagation
– Graph cuts
• Vision applications of inference in MRF’s.
• Learning MRF parameters.
– Iterative proportional fitting (IPF)
Learning MRF parameters, labeled data
Iterative proportional fitting lets you
make a maximum likelihood
estimate a joint distribution from
observations of various marginal
distributions.
True joint
probability

Observed
marginal
distributions
Initial guess at joint probability
IPF update equation

Scale the previous iteration’s estimate for the joint

probability by the ratio of the true to the predicted
marginals.

Gives gradient ascent in the likelihood of the joint

probability, given the observations of the marginals.

See: Michael Jordan’s book on graphical models

Convergence of to correct marginals by IPF algorithm
Convergence of to correct marginals by IPF algorithm
IPF results for this example:
comparison of joint probabilities

True joint
probability

Initial guess Final maximum

entropy estimate
Application to MRF parameter estimation

• Can show that for the ML estimate of the clique

potentials, φc(xc), the empirical marginals equal
the model marginals,

• This leads to the IPF update rule for φc(xc)

• Performs coordinate ascent in the likelihood of the

MRF parameters, given the observed data.

Reference: unpublished notes by Michael Jordan

More general graphical models than
MRF grids
• In this course, we’ve studied Markov chains, and
Markov random fields, but, of course, many other
structures of probabilistic models are possible and
useful in computer vision.
• For a nice on-line tutorial about Bayes nets, see
Kevin Murphy’s tutorial in his web page.
GrabCut

http://research.microsoft.com/vision/Cambridge/papers/siggraph04.pdf
end

Realtime Festival Overview
No ratings yet
Realtime Festival Overview
28 pages
Tanishka From
No ratings yet
Tanishka From
1 page
SMM 2024 WRF
No ratings yet
SMM 2024 WRF
374 pages
Resume - Devendra Rathore - Digital Marketing Professional-Compressed
No ratings yet
Resume - Devendra Rathore - Digital Marketing Professional-Compressed
2 pages
Aiml Partb Unit II QP
No ratings yet
Aiml Partb Unit II QP
5 pages
Am-4258 NEO Brushless Motor - Data Sheet
No ratings yet
Am-4258 NEO Brushless Motor - Data Sheet
2 pages
Summer 2022 Pyq Last
No ratings yet
Summer 2022 Pyq Last
2 pages
PCworth Product Pricelist
No ratings yet
PCworth Product Pricelist
22 pages
ML Unit5 QB Solutions
No ratings yet
ML Unit5 QB Solutions
13 pages
Sugar Plant Specifications 5000 TCD-7500 TCD
80% (5)
Sugar Plant Specifications 5000 TCD-7500 TCD
104 pages
Master Thesis RSM Erasmus University
100% (3)
Master Thesis RSM Erasmus University
5 pages
PNB Recruiment 2024 For Various Posts
No ratings yet
PNB Recruiment 2024 For Various Posts
11 pages
Lec7 - Bayesian Network I
No ratings yet
Lec7 - Bayesian Network I
62 pages
2022 Inspiring Profiles en Forster 0 Cat355cb804
No ratings yet
2022 Inspiring Profiles en Forster 0 Cat355cb804
32 pages
Quadratic Equations Final
No ratings yet
Quadratic Equations Final
6 pages
PML Class 1 2025
No ratings yet
PML Class 1 2025
54 pages
SP14 CS188 Lecture 14 - Hidden Markov Models - Print
No ratings yet
SP14 CS188 Lecture 14 - Hidden Markov Models - Print
26 pages
Arjun LN
No ratings yet
Arjun LN
2 pages
CLASS 2025 Bayesian Framework
No ratings yet
CLASS 2025 Bayesian Framework
46 pages
3.1 Usage of Ajax and Json
No ratings yet
3.1 Usage of Ajax and Json
18 pages
Cordeau 2002
No ratings yet
Cordeau 2002
11 pages
US Gov National Standards Strategy 2023
No ratings yet
US Gov National Standards Strategy 2023
14 pages
MCMC
No ratings yet
MCMC
76 pages
T9 Assembly Modeling
No ratings yet
T9 Assembly Modeling
15 pages
Chapter 9 Bayesian Methods - Machine Learning For Factor Investing
No ratings yet
Chapter 9 Bayesian Methods - Machine Learning For Factor Investing
11 pages
Directing The Documentary 6th 5084
No ratings yet
Directing The Documentary 6th 5084
710 pages
XZB (36B) Rotating Cups-Installation Technique and Inspection Record
No ratings yet
XZB (36B) Rotating Cups-Installation Technique and Inspection Record
23 pages
Hoeganaes Corporation
No ratings yet
Hoeganaes Corporation
11 pages
AMF-65 AMS RFT Partnership Range - CULT PDF
No ratings yet
AMF-65 AMS RFT Partnership Range - CULT PDF
46 pages
ML - Unit-V-1
No ratings yet
ML - Unit-V-1
42 pages
CEM How To - Final
No ratings yet
CEM How To - Final
84 pages
CRF Eric Xing
No ratings yet
CRF Eric Xing
31 pages
Installer Uninstaller Readme
No ratings yet
Installer Uninstaller Readme
2 pages
Amy Corns - Connecting Scatter Plots and Correlation Coefficients Activity
No ratings yet
Amy Corns - Connecting Scatter Plots and Correlation Coefficients Activity
23 pages
Pipeliner Mps 4000
No ratings yet
Pipeliner Mps 4000
4 pages
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Prob Inf
No ratings yet
Prob Inf
56 pages
Cheat Sheet 4
No ratings yet
Cheat Sheet 4
2 pages
Bayesian - Lec - 4
No ratings yet
Bayesian - Lec - 4
25 pages
41-, Gaussian Mixture Models, Expectation Maximization-20-11-2024
No ratings yet
41-, Gaussian Mixture Models, Expectation Maximization-20-11-2024
40 pages
Machine - Learning (Unit 3)
No ratings yet
Machine - Learning (Unit 3)
9 pages
Exp1 A09 DS
No ratings yet
Exp1 A09 DS
6 pages
Solving Math Problems
From Everand
Solving Math Problems
George N. Frempong
No ratings yet
16 Graphical Models
No ratings yet
16 Graphical Models
27 pages
Bayesian Neworks
No ratings yet
Bayesian Neworks
32 pages
Kingdom of Saudi: Jubail Industrial City Project
No ratings yet
Kingdom of Saudi: Jubail Industrial City Project
45 pages
Curriculum Map Grade 7
No ratings yet
Curriculum Map Grade 7
7 pages
07 - Bayesian Learning
No ratings yet
07 - Bayesian Learning
55 pages
Arduino Energy Meter PDF
100% (2)
Arduino Energy Meter PDF
16 pages
Compliance Under Case-B'.: Notes
No ratings yet
Compliance Under Case-B'.: Notes
10 pages
2-BP Basics
No ratings yet
2-BP Basics
69 pages
QSK Series Mcrs Elec Cont 20aug10
100% (1)
QSK Series Mcrs Elec Cont 20aug10
54 pages
D8.1M 2007PV PDF
No ratings yet
D8.1M 2007PV PDF
5 pages
13 Bayes Nets
No ratings yet
13 Bayes Nets
38 pages
Graph Lecture19
No ratings yet
Graph Lecture19
42 pages
ASHTIKA
No ratings yet
ASHTIKA
9 pages
DRP Proposal - 210103
No ratings yet
DRP Proposal - 210103
6 pages
Machine Learning Models and Theories
No ratings yet
Machine Learning Models and Theories
38 pages
Probabilistic AI
No ratings yet
Probabilistic AI
13 pages
Slide 1
No ratings yet
Slide 1
37 pages
Bayesian Learning
No ratings yet
Bayesian Learning
44 pages
ECE 368 Course Review: Probabilistic Reasoning 2023
No ratings yet
ECE 368 Course Review: Probabilistic Reasoning 2023
138 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
04 Exact Inference
No ratings yet
04 Exact Inference
23 pages
Inference in BN
No ratings yet
Inference in BN
18 pages
Scribe Lecture4
No ratings yet
Scribe Lecture4
9 pages
Essentials of Bayesian Inference 1706204646
No ratings yet
Essentials of Bayesian Inference 1706204646
21 pages
Bayes Algorithm
No ratings yet
Bayes Algorithm
26 pages
Bayesian Machine Learning
No ratings yet
Bayesian Machine Learning
127 pages
2223hk1 Slide01 ML2022-2
No ratings yet
2223hk1 Slide01 ML2022-2
23 pages
Chapter 9 Data Mining
No ratings yet
Chapter 9 Data Mining
147 pages
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
No ratings yet
Machine Learning: Lecture 6: Bayesian Learning (Based On Chapter 6 of Mitchell T.., Machine Learning, 1997)
15 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Belief Propagation Cambridge
No ratings yet
Belief Propagation Cambridge
22 pages
Lec23 PDF
No ratings yet
Lec23 PDF
7 pages
6438 CombinedNotes
No ratings yet
6438 CombinedNotes
206 pages
Lecture 8: Bayesian Estimation of Parameters in State Space Models
No ratings yet
Lecture 8: Bayesian Estimation of Parameters in State Space Models
33 pages
Lectures 7 and 8
No ratings yet
Lectures 7 and 8
37 pages
Belief Propagation Algorithm
No ratings yet
Belief Propagation Algorithm
20 pages
Bayesian Networks: Construction, Inference, Learning and Causal Interpretation
No ratings yet
Bayesian Networks: Construction, Inference, Learning and Causal Interpretation
58 pages
Introduction To Bayesian Learning: Aaron Hertzmann University of Toronto SIGGRAPH 2004 Tutorial
No ratings yet
Introduction To Bayesian Learning: Aaron Hertzmann University of Toronto SIGGRAPH 2004 Tutorial
141 pages
Jeff Byers - Machine Learning and Advanced Statitics
No ratings yet
Jeff Byers - Machine Learning and Advanced Statitics
48 pages
A Tutorial Introduction To Belief Propagation: James Coughlan
No ratings yet
A Tutorial Introduction To Belief Propagation: James Coughlan
44 pages
Dealing With Uncertainty P (X - E) : Probability Theory The Foundation of Statistics
No ratings yet
Dealing With Uncertainty P (X - E) : Probability Theory The Foundation of Statistics
34 pages
Introduction To Probabilistic Learning
No ratings yet
Introduction To Probabilistic Learning
9 pages
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
From Everand
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
Andrew Igla
No ratings yet
MCMC Brief
100% (1)
MCMC Brief
69 pages
Hmms and The Forward-Backward Algorithm: Hidden Markov Models
No ratings yet
Hmms and The Forward-Backward Algorithm: Hidden Markov Models
7 pages
Bayesian Networks: Machine Learning, Lecture (Jaakkola)
No ratings yet
Bayesian Networks: Machine Learning, Lecture (Jaakkola)
8 pages
Bayes ML Tutorial
No ratings yet
Bayes ML Tutorial
69 pages

2006 March 21 MRF

Uploaded by

2006 March 21 MRF

Uploaded by

Graphical models,

belief propagation, and

Bill Freeman, MIT

6.882 March 21, 2005

Showing local disparity evidence

How select which selection of

• First, define the function you want to optimize. Note the

x1MMSE = mean sum sum P ( x1 , x2 , x3 , y1 , y2 , y3 )

x1MMSE = mean sum sum Φ ( x1 , y1 )

x1MMSE = mean sum sum Φ ( x1 , y1 )

(Given the probabilities of my being in

To find the marginal probability for each variable, you can

Vision is a problem involving the interactions of many variables:

Readings: Jordan and Weiss intro article—fantastic!

If the system really is high dimensional, that will quickly become

By the chain rule, for any probability distribution, we have:

But if we exploit the assumed modularity of the probability distribution over

Now our marginalization summations distribute through those terms:

In this example, it has saved us a lot of computation. Suppose each

The joint probability for this graph is given by:

Where Φ ( x , x ) is called a “compatibility function”. We can

Typical choice for Kikuchi cluster.

Update for Update for

BP: belief propagation

Described in: Winkler, 1995. Introduced by Besag in 1986.

Reference: Geman and Geman, IEEE PAMI 1984.

x2(t +1) ~ π ( x2 | x1(t +1) , x3(t ) , L , xK(t ) )

xK( t +1) ~ π ( xK | x1( t +1) ,L, xK( t −+11) )

Final result compares well with vector

Surface (Height Map) Shading Image

Image Shading Image Reflectance

Classify each derivative

Shading Training Set Reflectance Change Training Set

Is the change here better explained as

• Use Generalized Belief

Input Image Reflectance Image Reflectance Image

Input Image Shading Image Reflectance Image

Scale the previous iteration’s estimate for the joint

Gives gradient ascent in the likelihood of the joint

See: Michael Jordan’s book on graphical models

Initial guess Final maximum

• Can show that for the ML estimate of the clique

• This leads to the IPF update rule for φc(xc)

• Performs coordinate ascent in the likelihood of the

Reference: unpublished notes by Michael Jordan

You might also like