0% found this document useful (0 votes)

17 views60 pages

11 Probabilistic Temporal Models

Uploaded by

韩二建

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views60 pages

11 Probabilistic Temporal Models

Uploaded by

韩二建

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 60

Midterm

▪ Request regrading on Gradescope by this Saturday

Announcement
▪ Homework 4
▪ Due: Nov. 18, 11:59pm
▪ Programming Assignment 4
▪ Due: Nov. 25, 11:59pm
Probabilistic Reasoning over Time

AIMA Chapter 15

[Adapted from slides by Dan Klein and Pieter Abbeel at UC Berkeley]

Uncertainty and Time

▪ Often, we want to reason about a sequence of observations

▪ Speech recognition
▪ Robot localization
▪ Medical monitoring
▪ User attention

▪ Need to introduce time into our models

Markov Models
Markov Models (aka Markov chain/process)
▪ Assume discrete variables that share the same finite domain
▪ Values in the domain is called the states

X0 X1 X2 X3
P(X0) P(Xt | Xt-1)

▪ The transition model P(Xt | Xt-1) specifies how the state evolves
over time
▪ Stationarity assumption: same transition probabilities at all time
steps
▪ Joint distribution P(X0,…, XT) = P(X0) t P(Xt | Xt-1)
Quiz: are Markov models a special case of Bayes nets?

X0 X1 X2 X3

▪ Yes and no!

▪ Yes:
▪ Directed acyclic graph, joint = product of conditionals
▪ No:
▪ Infinitely many variables (unless we truncate)
▪ Repetition of transition model not part of standard Bayes net syntax
Markov Assumption: Conditional Independence

▪ Markov assumption: Xt+1, … are independent of X0,…, Xt-1 given

Xt
▪ Past and future independent given the present
▪ Each time step only depends on the previous
▪ This is a first-order Markov model
▪ A kth-order model allows dependencies on k earlier steps
Example: Weather
▪ States {rain, sun}

▪ Initial distribution P(X0)

P(X0)
sun rain
0.5 0.5
Two new ways of representing the same CPT
▪ Transition model P(Xt | Xt-1) 0.9
0.3
0.9
Xt-1 P(Xt|Xt-1) sun sun
rain sun 0.1
sun rain 0.3
rain rain
sun 0.9 0.1 0.7
0.7
rain 0.3 0.7 0.1
Weather prediction
▪ Time 0: <0.5,0.5> Xt-1 P(Xt|Xt-1)
sun rain
sun 0.9 0.1
rain 0.3 0.7

▪ What is the weather like at time 1?

▪ P(X1) = x0 P(X1,X0=x0)
▪ = x0 P(X0=x0) P(X1| X0=x0)
▪ = 0.5<0.9,0.1> + 0.5<0.3,0.7> = <0.6,0.4>
Weather prediction, contd.
▪ Time 1: <0.6,0.4> Xt-1 P(Xt|Xt-1)
sun rain
sun 0.9 0.1
rain 0.3 0.7

▪ What is the weather like at time 2?

▪ P(X2) = x1 P(X2,X1=x1)
▪ = x1 P(X1=x1) P(X2| X1=x1)
▪ = 0.6<0.9,0.1> + 0.4<0.3,0.7> = <0.66,0.34>
Weather prediction, contd.
▪ Time 2: <0.66,0.34> Xt-1 P(Xt|Xt-1)
sun rain
sun 0.9 0.1
rain 0.3 0.7

▪ What is the weather like at time 3?

▪ P(X3) = x2 P(X3,X2=x2)
▪ = x2 P(X2=x2) P(X3| X2=x2)
▪ = 0.66<0.9,0.1> + 0.34<0.3,0.7> = <0.696,0.304>
Forward algorithm (simple form)
▪ What is the state at time t (given an initial distribution P(X0))?
▪ P(Xt) = xt-1 P(Xt,Xt-1=xt-1)
▪ = xt-1 P(Xt-1=xt-1) P(Xt| Xt-1=xt-1)

Probability from
Transition model
previous iteration

▪ Iterate this update starting at t=0

Example Run of Mini-Forward Algorithm
▪ From initial observation of sun Xt-1 Xt P(Xt|Xt-1)
sun sun 0.9
sun rain 0.1
rain sun 0.3
P(X0) P(X1) P(X2) P(X3) P(X)
rain rain 0.7
▪ From initial observation of rain

P(X0) P(X1) P(X2) P(X3) P(X)

▪ From yet another initial distribution P(X0):

…
P(X0) P(X)
Stationary Distributions

▪ For most chains: ▪ Stationary distribution:

▪ Influence of the initial distribution ▪ The distribution we end up with is called
gets less and less over time. the stationary distribution of the
▪ The distribution we end up in is chain
independent of the initial distribution ▪ It satisfies
Example: Stationary Distributions
▪ Computing the stationary distribution
X0 X1 X2 X3

Xt-1 Xt P(Xt|Xt-1)
sun sun 0.9
sun rain 0.1
rain sun 0.3
rain rain 0.7

Also:
Application of Stationary Distribution: Web Link Analysis

▪ Web browsing
▪ Each web page is a state
▪ Initial distribution: uniform over pages
▪ Transitions:
▪ With prob. c, uniform jump to a random page
▪ With prob. 1-c, follow a random outlink
▪ Stationary distribution: PageRank
▪ Will spend more time on highly reachable pages
▪ Google 1.0 returned the set of pages containing all
your keywords in decreasing rank
▪ Now: use link analysis along with many other factors
(rank actually getting less important)
Application of Stationary Distributions: Gibbs Sampling

▪ Each joint instantiation over all hidden and

query variables is a state: {X1, …, Xn} = H ∪ Q
▪ Transitions:
▪ Pick a variable and resample its value conditioned
on its Markov blanket
▪ Stationary distribution:
▪ Conditional distribution P(X1, X2 , … , Xn|e1, …, em)
▪ When running Gibbs sampling long enough, we
get a sample from the desired distribution
Hidden Markov Models
Hidden Markov Models
▪ Usually the true state is not observed directly
▪ E.g., you stay indoor and cannot see the weather,
but you can see if people come in with umbrella
or not.
▪ Hidden Markov models (HMMs)
▪ Underlying Markov chain over states X
▪ You observe evidence E at each time step

X0 X1 X2 X3 X5

E1 E2 E3 E5
Example: Weather HMM
▪ An HMM is defined by:
Wt-1 P(Wt|Wt-1)
▪ Initial distribution: P(X0)
sun rain
sun 0.9 0.1 ▪ Transition model: P(Xt| Xt-1)
rain 0.3 0.7 ▪ Emission model: P(Et| Xt)
Weathert-1 Weathert Weathert+1

Wt P(Ut|Wt)
true false
sun 0.2 0.8
rain 0.9 0.1

Umbrellat-1 Umbrellat Umbrellat+1

HMM as probability model
▪ Joint distribution for Markov model: P(X0,…, XT) = P(X0) t=1:T P(Xt | Xt-1)
▪ Joint distribution for hidden Markov model:
P(X0,X1, E1,…, XT,ET) = P(X0) t=1:T P(Xt | Xt-1) P(Et | Xt)
▪ Independence in HMM
▪ Future states are independent of the past given the present
▪ Current evidence is independent of everything else given the current state

X0 X1 X2 X3 X5

E1 E2 E3 E5
Real HMM Examples
▪ Speech recognition HMMs:
▪ Observations are acoustic signals (continuous valued)
▪ States are specific positions in specific words (so, tens of thousands)
▪ Machine translation HMMs:
▪ Observations are words (tens of thousands)
▪ States are translation options
▪ Robot tracking:
▪ Observations are range readings (continuous)
▪ States are positions on a map (continuous)
▪ Molecular biology:
▪ Observations are nucleotides ACGT
▪ States are coding/non-coding/start/stop/splice-site etc.
Inference tasks

▪ Useful notation: Xa:b = Xa , Xa+1, …, Xb

▪ Filtering: P(Xt|e1:t)
▪ belief state — posterior distribution over the most recent state given all evidence
▪ Ex: robot localization
▪ Prediction: P(Xt+k|e1:t) for k > 0
▪ posterior distribution over a future state given all evidence
▪ Smoothing: P(Xk|e1:t) for 0 ≤ k < t
▪ posterior distribution over a past state given all evidence
▪ Most likely explanation: arg maxx0:t P(x0:t | e1:t)
▪ Ex: speech recognition, decoding with a noisy channel
Filtering
▪ Filtering: infer current state given all evidence
▪ Aim: a recursive filtering algorithm of the form
▪ P(Xt+1|e1:t+1) = g(et+1, P(Xt|e1:t) )
Apply Bayes’ rule

▪ P(Xt+1|e1:t+1) = P(Xt+1|e1:t, et+1)

▪ = α P(et+1|Xt+1, e1:t) P(Xt+1| e1:t)

α = 1 / P(et+1|e1:t)
Filtering
▪ Filtering: infer current state given all evidence
▪ Aim: a recursive filtering algorithm of the form
▪ P(Xt+1|e1:t+1) = g(et+1, P(Xt|e1:t) )

▪ P(Xt+1|e1:t+1) = P(Xt+1|e1:t, et+1) Apply conditional independence

▪ = α P(et+1|Xt+1, e1:t) P(Xt+1| e1:t)

▪ = α P(et+1|Xt+1) P(Xt+1| e1:t)

Normalize Update Predict

Filtering
▪ Filtering: infer current state given all evidence
▪ Aim: a recursive filtering algorithm of the form
▪ P(Xt+1|e1:t+1) = g(et+1, P(Xt|e1:t) )

▪ P(Xt+1|e1:t+1) = P(Xt+1|e1:t, et+1)

▪ = α P(et+1|Xt+1, e1:t) P(Xt+1| e1:t) Condition on X
t

▪ = α P(et+1|Xt+1) P(Xt+1| e1:t)

▪ P(Xt+1|e1:t+1) = P(Xt+1|e1:t, et+1)

▪ = α P(et+1|Xt+1, e1:t) P(Xt+1| e1:t)
▪ = α P(et+1|Xt+1) P(Xt+1| e1:t) Apply conditional
independence

▪ = α P(et+1|Xt+1) xt P(xt | e1:t) P(Xt+1| xt, e1:t)

Normalize Update Predict

Xt+1 Xt Xt+1

Et+1
Forward algorithm
▪ P(Xt+1|e1:t+1) = α P(et+1|Xt+1) x P(xt | e1:t) P(Xt+1| xt)
t

Normalize Update Predict

▪ f1:t+1 = FORWARD(f1:t , et+1)

▪ We start with f1:0 = P(X0) and then iterate
▪ Cost per time step: O(|X|2) where |X| is the number of states
Example: Weather HMM
𝑃 𝑠 = 0.5 × 0.9 + 0.5 × 0.3 = 0.6
0.6
0.4 𝑃 𝑟 = 0.5 × 0.1 + 0.5 × 0.7 = 0.4
predict
update &
normalize Wt-1 P(Wt|Wt-1)
f(sun) = 0.5 f(sun) = 0.25 sun rain
𝑃 𝑠|𝑢 ∝ 0.6 × 0.2 = 0.12
f(rain) = 0.5 f(rain) = 0.75 sun 0.9 0.1
𝑃 𝑠|𝑢 ∝ 0.4 × 0.9 = 0.36
rain 0.3 0.7

Weather0 Weather1
Wt P(Ut|Wt)
true false
P(W0)
sun 0.2 0.8
sun rain
Umbrella1 rain 0.9 0.1
0.5 0.5
Example: Weather HMM

0.6 0.45
predict 0.4 predict 0.55
update & update &
normalize normalize Wt-1 P(Wt|Wt-1)
f(sun) = 0.5 f(sun) = 0.25 f(sun) = 0.154 sun rain
f(rain) = 0.5 f(rain) = 0.75 f(rain) = 0.846 sun 0.9 0.1
rain 0.3 0.7

Weather0 Weather1 Weather2 …

Normalize Update Predict

α is a constant. So if we only want to compute P(xt | e1:t), then we can skip

normalization when computing P(x1 | e1), P(x2 | e1:2), …, P(xt-1 | e1:t-1)

Q: How is the algorithm related to variable elimination?

Another view of the algorithm
▪ State trellis: graph of states and transitions over time
sun sun sun sun

rain rain rain rain

X0 X1 … XT

▪ Each arc represents some transition xt-1 → xt

▪ Each arc has weight P(xt | xt-1) P(et | xt) (arcs to initial states have weight P(x0) )
▪ Each path is a sequence of states
▪ The product of weights on a path is proportional to that state sequence’s probability
P(x0) t P(xt | xt-1) P(et | xt) = P(x1:t , e1:t)  P(x1:t | e1:t)
Another view of the algorithm

sun sun sun sun

rain rain rain rain

X0 X1 … Xt+1
• Forward algorithm computes sum over all possible paths
P(xt+1|e1:t+1) = x0:t P(x0:t+1 | e1:t+1)
• It uses dynamic programming to sum over all paths
• For each state at time t, keep track of the total probability of all paths to it
f1:t+1 = FORWARD(f1:t , et+1)
= α P(et+1|Xt+1) x P(Xt+1| xt) f1:t [xt]
t
Most Likely Explanation
Inference tasks
▪ Filtering: P(Xt|e1:t)
▪ belief state—input to the decision process of a rational agent
▪ Prediction: P(Xt+k|e1:t) for k > 0
▪ evaluation of possible action sequences; like filtering without the evidence
▪ Smoothing: P(Xk|e1:t) for 0 ≤ k < t
▪ better estimate of past states, essential for learning
▪ Most likely explanation: arg maxx0:t P(x0:t | e1:t)
▪ speech recognition, decoding with a noisy channel
Most likely explanation = most probable path
▪ State trellis: graph of states and transitions over time
sun sun sun sun

rain rain rain rain

X0 X1 … XT

▪ The product of weights on a path is proportional to that state sequence’s probability

P(x0) t P(xt | xt-1) P(et | xt) = P(x0:t , e1:t)  P(x0:t | e1:t)
▪ Viterbi algorithm computes best paths
arg maxx0:t P(x0:t | e1:t)
Forward / Viterbi algorithms
sun sun sun sun

rain rain rain rain

X0 X1 … XT
Viterbi Algorithm (max) Forward Algorithm (sum)
For each state at time t, keep track of For each state at time t, keep track of the
the (unnormalized) maximum total probability of all paths to it:
probability of any path to it: f1:t+1 (xt+1) = P(xt+1 |e1:t+1)
m1:t+1 (xt+1) = maxx P(x1:t+1 |e1:t+1)
1:t
= x P(x1:t+1 |e1:t+1)
1:t

m1:t+1 = VITERBI(m1:t , et+1) f1:t+1 = FORWARD(f1:t , et+1)

X0 X1 X2 X3 true false
sun 0.2 0.8
U1=true U2=false U3=true rain 0.9 0.1

▪ m1:t+1 = P(et+1|Xt+1) maxx P(Xt+1| xt) m1:t [xt]

𝑚1:1 sun = 0.2 × max(0.9 × 0.5, 0.3 × 0.5) = 0.09

Viterbi algorithm contd.
Wt-1 P(Wt|Wt-1)
P(W0)
sun rain
sun rain sun
0.5 sun
0.09 sun
0.076 sun
0.0136080
sun 0.9 0.1
0.5 0.5
rain 0.3 0.7
rain
0.5 rain
0.315 rain
0.022 rain
0.0138495
Wt P(Ut|Wt)

X0 X1 X2 X3 true false
sun 0.2 0.8
U1=true U2=false U3=true rain 0.9 0.1

▪ m1:t+1 = P(et+1|Xt+1) maxx P(Xt+1| xt) m1:t [xt]

▪ Time complexity: O(|X|2 T)

▪ Space complexity: O(|X| T)
Dynamic Bayes Nets
Dynamic Bayes Nets (DBNs)
▪ We want to track multiple variables over time, using
multiple sources of evidence
▪ Idea: Repeat a fixed Bayes net structure at each time
▪ Variables from time t can condition on those from t-1
t =1 t =2 t =3

G 1a G 2a G 3a

G 1b G 2b G 3b

E1a E1b E2a E2b E3a E3b

DBNs and HMMs
▪ Every HMM is a DBN
▪ Every discrete DBN can be represented by a HMM
▪ Each HMM state is Cartesian product of DBN state variables
▪ E.g., 3 binary state variables => one state variable with 23 possible values
▪ Advantage of DBN vs. HMM?
▪ Sparse dependencies => exponentially fewer parameters
▪ E.g., 20 binary state variables, 2 parents each;
DBN has 20 x 22+1 = 160 parameters, HMM has 220 x 220 =~ 1012 parameters
Exact Inference in DBNs
▪ Variable elimination applies to dynamic Bayes nets
▪ Offline: “unroll” the network for T time steps, then eliminate variables to find P(XT|e1:T)
▪ Problem: results in very large BN
t =1 t =2 t =3

G 1a G 2a G 3a
G 1b G 2b G 3b

E1a E1b E2a E2b E3a E3b

▪ Can we do better?
▪ Do we need to unroll for many steps? What is the best variable order of elimination?
▪ Online: unroll as we go, eliminate all variables from the previous time step
▪ A generalization of the Forward algorithm
Particle Filtering
Large state space
▪ When |X| is huge (e.g., position in a building), exact inference becomes
infeasible
▪ Can we use approximate inference, e.g., likelihood weighting?
▪ Evidences are “downstream”
▪ By ignoring the evidence: with more states sampled over time, the weight drops
quickly (going into low-probability region)
▪ Hence: too few “reasonable” samples

X0 X1 X2 X3

E1 E2 E3
Particle Filtering

▪ Represent belief state at each step by a set of 0.0 0.1 0.0

samples
0.0 0.0 0.2
▪ Samples are called particles
▪ Our representation of P(X) is now a list of N 0.0 0.2 0.5
particles (samples)
▪ P(x) approximated by number of particles with value x
▪ So, many x may have P(x) = 0
▪ Generally, N << |X|
▪ More particles, more accuracy; but a large N would
defeat the point.
Representation: Particles

▪ Initialization
▪ sample N particles from the initial distribution P(X0)
▪ All particles have a weight of 1

Particles:
(3,3)
(2,3)
(3,3)
(3,2)
(3,3)
(3,2)
(1,2)
(3,3)
(3,3)
(2,3)
Particle Filtering: Propagate forward

▪ Each particle is moved by sampling its Particles:

(3,3)
next position from the transition model: (2,3)
(3,3)
▪ xt+1 ~ P(Xt+1 | xt) (3,2)
(3,3)
(3,2)
(1,2)
(3,3)

▪ This captures the passage of time

(3,3)
(2,3)

▪ If enough samples, close to exact probabilities

(consistent) Particles:
(3,2)
(2,3)
(3,2)
(3,1)
(3,3)
(3,2)
(1,3)
(2,3)
(3,2)
(2,2)
Particle Filtering: Observe
Particles:
▪ Similar to likelihood weighting, weight (3,2)
(2,3)
samples based on the evidence (3,2)
(3,1)
▪ W = P(et| xt) (3,3)
(3,2)

▪ Particles that fit the evidence better get (1,3)

(2,3)
higher weights, others get lower weights (3,2)
(2,2)

▪ What happens if we repeat the

Propagate-Observe procedure over Particles:
(3,2) w=.9

time? (2,3) w=.2

(3,2) w=.9
(3,1) w=.4
▪ It is exactly likelihood weighting (if we (3,3) w=.4
(3,2) w=.9
multiply the weights) (1,3) w=.1
(2,3) w=.2
▪ Weights drop quickly… (3,2) w=.9
(2,2) w=.4
Particle Filtering: Resample

▪ Rather than tracking weighted samples, Particles:

(3,2) w=.9
we resample (2,3) w=.2
(3,2) w=.9

▪ Generate N new samples from our weighted (3,1) w=.4

(3,3) w=.4
samples (3,2) w=.9
(1,3) w=.1

▪ Each new sample is selected from the current (2,3) w=.2

(3,2) w=.9
population of samples; the probability is (2,2) w=.4

proportional to its weight.

▪ The new samples have weight of 1
(New) Particles:
(3,2)
(2,2)
(3,2)
▪ Now the update is complete for this time (2,3)
(3,3)
step, continue with the next one (3,2)
(1,3)
(2,3)
(3,2)
(3,2)
Summary: Particle Filtering
▪ Particles: track samples of states rather than an explicit distribution
Propagate forward Weight Resample

 P(Xt|e1:t)

Particles: Particles: Particles: (New) Particles:

(3,3) (3,2) (3,2) w=.9 (3,2)
(2,3) (2,3) (2,3) w=.2 (2,2)
(3,3) (3,2) (3,2) w=.9 (3,2)
(3,2) (3,1) (3,1) w=.4 (2,3)
(3,3) (3,3) (3,3) w=.4 (3,3)
(3,2) (3,2) (3,2) w=.9 (3,2)
(1,2) (1,3) (1,3) w=.1 (1,3)
(3,3) (2,3) (2,3) w=.2 (2,3)
(3,3) (3,2) (3,2) w=.9 (3,2)
(2,3) (2,2) (2,2) w=.4 (3,2)

Consistency: see proof in AIMA Ch. 15

Robot Localization
▪ In robot localization:
▪ We know the map, but not the robot’s position
▪ Observations may be vectors of range finder readings
▪ State space and readings are typically continuous so we
cannot usually represent or compute an exact posterior
▪ Particle filtering is a main technique
Particle Filter Localization (Sonar)

[Dieter Fox, et al.]

Particle Filter Localization (Laser)

[Dieter Fox, et al.]

Robot Mapping
▪ SLAM: Simultaneous Localization And Mapping
▪ We do not know the map or our location
▪ State consists of position AND map!
▪ Main techniques: Kalman filtering (Gaussian HMMs)
and particle methods

DP-SLAM, Ron Parr

Particle Filter SLAM – Video 1

[Sebastian Thrun, et al.]

Particle Filter SLAM – Video 2

[Dirk Haehnel, et al.]

Summary
▪ Probabilistic temporal models
▪ Markov model
▪ Hidden Markov model
▪ Filtering: forward algorithm
▪ MLE: Viterbi algorithm
▪ Dynamic Bayesian network
▪ Approximate inference by particle filtering

Chapter15 1
No ratings yet
Chapter15 1
36 pages
Chapter 14
No ratings yet
Chapter 14
38 pages
L14 Probabilistic Reasoning Over Time
No ratings yet
L14 Probabilistic Reasoning Over Time
42 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
51 pages
24f 09 Hidden Markov Models
No ratings yet
24f 09 Hidden Markov Models
79 pages
Lec18 HMMs
No ratings yet
Lec18 HMMs
56 pages
6-AI Markov
No ratings yet
6-AI Markov
24 pages
Probabilistic Reasoning Over Time
No ratings yet
Probabilistic Reasoning Over Time
62 pages
Pert15 - Probabilistic Reasoning Over Time
No ratings yet
Pert15 - Probabilistic Reasoning Over Time
32 pages
SP14 CS188 Lecture 14 - Hidden Markov Models - Print
No ratings yet
SP14 CS188 Lecture 14 - Hidden Markov Models - Print
26 pages
Chapter15 2
No ratings yet
Chapter15 2
21 pages
Ad8601-Unit Ii Probabilistic Reasoning Ii
No ratings yet
Ad8601-Unit Ii Probabilistic Reasoning Ii
26 pages
CSE 473: Ar+ficial Intelligence: Bayes' Nets
No ratings yet
CSE 473: Ar+ficial Intelligence: Bayes' Nets
26 pages
cs188 Fa22 Note17
No ratings yet
cs188 Fa22 Note17
8 pages
Lec8 - Bayesian Network II
No ratings yet
Lec8 - Bayesian Network II
50 pages
AI - 4th Unit
No ratings yet
AI - 4th Unit
19 pages
Lectures 7 and 8
No ratings yet
Lectures 7 and 8
37 pages
Lecture 2 - Recursive State Estimation-P1
No ratings yet
Lecture 2 - Recursive State Estimation-P1
39 pages
Hidden Markovnikov Model
No ratings yet
Hidden Markovnikov Model
32 pages
17 19 HMMs
No ratings yet
17 19 HMMs
23 pages
Hmms and The Forward-Backward Algorithm: Hidden Markov Models
No ratings yet
Hmms and The Forward-Backward Algorithm: Hidden Markov Models
7 pages
Lecture 11
No ratings yet
Lecture 11
55 pages
Naive Bayes
No ratings yet
Naive Bayes
25 pages
Inference On Relational Models Using Markov Chain Monte Carlo
No ratings yet
Inference On Relational Models Using Markov Chain Monte Carlo
61 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
51 pages
斯坦福大学机器学习数学基础 65-72
No ratings yet
斯坦福大学机器学习数学基础 65-72
8 pages
Hidden Markov Models: Julia Hirschberg CS4705
No ratings yet
Hidden Markov Models: Julia Hirschberg CS4705
37 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
51 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
35 pages
Crib Sheet
No ratings yet
Crib Sheet
2 pages
Forward-Backward Algorithm PDF
No ratings yet
Forward-Backward Algorithm PDF
6 pages
Introduction
No ratings yet
Introduction
35 pages
Lecture 2 Probabilistic Robotics
No ratings yet
Lecture 2 Probabilistic Robotics
35 pages
hw3 Solution
No ratings yet
hw3 Solution
7 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
32 pages
Probabilistic Reasoning: CS 188: Artificial Intelligence
No ratings yet
Probabilistic Reasoning: CS 188: Artificial Intelligence
10 pages
Unit3pdf 2025 01 14 10 38 08
No ratings yet
Unit3pdf 2025 01 14 10 38 08
4 pages
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
No ratings yet
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
29 pages
Lec20 PDF
No ratings yet
Lec20 PDF
7 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
36 pages
cs229 HMM
No ratings yet
cs229 HMM
13 pages
Hidden Markov Models: Adapted From
No ratings yet
Hidden Markov Models: Adapted From
33 pages
AML Mod2
No ratings yet
AML Mod2
38 pages
Unit 2-4
No ratings yet
Unit 2-4
13 pages
HMM
No ratings yet
HMM
25 pages
29-Approximate Inference Methods-28-03-2024
No ratings yet
29-Approximate Inference Methods-28-03-2024
26 pages
Naive Bays
No ratings yet
Naive Bays
25 pages
Artificial Intelligence and Learning Algorithms: Presented by Brian M. Frezza 12/1/05
No ratings yet
Artificial Intelligence and Learning Algorithms: Presented by Brian M. Frezza 12/1/05
67 pages
Recitation4 Notes
No ratings yet
Recitation4 Notes
6 pages
UCAS AI模式识别4 参数估计
No ratings yet
UCAS AI模式识别4 参数估计
37 pages
ECE 368 Course Review: Probabilistic Reasoning 2023
No ratings yet
ECE 368 Course Review: Probabilistic Reasoning 2023
138 pages
Probability & Statistics 2: Robert Šámal January 29, 2024
No ratings yet
Probability & Statistics 2: Robert Šámal January 29, 2024
29 pages
Computational Genomics Hidden Markov Models (HMMS)
No ratings yet
Computational Genomics Hidden Markov Models (HMMS)
55 pages
2024 Fall CSE366 12 HMM
No ratings yet
2024 Fall CSE366 12 HMM
46 pages
Markov Chains
No ratings yet
Markov Chains
42 pages
Characterization of Dynamic Bayesian Network-The Dynamic Bayesian Network As Temporal Network
No ratings yet
Characterization of Dynamic Bayesian Network-The Dynamic Bayesian Network As Temporal Network
8 pages
MArkov
No ratings yet
MArkov
26 pages
Lec6 - Probabilistic Reasoning
No ratings yet
Lec6 - Probabilistic Reasoning
36 pages
Hidden Markov Modelss
No ratings yet
Hidden Markov Modelss
59 pages
Optimization in Function Spaces
From Everand
Optimization in Function Spaces
Amol Sasane
No ratings yet
Report General Chejj
No ratings yet
Report General Chejj
3 pages
CU-2022 B.sc. (Honours) Zoology Semester-6 Paper-DSE-B (2) - 1 QP
No ratings yet
CU-2022 B.sc. (Honours) Zoology Semester-6 Paper-DSE-B (2) - 1 QP
2 pages
Daftar Topik Dan Road Map Pusat Penelitian 2020 2024
No ratings yet
Daftar Topik Dan Road Map Pusat Penelitian 2020 2024
22 pages
Aurora Geo Report
No ratings yet
Aurora Geo Report
86 pages
Mathematical Optimization of Solar Thermal Collectors Efficiency Function Using MATLAB
No ratings yet
Mathematical Optimization of Solar Thermal Collectors Efficiency Function Using MATLAB
5 pages
Spesifikasi Rig 450 HP (BMA#06)
No ratings yet
Spesifikasi Rig 450 HP (BMA#06)
21 pages
Notes On The Balance of Power
No ratings yet
Notes On The Balance of Power
1 page
Testbank For Images of The Past 9th Edition Price
No ratings yet
Testbank For Images of The Past 9th Edition Price
18 pages
Karnataka FPOs
No ratings yet
Karnataka FPOs
66 pages
Horizon (Ceiling Hung) : Key Features
No ratings yet
Horizon (Ceiling Hung) : Key Features
2 pages
Articles (Homework)
No ratings yet
Articles (Homework)
2 pages
Chilled Displays
No ratings yet
Chilled Displays
65 pages
Boolean Xor Based (K, N) Threshold Visual Cryptography For Grayscale Images
No ratings yet
Boolean Xor Based (K, N) Threshold Visual Cryptography For Grayscale Images
4 pages
B Tech District-Wise
No ratings yet
B Tech District-Wise
10 pages
Brochr AS350B3e
100% (1)
Brochr AS350B3e
16 pages
Nystesc 2019 Primer
No ratings yet
Nystesc 2019 Primer
64 pages
PMP Notes - 3
100% (3)
PMP Notes - 3
68 pages
(15PR201203644338) PDF
No ratings yet
(15PR201203644338) PDF
4 pages
Quarter 3 - Module 8: The Power (Positivity, Optimism and Resiliency) To Cope
100% (1)
Quarter 3 - Module 8: The Power (Positivity, Optimism and Resiliency) To Cope
3 pages
Flashcut CNC 7 - 0 Users Guide
No ratings yet
Flashcut CNC 7 - 0 Users Guide
185 pages
012 M13 Geometrical Modeling 2007 Potsdam
No ratings yet
012 M13 Geometrical Modeling 2007 Potsdam
8 pages
Class 6 History Worksheet
No ratings yet
Class 6 History Worksheet
5 pages
Advantage of Using PLC in Industrial Automation
No ratings yet
Advantage of Using PLC in Industrial Automation
2 pages
Parting Glass
No ratings yet
Parting Glass
1 page
BEP 328 - Project Management 8: Negotiating Solutions
No ratings yet
BEP 328 - Project Management 8: Negotiating Solutions
13 pages
Polarity and Intermolecular Forces Lab Sheet
100% (1)
Polarity and Intermolecular Forces Lab Sheet
9 pages
Thcs An Lac - Thi HK I. k9. 2020-2021
No ratings yet
Thcs An Lac - Thi HK I. k9. 2020-2021
8 pages
9800 Relay Series
No ratings yet
9800 Relay Series
2 pages
Modal Verbs
No ratings yet
Modal Verbs
8 pages
Monthly RE Generation Report April 2025
No ratings yet
Monthly RE Generation Report April 2025
28 pages

11 Probabilistic Temporal Models

Uploaded by

11 Probabilistic Temporal Models

Uploaded by

Midterm

▪ Request regrading on Gradescope by this Saturday

[Adapted from slides by Dan Klein and Pieter Abbeel at UC Berkeley]

▪ Often, we want to reason about a sequence of observations

▪ Need to introduce time into our models

▪ Yes and no!

▪ Markov assumption: Xt+1, … are independent of X0,…, Xt-1 given

▪ Initial distribution P(X0)

▪ What is the weather like at time 1?

▪ What is the weather like at time 2?

▪ What is the weather like at time 3?

▪ Iterate this update starting at t=0

P(X0) P(X1) P(X2) P(X3) P(X)

▪ For most chains: ▪ Stationary distribution:

▪ Each joint instantiation over all hidden and

Umbrellat-1 Umbrellat Umbrellat+1

▪ Useful notation: Xa:b = Xa , Xa+1, …, Xb

▪ P(Xt+1|e1:t+1) = P(Xt+1|e1:t, et+1)

▪ P(Xt+1|e1:t+1) = P(Xt+1|e1:t, et+1) Apply conditional independence

▪ = α P(et+1|Xt+1, e1:t) P(Xt+1| e1:t)

Normalize Update Predict

▪ P(Xt+1|e1:t+1) = P(Xt+1|e1:t, et+1)

▪ = α P(et+1|Xt+1) P(Xt+1| e1:t)

▪ P(Xt+1|e1:t+1) = P(Xt+1|e1:t, et+1)

▪ = α P(et+1|Xt+1) xt P(xt | e1:t) P(Xt+1| xt, e1:t)

Normalize Update Predict

Normalize Update Predict

▪ f1:t+1 = FORWARD(f1:t , et+1)

Weather0 Weather1 Weather2 …

Normalize Update Predict

α is a constant. So if we only want to compute P(xt | e1:t), then we can skip

Q: How is the algorithm related to variable elimination?

rain rain rain rain

▪ Each arc represents some transition xt-1 → xt

sun sun sun sun

rain rain rain rain

rain rain rain rain

▪ The product of weights on a path is proportional to that state sequence’s probability

rain rain rain rain

m1:t+1 = VITERBI(m1:t , et+1) f1:t+1 = FORWARD(f1:t , et+1)

▪ m1:t+1 = P(et+1|Xt+1) maxx P(Xt+1| xt) m1:t [xt]

𝑚1:1 sun = 0.2 × max(0.9 × 0.5, 0.3 × 0.5) = 0.09

▪ m1:t+1 = P(et+1|Xt+1) maxx P(Xt+1| xt) m1:t [xt]

▪ Time complexity: O(|X|2 T)

E1a E1b E2a E2b E3a E3b

E1a E1b E2a E2b E3a E3b

▪ Represent belief state at each step by a set of 0.0 0.1 0.0

▪ Each particle is moved by sampling its Particles:

▪ This captures the passage of time

▪ If enough samples, close to exact probabilities

▪ Particles that fit the evidence better get (1,3)

▪ What happens if we repeat the

time? (2,3) w=.2

▪ Rather than tracking weighted samples, Particles:

▪ Generate N new samples from our weighted (3,1) w=.4

▪ Each new sample is selected from the current (2,3) w=.2

proportional to its weight.

Particles: Particles: Particles: (New) Particles:

Consistency: see proof in AIMA Ch. 15

[Dieter Fox, et al.]

[Dieter Fox, et al.]

DP-SLAM, Ron Parr

[Sebastian Thrun, et al.]

[Dirk Haehnel, et al.]

You might also like