0% found this document useful (0 votes)

14 views79 pages

24f 09 Hidden Markov Models

Uploaded by

tanyeriakif

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views79 pages

24f 09 Hidden Markov Models

Uploaded by

tanyeriakif

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 79

CS 461: Artiﬁcial Intelligence

Hidden Markov Models

1
Reasoning over Time or Space
– Often, we want to reason about a sequence of observations where the
state of the underlying system is changing
● Speech recognition
● Robot localization
● User attention
● Medical monitoring

– Need to introduce time into our models

2
Markov Models (aka Markov chain/process).

X0 X1 X2 X3

3
Quiz: are Markov models a special case of Bayes nets?
– Yes and no!

– Yes:
● Directed acyclic graph, joint = product of conditionals

– No:
● Inﬁnitely many variables (unless we truncate)

● Repetition of transition model not part of standard Bayes net syntax

4
Example: Random walk in one dimension

-4 -3 -2 -1 0 1 2 3 4

5
Example: n-gram models
We call ourselves Homo sapiens—man the wise—because our intelligence is so important to us.
For thousands of years, we have tried to understand how we think; that is, how a mere handful of matter can
perceive, understand, predict, and manipulate a world far larger and more complicated than itself. ….

– State: word at position t in text (can also build letter n-grams)

– Transition model (probabilities come from empirical frequencies):
● Unigram (zero-order): P(Wordt = i)
■ “logical are as are confusion a may right tries agent goal the was . . .”

● Bigram (ﬁrst-order): P(Wordt = i | Wordt-1= j)

■ “systems are very similar computational approach would be represented . . .”

● Trigram (second-order): P(Wordt = i | Wordt-1= j, Wordt-2= k)

■ “planning and scheduling are integrated the success of naive bayes model is . . .”

– Applications: text classification, spam detection, author identification, language

classification, speech recognition 6
Example: Web browsing
– State: URL visited at step t
– Transition model:
● With probability p, choose an outgoing link at random
● With probability (1-p), choose an arbitrary new page

– Question: What is the stationary distribution over pages?

● i.e., if the process runs forever, what fraction of time does it spend in any given page?

– Application: Google page rank

● Google 1.0 returned the set of pages containing all your keywords in decreasing rank, now all
search engines use link analysis along with many other factors (rank actually getting less
important over time)

7
Example: Weather
– States {rain, sun}
Two new ways of representing the same CPT
0.9
0.3
▪ Initial distribution P(X0)
rain sun
P(X0)
sun rain
0.7
0.5 0.5 0.1

▪ Transition model P(Xt |

0.9
Xt-1X) P(Xt|Xt-1)
sun
0.1
sun
t-1
0.3
sun rain rain rain
0.7
sun 0.9 0.1
rain 0.3 0.7 8
Weather prediction
– Time 0: <0.5,0.5>

– What is the weather like at time 1? Xt-1 P(Xt|Xt-1)

sun rain
● P(X1) = ∑x0 P(X1,X0=x0)
sun 0.9 0.1
● = ∑x0 P(X0=x0) P(X1| X0=x0) rain 0.3 0.7
● = 0.5<0.9,0.1> + 0.5<0.3,0.7> = <0.6,0.4>

9
Weather prediction
– Time 1: <0.6,0.4>

– What is the weather like at time 1? Xt-1 P(Xt|Xt-1)

sun rain
● P(X2) = ∑x1 P(X2,X1=x1)
sun 0.9 0.1
● = ∑x1 P(X1=x1) P(X2| X1=x1) rain 0.3 0.7
● = 0.6<0.9,0.1> + 0.4<0.3,0.7> = <0.66,0.34>

10
Weather prediction
– Time 2: <0.66,0.34>

– What is the weather like at time 1? Xt-1 P(Xt|Xt-1)

sun rain
● P(X3) = ∑x2 P(X3,X2=x2)
sun 0.9 0.1
● = ∑x2 P(X2=x2) P(X3| X2=x2) rain 0.3 0.7
● = 0.66<0.9,0.1> + 0.34<0.3,0.7> = <0.696,0.304>

11
Forward algorithm (simple form)
Probability from
previous iteration
Transition model

12
And the same thing in linear algebra form
● What is the weather like at time 2?
○ P(X2) = 0.6<0.9,0.1> + 0.4<0.3,0.7> = <0.66,0.34>
● In matrix-vector form:
Xt-1 P(Xt|Xt-1)

○ P(X2) = ( )( )=( )
0.9 0.3 0.6 0.66 sun rain
0.1 0.7 0.4 0.34
sun 0.9 0.1
rain 0.3 0.7

● i.e., multiply by TT, transpose of transition matrix

13
Stationary Distributions
– The limiting distribution is called the stationary distribution P
∞
of the chain
– It satisﬁes P = P +1 = TT P∞
∞ ∞
– Solving for P in the example:
∞
( 0.9 0.3
0.1 0.7 )( ) ( )
p p
1-p = 1-p

0.9p + 0.3(1-p) = p
p = 0.75

Stationary distribution is <0.75,0.25> regardless of starting distribution

14
Example Run of Mini-Forward Algorithm
▪ From initial observation of sun

P(X1) P(X2) P(X3) P(X4) P(X∞)

▪ From initial observation of rain

P(X1) P(X2) P(X3) P(X4) P(X∞)

▪ From yet another initial distribution P(X1):

…
P(X1) P(X∞)
Video of Demo Ghostbusters Basic Dynamics
Video of Demo Ghostbusters Circular Dynamics
Video of Demo Ghostbusters Whirlpool Dynamics
Application of Stationary Distributions: Gibbs Sampling*
– Each joint instantiation over all hidden and query variables is a state:
{X1, …, Xn} = H U Q

– Transitions:
● With probability 1/n resample variable Xj according to

P(Xj | x1, x2, …, xj-1, xj+1, …, xn, e1, …, em)

– Stationary distribution:
● Conditional distribution P(X1, X2 , … , Xn|e1, …, em)
● Means that when running Gibbs sampling long enough we get a sample from the
desired distribution
● Requires some proof to show this is true!

19
MC Example
MC Example: by forward sim (Monte Carlo)

100K steps
MC Example: by Linear Algebra

Pizza day!
MC Example: by Linear Algebra

Pizza day!

Monte Carlo estimate:

Hidden Markov Models
Hidden Markov Models

X0 X1 X2 X3
X
5

E1 E2 E3
E
5

27
Example: Weather HMM
– An HMM is deﬁned by:
Wt-1 P(Wt|Wt-1) Wt P(Ut|Wt)
● Initial distribution: P(X0)
sun rain true false
sun 0.9 0.1 sun 0.2 0.8
● Transition model: P(Xt| Xt-1)

rain 0.3 0.7 rain 0.9 0.1 ● Sensor model: P(Et| Xt)

Weathert-1 Weathert Weathert+1

Umbrellat-1 Umbrellat Umbrellat+1

HMM as probability model

X0 X1 X2 X3
X
Useful notation:
5
Xa:b = Xa , Xa+1, …, Xb
E1 E2 E3
E
5
HMMs: Some Relevant Problems

30
Real HMM Examples
– Speech recognition HMMs:
● Observations are acoustic signals (continuous valued)
● States are speciﬁc positions in speciﬁc words (so, tens of thousands)

– Machine translation HMMs:

● Observations are words (tens of thousands)
● States are translation options

– Robot tracking:
● Observations are range readings (continuous)
● States are positions on a map (continuous)

– Molecular biology:
● Observations are nucleotides ACGT
● States are coding/non-coding/start/stop/splice-site etc.

31
Inference tasks

32
Inference tasks
– Filtering: P(X |e )
t 1:t
● belief state—input to the decision process of a rational agent

– Prediction: P(X
t+k
|e
1:t
) for k > 0
● evaluation of possible action sequences; like ﬁltering without the evidence

– Smoothing: P(X |e ) for 0 ≤ k < t

k 1:t
● better estimate of past states, essential for learning

– Most likely explanation: arg max

x1:t
P(x
1:t
|e
1:t
)
● speech recognition, decoding with a noisy channel

33
Inference tasks

Filtering: P(Xt|e1:t) Prediction: P(Xt+k|e1:t)

X1 X2 X3 X4 X1 X2 X3 X4

e1 e2 e3 e4 e1 e2 e3

Smoothing: P(Xk|e1:t), k<t Explanation: P(X1:t|e1:t)

X1 X2 X3 X4 X1 X2 X3 X4

e1 e2 e3 e4 e1 e2 e3 e4
Example: Ghostbusters HMM
– P(X1) = uniform 1/9 1/9 1/9

1/9 1/9 1/9

– P(X|X’) = usually move clockwise, but
sometimes move in a random direction 1/9 1/9 1/9

or stay in place P(X1)

– P(Rij|X) = same sensor model as before: 1/6 1/6 1/2

red means close, green means far away.
0 1/6 0

X1 X2 X3 X4 0 0 0

X P(X|X’=<1,2>)
5
Ri,j Ri,j Ri,j Ri,j
35
[Demo: Ghostbusters – Circular Dynamics – HMM (L14D2)]
Video of Demo Ghostbusters – Circular Dynamics --
HMM
Example 1: Weather-Mood (states observed)
Example 1: Weather-Mood (states observed)

)
Example 1: Weather-Mood (states observed)

Using left eigenvectors

Example 2: Best Explanation (HMM)

What is the most likely weather sequence for

the observed mood sequence?
Example 2: Best Explanation (HMM)
Example 2: Best Explanation (HMM)
Example 3: Likelihood of Evidence (HMM)

What is the probability of this observed mood

sequence given an HMM model (as on the right)?
Example 3: Likelihood of Evidence (HMM)
Example 3: Likelihood of Evidence (HMM)
Example 3: Likelihood of Evidence (HMM)
Example 3: Likelihood of Evidence (HMM)
Example 3: Likelihood of Evidence (HMM)
Example 3: Likelihood of Evidence (HMM)
Example 3: Likelihood of Evidence (HMM)
Example 3: Likelihood of Evidence (HMM)
Example 3: Likelihood of Evidence (HMM)
Example 3: Likelihood of Evidence (HMM)

(one way to construct)

Forward algorithm
Example 3: Likelihood of Evidence (HMM)
Filtering / Monitoring
– Filtering, or monitoring, or state estimation is the task of
tracking/maintaining the distribution Bt(X) = f1:t = P(Xt|e1:t)
(the belief state) over time

– We start with f0 in an initial setting, usually uniform

– As time passes, or we get observations, we update f

– The Kalman ﬁlter was invented in the 60’s and ﬁrst implemented as a
method of trajectory estimation for the Apollo program; 1.120.000 papers
on Google Scholar

55
Example: Robot Localization
Example from
Michael Pfeiffer

Prob 0 1
t=0
Sensor model: four bits for wall/no-wall in each direction,
never more than 1 mistake
Transition model: action may fail with small prob.
Example: Robot Localization

Prob 0 1
t=1
Lighter grey: was possible to get the reading,
but less likely (required 1 mistake)
Example: Robot Localization

Prob 0 1

t=2
Example: Robot Localization

Prob 0 1

t=3
Example: Robot Localization

Prob 0 1

t=4
Example: Robot Localization

Prob 0 1

t=5
Inference: Base Cases

X
1
X X
E 1 2
1
Passage of Time

– Aim: devise a recursive ﬁltering algorithm of the form: P(Xt+1|e1:t+1) = g(et+1, P(Xt|e1:t) )

– Assume we have current belief P(X | evidence to date)

X1 X2
– Then, after one time step passes:

be careful about what time step t the belief

is about, and what evidence it includes

▪ Or compactly:

– Basic idea: beliefs get “pushed” through the transitions

Filtering
– Filtering allows us to essentially update our belief (a probability
distribution for the true state) with observations
● We need a prior distribution for the belief as well

– Let our belief probability for state s at time t be denoted as

– We would like to have a recursive, Markov ﬁltering algorithm

● Otherwise computations would become more diﬃcult as time goes on

64
Passage of Time

– As time passes, uncertainty “accumulates” (Transition model: ghosts usually go clockwise)

T=1 T=2 T=5

Bayesian Filtering (Forward Algorithm)
– We can directly derive a belief update (the so called forward algorithm):

66
Observation
– Assume we have current belief P(X | previous evidence): X
1

– Then, after evidence comes in: E

– Or, compactly:

▪ Basic idea: beliefs “reweighted”

by likelihood of evidence
▪ Unlike passage of time, we must
renormalize
Example: Observation

– As we get observations, beliefs get reweighted, uncertainty “decreases”

Before observation After observation

Example: Weather HMM

B’(+r) = 0.5 B’(+r) = 0.627

B’(-r) = 0.5 B’(-r) = 0.373

B(+r) = 0.5 B(+r) = 0.818 B(+r) = 0.883

B(-r) = 0.5 B(-r) = 0.182 B(-r) = 0.117

Rain0 Rain1 Rain2 Rt Rt+1 P(Rt+1|Rt) Rt Ut P(Ut|Rt)

+r +r 0.7 +r +u 0.9
+r -r 0.3 +r -u 0.1
Umbrella1 Umbrella2 -r +r 0.3 -r +u 0.2
-r -r 0.7 -r -u 0.8
The Forward Algorithm
– We are given evidence at each time and want to know

– We can derive the following updates

We can normalize as we go if we
want to have P(x|e) at each time
step, or just once at the end…
The Forward Algorithm
– We are given evidence at each time and want to know

– We can derive the following updates

We can normalize as we go if we
want to have P(x|e) at each time
step, or just once at the end…

Base case

known known
Online Belief Updates
– Every time step, we start with current P(X | evidence)
– We update for time:

X1 X2

– We update for evidence:

– The forward algorithm does both at once (and doesn’t normalize)

E2
Pacman – Sonar (P4)

[Demo: Pacman – Sonar – No Beliefs(L14D1)]

Video of Demo Pacman – Sonar (with beliefs)
Most Likely Explanation
Inference tasks

– Filtering: P(X |e )
t 1:t
● belief state—input to the decision process of a rational agent

– Prediction: P(X
t+k
|e
1:t
) for k > 0
● evaluation of possible action sequences; like ﬁltering without the evidence

– Smoothing: P(X |e ) for 0 ≤ k < t

k 1:t
● better estimate of past states, essential for learning

– Most likely explanation: arg max

x1:t
P(x
1:t
|e
1:t
)
● speech recognition, decoding with a noisy channel

76
Most likely explanation = most probable path
– State trellis: graph of states and transitions over time
• arg maxx1:t P(x1:t | e1:t)
sun sun sun sun = arg maxx1:t α P(x1:t , e1:t)
= arg maxx1:t P(x1:t , e1:t)
rain rain rain rain = arg maxx1:t P(x0) ∏t P(xt | xt-1) P(et | xt)

X0 X1 … XT

– Each arc represents some transition xt-1 → xt

– Each arc has weight P(xt | xt-1) P(et | xt) (arcs to initial states have weight P(x0) )
– The product of weights on a path is proportional to that state sequence’s probability
– Forward algorithm computes sums of paths, Viterbi algorithm computes best paths

77
Forward / Viterbi algorithms

sun sun sun sun

rain rain rain rain

X0 X1 … XT

Forward Algorithm (sum) Viterbi Algorithm (max)

For each state at time t, keep track For each state at time t, keep track of
of the total probability of all the maximum probability of any path
paths to it to it
f1:t+1 = FORWARD(f1:t , et+1) m1:t+1 = VITERBI(m1:t , et+1)
= α P(et+1|Xt+1) ∑xt P(Xt+1| xt) f1:t = P(et+1|Xt+1) maxxt P(Xt+1| xt) m1:t
Next Time: Particle Filtering and Applications of HMMs

2024 Fall CSE366 12 HMM
No ratings yet
2024 Fall CSE366 12 HMM
46 pages
HiddenMarkovModel FINAL
100% (2)
HiddenMarkovModel FINAL
73 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
51 pages
Lec18 HMMs
No ratings yet
Lec18 HMMs
56 pages
Chapter 14
No ratings yet
Chapter 14
38 pages
MATHS Worksheet and Class 2 CH-6 L1
No ratings yet
MATHS Worksheet and Class 2 CH-6 L1
40 pages
21CSC305P ML - Unit4
No ratings yet
21CSC305P ML - Unit4
76 pages
Hidden Markovnikov Model
No ratings yet
Hidden Markovnikov Model
32 pages
L4 Tagging
No ratings yet
L4 Tagging
107 pages
HiddenMarkovModels BARCA
No ratings yet
HiddenMarkovModels BARCA
44 pages
A Systematic Literature Review of Machine Learning Methods Applied To Predictive Maintenance
No ratings yet
A Systematic Literature Review of Machine Learning Methods Applied To Predictive Maintenance
16 pages
SP14 CS188 Lecture 14 - Hidden Markov Models - Print
No ratings yet
SP14 CS188 Lecture 14 - Hidden Markov Models - Print
26 pages
11 Probabilistic Temporal Models
No ratings yet
11 Probabilistic Temporal Models
60 pages
Lecture 11
No ratings yet
Lecture 11
55 pages
Slides
No ratings yet
Slides
69 pages
Ad8601-Unit Ii Probabilistic Reasoning Ii
No ratings yet
Ad8601-Unit Ii Probabilistic Reasoning Ii
26 pages
Pert15 - Probabilistic Reasoning Over Time
No ratings yet
Pert15 - Probabilistic Reasoning Over Time
32 pages
Hidden Markov Models Applied To Information Extraction: Part I: Concept
No ratings yet
Hidden Markov Models Applied To Information Extraction: Part I: Concept
34 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
35 pages
Знімок екрана 2022-10-31 о 18.56.30
No ratings yet
Знімок екрана 2022-10-31 о 18.56.30
96 pages
AAI Lab Manual FH-25
No ratings yet
AAI Lab Manual FH-25
20 pages
Unit - 4 Hidden Markov Models
No ratings yet
Unit - 4 Hidden Markov Models
39 pages
AML Mod2
No ratings yet
AML Mod2
38 pages
Sequence Model:: Hidden Markov Models
No ratings yet
Sequence Model:: Hidden Markov Models
60 pages
Computational Genomics Hidden Markov Models (HMMS)
No ratings yet
Computational Genomics Hidden Markov Models (HMMS)
55 pages
MLRD 8
No ratings yet
MLRD 8
39 pages
Exam2 s15 Sol
100% (1)
Exam2 s15 Sol
10 pages
Artificial Intelligence and Learning Algorithms: Presented by Brian M. Frezza 12/1/05
No ratings yet
Artificial Intelligence and Learning Algorithms: Presented by Brian M. Frezza 12/1/05
67 pages
Modul 1 English For Engineering 1
100% (1)
Modul 1 English For Engineering 1
29 pages
Markov Models
No ratings yet
Markov Models
54 pages
Optimization in Function Spaces
From Everand
Optimization in Function Spaces
Amol Sasane
No ratings yet
Module 4.2
No ratings yet
Module 4.2
42 pages
Machine Learning For Natural Language Processing: Hidden Markov Models
No ratings yet
Machine Learning For Natural Language Processing: Hidden Markov Models
33 pages
Parametric Models Hidden Markov Models
No ratings yet
Parametric Models Hidden Markov Models
30 pages
Chapter15 1
No ratings yet
Chapter15 1
36 pages
09 - Hidden Markov Model
No ratings yet
09 - Hidden Markov Model
78 pages
CSCI 5832 Natural Language Processing: Jim Martin
No ratings yet
CSCI 5832 Natural Language Processing: Jim Martin
47 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
26 pages
CS 4705 Hidden Markov Models: Slides Adapted From Dan Jurafsky, and James Martin
No ratings yet
CS 4705 Hidden Markov Models: Slides Adapted From Dan Jurafsky, and James Martin
35 pages
Hidden Markov Models: Julia Hirschberg CS4705
No ratings yet
Hidden Markov Models: Julia Hirschberg CS4705
37 pages
Artificial Intelligence and Learning Algorithms: Presented by Brian M. Frezza 12/1/05
No ratings yet
Artificial Intelligence and Learning Algorithms: Presented by Brian M. Frezza 12/1/05
67 pages
A Hidden Markov Model
No ratings yet
A Hidden Markov Model
6 pages
Hidden Markov Modelss
No ratings yet
Hidden Markov Modelss
59 pages
IFY EAP Teaching Guide 21-22
No ratings yet
IFY EAP Teaching Guide 21-22
63 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
56 pages
ML 5
No ratings yet
ML 5
28 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
51 pages
Introduction To Machine Learning CMU-10701: Hidden Markov Models
No ratings yet
Introduction To Machine Learning CMU-10701: Hidden Markov Models
30 pages
Hidden Markov Models and Sequential Data
No ratings yet
Hidden Markov Models and Sequential Data
45 pages
Lectures 7 and 8
No ratings yet
Lectures 7 and 8
37 pages
cs229 HMM
No ratings yet
cs229 HMM
13 pages
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
No ratings yet
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
29 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
51 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
55 pages
斯坦福大学机器学习数学基础 65-72
No ratings yet
斯坦福大学机器学习数学基础 65-72
8 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
36 pages
Unit3pdf 2025 01 14 10 38 08
No ratings yet
Unit3pdf 2025 01 14 10 38 08
4 pages
CSE 473: Ar+ficial Intelligence: Bayes' Nets
No ratings yet
CSE 473: Ar+ficial Intelligence: Bayes' Nets
26 pages
Cosgrove A Wijayatilake C Open World c1 Advanced Students Bo
No ratings yet
Cosgrove A Wijayatilake C Open World c1 Advanced Students Bo
29 pages
Hidden Markov Model (HMM) Architecture
No ratings yet
Hidden Markov Model (HMM) Architecture
15 pages
Lec19 PDF
No ratings yet
Lec19 PDF
9 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
5 pages
Lec20 PDF
No ratings yet
Lec20 PDF
7 pages
Hmms and The Forward-Backward Algorithm: Hidden Markov Models
No ratings yet
Hmms and The Forward-Backward Algorithm: Hidden Markov Models
7 pages
Rey's Thesis Chapter 1-5 Final Draft
No ratings yet
Rey's Thesis Chapter 1-5 Final Draft
47 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Einstein His Life and Universe
No ratings yet
Einstein His Life and Universe
4 pages
Science8 - q1 - Mod4 - Effect of Temperature To The Speed of Sound - v4
No ratings yet
Science8 - q1 - Mod4 - Effect of Temperature To The Speed of Sound - v4
25 pages
8control and DSP Lab - PDF
No ratings yet
8control and DSP Lab - PDF
50 pages
Conceptualizations of Intrinsic Motivation and Self-Determination
No ratings yet
Conceptualizations of Intrinsic Motivation and Self-Determination
2 pages
Conceptual Physics Chapter 9 Conservation of Energy Answers 7Gkg
100% (1)
Conceptual Physics Chapter 9 Conservation of Energy Answers 7Gkg
3 pages
Attachment-3 - Piping Loads On Vessel Clips - R0
No ratings yet
Attachment-3 - Piping Loads On Vessel Clips - R0
3 pages
HighEntropy Carbide
No ratings yet
HighEntropy Carbide
10 pages
NCERT Class 11 Geography Part 1
No ratings yet
NCERT Class 11 Geography Part 1
146 pages
Test Bank For Psychology in Your Life 3rd by Grisonpdf Download
100% (6)
Test Bank For Psychology in Your Life 3rd by Grisonpdf Download
41 pages
Scientific Notation Is A Way of Writing Numbers That Is Often Used by Scientists and
No ratings yet
Scientific Notation Is A Way of Writing Numbers That Is Often Used by Scientists and
1 page
Teaching Practicum Syllabus
No ratings yet
Teaching Practicum Syllabus
9 pages
Patient Complaint Form Template
No ratings yet
Patient Complaint Form Template
3 pages
Raycasting 1
No ratings yet
Raycasting 1
22 pages
CS461 Fall24 HW4
No ratings yet
CS461 Fall24 HW4
18 pages
BIO2133 LEC1 Jan 11 - 2021 - 1 Slide Per Page
No ratings yet
BIO2133 LEC1 Jan 11 - 2021 - 1 Slide Per Page
36 pages
Data Analysis Coca Cola
No ratings yet
Data Analysis Coca Cola
7 pages
Ccaa Training Catalogue - March 2023 2
No ratings yet
Ccaa Training Catalogue - March 2023 2
27 pages
Types of Semiconductors
No ratings yet
Types of Semiconductors
4 pages
Donna M. Richter Honored As A VIP For Fall 2024 by P.O.W.E.R. (Professional Organization of Women of Excellence Recognized)
No ratings yet
Donna M. Richter Honored As A VIP For Fall 2024 by P.O.W.E.R. (Professional Organization of Women of Excellence Recognized)
3 pages
CS461 Fall24 PA3
No ratings yet
CS461 Fall24 PA3
2 pages
Mass Spectra - The Molecular Ion (M+) Peak
No ratings yet
Mass Spectra - The Molecular Ion (M+) Peak
5 pages
Anticipation Reaction Paper Towns
No ratings yet
Anticipation Reaction Paper Towns
2 pages
Study 6
No ratings yet
Study 6
1 page
Article PDF
No ratings yet
Article PDF
6 pages
Natural Resource Security and Governance (NRSG) Q&A
No ratings yet
Natural Resource Security and Governance (NRSG) Q&A
4 pages
Dips Presentation Planning Weebly
No ratings yet
Dips Presentation Planning Weebly
4 pages
Salah Khalid Alasbahi Objective: Sana'a University
No ratings yet
Salah Khalid Alasbahi Objective: Sana'a University
2 pages
Titration Curves Lab Report
No ratings yet
Titration Curves Lab Report
2 pages

24f 09 Hidden Markov Models

Uploaded by

24f 09 Hidden Markov Models

Uploaded by

CS 461: Artiﬁcial Intelligence

Hidden Markov Models

– Need to introduce time into our models

● Repetition of transition model not part of standard Bayes net syntax

– State: word at position t in text (can also build letter n-grams)

● Bigram (ﬁrst-order): P(Wordt = i | Wordt-1= j)

● Trigram (second-order): P(Wordt = i | Wordt-1= j, Wordt-2= k)

– Applications: text classification, spam detection, author identification, language

– Question: What is the stationary distribution over pages?

– Application: Google page rank

▪ Transition model P(Xt |

– What is the weather like at time 1? Xt-1 P(Xt|Xt-1)

– What is the weather like at time 1? Xt-1 P(Xt|Xt-1)

– What is the weather like at time 1? Xt-1 P(Xt|Xt-1)

● i.e., multiply by TT, transpose of transition matrix

Stationary distribution is <0.75,0.25> regardless of starting distribution

P(X1) P(X2) P(X3) P(X4) P(X∞)

P(X1) P(X2) P(X3) P(X4) P(X∞)

▪ From yet another initial distribution P(X1):

P(Xj | x1, x2, …, xj-1, xj+1, …, xn, e1, …, em)

Monte Carlo estimate:

Weathert-1 Weathert Weathert+1

Umbrellat-1 Umbrellat Umbrellat+1

– Machine translation HMMs:

– Smoothing: P(X |e ) for 0 ≤ k < t

– Most likely explanation: arg max

Filtering: P(Xt|e1:t) Prediction: P(Xt+k|e1:t)

Smoothing: P(Xk|e1:t), k<t Explanation: P(X1:t|e1:t)

1/9 1/9 1/9

or stay in place P(X1)

– P(Rij|X) = same sensor model as before: 1/6 1/6 1/2

Using left eigenvectors

What is the most likely weather sequence for

What is the probability of this observed mood

(one way to construct)

– We start with f0 in an initial setting, usually uniform

– As time passes, or we get observations, we update f

– Assume we have current belief P(X | evidence to date)

be careful about what time step t the belief

– Basic idea: beliefs get “pushed” through the transitions

– Let our belief probability for state s at time t be denoted as

– We would like to have a recursive, Markov ﬁltering algorithm

– As time passes, uncertainty “accumulates” (Transition model: ghosts usually go clockwise)

T=1 T=2 T=5

– Then, after evidence comes in: E

▪ Basic idea: beliefs “reweighted”

– As we get observations, beliefs get reweighted, uncertainty “decreases”

Before observation After observation

B’(+r) = 0.5 B’(+r) = 0.627

B(+r) = 0.5 B(+r) = 0.818 B(+r) = 0.883

Rain0 Rain1 Rain2 Rt Rt+1 P(Rt+1|Rt) Rt Ut P(Ut|Rt)

– We can derive the following updates

– We can derive the following updates

– We update for evidence:

– The forward algorithm does both at once (and doesn’t normalize)

[Demo: Pacman – Sonar – No Beliefs(L14D1)]

– Smoothing: P(X |e ) for 0 ≤ k < t

– Most likely explanation: arg max

– Each arc represents some transition xt-1 → xt

sun sun sun sun

rain rain rain rain

Forward Algorithm (sum) Viterbi Algorithm (max)

You might also like