0% found this document useful (0 votes)

22 views42 pages

MLRD 9

Uploaded by

damasodra33

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views42 pages

MLRD 9

Uploaded by

damasodra33

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 42

9: Viterbi Algorithm for HMM Decoding

Machine Learning and Real-world Data

Andreas Vlachos
(slides adapted from Simone Teufel)

Department of Computer Science and Technology

University of Cambridge
Last session: estimating parameters of an HMM

The dishonest casino, dice edition.

Two hidden states: L (loaded dice), F (fair dice).
You don’t know which dice is currently in use. You can only
observe the numbers that are thrown.
You estimated transition and emission probabilities (Problem
1 from last time).
We are now turning to Problem 4.
We want the HMM to find out when the fair dice was out,
and when the loaded dice was out.
We need to write a decoder.
Decoding: finding the most likely path

Definition of decoding: Finding the most likely hidden state

sequence X that explains the observation O given the HMM
parameters µ = (A, B).

X̂ = argmax P (X, O|µ)

X
= argmax P (O|X, B)P (X|A)
X
T
Y
= argmax P (Ot |Xt , B)P (Xt |Xt−1 , A)
X1 ...XT t=1

Search space of possible state sequences X is O(N T ); too

large for brute force search.
Viterbi is a Dynamic Programming Application

(Reminder from Algorithms course)

We can use Dynamic Programming if two conditions apply:
Optimal substructure property
An optimal state sequence X1 . . . Xj . . . XT contains inside it
the sequence X1 . . . Xj , which is also optimal
Overlapping subsolutions property
If both Xt and Xu are on the optimal path, with u > t, then
the calculation of the probability for being in state Xt is part
of each of the many calculations for being in state Xu .
Viterbi is a Dynamic Programming Application

(Reminder from Algorithms course)

Here’s how we can save ourselves a lot of time.

Because of the Limited Horizon of the HMM, we don’t need
to keep a complete record of how we arrived at a certain state.
For the first-order HMM, we only need to record one previous
step.
Just do the calculation of the probability of reaching each
state once for each time step (variable δ).
Then memoise this probability in a Dynamic Programming
table
This reduces our effort to O(N 2 T ).
This is for the first order HMM, which only has a memory of
one previous state.
Viterbi: main data structure

Memoisation is done using a trellis.

A trellis is equivalent to a Dynamic Programming table.
The trellis is (N + 2) × (T + 2) in size, with states j as rows
and time steps t as columns.
Each cell j, t records the Viterbi probability δj (t), the
probability of the most likely path that ends in state sj at
time t:
δj (t) = max [δi (t − 1) aij bj (Ot )]
1≤i≤N

This probability is calculated by maximising over the best

ways of going to sj for each si .
aij : the transition probability from si to sj
bj (Ot ): the probability of emitting Ot from destination state
sj
Viterbi algorithm, initialisation

Note: the probability of a state starting the sequence at t = 0 is

just the probability of it emitting the first symbol.
Viterbi algorithm, initialisation
Viterbi algorithm, initialisation
Viterbi algorithm, initialisation
Viterbi algorithm, main step
Viterbi algorithm, main step: observation is 4
Viterbi algorithm, main step: observation is 4
Viterbi algorithm, main step, ψ

ψj (t) is a helper variable that stores the t − 1 state index i on

the highest probability path.

ψj (t) = argmax[δi (t − 1) aij bj (Ot )]

1≤i≤N

In the backtracing phase, we will use ψ to find the previous

cell/state in the best path.
Viterbi algorithm, main step: observation is 4
Viterbi algorithm, main step: observation is 4
Viterbi algorithm, main step: observation is 4
Viterbi algorithm, main step: observation is 4
Viterbi algorithm, main step: observation is 3
Viterbi algorithm, main step: observation is 3
Viterbi algorithm, main step: observation is 3
Viterbi algorithm, main step: observation is 3
Viterbi algorithm, main step: observation is 3
Viterbi algorithm, main step: observation is 5
Viterbi algorithm, main step: observation is 5
Viterbi algorithm, termination
Viterbi algorithm, termination
Viterbi algorithm, backtracing
Viterbi algorithm, backtracing
Viterbi algorithm, backtracing
Viterbi algorithm, backtracing
Viterbi algorithm, backtracing
Viterbi algorithm, backtracing
Viterbi algorithm, backtracing
Viterbi algorithm, backtracing
Why is it necessary to keep N states at each time step?

We have convinced ourselves that it’s not necessary to keep

more than N (“real”) states per time step.
But could we cut down the table to just a one-dimensional
table of T time slots by choosing the probability of the best
path overall ending in that time slot, in any of the states?
This would be the greedy choice
But think about what could happen in a later time slot.
You could encounter a zero or very low probability concerning
all paths going through your chosen state sj at time t.
Now a state sk that looked suboptimal in comparison to sj at
time t becomes the best candidate.
As we don’t know the future, this could happen to any state,
so we need to keep the probabilities for each state at each time
slot.
But thankfully, no more.
Precision and Recall

So far, we have measured system success in accuracy or

agreement in Kappa.
But sometimes it’s only one type of instances that we find
interesting.
We don’t want a summary measure that averages over
interesting and non-interesting instances, as accuracy does.
In those cases, we use precision, recall and F-measure.
These metrics are imported from the field of information
retrieval, where the difference (in numbers) beween interesting
and non-interesting examples is particularly high.
Accuracy doesn’t work well when the types of instances are
unbalanced.
True positives, false negatives. . .

System says:
L F Total
Truth is: L TP FN TP+FN
F FP TN FP+TN
Total TP+FP FN+FP TP+FP+FN+FP

L is the category we are interested in.

TP are the true positives.

The system correctly declared them as positive.
FN are the false negatives.
The system didn’t declare them as as a positive, but should
have.
TN are the true negatives.
The system didn’t declare them as a positive, and was right.
FP are the false positives.
The system declared them as a positive, but shouldn’t have.
Precision and Recall

System says:
L F Total
Truth is: L TP FN TP+FN
F FP TN FP+TN
Total TP+FP FN+TN TP+FP+FN+TN

TP
Precision of L: PL = T P +F P
Recall of L: RL = T PT+FP
N
2PL RL
F-measure of L: FL = PL +RL
Accuracy: A = T P +FT PP +F
+T N
N +T N
Your task today

Task 8:
Implement the Viterbi algorithm.
Run it on the dice dataset and measure precision of L (PL ),
recall of L (RL ) and F-measure of L (FL ).
Literature

Manning and Schutze (2000). Foundations of Statistical

Natural Language Processing, MIT Press. Chapter 9.3.2.
We use a state-emission HMM, but this textbook uses an
arc-emission HMM. There is therefore a slight difference in the
algorithm as to in which step the initial and final bj (kt ) are
multiplied in.
(Jurafsky and Martin, 3rd Edition, online, Chapter 8.4 (but
careful, notation!))
Smith, Noah A. (2004). Hidden Markov Models: All the
Glorious Gory Details.
Bockmayr and Reinert (2011). Markov chains and Hidden
Markov Models. Discrete Math for Bioinformatics WS 10/11.

HMM in BI
No ratings yet
HMM in BI
37 pages
Session 6-Markov Slide
No ratings yet
Session 6-Markov Slide
68 pages
Asr05 HMM Algorithms
No ratings yet
Asr05 HMM Algorithms
47 pages
Computational Genomics Hidden Markov Models (HMMS)
No ratings yet
Computational Genomics Hidden Markov Models (HMMS)
55 pages
Lecture 11
No ratings yet
Lecture 11
55 pages
Hidden Markov Models: Modified From
No ratings yet
Hidden Markov Models: Modified From
32 pages
Bioinformatics-Lesson 07 - Hidden Markov Model
No ratings yet
Bioinformatics-Lesson 07 - Hidden Markov Model
28 pages
Unit - 4 Hidden Markov Models
No ratings yet
Unit - 4 Hidden Markov Models
39 pages
3 2 ML For IoT Part2
No ratings yet
3 2 ML For IoT Part2
20 pages
Sixth Ed Chap 13
No ratings yet
Sixth Ed Chap 13
19 pages
16-Infelctional and Derivational Morphology-28!02!2024
No ratings yet
16-Infelctional and Derivational Morphology-28!02!2024
20 pages
HMM: Viterbi Algorithm - A Toy Example: Start
No ratings yet
HMM: Viterbi Algorithm - A Toy Example: Start
12 pages
Programación Dinamica Aplicacion Al Analisis de ADN
No ratings yet
Programación Dinamica Aplicacion Al Analisis de ADN
19 pages
Prques 2
No ratings yet
Prques 2
13 pages
Viterbi Algo
No ratings yet
Viterbi Algo
13 pages
Gene Finding and HMMS: 6.096 - Algorithms For Computational Biology - Lecture 7
No ratings yet
Gene Finding and HMMS: 6.096 - Algorithms For Computational Biology - Lecture 7
69 pages
Bioinformatics HMM Updated
No ratings yet
Bioinformatics HMM Updated
28 pages
Chapter 4. Iterative Turbo Code Decoder: U L y L 0 U L
No ratings yet
Chapter 4. Iterative Turbo Code Decoder: U L y L 0 U L
30 pages
Hidden Markov Models in Speech Recognition: Wayne Ward
No ratings yet
Hidden Markov Models in Speech Recognition: Wayne Ward
35 pages
ADRI Report
No ratings yet
ADRI Report
9 pages
Memoria 4
No ratings yet
Memoria 4
7 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
26 pages
Hidden Markov Modelss
No ratings yet
Hidden Markov Modelss
59 pages
Example of Viterbi
No ratings yet
Example of Viterbi
39 pages
STAT 530 Hidden Markov Model: Outline
No ratings yet
STAT 530 Hidden Markov Model: Outline
17 pages
A Tutorial On Hidden Markov Models PDF
No ratings yet
A Tutorial On Hidden Markov Models PDF
22 pages
HMM
No ratings yet
HMM
25 pages
1.1. An Example of A HMM For Protein Sequences: Output Prob
No ratings yet
1.1. An Example of A HMM For Protein Sequences: Output Prob
16 pages
Viterbi Algorithm
No ratings yet
Viterbi Algorithm
5 pages
Hidden Markov Models: CH 3.2, 3.2 of DEKM
No ratings yet
Hidden Markov Models: CH 3.2, 3.2 of DEKM
27 pages
On-Line Viterbi Algorithm For Analysis of Long Biological Sequences
No ratings yet
On-Line Viterbi Algorithm For Analysis of Long Biological Sequences
12 pages
Pairwise Cosine Similarity of Emission Probability Matrix As An Indicator of Prediction Accuracy of The Viterbi Algorithm
No ratings yet
Pairwise Cosine Similarity of Emission Probability Matrix As An Indicator of Prediction Accuracy of The Viterbi Algorithm
6 pages
Recitation4 Notes
No ratings yet
Recitation4 Notes
6 pages
Design Presentation
No ratings yet
Design Presentation
13 pages
HMM Isolated Word Recognition
No ratings yet
HMM Isolated Word Recognition
23 pages
Hidden Markov Model 1740223320
No ratings yet
Hidden Markov Model 1740223320
14 pages
Viterbi Decoding of Convolutional Codes: Hapter
No ratings yet
Viterbi Decoding of Convolutional Codes: Hapter
16 pages
506.6T-17 Visual Shotcrete Core Quality Evaluation Technote
No ratings yet
506.6T-17 Visual Shotcrete Core Quality Evaluation Technote
4 pages
Viterbi Algorithm
No ratings yet
Viterbi Algorithm
12 pages
Introduction To Machine Learning CMU-10701: Hidden Markov Models
No ratings yet
Introduction To Machine Learning CMU-10701: Hidden Markov Models
30 pages
Cu HMM
No ratings yet
Cu HMM
13 pages
ML 5
No ratings yet
ML 5
28 pages
HMM Lecture Notes
No ratings yet
HMM Lecture Notes
7 pages
HMM Viterbi Mini Example
No ratings yet
HMM Viterbi Mini Example
4 pages
Algorithms - Hidden Markov Models
No ratings yet
Algorithms - Hidden Markov Models
7 pages
Backward Algo
No ratings yet
Backward Algo
4 pages
Viterbi Decoding of Convolutional Codes: Ecture
No ratings yet
Viterbi Decoding of Convolutional Codes: Ecture
11 pages
Lec20 PDF
No ratings yet
Lec20 PDF
7 pages
Hidden Markov Models 3pb6fukspf
No ratings yet
Hidden Markov Models 3pb6fukspf
29 pages
NLP Mod5 Lec2 Viterbi Algorithm
No ratings yet
NLP Mod5 Lec2 Viterbi Algorithm
15 pages
Lecture 9 Viterbi Decoding of Convolutional Code
No ratings yet
Lecture 9 Viterbi Decoding of Convolutional Code
9 pages
Hidden Markov Model: Le HMM - Ipynb
No ratings yet
Hidden Markov Model: Le HMM - Ipynb
2 pages
Viterbi Algorithm
No ratings yet
Viterbi Algorithm
9 pages
Hidden Markov Model (HMM) Architecture
No ratings yet
Hidden Markov Model (HMM) Architecture
15 pages
A Fast Maximum-Likelihood Decoder For Convolutional Codes
No ratings yet
A Fast Maximum-Likelihood Decoder For Convolutional Codes
5 pages
Sunny Days Childrens T Shirt Us
100% (1)
Sunny Days Childrens T Shirt Us
5 pages
Algebraic Survivor Memory Management Design For Viterbi Detectors
No ratings yet
Algebraic Survivor Memory Management Design For Viterbi Detectors
6 pages
Viterbi Decoder
No ratings yet
Viterbi Decoder
8 pages
ViteRbi Algorithm
No ratings yet
ViteRbi Algorithm
19 pages
Viterbi Algorithm Demystified
No ratings yet
Viterbi Algorithm Demystified
15 pages
Viterbi Algorithm
No ratings yet
Viterbi Algorithm
6 pages
Intro To Java Programming Comprehensive Version 10th Edition by Y Daniel Liang
No ratings yet
Intro To Java Programming Comprehensive Version 10th Edition by Y Daniel Liang
315 pages
List of Imran Series by Ibn-e-Safi - Wikipedia
No ratings yet
List of Imran Series by Ibn-e-Safi - Wikipedia
25 pages
26 Neural Nets
No ratings yet
26 Neural Nets
77 pages
Bearings Archives - Marine Engineering Study Materials
100% (1)
Bearings Archives - Marine Engineering Study Materials
5 pages
Fluid Mechanics and Hydraulics - Gillesania
No ratings yet
Fluid Mechanics and Hydraulics - Gillesania
308 pages
DLP - 2 - Weel 2 - in 21ST Centurt Literature in The Philippines and The World
No ratings yet
DLP - 2 - Weel 2 - in 21ST Centurt Literature in The Philippines and The World
5 pages
Persuasive Speech On Homework Should Be Banned
100% (1)
Persuasive Speech On Homework Should Be Banned
6 pages
Medisin The Causes Solutions To Disease Malnutrition and The Medical Sins That Are Killing The World 1st Scott Whitaker PDF Download
No ratings yet
Medisin The Causes Solutions To Disease Malnutrition and The Medical Sins That Are Killing The World 1st Scott Whitaker PDF Download
82 pages
Research Reports
No ratings yet
Research Reports
11 pages
T7 Astro Camera Astronomy Planetary Quick Guide
No ratings yet
T7 Astro Camera Astronomy Planetary Quick Guide
20 pages
How To Package and Deploy SAP Business One Extensions For Lightweight Deployment
No ratings yet
How To Package and Deploy SAP Business One Extensions For Lightweight Deployment
26 pages
Pac 6500-Sira 16 Atex 2362-00
No ratings yet
Pac 6500-Sira 16 Atex 2362-00
3 pages
NeuralNets DeepLearning
No ratings yet
NeuralNets DeepLearning
17 pages
15 Finetune
No ratings yet
15 Finetune
33 pages
6 Perceptron
No ratings yet
6 Perceptron
32 pages
MLRD 1
No ratings yet
MLRD 1
28 pages
3 2KNN
No ratings yet
3 2KNN
27 pages
Hazardous Substance Fact Sheet: Right To Know
No ratings yet
Hazardous Substance Fact Sheet: Right To Know
6 pages
Massachusetts Parent Letter Refusing MCAS
No ratings yet
Massachusetts Parent Letter Refusing MCAS
1 page
Oteco General
No ratings yet
Oteco General
16 pages
EnviroBLASTO: A Calculator For Estimating The Environmental Impacts of Rock Blasting
No ratings yet
EnviroBLASTO: A Calculator For Estimating The Environmental Impacts of Rock Blasting
6 pages
Chapter 1 - Notes - Fixed Income Analysis
No ratings yet
Chapter 1 - Notes - Fixed Income Analysis
3 pages
PC1015
No ratings yet
PC1015
13 pages
Implementing Microservice Architecture Changed
No ratings yet
Implementing Microservice Architecture Changed
5 pages
MLP Scratch
No ratings yet
MLP Scratch
8 pages
Image Augmentation
No ratings yet
Image Augmentation
8 pages
Neural Style
No ratings yet
Neural Style
6 pages
Benchmarking Optimizers
No ratings yet
Benchmarking Optimizers
30 pages
Risk Management Template
No ratings yet
Risk Management Template
2 pages
Tutorial Sheet - 9
No ratings yet
Tutorial Sheet - 9
2 pages
Fine Tuning
No ratings yet
Fine Tuning
3 pages
ANT 4468 - Syllabus PDF
No ratings yet
ANT 4468 - Syllabus PDF
5 pages
Surgery Year 5 5 Feb 2013 (Omega 1)
No ratings yet
Surgery Year 5 5 Feb 2013 (Omega 1)
26 pages
Chemical Identity: Material Safety Data Sheet Gasoline/Petrol
No ratings yet
Chemical Identity: Material Safety Data Sheet Gasoline/Petrol
4 pages
Daily Report Swiss Embassy Jakarta
No ratings yet
Daily Report Swiss Embassy Jakarta
1 page
1st Sem Result
No ratings yet
1st Sem Result
1 page
Efu Health Insurance
No ratings yet
Efu Health Insurance
3 pages
Sri Dev Suman Uttarakhand University ी देव सुमन उ तराख ड व व व यालय
No ratings yet
Sri Dev Suman Uttarakhand University ी देव सुमन उ तराख ड व व व यालय
1 page
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet

MLRD 9

Uploaded by

MLRD 9

Uploaded by

9: Viterbi Algorithm for HMM Decoding

Machine Learning and Real-world Data

Department of Computer Science and Technology

The dishonest casino, dice edition.

Definition of decoding: Finding the most likely hidden state

X̂ = argmax P (X, O|µ)

Search space of possible state sequences X is O(N T ); too

(Reminder from Algorithms course)

(Reminder from Algorithms course)

Here’s how we can save ourselves a lot of time.

Memoisation is done using a trellis.

This probability is calculated by maximising over the best

Note: the probability of a state starting the sequence at t = 0 is

ψj (t) is a helper variable that stores the t − 1 state index i on

ψj (t) = argmax[δi (t − 1) aij bj (Ot )]

In the backtracing phase, we will use ψ to find the previous

We have convinced ourselves that it’s not necessary to keep

So far, we have measured system success in accuracy or

L is the category we are interested in.

TP are the true positives.

Manning and Schutze (2000). Foundations of Statistical

You might also like