0% found this document useful (0 votes)

17 views88 pages

Lec7 - 10 - HMM Learning

Uploaded by

biswasarno75

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views88 pages

Lec7 - 10 - HMM Learning

Uploaded by

biswasarno75

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 88

Hidden Markov Models

Dr. Md. Golam Rabiul Alam

BRAC University
1
Markov Models

2
Markov Models

3
Markov Models

4
Example of Markov Model
0.3 0.7

Rain Dry

0.2 0.8

• Two states : ‘Rain’ and ‘Dry’.

• Transition probabilities:
P(‘Rain’|‘Rain’)=0.3 , P(‘Dry’|‘Rain’)=0.7 ,
P(‘Rain’|‘Dry’)=0.2, P(‘Dry’|‘Dry’)=0.8
• Initial probabilities: say P(‘Rain’)=0.4 , P(‘Dry’)=0.6 .

5
Calculation of sequence probability

Dry Dry Rain

P ( S1  Dry , S 2  Dry , S 3  Rain )

 P ( S1  Dry ) P ( S 2  Dry | S1  Dry ) P ( S 3  Rain | S 2  Dry )
 0.6  0.8  0.2

Markov Model is just a Bayesian Network;

In this network, P(S1,S2,S3)=P(S1)P(S2|S1)P(S3|S2) 6
Hidden Markov Models

• Based on Markov Models

• Differences include
– State becomes “Hidden”
– The state information is not available, instead
of, there are some “Observations” with are
correlated with “Hidden” State

7
Hidden Markov Model

In Markov Model,
Emission Prof. is not used
8
Hidden Markov Models

9
Hidden Markov Models

10
Hidden Markov Models

11
HMM Example

12
13
HMM Problems

14
15
16
Evaluation Problem

17
Hidden Markov Model

18
Problems can be solved using HMM
1) Calculation of observation sequence probability

S1 S2 S3

O1 O2 O3

Rain Rain Dry

P (O1  Rain, O 2  Rain, O 3  Dry ) 19

P (O1  Rain , O 2  Rain , O 3  Dry )
  P (O1  Rain , O 2  Rain , O 3  Dry , S 3)
S 3 {low , high }

P (O1  Rain, O 2  Rain , O 3  Dry , S 3  Low )   3Low

P (O1  Rain, O 2  Rain , O 3  Dry , S 3  High )   3High

P (O1  Rain, O 2  Rain , O 3  Dry )   3Low   3High    3i

i{ Low , High }

* Now the problem is how to calculate  3Low &  3High

20
S1 S2 S3  3i  P (O1, O 2, O 3, S 3  i )

O1 O2 O3

S1 S2 S3
  P (O1, O 2, S 2  i )
i
2

O1 O2 O3

S1 S2 S3
  P (O1, S1  i )
i
1
21
Calculation
Difficulty

22
Can we find some relationship between  3i &  2i & 1i

If we can find the relationship, then we can:

1) Calculate  1i
2) Calculate  2i based on  1i
3) Calculate  3i based on  2i

Recursively!

23
Can we find the relationship? (Yes)
 3Low  P (O1, O 2, O 3, S 3  Low )
 P (O1, O 2, O 3, S 3  Low, S 2  High )  P (O1, O 2, O 3, S 3  Low, S 2  Low )
  P (O1, O 2, O 3, S 3  Low, S 2  j )
j { Low , High }

  P (O1, O 2, S 2  j )  P (O 3 | S 3  Low, O1, O 2, S 2  j )  P ( S 3  Low | O1, O 2, S 2  j )

j { Low , High }

D-Separation

P (O3 | S 3  Low)
P ( S 3  Low | S 2  j )

S1 S2 S3

O1 O2 O3 24
Forward Algorithm
 3Low   P (O1, O 2, S 2  j )  P (O 3 | S 3  Low )  P ( S 3  Low | S 2  j )
j { Low , High }

   2j  P (O 3 | S 3  Low )  P ( S 3  Low | S 2  j )
j { Low , High }

 P (O 3 | S 3  Low )   2j  P ( S 3  Low | S 2  j )
j { Low , High }
Dynamic
Programming

Emission Probability Transition Probability

25
26
Forward Algorithm

27
28
HMM Example

29
Decoding Problem 1

30
Problem can be Solved by HMM

• Decoding problem 1
S1 S2 S3 S4

O1 O2 O3 O4

P(O1,O2,O3,O4,S3=High)=?

31
P (O1, O 2, O 3, O 4, S 3  High ) 
P (O1, O 2, O 3, S 3  High )  P (O 4 | O1, O 2, O 3, S 3  High )
 P (O1, O 2, O 3, S 3  High )  P (O 4 | S 3  High )
  3High   3High

We already know how

to calculate it. ？
32
S1 S2 S3 S4 S5

S6 High
3

O1 O2 O3 O4 O5 O6

S1 S2 S3 S4 S5 S6 Calculation
 High
4
Difficulty
O1 O2 O3 O4 O5 O6

S1 S2 S3 S4 S5 S6

 5High
O1 O2 O3 O4 O5 O6
33
Can we find some relationship among  i &  i &  i &  i
3 4 5 6

If we can find the relationship, then we can:

1) Calculate  6i  1
2) Calculate  5i based on  6i
3) Calculate  4i based on  i
5

3 4
4) Calculate i based on i

Recursively!
34
Can we find the relationship? (Yes)
 3High  P (O 4, O 5, O 6 | S 3  High )
  P (O 4, O 5, O 6, S 4  j | S 3  High )
j { Low , High }

  P (O 5, O 6 | S 4  j , O 4, S 3  High )  P (O 4 | S 4  j , S 3  High )  P ( S 4  j | S 3  High )

j { Low , High }

D-Separation

 4j P (O 4 | S 4  j )

S1 S2 S3 S4 S5 S6

O1 O2 O3 O4 O5 O6
35
Backward Algorithm

 3High    4j  P (O 4 | S 4  j )  P ( S 4  j | S 3  High )
j { Low , High }

Emission Prob.
Transition Prob.

36
Backward Probability

37
Backward Algorithm

38
Decoding Problem 2

S1 S2 S3

O1 O2 O3

Rain Rain Dry

O1,O2,O3 are known, what is the most probable sequence S1,S2,S3

For example: {High, High, Low}, {Low, High, Low}, {Low, High, High}…….
39
arg max P ( S1, S 2, S 3 | O1, O 2, O 3)
S 1, S 2, S 3

 arg max P ( S1, S 2, S 3, O1, O 2, O 3)

S 1, S 2, S 3

 arg max max P ( S 3  k , S1, S 2, O1, O 2, O 3)

k S 1, S 2

V3k
Probability of most likely sequence of states ending at states S3=k

40
V3k
 max P ( S 3  k , S 2, S1, O1, O 2, O 3)
S 1, S 2

 max max P ( S 3  k , S 2  i, S1, O1, O 2, O 3)

i S1

 max max P ( S 2  i , S1, O1, O 2) P ( S 3  k , O 3 | S 2  i , S1, O1, O 2)

i S1

 max max P ( S 2  i , S1, O1, O 2) P (O 3 | S 3  k , S 2  i, S1, O1, O 2) P ( S 3  k | S 2  i, S1, O1, O 2)

i S1

D-Separation

P (O 3 | S 3  k ) P(S 3  k | S 2  i)

V3k  max V2i P (O 3 | S 3  k ) P ( S 3  k | S 2  i )

 P (O 3 | S 3  k ) max P ( S 3  k | S 2  i )V 2
i
41
i
Viterbi algorithm

V3k  max V2i P (O 3 | S 3  k ) P ( S 3  k | S 2  i )

 P (O 3 | S 3  k ) max P ( S 3  k | S 2  i )V2i
i

V2k  P (O 2 | S 2  k ) max P ( S 2  k | S1  i )V1i

42
Viterbi algorithm

43
HMM

What is the most likely sequence

of health status for the
observation sequence
[Normal, Cold, Dizzy]?

44
Viterbi Algorithm

45
Viterbi Algorithm

46
Viterbi Algorithm

47
Viterbi Algorithm

48
Viterbi Algorithm

The most likely sequence of health status is [Health,

Healthy, Fever] for the observation sequence
[Normal, Cold, Dizzy]. 49
50
51
Expectation Maximization (EM)

52
53
EM

54
55
Expectation Maximization (EM)

56
Expected number of transitions from state
i to state j at time t 57
Expected number of transitions from state
i to state j at time t 58
Expected number of transitions from state
i to state j at time t 59
Expected number of times for being in
state i at time t.

60
Expected number of times for being in
state i at time t. 61
62
63
Example

Observations: R, D

State sequence 1

ObS sequence 1

State sequence 2

Obs sequence 2

State sequence 3

Obs sequence 3

State sequence 4

Obs sequence 4

State sequence 5

Obs sequence 5

64
Example

65
Expected number of
transitions from state
i to state j at time t

Expected number
of times for being
in state i at time t.

66
HMM Parameter Learning

67
HMM Parameter Learning

68
HMM Parameter Learning (Given fully
labeled sequences)

69
HMM Parameter Learning

70
HMM Parameter Learning

71
HMM Parameter Learning

72
HMM Parameter Learning without fully
labeled hidden sequences

We have seen the procedure to calculate the optimal parameters given the
hidden state sequence.

However, it is common that the hidden state sequence is unknown. In such a

case, we first try to "estimate" the "expected" state sequence based on some
initial estimates of parameters.

Then, we use the principles of MLE for the observed state sequence to refine
the parameters.

We apply these two steps, iteratively, via an algorithm called Expectation-

Maximization.
73
HMM Parameter Learning

74
HMM Parameter Learning

75
HMM Parameter Learning

76
HMM Parameter Learning

77
HMM Parameter Learning

78
HMM Parameter Learning

By repeating the Expectation and Maximization steps till convergence, we get a set
of local optima. We may run the algorithms multiple times with different
initializations and finally choose the set of parameters giving the highest
likelihood.

79
HMM Parameter Learning Example
Without fully labeled sequences

• Assume, we have the observations for a

single example in our training set from the
Fair and Biased coin HMM like the
following:

Parameter learning Example Without fully

labeled sequences
We wish to compute the parameters using the
EM algorithm. Assume that K=2: Fair coin
and Biased coin. 80
HMM Parameter Learning (Example)

81
HMM Parameter Learning

82
HMM Parameter Learning (Example)

83
HMM Parameter Learning

84
HMM Parameter Learning (Example)

85
86
87
Thank You
• Standard HMM reference:
L. R. Rabiner, "A Tutorial on Hidden Markov Models and
Selected Applications in Speech Recognition," Proc. of the
IEEE, Vol.77, No.2, pp.257-286, 1989.

• Excellent reference for Dynamic Bayes Nets as a unifying

framework for probabilistic temporal models (including
HMMs and Kalman filters):

Chapter 15 of Artificial Intelligence, A Modern Approach,

2nd Edition, by Russell & Norvig

2024 Fall CSE366 12 HMM
No ratings yet
2024 Fall CSE366 12 HMM
46 pages
Data Structure Previous Year Question Papers
No ratings yet
Data Structure Previous Year Question Papers
12 pages
Winter Semester 2022-23 CSE3008 ETH AP2022236000448 Reference Material I 26-Apr-2023 HMM Class-1 PDF
No ratings yet
Winter Semester 2022-23 CSE3008 ETH AP2022236000448 Reference Material I 26-Apr-2023 HMM Class-1 PDF
56 pages
BT302 L9 HMM
No ratings yet
BT302 L9 HMM
29 pages
IS 7118 Unit-6 HMM
No ratings yet
IS 7118 Unit-6 HMM
78 pages
Unit - 4 Hidden Markov Models
No ratings yet
Unit - 4 Hidden Markov Models
39 pages
AML Mod2
No ratings yet
AML Mod2
38 pages
Module 6.2
No ratings yet
Module 6.2
25 pages
Machine Learning For Natural Language Processing: Hidden Markov Models
No ratings yet
Machine Learning For Natural Language Processing: Hidden Markov Models
33 pages
Hidden Markov Models and Sequential Data
No ratings yet
Hidden Markov Models and Sequential Data
45 pages
Lecture 11
No ratings yet
Lecture 11
55 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
35 pages
Slides
No ratings yet
Slides
69 pages
SP14 CS188 Lecture 14 - Hidden Markov Models - Print
No ratings yet
SP14 CS188 Lecture 14 - Hidden Markov Models - Print
26 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
32 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
26 pages
cs229 HMM
No ratings yet
cs229 HMM
13 pages
斯坦福大学机器学习数学基础 65-72
No ratings yet
斯坦福大学机器学习数学基础 65-72
8 pages
Hidden Markovnikov Model
No ratings yet
Hidden Markovnikov Model
32 pages
Recitation4 Notes
No ratings yet
Recitation4 Notes
6 pages
Hidden Markov Models: CH 3.2, 3.2 of DEKM
No ratings yet
Hidden Markov Models: CH 3.2, 3.2 of DEKM
27 pages
Hidden Markov Modelss
No ratings yet
Hidden Markov Modelss
59 pages
Fundamentals of Speech Recognition Suggested Project The Hidden Markov Model 1. Project Introduction
No ratings yet
Fundamentals of Speech Recognition Suggested Project The Hidden Markov Model 1. Project Introduction
11 pages
HMM
No ratings yet
HMM
25 pages
HMM Cuda Baum Welch
No ratings yet
HMM Cuda Baum Welch
8 pages
Parametric Models Hidden Markov Models
No ratings yet
Parametric Models Hidden Markov Models
30 pages
Markov Models
No ratings yet
Markov Models
54 pages
Example - Markov Models - HMM
No ratings yet
Example - Markov Models - HMM
7 pages
Lecture07 HMM S
No ratings yet
Lecture07 HMM S
26 pages
CS 4705 Hidden Markov Models: Slides Adapted From Dan Jurafsky, and James Martin
No ratings yet
CS 4705 Hidden Markov Models: Slides Adapted From Dan Jurafsky, and James Martin
35 pages
Hidden Markov Models: Julia Hirschberg CS4705
No ratings yet
Hidden Markov Models: Julia Hirschberg CS4705
37 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
4 pages
Introduction To Machine Learning CMU-10701: Hidden Markov Models
No ratings yet
Introduction To Machine Learning CMU-10701: Hidden Markov Models
30 pages
Backward Algo
No ratings yet
Backward Algo
4 pages
Hidden Markov Models: Background
No ratings yet
Hidden Markov Models: Background
13 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
20 pages
Multiple Sequence Alignment
No ratings yet
Multiple Sequence Alignment
16 pages
Cu HMM
No ratings yet
Cu HMM
13 pages
ML 5
No ratings yet
ML 5
28 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
55 pages
Hidden Markov Models: Adapted From
No ratings yet
Hidden Markov Models: Adapted From
33 pages
19MAT301 - Practice Sheet 2 & 3
No ratings yet
19MAT301 - Practice Sheet 2 & 3
10 pages
Extended Viterbi Algorithm For Second Order Hidden Markov Processes
No ratings yet
Extended Viterbi Algorithm For Second Order Hidden Markov Processes
3 pages
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
No ratings yet
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
29 pages
Lec20 PDF
No ratings yet
Lec20 PDF
7 pages
Sequence Model:: Hidden Markov Models
No ratings yet
Sequence Model:: Hidden Markov Models
60 pages
hw3 Solution
No ratings yet
hw3 Solution
7 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
31 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
56 pages
Cis262 HMM
No ratings yet
Cis262 HMM
34 pages
A Revealing Introduction To Hidden Markov Models
No ratings yet
A Revealing Introduction To Hidden Markov Models
20 pages
Hidden Markov Model (HMM) Architecture
No ratings yet
Hidden Markov Model (HMM) Architecture
15 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
51 pages
Advance Algorithms PDF
0% (2)
Advance Algorithms PDF
2 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
51 pages
Algorithms - Hidden Markov Models
No ratings yet
Algorithms - Hidden Markov Models
7 pages
Hmms and The Forward-Backward Algorithm: Hidden Markov Models
No ratings yet
Hmms and The Forward-Backward Algorithm: Hidden Markov Models
7 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
5 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
36 pages
Hybrid Low Radix Encoding-Based Approximate Booth Multipliers
100% (1)
Hybrid Low Radix Encoding-Based Approximate Booth Multipliers
33 pages
Segment Tree For Solving Range Minimum Query Problems
No ratings yet
Segment Tree For Solving Range Minimum Query Problems
6 pages
Grade 9 Math Worksheeet
No ratings yet
Grade 9 Math Worksheeet
4 pages
Precedence Graph - DBMS
No ratings yet
Precedence Graph - DBMS
7 pages
Text Compression: Examples Huffman Coding: Go Go Gophers
No ratings yet
Text Compression: Examples Huffman Coding: Go Go Gophers
8 pages
Data Structures Unit-1
No ratings yet
Data Structures Unit-1
53 pages
Hillier 10e Ch12
No ratings yet
Hillier 10e Ch12
41 pages
Graph Partitioning and Graph Clustering (PDFDrive)
No ratings yet
Graph Partitioning and Graph Clustering (PDFDrive)
258 pages
Essentials of Theoretical Computer Science-173-230
No ratings yet
Essentials of Theoretical Computer Science-173-230
58 pages
Avl PDF
No ratings yet
Avl PDF
60 pages
110CS0081 1
No ratings yet
110CS0081 1
42 pages
Lecture 5 Merge Sort
No ratings yet
Lecture 5 Merge Sort
21 pages
Linear Programming Worksheet 2023
No ratings yet
Linear Programming Worksheet 2023
2 pages
2018BCCContestSolutions9 10
No ratings yet
2018BCCContestSolutions9 10
37 pages
Linear Search
No ratings yet
Linear Search
4 pages
Tut-2 Solution
No ratings yet
Tut-2 Solution
2 pages
AOA 2023 Solution
No ratings yet
AOA 2023 Solution
25 pages
BSC Computer Science Cs Semester 6 2022 November Data Analytics 2019 Pattern
No ratings yet
BSC Computer Science Cs Semester 6 2022 November Data Analytics 2019 Pattern
2 pages
Ec Quiz3
No ratings yet
Ec Quiz3
8 pages
Formal Languages and Automata Theory Exercises Finite Automata Unit 3
No ratings yet
Formal Languages and Automata Theory Exercises Finite Automata Unit 3
12 pages
Assignment 5
No ratings yet
Assignment 5
4 pages
Lab-11-Loops in Python
No ratings yet
Lab-11-Loops in Python
4 pages
On Some Open Problems in The Theory of Cellular Automata
No ratings yet
On Some Open Problems in The Theory of Cellular Automata
5 pages
Lab Program (SVM From Scratch)
No ratings yet
Lab Program (SVM From Scratch)
2 pages
Projecta$10$12$940Projectb$4$5$750Projectc$11$8$660Plant Capacities705030150
No ratings yet
Projecta$10$12$940Projectb$4$5$750Projectc$11$8$660Plant Capacities705030150
3 pages
Graph Theory Leasson 2
No ratings yet
Graph Theory Leasson 2
2 pages
20tuit124-Ex 7 PP
No ratings yet
20tuit124-Ex 7 PP
4 pages
Complexity Analysis - Difficult Recurrences: Example 1: The Fibonacci Recurrence
No ratings yet
Complexity Analysis - Difficult Recurrences: Example 1: The Fibonacci Recurrence
4 pages
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
From Everand
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
Shubhankar Paul
No ratings yet
Huggy Cat Amigurumi Crochet Pattern
From Everand
Huggy Cat Amigurumi Crochet Pattern
Sayjai
3/5 (2)
Froggy Amigurumi Crochet Pattern
From Everand
Froggy Amigurumi Crochet Pattern
Sayjai Thawornsupacharoen
No ratings yet

Lec7 - 10 - HMM Learning

Uploaded by

Lec7 - 10 - HMM Learning

Uploaded by

Hidden Markov Models

Dr. Md. Golam Rabiul Alam

• Two states : ‘Rain’ and ‘Dry’.

Dry Dry Rain

P ( S1  Dry , S 2  Dry , S 3  Rain )

Markov Model is just a Bayesian Network;

• Based on Markov Models

Rain Rain Dry

P (O1  Rain, O 2  Rain, O 3  Dry ) 19

P (O1  Rain, O 2  Rain , O 3  Dry , S 3  Low )   3Low

P (O1  Rain, O 2  Rain , O 3  Dry )   3Low   3High    3i

* Now the problem is how to calculate  3Low &  3High

If we can find the relationship, then we can:

  P (O1, O 2, S 2  j )  P (O 3 | S 3  Low, O1, O 2, S 2  j )  P ( S 3  Low | O1, O 2, S 2  j )

Emission Probability Transition Probability

We already know how

If we can find the relationship, then we can:

  P (O 5, O 6 | S 4  j , O 4, S 3  High )  P (O 4 | S 4  j , S 3  High )  P ( S 4  j | S 3  High )

Rain Rain Dry

O1,O2,O3 are known, what is the most probable sequence S1,S2,S3

 arg max P ( S1, S 2, S 3, O1, O 2, O 3)

 arg max max P ( S 3  k , S1, S 2, O1, O 2, O 3)

 max max P ( S 3  k , S 2  i, S1, O1, O 2, O 3)

 max max P ( S 2  i , S1, O1, O 2) P ( S 3  k , O 3 | S 2  i , S1, O1, O 2)

 max max P ( S 2  i , S1, O1, O 2) P (O 3 | S 3  k , S 2  i, S1, O1, O 2) P ( S 3  k | S 2  i, S1, O1, O 2)

V3k  max V2i P (O 3 | S 3  k ) P ( S 3  k | S 2  i )

V3k  max V2i P (O 3 | S 3  k ) P ( S 3  k | S 2  i )

V2k  P (O 2 | S 2  k ) max P ( S 2  k | S1  i )V1i

What is the most likely sequence

The most likely sequence of health status is [Health,

However, it is common that the hidden state sequence is unknown. In such a

We apply these two steps, iteratively, via an algorithm called Expectation-

• Assume, we have the observations for a

Parameter learning Example Without fully

• Excellent reference for Dynamic Bayes Nets as a unifying

Chapter 15 of Artificial Intelligence, A Modern Approach,

You might also like