0% found this document useful (0 votes)

7 views24 pages

PoSTagging-HMM

The document discusses Part-of-Speech (PoS) tagging and its implementation using Hidden Markov Models (HMM). It explains the process of assigning grammatical tags to words in a sentence, the challenges of ambiguity in tagging, and how contextual information can aid in disambiguation. The document also outlines the formulation of HMM for PoS tagging, including the use of transition and emission probabilities to determine the most likely sequence of tags for a given sentence.

Uploaded by

saurav22465

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views24 pages

PoSTagging-HMM

Uploaded by

saurav22465

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Part-of-Speech Tagging and

Hidden Markov Model

Md Shad Akhtar
[email protected]

shadakhtar:nlp:iiitd:2025:POS:HMM
Part of Speech Tagging
● Given a PoS tag set T = [t1, t2, ⋯, tm]and a sentence S = [w1, w2, ⋯, wn]
○ Assign each word wi a tag tj that defines its grammatical class.

He ate an apple .

PRP VBD DT NN .

2
Shadakhtar:nlp:iiitd:2025:POS:HMM
Part of Speech Tagging
● A word can have more than one PoS tag.

He_PRP went_VBD to_TO the_DT park_NN in_IN a_DT car_NN ._.

He_PRP went_VBD to_TO park_VB the_DT car_NN in_IN the_DT shed_NN ._.

● Context can help in disambiguation!

bank1_Verb
I bank1 on the bank2 by the river bank3. bank2_Noun
bank3_Noun

3
Shadakhtar:nlp:iiitd:2025:POS:HMM
Part of Speech
● Grouping of words into equivalence classes w.r.t. the role they play in the syntactic
structure.
○ Contains information about the word and its neighbours, e.g., an article always follows a noun/
noun phrase.

● Traditionally,
○ Noun Content words (Open class)
○ Verb ● Semantically rich word
○ Adjective ● Not-fixed
○ Adverb
○ Pronoun
○ Preposition
Functional words (Closed class)
○ Conjunction ● Important to bind the sentence
○ Interjection ● Usually, have less information
● Fixed and Limited
4
Shadakhtar:nlp:iiitd:2025:POS:HMM
Part of Speech
Penn Tree-bank tagset
45 tags

5
Shadakhtar:nlp:iiitd:2025:POS:HMM
Ambiguity in POS tag
● Most words in English are unambiguous
○ Almost, 88.5% (Brown corpus)
○ If no ambiguity, PoS tagging is simple
■ learn a table of words and its corresponding tags.

● However, most common words are ambiguous.

○ Almost, 40% (tokens in Brown corpus)

● Disambiguation
○ Look for the contextual information, i.e., look-back or look-ahead.

shadakhtar:nlp:iiitd:2020:POS:HMM
Illustration of PoS tagging

7
shadakhtar:nlp:iiitd:2025:POS:HMM
An example
● S = Brown foxes jumped over the fence.
● T = ??

● Tag set = [NN, NNS, VB, VBS, VBD, JJ, RB, DT, IN, . ]

● Exhaustive search
○ For each token, we have to estimate the probability w.r.t. each tag.
|S|
○ O( | T | )
○ Very expensive!!
● If we know that a word can take only a handful of PoS tags, we can reduce this search.
○ How to get this information?
■ Corpus
○ Still, it can have lots of possibilities.
■ Retain only the most probable path so far, and discard others.
■ Viterbi Algorithm
8
Shadakhtar:nlp:iiitd:2025:POS:HMM
0 1 2 3 4 5 6 7
W: ^ Brown foxes jumped over the fence .
T: ^ JJ NNS VBD NN DT NN .
NN VBS JJ IN VB
JJ
RB

VBD NN DT NN .
NNS

JJ VB .
JJ IN DT
VBS
^
JJ DT
NNS

VBS RB DT

shadakhtar:nlp:iiitd:2020:POS:HMM
VBD NN DT NN .
NNS

JJ VB .
JJ IN DT
VBS
^
JJ DT
NNS

VBS RB DT

^ Brown foxes jumped over the fence .

● Pick a path to the leaf that maximizes the likelihood of the tag sequence.
○ E.g., the top most path,
■ ^ JJ NNS VBD NN DT NN .

shadakhtar:nlp:iiitd:2020:POS:HMM
Hidden Markov Model (HMM)

11
shadakhtar:nlp:iiitd:2025:POS:HMM
Motivation
Bag1 Bag 2 Bag 3

R → 30 R → 10 R → 60
G → 50 G → 40 G → 10
B → 20 B → 50 B → 30

● Assume, we have an observation of N balls withdrawn from these bags

Red Red Green Green Blue Red Green Red
B1, B2, or B3? B1, B2, or B3? B1, B2, or B3? … B1, B2, or B3?

● Question
○ Give the most likely sequence of bags for the withdrawal of above sequence of balls.
■ Not an easily computable problem.
12
Shadakhtar:nlp:iiitd:2025:POS:HMM
Transition probabilities Emission probabilities
Given two probability matrices B1 B2 B3 R G B

B1 0.1 0.4 0.5 B1 0.3 0.5 0.2

B2 0.6 0.2 0.2 B2 0.1 0.4 0.5

B3 0.3 0.4 0.3 B3 0.6 0.1 0.3

R, 0.03 R, 0.18
R, 0.15
G, 0.05 G, 0.03
G, 0.25
B, 0.02 B, 0.09
B, 0.10
R, 0.18
B1 G, 0.03 B3
B, 0.09
R, 0.24
G, 0.04
R,0.06 B, 0.12
G,0.24
R, 0.08 B,0.30
R, 0.02
G, 0.20
G, 0.08
B, 0.12
B2 B, 0.10

R, 0.02
G, 0.08 13 13
Shadakhtar:nlp:iiitd:2025:POS:HMM B, 0.10
Transition probabilities Emission probabilities
Given two probability matrices B1 B2 B3 R G B

B1 0.1 0.4 0.5 B1 0.3 0.5 0.2

B2 0.6 0.2 0.2 B2 0.1 0.4 0.5

B3 0.3 0.4 0.3 B3 0.6 0.1 0.3

^ R R G

R, 0.03 R, 0.18 B1, 0.0090.10.5 0.00045

R, 0.15
G, 0.05 G, 0.03 B1, 0.3*0.1*0.3
G, 0.25 B2, 0.009*0.4*0.4 0.00144
B, 0.02 B, 0.09 =0.009
B, 0.10
R, 0.18 B3, 0.009*0.5*0.1 0.00045
B1 G, 0.03 B3
B, 0.09 B1, 0.012*0.6*0.5 0.0036
R, 0.24
Initial State
G, 0.04 B0 B1, 0.3*1 B2, 0.3*0.4*0.1
B2, 0.012*0.2*0.4 0.00048
R,0.06 B, 0.12 =0.3 =0.012
G,0.24
B3, 0.012*0.2*0.1 0.00024
R, 0.08 B,0.30
R, 0.02
G, 0.20 B1, 0.09*0.3*0.5 0.0108
G, 0.08
B, 0.12
B2 B, 0.10 B3, 0.3*0.5*0.6
B2, 0.09*0.4*0.4 0.0144
= 0.09
R, 0.02
G, 0.08 B3, 0.09*0.3*0.1 0.002714
Shadakhtar:nlp:iiitd:2025:POS:HMM B, 0.10
Formulation of HMM-based PoS tagging

o1 o2 o3 o4 o5 o6 o7 o8

Observation R R G G B R G R

State s1 s2 s3 s4 s5 s6 s7 s8

where,
si ∈ [B1, B2, B3]
S*: Best possible state sequence
Goal: Maximize P(S | O) tag sequence by choosing the best sequence S

S* = argmaxS P(S | O)

shadakhtar:nlp:iiitd:2020:POS:HMM
Formulation of HMM
S* = argmaxS P(S | O)

P(S | O) = P({s1, s2, s3, s4, s5, s6, s7, s8} | {o1, o2, o3, o4, o5, o6, o7, o8})

2. Apply Markov assumption

= P(s1 | O)P(s2 | s1, O)P(s3 | s2, O)P(s4 | s3, O)⋯P(s8 | s7, O)

16
Shadakhtar:nlp:iiitd:2025:POS:HMM
Bayes’ Theorem

P(B | A)P(A)
P(A | B) =
P(B)

where,
P(A | B) = Posterior
P(B | A) = Likelihood
P(A) = Prior
P(B) = Normalizing constant

17
Shadakhtar:nlp:iiitd:2025:POS:HMM
Formulation of HMM
3. Apply Bayes’ theorem
S* = argmaxS P(S | O) = argmaxS P (O|S) . P(S)

Prior
P(S) = P(s1) . P(s2|s1) . P(s3|s2) ... P(s8|s7)

P(O | S) = P(o1 | s1) P(o2 | s2) P(o3 | s3) P(o4 | s4) … P(o8 | s8)

Putting prior and likelihood together

P(S | O) = P (O|S) . P(S)
= P(o1 | s1) P(o2 | s2) P(o3 | s3) P(o4 | s4) … P(o8 | s8) P(s1) . P(s2|s1) . P(s3|s2) ... P(s8|s7) 18
Shadakhtar:nlp:iiitd:2025:POS:HMM
o0 o1 o2 o3 o4 o5 o6 o7 o8

Formulation of HMM Observation ε R R G G B R G R

State s0 s1 s2 s3 s4 s5 s6 s7 s8 s9

We introduce two new states, s0 and s9, to represent the initial and final states with ε symbol represents
the start of the observation.

P(S | O) = [P(o0 | s0) P(s1|s0)] . P(S | O) = ∏k=1 P(ok | sk) P(sk+1|sk)

[P(o1 | s1) P(s2|s1)] .
[P(o2 | s2) P(s3|s2)]
ok
[P(o3 | s3) P(s4|s3)] P(S | O) = ∏k=1 P(sk → sk+1)
[P(o4 | s4) P(s5|s4)]
[P(o5 | s5) P(s6|s5)] where,
[P(o6 | s6) P(s7|s6)] P(s9|s8) = 1, is the transition probability from the state of the last observation

[P(o7 | s7) P(s8|s7)] s8 to the final state s9

P(s1|s0) is the initial transition probability.
[P(o8 | s8) P(s9|s8)]
P(o0 | s0) is the initial emission probability
19
Shadakhtar:nlp:iiitd:2025:POS:HMM
Decoding a state sequence a, 0.1
b, 0.2 a, 0.3
b, 0.4
a, 0.3
b, 0.2

S0 →ε S1 S2
0.0
a, 0.2
1.0
b, 0.3
S1 S2
0.1 0.3 0.2 0.3 →a
1*0.1 = 0.1 Observation sequence: a b a b
S1 1* 0.3 = 0.3 S2 S1 S2

0.2 0.4 0.0 0.0

0.3 0.2 →b

S1 S2 S1 S2
0.3 * 0.3 = 0.09 0.3 * 0.2 = 0.06
0.1 * 0.2 = 0.02 0.1 * 0.4 = 0.04
0.1 0.3
0.2 0.3 →a

0.012
0.009 S1 S2 0.0027 S1 S2 0.0018

0.2
0.3 0.2 0.4 →b

0.0081 S1 S2 0.0024 S1 S2 0.0048 20

Shadakhtar:nlp:iiitd:2025:POS:HMM 0.0054
Summary of HMM
● Problem definition:
○ S* = argmaxS P(S | O, M) where,
■ S → State sequence,
■ O → Observation sequence,
■ M → Model
● M = [Q, Q0, A, T] where,
○ Q → Set of states,
○ Q0 → Start state,
○ A → Alphabet, and
○ T → Transition function

ak
P(si → sj) ∀i, k, j

21
Shadakhtar:nlp:iiitd:2025:POS:HMM
Computing P(.) values for PoS tagging
• States (sk) are the tags (NN, JJ, VB) and
• Observations (ok) are the words in a sentence.

P(S | O) = ∏k=1 P(ok | sk) P(sk+1|sk)

shadakhtar:nlp:iiitd:2020:POS:HMM
Associated tasks with HMM
● Given an automaton and an observation sequence, find the best possible state sequences
○ S* = ?
○ Viterbi

● Given an automaton, find the probability of an observation sequence

○ P(O) = ?
Forward algorithm: F (k , i ) = P (o1o2 o3⋯ok , Si ) = ∑ F (k − 1, j ) . P (sj →ok si )
○
j=0,N

○ Probability of being in state si having seen o1o2 o3⋯ok

Backward algorithm: B(k , i ) = P (ok ok+1ok+2⋯om, si ) = ∑ B(k + 1, j )P (si →ok sj )

○
j=0,N
○ Probability of seeing ok ok+1ok+2⋯om given that the state was si

● Given the observation sequence, find the HMM parameters

○ M(T) = ?
○ Baum-Welch algorithm or Forward-Backward algorithm: EM
23
Shadakhtar:nlp:iiitd:2025:POS:HMM
Thanks

24
shadakhtar:nlp:iiitd:2025:POS:HMM

Hans Wehr Dictionary
No ratings yet
Hans Wehr Dictionary
1,137 pages
Pronouns Lesson Plan
100% (1)
Pronouns Lesson Plan
2 pages
Business Communication Note PDF
50% (2)
Business Communication Note PDF
72 pages
Ebs Cohost
100% (4)
Ebs Cohost
17 pages
Byram Et Al-2018-Foreign Language Annals
No ratings yet
Byram Et Al-2018-Foreign Language Annals
12 pages
Chapter 5 - Grammar For IELTS
No ratings yet
Chapter 5 - Grammar For IELTS
19 pages
Week2 Conjunctions
No ratings yet
Week2 Conjunctions
29 pages
Adjectives and Adverbs (Form 3)
No ratings yet
Adjectives and Adverbs (Form 3)
11 pages
Inglés Intermedio: Unit 4
No ratings yet
Inglés Intermedio: Unit 4
22 pages
Adverbs All in 1 Pack Academic Marker 1
No ratings yet
Adverbs All in 1 Pack Academic Marker 1
15 pages
CSCI 5832 Natural Language Processing: Jim Martin
No ratings yet
CSCI 5832 Natural Language Processing: Jim Martin
47 pages
Corrections ( + ... 400
No ratings yet
Corrections ( + ... 400
14 pages
Synaesthetic Metaphors
No ratings yet
Synaesthetic Metaphors
4 pages
Análisis Morfológico - Ingles
No ratings yet
Análisis Morfológico - Ingles
6 pages
English Class
No ratings yet
English Class
50 pages
Hidden Markov Models Applied To Information Extraction: Part I: Concept
No ratings yet
Hidden Markov Models Applied To Information Extraction: Part I: Concept
34 pages
L4 Tagging
No ratings yet
L4 Tagging
107 pages
Unit 1 Function Words and Content Words
No ratings yet
Unit 1 Function Words and Content Words
2 pages
Preposition Power: Click Here To Start
No ratings yet
Preposition Power: Click Here To Start
20 pages
Unit 1 Day 6
No ratings yet
Unit 1 Day 6
17 pages
09 - Hidden Markov Model
No ratings yet
09 - Hidden Markov Model
78 pages
2 cs626 Pos Tagging Week of 1aug22
No ratings yet
2 cs626 Pos Tagging Week of 1aug22
57 pages
Fsa and HMM: LING 572 Fei Xia 1/5/06
No ratings yet
Fsa and HMM: LING 572 Fei Xia 1/5/06
41 pages
POS Tagging HMM
No ratings yet
POS Tagging HMM
14 pages
Buku Developing
No ratings yet
Buku Developing
150 pages
PAC-Learning of Markov Models With Hidden State
No ratings yet
PAC-Learning of Markov Models With Hidden State
12 pages
Hidden Markov Models: Julia Hirschberg CS4705
No ratings yet
Hidden Markov Models: Julia Hirschberg CS4705
37 pages
Using MALLET For Conditional Random Fields: Matthew Michelson & Craig A. Knoblock CSCI 548 - Lecture 3
No ratings yet
Using MALLET For Conditional Random Fields: Matthew Michelson & Craig A. Knoblock CSCI 548 - Lecture 3
41 pages
CS 4705 Hidden Markov Models: Slides Adapted From Dan Jurafsky, and James Martin
No ratings yet
CS 4705 Hidden Markov Models: Slides Adapted From Dan Jurafsky, and James Martin
35 pages
Lec PoS Tagging 2022
No ratings yet
Lec PoS Tagging 2022
67 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
56 pages
A Guide To Hidden Markov Model and Its Applications in NLP
No ratings yet
A Guide To Hidden Markov Model and Its Applications in NLP
11 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
20 pages
CSCI 5832 Natural Language Processing: Jim Martin
No ratings yet
CSCI 5832 Natural Language Processing: Jim Martin
46 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
19 pages
Class 6 Sample
No ratings yet
Class 6 Sample
3 pages
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
No ratings yet
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
29 pages
Machine Learning For Natural Language Processing: Hidden Markov Models
No ratings yet
Machine Learning For Natural Language Processing: Hidden Markov Models
33 pages
The Adjective
No ratings yet
The Adjective
6 pages
Notability 2
No ratings yet
Notability 2
19 pages
27 Ingles PD 01
No ratings yet
27 Ingles PD 01
62 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
55 pages
Developing Autonomy in Efl Writing: Session 2: Verbs
No ratings yet
Developing Autonomy in Efl Writing: Session 2: Verbs
59 pages
4 HMM
No ratings yet
4 HMM
52 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
9 pages
5 Natural Language Processing
No ratings yet
5 Natural Language Processing
7 pages
Stress
No ratings yet
Stress
4 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
32 pages
Sequence Model:: Hidden Markov Models
No ratings yet
Sequence Model:: Hidden Markov Models
60 pages
17 19 HMMs
No ratings yet
17 19 HMMs
23 pages
HTMM Notes
No ratings yet
HTMM Notes
3 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
51 pages
Module 3
No ratings yet
Module 3
17 pages
Q1 W3 Modal
No ratings yet
Q1 W3 Modal
9 pages
LaTeX For Linguists
No ratings yet
LaTeX For Linguists
2 pages
T R H M M P P: HE Elationship Between Idden Arkov Odels and Rediction by Artial Matching Models
No ratings yet
T R H M M P P: HE Elationship Between Idden Arkov Odels and Rediction by Artial Matching Models
4 pages
Lecture Notes On Syntactic Processing
No ratings yet
Lecture Notes On Syntactic Processing
14 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
31 pages
hw3 Solution
No ratings yet
hw3 Solution
7 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
36 pages
May 14
No ratings yet
May 14
23 pages
19CSE453 - Natural Language Processing: Part of Speech Tagging
No ratings yet
19CSE453 - Natural Language Processing: Part of Speech Tagging
59 pages
Baum Welch HMM
No ratings yet
Baum Welch HMM
24 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
5 pages
NLP Assignment 5
No ratings yet
NLP Assignment 5
5 pages
5 Sequence Learning
No ratings yet
5 Sequence Learning
50 pages
HMM Detailed
No ratings yet
HMM Detailed
41 pages
斯坦福大学机器学习数学基础 65-72
No ratings yet
斯坦福大学机器学习数学基础 65-72
8 pages
ngữ nghĩa học
No ratings yet
ngữ nghĩa học
15 pages
NLP Assignment 5
No ratings yet
NLP Assignment 5
5 pages
AML Mod2
No ratings yet
AML Mod2
38 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
35 pages
Week 4
No ratings yet
Week 4
98 pages
24f 09 Hidden Markov Models
No ratings yet
24f 09 Hidden Markov Models
79 pages
Unit 3
No ratings yet
Unit 3
50 pages
Cognition in Education 1st Edition Matthew T. Mccrudden - The Full Ebook Version Is Just One Click Away
100% (2)
Cognition in Education 1st Edition Matthew T. Mccrudden - The Full Ebook Version Is Just One Click Away
61 pages
Grammar - Matters - (PG - 7 16)
No ratings yet
Grammar - Matters - (PG - 7 16)
10 pages
Lecture 6 Hidden Markov and Maximum Entropy Models
No ratings yet
Lecture 6 Hidden Markov and Maximum Entropy Models
28 pages
Lec 10
No ratings yet
Lec 10
77 pages
NLP Lecture 01-10-Hmm
No ratings yet
NLP Lecture 01-10-Hmm
9 pages
Lecture 2
No ratings yet
Lecture 2
21 pages
Hidden Markov Model
No ratings yet
Hidden Markov Model
13 pages
The Essential Guide To Spanish Linking Words Cheat Sheet
No ratings yet
The Essential Guide To Spanish Linking Words Cheat Sheet
7 pages
SP14 CS188 Lecture 14 - Hidden Markov Models - Print
No ratings yet
SP14 CS188 Lecture 14 - Hidden Markov Models - Print
26 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
11 pages
Hidden Markovnikov Model
No ratings yet
Hidden Markovnikov Model
32 pages
Ai TXT Unit5
No ratings yet
Ai TXT Unit5
7 pages

PoSTagging-HMM

Uploaded by

PoSTagging-HMM

Uploaded by

Part-of-Speech Tagging and

Hidden Markov Model

He_PRP went_VBD to_TO the_DT park_NN in_IN a_DT car_NN ._.

● Context can help in disambiguation!

● However, most common words are ambiguous.

^ Brown foxes jumped over the fence .

● Assume, we have an observation of N balls withdrawn from these bags

B1 0.1 0.4 0.5 B1 0.3 0.5 0.2

B2 0.6 0.2 0.2 B2 0.1 0.4 0.5

B3 0.3 0.4 0.3 B3 0.6 0.1 0.3

B1 0.1 0.4 0.5 B1 0.3 0.5 0.2

B2 0.6 0.2 0.2 B2 0.1 0.4 0.5

B3 0.3 0.4 0.3 B3 0.6 0.1 0.3

R, 0.03 R, 0.18 B1, 0.009*0.1*0.5 0.00045

2. Apply Markov assumption

Putting prior and likelihood together

Formulation of HMM Observation ε R R G G B R G R

P(S | O) = [P(o0 | s0) P(s1|s0)] . P(S | O) = ∏k=1 P(ok | sk) P(sk+1|sk)

[P(o7 | s7) P(s8|s7)] s8 to the final state s9

0.2 0.4 0.0 0.0

0.0081 S1 S2 0.0024 S1 S2 0.0048 20

P(S | O) = ∏k=1 P(ok | sk) P(sk+1|sk)

● Given an automaton, find the probability of an observation sequence

○ Probability of being in state si having seen o1o2 o3⋯ok

Backward algorithm: B(k , i ) = P (ok ok+1ok+2⋯om, si ) = ∑ B(k + 1, j )P (si →ok sj )

● Given the observation sequence, find the HMM parameters

You might also like

R, 0.03 R, 0.18 B1, 0.0090.10.5 0.00045