0% found this document useful (0 votes)

32 views52 pages

NLP04 PartOfSpeechTagging

The document discusses part-of-speech tagging, including motivation for its use in applications like speech synthesis, machine translation, syntactic parsing and information extraction. It covers topics like POS tagsets, ambiguity, baseline methods, rule-based and statistical tagging using hidden Markov models.

Uploaded by

ipn7 birta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views52 pages

NLP04 PartOfSpeechTagging

Uploaded by

ipn7 birta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 52

Natural Language Processing

SoSe 2017

Part-of-speech tagging

Dr. Mariana Neves May 22nd, 2017

Part-of-Speech (POS) Tags

● Also known as:

– Part-of-speech tags, lexical categories, word classes,
morphological classes, lexical tags

Plays[VERB] well[ADVERB] with[PREPOSITION] others[NOUN]

Plays[VBZ] well[RB] with[IN] others[NNS]

2
Examples of POS tags

●
Noun: book/books, nature, Germany, Sony
●
Verb: eat, wrote
●
Auxiliary: can, should, have
●
Adjective: new, newer, newest
●
Adverb: well, urgently
●
Number: 872, two, first
●
Article/Determiner: the, some
●
Conjuction: and, or
●
Pronoun: he, my
●
Preposition: to, in
●
Particle: off, up
●
Interjection: Ow, Eh

3
Motivation: Speech Synthesis

●
Word „content“
– „Eggs have a high protein content.“
– „She was content to step down after four years as chief
executive.“

4 (http://www.thefreedictionary.com/content)
Motivation: Machine Translation

● e.g., translation from English to German:

– „I like ...“
● „Ich mag ….“ (verb)
● „Ich wie ...“ (preposition)

5
Motivation: Syntactic parsing

6
(http://nlp.stanford.edu:8080/parser/index.jsp)
Motivation: Information extraction

● Named-entity recognition (usually nouns)

7 (http://www.nactem.ac.uk/tsujii/GENIA/tagger/)
Motivation: Information extraction

● Relation extraction (triggers are usually verbs)

8 (http://www.nactem.ac.uk/tsujii/GENIA/tagger/)
Open vs. Closed Classes

● Closed
– limited number of words, do not grow usually
– e.g., Auxiliary, Article, Determiner, Conjuction, Pronoun,
Preposition, Particle, Interjection

● Open
– unlimited number of words
– e.g., Noun, Verb, Adverb, Adjective

9
POS Tagsets

●
There are many parts of speech tagsets
●
Tag types
– Coarse-grained
●
Noun, verb, adjective, ...
– Fine-grained
●
noun-proper-singular, noun-proper-plural, noun-
common-mass, ..
●
verb-past, verb-present-3rd, verb-base, ...
●
adjective-simple, adjective-comparative, ...

10
POS Tagsets

●
Brown tagset (87 tags)
– Brown corpus

●
C5 tagset (61 tags)
●
C7 tagset (146 tags!)

●
Penn TreeBank (45 tags) – most used
– A large annotated corpus of English tagset

11
POS Tagging

●
The process of assigning a part of speech to each word in a
text

●
Challenge: words often have more than one POS
– On my back[NN] (noun)
– The back[JJ] door (adjective)
– Win the voters back[RB] (adverb)
– Promised to back[VB] the bill (verb)

12
Ambiguity in POS tags

●
45-tags Brown corpus (word types)
– Unambiguous (1 tag): 38,857
– Ambiguous: 8,844
●
2 tags: 6,731
●
3 tags: 1,621
●
4 tags: 357
●
5 tags: 90
●
6 tags: 32
●
7 tags: 6 (well, set, round, open, fit, down)
●
8 tags: 4 ('s, half, back, a)
●
9 tags: 3 (that, more, in)

13
Baseline method

1. Tagging unambiguous words with the correct label

2. Tagging ambiguous words with their most frequent label
3. Tagging unknown words as a noun

●
This method performs around 90% precision

14
POS Tagging

●
The process of assigning a POS tag to each word in a text.
Choosing the best candidate tag for each word.

– Plays (NNS/VBZ)
– well (UH/JJ/NN/RB)
– with (IN)
– others (NNS)

– Plays[VBZ] well[RB] with[IN] others[NNS]

15
Rule-Based Tagging

●
Standard approach (two steps):
1. Dictionaries to assign a list of potential tags
●
Plays (NNS/VBZ)
●
well (UH/JJ/NN/RB)
●
with (IN)
●
others (NNS)
2. Hand-written rules to restrict to a POS tag
●
Plays (VBZ)
●
well (RB)
●
with (IN)
●
others (NNS)

16
Rule-Based Tagging

●
Some approaches rely on morphological parsing
– e.g., EngCG Tagger below

….

17
(http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.57.972&rep=rep1&type=pdf)
Sequential modeling

● Many of the NLP techniques should deal with data represented

as sequence of items
– Characters, Words, Phrases, Lines, …

● e.g., for part-of-speech tagging

– I[PRP] saw[VBP] the[DT] man[NN] on[IN] the[DT] roof[NN] .

● e.g., for named-entity recognition

– Steven[PER] Paul[PER] Jobs[PER] ,[O] co-founder[O] of[O]
Apple[ORG] Inc[ORG] ,[O] was[O] born[O] in[O] California[LOC].

18
Sequential modeling

●
Making a decision based on:
– Current Observation:
● Word (W0): „35-years-old“
●
Prefix, Suffix: „computation“  „comp“, „ation“
●
Lowercased word: „New“  „new“
●
Word shape: „35-years-old“  „d-a-a“
– Surrounding observations
● Words (W+1, W−1)
– Previous decisions
● POS tags (T−1, T−2)

19
Sequential modeling

●
Greedy inference
– Start in the beginning of the sequence
– Assign a label to each item using the classifier
– Using previous decisions as well as the observed data

20
Sequential modeling

●
Beam inference
– Keeping the top k labels in each position
– Extending each sequence in each local way
– Finding the best k labels for the next position

21
Hidden Markov Model (HMM)

● Finding the best sequence of tags (t1 ...tn ) that corresponds to

the sequence of observations (w1 ...wn )

●
Probabilistic View
– Considering all possible sequences of tags
– Choosing the tag sequence from this universe of
sequences, which is most probable given the observation
sequence

t̂1n =argmax t P (t 1n∣w n1 )

n
1

22
Using the Bayes Rule

t̂1n=argmax t P (t 1n∣w 1n )
n
1

P ( B∣A)⋅P ( A)
P ( A∣B)=
P (B )

n n P (w n1∣t n1 )⋅P (t n1 )
P (t ∣w )=
1 1
P (w n1 )

t̂1n =argmax t P (w n1∣t 1n )⋅P (t n1 )

n
1

likelihood prior probability

23
Using Markov Assumption

t̂1n =argmax t P (w n1∣t 1n )⋅P (t n1 )

n
1

n
P (w n1∣t n1 )≃ i=1 ∏ P (w i∣t i ) (it depends only on its POS tag and independent of other words)

n
P(t 1n )≃ i=1 ∏ P (t i∣t i−1) (it depends only on the previous POS tag, thus, bigram)

n
t̂1n =argmax t n
1 i=1 ∏ P (wi∣t i )⋅P (t i∣t i −1)

24
Two Probabilities

● The tag transition probabilities: P(ti|ti−1)

– Finding the likelihood of a tag to proceed by another tag
– Similar to the normal bigram model

C (t i −1 , t i )
P (t i∣t i−1 )=
C (t i −1 )

25
Two Probabilities

● The word likelihood probabilities: P(wi|ti)

– Finding the likelihood of a word to appear given a tag

C (t i , w i )
P (w i∣t i )=
C (t i )

26
Two Probabilities

I[PRP] saw[VBP] the[DT] man[NN?] on[] the[] roof[] .

C ([ DT ] ,[ NN ])
P([ NN ]∣[ DT ])=
C ([ DT ])

C ([ NN ] , man)
P(man∣[ NN ])=
C ([ NN ])

27
Ambiguity in POS tagging

Secretariat[NNP] is[VBZ] expected[VBN] to[TO] race[VB] tomorrow[NR] .

People[NNS] inquire[VB] the[DT] reason[NN] for[IN] the[DT] race[NN] .

28
Ambiguity

Secretariat[NNP] is[VBZ] expected[VBN] to[TO] race[?] tomorrow[NR] .

NNP VBZ VBN TO VB NR

Secretariat is expected to race tomorrow

NNP VBZ VBN TO NN NR

Secretariat is expected to race tomorrow

29
Ambiguity

Secretariat[NNP] is[VBZ] expected[VBN] to[TO] race[VB] tomorrow[NR] .

NNP VBZ VBN TO VB NR

Secretariat is expected to race tomorrow

30
Ambiguity

Secretariat[NNP] is[VBZ] expected[VBN] to[TO] race[VB] tomorrow[NR] .

NNP VBZ VBN TO NN NR

Secretariat is expected to race tomorrow

31
Viterbi algorithm

● Decoding algorithm for HMM

– Determine the best sequence of POS tags

● Probability matrix
– Columns corresponding to inputs (words)
– Rows corresponding to possible states (POS tags)

32
Viterbi algorithm

1. Move through the matrix in one pass filling the columns left to
right using the transition probabilities and observation
probabilities
2. Store the max probability path to each cell (not all paths) using
dynamic programming

33
qend end

q4 NN

q3 TO

q2 VB

q1 PPSS

q0 start

i want to race

34
Q1 Q2 Q3 Q4
qend end

vt-1: previous Viterbi path probability

q4 NN
(from the previous time step)

q3 TO

q2 VB

q1 PPSS

q0 start

V0(0)=1.0
i want to race

35
Q1 Q2 Q3 Q4
qend end end

aij: transition probability

q4 NN NN (from previous state qi to current state qj)
P(NN|start)·P(start) bj(ot): state observation likelihood
.0041·1.0=0.0041
(observation ot given the current state j)
q3 TO TO
P(TO|start)·P(start)
.0043·1.0=0.0043

q2 VB VB
P(VB|start)·P(start)
.019·1.0=0.019
q1 PPSS PPSS
P(PPSS|start)·P(start)
.067·1.0=0.067
q0 start start
v0(0)=1.0
i want to race

36
Q1 Q2 Q3 Q4
N
qend end end v t ( j )=max v t −1 (i)⋅a ij⋅b j (ot )
i=1

q4 NN NN
v1(4)=P(PPSS|start)·P(start)·P(I|NN)=0.041·0=0

q3 TO TO
v1(3)=P(PPSS|start)·P(start)·P(I|TO)=0.043·0=0

q2 VB VB
v1(2)=P(PPSS|start)·P(start)·P(I|VB)=0.019·0=0

q1 PPSS PPSS
v1(1)=P(PPSS|start)·P(start)·P(I|PPSS)=0.067·0.37=0.025

q0 start start

i want to race

37
Q1 Q2 Q3 Q4
qend end end end

q4 NN NN NN
v1(4)·P(VB|NN)
0·.0040=0
q3 TO TO TO
v1(3)·P(VB|TO)
0·.83=0
v1(2)·P(VB|VB)
q2 VB VB VB
0·.0038=0
v1(1)·P(VB|PPSS)
.025·.23=0.0055
q1 PPSS PPSS PPSS

q0 start start start

i want to race

38
Q1 Q2 Q3 Q4
qend end end end

q4 NN NN NN

q3 TO TO TO

q2 VB VB VB
v2(2)=max(0,0,0,.0055)·.0093=.000051

q1 PPSS PPSS PPSS

q0 start start start

i want to race

39
Q1 Q2 Q3 Q4
qend end end end

q4 NN NN NN

q3 TO TO TO

q2 VB VB VB

q1 PPSS PPSS PPSS

q0 start start start

i want to race

40
Q1 Q2 Q3 Q4
qend end end end end end end

q4 NN NN NN NN NN NN

q3 TO TO TO TO TO TO

q2 VB VB VB VB VB VB

q1 PPSS PPSS PPSS PPSS PPSS PPSS

q0 start start start start start start

i want to race

41
Q1 Q2 Q3 Q4
qend end end end end end end

q4 NN NN NN NN NN NN

q3 TO TO TO TO TO TO

q2 VB VB VB VB VB VB

q1 PPSS PPSS PPSS PPSS PPSS PPSS

q0 start start start start start start

i want to race

42
Q1 Q2 Q3 Q4
POS tagging using machine learning

● Classification problem (token by token) using a rich set of

features

(https://link.springer.com/chapter/10.1007/11573036_36)

43
POS tagging using
neural networks

● e.g., using Bidirectional Long

Short-Term Memory
Recurrent Neural Network
(bi-LSTM)
● Input based on tokens,
characters and bytes

44 (https://www.aclweb.org/anthology/P/P16/P16-2067.pdf)
Evaluation

● Corpus
– Training and test, and optionally also development set
– Training (cross-validation) and test set

● Evaluation
– Comparison of gold standard (GS) and predicted tags
– Evaluation in terms of Precision, Recall and F-Measure

45
Precision and Recall

●
Precision:
– Amount of labeled items which are correct

tp
Precision=
tp+ fp

●
Recall:
– Amount of correct items which have been labeled

tp
Recall =
tp+ fn

46
F-Measure

●
There is a strong anti-correlation between precision and recall
●
Having a trade off between these two metrics
●
Using F-measure to consider both metrics together
●
F -measure is a weighted harmonic mean of precision and
recall

(β2 +1) P R
F= 2
β P+R

47
Error Analysis

●
Confusion matrix or contingency table
– Percentage of overall tagging error

IN JJ NN NNP RB VBD VBN

IN - .2 .7
JJ .2 - 3.3 2.1 1.7 .2 2.7
NN 8.7 - .2
NNP .2 3.3 4.1 - .2
RB 2.2 2.0 .5 -
VBD .3 .5 - 4.4
VBN 2.8 2.6

48
Summary

● POS tagging and tagsets

● Rule-based algorithms
● Sequential algorithms
● Neural networks
● Evaluation (P,R,FM)

49
Tools for POS tagging

●
Spacy: https://spacy.io/
●
OpenNLP: https://opennlp.apache.org/
●
Stanford CoreNLP: https://stanfordnlp.github.io/CoreNLP/
●
NLTK Python: http://www.nltk.org/
●
and others...

50
Further reading

●
Book Jurafski & Martin
– Chapter 5

51
Exercise

●
Project: choose a POS tagger and use it in your project.
– Can POS tags support your task?

Pos Tagging Pushpak
No ratings yet
Pos Tagging Pushpak
88 pages
Lec 10
No ratings yet
Lec 10
77 pages
Lec8 - Bayesian Network II
No ratings yet
Lec8 - Bayesian Network II
50 pages
AI-Artificial Intelligence Akash
100% (1)
AI-Artificial Intelligence Akash
64 pages
05 Ar 4
No ratings yet
05 Ar 4
145 pages
Final Solution
No ratings yet
Final Solution
12 pages
Lecture 04
No ratings yet
Lecture 04
42 pages
ECE 368 Course Review: Probabilistic Reasoning 2023
No ratings yet
ECE 368 Course Review: Probabilistic Reasoning 2023
138 pages
L4 Tagging
No ratings yet
L4 Tagging
107 pages
Communication Skills Notes Class X IT
0% (1)
Communication Skills Notes Class X IT
4 pages
Std. 8th Med. English First Language and Mathematics
No ratings yet
Std. 8th Med. English First Language and Mathematics
11 pages
Lecture05-Hmm Pos Tagging
No ratings yet
Lecture05-Hmm Pos Tagging
38 pages
English Grammar Book-Final - 2-5-21
No ratings yet
English Grammar Book-Final - 2-5-21
42 pages
PoSTagging-HMM
No ratings yet
PoSTagging-HMM
24 pages
Theory Test I PDF
No ratings yet
Theory Test I PDF
11 pages
AI Assignment Detailed Solutions
No ratings yet
AI Assignment Detailed Solutions
5 pages
HMM Isolated Word Recognition
No ratings yet
HMM Isolated Word Recognition
23 pages
ACTIVITIESSSA
No ratings yet
ACTIVITIESSSA
11 pages
E-Book - English Grammar Practice Worksheets - CBSE-ICSE-All Indian Boards - Advt - 3
100% (1)
E-Book - English Grammar Practice Worksheets - CBSE-ICSE-All Indian Boards - Advt - 3
18 pages
NLP 4
No ratings yet
NLP 4
83 pages
VTU Exam Question Paper With Solution of 18MCA53 Machine Learning Feb-2022-Dr - Gnaneswari
No ratings yet
VTU Exam Question Paper With Solution of 18MCA53 Machine Learning Feb-2022-Dr - Gnaneswari
27 pages
Solution and Result
No ratings yet
Solution and Result
9 pages
6.2 NLP
No ratings yet
6.2 NLP
20 pages
CSCI 5832 Natural Language Processing: Jim Martin
No ratings yet
CSCI 5832 Natural Language Processing: Jim Martin
47 pages
A Primer in Chinese Buddhist Writings 1
100% (2)
A Primer in Chinese Buddhist Writings 1
157 pages
Artificial Intelligence, NLP
No ratings yet
Artificial Intelligence, NLP
6 pages
cs188 Su19 Final - Sol
No ratings yet
cs188 Su19 Final - Sol
29 pages
NLP Assignment 5
No ratings yet
NLP Assignment 5
5 pages
EIS - VGU - 24s - Exercises 2024-04-22 Solution
No ratings yet
EIS - VGU - 24s - Exercises 2024-04-22 Solution
10 pages
19CSE453 - Natural Language Processing: Part of Speech Tagging
No ratings yet
19CSE453 - Natural Language Processing: Part of Speech Tagging
59 pages
Equation Sheet
No ratings yet
Equation Sheet
4 pages
2 cs626 Pos Tagging Week of 1aug22
No ratings yet
2 cs626 Pos Tagging Week of 1aug22
57 pages
This Is AI4001: GCR: t37g47w
No ratings yet
This Is AI4001: GCR: t37g47w
51 pages
Lec PoS Tagging 2022
No ratings yet
Lec PoS Tagging 2022
67 pages
4MS Sequence 1 Plan by BERRAHAL AHLEM
No ratings yet
4MS Sequence 1 Plan by BERRAHAL AHLEM
5 pages
Key Dec 2022
No ratings yet
Key Dec 2022
9 pages
AI Assignment Complete
No ratings yet
AI Assignment Complete
3 pages
Roark - Lec 2 - HMM Viterbi Forward
No ratings yet
Roark - Lec 2 - HMM Viterbi Forward
37 pages
5 Sequence Learning
No ratings yet
5 Sequence Learning
50 pages
Tutorial I
No ratings yet
Tutorial I
6 pages
Over Fitting and TBL
No ratings yet
Over Fitting and TBL
46 pages
Hidden Markov Models in Speech Recognition: Wayne Ward
No ratings yet
Hidden Markov Models in Speech Recognition: Wayne Ward
35 pages
Using MALLET For Conditional Random Fields: Matthew Michelson & Craig A. Knoblock CSCI 548 - Lecture 3
No ratings yet
Using MALLET For Conditional Random Fields: Matthew Michelson & Craig A. Knoblock CSCI 548 - Lecture 3
41 pages
CSE3008
No ratings yet
CSE3008
30 pages
Grammar Translation Method
100% (1)
Grammar Translation Method
2 pages
Unit 4. Conditional Clauses and Future Time Clauses
100% (1)
Unit 4. Conditional Clauses and Future Time Clauses
4 pages
A Concept of Limits
From Everand
A Concept of Limits
Donald W. Hight
4/5 (4)
Model With One-Word Context: 2vec 2vec 2vec 2vec
100% (1)
Model With One-Word Context: 2vec 2vec 2vec 2vec
17 pages
Assignment 3
No ratings yet
Assignment 3
12 pages
CS 224n Assignment #2: Word2Vec and Dependency Parsing
No ratings yet
CS 224n Assignment #2: Word2Vec and Dependency Parsing
10 pages
Noun Phrase Extraction: A Description of Current Techniques
No ratings yet
Noun Phrase Extraction: A Description of Current Techniques
36 pages
NLP Final
No ratings yet
NLP Final
11 pages
CT-1 Sem6
No ratings yet
CT-1 Sem6
6 pages
NLP Assignment 5
No ratings yet
NLP Assignment 5
5 pages
Probabilistic Robotics Exam, Spring 2012: 4 Points
No ratings yet
Probabilistic Robotics Exam, Spring 2012: 4 Points
7 pages
Unit 1 ML
No ratings yet
Unit 1 ML
60 pages
CSCI 5832 Natural Language Processing: Jim Martin
No ratings yet
CSCI 5832 Natural Language Processing: Jim Martin
46 pages
Comparative Typology Midterm Test
No ratings yet
Comparative Typology Midterm Test
10 pages
Bahl, Cocke, Jelinek and Raviv (BCJR) Algorithm: Markov Source Discrete Memoryles S Channel Receiver
No ratings yet
Bahl, Cocke, Jelinek and Raviv (BCJR) Algorithm: Markov Source Discrete Memoryles S Channel Receiver
12 pages
Crib Sheet
No ratings yet
Crib Sheet
2 pages
Common Irregular Verbs
No ratings yet
Common Irregular Verbs
2 pages
Assignment On Morphology
No ratings yet
Assignment On Morphology
2 pages
Mitchell Machine Learning
No ratings yet
Mitchell Machine Learning
37 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
51 pages
Libro English B1 Threshold Unit 9
No ratings yet
Libro English B1 Threshold Unit 9
12 pages
Curriculum Vitae (CV) : 1. Personal Information
No ratings yet
Curriculum Vitae (CV) : 1. Personal Information
3 pages
Amber Strack - Proofreading Practice
No ratings yet
Amber Strack - Proofreading Practice
4 pages
Midterm 2: CS 188 Summer 2019 Introduction To Artificial Intelligence
No ratings yet
Midterm 2: CS 188 Summer 2019 Introduction To Artificial Intelligence
11 pages
Sping 2009 Final
No ratings yet
Sping 2009 Final
15 pages
Faculty of Engineering: - Answer Any Four Full Questions Missing Data, If Any, May Be Assumed Suitably. 1. (A)
No ratings yet
Faculty of Engineering: - Answer Any Four Full Questions Missing Data, If Any, May Be Assumed Suitably. 1. (A)
2 pages
Practice Final CS61c
No ratings yet
Practice Final CS61c
19 pages
World of Self, Family and Friends Free Time Listening Listening 4 Sunday
No ratings yet
World of Self, Family and Friends Free Time Listening Listening 4 Sunday
8 pages
CS 540: Introduction To Artificial Intelligence: Final Exam: 12:25-2:25pm, December 17, 2014 Room 132 Noland
No ratings yet
CS 540: Introduction To Artificial Intelligence: Final Exam: 12:25-2:25pm, December 17, 2014 Room 132 Noland
11 pages
Applying Deep Learning To Answer Selection - A Study and An Open Task
No ratings yet
Applying Deep Learning To Answer Selection - A Study and An Open Task
8 pages
IELTS : From Failure To Success
From Everand
IELTS : From Failure To Success
YASH AKBARI
No ratings yet
Second Term Exam 6e
No ratings yet
Second Term Exam 6e
2 pages
Language Situation and Language Policy In-Education in Zimbabwe: A Perspective Towards Tonga Learners
No ratings yet
Language Situation and Language Policy In-Education in Zimbabwe: A Perspective Towards Tonga Learners
15 pages
Gradable Non Gradable Adjective Pratic Sheet With Sloution
No ratings yet
Gradable Non Gradable Adjective Pratic Sheet With Sloution
5 pages
Types of Adverbs
No ratings yet
Types of Adverbs
4 pages
Lexical Processing in Children and Adults During Word Copying
No ratings yet
Lexical Processing in Children and Adults During Word Copying
17 pages
Hard and Soft C and G Phon - 3-5 - L4
No ratings yet
Hard and Soft C and G Phon - 3-5 - L4
7 pages
Eng Y4 SK-1
No ratings yet
Eng Y4 SK-1
62 pages
J 00018 Paper I Set P With Key
No ratings yet
J 00018 Paper I Set P With Key
44 pages
They Are Made in Indonesia
No ratings yet
They Are Made in Indonesia
10 pages
50 Python Concepts Every Developer Should Know
From Everand
50 Python Concepts Every Developer Should Know
Hernando Abella
No ratings yet
LDC - Model Questions and Answers-11
No ratings yet
LDC - Model Questions and Answers-11
3 pages
Narrative: Term 2 Objectives/ Non-Negotiables
No ratings yet
Narrative: Term 2 Objectives/ Non-Negotiables
4 pages
00 Spanish Battle Plans All PDF
No ratings yet
00 Spanish Battle Plans All PDF
12 pages
Universidad Abierta para Adultos: (Uapa)
No ratings yet
Universidad Abierta para Adultos: (Uapa)
5 pages
Six Approaches To Stylistic Analysis - W
No ratings yet
Six Approaches To Stylistic Analysis - W
5 pages

NLP04 PartOfSpeechTagging

Uploaded by

NLP04 PartOfSpeechTagging

Uploaded by

Natural Language Processing

Dr. Mariana Neves May 22nd, 2017

● Also known as:

Plays[VERB] well[ADVERB] with[PREPOSITION] others[NOUN]

Plays[VBZ] well[RB] with[IN] others[NNS]

● e.g., translation from English to German:

● Named-entity recognition (usually nouns)

● Relation extraction (triggers are usually verbs)

1. Tagging unambiguous words with the correct label

– Plays[VBZ] well[RB] with[IN] others[NNS]

● Many of the NLP techniques should deal with data represented

● e.g., for part-of-speech tagging

● e.g., for named-entity recognition

● Finding the best sequence of tags (t1 ...tn ) that corresponds to

t̂1n =argmax t P (t 1n∣w n1 )

t̂1n =argmax t P (w n1∣t 1n )⋅P (t n1 )

likelihood prior probability

t̂1n =argmax t P (w n1∣t 1n )⋅P (t n1 )

● The tag transition probabilities: P(ti|ti−1)

● The word likelihood probabilities: P(wi|ti)

I[PRP] saw[VBP] the[DT] man[NN?] on[] the[] roof[] .

Secretariat[NNP] is[VBZ] expected[VBN] to[TO] race[VB] tomorrow[NR] .

People[NNS] inquire[VB] the[DT] reason[NN] for[IN] the[DT] race[NN] .

Secretariat[NNP] is[VBZ] expected[VBN] to[TO] race[?] tomorrow[NR] .

NNP VBZ VBN TO VB NR

Secretariat is expected to race tomorrow

NNP VBZ VBN TO NN NR

Secretariat is expected to race tomorrow

Secretariat[NNP] is[VBZ] expected[VBN] to[TO] race[VB] tomorrow[NR] .

NNP VBZ VBN TO VB NR

Secretariat is expected to race tomorrow

Secretariat[NNP] is[VBZ] expected[VBN] to[TO] race[VB] tomorrow[NR] .

NNP VBZ VBN TO NN NR

Secretariat is expected to race tomorrow

● Decoding algorithm for HMM

vt-1: previous Viterbi path probability

aij: transition probability

q0 start start start

q1 PPSS PPSS PPSS

q0 start start start

q1 PPSS PPSS PPSS

q0 start start start

q1 PPSS PPSS PPSS PPSS PPSS PPSS

q0 start start start start start start

q1 PPSS PPSS PPSS PPSS PPSS PPSS

q0 start start start start start start

● Classification problem (token by token) using a rich set of

● e.g., using Bidirectional Long

IN JJ NN NNP RB VBD VBN

● POS tagging and tagsets

You might also like