Lecture 20-23 Part of Speech Tagging
Lecture 20-23 Part of Speech Tagging
𝑝 𝑥1 … 𝑥𝑛 𝑦𝑖 … 𝑦𝑛+1 = ෑ 𝑞 𝑦𝑖 ෑ 𝑒 𝑥𝑖 𝑦𝑖 )
𝑖=1 𝑖=1
𝑝 𝑥1 … 𝑥𝑛 𝑦𝑖 … 𝑦𝑛+1 = ෑ 𝑞 𝑦𝑖 | 𝑦𝑖−1 ෑ 𝑒 𝑥𝑖 𝑦𝑖 )
𝑖=1 𝑖=1
Unknown Words
• strongest source of information for guessing the part-of-speech of
unknown words is morphology.
• Words that end in -s are likely to be plural nouns (NNS),
• words ending with -ed tend to be past participles (VBN),
• words ending with -able adjectives (JJ),
Unknown Words
• Store for each final letter sequence (word suffixes) of up to 10 letters,
the statistics of the tag it was associated with in training.
• We are thus computing for each suffix of length i the probability of
the tag ti given the suffix letters
= *
Tagging Problem
Let |S| = 50, length of sequence = n = 15
|S|n = 5015
Standard Algorithms for POS Tagging
• HMM (with Viterbi algorithm)
• Neural Sequence Models (RNN, Transformers)
• Large Language Models (like BERT), fine-tuned
• Test sentence: He likes fish rice, tag sequence: PRON VERB NOUN
NOUN
Reading
• Chapter 8, Speech and Language Processing, Third Edition