The Expectation Maximization (EM) Algorithm

Uploaded by

Tarun Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views10 pages

The Expectation Maximization (EM) Algorithm

Uploaded by

Tarun Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 10

The Expectation Maximization (EM)

Algorithm
General Idea
▪ Start by devising a noisy channel
▪ Any model that predicts the corpus observations via
some hidden structure (tags, parses, …)
▪ Initially guess the parameters of the model!
▪ Educated guess is best, but random can work

▪ Expectation step: Use current parameters (and

observations) to reconstruct hidden structure
▪ Maximization step: Use that hidden structure
(and observations) to reestimate parameters
Repeat until convergence!
General Idea
initial
guess E step

Guess of Guess of unknown

unknown hidden structure
(tags, parses, weather)
parameters
(probabilities) Observed structure
(words, ice cream)

M step
Grammar Reestimation
E step correct test trees
P
A s
c
R o
S r accuracy
e
E r
test R
sentences expensive and/or
wrong sublanguage

cheap, plentiful Grammar

and appropriate
LEARNER
training
M step trees
EM by Dynamic Programming: Two
Versions

▪ The Viterbi approximation

▪ Expectation: pick the best parse of each sentence
▪ Maximization: retrain on this best-parsed corpus
▪ Advantage: Speed!

▪ Real EM we r?
h y slo
▪ Expectation:
w find all parses of each sentence
▪ Maximization: retrain on all parses in proportion to
their probability (as if we observed fractional count)
▪ Advantage: p(training corpus) guaranteed to increase
▪ Exponentially many parses, so don’t extract them
from chart – need some kind of clever counting
Examples of EM
▪ Finite-State case: Hidden Markov Models
▪ “forward-backward” or “Baum-Welch” algorithm
▪ Applications:
▪ explain ice cream in terms of underlying weather sequence
▪ explain words in terms of underlying tag sequence
▪ explain phoneme sequence in terms of underlying word
compose ▪ explain sound sequence in terms of underlying phoneme
these?
▪Context-Free case: Probabilistic CFGs
▪ “inside-outside” algorithm: unsupervised grammar learning!
▪ Explain raw text in terms of underlying cx-free parse
▪ In practice, local maximum problem gets in the way
▪ But can improve a good starting grammar via raw text
▪ Clustering case: explain points via clusters
Our old friend PCFG
S

▪ Start with a “pretty good” grammar

▪ E.g., it was trained on supervised data (a treebank) that is small,
imperfectly annotated, or has sentences in a different style from
what you want to parse. S
▪ Parse a corpus of unparsed sentences:
AdvP S
# copies of …
this sentence 12 Today stocks were up 12 NP VP
Today
in the corpus …
stocks V PRT
▪ Reestimate:
▪ Collect counts: …; c(S  NP VP) += 12; c(S) += 2*12; …were up
▪ Divide: p(S  NP VP | S) = c(S  NP VP) / c(S)
▪ May be wise to smooth
True EM for parsing
▪ Similar, but now we consider all parses of each sentence
S
▪ Parse our corpus of unparsed sentences:
AdvP S
# copies of …
this sentence 12 Today stocks were up 10.8 NP VP
Today
in the corpus …
stocks V PRT

▪ Collect counts fractionally: were up

▪ …; c(S  NP VP) += 10.8; c(S) += 2*10.8; … 1.2 S
▪ …; c(S  NP VP) += 1.2; c(S) += 1*1.2; …
NP VP

NP NP V PRT

Today stocks were up

600.465 - Intro to NLP - J. Eisner
Why do we want this info?

▪ Grammar reestimation by EM method

▪ E step collects those expected counts
▪ M step sets

▪ Minimum Bayes Risk decoding

▪ Find a tree that maximizes expected reward,
e.g., expected total # of correct constituents
▪ CKY-like dynamic programming algorithm
▪The input specifies the probability of correctness
for each possible constituent (e.g., VP from 1 to 5)

NSC Maths Grade 12 November 2024 P2 and Memo
No ratings yet
NSC Maths Grade 12 November 2024 P2 and Memo
63 pages
Unit 3
No ratings yet
Unit 3
19 pages
SSC Phy - Sci. Material. (EM) Final PDF
No ratings yet
SSC Phy - Sci. Material. (EM) Final PDF
33 pages
Best Practice in Nickel Laterite Exploration and Resource Evaluation
83% (6)
Best Practice in Nickel Laterite Exploration and Resource Evaluation
68 pages
Beastmen Brayherds - Old World Builder
No ratings yet
Beastmen Brayherds - Old World Builder
2 pages
Manual GH900
No ratings yet
Manual GH900
104 pages
Generative Models For Ambiguity Resolution
No ratings yet
Generative Models For Ambiguity Resolution
8 pages
NLP Unit 2
No ratings yet
NLP Unit 2
20 pages
Saas Overview (Software As A Service) : Gartner
No ratings yet
Saas Overview (Software As A Service) : Gartner
15 pages
3-Lecture Three - (Chapter Two-N-gram Language Models)
No ratings yet
3-Lecture Three - (Chapter Two-N-gram Language Models)
28 pages
Developing Training Curriculum: A Course Leading To TM2
No ratings yet
Developing Training Curriculum: A Course Leading To TM2
32 pages
Name: - Grade and Section: - Learning Module No. 3
No ratings yet
Name: - Grade and Section: - Learning Module No. 3
4 pages
1 Tahura Sharaban 2021 PHD
No ratings yet
1 Tahura Sharaban 2021 PHD
341 pages
NLP Unit 3
No ratings yet
NLP Unit 3
17 pages
Model Paper - I: Market Research
No ratings yet
Model Paper - I: Market Research
4 pages
NLP Unit-2
No ratings yet
NLP Unit-2
18 pages
Sample Exam Questions With Answers
No ratings yet
Sample Exam Questions With Answers
5 pages
Ngrams
100% (1)
Ngrams
22 pages
Introduction To Computational Linguistics: Eugene Charniak and Mark Johnson
No ratings yet
Introduction To Computational Linguistics: Eugene Charniak and Mark Johnson
148 pages
Siam Mapped A History of The Geo-Body of A Nation - Selection
No ratings yet
Siam Mapped A History of The Geo-Body of A Nation - Selection
102 pages
MLF HMM
No ratings yet
MLF HMM
96 pages
Sổ Tay Từ Vựng Bản Ngữ Speaking Part 2 & 3 - IELTS I-Ready
No ratings yet
Sổ Tay Từ Vựng Bản Ngữ Speaking Part 2 & 3 - IELTS I-Ready
104 pages
L4 Tagging
No ratings yet
L4 Tagging
107 pages
Adventure Finance 1st Edition Aunnie Patton Power Download
No ratings yet
Adventure Finance 1st Edition Aunnie Patton Power Download
48 pages
RDSO SPN TC 7 07 Amdt 2
No ratings yet
RDSO SPN TC 7 07 Amdt 2
53 pages
Xu Ly Ngon Ngu Tu Nhien Regina Barzilay Lec20 Global Linear Models (Cuuduongthancong - Com)
No ratings yet
Xu Ly Ngon Ngu Tu Nhien Regina Barzilay Lec20 Global Linear Models (Cuuduongthancong - Com)
70 pages
NLP Unit-4
No ratings yet
NLP Unit-4
48 pages
Lecture05-Hmm Pos Tagging
No ratings yet
Lecture05-Hmm Pos Tagging
38 pages
The Expectation Maximization (EM) Algorithm: Continued!
No ratings yet
The Expectation Maximization (EM) Algorithm: Continued!
67 pages
NLP Sem 3 Unit
No ratings yet
NLP Sem 3 Unit
12 pages
Inducing Tree-Substitution Grammars: Trevor Cohn
No ratings yet
Inducing Tree-Substitution Grammars: Trevor Cohn
44 pages
3 - 1.2 Linear Models and Rates of Change
No ratings yet
3 - 1.2 Linear Models and Rates of Change
30 pages
Readme
No ratings yet
Readme
1 page
Natural Language Processing, Problem Set 3: Training Data
No ratings yet
Natural Language Processing, Problem Set 3: Training Data
6 pages
Chapter 9 V 2
No ratings yet
Chapter 9 V 2
18 pages
Isolated-Word Speech Recognition Using Hidden Markov Models: H Akon Sandsmark December 18, 2010
No ratings yet
Isolated-Word Speech Recognition Using Hidden Markov Models: H Akon Sandsmark December 18, 2010
9 pages
NLP PLM
No ratings yet
NLP PLM
35 pages
Dependency Parsing 2: CMSC 723 / LING 723 / INST 725
No ratings yet
Dependency Parsing 2: CMSC 723 / LING 723 / INST 725
52 pages
Lecture 07
No ratings yet
Lecture 07
35 pages
The Expectation Maximization (EM) Algorithm
No ratings yet
The Expectation Maximization (EM) Algorithm
10 pages
Practice Tesssts
No ratings yet
Practice Tesssts
33 pages
Expectation Maximization: Dekang Lin Department of Computing Science University of Alberta
No ratings yet
Expectation Maximization: Dekang Lin Department of Computing Science University of Alberta
22 pages
Spring School: Haus - House, Building, Home, Household, Shell. Multiple Translations House Building Haus Snail Shell
No ratings yet
Spring School: Haus - House, Building, Home, Household, Shell. Multiple Translations House Building Haus Snail Shell
20 pages
Stat NLP
No ratings yet
Stat NLP
19 pages
18-Graph Based Dependency Parsing-19-09-2024
No ratings yet
18-Graph Based Dependency Parsing-19-09-2024
19 pages
302 Mechanics of Solids
No ratings yet
302 Mechanics of Solids
23 pages
Probabilistic Theory in Natural Language Processing
No ratings yet
Probabilistic Theory in Natural Language Processing
15 pages
Mosses
No ratings yet
Mosses
21 pages
NLP Unit 4
No ratings yet
NLP Unit 4
22 pages
Notes 4
No ratings yet
Notes 4
7 pages
Unit - 3 Parsing Theory (II) : Prof. Dixita B. Kagathara
No ratings yet
Unit - 3 Parsing Theory (II) : Prof. Dixita B. Kagathara
27 pages
A Probabilistic Earley Parser As A Psycholinguistic Model 2001 N01-1021
No ratings yet
A Probabilistic Earley Parser As A Psycholinguistic Model 2001 N01-1021
8 pages
Parallel Hidden Markov Models For American Sign Language Recognition
No ratings yet
Parallel Hidden Markov Models For American Sign Language Recognition
7 pages
Unit 4 Infrastructure As A Service
No ratings yet
Unit 4 Infrastructure As A Service
37 pages
Kcpe 2012
No ratings yet
Kcpe 2012
14 pages
NLP
No ratings yet
NLP
12 pages
Probabilistic Language Modeling Challenges
No ratings yet
Probabilistic Language Modeling Challenges
12 pages
Noun Phrase Extraction: A Description of Current Techniques
No ratings yet
Noun Phrase Extraction: A Description of Current Techniques
36 pages
Assignment Matrices
No ratings yet
Assignment Matrices
5 pages
Dependency Parsing
No ratings yet
Dependency Parsing
96 pages
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
No ratings yet
Hidden Markov Models: Ts. Nguyễn Văn Vinh Bộ môn KHMT, Trường ĐHCN, ĐH QG Hà nội
51 pages
Tutorial On Speech Recognition: Alex Acero Microsoft Research
No ratings yet
Tutorial On Speech Recognition: Alex Acero Microsoft Research
38 pages
Imitation Learning: Modeling & Learning Sequence of Decisions
No ratings yet
Imitation Learning: Modeling & Learning Sequence of Decisions
53 pages
Force
No ratings yet
Force
7 pages
HMM Tutorial
No ratings yet
HMM Tutorial
15 pages
Lecture 7 PDF
No ratings yet
Lecture 7 PDF
23 pages
Inside-Outside and Forward-Backward Algorithms Are Just Backprop (Tutorial Paper)
No ratings yet
Inside-Outside and Forward-Backward Algorithms Are Just Backprop (Tutorial Paper)
17 pages
MPCC Mark III Instruction Manual
No ratings yet
MPCC Mark III Instruction Manual
7 pages
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
No ratings yet
Lecture 8: State-Space Models Based On Slides By: Probabilis C Graphical Models
29 pages
CS6314
No ratings yet
CS6314
2 pages
Grade 6 Investigation Marking Guideline
No ratings yet
Grade 6 Investigation Marking Guideline
5 pages
Mtec 115 - Workshop Theory and Practice III B Second Semester SY 2019 - 2020 Course Completion Hacksaw
No ratings yet
Mtec 115 - Workshop Theory and Practice III B Second Semester SY 2019 - 2020 Course Completion Hacksaw
10 pages
6403 1981 AMD2 Reff2021
No ratings yet
6403 1981 AMD2 Reff2021
19 pages
Ethics - Module
No ratings yet
Ethics - Module
3 pages
Unit-3: Medium Access Sublayer
No ratings yet
Unit-3: Medium Access Sublayer
21 pages
Hidden Markov Models in Speech Recognition: Wayne Ward
No ratings yet
Hidden Markov Models in Speech Recognition: Wayne Ward
35 pages
Learning With Hidden Variables - EM Algorithm
No ratings yet
Learning With Hidden Variables - EM Algorithm
31 pages
Finite Element Analysis
No ratings yet
Finite Element Analysis
2 pages
Basic Parsing Techniques - Parsing
No ratings yet
Basic Parsing Techniques - Parsing
20 pages
NCERT Solutions For Class 8 Civics Chapter 9 Public Facilities
No ratings yet
NCERT Solutions For Class 8 Civics Chapter 9 Public Facilities
3 pages
The Expectation-Maximization Algorithm: IEEE Signal Processing Magazine December 1996
No ratings yet
The Expectation-Maximization Algorithm: IEEE Signal Processing Magazine December 1996
15 pages
Dependency Parsing: Pawan Goyal
No ratings yet
Dependency Parsing: Pawan Goyal
38 pages
Learning Structured Models For Phone Recognition
No ratings yet
Learning Structured Models For Phone Recognition
9 pages
Trigram Language Models
No ratings yet
Trigram Language Models
19 pages
Products of Random Latent Variable Grammars
No ratings yet
Products of Random Latent Variable Grammars
9 pages
Unit III - Procedure & Macros Programs
No ratings yet
Unit III - Procedure & Macros Programs
13 pages
h3 P
No ratings yet
h3 P
6 pages
XH C6
No ratings yet
XH C6
15 pages
Machine Learning and Statistical Natural Language Processing
No ratings yet
Machine Learning and Statistical Natural Language Processing
27 pages
CS 388: Natural Language Processing:: N-Gram Language Models
No ratings yet
CS 388: Natural Language Processing:: N-Gram Language Models
22 pages
Statistical NLP
No ratings yet
Statistical NLP
19 pages
Lecture 24 - 8255 Various Modes of Operations
No ratings yet
Lecture 24 - 8255 Various Modes of Operations
16 pages
Readers Theater American Heroes Thomas Edison
No ratings yet
Readers Theater American Heroes Thomas Edison
7 pages
SQL Mastery: A Step-by-Step Guide to Learn SQL and Manage Data Effectively
From Everand
SQL Mastery: A Step-by-Step Guide to Learn SQL and Manage Data Effectively
Lena Neill
No ratings yet
SQL for Beginners: A Guide to Excelling in Coding and Database Management
From Everand
SQL for Beginners: A Guide to Excelling in Coding and Database Management
Vere salazar
No ratings yet

The Expectation Maximization (EM) Algorithm

Uploaded by

The Expectation Maximization (EM) Algorithm

Uploaded by

The Expectation Maximization (EM)

▪ Expectation step: Use current parameters (and

Guess of Guess of unknown

cheap, plentiful Grammar

▪ The Viterbi approximation

▪ Start with a “pretty good” grammar

▪ Collect counts fractionally: were up

Today stocks were up

▪ Grammar reestimation by EM method

▪ Minimum Bayes Risk decoding

You might also like