0% found this document useful (0 votes)

44 views37 pages

Anaphora Resolution

This document discusses different approaches to anaphora resolution, which is the process of determining the antecedents of referring expressions like pronouns. It summarizes Hobbs' 1978 algorithm, which uses syntactic constraints and search order to find antecedents. It also summarizes Lappin and Leass' 1994 approach, which maintains a discourse model with representations of potential referents that have degrees of salience based on syntactic and recency factors. The document explains how these two algorithms work and some of their limitations.

Uploaded by

aisha ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views37 pages

Anaphora Resolution

Uploaded by

aisha ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 37

Anaphora Resolution

Spring 2010, UCSC – Adrian Brasoveanu

[Slides based on various sources, collected over a couple of years

and repeatedly modified – the work required to track them down
& list them would take too much time at this point. Please email
me ([email protected]) if you can identify particular sources.]
There are more slides added to Adrian’s presentation.
Reference Phenomena
Five common types of referring expression
Type Example
Indefinite noun phrase I saw a Ford Escort today.
Definite noun phrase I saw a Ford Escort today. The Escort was white.
Pronoun I saw a Ford Escort today. It was white.
Demonstratives I like this better than that.
One-anaphora I saw 6 Ford Escort today. Now I want one.

Three types of referring expression that complicate the reference resolution

Type Example
Inferrables I almost bought a Ford Escort, but a door had a dent.
Discontinuous Sets John and Mary love their Escorts. They often drive them.
Generics I saw 6 Ford Escorts today. They are the coolest cars.
Reference Resolution

 How to develop successful algorithms for reference

resolution? There are two necessary steps.

 First is to filter the set of possible referents by certain

hard-and-fast constraints.

 Second is to set the preference for possible referents.

Constraints (for English)
 Number Agreement:
 To distinguish between singular and plural references.
 *John has a new car. They are red.

 Gender Agreement:
 To distinguish male, female, and non-personal genders.
 John has a new car. It is attractive. [It = the new car]

 Person and Case Agreement:

 To distinguish between three forms of person;
 *You and I have Escorts. They love them.
 To distinguish between subject position, object position, and
genitive position.
Constraints (for English)
 Syntactic Constraints:
 Syntactic relationships between a referring expression and a
possible antecedent noun phrase
 John bought himself a new car. [himself=John]
 John bought him a new car. [him≠John]

 Selectional Restrictions:
 A verb places restrictions on its arguments.
 John parked his Acura in the garage. He had driven it
around for hours. [it=Acura, it≠garage];
 I picked up the book and sat in a chair. It broke.
Syntax can’t be all there is
 John hit Bill. He was severely injured.

 Margaret Thatcher admires Hillary Clinton, and

George W. Bush absolutely worships her.
Preferences in Pronoun Interpretation

 Recency:
 Entities introduced recently are more salient than those
introduced before.
 John has a Legend. Bill has an Escort. Mary likes to drive
it.

 Grammatical Role:
 Entities mentioned in subject position are more salient than
those in object position.
 Bill went to the Acura dealership with John. He bought an
Escort. [he=Bill]
Preferences in Pronoun Interpretation

 Repeated Mention:
 Entities that have been focused on in the prior discourse are
more salient.

John needed a car to get to his new job.

He decided that he wanted something sporty.
Bill went to the Acura dealership with him.
He bought an Integra. [he=John]
Preferences in Pronoun Interpretation
 Parallelism (more generally – discourse structure):
 There are also strong preferences that appear to be induced
by parallelism effects.

 Mary went with Sue to the cinema. Sally went with her
to the mall. [ her = Sue]

 Jim surprised Paul and then Julie shocked him. (him =

Paul)
Preferences in Pronoun Interpretation
 Verb Semantics:
 Certain verbs appear to place a semantically-oriented emphasis
on one of their argument positions.

 John telephoned Bill. He had lost the book in the

mall. [He = John]
 John criticized Bill. He had lost the book in the mall.
[He = Bill]

 David praised Hans because he … [he = Hans]

 David apologized to Hans because he… [he = David]
Preferences in Pronoun Interpretation
 World knowledge in general:

 The city council denied the demonstrators a permit because

they {feared|advocated} violence.

 The city council denied the demonstrators a permit because

they {feared|advocated} violence.

 The city council denied the demonstrators a permit

because they {feared|advocated} violence.
The Plan
Introduce and compare 3 algorithms for
anaphora resolution:

 Hobbs 1978

 Lappin and Leass 1994

 Centering Theory
Hobbs 1978
 Hobbs, Jerry R., 1978, ``Resolving Pronoun
References'', Lingua, Vol. 44, pp. 311-338.

 Also in Readings in Natural Language

Processing, B. Grosz, K. Sparck-Jones, and
B. Webber, editors, pp. 339-352, Morgan
Kaufmann Publishers, Los Altos, California.
Hobbs 1978

 Hobbs (1978) proposes an algorithm that searches parse

trees (i.e., basic syntactic trees) for antecedents of a
pronoun.

 starting at the NP node immediately dominating the

pronoun
 in a specified search order
 looking for the first match of the correct gender and
number

 Idea: discourse and other preferences will be

approximated by search order.
Hobbs’s point
… the naïve approach is quite good. Computationally
speaking, it will be a long time before a semantically
based algorithm is sophisticated enough to perform as
well, and these results set a very high standard for any
other approach to aim for.

Yet there is every reason to pursue a semantically

based approach. The naïve algorithm does not work.
Any one can think of examples where it fails. In these
cases it not only fails; it gives no indication that it has
failed and offers no help in finding the real antecedent.
(p. 345)
Hobbs 1978

 This simple algorithm has become a baseline:

more complex algorithms should do better than
this.

 Hobbs distance: ith candidate NP considered by

the algorithm is at a Hobbs distance of i.
Hobbs’s “Naïve” Algorithm
1. Begin at the NP immediately dominating the pronoun.
2. Go up tree to first NP or S encountered.
 Call node X, and path to it, p.
 Search left-to-right below X and to left of p, proposing any NP node which
has an NP or S between it and X.
3. If X is highest S node in sentence,
 Search previous trees, in order of recency, left-to-right, breadth-first,
proposing NPs encountered.
4. Otherwise, from X, go up to first NP or S node encountered,
 Call this X, and path to it p.
5. If X is an NP, and p does not pass through an N-bar that X immediately dominates,
propose X.
6. Search below X, to left of p, left-to-right, breadth-first, proposing NP encountered.
7. If X is an S, search below X to right of p, left-to-right, breadth-first, but not going
through any NP or S, proposing NP encountered.
8. Go to 2.
Another example:

The referent for “he”: we follow the same path, get to the same place, but reject NP4,
then reject NP5. Finally, accept NP6.
Lappin and Leass 1994
 Idea: Maintain a discourse model , in which there are representations
for potential referents. (much like the DRSs we built throughout the
quarter )

 Lappin and Leass 1994 propose a discourse model in which potential

referents have degrees of salience.

 They try to resolve (pronoun) references by finding highly salient

referents compatible with pronoun agreement features.

 In effect, they incorporate:

 recency
 syntax-based preferences
 agreement, but no (other) semantics
Lappin and Leass 1994
 First, we assign a number of salience factors & salience
values to each referring expression.

 The salience values (weights) are arrived by

experimentation on a certain corpus.
Lappin and Leass 1994
Salience Factor Salience Value
Sentence recency 100
Subject emphasis 80
Existential emphasis 70
Accusative emphasis 50
Indirect object emphasis 40
Non-adverbial emphasis 50
Head noun emphasis 80
Lappin and Leass 1994
 Non-adverbial emphasis is to penalize
“demarcated adverbial PPs” (e.g., “In his hand,
…”) by giving points to all other types.

 Head noun emphasis is to penalize embedded

referents.

 Other factors & values:

 Grammatical role parallelism: 35
 Cataphora: -175
Lappin and Leass 1994
 The algorithm employs a simple weighting scheme that integrates
the effects of several preferences:

 For each new entity, a representation for it is added to the discourse

model and salience value computed for it.

 Salience value is computed as the sum of the weights assigned by a

set of salience factors.
 The weight a salience factor assigns to a referent is the highest one the
factor assigns to the referent’s referring expression.

 Salience values are cut in half each time a new sentence is

processed.
Lappin and Leass 1994
The steps taken to resolve a pronoun are as follows:

 Collect potential referents (four sentences back);

 Remove potential referents that don’t semantically agree;

 Remove potential referents that don’t syntactically agree;

 Compute salience values for the rest potential referents;

 Select the referent with the highest salience value.

Lappin and Leass 1994
 Salience factors apply per NP, i.e., referring expression.

 However, we want the salience for a potential referent.

 So, all NPs determined to have the same referent are
examined.

 The referent is given the sum of the highest salience factor

associated with any such referring expression.

 Salience factors are considered to have scope over a

sentence
 so references to the same entity over multiple sentences add
up
 while multiple references within the same sentence don’t.
Example (from Jurafsky and Martin)
 John saw a beautiful Acura Integra at
the dealership.
 He showed it to Bob.
 He bought it.
John
Salience Factor Salience Value
Sentence recency 100
Subject emphasis 80
Existential emphasis
Accusative emphasis
Indirect object emphasis
Non-adverbial emphasis 50
Head noun emphasis 80
Integra
Salience Factor Salience Value
Sentence recency 100
Subject emphasis
Existential emphasis
Accusative emphasis 50
Indirect object emphasis
Non-adverbial emphasis 50
Head noun emphasis 80
dealership
Salience Factor Salience Value
Sentence recency 100
Subject emphasis
Existential emphasis
Accusative emphasis
Indirect object emphasis
Non-adverbial emphasis 50
Head noun emphasis 80
He
Salience Factor Salience Value
Sentence recency 100
Subject emphasis 80
Existential emphasis
Accusative emphasis
Indirect object emphasis
Non-adverbial emphasis 50
Head noun emphasis 80
It
Salience Factor Salience Value
Sentence recency 100
Subject emphasis
Existential emphasis
Accusative emphasis 50
Indirect object emphasis
Non-adverbial emphasis 50
Head noun emphasis 80
Bob
Salience Factor Salience Value
Sentence recency 100
Subject emphasis
Existential emphasis
Accusative emphasis
Indirect object emphasis 40
Non-adverbial emphasis 50
Head noun emphasis 80
He
Salience Factor Salience Value
Sentence recency 100
Subject emphasis 80
Existential emphasis
Accusative emphasis
Indirect object emphasis
Non-adverbial emphasis 50
Head noun emphasis 80
It
Salience Factor Salience Value
Sentence recency 100
Subject emphasis
Existential emphasis
Accusative emphasis 50
Indirect object emphasis
Non-adverbial emphasis 50
Head noun emphasis 80
Evaluation of Lappin and Leass 1994
 Weights were arrived at by experimentation on
a corpus of computer training manuals.

 Combined with other filters, algorithm achieve

86% accuracy (74% / 89% inter- / intra-
sentential):
 applied to unseen data of same genre

 Hobbs’ algorithm applied to same data is 82%

accurate (87% / 81% inter / intra).

NLP Tech Neo Mumbai University Revised Schemes C 2019
No ratings yet
NLP Tech Neo Mumbai University Revised Schemes C 2019
146 pages
Natural Language Processing Artificial Intelligence
100% (2)
Natural Language Processing Artificial Intelligence
81 pages
Natural Language Processing PDF
100% (1)
Natural Language Processing PDF
47 pages
NLP Module 5
No ratings yet
NLP Module 5
156 pages
Xu-Ly-Ngon-Ngu-Tu-Nhien - Christopher-Manning - Cs224n-2019-Lecture16-Coref - (Cuuduongthancong - Com)
No ratings yet
Xu-Ly-Ngon-Ngu-Tu-Nhien - Christopher-Manning - Cs224n-2019-Lecture16-Coref - (Cuuduongthancong - Com)
92 pages
1665572732-Discourse, Constraints and Algorithm Anaphora
No ratings yet
1665572732-Discourse, Constraints and Algorithm Anaphora
34 pages
MNLP Unit-5
No ratings yet
MNLP Unit-5
100 pages
cs4248 l1 Lecture01 Post
No ratings yet
cs4248 l1 Lecture01 Post
62 pages
Chapter 5.2
No ratings yet
Chapter 5.2
24 pages
22 Jurafsky
No ratings yet
22 Jurafsky
34 pages
23-Anaphora Resolution-03-10-2024
No ratings yet
23-Anaphora Resolution-03-10-2024
46 pages
NLP
No ratings yet
NLP
78 pages
Semantic
No ratings yet
Semantic
25 pages
Reference Resolution: Adam Meyers New York University
No ratings yet
Reference Resolution: Adam Meyers New York University
29 pages
1999anaphora Resolution The State of The Art
No ratings yet
1999anaphora Resolution The State of The Art
34 pages
Lecture10 - SRL
No ratings yet
Lecture10 - SRL
32 pages
Cognitive Science - 2023 - Simovic - How Do Antecedent Semantics Influence Pronoun Interpretation Evidence From Eye
No ratings yet
Cognitive Science - 2023 - Simovic - How Do Antecedent Semantics Influence Pronoun Interpretation Evidence From Eye
15 pages
Pragmatic-Reference Resolution
No ratings yet
Pragmatic-Reference Resolution
23 pages
Cse 4022
No ratings yet
Cse 4022
284 pages
Pronoun Resolution
No ratings yet
Pronoun Resolution
17 pages
NLP Chap 5
No ratings yet
NLP Chap 5
40 pages
An Algorithm For Pronominal Anaphora Resolution: Shalom Lappin" Herbert Leass T
No ratings yet
An Algorithm For Pronominal Anaphora Resolution: Shalom Lappin" Herbert Leass T
28 pages
NLP 9
No ratings yet
NLP 9
19 pages
SLoSP 2007 1
No ratings yet
SLoSP 2007 1
42 pages
CS502 Final Term Reference MCQ's by Faisal
67% (3)
CS502 Final Term Reference MCQ's by Faisal
57 pages
Tejaswini Deoskar
No ratings yet
Tejaswini Deoskar
15 pages
LING389-week6-verb Argument Structure
No ratings yet
LING389-week6-verb Argument Structure
20 pages
Unit 4
No ratings yet
Unit 4
70 pages
The Effect of Establishing Coherence in Ellipsis and Anaphora Resolution
No ratings yet
The Effect of Establishing Coherence in Ellipsis and Anaphora Resolution
8 pages
Natural Language Processing
No ratings yet
Natural Language Processing
13 pages
P02 1012 PDF
No ratings yet
P02 1012 PDF
8 pages
Notes 3
No ratings yet
Notes 3
17 pages
NLP QB2 GT Ans
No ratings yet
NLP QB2 GT Ans
11 pages
Annotation of Anaphora and Coreference For Automatic Processing
No ratings yet
Annotation of Anaphora and Coreference For Automatic Processing
53 pages
Unit 3-1
No ratings yet
Unit 3-1
66 pages
Unit 5 NLP
No ratings yet
Unit 5 NLP
13 pages
Challenges (NLP) and F C Structure
No ratings yet
Challenges (NLP) and F C Structure
8 pages
17 Pragmatics
No ratings yet
17 Pragmatics
35 pages
Asher DISCOURSE ANALYSIS
No ratings yet
Asher DISCOURSE ANALYSIS
103 pages
Goverbment & Binding Theory III
No ratings yet
Goverbment & Binding Theory III
10 pages
Gramatyka Opisowa - Egzamin
No ratings yet
Gramatyka Opisowa - Egzamin
13 pages
Implementation of Coreference Resolution
No ratings yet
Implementation of Coreference Resolution
5 pages
Natural Language Processing
No ratings yet
Natural Language Processing
41 pages
Reference Resolution - UNIT 3
No ratings yet
Reference Resolution - UNIT 3
14 pages
Approaches To Semantic Analysis
No ratings yet
Approaches To Semantic Analysis
15 pages
Givenness, Compositionally and Dynamically
No ratings yet
Givenness, Compositionally and Dynamically
7 pages
Pronominal Anaphora Resolution in
No ratings yet
Pronominal Anaphora Resolution in
7 pages
Unit 7 - Pragmatics, Discourse, Dialogue, and Natural Language Generation
No ratings yet
Unit 7 - Pragmatics, Discourse, Dialogue, and Natural Language Generation
17 pages
Lectures Unit3 - Semantic Parsing
No ratings yet
Lectures Unit3 - Semantic Parsing
19 pages
1999analysis of Syntax-Based Pronoun Resolution Methods
No ratings yet
1999analysis of Syntax-Based Pronoun Resolution Methods
4 pages
Lecture NLP
100% (1)
Lecture NLP
38 pages
Pragmatic Analysis
No ratings yet
Pragmatic Analysis
12 pages
FALLSEM2024-25 BCSE409L TH VL2024250101858 2024-07-25 Reference-Material-III
No ratings yet
FALLSEM2024-25 BCSE409L TH VL2024250101858 2024-07-25 Reference-Material-III
28 pages
Introduction
No ratings yet
Introduction
49 pages
Chapter
No ratings yet
Chapter
13 pages
Wa0000.
No ratings yet
Wa0000.
13 pages
Where's The Meeting That Was Cancelled? Existential Implications of Transitive Verbs
No ratings yet
Where's The Meeting That Was Cancelled? Existential Implications of Transitive Verbs
12 pages
NLP
No ratings yet
NLP
29 pages
Natural Language Processing
No ratings yet
Natural Language Processing
44 pages
Demystifying Graph Data Science Graph Algorithms, Analytics Methods, Platforms, Databases, and Use Cases (Pethuru Raj, Abhishek Kumar Etc.) (Z-Library)
No ratings yet
Demystifying Graph Data Science Graph Algorithms, Analytics Methods, Platforms, Databases, and Use Cases (Pethuru Raj, Abhishek Kumar Etc.) (Z-Library)
415 pages
Gate Questions
100% (1)
Gate Questions
35 pages
Computer Science Textbook Solutions - 9
No ratings yet
Computer Science Textbook Solutions - 9
30 pages
AI and ML Lab Manual
No ratings yet
AI and ML Lab Manual
29 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Semantics: Representations and Analyses: Julia Hirschberg CS 4705
No ratings yet
Semantics: Representations and Analyses: Julia Hirschberg CS 4705
32 pages
Ads Lab Manual
No ratings yet
Ads Lab Manual
55 pages
EE 5301 - VLSI Design Automation I: Algorithms
No ratings yet
EE 5301 - VLSI Design Automation I: Algorithms
128 pages
23CS0903 Artificial Intelligence and Machine Learning Lab Manual R23 CSE CSM
No ratings yet
23CS0903 Artificial Intelligence and Machine Learning Lab Manual R23 CSE CSM
43 pages
Nikita
No ratings yet
Nikita
33 pages
50 Tricks To Identify DSA Patterns Https - Drive
No ratings yet
50 Tricks To Identify DSA Patterns Https - Drive
4 pages
Ai Unit 1-1
No ratings yet
Ai Unit 1-1
73 pages
Python Roadmap
No ratings yet
Python Roadmap
29 pages
CS2109S Notes
No ratings yet
CS2109S Notes
19 pages
L7 L8 - Searching For Solutions - Uninformed Search Strategies
No ratings yet
L7 L8 - Searching For Solutions - Uninformed Search Strategies
34 pages
Algorithm Analysis
No ratings yet
Algorithm Analysis
35 pages
Unit 12 Algorithms Teaching Guide
No ratings yet
Unit 12 Algorithms Teaching Guide
23 pages
Narrative Report COMP 212 4
No ratings yet
Narrative Report COMP 212 4
25 pages
AI-ML LAB Manual Format
No ratings yet
AI-ML LAB Manual Format
62 pages
Graph Traversals (BFS and DFS)
No ratings yet
Graph Traversals (BFS and DFS)
47 pages
Lab Exercise Brute Force 3.2
No ratings yet
Lab Exercise Brute Force 3.2
19 pages
Lab2 8 Puzzle Single Player Game (BFS)
No ratings yet
Lab2 8 Puzzle Single Player Game (BFS)
6 pages
Applications of BFS Algorithm: 1. What Is Reinforcement Learning?
No ratings yet
Applications of BFS Algorithm: 1. What Is Reinforcement Learning?
24 pages
Cs Fns MT BSCH Applied Mathematics 2023
No ratings yet
Cs Fns MT BSCH Applied Mathematics 2023
24 pages
Team8 - Path Finding Visualizer - Report I-2
No ratings yet
Team8 - Path Finding Visualizer - Report I-2
8 pages
SE-Comps SEM4 AOA-CBCGS DEC18 SOLUTION
No ratings yet
SE-Comps SEM4 AOA-CBCGS DEC18 SOLUTION
15 pages
Advanced Camp Schedule
No ratings yet
Advanced Camp Schedule
5 pages
AI Assignment I
No ratings yet
AI Assignment I
7 pages
Tree Searching
No ratings yet
Tree Searching
16 pages
BFS and DFS
No ratings yet
BFS and DFS
9 pages
7 Days to Grammar Excellence: How to Master English from Beginner to Advanced
From Everand
7 Days to Grammar Excellence: How to Master English from Beginner to Advanced
Ranjot Singh Chahal
No ratings yet

Anaphora Resolution

Uploaded by

Anaphora Resolution

Uploaded by

Anaphora Resolution

Spring 2010, UCSC – Adrian Brasoveanu

[Slides based on various sources, collected over a couple of years

Three types of referring expression that complicate the reference resolution

 How to develop successful algorithms for reference

 First is to filter the set of possible referents by certain

 Second is to set the preference for possible referents.

 Person and Case Agreement:

 Margaret Thatcher admires Hillary Clinton, and

John needed a car to get to his new job.

 Jim surprised Paul and then Julie shocked him. (him =

 John telephoned Bill. He had lost the book in the

 David praised Hans because he … [he = Hans]

 The city council denied the demonstrators a permit because

 The city council denied the demonstrators a permit because

 The city council denied the demonstrators a permit

 Lappin and Leass 1994

 Also in Readings in Natural Language

 Hobbs (1978) proposes an algorithm that searches parse

 starting at the NP node immediately dominating the

 Idea: discourse and other preferences will be

Yet there is every reason to pursue a semantically

 This simple algorithm has become a baseline:

 Hobbs distance: ith candidate NP considered by

 Lappin and Leass 1994 propose a discourse model in which potential

 They try to resolve (pronoun) references by finding highly salient

 In effect, they incorporate:

 The salience values (weights) are arrived by

 Head noun emphasis is to penalize embedded

 Other factors & values:

 For each new entity, a representation for it is added to the discourse

 Salience value is computed as the sum of the weights assigned by a

 Salience values are cut in half each time a new sentence is

 Collect potential referents (four sentences back);

 Remove potential referents that don’t semantically agree;

 Remove potential referents that don’t syntactically agree;

 Compute salience values for the rest potential referents;

 Select the referent with the highest salience value.

 However, we want the salience for a potential referent.

 The referent is given the sum of the highest salience factor

 Salience factors are considered to have scope over a

 Combined with other filters, algorithm achieve

 Hobbs’ algorithm applied to same data is 82%

You might also like