P02 1012 PDF
P02 1012 PDF
P02 1012 PDF
Figure 2: A Brief Trace of the Pronominalization Algorithm for Anaphoric Pronouns from S TORY B OOK
Without a full-scale implementation, it is impossible texts. Previous approaches, based largely on theo-
to determine whether an algorithm performs imper- retical approaches such as Centering Theory, deal
fectly due to human error, a lack of available corpus exclusively with anaphoric pronouns and have com-
data for making decisions, or if it is a fault with the plex processing and definitional requirements.
algorithm itself. Given the full rhetorical structure available to an
Using the algorithm described in Figure 1, we implemented generation system, we devised a sim-
modified S TORY B OOK to substitute the types of pler method of determining appropriate pronom-
pronouns described in Section 3. We then created inalizations which was more accurate than exist-
the discourse plan and lexicon necessary to generate ing methods simulated by hand or performed semi-
the same three articles from the New York Times as automatically. This shows that approaches designed
(McCoy and Strube, 1999). The results for both the for use with anaphora resolution, which must build
newspaper texts and the Little Red Riding Hood nar- up discourse knowledge from scratch, may not be
rative described in (Callaway and Lester, in press) the most desirable method for use in NLG, where
are shown in Table 1. discourse knowledge already exists. The positive re-
With the same three texts from the New York sults from our simple counting algorithm, after only
Times, S TORY B OOK performed better than the pre- minor changes in parameters from a narrative do-
vious reported results of 85-90% described in (Mc- main to that of newspaper text, indicates that future
Coy and Strube, 1999; Henschel et al., 2000) on both high-quality prose generation systems are very near.
animate and all anaphora using a corpus matching
technique. Furthermore, this was obtained solely by 8 Acknowledgements
adjusting the recency parameter to 4 (it was 3 in our We would like to thank Michael Young and Renate
narrative domain), and without considering other en- Henschel for their helpful comments; Kathy McCoy
hancements such as gender/number constraints or very quickly provided the original 3 NYT articles
domain-specific alterations.1 upon request; the anonymous reviewers whose com-
ments greatly improved this paper. Support for this
7 Conclusions work was provided by ITC-irst and the IntelliMedia
Pronominalization is an important element in the au- Initiative of North Carolina State University.
tomatic creation of multi-paragraph and multi-page
1
It is important to note, however, that our counts of pronouns References
and antecedents do not match theirs. This may stem from a vari-
ety of factors, such as including single instances of nominal de- Douglas E. Appelt. 1985. Planning English referring
scriptions, whether dialogue pronouns were considered, and if expressions. Artificial Intelligence, 26:1–33.
borderline quantifiers and words like “everyone” were counted.
The generation community to-date has not settled on standard, Frederick Baldwin. 1995. CogNIAC: A Discourse Pro-
marked corpora for comparison purposes as has the rest of the cessing Engine. Ph.D. thesis, The University of Penn-
computational linguistics community. sylvania, Philadelphia, PA.
Charles B. Callaway and James C. Lester. 2001a. Eval- Kathleen F. McCoy and Michael Strube. 1999. Taking
uating the effects of natural language generation on time to structure discourse: Pronoun generation be-
reader satisfaction. In Proceedings of the Twenty- yond accessibility. In Proceedings of the Twenty-First
Third Annual Conference of the Cognitive Science So- Conference of the Cognitive Science Society, pages
ciety, pages 164–169, Edinburgh, UK. 378–383, Vancouver, CA, August.
Charles B. Callaway and James C. Lester. 2001b. Nar- Elena Not. 1996. A computational model for generating
rative prose generation. In Proceedings of the Seven- referring expressions in a multilingual application do-
teenth International Joint Conference on Artificial In- main. In COLING–1996: Proceedings of the 16th In-
telligence, pages 1241–1248, Seattle, WA. ternational Conference on Computational Linguistics,
Copenhagen, Denmark, August.
Charles B. Callaway and James C. Lester. 2003. Narra-
tive prose generation. Artificial Intelligence. In press. M. Poesio, H. Cheng, R. Henschel, J. Hitzeman, R. Kib-
ble, and R. Stevenson. 1999a. Specifying the parame-
Wim Claassen. 1992. Generating referring expressions ters of centering theory: A corpus-based evaluation us-
in a multimodal environment. In R. Dale, E. Hovy, ing text from application-oriented domains. In Book-
D. Rosner, and O. Stock, editors, Aspects of Auto- title, page Pages, Address, Month.
mated Natural Language Generation, pages 247–62.
Springer-Verlag, Berlin. M. Poesio, R. Henschel, J. Hitzeman, R. Kibble, S. Mon-
tague, and K. van Deemter. 1999b. Towards an anno-
Robert Dale. 1992. Generating Referring Expressions. tation scheme for noun phrase generation. In Bookti-
MIT Press. tle, page Pages, Address, Month.
Barbara Di Eugenio. 1998. Centering in Italian. In R. Quirk, S. Greenbaum, G. Leech, and J. Svartvik. 1985.
Marilyn A. Walker, Aravind K. Joshi, and Ellen F. A Comprehensive Grammar of the English Language.
Prince, editors, Centering in Discourse. Oxford Uni- Longman Publishers.
versity Press, Cambridge, MA.
Ehud Reiter and Robert Dale. 1997. Building ap-
Barbara J. Grosz and Candace L. Sidner. 1986. Atten- plied natural-language generation systems. Journal of
tion, intentions, and the structure of discourse. Com- Natural-Language Engineering, 3:57–87.
putational Linguistics, 12(3):175–204.
Candace L. Sidner. 1983. Focusing in the com-
Barbara J. Grosz, Aravind K. Joshi, and Scott Weinstein. prehension of definite anaphora. In M. Brady and
1995. Centering: A framework for modelling the lo- R. Berwick, editors, Computational Models of Dis-
cal coherence of discourse. Computational Linguis- course, pages 267–330. MIT Press, Cambridge, MA.
tics, 21(2).
Marilyn A. Walker. 1998. Centering, anaphora resolu-
Barbara J. Grosz. 1977. The representation and use of tion, and discourse structure. In Marilyn A. Walker,
focus in a system for understanding dialogs. In Pro- Aravind K. Joshi, and Ellen F. Prince, editors, Center-
ceedings of the Fifth International Joint Conference on ing in Discourse. Oxford University Press, Cambridge,
Artificial Intelligence, pages 67–76, Cambridge, MA. MA.
Peter Heeman and Graeme Hirst. 1986. Collaborating Bonnie Webber. 1979. A Formal Approach to Discourse
on referring expressions. Computational Linguistics, Anaphora. Garland, NY.
12(3):351–382.
Maria Wolters and Donna K. Byron. 2000. Prosody and
Renate Henschel, Hua Cheng, and Massimo Poesio. the resolution of pronominal anaphora. In COLING–
2000. Pronominalization revisited. In COLING– 2000: Proceedings of the 18th International Con-
2000: Proceedings of the 18th International Con- ference on Computational Linguistics, Saarbruecken,
ference on Computational Linguistics, Saarbruecken, Germany.
Germany.
C. Yeh and C. Mellish. 1997. An empirical study on
Jerry R. Hobbs. 1976. Pronoun resolution. Technical the generation of anaphora in Chinese. Computational
Report 76-1, Department of Computer Science, City Linguistics, 23(1):169–190.
College, CUNY, New York, NY.