0% found this document useful (0 votes)

9 views14 pages

Squibs and Discussions Memoization in Top-Down Parsing

Uploaded by

dimasadisuseno02

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views14 pages

Squibs and Discussions Memoization in Top-Down Parsing

Uploaded by

dimasadisuseno02

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Squibs and Discussions

Memoization in Top-Down Parsing

Mark Johnson"
Brown University

1. Introduction

In a paper published in this journal, Norvig (1991) pointed out that memoization of a
top-down recognizer program produces a program that behaves similiarly to a chart
parser. This is not surprising to anyone familiar with logic-programming approaches to
natural language processing (NLP). For example, the Earley deduction proof procedure
is essentially a memoizing version of the top-down selected literal deletion (SLD) proof
procedure employed by Prolog. Pereira and Warren (1983) showed that the steps of
the Earley Deduction proof procedure proving the well-formedness of a string S from
the standard 'top-down' definite clause grammar (DCG) axiomatization of a context-
free grammar (CFG) G correspond directly to those of Earley's algorithm recognizing
S using G.
Yet as Norvig notes in passing, using his approach the resulting parsers in general
fail to terminate on left-recursive grammars, even with memoization. The goal of
this paper is to discover w h y this is the case and present a functional formalization
of memoized top-down parsing for which this is not so. Specifically, I show how
to formulate top-down parsers in a 'continuation-passing style,' which incrementally
enumerates the right string positions of a category, rather than returning a set of such
positions as a single value. This permits a type of memoization not described to my
knowledge in the context of functional programming before. This kind of memoization
is akin to that used in logic programming, and yields terminating parsers even in the
face of left recursion.
In this paper, algorithms are expressed in the Scheme programming language (Rees
and Clinger 1991). Scheme was chosen because it is a popular, widely known language
that many readers find easy to understand. Scheme's 'first-class' treatment of functions
simplifies the functional abstraction used in this paper, but the basic approach can be
implemented in more conventional languages as well. Admittedly elegance is a matter
of taste, but personally I find the functional specification of CFGs described here as
simple and elegant as the more widely known logical (DCG) formalization, and I hope
that the presentation of working code will encourage readers to experiment with the
ideas described here and in more substantial works such as Leermakers (1993). In
fact, my own observations suggest that with minor modifications (such as the use of
integers rather than lists to indicate string positions, and vectors indexed by string
positions rather than lists in the memoization routines) an extremely efficient chart
parser can be obtained from the code presented here.
Ideas related to the ones discussed here have been presented on numerous occa-
sions. Almost 20 years ago Shiel (1976) noticed the relationship between chart parsing
and top-down parsing. Leermakers (1993) presents a more abstract discussion of the
functional treatment of parsing, and avoids the left-recursion problem for memoized

• Cognitive ScienceDepartment, Brown University,Box 1978, Providence, RI 02912

(~) 1995 Associationfor Computational Linguistics

Computational Linguistics Volume 21, Number 3

functional parsers by using a 'recursive ascent' or PLR parsing strategy instead of a

top-down strategy. At a more abstract level than that of this paper, Shieber, Schabes,
and Pereira (1994) show that a variety of well-known parsing algorithms can be viewed
as computing the closure of a set of basic parsing operations on a representation of
the input string.

2. Formalizing Context-Free Grammars

It is fairly straightforward to implement a top-down parser in a functional program-

ming language. The key insight is that a nonterminal category A in a grammar defines
a function fA that maps a string position 1 in the input string 7 to a set of string po-
sitions fA(l) such that r C fA(1) iff A can derive the substring of "7 spanning string
positions I to r (see e.g., Leermakers [1993] for discussion).
For example, suppose V, gP, and S are already bound to fv, fwP and fs, and the
grammar contains the following productions with VP on the left hand side.

(1) VP --+ V NP VP --+ V S

Then the following Scheme definition binds vp to fvP.

(2) (define (VP p)

(union (reduce union '() (map NP (V p)))
(reduce union '() (map S (V p))))))

If sets are represented by unordered lists, union can be given the following defini-
tion. The function reduce is defined such that an expression of the form (reduce
f e' (xl ... Xn)) evaluates to ( f (... 0c e Xl)...)Xn).

(3) (define (reduce fn init args)

(if (null? args)
init
(reduce fn (fn init (car args))
(car args))))
(4) (define (union set1 set2)
(if (null? set1)
set2
(if (member (car set1) set2)
(union (cdr set1) set2)
(cons (car set1)
(union (cdr set1) set2)))))

When evaluated using Scheme's applicative-order reduction rule, such a system be-
haves as a depth-first, top-down recognizer in which nondeterminism is simulated by
backtracking. For example, in (2) the sequence V NP is first investigated as a potential
analysis of VP, and then the sequence V S is investigated.
Rather than defining the functions f by hand as in (2), higher-order functions can
be introduced to automate this task. It is convenient to use suffixes of the input string
to represent the string positions of the input string (as in DCGs).
The expression (terminal x) evaluates to a function that maps a string position I to
the singleton set { r } iff the terminal x spans from I to r, and the empty set otherwise.

406
Mark Johnson Memoization in Top-Down Parsing

(5) (define (terminal X)

(lambda (p)
(if (and (pair? p)
(eq? (car p) X))
(list (cdr p))
'())))

The expression (seq fA fB) evaluates to a function that maps a string position 1 to the
set of string positions {ri} such that there exists an m 6 fA(1), and ri 6 fB(rrl). Informally,
the resulting function recognizes substrings that are the concatenation of a substring
recognized by fA and a substring recognized by f~.

(6) (define (seq A B)

(lambda (p)
(reduce union '() (map B (A p)))))

The expression ( a l t fA fB) evaluates to a function that maps a string position 1 to

fa(l) U fB(1). Informally, the resulting function recognizes the union of the substrings
recognized by fA and fB.

(7) (define (alt A B)

(lambda (p)
(union (A p) (B p))))

While terminal, seq, and a l t suffice to define (epsilon-free) context-free grammars,

we can easily define other useful higher-order functions. For example, e p s i l o n recog-
nizes the empty string (i.e., it maps every string position 1 into the singleton set {1}),
(opt fA) recognizes an optional constituent, and (k* f,O recognizes zero or more occur-
rences of the substrings recognized by fA.

(8) (define epsilon list)

(9) (define (opt A) (alt epsilon A))
(10) (define (k* A)
(alt epsilon
(seq A (k* A))))

These higher-order functions can be used to provide simpler definitions, such as (2a)
or (2b), for the function VP defined in (2) above.

(2a) (define VP (alt (seq V NP) (seq V S)))

(2b) (define VP (seq V (alt NP S)))

This method of defining the functions corresponding to categories is quite appealing.

Unfortunately, Scheme is deficient in that it does not allow mutually recursive func-
tional definitions of the kind in (2a) or (2b). For example, suppose S is defined as in
(11) and VP is defined as in (2a).

(11) (define S (seq NP VP))

407
Computational Linguistics Volume 21, Number 3

Further, s u p p o s e (11) precedes (2a) textually in the p r o g r a m . Then the variable VP in

(11) will be incorrectly interpreted as u n b o u n d . C h a n g i n g the order of the definitions
will not help, as then the variable S will be u n b o u n d . ~ A w o r k - a r o u n d is to a d d a vac-
u o u s l a m b d a abstraction a n d application as in (11a), in effect delaying the evaluation
of function definition.

(11a) (define S (lambda args (apply (seq NP VP) args)))

With a m a c r o definition such as (12) ( n a m e d to r e m i n d us of this deficiency in the

current Scheme specification a n d p e r h a p s e n c o u r a g e the l a n g u a g e designers to do
better in the future), the definition of functions such as (11a) can be written as (11b).

(12) (define-syntax vacuous

(syntax-rules ()
((vacuous fn)
(lambda args (apply fn args)))))
(11b) (define S (vacuous (seq NP VP)))

Figure 1 contains a f r a g m e n t defined in this way. After these definitions h a v e b e e n

loaded, an expression such as the one in (13) can be evaluated. It returns a list of the
input string's suffixes that c o r r e s p o n d to the right string position of an S.

(13) > (s '(Kim knows every student likes Sandy))

((likes sandy) ())

In e x a m p l e (13), the list resulting f r o m the evaluation contains t w o suffixes, corre-

s p o n d i n g to the fact that b o t h Kim knows every student a n d Kim knows every student likes
Sandy can be analysed as Ss.
Finally, the r e c o g n i z e predicate can be defined as follows. The expression ( r e c o g -
n i z e words) is true iff words is a list of w o r d s that can be analysed as an S, i.e., if the
e m p t y string is a one of right string positions of an S w h o s e left string position is the
whole string to be recognized.

(14) (define (recognize words)

(member '() (S words)))

3. M e m o i z a t i o n and Left R e c u r s i o n

As noted above, the Scheme functions defined in this w a y b e h a v e as t o p - d o w n , back-

tracking recognizers. It is well k n o w n that such parsing m e t h o d s suffer f r o m two
major problems.

1 This problem can arise even if syntactic constructions specifically designed to express mutual recursion
are used, such as letrec. Although these variables are closed over, their values are not applied when
the defining expressions are evaluated, so such definitions should not be problematic for an
applicative-order evaluator. Apparently Scheme requires that mutually recursive functional expressions
syntactically contain a lambda expression. Note that this is not a question of reduction strategy (e.g.,
normal-order versus applicative-order), but an issue about the syntactic scope of variables.

408
Mark Johnson Memoization in Top-Down Parsing

(define S (vacuous (seq NP VP))) ;S--~NP VP

(define VP (vacuous (alt (seq V NP) ;VP-+VNP
(seq (V S))))) ;]VS
(define NP (vacuous (alt PN ;NP--*PN
(seq Det N)))) ;[DetN
(define PN (alt (terminal 'Kim) (terminal 'Sandy)))
(define V (alt (terminal 'likes) (terminal 'knows)))
(define Det (alt (terminal 'every) (terminal 'no)))
(define N (alt (terminal 'student) (terminal 'professor)))
Figure 1
A CFG &agmentdefined using the highe~orderconstructors.

First, a t o p - d o w n parser using a left-recursive g r a m m a r typically fails to terminate

on s o m e inputs. This is true for recognizers defined in the m a n n e r just described; left-
recursive g r a m m a r s yield p r o g r a m s that contain ill-founded recursive definitions. 2
Second, backtracking parsers typically involve a significant a m o u n t of r e d u n d a n t
computation, a n d parsing time is exponential in the length of the i n p u t string in the
w o r s t case. Again, this is also true for the recognizers just described.
M e m o i z a t i o n is a standard technique for avoiding r e d u n d a n t c o m p u t a t i o n , and as
N o r v i g (1991) noted, it can be applied to t o p - d o w n recognizers to convert exponential-
time recognizers into p o l y n o m i a l - t i m e recognizers.
A general w a y of doing this is b y defining a higher-order p r o c e d u r e memo that takes
a function as an a r g u m e n t a n d returns a m e m o i z e d version of it. 3 This p r o c e d u r e is
essentially the s a m e as the memoize predicate that is extensively discussed in Abelson
a n d S u s s m a n (1985).

(15) (define (memo f n )

(let ((alist ' ( ) ) )
(launbda a r g s
(let ((entry (assoc args alist)))
(if entry
(cdr entry)
(let ((result (apply fn args)))
(set! alist (cons (cons args result)
alist))
result))))))

To m e m o i z e the recognizer, the original definitions of the functions should be replaced

with their m e m o i z e d counterparts; e.g., (llb) should be replaced with (11c). Clearly
these definitions could be further simplified with suitable m a c r o definitions or other
'syntactic sugar.'

2 Specifically,if A is a Scheme variable bound to the function corresponding to a left-recursive category,

then for any string position p the expression (A p) reduces to another expression containing (A p). Thus
the (applicative-order) reduction of such expressions does not terminate.
3 For simplicity, the memoprocedure presented in (15) stores the memo table as an association list, in
general resulting in a less than optimal implementation. As Norvig notes, more specialized data
structures, such as hash tables, can improve performance. In the parsing context here, optimal
performance would probably be obtained by encoding string positions with integers, allowing memo
table lookup to be a single array reference.

409
Computational Linguistics Volume 21, Number 3

(11c) ( d e f i n e S (memo (vacuous (seq NP VP))))

As an aside, it is interesting to note that memoization can be applied selectively in this

approach. For example, because of the overhead of table lookup in complex feature-
based grammars, it might be more efficient not to m e m o i z e all categories, but rather
restrict memoization to particular categories such as N P and S.
N o w we turn to the problem of left recursion. In a logic p r o g r a m m i n g setting,
memoization (specifically, the use of Earley deduction) avoids the nontermination
problems associated with left recursion, even w h e n used with the DCG axiomati-
zation of a left-recursive grammar. But as N o r v i g mentions in passing, with parsers
defined in the m a n n e r just described, the m e m o i z e d versions of programs derived
from left-recursive grammars fail to terminate.
It is easy to see why. A memo-ed p r o c e d u r e constructs an entry in a m e m o table
only after the result of applying the u n m e m o i z e d function to its arguments has been
computed. Thus in cases of left recursion, memoization does nothing to prevent the
ill-founded recursion that leads to nontermination.
In fact it is not clear h o w memoization could help in these cases, given that we
require that memo behaves semantically as the identity function; i.e., that (memo f ) and
f are the same function. Of course, we could try to w e a k e n this identity requirement
(e.g., b y only requiring that ( f x ) and ((memo f ) x) are identical w h e n the reduction
of the former terminates), but it is not clear h o w to do this systematically.
Procedurally speaking, it seems as if memoization is applying 'too late' in the
left-recursive cases; reasoning by analogy with Earley deduction, we need to construct
an entry in the m e m o table w h e n such a function is called; not w h e n the result of
its evaluation is known. Of course, in the left recursive cases this seems to lead to
an inconsistency, since these are cases where the value of an expression is required to
c o m p u t e that v e r y value.
Readers familiar with Abelson and Sussman (1985) will k n o w that in m a n y cases
it is possible to circumvent such apparent circularity b y using asynchronous 'lazy
streams' in place of the list representations (of string positions) used above. The
continuation-passing style encoding of CFGs discussed in the next section can be
seen as a more functionally oriented instantiation of this kind of approach.

4. Formalizing Relations in Continuation-Passing Style

The apparent circularity in the definition of the functions corresponding to left-recur-

sive categories suggests that it m a y be worthwhile reformulating the recognition prob-
lem in such a w a y that the string position results are produced incrementally, rather than
in one fell swoop, as in the formalization just described. The key insight is that each
nonterminal category A in a g r a m m a r defines a relation rA such that rA(l, r) iff A can
derive the substring of the input string spanning string positions I to r . 4 Informally
speaking, the r can be e n u m e r a t e d one at a time, so the fact that the calculation of
rA(l, r) requires the result rA(l, r') need not lead to a vicious circularity.
One w a y to i m p l e m e n t this in a functional p r o g r a m m i n g language is to use a
'Continuation-Passing Style' (CPS) of programming, s It turns out that a m e m o i z e d

4 The relation rA and the function fA mentioned above satisfy V r ~/l rA(l, r) ~ r C f(l).
5 Several readers of this paper, including a reviewer, suggested that this can be formulated more
succinctly using Scheme's call/cc continuation-constructing primitive. After this paper was accepted
for publication, Jeff Sisskind devised an implementation based on call/cc which does not require
continuations to be explicitly passed as arguments to functions.

410
Mark Johnson Memoization in Top-Down Parsing

t o p - d o w n parser written in continuation-passing style will in fact terminate, even

in the face of left recursion. Additionally, the treatment of memoization in a CPS is
instructive because it shows the types of table lookup operations n e e d e d in chart
parsing.
Informally, in a CPS p r o g r a m an additional argument, call it c, is a d d e d to all
functions and procedures. W h e n these functions and procedures are called c is always
b o u n d to a procedure (called the continuation); the idea is that a result value v is
'returned' b y evaluating (c v). For example, the standard definition of the function
s q u a r e in (16) w o u l d be rewritten in CPS as in (17). (18) shows h o w this definition
could be used to c o m p u t e and display (using the Scheme builtin d i s p l a y ) the square
of the n u m b e r 3.

(16) ( d e f i n e ( s q u a r e x) (* x x ) )
(17) ( d e f i n e ( s q u a r e cont x) ( c o n t (* x x ) ) )
(18) > ( s q u a r e d i s p l a y 3)
9

Thus whereas result values in a non-CPS p r o g r a m flow 'upwards' in the procedure

call tree, in a CPS p r o g r a m result values flow ' d o w n w a r d s ' in the procedure call tree. 6,7
The CPS style of p r o g r a m m i n g can be used to formalize relations in a pure functional
language as procedures that can be thought of as 'returning' multiply valued results
any n u m b e r of times.
These features of CPS can be used to encode CFGs as follows. Each category A is
associated with a function gA that represents the relation rA, i.e., (gA C I) reduces (in an
applicative-order reduction) in such a fashion that at some stage in the reduction the
expression (c r) is reduced iff A can derive the substring spanning string positions I
to r of the input string. (The value of (gA c I) is immaterial and therefore unspecified,
but see footnote 8 below). That is, if (gA C I) is evaluated with l b o u n d to the left string
position of category A, then (c r) will be evaluated zero or more times with r b o u n d
to each of A's right string positions r corresponding to I.
For example, a CPS function recognizing the terminal item 'will' (arguably a future
auxiliary in a class of its own) could be written as in (19).

(19) (define (future-aux continuation pos)

(if (and (pair? pos) (eq? (car pos) 'will))
(continuation (cdr pos))))

For a more complicated example, consider the two rules defining VP in the fragment
above, repeated here as (20). These could be formalized as the CPS function defined
in (21).

(20) VP --+ V N P VP --+ V S

(21) (define (VP continuation pos)
(begin
(V (lambda (posl) (NP continuation posl)) pos)
(V (lambda (posl) (S continuation posl)) pos)))

6 Tail recursion optimization prevents the procedure call stack from growing unboundedly.
7 This CPS formalization of CFGs is closely related to the 'downward success passing' method of
translating Prolog into Lisp discussed by Kahn and Carlsson (1984).

411
Computational Linguistics Volume 21, Number 3

In this example V, NP, and S are assumed to have CPS definitions. Informally, the
expression (lambda (poe1) (NP c o n t i n u a t i o n p o s l ) ) is a continuation that specifies
what to do if a V is found, viz., pass the V's right string position posl to the NP
recognizer as its left-hand string position, and instruct the NP recognizer in turn to
pass its right string positions to continuation.
The recognition process begins by passing the function corresponding to the root
category the string to be recognized, and a continuation (to be evaluated after suc-
cessful recognition) that records the successful analysis. 8

(22) ( d e f i n e (recognize words)

( l e t ((recognized # f ) )
(S (lambda (pos)
( i f (null? pos) (set! recognized #t)))
words)
recognized))

Thus rather than constructing a set of all the right string positions (as in the previous
encoding), this encoding exploits the ability of the CPS approach to 'return' a value
zero, one or more times (corresponding to the number of right string positions). And
although it is not demonstrated in this paper, the ability of a CPS procedure to 'return'
more than one value at a time can be used to pass other information besides right string
position, such as additional syntactic features or semantic values.
Again, higher-order functions can be used to simplify the definitions of the CPS
functions corresponding to categories. The CPS versions of the terminal, se% and a l t
functions are given as (23), (25), and (24) respectively.

(23) (define (terminal word)

(lambda (continuation poe)
(if (and (pair? poe) (eq? (car poe) word))
(continuation (cdr poe)))))

8 Thus this formaliza~on makes use of mutability to return final results, and so cannot be expressed in a
purely func~onal language. Howeve~ it is possible to construct a similiar formalization in the purely
functional subset of Scheme by passing around an additional 'result' argument (here the last
argument). The examples above would be rewritten as the following under this approach.

(19') (define (future-aux continuation poe result)

(if (and (pair? poe) (eq? (car poe) 'will))
(continuation (cdr poe) result)))
(21') (define (VP continuation poe result)
(V (lambda (posl resultl)
(NP continuation posl resultl))
poe
(V (lambda (posl resultl)
(S continuation posl result1))
poe
result)))
(22') (define (recognize words)
(S (lambda (poe result)
(if (null? poe) #t result))
words))

412
Mark Johnson Memoization in Top-Down Parsing

(24) (define (alt altl alt2)

(lambda (continuation pos)
(begin (altl continuation pos)
(alt2 continuation pos))))

(25) (define (seq seql seq2)

(lambda (cont pos)
(seql (lambda (posl) (seq2 cent posl))
pos)))

If these three functions definitions replace the earlier definitions given in (5), (6), and
(7), the fragment in Figure I defines a CPS recognizer. Note that just as in the first CFG
encoding, the resulting program behaves as a top-down recognizer. Thus in general
these progams fail to terminate when faced with a left-recursive grammar for es-
sentially the same reason: the procedures that correspond to left-recursive categories
involve ill-founded recursion.

5. Memoization in Continuation-Passing Style

The memo procedure defined in (15) is not appropriate for CPS programs because it as-
sociates the arguments of the functional expression with the value that the expression
reduces to, but in a CPS program the 'results' produced by an expression are the val-
ues it passes on to the continuation, rather than the value that the expression reduces
to. That is, a memoization procedure for a CPS procedure should associate argument
values with the set of values that the unmemoized procedure passes to its continua-
tion. Because an unmemoized CPS procedure can produce multiple result values, its
memoized version must store not only these results, but also the continuations passed
to it by its callers, which must receive any additional results produced by the original
unmemoized procedure.
The cps-memo procedure in (26) achieves this by associating a table entry with
each set of argument values that has two components; a list of caller continuations
and a list of result values. The caller continuation entries are constructed when the
memoized procedure is called, and the result values are entered and propagated back
to callers each time the unmemoized procedure 'returns' a new value. 9

9 The dolist form used in (26) behaves as the dolist form in CommonLisp. It can be defined in terms
of Scheme primitives as follows:

(define-syntax dolist
(syntax-rules ()
((dolist (var list) . body)
(do ((to-do list))
((null? to-do))
(let ((var (car to-do)))
• body)))))

413
Computational Linguistics Volume 21, Number 3

(26) (define (memo cps-fn)

(let ((table (make-table)))
(lambda (continuation . args)
(let ((entry (table-tel table args)))
(cond ((null? (entry-continuations entry))
;fi~ttime memo~ed procedu~has been called with args
(push-continuation! entry continuation)
(apply cps-fn
(lambda result
(when (not (result-subsumed? entry result))
(push-result! entry result)
(dolist (cont (entry-continuations entry))
(apply cont result))))
args))
(else
; memoizedprocedu~hasbeen called with args befo~
(push-continuation! entry continuation)
(dolist (result (entry-results entry))
(apply continuation result))))))))

Specifically, when the memoized procedure is called, c o n t i n u a t i o n is bound to the

continuation passed by the caller that should receive 'return' values, and args is bound
to a list of arguments that index the entry in the memo table and are passed to the
unmemoized procedure cps-fn if evaluation is needed. The memo table t a b l e initially
associates every set of arguments with empty caller continuation and empty result
value sets. The local variable e n t r y is bound to the table entry that corresponds to
args; the set of caller continuations stored in e n t r y is null iff the memoized function
has not been called with this particular set of arguments before.
The cond clause determines if the memoized function has been called with args
before by checking if the continuations component of the table entry is nonempty.
In either case, the caller continuation needs to be stored in the continuations compo-
nent of the table entry, so that it can receive any additional results produced by the
unmemoized procedure.
If the memoized procedure has not been called with args before, it is necessary
to call the unmemoized procedure cps-fn to produce the result values for args. The
continuation passed to cps-fn checks to see if each r e s u l t of this evaluation is sub-
sumed by some other result already produced for this entry; if it is not, it is pushed
onto the results component of this entry, and finally passed to each caller continuation
associated with this entry.
If the memoized procedure has been called with args before, the results associ-
ated with this table entry can be reused. After storing the caller continuation in the
table entry, each result already accumulated in the table entry is passed to the caller
continuation.
Efficient implementations of the table and entry manipulation procedures would
be specialized for the particular types of arguments and results used by the unmem-
oized procedures. Here we give a simple and general, but less than optimal, imple-
mentation using association lists. 1°

10 This formalizationmakes use of 'impure' featuresof Scheme,specificallydestructiveassignmentto add

an elementto the table list (whichis why this list containsthe dummy element "head*).Arguably,

414
Mark Johnson Memoization in Top-Down Parsing

A table is a headed association list (27), which is extended as needed by t a b l e - r e f

(28). In this fragment there are no partially specified arguments or results (such as
would be involved if the fragment used feature structures), so the subsumption relation
is in fact equality.

(27) (define (make-table) (list '~head~))

(28) (define (table-ref table key)
(let ((pair (assoc key (cdr table))))
(if pair ;an entry alreadyexists
(cdr p a i r ) ; ~turnit
(let ((new-entry (make-entry)))
(set-cdr! table (cons (cons key new-entry)
(cdr table)))
new-entry))))

Entries are manipulated by the following procedures. Again, because this fragment
does not produce partially specified results, the result subsumption check can be per-
formed by the Scheme function member.

(29) (define (make-entry) (cons '() '()))

(3O) (define entry-continuations car)
(31) (define entry-results cdr)
(32) (define (push-continuation! entry continuation)
(set-car! entry (cons continuation (car entry))))
(33) (define (push-result! entry result)
(set-cdr! entry (cons result (cdr entry))))
(34) (define (result-subsumed? entry result)
(member result (entry-results entry)))

As claimed above, the memoized version of the CPS top-down parser does terminate,
even if the grammar is left-recursive. Informally, memoized CPS top-down parsers
terminate in the face of left-recursion because they ensure that no unmemoized pro-
cedure is ever called twice with the same arguments. For example, we can replace
the definition of NP in the fragment with the left-recursive one given in (35) with-
out compromising termination, as shown in (36) (where the input string is meant to
approximate Kim's professor knows every student).

(35) (define NP (memo (vacuous

(alt PN ;NP-+PN
(alt (seq NP N) ; IN P N
(seq Det N ) ) ) ) ) ) ; IDetN
(36) > (recognize '(Kim professor knows every student))
#t

this is a case in which impure features result in a more comprehensibleoverallprogram.

415
Computational Linguistics Volume 21, Number 3

Memoized CPS top-down recognizers do in fact correspond fairly closely to chart

parsers. Informally, the memo table for the procedure corresponding to a category A
will have an entry for an argument string position 1 just in case a predictive chart
parser predicts a category A at position l, and that entry will contain string position
r as a result just in case the corresponding chart contains a complete edge spanning
from l to r. Moreover, the evaluation of the procedure PA corresponding to a category
A at string position l corresponds to predicting A at position l, and the evaluation of
the caller continuations corresponds to the completion steps in chart parsing. The CPS
memoization described here caches such evaluations in the same w a y that the chart
caches predictions, and the termination in the face of left recursive follows from the
fact that no procedure PA is ever called with the same arguments twice. Thus given a
CPS formalization of the parsing problem and an appropriate memoization technique,
it is in fact the case that "the maintenance of well-formed substring tables or charts
can be seen as a special case of a more general technique: memoization" (Norvig 1991),
even if the grammar contains left recursion.

6. C o n c l u s i o n and Future Work

This paper has shown how to generalize Norvig's application of memoization to

top-down recognizers to yield terminating recognizers for left recursive grammars.
Although not discussed here, the techniques used to construct the CPS recognizers
can be generalized to parsers that construct parse trees, or associate categories with
"semantic values" or "unification-based" feature structures. Specifically, we add extra
arguments to each (caller) continuation whose value is the feature structure, parse tree
a n d / o r the "semantic value" associated with each category. Doing this raises other in-
teresting questions not addressed by this paper. As noted by a CL reviewer, while the
use of memoization described here achieves termination in the face of left recursion
and polynomial recognition times for CFGs, it does not provide packed parse forest
representations of the strings analysed in the way that chart-based systems can (Lang
1991; Tomita 1985). Since the information that would be used to construct such packed
parse forest representations in a chart is encapsulated in the state of the memoized
functions, a straightforward implementation attempt would probably be very compli-
cated, and I suspect ultimately not very informative. I suggest that it might be more
fruitful to try to develop an appropriate higher level of abstraction. For example, the
packed parse forest representation exploits the fact that all that matters about a sub-
tree is its root label and the substring it spans; its other internal details are irrelevant.
This observation might be exploited by performing parse tree construction on streams
of subtrees with the same root labels and string positions (formulated using CPS as
described above) rather than individual subtrees; these operations would be 'delayed'
until the stream is actually read, as is standard, so the parse trees would not actually
be constructed during the parsing process. Whether or not this particular approach is
viable is not that important, but it does seem as if a functional perspective provides
useful and insightful ways to think about the parsing process.

416
Mark Johnson Memoization in Top-Down Parsing

Acknowledgments automatic memoization with applications

I would like to thank Jeff Sisskind, Edward to context-free parsing." Computational
Stabler, and the CL reviewers for their Linguistics, 17(1), 91-98.
stimulating comments. This paper was Pereira, Fernando, and Warren, David H. D.
made available via the CMP-LG pre-print (1983). "Parsing as deduction." In
server after it was accepted by Computational Proceedings, 21st Annual Meeting of the
Linguistics, and I thank my colleagues on Association for Computational Linguistics,
the Internet for their numerous suggestions 137-144.
and technical improvements. Rees, Jonathan, and Clinger, William (1991).
"Revised report on the algorithmic
language scheme." Technical Report 341,
References Computer Science Department, Indiana
Abelson, Harold, and Sussman, Gerald Jay University.
(1985). Structure and Interpretation of Shell, B. A. (1976). "Observations on
Computer Programs. MIT Press. context-free parsing." Technical Report TR
Kahn, K. M., and Carlsson, M. (1984). "How 12-76, Center for Research in Computing
to implement Prolog on a Lisp machine." Technology, Aiken Computation
In Implementations of Prolog, edited by J. A. Laboratory, Harvard University.
Campbell, 117-134. Ellis Horwood Shieber, Stuart M.; Schabes, Yves; and
Limited. Pereira, Fernando C. N. (1994).
Lang, Bernard (1991). "Towards a uniform "Principles and implementation of
formal framework for parsing." In Current deductive parsing." Technical Report
Issues in Parsing Technology, edited by TR-11-94, Center for Research in
Masaru Tomita, 153-172. Kluwer Computing Technology (also available
Academic Publishers. from the cmp-lg server), Computer
Leermakers, Ren4 (1993). The Functional Science Department, Harvard University.
Treatment of Parsing, Kluwer Academic Tomita, Masaru (1985). Efficient Parsing for
Publishers. Natural Language, Kluwer Academic
Norvig, Peter (1991). "Techniques for Publishers.

417

Chapter 5 Intro To Top Down Parsing
No ratings yet
Chapter 5 Intro To Top Down Parsing
50 pages
3 3-NLP
No ratings yet
3 3-NLP
32 pages
4 Parsing
No ratings yet
4 Parsing
55 pages
Chapter 4 - Syntax Analysis CIE1
No ratings yet
Chapter 4 - Syntax Analysis CIE1
69 pages
Chapter 3 B Top-Down Parsing
No ratings yet
Chapter 3 B Top-Down Parsing
49 pages
10-11-12-13-Top Down Parser
No ratings yet
10-11-12-13-Top Down Parser
76 pages
CS6109 Module 5
No ratings yet
CS6109 Module 5
117 pages
Top Down
No ratings yet
Top Down
25 pages
Unit-2 2.1. Review of CFG Ambiguity of Grammars 2.1.1. Limitations of Regular Language
No ratings yet
Unit-2 2.1. Review of CFG Ambiguity of Grammars 2.1.1. Limitations of Regular Language
44 pages
Lexical and Syntax Analysis
No ratings yet
Lexical and Syntax Analysis
63 pages
Top Down Parsing
No ratings yet
Top Down Parsing
37 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
54 pages
Lecture 17
No ratings yet
Lecture 17
57 pages
What Is Parsing: Parsing Is The Process of Analyzing An Input Sequence in Order
No ratings yet
What Is Parsing: Parsing Is The Process of Analyzing An Input Sequence in Order
9 pages
CD Unit 2
No ratings yet
CD Unit 2
6 pages
Nahid - 2474 PDF
No ratings yet
Nahid - 2474 PDF
9 pages
4 Predctive Parser
No ratings yet
4 Predctive Parser
59 pages
Unit 3
No ratings yet
Unit 3
8 pages
Unit 3
No ratings yet
Unit 3
37 pages
td2 LL - 1 Parsing
No ratings yet
td2 LL - 1 Parsing
45 pages
Chapter 3 - Syntax Analysis (Parsers) Part Two
No ratings yet
Chapter 3 - Syntax Analysis (Parsers) Part Two
24 pages
Unit III
No ratings yet
Unit III
29 pages
CSC 4181 Compiler Construction Parsing
No ratings yet
CSC 4181 Compiler Construction Parsing
53 pages
Chapter4 1
No ratings yet
Chapter4 1
61 pages
Chapter 8 - Syntax Analysis
No ratings yet
Chapter 8 - Syntax Analysis
92 pages
Unit-II CD
No ratings yet
Unit-II CD
81 pages
Parsing Technique Baar Baar
No ratings yet
Parsing Technique Baar Baar
29 pages
Unit 2 (CD)
No ratings yet
Unit 2 (CD)
12 pages
03 Parsing
No ratings yet
03 Parsing
71 pages
L5 TopDownParsing
No ratings yet
L5 TopDownParsing
30 pages
Chapter 3
No ratings yet
Chapter 3
9 pages
Module-2 1
No ratings yet
Module-2 1
51 pages
Syntax Analysis
No ratings yet
Syntax Analysis
90 pages
Chapter - 3
No ratings yet
Chapter - 3
46 pages
Predictive Parsing and LL (1) - Compiler Design - Dr. D. P. Sharma - NITK Surathkal by Wahid311
100% (2)
Predictive Parsing and LL (1) - Compiler Design - Dr. D. P. Sharma - NITK Surathkal by Wahid311
56 pages
CD Unit3
No ratings yet
CD Unit3
74 pages
Basic Parsing Techniques - Parsing
No ratings yet
Basic Parsing Techniques - Parsing
20 pages
3 Syntax Analysis
No ratings yet
3 Syntax Analysis
42 pages
Syntax Analysis I 2024
No ratings yet
Syntax Analysis I 2024
38 pages
Parser Lec4
No ratings yet
Parser Lec4
21 pages
Top Down Parsing
No ratings yet
Top Down Parsing
38 pages
Unit II PDF
No ratings yet
Unit II PDF
7 pages
Compiler Design Syntax Analysis Top Down
No ratings yet
Compiler Design Syntax Analysis Top Down
34 pages
Chapter 3-Syntax Analysis-II
No ratings yet
Chapter 3-Syntax Analysis-II
28 pages
Top-Down Parsing: - The Parse Tree Is Created Top To Bottom. - Top-Down Parser
No ratings yet
Top-Down Parsing: - The Parse Tree Is Created Top To Bottom. - Top-Down Parser
36 pages
Crafting A Compiler With C (VIII) : The LL Grammar Class
No ratings yet
Crafting A Compiler With C (VIII) : The LL Grammar Class
18 pages
Syntax Analysis I 2022 Class
No ratings yet
Syntax Analysis I 2022 Class
33 pages
Presented by Jyoti Thakur
No ratings yet
Presented by Jyoti Thakur
31 pages
Unit - Ii 2.1 Syntax Analysis
No ratings yet
Unit - Ii 2.1 Syntax Analysis
122 pages
Top-Down Parsing PDF
No ratings yet
Top-Down Parsing PDF
6 pages
Lingoda Class
No ratings yet
Lingoda Class
42 pages
Top-Down and Bottom-Up Parsing
No ratings yet
Top-Down and Bottom-Up Parsing
23 pages
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
No ratings yet
Topic #4: Syntactic Analysis (Parsing) : INF 524 Compiler Construction Spring 2011
44 pages
Elimination of Left Recursion
No ratings yet
Elimination of Left Recursion
17 pages
Parsing
No ratings yet
Parsing
38 pages
Chapter 4 - Syntax Analysis
No ratings yet
Chapter 4 - Syntax Analysis
82 pages
IOT Complete Notes TRUE ENGINEER
No ratings yet
IOT Complete Notes TRUE ENGINEER
107 pages
Synthesis Paper
100% (1)
Synthesis Paper
2 pages
NTSE Practice Paper - 07 Mental Ability Test
No ratings yet
NTSE Practice Paper - 07 Mental Ability Test
7 pages
The Hebrew Hertiage PDF
No ratings yet
The Hebrew Hertiage PDF
48 pages
Top-Down Parsing: Programming Language Application
No ratings yet
Top-Down Parsing: Programming Language Application
4 pages
Unit 4: A.G. Gardiner: Life and Works
No ratings yet
Unit 4: A.G. Gardiner: Life and Works
7 pages
Head Hunting and Raids
No ratings yet
Head Hunting and Raids
31 pages
Text Processing For NLP Text Processing
No ratings yet
Text Processing For NLP Text Processing
15 pages
Thematic Translation Installment 110 Chapter Az-Zukhruf
No ratings yet
Thematic Translation Installment 110 Chapter Az-Zukhruf
24 pages
Think Big Think Small Groups PDF
No ratings yet
Think Big Think Small Groups PDF
115 pages
Cre Grade 2 Term 1
100% (1)
Cre Grade 2 Term 1
6 pages
Biology Style Manual 2020
No ratings yet
Biology Style Manual 2020
19 pages
Worksheet Present Simplecontinuous Martín Cárdenas
No ratings yet
Worksheet Present Simplecontinuous Martín Cárdenas
9 pages
Alyssa Smith: Objective
No ratings yet
Alyssa Smith: Objective
2 pages
Introduction To Verilog
No ratings yet
Introduction To Verilog
43 pages
Presentation Notes F Scott Fitzgerald 1
No ratings yet
Presentation Notes F Scott Fitzgerald 1
2 pages
Grammar C2.1-Modal Verbs - 4
No ratings yet
Grammar C2.1-Modal Verbs - 4
18 pages
+2 General English
No ratings yet
+2 General English
24 pages
How To Install
No ratings yet
How To Install
2 pages
9059 Assignment No 1 Language and Culture PDF
No ratings yet
9059 Assignment No 1 Language and Culture PDF
8 pages
OJM SectionTest 1A01 Sol e
No ratings yet
OJM SectionTest 1A01 Sol e
14 pages
Sports Academy Project Word File PDF
No ratings yet
Sports Academy Project Word File PDF
23 pages
Ebook KnifePatterns1
No ratings yet
Ebook KnifePatterns1
25 pages
Borang Kiraan Mata
No ratings yet
Borang Kiraan Mata
8 pages
Quinceañera
No ratings yet
Quinceañera
9 pages
CSARCH1 DE01 Documentation
No ratings yet
CSARCH1 DE01 Documentation
7 pages
My Grammar Tables
No ratings yet
My Grammar Tables
6 pages
Action Research Cover
No ratings yet
Action Research Cover
4 pages
Essay Criteria 101, 2019 Spring Semester
No ratings yet
Essay Criteria 101, 2019 Spring Semester
2 pages
Review of A Commentary On The Letters of
No ratings yet
Review of A Commentary On The Letters of
3 pages
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Trifocal Tensor: Exploring Depth, Motion, and Structure in Computer Vision
From Everand
Trifocal Tensor: Exploring Depth, Motion, and Structure in Computer Vision
Fouad Sabry
No ratings yet

Squibs and Discussions Memoization in Top-Down Parsing

Uploaded by

Squibs and Discussions Memoization in Top-Down Parsing

Uploaded by

Squibs and Discussions

Memoization in Top-Down Parsing

• Cognitive ScienceDepartment, Brown University,Box 1978, Providence, RI 02912

(~) 1995 Associationfor Computational Linguistics

functional parsers by using a 'recursive ascent' or PLR parsing strategy instead of a

2. Formalizing Context-Free Grammars

It is fairly straightforward to implement a top-down parser in a functional program-

(1) VP --+ V NP VP --+ V S

Then the following Scheme definition binds vp to fvP.

(2) (define (VP p)

(3) (define (reduce fn init args)

(5) (define (terminal X)

(6) (define (seq A B)

The expression ( a l t fA fB) evaluates to a function that maps a string position 1 to

(7) (define (alt A B)

While terminal, seq, and a l t suffice to define (epsilon-free) context-free grammars,

(8) (define epsilon list)

(2a) (define VP (alt (seq V NP) (seq V S)))

This method of defining the functions corresponding to categories is quite appealing.

(11) (define S (seq NP VP))

Further, s u p p o s e (11) precedes (2a) textually in the p r o g r a m . Then the variable VP in

(11a) (define S (lambda args (apply (seq NP VP) args)))

With a m a c r o definition such as (12) ( n a m e d to r e m i n d us of this deficiency in the

(12) (define-syntax vacuous

Figure 1 contains a f r a g m e n t defined in this way. After these definitions h a v e b e e n

(13) > (s '(Kim knows every student likes Sandy))

In e x a m p l e (13), the list resulting f r o m the evaluation contains t w o suffixes, corre-

(14) (define (recognize words)

As noted above, the Scheme functions defined in this w a y b e h a v e as t o p - d o w n , back-

(define S (vacuous (seq NP VP))) ;S--~NP VP

First, a t o p - d o w n parser using a left-recursive g r a m m a r typically fails to terminate

(15) (define (memo f n )

To m e m o i z e the recognizer, the original definitions of the functions should be replaced

2 Specifically,if A is a Scheme variable bound to the function corresponding to a left-recursive category,

(11c) ( d e f i n e S (memo (vacuous (seq NP VP))))

As an aside, it is interesting to note that memoization can be applied selectively in this

4. Formalizing Relations in Continuation-Passing Style

The apparent circularity in the definition of the functions corresponding to left-recur-

t o p - d o w n parser written in continuation-passing style will in fact terminate, even

Thus whereas result values in a non-CPS p r o g r a m flow 'upwards' in the procedure

(19) (define (future-aux continuation pos)

(20) VP --+ V N P VP --+ V S

(22) ( d e f i n e (recognize words)

(23) (define (terminal word)

(19') (define (future-aux continuation poe result)

(24) (define (alt altl alt2)

(25) (define (seq seql seq2)

5. Memoization in Continuation-Passing Style

(26) (define (memo cps-fn)

Specifically, when the memoized procedure is called, c o n t i n u a t i o n is bound to the

10 This formalizationmakes use of 'impure' featuresof Scheme,specificallydestructiveassignmentto add

A table is a headed association list (27), which is extended as needed by t a b l e - r e f

(27) (define (make-table) (list '~head~))

(29) (define (make-entry) (cons '() '()))

(35) (define NP (memo (vacuous

this is a case in which impure features result in a more comprehensibleoverallprogram.

Memoized CPS top-down recognizers do in fact correspond fairly closely to chart

6. C o n c l u s i o n and Future Work

This paper has shown how to generalize Norvig's application of memoization to

Acknowledgments automatic memoization with applications

You might also like